OCR Enhancement, Medieval Scripts, Character Recognition, Machine Learning
Toward Socially Aware Vision-Language Models: Evaluating Cultural Competence Through Multimodal Story Generation
arxiv.org·18h
Visual-CoG: Stage-Aware Reinforcement Learning with Chain of Guidance for Text-to-Image Generation
arxiv.org·18h
Interpretable Early Failure Detection via Machine Learning and Trace Checking-based Monitoring
arxiv.org·18h
TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill \& Decode Inference
arxiv.org·1d
MSNav: Zero-Shot Vision-and-Language Navigation with Dynamic Memory and LLM Spatial Reasoning
arxiv.org·18h
Learning to Detect Label Errors by Making Them: A Method for Segmentation and Object Detection Datasets
arxiv.org·18h
An Efficient Dual-Line Decoder Network with Multi-Scale Convolutional Attention for Multi-organ Segmentation
arxiv.org·18h
Loading...Loading more...