Breaking the SFT Plateau: Multimodal Structured Reinforcement Learning for Chart-to-Code Generation
arxiv.org·3h
Can Large Language Models (LLMs) Describe Pictures Like Children? A Comparative Corpus Study
arxiv.org·3h
Color Spike Data Generation via Bio-inspired Neuron-like Encoding with an Artificial Photoreceptor Layer
arxiv.org·3h
MOON: Generative MLLM-based Multimodal Representation Learning for E-commerce Product Understanding
arxiv.org·1d
YOLO11-CR: a Lightweight Convolution-and-Attention Framework for Accurate Fatigue Driving Detection
arxiv.org·3h
Loading...Loading more...