RegionRAG: Region-level Retrieval-Augumented Generation for Visually-Rich Documents
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
LLM-Centric RAG with Multi-Granular Indexing and Confidence Constraints
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
R²D²: Perception-Guided Task & Motion Planning for Long-Horizon Manipulation
developer.nvidia.com·1d
Real-time AI Systems
Flag this post
Un-Attributability: Computing Novelty From Retrieval & Semantic Similarity
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
InertialAR: Autoregressive 3D Molecule Generation with Inertial Frames
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
VISTA Score: Verification In Sequential Turn-based Assessment
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Generating Accurate and Detailed Captions for High-Resolution Images
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Reasoning Models Sometimes Output Illegible Chains of Thought
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Mitigating Semantic Collapse in Partially Relevant Video Retrieval
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Personalized AI Scaffolds Synergistic Multi-Turn Collaboration in Creative Work
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Dual-Stream Diffusion for World-Model Augmented Vision-Language-Action Model
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Culture Cartography: Mapping the Landscape of Cultural Knowledge
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Expressive Range Characterization of Open Text-to-Audio Models
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Multi-Representation Attention Framework for Underwater Bioacoustic Denoising and Recognition
arxiv.org·1d
🔢Quantization of LLMs
Flag this post
A Machine Learning-Based Framework to Shorten the Questionnaire for Assessing Autism Intervention
arxiv.org·1d
📊AI Performance Profiling
Flag this post