Auditing LLM Editorial Bias in News Media Exposure
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Independent Clinical Evaluation of General-Purpose LLM Responses to Signals of Suicide Risk
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Dialogue as Discovery: Navigating Human Intent Through Principled Inquiry
arxiv.org·1d
💬Prompt optimizations for LLM serving
Flag this post
LLM-Centric RAG with Multi-Granular Indexing and Confidence Constraints
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Make Sense of Video Analytics by Integrating NVIDIA AI Blueprints
developer.nvidia.com·1d
📊AI Performance Profiling
Flag this post
Thought Branches: Interpreting LLM Reasoning Requires Resampling
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Culture Cartography: Mapping the Landscape of Cultural Knowledge
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Reasoning Models Sometimes Output Illegible Chains of Thought
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Depth and Autonomy: A Framework for Evaluating LLM Applications in Social Science Research
arxiv.org·5d
🧠Large Language Models (LLMs)
Flag this post
AURA: A Reinforcement Learning Framework for AI-Driven Adaptive Conversational Surveys
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
VISTA Score: Verification In Sequential Turn-based Assessment
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
From product to system network challenges in system of systems lifecycle management
arxiv.org·1d
🔧Systems-level optimizations for LLM serving
Flag this post
Can MLLMs Read the Room? A Multimodal Benchmark for Verifying Truthfulness in Multi-Party Social Interactions
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
"Koyi Sawaal Nahi Hai": Reimagining Maternal Health Chatbots for Collective, Culturally Grounded Care
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Patient-Centered Summarization Framework for AI Clinical Summarization: A Mixed-Methods Design
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
AstuteRAG-FQA: Task-Aware Retrieval-Augmented Generation Framework for Proprietary Data Challenges in Financial Question Answering
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
DialectalArabicMMLU: Benchmarking Dialectal Capabilities in Arabic and Multilingual Language Models
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Beyond Visualization: Building Decision Intelligence Through Iterative Dashboard Refinement
arxiv.org·1d
🔍Retrieval-augmented generation
Flag this post
Reasoning Curriculum: Bootstrapping Broad LLM Reasoning from Math
arxiv.org·4d
🧠Large Language Models (LLMs)
Flag this post
Loading...Loading more...