Algorithmic Assistance with Recommendation-Dependent Preferences
arxiv.org·21h
🧠Large Language Models (LLMs)
Flag this post
LLM-Centric RAG with Multi-Granular Indexing and Confidence Constraints
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Challenges in Credit Assignment for Multi-Agent Reinforcement Learning in Open Agent Systems
arxiv.org·1d
🤖Agents using LLMs
Flag this post
Application of predictive machine learning in pen & paper RPG game design
arxiv.org·21h
📊AI Performance Profiling
Flag this post
Cross-Platform Evaluation of Reasoning Capabilities in Foundation Models
arxiv.org·4d
📊AI Performance Profiling
Flag this post
OpenSIR: Open-Ended Self-Improving Reasoner
arxiv.org·21h
💬Prompt optimizations for LLM serving
Flag this post
ARC-GEN: A Mimetic Procedural Benchmark Generator for the Abstraction and Reasoning Corpus
arxiv.org·21h
🔍Retrieval-augmented generation
Flag this post
Patient-Centered Summarization Framework for AI Clinical Summarization: A Mixed-Methods Design
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Reasoning Models Sometimes Output Illegible Chains of Thought
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
AURA: A Reinforcement Learning Framework for AI-Driven Adaptive Conversational Surveys
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
SpatialTraceGen: High-Fidelity Traces for Efficient VLM Spatial Reasoning Distillation
arxiv.org·21h
🧠Large Language Models (LLMs)
Flag this post
Make Sense of Video Analytics by Integrating NVIDIA AI Blueprints
developer.nvidia.com·2d
📊AI Performance Profiling
Flag this post
Disciplined Biconvex Programming
arxiv.org·21h
Model optimizations in LLMs
Flag this post
STACKFEED: Structured Textual Actor-Critic Knowledge Base Editing with FeedBack
arxiv.org·21h
🧠Large Language Models (LLMs)
Flag this post
Surfacing Subtle Stereotypes: A Multilingual, Debate-Oriented Evaluation of Modern LLMs
arxiv.org·21h
🧠Large Language Models (LLMs)
Flag this post
Human-AI Programming Role Optimization: Developing a Personality-Driven Self-Determination Framework
arxiv.org·21h
📊AI Performance Profiling
Flag this post
Do Math Reasoning LLMs Help Predict the Impact of Public Transit Events?
arxiv.org·21h
🧠Large Language Models (LLMs)
Flag this post