Adaptive Human-Computer Interaction Strategies Through Reinforcement Learning in Complex
arxiv.org·1d
Real-time AI Systems
Flag this post
AURA: A Reinforcement Learning Framework for AI-Driven Adaptive Conversational Surveys
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Beyond Visualization: Building Decision Intelligence Through Iterative Dashboard Refinement
arxiv.org·1d
🔍Retrieval-augmented generation
Flag this post
Personalized AI Scaffolds Synergistic Multi-Turn Collaboration in Creative Work
arxiv.org·1d
🔍Retrieval-augmented generation
Flag this post
Do Not Step Into the Same River Twice: Learning to Reason from Trial and Error
arxiv.org·4d
🧠Large Language Models (LLMs)
Flag this post
Empowering RepoQA-Agent based on Reinforcement Learning Driven by Monte-carlo Tree Search
arxiv.org·4d
🤖Agents using LLMs
Flag this post
Expressive Range Characterization of Open Text-to-Audio Models
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Reward Collapse in Aligning Large Language Models
arxiv.org·4d
🧠Large Language Models (LLMs)
Flag this post
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning
arxiv.org·4d
🧠Large Language Models (LLMs)
Flag this post
Simplifying Preference Elicitation in Local Energy Markets: Combinatorial Clock Exchange
arxiv.org·1d
🌐Distributed LLM Systems
Flag this post
Challenges in Credit Assignment for Multi-Agent Reinforcement Learning in Open Agent Systems
arxiv.org·1d
🤖Agents using LLMs
Flag this post
zFLoRA: Zero-Latency Fused Low-Rank Adapters
arxiv.org·4d
🔧Systems-level optimizations for LLM serving
Flag this post
Un-Attributability: Computing Novelty From Retrieval & Semantic Similarity
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Testing Cross-Lingual Text Comprehension In LLMs Using Next Sentence Prediction
arxiv.org·5d
🧠Large Language Models (LLMs)
Flag this post
RAVR: Reference-Answer-guided Variational Reasoning for Large Language Models
arxiv.org·5d
🧠Large Language Models (LLMs)
Flag this post