Adaptive Human-Computer Interaction Strategies Through Reinforcement Learning in Complex
arxiv.org·1d
⚡Real-time AI Systems
Flag this post
Contrastive Knowledge Transfer and Robust Optimization for Secure Alignment of Large Language Models
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
AURA: A Reinforcement Learning Framework for AI-Driven Adaptive Conversational Surveys
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Beyond Visualization: Building Decision Intelligence Through Iterative Dashboard Refinement
arxiv.org·1d
🔍Retrieval-augmented generation
Flag this post
Personalized AI Scaffolds Synergistic Multi-Turn Collaboration in Creative Work
arxiv.org·1d
🔍Retrieval-augmented generation
Flag this post
Questionnaire meets LLM: A Benchmark and Empirical Study of Structural Skills for Understanding Questions and Responses
arxiv.org·4d
🧠Large Language Models (LLMs)
Flag this post
Do Not Step Into the Same River Twice: Learning to Reason from Trial and Error
arxiv.org·4d
🧠Large Language Models (LLMs)
Flag this post
Empowering RepoQA-Agent based on Reinforcement Learning Driven by Monte-carlo Tree Search
arxiv.org·4d
🤖Agents using LLMs
Flag this post
Expressive Range Characterization of Open Text-to-Audio Models
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Reward Collapse in Aligning Large Language Models
arxiv.org·4d
🧠Large Language Models (LLMs)
Flag this post
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning
arxiv.org·4d
🧠Large Language Models (LLMs)
Flag this post
Positivity-preserving Well-balanced PAMPA Schemes with Global Flux quadrature for One-dimensional Shallow Water Models
arxiv.org·1d
🔢Quantization of LLMs
Flag this post
Simplifying Preference Elicitation in Local Energy Markets: Combinatorial Clock Exchange
arxiv.org·1d
🌐Distributed LLM Systems
Flag this post
Challenges in Credit Assignment for Multi-Agent Reinforcement Learning in Open Agent Systems
arxiv.org·1d
🤖Agents using LLMs
Flag this post
Implicature in Interaction: Understanding Implicature Improves Alignment in Human-LLM Interaction
arxiv.org·5d
🧠Large Language Models (LLMs)
Flag this post
zFLoRA: Zero-Latency Fused Low-Rank Adapters
arxiv.org·4d
🔧Systems-level optimizations for LLM serving
Flag this post
Un-Attributability: Computing Novelty From Retrieval & Semantic Similarity
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
From product to system network challenges in system of systems lifecycle management
arxiv.org·1d
🔧Systems-level optimizations for LLM serving
Flag this post
Loading...Loading more...