Ariadne: A Controllable Framework for Probing and Extending VLM Reasoning Boundaries
arxiv.org·1d
⚙️AI Infrastructure Automation
Flag this post
Understanding Code Agent Behaviour: An Empirical Study of Success and Failure Trajectories
arxiv.org·1d
🤖Agents using LLMs
Flag this post
Algorithmic Assistance with Recommendation-Dependent Preferences
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Graph-Enhanced Policy Optimization in LLM Agent Training
arxiv.org·5d
🤖Agents using LLMs
Flag this post
STACKFEED: Structured Textual Actor-Critic Knowledge Base Editing with FeedBack
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Surfacing Subtle Stereotypes: A Multilingual, Debate-Oriented Evaluation of Modern LLMs
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
flowengineR: A Modular and Extensible Framework for Fair and Reproducible Workflow Design in R
arxiv.org·1d
📊AI Performance Profiling
Flag this post
AURA: A Reinforcement Learning Framework for AI-Driven Adaptive Conversational Surveys
arxiv.org·2d
🧠Large Language Models (LLMs)
Flag this post
ParaScopes: What do Language Models Activations Encode About Future Text?
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Coverage Analysis and Optimization of FIRES-Assisted NOMA and OMA Systems
arxiv.org·1d
🔧Systems-level optimizations for LLM serving
Flag this post
Challenges in Credit Assignment for Multi-Agent Reinforcement Learning in Open Agent Systems
arxiv.org·2d
🤖Agents using LLMs
Flag this post
Computation as a Game
arxiv.org·1d
💬Prompt optimizations for LLM serving
Flag this post
ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation
arxiv.org·1d
🔍Retrieval-augmented generation
Flag this post
PROPEX-RAG: Enhanced GraphRAG using Prompt-Driven Prompt Execution
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
InertialAR: Autoregressive 3D Molecule Generation with Inertial Frames
arxiv.org·2d
🔍Retrieval-augmented generation
Flag this post