RLAC: Reinforcement Learning with Adversarial Critic for Free-Form Generation Tasks
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Understanding Code Agent Behaviour: An Empirical Study of Success and Failure Trajectories
arxiv.org·1d
🤖Agents using LLMs
Flag this post
Adding New Capability in Existing Scientific Application with LLM Assistance
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Fast Answering Pattern-Constrained Reachability Queries with Two-Dimensional Reachability Index
arxiv.org·1d
🌐Distributed LLM Systems
Flag this post
Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
What a diff makes: automating code migration with large language models
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Simplifying Preference Elicitation in Local Energy Markets: Combinatorial Clock Exchange
arxiv.org·2d
🌐Distributed LLM Systems
Flag this post
Analyzing Sustainability Messaging in Large-Scale Corporate Social Media
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Self-Improving Vision-Language-Action Models with Data Generation via Residual RL
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
How Code Execution Drives Key Risks in Agentic AI Systems
developer.nvidia.com·6d
🔧Systems-level optimizations for LLM serving
Flag this post
LongCat-Flash-Omni Technical Report
arxiv.org·1d
Real-time AI Systems
Flag this post
Chain of Time: In-Context Physical Simulation with Image Generation Models
arxiv.org·1d
Real-time AI Systems
Flag this post
Split Learning-Enabled Framework for Secure and Light-weight Internet of Medical Things Systems
arxiv.org·1d
Real-time AI Systems
Flag this post