AI Agents

Feeds to Scour
SubscribedAll
Scoured 154 posts in 7.1 ms

Benchmarking Open-Ended Multi-Agent Coordination in Language Agents

 💬LLMs  Content type: Academic
arxiv.org·

Multi-agent rendezvous in fluid flows via reinforcement learning

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

Quantitative Promise Theory: Intentionality and Inference in Autonomous Agents

 📐Scaling Laws  Content type: Academic
arxiv.org·

DexFuture: Hierarchical Future-State Visuomotor Targeting for Bimanual Dexterous Tool Use

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

Counterexample Guided Learning in the Large using Reasoning Agents

 💬LLMs  Content type: Academic
arxiv.org·

Brain-Prompt Injection: A Route-Safety Audit for BCI-LLM Agents

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

FlowBank: Query-Adaptive Agentic Workflows Optimization through Precompute-and-Reuse

 🖥️ML Systems  Content type: Academic
arxiv.org·

Self-evolving LLM agents with in-distribution Optimization

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

A Barrier-Modulated Architecture for Safe Affine Formation Control in Second-Order Multi-Agent Systems

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

Phi-Actor-Critic: Steering General-Sum Games to Pareto-Efficient Correlated Equilibria

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

Decision-Aware Memory Cards: Counterfactual-Inspired Context Selection and Compression for Tool-Using LLM Agents

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

Agentic Monte Carlo: Simulating Reinforcement Learning for Black-Box Agents

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

Agent Economics: An Entropy-Controlled Pluralistic Alignment Framework for Preventing Artificial Hivemind in Autonomous Agents

 🔄Transformers  Content type: Academic
arxiv.org·

Embodied-BenchClaw: An Autonomous Multi-Agent System for Embodied Spatial Intelligence Benchmark Construction

 📐Scaling Laws  Content type: Academic
arxiv.org·

Humans' ALMANAC: A Human Collaboration Dataset of Action-Level Mental Model Annotations for Agent Collaboration

 💬LLMs  Content type: Academic
arxiv.org·

Goal-Autopilot: A Verifiable Anti-Fabrication Firewall for Unattended Long-Horizon Agents

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

Beyond Goodhart's Law: A Dynamic Benchmark for Evaluating Compliance in Multi-Agent Systems

 📐Scaling Laws  Content type: Academic
arxiv.org·

TAPO: Tool-Aware Policy Optimization via Credit Transfer for Multimodal Search Agents

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

DuplexOmni: Real-Time Listening, Seeing, Thinking, and Speaking for Full-Duplex Interaction

 🔄Transformers  Content type: Academic
arxiv.org·

Memory is Reconstructed, Not Retrieved: Graph Memory for LLM Agents

 🧠AI Research  Content type: Academic
arxiv.org·

No more posts from Bingran's subscribed feeds.

Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help