🤖 AI Agents - Bingran · Scour

Benchmarking Open-Ended Multi-Agent Coordination in Language Agents

💬LLMs Academic

Multi-agent rendezvous in fluid flows via reinforcement learning

🎮Reinforcement Learning Academic

Quantitative Promise Theory: Intentionality and Inference in Autonomous Agents

📐Scaling Laws Academic

DexFuture: Hierarchical Future-State Visuomotor Targeting for Bimanual Dexterous Tool Use

🎮Reinforcement Learning Academic

Counterexample Guided Learning in the Large using Reasoning Agents

💬LLMs Academic

Brain-Prompt Injection: A Route-Safety Audit for BCI-LLM Agents

🎮Reinforcement Learning Academic

FlowBank: Query-Adaptive Agentic Workflows Optimization through Precompute-and-Reuse

🖥️ML Systems Academic

Self-evolving LLM agents with in-distribution Optimization

🎮Reinforcement Learning Academic

A Barrier-Modulated Architecture for Safe Affine Formation Control in Second-Order Multi-Agent Systems

🎮Reinforcement Learning Academic

Phi-Actor-Critic: Steering General-Sum Games to Pareto-Efficient Correlated Equilibria

🎮Reinforcement Learning Academic

Decision-Aware Memory Cards: Counterfactual-Inspired Context Selection and Compression for Tool-Using LLM Agents

🎮Reinforcement Learning Academic

Agentic Monte Carlo: Simulating Reinforcement Learning for Black-Box Agents

🎮Reinforcement Learning Academic

Agent Economics: An Entropy-Controlled Pluralistic Alignment Framework for Preventing Artificial Hivemind in Autonomous Agents

🔄Transformers Academic

Embodied-BenchClaw: An Autonomous Multi-Agent System for Embodied Spatial Intelligence Benchmark Construction

📐Scaling Laws Academic

Humans' ALMANAC: A Human Collaboration Dataset of Action-Level Mental Model Annotations for Agent Collaboration

💬LLMs Academic

Goal-Autopilot: A Verifiable Anti-Fabrication Firewall for Unattended Long-Horizon Agents

🎮Reinforcement Learning Academic

Beyond Goodhart's Law: A Dynamic Benchmark for Evaluating Compliance in Multi-Agent Systems

📐Scaling Laws Academic

TAPO: Tool-Aware Policy Optimization via Credit Transfer for Multimodal Search Agents

🎮Reinforcement Learning Academic

DuplexOmni: Real-Time Listening, Seeing, Thinking, and Speaking for Full-Duplex Interaction

🔄Transformers Academic

Memory is Reconstructed, Not Retrieved: Graph Memory for LLM Agents

🧠AI Research Academic

No more posts from Bingran's subscribed feeds.

Scour all 25258 feeds Learn more about Feeds

Sign up or log in to see more results

Log in to enable infinite scrolling