AI Agents

Feeds to Scour
SubscribedAll
Scoured 57 posts in 6.5 ms

ADK Arena: Evaluating Agent Development Kits via LLM-as-a-Developer

 🔗LLM Workflows  Content type: Academic
arxiv.org·

Self-Paced Curriculum Reinforcement Learning for Autonomous Superbike Racing in Simulation

 🤖AI Coding  Content type: Academic
arxiv.org·

Latent Reasoning Guidance for Parallel Code Translation

 🧠LLMs  Content type: Academic
arxiv.org·

From Holistic Evaluation to Structured Criteria: Rubrics Across the Evolving LLM Landscape

 🔗LLM Workflows  Content type: Academic
arxiv.org·

Entropy-Based Evaluation of AI Agents: A Lightweight Framework for Measuring Behavioral Patterns

 📚RAG  Content type: Academic
arxiv.org·

Representational Similarity and Model Behavior in Multi-Agent Interaction

 🔗LLM Workflows  Content type: Academic
arxiv.org·

RSC: Decentralized Rigid Formation Flocking for Large-Scale Swarms via Hybrid Predictive Control and Online Reconfiguration

 🤖AI Coding  Content type: Academic
arxiv.org·

MAVIS: Multi-Agent Video Retrieval via Structured Video Understanding

 🔗LLM Workflows  Content type: Academic
arxiv.org·

Plan First, Judge Later, Run Better: A DMAIC-Inspired Agentic System for Industrial Anomaly Detection

 🔗LLM Workflows  Content type: Academic
arxiv.org·

FALSIFYBENCH: Evaluating Inductive Reasoning in LLMs with Rule Discovery Games

 🧠LLMs  Content type: Academic
arxiv.org·

TianJi-Environ: An Autonomous AI Scientist for Atmospheric Environmental Research

 🔗LLM Workflows  Content type: Academic
arxiv.org·

SCOUT: Semantic scene COverage via Uncertainty-guided Traversal

 🧠LLMs  Content type: Academic
arxiv.org·

Strabo: Declarative Specification and Implementation of Agentic Interaction Protocols

 🔗LLM Workflows  Content type: Academic
arxiv.org·

Cascading Hallucination in Agentic RAG: The CHARM Framework for Detection and Mitigation

 📚RAG  Content type: Academic
arxiv.org·

Trustworthy Smart Fabs via Professional Proxies: Scaling Safe and Sustainable by Design (SSbD) through Industrial Data Spaces

 🎼Data Orchestration  Content type: Academic
arxiv.org·

The Saturation Trap and the Subjectivity of Intervention Timing: Why Affect-Based Triggers and LLM Judges Fail to Time Interventions on Autonomous Agents

 🧠LLMs  Content type: Academic
arxiv.org·

AutoLab: Can Frontier Models Solve Long-Horizon Auto Research and Engineering Tasks?

 🤖AI Coding  Content type: Academic
arxiv.org·

No more posts from cwensel's subscribed feeds.

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help