Agents using LLMs

Feeds to Scour
SubscribedAll
Scoured 74 posts in 7.2 ms

VASO: Formally Verifiable Self-Evolving Skills for Physical AI Agents

 ⚙️AI Infrastructure Automation  Content type: Academic
arxiv.org·

Stability Without Safety: Gain Manipulation Attacks on Agentic Cyber-Physical Systems

 ⚙️AI Infrastructure Automation  Content type: Academic
arxiv.org·

AI Coding Agents in Social Science: Methodologically Diverse, Empirically Consistent, Interpretively Vulnerable

 🧠Large Language Models (LLMs)  Content type: Academic
arxiv.org·

ADK Arena: Evaluating Agent Development Kits via LLM-as-a-Developer

 💬Prompt optimizations for LLM serving  Content type: Academic
arxiv.org·

SKILL.nb: Selective Formalization and Gated Execution for Durable Agent Workflows

 ⚙️AI Infrastructure Automation  Content type: Academic
arxiv.org·

Learning Multi-Agent Communication Protocol: Study on Information Entropy Efficiency in MARL

 🌐Distributed LLM Systems  Content type: Academic
arxiv.org·

Where Instruction Hierarchy Breaks: Diagnosing and Repairing Failures in Reasoning Language Models

 📊AI Performance Profiling  Content type: Academic
arxiv.org·

Self-Paced Curriculum Reinforcement Learning for Autonomous Superbike Racing in Simulation

 Model optimizations in LLMs  Content type: Academic
arxiv.org·

What Should Agents Say? Action-state Communication for Efficient Multi-Agent Systems

 🧠Large Language Models (LLMs)  Content type: Academic
arxiv.org·

SwarmSense-DNN: A Trustworthy and Decentralized Neural Framework for Proactive Anomaly Defense in Consumer IoT

 🧠Large Language Models (LLMs)  Content type: Academic
arxiv.org·

Representational Similarity and Model Behavior in Multi-Agent Interaction

 🔍Retrieval-augmented generation  Content type: Academic
arxiv.org·

SCALE: Scalable Cross-Attention Learning with Extrapolation for Agentic Workflow Scheduling

 📊AI Performance Profiling  Content type: Academic
arxiv.org·

The Impossibility of Eliciting Latent Knowledge

 🧠Large Language Models (LLMs)  Content type: Academic
arxiv.org·

The Token Not Taken: Sampling, State, and the Variability of AI Agent Outputs

 Real-time AI Systems  Content type: Academic
arxiv.org·

Efficient Multi-Agent Optimization of Optical Power in S+C+L-Band Systems

 🔧Systems-level optimizations for LLM serving  Content type: Academic
arxiv.org·

Workflow-GYM: Towards Long-Horizon Evaluation of Computer-use Agentic tasks in Real-World Professional Fields

 📊AI Performance Profiling  Content type: Academic
arxiv.org·

EGTR-Review: Efficient Evidence-Grounded Scientific Peer Review Generation via Multi-Agent Teacher Distillation

 🧠Large Language Models (LLMs)  Content type: Academic
arxiv.org·

WebMCP Tool Surface Poisoning: Runtime Manipulation Attacks on LLM Agents

 🔧Systems-level optimizations for LLM serving  Content type: Academic
arxiv.org·

ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research

 📊AI Performance Profiling  Content type: Academic
arxiv.org·

SubtleMemory: A Benchmark for Fine-Grained Relational Memory Discrimination in Long-Horizon AI Agents

 📊AI Performance Profiling  Content type: Academic
arxiv.org·

No more posts from pleto's subscribed feeds.

Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help