Agents using LLMs

Feeds to Scour
SubscribedAll
Scoured 74 posts in 4.3 ms

MedCTA: A Benchmark for Clinical Tool Agents

 📊AI Performance Profiling  Content type: Academic
arxiv.org·

Vortex: Efficient and Programmable Sparse Attention Serving for AI Agents

 💬Prompt optimizations for LLM serving  Content type: Academic
arxiv.org·

IS-CoT: Breaking the Long-form Generation Collapse via Interleaved Structural Thinking

 🧠Large Language Models (LLMs)  Content type: Academic
arxiv.org·

A Five-Plane Reference Architecture for Runtime Governance of Production AI Agents

 ⚙️AI Infrastructure Automation  Content type: Academic
arxiv.org·

From Holistic Evaluation to Structured Criteria: Rubrics Across the Evolving LLM Landscape

 🏋️LLM training frameworks  Content type: Academic
arxiv.org·

Personal AI Agent for Camera Roll VQA

 📊AI Performance Profiling  Content type: Academic
arxiv.org·

Exploration Structure in LLM Agents for Multi-File Change Localization

 Model optimizations in LLMs  Content type: Academic
arxiv.org·

Autonomous Incident Resolution at Hyperscale: An Agentic AI Architecture for Network Operations

 ⚙️AI Infrastructure Automation  Content type: Academic
arxiv.org·

Notes2Skills: From Lab Notebooks to Certainty-Aware Scientific Agent Skills

 🧠Large Language Models (LLMs)  Content type: Academic
arxiv.org·

CollabSim: A CSCW-Grounded Methodology for Investigating Collaborative Competence of LLM Agents through Controlled Multi-Agent Experiments

 🧠Large Language Models (LLMs)  Content type: Academic
arxiv.org·

VATS: Exploiting Implicit Authority in Error-Path Injection via Systematic Mutation

 🔧Systems-level optimizations for LLM serving  Content type: Academic
arxiv.org·

LLMs+Graphs: Toward Graph-Native, Synergistic AI Systems

 🧠Large Language Models (LLMs)  Content type: Academic
arxiv.org·

The End of Software Engineering: How AI Agents Are Fundamentally Restructuring the Software Paradigm

 ⚙️AI Infrastructure Automation  Content type: Academic
arxiv.org·

Benchmarking Open-Ended Multi-Agent Coordination in Language Agents

 🧠Large Language Models (LLMs)  Content type: Academic
arxiv.org·

Comparing Sentiment Contagion in AI-Agent and Human Social Networks: Evidence from MOLTBOOK

 🧠Large Language Models (LLMs)  Content type: Academic
arxiv.org·

RAILS: Verification-Native Clearing For Agentic Commerce

 ⚙️AI Infrastructure Automation  Content type: Academic
arxiv.org·

SG2Loc: Sequential Visual Localization on 3D Scene Graphs

 🧠Large Language Models (LLMs)  Content type: Academic
arxiv.org·

SHIELDS: Automating OS Hardening with Iterative Multi-Agent Remediation

 ⚙️AI Infrastructure Automation  Content type: Academic
arxiv.org·

InquiTree: Evaluating AI Agents in the Scientific Inquiry Loop with Paper-Derived Research Trees

 ⚙️AI Infrastructure Automation  Content type: Academic
arxiv.org·

Toward Generalist Autonomous Research via Hypothesis-Tree Refinement

 ⚙️AI Infrastructure Automation  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help