🤖 Agents using LLMs - pleto · Scour

MedCTA: A Benchmark for Clinical Tool Agents

📊AI Performance Profiling Academic

Vortex: Efficient and Programmable Sparse Attention Serving for AI Agents

💬Prompt optimizations for LLM serving Academic

IS-CoT: Breaking the Long-form Generation Collapse via Interleaved Structural Thinking

🧠Large Language Models (LLMs) Academic

A Five-Plane Reference Architecture for Runtime Governance of Production AI Agents

⚙️AI Infrastructure Automation Academic

From Holistic Evaluation to Structured Criteria: Rubrics Across the Evolving LLM Landscape

🏋️LLM training frameworks Academic

Personal AI Agent for Camera Roll VQA

📊AI Performance Profiling Academic

Exploration Structure in LLM Agents for Multi-File Change Localization

✨Model optimizations in LLMs Academic

Autonomous Incident Resolution at Hyperscale: An Agentic AI Architecture for Network Operations

⚙️AI Infrastructure Automation Academic

Notes2Skills: From Lab Notebooks to Certainty-Aware Scientific Agent Skills

🧠Large Language Models (LLMs) Academic

CollabSim: A CSCW-Grounded Methodology for Investigating Collaborative Competence of LLM Agents through Controlled Multi-Agent Experiments

🧠Large Language Models (LLMs) Academic

VATS: Exploiting Implicit Authority in Error-Path Injection via Systematic Mutation

🔧Systems-level optimizations for LLM serving Academic

LLMs+Graphs: Toward Graph-Native, Synergistic AI Systems

🧠Large Language Models (LLMs) Academic

The End of Software Engineering: How AI Agents Are Fundamentally Restructuring the Software Paradigm

⚙️AI Infrastructure Automation Academic

Benchmarking Open-Ended Multi-Agent Coordination in Language Agents

🧠Large Language Models (LLMs) Academic

Comparing Sentiment Contagion in AI-Agent and Human Social Networks: Evidence from MOLTBOOK

🧠Large Language Models (LLMs) Academic

RAILS: Verification-Native Clearing For Agentic Commerce

⚙️AI Infrastructure Automation Academic

SG2Loc: Sequential Visual Localization on 3D Scene Graphs

🧠Large Language Models (LLMs) Academic

SHIELDS: Automating OS Hardening with Iterative Multi-Agent Remediation

⚙️AI Infrastructure Automation Academic

InquiTree: Evaluating AI Agents in the Scientific Inquiry Loop with Paper-Derived Research Trees

⚙️AI Infrastructure Automation Academic

Toward Generalist Autonomous Research via Hypothesis-Tree Refinement

⚙️AI Infrastructure Automation Academic

Log in to enable infinite scrolling