AI Assistants

Feeds to Scour
SubscribedAll
Scoured 74 posts in 5.0 ms

VESTA: A Fully Automated Scenario Generation and Safety Evaluation Framework for LLM Agents

 🧠LLMs  Content type: Academic
arxiv.org·

DeployBench: Benchmarking LLM Agents for Research Artifact Deployment

 🧠LLMs  Content type: Academic
arxiv.org·

AGENTSERVESIM: A Hardware-aware Simulator for Multi-Turn LLM Agent Serving

 🧠LLMs  Content type: Academic
arxiv.org·

Agentic Monte Carlo: Simulating Reinforcement Learning for Black-Box Agents

 🤖AI Coding Tools  Content type: Academic
arxiv.org·

LLM Agent-Assisted Reverse Engineering with Quantitative Readability Metrics

 🧠LLMs  Content type: Academic
arxiv.org·

TRACE: Trajectory Reasoning through Adaptive Cross-Step Evidence Aggregation for LLM Agents

 🤖AI Coding Tools  Content type: Academic
arxiv.org·

Provably Auditable and Safe LLM Agents from Human-Authored Ontologies

 🧠LLMs  Content type: Academic
arxiv.org·

SecureClaw: Clawing Back Control of LLM Agents

 🧠LLMs  Content type: Academic
arxiv.org·

Context-Fractured Decomposition Attacks on Tool-Using LLM Agents: Exploiting Artifact Provenance Gaps

 🧠LLMs  Content type: Academic
arxiv.org·

Memory is Reconstructed, Not Retrieved: Graph Memory for LLM Agents

 🧠LLMs  Content type: Academic
arxiv.org·

MemToolAgent overview with a simple restaurant booking scenario where the agent retrieves similar memories, receives feedback on an invalid time format, and generates a reflection to update its memory

 🧠LLMs  Content type: Academic
arxiv.org·

Will the Agent Recuse Itself? Measuring LLM-Agent Compliance with In-Band Access-Deny Signals

 🛡️Security Advisories  Content type: Academic
arxiv.org·

Memory Beyond Recall: A Dual-Process Cognitive Memory System for Self-Evolving LLM Agents

 🧠LLMs  Content type: Academic
arxiv.org·

REFLECT: Intervention-Supported Error Attribution for Silent Failures in LLM Agent Traces

 🧠LLMs  Content type: Academic
arxiv.org·

From Untrusted Input to Trusted Memory: A Systematic Study of Memory Poisoning Attacks in LLM Agents

 🛡️Memory Safety  Content type: Academic
arxiv.org·

Causal Agent Replay: Counterfactual Attribution for LLM-Agent Failures

 🧠LLMs  Content type: Academic
arxiv.org·

Plan First, Judge Later, Run Better: A DMAIC-Inspired Agentic System for Industrial Anomaly Detection

 🏛️Software Architecture  Content type: Academic
arxiv.org·

Decision-Aware Memory Cards: Counterfactual-Inspired Context Selection and Compression for Tool-Using LLM Agents

 🧠LLMs  Content type: Academic
arxiv.org·

OpenSkill: Open-World Self-Evolution for LLM Agents

 🧠LLMs  Content type: Academic
arxiv.org·

Caught in the Act(ivation): Toward Pre-Output and Multi-Turn Detection of Credential Exfiltration by LLM Agents

 🧠LLMs  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help