Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Prompt Engineering
✍️ Prompt Engineering
Specific
prompt design, system prompts, prompt techniques, LLM prompting
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
46
posts in
7.0
ms
BEACON: Behavioral Entropy Aggregation for
Cross-Model
Hallucination Detection in Large Language Models
🤖
Agentic Engineering
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for BEACON: Behavioral Entropy Aggregation for Cross-Model Hallucination Detection in Large Language Models
Declarative Skills for AI Agents in Knowledge-Grounded Tool-Use Workflows
🤖
AI Agents
Content type:
Academic
arxiv.org
·
3d
3 days ago
Actions for Declarative Skills for AI Agents in Knowledge-Grounded Tool-Use Workflows
Mutation Without Variation: Convergence Dynamics in
LLM-Driven
Program Evolution
🧠
LLMs
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Mutation Without Variation: Convergence Dynamics in LLM-Driven Program Evolution
LLM-Guided
Neural Architecture Search for Robust
Co-Design
of Physical Neural Networks
🧠
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for LLM-Guided Neural Architecture Search for Robust Co-Design of Physical Neural Networks
You Only Index Once: Cross-Layer Sparse Attention with Shared Routing
🤖
Agentic Engineering
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for You Only Index Once: Cross-Layer Sparse Attention with Shared Routing
When No Answer Is Correct: Diagnosing Absent Answer Detection for MLLMs in Video Understanding
🤖
Agentic Engineering
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for When No Answer Is Correct: Diagnosing Absent Answer Detection for MLLMs in Video Understanding
Think Fast: Estimating No-CoT Task-Completion Time Horizons of Frontier AI
Models
🤖
Agentic Engineering
Content type:
Academic
arxiv.org
·
3d
3 days ago
Actions for Think Fast: Estimating No-CoT Task-Completion Time Horizons of Frontier AI Models
Proxy Reward Internalization and Mechanistic Exploitation: A Learned Precursor to Reward Hacking and Its Generalization
🤖
Agentic Engineering
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Proxy Reward Internalization and Mechanistic Exploitation: A Learned Precursor to Reward Hacking and Its Generalization
Arithmetic Pedagogy for Language
Models
🤖
Agentic Engineering
Content type:
Academic
arxiv.org
·
6d
6 days ago
·
Hacker News
Actions for Arithmetic Pedagogy for Language Models
VisualLeakBench: Reproducible Action-Boundary Propagation Failures in Vision-Language Agents
⚙️
AI Automation
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for VisualLeakBench: Reproducible Action-Boundary Propagation Failures in Vision-Language Agents
Quantum-Inspired Trace-Augmented Evidence Selection for Reasoning over Structured Hypothesis Spaces
⚛️
Quantum Computing
Content type:
Academic
arxiv.org
·
3d
3 days ago
Actions for Quantum-Inspired Trace-Augmented Evidence Selection for Reasoning over Structured Hypothesis Spaces
LLM-Based
Code Documentation Generation and Multi-Judge Evaluation
🧠
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for LLM-Based Code Documentation Generation and Multi-Judge Evaluation
Towards Autonomous Accelerator
Design
: FPGA Accelerator Generation with SECDA
🤖
Agentic Engineering
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Towards Autonomous Accelerator Design: FPGA Accelerator Generation with SECDA
IMUG-Bench: Benchmarking Unified Multimodal
Models
on Interleaved Understanding and Generation
🤖
Agentic Engineering
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for IMUG-Bench: Benchmarking Unified Multimodal Models on Interleaved Understanding and Generation
Dep-LLM
: Training-Free Depression Diagnosis via Evidence-Guided Structured Multi-factor with Reliable
LLM
Reasoning
🤖
Agentic Engineering
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Dep-LLM: Training-Free Depression Diagnosis via Evidence-Guided Structured Multi-factor with Reliable LLM Reasoning
CogManip: Benchmarking Manipulative Behavior in Multi-Turn Interactions with Large Language
Model
🧠
LLMs
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for CogManip: Benchmarking Manipulative Behavior in Multi-Turn Interactions with Large Language Model
The Arbiter Agent: Continually Monitoring Multi-Agent Conversations to Detect Emergent Misalignment
🤖
Multi-Agent Systems
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for The Arbiter Agent: Continually Monitoring Multi-Agent Conversations to Detect Emergent Misalignment
How Small Can You Go? LoRA
Fine-Tuning
270M-8B
Models
for Merchant Information Extraction in Financial Transactions
🤖
Agentic Engineering
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for How Small Can You Go? LoRA Fine-Tuning 270M-8B Models for Merchant Information Extraction in Financial Transactions
UrduMMLU: A Massive Multitask Benchmark for Urdu Language Understanding
🧠
LLMs
Content type:
Academic
arxiv.org
·
3d
3 days ago
Actions for UrduMMLU: A Massive Multitask Benchmark for Urdu Language Understanding
Domain-Conditioned Safety in Frontier Computer-Using Agents: A 793-Episode Browser Benchmark, a Coding-Domain Cross-Reference, and a Reproducibility Audit of Recent Red-Teaming
⚙️
AI Automation
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Domain-Conditioned Safety in Frontier Computer-Using Agents: A 793-Episode Browser Benchmark, a Coding-Domain Cross-Reference, and a Reproducibility Audit of Recent Red-Teaming
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help