Reasoning Models

Feeds to Scour
SubscribedAll
Scoured 207 posts in 9.7 ms

Version Controla and Agent Audit Platform

 💾Agent Memory
cognatoai.com··Hacker News

Tight Sample Complexity of Transformers

 ✍️Prompt Engineering  Content type: Academic
arxiv.org·

Ex150salmon review: Failure after only 14 days

 🔁Spaced Repetition  Content type: News
exfatloss.com··Hacker News

OpenMedReason: Scientific Reasoning Supervision for Medical Vision-Language Models

 👁️Multimodal AI  Content type: Academic
arxiv.org·

Dropout-GRPO: Variational Stochasticity for Continuous Latent Reasoning

 🎯Reinforcement Learning  Content type: Academic
arxiv.org·

Contextual Identity Laundering: How Claude’s Image Refusal Can Be Routed Through Web Search

 ✍️Prompt Engineering
lesswrong.com·

RecurGuard: Runtime Monitoring for Reasoning-Token Consumption Attacks

 Inference  Content type: Academic
arxiv.org·

Bootstrapped Monitoring: Leveraging Transparent Reasoning to Oversee Stronger AI Agents

 ✍️Prompt Engineering  Content type: Academic
arxiv.org·

When the Chain of Thought Knows Better: Failure Modes in Multi-Turn Reasoning Models

 ✍️Prompt Engineering  Content type: Academic
arxiv.org·

Calibration Drift Under Reasoning: How Chain-of-Thought Budgets Induce Overconfidence in Large Language Models

 ✍️Prompt Engineering  Content type: Academic
arxiv.org·

The Periodic Table of LLM Reasoning: A Structured Survey of Reasoning Paradigms, Methods, and Failure Modes

 ✍️Prompt Engineering  Content type: Academic
arxiv.org·

Visual Para-Thinker++: A Single-Policy Multi-Agent Framework for Visual Reasoning

 🤖AI Agents  Content type: Academic
arxiv.org·

LLMs+Graphs: Toward Graph-Native, Synergistic AI Systems

 🔗Graph Neural Networks  Content type: Academic
arxiv.org·

UniReason-Med: A Shared Grounded Reasoning Interface for 2D-to-3D Transfer in Medical VQA

 🎛️Fine-tuning  Content type: Academic
arxiv.org·

Training Deliberative Monitors for Black-Box Scheming Detection

 🎛️Fine-tuning
lesswrong.com·

MODF-SIR: A Multi-agent Omni-modal Distilled Framework for Social Intelligence Reasoning

 🧠LLMs  Content type: Academic
arxiv.org·

The Shibboleth Effect: Auditing the Cross-Lingual Distributional Skew of Large Language Models

 🧠LLMs  Content type: Academic
arxiv.org·

Building Better Activation Oracles

 ✍️Prompt Engineering
lesswrong.com·

IS-CoT: Breaking the Long-form Generation Collapse via Interleaved Structural Thinking

 ✍️Prompt Engineering  Content type: Academic
arxiv.org·

Benchmarking Large Language Models for Safety Data Extraction

 ✍️Prompt Engineering  Content type: Academic
arxiv.org·
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help