🐿️ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
✓ Automated Theorem Proving

Proof Assistants, Interactive Verification, Proof Search, Tactical Reasoning

Agentic-R1: Distilled Dual-Strategy Reasoning
arxiv.org·2d
🎮Verification Games
JavaScript is easy, but how good is Claude Code at writing Malbolge code?
somethingwithai.substack.com·19h·
Discuss: Substack
🐫Embedded OCaml
Automated Reasoning for Vulnerability Management by Design
arxiv.org·2d
🛡️seL4
Most comprehensive review of AI coding agents for Kotlin/Android tasks
jasonpearson.dev·1d·
Discuss: Hacker News
🧱Immutable Infrastructure
Position: We Need An Algorithmic Understanding of Generative AI
arxiv.org·9h
🎮Verification Games
Truth-value judgment in language models: 'truth directions' are context sensitive
arxiv.org·9h
➡️Category Theory
AI Evals: How To Systematically Improve and Evaluate AI
newsletter.eng-leadership.com·2d·
Discuss: r/programming
👁️System Observability
LLMs exploit our tolerance for sloppiness
humprog.org·2d·
Discuss: Hacker News
🎮Verification Games
SEO, Logorrhoea and the Rise of Sick AI
purpleorca.co.uk·4h·
Discuss: Hacker News
➡️Category Theory
I Changed My Mind: AI Will Replace Us
medium.com·18h·
Discuss: Hacker News
🤖Program Synthesis
Structured Prompts, Better Outcomes? Exploring the Effects of a Structured Interface with ChatGPT in a Graduate Robotics Course
arxiv.org·9h
⚙️PL Implementation
Mastering Language Models: A Deep Dive into Input Parameters
megaputer.com·19h·
Discuss: Hacker News
🧪Property-Based Testing
zkSDK: Streamlining zero-knowledge proof development through automated trace-driven ZK-backend selection
arxiv.org·2d
🔄Reproducible Builds
The hidden cost of AI reliance
codebytom.blog·1d·
Discuss: Hacker News, r/programming
🐫Embedded OCaml
Claude Code/Cursor is using grep? Are we devolving
news.ycombinator.com·4h·
Discuss: Hacker News
🤖Program Synthesis
Argumentative Characterizations of (Extended) Disjunctive Logic Programs
arxiv.org·2d
🎮Verification Games
How to scale RL to 10^26 FLOPs
blog.jxmo.io·16h·
Discuss: Hacker News
🎮Verification Games
A Compositional Approach to Diagnosing Faults in Cyber-Physical Systems
arxiv.org·2d
🔍Formal Verification
Improving AEBS Validation Through Objective Intervention Classification Leveraging the Prediction Divergence Principle
arxiv.org·9h
🔍Formal Verification
KeyKnowledgeRAG (K^2RAG): An Enhanced RAG method for improved LLM question-answering capabilities
arxiv.org·9h
🎮Verification Games
Loading...Loading more...
AboutBlogChangelogRoadmap