ProofOfThought: LLM-based reasoning using Z3 theorem proving
dev.to·17h·
Discuss: DEV
SMT Integration
Property-based testing of batch-invariant operations
mmaaz.ca·7h·
Discuss: Hacker News
🧪Property-Based Testing
Beyond the Final Answer: Evaluating the Reasoning Trajectories of Tool-Augmented Agents
arxiv.org·2h
🔍Concolic Testing
Three important things to get right for successful AI Coding
kau.sh·13h
Proof Automation
How to Train an LLM to Do Proofs: Beyond Verifiable Rewards
tobysimonds.com·1d·
Discuss: Hacker News
🎯Interactive Provers
ProofOfThought: LLM-based reasoning using Z3 theorem proving
dev.to·1d·
Discuss: DEV
🧮Z3 Solver
Understanding the 4 Main Approaches to LLM Evaluation (From Scratch)
magazine.sebastianraschka.com·19h·
Discuss: Hacker News
🌳Context free grammars
Prompting Techniques for Specialised LLMs
dev.to·13h·
Discuss: DEV
🔗Constraint Handling
A grand week
blog.mitrichev.ch·17h·
🧮SMT Solvers
Python PEP 636 – Structural Pattern Matching: Tutorial
peps.python.org·19h·
Discuss: Hacker News
📝Concrete Syntax
Seriously Testing LLMs
satisfice.com·5h
🔍Concolic Testing
94% of AI Developers Ignore This Theorem Prover. Here's Why That's Costing Millions.
dev.to·23h·
Discuss: DEV
⚙️Proof Engineering
A Practical Guide to Generating Unit Tests with AI Code Assistants
qt.io·15m
📏Code Metrics
MathArena Apex: Unconquered Final-Answer Problems
matharena.ai·1d·
Discuss: Hacker News
🧮SMT Solvers
Automated Verification of Code Logic & Security Vulnerabilities via Hyperdimensional Semantic Analysis
dev.to·2h·
Discuss: DEV
📏Code Metrics
PRISM-Physics: Causal DAG-Based Process Evaluation for Physics Reasoning
arxiv.org·2h
Effect Handlers
Estimated tokens to merge (ETM) & other notes
gmays.com·11h
🌀Brotli Internals
LLM Prompt Fixed Point: the Ultimate Prompt
funcall.blogspot.com·2d·
Effect Handlers
Atomic and Saturated Models
functor.network·2d·
Discuss: Hacker News
🔢Denotational Semantics
An alternative to knowledge graphs for storing loosely structured content
fleetingswallow.com·16h·
Discuss: Hacker News
🕸️Knowledge Graphs