How to Train an LLM to Do Proofs: Beyond Verifiable Rewards
tobysimonds.com·1d·
Discuss: Hacker News
🎯Interactive Provers
Automatic Building Code Review: A Case Study
arxiv.org·10h
📏Code Metrics
ProofOfThought: LLM-based reasoning using Z3 theorem proving
dev.to·1d·
Discuss: DEV
🧮Z3 Solver
The Power of Three: Ternary Logic, Triolectics, and Three Sided Football
sothismedias.com·4h·
Discuss: Hacker News
🧮Theoretical Computer Science
Quit Begging Your LLM: Master the Art of Task Framing
hackernoon.com·13h
🧮Theorem Proving
Higher-Level Design Patterns
qouteall.fun·3d·
Discuss: Hacker News
Algebraic Effects
Training Dynamics of Parametric and In-Context Knowledge Utilization in Language Models
arxiv.org·10h
🌲Parse Trees
Property-based testing of batch-invariant operations
mmaaz.ca·15h·
Discuss: Hacker News
🧪Property-Based Testing
Adventures on the AI Coding side of things
medium.com·6h·
Discuss: Hacker News
🌍Cultural Algorithms
Retrv-R1: A Reasoning-Driven MLLM Framework for Universal and Efficient Multimodal Retrieval
arxiv.org·10h
🗂️Vector Search
Solving 2-SAT
nima101.github.io·5d·
Discuss: Hacker News
🔗Constraint Handling
High-Quality Pull-Request Descriptions
racecondition.software·18h·
Discuss: Hacker News
⚙️Proof Engineering
The future of your code is no-code
pleasedontdeploy.com·1h·
Discuss: Hacker News
📏Code Metrics
An Overview of Modern Memory Management Architectures in LLM Agents
vinithavn.medium.com·1d·
Discuss: Hacker News
💾Persistence Strategies
A Primer on Memory Consistency and Cache Coherence, Second Edition
link.springer.com·20h·
Discuss: r/programming
Cache Coherence
Language Agnostic Programming: Why you may still need code
joaquimrocha.com·22h·
Discuss: Hacker News
💻Programming languages
Autoreview: The Dragon Hatchling – The Missing Link Between the Transformer and
arxiviq.substack.com·1d·
Discuss: Substack
🔲Cellular Automata
Explore Briefly, Then Decide: Mitigating LLM Overthinking via Cumulative Entropy Regulation
arxiv.org·3d
Effect Handlers
DiffuSpec: Unlocking Diffusion Language Models for Speculative Decoding
arxiv.org·10h
🎙️Whisper
Mitigating Modal Imbalance in Multimodal Reasoning
arxiv.org·10h
Bidirectional Typing