An enough week
blog.mitrichev.ch·16h·
🧮Z3 Solver
Is ChatGPT-5 Able to Provide Proofs for Advanced Mathematics?
machinelearningmastery.com·3d
🎯Proof Tactics
OBCache: Optimal Brain KV Cache Pruning for Efficient Long-Context LLM Inference
arxiv.org·8h
💻Local LLMs
Show HN: I built a local AI agent desk toy
blog.simone.computer·1d·
Discuss: Hacker News
🎙️Whisper
On the Pure Quantum Polynomial Hierarchy and Quantified Hamiltonian Complexity
arxiv.org·1d
⚛️Quantum Algorithms
Valid Stopping for LLM Generation via Empirical Dynamic Formal Lift
arxiv.org·1d
💻Programming languages
The Alignment Auditor: A Bayesian Framework for Verifying and Refining LLM Objectives
arxiv.org·2d
💻Local LLMs
Think Natively: Unlocking Multilingual Reasoning with Consistency-Enhanced Reinforcement Learning
arxiv.org·1d
🧮SMT Solvers
CAM: A Constructivist View of Agentic Memory for LLM-Based Reading Comprehension
arxiv.org·2d
📝Concrete Syntax
94% of Developers Waste Tokens on Reasoning LLMs. Here's Why.
dev.to·6h·
Discuss: DEV
💻Local LLMs
Relational Transformer: Toward Zero-Shot Foundation Models for Relational Data
arxiv.org·1d
🧠Learned Indexes
Which Heads Matter for Reasoning? RL-Guided KV Cache Compression
arxiv.org·8h
📼Cassette Combinators
Two-Stage Voting for Robust and Efficient Suicide Risk Detection on Social Media
arxiv.org·8h
💾Binary Linguistics
IKNet: Interpretable Stock Price Prediction via Keyword-Guided Integration of News and Technical Indicators
arxiv.org·8h
🧠Learned Indexing
Do We Really Need SFT? Prompt-as-Policy over Knowledge Graphs for Cold-start Next POI Recommendation
arxiv.org·8h
🎯Content Recommendation
Certifiable Safe RLHF: Fixed-Penalty Constraint Optimization for Safer Language Models
arxiv.org·3d
🔗Parser Combinators
Effective and Stealthy One-Shot Jailbreaks on Deployed Mobile Vision-Language Agents
arxiv.org·8h
🕵️Vector Smuggling