Model Quantization, Inference Optimization, GGUF Format, Privacy-preserving AI

An enough week
blog.mitrichev.ch·1d·
🧮Z3 Solver
Who watches the watchers? LLM on LLM evaluations
stackoverflow.blog·1d
📏Code Metrics
Tool or Agent? The impact of AI in your code and in your wallet It all boils down to math again!
blog.codeminer42.com·1d
Proof Automation
The key to conversational speech recognition
datasciencecentral.com·1d
🎵Audio ML
Expanding the Action Space of LLMs to Reason Beyond Language
arxiv.org·19h
🌳Context free grammars
OpenAI's inflated valuation, as I understand it
taloranderson.com·7h·
Discuss: Hacker News
🧠Intelligence Compression
How the Rise of Tabular Foundation Models Is Reshaping Data Science
towardsdatascience.com·1d
🧠Machine Learning
[D] Anyone using smaller, specialized models instead of massive LLMs?
reddit.com·1d·
🎯Performance Proofs
Show HN: Comparegpt.io – Trustworthy Mode to reduce LLM hallucinations
news.ycombinator.com·22h·
Discuss: Hacker News
🔍BitFunnel
What is a Large Language Model (LLM)
dev.to·6h·
Discuss: DEV
🌀Brotli Internals
AI Guardrails, Gateways, Governance Nightmares
go.mcptotal.io·16h·
Discuss: Hacker News
🎯Threat Hunting
LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?
arxiv.org·19h
🔗Parser Combinators
The Alignment Auditor: A Bayesian Framework for Verifying and Refining LLM Objectives
arxiv.org·2d
🧮Kolmogorov Complexity
InferenceMAX – open-source Inference Frequent Benchmarking
github.com·3h·
Discuss: Hacker News
Performance Mythology
TRIM: Token-wise Attention-Derived Saliency for Data-Efficient Instruction Tuning
arxiv.org·1d
🔨Compilers
100 Poisoned Examples Can Hijack Any AI Model (Even GPT-4-Scale LLMs)
dev.to·1d·
Discuss: DEV
Proof Automation
Integral Signatures of Activation Functions: A 9-Dimensional Taxonomy and Stability Theory for Deep Learning
arxiv.org·19h
🧠Machine Learning
In-Depth Analysis: "Attention Is All You Need"
dev.to·8h·
Discuss: DEV
🧠Intelligence Compression