Kimi Linear: An Expressive, Efficient Attention Architecture
arxiviq.substack.com·10h·
Discuss: Substack
🧠LLM Inference
Flag this post
The NVIDIA Dependency Nightmare on Ubuntu: A Deep Dive for Data Science
pub.towardsai.net·2h
🤖AI
Flag this post
Feature Infrastructure Engineering: A Comprehensive Guide
mlfrontiers.substack.com·17h·
Discuss: Substack
🎛️Feed Filtering
Flag this post
$5K inference rig build specs? Suggestions please.
reddit.com·19h·
Discuss: r/LocalLLaMA
📊Model Serving Economics
Flag this post
EP187: Why is DeepSeek-OCR such a BIG DEAL?
blog.bytebytego.com·17h
💾Prompt Caching
Flag this post
Announce "orb" as a runtime abstraction and "razor-rpc"
reddit.com·23h·
Discuss: r/rust
🌐Axum
Flag this post
Smaller Surfaces
nrempel.com·11h·
Discuss: Hacker News
⚙️Language Runtimes
Flag this post
Objects as Random Access Memory
tbr.bearblog.dev·6h
🗜️Compaction
Flag this post
From hours to seconds: AI tools to detect animal calls
seangoedecke.com·2h·
Discuss: Hacker News
🔢BitNet
Flag this post
Cognotik: A New FOSS AI Coding Assistant. For JetBrains IDEs
github.com·8h·
Discuss: Hacker News
🔧Developer tools
Flag this post
Weak-To-Strong Generalization
lesswrong.com·6h
🔤Tokenization
Flag this post
Speedrunning an RL Environment
sidb.in·22h·
Discuss: Hacker News
🕳LLM Vulnerabilities
Flag this post
Joy & Curiosity #60
registerspill.thorstenball.com·1h
🪄Prompt Engineering
Flag this post
Is 'human' a risky AGI target
nullsy.com·9h·
Discuss: Hacker News
🎭Claude
Flag this post
Research roundup: 6 cool science stories we almost missed
arstechnica.com·16h
🍄Mycorrhizal Networks
Flag this post
L16 Benchmark: How Prompt Framing Affects Truth, Drift, and Sycophancy in GEMMA-2B-IT vs PHI-2
colab.research.google.com·19h·
Discuss: r/LocalLLaMA
🪄Prompt Engineering
Flag this post
Machine Scheduler in LLVM – Part II
myhsu.xyz·3h·
Discuss: Hacker News
⚙️Mechanical Sympathy
Flag this post
The End of Cloud Inference
docs.google.com·17h·
Discuss: Hacker News
📊Model Serving Economics
Flag this post
Beyond Brute Force: 4 Secrets to Smaller, Smarter, and Dramatically Cheaper AI
hackernoon.com·18h
🤖AI
Flag this post