🐿️ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🧠 LLM Inference

Quantization, Attention Mechanisms, Batch Processing, KV Caching

Parallelization Strategies in Neural Networks
nwktimes.blogspot.com·15h·
Discuss: Hacker News
⚡Hardware Acceleration
How Can You Tell if You've Instilled a False Belief in Your LLM?
lesswrong.com·6h
🏆LLM Benchmarking
i had to prompt inject the @united airlines bot because it kept refusing to connect me with a human
threadreaderapp.com·23h
📊ModernBERT
LLMs, Quantum Measurement, and a Primitive of Consciousness
understoryai.substack.com·16h·
Discuss: Substack
🪄Prompt Engineering
Baseten raises $150M Series D for inference infra but where’s the real bottleneck?
reddit.com·6h·
Discuss: r/LocalLLaMA
📊Model Serving Economics
AI could one day replace tutors, but its reliability still lags
phys.org·7h
🏆LLM Benchmarking
Knowledge and memory
robinsloan.com·15h
🪄Prompt Engineering
GLM 4.5 with Claude Code is a killer combination
docs.z.ai·22h·
Discuss: Hacker News
🏆LLM Benchmarking
[Level 1] Building Personalized Text Summarization - Following up on Personal Chatbot Success
colab.research.google.com·7h·
Discuss: r/LocalLLaMA
👨‍💻AI Coding
Let's generate our own LLM fine-tuning dataset (100% local):
threadreaderapp.com·17h
👨‍💻AI Coding
Show HN: Reverse vs. Vectorized Forward Ad: A Performance Exploration in C
raph5.github.io·13h·
Discuss: Hacker News
⚡SIMD Optimization
GPT-5 Thinking in ChatGPT is shockingly good at search and demonstrates the potential of combining tool calling with chain-of-thought reasoning (Simon Willison/...
techmeme.com·1h
💳Content Monetization
LifeGPT: Generative pretrained transformer model for cellular automata
nature.com·16h·
Discuss: Hacker News
🆕New AI
LLMs Are Adaptive Data Organisms
worldgov.org·6h·
Discuss: Hacker News
🏆LLM Benchmarking
5 ML Mistakes That Scream “Student” (And How to Fix Them)
pub.towardsai.net·9h
🛡️AI Security
Follow up experiments on preventative steering
lesswrong.com·19h
🛡️AI Safety
“Everyone knows” what an autoencoder is… but there's an important complementary picture missing from most introductory material.
threadreaderapp.com·2h
📊Embeddings
EP179: Kubernetes Explained
blog.bytebytego.com·8h
💾Persistence Strategies
Under the Hood of Fuzzy Search: Building a Search Engine 15 times fuzzier than Lucene
andrewjsaid.com·4h·
Discuss: r/programming
💻Programming languages
EPYC vs. Xeon for Hybrid Inference Server?
reddit.com·9h·
Discuss: r/LocalLLaMA
⚙️Mechanical Sympathy
Loading...Loading more...
AboutBlogChangelogRoadmap