Feeds to Scour
SubscribedAll
Scoured 19870 posts in 664.4 ms
Understanding LLM Inference Engines: Inside Nano-vLLM (Part 1)
neutree.ai·1d·
🧠LLM Inference
Preview
Report Post
Training LLMs with Fault Tolerant HSDP on 100,000 GPUs
arxiv.org·1d
📦Batch Embeddings
Preview
Report Post
Writing an LLM from scratch, part 32a -- Interventions: training a baseline model
gilesthomas.com·3h·
Discuss: Hacker News
🏆LLM Benchmarking
Preview
Report Post
Beyond Two Towers: Re-architecting the Serving Stack for Next-Gen Ads Lightweight Ranking Models…
medium.com·1d
📊Feed Optimization
Preview
Report Post
luml-ai/luml: LUML is an open-source MLOps/LLMOps platform, allowing to build and deploy AI/ML models in a matter of minutes.
github.com·14h·
Discuss: Hacker News
🦙Ollama
Preview
Report Post
Thoughts on Toby Ords' AI Scaling Series
lesswrong.com·4h
🏆LLM Benchmarking
Preview
Report Post
Self-Optimizing Football Chatbot Guided by Domain Experts on Databricks
databricks.com·11h
🏆LLM Benchmarking
Preview
Report Post
The Architecture of Trust: Guardrails for Production Generative AI Applications and the Llama
pub.towardsai.net·1h
🛡️AI Security
Preview
Report Post
ML for Energy-Performance-Aware Scheduling On Heterogeneous Multicore Architectures (Cambridge)
semiengineering.com·1d
📊Model Serving Economics
Preview
Report Post
Understanding Efficiency: Quantization, Batching, and Serving Strategies in LLM Energy Use
arxiv.org·2d
🧠LLM Inference
Preview
Report Post
AI Cost Considerations Every Engineer Should Know
vantage.sh·9h·
Discuss: Hacker News
💰Tokenomics
Preview
Report Post
LLM Fine-tuning Providers. Customize Large Language Models on… | by Xin Cheng | Feb, 2026
billtcheng2013.medium.com·1d
🏆LLM Benchmarking
Preview
Report Post
Together AI welcomes Alon Gavrielov as VP of Infrastructure Strategy
together.ai·1d
🖥GPUs
Preview
Report Post
Show HN: Polymcp and Ollama for Simple Local and Cloud LLM Execution
news.ycombinator.com·1d·
Discuss: Hacker News
🦙Ollama
Preview
Report Post
How Grab Built a Vision LLM to Scan Images
blog.bytebytego.com·13h
🔤Tokenization
Preview
Report Post
Owning a $5M data center
blog.comma.ai·11h
🏠Self-hosting
Preview
Report Post
jameshaydon/sentinel - MCP guardrailing for LLM agents using logic programming
github.com·18m
📋MCP
Preview
Report Post
Postdoc in Milan on scalability for high-dimensional Bayesian learning
statmodeling.stat.columbia.edu·1d
🧠Inference Serving
Preview
Report Post
LLMs as the new high level language
federicopereiro.com·1d·
🪄Prompt Engineering
Preview
Report Post
The maturity gap in ML pipeline infrastructure
chainguard.dev·2d·
Discuss: r/programming
🛡️AI Security
Preview
Report Post

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help