ML Systems

machine learning infrastructure, MLOps, model serving, training

Feeds to Scour
SubscribedAll
Scoured 136 posts in 5.6 ms

AMD Radeon RX 9070 GRE vs. Nvidia GeForce RTX 5070

 🎮GPGPU
club386.com·

Machinic Psychopharmacology: Do LLMs Self-Medicate?

 📈Trading Systems
lesswrong.com··Hacker News

The Practitioner’s Guide to AgentOps

 🚀Performance Engineering

Built an open-source LLMOps Gateway with Docker, Kubernetes, CI/CD and Monitoring

 🌐Networking  Content type: Code
github.com··r/devops, r/reactjs
Less-relevant results

ASUS ExpertBook Ultra Flagship Business Laptop Debuts In SEA Markets, Featuring Sub-1kg Chassis & Intel Core Ultra X7 Processor

 🖥️Systems Programming
pokde.net·

Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn’t Be This Good

 🚀Performance Engineering  Content type: Blog
towardsai.net·

Redis Data Integration in Redis Cloud is now GA in AWS

 📈Trading Systems  Content type: Blog
redis.io·

New comment by christyfthk in "Ask HN: Who is hiring? (June 2026)"

 ⚙️C++  Content type: Discussion

AI Native Landscape Launches as a Standalone Site

 🚀Performance Engineering  Content type: Blog
jimmysong.io·

Integrate OpenShift AI and PG Airman MCP Server

 📈Trading Systems
developers.redhat.com·

Using local LLMs for agentic coding

 🎮GPGPU  Content type: Blog
blog.alexewerlof.com·

Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency

 Cache Optimization  Content type: News  Content type: Blog
blog.google··Hacker News

Where to Host Your Open-Source Model (Under 10B Parameters)

 🚀Performance Engineering
digitalocean.com·

Youssof Altoukhi (@Youssofal_)

 🎯Low Latency
xcancel.com··r/LocalLLaMA

Understanding Agentic AI Infrastructure

 📈Trading Systems  Content type: Blog
mirantis.com·

not much happened today | AINews

 🚀Performance Engineering
news.smol.ai·

Local LLMs, Buy a GPU, and the Case for Cognitive Security

 🎮GPGPU

Build a local voice agent with Red Hat OpenShift AI

 🎮GPGPU
developers.redhat.com·

[AINews] FrontierCode: Benchmarking for Code Quality over Slop

 🚀Performance Engineering  Content type: News
latent.space
·

huawei-csl/KVarN: KVarN is a native vLLM KV-cache quantization backend for your agents: 3-5x more context, throughput above FP16, and FP16-level accuracy. Calibration-free, one flag.

 🔀Parallel Computing  Content type: Code
github.com··Hacker News
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help