Inference

Feeds to Scour
SubscribedAll
Scoured 348 posts in 6.7 ms

Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell

 🚀Model Releases  Content type: News  Content type: Blog
developer.nvidia.com·

Nvidia DGX Spark GB10 – AI Models and Guide with vLLM and Autonomous Script

 🧠AI  Content type: Code
github.com··Hacker News

Ultrafast machine learning on FPGAs via Kolmogorov-Arnold Networks

 🤖Machine Learning

For Robotaxis, Safety Must Be Built In, Not Bolted On

 🧠AI  Content type: Blog
blogs.nvidia.com·

Vadzo Imaging Introduces HDR MIPI CSI-2 Embedded Cameras Recommended for Drone and UAV Applications

 🤖Machine Learning  Content type: News
einpresswire.com·

google/gemma-4-31B-it · fix: chat template — null handling, reasoning preservation, turn-tag balance, input validation

 💬LLMs

Show HN: Ext-Infer

 🦀Rust

🇳🇱 Go/Golang job: Senior Backend Engineer (Go) | Studio AI at Creative Fabrica (Amsterdam, Netherlands)

 👨‍🏫Karpathy
golangprojects.com·

Why agentic AI needs an open inference stack

 🕵️AI Agents
redhat.com·

MLPerf and the rise of latency-aware LLM benchmarking

 Transformers
edn.com·

TFLite Edge Model Quantizer Snippet

 💬LLMs

AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support

 🎨Diffusion Models
phoronix.com·

LC-QAT: Data-Efficient 2-Bit QAT for LLMs via Linear-Constrained Vector Quantization

 💬LLMs  Content type: Academic
arxiv.org·

The latest Gemma 4 models use a training trick to slash their on-device memory footprint

 🧠AI
androidauthority.com·

What's in the Box? A Field Guide to AI Models

 🧠AI  Content type: Blog
iankduncan.com·

Google’s DiffusionGemma is 4x faster than its other Gemma models

 🎨Diffusion Models
thenewstack.io·

MiMo-v2.5-Pro-UltraSpeed: 1T model with 1000 TPS

 🎯Fine-Tuning  Content type: Blog

A field journal on Ray Data and Daft for multimodal data lake (14 minute read)

 📊AI Evals  Content type: Blog
mehulbatra.medium.com·

Azure OpenAI Architecture: The Decisions That Actually Matter (Part 2)

 💬LLMs

Latest technical articles & videos.

 🎯Fine-Tuning
certdepot.net·
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help