CUDA

Feeds to Scour
SubscribedAll
Scoured 43 posts in 6.5 ms

Microsoft Weekly: Surface Laptop Ultra, Windows 11 context menus, Build 2026 recap, and more

 🏗️AI Infra
neowin.net·

Vortex 3.0 Released As Full-Stack, Open-Source RISC-V GPU Now With 3D Pipeline

 💻GPU Computing
phoronix.com·
Less-relevant results

Vortex expands open RISC-V graphics

 💻GPU Computing
jonpeddie.com·

1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM

 💻GPU Computing
smolhub.com··r/LocalLLaMA

Edge AI deployment made easy for system integrators

 🏗️AI Infra
edn.com·

Nvidia's best GPU feature is hiding in VLC's settings, and you're probably missing it

 💻GPU Computing
xda-developers.com·

Build a local voice agent with Red Hat OpenShift AI

 🧠LLMs
developers.redhat.com·

Density Field State Space Models: 1-Bit Distillation, Efficient Inference, and Knowledge Organization in Mamba-2

 🏗️AI Infra  Content type: Academic
arxiv.org·

Five labs, five minds: building a multi-model finance drama on small models

 🏗️AI Infra  Content type: Blog
huggingface.co·

Nvidia DGX Spark GB10 – AI Models and Guide with vLLM and Autonomous Script

 🧠LLMs  Content type: Code
github.com··Hacker News

This Is the Hidden ‘AI Tax’ That Founders Need to Budget For

 💻GPU Computing
entrepreneur.com·

Holding the FP8 Quality Ceiling at 8-Bit Weights and Activations: INT8 and GGUF Post-Training Quantization of Ideogram 4.0 for Consumer GPUs

 💻GPU Computing  Content type: Academic
arxiv.org·

NetX-lab/Frontier: Frontier: A Discrete-Event Simulator for Modern LLM Serving

 🧠LLMs  Content type: Code
github.com··Hacker News

Gram Newton-Schulz: A Fast, Hardware-Aware Newton-Schulz Algorithm for Muon

 🐧Operating Systems  Content type: Blog
tridao.me··Hacker News

SET: Stream-Event-Triggered Scheduling for Efficient CUDA Graph Pipelines

 🏗️AI Infra  Content type: Academic
arxiv.org·

Gated DeltaNet, From First Principles

 💻GPU Computing  Content type: Blog

🥇Top AI Papers of the Week

 🏗️AI Infra  Content type: News
nlp.elvissaravia.com·

sgl-project/sglang-omni: SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models

 💻GPU Computing  Content type: Code
github.com·

Beyond Per-Token Pricing: A Concurrency-Aware Methodology for LLM Infrastructure Cost Estimation

 🧠LLMs  Content type: Academic
arxiv.org·

Open source building blocks for computational design. Est. 2006

 💻GPU Computing
thi.ng··Hacker News

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help