Computer Graphics

Feeds to Scour
SubscribedAll
Scoured 54 posts in 10.1 ms

KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++

 Flash Attention  Content type: Code
github.com··Hacker News

AgentCompile: An LLM-Guided Compiler for Direct CUDA Inference

 🤖AI  Content type: Academic
arxiv.org·

RenderLab – Prototype rendering techniques and renderers in the browser

 🎮Game Development

1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM

 💬LLMs
smolhub.com··r/LocalLLaMA

DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200

 💬LLMs  Content type: News

Open source building blocks for computational design. Est. 2006

 💻Programming Languages
thi.ng··Hacker News

An introduction to the Linux graphics stack

 📚Speculative Fiction
crosscat.me·

Unsloth Gemma 4 QAT

 Quantization
unsloth.ai·

Path-Traced Inverse Rendering with Global Illumination in 3D Gaussian Fields

 🎮Game Development  Content type: Academic
arxiv.org·

nex-agi/Nex-N2-mini • Huggingface

 🤖AI

I stopped using most of Rust’s advanced features for my ML library

 🤖AI  Content type: Code
github.com··r/rust

APEX4: Efficient Pure W4A4 LLM Inference via Intra-SM Compute Rebalancing

 🖥️GPU Programming  Content type: Academic
arxiv.org·

sgl-project/sglang-omni: SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models

 Hardware Acceleration  Content type: Code
github.com·

AutoMegaKernel: A Statically-Checked Agent Harness for Self-Retargeting Megakernel Synthesis

 🤖AI  Content type: Academic
arxiv.org··Hacker News

ASTRA-sim 3.0: Next-Level Distributed Machine Learning Simulations via High-Fidelity GPU and Infrastructure Modeling

 🖥️GPU Programming  Content type: Academic
arxiv.org·

Density Field State Space Models: 1-Bit Distillation, Efficient Inference, and Knowledge Organization in Mamba-2

 🖥️GPU Programming  Content type: Academic
arxiv.org·

Parallel Causal Associative Fields: Gated Sparse Memory for Long-Context Language Modeling

 Transformers  Content type: Academic
arxiv.org·

Does anyone know what PCIe mode was used for these benchmarks?

 💬LLMs  Content type: Code
github.com··r/LocalLLaMA

Communication Strategy Selection for Multi-GPU 3D FDTD with Convolutional Perfectly Matched Boundary Layers

 🖥️GPU Programming  Content type: Academic
arxiv.org·

LLM-Based Porting of Optimized C++ to CUDA Through Deoptimization and Reoptimization

 🖥️GPU Programming  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help