Operator Fusion, Memory Bandwidth, Graph Optimization, Intermediate Elimination

I built 10k robots simulation with collision avoidance in WebGPU (HTML)
physical-ai.ghost.io·19h·
🔧PTX
Flag this post
Feature Stores 2.0: The Next Frontier of Scalable Data Engineering for AI
hackernoon.com·2d
ONNX Runtime
Flag this post
Researchers want to kill the vibe, propose better model for AI coding
theregister.com·1h
🤖AI Coding Tools
Flag this post
Geonum – geometric number library for unlimited dimensions with O(1) complexity
github.com·4d·
Discuss: Hacker News
✂️CUTLASS
Flag this post
Show HN: Linguistic RL – A 7B model discovers Occam's Razor through reflection
github.com·1h·
🎓Model Distillation
Flag this post
MeixnerNet: Adaptive and Robust Spectral Graph Neural Networks with Discrete Orthogonal Polynomials
arxiv.org·3d
🔀Operator Fusion
Flag this post
LDBT instead of DBTL: combining machine learning and rapid cell-free testing
nature.com·2d
🎓Model Distillation
Flag this post
MISA: Memory-Efficient LLMs Optimization with Module-wise Importance Sampling
arxiv.org·3d
📊Gradient Accumulation
Flag this post
Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals
arxiv.org·4d
🧮cuDNN
Flag this post
KTransformers Open Source New Era: Local Fine-tuning of Kimi K2 and DeepSeek V3
reddit.com·3d·
Discuss: r/LocalLLaMA
ONNX Runtime
Flag this post
Understanding Support Vector Machines SVM: Origins, Working, and Real-World Applications
dev.to·4d·
Discuss: DEV
📊Gradient Accumulation
Flag this post
My Quest for Speed: How a Clickhouse Type Improvement Led Me Down a Caching Rabbit Hole in Rust
dev.to·15h·
Discuss: DEV
📊Profiling Tools
Flag this post
Merging Continual Pretraining Models for Domain-Specialized LLMs: A Case Study in Finance
arxiv.org·2d
🔗NCCL
Flag this post
Unlocking AI Vision with the Wisdom of Cats: Building Generalizable Models
dev.to·2d·
Discuss: DEV
🧮cuDNN
Flag this post
Google Unveils Ironwood, Its ‘Most Powerful’ and ‘Energy-Efficient’ AI Chip to Date
techrepublic.com·1h
🎯Tensor Cores
Flag this post
Quantum-Resistant Federated Learning: Securing Distributed Model Training Against Future Cryptanalytic Attacks
dev.to·1d·
Discuss: DEV
📉Model Quantization
Flag this post