Enabling Trillion-Parameter Models on AWS EFA
research.perplexity.ai·10h·
Discuss: Hacker News
Performance Engineering
Flag this post
H-FA: A Hybrid Floating-Point and Logarithmic Approach to Hardware Accelerated FlashAttention
arxiv.org·1d
🔐ChaCha20
Flag this post
Moving past speculation: How deterministic CPUs deliver predictable AI performance
venturebeat.com·3d
🏗Computer Architecture
Flag this post
Crushing ML Latency: The (Un)Official Best Practices for Systems Optimisation
pub.towardsai.net·5h
🚀Performance
Flag this post
Powering the Future of AI: L40S GPU Server vs H100 GPU Server
dev.to·5h·
Discuss: DEV
🎮WebGPU
Flag this post
KTransformers Open Source New Era: Local Fine-tuning of Kimi K2 and DeepSeek V3
reddit.com·21h·
Discuss: r/LocalLLaMA
📱Edge AI
Flag this post
Supercharging the ML and AI Development Experience at Netflix
netflixtechblog.com·14h
💬Prompt Engineering
Flag this post
BoxLambda OS Software Architecture, First Draft
epsilon537.github.io·7h·
Discuss: Hacker News
💻Operating Systems
Flag this post
Ubuntu Blog: Edge Networking gets smarter: AI and 5G in action
ubuntu.com·3h
🌍Edge Computing
Flag this post
Radar Trends to Watch: November 2025
oreilly.com·22h
🎭Program Synthesis
Flag this post
My First Multi-GPU Kernel: Writing All-to-All for AMD MI300X
gau-nernst.github.io·2d·
Discuss: Hacker News
🎮WebGPU
Flag this post
How to code MPU-6050 on STM32CubeIDE?
dev.to·1h·
Discuss: DEV
🎛️Microcontrollers
Flag this post
A Soft‑Fork Proposal for Blockchain‑Based Distributed AI Computation
hackernoon.com·1d
⛓️Blockchain
Flag this post
NOWS: Neural Operator Warm Starts for Accelerating Iterative Solvers
arxiv.org·5h
🔢NumPy
Flag this post
Geonum – geometric number library for unlimited dimensions with O(1) complexity
github.com·1d·
Discuss: Hacker News
📏Linear Types
Flag this post
A hitchhiker's guide to CUDA programming
seanzhang.me·5d·
Discuss: Hacker News
🔀SIMD Programming
Flag this post
Why stop at 1 million tokens when you can have 10? My journey to extreme context on a gaming GPU. [P]
reddit.com·22h·
📱Edge AI
Flag this post
Detailed Technical Documentation on AI Implementation Logic (Taking Large Language Models as an Example )
nbtab.com·1d·
Discuss: DEV
📱Edge AI
Flag this post
Why is AI Generated Rust slow when compared with Go/C#/Node/JavaScript
srid68.github.io·19h·
Discuss: Hacker News
🦀Rust
Flag this post