Show HN: Everything it took to run an LLM at 10k tok/s on H200s
relace.aiยท22hยท
Discuss: Hacker News
๐Ÿ“Code Metrics
Flag this post
Long-Context Modeling with Dynamic Hierarchical Sparse Attention for On-Device LLMs
arxiv.orgยท1d
๐Ÿ’ปLocal LLMs
Flag this post
A Little Update on My RSS Setup
512pixels.netยท1h
๐Ÿ“ฐRSS Reading Practices
Flag this post
Everything About Transformers
krupadave.comยท9h
๐Ÿ“Text Parsing
Flag this post
Computing High-Frequency Factors in Real Time for Quantitative Models
medium.comยท26mยท
Discuss: Hacker News
๐ŸŒŠStreaming Databases
Flag this post
TheStageAI/TheWhisper: up to 3x faster optimized Whisper models for streaming and on-device use
github.comยท18hยท
๐ŸŽ™๏ธWhisper
Flag this post
๐Ÿš€ Go Faster: Cutting the Slack in GC with Smart Memory Allocation
dev.toยท1dยท
Discuss: DEV
๐Ÿง Memory Allocators
Flag this post
Issue 496
haskellweekly.newsยท3h
๐Ÿ”—Functional Compilers
Flag this post
Tracking an evolving Discord-based RAT family
reversinglabs.comยท1d
๐Ÿฆ Malware Analysis
Flag this post
Challenges in Building Natural, Lowโ€‘Latency, Reliable Voice Assistants
hackernoon.comยท9h
๐ŸŽตAudio Streaming
Flag this post
Writing an LLM from scratch, part 25 โ€“ instruction fine-tuning
gilesthomas.comยท18hยท
Discuss: Hacker News
โšกProof Automation
Flag this post
Bringing Vision-Language Intelligence to RAG with ColPali
towardsdatascience.comยท21h
๐Ÿ“Concrete Syntax
Flag this post
Java Generics and Collections โ€ข Maurice Naftalin & Stuart Marks โ€ข GOTO 2025
youtube.comยท2h
ฮปLambda Formalization
Flag this post
Torchforge โ€“ a PyTorch native library for scalable RL post-training
pytorch.orgยท4hยท
Discuss: Hacker News
๐Ÿ–ฅ๏ธGame Emulation
Flag this post
Kafka is Fast โ€“ I'll use Postgres
topicpartition.ioยท1dยท
๐ŸŒŠStreaming Databases
Flag this post
Vectorizing for Fun and Performance
ibm.comยท22hยท
Discuss: Hacker News
โšกSIMD Vectorization
Flag this post
Lessons from scaling live events at Patreon
patreon.comยท1dยท
๐ŸŒŠStreaming Systems
Flag this post
How fast can an LLM go?
fergusfinn.comยท5hยท
Discuss: Hacker News
๐ŸŽฏEmulator Accuracy
Flag this post
Falcon: A Comprehensive Chinese Text-to-SQL Benchmark for Enterprise-Grade Evaluation
arxiv.orgยท11h
๐Ÿ‡จ๐Ÿ‡ณChinese Computing
Flag this post