Attention Optimization, Memory Efficiency, Transformer Acceleration, IO-Aware

Bandits in Your LLM Gateway
tensorzero.com·17h·
Discuss: Hacker News
🏎️TensorRT
Flag this post
Government Lawyers Oversight Watchdog, Amazon Music, DeepSeek, More: Tuesday Afternoon ResearchBuzz, November 11, 2025
researchbuzz.me·11h
📊Profiling
Flag this post
A hypothetical search service on S3 with Tantivy and warm cache on NVMe
shayon.dev·1d·
ONNX Runtime
Flag this post
Generative AI and the bullshit singularity
daedtech.com·21h·
Discuss: Hacker News
🤖AI Coding Tools
Flag this post
AOC Q27G4ZMN 27-inch QHD Mini LED 240 Hz gaming monitor review: Incredible performance and value
tomshardware.com·19h
⏱️Benchmarking
Flag this post
Rewrite of Gemini API (AI content)
funcall.blogspot.com·13h·
🤖AI Coding Tools
Flag this post
The Era of Haves and Have-Nots
kellblog.com·13h
ONNX Runtime
Flag this post
Why Document Image Compression is Crucial in Web Apps - And How to Do It Properly
dev.to·21h·
Discuss: DEV
🏎️TensorRT
Flag this post
Against Powerful Text Editors
lesswrong.com·2d
🐕Ruff
Flag this post
Wired for Words: Understanding Language and the Brain
psychologytoday.com·2d
🧩Attention Kernels
Flag this post
Stellar Blockchain to Power Turbo Energy’s Tokenized Clean Energy Financing Initiative
coindesk.com·17h
🔗NCCL
Flag this post
Testing Data Stories with Simulated Audiences
cacm.acm.org·17h
🏎️TensorRT
Flag this post
Baseten takes on hyperscalers with new AI training platform that lets you own your model weights
venturebeat.com·1d
ONNX Runtime
Flag this post
AI Papers to Read in 2025
towardsdatascience.com·6d
📊Gradient Accumulation
Flag this post
IonQ's Q3: Inflection Is Near
seekingalpha.com·1d
🐕Ruff
Flag this post
r/SillyTavernAI
reddit.com·2d·
💡LSP
Flag this post
Weekly #45-2025: PHP Tricks, LLM Collaboration, SQL Speedups, and the Future of Web Payments
dev.to·3d·
Discuss: DEV
💡LSP
Flag this post
I used NotebookLM to help me build a PC, and it was better than any other tool
xda-developers.com·2d
🎯Tensor Cores
Flag this post
Imaginarium: Vision-guided High-Quality 3D Scene Layout Generation
paperium.net·3d·
Discuss: DEV
🏎️TensorRT
Flag this post