jimman's Top FindsLoading...
Gated DeltaNet (Linear Attention variant in Qwen3-Next and Kimi Linear)
sebastianraschka.com·14h·
Discuss: r/LLM
LLM Optimization
Flag this post
Good abstractions for humans turn out to be good abstractions for LLMs
betweentheprompts.com·2h·
Discuss: Hacker News
✍️Prompt Engineering
Flag this post
A Practitioner's Guide to Kolmogorov-Arnold Networks
arxiviq.substack.com·23h·
Discuss: Substack
LLM Optimization
Flag this post
Why Multimodal AI Broke the Data Pipeline — And How Daft Is Beating Ray and Spark to Fix It
hackernoon.com·12h
Model Efficiency
Flag this post
Relation-Aware Bayesian Optimization of DBMS Configurations Guided by Affinity Scores
arxiv.org·12h
Model Efficiency
Flag this post
Orchestrating Chaos: Unleashing the Power of Bio-Inspired AI for Autonomous System Design by Arvind Sundararajan
dev.to·12h·
Discuss: DEV
✍️Prompt Engineering
Flag this post
My First Multi-GPU Kernel: Writing All-to-All for AMD MI300X
gau-nernst.github.io·16h·
Discuss: Hacker News
Model Efficiency
Flag this post
How to Use Multimodal AI Models With Docker Model Runner
docker.com·4h
🤖AI
Flag this post
We found embedding indexing bottleneck in the least expected place: JSON parsing
nixiesearch.substack.com·1h·
Discuss: Substack
LLM Optimization
Flag this post
AI Summarization Optimization
schneier.com·5h
✍️Prompt Engineering
Flag this post
Small Vs. Large Language Models
semiengineering.com·9h·
Discuss: Hacker News
Model Efficiency
Flag this post
Scaling Coding-Agent RL to 32x H100s. 160% Improvement on Stanford's TBench
github.com·5h·
LLM Optimization
Flag this post
Sign up or login to customize your feed and get personalized topic recommendations
Reasoning Models Sometimes Output Illegible Chains of Thought
arxiv.org·12h
✍️Prompt Engineering
Flag this post
Dive into Systems
diveintosystems.org·50m·
Discuss: Hacker News
✍️Prompt Engineering
Flag this post
Can-t stop till you get enough
cant.bearblog.dev·23h·
Discuss: Hacker News
🤖AI
Flag this post
Enhanced Richardson Extrapolation via Adaptive Kernel Regression and Uncertainty Quantification
dev.to·3h·
Discuss: DEV
Model Efficiency
Flag this post
Generation at the Speed of Thought: Speculative Decoding
bittere.substack.com·1d·
Discuss: Substack
LLM Optimization
Flag this post
Building Syllabi – Agentic AI with Vercel AI SDK, Dynamic Tool Loading, and RAG
dev.to·15h·
Discuss: DEV
✍️Prompt Engineering
Flag this post
The Illustrated NeurIPS 2025: A Visual Map of the AI Frontier
newsletter.languagemodels.co·2h
LLM Optimization
Flag this post
Document-Driven Development in Next.js: How I Stopped Losing My Mind Managing Requirements
danielkliewer.com·5h·
✍️Prompt Engineering
Flag this post