jimman's Feed · Scour

chatprd.ai·9h

🔍AI Interpretability

Hitting 1,000 tokens per second on a single RTX 5090

blog.alpindale.net·3h·

Discuss: Hacker News

⚡Model Efficiency

SDFP: Speculative Decoding with FIT-Pruned Models for Training-Free and Plug-and-Play LLM Acceleration

arxiv.org·2d

⚡LLM Optimization

The Importance of Prompts in the AI Era (and Why Prompt Sharing Platforms Matter)

dev.to·20h·

Discuss: DEV

✍️Prompt Engineering

From Prediction to Compilation: A Manifesto for Intrinsically Reliable AI

news.ycombinator.com·14h·

Discuss: Hacker News

✍️Prompt Engineering

When Clever Hardware Hacks Bite Back: A Password Keeper Device Autopsy

hackaday.com·1d

Quantization-Aware Distillation

ternarysearch.blogspot.com·23h·

Discuss: Hacker News

⚡LLM Optimization

Is Your Machine Learning Pipeline as Efficient as it Could Be?

kdnuggets.com·2d

⚡Model Efficiency

Show HN: A Prompting Framework for Non-Vibe-Coders

github.com·18h·

Discuss: Hacker News

✍️Prompt Engineering

Show HN: Model Training Memory Simulator

czheo.github.io·16h·

Discuss: Hacker News

⚡Model Efficiency

AI Workflows with human-in-the-loop

weavemind.ai·17h·

Discuss: Hacker News

✍️Prompt Engineering

A Guide to Effective Prompt Engineering

blog.bytebytego.com·4d

✍️Prompt Engineering

Sign up or login to customize your feed and get personalized topic recommendations

Understanding LLM Inference Engines: Inside Nano-vLLM (Part 2)

neutree.ai·2d·

Discuss: Hacker News

⚡LLM Optimization

Zero-Latency Local AI: Tuning Your Linux Kernel for LLM Inference 🐧🧠

dev.to·1d·

Discuss: DEV

⚡LLM Optimization

Experiments in building bespoke tools with AI

knlb.dev·1h·

Discuss: Hacker News

✍️Prompt Engineering

Software Engineering with AI: Beyond Vibe-Coding

principalengineer.com·11h

✍️Prompt Engineering

isledb: An embedded key-value engine built on object storage in Go

reddit.com·15h·

Discuss: r/golang

The Five Types of Programmers (2010)

stevenbenner.com·5h·

Discuss: Hacker News

🛠️Developer Tools

Determining Energy Efficiency Sweet Spots in Production LLM Inference

arxiv.org·2d

⚡Model Efficiency

Mechanistic Interpretability: Peeking Inside an LLM

towardsdatascience.com·3d

⚡LLM Optimization

Loading more...