💻 Local LLMs - nmarshall · Scour

SDFP: Speculative Decoding with FIT-Pruned Models for Training-Free and Plug-and-Play LLM Acceleration

arxiv.org·4d

Understanding LLM Inference Engines: Inside Nano-vLLM (Part 2)

neutree.ai·4d·

Discuss: Hacker News

Drifting models

breno.bearblog.dev·1d

🎲Procedural Generation

How Yelp Built “Yelp Assistant”

blog.bytebytego.com·23h

🌊Event Streaming

Convolutional Neural Networks using Logarithmic Data Representation

dev.to·2d·

Discuss: DEV

The State of Agentic Graph RAG

localoptimumai.substack.com·4h·

Discuss: Substack

ML-LIB: Machine Learning Library Proposed For The Linux Kernel

phoronix.com·3d·

Discuss: Hacker News

Testing 80 LLMs on spatial reasoning on grids

mihai.page·1d·

Discuss: Hacker News

🏗️AI Infrastructure

Automating Inference Optimizations with NVIDIA TensorRT LLM AutoDeploy

developer.nvidia.com·21h

Optimized LLM Inference Engines

rishirajacharya.com·6d

🏗️AI Infrastructure

A one-prompt attack that breaks LLM safety alignment

microsoft.com·23h·

Discuss: Hacker News

🏠Self-hosted AI

The Rise of Local Speech Recognition

oatmealapp.com·1d·

Discuss: Hacker News

🗣️Speech Synthesis

PriMod4AI: Lifecycle-Aware Privacy Threat Modeling for AI Systems using LLM

arxiv.org·4d

🏠Self-hosted AI

NotebookLM: The AI that only learns from you

byandrev.dev·2d·

Discuss: Hacker News

🏗️AI Infrastructure

Show HN: Kore – Stack based language where compiler is the reward function

github.com·12h·

Discuss: Hacker News

SAE Feature Matchmaking (Layer-to-Layer) by Mitali M

greaterwrong.com·11h

Allium is an LLM-native language for sharpening intent alongside implementation

juxt.github.io·1d·

Discuss: Hacker News

Mechanistic Interpretability: Peeking Inside an LLM

towardsdatascience.com·5d

🤖AI Inference

Large Language Models (LLMs): Navigating the Future

dev.to·5d·

Discuss: DEV

🏗️AI Infrastructure

LLMs Are Prediction Machines

kaelandt.github.io·1d·

Discuss: Hacker News

Loading more...