🎯 Fine-tuning - saeedesmaili · Scour

vLLM Transformers Backend: Bridging Hugging Face Compatibility and High-Performance Inference

🧠LLM Inference Blog

odsc.medium.com·

How to reduce capability degradation from off-model SFT

🎮Reinforcement Learning

lesswrong.com·

NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI

⚡CUDA Blog

blogs.nvidia.com·

Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM

🧠LLM Inference

deemwar-products.github.io··Hacker News

Anthropic releases Claude Fable 5 and Mythos 5 with major gains in coding and science

Posting for authoring

turingpost.com·

The week AI infrastructure crossed from a technology story to a financial one

🧠LLM Inference News

Architecture-Aware Reinforcement Learning Makes Sliding-Window Attention Competitive in Math Reasoning

🧠Transformers Academic

DiffusionGemma: 4x Faster Text Generation

🤖Data science News Blog

blog.google··Hacker News, r/LocalLLaMA, r/singularity

Latest technical articles & videos.

certdepot.net·

Domain-Specific Small Language Models (Manning)

i-programmer.info·

tantara/worldcup-sim: Explore and simulate the 2026 FIFA World Cup — typed tournament data, an LLM simulation kernel, and in-browser TTS commentary.

🤖Data science Code

github.com··Hacker News

Show HN: Bosun – a small model that keeps an agent's memory graph clean

🔤Tokenization

huggingface.co··Hacker News

fc2

I Processed 2.4 Billion Tokens Across 52 AI Models for $0.52. Here's the Full Breakdown.

saintlex.sbs··DEV

1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM

🤖Data science

smolhub.com··r/LocalLLaMA

Substrate Asymmetry in User-Side Memory: A Diagnostic Framework

🧠LLMs Academic

iOS 27 Security: What WWDC 2026’s AI Features Mean for Mobile App Risk

🤖Data science Blog

nowsecure.com·

GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)

🧠LLM Inference

vettedconsumer.com··Hacker News

Hacker News Cohort Collectively Dismisses Anthropic and Champions Chinese Models over Fable's Fumble

🤖Automation Discussion

news.ycombinator.com··r/LocalLLaMA

Sign up or log in to see more results

Log in to enable infinite scrolling