🧠 LLM Inference - emschwartz · Scour

AI discovers a 5x faster MoE load balancing algorithm than human experts

adrs-ucb.notion.site·15h·

Discuss: Hacker News

🏗️LLM Infrastructure

Flag this post

Understanding RNNs: The Model That Paved the Way for Transformers and the AI Revolution

pub.towardsai.net·4h

🔤Tokenization

Flag this post

Sampling in Large Language Models

aiunpacked.net·11h·

Discuss: Hacker News

🏗️LLM Infrastructure

Flag this post

Building a different kind of personal intelligence

lesswrong.com·20h

🪄Prompt Engineering

Flag this post

I spent months struggling to understand AI agents. Built a from scratch tutorial so you don't have to.

github.com·17h·

Discuss: r/LocalLLaMA

🏗️LLM Infrastructure

Flag this post

(2018) The Google Brain Team – Looking Back on 2017

blog.research.google·2h·

Discuss: Hacker News

Flag this post

Enabling Deep Model Explainability with Integrated Gradients at Uber

uber.com·23h

🔍AI Interpretability

Flag this post

Antislop: A Comprehensive Framework for Identifying and Eliminating Repetitive Patterns in Language Models

reddit.com·8h·

Discuss: r/LocalLLaMA

🔤Tokenization

Flag this post

Part 2: Instruction Fine-Tuning: Evaluation and Advanced Techniques for Efficient Training

neptune.ai·21h

🪄Prompt Engineering

Flag this post

Researchers show that training on “junk data” can lead to LLM “brain rot”

arstechnica.com·16h

📋Text Quality

Flag this post

Types of LLM Fine-Tuning

pub.towardsai.net·4h

🏗️LLM Infrastructure

Flag this post

When the VIBEs Start to Fade

mindruptive.com·3h·

Discuss: Hacker News

🪄Prompt Engineering

Flag this post

Canonical Begins Snap'ing Up Silicon-Optimized AI LLMs For Ubuntu Linux

phoronix.com·23h

🏗️LLM Infrastructure

Flag this post

Making the Smallest and Dumbest LLM with Extreme Quantization

hackaday.com·11h

🏗️LLM Infrastructure

Flag this post

Might the DeepSeek-OCR paper be a key innovation for smarter models?

nitter.net·20h·

Discuss: r/LocalLLaMA

🔤Tokenization

Flag this post

AI #139: The Overreach Machines

thezvi.substack.com·22h·

Discuss: Substack

Flag this post

Real-time self-supervised denoising for high-speed fluorescence neural imaging

nature.com·13h

⚡Hardware Acceleration

Flag this post

Cline & Our Commitment to Open Source - zAI GLM 4.6

cline.ghost.io·21h

🏗️LLM Infrastructure

Flag this post

The Rise of JavaScript in Machine Learning

thenewstack.io·21h

🏆LLM Benchmarking

Flag this post

A different kind of personal intelligence: an architecture for understanding

rebeccadai.substack.com·20h·

Discuss: Substack

🪄Prompt Engineering

Flag this post

Loading more...