🧠 LLM Inference - akapaka · Scour

EyesOff: Why Some Models Quantize Better Than Others

ym2132.github.io·10h·

Discuss: Hacker News

🤖Machine Learning

Guney-olu/nanoslg: A from-scratch implementation of distributed LLM inference in simple readable Python

github.com·2d·

Discuss: Hacker News, r/LLM

Beyond Kuramoto Models: Associative Memory and Plastic Synapses in ML Ensembles

hackernoon.com·17h

🤖Machine Learning

Architectural and Mathematical Foundations of Machine Learning: A Rigorous Synthesis of Theory, Geometry, and Implementation

chizkidd.github.io·19h·

Discuss: Hacker News

🤖Machine Learning

First look: Run LLMs locally with LM Studio

infoworld.com·23h

GLM 5 is already on huggingface!

huggingface.co·15h·

Discuss: r/LocalLLaMA

Biases in the Blind Spot: Detecting What LLMs Fail to Mention

arxiv.org·1d·

Discuss: Hacker News

Statistical Models for the Latent Space: From Gaussian VAE to Kuramoto-Enhanced S-VAE

hackernoon.com·2d

🤖Machine Learning

Overview of end-to-end encrypted AI inference for Confer

news.ycombinator.com·14h·

Discuss: Hacker News

🤖Machine Learning

AI-augmented data quality engineering

infoworld.com·2d

🤖Machine Learning

Digitizing the "Shokunin": How we encoded a Master's hammer strike into AI

yusukekaizen.substack.com·2h·

Discuss: Substack

🤖Machine Learning

Transformer-Based Memory Forecasting: Leveraging Anonymized Aggregates for Personal Insights

novice.media·11h·

Discuss: Hacker News

Show HN: Fighting the War Against Expensive Reinforcement Learning

cadenza-landing-qtu7gbjwb-akshparekh123-3457s-projects.vercel.app·1h·

Discuss: Hacker News

🤖Machine Learning

Show HN: Latent-k – Persistent dependency map to reduce AI coding token usage

latentk.org·19h·

Discuss: Hacker News

Memsearch,an agent memory with md as source of truth(inspired by OpenClaw)

zilliztech.github.io·6h·

Discuss: Hacker News

How We Built the Fastest Kimi K2.5 on Artificial Analysis

baseten.co·17h·

Discuss: Hacker News

🤖Machine Learning

Automating Inference Optimizations with NVIDIA TensorRT LLM AutoDeploy

developer.nvidia.com·2d·

Discuss: Hacker News

🤖Machine Learning

Dear Agent: Prove it.

rijnard.com·4h·

Discuss: Hacker News

Ask HN: Where does this adversarial prize mechanism break?

news.ycombinator.com·7h·

Discuss: Hacker News

A Note on Flat Abstract Syntax Trees

gist.github.com·2d·

Discuss: Hacker News

🕸️WebAssembly

Loading more...