🔢 BitNet Inference - emschwartz · Scour

PackInfer: Compute- and I/O-Efficient Attention for Batched LLM Inference

arxiv.org·1d

🧠LLM Inference

the mathematics of compression in database systems

bitsxpages.com·11h

💾Binary Formats

Quantization-Aware Distillation

ternarysearch.blogspot.com·2d·

Discuss: Hacker News

🧠LLM Inference

Large Language Models Live in Time

lesswrong.com·16h

🧠LLM Inference

Import AI 444: LLM societies; Huawei makes kernels with AI; ChipBench

importai.substack.com·17h·

Discuss: Substack

🏆LLM Benchmarking

Main Content || Math ∩ Programming

jeremykun.com·1d

🌳Data Structures

Automating Inference Optimizations with NVIDIA TensorRT LLM AutoDeploy

developer.nvidia.com·12h

🏗️LLM Infrastructure

A Note on Flat Abstract Syntax Trees

gist.github.com·12h·

Discuss: Hacker News

📏Linear Types

DeepChopper model improves RNA sequencing research by mitigating chimera artifacts

phys.org·10h

🧠LLM Inference

vermaden.wordpress.com·19h

🔍Binary Analysis

Deep networks learn to parse uniform-depth context-free languages from local statistics

arxiv.org·1d

How can computing for AI and other demands be more energy efficient?

techxplore.com·2d

A practical systems engineering guide: Architecting AI-ready infrastructure for the agentic era

thenewstack.io·8h

🏗️LLM Infrastructure

AI-augmented data quality engineering

infoworld.com·21h

🔍AI Interpretability

LocalGPT: A local AI assistant with persistent memory in a single binary

localgpt.app·11h·

Discuss: Hacker News

🏗️LLM Infrastructure

Expectation and Copysets

buttondown.com·12h·

Discuss: Hacker News

💾Binary Formats

XL-MSDigger: a deep learning-based, versatile solution for cross-linking mass spectrometry

nature.com·3h

📇Vector Indexing

The Shape of Code » Dennard scaling a necessary condition for Moore’s law

shape-of-code.com·1d

🔬Chip Fabrication

25W06. Learning a language with the machine

z1nz0l1n.com·1d

🔤Tokenization

Bulk RRAM Could Be AI’s Memory Wall Solution

spectrum.ieee.org·18h·

Discuss: r/hardware

🧠Memory Hierarchy Design

Loading more...