🗺️ Region Inference - abnv · Scour

LLM-CoOpt: A Co-Design and Optimization Framework for Efficient LLM Inference on Heterogeneous Platforms

arxiv.org·13h

datavorous/spheni: An in-memory vector search library in C++ with Python bindings

github.com·4h·

Discuss: Hacker News

🔀SIMD Programming

Free(): Learning to Forget in Malloc-Only Reasoning Models

arxiv.org·1d

🔗Linear Lisp

Deterministic Inference with EigenAI

deterministicinference.com·4m

✨Effect Inference

AI Code’s Logic Can Now Be Checked From Within, Bypassing External Tests

quantumzeitgeist.com·4h

🎭Program Synthesis

The Machine Learning Practitioner’s Guide to Speculative Decoding

machinelearningmastery.com·7h

🚀Tokenizer Performance

Parallel Track Transformers: Enabling Fast GPU Inference with Reduced Synchronization

machinelearning.apple.com·1d

⚡Parallel Parsing

Avoiding UB but "safe" data race in a lock-free slab allocator - help - The Rust Programming Language Forum

users.rust-lang.org·19m

🔒Rust Borrowing

Cache-aware disaggregated inference for up to 40% faster long-context LLM serving

together.ai·18h

⏲️Embedded GC

Show HN: Latent-k – Persistent dependency map to reduce AI coding token usage

latentk.org·5h·

Discuss: Hacker News

🦀MIR Optimization

Wavelet Meets Adam: Compressing Gradients for Memory-Efficient Training

chipublib.idm.oclc.org·2h

📐Succinct Data Structures

Your ML Model Is Training on the Future

dev.to·1h·

Discuss: DEV

📈Query Optimization

Architectural and Mathematical Foundations of Machine Learning: A Rigorous Synthesis of Theory, Geometry, and Implementation

chizkidd.github.io·5h·

Discuss: Hacker News

🔍ML Language

A Note on Flat Abstract Syntax Trees

gist.github.com·2d·

Discuss: Hacker News

🌳Tree Walking

Writing a ONNX Neural Network Inference Engine from Scratch in C to run image classification with MobileNetV2

flexw.github.io·2d·

Discuss: r/C_Programming

🔍ML Language

Large Language Models for Mortals book

andrewpwheeler.com·8h

Computer vision segmentation model—deep learning for categorizing microplastic debris

frontiersin.org·1d

✨Effect Inference

Compiler-Driven Static Analysis Locking Context Checking Merged For Linux 7.0

phoronix.com·7h

🏃Escape Analysis

Deep C Dives: Undefined Behavior

i-programmer.info·5h

📚Stack Allocation

Fast Museum Searches: Go Concurrency and Caching

pkg.go.dev·2h·

Discuss: DEV

🗑️Stack Scanning GC

Loading more...