Binary Neural Networks, Low-precision Training, Efficient Inference, Weight Compression

Feeds to Scour
SubscribedAll
Scoured 18244 posts in 694.1 ms
LB-MCTS: Synergizing Large Language Models and Bayesian Optimization for Efficient CASH
arxiv.org·1d
🧠LLM Inference
Preview
Report Post
Everything Moe
ianbarber.blog·1d·
Discuss: Hacker News
🔤Tokenization
Preview
Report Post
Gated DeltaNet: The “Surgical Eraser” Solving Linear Attention’s Memory Problem
pub.towardsai.net·1d
🧠LLM Inference
Preview
Report Post
Less Is More -- Until It Breaks: Security Pitfalls of Vision Token Compression in Large Vision-Language Models
arxiv.org·1d
🔤Tokenization
Preview
Report Post
A Visual Guide to Quantization
newsletter.maartengrootendorst.com·2d
🔬RaBitQ
Preview
Report Post
Co-optimization Approaches For Reliable and Efficient AI Acceleration (Peking University et al.)
semiengineering.com·13h
Hardware Acceleration
Preview
Report Post
Uncovering Unfaithful CoT in Deceptive Models
lesswrong.com·5h
🛡️AI Security
Preview
Report Post
featurestorebook/mlfs-book: O'Reilly book - Building Machine Learning Systems with a feature store: batch, real-time, and LLMs
github.com·6h·
Discuss: Hacker News
🏗️LLM Infrastructure
Preview
Report Post
Exploring Text Compression
denvaar.dev·1d
📝Text Compression
Preview
Report Post
Memory layout matters: Reducing metric storage overhead by 4x in a Rust TSDB
baarse.substack.com·15h·
Discuss: r/rust
🏹Apache Arrow
Preview
Report Post
ChatGPT’s Laws of Machine Learning
shruggingface.com·1d
🛡️AI Security
Preview
Report Post
Model-agnostic linear-memory online learning in spiking neural networks
nature.com·2d
🔢BitNet Inference
Preview
Report Post
Implementing Dino from Scratch
logits.bearblog.dev·1d·
Discuss: Hacker News
🎯Qdrant
Preview
Report Post
Analog hardware may solve Internet of Things' speed bumps and bottlenecks
techxplore.com·10h
🖥️Hardware Architecture
Preview
Report Post
Why AI Needs GPUs and TPUs: The Hardware Behind LLMs
blog.bytebytego.com·2d
Hardware Acceleration
Preview
Report Post
FastMCP 3.0
producthunt.com·1d
📋MCP
Preview
Report Post
The Convolutional Neural Network
cocakoala.substack.com·3d·
Discuss: Substack
📊Vector Databases
Preview
Report Post
High-security-risk AI apps: Millions of data sets open on the net
europedigital.cloud·1d
🛡️AI Security
Preview
Report Post
Why it’s critical to move beyond overly aggregated machine-learning metrics
news.mit.edu·1d
🏆LLM Benchmarking
Preview
Report Post
Inside Mixedbread: How We Built Multimodal Late-Interaction at Billion Scale
mixedbread.com·2d·
Discuss: Hacker News
🎨Chroma
Preview
Report Post

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help