Model Quantization, ONNX Runtime, Embedded Inference, TinyML

Feeds to Scour
SubscribedAll
Scoured 3978 posts in 34.3 ms
The AI Engineering Stack
bowtiedraptor.substack.com·22h·
Discuss: Substack
AI-Driven DevOps
Preview
Report Post
Disaggregated machine learning via in-physics computing at radio frequency
science.org·2d·
Discuss: Hacker News
📉Model Quantization
Preview
Report Post
The three AI bets
alearningaday.blog·1d·
Discuss: Hacker News
AI Ethics & Alignment
Preview
Report Post
Week 5 in Data Science: Image recognition neural network with 90% accuracy
igorstechnoclub.com·1d·
Discuss: Hacker News
🗂️Vector Databases
Preview
Report Post
I bought a €9k GH200 “desktop” to save $1.27 on Claude Code (vLLM tuning notes)
dnhkng.github.io·7h·
Discuss: r/LocalLLaMA
🚀Performance
Preview
Report Post
Out-of-Context: Constrained Tool Based Exploration of Context
gojiberries.io·17h·
Discuss: Hacker News
🧩LLM Integration
Preview
Report Post
Taming P99s in OpenFGA: How We Built a Self-Tuning Strategy Planner
auth0.com·2d·
Discuss: Lobsters
🚀Performance
Preview
Report Post
AI as the Engine of Application State
jonwoodlief.com·1d·
Discuss: Hacker News
💬AI Code Assistants
Preview
Report Post
AI features for no one
pcloadletter.dev·14h·
🛡️AI Security
Preview
Report Post
Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting
arxiv.org·1d·
Discuss: r/LocalLLaMA
🛡Resilience Engineering
Preview
Report Post
DeepSeek To Release Next Flagship AI Model With Strong Coding Ability
theinformation.com
·2d·
📉Model Quantization
Preview
Report Post
Creating a perceptron for logical operations
knowledge.dev·18h·
Discuss: Hacker News
📉Model Quantization
Preview
Report Post
The "Good Will Hunting" Problem in Generative AI
medium.com·17h·
Discuss: Hacker News
🛡️AI Security
Preview
Report Post
Ask HN: Idea) Autoregressive joint embedding predictor model
news.ycombinator.com·1d·
Discuss: Hacker News
🔢Embeddings
Preview
Report Post
The Dirty Secret of Million-Token Context Windows
deadneurons.substack.com·1d·
Discuss: Substack
🧱Chunking
Preview
Report Post
Don't fall into the anti-AI hype
antirez.com·6h·
AI-Driven DevOps
Preview
Report Post
AI’s Memorization Crisis
theatlantic.com
·1d·
🔍RAG
Preview
Report Post
The Coming AI Compute Crunch
martinalderson.com·1d·
💸Affordable LLMs
Preview
Report Post
Sparse attention 3 – inefficiency of extracting similar content
kindxiaoming.github.io·4h
🧱Chunking
Preview
Report Post
HW-Accelerated Physical AI Framework For Resource-Constrained Edge Devices (ASU)
semiengineering.com·3d
📉Model Quantization
Preview
Report Post

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help