Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
📱 Edge AI
Model Quantization, ONNX Runtime, Embedded Inference, TinyML
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
4939
posts in
70.4
ms
From
Monolith
to Micro-Brain:
Architecting
Scalable AI Inference in .NET
dev.to
·
8h
·
Discuss:
DEV
💬
Prompt Engineering
Quantization-Aware
Distillation
ternarysearch.blogspot.com
·
1h
·
Discuss:
Hacker News
📉
Model Quantization
How I
squeezed
a
BERT
sentiment analyzer into 1GB RAM on a $5 VPS
mohammedeabdelaziz.github.io
·
12h
·
Discuss:
Hacker News
📉
Model Quantization
Hello Edge: Keyword
Spotting
on
Microcontrollers
paperium.net
·
1d
·
Discuss:
DEV
📉
Model Quantization
LAI
#113: The Engineering Work That
Decides
Whether AI Holds Up
pub.towardsai.net
·
2d
💬
AI Code Assistants
Continual
learning and the post
monolith
AI era
baseten.co
·
1d
·
Discuss:
Hacker News
🧱
Chunking
Life at the Edge
asadk.com
·
16h
·
Discuss:
Hacker News
🌍
Edge Computing
Building the Future with AI That
Acts
devxt.com
·
4h
·
Discuss:
Hacker News
🤖
spec-driven ai-assisted development
Understanding LLM Inference
Engines
: Inside
Nano-vLLM
(Part 2)
neutree.ai
·
1d
·
Discuss:
Hacker News
🧩
LLM Integration
Hypernetworks
: Neural Networks for
Hierarchical
Data
blog.sturdystatistics.com
·
2d
·
Discuss:
Hacker News
🔢
Embeddings
Human-like Search for Modern
Applications
anvitra.ai
·
1h
·
Discuss:
Hacker News
🗂️
Vector Databases
gharasathi
(
घर
ासाठी) — A Privacy-First Household AI Running on a $200 Mini PC
amazon.com.au
·
14m
·
Discuss:
DEV
💸
Affordable LLMs
ML-LIB
: Machine Learning Library Proposed For The Linux Kernel
phoronix.com
·
1d
·
Discuss:
Hacker News
🧩
LLM Integration
Released:
DeepBrainz-R1
— reasoning-first small models for agentic workflows (
4B
/ 2B
huggingface.co
·
2d
·
Discuss:
Hacker News
,
r/LocalLLaMA
🧩
LLM Integration
Agentic
Coding and the Problem of
Oracles
epkconsulting.substack.com
·
9h
·
Discuss:
Substack
,
r/programming
🔄
Autonomous Agents
a
proposal
for AI that's on your side
r.github.io
·
1d
·
Discuss:
Hacker News
💬
AI Code Assistants
Fast
Autoscheduling
for Sparse ML
Frameworks
ajroot.pl
·
3d
·
Discuss:
Hacker News
,
r/Compilers
🗂️
Vector Databases
Running Local LLMs as Your AI Coding
Assistant
on Apple
Silicon
dev.to
·
23h
·
Discuss:
DEV
💬
Prompt Engineering
Building Highly Efficient Inference System for
Recommenders
Using
PyTorch
pytorch.org
·
1d
·
Discuss:
Hacker News
📉
Model Quantization
ggml
: backend-agnostic tensor parallelism by
JohannesGaessler
· Pull Request #19378
github.com
·
2d
·
Discuss:
r/LocalLLaMA
🚀
Performance
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help