Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
⚡ Quantization
Model Compression, INT8, Weight Quantization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
81
posts in
49.9
ms
Intentional
Model
Selection — How to Actually Choose the Right Gemma 4 Variant for Your Workload
🤖
LLM Inference
dev.to
·
5d
·
DEV
AlexRosito67/xyron-mnist-esp32: Implementation of Xyron (Neural network CLI tool in C++ with configurable layers and activation functions)
📶
ESP32
github.com
·
20h
·
DEV
Cohere cracks lossless
quantization
and native citations with first full Apache 2.0 licensed open
model
Command A+
🔄
AI Workflows
venturebeat.com
·
5h
Stop Fixing Your Prompts — Fix Your Thinking Style Instead (A Claude Code Experiment)
📋
AGENTS.md
bestaiweb.ai
·
1d
·
DEV
Benchmark and optimize LLMs on-device with AI Edge Portal
📊
LLM Evaluation
cloud.google.com
·
10h
How I Shipped an Autonomous Agentic System on a 2026 Serverless-GPU Stack
📊
Compute Markets
medium.com
·
2d
How Auto Transport Companies Are Leveraging AI for Precision Logistics
⚙️
AI Automation
haulin.ai
·
22h
·
DEV
To tame enterprise AI chaos, open source rallies around a standard execution layer
🏛
Sovereign AI Infrastructure
siliconangle.com
·
6d
Ollama Cheat Sheet: Local LLMs,
Models
, API &
Integration
(2026)
🦙
Ollama
meshworld.in
·
2d
·
DEV
When the Sensitivity Metric Lies: A Drift-Inversion in Mixed-Precision LLM
Quantization
⚡
Inference
dev.to
·
15h
·
DEV
Convergent Abstraction Hypothesis
⚡
Inference
lesswrong.com
·
6d
Untangling 40-Year-Old COBOL Monoliths with Gemma 4 (Yes, Completely Offline)
🦙
Ollama
github.com
·
2d
·
DEV
Issue 651
📊
Data Science
datascienceweekly.substack.com
·
6d
·
Substack
Lighthouse Attention: The Training-Time Hierarchy That Makes Quadratic Attention Practical Again
🤖
LLM Inference
dev.to
·
1d
·
DEV
I Thought Fine-Tuning LLMs Needed Expensive GPUs. I Was Wrong.
🤖
LLM Inference
dev.to
·
19h
·
DEV
On-Device AI for Construction Safety: Why I'm Skipping the Cloud Entirely
🏛
Sovereign AI Infrastructure
dev.to
·
2d
·
DEV
KTransformers' 5 Hidden Uses That Make 671B
Models
Run on Your Laptop 🔥
⚡
Hardware Acceleration
dev.to
·
23h
·
DEV
The Central Bank of
Intelligence
: Navigating the Token Economy
💰
Tokenomics
dev.to
·
6d
·
DEV
GPU Bottleneck Analyzer, NVIDIA Rubin VRAM Demands, and Qwen VRAM Optimization
🟩
Nvidia
dev.to
·
2d
·
DEV
When
Models
Eat the World: Supply Chain Quality for AI-Dependent Systems
🎯
AI Reliability
dev.to
·
21h
·
DEV
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help