Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
💫 slick production values
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
15585
posts in
17.5
ms
Fused
INT8
Weight-Only Quantization in
Pallas
🧮
Vector Databases
rishirajacharya.com
·
3d
·
…
You guys seen this? 1-bit model with an
MMLU-R
of 65.7, 8B
params
🚀
Performance
huggingface.co
·
1d
·
r/LocalLLaMA
·
…
DRAM
pricing is killing the *everything* market. We just had a vendor
uplift
our...
🤪
outlandish and economical
news.ycombinator.com
·
21h
·
Hacker News
·
…
“เอกนัฏ” จ่
อตร
ึงค่าไฟ 3.88 บาท หลั
งถวายส
ัตย์ฯ พร้อมรื้อโครงสร้างพลังงานทันที
⭐
★★★★★
thairath.co.th
·
15h
·
…
วิกฤตน้ำมั
นกระทบ
3 ค่
ายขนส
่ง KEX-Flash-J&T ประกาศขึ้นค่าส่ง 3 บาท/ชิ้น เริ่ม 1 เม.ย. นี้
📼
Cassette
thestandard.co
·
2d
·
…
Order of
Values
🤪
outlandish and economical
sebastian.graphics
·
5d
·
…
march 30
oem
⭐
★★★★★
combatdavey.net
·
2d
·
…
Pure C implementation of the
TurboQuant
paper (
ICLR
2026) for KV cache compression in LLM inference.
🦙
Ollama
github.com
·
1d
·
r/LocalLLaMA
·
…
Scaling a
Monolith
to 1M
LOC
: 113 Pragmatic Lessons from Tech Lead to CTO
🚀
Performance
semicolonandsons.com
·
6d
·
Lobsters
,
Hacker News
·
…
I spent 96 hours setting up dual
DGX
Sparks and a Mac Studio M3 Ultra for the same
397B
model. Neither won.
🚀
Performance
alooftwaffle.substack.com
·
5d
·
r/LocalLLaMA
·
…
Bigoish
: Test the
empirical
computational complexity of Rust algorithms
🧮
Vector Databases
docs.rs
·
6d
·
Lobsters
,
Hacker News
·
…
Inference
Engines
— A visual deep dive into the journey of a token down the transformer
layers
🧮
Vector Databases
femiadeniran.com
·
4d
·
r/LocalLLaMA
·
…
google/gemma-4-31B-it
🦙
Ollama
huggingface.co
·
3h
·
r/LocalLLaMA
·
…
2026 Week 13
📈
escalating concern
blog.lmorchard.com
·
6d
·
…
Under New Management
📈
escalating concern
adventuresinclaude.ai
·
4d
·
…
Some Local
Aspects
of AI
🦙
Ollama
blog.raymond.burkholder.net
·
4d
·
…
Block-Sparse Attention Kernel via
JAX/Pallas
🚀
Performance
rishirajacharya.com
·
3d
·
…
ggml
: allow prefetching tensor overrides by
am17an
· Pull Request #21067
🚀
Performance
github.com
·
5d
·
r/LocalLLaMA
·
…
My new favorite warp speed !
qwen3.5-35b-a3b-turbo-swe-v0.0.1
🕹️
PICO-8
huggingface.co
·
3d
·
r/LocalLLaMA
·
…
Llama.cpp with
Turboquant
, Heavy-Hitter Oracle (H2O), and
StreamingLLM
. Even more performance!
🦙
Ollama
github.com
·
5d
·
r/LocalLLaMA
·
…
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help