Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🤖 AI
Broad
Claude, Llama, Local LLMs
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
33
posts in
16.6
ms
RedToasty/llama.cpp
_qts: Fixing --
split-mode
tensor, with different KV cache quantization types.
🚀
Model Serving
github.com
·
3d
·
r/LocalLLaMA
Local
LLMs
are ready for real work
🚀
Model Serving
thelurkreport.beehiiv.com
·
2d
·
r/LocalLLaMA
GPU Memory Math for
LLMs
: Formula That Tells You What Fits on Your GPU
🧠
Deep Learning
theahmadosman.substack.com
·
7h
·
Substack
,
r/LocalLLaMA
Document-tuning
instills durable animal compassion in
LLMs
(and generalizes to humans)
💬
Natural Language Processing
lesswrong.com
·
1h
bytedance released an open source
model
that attempts to do just about anything with only 3b parameters
🧮
Vector Databases
huggingface.co
·
1d
·
r/LocalLLaMA
,
r/singularity
An
AI
Ingredients List Assumes You Know What the Ingredients Are
🛠️
Feature Engineering
pathandpayload.com
·
3d
Tried every Hermes Agent alternative so you don't have to (2026 roundup)
🦆
DuckDB
composio.dev
·
2d
·
r/LocalLLaMA
Grok Build Technical Decoder: Complete Wiki
🔨
LLVM
sharathdevulapalli.com
·
14h
Testing MiniMax M2.7 via API on three real ML and coding workflows
🚀
Model Serving
andlukyane.com
·
3d
·
Hacker News
Why does
off-model
SFT degrade capabilities?
🚀
Model Serving
lesswrong.com
·
4h
Why Ruby Still Feels Like Home After All These Years
📦
Parquet
caio.ca
·
1d
·
Lobsters
If I Were Emperor of New
AI
Safety
Researcher
Training...
🛠️
Feature Engineering
lesswrong.com
·
5h
BrunoArsioli/llama-optimus
: Lightweight Python tool using Optuna for
tuning
llama.cpp
flags: towards optimal tok/s for your machine
🔨
LLVM
github.com
·
11h
·
r/LocalLLaMA
Synthetic Persona Pretraining: Alignment from Token Zero
🛠️
Feature Engineering
lesswrong.com
·
14h
Qwen3.6-27B-UD-Q4_K_
XL.gguf
·
unsloth/Qwen3.6-27B-MTP-GGUF
at main
📦
uv
huggingface.co
·
3d
·
r/LocalLLaMA
slokam-ai/localgcp
:
LocalStack
for GCP. One Go binary emulating 14 Google Cloud services locally: Vertex
AI
, BigQuery, Spanner, Firestore, Pub/Sub, Cloud Storage, Bigtable, Cloud SQL, Memorystore, Cloud Tasks, KMS, Secret Manager, Cloud Run, Cloud Logging. Zero cloud bills.
📦
Parquet
github.com
·
13h
·
Lobsters
,
Hacker News
Classifier
Context
Rot: Monitor Performance Degrades with
Context
Length
📓
Note-Taking
lesswrong.com
·
2d
NLA Verbalizations on AuditBench:
Llama
70B
💬
Natural Language Processing
lesswrong.com
·
4d
anishathalye/ai-agent-security-lecture
: Guest lecture in MIT 6.566 on
AI
Agent Security
🤖
Transformers
github.com
·
2d
·
Lobsters
,
Hacker News
Jackrong/Qwopus3.5-9B-Coder-GGUF
🚀
Model Serving
huggingface.co
·
3d
·
r/LocalLLaMA
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help