Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Fine-tuning
🎯 Fine-tuning
Specific
LoRA, PEFT, instruction tuning, model fine-tuning, QLoRA
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
385
posts in
5.0
ms
vLLM
Transformers
Backend: Bridging
Hugging
Face
Compatibility and High-Performance Inference
🧠
LLM Inference
Content type:
Blog
odsc.medium.com
·
2h
2 hours ago
Actions for vLLM Transformers Backend: Bridging Hugging Face Compatibility and High-Performance Inference
How to reduce capability degradation from
off-model
SFT
🎮
Reinforcement Learning
lesswrong.com
·
3d
3 days ago
Actions for How to reduce capability degradation from off-model SFT
NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI
⚡
CUDA
Content type:
Blog
blogs.nvidia.com
·
1d
1 day ago
Actions for NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI
Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM
🧠
LLM Inference
deemwar-products.github.io
·
6d
6 days ago
·
Hacker News
Actions for Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM
Anthropic releases Claude Fable 5 and Mythos 5 with major gains in coding and science
🪨
Obsidian
oodaloop.com
·
1d
1 day ago
Actions for Anthropic releases Claude Fable 5 and Mythos 5 with major gains in coding and science
Posting for authoring
📔
Journaling
turingpost.com
·
4d
4 days ago
Actions for Posting for authoring
The week AI infrastructure crossed from a technology story to a
financial
one
🧠
LLM Inference
Content type:
News
mlwhiz.com
·
1d
1 day ago
Actions for The week AI infrastructure crossed from a technology story to a financial one
Architecture-Aware Reinforcement Learning Makes Sliding-Window Attention Competitive in Math Reasoning
🧠
Transformers
Content type:
Academic
arxiv.org
·
20h
20 hours ago
Actions for Architecture-Aware Reinforcement Learning Makes Sliding-Window Attention Competitive in Math Reasoning
DiffusionGemma: 4x Faster Text Generation
🤖
Data science
Content type:
News
Content type:
Blog
blog.google
·
1d
1 day ago
·
Hacker News
,
r/LocalLLaMA
,
r/singularity
Actions for DiffusionGemma: 4x Faster Text Generation
Latest technical articles & videos.
🧠
LLMs
certdepot.net
·
5d
5 days ago
Actions for Latest technical articles & videos.
Domain-Specific Small Language
Models
(Manning)
📡
RSS
i-programmer.info
·
1d
1 day ago
Actions for Domain-Specific Small Language Models (Manning)
tantara/worldcup-sim: Explore and simulate the 2026 FIFA World Cup — typed tournament data, an
LLM
simulation kernel, and in-browser TTS commentary.
🤖
Data science
Content type:
Code
github.com
·
8h
8 hours ago
·
Hacker News
Actions for tantara/worldcup-sim: Explore and simulate the 2026 FIFA World Cup — typed tournament data, an LLM simulation kernel, and in-browser TTS commentary.
Show HN: Bosun – a small
model
that keeps an agent's memory graph clean
🔤
Tokenization
huggingface.co
·
6h
6 hours ago
·
Hacker News
Actions for Show HN: Bosun – a small model that keeps an agent's memory graph clean
fc2
🧠
LLMs
yog.ink
·
4d
4 days ago
Actions for fc2
I Processed 2.4 Billion Tokens Across 52 AI
Models
for $0.52. Here's the Full Breakdown.
🤖
AI Agents
saintlex.sbs
·
21h
21 hours ago
·
DEV
Actions for I Processed 2.4 Billion Tokens Across 52 AI Models for $0.52. Here's the Full Breakdown.
1-bit and 1.58 bit
LLM
Benchmarking on Jetson Orin Nano
Super
| Bonsai LM
🤖
Data science
smolhub.com
·
3d
3 days ago
·
r/LocalLLaMA
Actions for 1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM
Substrate Asymmetry in User-Side Memory: A Diagnostic Framework
🧠
LLMs
Content type:
Academic
arxiv.org
·
20h
20 hours ago
Actions for Substrate Asymmetry in User-Side Memory: A Diagnostic Framework
iOS 27 Security: What WWDC 2026’s AI Features Mean for Mobile App Risk
🤖
Data science
Content type:
Blog
nowsecure.com
·
4h
4 hours ago
Actions for iOS 27 Security: What WWDC 2026’s AI Features Mean for Mobile App Risk
GGUF vs GPTQ vs AWQ: The Plain-English Guide to
LLM
Quantization (and Which One to Pick)
🧠
LLM Inference
vettedconsumer.com
·
5d
5 days ago
·
Hacker News
Actions for GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)
Hacker News Cohort Collectively Dismisses Anthropic and Champions Chinese
Models
over Fable's Fumble
🤖
Automation
Content type:
Discussion
news.ycombinator.com
·
2d
2 days ago
·
r/LocalLLaMA
Actions for Hacker News Cohort Collectively Dismisses Anthropic and Champions Chinese Models over Fable's Fumble
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help