Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLM Training
🧠 LLM Training
Specific
LLM training, pretraining, RLHF, model training, arxiv ML
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
318
posts in
8.8
ms
Parameter-Efficient
Fine-Tuning
with Learnable Rank
⚡
LLM Inference
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Parameter-Efficient Fine-Tuning with Learnable Rank
LeLab Is
Hugging
Face
’s New Browser-Based GUI for the LeRobot Ecosystem
🖥️
Self-Hosting
Content type:
News
hackster.io
·
1d
1 day ago
Actions for LeLab Is Hugging Face’s New Browser-Based GUI for the LeRobot Ecosystem
New comment by bkjlblh in "Claude Fable 5"
⚡
LLM Inference
Content type:
Discussion
news.ycombinator.com
·
22h
22 hours ago
·
Hacker News
Actions for New comment by bkjlblh in "Claude Fable 5"
Timing Trick Cuts Energy Used in
LLM
Training
by Up to 14 Percent
⚙️
Systems Programming
Content type:
News
spectrum.ieee.org
·
5h
5 hours ago
Actions for Timing Trick Cuts Energy Used in LLM Training by Up to 14 Percent
Train
Models
Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell
⚡
LLM Inference
Content type:
News
Content type:
Blog
developer.nvidia.com
·
1d
1 day ago
Actions for Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell
Critical
Hugging
Face
Transformers
flaw ran attacker code on a routine model load
⚡
LLM Inference
siliconangle.com
·
6d
6 days ago
Actions for Critical Hugging Face Transformers flaw ran attacker code on a routine model load
If Claude Fable stops helping you, you’ll never know
⚡
LLM Inference
simonwillison.net
·
15h
15 hours ago
·
Hacker News
Actions for If Claude Fable stops helping you, you’ll never know
NeuroBait: I
fine-tuned
a
model
to spark dopamine for ADHD brain
⚡
LLM Inference
Content type:
Blog
huggingface.co
·
1d
1 day ago
Actions for NeuroBait: I fine-tuned a model to spark dopamine for ADHD brain
Hugging
Face
Transformers
RCE flaw enables stealthy compromise via AI model configs
⚡
LLM Inference
csoonline.com
·
6d
6 days ago
Actions for Hugging Face Transformers RCE flaw enables stealthy compromise via AI model configs
Less-relevant results
DiffusionGemma: 4x Faster Text Generation
⚡
LLM Inference
Content type:
News
Content type:
Blog
blog.google
·
31m
31 minutes ago
·
Hacker News
Actions for DiffusionGemma: 4x Faster Text Generation
Hugging
Face
Transformers
flaw enables RCE via malicious model configs
⚡
LLM Inference
4sysops.com
·
3d
3 days ago
Actions for Hugging Face Transformers flaw enables RCE via malicious model configs
LLM
are universal simulators
⚡
LLM Inference
invertedpassion.com
·
1d
1 day ago
·
Hacker News
Actions for LLM are universal simulators
Domain-Specific Small Language
Models
(Manning)
⚡
LLM Inference
i-programmer.info
·
1h
1 hour ago
Actions for Domain-Specific Small Language Models (Manning)
Malicious
Hugging
Face
Models
Could Trigger Remote Code Execution
🕸️
axum
techrepublic.com
·
4d
4 days ago
Actions for Malicious Hugging Face Models Could Trigger Remote Code Execution
libertywing/FlashMemory-Deepseek-V4: FlashMemory DS-V4 Retriever: a lightweight retriever that sparsifies DeepSeek-V4 CSA KV-cache. Weights available on
Hugging
Face
.
⚡
LLM Inference
Content type:
Code
github.com
·
16h
16 hours ago
Actions for libertywing/FlashMemory-Deepseek-V4: FlashMemory DS-V4 Retriever: a lightweight retriever that sparsifies DeepSeek-V4 CSA KV-cache. Weights available on Hugging Face.
Google Colab CLI opens runtimes to Claude Code and Codex
🔄
Async Runtimes
helpnetsecurity.com
·
2d
2 days ago
·
r/ClaudeAI
Actions for Google Colab CLI opens runtimes to Claude Code and Codex
A generalist biomedical vision-language
model
via multi-CLIP knowledge distillation
⚡
LLM Inference
Content type:
Academic
nature.com
·
16h
16 hours ago
Actions for A generalist biomedical vision-language model via multi-CLIP knowledge distillation
Nvidia Nemotron 3 Ultra
⚡
LLM Inference
research.nvidia.com
·
6d
6 days ago
·
Hacker News
Actions for Nvidia Nemotron 3 Ultra
DiffusionGemma: The Developer Guide
⚡
LLM Inference
Content type:
Blog
developers.googleblog.com
·
16h
16 hours ago
Actions for DiffusionGemma: The Developer Guide
Finetuning
masking challenges narrow-task evaluation of cell foundation
models
⚡
LLM Inference
Content type:
Academic
biorxiv.org
·
3d
3 days ago
Actions for Finetuning masking challenges narrow-task evaluation of cell foundation models
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help