Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Model Training
⚙️ Model Training
pretraining, fine-tuning, training run, compute, loss curve
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
837
posts in
7.7
ms
If Claude Fable stops helping you, you’ll never know
💬
LLMs
simonwillison.net
·
1d
1 day ago
·
Hacker News
Actions for If Claude Fable stops helping you, you’ll never know
The Enormous Potential For Microsoft Frontier
Fine
Tuning
🧠
AI Research
joshbersin.com
·
6d
6 days ago
Actions for The Enormous Potential For Microsoft Frontier Fine Tuning
Researchers
trained
an open source AI search agent, Harness-1, that outperforms GPT-5.4 on recalling relevant information
🎮
Reinforcement Learning
venturebeat.com
·
2d
2 days ago
·
Hacker News
Actions for Researchers trained an open source AI search agent, Harness-1, that outperforms GPT-5.4 on recalling relevant information
Fine
tuning
classification in Elixir
📉
Deep Learning
elixirstatus.com
·
3d
3 days ago
Actions for Fine tuning classification in Elixir
Predictive Processing: Conscious when
Training
💬
LLMs
lesswrong.com
·
14h
14 hours ago
Actions for Predictive Processing: Conscious when Training
Introducing North Mini Code: Cohere’s First
Model
For Developers
🔄
Transformers
Content type:
Blog
huggingface.co
·
1d
1 day ago
·
Hacker News
Actions for Introducing North Mini Code: Cohere’s First Model For Developers
Evolution of crystal field and intra-ionic interactions in ilmenite $A{\mathrm{IrO}}_{3}$ ($A=\mathrm{Mg}$, Zn, Cd) and hyperhoneycomb $β\text{−}{\mathrm{ZnIrO}...
🔍
Interpretability
link.aps.org
·
2d
2 days ago
Actions for Evolution of crystal field and intra-ionic interactions in ilmenite $A{\mathrm{IrO}}_{3}$ ($A=\mathrm{Mg}$, Zn, Cd) and hyperhoneycomb $β\text{−}{\mathrm{ZnIrO}...
A new chapter of efficient foundation
models
for medical imaging
🖥️
ML Systems
techcommunity.microsoft.com
·
1d
1 day ago
Actions for A new chapter of efficient foundation models for medical imaging
Ideogram 4.0 launches with 2K resolution and top open-weight ranking
📐
Scaling Laws
alternativeto.net
·
6d
6 days ago
Actions for Ideogram 4.0 launches with 2K resolution and top open-weight ranking
Probabilistic Contrastive
Pretraining
for Multi-task ADME Property Prediction
💬
LLMs
Content type:
Academic
arxiv.org
·
10h
10 hours ago
Actions for Probabilistic Contrastive Pretraining for Multi-task ADME Property Prediction
Nvidia DGX Spark GB10 – AI
Models
and Guide with
vLLM
and Autonomous Script
💬
LLMs
Content type:
Code
github.com
·
5d
5 days ago
·
Hacker News
Actions for Nvidia DGX Spark GB10 – AI Models and Guide with vLLM and Autonomous Script
A generalist biomedical vision-language
model
via multi-CLIP knowledge distillation
💬
LLMs
Content type:
Academic
nature.com
·
1d
1 day ago
Actions for A generalist biomedical vision-language model via multi-CLIP knowledge distillation
Why Shrinking an AI
Model
Often Makes It More Useful
💬
LLMs
siliconopera.com
·
4d
4 days ago
Actions for Why Shrinking an AI Model Often Makes It More Useful
pLM-Guided Inverse Folding for Antibody Sequence Design
🔍
Interpretability
Content type:
Academic
biorxiv.org
·
4d
4 days ago
Actions for pLM-Guided Inverse Folding for Antibody Sequence Design
Tracing Eval-Awareness Emergence Through
Training
of OLMo 3
🎮
Reinforcement Learning
lesswrong.com
·
1d
1 day ago
Actions for Tracing Eval-Awareness Emergence Through Training of OLMo 3
Latest technical articles & videos.
📄
arXiv
certdepot.net
·
5d
5 days ago
Actions for Latest technical articles & videos.
Hubs or Fringes:
Pretraining
Data Selection via Web Graph Centrality
💬
LLMs
Content type:
Academic
arxiv.org
·
10h
10 hours ago
Actions for Hubs or Fringes: Pretraining Data Selection via Web Graph Centrality
Small Experiments, Cheaper Decisions: A Case Study in Staged Promotion for
Micro-Pretraining
💬
LLMs
Content type:
Academic
arxiv.org
·
10h
10 hours ago
Actions for Small Experiments, Cheaper Decisions: A Case Study in Staged Promotion for Micro-Pretraining
google/gemma-4-31B-it · fix: chat template — null handling, reasoning preservation, turn-tag balance, input validation
🔥
PyTorch
huggingface.co
·
3d
3 days ago
·
r/LocalLLaMA
Actions for google/gemma-4-31B-it · fix: chat template — null handling, reasoning preservation, turn-tag balance, input validation
fix(gateway): fail closed for unknown
model
auth · openclaw/openclaw@85343ea
💬
LLMs
Content type:
Code
github.com
·
6d
6 days ago
Actions for fix(gateway): fail closed for unknown model auth · openclaw/openclaw@85343ea
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help