Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Model Training
⚙️ Model Training
training pipeline, epochs, loss function, optimizer
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
264
posts in
7.1
ms
Discrete Diffusion
Modelling
by Estimating the
Ratios
of the Data Distribution
🤖
Machine Learning
Content type:
News
Content type:
Blog
leetarxiv.substack.com
·
1d
1 day ago
·
Substack
,
r/programming
Actions for Discrete Diffusion Modelling by Estimating the Ratios of the Data Distribution
Predictable Scaling Laws of
Optimal
Hyperparameters
for LLM Continued
Pre-training
🖥️
Systems ML
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Predictable Scaling Laws of Optimal Hyperparameters for LLM Continued Pre-training
Introducing a new database category - the predictive database
🤖
Machine Learning
Content type:
Blog
aito.ai
·
1d
1 day ago
·
Hacker News
Actions for Introducing a new database category - the predictive database
Stop
hand-tuning
kernels: How Neuron Agentic Development accelerates AWS
Trainium
optimizations
⚙️
Systems Programming
Content type:
Blog
aws.amazon.com
·
13h
13 hours ago
Actions for Stop hand-tuning kernels: How Neuron Agentic Development accelerates AWS Trainium optimizations
LLM are universal simulators
🖥️
Systems ML
invertedpassion.com
·
2d
2 days ago
·
Hacker News
Actions for LLM are universal simulators
New comment by Ishan1907 in "Ask HN: Who wants to be hired? (June 2026)"
🤖
Machine Learning
drive.google.com
·
6d
6 days ago
·
Hacker News
Actions for New comment by Ishan1907 in "Ask HN: Who wants to be hired? (June 2026)"
Timing Trick Cuts Energy Used in LLM
Training
by Up to 14 Percent
🤖
Machine Learning
Content type:
News
spectrum.ieee.org
·
18h
18 hours ago
·
Hacker News
Actions for Timing Trick Cuts Energy Used in LLM Training by Up to 14 Percent
Vibe Diaries:
Training
Nanochat
🤖
Machine Learning
vibediary.dev
·
2d
2 days ago
·
Hacker News
Actions for Vibe Diaries: Training Nanochat
Multilevel Stochastic
Gradient
Descent
for Risk-Averse PDE-Constrained
Optimization
🕸️
Neural Networks
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Multilevel Stochastic Gradient Descent for Risk-Averse PDE-Constrained Optimization
Alleged Fable sabotage of an ML project
🤖
Machine Learning
xcancel.com
·
2h
2 hours ago
·
Hacker News
Actions for Alleged Fable sabotage of an ML project
I Let an AI Agent Run 40 Experiments While I Slept
🧠
Deep Learning
Content type:
Blog
oreilly.com
·
6d
6 days ago
Actions for I Let an AI Agent Run 40 Experiments While I Slept
Welcome to Machine
Learning
With Manya: The Ultimate Adventure Map!
🤖
Machine Learning
Content type:
Blog
medium.com
·
2d
2 days ago
Actions for Welcome to Machine Learning With Manya: The Ultimate Adventure Map!
youyeetoo updates R1 SBC and lists K1 N100-based x86 computer
🛠️
ML Frameworks
linuxgizmos.com
·
1h
1 hour ago
Actions for youyeetoo updates R1 SBC and lists K1 N100-based x86 computer
Intro — Sehastrajit
🤖
Machine Learning
Content type:
Blog
medium.com
·
2d
2 days ago
Actions for Intro — Sehastrajit
Adaptive
Learning
Rates
with Surrogate Probability for Follow-the-Perturbed-Leader
🔗
Distributed Training
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Adaptive Learning Rates with Surrogate Probability for Follow-the-Perturbed-Leader
ml-from-scratch-book/code: Companion code for Machine
Learning
From Scratch — 10 core ML algorithms built from scratch with NumPy, compared with Scikit-learn and
PyTorch
.
🤖
Machine Learning
Content type:
Code
github.com
·
1d
1 day ago
·
Hacker News
Actions for ml-from-scratch-book/code: Companion code for Machine Learning From Scratch — 10 core ML algorithms built from scratch with NumPy, compared with Scikit-learn and PyTorch.
Apple WWDC On-Device AI Deep Dive - Google Docs
🤖
Machine Learning
gist.is
·
7h
7 hours ago
·
Hacker News
Actions for Apple WWDC On-Device AI Deep Dive - Google Docs
Human-Like Neural Nets by Catapulting
🧠
Deep Learning
gwern.net
·
4d
4 days ago
·
Hacker News
Actions for Human-Like Neural Nets by Catapulting
The 4-Stage AI Asset Lifecycle: How to Manage Your
Models
, Datasets, and Labels Without
Losing
Track
🔄
MLOps
sitepoint.com
·
5d
5 days ago
Actions for The 4-Stage AI Asset Lifecycle: How to Manage Your Models, Datasets, and Labels Without Losing Track
Hyperparameter
Learning
for Latent Factorization of
Tensors
for Representation
Learning
to Large-scale Dynamic Weighted Directed Network
🕸️
Neural Networks
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Hyperparameter Learning for Latent Factorization of Tensors for Representation Learning to Large-scale Dynamic Weighted Directed Network
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help