Model Training

Feeds to Scour
SubscribedAll
Scoured 264 posts in 7.1 ms

Discrete Diffusion Modelling by Estimating the Ratios of the Data Distribution

 🤖Machine Learning  Content type: News  Content type: Blog

Predictable Scaling Laws of Optimal Hyperparameters for LLM Continued Pre-training

 🖥️Systems ML  Content type: Academic
arxiv.org·

Introducing a new database category - the predictive database

 🤖Machine Learning  Content type: Blog
aito.ai··Hacker News

Stop hand-tuning kernels: How Neuron Agentic Development accelerates AWS Trainium optimizations

 ⚙️Systems Programming  Content type: Blog
aws.amazon.com·

LLM are universal simulators

 🖥️Systems ML

New comment by Ishan1907 in "Ask HN: Who wants to be hired? (June 2026)"

 🤖Machine Learning

Timing Trick Cuts Energy Used in LLM Training by Up to 14 Percent

 🤖Machine Learning  Content type: News
spectrum.ieee.org
··Hacker News

Vibe Diaries: Training Nanochat

 🤖Machine Learning
vibediary.dev··Hacker News

Multilevel Stochastic Gradient Descent for Risk-Averse PDE-Constrained Optimization

 🕸️Neural Networks  Content type: Academic
arxiv.org·

Alleged Fable sabotage of an ML project

 🤖Machine Learning
xcancel.com··Hacker News

I Let an AI Agent Run 40 Experiments While I Slept

 🧠Deep Learning  Content type: Blog
oreilly.com·

Welcome to Machine Learning With Manya: The Ultimate Adventure Map!

 🤖Machine Learning  Content type: Blog
medium.com·

youyeetoo updates R1 SBC and lists K1 N100-based x86 computer

 🛠️ML Frameworks
linuxgizmos.com·

Intro — Sehastrajit

 🤖Machine Learning  Content type: Blog
medium.com·

Adaptive Learning Rates with Surrogate Probability for Follow-the-Perturbed-Leader

 🔗Distributed Training  Content type: Academic
arxiv.org·

ml-from-scratch-book/code: Companion code for Machine Learning From Scratch — 10 core ML algorithms built from scratch with NumPy, compared with Scikit-learn and PyTorch.

 🤖Machine Learning  Content type: Code
github.com··Hacker News

Apple WWDC On-Device AI Deep Dive - Google Docs

 🤖Machine Learning
gist.is··Hacker News

Human-Like Neural Nets by Catapulting

 🧠Deep Learning
gwern.net··Hacker News

The 4-Stage AI Asset Lifecycle: How to Manage Your Models, Datasets, and Labels Without Losing Track

 🔄MLOps
sitepoint.com·

Hyperparameter Learning for Latent Factorization of Tensors for Representation Learning to Large-scale Dynamic Weighted Directed Network

 🕸️Neural Networks  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help