Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLM Training
🧠 LLM Training
Specific
LLM training, pretraining, RLHF, model training, arxiv ML
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
318
posts in
5.1
ms
Data-Constrained Language
Model
Pretraining
: Improved Regularization and Scaling Laws
⚡
LLM Inference
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Data-Constrained Language Model Pretraining: Improved Regularization and Scaling Laws
Less-relevant results
ju4nv1e1r4/nlp_engine_inference: An inference engine for NLP
models
.
⚡
LLM Inference
Content type:
Code
github.com
·
6d
6 days ago
·
r/rust
Actions for ju4nv1e1r4/nlp_engine_inference: An inference engine for NLP models.
Reinventing Entropy
⚙️
Systems Programming
Content type:
News
Content type:
Blog
3blue1brown.substack.com
·
3d
3 days ago
·
Substack
Actions for Reinventing Entropy
Mythograph Atelier #1 - Abstract Art That Means Something to You
🕸️
axum
Content type:
Blog
huggingface.co
·
2d
2 days ago
Actions for Mythograph Atelier #1 - Abstract Art That Means Something to You
AI Chart Understanding Breakthrough: MIT-IBM Dataset Lets Small
Models
Beat GPT-4o
⚡
LLM Inference
techtimes.com
·
6d
6 days ago
Actions for AI Chart Understanding Breakthrough: MIT-IBM Dataset Lets Small Models Beat GPT-4o
Fisher-Guided Progressive
Parameter
Selection for Adaptive
Fine-Tuning
⚡
LLM Inference
Content type:
Academic
arxiv.org
·
12h
12 hours ago
Actions for Fisher-Guided Progressive Parameter Selection for Adaptive Fine-Tuning
The crash that vanished: control and emergence in a
five-model
economy
🦀
Rust
Content type:
Blog
huggingface.co
·
2d
2 days ago
Actions for The crash that vanished: control and emergence in a five-model economy
Multi-Hop Knowledge Composition is Bound by
Pretraining
Exposure
⚡
LLM Inference
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Multi-Hop Knowledge Composition is Bound by Pretraining Exposure
If Claude Fable stops helping you, you'll never know
⚡
LLM Inference
Content type:
Blog
jonready.com
·
19h
19 hours ago
·
Lobsters
,
Hacker News
Actions for If Claude Fable stops helping you, you'll never know
Amazing Digital Dentures (a failed project)
📝
Long-form Tech Essays
Content type:
Blog
huggingface.co
·
2d
2 days ago
Actions for Amazing Digital Dentures (a failed project)
Mix, Don't Pick: Why Synthetic Corpus Composition Matters for Time Series Foundation
Model
Pretraining
⚡
LLM Inference
Content type:
Academic
arxiv.org
·
12h
12 hours ago
Actions for Mix, Don't Pick: Why Synthetic Corpus Composition Matters for Time Series Foundation Model Pretraining
Room360: Video-to-3D Spatial Reconstruction Platform
🖥️
Self-Hosting
Content type:
Blog
huggingface.co
·
2d
2 days ago
Actions for Room360: Video-to-3D Spatial Reconstruction Platform
Benchmarking Empirical Privacy Protection for Adaptations of Large Language
Models
⚡
LLM Inference
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Benchmarking Empirical Privacy Protection for Adaptations of Large Language Models
Advancing the State-of-the-Art in Empirical Privacy Auditing
λ
Type Theory
Content type:
Academic
arxiv.org
·
12h
12 hours ago
Actions for Advancing the State-of-the-Art in Empirical Privacy Auditing
Task-Seeded Synthetic Q&A Generation for Nemotron
Pretraining
⚡
LLM Inference
Content type:
Blog
huggingface.co
·
6d
6 days ago
Actions for Task-Seeded Synthetic Q&A Generation for Nemotron Pretraining
In-Context Learning for Latent Space Bayesian Optimization
⚡
LLM Inference
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for In-Context Learning for Latent Space Bayesian Optimization
Unifying Local Communications and Local Updates for
LLM
Pretraining
⚡
LLM Inference
Content type:
Academic
arxiv.org
·
12h
12 hours ago
Actions for Unifying Local Communications and Local Updates for LLM Pretraining
How to
Fine-Tune
Nemotron 3.5 ASR for Your Language, Domain, or Accent
⚡
LLM Inference
Content type:
Blog
huggingface.co
·
6d
6 days ago
·
Hacker News
Actions for How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent
A Unifying Lens on Reward Uncertainty in
RLHF
⚡
LLM Inference
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for A Unifying Lens on Reward Uncertainty in RLHF
A Controlled Audit of
Pretraining
Contamination in Public Medical Vision-Language Benchmarks
📄
CS Papers
Content type:
Academic
arxiv.org
·
12h
12 hours ago
Actions for A Controlled Audit of Pretraining Contamination in Public Medical Vision-Language Benchmarks
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help