Skip to main content
Scour
Discover
Docs
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLM Training
🧠 LLM Training
Specific
LLM training, pretraining, RLHF, model training, arxiv ML
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
233
posts in
22.3
ms
kaggle.com
·
3d
3 days ago
If a 270M
Model
Already Worked, Why Did I
Fine-Tune
a 7B One?
Discussed on
DEV
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for If a 270M Model Already Worked, Why Did I Fine-Tune a 7B One?
Lobsters
·
3d
3 days ago
What's the advice for
LLM
poisoning of artwork these days?
Discussed on
Lobsters
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for What's the advice for LLM poisoning of artwork these days?
medium.com
·
5d
5 days ago
From Intern to AI Agent: How
Hugging
Face
’s
ML
Intern Is Redefining Work
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for From Intern to AI Agent: How Hugging Face’s ML Intern Is Redefining Work
arXiv
·
2d
2 days ago
Comparing
Transformers
and Hybrid
Models
at the Token Level
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Comparing Transformers and Hybrid Models at the Token Level
Hugging Face
·
3d
3 days ago
baidu/Unlimited-OCR
Covered by
5 sources
See all sources covering this story
including
The Rundown AI
,
VentureBeat
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for baidu/Unlimited-OCR
arXiv
·
2d
2 days ago
Provably
Efficient
Policy-Reward
Co-Pretraining
for Adversarial Imitation Learning
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Provably Efficient Policy-Reward Co-Pretraining for Adversarial Imitation Learning
news.smol.ai
·
6d
6 days ago
not much happened today | AINews
Covers
6 stories
See all stories this covers
including
GLM-5.2 is the new leading open weights model on Artificial Analysis
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for not much happened today | AINews
GitHub
·
5d
5 days ago
Show HN: Pragmatiq – open-source framework for foundational
models
in banking
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Pragmatiq – open-source framework for foundational models in banking
GitHub
·
5d
5 days ago
Out of Stealth (Kinda)
Covers
uv
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Out of Stealth (Kinda)
arXiv
·
1d
1 day ago
Transformer-Based
Language
Models
Across Domain Verticals: Architectures, Applications and Critical Assessment
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Transformer-Based Language Models Across Domain Verticals: Architectures, Applications and Critical Assessment
Hugging Face
·
2d
2 days ago
Shipping
huggingface
_hub every week with AI, open tools, and a human in the loop
Covers
Opencode – open-source alternative to Claude Code
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Shipping huggingface_hub every week with AI, open tools, and a human in the loop
fig.inc
·
5d
5 days ago
Breaking Browser-Use
Models
Using Domain Randomization
Discussed on
Hacker News
and
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Breaking Browser-Use Models Using Domain Randomization
GitHub
·
5d
5 days ago
Show HN: I built an
11-LLM
consensus engine to detect AI hallucination
Covers
Show HN: An AI that reliably builds full-stack apps by preventing LLM mistakes
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: I built an 11-LLM consensus engine to detect AI hallucination
arXiv
·
5h
5 hours ago
Natural Ungrokking: Asymmetric Control of Which Rules Survive
Pretraining
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Natural Ungrokking: Asymmetric Control of Which Rules Survive Pretraining
Hugging Face
·
1d
1 day ago
Qwen-AgentWorld-35B-A3B: a 3B-active MoE
trained
to simulate MCP, terminal, SWE, Android, web and OS environments
Covers
2 stories
See all stories this covers
including
vllm-project/vllm
Covered by
3 sources
See all sources covering this story
including
GitHub
,
indiehacker.news
Discussed on
r/LocalLLaMA
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Qwen-AgentWorld-35B-A3B: a 3B-active MoE trained to simulate MCP, terminal, SWE, Android, web and OS environments
arXiv
·
1d
1 day ago
TuringViT: Making SOTA Vision
Transformers
Accessible to All
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for TuringViT: Making SOTA Vision Transformers Accessible to All
arXiv
·
1d
1 day ago
A Physics-Informed Fourier-Wavelet
Transformer
for Multiscale Computational Fluid Dynamics Surrogate
Modeling
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for A Physics-Informed Fourier-Wavelet Transformer for Multiscale Computational Fluid Dynamics Surrogate Modeling
arXiv
·
1d
1 day ago
Aligning MusicLLM with Emotion using
Instruction
Tuning
and Feedback-Driven Alignment
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Aligning MusicLLM with Emotion using Instruction Tuning and Feedback-Driven Alignment
Hugging Face
·
6d
6 days ago
MosaicLeaks: Can your research agent keep a secret?
Covered by
tldr.tech
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for MosaicLeaks: Can your research agent keep a secret?
arXiv
·
1d
1 day ago
Tri-Efficient
Transfer
Learning for Point Cloud Videos
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Tri-Efficient Transfer Learning for Point Cloud Videos
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report