Skip to main content
Scour
Discover
Docs
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLM Training
🧠 LLM Training
Specific
LLM training, pretraining, RLHF, model training, arxiv ML
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
244
posts in
15.5
ms
🧠
LLM Research
arXiv
·
2d
2 days ago
moBERTo: A
Modern
Encoder for Portuguese via Continued
Pretraining
of ModernBERT
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for moBERTo: A Modern Encoder for Portuguese via Continued Pretraining of ModernBERT
⚙️
LLM Fine-tuning
kaggle.com
·
4d
4 days ago
QLoRA:
Fine-Tuning
a 7B
Model
on a 16GB GPU (It Shrank to 5.4GB in Front of Me)
Discussed on
DEV
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for QLoRA: Fine-Tuning a 7B Model on a 16GB GPU (It Shrank to 5.4GB in Front of Me)
🧠
LLM Research
arXiv
·
2d
2 days ago
Provably Efficient Policy-Reward
Co-Pretraining
for Adversarial Imitation Learning
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Provably Efficient Policy-Reward Co-Pretraining for Adversarial Imitation Learning
🤖
AI Development
GitHub
·
5d
5 days ago
Show HN: Alloy – a PyTorch backend and inference engine for Apple Silicon
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Alloy – a PyTorch backend and inference engine for Apple Silicon
🔀
LoRA
arXiv
·
20h
20 hours ago
Memory-Efficient Policy Libraries with Low-Rank Adaptation in Reinforcement Learning
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Memory-Efficient Policy Libraries with Low-Rank Adaptation in Reinforcement Learning
🎮
Reinforcement Learning
arXiv
·
1d
1 day ago
Weight-Space Geometry of Offline Reasoning
Training
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Weight-Space Geometry of Offline Reasoning Training
🎯
Post-training
arXiv
·
1d
1 day ago
Aligning MusicLLM with Emotion using
Instruction
Tuning
and Feedback-Driven Alignment
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Aligning MusicLLM with Emotion using Instruction Tuning and Feedback-Driven Alignment
🔬
ML Research
arXiv
·
20h
20 hours ago
The Geometry of Sequential Learning: Lie-Bracket Prediction of
Transfer
Order
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The Geometry of Sequential Learning: Lie-Bracket Prediction of Transfer Order
🧠
LLM Research
arXiv
·
20h
20 hours ago
Emergent Capabilities Arise Randomly from Learning Sparse
Attention
Patterns
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Emergent Capabilities Arise Randomly from Learning Sparse Attention Patterns
🧠
LLM Research
arXiv
·
1d
1 day ago
TuringViT: Making SOTA Vision
Transformers
Accessible to All
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for TuringViT: Making SOTA Vision Transformers Accessible to All
🎮
Reinforcement Learning
arXiv
·
20h
20 hours ago
Towards Scalable Multi-Task Reinforcement Learning with
Large
Decision
Models
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Towards Scalable Multi-Task Reinforcement Learning with Large Decision Models
🧠
LLM Research
arXiv
·
2d
2 days ago
Where Does the Signal Live? A Web Data Recipe for Medical Encoder
Pretraining
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Where Does the Signal Live? A Web Data Recipe for Medical Encoder Pretraining
🧠
LLM Research
arXiv
·
20h
20 hours ago
Natural Ungrokking: Asymmetric Control of Which Rules Survive
Pretraining
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Natural Ungrokking: Asymmetric Control of Which Rules Survive Pretraining
🏆
LLM Benchmarking
arXiv
·
20h
20 hours ago
Cliff Tokens: Identifying Single-Token Failure Triggers in
LLM
Mathematical Reasoning
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Cliff Tokens: Identifying Single-Token Failure Triggers in LLM Mathematical Reasoning
🧠
LLM Engineering
arXiv
·
2d
2 days ago
Priority-Aware Learning-Unlearning Correction for Dynamic Decentralized
LoRA
Fine-Tuning
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Priority-Aware Learning-Unlearning Correction for Dynamic Decentralized LoRA Fine-Tuning
🧠
LLM
arXiv
·
20h
20 hours ago
EPTS: Elastic
Post-Training
Sparsity for Efficient
Large
Language
Model Compression
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for EPTS: Elastic Post-Training Sparsity for Efficient Large Language Model Compression
🧠
LLM Research
arXiv
·
2d
2 days ago
Technical Report for the ICRA 2026 GOOSE 2D
Fine-Grained
Semantic Segmentation Challenge:
Pretraining-Diverse
Ensemble of Foundation Vision Encoders for Robust ...
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Technical Report for the ICRA 2026 GOOSE 2D Fine-Grained Semantic Segmentation Challenge: Pretraining-Diverse Ensemble of Foundation Vision Encoders for Robust ...
🔍
Interpretability
arXiv
·
20h
20 hours ago
Perfect Detection, Failed Control: The Geometry of Knowing vs. Steering in
Language
Models
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Perfect Detection, Failed Control: The Geometry of Knowing vs. Steering in Language Models
🧠
LLM Engineering
arXiv
·
2d
2 days ago
Enhancing LLMs for Graph Tasks via Graph-aware
LoRA
Generation
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Enhancing LLMs for Graph Tasks via Graph-aware LoRA Generation
🎯
RLHF
arXiv
·
2d
2 days ago
Repeated
post-training
is not Self-improving: Diagnosing Scientific Amnesia in Continual
DPO
Pipelines
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Repeated post-training is not Self-improving: Diagnosing Scientific Amnesia in Continual DPO Pipelines
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report