Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Model Training
⚙️ Model Training
pretraining, fine-tuning, training run, compute, loss curve
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
340
posts in
6.8
ms
Lost
in the Non-convex Loss Landscape: How to
Fine-tune
the Large Time Series Model?
🧠
AI Research
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Lost in the Non-convex Loss Landscape: How to Fine-tune the Large Time Series Model?
ViP-VL
: Vietnamese Self-supervised Speech
Pretraining
Model
with Vector-Quantization Learning
💬
LLMs
Content type:
Academic
arxiv.org
·
6h
6 hours ago
Actions for ViP-VL: Vietnamese Self-supervised Speech Pretraining Model with Vector-Quantization Learning
High-Dimensional Theory of LoRA
Fine-Tuning
in a Solvable Attention
Model
🔄
Transformers
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for High-Dimensional Theory of LoRA Fine-Tuning in a Solvable Attention Model
Emergent Misalignment Can Be Induced by Sycophancy and Reversed via Alignment Gating
📐
Scaling Laws
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Emergent Misalignment Can Be Induced by Sycophancy and Reversed via Alignment Gating
Corpus Augmentation for Sign Language Translation via LLM-Guided Video Stitching
💬
LLMs
Content type:
Academic
arxiv.org
·
6h
6 hours ago
Actions for Corpus Augmentation for Sign Language Translation via LLM-Guided Video Stitching
Emergence of Context Characteristics Sensitivity in Large Language
Models
🎮
Reinforcement Learning
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Emergence of Context Characteristics Sensitivity in Large Language Models
Multilingual
Fine-Tuning
via Localized
Gradient
Conflict Resolution
💬
LLMs
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Multilingual Fine-Tuning via Localized Gradient Conflict Resolution
Simplicity Suffices for Parameter Noise Injection in Stochastic
Gradient
Descent
📉
Deep Learning
Content type:
Academic
arxiv.org
·
6h
6 hours ago
Actions for Simplicity Suffices for Parameter Noise Injection in Stochastic Gradient Descent
Stage-1 Controls the Entropy Regime, Not the Outcome
🎮
Reinforcement Learning
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Stage-1 Controls the Entropy Regime, Not the Outcome
On the Geometry of On-Policy Distillation
🎮
Reinforcement Learning
Content type:
Academic
arxiv.org
·
3d
3 days ago
Actions for On the Geometry of On-Policy Distillation
Harness In-Context Operator Learning with Chain of Operators
💬
LLMs
Content type:
Academic
arxiv.org
·
6h
6 hours ago
Actions for Harness In-Context Operator Learning with Chain of Operators
In-Context Learning for Latent Space Bayesian Optimization
💬
LLMs
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for In-Context Learning for Latent Space Bayesian Optimization
Pretraining
Recurrent Networks without Recurrence
📉
Deep Learning
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Pretraining Recurrent Networks without Recurrence
The Art of Interrogation: Consistency Amplifies Factuality in Spatial Reasoning
🎮
Reinforcement Learning
Content type:
Academic
arxiv.org
·
6h
6 hours ago
Actions for The Art of Interrogation: Consistency Amplifies Factuality in Spatial Reasoning
A Unifying Lens on Reward Uncertainty in
RLHF
🎮
Reinforcement Learning
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for A Unifying Lens on Reward Uncertainty in RLHF
Categorical Prior Lock-in: Why In-Context Learning Fails for Structured Data
💬
LLMs
Content type:
Academic
arxiv.org
·
6h
6 hours ago
Actions for Categorical Prior Lock-in: Why In-Context Learning Fails for Structured Data
Optimal Rates for Generalization of
Gradient
Descent
Methods with
Deep
Neural Networks
📉
Deep Learning
Content type:
Academic
arxiv.org
·
3d
3 days ago
Actions for Optimal Rates for Generalization of Gradient Descent Methods with Deep Neural Networks
Benchmarking Empirical Privacy Protection for Adaptations of Large Language
Models
💬
LLMs
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Benchmarking Empirical Privacy Protection for Adaptations of Large Language Models
World Pilot: Steering Vision-Language-Action
Models
with World-Action Priors
💬
LLMs
Content type:
Academic
arxiv.org
·
6h
6 hours ago
Actions for World Pilot: Steering Vision-Language-Action Models with World-Action Priors
From Shortcuts to Reasoning: Robust
Post-Training
of Theory of Mind with Reinforcement Learning
🧠
AI Research
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for From Shortcuts to Reasoning: Robust Post-Training of Theory of Mind with Reinforcement Learning
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help