Model Training

Feeds to Scour
SubscribedAll
Scoured 340 posts in 6.8 ms

Lost in the Non-convex Loss Landscape: How to Fine-tune the Large Time Series Model?

 🧠AI Research  Content type: Academic
arxiv.org·

ViP-VL: Vietnamese Self-supervised Speech Pretraining Model with Vector-Quantization Learning

 💬LLMs  Content type: Academic
arxiv.org·

High-Dimensional Theory of LoRA Fine-Tuning in a Solvable Attention Model

 🔄Transformers  Content type: Academic
arxiv.org·

Emergent Misalignment Can Be Induced by Sycophancy and Reversed via Alignment Gating

 📐Scaling Laws  Content type: Academic
arxiv.org·

Corpus Augmentation for Sign Language Translation via LLM-Guided Video Stitching

 💬LLMs  Content type: Academic
arxiv.org·

Emergence of Context Characteristics Sensitivity in Large Language Models

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

Multilingual Fine-Tuning via Localized Gradient Conflict Resolution

 💬LLMs  Content type: Academic
arxiv.org·

Simplicity Suffices for Parameter Noise Injection in Stochastic Gradient Descent

 📉Deep Learning  Content type: Academic
arxiv.org·

Stage-1 Controls the Entropy Regime, Not the Outcome

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

On the Geometry of On-Policy Distillation

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

Harness In-Context Operator Learning with Chain of Operators

 💬LLMs  Content type: Academic
arxiv.org·

In-Context Learning for Latent Space Bayesian Optimization

 💬LLMs  Content type: Academic
arxiv.org·

Pretraining Recurrent Networks without Recurrence

 📉Deep Learning  Content type: Academic
arxiv.org·

The Art of Interrogation: Consistency Amplifies Factuality in Spatial Reasoning

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

A Unifying Lens on Reward Uncertainty in RLHF

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

Categorical Prior Lock-in: Why In-Context Learning Fails for Structured Data

 💬LLMs  Content type: Academic
arxiv.org·

Optimal Rates for Generalization of Gradient Descent Methods with Deep Neural Networks

 📉Deep Learning  Content type: Academic
arxiv.org·

Benchmarking Empirical Privacy Protection for Adaptations of Large Language Models

 💬LLMs  Content type: Academic
arxiv.org·

World Pilot: Steering Vision-Language-Action Models with World-Action Priors

 💬LLMs  Content type: Academic
arxiv.org·

From Shortcuts to Reasoning: Robust Post-Training of Theory of Mind with Reinforcement Learning

 🧠AI Research  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help