Model Training

Feeds to Scour
SubscribedAll
Scoured 340 posts in 9.6 ms

Breaking the Ice: Analyzing Cold Start Latency in vLLM

 🖥️ML Systems  Content type: Academic
arxiv.org··Hacker News

Detecting Sensitive Personal Information in Japanese Pre-Training Corpora for Large Language Models

 💬LLMs  Content type: Academic
arxiv.org·

Multilingual Sentiment Aware Text Summarization A Reinforcement Learning Approach for Consistency Maintenance

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

On Subquadratic Architectures: From Applications to Principles

 📐Scaling Laws  Content type: Academic
arxiv.org·

The Fine-Tuning Trap: Evaluating Negative Transfer and the Role of PEFT in Sub-1B Mathematical Reasoning

 📐Scaling Laws  Content type: Academic
arxiv.org·

FiberTune: Preserving Action-Fiber Visual Residuals in Vision-Language-Action Fine-Tuning

 🔄Transformers  Content type: Academic
arxiv.org·

Bridging the Morphology Gap: Adapting VLA Models to Dexterous Manipulation via Intent-Conditioned Fine-Tuning

 📐Scaling Laws  Content type: Academic
arxiv.org·

Two Bridges, One Pathway: From VLMs to Generalizable VLAs with Embodied Trajectory-Coupled Data

 💬LLMs  Content type: Academic
arxiv.org·

Epistemic Injustice in Language Models: An Audit of Pretraining Filters and Guardrails

 💬LLMs  Content type: Academic
arxiv.org·

Breaking the Tokenizer Barrier: On-Policy Distillation across Model Families

 💬LLMs  Content type: Academic
arxiv.org·

SwiftCTS: Fast Cross-Design Prediction and Pareto Optimization of Clock Tree Metrics via Few-Shot Calibration

 🔍Interpretability  Content type: Academic
arxiv.org·

Reinforcement Learning for Flow-Matching Policies with Density Transport

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

APT: Action Expert Pretraining Improves Instruction Generalization of Vision-Language-Action Policies

 💬LLMs  Content type: Academic
arxiv.org·

PC Layer: Polynomial Weight Preconditioning for Improving LLM Pre-Training

 💬LLMs  Content type: Academic
arxiv.org·

SAFER-Nav: Enhancing Safety for Visual Robot Navigation via Segmentation-Aware Fine-Tuning

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

Predictive Coding with Bayesian Priors via Proximal Gradients

 📉Deep Learning  Content type: Academic
arxiv.org·

SceneMiner: Identity-Preserving Multi-Task Fine-Tuning for Unified BEV Scene Mining

 🔄Transformers  Content type: Academic
arxiv.org·

FlowPRO: Reward-Free Reinforced Fine-Tuning of Flow-Matching VLAs via Proximalized Preference Optimization

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

RCAP: Robust, Class-Aware, Probabilistic Dynamic Dataset Pruning

 📉Deep Learning  Content type: Academic
arxiv.org·

Defending Against Malicious Finetuning by Scaling Train-time Adversarial Attacks

 📐Scaling Laws  Content type: Academic
arxiv.org·

No more posts from Bingran's subscribed feeds.

Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help