Model Training
Epistemic Injustice in Language Models: An Audit of Pretraining Filters and Guardrails
💬LLMs Content type: AcademicBreaking the Tokenizer Barrier: On-Policy Distillation across Model Families
💬LLMs Content type: AcademicReinforcement Learning for Flow-Matching Policies with Density Transport
🎮Reinforcement Learning Content type: AcademicPC Layer: Polynomial Weight Preconditioning for Improving LLM Pre-Training
💬LLMs Content type: AcademicSceneMiner: Identity-Preserving Multi-Task Fine-Tuning for Unified BEV Scene Mining
🔄Transformers Content type: AcademicRCAP: Robust, Class-Aware, Probabilistic Dynamic Dataset Pruning
📉Deep Learning Content type: AcademicDefending Against Malicious Finetuning by Scaling Train-time Adversarial Attacks
📐Scaling Laws Content type: AcademicNo more posts from Bingran's subscribed feeds.