💬 LLMs - Bingran · Scour

Does Topic Sentiment Cause Perceived Ideology? Comparing Human and LLM Annotations in Political News Articles

⚙️Model Training Academic

Muon Learns More Robust and Transferable Features than Adam

⚙️Model Training Academic

Epistemic Injustice in Language Models: An Audit of Pretraining Filters and Guardrails

⚙️Model Training Academic

World Pilot: Steering Vision-Language-Action Models with World-Action Priors

⚙️Model Training Academic

BUDDY: BUdget-Driven DYnamic Depth Routing for Adaptive Large Language Model Inference

🖥️ML Systems Academic

Dual-Stance Evaluation of Sycophancy: The Structure of Agreement and the Limits of Intervention

🔍Interpretability Academic

A Regret Minimization Framework on Preference Learning in Large Language Models

🎮Reinforcement Learning Academic

GuardNet: Ensemble Strategies of Shallow Neural Networks for Robust Prompt Injection and Jailbreak Detection

📉Deep Learning Academic

The Structural Attention Tax: How Retrieval Format Hijacks In-Context Learning Independent of Content

🔄Transformers Academic

MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding

⚙️Model Training Academic

Towards Tight Bounds for Streaming Attention

🧠AI Research Academic

Less-relevant results

Substrate Asymmetry in User-Side Memory: A Diagnostic Framework

🎮Reinforcement Learning Academic

In-Context Learning for Latent Space Bayesian Optimization

⚙️Model Training Academic

TrustMargin: Training-Free Arbitration between Parametric Memory and Retrieved Evidence in Large Language Models

⚙️Model Training Academic

Translate-R1: Cost-Aware Translation Tool Use via Reinforcement Learning

🤖AI Agents Academic

How Small Can You Go? LoRA Fine-Tuning 270M-8B Models for Merchant Information Extraction in Financial Transactions

⚙️Model Training Academic

What Do People Actually Want From AI? Mapping Preference Plurality

🎮Reinforcement Learning Academic

A Unifying Lens on Reward Uncertainty in RLHF

🎮Reinforcement Learning Academic

APT: Action Expert Pretraining Improves Instruction Generalization of Vision-Language-Action Policies

⚙️Model Training Academic

PC Layer: Polynomial Weight Preconditioning for Improving LLM Pre-Training

⚙️Model Training Academic

No more posts from Bingran's subscribed feeds.

Scour all 25258 feeds Learn more about Feeds

Sign up or log in to see more results

Log in to enable infinite scrolling