Interpretability
Mechanistic Insights into Functional Sparsity in Multimodal LLMs via CoRe Heads
聽馃挰LLMs 聽Content type: AcademicA Unifying Framework for Concept-Based Representational Similarity
聽馃攧Transformers 聽Content type: AcademicPosition: Don't Just "Fix it in Post": A Science of AI Must Study Training Dynamics
聽馃AI Research 聽Content type: AcademicSet-Based Transformer for Atmospheric Compensation in Standoff LWIR Hyperspectral Imaging
聽馃搻Scaling Laws 聽Content type: AcademicThe Amplifying Mirror: Locating and Steering the Partisan Direction inside a Large Language Model
聽馃挰LLMs 聽Content type: AcademicWAV: Multi-Resolution Block Residual Routing for Deep Decoder-Only Transformers
聽馃攧Transformers 聽Content type: AcademicDominant-Layer ZO: A Single Layer Dominates Zeroth-Order Fine-Tuning of LLMs
聽鈿欙笍Model Training 聽Content type: AcademicLocalizing Prompt Ambiguity in Large Language Models with Probe-Targeted Attribution
聽馃挰LLMs 聽Content type: AcademicInterpreting Brain Responses to Language with Sparse Features from Language Models
聽馃挰LLMs 聽Content type: AcademicNo more posts from Bingran's subscribed feeds.