ML Systems
BIDENT: Heterogeneous Operator-level Mapping for Efficient Edge Inference
ย ๐Scaling Laws ย Content type: AcademicSpectrumKV: Per-Token Mixed-Precision KV Cache Transfer for Prefill-Decode Disaggregated LLM Serving
ย ๐ฌLLMs ย Content type: AcademicStageFrontier: Synchronization-Aware Stage Accounting for Distributed ML Training
ย ๐Deep Learning ย Content type: AcademicSTAR-KV: Low-Rank KV Cache Compression via Soft Thresholding for Adaptive Rank Control
ย ๐Scaling Laws ย Content type: AcademicSABLE: GPU-Based Power Flow Accelerator for Sparsity-Aware Batched Learning
ย ๐Deep Learning ย Content type: AcademicDeployBench: Benchmarking LLM Agents for Research Artifact Deployment
ย ๐คAI Agents ย Content type: AcademicNo more posts from Bingran's subscribed feeds.