artificial intelligence
Reinforcement Learning for Flow-Matching Policies with Density Transport
聽鈿欙笍AI Automation 聽Content type: AcademicA Regret Minimization Framework on Preference Learning in Large Language Models
聽馃LLMs 聽Content type: AcademicAPEX4: Efficient Pure W4A4 LLM Inference via Intra-SM Compute Rebalancing
聽馃LLMs 聽Content type: AcademicSynthetic Benchmarks Overstate Forward-Forward Scaling: Real-Data Limits of Layer-Local Training
聽馃LLMs 聽Content type: AcademicPrincipled Agent Debate: Adversarial Arbitration for Sycophancy Reduction in Large Language Models
聽馃LLMs 聽Content type: AcademicDiffoR: A Unified Continuous Generative Framework for Universal Ordinal Regression
聽馃LLMs 聽Content type: AcademicP-Cast Precision in FP8 Attention: Sink-Induced Collapse and the Optimality of S=2^8
聽馃LLMs 聽Content type: AcademicNext-Token Prediction Learns Generalisable Representations of Sleep Physiology
聽馃LLMs 聽Content type: AcademicBioVid: Autoregressive Video Generation with Biological Behavior Semantic Comprehension
聽馃帹AI for Creators 聽Content type: AcademicNo more posts from MarkGao's subscribed feeds.