artificial intelligence

Feeds to Scour
SubscribedAll
Scoured 39 posts in 8.1 ms

A Unifying Lens on Reward Uncertainty in RLHF

馃LLMsContent type: Academic
arxiv.org

Reinforcement Learning for Flow-Matching Policies with Density Transport

鈿欙笍AI AutomationContent type: Academic
arxiv.org

Pretraining Recurrent Networks without Recurrence

馃LLMsContent type: Academic
arxiv.org

Semantic Cache Distillation: Efficient State Transfer via Reuse and Selective Patching

鈿涳笍Quantum ComputingContent type: Academic
arxiv.org

FAIR-Calib: Frontier-Aware Instability-Reweighted Calibration for Post-Training Quantization of Diffusion Large Language Models

馃LLMsContent type: Academic
arxiv.org

FADA: Accessible fetal ultrasound interpretation and annotation with a selectively distilled unified vision-language model

馃LLMsContent type: Academic
arxiv.org

A Regret Minimization Framework on Preference Learning in Large Language Models

馃LLMsContent type: Academic
arxiv.org

STAR-KV: Low-Rank KV Cache Compression via Soft Thresholding for Adaptive Rank Control

馃彧Small Business AIContent type: Academic
arxiv.org

EEGDancer: Dynamic Emotion Latent Space Masked Modeling with Reinforcement Learning for EEG Continuous Emotion Prediction

馃敩NeurotechContent type: Academic
arxiv.org

Minimum Distortion Quantization with Specified Output Distribution

馃LLMsContent type: Academic
arxiv.org

APEX4: Efficient Pure W4A4 LLM Inference via Intra-SM Compute Rebalancing

馃LLMsContent type: Academic
arxiv.org

Synthetic Benchmarks Overstate Forward-Forward Scaling: Real-Data Limits of Layer-Local Training

馃LLMsContent type: Academic
arxiv.org

Principled Agent Debate: Adversarial Arbitration for Sycophancy Reduction in Large Language Models

馃LLMsContent type: Academic
arxiv.org

Hidden Consensus:Preference-Validity Compression in Human Feedback

馃LLMsContent type: Academic
arxiv.org

EgoPressDiff: Multimodal Video Diffusion for Egocentric UV-Domain Hand-Pressure Estimation

馃帹AI for CreatorsContent type: Academic
arxiv.org

DiffoR: A Unified Continuous Generative Framework for Universal Ordinal Regression

馃LLMsContent type: Academic
arxiv.org

P-Cast Precision in FP8 Attention: Sink-Induced Collapse and the Optimality of S=2^8

馃LLMsContent type: Academic
arxiv.org

Next-Token Prediction Learns Generalisable Representations of Sleep Physiology

馃LLMsContent type: Academic
arxiv.org

BioVid: Autoregressive Video Generation with Biological Behavior Semantic Comprehension

馃帹AI for CreatorsContent type: Academic
arxiv.org

No more posts from MarkGao's subscribed feeds.

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help