LLMs

Feeds to Scour
SubscribedAll
Scoured 17 posts in 6.6 ms

LLM-Based Code Documentation Generation and Multi-Judge Evaluation

鉁嶏笍Prompt EngineeringContent type: Academic
arxiv.org

EvalStop: Using World Feedback to Detect and Correct Reward Overoptimization in Multi-Tenant RLHF Platforms

馃artificial intelligenceContent type: Academic
arxiv.org

Phantom transitions in language model fine-tuning

鉁嶏笍Prompt EngineeringContent type: Academic
arxiv.org

Towards Robust Arabic Speech Emotion Recognition with Deep Learning

馃artificial intelligenceContent type: Academic
arxiv.org

Evaluating Advanced Prompting on Gemini Flash for Multi-Hop Biomedical QA

鉁嶏笍Prompt EngineeringContent type: Academic
arxiv.org

From Symbolic to Geometric: Enabling Spatial Reasoning in Large Language Models

鉁嶏笍Prompt EngineeringContent type: Academic
arxiv.org

A Regret Minimization Framework on Preference Learning in Large Language Models

馃artificial intelligenceContent type: Academic
arxiv.org
Less-relevant results

Hidden Consensus:Preference-Validity Compression in Human Feedback

馃artificial intelligenceContent type: Academic
arxiv.org

BiasGRPO: Stabilizing Bias Mitigation in High-Variance Reward Landscapes via Group-Relative Policy Optimization

馃artificial intelligenceContent type: Academic
arxiv.org

A Unifying Lens on Reward Uncertainty in RLHF

馃artificial intelligenceContent type: Academic
arxiv.org

Automated Pronunciation Evaluation for Korean Toddler Speech using Speech Diarization and Self-Supervised Learning

馃artificial intelligenceContent type: Academic
arxiv.org

Sparse Mixture-of-Experts Reward Models Learn Interpretable and Specialized Experts for Personalized Preference Modeling

馃artificial intelligenceContent type: Academic
arxiv.org

Principled Agent Debate: Adversarial Arbitration for Sycophancy Reduction in Large Language Models

馃artificial intelligenceContent type: Academic
arxiv.org

GenTI: Benchmarking LLMs for Autonomous IDPS Rule Generation for Unseen Attacks

鉁嶏笍Prompt EngineeringContent type: Academic
arxiv.org

PolyBuild: An End-to-End Method for Polygonal Building Contour Extraction from High-Resolution Remote Sensing Images

馃artificial intelligenceContent type: Academic
arxiv.org

Low-Rank Decay for Grokking in Scale-Invariant Transformers: A Spectral-Geometric View

馃artificial intelligenceContent type: Academic
arxiv.org

Signed Dual Attention: Capturing Signed Dependencies in Time Series Forecasting

馃artificial intelligenceContent type: Academic
arxiv.org

No more posts from MarkGao's subscribed feeds.

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help