recommendation systems, LLM, large langurage model

Feeds to Scour
SubscribedAll
Scoured 35 posts in 5.4 ms

Consistent Probabilistic Social Choice Revisited

馃幃Q-LearningContent type: Academic
arxiv.org

A Regret Minimization Framework on Preference Learning in Large Language Models

馃reinforcement learning, deep learning, machine learningContent type: Academic
arxiv.org

Traits Run Deeper: Trait-Specific Asymmetric Fusion for Personality Assessment

馃TransformersContent type: Academic
arxiv.org

SkelDPO: A Skeleton-Guided Direct Preference Optimization Framework for Efficient Code Generation

馃幆RLHFContent type: Academic
arxiv.org

ARM: An AutoRegressive Large Multimodal Model with Unified Discrete Representations

馃reinforcement learning, deep learning, machine learningContent type: Academic
arxiv.org

Beyond Rubrics: Exploration-Guided Evaluation Skills for Reward Modeling

馃幆RLHFContent type: Academic
arxiv.org

Adaptive Loss Balancing for Noise-Robust GRPO in Generative Recommendation

馃摎Information RetrievalContent type: Academic
arxiv.org

SIDInspector: A Mapping-First Diagnostic Resource for Semantic-ID Tokenizers

馃摎Information RetrievalContent type: Academic
arxiv.org

DynaCF: Mitigating Shortcut Learning in Reward Models via Dynamic Counterfactual Sensitivity

馃敆Causal InferenceContent type: Academic
arxiv.org

DREAM: Dynamic Refinement of Early Assignment Mappings

馃敆Causal InferenceContent type: Academic
arxiv.org

Beyond Scalar Rewards by Internalizing Reasoning into Score Distributions

馃幆RLHFContent type: Academic
arxiv.org

SSRLive: Live Streaming Recommendation with Dynamic Semantic ID

馃摎Information RetrievalContent type: Academic
arxiv.org

STELLAR: Spatio-Temporal Environmental Learning with Latent Alignment and Refinement for Long-Tailed Species Distribution Modeling

馃reinforcement learning, deep learning, machine learningContent type: Academic
arxiv.org

Generalized Rank-based Evaluation for Knowledge Graph Completion: Perspectives, Framework, and Analyses

馃攳RAGContent type: Academic
arxiv.org

Gryphon: A Unified Architecture for Semantic-ID Generation and Item-Level Scoring in Industrial Recommendations

馃摎Information RetrievalContent type: Academic
arxiv.org

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help