Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
ML
🤖 ML
machine learning, neural networks, deep learning, training
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
42
posts in
5.2
ms
EvalStop: Using World Feedback to Detect and Correct Reward Overoptimization in Multi-Tenant
RLHF
Platforms
⚙️
Mechanical Sympathy
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for EvalStop: Using World Feedback to Detect and Correct Reward Overoptimization in Multi-Tenant RLHF Platforms
The Neutral Mask: How
RLHF
Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language
Model
🎯
Embedding Models
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model
A Unifying Lens on Reward Uncertainty in
RLHF
🎲
Probability
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for A Unifying Lens on Reward Uncertainty in RLHF
Pretraining Recurrent
Networks
without Recurrence
🔍
SPLADE
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for Pretraining Recurrent Networks without Recurrence
Hybridizing Equilibrium Propagation with Ising
Machines
for Efficient Energy-Based
Learning
⚛️
Quantum Computing
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Hybridizing Equilibrium Propagation with Ising Machines for Efficient Energy-Based Learning
Beyond Patches: Superpixel Token-based
Transformers
for Attribute-Specific Fashion Retrieval
🔍
Information Retrieval
Content type:
Academic
arxiv.org
·
13h
13 hours ago
Actions for Beyond Patches: Superpixel Token-based Transformers for Attribute-Specific Fashion Retrieval
Multilingual Sentiment Aware Text Summarization A Reinforcement
Learning
Approach for Consistency Maintenance
🔍
Information Retrieval
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Multilingual Sentiment Aware Text Summarization A Reinforcement Learning Approach for Consistency Maintenance
Sequential Data Poisoning in LLM
Post-Training
🗜️
Compression Algorithms
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Sequential Data Poisoning in LLM Post-Training
Reinforcement
Learning
for Flow-Matching Policies with Density Transport
⚙️
Adaptive Execution
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Reinforcement Learning for Flow-Matching Policies with Density Transport
Signed Dual Attention: Capturing Signed Dependencies in Time Series Forecasting
🔍
Information Retrieval
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Signed Dual Attention: Capturing Signed Dependencies in Time Series Forecasting
Toward Compiler World
Models
:
Learning
Latent Dynamics for Efficient
Tensor
Program Search
🧮
Constraint Solvers
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Toward Compiler World Models: Learning Latent Dynamics for Efficient Tensor Program Search
Towards Tight Bounds for Streaming Attention
🧩
Complexity Theory
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Towards Tight Bounds for Streaming Attention
GenAutoML: An Agentic Framework for Dynamic
Architecture
Generation and Optimization in Time-Series Analysis
🔄
Incremental Computation
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for GenAutoML: An Agentic Framework for Dynamic Architecture Generation and Optimization in Time-Series Analysis
Perturbative Contrastive Physical
Learning
🎯
Physics Simulation
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Perturbative Contrastive Physical Learning
GOTabPFN: From Feature Ordering to Compact Tokenization for Tabular Foundation
Models
on High-Dimensional Data
💰
Cost-Based Optimization
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for GOTabPFN: From Feature Ordering to Compact Tokenization for Tabular Foundation Models on High-Dimensional Data
A Regret Minimization Framework on Preference
Learning
in Large Language
Models
🎯
Embedding Models
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for A Regret Minimization Framework on Preference Learning in Large Language Models
BiasGRPO: Stabilizing Bias Mitigation in High-Variance Reward Landscapes via Group-Relative Policy Optimization
💰
Cost-Based Optimization
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for BiasGRPO: Stabilizing Bias Mitigation in High-Variance Reward Landscapes via Group-Relative Policy Optimization
Reconstructing Multi-Decadal Forest Disturbances: A Spatio-Temporal
Transformer
Approach
〰️
Signal Processing
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Reconstructing Multi-Decadal Forest Disturbances: A Spatio-Temporal Transformer Approach
Q-VGM: Q-Guided
Value-Gradient
Matching for Flow-Matching VLA Policies
🧮
SMT Solvers
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Q-VGM: Q-Guided Value-Gradient Matching for Flow-Matching VLA Policies
Sparse Mixture-of-Experts Reward
Models
Learn
Interpretable and Specialized Experts for Personalized Preference Modeling
📉
Embeddings Optimization
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Sparse Mixture-of-Experts Reward Models Learn Interpretable and Specialized Experts for Personalized Preference Modeling
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help