Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🎮 Reinforcement Learning
Specific
RL, reward functions, policy gradient, RLHF
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
188
posts in
24.7
ms
Decoupling KL and Trajectories: A Unified Perspective for SFT, DAgger, Offline
RL
, and OPD in LLM Distillation
📐
ML Theory
arxiv.org
·
2d
Eric Jang – Building AlphaGo from scratch
♟️
Game Theory
dwarkesh.com
·
5d
·
Hacker News
PopuLoRA: Co-Evolving LLM Populations for Reasoning Self-Play
🤖
AI Agents
vmax.ai
·
7h
·
Hacker News
Long Context Pre-Training w/ Lighthouse Attention
💬
LLMs
mail.bycloud.ai
·
1d
Reinforcement
Learning
: An Introduction (2nd Edition)
📐
ML Theory
chizkidd.github.io
·
5d
Can EPF be attached for liabilities? What happens if employer recovers contribution but doesn’t deposit? FAQs answered
📈
Business News
livemint.com
·
11h
Agora-1: The
Multi-Agent
World Model
🤖
AI Agents
odyssey.ml
·
2d
·
Hacker News
GRIP-VLM:
RL
for Efficient Vision-Language Models
💬
LLMs
startuphub.ai
·
6d
Massachusetts' Institute of Technology Introduction to
Deep
Learning
🧠
Neural Networks
i-programmer.info
·
1d
MegaTrain Full Precision Training of 100B+ Parameter LLMs on a Single GPU
💬
LLMs
github.com
·
3d
·
Hacker News
Courts grants personal protection order against mother who repeatedly cursed at daughter during writing exercise
📡
Information Theory
channelnewsasia.com
·
2d
·
r/singapore
not much happened today
📖
Open Source
news.smol.ai
·
6d
Less-relevant results
Brasada Capital Q1 2026 Client Letter
📈
Business News
seekingalpha.com
·
4h
2D map of 26,741M/CV papers from CVPR, NeurIPS, ICML, ICLR (2024–2025)
📐
ML Theory
matejgazda.com
·
6d
·
Hacker News
This DIY Robot Kit Puts Humanoid Development in Your Garage for $15,000
💻
Tech News
gadgetreview.com
·
2d
·
r/artificial
Australia’s Submarine Problems
♟️
Game Theory
thediplomat.com
·
16h
‘
Marquee
project’ on underwater vehicles to kickstart AUKUS pillar two
💻
Tech News
watoday.com.au
·
23h
Deep
Reinforcement
Learning
Framework for Diversified Portfolio Management Across Global Equity Markets
🧠
Neural Networks
arxiv.org
·
2d
LLM Inference
💬
LLMs
iop.systems
·
2h
AI researchers report gaps in
agent
reliability and safety
🤖
AI Agents
kite.kagi.com
·
5d
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help