Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
RL
🎮 RL
Specific
reinforcement learning, reward modeling, policy gradient
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
177
posts in
7.4
ms
You'
re
doing it wrong
🧩
Behavioral Economics
Content type:
News
understandably.com
·
2d
2 days ago
Actions for You're doing it wrong
Deep
Reinforcement
Learning
for Adaptive Power Allocation in ISAC Systems with Mobile Target
🌐
World Models
Content type:
Academic
arxiv.org
·
11h
11 hours ago
Actions for Deep Reinforcement Learning for Adaptive Power Allocation in ISAC Systems with Mobile Target
Spotlight On: Dreamplug Technologies Private Limited (CRED), a New Principal Participating Organization
🏋️
Pretraining
Content type:
Blog
blog.pcisecuritystandards.org
·
3d
3 days ago
Actions for Spotlight On: Dreamplug Technologies Private Limited (CRED), a New Principal Participating Organization
Combermere and Harrison College reach Under-15 basketball final
💬
LLMs
cbc.bb
·
4d
4 days ago
Actions for Combermere and Harrison College reach Under-15 basketball final
cakewalk wyrm
💬
LLMs
thevalleybelow.id
·
3d
3 days ago
Actions for cakewalk wyrm
KinematicRL: A Sim-to-Real
Reinforcement
Learning
Framework For Social Navigation With Kinodynamic Feasibility
🌐
World Models
Content type:
Academic
arxiv.org
·
11h
11 hours ago
Actions for KinematicRL: A Sim-to-Real Reinforcement Learning Framework For Social Navigation With Kinodynamic Feasibility
AI-powered living business intelligence
network
🌐
World Models
atlasforgex.com
·
1d
1 day ago
·
Hacker News
Actions for AI-powered living business intelligence network
Reinforcement
Learning
Disrupts
Gradient-Based
Adversarial Optimization
🌐
World Models
Content type:
Academic
arxiv.org
·
11h
11 hours ago
Actions for Reinforcement Learning Disrupts Gradient-Based Adversarial Optimization
Why Claude Produces High-Quality Output: A Developer’s Guide to Token Efficiency and Hallucination…
💬
LLMs
Content type:
Blog
medium.com
·
6d
6 days ago
Actions for Why Claude Produces High-Quality Output: A Developer’s Guide to Token Efficiency and Hallucination…
Fenn Tower Through Time: The Story of CSU’s Enduring Landmark
📈
Economics
Content type:
Academic
csuohio.edu
·
1d
1 day ago
Actions for Fenn Tower Through Time: The Story of CSU’s Enduring Landmark
Space-sampled Value Decay: Forgetting Mechanisms for Non-stationary
Deep
Reinforcement
Learning
🌐
World Models
Content type:
Academic
arxiv.org
·
11h
11 hours ago
Actions for Space-sampled Value Decay: Forgetting Mechanisms for Non-stationary Deep Reinforcement Learning
23 Years Ago, This Hit Comedy Hit Theaters as a Secret ‘Fight Club’ Parody, and Nobody Noticed
📊
ML
Content type:
News
vice.com
·
1d
1 day ago
Actions for 23 Years Ago, This Hit Comedy Hit Theaters as a Secret ‘Fight Club’ Parody, and Nobody Noticed
Hey-Meadow/meadow-mind: Zero training, second-level reactions (~400ms). A language-rule decision mind on a local 7B diffusion LM.
🌐
World Models
Content type:
Code
github.com
·
22h
22 hours ago
·
Hacker News
Actions for Hey-Meadow/meadow-mind: Zero training, second-level reactions (~400ms). A language-rule decision mind on a local 7B diffusion LM.
Stack Overflow didn't just help AI
learn
to code
🧠
AI
zozo123.github.io
·
4d
4 days ago
·
Hacker News
Actions for Stack Overflow didn't just help AI learn to code
Students discover long-lost Roman villa under high school
gym
🤖
AI Agents
Content type:
News
popsci.com
·
2d
2 days ago
Actions for Students discover long-lost Roman villa under high school gym
Lodge School teams advance to volleyball quarter-finals
💬
LLMs
cbc.bb
·
4d
4 days ago
Actions for Lodge School teams advance to volleyball quarter-finals
Multi-agent
rendezvous in fluid flows via
reinforcement
learning
🌐
World Models
Content type:
Academic
arxiv.org
·
11h
11 hours ago
Actions for Multi-agent rendezvous in fluid flows via reinforcement learning
I got so mad at poke(rogue)like that I trained a
RL
agent
to beat it for me
🌐
World Models
thiagolira.blot.im
·
3d
3 days ago
·
Hacker News
Actions for I got so mad at poke(rogue)like that I trained a RL agent to beat it for me
Sequent: scale and automation for higher confidence in alignment
🧠
AI
lesswrong.com
·
1d
1 day ago
Actions for Sequent: scale and automation for higher confidence in alignment
‘I can breathe again’: What district leaders said they’ve heard about their school cellphone bans
💬
LLMs
Content type:
News
chalkbeat.org
·
16h
16 hours ago
Actions for ‘I can breathe again’: What district leaders said they’ve heard about their school cellphone bans
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help