Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
73043
posts in
869.5
ms
Difficulty-Estimated
Policy Optimization
arxiv.org
·
1d
📊
Optimization
Prism
: Spectral
Parameter
Sharing for Multi-Agent Reinforcement Learning
arxiv.org
·
1d
📮
Multi-producer Queues
The price of intelligence
cyb3rops.medium.com
·
1d
🧠
Cognitive Science
Manufacturing
QMS
Software
samrian.com
·
13h
·
Discuss:
Hacker News
⚓
Anchors
Learning by
hand
is better than learning by AI
blog.engora.com
·
10h
·
Discuss:
Hacker News
🎭
Program Synthesis
Everything I know about good system design
seangoedecke.com
·
3h
⚙️
Systems Programming
Building LLMs in
Resource-Constrained
Environments
: A Hands-On Perspective
infoq.com
·
17h
💬
Prompt Engineering
Tips
lonestation.itch.io
·
2d
🔍
Miniselect
Predicting
operators
reliability
for control room alarm management using knowledge-based Bayesian networks
sciencedirect.com
·
3d
🏠
Home Automation
(8) AI Meets Brain: Memory Systems from
Cognitive
Neuroscience
to Autonomous Agents
arxiviq.substack.com
·
18h
·
Discuss:
Substack
💬
Prompt Engineering
Part 5: Reward Engineering: How to Shape
Behaviors
in
Financial/Robotic
Tasks
dev.to
·
4d
·
Discuss:
DEV
📊
Dynamic Programming
Rethinking
imitation
learning with Predictive
Inverse
Dynamics Models
microsoft.com
·
4d
🔲
Cellular Automata
Hybrid Model‑Based / Model‑Free Reinforcement Learning for Energy‑Efficient Autonomous Warehouse Robot Navigation with Real‑Time
Obstacle
Prediction **
Abstra
...
freederia.com
·
3d
🤖
Robotics
Show HN: We added
AGENTS.md
to 120 challenges so AI
teaches
instead of codes
frontendmentor.io
·
13h
·
Discuss:
Hacker News
💬
Prompt Engineering
Choice
as an
emergent
feature
oop.bearblog.dev
·
1d
📖
Interactive Fiction
Skills:
teaching
AI agents to act
consistently
dev.to
·
10h
·
Discuss:
DEV
🤖
Automation
A
GTM
guide to AI models
revengine.substack.com
·
2d
·
Discuss:
Substack
💬
Prompt Engineering
Designing
a Cost-Efficient
Agentic
System
p.agnihotry.com
·
11h
·
Discuss:
Hacker News
⚓
Anchors
On
Recursive
Self-Improvement
(Part I)
hyperdimensional.co
·
18h
⚓
Anchors
Habit
Detection For Home
Assistant
hackaday.com
·
1d
🏠
Home Automation
Loading...
Loading more...
« Page 2
•
Page 4 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help