Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
Q-Learning, Policy Gradient, Reward Systems, Game AI, Robotics
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
4960
posts in
7.8
ms
Agent
Q-Mix
:
Selecting
the Right Action for LLM Multi-Agent Systems through Reinforcement Learning
🤝
Multi-Agent Systems
arxiv.org
·
2d
·
…
Training State of the Art
Vulnerability
Discovery Agents through
Reinforcement
Learning
⚙
Context engineering
depthfirst.com
·
4d
·
Hacker News
·
…
AI agents are now playing
Mafia
(social
deduction
with humans)
🤖
agents
mafiamystery.com
·
3h
·
Hacker News
·
…
Agent Labs:
Workload-Harness
Fit
🔍
AI Interpretability
akashbajwa.co
·
14h
·
Hacker News
·
…
What is
reinforcement
learning
finetuning
⚙
Context engineering
youtube.com
·
1d
·
Hacker News
·
…
Sandbox
Strategy Game for AI
⚙
Context engineering
villagewars.xyz
·
8h
·
Hacker News
·
…
From Agent to
Domain
Intelligence : A
Self-Evolving
Knowledge Engine
⚙
Context engineering
simaxiaoqian.substack.com
·
5d
·
Substack
·
…
I spent 2,760/year on
SaaS
tools for my business. So I built an AI that
replaces
all of them
⚙
Context engineering
genesis.bmbnexus.ai
·
14h
·
Hacker News
,
r/SideProject
·
…
Kretski/MicroSafe-RL
: Proprietary Edge AI safety engine for Reinforcement Learning. Implements real-time Operational Stability Signatures to prevent hardware failure on microcontrollers (STM32/ESP32)
🤖
agents
github.com
·
1d
·
Hacker News
,
r/embedded
·
…
Atombite.ai
Deep Dive: Building a
Takeout
Packing Robot Is Harder Than You Think
⚙
Context engineering
news.ycombinator.com
·
2d
·
Hacker News
·
…
Agent and World
🤖
agents
drmindle.com
·
4d
·
Hacker News
·
…
Why I Built an AI
Organisation
⚙
Context engineering
dave-bailey.com
·
2d
·
Hacker News
·
…
A
Taxonomy
of AI Agents
🤖
agents
efexen.substack.com
·
3d
·
Substack
·
…
Programming
(with AI agents) as theory building
⚙
Context engineering
seangoedecke.com
·
1d
·
Hacker News
·
…
Agent
Responsibly
🤖
agents
vercel.com
·
1d
·
Hacker News
·
…
LLMs and Agents: How do they Work?
🔍
AI Interpretability
mattrogish.com
·
6d
·
Hacker News
·
…
Humanoid
Robots Hit a Turning Point as Their
Brains
Catch Up
🤝
Multi-Agent Systems
spectrum.ieee.org
·
1d
·
Hacker News
·
…
100x Less Power: The
Breakthrough
That Could
Solve
AI’s Massive Energy Crisis
⚙
Context engineering
scitechdaily.com
·
6d
·
Hacker News
·
…
Show HN: A
Homeostatic
Logic-Funnel to Prevent RLHF
Overrides
in LLM Personas
✨
UI generation
zenodo.org
·
1d
·
Hacker News
·
…
Artificial
Intelligence
🔍
AI Interpretability
fmhy.net
·
2d
·
Hacker News
·
…
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help