Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
Q-Learning, Policy Gradients, Environments, Rewards
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
112776
posts in
546.4
ms
Can We Really Learn One Representation to
Optimize
All
Rewards
?
arxiv.org
·
1d
🔀
Transformers
Think Longer to Explore Deeper: Learn to Explore In-Context via
Length-Incentivized
Reinforcement Learning
arxiv.org
·
1d
🔀
Transformers
The AI Jobs
Non-Apocalypse
: An Update
aei.org
·
16h
🤖
AI
How to
Leverage
Explainable
AI for Better Business Decisions
towardsdatascience.com
·
1d
🤖
AI
How low-bit
inference
enables
efficient AI
dropbox.tech
·
1h
·
Discuss:
Hacker News
🤖
AI
Human-like
metacognitive
skills will reduce LLM
slop
and aid alignment and capabilities
lesswrong.com
·
1d
🔀
Transformers
Simulating
Users with State Alignment Beats Response
Imitation
humanlm.stanford.edu
·
4h
🔀
Transformers
Inversion
of Control
mandar.dev
·
18h
·
Discuss:
Hacker News
🤖
AI
I gave my
OpenClaw
GTM
assistant a brain. Here's what happened
shawnharris.com
·
15h
·
Discuss:
Hacker News
🔀
Transformers
A New LLM System for
Synthesis
Planning
science.org
·
12h
🏗️
Data Engineering
London-based
Stanhope
AI raises €6.7 million for adaptive AI in
robotics
and defence applications
europedigital.cloud
·
1d
🤖
AI
DaVinci-Agency
: A
Shortcut
to Long-Horizon AI Agents
hackernoon.com
·
12h
🤖
AI
Worlds
: A Simulation Engine for Agentic
Pentesting
dreadnode.io
·
1d
·
Discuss:
Hacker News
🌐
Distributed Systems
Optimization of interpretable
hydropower
reservoir operation rules by
denoising
diffusion probabilistic model, parallel chaotic cooperation search algorithm and...
sciencedirect.com
·
18h
🔧
Feature Engineering
Computer Vision Agent
npmjs.com
·
1h
·
Discuss:
Hacker News
🤖
AI
Researchers propose a self-distillation fix for ‘
catastrophic
forgetting
’ in LLMs
infoworld.com
·
2d
🌐
Distributed Systems
How to
spend
your
bonus
kill-the-newsletter.com
·
15h
🔍
Query Languages & APIs
The
Rational
Use of
Cognitive
Resources
press.princeton.edu
·
4d
🔀
Transformers
Distributed Training Across Mixed GPUs:
Solving
the
Heterogeneous
Fleet Problem
shardpool.aurora-sentient.net
·
1h
·
Discuss:
DEV
🔪
Database Sharding
Persistent
memory for AI agents, local-first and open source
engram-ai.dev
·
15h
·
Discuss:
Hacker News
🌐
Distributed Systems
Loading...
Loading more...
« Page 1
•
Page 3 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help