Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
Q-Learning, Policy Gradients, Environments, Rewards
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
112771
posts in
269.2
ms
Learning beyond Teacher:
Generalized
On-Policy Distillation with Reward
Extrapolation
arxiv.org
·
23h
·
Discuss:
Hacker News
🔀
Transformers
Provable
Offline Reinforcement Learning for Structured Cyclic
MDPs
arxiv.org
·
23h
🤖
AI
Survival in the
Thorny
Jungle
: Tracking Wild Animals & Catching Stream Fish Alone
youtube.com
·
11h
🌐
Distributed Systems
Agentic AI Chip Design,
Networking
Chip, Edge AI:
Embedded
Week Insights
embedded.com
·
3h
🌐
Distributed Systems
FinovateEurope
2026: From AI
Hype
To Bank‑Ready Execution
forrester.com
·
1d
🏗️
Data Engineering
The 4
Precision
Formats
: How to Train AI 2× Faster with Half the Memory
pub.towardsai.net
·
14h
🤖
AI
AI Agents Now
ADAPT
To
Messy
Real-World Problems, Not Just Perfect Tests
quantumzeitgeist.com
·
1d
🤖
AI
AI captures
particle
accelerator
behavior to optimize machine performance
phys.org
·
13h
🤖
AI
GPU-Serving
Two-Tower
Models for Lightweight Ads Engagement Prediction
medium.com
·
4h
🧭
Vector Databases
Microsoft Tests AI
Marketplace
Simulation
i-programmer.info
·
9h
🏗️
Data Engineering
Recursive
self-improvement
from AI models
marginalrevolution.com
·
3d
·
Discuss:
Hacker News
🤖
AI
Diffusion Models for
ARC-AGI
: A
Retrospective
christopherhwood.com
·
2d
·
Discuss:
Hacker News
🔀
Transformers
Building
Physical
Agentic
AI
dansitu.substack.com
·
11h
·
Discuss:
Substack
🌐
Distributed Systems
AI
Outperforms
Humans in
Countless
Areas
psychologytoday.com
·
10h
🔀
Transformers
Navigation/Route
Calculation
System
dev.to
·
10h
·
Discuss:
DEV
🔍
Query Languages & APIs
A
masterclass
in AI security
operations
redcanary.com
·
1d
🤖
AI
Olmix
: A framework for data mixing throughout
LM
development
allenai.org
·
12h
🏗️
Data Engineering
What Murder Mystery 2 reveals about
emergent
behaviour
in online games
artificialintelligence-news.com
·
12h
🤖
AI
At-home movement state classification using totally
implantable
cortical-basal
ganglia
neural interface
science.org
·
14h
🔀
Transformers
Scaling
LLM Post-Training at Netflix
netflixtechblog.com
·
20h
🔧
Feature Engineering
Sign up or log in to see more results
Sign Up
Login
« Page 2
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help