Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
reinforcement learning
🎯 reinforcement learning
artificial intelligence,deep learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
28
posts in
17.6
ms
Agents
Need Work Data: A Primer on RLWD, or
Reinforcement
Learning
on Work Data
🤖
llm
anjalishriva.com
·
4d
4 days ago
·
Hacker News
Actions for Agents Need Work Data: A Primer on RLWD, or Reinforcement Learning on Work Data
Demystifying Hidden-State Recurrence: Switchable Latent Reasoning with
On-Policy
Reinforcement
Learning
📱
Edge AI
Content type:
Academic
arxiv.org
·
2d
2 days ago
·
Hacker News
Actions for Demystifying Hidden-State Recurrence: Switchable Latent Reasoning with On-Policy Reinforcement Learning
Anthropic Is Taking AI Welfare Seriously. I’m Not Sure It Knows What It’s Measuring.
🤖
llm
lesswrong.com
·
15h
15 hours ago
·
Hacker News
Actions for Anthropic Is Taking AI Welfare Seriously. I’m Not Sure It Knows What It’s Measuring.
Mi50 32GB / GFX906 - vLLM Qwen 3.5 Configuration for Qwen 3.5:9B AWQ-4bit
🤖
llm
huggingface.co
·
2d
2 days ago
·
r/LocalLLaMA
Actions for Mi50 32GB / GFX906 - vLLM Qwen 3.5 Configuration for Qwen 3.5:9B AWQ-4bit
I got so mad at poke(rogue)like that I
trained
a
RL
agent
to beat it for me
🎛️
Control theory
thiagolira.blot.im
·
6d
6 days ago
·
Hacker News
Actions for I got so mad at poke(rogue)like that I trained a RL agent to beat it for me
vrtnis/tycoon-learning-environment
: A JAX transport-economy
learning
environment for route planning, cargo flow, financing, and replayable
agent
benchmarks.
🏋️
Isaac Gym
Content type:
Code
github.com
·
1d
1 day ago
·
Hacker News
Actions for vrtnis/tycoon-learning-environment: A JAX transport-economy learning environment for route planning, cargo flow, financing, and replayable agent benchmarks.
Some Ethical Problems with AI
🐝
Swarm Intelligence
Content type:
Blog
arkvis.com
·
13h
13 hours ago
·
Hacker News
Actions for Some Ethical Problems with AI
Introduction to (Multimodal) LLM-as-a-Judge
🤖
llm
Content type:
News
Content type:
Blog
yinghonglan.substack.com
·
11h
11 hours ago
·
Substack
Actions for Introduction to (Multimodal) LLM-as-a-Judge
Researchers
trained
an open source AI search
agent
, Harness-1, that outperforms GPT-5.4 on recalling relevant information
📱
Edge AI
venturebeat.com
·
5d
5 days ago
·
Hacker News
Actions for Researchers trained an open source AI search agent, Harness-1, that outperforms GPT-5.4 on recalling relevant information
The Era of
Multi-Agent
Imagined Experience
🤖
llm
odyssey.ml
·
1d
1 day ago
·
Hacker News
Actions for The Era of Multi-Agent Imagined Experience
SkyPilot Sandboxes: Run
Agent
Code on Your Own Kubernetes, at Scale
🤖
llm
Content type:
Blog
blog.skypilot.co
·
5d
5 days ago
·
Hacker News
Actions for SkyPilot Sandboxes: Run Agent Code on Your Own Kubernetes, at Scale
Recursive Self-Improvement
🐍
Python
Content type:
News
Content type:
Blog
ana15.substack.com
·
2d
2 days ago
·
Substack
Actions for Recursive Self-Improvement
Issue 655
🤖
llm
Content type:
News
Content type:
Blog
datascienceweekly.substack.com
·
2d
2 days ago
·
Substack
Actions for Issue 655
Inside soccer’s data renaissance
👁️
Computer vision
Content type:
News
technologyreview.com
·
3d
3 days ago
·
Hacker News
Actions for Inside soccer’s data renaissance
AI-powered living business
intelligence
network
📱
Edge AI
atlasforgex.com
·
3d
3 days ago
·
Hacker News
Actions for AI-powered living business intelligence network
Kimi K2.7-Code: open-source coding
model
with better token efficiency
🤖
llm
8
articles covering this post
huggingface.co
·
2d
2 days ago
·
Hacker News
,
r/LocalLLaMA
·
Cited by 8 articles
Actions for Kimi K2.7-Code: open-source coding model with better token efficiency
Beyond Dexterity: Why Contact May Define the Next Era of Robotics
🤖
Robotics
Content type:
Video
Content type:
News
spectrum.ieee.org
·
4d
4 days ago
·
Hacker News
Actions for Beyond Dexterity: Why Contact May Define the Next Era of Robotics
Why LLMs (still) lack taste
🤖
llm
beyondtheprior.com
·
5d
5 days ago
·
Hacker News
Actions for Why LLMs (still) lack taste
Introducing the Third Generation of Apple’s Foundation
Models
📱
Edge AI
28
articles covering this post
machinelearning.apple.com
·
6d
6 days ago
·
Hacker News
,
r/apple
·
Cited by 28 articles
Actions for Introducing the Third Generation of Apple’s Foundation Models
Apple's New AI
Models
Contain 'None' of Google's Gemini Assistant
📱
Edge AI
Content type:
News
macrumors.com
·
4d
4 days ago
·
Hacker News
Actions for Apple's New AI Models Contain 'None' of Google's Gemini Assistant
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help