Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🎮 强化学习
智能体, 奖励函数, Q学习, 策略优化
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
564
posts in
15.0
ms
How’s it going?
Reinforcement
learning
in language models recruits a
functional
welfare axis
💬
NLP
functionalwelfare.com
·
6d
·
Hacker News
AI model predicts building fire spread, redirecting evacuees to safer exits in real time
👁️
计算机视觉
techxplore.com
·
2d
·
Hacker News
Off-Policy
RL Replay Buffer Memory
Leak
: Fix 2M Step Crash
🗂️
知识管理
tildalice.io
·
4d
Nvidia enters Windows AI PC race with new RTX Spark chip: All major announcements at Computex 2026
💬
NLP
indianexpress.com
·
5d
Learning
to replenish: A hybrid
deep
reinforcement
learning
for dynamic inventory management in the pharmaceutical supply chains
👁️
计算机视觉
Academic
arxiv.org
·
2d
Microsoft Build 2026: Be yourself at work
🤖
人工智能
Blog
blogs.microsoft.com
·
4d
Build 2026: Organizations Can Unlock Enterprise Intelligence with Microsoft IQ
👁️
计算机视觉
petri.com
·
4d
Lessons and Concepts from
Reinforcement
Learning
👁️
计算机视觉
Blog
shahzaib.bearblog.dev
·
6d
Postdoc position in philosophy of science with focus on astrophysics (Jagiellonian University, 3 years, full time)
🧠
认知科学
Blog
takingupspacetime.wordpress.com
·
4d
Copilot super app leaks 🤖, Minimax M3 ➕, Nvidia N1X ⚡️
🗂️
知识管理
tldr.tech
·
6d
A
Functional
Taxonomy of World Models – Fei Fei Li
🤖
机器学习
Blog
drfeifei.substack.com
·
3d
·
Substack
Build 2026: Microsoft tops Google in image generation while playing catch-up on reasoning
👁️
计算机视觉
the-decoder.com
·
3d
Agentic
Monte Carlo: Simulating
Reinforcement
Learning
for Black-Box Agents
👁️
计算机视觉
Academic
arxiv.org
·
2d
New comment by stuartjohnson in "Ask HN: Who wants to be hired? (June 2026)"
🤖
人工智能
drive.google.com
·
5d
·
Hacker News
The RL Flywheel That Actually Works
🤖
机器学习
discord.gg
·
5d
·
DEV
AI Weather Models, Tech Layoffs, & Anthropic IPO
👁️
计算机视觉
briefing.forwardfuture.ai
·
4d
DeepSeek
fundraising 💰, Meta model delays ⌛ , Gemma 4 12B 🤖
🤖
人工智能
tldr.tech
·
3d
Representation
Learning
Enables Scalable Multitask
Deep
Reinforcement
Learning
👁️
计算机视觉
Academic
arxiv.org
·
2d
Opus 4.8, OpenRouter, Cognition, Snowflake, and a papal warning
👁️
计算机视觉
Blog
thesequence.substack.com
·
6d
·
Substack
I made a kernel 2.2x faster. It made my training loop 3x slower
🤖
人工智能
Blog
kyrieblunders.bearblog.dev
·
4d
·
Hacker News
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help