Reinforcement Learning

Feeds to Scour
SubscribedAll
Scoured 388 posts in 7.8 ms

Experts weigh in on Anthropic’s Fable 5, Mythos 5 releases

 📐Formal Methods
sdtimes.com·

I got so mad at poke(rogue)like that I trained a RL agent to beat it for me

 🤖Machine Learning

Space-sampled Value Decay: Forgetting Mechanisms for Non-stationary Deep Reinforcement Learning

 💬LLMs  Content type: Academic
arxiv.org·

How AI chatbots become better learning coaches

 💬LLMs
techxplore.com·

🥇Top AI Papers of the Week

 🤖AI  Content type: News
nlp.elvissaravia.com·

Mbodi AI (YC P25) Is Hiring Founding Machine Learning Engineer (Robotics)

 🤖AI

San Francisco Construction Security Company: Complete Guide to Protecting Your Job Site in 2026

 💻Tech Industry  Content type: Blog
medium.com·

CCKS: Consensus-based Communication and Knowledge Sharing

 🖧Distributed Systems  Content type: Academic
arxiv.org·

Edge AI enabled MIMO MC-CDMA for 6G optimizing spectrum and energy efficiency with SIC and deep reinforcement learning

 🤖Machine Learning  Content type: Academic
nature.com·

The Exploit Always Wins

 ✍️Prompt Engineering  Content type: Blog
abhishek-shankar.com·

Comp.compilers: Paper: MileStone: A Multi-Objective Compiler Phase Ordering Framework for Graph-based IR-Level Optimization

 ⚙️Compilers
compilers.iecc.com·

You're doing it wrong

 🍳Cooking  Content type: News
understandably.com·

Variational Proximal Policy Optimization

 🤖Machine Learning  Content type: Academic
arxiv.org·

Bridging Multi-Vector and Learned-Sparse Retrieval, A Diagnostic Framework for Robust Semantic IDs, and More!

 💬LLMs  Content type: News  Content type: Blog

SLUUG Talk: Demystifying Large Language Models on Linux

 🤖AI  Content type: Code
github.com··DEV

Geometrically Averaged Hard Target Updates for Linear Q-Learning

 🤖Machine Learning  Content type: Academic
arxiv.org·

Sequent: scale and automation for higher confidence in alignment

 🤖AI
lesswrong.com·

HERO: Hindsight-Enhanced Reflection from Environment Observations for Agentic Self-Distillation

 🏗️AI Infrastructure  Content type: Academic
arxiv.org·

BeatpulseLabs raises $1.8M pre-seed to scale AI training data

 🤖Machine Learning  Content type: News
tech.eu·

Fast and Highly Expressive Policy Learning for Offline Reinforcement Learning via Bootstrapped Flow Q-Learning

 🏗️AI Infrastructure  Content type: Academic
arxiv.org·
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help