reinforcement learning

Feeds to Scour
SubscribedAll
Scoured 353 posts in 8.0 ms

Spotlight On: Dreamplug Technologies Private Limited (CRED), a New Principal Participating Organization

 🧩operations research  Content type: Blog

Optimisation over non-stationary distributions creates weirder minds

 🧩operations research
lesswrong.com·

You're doing it wrong

 🏃‍♀️running  Content type: News
understandably.com·

Startup Ricursive to Create an End-to-End AI Model for Chip Design

 🧩operations research  Content type: News
eetimes.com·

Stack Overflow didn't just help AI learn to code

 🦀Rust

Weekly Research Recap

 🧩operations research  Content type: News
quantseeker.com·

Beyond Dexterity: Why Contact May Define the Next Era of Robotics

 🧩operations research  Content type: Video  Content type: News

Bridging Multi-Vector and Learned-Sparse Retrieval, A Diagnostic Framework for Robust Semantic IDs, and More!

 📊linear programming  Content type: News  Content type: Blog

Edge AI enabled MIMO MC-CDMA for 6G optimizing spectrum and energy efficiency with SIC and deep reinforcement learning

 📊linear programming  Content type: Academic
nature.com·

The Effective Sample Size

 🧩operations research

Nvidia Nemotron 3 Ultra

 🦀Rust

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

 📊linear programming  Content type: Academic
arxiv.org·

What is MBPO? A Beginner’s Guide to Efficient Reinforcement Learning

 📊linear programming  Content type: Blog

Daimon Robotics and Galbot jointly launches RobOmni for benchmarking tactile perception and dexterous manipulation

 🧩operations research
therobotreport.com·

Major Types of Machine Learning

 📊linear programming  Content type: Blog
medium.com·
Less-relevant results

BYD Great Han Arrives: D-Class Sedan, Directly Challenging BBA in the 300,000 Yuan Class

 🧩operations research
autonews.gasgoo.com·

Vibe Diaries: Training Nanochat

 🦀Rust
vibediary.dev··Hacker News

SLUUG Talk: Demystifying Large Language Models on Linux

 📊linear programming  Content type: Code
github.com··DEV

Comp.compilers: Paper: MileStone: A Multi-Objective Compiler Phase Ordering Framework for Graph-based IR-Level Optimization

 📊linear programming
compilers.iecc.com·

A Unifying Lens on Reward Uncertainty in RLHF

 📊linear programming  Content type: Academic
arxiv.org·
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help