🎮 Reinforcement Learning - livfan · Scour

Reinforcement Learning from Human Feedback

arxiv.org·1d

♟️Game Theory

🥇Top AI Papers of the Week

nlp.elvissaravia.com·18h

Hybrid neural–cognitive models reveal how memory shapes human reward learning

nature.com·1d

🤖Machine Learning

Multi-Agent Reinforcement Learning (MARL): Practical Guide to Cooperative and Competitive Learning

dev.to·3d·

Discuss: DEV

♟️Game Theory

Adaptive Exploration for Latent-State Bandits

arxiv.org·3d

Hybrid Model‑Based / Model‑Free Reinforcement Learning for Energy‑Efficient Autonomous Warehouse Robot Navigation with Real‑Time Obstacle Prediction **Abstra...

freederia.com·3d

Main Content || Math ∩ Programming

jeremykun.com·10h

i10e-lab/HelloRL: A fully modular framework to make Reinforcement Learning quick and easy

github.com·2d·

Discuss: Hacker News

Adaptive Neuro-Symbolic Planning for smart agriculture microgrid orchestration in hybrid quantum-classical pipelines

dev.to·23h·

Discuss: DEV

♟️Game Theory

6 AI Agents, One Company

voxyz.space·13m

AI Agents as Accountability Partners: Configurable Nudging for Your Goals

blog.turtleand.com·13h·

Discuss: DEV

Quantization-Aware Distillation

ternarysearch.blogspot.com·1d·

Discuss: Hacker News

🤖Machine Learning

Choice as an emergent feature

oop.bearblog.dev·14h

🎮Game Design

Why reinforcement learning breaks at scale, and how a new method fixes it

techxplore.com·4d

Performance Tip of the Week #94: Decision making in a data-imperfect world

abseil.io·1d

🤖Machine Learning

On Economics of A(S)I Agents

lesswrong.com·1d

♟️Game Theory

Scientists reveal the alien logic of AI: hyper-rational but stumped by simple concepts

psypost.org·1d

♟️Game Theory

Cooperative Autonomous Navigation of Legged Robots in Unstructured Terrains Using Hierarchical Reinforcement Learning — ## Abstract Legged robotic plat...

freederia.com·2d

I Let AI Agents Train Their Own Models. Here's What Actually Happened.

hamzamostafa.com·4h·

Discuss: Hacker News

Label-Consistent Backdoor Attacks

paperium.net·16h·

Discuss: DEV

🤖Machine Learning

Loading more...