🎮 Reinforcement Learning - hussoster · Scour

Why Robotics Is a Pre-Paradigm Field

🧠Deep Learning News

whattotelltherobot.com··Hacker News

Towards End to End Motion Planning and Execution for Autonomous Underwater Vehicles Using Reinforcement Learning

🧠Neural Network Architectures Academic

What is MBPO? A Beginner’s Guide to Efficient Reinforcement Learning

🤖Transformer Architecture Blog

ujangriswanto08.medium.com·

Model predictive task sampling for efficient and robust adaptation

🚀Model Deployment Academic

World Model Self-Distillation: Training World Models to Solve General Tasks

🎲Synthetic Data Generation Academic

UNIQ: Conformal Calibration for Adaptive Conservatism in Offline Reinforcement Learning

🤖AI Academic

Reinforcement learning in linear embedding space unlocks generalizable control across soft robot configurations

🤖AI Academic

Deep reinforcement learning for process design: Review and perspective

🧠Neural Network Architectures Academic

Dmsh: A Multi-Agent Reinforcement Learning Framework for All-Quad Mesh Generation

🧠Neural Network Architectures Academic

GIFT: LLM-Guided State-Reward Interface for Financial Reinforcement Learning

🤖AI Academic

CFCamo: A Counterfactual Detect-or-Abstain Framework for Camouflaged Object Detection

🧠Deep Learning Academic

Geometry-Aware Reinforcement Learning for 2D Irregular Nesting

🤖Transformer Architecture Academic

Development of COVID-19 Booster Vaccine Policy by Microsimulation and Q-learning

🧠Neural Network Architectures Academic

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

🔄LSTM Networks Academic

Performance Variation in Deep Reinforcement Learning

🔄LSTM Networks Academic

Event-Driven Reinforcement Learning Enables Long-Horizon Control in Semiconductor Fabrication

🔄LSTM Networks Academic

Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning

📈Time Series Forecasting Academic

On Advantage Estimates for Max@K Policy Gradients

🤖AI Academic

Bellman-Taylor Score Decoding for Markov Decision Processes with State-Dependent Feasible Action Sets

🔄LSTM Networks Academic

Path Planning Using Deep Deterministic Policy Gradient: A Reinforcement Learning Approach

🧠Neural Network Architectures Academic

Log in to enable infinite scrolling