🤖 reinforcement learning - ddboline · Scour

Spotlight On: Dreamplug Technologies Private Limited (CRED), a New Principal Participating Organization

🧩operations research Blog

blog.pcisecuritystandards.org·

Optimisation over non-stationary distributions creates weirder minds

🧩operations research

lesswrong.com·

You're doing it wrong

🏃‍♀️running News

understandably.com·

Startup Ricursive to Create an End-to-End AI Model for Chip Design

🧩operations research News

Stack Overflow didn't just help AI learn to code

zozo123.github.io··Hacker News

Weekly Research Recap

🧩operations research News

quantseeker.com·

Beyond Dexterity: Why Contact May Define the Next Era of Robotics

🧩operations research Video News

spectrum.ieee.org

··Hacker News

Bridging Multi-Vector and Learned-Sparse Retrieval, A Diagnostic Framework for Robust Semantic IDs, and More!

📊linear programming News Blog

recsys.substack.com

Edge AI enabled MIMO MC-CDMA for 6G optimizing spectrum and energy efficiency with SIC and deep reinforcement learning

📊linear programming Academic

The Effective Sample Size

🧩operations research

alex.smola.org··Hacker News

Nvidia Nemotron 3 Ultra

research.nvidia.com··Hacker News

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

📊linear programming Academic

What is MBPO? A Beginner’s Guide to Efficient Reinforcement Learning

📊linear programming Blog

ujangriswanto08.medium.com·

Daimon Robotics and Galbot jointly launches RobOmni for benchmarking tactile perception and dexterous manipulation

🧩operations research

therobotreport.com·

Major Types of Machine Learning

📊linear programming Blog

Less-relevant results

BYD Great Han Arrives: D-Class Sedan, Directly Challenging BBA in the 300,000 Yuan Class

🧩operations research

autonews.gasgoo.com·

Vibe Diaries: Training Nanochat

vibediary.dev··Hacker News

SLUUG Talk: Demystifying Large Language Models on Linux

📊linear programming Code

github.com··DEV

Comp.compilers: Paper: MileStone: A Multi-Objective Compiler Phase Ordering Framework for Graph-based IR-Level Optimization

📊linear programming

compilers.iecc.com·

A Unifying Lens on Reward Uncertainty in RLHF

📊linear programming Academic

Sign up or log in to see more results

Log in to enable infinite scrolling