🤖 reinforcement learning - ddboline · Scour

Variance Reduction Based Experience Replay for Policy Optimization

arxiv.org·6d

📊linear programming

Mode-Dependent Rectification for Stable PPO Training

arxiv.org·6d

📊linear programming

Part 5: Reward Engineering: How to Shape Behaviors in Financial/Robotic Tasks

dev.to·6d·

Discuss: DEV

🧩operations research

Multi-Agent Reinforcement Learning (MARL): Practical Guide to Cooperative and Competitive Learning

dev.to·6d·

Discuss: DEV

📊linear programming

An attempt at a First-Proof AI challenge

abhvio.us·3d·

Discuss: Hacker News

📊linear programming

Manufacturing QMS Software

samrian.com·2d·

Discuss: Hacker News

🧩operations research

Oatmeal - Constraint propagation for fun

eli.li·4d·

Discuss: Lobsters, Hacker News

📊linear programming

Autonomous PRD Agent

minicodemonkey.github.io·3d·

Discuss: Hacker News

🧩operations research

AI Orchestrators Decision Table

gist.github.com·3d·

Discuss: Hacker News

📊linear programming

EU AI Act Compliance for Enterprise AI Systems: What Your Engineering Team Needs to Build

medium.com·3d·

Discuss: Hacker News

📊linear programming

Mathematical Resolution of P vs NP through Informational Noise Subtraction and Linear O(n) Mapping

zenodo.org·4d·

Discuss: Hacker News

📊linear programming

From Prediction to Compilation: A Manifesto for Intrinsically Reliable AI

news.ycombinator.com·3d·

Discuss: Hacker News

🧩operations research

Rule #1 for coding with AI agents

zknill.io·2d·

Discuss: Hacker News

🧩operations research

Continual learning and the post monolith AI era

baseten.co·5d·

Discuss: Hacker News

📊linear programming

AI Workflows with human-in-the-loop

weavemind.ai·4d·

Discuss: Hacker News

🧩operations research

How We Give AI Agents Long-Term Memory Without Blowing the Budget

metaduck.com·3d·

Discuss: DEV, Hacker News

📊linear programming

Double Rootlessness: AI's Cognitive Illusion and Systemic Risk Amplification

news.ycombinator.com·3d·

Discuss: Hacker News

🧩operations research

OpenClaw: I gave an AI my credit card and let it loose on Amazon

codedojo.com·2d·

Discuss: Hacker News

Experiments in building bespoke tools with AI

knlb.dev·3d·

Discuss: Hacker News

🧩operations research

Quantization-Aware Distillation

ternarysearch.blogspot.com·4d·

Discuss: Hacker News

📊linear programming

Loading more...