🤖 reinforcement learning - ddboline · Scour

GAS: Enhancing Reward-Cost Balance of Generative Model-assisted Offline Safe RL

arxiv.org·5d

📊linear programming

Back to Basics: Revisiting Exploration in Reinforcement Learning for LLM Reasoning via Generative Probabilities

arxiv.org·5d

📊linear programming

Meta-Optimized Continual Adaptation for deep-sea exploration habitat design with embodied agent feedback loops

dev.to·3d·

Discuss: DEV

🧩operations research

Stochastic Gradient Descent Optimizes Over-parameterized Deep ReLU Networks

dev.to·3d·

Discuss: DEV

📊linear programming

Show HN: Self-improvement platform

upstep.me·1d·

Discuss: Hacker News

📊linear programming

Is this a game... or is it real? What's the difference?

bryan-murdock.blogspot.com·2d·

Discuss: Hacker News

🏃‍♀️running

Barn Owls Know When to Wait (iuSTDP part 2)

blog.typeobject.com·3d·

Discuss: Hacker News

📊linear programming

Heuristics for lab robotics, and where its future may go

owlposting.com·2d·

Discuss: Hacker News

🧩operations research

Show HN: First AI Employee – Treat AI as a hire, not a chatbot

site-beige-ten.vercel.app·2d·

Discuss: Hacker News

📊linear programming

A one-prompt attack that breaks LLM safety alignment

microsoft.com·2d·

Discuss: Hacker News

📊linear programming

Oatmeal - Constraint propagation for fun

eli.li·3d·

Discuss: Lobsters, Hacker News

📊linear programming

Manufacturing QMS Software

samrian.com·2d·

Discuss: Hacker News

🧩operations research

The AI Training Asymmetry

tostracker.app·4d·

Discuss: Hacker News

📊linear programming

Persistent Memory API for AI Agents

memoclaw.com·2d·

Discuss: Hacker News

📊linear programming

OpenClaw: I gave an AI my credit card and let it loose on Amazon

codedojo.com·2d·

Discuss: Hacker News

Autonomous PRD Agent

minicodemonkey.github.io·3d·

Discuss: Hacker News

🧩operations research

AI Orchestrators Decision Table

gist.github.com·2d·

Discuss: Hacker News

📊linear programming

Rule #1 for coding with AI agents

zknill.io·2d·

Discuss: Hacker News

🧩operations research

An attempt at a First-Proof AI challenge

abhvio.us·3d·

Discuss: Hacker News

📊linear programming

EU AI Act Compliance for Enterprise AI Systems: What Your Engineering Team Needs to Build

medium.com·2d·

Discuss: Hacker News

📊linear programming

Loading more...