ddboline's Feed · Scour

Distributional Reinforcement Learning with Diffusion Bridge Critics

arxiv.org·3d

🤖reinforcement learning

A Newbie's First Contribution to (Rust for) Linux

blog.buenzli.dev·11h·

Discuss: Hacker News

i10e-lab/HelloRL: A fully modular framework to make Reinforcement Learning quick and easy

github.com·2d·

Discuss: Hacker News

🤖reinforcement learning

GAS: Enhancing Reward-Cost Balance of Generative Model-assisted Offline Safe RL

arxiv.org·3d

🤖reinforcement learning

AWS Lambda with Rust and Closure Syntax

aws.amazon.com·12h·

Discuss: r/rust

Adaptive Neuro-Symbolic Planning for smart agriculture microgrid orchestration in hybrid quantum-classical pipelines

dev.to·21h·

Discuss: DEV

🤖reinforcement learning

An attempt at a First-Proof AI challenge

abhvio.us·19h·

Discuss: Hacker News

📊linear programming

Accelerate your discovery by parallelizing experiments

magellink.com·14h·

Discuss: Hacker News

🧩operations research

On Economics of A(S)I Agents

lesswrong.com·1d

🤖reinforcement learning

Splitwise alternative that makes sense

divvy.club·21h·

Discuss: Hacker News

📊linear programming

Show HN: ShapeGuard – Shape Contracts for NumPy and Jax

news.ycombinator.com·2h·

Discuss: Hacker News

C and Undefined Behaviour

lelanthran.com·16h·

Discuss: Hacker News, r/C_Programming, r/programming

Sign up or login to customize your feed and get personalized topic recommendations

Barn Owls Know When to Wait (iuSTDP part 2)

blog.typeobject.com·1d·

Discuss: Hacker News

🤖reinforcement learning

Oatmeal - Constraint propagation for fun

eli.li·1d·

Discuss: Lobsters, Hacker News

📊linear programming

Learning Models with Uniform Performance via Distributionally RobustOptimization

dev.to·1d·

Discuss: DEV

🤖reinforcement learning

Quantization-Aware Distillation

ternarysearch.blogspot.com·1d·

Discuss: Hacker News

🤖reinforcement learning

LLMs Are Prediction Machines

kaelandt.github.io·12h·

Discuss: Hacker News

🤖reinforcement learning

I Let AI Agents Train Their Own Models. Here's What Actually Happened.

hamzamostafa.com·2h·

Discuss: Hacker News

🤖reinforcement learning

OvidijusParsiunas/are-you-random: 🎲 Browser game that predicts your "random" choices

github.com·21h·

Discuss: Hacker News

🤖reinforcement learning

Heterogeneous Processing: A Strategy for Augmenting Moore's Law (2006)

linuxjournal.com·17h·

Discuss: Hacker News

🧩operations research

Loading more...