ddboline's Top FindsLoading...
Reinforcement Learning: How Machines Learn to Make Smart Choices Like You Do
dev.to·1d·
Discuss: DEV
🤖reinforcement learning
Flag this post
Help with AI Fatigue
news.ycombinator.com·2h·
Discuss: Hacker News
Flag this post
Stop vibe coding your unit tests
andy-gallagher.com·1d·
Discuss: Hacker News
🦀Rust
Flag this post
Periodic Skill Discovery
arxiv.org·15h
🤖reinforcement learning
Flag this post
Mathematical exploration and discovery at scale
terrytao.wordpress.com·16h·
Discuss: Hacker News
🤖reinforcement learning
Flag this post
Expected Value Analysis in AI Product Management
towardsdatascience.com·4h
🧩operations research
Flag this post
LazyLLM, Easiest and laziest way for building multi-agent LLMs applications
github.com·20h·
Discuss: Hacker News
🤖reinforcement learning
Flag this post
Optimal Boundary Control of Diffusion on Graphs via Linear Programming
arxiv.org·15h
📊linear programming
Flag this post
Petri Dish Neural Cellular Automata
pub.sakana.ai·1d·
Discuss: Hacker News
🤖reinforcement learning
Flag this post
[Deep Dive] How We Solved Poker: From Academic Bots to Superhuman AI (1998-2025)
gist.github.com·18h·
Discuss: r/programming
🤖reinforcement learning
Flag this post
Can-t stop till you get enough
cant.bearblog.dev·4d·
Discuss: Hacker News
🦀Rust
Flag this post
Up and Down the Ladder of Abstraction
worrydream.com·1d·
Discuss: Hacker News
🤖reinforcement learning
Flag this post
Going Beyond Expert Performance via Deep Implicit Imitation Reinforcement Learning
arxiv.org·15h
🤖reinforcement learning
Flag this post
The Complexity Cliff: Why Reasoning Models Work Right Up Until They Don't
rewire.it·20h·
Discuss: Hacker News
🤖reinforcement learning
Flag this post
Shrinking the Variance: Shrinkage Baselines for Reinforcement Learning with Verifiable Rewards
arxiv.org·15h
🤖reinforcement learning
Flag this post
Disciplined Biconvex Programming
arxiv.org·2d
🧩operations research
Flag this post
Sign up or login to customize your feed and get personalized topic recommendations
10 Polars One-Liners for Speeding Up Data Workflows
kdnuggets.com·6h
Flag this post
Coding on Paper
thepalindrome.org·5h·
Discuss: Hacker News
Flag this post
Online Learning to Rank under Corruption: A Robust Cascading Bandits Approach
arxiv.org·15h
🤖reinforcement learning
Flag this post