ddboline's Top FindsLoading...
Accelerating MySQL Query Optimization via Reinforcement Learning & Hypergraph Analysis
dev.to·3h·
Discuss: DEV
🤖reinforcement learning
Flag this post
Show HN: I Vibe-Coded a TUI for AWS Logs Insights in Rust
github.com·1h·
Discuss: Hacker News
🦀Rust
Flag this post
Power Constrained Nonstationary Bandits with Habituation and Recovery Dynamics
arxiv.org·17h
🤖reinforcement learning
Flag this post
The Reinforcement Learning Handbook: A Guide to Foundational Questions
towardsdatascience.com·8h
🤖reinforcement learning
Flag this post
Moves Are Broken
youtube.com·3h
🦀Rust
Flag this post
Discrete Fourier Transform: Introduction (2020)
chciken.com·6h·
Discuss: Hacker News
📊linear programming
Flag this post
GPT-4 Functions as Monoidal Structures: Sequential ∘ and Parallel ⊗
lightcapai.medium.com·6h·
Discuss: Hacker News
🧩operations research
Flag this post
American Wind Farms
tech.marksblogg.com·15h·
Discuss: Hacker News
📊linear programming
Flag this post
A Guide to My Organizational Workflow
cachestocaches.com·22h·
Discuss: Hacker News
🧩operations research
Flag this post
Reasoning with Sampling: Your Base Model Is Smarter Than You Think
aakaran.github.io·5h·
Discuss: Hacker News
🤖reinforcement learning
Flag this post
Rodrigo Girão Serrão: A generator, duck typing, and a branchless conditional walk into a bar
mathspp.com·2d
🦀Rust
Flag this post
Sign up or login to customize your feed and get personalized topic recommendations
Harness the Power of Atlas Search and Vector Search with $RankFusion
mongodb.com·7h·
Discuss: Hacker News
🧩operations research
Flag this post
Algorithmic Alchemy: Transmuting Dynamic Programming with Gradients by Arvind Sundararajan
dev.to·2d·
Discuss: DEV
🤖reinforcement learning
Flag this post
Learning Without Critics? Revisiting GRPO in Classical Reinforcement Learning Environments
arxiv.org·17h
🤖reinforcement learning
Flag this post
SampCert: Verified Foundations for Differential Privacy (PLDI 2025)
dl.acm.org·5h·
Discuss: Hacker News
Flag this post
The Orchestrator Pattern: Routing Conversations to Specialized AI Agents
dev.to·1d·
Discuss: DEV
🤖reinforcement learning
Flag this post
Explaining Human Choice Probabilities with Simple Vector Representations
arxiv.org·17h
🤖reinforcement learning
Flag this post
Dynamic Freight Route Optimization via Multi-Agent Reinforcement Learning with Adaptive Risk Aversion
dev.to·15h·
Discuss: DEV
🤖reinforcement learning
Flag this post