Algorithmic Alchemy: Transmuting Dynamic Programming with Gradients by Arvind Sundararajan
♟Chess Programming
Flag this post
Equilibrium Policy Generalization: A Reinforcement Learning Framework for Cross-Graph Zero-Shot Generalization in Pursuit-Evasion Games
arxiv.org·15h
♟Chess Programming
Flag this post
News for October 2025
ptreview.sublinear.info·21h
⭕Consistent Hashing
Flag this post
Superhuman AI for Multiplayer Poker
🎯Bitboards
Flag this post
Computation as a Game
arxiv.org·15h
♟Chess Programming
Flag this post
How Transformer Models Detect Anomalies in System Logs
hackernoon.com·1d
👁️Observability
Flag this post
original ↗
allendowney.com·20h
🎯Bitboards
Flag this post
AI Function Calling: Composing and Decomposing Functions for Complex Tasks
💬Natural Language Processing
Flag this post
Information Gain-based Policy Optimization: A Simple and Effective Approach forMulti-Turn LLM Agents
🤖Machine Learning
Flag this post
Bayesian Natural Gradient Fine-Tuning of CLIP Models via Kalman Filtering
arxiv.org·15h
🤖Machine Learning
Flag this post
Masked Softmax Layers in PyTorch
🎯Bitboards
Flag this post
What to Do When Your Credit Risk Model Works Today, but Breaks Six Months Later
towardsdatascience.com·1h
🤖Machine Learning
Flag this post
Thoughts on "Static Retrival Revisited"
curiouscoding.nl·21h
🌲B-Trees
Flag this post
Cutting LLM Batch Inference Time in Half: Dynamic Prefix Bucketing at Scale
⭕Consistent Hashing
Flag this post
Loading...Loading more...