Equilibrium Policy Generalization: A Reinforcement Learning Framework for Cross-Graph Zero-Shot Generalization in Pursuit-Evasion Games
arxiv.org·2h
♟Chess Programming
Flag this post
News for October 2025
ptreview.sublinear.info·8h
⭕Consistent Hashing
Flag this post
Superhuman AI for Multiplayer Poker
🎯Bitboards
Flag this post
Computation as a Game
arxiv.org·2h
♟Chess Programming
Flag this post
How Transformer Models Detect Anomalies in System Logs
hackernoon.com·13h
👁️Observability
Flag this post
original ↗
allendowney.com·7h
🎯Bitboards
Flag this post
Information Gain-based Policy Optimization: A Simple and Effective Approach forMulti-Turn LLM Agents
🤖Machine Learning
Flag this post
Bayesian Natural Gradient Fine-Tuning of CLIP Models via Kalman Filtering
arxiv.org·2h
🤖Machine Learning
Flag this post
Masked Softmax Layers in PyTorch
🎯Bitboards
Flag this post
Robust Control Synthesis via Persistent Homology-Guided Network Pruning
🔗Distributed Systems
Flag this post
Relation-Aware Bayesian Optimization of DBMS Configurations Guided by Affinity Scores
arxiv.org·1d
🌲B-Trees
Flag this post
Naïve Shuffle Algorithm (2007)
🎯Bitboards
Flag this post
Benchmarking Generative AI Against Bayesian Optimization for Constrained Multi-Objective Inverse Design
arxiv.org·2h
🤖Machine Learning
Flag this post
Disciplined Biconvex Programming
arxiv.org·2h
♟Chess Programming
Flag this post
Improving in chess is hard. I built the world's most accurate human-like chess AI to help me.
🎯Bitboards
Flag this post
Fast Answering Pattern-Constrained Reachability Queries with Two-Dimensional Reachability Index
arxiv.org·2h
🌲B-Trees
Flag this post
Loading...Loading more...