Deep Reinforcement Learning Book
deepreinforcementlearningbook.org·2d·
Discuss: Hacker News
🏋️Isaac Gym
Flag this post
Superhuman AI for Multiplayer Poker
science.org·1h·
Discuss: Hacker News
🏋️Isaac Gym
Flag this post
Kimi Linear: An Expressive, Efficient Attention Architecture
arxiviq.substack.com·11m·
Discuss: Substack
📱Edge AI
Flag this post
The AI Capability Gap
blog.dwac.dev·9h·
📱Edge AI
Flag this post
Extending on-policy distillation for any teacher/student pair
reddit.com·3d·
Discuss: r/LocalLLaMA
🏋️Isaac Gym
Flag this post
PPO for LLMs: A Guide for Normal People
cameronrwolfe.substack.com·5d·
Discuss: Substack
🤖llm
Flag this post
Emergent Introspective Awareness in Large Language Models
lesswrong.com·2d
🤖llm
Flag this post
How Well Does RL Scale?
tobyord.com·2d·
Discuss: Hacker News
📱Edge AI
Flag this post
Speedrunning an RL Environment
sidb.in·12h·
Discuss: Hacker News
🏋️Isaac Gym
Flag this post
AI and Intuition
theheasman.com·1h·
Discuss: Hacker News
📱Edge AI
Flag this post
Humanity Needs Democratic Control of AI
jacobin.com·11h·
Discuss: Hacker News
📱Edge AI
Flag this post
On-Policy Distillation
thinkingmachines.ai·5d·
🤖llm
Flag this post
Bridging Minds and Machines
ofcarbonandsilicon.substack.com·4d·
Discuss: Substack
👁️‍🗨️Multimodal Sensing
Flag this post
Books for Robots (Only)
jmadden.org·6h·
Discuss: Hacker News
📱Edge AI
Flag this post
Emergent introspective awareness in large language models
transformer-circuits.pub·1d·
Discuss: Hacker News
🤖llm
Flag this post
Transforming Expense Management with AI Agent Orchestration
insideaiagents.com·1d·
Discuss: Hacker News
📱Edge AI
Flag this post
Context-Bench: Benchmarking LLMs on Agentic Context Engineering
letta.com·1d·
Discuss: Hacker News
🤖llm
Flag this post
Your Transformer is Secretly an EOT Solver
elonlit.com·1d·
Discuss: Hacker News
📱Edge AI
Flag this post
The Abode of Salvation
rohanparanjpe.substack.com·1d·
Discuss: Substack
🧮Mathematics
Flag this post
Design for Learning
kris.pengy.ca·6d·
Discuss: Hacker News
🚗Autonomous Vehicles
Flag this post