Reinforcement learning, RL, robotics, mjx, mujoco, mels

Clever Hans Couldn't Do Arithmetic, and LLMs Don't Understand
codemanship.wordpress.com·13h·
Discuss: Hacker News
💬AI
Early Bytes of Creativity – Forgotten Mainframe Games, Part I (2023)
zeitgame.net·8h·
Discuss: Lobsters
🦀Rust
Do Startups Dream of Electric Robots
partenit.io·14h·
Discuss: Hacker News
💬AI
Improving Cursor Tab with RL
cursor.com·22h·
Discuss: Hacker News
💬AI
What I Learned Starting an AI-Only Fantasy Football League
brooklynhacker.com·3h·
Discuss: Hacker News
💬AI
Building a shared world with systems we don't understand–what could go wrong?
syntheticauth.ai·1d·
Discuss: Hacker News
💬AI
How to calibrate a large-scale agent-based model?
mcrcsm.substack.com·1d·
Discuss: Substack
💬AI
K2-Think: A Parameter-Efficient Reasoning System
arxiviq.substack.com·9h·
Discuss: Substack
🌿SCM
The Data Backbone of LLM Systems
infoq.com·1d·
Discuss: Lobsters
💬AI
My (speculative) master plan for immortality
maxwellnye.com·1d·
Discuss: Hacker News
💬AI
Building multi-agent tools for engineering
newstoretech.substack.com·12h·
Discuss: Substack
🌿SCM
The functional form of value normalization in human reinforcement learning
elifesciences.org·1d·
Discuss: Hacker News
💬AI
Outcome-based Exploration for LLM Reasoning
arxiv.org·3d·
Discuss: Hacker News
💬AI
An introduction to program synthesis
mchav.github.io·1d·
💬AI
Causal Artificial Intelligence [Free Textbook]
causalai-book.net·4d·
Discuss: Hacker News
💬AI
Don't Build an RL Environment Startup
benanderson.work·4d·
Discuss: Hacker News
🌿SCM
Shape-changing tensegrity-blocks enable self-assembling robotic structuress
nature.com·9h·
Discuss: Hacker News
🌿SCM
Show HN: Asimov's three laws, a working implementation (don't use in production)
maybedont.ai·3d·
Discuss: Hacker News
💬AI
Terminators: AI-driven robot war machines on the march
theregister.com·16h·
Discuss: Hacker News
💬AI
Why Most LLM Chatbots Never Make It to Production
humansignal.com·12h·
Discuss: Hacker News
🌿SCM