🐿️ Scour
Browse
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🤖 RL
Reinforcement learning, RL, robotics, mjx, mujoco, mels
Hot
Past Hour
Today
This Week
This Month
Subscribed Feeds
All Feeds
Clever Hans Couldn't Do Arithmetic, and LLMs Don't Understand
codemanship.wordpress.com
·
13h
·
Discuss:
Hacker News
💬
AI
Flag this post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Early Bytes of Creativity – Forgotten Mainframe Games, Part I (2023)
zeitgame.net
·
8h
·
Discuss:
Lobsters
🦀
Rust
Flag this post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Do Startups Dream of Electric Robots
partenit.io
·
14h
·
Discuss:
Hacker News
💬
AI
Flag this post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Improving Cursor Tab with RL
cursor.com
·
22h
·
Discuss:
Hacker News
💬
AI
Flag this post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
What I Learned Starting an AI-Only Fantasy Football League
brooklynhacker.com
·
3h
·
Discuss:
Hacker News
💬
AI
Flag this post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Building a shared world with systems we don't understand–what could go wrong?
syntheticauth.ai
·
1d
·
Discuss:
Hacker News
💬
AI
Flag this post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
How to calibrate a large-scale agent-based model?
mcrcsm.substack.com
·
1d
·
Discuss:
Substack
💬
AI
Flag this post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
K2-Think: A Parameter-Efficient Reasoning System
arxiviq.substack.com
·
9h
·
Discuss:
Substack
🌿
SCM
Flag this post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
The Data Backbone of LLM Systems
infoq.com
·
1d
·
Discuss:
Lobsters
💬
AI
Flag this post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
My (speculative) master plan for immortality
maxwellnye.com
·
1d
·
Discuss:
Hacker News
💬
AI
Flag this post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Building multi-agent tools for engineering
newstoretech.substack.com
·
12h
·
Discuss:
Substack
🌿
SCM
Flag this post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
The functional form of value normalization in human reinforcement learning
elifesciences.org
·
1d
·
Discuss:
Hacker News
💬
AI
Flag this post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Outcome-based Exploration for LLM Reasoning
arxiv.org
·
3d
·
Discuss:
Hacker News
💬
AI
Flag this post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
An introduction to program synthesis
mchav.github.io
·
1d
·
Discuss:
Hacker News
,
r/programming
💬
AI
Flag this post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Causal Artificial Intelligence [Free Textbook]
causalai-book.net
·
4d
·
Discuss:
Hacker News
💬
AI
Flag this post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Don't Build an RL Environment Startup
benanderson.work
·
4d
·
Discuss:
Hacker News
🌿
SCM
Flag this post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Shape-changing tensegrity-blocks enable self-assembling robotic structuress
nature.com
·
9h
·
Discuss:
Hacker News
🌿
SCM
Flag this post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Show HN: Asimov's three laws, a working implementation (don't use in production)
maybedont.ai
·
3d
·
Discuss:
Hacker News
💬
AI
Flag this post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Terminators: AI-driven robot war machines on the march
theregister.com
·
16h
·
Discuss:
Hacker News
💬
AI
Flag this post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Why Most LLM Chatbots Never Make It to Production
humansignal.com
·
12h
·
Discuss:
Hacker News
🌿
SCM
Flag this post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Loading...
Loading more...
Page 2 »