Scaling Coding-Agent RL to 32x H100s. 160% Improvement on Stanford's TBench
github.com·1d·
🧠AI
Flag this post
Adaptive Human-Computer Interaction Strategies Through Reinforcement Learning in Complex
arxiv.org·1d
👆human-computer interaction
Flag this post
Why agents do not write most of our code – a reality check
octomind.dev·19h·
Discuss: Hacker News
🧠AI
Flag this post
The Oversight Game: Learning to Cooperatively Balance an AI Agent's Safety and Autonomy
arxiv.org·4d
🧠AI
Flag this post
New prompt injection papers: Agents Rule of Two and The Attacker Moves Second
simonwillison.net·1d·
Discuss: Hacker News
🤖agents
Flag this post
There is no such thing as conscious artificial intelligence – Nature
nature.com·5h·
Discuss: Hacker News
🧠AI
Flag this post
Simplifying Preference Elicitation in Local Energy Markets: Combinatorial Clock Exchange
arxiv.org·1d
🧠AI
Flag this post
Partially Observable Multi-Agent Reinforcement Learning with Information Sharing
arxiv.org·5d
🧠AI
Flag this post
What's up with Anthropic predicting AGI by early 2027?
lesswrong.com·20h·
Discuss: Hacker News
🧠AI
Flag this post
My First Multi-GPU Kernel: Writing All-to-All for AMD MI300X
gau-nernst.github.io·1d·
Discuss: Hacker News
🧠AI
Flag this post
A Practitioner's Guide to Kolmogorov-Arnold Networks
arxiviq.substack.com·1d·
Discuss: Substack
🧠AI
Flag this post
Maestro – The orchestration engine that replicates human judgment
news.ycombinator.com·6h·
Discuss: Hacker News
UI generation
Flag this post
Show HN: An AI that keeps your internal documentation alive
davia.ai·17h·
Discuss: Hacker News
👆human-computer interaction
Flag this post
The Noise and the Signal
russmiles.substack.com·7h·
Discuss: Substack
🌊Stream Processing
Flag this post
When AI Trading Agents Compete: Adverse Selection of Meta-Orders by Reinforcement Learning-Based Market Making
arxiv.org·1d
🧠AI
Flag this post
Adaptive Control for a Physics-Informed Model of a Thermal Energy Distribution System: Qualitative Analysis
arxiv.org·1d
🧠AI
Flag this post
A Project Is Not a Bundle of Tasks
secondthoughts.ai·12h·
Discuss: Hacker News
🧠AI
Flag this post
Agents Are Commoditizing the Complement
andreasfragner.com·16h·
Discuss: Hacker News
🤖agents
Flag this post
Show HN: GPU-accelerated sandboxes for running AI coding agents in parallel [video]
youtube.com·3d·
Discuss: Hacker News
🤖agents
Flag this post
The race to train AI robots how to act human in the real world
latimes.com·3h·
Discuss: Hacker News
🧠AI
Flag this post