Why your AI evals keep breaking
atla-ai.comยท2hยท
Discuss: Hacker News
๐ŸŽจAI UX Design
Flag this post
Introducing Agent-o-rama: build, trace, evaluate, and monitor stateful LLM agents in Java or Clojure
blog.redplanetlabs.comยท19hยท
Discuss: Hacker News
๐ŸƒAgentic
Flag this post
Transducer: Composition, Abstraction, Performance
funktionale-programmierung.deยท2hยท
Discuss: Hacker News
๐Ÿ”ŒMCP Protocol
Flag this post
Scaling Coding-Agent RL to 32x H100s. 160% Improvement on Stanford's TBench
github.comยท1dยท
๐ŸƒAgentic
Flag this post
The Noise and the Signal
russmiles.substack.comยท7hยท
Discuss: Substack
๐Ÿ”ŒMCP Protocol
Flag this post
Freephdlabor: Customizable multiagent research automation system
freephdlabor.github.ioยท5dยท
Discuss: Hacker News
๐ŸƒAgentic
Flag this post
The Constrained Application Protocol (CoAP)
datatracker.ietf.orgยท19hยท
Discuss: Hacker News
๐Ÿ”ŒMCP Protocol
Flag this post
The APM paradox
honeybadger.ioยท1dยท
๐Ÿ”ŒMCP Protocol
Flag this post
After the Last Git Commit
gist.github.comยท1dยท
Discuss: Hacker News
๐Ÿ”ŒMCP Protocol
Flag this post
What data do coding agents send, and where to?
chasersystems.comยท41mยท
Discuss: Hacker News
๐ŸPython development
Flag this post
Google's Jeff Dean on the Coming Era of Virtual Engineers
sequoiacap.comยท1dยท
Discuss: Hacker News
๐ŸŽจAI UX Design
Flag this post
Absurd Workflows: Durable Execution With Just Postgres
lucumr.pocoo.orgยท1dยท
๐Ÿ”ŒMCP Protocol
Flag this post
The AI Monetization Playbook
ondeviceguy.substack.comยท1dยท
Discuss: Substack
๐ŸŽจAI UX Design
Flag this post
My First Multi-GPU Kernel: Writing All-to-All for AMD MI300X
gau-nernst.github.ioยท1dยท
Discuss: Hacker News
๐Ÿ”ŒMCP Protocol
Flag this post
The AI Capability Gap
blog.dwac.devยท2dยท
๐Ÿ”ŒMCP Protocol
Flag this post
Human-agent collaboration for long-running agents
actively.aiยท3dยท
Discuss: Hacker News
๐ŸŽจAI UX Design
Flag this post
Kubernetes + Ceph: Your Freedom from the Cloud Cartel
oneuptime.comยท15hยท
๐Ÿ”ŒMCP Protocol
Flag this post
Show HN: An AI that keeps your internal documentation alive
davia.aiยท16hยท
Discuss: Hacker News
๐ŸŽจAI UX Design
Flag this post
How to design effective agent workflows?
boliv.substack.comยท3dยท
Discuss: Substack
๐Ÿ”ŒMCP Protocol
Flag this post
Enforcing Architecture in an Agent-Driven Codebase
phoebe.workยท21hยท
Discuss: Hacker News
๐ŸPython development
Flag this post