🎮 Reinforcement Learning - emmmmdty · Scour

Show HN: Fighting the War Against Expensive Reinforcement Learning

cadenza-landing-qtu7gbjwb-akshparekh123-3457s-projects.vercel.app·7h·

Discuss: Hacker News

Rollout-Training Co-Design for Efficient LLM-Based Multi-Agent Reinforcement Learning

arxiv.org·1d

Found-RL: foundation model-enhanced reinforcement learning for autonomous driving

arxiv.org·9h

🔄Transformers

A multi-agent reinforcement learning approach to autonomous aircraft taxiing with taxiing time, fuel consumption, and emission optimization

sciencedirect.com·1d

check out this article on Reinforcement Learning with R: Origins, Real-Life Applications, and Practical Implementation

dev.to·2d·

Discuss: DEV

🔄Transformers

Multi AI Agent Systems with crewAI

deeplearning.ai·3h

A training principle for drifting models

breno.bearblog.dev·3h

🤖Machine Learning

A masterclass in AI security operations

redcanary.com·1h

Your AI Strategy Has a Human-Shaped Hole

superiortech.io·48m·

Discuss: Hacker News

Feedback Control for Computer Systems

janert.org·7h

Observe emergent behavior in autonomous multi-agent LLM networks

agents.glide2.app·1d·

Discuss: Hacker News

AI Agents Explained in 3 Levels of Difficulty

kdnuggets.com·1d·

Discuss: Hacker News

Why the future of AI belongs to models that simulate reality

sifted.eu·4h

Robotics Motion Learning: Training Linked Robot Arms with Kuramoto Models

hackernoon.com·23h

GLM-5: From Vibe Coding to Agentic Engineering

simonwillison.net·20h·

Discuss: Hacker News

JupyterPS/VBAF: Visual Business Automation Framework - PowerShell-based reinforcement learning for education and business automation

github.com·2d·

Discuss: Hacker News

Recursive self-improvement from AI models

marginalrevolution.com·1d·

Discuss: Hacker News

I Pitted 3 AI Agents Against Each Other. The Result Was Scary.

pub.towardsai.net

·1d

I benchmarked 4 CLI coding agents on an NP-hard optimization problem I solved by hand 8 years ago. One of them beat me.

charlesazam.com·14m·

Discuss: Hacker News

Task-Completion Time Horizons of Frontier AI Models

metr.org·22h·

Discuss: Hacker News

Loading more...