🎯 reinforcement learning - plooh · Scour

Agents Need Work Data: A Primer on RLWD, or Reinforcement Learning on Work Data

anjalishriva.com··Hacker News

Demystifying Hidden-State Recurrence: Switchable Latent Reasoning with On-Policy Reinforcement Learning

📱Edge AI Academic

arxiv.org··Hacker News

Anthropic Is Taking AI Welfare Seriously. I’m Not Sure It Knows What It’s Measuring.

lesswrong.com··Hacker News

Mi50 32GB / GFX906 - vLLM Qwen 3.5 Configuration for Qwen 3.5:9B AWQ-4bit

huggingface.co··r/LocalLLaMA

I got so mad at poke(rogue)like that I trained a RL agent to beat it for me

🎛️Control theory

thiagolira.blot.im··Hacker News

vrtnis/tycoon-learning-environment: A JAX transport-economy learning environment for route planning, cargo flow, financing, and replayable agent benchmarks.

🏋️Isaac Gym Code

github.com··Hacker News

Some Ethical Problems with AI

🐝Swarm Intelligence Blog

arkvis.com··Hacker News

Introduction to (Multimodal) LLM-as-a-Judge

🤖llm News Blog

yinghonglan.substack.com··Substack

Researchers trained an open source AI search agent, Harness-1, that outperforms GPT-5.4 on recalling relevant information

venturebeat.com··Hacker News

The Era of Multi-Agent Imagined Experience

odyssey.ml··Hacker News

SkyPilot Sandboxes: Run Agent Code on Your Own Kubernetes, at Scale

🤖llm Blog

blog.skypilot.co··Hacker News

Recursive Self-Improvement

🐍Python News Blog

ana15.substack.com··Substack

Issue 655

🤖llm News Blog

datascienceweekly.substack.com··Substack

Inside soccer’s data renaissance

👁️Computer vision News

technologyreview.com··Hacker News

AI-powered living business intelligence network

atlasforgex.com

··Hacker News

Kimi K2.7-Code: open-source coding model with better token efficiency

huggingface.co··Hacker News, r/LocalLLaMA·Cited by 8 articles

Beyond Dexterity: Why Contact May Define the Next Era of Robotics

🤖Robotics Video News

spectrum.ieee.org

··Hacker News

Why LLMs (still) lack taste

beyondtheprior.com··Hacker News

Introducing the Third Generation of Apple’s Foundation Models

machinelearning.apple.com··Hacker News, r/apple·Cited by 28 articles

Apple's New AI Models Contain 'None' of Google's Gemini Assistant

📱Edge AI News

macrumors.com··Hacker News

Log in to enable infinite scrolling