🎮 强化学习 - doxnotejun · Scour

How’s it going? Reinforcement learning in language models recruits a functional welfare axis 💬NLP

functionalwelfare.com·6d·Hacker News

AI model predicts building fire spread, redirecting evacuees to safer exits in real time 👁️计算机视觉

techxplore.com·2d·Hacker News

Off-Policy RL Replay Buffer Memory Leak: Fix 2M Step Crash 🗂️知识管理

tildalice.io·4d

Nvidia enters Windows AI PC race with new RTX Spark chip: All major announcements at Computex 2026 💬NLP

indianexpress.com·5d

Learning to replenish: A hybrid deep reinforcement learning for dynamic inventory management in the pharmaceutical supply chains 👁️计算机视觉 Academic

Microsoft Build 2026: Be yourself at work 🤖人工智能 Blog

blogs.microsoft.com·4d

Build 2026: Organizations Can Unlock Enterprise Intelligence with Microsoft IQ 👁️计算机视觉

Lessons and Concepts from Reinforcement Learning 👁️计算机视觉 Blog

shahzaib.bearblog.dev·6d

Postdoc position in philosophy of science with focus on astrophysics (Jagiellonian University, 3 years, full time) 🧠认知科学 Blog

takingupspacetime.wordpress.com·4d

Copilot super app leaks 🤖, Minimax M3 ➕, Nvidia N1X ⚡️ 🗂️知识管理

A Functional Taxonomy of World Models – Fei Fei Li 🤖机器学习 Blog

drfeifei.substack.com·3d·Substack

Build 2026: Microsoft tops Google in image generation while playing catch-up on reasoning 👁️计算机视觉

the-decoder.com

·3d

Agentic Monte Carlo: Simulating Reinforcement Learning for Black-Box Agents 👁️计算机视觉 Academic

New comment by stuartjohnson in "Ask HN: Who wants to be hired? (June 2026)" 🤖人工智能

drive.google.com·5d·Hacker News

The RL Flywheel That Actually Works 🤖机器学习

discord.gg·5d·DEV

AI Weather Models, Tech Layoffs, & Anthropic IPO 👁️计算机视觉

briefing.forwardfuture.ai·4d

DeepSeek fundraising 💰, Meta model delays ⌛ , Gemma 4 12B 🤖 🤖人工智能

Representation Learning Enables Scalable Multitask Deep Reinforcement Learning 👁️计算机视觉 Academic

Opus 4.8, OpenRouter, Cognition, Snowflake, and a papal warning 👁️计算机视觉 Blog

thesequence.substack.com·6d·Substack

I made a kernel 2.2x faster. It made my training loop 3x slower 🤖人工智能 Blog

kyrieblunders.bearblog.dev·4d·Hacker News

Sign up or log in to see more results

Log in to enable infinite scrolling