🎮 RL - zongyuzhang · Scour

Phi-Actor-Critic: Steering General-Sum Games to Pareto-Efficient Correlated Equilibria

🕵️AI Agents Academic

Less-relevant results

Major Types of Machine Learning

👁️VLMs Blog

Microsoft Research's Lens proves detailed captions matter more than raw scale for training efficient image generators

🔓Open-source Models News

the-decoder.com

·

Lodge School teams advance to volleyball quarter-finals

🎭Multimodal AI

Siri AI is powered by Gemini models, but is not Gemini – what does that mean?

🔓Open-source Models

Geometrically Averaged Hard Target Updates for Linear Q-Learning

⚡Quantization Academic

Comp.compilers: Paper: MileStone: A Multi-Objective Compiler Phase Ordering Framework for Graph-based IR-Level Optimization

🖥️Inference Compute

compilers.iecc.com·

Are Classical Machine Learning Jobs Dying?

💹AI in Finance Blog

Why Claude Produces High-Quality Output: A Developer’s Guide to Token Efficiency and Hallucination…

🧠LLMs Blog

Model predictive task sampling for efficient and robust adaptation

🖥️Inference Compute Academic

Fast and Highly Expressive Policy Learning for Offline Reinforcement Learning via Bootstrapped Flow Q-Learning

🧠LLMs Academic

Memoirs of a Learning Machine: Autobiographical Self-Training and the Self-Training Gap

🕵️AI Agents

zenodo.org··Hacker News

Robots are closing in on human-like judgments, addressing a key challenge in physical AI

🤖Embodied AI

techxplore.com·

Beyond Dexterity: Why Contact May Define the Next Era of Robotics

🦾Robotics Video News

spectrum.ieee.org

··Hacker News

Hey-Meadow/meadow-mind: Zero training, second-level reactions (~400ms). A language-rule decision mind on a local 7B diffusion LM.

🔧Tool Use Code

github.com··Hacker News

Mult-DPO: Multinomial Direct Preference Optimization for Recommender Systems

👁️VLMs Academic

Google DeepMind's Susan Zhang argues abundant AI content shifts the premium from raw intelligence to human relationships and social dynamics

🔓Open-source Models News

Weekly Research Recap

💹AI in Finance News

quantseeker.com·

local AI agents for Cursor with pre-tuned marketplace/commu

🕵️AI Agents

locaible.com··Hacker News

I built a machine that turns AI papers into interactive explainers

🧠LLMs Blog

Sign up or log in to see more results

Log in to enable infinite scrolling