RL

Feeds to Scour
SubscribedAll
Scoured 452 posts in 9.8 ms

Phi-Actor-Critic: Steering General-Sum Games to Pareto-Efficient Correlated Equilibria

 🕵️AI Agents  Content type: Academic
arxiv.org·
Less-relevant results

Major Types of Machine Learning

 👁️VLMs  Content type: Blog
medium.com·

Microsoft Research's Lens proves detailed captions matter more than raw scale for training efficient image generators

 🔓Open-source Models  Content type: News
the-decoder.com
·

Lodge School teams advance to volleyball quarter-finals

 🎭Multimodal AI
cbc.bb·

Siri AI is powered by Gemini models, but is not Gemini – what does that mean?

 🔓Open-source Models
9to5mac.com·

Geometrically Averaged Hard Target Updates for Linear Q-Learning

 Quantization  Content type: Academic
arxiv.org·

Comp.compilers: Paper: MileStone: A Multi-Objective Compiler Phase Ordering Framework for Graph-based IR-Level Optimization

 🖥️Inference Compute
compilers.iecc.com·

Are Classical Machine Learning Jobs Dying?

 💹AI in Finance  Content type: Blog
medium.com·

Why Claude Produces High-Quality Output: A Developer’s Guide to Token Efficiency and Hallucination…

 🧠LLMs  Content type: Blog
medium.com·

Model predictive task sampling for efficient and robust adaptation

 🖥️Inference Compute  Content type: Academic
nature.com·

Fast and Highly Expressive Policy Learning for Offline Reinforcement Learning via Bootstrapped Flow Q-Learning

 🧠LLMs  Content type: Academic
arxiv.org·

Memoirs of a Learning Machine: Autobiographical Self-Training and the Self-Training Gap

 🕵️AI Agents
zenodo.org··Hacker News

Robots are closing in on human-like judgments, addressing a key challenge in physical AI

 🤖Embodied AI
techxplore.com·

Beyond Dexterity: Why Contact May Define the Next Era of Robotics

 🦾Robotics  Content type: Video  Content type: News

Hey-Meadow/meadow-mind: Zero training, second-level reactions (~400ms). A language-rule decision mind on a local 7B diffusion LM.

 🔧Tool Use  Content type: Code
github.com··Hacker News

Mult-DPO: Multinomial Direct Preference Optimization for Recommender Systems

 👁️VLMs  Content type: Academic
arxiv.org·

Google DeepMind's Susan Zhang argues abundant AI content shifts the premium from raw intelligence to human relationships and social dynamics

 🔓Open-source Models  Content type: News
digg.com·

Weekly Research Recap

 💹AI in Finance  Content type: News
quantseeker.com·

local AI agents for Cursor with pre-tuned marketplace/commu

 🕵️AI Agents

I built a machine that turns AI papers into interactive explainers

 🧠LLMs  Content type: Blog
blog.skz.dev·
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help