AI Research

Feeds to Scour
SubscribedAll
Scoured 807 posts in 8.3 ms

The Emergence of Reproducibility and Generalizability in Diffusion Models

 🧮Embedding Models  Content type: Academic
arxiv.org·

LLM Research Papers: The 2026 List (January to May)

 🎮Reinforcement Learning  Content type: News

Score-based diffusion models for accurate crystal-structure inpainting and reconstruction of hydrogen positions

 🧠Machine Learning  Content type: Academic
nature.com·

How to Implement a Model-Free RL Algorithm: A Step-by-Step Guide

 🎮Reinforcement Learning  Content type: Blog

Discrete Diffusion Modelling by Estimating the Ratios of the Data Distribution

 🧠Machine Learning  Content type: News  Content type: Blog

Forgis-Labs/HEPA: HEPA: Self-supervised horizon-conditioned event predictive architecture for time series. Spotlight at FMSD @ ICML 2026.

 🧠Machine Learning  Content type: Code
github.com··Hacker News

Backpropagation Without the Magic: A First-Principles Derivation

 🧠Machine Learning  Content type: Blog
medium.com
·

Q-Learning (Reinforcement learning): Bellman Equation, Markov Decision Processes, Q-Values, and…

 🎮Reinforcement Learning  Content type: Blog
medium.com
·

Reinforcement Learning and Optimal Control Book (RIP Dimitri Bertsekas)

 🎮Reinforcement Learning  Content type: Academic
web.mit.edu··Hacker News

Improving Generalization and Data Efficiency with Diffusion in Offline Multi-agent RL

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

Attention Based Interpretability With Concept Transformer

 🧮Embedding Models  Content type: Blog
medium.com
·

Time-slip in AI sepsis models may inflate results, risking under- or overtreatment

 🎮Reinforcement Learning
medicalxpress.com·

Reinforcement Learning for Flow-Matching Policies with Density Transport

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

ProcessThinker: Enhancing Multi-modal Large Language Models Reasoning via Rollout-based Process Reward

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

SLUUG Talk: Demystifying Large Language Models on Linux

 🎮Reinforcement Learning  Content type: Code
github.com··DEV

Evaluating the Representation Space of Diffusion Models via Self-Supervised Principles

 🧮Embedding Models  Content type: Academic
arxiv.org·

Neuron-based Personality Trait Induction in Large Language Models

 ✍️Prompt Engineering  Content type: Academic
arxiv.org·

NightFeats @ MMU-RAGent NeurIPS 2025: A Context-Optimized Multi-Agent RAG System for the Text-to-Text Track

 🧮Embedding Models  Content type: Academic
arxiv.org·

SVoT: State-aware Visualization-of-Thought for Spatial Reasoning via Reinforcement Learning

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

Fast and Highly Expressive Policy Learning for Offline Reinforcement Learning via Bootstrapped Flow Q-Learning

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help