Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Research
🧠 AI Research
AI breakthroughs, research papers, arXiv, deep learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
807
posts in
8.3
ms
The Emergence of Reproducibility and Generalizability in
Diffusion
Models
🧮
Embedding Models
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for The Emergence of Reproducibility and Generalizability in Diffusion Models
LLM
Research
Papers
: The 2026 List (January to May)
🎮
Reinforcement Learning
Content type:
News
magazine.sebastianraschka.com
·
4d
4 days ago
·
Hacker News
Actions for LLM Research Papers: The 2026 List (January to May)
Score-based
diffusion
models
for accurate crystal-structure inpainting and reconstruction of hydrogen positions
🧠
Machine Learning
Content type:
Academic
nature.com
·
8h
8 hours ago
Actions for Score-based diffusion models for accurate crystal-structure inpainting and reconstruction of hydrogen positions
How to Implement a
Model-Free
RL Algorithm: A Step-by-Step Guide
🎮
Reinforcement Learning
Content type:
Blog
ujangriswanto08.medium.com
·
3h
3 hours ago
Actions for How to Implement a Model-Free RL Algorithm: A Step-by-Step Guide
Discrete
Diffusion
Modelling
by Estimating the Ratios of the Data Distribution
🧠
Machine Learning
Content type:
News
Content type:
Blog
leetarxiv.substack.com
·
1d
1 day ago
·
Substack
,
r/programming
Actions for Discrete Diffusion Modelling by Estimating the Ratios of the Data Distribution
Forgis-Labs/HEPA: HEPA: Self-supervised horizon-conditioned event predictive
architecture
for time series. Spotlight at FMSD @
ICML
2026.
🧠
Machine Learning
Content type:
Code
github.com
·
22h
22 hours ago
·
Hacker News
Actions for Forgis-Labs/HEPA: HEPA: Self-supervised horizon-conditioned event predictive architecture for time series. Spotlight at FMSD @ ICML 2026.
Backpropagation
Without the Magic: A First-Principles Derivation
🧠
Machine Learning
Content type:
Blog
medium.com
·
6d
6 days ago
Actions for Backpropagation Without the Magic: A First-Principles Derivation
Q-Learning
(
Reinforcement
learning
): Bellman Equation, Markov Decision Processes, Q-Values, and…
🎮
Reinforcement Learning
Content type:
Blog
medium.com
·
2d
2 days ago
Actions for Q-Learning (Reinforcement learning): Bellman Equation, Markov Decision Processes, Q-Values, and…
Reinforcement
Learning
and Optimal Control Book (RIP Dimitri Bertsekas)
🎮
Reinforcement Learning
Content type:
Academic
web.mit.edu
·
5d
5 days ago
·
Hacker News
Actions for Reinforcement Learning and Optimal Control Book (RIP Dimitri Bertsekas)
Improving Generalization and Data Efficiency with
Diffusion
in Offline Multi-agent RL
🎮
Reinforcement Learning
Content type:
Academic
arxiv.org
·
4h
4 hours ago
Actions for Improving Generalization and Data Efficiency with Diffusion in Offline Multi-agent RL
Attention Based Interpretability With Concept
Transformer
🧮
Embedding Models
Content type:
Blog
medium.com
·
6d
6 days ago
Actions for Attention Based Interpretability With Concept Transformer
Time-slip in
AI
sepsis
models
may inflate results, risking under- or overtreatment
🎮
Reinforcement Learning
medicalxpress.com
·
5d
5 days ago
Actions for Time-slip in AI sepsis models may inflate results, risking under- or overtreatment
Reinforcement
Learning
for Flow-Matching Policies with Density Transport
🎮
Reinforcement Learning
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Reinforcement Learning for Flow-Matching Policies with Density Transport
ProcessThinker: Enhancing Multi-modal
Large
Language
Models
Reasoning via Rollout-based Process Reward
🎮
Reinforcement Learning
Content type:
Academic
arxiv.org
·
4h
4 hours ago
Actions for ProcessThinker: Enhancing Multi-modal Large Language Models Reasoning via Rollout-based Process Reward
SLUUG Talk: Demystifying
Large
Language
Models
on Linux
🎮
Reinforcement Learning
Content type:
Code
github.com
·
4d
4 days ago
·
DEV
Actions for SLUUG Talk: Demystifying Large Language Models on Linux
Evaluating the Representation Space of
Diffusion
Models
via Self-Supervised Principles
🧮
Embedding Models
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Evaluating the Representation Space of Diffusion Models via Self-Supervised Principles
Neuron-based Personality Trait Induction in
Large
Language
Models
✍️
Prompt Engineering
Content type:
Academic
arxiv.org
·
4h
4 hours ago
Actions for Neuron-based Personality Trait Induction in Large Language Models
NightFeats @ MMU-RAGent
NeurIPS
2025: A Context-Optimized Multi-Agent RAG System for the Text-to-Text Track
🧮
Embedding Models
Content type:
Academic
arxiv.org
·
4h
4 hours ago
Actions for NightFeats @ MMU-RAGent NeurIPS 2025: A Context-Optimized Multi-Agent RAG System for the Text-to-Text Track
SVoT: State-aware Visualization-of-Thought for Spatial Reasoning via
Reinforcement
Learning
🎮
Reinforcement Learning
Content type:
Academic
arxiv.org
·
4h
4 hours ago
Actions for SVoT: State-aware Visualization-of-Thought for Spatial Reasoning via Reinforcement Learning
Fast and Highly Expressive Policy
Learning
for Offline
Reinforcement
Learning
via Bootstrapped Flow
Q-Learning
🎮
Reinforcement Learning
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Fast and Highly Expressive Policy Learning for Offline Reinforcement Learning via Bootstrapped Flow Q-Learning
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help