Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
75009
posts in
1.12
s
Stop Rewarding
Hallucinated
Steps:
Faithfulness-Aware
Step-Level Reinforcement Learning for Small Reasoning Models
arxiv.org
·
2d
💬
Prompt Engineering
Rethinking
the Trust Region in LLM
Reinforcement
Learning
arxiv.org
·
3d
💬
Prompt Engineering
Power as a function of action
coordination
and decision
calculation
dev.to
·
1d
·
Discuss:
DEV
🧠
Cognitive Science
**Title**
dev.to
·
1d
·
Discuss:
DEV
⚡
LMAX Disruptor
Human-like Search for Modern
Applications
anvitra.ai
·
1d
·
Discuss:
Hacker News
🎯
Vector Search
Adaptive marketing:
Proven
strategies
for growing companies
blog.hubspot.com
·
2d
📡
Content Syndication
— ### Abstract Personalized chronic disease management remains a critical challenge due to the
heterogeneity
of patient trajectories and the
imperative
...
freederia.com
·
2d
📈
Differential Dataflow
Why RAG Failed Us for
SRE
and How We Built Dynamic Memory
Retrieval
Instead
drdroid.io
·
2d
·
Discuss:
Hacker News
🛡️
RAII
Your AI
Companion
pocketmindai.com
·
2d
·
Discuss:
r/InternetIsBeautiful
💬
Prompt Engineering
## Hyper-Accurate
Cerebellar
Microcircuit
Modeling via Dynamic Stochastic Differential Equation Projection and Reinforcement Learning Optimization for Enhanced Motor Skill Acquisition
freederia.com
·
4d
📊
Dynamic Programming
Why Files Are Not
Enough
as Memory for AI Agents
medium.com
·
16h
·
Discuss:
Hacker News
🧠
Memory Models
Neural population
geometry
and optimal coding of tasks with shared
latent
structure
nature.com
·
2d
🧮
Embeddings
Unsupervised
Learning NO. 515
newsletter.danielmiessler.com
·
1d
💬
Prompt Engineering
Routing
in a
Sparse
Graph: a Distributed Q-Learning Approach
towardsdatascience.com
·
5d
🕸️
Graph Theory
AI ‘thinking Budget’ Revealed In
Landmark
Study Of
Self-Reflecting
Machines
quantumzeitgeist.com
·
2d
💬
Prompt Engineering
NotebookLM
: The AI that only
learns
from you
byandrev.dev
·
1d
·
Discuss:
Hacker News
📓
Jupyter Notebooks
Your Agent Is
Slow
Because of
Inference
futureagi.com
·
2d
·
Discuss:
DEV
💬
Prompt Engineering
Is Your Machine Learning
Pipeline
as Efficient as it Could Be?
kdnuggets.com
·
2d
📱
Edge AI
OvidijusParsiunas/are-you-random
: 🎲 Browser game that predicts your "random"
choices
github.com
·
18h
·
Discuss:
Hacker News
📖
Interactive Fiction
Building the Future with AI That
Acts
devxt.com
·
1d
·
Discuss:
Hacker News
🎭
Program Synthesis
Loading...
Loading more...
« Page 3
•
Page 5 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help