Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
123623
posts in
452.2
ms
Nonparametric
Bayesian Optimization for General
Rewards
arxiv.org
·
1d
🧠
Machine Learning
Boltzmann
Reinforcement Learning for Noise resilience in Analog
Ising
Machines
arxiv.org
·
19h
🤖
AI
AI Agents Explained in 3
Levels
of
Difficulty
kdnuggets.com
·
1d
💬
Prompt Engineering
The Machine Learning
Practitioner
’s Guide to
Speculative
Decoding
machinelearningmastery.com
·
13h
🗣️
LLMs
GLM-5
: From
Vibe
Coding to Agentic Engineering
simonwillison.net
·
5h
💬
Prompt Engineering
Beyond the
Hype
: Why Machine Learning is the Strategic
Backbone
of Modern AI
pub.towardsai.net
·
1d
💬
Prompt Engineering
Machine learning reveals hidden
landscape
of
robust
information storage
phys.org
·
1d
💬
Prompt Engineering
Genuine learning biases persist after accounting for
temporally
decreasing
learning rates: Insight from fitting six datasets
pnas.org
·
11h
🧠
Machine Learning
Human Review Is the
Bottleneck
satyaborg.com
·
8h
·
Discuss:
Hacker News
💬
Prompt Engineering
Schedules
of Reinforcement in
Psychology
(Examples)
simplypsychology.org
·
1d
·
Discuss:
Hacker News
🧠
Cognitive Science
Technology is a tool, not a
replacement
for experience
healio.com
·
1d
📵
Digital Minimalism
Unlock
Customer Insights with
Theta
Intelligence
medium.com
·
1d
💬
Prompt Engineering
Beyond the
Prompt
- Why and How to
Fine-tune
Your Own Models
devblogs.microsoft.com
·
7h
💬
Prompt Engineering
Quantization-Aware
Distillation
ternarysearch.blogspot.com
·
3d
·
Discuss:
Hacker News
🗣️
LLMs
Ai’s ‘
steering
’ Made Far More
Precise
With New Fine-Tuning Technique
quantumzeitgeist.com
·
1d
💬
Prompt Engineering
Risk-preference-aware
optimal scheduling and profit allocation of load
aggregators
and charging operators
sciencedirect.com
·
1d
💬
Prompt Engineering
New Research Shows AI Agents Learn
Altruism
From Human
Behavior
pymnts.com
·
2d
🤖
AI
Demand‑Controlled
Ventilation
in Multi‑
Occupancy
Offices: A Reinforcement‑Learning Approach to Adaptive CO₂ Threshold Optimization and Energy‑Efficiency Analysis
freederia.com
·
5d
💬
Prompt Engineering
The
benefit
of
AI-assisted
coding isn't just about coding faster
johnlindblad.substack.com
·
5h
·
Discuss:
Substack
💬
Prompt Engineering
Dopaminergic
mechanisms supporting hippocampal
postencoding
dynamics in humans
pnas.org
·
11h
🧠
Cognitive Science
Loading...
Loading more...
« Page 3
•
Page 5 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help