Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
73211
posts in
1.39
s
Do We Need Adam?
Surprisingly
Strong and Sparse Reinforcement Learning with
SGD
in LLMs
arxiv.org
·
7h
📱
Edge AI
Reinforcement
Learning from Human
Feedback
arxiv.org
·
2d
🎲
Deterministic Simulation
Tips
lonestation.itch.io
·
2d
🔍
Miniselect
Cross Entropy
Derivatives
, Part 6: Using gradient
descent
to reach the final result
dev.to
·
1d
·
Discuss:
DEV
📊
Optimization
Personalized Adaptive Feedback System for Early Detection and Intervention of Fine‑Motor Skill Development in
Preschool
Children Using Wearable
IMU
Sensors and Reinforcement Learning
freederia.com
·
4d
🧭
Inertial Navigation
Learning Models with Uniform Performance via
Distributionally
RobustOptimization
dev.to
·
2d
·
Discuss:
DEV
📊
Optimization
On
Recursive
Self-Improvement
(Part I)
hyperdimensional.co
·
1d
⚓
Anchors
Habit
Detection For Home
Assistant
hackaday.com
·
1d
🏠
Home Automation
Clawdbot
and the Rise of AI Agents: How
Autonomous
AI Is Changing the Way We Work
inoru.com
·
21h
·
Discuss:
DEV
🛡️
AI Security
Hypernetworks
: Neural Networks for
Hierarchical
Data
blog.sturdystatistics.com
·
4d
·
Discuss:
Hacker News
📱
Edge AI
Designing
a Cost-Efficient
Agentic
System
p.agnihotry.com
·
18h
·
Discuss:
Hacker News
⚓
Anchors
Barn
Owls
Know When to Wait (
iuSTDP
part 2)
blog.typeobject.com
·
2d
·
Discuss:
Hacker News
🚦
Wait-Free Algorithms
**Abstract:** This paper introduces a novel approach to automated credit risk assessment and early warning systems leveraging a
hierarchical
Bayesian
network...
freederia.com
·
3d
💰
TigerBeetle
learning by
reverse
engineering
clymup.com
·
2d
🔍
Reverse Engineering
Exploiting
large language model with reinforcement learning for generative job
recommendations
eurekalert.org
·
4d
💬
Prompt Engineering
userface.ai
userface.ai
·
1d
🦙
Ollama
Fastfood
: Approximate Kernel Expansions in
Loglinear
Time
paperium.net
·
2d
·
Discuss:
DEV
💡
Photon
ben
guo
🪽 on X: "How to code better with AI using this one weird
trick
"
x.com
·
1d
·
Discuss:
X
💬
Prompt Engineering
Show HN: We added
AGENTS.md
to 120 challenges so AI
teaches
instead of codes
frontendmentor.io
·
20h
·
Discuss:
Hacker News
💬
Prompt Engineering
Scientists reveal the alien logic of AI:
hyper-rational
but
stumped
by simple concepts
psypost.org
·
2d
💬
Prompt Engineering
Loading...
Loading more...
« Page 3
•
Page 5 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help