Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
121094
posts in
1.96
s
Habit
Detection For Home
Assistant
hackaday.com
·
3d
📵
Digital Minimalism
Barn
Owls
Know When to Wait (
iuSTDP
part 2)
blog.typeobject.com
·
4d
·
Discuss:
Hacker News
🤖
AI
The
Conductor
’s Model for
Enterprise
Go-To-Market
brajeshwar.com
·
3d
💬
Prompt Engineering
Choice
as an
emergent
feature
oop.bearblog.dev
·
3d
🧠
Cognitive Science
Show HN: Model Training Memory
Simulator
czheo.github.io
·
4d
·
Discuss:
Hacker News
🗣️
LLMs
The
Passive
AI Learning
Stack
That Changed the Way I Learn
donnfelker.com
·
3d
💬
Prompt Engineering
The Evolution of a
Lean
Programmer
unnamed.website
·
3d
·
Discuss:
Hacker News
🐚
Shell Scripting
The price of intelligence
cyb3rops.medium.com
·
3d
🤖
AI
Writing a
ONNX
Neural Network Inference Engine from Scratch in C to run image classification with
MobileNetV2
flexw.github.io
·
3d
·
Discuss:
r/C_Programming
🤖
AI
Adversarial
Reasoning:
Multiagent
World Models for closing the Simulation Gap
latent.space
·
4d
·
Discuss:
Hacker News
,
Hacker News
💬
Prompt Engineering
ben
guo
🪽 on X: "How to code better with AI using this one weird
trick
"
x.com
·
3d
·
Discuss:
X
💬
Prompt Engineering
A
GTM
guide to AI models
revengine.substack.com
·
4d
·
Discuss:
Substack
💬
Prompt Engineering
My
Workflow
for
Agentic
Coding
szymonkrajewski.pl
·
3d
💬
Prompt Engineering
Why Files Are Not
Enough
as Memory for AI Agents
medium.com
·
3d
·
Discuss:
Hacker News
💬
Prompt Engineering
When Optimization Works: The Role of
Convexity
in Business
Decisions
pub.towardsai.net
·
3d
🤖
AI
The Two-Board Problem: Training
Environment
for Research Agents
lesswrong.com
·
3d
🤖
AI
This sub-field focuses on
adapting
inverse reinforcement learning (IRL) techniques to scenarios with multiple autonomous agents competing for limited
resourc
...
freederia.com
·
6d
💬
Prompt Engineering
Supervised Learning of Functional Outcomes with
Predictors
at Different
Scales
: A Functional Gaussian Process Approach
arxiv.org
·
1d
🧠
Machine Learning
PRoFL-IoV
: A privacy-preserving and robust federated learning framework for short-term load forecasting in the internet of vehicles
sciencedirect.com
·
1d
🧠
Machine Learning
*Robust Hierarchical Reinforcement Learning for
Bipedal
Robots Performing Dynamic Balance on
Sloped
Terrains under Partial Sensor Failure*
freederia.com
·
6d
💬
Prompt Engineering
Loading...
Loading more...
« Page 13
•
Page 15 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help