Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
121271
posts in
745.1
ms
Travel
Recommendations
of
Tomorrow
: Generative Artificial Intelligence and Travel Planning
onlinelibrary.wiley.com
·
3d
💬
Prompt Engineering
How computers
evolved
from simple
calculation
tools to today’s AI systems
i.redd.it
·
3d
·
Discuss:
r/computers
🤖
AI
The price of intelligence
cyb3rops.medium.com
·
3d
🤖
AI
Barn
Owls
Know When to Wait (
iuSTDP
part 2)
blog.typeobject.com
·
4d
·
Discuss:
Hacker News
🤖
AI
The
Conductor
’s Model for
Enterprise
Go-To-Market
brajeshwar.com
·
3d
💬
Prompt Engineering
RL-Only
Neural Network Training
yager.io
·
5d
🤖
AI
A Language For Agents
lucumr.pocoo.org
·
3d
·
Discuss:
Lobsters
,
Hacker News
,
Hacker News
💬
Prompt Engineering
Writing a
ONNX
Neural Network Inference Engine from Scratch in C to run image classification with
MobileNetV2
flexw.github.io
·
3d
·
Discuss:
r/C_Programming
🤖
AI
On
Recursive
Self-Improvement
(Part I)
hyperdimensional.co
·
2d
💬
Prompt Engineering
When Optimization Works: The Role of
Convexity
in Business
Decisions
pub.towardsai.net
·
3d
🤖
AI
The Two-Board Problem: Training
Environment
for Research Agents
lesswrong.com
·
3d
🤖
AI
**Abstract:** This study introduces a novel, data-driven framework leveraging hyperdimensional computing (HDC) and recurrent neural networks (
RNNs
) to
predic
...
freederia.com
·
6d
🧠
Machine Learning
Cross Entropy
Derivatives
, Part 6: Using gradient
descent
to reach the final result
dev.to
·
3d
·
Discuss:
DEV
🧠
Machine Learning
Efficient
Unsupervised
Environment Design through
Hierarchical
Policy Representation Learning
arxiv.org
·
1d
💬
Prompt Engineering
**Value‑Aligned Inverse Reinforcement Learning for
Equitable
Ride‑Sharing Dispatch in Urban Micro‑Mobility Networks** — ### Abstract Ride‑sharing
platf
...
freederia.com
·
6d
💬
Prompt Engineering
Physics-Informed Neural Networks for
Inverse
PDE
Problems
pub.towardsai.net
·
4d
🤖
AI
Data-Centric
Interpretability
for LLM-based Multi-Agent Reinforcement Learning
lesswrong.com
·
5d
🗣️
LLMs
Energy-efficient robust control of vehicle
platoons
under cut-in
disturbances
: Integrating temporal-aware policy and barrier-constrained search
sciencedirect.com
·
20h
💬
Prompt Engineering
Learning Models with Uniform Performance via
Distributionally
RobustOptimization
dev.to
·
4d
·
Discuss:
DEV
🧠
Machine Learning
Beyond
Uniform
Credit: Causal Credit
Assignment
for Policy Optimization
arxiv.org
·
1d
💬
Prompt Engineering
Loading...
Loading more...
« Page 12
•
Page 14 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help