Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
74656
posts in
1.22
s
Robust
Online Learning
arxiv.org
·
22h
📊
Optimization
Distributional
Reinforcement Learning with Diffusion Bridge
Critics
arxiv.org
·
3d
📊
Optimization
A Simple
Method
for
Commonsense
Reasoning
dev.to
·
1d
·
Discuss:
DEV
🎴
Anki
Linear
Regression
: An
Overview
dev.to
·
3d
·
Discuss:
DEV
📊
Optimization
Adapting
to
technological
change
rhollick.wordpress.com
·
5d
⚡
Incremental Computation
Self-Optimizing Football
Chatbot
Guided by Domain Experts on
Databricks
databricks.com
·
6d
💬
Prompt Engineering
Beyond
Pilot
Purgatory
oreilly.com
·
5d
⚓
Anchors
The Dual
Pillars
of
Embodied
Autonomy: A Technical Deep Dive into Language-Action Models and…
pub.towardsai.net
·
5d
🤖
Robotics
Behavioral and
electroencephalographic
dataset
simultaneously
acquired during the Iowa gambling task
nature.com
·
5d
🧠
Cognitive Science
The
Agentic
Trust Framework: Zero Trust
Governance
for AI Agents
cloudsecurityalliance.org
·
5d
·
Discuss:
Hacker News
🛡️
AI Security
Sequential Attention: Making AI models
leaner
and faster without
sacrificing
accuracy
research.google
·
5d
·
Discuss:
Hacker News
,
r/LocalLLaMA
💬
Prompt Engineering
Collaborative risk-resistant
distributionally
robust dispatch and benefit allocation scheme for
interconnected
distribution systems
sciencedirect.com
·
4d
🎯
Quorum Systems
Private Data Space Model
privatedata.space
·
4d
🔗
Intrusive Containers
Agent development
workflow
coreweave.com
·
5d
💬
Prompt Engineering
Goodbye
Smartwatches
, Hello Health AI on Your
Wrist
news.ycombinator.com
·
4d
·
Discuss:
Hacker News
📈
Prometheus
AI for People
justsitandgrin.im
·
4d
·
Discuss:
Hacker News
💬
Prompt Engineering
Tspo
Shows 13.6% Gain, Resolving Double
Homogenization
In Policy Optimization
quantumzeitgeist.com
·
5d
⚓
Anchors
A
generalizable
foundation model for analysis of human brain
MRI
nature.com
·
3d
🖼️
Halide
A Neuro Symbolic Architecture For Induced
Epistemic
Agency and System 2 Reasoning in
Quantized
Large Language Models
papers.ssrn.com
·
4d
·
Discuss:
Hacker News
💬
Prompt Engineering
New AI
Quiz
Generator
learvo.com
·
6d
·
Discuss:
Hacker News
🎴
Anki
Loading...
Loading more...
« Page 8
•
Page 10 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help