Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
75271
posts in
682.0
ms
GAS: Enhancing
Reward-Cost
Balance of Generative Model-assisted Offline Safe
RL
arxiv.org
·
2d
💬
Prompt Engineering
Meta-Optimized Continual Adaptation for deep-sea exploration
habitat
design with
embodied
agent feedback loops
dev.to
·
22h
·
Discuss:
DEV
🔲
Cellular Automata
Stochastic Gradient Descent
Optimizes
Over-parameterized Deep
ReLU
Networks
dev.to
·
21h
·
Discuss:
DEV
🔬
Deep Learning
Dynamic Metabolic Flux Optimization by Reinforcement‑Learning‑Guided Feed Control for *E. coli*
Bioprocesses
**Abstract** We present a scalable framework
tha
...
freederia.com
·
2d
⚡
LMAX Disruptor
Dynamic Pedestrian Flow Optimization in Smart Tunnels Using Multi‑Agent Reinforcement Learning **Abstract** Rapid
urbanization
has produced urban tunnels
tha
...
freederia.com
·
2d
⚓
Anchors
Teach
your models to act, not just be
thoughtbot.com
·
2d
⚓
Anchors
Beyond Transformers.
Physics-Centric
Machine Learning for
Analog
semiwiki.com
·
3d
📱
Edge AI
Nonlinear random walks on
hypergraphs
characterized
by higher-order interactions
sciencedirect.com
·
1d
🕸️
Graph Theory
TTT-Discover
optimizes
GPU kernels 2x faster than human experts — by training during inference
venturebeat.com
·
2d
⚡
Hardware Acceleration
In (highly
contingent
!) defense of
interpretability-in-the-loop
ML training
lesswrong.com
·
2d
📊
Earley Parser
Loss Distribution Collapse: A
Structural
Theory of Dataset
Degradation
zenodo.org
·
2d
·
Discuss:
Hacker News
📈
Delta Encoding
Why do tree-based models still
outperform
deep learning on
tabular
data?
paperium.net
·
23h
·
Discuss:
DEV
🌳
Tree-sitter
Low-dimensional materials for
intracellular
electrophysiology
: advances from synthesis to applications
nature.com
·
2d
🧬
Computational Biology
Agent
Evaluation
: How to Test and
Measure
Agentic AI Performance
machinelearningmastery.com
·
3d
🚀
Performance
Boundary
Engineering
cabreza.substack.com
·
2d
·
Discuss:
Substack
⚓
Anchors
Writing an LLM from scratch, part
32d
--
Interventions
: adding attention bias
gilesthomas.com
·
1d
·
Discuss:
Hacker News
💬
Prompt Engineering
Text classification with Python 3.14's
zstd
module • Max
Halford
maxhalford.github.io
·
2d
·
Discuss:
Lobsters
,
Hacker News
🗜️
Zstd
KAYAP
: Hardening Drone Stability via Neural Differential
Manifolds
github.com
·
3d
·
Discuss:
DEV
🤖
Robotics
AI
Copilots
2026: Everyday
Helpers
Across Home Work and Education
windowsforum.com
·
2d
💬
Prompt Engineering
Leveraging the capabilities of
physics-informed
neural networks for channel optimization in
PEM
fuel cells
sciencedirect.com
·
1d
💾
PMem Programming
Loading...
Loading more...
« Page 4
•
Page 6 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help