Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
72930
posts in
1.06
s
When
RL
Meets Adaptive
Speculative
Training: A Unified Training-Serving System
arxiv.org
·
1d
📦
Folly
Alleviating
Sparse Rewards by Modeling Step-Wise and Long-Term Sampling Effects in Flow-Based
GRPO
arxiv.org
·
1d
🌀
Naiad
Stochastic Gradient Descent
Optimizes
Over-parameterized Deep
ReLU
Networks
dev.to
·
2d
·
Discuss:
DEV
🔬
Deep Learning
Fastfood
: Approximate Kernel Expansions in
Loglinear
Time
dev.to
·
2d
·
Discuss:
DEV
🎯
Qdrant
Dynamic Metabolic Flux Optimization by Reinforcement‑Learning‑Guided Feed Control for *E. coli*
Bioprocesses
**Abstract** We present a scalable framework
tha
...
freederia.com
·
3d
⚡
LMAX Disruptor
Dynamic Pedestrian Flow Optimization in Smart Tunnels Using Multi‑Agent Reinforcement Learning **Abstract** Rapid
urbanization
has produced urban tunnels
tha
...
freederia.com
·
4d
⚓
Anchors
Proposal: A Framework for
Discovering
Alien Physics via Optimal
Compression
lesswrong.com
·
3d
🔀
Procedural Generation
Representational
drift reflects ongoing balancing of stochastic changes by
Hebbian
learning
pnas.org
·
5d
🔄
Memory Ordering
Humane
, adaptive AI
bootstrapping
natemeyvis.com
·
3d
⚓
Anchors
Teach
your models to act, not just be
thoughtbot.com
·
4d
⚓
Anchors
Information
Retrieval
Part 2: How To Get Into Model Training Data
searchenginejournal.com
·
5d
🧠
Machine Learning
Growth through Games
pctmagazine.org
·
4d
🎮
Game Design
Understanding LLM Inference
Engines
: Inside
Nano-vLLM
(Part 2)
neutree.ai
·
3d
·
Discuss:
Hacker News
📱
Edge AI
Mechanistic
Interpretability:
Peeking
Inside an LLM
towardsdatascience.com
·
4d
💬
Prompt Engineering
Nonlinear random walks on
hypergraphs
characterized
by higher-order interactions
sciencedirect.com
·
2d
🕸️
Graph Theory
Jokes
on You AI: Turning the
Tables
dev-log.me
·
1d
·
Discuss:
Hacker News
💬
Prompt Engineering
AI Agents 2.0: AI Agents that can Learn(6 learning
types
that make memory
persistent
)
pub.towardsai.net
·
4d
💬
Prompt Engineering
Loss Distribution Collapse: A
Structural
Theory of Dataset
Degradation
zenodo.org
·
4d
·
Discuss:
Hacker News
📈
Delta Encoding
Why do tree-based models still
outperform
deep learning on
tabular
data?
paperium.net
·
2d
·
Discuss:
DEV
🌳
Tree-sitter
In (highly
contingent
!) defense of
interpretability-in-the-loop
ML training
lesswrong.com
·
3d
📊
Earley Parser
Loading...
Loading more...
« Page 6
•
Page 8 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help