Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
80311
posts in
463.3
ms
Do We Need Adam?
Surprisingly
Strong and Sparse Reinforcement Learning with
SGD
in LLMs
arxiv.org
·
8h
🔄
Meta-Learning
Preference
Conditioned
Multi-Objective Reinforcement Learning:
Decomposed
, Diversity-Driven Policy Optimization
arxiv.org
·
8h
🎯
Predictive Coding
Cross Entropy
Derivatives
, Part 6: Using gradient
descent
to reach the final result
dev.to
·
1d
·
Discuss:
DEV
🎯
Predictive Coding
RoomKit
,
Pipecat
, TEN Framework, LiveKit Agents: Choosing the Right Conversational AI Framework
dev.to
·
22h
·
Discuss:
DEV
🔄
Meta-Learning
Building LLMs in
Resource-Constrained
Environments
: A Hands-On Perspective
infoq.com
·
1d
🧠
Neuromorphic Hardware
Hypernetworks
: Neural Networks for
Hierarchical
Data
blog.sturdystatistics.com
·
4d
·
Discuss:
Hacker News
🎯
Predictive Coding
**Abstract:** This paper introduces a novel approach to automated credit risk assessment and early warning systems leveraging a
hierarchical
Bayesian
network...
freederia.com
·
3d
🎯
Predictive Coding
Choice
as an
emergent
feature
oop.bearblog.dev
·
1d
🎯
Predictive Coding
**Abstract:** This paper introduces a novel reinforcement learning (RL) framework for automating the design of optimal control
pulses
for trapped ion
qubits
....
freederia.com
·
5d
🧠
Neuromorphic Hardware
Barn
Owls
Know When to Wait (
iuSTDP
part 2)
blog.typeobject.com
·
2d
·
Discuss:
Hacker News
🧠
Neuromorphic Hardware
Clawdbot
and the Rise of AI Agents: How
Autonomous
AI Is Changing the Way We Work
inoru.com
·
22h
·
Discuss:
DEV
🧠
Neuromorphic Hardware
The
Conductor
’s Model for
Enterprise
Go-To-Market
brajeshwar.com
·
1d
🎯
Predictive Coding
The price of intelligence
cyb3rops.medium.com
·
1d
🧠
Neuromorphic Hardware
Large Language Models Live in Time
lesswrong.com
·
22h
🧠
Neuromorphic Hardware
Designing
a Cost-Efficient
Agentic
System
p.agnihotry.com
·
19h
·
Discuss:
Hacker News
🎯
Predictive Coding
AI-augmented
data quality engineering
infoworld.com
·
1d
🤖
Machine Learning
Show HN: We added
AGENTS.md
to 120 challenges so AI
teaches
instead of codes
frontendmentor.io
·
21h
·
Discuss:
Hacker News
🧭
Axon Guidance
On
Recursive
Self-Improvement
(Part I)
hyperdimensional.co
·
1d
🧠
Neuromorphic Hardware
A Language For Agents
lucumr.pocoo.org
·
1d
·
Discuss:
Lobsters
,
Hacker News
,
Hacker News
🌳
recursive neural networks
Show HN: Model Training Memory
Simulator
czheo.github.io
·
2d
·
Discuss:
Hacker News
🧠
Neuromorphic Hardware
Loading...
Loading more...
« Page 3
•
Page 5 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help