Feeds to Scour
SubscribedAll
Scoured 253652 posts in 4.52 s
Basics of Reinforcement Learning for LLMs
cameronrwolfe.substack.comยท14hยท
Discuss: Substack
๐Ÿ“ŠDynamic Programming
Preview
Report Post
Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning
reddit.comยท1dยท
๐Ÿ“ŠDynamic Programming
Preview
Report Post
Deep Reinforcement Learning: An Overview
paperium.netยท2dยท
Discuss: DEV
๐Ÿ“ŠDynamic Programming
Preview
Report Post
Learning General Policies with Policy Gradient Methods
arxiv.orgยท4d
๐Ÿ“ŠOptimization
Preview
Report Post
Deep Reinforcement Learning: An Overview
dev.toยท2dยท
Discuss: DEV
๐Ÿ“ŠDynamic Programming
Preview
Report Post
Introducing the XLab AI Security Guide
lesswrong.comยท10h
๐Ÿ›ก๏ธAI Security
Preview
Report Post
TIL every time you remember something, your brain slightly rewrites that memory instead of replaying it exactly
frontiersin.orgยท11hยท
๐ŸŽดAnki
Preview
Report Post
Building a Neural Network from scratch
pub.towardsai.net
ยท2d
๐Ÿ“ฑEdge AI
Preview
Report Post
Chad Dorsey - Concord, Massachusetts, United States | Professional Profile
linkedin.comยท5h
๐Ÿ’ฌPrompt Engineering
Preview
Report Post
Book Review: Why Machines Learn
philippdubach.comยท1dยท
Discuss: Hacker News
๐Ÿ’ฌPrompt Engineering
Preview
Report Post
Two-layer coordinated operation of multi-energy system considering carbon-oriented collaborative pricing mechanism via two-stage stochastic programming approach
sciencedirect.comยท2d
๐Ÿ“ŠDynamic Programming
Preview
Report Post
Claude's take on RLHF and self-doubt
future.forem.comยท20hยท
Discuss: DEV
๐ŸŽดAnki
Preview
Report Post
๐ŸŽฒ Learning is about building personal context
simeongriggs.devยท1d
๐ŸŽดAnki
Preview
Report Post
This AI Paper from Stanford and Harvard Explains Why Most 'Agentic AI' Systems Feel Impressive in Demos and then Completely Fall Apart in Real Use
www-marktechpost-com.cdn.ampproject.orgยท1d
๐Ÿ’ฌPrompt Engineering
Preview
Report Post
Chapter 2 The Targeted Learning Roadmap
tlverse.orgยท1d
๐Ÿง Machine Learning
Preview
Report Post
Hj Hornbeck
freethoughtblogs.comยท19h
๐ŸŒŠCALM Theorem
Preview
Report Post
Self-Supervised Temporal Pattern Mining for circular manufacturing supply chains with embodied agent feedback loops
dev.toยท6hยท
Discuss: DEV
โšกLMAX Disruptor
Preview
Report Post
Demonstration-Guided Continual Reinforcement Learning in Dynamic Environments
arxiv.orgยท4d
๐Ÿ“ŠDynamic Programming
Preview
Report Post
(292) Deep Learning Systems Course
youtube.comยท2d
๐Ÿ”ฌDeep Learning
Preview
Report Post