๐Ÿฟ๏ธ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
๐Ÿ”„ Reinforcement Learning
How to instantly be better at things
bigthink.comยท3h
๐ŸšฃRowing
How a Tiny Brain Region Helps You Learn Complex Movements, One Neuron at a Time
simonsfoundation.orgยท3h
๐ŸšฃRowing
VQAThinker: Exploring Generalizable and Explainable Video Quality Assessment via Reinforcement Learning
arxiv.orgยท14h
๐ŸšฃRowing
Latent Preference Bandits
arxiv.orgยท3d
๐ŸšฃRowing
modded-nanogpt: Analyzing value-embedding-, UNet-, and x0-lambdas
snimu.github.ioยท18h
๐ŸšฃRowing
Mixture-of-Agents (MoA): Improving LLM Quality through Multi-Agent Collaboration
hackernoon.comยท14h
๐ŸŒWorld Politics and Events
Epidemic Control on a Large-Scale-Agent-Based Epidemiology Model using Deep Deterministic Policy Gradient
arxiv.orgยท14h
๐ŸคInternational Relations
One More Machine Learning Article: When Computers Start to Think for Themselves
dev.toยท2dยท
Discuss: DEV
๐ŸšฃRowing
I Think with AI
rolando.isยท12hยท
Discuss: Hacker News
๐ŸคInternational Relations
๐Ÿ—๏ธ Part 1: Foundation - Basic RAG and Agentic Concepts
dev.toยท19hยท
Discuss: DEV
๐ŸšฃRowing
Social Welfare in Battery Charging Games
arxiv.orgยท14h
๐ŸคInternational Relations
Predictive Trust Degradation Mitigation via Dynamic Behavioral Anomaly Detection in Collaborative Robots
dev.toยท20hยท
Discuss: DEV
๐ŸšฃRowing
Rock-Paper-Scissors with Neural-Networks!
dev.toยท2dยท
Discuss: DEV
๐ŸšฃRowing
Enhanced Pyrolysis Process Optimization via Dynamic Kinetic Modeling & AI Feedback Loop
dev.toยท1dยท
Discuss: DEV
๐ŸคInternational Relations
Adaptive Predictive Control via Hyperdimensional State Compression and Real-time Feedback
dev.toยท2dยท
Discuss: DEV
๐ŸšฃRowing
Import AI 424: Facebook improves ads with RL; LLM and human brain similarities; and mental health and chatbots
importai.substack.comยท6hยท
Discuss: Substack
๐ŸคInternational Relations
Enhancing the Scalability of Classical Surrogates for Real-World Quantum Machine Learning Applications
arxiv.orgยท14h
๐ŸšฃRowing
Self-attention mechanism explained
jtlicardo.comยท1dยท
Discuss: Hacker News
๐ŸšฃRowing
Reward boosts cognitive control during working memory maintenance
nature.comยท18h
๐ŸคInternational Relations
SKATE, a Scalable Tournament Eval: Weaker LLMs differentiate between stronger ones using verifiable challenges
arxiv.orgยท14h
๐ŸšฃRowing
Loading...Loading more...
AboutBlogChangelogRoadmap