🎯 Reinforcement Learning - Scourface · Scour

Dynamic Regret via Discounted-to-Dynamic Reduction with Applications to Curved Losses and Adam Optimizer

arxiv.org·13h

🌳recursive neural networks

Projected Gradient Ascent for Efficient Reward-Guided Updates with One-Step Generative Models

arxiv.org·13h

🔄Meta-Learning

Skills: teaching AI agents to act consistently

dev.to·23h·

Discuss: DEV

🎯Predictive Coding

Stochastic Gradient Descent Optimizes Over-parameterized Deep ReLU Networks

dev.to·2d·

Discuss: DEV

🌳recursive neural networks

Dynamic Pedestrian Flow Optimization in Smart Tunnels Using Multi‑Agent Reinforcement Learning **Abstract** Rapid urbanization has produced urban tunnels tha...

freederia.com·4d

🧠Neuromorphic Hardware

Hierarchical Reinforcement Learning for Multi‑Arm Collaborative Assembly of Aerospace Composite Panels: Joint Kinematic Constraint‑Aware Policy with Curriculum‑Based Reward Shaping

freederia.com·4d

🎯Predictive Coding

Nonlinear random walks on hypergraphs characterized by higher-order interactions

sciencedirect.com·3d

🧠Neuromorphic Hardware

AI Agents 2.0: AI Agents that can Learn(6 learning types that make memory persistent)

pub.towardsai.net

·4d

🧠Neuromorphic Hardware

Loss Distribution Collapse: A Structural Theory of Dataset Degradation

zenodo.org·4d·

Discuss: Hacker News

🔄Meta-Learning

Why do tree-based models still outperform deep learning on tabular data?

paperium.net·2d·

Discuss: DEV

🌳recursive neural networks

Humane, adaptive AI bootstrapping

natemeyvis.com·4d

🔄Meta-Learning

Listen to Yourself

thestoicmanual.com·3d

🔗Synaptic Plasticity

Building the Future with AI That Acts

devxt.com·2d·

Discuss: Hacker News

🧠Neuromorphic Hardware

Self-Learning AI Agents: A High-Level Overview

digitalocean.com·6d

🔄Meta-Learning

In (highly contingent!) defense of interpretability-in-the-loop ML training

lesswrong.com·4d

🎯Predictive Coding

Writing an LLM from scratch, part 32d -- Interventions: adding attention bias

gilesthomas.com·3d·

Discuss: Hacker News

🔄Meta-Learning

Neural population geometry and optimal coding of tasks with shared latent structure

nature.com·4d

🎯Predictive Coding

Text classification with Python 3.14's zstd module • Max Halford

maxhalford.github.io·4d·

Discuss: Lobsters, Hacker News

🌳recursive neural networks

Your AI Companion

pocketmindai.com·4d·

Discuss: r/InternetIsBeautiful

🧠Neuromorphic Hardware

Understanding LLM Inference Engines: Inside Nano-vLLM (Part 2)

neutree.ai·4d·

Discuss: Hacker News

🧠Neuromorphic Hardware

Loading more...