🎯 Reinforcement Learning - Scourface · Scour

Provable Offline Reinforcement Learning for Structured Cyclic MDPs

arxiv.org·12h

🔄Meta-Learning

Optimistic Training and Convergence of Q-Learning -- Extended Version

arxiv.org·4d

🔄Meta-Learning

Gibbs Measures from Deep Shaped Multilayer Perceptrons

link.aps.org·1d

🌳recursive neural networks

Optimal timing for superintelligence

feeds.feedblitz.com·17h

🎯Predictive Coding

Static Design to Adaptive Control: How Artificial Intelligence Improves Modern Material Handling Equipment Systems

hackread.com·6h

polyrhachis/macrograd: A lightweight autograd engine inspired by PyTorch and micrograd

github.com·3h·

Discuss: Hacker News

🔄Meta-Learning

myctrl.tools·2h

In defense of wasting time

fastcompany.com·22h

🌱Neuroplasticity

A “Toolbox” Pipeline for Robots That See, Read, and Act

hackernoon.com·17h

🔌Neural Interfaces

Order parameters and phase transitions of continual learning in deep neural networks

pnas.org·2d

🌳recursive neural networks

Memory and Learning layer be built in-house or bought externally?

medium.com·3d·

Discuss: Hacker News

🧠Neuromorphic Hardware

Architectural and Mathematical Foundations of Machine Learning: A Rigorous Synthesis of Theory, Geometry, and Implementation

chizkidd.github.io·2d·

Discuss: Hacker News

🎯Predictive Coding

The 4 Mixture of Experts Architectures: How to Train 100B Models at 10B Cost

pub.towardsai.net

·1d

🔄Meta-Learning

MiniMaxAI/MiniMax-M2.5

huggingface.co·3h·

Discuss: Hacker News, r/LocalLLaMA

🎯Predictive Coding

Hybrid neural–cognitive models reveal how memory shapes human reward learning

nature.com·6d

🎯Predictive Coding

Shel-y/q-drift: Quantum-inspired CLI to analyze structural fragility and decision drift in distributed systems using Shannon Entropy and Signal Decay models.

github.com·8h·

Discuss: DEV

📡Signal Processing

Scaling LLM Post-Training at Netflix

netflixtechblog.com·9h

🔄Meta-Learning

Building Intelligent Bank Approval Workflows with Symfony 7.4 and Symfony AI

dev.to·2h·

Discuss: DEV

Olmix: A framework for data mixing throughout LM development

allenai.org·1h

🎯Predictive Coding

Active learning Kriging with functional dimension reduction for reliability analysis of stochastic dynamical systems

sciencedirect.com·1d

🎯Predictive Coding

Loading more...