Dynamic Resource Allocation in Vertiport Battery Swapping via Reinforcement Learning
dev.toยท11hยท
Discuss: DEV
๐Ÿง Neuromorphic Hardware
Flag this post
Understanding the Design of Optimizers with me
dev.toยท1dยท
Discuss: DEV
๐ŸŽฏPredictive Coding
Flag this post
Connectivity Structure and Dynamics of Nonlinear Recurrent Neural Networks
journals.aps.orgยท7h
๐Ÿง Neuromorphic Hardware
Flag this post
Writing an LLM from scratch, part 27 โ€“ what's left, and what's next?
gilesthomas.comยท6hยท
Discuss: Hacker News
๐Ÿ”„Meta-Learning
Flag this post
Iterative Foundation Model Fine-Tuning on Multiple Rewards
arxiv.orgยท2h
๐Ÿ”„Meta-Learning
Flag this post
Bio-Inspired Neuron Synapse Optimization for Adaptive Learning and Smart Decision-Making
arxiv.orgยท2h
๐Ÿง Neuromorphic Hardware
Flag this post
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning (Paper Review)
pub.towardsai.netยท2d
๐ŸŽฏPredictive Coding
Flag this post
Learning Complementary Policies for Human-AI Teams
arxiv.orgยท2h
๐Ÿ”„Meta-Learning
Flag this post
Alpamayo-R1: Bridging Reasoning and Action Prediction for Generalizable Autonomous Driving in the Long Tail
arxiv.orgยท2h
๐Ÿ”„Meta-Learning
Flag this post
From Parrot to Partner - How Reinforcement Learning Taught LLMs to Talk Like Humans
dev.toยท1dยท
Discuss: DEV
๐Ÿ”„Meta-Learning
Flag this post
Fast, Scalable LDA in C++ with Stochastic Variational Inference
github.comยท16hยท
Discuss: r/cpp
๐ŸŽฏPredictive Coding
Flag this post
Casing Collar Identification using AlexNet-based Neural Networks for Depth Measurement in Oil and Gas Wells
arxiv.orgยท2h
๐Ÿค–Machine Learning
Flag this post
A Practitioner's Guide to Kolmogorov-Arnold Networks
arxiviq.substack.comยท1dยท
Discuss: Substack
๐ŸŽฏPredictive Coding
Flag this post
Online Energy Storage Arbitrage under Imperfect Predictions: A Conformal Risk-Aware Approach
arxiv.orgยท2h
๐Ÿง Neuromorphic Hardware
Flag this post
Yes, you should understand backprop (2016)
karpathy.medium.comยท2dยท
Discuss: Hacker News
๐Ÿ”„Meta-Learning
Flag this post
Probabilistic Robustness for Free? Revisiting Training via a Benchmark
arxiv.orgยท2h
๐Ÿ”„Meta-Learning
Flag this post
Introducing Project Telos: Modeling, Measuring, and Intervening on Goal-directed Behavior in AI Systems
lesswrong.comยท3d
๐ŸŽฏPredictive Coding
Flag this post