🎯 Reinforcement Learning - Scourface · Scour

Optimistic Training and Convergence of Q-Learning -- Extended Version

arxiv.org·4d

🔄Meta-Learning

Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning

arxiv.org·16h

🔄Meta-Learning

MiniMaxAI/MiniMax-M2.5

huggingface.co·6h·

Discuss: Hacker News, r/LocalLLaMA

🎯Predictive Coding

A “Toolbox” Pipeline for Robots That See, Read, and Act

hackernoon.com·20h

🔌Neural Interfaces

Scaling LLM Post-Training at Netflix

netflixtechblog.com·12h

🔄Meta-Learning

Multi objective optimization of a discrete fracture geothermal reservoir using Bi-LSTM network

sciencedirect.com·2h

🌳recursive neural networks

Shel-y/q-drift: Quantum-inspired CLI to analyze structural fragility and decision drift in distributed systems using Shannon Entropy and Signal Decay models.

github.com·12h·

Discuss: DEV

📡Signal Processing

Olmix: A framework for data mixing throughout LM development

allenai.org·4h

🎯Predictive Coding

GLM-5: Targeting complex systems engineering and long-horizon agentic tasks

news.ycombinator.com·1h·

Discuss: Hacker News

🔄Meta-Learning

A training principle for drifting models

breno.bearblog.dev·1d

🔄Meta-Learning

Generalized Lanczos method for systematic optimization of neural-network quantum states

link.aps.org·1d

🧠Neuromorphic Computing

The democratization of AI data poisoning and how to protect your organization

csoonline.com·10h

🔒Cybersecurity

Product Forecasting through Time Series Analysis (Modelling)

pub.towardsai.net·21h

🎯Predictive Coding

Hybrid neural–cognitive models reveal how memory shapes human reward learning

nature.com·6d

🎯Predictive Coding

Recursive self-improvement from AI models

marginalrevolution.com·3d·

Discuss: Hacker News

🌳recursive neural networks

Human-like metacognitive skills will reduce LLM slop and aid alignment and capabilities

lesswrong.com·1d

🔄Meta-Learning

How to ground AI agents in accurate, context-rich data

thenewstack.io·8h

🤖Machine Learning

Ai’s Inner Workings Revealed By Model Trained On One Billion Data Points

quantumzeitgeist.com·1d

🎯Predictive Coding

AI Inference Needs A Mix-And-Match Memory Strategy

semiengineering.com·1d

🧠Neuromorphic Hardware

blog.engora.com·2d·

Discuss: Hacker News

🎯Predictive Coding

Sign up or log in to see more results