Simulated Human Learning in a Dynamic, Partially-Observed, Time-Series Environment
arxiv.org·22h
Automatic Differentiation
Flag this post
October 2024 Progress in Guaranteed Safe AI
lesswrong.com·9h
🗣️Large Language Models
Flag this post
Deep dive into the small details of micrograd
omrishneor.github.io·2d·
Discuss: Hacker News
Automatic Differentiation
Flag this post
Empowering Multi-Turn Tool-Integrated Reasoning with Group Turn Policy Optimization
arxiv.org·22h
🗣️Large Language Models
Flag this post
Thinking through how pretraining vs RL learn
dwarkesh.com·3d·
Discuss: Hacker News
📊Optimization
Flag this post
Show HN: MQ-AGI A neuro-symbolic architecture for modular AGI
news.ycombinator.com·1d·
Discuss: Hacker News
🗣️Large Language Models
Flag this post
Distribution Matching Distillation Meets Reinforcement Learning
arxiv.org·2d
Automatic Differentiation
Flag this post
Foundations for autonomous finance – Part I
ldstn.substack.com·2d·
Discuss: Substack
📊Optimization
Flag this post
Robots learn from experience making espresso, building boxes and folding laundry
pi.website·3d·
Discuss: Hacker News
Automatic Differentiation
Flag this post
Enhancing Reinforcement Learning in 3D Environments through Semantic Segmentation: A Case Study in ViZDoom
arxiv.org·2d
🧠Deep Learning
Flag this post
Dominance: The Standard Everyday Solution To Akrasia
lesswrong.com·5h
🎯Decision Theory
Flag this post
Olmo 3: America’s truly open reasoning models
interconnects.ai·13h·
Discuss: Hacker News
🗣️Large Language Models
Flag this post
AGI Doesn't Need More Parameters – It Needs an Epistemic Loop
researchgate.net·9h·
Discuss: Hacker News
🤖AI
Flag this post
COMPASS: Context-Modulated PID Attention Steering System for Hallucination Mitigation
arxiv.org·22h
🗣️Large Language Models
Flag this post
Function-on-Function Bayesian Optimization
arxiv.org·2d
📊Optimization
Flag this post
The Latent Role of Open Models in the AI Economy
papers.ssrn.com·23h·
Discuss: Hacker News
🤖AI
Flag this post
Interactive language learning with Claude Code
github.com·18h·
Discuss: Hacker News
🤖AI
Flag this post
AI for bio needs real-time data
coherenceneuro.substack.com·12h·
Discuss: Substack
⏱️Time Series Analysis
Flag this post