🎯 Reinforcement Learning - hello · Scour

Do We Need Adam? Surprisingly Strong and Sparse Reinforcement Learning with SGD in LLMs

arxiv.org·7h

Reinforcement Learning from Human Feedback

arxiv.org·2d

🎲Deterministic Simulation

lonestation.itch.io·2d

Cross Entropy Derivatives, Part 6: Using gradient descent to reach the final result

dev.to·1d·

Discuss: DEV

📊Optimization

Personalized Adaptive Feedback System for Early Detection and Intervention of Fine‑Motor Skill Development in Preschool Children Using Wearable IMU Sensors and Reinforcement Learning

freederia.com·4d

🧭Inertial Navigation

Learning Models with Uniform Performance via Distributionally RobustOptimization

dev.to·2d·

Discuss: DEV

📊Optimization

On Recursive Self-Improvement (Part I)

hyperdimensional.co·1d

Habit Detection For Home Assistant

hackaday.com·1d

🏠Home Automation

Clawdbot and the Rise of AI Agents: How Autonomous AI Is Changing the Way We Work

inoru.com·21h·

Discuss: DEV

🛡️AI Security

Hypernetworks: Neural Networks for Hierarchical Data

blog.sturdystatistics.com·4d·

Discuss: Hacker News

Designing a Cost-Efficient Agentic System

p.agnihotry.com·18h·

Discuss: Hacker News

Barn Owls Know When to Wait (iuSTDP part 2)

blog.typeobject.com·2d·

Discuss: Hacker News

🚦Wait-Free Algorithms

**Abstract:** This paper introduces a novel approach to automated credit risk assessment and early warning systems leveraging a hierarchical Bayesian network...

freederia.com·3d

💰TigerBeetle

learning by reverse engineering

clymup.com·2d

🔍Reverse Engineering

Exploiting large language model with reinforcement learning for generative job recommendations

eurekalert.org·4d

💬Prompt Engineering

userface.ai·1d

Fastfood: Approximate Kernel Expansions in Loglinear Time

paperium.net·2d·

Discuss: DEV

ben guo 🪽 on X: "How to code better with AI using this one weird trick"

x.com·1d·

Discuss: X

💬Prompt Engineering

Show HN: We added AGENTS.md to 120 challenges so AI teaches instead of codes

frontendmentor.io·20h·

Discuss: Hacker News

💬Prompt Engineering

Scientists reveal the alien logic of AI: hyper-rational but stumped by simple concepts

psypost.org·2d

💬Prompt Engineering

Loading more...