A Learning-Based Control Barrier Function for Car-Like Robots: Toward Less Conservative Collision Avoidance
arxiv.org·8h
⚙Engineering
Flag this post
Evaluating LLMs' Reasoning Over Ordered Procedural Steps
arxiv.org·1d
🕹Game development
Flag this post
Adaptive PID Control for Robotic Systems via Hierarchical Meta-Learning and Reinforcement Learning with Physics-Based Data Augmentation
arxiv.org·8h
⚙Engineering
Flag this post
Deep Self-Evolving Reasoning
🕹Game development
Flag this post
CG-TTRL: Context-Guided Test-Time Reinforcement Learning for On-Device Large Language Models
arxiv.org·8h
🕹Game development
Flag this post
Study finds AI models store memories and logic in different neural regions
⚙Engineering
Flag this post
Foundational Automatic Evaluators: Scaling Multi-Task Generative EvaluatorTraining for Reasoning-Centric Domains
🕹Game development
Flag this post
Controlled Vocabularies
⚙Engineering
Flag this post
When does Claude sabotage code? An Agentic Misalignment follow-up
lesswrong.com·1d
🕹Game development
Flag this post
Transforming animation with machine learning
medium.com·1d
🕹Game development
Flag this post
Large Language Models Develop Novel Social Biases Through Adaptive Exploration
arxiv.org·8h
🕹Game development
Flag this post
AdvisingWise: Supporting Academic Advising in Higher Educations Through a Human-in-the-Loop Multi-Agent Framework
arxiv.org·8h
🕹Game development
Flag this post
Spilling the Beans: Teaching LLMs to Self-Report Their Hidden Objectives
arxiv.org·8h
🕹Game development
Flag this post
Textual Self-attention Network: Test-Time Preference Optimization through Textual Gradient-based Attention
arxiv.org·8h
🕹Game development
Flag this post
Claude Projects, Sub-Agents, or Skills? Here’s How to Actually Choose
pub.towardsai.net·13h
🕹Game development
Flag this post
Fixing Enterprise Apps with AI: The T+n Problem
oreilly.com·1d
🕹Game development
Flag this post
AgentExpt: Automating AI Experiment Design with LLM-based Resource Retrieval Agent
arxiv.org·1d
⚙Engineering
Flag this post
Loading...Loading more...