baidu/ERNIE-4.5-VL-28B-A3B-Thinking released. Curious case..
huggingface.co·8h·
Discuss: r/LocalLLaMA
🕹Game development
Flag this post
A Learning-Based Control Barrier Function for Car-Like Robots: Toward Less Conservative Collision Avoidance
arxiv.org·8h
Engineering
Flag this post
Evaluating LLMs' Reasoning Over Ordered Procedural Steps
arxiv.org·1d
🕹Game development
Flag this post
Deep Self-Evolving Reasoning
dev.to·1d·
Discuss: DEV
🕹Game development
Flag this post
CG-TTRL: Context-Guided Test-Time Reinforcement Learning for On-Device Large Language Models
arxiv.org·8h
🕹Game development
Flag this post
Study finds AI models store memories and logic in different neural regions
arstechnica.com·14h·
Engineering
Flag this post
Foundational Automatic Evaluators: Scaling Multi-Task Generative EvaluatorTraining for Reasoning-Centric Domains
paperium.net·12h·
Discuss: DEV
🕹Game development
Flag this post
Controlled Vocabularies
jessicatalisman.substack.com·1d·
Discuss: Substack
Engineering
Flag this post
When does Claude sabotage code? An Agentic Misalignment follow-up
lesswrong.com·1d
🕹Game development
Flag this post
Transforming animation with machine learning
medium.com·1d
🕹Game development
Flag this post
Large Language Models Develop Novel Social Biases Through Adaptive Exploration
arxiv.org·8h
🕹Game development
Flag this post
The Next Frontier in NLP: Smarter Agents, Not Just Bigger Models
dev.to·1d·
Discuss: DEV
🕹Game development
Flag this post
Leveraging Synthetic Data for Enhanced AI Agent Evaluation
dev.to·2d·
Discuss: DEV
🕹Game development
Flag this post
Spilling the Beans: Teaching LLMs to Self-Report Their Hidden Objectives
arxiv.org·8h
🕹Game development
Flag this post
Textual Self-attention Network: Test-Time Preference Optimization through Textual Gradient-based Attention
arxiv.org·8h
🕹Game development
Flag this post
Claude Projects, Sub-Agents, or Skills? Here’s How to Actually Choose
pub.towardsai.net·13h
🕹Game development
Flag this post
Fixing Enterprise Apps with AI: The T+n Problem
oreilly.com·1d
🕹Game development
Flag this post
AgentExpt: Automating AI Experiment Design with LLM-based Resource Retrieval Agent
arxiv.org·1d
Engineering
Flag this post