Is GRPO Broken?
neelsomaniblog.comยท10hยท
Discuss: Hacker News
๐Ÿ›ก๏ธByzantine Consensus
[R] DeepSeek 3.2's sparse attention mechanism
reddit.comยท1dยท
๐Ÿ—๏ธAI Infrastructure
Quantum Agents: The Algorithmic Alchemists Reshaping Discovery
dev.toยท1hยท
Discuss: DEV
๐Ÿค–AI agents
H1B-KV: Hybrid One-Bit Caches for Memory-Efficient Large Language Model Inference
arxiv.orgยท3d
๐Ÿ—๏ธAI Infrastructure
Less Is More: Recursive Reasoning with Tiny Networks
github.comยท2dยท
Discuss: Hacker News
๐Ÿ“ฑEdge AI
Automated Fault Isolation & Healing in Linear Control Systems via Multi-Modal Data Fusion & Reinforcement Learning
dev.toยท5hยท
Discuss: DEV
๐Ÿ’งHydroponics Control
ARMOR: High-Performance Semi-Structured Pruning via Adaptive Matrix Factorization
arxiv.orgยท3d
๐ŸŽฏVector Databases
LinVideo: A Post-Training Framework towards O(n) Attention in Efficient Video Generation
arxiv.orgยท1d
๐Ÿ—๏ธAI Infrastructure
VideoNorms: Benchmarking Cultural Awareness of Video Language Models
arxiv.orgยท1d
๐ŸŽคVoice Interfaces
ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall
arxiv.orgยท1d
๐Ÿ—๏ธAI Infrastructure
Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimization
arxiv.orgยท4d
๐Ÿ Self-hosted AI
Off-Trajectory Reasoning: Can LLMs Collaborate on Reasoning Trajectory?
arxiv.orgยท2d
๐Ÿ“ฑEdge AI
Stress-Testing Model Specs Reveals Character Differences among Language Models
arxiv.orgยท1d
๐ŸŽ™๏ธWhisper
Effective and Stealthy One-Shot Jailbreaks on Deployed Mobile Vision-Language Agents
arxiv.orgยท1d
๐Ÿค–AI agents
Out-of-Distribution Generalization in Climate-Aware Yield Prediction with Earth Observation Data
arxiv.orgยท1d
๐Ÿ“ฑEdge AI
Certifiable Safe RLHF: Fixed-Penalty Constraint Optimization for Safer Language Models
arxiv.orgยท4d
๐ŸŽ™๏ธWhisper
Think Natively: Unlocking Multilingual Reasoning with Consistency-Enhanced Reinforcement Learning
arxiv.orgยท2d
๐Ÿ—๏ธAI Infrastructure
Covert Quantum Learning: Privately and Verifiably Learning from Quantum Data
arxiv.orgยท2d
๐Ÿ”Decentralized Identity