DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token viaReinforcement Learning
🧩Neurodiverse
Flag this post
NOAH: Benchmarking Narrative Prior driven Hallucination and Omission in Video Large Language Models
arxiv.org·6h
🧩Neurodiverse
Flag this post
Seeing Shapes: Unveiling Neural Network Vision with Fourier Geometry by Arvind Sundararajan
🧩Neurodiverse
Flag this post
When Bias Pretends to Be Truth: How Spurious Correlations Undermine Hallucination Detection in LLMs
arxiv.org·6h
🧩Neurodiverse
Flag this post
Cross-Modal Fine-Tuning of 3D Convolutional Foundation Models for ADHD Classification with Low-Rank Adaptation
arxiv.org·6h
🧩Neurodiverse
Flag this post
Can Training Dynamics of Scale-Invariant Neural Networks Be Explained by the Thermodynamics of an Ideal Gas?
arxiv.org·6h
🧩Neurodiverse
Flag this post
Explainable Deep Learning-based Classification of Wolff-Parkinson-White Electrocardiographic Signals
arxiv.org·6h
🧩Neurodiverse
Flag this post
Adaptive PID Control for Robotic Systems via Hierarchical Meta-Learning and Reinforcement Learning with Physics-Based Data Augmentation
arxiv.org·6h
🧩Neurodiverse
Flag this post
CG-TTRL: Context-Guided Test-Time Reinforcement Learning for On-Device Large Language Models
arxiv.org·6h
🧩Neurodiverse
Flag this post
Enhancing Robustness of Graph Neural Networks through p-Laplacian
arxiv.org·6h
🧩Neurodiverse
Flag this post
More Agents Helps but Adversarial Robustness Gap Persists
arxiv.org·6h
🧩Neurodiverse
Flag this post
A Low-Rank Method for Vision Language Model Hallucination Mitigation in Autonomous Driving
arxiv.org·6h
🧩Neurodiverse
Flag this post
Optimizing Diversity and Quality through Base-Aligned Model Collaboration
arxiv.org·6h
🧩Neurodiverse
Flag this post
[D] Information geometry, anyone?
🧩Neurodiverse
Flag this post
Simulating Clifford Circuits with Gaussian Elimination
arxiv.org·6h
🧩Neurodiverse
Flag this post
Loading...Loading more...