Spec-VLA: Speculative Decoding for Vision-Language-Action Models with Relaxed Acceptance
arxiv.org·4d
Nearest-Better Network for Visualizing and Analyzing Combinatorial Optimization Problems: A Unified Tool
arxiv.org·4d
SpA2V: Harnessing Spatial Auditory Cues for Audio-driven Spatially-aware Video Generation
arxiv.org·10h
Towards Interpretable Renal Health Decline Forecasting via Multi-LMM Collaborative Reasoning Framework
arxiv.org·4d
HRIPBench: Benchmarking LLMs in Harm Reduction Information Provision to Support People Who Use Drugs
arxiv.org·5d
Loading...Loading more...