graphical interfaces, gesture interfaces, multimodal interactions, voice interactions
MAP: Mitigating Hallucinations in Large Vision-Language Models with Map-Level Attention Processing
arxiv.org·11h
Model Misalignment and Language Change: Traces of AI-Associated Language in Unscripted Spoken English
arxiv.org·1d
Harnessing Textual Semantic Priors for Knowledge Transfer and Refinement in CLIP-Driven Continual Learning
arxiv.org·11h
Loading...Loading more...