RSS feed in an Astro blog
amanhimself.devยท2d
When Images Speak Louder: Mitigating Language Bias-induced Hallucinations in VLMs through Cross-Modal Guidance
arxiv.orgยท7h
SyncLipMAE: Contrastive Masked Pretraining for Audio-Visual Talking-Face Representation
arxiv.orgยท7h
Loading...Loading more...