graphical interfaces, gesture interfaces, multimodal interactions, voice interactions
BASIC: Boosting Visual Alignment with Intrinsic Refined Embeddings in Multimodal Large Language Models
arxiv.org·2d
ParallelSearch: Train your LLMs to Decompose Query and Search Sub-queries in Parallel with Reinforcement Learning
arxiv.org·21h
Loading...Loading more...