Analyze-Prompt-Reason: A Collaborative Agent-Based Framework for Multi-Image Vision-Language Reasoning
arxiv.org·1d
Harnessing Textual Semantic Priors for Knowledge Transfer and Refinement in CLIP-Driven Continual Learning
arxiv.org·13h
Rein++: Efficient Generalization and Adaptation for Semantic Segmentation with Vision Foundation Models
arxiv.org·13h
Vision transformer-based multi-camera multi-object tracking framework for dairy cow monitoring
arxiv.org·13h
An Evolving Scenario Generation Method based on Dual-modal Driver Model Trained by Multi-Agent Reinforcement Learning
arxiv.org·13h
Loading...Loading more...