Learning Generalizable and Efficient Image Watermarking via Hierarchical Two-Stage Optimization
arxiv.org·5h
TAR-TVG: Enhancing VLMs with Timestamp Anchor-Constrained Reasoning for Temporal Video Grounding
arxiv.org·1d
CLUE: Leveraging Low-Rank Adaptation to Capture Latent Uncovered Evidence for Image Forgery Localization
arxiv.org·1d
Natural Language-Driven Viewpoint Navigation for Volume Exploration via Semantic Block Representation
arxiv.org·1d
LaVieID: Local Autoregressive Diffusion Transformers for Identity-Preserving Video Creation
arxiv.org·1d
SOFA: Deep Learning Framework for Simulating and Optimizing Atrial Fibrillation Ablation
arxiv.org·1d
Dynamic Pattern Alignment Learning for Pretraining Lightweight Human-Centric Vision Models
arxiv.org·1d
MIND: A Noise-Adaptive Denoising Framework for Medical Images Integrating Multi-Scale Transformer
arxiv.org·1d
Loading...Loading more...