Training Kindai OCR with parallel textline images and self-attention feature distance-based loss
arxiv.org·49m
LaVieID: Local Autoregressive Diffusion Transformers for Identity-Preserving Video Creation
arxiv.org·1d
Fast and Generalizable parameter-embedded Neural Operators for Lithium-Ion Battery Simulation
arxiv.org·1d
CATP: Contextually Adaptive Token Pruning for Efficient and Enhanced Multimodal In-Context Learning
arxiv.org·1d
Loading...Loading more...