Codebook Learning, K-means Clustering, Compression, Product Quantization
Layer-Wise Analysis of Self-Supervised Representations for Age and Gender Classification in Children's Speech
arxiv.org·23h
ViMoNet: A Multimodal Vision-Language Framework for Human Behavior Understanding from Motion and Video
arxiv.org·1d
Structured Kernel Regression VAE: A Computationally Efficient Surrogate for GP-VAEs in ICA
arxiv.org·1d
A Vision-Language Pre-training Model-Guided Approach for Mitigating Backdoor Attacks in Federated Learning
arxiv.org·23h
Loading...Loading more...