CineTrans: Learning to Generate Videos with Cinematic Transitions via Masked Diffusion Models
arxiv.orgΒ·2d
Optimal Condition for Initialization Variance in Deep Neural Networks: An SGD Dynamics Perspective
arxiv.orgΒ·1d
Pair Programming: When Explanations Go Too Far
hackernoon.comΒ·2d
Utilizing Vision-Language Models as Action Models for Intent Recognition and Assistance
arxiv.orgΒ·2d
Med-GLIP: Advancing Medical Language-Image Pre-training with Large-scale Grounded Dataset
arxiv.orgΒ·5d
Improving Text Style Transfer using Masked Diffusion Language Models with Inference-time Scaling
arxiv.orgΒ·2d
An Efficient Medical Image Classification Method Based on a Lightweight Improved ConvNeXt-Tiny Architecture
arxiv.orgΒ·2d
ToxiFrench: Benchmarking and Enhancing Language Models via CoT Fine-Tuning for French Toxicity Detection
arxiv.orgΒ·2d
Loading...Loading more...