speech-to-text, model, local, whisper, whisper.cpp, input, voice, recognition, ai
Knowing or Guessing? Robust Medical Visual Question Answering via Joint Consistency and Contrastive Learning
arxiv.org·3d
Inference-Time Alignment Control for Diffusion Models with Reinforcement Learning Guidance
arxiv.org·1d
Loading...Loading more...