Attention Mechanisms, Large Language Models, BERT, Encoder-Decoder Architecture
Listening Before Speaking
lesswrong.comยท1d
Learning Generalizable and Efficient Image Watermarking via Hierarchical Two-Stage Optimization
arxiv.orgยท49m
Audio-Thinker: Guiding Audio Language Model When and How to Think via Reinforcement Learning
arxiv.orgยท1d
A Risk Taxonomy and Reflection Tool for Large Language Model Adoption in Public Health
arxiv.orgยท49m
Multi-modal Policies with Physics-informed Representations in Complex Fluid Environments
arxiv.orgยท49m
"Pull or Not to Pull?'': Investigating Moral Biases in Leading Large Language Models Across Ethical Dilemmas
arxiv.orgยท1d
Information Bottleneck-based Causal Attention for Multi-label Medical Image Recognition
arxiv.orgยท1d
Affordance-R1: Reinforcement Learning for Generalizable Affordance Reasoning in Multimodal Large Language Model
arxiv.orgยท2d
Multimodal AI Systems for Enhanced Laying Hen Welfare Assessment and Productivity Optimization
arxiv.orgยท1d
Loading...Loading more...