I Taught an AI to Dream
๐ง OpenAI
Flag this post
Validating Deep Models for Alzheimer's 18F-FDG PET Diagnosis Across Populations: A Study with Latin American Data
arxiv.orgยท18h
๐๏ธVision Transformers
Flag this post
Bayesian Natural Gradient Fine-Tuning of CLIP Models via Kalman Filtering
arxiv.orgยท18h
๐คMachine learning
Flag this post
The Curvature Rate {\lambda}: A Scalar Measure of Input-Space Sharpness in Neural Networks
arxiv.orgยท18h
๐๏ธVision Transformers
Flag this post
Explore More, Learn Better: Parallel MLLM Embeddings under Mutual Information Minimization
arxiv.orgยท18h
๐คMachine learning
Flag this post
Latent Domain Prompt Learning for Vision-Language Models
arxiv.orgยท18h
๐ง OpenAI
Flag this post
Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning
arxiv.orgยท1d
๐ง OpenAI
Flag this post
OmniBrainBench: A Comprehensive Multimodal Benchmark for Brain Imaging Analysis Across Multi-stage Clinical Tasks
arxiv.orgยท18h
๐Altair
Flag this post
VLM6D: VLM based 6Dof Pose Estimation based on RGB-D Images
arxiv.orgยท18h
๐บGeometric Learning
Flag this post
RยฒDยฒ: Perception-Guided Task & Motion Planning for Long-Horizon Manipulation
developer.nvidia.comยท1d
๐บGeometric Learning
Flag this post
A Hybrid Deep Learning and Forensic Approach for Robust Deepfake Detection
arxiv.orgยท1d
๐ฅPyTorch
Flag this post
SciTextures: Collecting and Connecting Visual Patterns, Models, and Code Across Science and Art
arxiv.orgยท18h
๐Altair
Flag this post
FOCUS: Efficient Keyframe Selection for Long Video Understanding
arxiv.orgยท1d
๐ทOpenCV
Flag this post
A filtering scheme for confocal laser endomicroscopy (CLE)-video sequences for self-supervised learning
arxiv.orgยท18h
๐บGeometric Learning
Flag this post
Diagnosing Hallucination Risk in AI Surgical Decision-Support: A Sequential Framework for Sequential Validation
arxiv.orgยท18h
๐ง OpenAI
Flag this post
FreeSliders: Training-Free, Modality-Agnostic Concept Sliders for Fine-Grained Diffusion Control in Images, Audio, and Video
arxiv.orgยท18h
๐ง OpenAI
Flag this post
Text-guided Fine-Grained Video Anomaly Detection
arxiv.orgยท18h
๐๏ธVision Transformers
Flag this post
A Retrospect to Multi-prompt Learning across Vision and Language
arxiv.orgยท18h
๐๏ธVision Transformers
Flag this post
Loading...Loading more...