GDM: Consistency Training Helps Limit Sycophancy and Jailbreaks in Gemini 2.5 Flash
lesswrong.com·11h
🧠OpenAI
Flag this post
Petri Dish Neural Cellular Automata
🔢NumPy
Flag this post
Don’t Just Normalize, Batch Normalize! A Guide to Stable Neural Networks
pub.towardsai.net·1d
👁️Vision Transformers
Flag this post
AI Maps the Brain’s Hidden Bridge, Revealing Genetic Links to Mental Health
neurosciencenews.com·7h
📊Data Science
Flag this post
Self-Improving Vision-Language-Action Models with Data Generation via Residual RL
arxiv.org·23h
🧠OpenAI
Flag this post
Building a Multimodal RAG That Responds with Text, Images, and Tables from Sources
towardsdatascience.com·1d
🧠OpenAI
Flag this post
Spot The Ball: A Benchmark for Visual Social Inference
arxiv.org·23h
👁️Vision Transformers
Flag this post
Generating Accurate and Detailed Captions for High-Resolution Images
arxiv.org·1d
🧠OpenAI
Flag this post
UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning
👁️Vision Transformers
Flag this post
Show HN: ReadMyMRI DICOM native preprocessor with multi model consensus/ML pipes
🧠OpenAI
Flag this post
A systematic evaluation of uncertainty quantification techniques in deep learning: a case study in photoplethysmography signal analysis
arxiv.org·23h
🔥PyTorch
Flag this post
Deep Generative Models for Enhanced Vitreous OCT Imaging
arxiv.org·23h
👁️Vision Transformers
Flag this post
EVTAR: End-to-End Try on with Additional Unpaired Visual Reference
arxiv.org·23h
🤗Hugging Face
Flag this post
A generative adversarial network optimization method for damage detection and digital twinning by deep AI fault learning: Z24 Bridge structural health monitorin...
arxiv.org·23h
👁️Vision Transformers
Flag this post
Few-Shot Multimodal Medical Imaging: A Theoretical Framework
arxiv.org·23h
👁️Vision Transformers
Flag this post
Loading...Loading more...