Vision Transformers

Feeds to Scour
SubscribedAll
Scoured 47 posts in 7.4 ms

Beyond Humans: Multispecies Animal Face Recognition Using Transfer Learning

ย ๐Ÿ‘Computer vision ย Content type: Academic
arxiv.orgยท

Page image classifier fine-tuned on century-spanning archives of scanned documents for further content-specific processing

ย ๐Ÿ‘Computer vision ย Content type: Academic
arxiv.orgยท

Uncertainty-Aware Adaptive Sensor Fusion for Autonomous Navigation

ย ๐Ÿ‘Computer vision ย Content type: Academic
arxiv.orgยท
Less-relevant results

CL-CLIP: CLIP-Based Continual Learning Framework with Cost-Volume Category Decoupling for Object Detection

ย ๐Ÿ‘Computer vision ย Content type: Academic
arxiv.orgยท

An Open-Source Two-Stage Computer Vision Pipeline for Fine-Grained Vehicle Classification using Vision Transformers

ย ๐Ÿ‘Computer vision ย Content type: Academic
arxiv.orgยท

Vision-Assisted Foundation Model for Solving Multi-Task Vehicle Routing Problems

ย ๐Ÿ‘Computer vision ย Content type: Academic
arxiv.orgยท

LRMIL: Efficient Low-Resolution Multiple Instance Learning via High-Resolution Knowledge Distillation for Whole Slide Image Classification

ย ๐Ÿ”Grad-CAM ย Content type: Academic
arxiv.orgยท

SynIB: Informational Bottleneck for Maximizing Synergy in Multimodal Learning

ย ๐Ÿ”Grad-CAM ย Content type: Academic
arxiv.orgยท

AMN: An Adaptive Multi-Scale Fusion Network with Boundary and Uncertainty Modeling for Nuclei Segmentation

ย ๐Ÿ”Grad-CAM ย Content type: Academic
arxiv.orgยท

Don't waste SAM

ย ๐Ÿ‘Computer vision ย Content type: Academic
arxiv.orgยท

Textual Supervision Enhances Geospatial Representations in Vision-Language Models

ย ๐Ÿ”Grad-CAM ย Content type: Academic
arxiv.orgยท

HarmoView: Harmonizing Multi-View Constraints for Identity-Consistent Video Generation

ย ๐Ÿ“ทOpenCV ย Content type: Academic
arxiv.orgยท

SlideCheck: Guiding Self-Supervised Pretraining of Pathology Foundation Models via Dataset Distributions

ย ๐Ÿ”Grad-CAM ย Content type: Academic
arxiv.orgยท

T-SAR-JEPA: Self-Supervised Temporal Anomaly Detection in SAR Amplitude Stacks via Latent Prediction

ย ๐Ÿ”ฌscikit-learn ย Content type: Academic
arxiv.orgยท

How Much MRI Preprocessing Is Enough? A Cost-Utility Study for Brain MRI Foundation Models

ย ๐Ÿ”Grad-CAM ย Content type: Academic
arxiv.orgยท

Kwai Keye-VL-2.0 Technical Report

ย ๐Ÿง OpenAI ย Content type: Academic
arxiv.orgยท

Human-Centered Benchmarking of Driver Monitoring Models

ย ๐Ÿ”Grad-CAM ย Content type: Academic
arxiv.orgยท

LatentWave: JEPA Pretraining for Wireless Foundation Models

ย ๐Ÿง OpenAI ย Content type: Academic
arxiv.orgยท

Reconstructing Multi-Decadal Forest Disturbances: A Spatio-Temporal Transformer Approach

ย ๐Ÿค–Machine learning ย Content type: Academic
arxiv.orgยท

A Unifying View of Attention Sinks: Two Algorithms, Two Solutions

ย ๐Ÿ‘Computer vision ย Content type: Academic
arxiv.orgยท

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help