Masked Softmax Layers in PyTorch
mcognetta.github.ioยท21hยท
Discuss: Hacker News
๐Ÿค–Machine learning
Flag this post
Latent Domain Prompt Learning for Vision-Language Models
arxiv.orgยท7h
๐Ÿ”Grad-CAM
Flag this post
Discriminately Treating Motion Components Evolves Joint Depth and Ego-Motion Learning
arxiv.orgยท7h
๐Ÿ”บGeometric Learning
Flag this post
AI for Service: Proactive Assistance with AI Glasses
paperium.netยท12hยท
Discuss: DEV
๐Ÿง OpenAI
Flag this post
Everything About Transformers
krupadave.comยท5d
๐Ÿง OpenAI
Flag this post
How We Built a Custom Vision LLM to Improve Document Processing at Grab
engineering.grab.comยท12hยท
Discuss: Hacker News
๐Ÿง OpenAI
Flag this post
Spot The Ball: A Benchmark for Visual Social Inference
arxiv.orgยท7h
๐Ÿ”Grad-CAM
Flag this post
Automated Defect Prediction via Cross-Entropy Regularized Graph Neural Networks for Microservice Architectures
dev.toยท11hยท
Discuss: DEV
๐Ÿง OpenAI
Flag this post
FOCUS: Efficient Keyframe Selection for Long Video Understanding
arxiv.orgยท1d
๐Ÿ”Grad-CAM
Flag this post
Trace Anything: Representing Any Video in 4D via Trajectory Fields
paperium.netยท1dยท
Discuss: DEV
๐Ÿ”Grad-CAM
Flag this post
[R] We were wrong about SNNs. The bo.ttleneck isn't binary/sparsity, it's frequency.
reddit.comยท1dยท
๐Ÿ”ฅPyTorch
Flag this post
FG-CLIP 2: A Bilingual Fine-grained Vision-Language Alignment Model
paperium.netยท1dยท
Discuss: DEV
๐Ÿ”Grad-CAM
Flag this post
Validating Deep Models for Alzheimer's 18F-FDG PET Diagnosis Across Populations: A Study with Latin American Data
arxiv.orgยท7h
๐Ÿ”Grad-CAM
Flag this post
Will Spiking Neural Nets Revolutionize AI by Mimicking Brain Efficiency? by Arvind Sundararajan
dev.toยท9hยท
Discuss: DEV
๐Ÿ”ฅPyTorch
Flag this post
No More Manual Masking: The Science That Makes AI Batch Background Removal Tools So Accurate
dev.toยท7hยท
Discuss: DEV
๐Ÿ‘Computer vision
Flag this post
FedMGP: Personalized Federated Learning with Multi-Group Text-Visual Prompts
arxiv.orgยท7h
๐Ÿง OpenAI
Flag this post
Dual-Stream Diffusion for World-Model Augmented Vision-Language-Action Model
arxiv.orgยท1d
๐Ÿ”Grad-CAM
Flag this post
Deep Neural Watermarking for Robust Copyright Protection in 3D Point Clouds
arxiv.orgยท1d
โ˜๏ธPoint Cloud Processing
Flag this post
ViSurf: Visual Supervised-and-Reinforcement Fine-Tuning for LargeVision-and-Language Models
paperium.netยท2dยท
Discuss: DEV
๐Ÿ”Grad-CAM
Flag this post