Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
VLMs
👁️ VLMs
Specific
vision language models, visual LLM, multimodal model
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
261
posts in
6.1
ms
Decoding Pedestrian Crossing Intention from Egocentric
Vision
via
Vision
Language
Models
🎭
Multimodal AI
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Decoding Pedestrian Crossing Intention from Egocentric Vision via Vision Language Models
NVlabs/Eagle: Eagle: Frontier
Vision-Language
Models
with Data-Centric Strategies
🎭
Multimodal AI
Content type:
Code
github.com
·
5d
5 days ago
Actions for NVlabs/Eagle: Eagle: Frontier Vision-Language Models with Data-Centric Strategies
Less-relevant results
Disquiet Junto Project 0754: The
Blip
🎭
Multimodal AI
llllllll.co
·
13h
13 hours ago
Actions for Disquiet Junto Project 0754: The Blip
A generalist biomedical
vision-language
model
via multi-CLIP knowledge distillation
🎭
Multimodal AI
Content type:
Academic
nature.com
·
1d
1 day ago
Actions for A generalist biomedical vision-language model via multi-CLIP knowledge distillation
OpenCV 5 Debuts with Improved ONNX Support and Native AI Upgrades
🧠
LLMs
Content type:
News
hackster.io
·
20h
20 hours ago
Actions for OpenCV 5 Debuts with Improved ONNX Support and Native AI Upgrades
OpenCV 5.0 Released With Rewritten DNN Engine, Built-In
LLM
&
VLM
Support
🎭
Multimodal AI
phoronix.com
·
4d
4 days ago
·
Hacker News
Actions for OpenCV 5.0 Released With Rewritten DNN Engine, Built-In LLM & VLM Support
Sale Sharks:
Blip
or decline after season of strife for Prem club?
🎭
Multimodal AI
Content type:
News
bbc.com
·
1d
1 day ago
Actions for Sale Sharks: Blip or decline after season of strife for Prem club?
openpilot 0.11.1
🎭
Multimodal AI
Content type:
Blog
blog.comma.ai
·
6d
6 days ago
Actions for openpilot 0.11.1
MSUE:
Multi-Modal
Soccer Understanding Expert
🎭
Multimodal AI
Content type:
Academic
arxiv.org
·
7h
7 hours ago
Actions for MSUE: Multi-Modal Soccer Understanding Expert
dimitrisdimitrov5-blip/Phantomix
: The open-source AI browser agent. Free alternative to OpenAI Operator.
🔓
Open-source Models
Content type:
Code
github.com
·
4h
4 hours ago
·
Hacker News
Actions for dimitrisdimitrov5-blip/Phantomix: The open-source AI browser agent. Free alternative to OpenAI Operator.
Scott Bessent says America's in a 'manufacturing renaissance' and Wall Street largely agrees. So where are the jobs?
🎭
Multimodal AI
fortune.com
·
6d
6 days ago
Actions for Scott Bessent says America's in a 'manufacturing renaissance' and Wall Street largely agrees. So where are the jobs?
Pinterest inks $4 billion AI deal with AWS, the largest infrastructure commitment in its history
🎭
Multimodal AI
aboutamazon.com
·
6d
6 days ago
Actions for Pinterest inks $4 billion AI deal with AWS, the largest infrastructure commitment in its history
Modeling
Complex Behaviors:
Multi-Personality
Composition and Dynamic Switching in
Vision-Language
Models
🎭
Multimodal AI
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Modeling Complex Behaviors: Multi-Personality Composition and Dynamic Switching in Vision-Language Models
A
Blip
on a Telescope in a Colorado Parking Lot Bolstered a Space Mission That Has Found Thousands of Planets … and Counting
🎭
Multimodal AI
smithsonianmag.com
·
6d
6 days ago
Actions for A Blip on a Telescope in a Colorado Parking Lot Bolstered a Space Mission That Has Found Thousands of Planets … and Counting
A primordial black hole nicknamed ‘Phoebe’ may help solve the mystery of dark matter
🎭
Multimodal AI
scientificamerican.com
·
6d
6 days ago
Actions for A primordial black hole nicknamed ‘Phoebe’ may help solve the mystery of dark matter
VL-DINO
: Leveraging
CLIP
Vision-Language
Knowledge for Open-Vocabulary Object Detectio
🎭
Multimodal AI
Content type:
Academic
arxiv.org
·
7h
7 hours ago
Actions for VL-DINO: Leveraging CLIP Vision-Language Knowledge for Open-Vocabulary Object Detectio
Geometric Coastline Localization using
Vision-Language
Models
🎭
Multimodal AI
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Geometric Coastline Localization using Vision-Language Models
OpenMedReason: Scientific Reasoning Supervision for Medical
Vision-Language
Models
💡
AI Reasoning
Content type:
Academic
arxiv.org
·
7h
7 hours ago
Actions for OpenMedReason: Scientific Reasoning Supervision for Medical Vision-Language Models
Pinterest Deepens AWS Partnership with US$4bn Cloud Deal
🎭
Multimodal AI
Content type:
News
aimagazine.com
·
3d
3 days ago
Actions for Pinterest Deepens AWS Partnership with US$4bn Cloud Deal
Reroute, Don't Remove: Recoverable
Visual
Token Routing for
Vision-Language
Models
🎭
Multimodal AI
Content type:
Academic
arxiv.org
·
7h
7 hours ago
Actions for Reroute, Don't Remove: Recoverable Visual Token Routing for Vision-Language Models
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help