Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Multimodal AI
👁️ Multimodal AI
Vision-Language, Image-Text, Multimodal Models
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
323
posts in
10.6
ms
OpenCV 5 release - New DNN engine with enhanced ONNX and
LLM/VLM
support, Intel, Arm, and RISC-V hardware optimizations - CNX Software
🤖
LLMs
Content type:
News
cnx-software.com
·
1d
1 day ago
Actions for OpenCV 5 release - New DNN engine with enhanced ONNX and LLM/VLM support, Intel, Arm, and RISC-V hardware optimizations - CNX Software
openpilot 0.11.1
⚡
Compute Scaling
Content type:
Blog
blog.comma.ai
·
6d
6 days ago
Actions for openpilot 0.11.1
Task-Aligned Stability Analysis of
Vision-Language
Models
for Autonomous Driving Hazard Detection
🛡️
AI Safety
Content type:
Academic
arxiv.org
·
6h
6 hours ago
Actions for Task-Aligned Stability Analysis of Vision-Language Models for Autonomous Driving Hazard Detection
Advisor: Give Any
Model
a Lifeline to a Smarter One
✨
Gemini
Content type:
Blog
openrouter.ai
·
1d
1 day ago
Actions for Advisor: Give Any Model a Lifeline to a Smarter One
My LLM API Bill Hit $847/Month. Here is the Open-Source Proxy That Cut It to $89.
✨
Gemini
kaithorne.gumroad.com
·
5d
5 days ago
·
DEV
Actions for My LLM API Bill Hit $847/Month. Here is the Open-Source Proxy That Cut It to $89.
Multimodal
Browser
AI
with Transformers.js for
Images
and Speech
🤖
LLMs
machinelearningmastery.com
·
22h
22 hours ago
Actions for Multimodal Browser AI with Transformers.js for Images and Speech
linzhiqiu/t2v_metrics: Evaluating
text-to-image/video/3D
models
with VQAScore
📈
AI Progress
Content type:
Code
github.com
·
1d
1 day ago
·
Hacker News
Actions for linzhiqiu/t2v_metrics: Evaluating text-to-image/video/3D models with VQAScore
Google’s Sergey Brin Sees A Path To AGI But Not Beyond It via @sejournal, @martinibuster
🧬
Computational Biology
searchenginejournal.com
·
5d
5 days ago
Actions for Google’s Sergey Brin Sees A Path To AGI But Not Beyond It via @sejournal, @martinibuster
I Tracked Every Penny I Spent on
AI
APIs for a Month
📈
AI Progress
yixintoken.com
·
7h
7 hours ago
·
DEV
Actions for I Tracked Every Penny I Spent on AI APIs for a Month
New open-source voice
model
listens nonstop and decides every 0.4 seconds whether to speak or stay silent
⚡
Transformers
the-decoder.com
·
4d
4 days ago
Actions for New open-source voice model listens nonstop and decides every 0.4 seconds whether to speak or stay silent
Vibe Rounds Concept Document : Dr. Avinash Kumar Gupta : Free Download, Borrow, and Streaming
✨
Gemini
archive.org
·
1d
1 day ago
·
Hacker News
Actions for Vibe Rounds Concept Document : Dr. Avinash Kumar Gupta : Free Download, Borrow, and Streaming
Claude vs
GPT-4
: Which
AI
API Is Better for Developers? (2026)
📈
AI Progress
kalyna.pro
·
5d
5 days ago
·
DEV
Actions for Claude vs GPT-4: Which AI API Is Better for Developers? (2026)
ApertureLab · Synthetic Aperture Sonar Simulator
⚡
Compute Scaling
gergltd.com
·
14h
14 hours ago
·
Hacker News
Actions for ApertureLab · Synthetic Aperture Sonar Simulator
know the mother tongue of your LLMs
⚡
Transformers
mothertoken.inigoimaz.com
·
1d
1 day ago
·
Hacker News
Actions for know the mother tongue of your LLMs
Seeing Before Colliding: Anticipatory Safe RL with Frozen
Vision-Language
Models
🤖
Reinforcement Learning
Content type:
Academic
arxiv.org
·
6h
6 hours ago
Actions for Seeing Before Colliding: Anticipatory Safe RL with Frozen Vision-Language Models
Qwen3.7-Plus is Alibaba's bid to turn
multimodal
AI
into a full-blown autonomous agent
📈
AI Progress
the-decoder.com
·
5d
5 days ago
Actions for Qwen3.7-Plus is Alibaba's bid to turn multimodal AI into a full-blown autonomous agent
Start Up No.2680: Apple to relaunch Siri *again*, jet fuel shortage hits Brazil, astrophysicists see LLM future, and more
📈
AI Progress
Content type:
Blog
theoverspill.blog
·
3d
3 days ago
Actions for Start Up No.2680: Apple to relaunch Siri *again*, jet fuel shortage hits Brazil, astrophysicists see LLM future, and more
OpenCV 5 Debuts with Improved ONNX Support and Native
AI
Upgrades
⚡
Transformers
Content type:
News
hackster.io
·
19h
19 hours ago
Actions for OpenCV 5 Debuts with Improved ONNX Support and Native AI Upgrades
Pinterest Deepens AWS Partnership with US$4bn Cloud Deal
📈
AI Progress
Content type:
News
aimagazine.com
·
2d
2 days ago
Actions for Pinterest Deepens AWS Partnership with US$4bn Cloud Deal
Junior Architects with Shaky Logic: Testing
AI
’s Real-World Coding Skills – article review
📈
AI Progress
Content type:
Blog
metrics.blogg.gu.se
·
6d
6 days ago
Actions for Junior Architects with Shaky Logic: Testing AI’s Real-World Coding Skills – article review
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help