Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Multimodal AI
🖼️ Multimodal AI
multimodal, vision language model, VLM, 多模态
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
328
posts in
7.2
ms
Do VLMs
Reason
Like Engineers? A Benchmark and a Stage-wise Evaluation
⚙️
MLOps
Content type:
Academic
arxiv.org
·
22h
22 hours ago
Actions for Do VLMs Reason Like Engineers? A Benchmark and a Stage-wise Evaluation
Claude vs
GPT-4
: Which
AI
API Is Better for Developers? (2026)
🧠
LLM
kalyna.pro
·
5d
5 days ago
·
DEV
Actions for Claude vs GPT-4: Which AI API Is Better for Developers? (2026)
I just built a small OCR tool that runs completely offline in your browser.
💬
NLP
uploadless.app
·
3d
3 days ago
·
r/u_Character-Ad5614
Actions for I just built a small OCR tool that runs completely offline in your browser.
NVIDIA's Cosmos 3: The World's First Fully Open
AI
Omnimodel
🤖
AI Agents
Content type:
News
aimagazine.com
·
1d
1 day ago
Actions for NVIDIA's Cosmos 3: The World's First Fully Open AI Omnimodel
ApertureLab · Synthetic Aperture Sonar Simulator
🤗
Hugging Face
gergltd.com
·
6h
6 hours ago
·
Hacker News
Actions for ApertureLab · Synthetic Aperture Sonar Simulator
openpilot 0.11.1
🎯
Fine-tuning
Content type:
Blog
blog.comma.ai
·
5d
5 days ago
Actions for openpilot 0.11.1
ChatGPT Is $20 a Month, This App Gives You
GPT
, Claude, and Gemini for a Year for $29.99
🤗
Hugging Face
techpowerup.com
·
13h
13 hours ago
Actions for ChatGPT Is $20 a Month, This App Gives You GPT, Claude, and Gemini for a Year for $29.99
know the mother tongue of your LLMs
🧠
LLM
mothertoken.inigoimaz.com
·
1d
1 day ago
·
Hacker News
Actions for know the mother tongue of your LLMs
How to Defend Against Prompt Injection in Production
🧠
LLM
Content type:
Reference
leanpub.com
·
2d
2 days ago
·
DEV
Actions for How to Defend Against Prompt Injection in Production
My LLM API Bill Hit $847/Month. Here is the Open-Source Proxy That Cut It to $89.
⚙️
MLOps
kaithorne.gumroad.com
·
5d
5 days ago
·
DEV
Actions for My LLM API Bill Hit $847/Month. Here is the Open-Source Proxy That Cut It to $89.
A new chapter of efficient foundation
models
for medical
imaging
🤗
Hugging Face
techcommunity.microsoft.com
·
13h
13 hours ago
Actions for A new chapter of efficient foundation models for medical imaging
linzhiqiu/t2v_metrics: Evaluating
text-to-image/video/3D
models
with VQAScore
⚡
Machine Learning
Content type:
Code
github.com
·
1d
1 day ago
·
Hacker News
Actions for linzhiqiu/t2v_metrics: Evaluating text-to-image/video/3D models with VQAScore
Turn multiple
AI
subscriptions into one $60 lifetime plan with
GPT-4o
, Claude, and Gemini included
🤖
AI Agents
pcworld.com
·
3d
3 days ago
Actions for Turn multiple AI subscriptions into one $60 lifetime plan with GPT-4o, Claude, and Gemini included
One Token per
Multimodal
Evidence: Latent Memory for Resource-Constrained QA
📚
RAG
Content type:
Academic
arxiv.org
·
22h
22 hours ago
Actions for One Token per Multimodal Evidence: Latent Memory for Resource-Constrained QA
New open-source voice
model
listens nonstop and decides every 0.4 seconds whether to speak or stay silent
🎯
Fine-tuning
the-decoder.com
·
4d
4 days ago
Actions for New open-source voice model listens nonstop and decides every 0.4 seconds whether to speak or stay silent
OpenCV 5 Debuts with Improved ONNX Support and Native
AI
Upgrades
🔬
Deep Learning
Content type:
News
hackster.io
·
12h
12 hours ago
Actions for OpenCV 5 Debuts with Improved ONNX Support and Native AI Upgrades
Rivian Doesn’t Care How Much You Like Interior Buttons, Voice Control Is Better
🤖
AI Agents
Content type:
News
carscoops.com
·
2d
2 days ago
Actions for Rivian Doesn’t Care How Much You Like Interior Buttons, Voice Control Is Better
What TTS Throws Away
📚
RAG
amaldavid.com
·
4d
4 days ago
·
Hacker News
Actions for What TTS Throws Away
Pinterest Deepens AWS Partnership with US$4bn Cloud Deal
⚙️
MLOps
Content type:
News
aimagazine.com
·
2d
2 days ago
Actions for Pinterest Deepens AWS Partnership with US$4bn Cloud Deal
OpenCV 5.0 Released With Rewritten DNN Engine, Built-In LLM &
VLM
Support
🔬
Deep Learning
phoronix.com
·
4d
4 days ago
·
Hacker News
Actions for OpenCV 5.0 Released With Rewritten DNN Engine, Built-In LLM & VLM Support
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help