Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Vision Language Model
👁 Vision Language Model
Specific
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
172
posts in
6.8
ms
docs: document pdf web tool tests · openclaw/openclaw@1e8609a
🔥
PyTorch
Content type:
Code
github.com
·
6d
6 days ago
Actions for docs: document pdf web tool tests · openclaw/openclaw@1e8609a
Task-Aligned Stability Analysis of
Vision-Language
Models
for Autonomous Driving Hazard Detection
👁
Computer vision
Content type:
Academic
arxiv.org
·
12h
12 hours ago
Actions for Task-Aligned Stability Analysis of Vision-Language Models for Autonomous Driving Hazard Detection
A new chapter of efficient foundation
models
for medical
imaging
🔥
PyTorch
techcommunity.microsoft.com
·
1d
1 day ago
Actions for A new chapter of efficient foundation models for medical imaging
AI-Based Medication Monitoring, Subreddit Spam, Chipotle Chatbots, More: ResearchBuzz AI Update, June 6, 2026
📱
Edge AI
researchbuzz.me
·
5d
5 days ago
Actions for AI-Based Medication Monitoring, Subreddit Spam, Chipotle Chatbots, More: ResearchBuzz AI Update, June 6, 2026
AVIS: Adaptive Test-Time Scaling for
Vision-Language
Models
👁
Computer vision
Content type:
Academic
arxiv.org
·
12h
12 hours ago
Actions for AVIS: Adaptive Test-Time Scaling for Vision-Language Models
Apple rebuilt its on-device AI stack at WWDC 2026
🔥
PyTorch
Content type:
Blog
ziraph.com
·
2d
2 days ago
·
Hacker News
Actions for Apple rebuilt its on-device AI stack at WWDC 2026
Reroute, Don't Remove: Recoverable
Visual
Token Routing for
Vision-Language
Models
👁
Computer vision
Content type:
Academic
arxiv.org
·
12h
12 hours ago
Actions for Reroute, Don't Remove: Recoverable Visual Token Routing for Vision-Language Models
Human-driven sea-level rise has quadrupled the frequency of coastal sea-level extremes since 1900
🤖
AI
Content type:
Academic
nature.com
·
1d
1 day ago
Actions for Human-driven sea-level rise has quadrupled the frequency of coastal sea-level extremes since 1900
Gemma 4 QAT
models
: Optimizing model compression for mobile and laptop efficiency
📱
Edge AI
Content type:
News
Content type:
Blog
blog.google
·
6d
6 days ago
·
Hacker News
Actions for Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency
One Stone, Three Birds: Self-adaptive Optimal Transport for
Multi-VLM
Selection, Adaptation, and Ensembling
📱
Edge AI
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for One Stone, Three Birds: Self-adaptive Optimal Transport for Multi-VLM Selection, Adaptation, and Ensembling
Apples to Apples: MLX vs. Llama.cpp for Gemma 4 12B on an M1 16GB
🤖
AI
Content type:
Blog
ziraph.com
·
5d
5 days ago
·
Hacker News
Actions for Apples to Apples: MLX vs. Llama.cpp for Gemma 4 12B on an M1 16GB
Are Reasoning
Vision-Language
Models
Robust to Semantic Visual Distractions?
👁
Computer vision
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Are Reasoning Vision-Language Models Robust to Semantic Visual Distractions?
From Prompts to Tokens: Internalizing Causal Supervision in
Vision-Language
Model
for Multi-Image Causal Reasoning
👁
Computer vision
Content type:
Academic
arxiv.org
·
12h
12 hours ago
Actions for From Prompts to Tokens: Internalizing Causal Supervision in Vision-Language Model for Multi-Image Causal Reasoning
Google Gemma 4 12B nearly matches 26B benchmarks — and runs on your laptop
📱
Edge AI
thenewstack.io
·
6d
6 days ago
Actions for Google Gemma 4 12B nearly matches 26B benchmarks — and runs on your laptop
DiffusionGemma 26B A4B results on my 5090
🔥
PyTorch
huggingface.co
·
1d
1 day ago
·
r/LocalLLaMA
Actions for DiffusionGemma 26B A4B results on my 5090
World
Model
Self-Distillation: Training World Models to Solve General Tasks
👁
Computer vision
Content type:
Academic
arxiv.org
·
12h
12 hours ago
Actions for World Model Self-Distillation: Training World Models to Solve General Tasks
OmniGameArena: A Unified UE5 Benchmark for
VLM
Game Agents with Improvement Dynamics
📱
Edge AI
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for OmniGameArena: A Unified UE5 Benchmark for VLM Game Agents with Improvement Dynamics
DataXflowGen for GenAI-driven
model
generation
🤖
AI
Content type:
Academic
nature.com
·
5d
5 days ago
Actions for DataXflowGen for GenAI-driven model generation
MLingualFC: Evaluating Jailbreak Vulnerabilities in Multilingual
Vision-Language
Models
📱
Edge AI
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for MLingualFC: Evaluating Jailbreak Vulnerabilities in Multilingual Vision-Language Models
fix(media-understanding): preserve native
vision
skip with
imageModel
… · openclaw/openclaw@d1cb6cd
🎯
Object Detection
Content type:
Code
github.com
·
4d
4 days ago
Actions for fix(media-understanding): preserve native vision skip with imageModel… · openclaw/openclaw@d1cb6cd
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help