CLIP

Feeds to Scour
SubscribedAll
Scoured 227 posts in 9.6 ms

OpenMedReason: Scientific Reasoning Supervision for Medical Vision-Language Models

 🤖ai  Content type: Academic
arxiv.org·
Less-relevant results

Why Robotics Is a Pre-Paradigm Field

 🤖Machine Learning  Content type: News

AVIS: Adaptive Test-Time Scaling for Vision-Language Models

 🤗Hugging Face  Content type: Academic
arxiv.org·

Textual Supervision Enhances Geospatial Representations in Vision-Language Models

 🤗Hugging Face  Content type: Academic
arxiv.org·

4DP-QA: Scalable QA for 4D Perception in Vision Language Models

 🤗Hugging Face  Content type: Academic

AI, Orientalism, and “Calm” Art

 📊Data Science
nightingaledvs.com·

LabVLA: Grounding Vision-Language-Action Models in Scientific Laboratories

 🤖AI Engineering  Content type: Academic
arxiv.org·

Xpeng CEO takes helm of robotics unit to drive physical AI transition

 🤖AI Engineering
cnevpost.com·

World Pilot: Steering Vision-Language-Action Models with World-Action Priors

 🤖AI  Content type: Academic
arxiv.org·

The Concerning, Unchecked Rise of E2E AI in Physical Applications

 🤖Machine Learning  Content type: News
eetimes.com·

Reroute, Don't Remove: Recoverable Visual Token Routing for Vision-Language Models

 🔍RAG  Content type: Academic
arxiv.org·

Building Pakistan Notice Helper: A Small AI Tool for a Very Local Safety Problem

 🤖ai  Content type: Blog
huggingface.co·

Trajectory-Level Redirection Attacks on Vision-Language-Action Models

 🤖AI  Content type: Academic
arxiv.org·

Task-Aligned Stability Analysis of Vision-Language Models for Autonomous Driving Hazard Detection

 🤗Hugging Face  Content type: Academic
arxiv.org·

NXP Computex Keynote 2026 Coverage

 🧠LLM Inference
servethehome.com·

From Prompts to Tokens: Internalizing Causal Supervision in Vision-Language Model for Multi-Image Causal Reasoning

 🧠LLMs  Content type: Academic
arxiv.org·

MedSIGHT: Towards Grounded Visual Comprehension in Medical Large Vision-Language Models

 🧠LLMs  Content type: Academic
arxiv.org·

Iterative Visual Thinking: Teaching Vision-Language Models Spatial Self-Correction through Visual Feedback

 🤗Hugging Face  Content type: Academic
arxiv.org·

When Does Language Matter? Multilingual Instructions Reveal Step-wise Language Sensitivity in Vision-Language-Action Models

 🤖AI  Content type: Academic

CLASP: Language-Driven Robot Skill Selection and Composition using Task-Parameterized Learning

 🤖ai  Content type: Academic
arxiv.org·
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help