CLIP

Feeds to Scour
SubscribedAll
Scoured 227 posts in 7.5 ms

A generalist biomedical vision-language model via multi-CLIP knowledge distillation

 🤗Hugging Face  Content type: Academic
nature.com·

Refining Vision-Language Models For Lithography Defect Detection

 🤖ai
semiengineering.com·

VLGA: Vision-Language-Geometry-Action Models for Autonomous Driving

 🤗Hugging Face  Content type: Academic
arxiv.org·
Less-relevant results

Computer Vision and Geometry Group | Robot Learning

 🤖ai

Issue 655

 📊Data Science  Content type: News  Content type: Blog

NVIDIA's Cosmos 3: The World's First Fully Open AI Omnimodel

 🤖AI  Content type: News
aimagazine.com·

ApertureLab · Synthetic Aperture Sonar Simulator

 🤖ai
gergltd.com··Hacker News

Mi50 32GB / GFX906 - vLLM Qwen 3.5 Configuration for Qwen 3.5:9B AWQ-4bit

 🤖ai

TEVI: Text-Conditioned Editing of Visual Representations via Sparse Autoencoders for Improved Vision-Language Alignment

 🤗Hugging Face  Content type: Academic
arxiv.org·

Are Emotion Reading Robots Still Missing What Matters Most?

 🤗Hugging Face  Content type: News
spectrum.ieee.org
·

Nvidia Accelerates Healthcare AI Race With Transcription Model

 🤖AI
pymnts.com·

1% Stake, 100% Strategy: How Big Is Toyota's Game?

 🤖Machine Learning
autonews.gasgoo.com·

VL-DINO: Leveraging CLIP Vision-Language Knowledge for Open-Vocabulary Object Detectio

 🤗Hugging Face  Content type: Academic
arxiv.org·

Can 15 European startups reshape physical AI? Google DeepMind just bet on them

 🚀Startups
ppc.land··r/europe

DCOX, PDFs Were Not Built for AI. This New Open Standard Wants to Change That

 🧠LLMs
itsfoss.com·

A disease-centric vision-language foundation model for precision oncology in kidney cancer

 📊Data Science  Content type: Academic
nature.com·

PixelRAG beats text parsers on accuracy and cuts AI agent token costs 10x

 🔍RAG
venturebeat.com·

Accreted Intelligence — it does your work, and every action makes it smarter

 💬NLP
accint.xyz··Hacker News

APT: Action Expert Pretraining Improves Instruction Generalization of Vision-Language-Action Policies

 🤖AI  Content type: Academic
arxiv.org·

OpenCV 5 release - New DNN engine with enhanced ONNX and LLM/VLM support, Intel, Arm, and RISC-V hardware optimizations - CNX Software

 🤖Machine Learning  Content type: News
cnx-software.com·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help