🖼️ Multimodal AI - tfriedel · Scour

Decoding Pedestrian Crossing Intention from Egocentric Vision via Vision Language Models

👁️Computer Vision Academic

NVlabs/Eagle: Eagle: Frontier Vision-Language Models with Data-Centric Strategies

🔍Fine-Grained Classification Code

Mi50 32GB / GFX906 - vLLM Qwen 3.5 Configuration for Qwen 3.5:9B AWQ-4bit

huggingface.co··r/LocalLLaMA

A generalist biomedical vision-language model via multi-CLIP knowledge distillation

🏷️Label Noise Academic

SpaceX IPO hype is massive — and especially dangerous for investors over 50

🛰️Geospatial AI

marketwatch.com·

Vibe Coding Specificity Foundation Models

🧠LLMs Academic

A new chapter of efficient foundation models for medical imaging

techcommunity.microsoft.com

·

OpenCV 5.0 Released With Rewritten DNN Engine, Built-In LLM & VLM Support

👁️Computer Vision

phoronix.com··Hacker News

Can robots read the room?

🧠LLMs News Academic

news.cornell.edu·

OpenCV 5 release - New DNN engine with enhanced ONNX and LLM/VLM support, Intel, Arm, and RISC-V hardware optimizations - CNX Software

👁️Computer Vision News

cnx-software.com·

openpilot 0.11.1

✍️Prompt Engineering Blog

blog.comma.ai·

NVIDIA's Cosmos 3: The World's First Fully Open AI Omnimodel

🤖AI Agents News

aimagazine.com·

Adapting Vision-Language Models from Iconic to Inclusive for Multi-Label Recognition Without Labels

🏷️Label Noise Academic

ApertureLab · Synthetic Aperture Sonar Simulator

👁️Computer Vision

gergltd.com··Hacker News

OpenCV 5 Is Here: The Biggest Leap in Years for Computer Vision

👁️Computer Vision

opencv.org··Hacker News, Hacker News

Disquiet Junto Project 0754: The Blip

✍️Prompt Engineering

Apple Reveals New AI Architecture Built Around Google Gemini Models

🛰️Geospatial AI News

macrumors.com··Hacker News

dimitrisdimitrov5-blip/Phantomix: The open-source AI browser agent. Free alternative to OpenAI Operator.

🔌MCP Code

github.com··Hacker News

Mbodi AI (YC P25) Is Hiring Founding Machine Learning Engineer (Robotics)

ycombinator.com··Hacker News

MSUE: Multi-Modal Soccer Understanding Expert

🔍Fine-Grained Classification Academic

Log in to enable infinite scrolling