👁️ Multimodal AI - jobz · Scour

BioCoach uses AI and biomechanics to give real-time exercise feedback at home

thebrighterside.news··r/artificial

mtmd : add video input support by ngxson · Pull Request #24269 · ggml-org/llama.cpp

🗄️Vector Databases Code

github.com··r/LocalLLaMA

Florida's lawsuit against OpenAI and CEO Altman treats ChatGPT as a defective product and public nuisance

⚖️AI Governance

the-decoder.com

·

VL-DINO: Leveraging CLIP Vision-Language Knowledge for Open-Vocabulary Object Detectio

🗄️Vector Databases Academic

Scott Bessent says America's in a 'manufacturing renaissance' and Wall Street largely agrees. So where are the jobs?

💎Token Economics

·

Are Reasoning Vision-Language Models Robust to Semantic Visual Distractions?

🧠Reasoning Models Academic

A Blip on a Telescope in a Colorado Parking Lot Bolstered a Space Mission That Has Found Thousands of Planets … and Counting

smithsonianmag.com·

A primordial black hole nicknamed ‘Phoebe’ may help solve the mystery of dark matter

scientificamerican.com

·

OmniGameArena: A Unified UE5 Benchmark for VLM Game Agents with Improvement Dynamics

💾Agent Memory Academic

World Model Self-Distillation: Training World Models to Solve General Tasks

🎛️Fine-tuning Academic

fix(sessions): preserve user model override across daily/idle rollove… · openclaw/openclaw@8e81bf7

🔌MCP Code

MLingualFC: Evaluating Jailbreak Vulnerabilities in Multilingual Vision-Language Models

🧠LLMs Academic

Two Bridges, One Pathway: From VLMs to Generalizable VLAs with Embodied Trajectory-Coupled Data

🎯Reinforcement Learning Academic

fix: preserve Foundry Responses reasoning replay ids · openclaw/openclaw@248dfb2

✍️Prompt Engineering Code

MSUE: Multi-Modal Soccer Understanding Expert

📊Model Evaluation Academic

The Last Visible Pixel: Probing Fine-Scale Perception in Vision-Language Models

⚡Inference Academic

Vision Language Model Helps Private Information De-Identification in Vision Data

🎛️Fine-tuning Academic

Decoding Pedestrian Crossing Intention from Egocentric Vision via Vision Language Models

🌐World Models Academic

CheXanatomy: Anatomy-Aware Vision-Language Modeling for Chest Radiographs

🗄️Vector Databases Academic

OpenMedReason: Scientific Reasoning Supervision for Medical Vision-Language Models

🧠Reasoning Models Academic

Sign up or log in to see more results

Log in to enable infinite scrolling