CLIP

Feeds to Scour
SubscribedAll
Scoured 228 posts in 8.1 ms

Accreted Intelligence — it does your work, and every action makes it smarter

 💬NLP
accint.xyz··Hacker News

DAM-VLA: Decoupled Asynchronous Multimodal Vision Language Action model

 🤗Hugging Face  Content type: Academic
arxiv.org·

A New Electric Hypercar Just Packed 3,154 HP and a 550km/h Top Speed Into a Prototype GT - Yanko Design

 🗺️Product Management
yankodesign.com·

john-rocky/coreai-model-zoo: Community model zoo + knowledge base for Apple Core AI (iOS/macOS 27): Qwen3.5 & Gemma 4 converted end-to-end, verified on-device (iPhone 17 Pro GPU/ANE), conversion gotchas, custom Metal kernels, Swift runner

 🤖ai  Content type: Code
github.com··Hacker News

Pinterest Deepens AWS Partnership with US$4bn Cloud Deal

 🤖AI Engineering  Content type: News
aimagazine.com·

Adapting Vision-Language Models from Iconic to Inclusive for Multi-Label Recognition Without Labels

 🤗Hugging Face  Content type: Academic
arxiv.org·

How Desktop AI Hubs Could Deflect Over 56.23 TWh of Industrial Data Center Load by 2035

 🧠LLMs
futurumgroup.com·

OpenCV 5.0 Computer Vision Library Released with Rewritten DNN Engine

 🧠LLM Inference
linuxiac.com·

LAST: Bridging Vision-Language and Action Manifolds via Gromov-Wasserstein Alignment

 🤗Hugging Face  Content type: Academic
arxiv.org·

I made a zero cost browser-use tool – let AI click and type on webpages for you

 🧠LLM Inference  Content type: Code
github.com··Hacker News

The Sequence Radar #873: Last Week in AI: Soccer, S-1s, and Supermodels

 🤖AI  Content type: News  Content type: Blog

Robotics will not have a clean Llama moment

 🧠LLMs
therobotreport.com·

RoboProcessBench: Benchmarking Process-Aware Understanding in Vision-Language Robotic Manipulation

 🤖AI Engineering  Content type: Academic
arxiv.org·

From Traditional Automation to Embodied Wireless Intelligence: Vision-Language-Action Empowered Physics-Aware Communication Networks

 🤖AI Engineering  Content type: Academic
arxiv.org·

OpenCV Introduces New DNN Inference Engine

 🤖Machine Learning
i-programmer.info·

OpenMedQ: Broad Open Pretraining for Medical Vision-Language Models

 🤖Machine Learning  Content type: Academic
arxiv.org·

Can robots read the room?

 🤖AI  Content type: News  Content type: Academic
news.cornell.edu·

GIVE: Grounding Human Gestures in Vision-Language-Action Models

 🤗Hugging Face  Content type: Academic
arxiv.org·

A Dataset for Dynamic Human Preferences for Vision Language Models

 🤗Hugging Face  Content type: Academic
arxiv.org·

PP-OCRv6: From 1.5M to 34.5M Parameters, Surpassing Billion-Scale VLMs on OCR Tasks

 🔍RAG  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help