Multimodal AI

Feeds to Scour
SubscribedAll
Scoured 55 posts in 8.5 ms

Bringing the latest Gemini models to Apple developers

 🍎iOS  Content type: Video  Content type: News  Content type: Blog
blog.google
··Hacker News

OpenCV 5.0 Released With Rewritten DNN Engine, Built-In LLM & VLM Support

 💻Operating Systems
phoronix.com··Hacker News

Multimodal Browser AI with Transformers.js for Images and Speech

 💻Operating Systems

linzhiqiu/t2v_metrics: Evaluating text-to-image/video/3D models with VQAScore

 🎨Generative AI  Content type: Code
github.com··Hacker News

ApertureLab · Synthetic Aperture Sonar Simulator

 🎨Generative AI
gergltd.com··Hacker News

What TTS Throws Away

 🤖ChatGPT
amaldavid.com··Hacker News

Apple Reveals New AI Architecture Built Around Google Gemini Models

 Gemini  Content type: News
macrumors.com··Hacker News

OpenCV 5 Is Here: The Biggest Leap in Years for Computer Vision

 💻Operating Systems

Vibe Rounds Concept Document : Dr. Avinash Kumar Gupta : Free Download, Borrow, and Streaming

 🤖AI
archive.org··Hacker News

Google's latest DiffusionGemma open AI model comes with a 4x speed boost

 🤖AI Tools  Content type: News
arstechnica.com·

know the mother tongue of your LLMs

 🧠LLM

Apple’s new Siri AI knows when to shut up

 🤖ChatGPT  Content type: News
theverge.com
·

Slow Token, Fast Action – Learning in Robotics

 💬Natural Language Processing  Content type: Blog

New Apple feature automatically changes your compromised passwords

 🍎iOS  Content type: News
bleepingcomputer.com·
Less-relevant results

A Plea to the Labs: Let the Models Diagnose.

 🧠LLMs  Content type: Blog

Siri AI and the Latest in Apple Intelligence: The MacStories Overview

 🍎Apple
macstories.net·

Florian Brand, Prime Intellect research engineer, adopts Gemma 4 E4B 6-bit quantized as his primary local Mac LLM

 🤖AI  Content type: News
digg.com··Hacker News

Launch HN: Transload (YC P26) – Measuring freight items with CCTV

 🔄n8n  Content type: Discussion

Agentic Search Models with OpenSearch and Elasticsearch

 💬Prompt Engineering  Content type: Blog
bonsai.io··Hacker News

What is Agentic RAG? Building Multi-Agent Agentic RAG Systems

 🤖AI Tools
pub.towardsai.net
·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help