BestBlogs.dev

Google DeepMind 发布 Gemini Omni:多模态理解与编辑的重大飞跃 (opens in new tab)

📌 One-Sentence Summary Demis Hassabis 宣布推出 Gemini Omni,这是一项多模态 AI 的重大进步,能够处理视频、音频和图像,并构建全新的场景。 📝 Summary Demis Hassabis 的这条推文公布了 Google DeepMind 的新多模态 AI 模型 Gemini Omni。该模型代表了世界理解与多模态编辑能力的重大飞跃,能够接收照片、视频和音频作为输入,并生成全新的场景。Hassabis 强调了它处理任意输入和输出的能力,首先从视频开始,并突出了该工具的交互特性,允许用户提供自己的视频并迭代创意。这标志着 Gemini Omni 向着更通用的 AI 界面迈出了基础性的一步。 📊 Article Meta AI Screening:92 Featured:Yes Source:Demis Hassabis(@demishassabis) Author:Demis Hassabis Category:人工智能 Language:英文 Read Time:2 min Word Count:294 Tags: Gemini ...

Read the original article
Sign in to keep reading the full article.

Keyboard Shortcuts

Navigation

Next / previous post
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Discover
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help