Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
✨ Gemini
Specific
Google AI, Multimodal Models, Large Context, API Access
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
24672
posts in
77.2
ms
OpenVLThinkerV2
: A
Generalist
Multimodal Reasoning Model for Multi-domain Visual Tasks
🔮
pplx-embed-v1
arxiv.org
·
1d
Google AI Edge
Gallery
📱
Edge AI Optimization
simonwillison.net
·
5d
Google’s Gemini AI can answer your questions with 3D models and
simulations
🆕
New AI
theverge.com
·
1d
·
r/artificial
Recall
– local multimodal
semantic
search for your files
🎨
Chroma
github.com
·
5d
·
Hacker News
Google says the Gemini app can now generate interactive 3D models and
simulations
; users must select the Pro model in the prompt bar (Emma
Roth/The
Verge)
🆕
New AI
techmeme.com
·
1d
Multimodal
Embedding
&
Reranker
Models with Sentence Transformers
✖️
Cross-encoders
huggingface.co
·
2d
Dreaming
in Google AI
🔬
AI Labs
mleddy.blogspot.com
·
4d
·
Blogger
MTA-Agent
: An Open
Recipe
for Multimodal Deep Search Agents
🔗
Hybrid Search
arxiv.org
·
2d
PanLUNA
: An Efficient and Robust Query-Unified Multimodal Model for Edge
Biosignal
Intelligence
📱
Edge AI Optimization
arxiv.org
·
4d
Uni-ViGU
: Towards Unified Video Generation and Understanding via A Diffusion-Based Video Generator
📦
Batch Embeddings
arxiv.org
·
1d
Multimodal
Latent
Reasoning via Predictive
Embeddings
🔮
pplx-embed-v1
arxiv.org
·
1d
Firebolt-VL
: Efficient Vision-Language Understanding with Cross-Modality Modulation
🧠
LLM Inference
arxiv.org
·
4d
HiRO-Nav
: Hybrid ReasOning Enables Efficient Embodied Navigation
🧠
LLM Inference
arxiv.org
·
1d
Tree-of-Evidence: Efficient "System 2" Search for
Faithful
Multimodal
Grounding
🧠
LLM Inference
arxiv.org
·
1d
PRIME: Prototype-Driven Multimodal Pretraining for Cancer
Prognosis
with Missing
Modalities
📊
Embeddings
arxiv.org
·
3d
Fundus-R1
: Training a
Fundus-Reading
MLLM
with Knowledge-Aware Reasoning on Public Data
🎨
Chroma
arxiv.org
·
1d
Scalable and
Explainable
Learner-Video
Interaction Prediction using Multimodal Large Language Models
🎯
Semantic Tokens
arxiv.org
·
4d
Multimodal
Large Language Models for
Multi-Subject
In-Context Image Generation
📦
Batch Embeddings
arxiv.org
·
1d
ARIA
: Adaptive
Retrieval
Intelligence Assistant -- A Multimodal RAG Framework for Domain-Specific Engineering Education
🔄
LLM RAG Pipelines
arxiv.org
·
2d
Generative AI for Video Trailer Synthesis: From
Extractive
Heuristics
to Autoregressive Creativity
🏗️
LLM Infrastructure
arxiv.org
·
3d
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help