Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Generative AI
🎨 Generative AI
Image Generation, Video Generation, Multimodal AI
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
226
posts in
6.8
ms
Kingdom Hearts IV is coming to Switch 2, PlayStation 5, Xbox Series X, and PC
🎨
Generative Art
rpgsite.net
·
1d
1 day ago
Actions for Kingdom Hearts IV is coming to Switch 2, PlayStation 5, Xbox Series X, and PC
Video-Rate
Streaming Stylization on a Vision-Aware MLLM-Conditioned Edit
Diffusion
: Asymmetric Batched Inference on a Distilled UNet + MLLM
Text
Encoder
💬
Natural Language Processing
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Video-Rate Streaming Stylization on a Vision-Aware MLLM-Conditioned Edit Diffusion: Asymmetric Batched Inference on a Distilled UNet + MLLM Text Encoder
BLM-SGAN: Bidirectional Language
Modeling
for Semantic-Spatial
Text-to-Image
Generation
💬
Natural Language Processing
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for BLM-SGAN: Bidirectional Language Modeling for Semantic-Spatial Text-to-Image Generation
Breaking the Lock-in: Diversifying
Text-to-Image
Generation
via Representation Modulation
👁️
Multimodal AI
Content type:
Academic
arxiv.org
·
3d
3 days ago
Actions for Breaking the Lock-in: Diversifying Text-to-Image Generation via Representation Modulation
Seeing is Believing: Aligning Prompt Rewriting with Visual Anchors for
Text-to-Image
Generation
💬
Prompt Engineering
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Seeing is Believing: Aligning Prompt Rewriting with Visual Anchors for Text-to-Image Generation
STEDiff: Strengthening
Text
Embedding for
Text-to-Image
Alignment in
Diffusion
Model
👁️
Multimodal AI
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for STEDiff: Strengthening Text Embedding for Text-to-Image Alignment in Diffusion Model
NutriMLLM:
Multimodal
Large Language
Models
for Dietary Micronutrient Analysis
👁️
Multimodal AI
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for NutriMLLM: Multimodal Large Language Models for Dietary Micronutrient Analysis
Faithful, Enriched, and Precise: Benchmarking Natural-Science Illustration
Generation
by T2I
models
👁️
Multimodal AI
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Faithful, Enriched, and Precise: Benchmarking Natural-Science Illustration Generation by T2I models
Customization under Fire: Plugin Poisoning in
Text-to-Image
Ecosystem
🚀
Indie Hacking
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Customization under Fire: Plugin Poisoning in Text-to-Image Ecosystem
ZIPP:Zero-shot
Image
Personalization from Personas
👁️
Multimodal AI
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for ZIPP:Zero-shot Image Personalization from Personas
Can We Predict The Human Preference For
Text-to-Image
Content Prior To
Generation
And Is It Even Useful To Do So?
🎨
Generative Art
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Can We Predict The Human Preference For Text-to-Image Content Prior To Generation And Is It Even Useful To Do So?
Late-Layer
Fusion is Enough: Dual-Path Vision Token Routing for
Multimodal
Large Language
Models
under Visual Saturation
👁️
Multimodal AI
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Late-Layer Fusion is Enough: Dual-Path Vision Token Routing for Multimodal Large Language Models under Visual Saturation
Conditional Vendi Score: Prompt-Aware Diversity Evaluation for
Generative
AI
Models
and LLMs
💻
Operating Systems
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Conditional Vendi Score: Prompt-Aware Diversity Evaluation for Generative AI Models and LLMs
sketch-plot: Progressive Editing for
Text-to-Image
Academic Figures
👁️
Multimodal AI
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for sketch-plot: Progressive Editing for Text-to-Image Academic Figures
Assessing the Geographic Diversity of
AI
's Platial Representations in
Image
Generation
👁️
Multimodal AI
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Assessing the Geographic Diversity of AI's Platial Representations in Image Generation
Consistent-Inversion: Reverse Consistency Guidance for Structure-Preserving Visual Editing
🤖
AI Tools
Content type:
Academic
arxiv.org
·
3d
3 days ago
Actions for Consistent-Inversion: Reverse Consistency Guidance for Structure-Preserving Visual Editing
EditSSC: Toward Editable Semantic Occupancy Scenes with Unconditional
Diffusion
Models
🤖
AI Tools
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for EditSSC: Toward Editable Semantic Occupancy Scenes with Unconditional Diffusion Models
Beyond Scalar Rewards by Internalizing Reasoning into Score Distributions
👁️
Multimodal AI
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Beyond Scalar Rewards by Internalizing Reasoning into Score Distributions
Can You Trust What You See? Human and
AI
Detection of Synthetic Legal Evidence
♊
Gemini
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Can You Trust What You See? Human and AI Detection of Synthetic Legal Evidence
OmniGen-AR: AutoRegressive
Any-to-Image
Generation
🎨
Generative Art
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for OmniGen-AR: AutoRegressive Any-to-Image Generation
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help