Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Generative AI
🎨 Generative AI
Diffusion Models, Image Generation, Video Synthesis, Multimodal AI
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
141
posts in
9.0
ms
UniCanvas: A
Diffusion-base
Unified
Model
for
Text-in-Image
Joint Generation
👁️
Multimodal AI
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for UniCanvas: A Diffusion-base Unified Model for Text-in-Image Joint Generation
How
Image
Generation
Actually Works
🎲
Procedural Generation
pub.towardsai.net
·
3d
3 days ago
Actions for How Image Generation Actually Works
Evaluating the Representation
Space
of
Diffusion
Models
via Self-Supervised Principles
👁️
Multimodal AI
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Evaluating the Representation Space of Diffusion Models via Self-Supervised Principles
Data assimilation for subsurface flow using
latent
diffusion
model
parameterization: performance of ensemble-Kalman and Monte Carlo techniques
🎲
Bayesian Inference
Content type:
Academic
arxiv.org
·
18h
18 hours ago
Actions for Data assimilation for subsurface flow using latent diffusion model parameterization: performance of ensemble-Kalman and Monte Carlo techniques
TrioPose: Native Triple-Stream
Diffusion
Transformers for Pose-Guided
Text-to-Image
Generation
👁️
Multimodal AI
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for TrioPose: Native Triple-Stream Diffusion Transformers for Pose-Guided Text-to-Image Generation
Optimality of FSQ Tokens for Continuous
Diffusion
for Categorical Data with Application to
Text-to-Speech
🧠
LLM
Content type:
Academic
arxiv.org
·
18h
18 hours ago
Actions for Optimality of FSQ Tokens for Continuous Diffusion for Categorical Data with Application to Text-to-Speech
Unified Safe In-context
Image
Generation
in
Multimodal
Diffusion Transformers via Restricting Unsafe Information Flows
🔐
Cryptography
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Unified Safe In-context Image Generation in Multimodal Diffusion Transformers via Restricting Unsafe Information Flows
Efficient and Training-Free
Single-Image
Diffusion
Models
👁️
Multimodal AI
Content type:
Academic
arxiv.org
·
6d
6 days ago
·
Hacker News
Actions for Efficient and Training-Free Single-Image Diffusion Models
NutriMLLM:
Multimodal
Large Language
Models
for Dietary Micronutrient Analysis
👁️
Multimodal AI
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for NutriMLLM: Multimodal Large Language Models for Dietary Micronutrient Analysis
Late-Layer
Fusion is Enough: Dual-Path Vision Token Routing for
Multimodal
Large Language
Models
under Visual Saturation
👁️
Multimodal AI
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Late-Layer Fusion is Enough: Dual-Path Vision Token Routing for Multimodal Large Language Models under Visual Saturation
Diffusion
Models
for Adaptive Sequential Data
Generation
🛡️
Privacy Engineering
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for Diffusion Models for Adaptive Sequential Data Generation
STREAM: Stochastic Riemannian Flow Matching with Anisotropic Decoder for Digital Histopathology
Image
Generation
🎲
Procedural Generation
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for STREAM: Stochastic Riemannian Flow Matching with Anisotropic Decoder for Digital Histopathology Image Generation
Geometry-Aware Dataset Condensation for
Diffusion
Model
Training
🛡️
Privacy Engineering
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for Geometry-Aware Dataset Condensation for Diffusion Model Training
BLM-SGAN: Bidirectional Language
Modeling
for Semantic-Spatial
Text-to-Image
Generation
💬
NLP
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for BLM-SGAN: Bidirectional Language Modeling for Semantic-Spatial Text-to-Image Generation
FreeAnimate: Training-Free Human
Image
Animation with Preview-Guided Denoising
👁️
Multimodal AI
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for FreeAnimate: Training-Free Human Image Animation with Preview-Guided Denoising
Where Should Knowledge Enter? A Layered Framework for Knowledge Infusion in
Multimodal
Iterative
Generative
Mo
📚
Content Curation
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for Where Should Knowledge Enter? A Layered Framework for Knowledge Infusion in Multimodal Iterative Generative Mo
Less Is More: Training-Free Acceleration Framework of 3D
Diffusion
Models
for Low-Count PET Denoising via Global-Local Trajectory Reduction
🤖
LLM Inference
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Less Is More: Training-Free Acceleration Framework of 3D Diffusion Models for Low-Count PET Denoising via Global-Local Trajectory Reduction
The Invisible Hand of Physics: When
Video
Diffusion
Models
Know More Than They Show
👁️
Multimodal AI
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for The Invisible Hand of Physics: When Video Diffusion Models Know More Than They Show
Seeing is Believing: Aligning
Prompt
Rewriting with Visual Anchors for
Text-to-Image
Generation
🧠
LLM
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Seeing is Believing: Aligning Prompt Rewriting with Visual Anchors for Text-to-Image Generation
ZIPP:Zero-shot
Image
Personalization from Personas
🧠
LLM
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for ZIPP:Zero-shot Image Personalization from Personas
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help