Generative AI

Feeds to Scour
SubscribedAll
Scoured 141 posts in 9.0 ms

UniCanvas: A Diffusion-base Unified Model for Text-in-Image Joint Generation

 👁️Multimodal AI  Content type: Academic
arxiv.org·

How Image Generation Actually Works

 🎲Procedural Generation
pub.towardsai.net
·

Evaluating the Representation Space of Diffusion Models via Self-Supervised Principles

 👁️Multimodal AI  Content type: Academic
arxiv.org·

Data assimilation for subsurface flow using latent diffusion model parameterization: performance of ensemble-Kalman and Monte Carlo techniques

 🎲Bayesian Inference  Content type: Academic
arxiv.org·

TrioPose: Native Triple-Stream Diffusion Transformers for Pose-Guided Text-to-Image Generation

 👁️Multimodal AI  Content type: Academic
arxiv.org·

Optimality of FSQ Tokens for Continuous Diffusion for Categorical Data with Application to Text-to-Speech

 🧠LLM  Content type: Academic
arxiv.org·

Unified Safe In-context Image Generation in Multimodal Diffusion Transformers via Restricting Unsafe Information Flows

 🔐Cryptography  Content type: Academic
arxiv.org·

Efficient and Training-Free Single-Image Diffusion Models

 👁️Multimodal AI  Content type: Academic
arxiv.org··Hacker News

NutriMLLM: Multimodal Large Language Models for Dietary Micronutrient Analysis

 👁️Multimodal AI  Content type: Academic
arxiv.org·

Late-Layer Fusion is Enough: Dual-Path Vision Token Routing for Multimodal Large Language Models under Visual Saturation

 👁️Multimodal AI  Content type: Academic
arxiv.org·

Diffusion Models for Adaptive Sequential Data Generation

 🛡️Privacy Engineering  Content type: Academic
arxiv.org·

STREAM: Stochastic Riemannian Flow Matching with Anisotropic Decoder for Digital Histopathology Image Generation

 🎲Procedural Generation  Content type: Academic
arxiv.org·

Geometry-Aware Dataset Condensation for Diffusion Model Training

 🛡️Privacy Engineering  Content type: Academic
arxiv.org·

BLM-SGAN: Bidirectional Language Modeling for Semantic-Spatial Text-to-Image Generation

 💬NLP  Content type: Academic
arxiv.org·

FreeAnimate: Training-Free Human Image Animation with Preview-Guided Denoising

 👁️Multimodal AI  Content type: Academic
arxiv.org·

Where Should Knowledge Enter? A Layered Framework for Knowledge Infusion in Multimodal Iterative Generative Mo

 📚Content Curation  Content type: Academic
arxiv.org·

Less Is More: Training-Free Acceleration Framework of 3D Diffusion Models for Low-Count PET Denoising via Global-Local Trajectory Reduction

 🤖LLM Inference  Content type: Academic
arxiv.org·

The Invisible Hand of Physics: When Video Diffusion Models Know More Than They Show

 👁️Multimodal AI  Content type: Academic
arxiv.org·

Seeing is Believing: Aligning Prompt Rewriting with Visual Anchors for Text-to-Image Generation

 🧠LLM  Content type: Academic
arxiv.org·

ZIPP:Zero-shot Image Personalization from Personas

 🧠LLM  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help