Diffusion Models

Feeds to Scour
SubscribedAll
Scoured 72 posts in 6.3 ms

Latent Diffusion Policy: Shaping Latent Spaces for Diffusion-Based Robotic Manipulation

 🤖Embodied AI  Content type: Academic
arxiv.org·

Continuous Language Diffusion as a Decoder-Interface Problem

 🔀Multimodal AI  Content type: Academic
arxiv.org·

ARAPDiffusion: ARAP Regularization for Diffusion-Based Deformable Shape Space Learning

 👁️Computer Vision  Content type: Academic
arxiv.org·
Less-relevant results

Spectrally Regularized Latent Flow Matching for Turbulence Generation

 🫧Gaussian Splatting  Content type: Academic
arxiv.org·

Holding the FP8 Quality Ceiling at 8-Bit Weights and Activations: INT8 and GGUF Post-Training Quantization of Ideogram 4.0 for Consumer GPUs

 NeRF  Content type: Academic
arxiv.org·

Diffusion Models for Adaptive Sequential Data Generation

 🧊3D Generation  Content type: Academic
arxiv.org·

Echo-DM: Ultrasound Marker Removal via Conditional Latent Diffusion and Region-Aware Fusion

 👁️Computer Vision  Content type: Academic
arxiv.org·

HyFAD: Hybrid Time-Frequency Diffusion with Frequency-Aware Embedding for Time Series Imputation

 🗂️Semantic Segmentation  Content type: Academic
arxiv.org·

No Free Lunch for Synthetic Images under Data Scarcity Conditions

 👁️Computer Vision  Content type: Academic
arxiv.org·

The Score Hamiltonian: Mapping Diffusion Models to Adiabatic Transport

 🫧Gaussian Splatting  Content type: Academic
arxiv.org·

Show HN: Magenta Real-Time Music Generation on iPhone, Without the GPU

 🧊3D Generation  Content type: Code
github.com··Hacker News

Consistent-Inversion: Reverse Consistency Guidance for Structure-Preserving Visual Editing

 👁️Computer Vision  Content type: Academic
arxiv.org·

Anchor-Conditioned Compositional Control for Landscape Image Generation

 👁️Computer Vision  Content type: Academic
arxiv.org·

Flash-WAM: Modality-Aware Distillation for World Action Models

 👁️Computer Vision  Content type: Academic
arxiv.org·

Test-time Adversarial Takeover: A Real-time Hijacking Interface against Robotic Diffusion Policies

 🤖Embodied AI  Content type: Academic
arxiv.org·

NSVQ: Mitigating Codebook Collapse by Stabilizing Encoder Drift in Vector Quantization

 👁️Computer Vision  Content type: Academic
arxiv.org·

STREAM: Stochastic Riemannian Flow Matching with Anisotropic Decoder for Digital Histopathology Image Generation

 👁️Computer Vision  Content type: Academic
arxiv.org·

Mean Flow Distillation: Robust and Stable Distillation for Flow Matching Models

 👁️Computer Vision  Content type: Academic
arxiv.org·

tetherto/qvac: QVAC - Local AI SDK and libraries for building private, cross-platform, peer-to-peer AI applications. Run LLMs, speech-to-text, translation, and more locally on Linux, macOS, Windows, Android, and iOS.

 🔀Multimodal AI  Content type: Code
github.com·

Speech Meets ELF: Audio Conditional Continuous-Target Diffusion for Speech Recognition and Translation

 🔀Multimodal AI  Content type: Academic
arxiv.org·
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help