Video Prediction

Feeds to Scour
SubscribedAll
Scoured 34 posts in 6.7 ms

WorldFly: A World-Model-Based Vision-Language-Action Model for UAV Navigation

 🌍World Model  Content type: Academic
arxiv.org·

BiWM: Advancing Open-Source Interactive Video World Models with Bidirectional Autoregression

 🔭Novel View Synthesis  Content type: Academic
arxiv.org·

Targeting World Models to Compromise Robot Learning Pipelines

 🌍World Model  Content type: Academic
arxiv.org·

Business World Model

 🌍World Model  Content type: Academic
arxiv.org·

Latent Spatial Memory for Video World Models

 🔭Novel View Synthesis  Content type: Academic
arxiv.org·

One Lens, Many Worlds : A Capability-Typed Interface for World-Model Interpretability

 🌍World Model  Content type: Academic
arxiv.org·

Towards World Models in Biomedical Research

 🌍World Model  Content type: Academic
arxiv.org·

Unifying Object-Centric World Models and Diffusion Policy: A Hierarchical Framework for Multi-Stage Robotic Tasks

 🌍World Model  Content type: Academic
arxiv.org·

WorldOlympiad: Can Your World Model Survive a Triathlon?

 🎬Video/3D/4D generation  Content type: Academic
arxiv.org·

PRISM: PRior-guided Imagination Sampling in world Models

 🔭Novel View Synthesis  Content type: Academic
arxiv.org·

Data assimilation for subsurface flow using latent diffusion model parameterization: performance of ensemble-Kalman and Monte Carlo techniques

 🌍World Model  Content type: Academic
arxiv.org·

UniCanvas: A Diffusion-base Unified Model for Text-in-Image Joint Generation

 🔭Novel View Synthesis  Content type: Academic
arxiv.org·

ATM: Action-Consistency Transfer Matrix for Diagnosing and Improving Latent World Models

 🌍World Model  Content type: Academic
arxiv.org·

Monte Carlo Pass Search: Using Trajectory Generation for 3D Counterfactual Pass Evaluation in Football

 🔭Novel View Synthesis  Content type: Academic
arxiv.org·

Prisma-World: Camera-Controllable Multi-Agent Video World Model

 🔭Novel View Synthesis  Content type: Academic
arxiv.org·

LiAuto-GeoX: Efficient Grounded Driving Transformer

 🔭Novel View Synthesis  Content type: Academic
arxiv.org·

What Makes Video World Model Latents Action-Relevant: Prediction over Reconstruction

 🔭Novel View Synthesis  Content type: Academic
arxiv.org·

Bypassing Copyright Protection in Diffusion-based Customization via Two-Stage Latent Feature Optimization

 🔭Novel View Synthesis  Content type: Academic
arxiv.org·

DisCo: World Models with Discrete Camera Motion Control

 🔭Novel View Synthesis  Content type: Academic
arxiv.org·

ReflectiChain: Epistemic Grounding in LLM-Driven World Models for Supply Chain Resilience

 🌍World Model  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help