TY - JOUR
T1 - DDLP
T2 - Unsupervised Object-centric Video Prediction with Deep Dynamic Latent Particles
AU - Daniel, Tal
AU - Tamar, Aviv
N1 - Publisher Copyright:
© 2024, Transactions on Machine Learning Research. All rights reserved.
PY - 2024
Y1 - 2024
N2 - We propose a new object-centric video prediction algorithm based on the deep latent particle (DLP) representation of Daniel & Tamar (2022a). In comparison to existing slot-or patch-based representations, DLPs model the scene using a set of keypoints with learned parameters for properties such as position and size, and are both efficient and interpretable. Our method, deep dynamic latent particles (DDLP), yields state-of-the-art object-centric video prediction results on several challenging datasets. The interpretable nature of DDLP allows us to perform “what-if” generation – predict the consequence of changing properties of objects in the initial frames, and DLP’s compact structure enables efficient diffusionbased unconditional video generation. Videos, code and pre-trained models are available: https://taldatech.github.io/ddlp-web/.
AB - We propose a new object-centric video prediction algorithm based on the deep latent particle (DLP) representation of Daniel & Tamar (2022a). In comparison to existing slot-or patch-based representations, DLPs model the scene using a set of keypoints with learned parameters for properties such as position and size, and are both efficient and interpretable. Our method, deep dynamic latent particles (DDLP), yields state-of-the-art object-centric video prediction results on several challenging datasets. The interpretable nature of DDLP allows us to perform “what-if” generation – predict the consequence of changing properties of objects in the initial frames, and DLP’s compact structure enables efficient diffusionbased unconditional video generation. Videos, code and pre-trained models are available: https://taldatech.github.io/ddlp-web/.
UR - http://www.scopus.com/inward/record.url?scp=85208973004&partnerID=8YFLogxK
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:85208973004
VL - 2024
JO - Transactions on Machine Learning Research
JF - Transactions on Machine Learning Research
ER -