Title | ||
---|---|---|
Cycle Consistency Based Method for Learning Disentangled Representation for Stochastic Video Prediction |
Abstract | ||
---|---|---|
Video frame prediction is an interesting computer vision problem of predicting the future frames of a video sequence from a given set of context frames. Video prediction models have found wide-scale perspective applications in autonomous navigation, representation learning, and healthcare. However, predicting future frames is challenging due to the high dimensional and stochastic nature of video data. This work proposes a novel cycle consistency loss to disentangle video representation into a low dimensional time-dependent pose and time-independent content latent factors in two different VAE based video prediction models. The key motivation behind cycle consistency loss is that future frame predictions are more plausible and realistic if they reconstruct the previous frames. The proposed cycle consistency loss is also generic because it can be applied to other VAE-based stochastic video prediction architectures with slight architectural modifications. We validate our disentanglement hypothesis and the quality of long-range predictions on standard synthetic and challenging real-world datasets such as Stochastic Moving MNIST and BAIR. |
Year | DOI | Venue |
---|---|---|
2022 | 10.1007/978-3-031-06433-3_23 | IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT III |
Keywords | DocType | Volume |
Video frame prediction, Variational autoencoders, Cyclic consistency | Conference | 13233 |
ISSN | Citations | PageRank |
0302-9743 | 0 | 0.34 |
References | Authors | |
0 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Ujjwal Tiwari | 1 | 1 | 1.04 |
P. Aditya Sreekar | 2 | 0 | 0.34 |
Anoop Namboodiri | 3 | 0 | 0.34 |