Real-Time View Synthesis with Multiplane Image Network using Multimodal Supervision

1Mid Sweden University 2Technical University Berlin 3HTW Berlin - University of Applied Sciences

RT-MPINet Generates MPIs from Single Image

Abstract

We present a real-time multiplane image (MPI) network. Unlike existing MPI based approaches that often rely on a separate depth estimation network to guide the network for estimating MPI parameters, our method directly predicts these parameters from a single RGB image. To guide the network we present a multimodal training strategy utilizing joint supervision from view synthesis and depth estimation losses. More details can be found in the paper.

In-the-wild Examples

Click to open interactive Viewer

Movement mode:

Results

COCO Dataset: Different View Synthesis Methods Against Ours

The visual comparison against SinMPI, TMPI, and AdaMPI.

Ours TMPI
Ours AdaMPI
Ours SinMPI
Ours TMPI
Ours AdaMPI
Ours SinMPI

FPS Rate on RTX 2070 Super

We compare the FPS rate on different resolutions against other methods when rendering end-to-end.

Note: When rendering from predicted MPIs, the rendering speed will be same for all methods.

BibTeX

Will be added later