modeling the conditional entropy between frames. Unlike prior learning-based approaches,
we reduce complexity by not performing any form of explicit transformations between frames
and assume each frame is encoded with an independent state-of-the-art deep image
compressor. We first show that a simple architecture modeling the entropy between the
image latent codes is as competitive as other neural video compression works and video …