Skip to main navigation Skip to search Skip to main content

Adapting MIMO video restoration networks to low latency constraints

Valéry Dewil, Zhe Zheng, Arnaud Barral, Lara Raad, Nao Nicolas, Ioannis Cassagne, Jean-Michel Morel, Gabriele Facciolo, Bruno Galerne, Pablo Arias

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

MIMO (multiple input, multiple output) approaches are a recent trend in neural network architectures for video restoration problems, where each network evaluation produces multiple output frames. The video is split into non-overlapping stacks of frames that are processed independently, resulting in a very appealing trade-off between output quality and computational cost. In this work we focus on the low-latency setting by limiting the number of available future frames. We find that MIMO architectures suffer from problems that have received little attention so far, namely (1) the performance drops significantly due to the reduced temporal receptive field, particularly for frames at the boundaries of the stack, (2) there are strong temporal discontinuities at stack transitions which induce a step-wise motion artifact. We propose two simple solutions to alleviate these problems: recurrence across MIMO stacks to boost the output quality by implicitly increasing the temporal receptive field, and overlapping of the output stacks to smooth the temporal discontinuity at stack transitions. These modifications can be applied to any MIMO architecture. We test them on three state-of-the-art video denoising networks with different computational cost. The proposed contributions result in a new state-of-the-art for low-latency networks, both in terms of reconstruction error and temporal consistency. As an additional contribution, we introduce a new benchmark consisting of drone footage that highlights temporal consistency issues that are not apparent in the standard benchmarks. © 2024. The copyright of this document resides with its authors.
Original languageEnglish
Title of host publication35th British Machine Vision Conference 2024
PublisherBritish Machine Vision Association, BMVA
Number of pages15
Publication statusPublished - Nov 2024
EventThe 35th British Machine Vision Conference (BMVC 2024) - Glasgow, United Kingdom
Duration: 25 Nov 202428 Nov 2024

Publication series

NameBritish Machine Vision Conference, BMVC

Conference

ConferenceThe 35th British Machine Vision Conference (BMVC 2024)
PlaceUnited Kingdom
CityGlasgow
Period25/11/2428/11/24

Funding

Work partially financed by DGA and FMJH PhD scholarships. It was also performed using HPC resources from GENCI-IDRIS (grants 2023-AD011014015 and AD011011801R3) and from the \u201CM\u00E9socentre\u201D computing center of CentraleSup\u00E9lec and ENS Paris-Saclay supported by CNRS and R\u00E9gion \u00CEle-de-France. Centre Borelli is also with Universit\u00E9 Paris Cit\u00E9, SSA and INSERM.

Fingerprint

Dive into the research topics of 'Adapting MIMO video restoration networks to low latency constraints'. Together they form a unique fingerprint.

Cite this