TEACHER-STUDENT LEARNING WITH MULTI-GRANULARITY CONSTRAINT TOWARDS COMPACT FACIAL FEATURE REPRESENTATION

Shurun Wang, Shiqi Wang*, Wenhan Yang, Xinfeng Zhang, Shanshe Wang, Siwei Ma

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

In this paper, we propose a novel end-to-end feature compression scheme by leveraging the representation and learning capability of deep neural networks, towards intelligent front-end equipped analysis with promising accuracy and efficiency. In particular, the extracted features are compactly coded in an end-to-end manner by optimizing the rate- distortion cost to achieve feature-in-feature representation. The multi-granularity constraint is further imposed, serving as the optimization objective to make the feature compression more "healthier" from the perspective of ultimate utility. More specifically, the analysis accuracy is considered in the coarse granularity level constraint, ensuring the capability of facial analysis with the reconstructed feature. Furthermore, at the fine granularity level the feature fidelity is involved to preserve the original feature quality. Moreover, a latent code level teacher-student enhancement model is proposed to efficiently transfer the low bit-rate representation into a high bit- rate one. Such a strategy further allows us to adaptively shift the representation cost to decoding computations, leading to more flexible feature compression with enhanced decoding capability. We verify the effectiveness of the proposed model with the facial feature, and experimental results reveal better compression performance in terms of rate-accuracy compared with existing models.
Original languageEnglish
Title of host publication2021 IEEE International Conference on Acoustics, Speech and Signal Processing
Subtitle of host publicationPROCEEDINGS
PublisherIEEE
Pages8503-8507
ISBN (Electronic)9781728176055
ISBN (Print)9781728176062
DOIs
Publication statusPublished - Jun 2021
Event2021 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2021) - Virtual, Toronto, Ontario, Canada
Duration: 6 Jun 202111 Jun 2021

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN (Print)1520-6149
ISSN (Electronic)2379-190X

Conference

Conference2021 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2021)
PlaceCanada
CityToronto, Ontario
Period6/06/2111/06/21

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 4 - Quality Education
    SDG 4 Quality Education

Research Keywords

  • Feature compression
  • deep learning
  • teacher-student network
  • multi-granularity

Fingerprint

Dive into the research topics of 'TEACHER-STUDENT LEARNING WITH MULTI-GRANULARITY CONSTRAINT TOWARDS COMPACT FACIAL FEATURE REPRESENTATION'. Together they form a unique fingerprint.

Cite this