Skip to main navigation Skip to search Skip to main content

Neural-Driven Multi-Band Processing for Automatic Equalization and Style Transfer

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

33 Downloads (CityUHK Scholars)

Abstract

We present a Neural-Driven Multi-Band Processor (NDMP), an innovative audio processing framework that augments a static six-band Parametric Equalizer (PEQ) with per-band compression, optimized via neural inference for two tasks: Automatic Equalization (AutoEQ) and Production Style Transfer (NDMP-ST). NDMP is trained using a self-supervised strategy where reference audio is degraded with random NDMP parameters and gain, eliminating the need for paired input-target data. As a fully differentiable operator, NDMP supports end-to-end training with audio-domain losses. The model learns to predict tonal and dynamic parameters that match the degraded input to the reference. At inference, AutoEQ adjusts tonal balance for unseen inputs without a reference, while NDMP-ST imposes production-style characteristics by predicting tailored parameters. We evaluated the MUSDB18 dataset with objective metrics (e.g., SI-SDR, SNR, PESQ, STOI) and listening tests and NDMP consistently outperforms traditional PEQ. NDMP offers an efficient and robust neural enhancement framework for modern audio production, enriching static processing with learned dynamic control.

© 2025 Parakrant Sarkar et al.
Original languageEnglish
Title of host publicationProceedings of the 28th International Conference on Digital Audio Effects (DAFx25)
PublisherDigital Audio Effects (DAFx)
Pages382-389
Publication statusPublished - Sept 2025
Event28th International Conference on Digital Audio Effects - Mole Vanvitelliana, Ancona, Italy
Duration: 2 Sept 20255 Sept 2025
https://dafx25.dii.univpm.it/

Publication series

NameDAFx proceedings series
ISSN (Print)2413-6700
ISSN (Electronic)2413-6689

Conference

Conference28th International Conference on Digital Audio Effects
Abbreviated titleDAFx25
PlaceItaly
CityAncona
Period2/09/255/09/25
Internet address

Publisher's Copyright Statement

  • This full text is made available under CC-BY 4.0. https://creativecommons.org/licenses/by/4.0/

Fingerprint

Dive into the research topics of 'Neural-Driven Multi-Band Processing for Automatic Equalization and Style Transfer'. Together they form a unique fingerprint.

Cite this