A Hierarchical Taxonomy For Deep State Space Models

Shiqin Tang, Pengxing Feng, Shujian Yu, Yining Dong, S. Joe Qin

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

Modeling nonlinear dynamical systems is a challenging task in fields such as speech processing, music generation, and video prediction. This paper introduces a hierarchical framework for Deep State Space Models (DSSMs), categorizing them by their conditional independence properties and Markov assumptions and positioning existing models within this framework, including the Stochastic Recurrent Neural Network (SRNN), Variational Recurrent Neural Network (VRNN), and Recurrent State Space Model (RSSM). We discuss different options for the inference networks and demonstrate how integrating normalizing flows can enhance model flexibility by capturing complex distributions. Our work not only clarifies the relationships among existing models but also paves the way for the development of new, more effective approaches for modeling nonlinear dynamics. In particular, we propose the Autoregressive State Space Model (ArSSM) and evaluate its effectiveness in speech and polyphonic music modeling tasks. © 2025 IEEE.
Original languageEnglish
Title of host publicationProceedings of the 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
PublisherIEEE
ISBN (Electronic)979-8-3503-6874-1
ISBN (Print)979-8-3503-6875-8
DOIs
Publication statusPublished - 2025
Event50th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025) - Hyderabad International Convention Centre, Hyderabad, India
Duration: 6 Apr 202511 Apr 2025
https://2025.ieeeicassp.org/

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN (Print)1520-6149
ISSN (Electronic)2379-190X

Conference

Conference50th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025)
Abbreviated titleICASSP2025
Country/TerritoryIndia
CityHyderabad
Period6/04/2511/04/25
Internet address

Research Keywords

  • Deep State Space Models
  • Dynamical Variational Autoencoders
  • Hierarchical Taxonomy
  • Normalizing Flows

Fingerprint

Dive into the research topics of 'A Hierarchical Taxonomy For Deep State Space Models'. Together they form a unique fingerprint.

Cite this