Skip to main navigation Skip to search Skip to main content

Large Audio AI Models for Fixed-Media Electronics in "Prelude: To Listening": The use of AI and traditional Chinese instruments in sonic art

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

83 Downloads (CityUHK Scholars)

Abstract

This paper explores the innovative integration of Large Audio AI Models (LMs) with traditional Chinese instruments in the concert version of the work "Prelude: To Listening". Through the use of AI-generated audio accompaniment for the instruments sanxian, sanshin, and Chinese percussions, the project pushes the boundaries of sonic art and mixed music. The study examines the creative potential of LMs, such as AudioLDM2 and Audiocraft , in generating novel sounds that deviate from exact replication, thereby offering fresh avenues for artistic expression. The paper highlights the conceptual innovation of using film subtitles as musical prompts, which imbues the audio with additional narrative and emotional layers. It also discusses the challenges of debugging LMs and the limitations of controlling audio features through text, emphasizing the need for more sophisticated methods to bridge text inputs and audio outputs. The performance, led by artist Ryo IKESHIRO and supported by a team of collaborators, showcases the potential of LMs in the evolution of experimental music and sound arts. The code and audio are available at https://github.com/prelude-to-listening/concert.
Original languageEnglish
Title of host publicationThe International Conference on AI and Musical Creativity (AIMC 2024)
Number of pages21
Publication statusPublished - Sept 2024
Event2024 International Conference on AI and Musical Creativity (AIMC 2024) - University of Oxford, Oxford, United Kingdom
Duration: 9 Sept 202411 Sept 2024

Conference

Conference2024 International Conference on AI and Musical Creativity (AIMC 2024)
PlaceUnited Kingdom
CityOxford
Period9/09/2411/09/24

Funding

The work for the performance by XXXXX and this paper by XXXXX and XXXXX was supported by a grant from the Research Grants Council of the Hong Kong Special Administrative Region, China (Project No. CityU 21602822).

Research Keywords

  • Large Audio AI Models
  • audio generation
  • mixed music
  • fixed-media electronics
  • Sonic Art

Publisher's Copyright Statement

  • This full text is made available under CC-BY 4.0. https://creativecommons.org/licenses/by/4.0/

RGC Funding Information

  • RGC-funded

Fingerprint

Dive into the research topics of 'Large Audio AI Models for Fixed-Media Electronics in "Prelude: To Listening": The use of AI and traditional Chinese instruments in sonic art'. Together they form a unique fingerprint.

Cite this