A General Framework for Visualization of Sound Collections in Musical Interfaces

Gerard Roma, Anna Xambó, Owen Green, Pierre Alexandre Tremblay

Research output: Contribution to journalArticlepeer-review

4 Citations (Scopus)


While audio data play an increasingly central role in computer-based music production, interaction with large sound collections in most available music creation and production environments is very often still limited to scrolling long lists of file names. This paper describes a general framework for devising interactive applications based on the content-based visualization of sound collections. The proposed framework allows for a modular combination of different techniques for sound segmentation, analysis, and dimensionality reduction, using the reduced feature space for interactive applications. We analyze several prototypes presented in the literature and describe their limitations. We propose a more general framework that can be used flexibly to devise music creation interfaces. The proposed approach includes several novel contributions with respect to previously used pipelines, such as using unsupervised feature learning, content-based sound icons, and control of the output space layout. We present an implementation of the framework using the SuperCollider computer music language, and three example prototypes demonstrating its use for data-driven music interfaces. Our results demonstrate the potential of unsupervised machine learning and visualization for creative applications in computer music.

Original languageEnglish
Article number11926
Number of pages22
JournalApplied Sciences (Switzerland)
Issue number24
Publication statusPublished - 15 Dec 2021


Dive into the research topics of 'A General Framework for Visualization of Sound Collections in Musical Interfaces'. Together they form a unique fingerprint.

Cite this