Sound Sharing and Retrieval

Frederic Font, Gerard Roma, Xavier Serra

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

Multimedia sharing has experienced an enormous growth in recent years, and sound sharing has not been an exception. Nowadays one can find online sound sharing sites in which users can search, browse, and contribute large amounts of audio content such as sound effects, field and urban recordings, music tracks, and music samples. This poses many challenges to enable search, discovery, and ultimately reuse of this content. In this chapter we give an overview of different ways to approach such challenges. We describe how to build an audio database by outlining different aspects to be taken into account. We discuss metadata-based descriptions of audio content and different searching and browsing techniques that can be used to navigate the database. In addition to metadata, we show sound retrieval techniques based on the extraction of audio features from (possibly) unannotated audio. We end the chapter by discussing advanced approaches to sound retrieval and by drawing some conclusions about present and future of sound sharing and retrieval. In addition to our explanations, we provide code examples that illustrate some of the concepts discussed.
Original languageEnglish
Title of host publicationComputational Analysis of Sound Scenes and Events
EditorsTuomas Virtanen, Mark Plumbley, Dan Ellis
PublisherSpringer, Cham
Pages279-301
Number of pages23
ISBN (Electronic)9783319634500
ISBN (Print)9783319634494
Publication statusPublished - 2018

    Fingerprint

Cite this

Font, F., Roma, G., & Serra, X. (2018). Sound Sharing and Retrieval. In T. Virtanen, M. Plumbley, & D. Ellis (Eds.), Computational Analysis of Sound Scenes and Events (pp. 279-301). Springer, Cham.