Sound Sharing and Retrieval

Frederic Font, Gerard Roma, Xavier Serra

Research output: Chapter in Book/Report/Conference proceedingChapter

3 Citations (Scopus)

Abstract

Multimedia sharing has experienced an enormous growth in recent years, and sound sharing has not been an exception. Nowadays one can find online sound sharing sites in which users can search, browse, and contribute large amounts of audio content such as sound effects, field and urban recordings, music tracks, and music samples. This poses many challenges to enable search, discovery, and ultimately reuse of this content. In this chapter we give an overview of different ways to approach such challenges. We describe how to build an audio database by outlining different aspects to be taken into account. We discuss metadata-based descriptions of audio content and different searching and browsing techniques that can be used to navigate the database. In addition to metadata, we show sound retrieval techniques based on the extraction of audio features from (possibly) unannotated audio. We end the chapter by discussing advanced approaches to sound retrieval and by drawing some conclusions about present and future of sound sharing and retrieval. In addition to our explanations, we provide code examples that illustrate some of the concepts discussed.
Original languageEnglish
Title of host publicationComputational Analysis of Sound Scenes and Events
EditorsTuomas Virtanen, Mark Plumbley, Dan Ellis
PublisherSpringer, Cham
Chapter10
Pages279-301
Number of pages23
Edition1st
ISBN (Electronic)9783319634500
ISBN (Print)9783319634494, 9783319875590
DOIs
Publication statusPublished - 22 Sep 2017

Fingerprint Dive into the research topics of 'Sound Sharing and Retrieval'. Together they form a unique fingerprint.

  • Cite this

    Font, F., Roma, G., & Serra, X. (2017). Sound Sharing and Retrieval. In T. Virtanen, M. Plumbley, & D. Ellis (Eds.), Computational Analysis of Sound Scenes and Events (1st ed., pp. 279-301). Springer, Cham. https://doi.org/10.1007/978-3-319-63450-0_10