Sound Sharing and Retrieval

Frederic Font, Gerard Roma, Xavier Serra

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

Multimedia sharing has experienced an enormous growth in recent years, and sound sharing has not been an exception. Nowadays one can find online sound sharing sites in which users can search, browse, and contribute large amounts of audio content such as sound effects, field and urban recordings, music tracks, and music samples. This poses many challenges to enable search, discovery, and ultimately reuse of this content. In this chapter we give an overview of different ways to approach such challenges. We describe how to build an audio database by outlining different aspects to be taken into account. We discuss metadata-based descriptions of audio content and different searching and browsing techniques that can be used to navigate the database. In addition to metadata, we show sound retrieval techniques based on the extraction of audio features from (possibly) unannotated audio. We end the chapter by discussing advanced approaches to sound retrieval and by drawing some conclusions about present and future of sound sharing and retrieval. In addition to our explanations, we provide code examples that illustrate some of the concepts discussed.
LanguageEnglish
Title of host publicationComputational Analysis of Sound Scenes and Events
EditorsTuomas Virtanen, Mark Plumbley, Dan Ellis
PublisherSpringer, Cham
Pages279-301
Number of pages23
ISBN (Electronic)9783319634500
ISBN (Print)9783319634494
Publication statusPublished - 2018

Fingerprint

Acoustic waves
Metadata

Cite this

Font, F., Roma, G., & Serra, X. (2018). Sound Sharing and Retrieval. In T. Virtanen, M. Plumbley, & D. Ellis (Eds.), Computational Analysis of Sound Scenes and Events (pp. 279-301). Springer, Cham.
Font, Frederic ; Roma, Gerard ; Serra, Xavier. / Sound Sharing and Retrieval. Computational Analysis of Sound Scenes and Events. editor / Tuomas Virtanen ; Mark Plumbley ; Dan Ellis. Springer, Cham, 2018. pp. 279-301
@inbook{52db295a9f76476b9661d6e26be43d16,
title = "Sound Sharing and Retrieval",
abstract = "Multimedia sharing has experienced an enormous growth in recent years, and sound sharing has not been an exception. Nowadays one can find online sound sharing sites in which users can search, browse, and contribute large amounts of audio content such as sound effects, field and urban recordings, music tracks, and music samples. This poses many challenges to enable search, discovery, and ultimately reuse of this content. In this chapter we give an overview of different ways to approach such challenges. We describe how to build an audio database by outlining different aspects to be taken into account. We discuss metadata-based descriptions of audio content and different searching and browsing techniques that can be used to navigate the database. In addition to metadata, we show sound retrieval techniques based on the extraction of audio features from (possibly) unannotated audio. We end the chapter by discussing advanced approaches to sound retrieval and by drawing some conclusions about present and future of sound sharing and retrieval. In addition to our explanations, we provide code examples that illustrate some of the concepts discussed.",
author = "Frederic Font and Gerard Roma and Xavier Serra",
year = "2018",
language = "English",
isbn = "9783319634494",
pages = "279--301",
editor = "Tuomas Virtanen and Mark Plumbley and Dan Ellis",
booktitle = "Computational Analysis of Sound Scenes and Events",
publisher = "Springer, Cham",

}

Font, F, Roma, G & Serra, X 2018, Sound Sharing and Retrieval. in T Virtanen, M Plumbley & D Ellis (eds), Computational Analysis of Sound Scenes and Events. Springer, Cham, pp. 279-301.

Sound Sharing and Retrieval. / Font, Frederic; Roma, Gerard; Serra, Xavier.

Computational Analysis of Sound Scenes and Events. ed. / Tuomas Virtanen; Mark Plumbley; Dan Ellis. Springer, Cham, 2018. p. 279-301.

Research output: Chapter in Book/Report/Conference proceedingChapter

TY - CHAP

T1 - Sound Sharing and Retrieval

AU - Font, Frederic

AU - Roma, Gerard

AU - Serra, Xavier

PY - 2018

Y1 - 2018

N2 - Multimedia sharing has experienced an enormous growth in recent years, and sound sharing has not been an exception. Nowadays one can find online sound sharing sites in which users can search, browse, and contribute large amounts of audio content such as sound effects, field and urban recordings, music tracks, and music samples. This poses many challenges to enable search, discovery, and ultimately reuse of this content. In this chapter we give an overview of different ways to approach such challenges. We describe how to build an audio database by outlining different aspects to be taken into account. We discuss metadata-based descriptions of audio content and different searching and browsing techniques that can be used to navigate the database. In addition to metadata, we show sound retrieval techniques based on the extraction of audio features from (possibly) unannotated audio. We end the chapter by discussing advanced approaches to sound retrieval and by drawing some conclusions about present and future of sound sharing and retrieval. In addition to our explanations, we provide code examples that illustrate some of the concepts discussed.

AB - Multimedia sharing has experienced an enormous growth in recent years, and sound sharing has not been an exception. Nowadays one can find online sound sharing sites in which users can search, browse, and contribute large amounts of audio content such as sound effects, field and urban recordings, music tracks, and music samples. This poses many challenges to enable search, discovery, and ultimately reuse of this content. In this chapter we give an overview of different ways to approach such challenges. We describe how to build an audio database by outlining different aspects to be taken into account. We discuss metadata-based descriptions of audio content and different searching and browsing techniques that can be used to navigate the database. In addition to metadata, we show sound retrieval techniques based on the extraction of audio features from (possibly) unannotated audio. We end the chapter by discussing advanced approaches to sound retrieval and by drawing some conclusions about present and future of sound sharing and retrieval. In addition to our explanations, we provide code examples that illustrate some of the concepts discussed.

UR - http://www.springer.com/gb/book/9783319634494#aboutBook

M3 - Chapter

SN - 9783319634494

SP - 279

EP - 301

BT - Computational Analysis of Sound Scenes and Events

A2 - Virtanen, Tuomas

A2 - Plumbley, Mark

A2 - Ellis, Dan

PB - Springer, Cham

ER -

Font F, Roma G, Serra X. Sound Sharing and Retrieval. In Virtanen T, Plumbley M, Ellis D, editors, Computational Analysis of Sound Scenes and Events. Springer, Cham. 2018. p. 279-301