Improving single-network single-channel separation of musical audio with convolutional layers

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

Most convolutional neural network architectures explored so far for musical audio separation follow an autoencoder structure, where the mixture is considered to be a corrupted version of the original source. On the other hand, many approaches based on deep neural networks make use of several networks with different objectives for estimating the sources. In this paper we propose a discriminative approach based on traditional convolutional neural network architectures for image classification and speech recognition. Our results show that this architecture performs similarly to current state of the art approaches for separating singing voice, and that the addition of convolutional layers allows improving separation results with respect to using only fully-connected layers.

Original languageEnglish
Title of host publicationLatent Variable Analysis and Signal Separation
Subtitle of host publication14th International Conference, LVA/ICA 2018, Guildford, UK, July 2–5, 2018, Proceedings
EditorsSharon Gannot, Yannick Deville, Russell Mason, Mark D. Plumbley, Dominic Ward
PublisherSpringer Verlag
Pages306-315
ISBN (Electronic)9783319937649
ISBN (Print)9783319937632
DOIs
Publication statusPublished - 6 Jun 2018
Event14th International Conference on Latent Variable Analysis and Signal Seperation - University of Surrey, Guildford, United Kingdom
Duration: 2 Jul 20186 Jul 2018
http://cvssp.org/events/lva-ica-2018/ (Link to Conference Website)

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10891 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference14th International Conference on Latent Variable Analysis and Signal Seperation
Abbreviated titleLVA / ICA 2018
CountryUnited Kingdom
CityGuildford
Period2/07/186/07/18
Internet address

    Fingerprint

Cite this

Roma, G., Green, O., & Tremblay, P. A. (2018). Improving single-network single-channel separation of musical audio with convolutional layers. In S. Gannot, Y. Deville, R. Mason, M. D. Plumbley, & D. Ward (Eds.), Latent Variable Analysis and Signal Separation: 14th International Conference, LVA/ICA 2018, Guildford, UK, July 2–5, 2018, Proceedings (pp. 306-315). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 10891 LNCS). Springer Verlag. https://doi.org/10.1007/978-3-319-93764-9_29