Discriminative Enhancement for Single Channel Audio Source Separation Using Deep Neural Networks

Emad M Grais, Gerard Roma, Andrew JR Simpson, Mark D Plumbley

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Citations (Scopus)

Abstract

The sources separated by most single channel audio source separation techniques are usually distorted and each separated source contains residual signals from the other sources. To tackle this problem, we propose to enhance the separated sources to decrease the distortion and interference between the separated sources using deep neural networks (DNNs). Two different DNNs are used in this work. The first DNN is used to separate the sources from the mixed signal. The second DNN is used to enhance the separated signals. To consider the interactions between the separated sources, we propose to use a single DNN to enhance all the separated sources together. To reduce the residual signals of one source from the other separated sources (interference), we train the DNN for enhancement discriminatively to maximize the dissimilarity between the predicted sources. The experimental results show that using discriminative enhancement decreases the distortion and interference between the separated sources.
Original languageEnglish
Title of host publicationLatent Variable Analysis and Signal Separation
EditorsPetr Tichavský, Massoud Babaie-Zadeh, Olivier J.J. Michel, Nadège Thirion-Moreau
PublisherSpringer, Cham
Pages236-246
Number of pages11
ISBN (Electronic)9783319535470
ISBN (Print)9783319535463
DOIs
Publication statusPublished - 15 Feb 2017
Externally publishedYes
Event13th International Conference on Latent Variable Analysis and Signal Separation - Grenoble-Alpes University, Grenoble, France
Duration: 21 Feb 201723 Feb 2017
Conference number: 13
http://www.lva-ica-2017.com/ (Link to Conference Website)

Publication series

NameLecture Notes in Computer Science
PublisherSpringer
Volume10169
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference13th International Conference on Latent Variable Analysis and Signal Separation
Abbreviated titleLVA/ICA 2017
Country/TerritoryFrance
CityGrenoble
Period21/02/1723/02/17
Internet address

Fingerprint

Dive into the research topics of 'Discriminative Enhancement for Single Channel Audio Source Separation Using Deep Neural Networks'. Together they form a unique fingerprint.
  • Improving single-network single-channel separation of musical audio with convolutional layers

    Roma, G., Green, O. & Tremblay, P. A., 6 Jun 2018, Latent Variable Analysis and Signal Separation: 14th International Conference, LVA/ICA 2018, Guildford, UK, July 2–5, 2018, Proceedings. Gannot, S., Deville, Y., Mason, R., Plumbley, M. D. & Ward, D. (eds.). Springer Verlag, p. 306-315 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 10891 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Open Access
    File
    6 Citations (Scopus)
  • Combining mask estimates for single channel audio source separation using deep neural networks

    Grais, E. M., Roma, G., Simpson, A. JR. & Plumbley, M. D., Sep 2016, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. p. 3339-3343 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Open Access
    23 Citations (Scopus)

Cite this