Abstract
Most convolutional neural network architectures explored so far for musical audio separation follow an autoencoder structure, where the mixture is considered to be a corrupted version of the original source. On the other hand, many approaches based on deep neural networks make use of several networks with different objectives for estimating the sources. In this paper we propose a discriminative approach based on traditional convolutional neural network architectures for image classification and speech recognition. Our results show that this architecture performs similarly to current state of the art approaches for separating singing voice, and that the addition of convolutional layers allows improving separation results with respect to using only fully-connected layers.
| Original language | English |
|---|---|
| Title of host publication | Latent Variable Analysis and Signal Separation |
| Subtitle of host publication | 14th International Conference, LVA/ICA 2018, Guildford, UK, July 2–5, 2018, Proceedings |
| Editors | Sharon Gannot, Yannick Deville, Russell Mason, Mark D. Plumbley, Dominic Ward |
| Publisher | Springer Verlag |
| Pages | 306-315 |
| ISBN (Electronic) | 9783319937649 |
| ISBN (Print) | 9783319937632 |
| DOIs | |
| Publication status | Published - 6 Jun 2018 |
| Event | 14th International Conference on Latent Variable Analysis and Signal Seperation - University of Surrey, Guildford, United Kingdom Duration: 2 Jul 2018 → 6 Jul 2018 http://cvssp.org/events/lva-ica-2018/ (Link to Conference Website) |
Publication series
| Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
|---|---|
| Volume | 10891 LNCS |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
Conference
| Conference | 14th International Conference on Latent Variable Analysis and Signal Seperation |
|---|---|
| Abbreviated title | LVA / ICA 2018 |
| Country/Territory | United Kingdom |
| City | Guildford |
| Period | 2/07/18 → 6/07/18 |
| Internet address |
|
Fingerprint
Dive into the research topics of 'Improving single-network single-channel separation of musical audio with convolutional layers'. Together they form a unique fingerprint.-
Discriminative Enhancement for Single Channel Audio Source Separation Using Deep Neural Networks
Grais, E. M., Roma, G., Simpson, A. J. & Plumbley, M. D., 15 Feb 2017, Latent Variable Analysis and Signal Separation. Tichavský, P., Babaie-Zadeh, M., Michel, O. J. J. & Thirion-Moreau, N. (eds.). Springer, Cham, p. 236-246 11 p. (Lecture Notes in Computer Science; vol. 10169).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review
Open Access8 Link opens in a new tab Citations (Scopus) -
Two-stage single-channel audio source separation using deep neural networks
Grais, E. M., Roma, G., Simpson, A. J. & Plumbley, M. D., 1 Sept 2017, In: IEEE/ACM Transactions on Audio, Speech, and Language Processing. 25, 9, p. 1773-1783 11 p.Research output: Contribution to journal › Article › peer-review
46 Link opens in a new tab Citations (Scopus) -
Combining mask estimates for single channel audio source separation using deep neural networks
Grais, E. M., Roma, G., Simpson, A. J. & Plumbley, M. D., Sept 2016, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. p. 3339-3343 5 p.Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review
Open Access23 Link opens in a new tab Citations (Scopus) -
Single-Channel Audio Source Separation Using Deep Neural Network Ensembles
Grais, E. M., Roma, G., Simpson, A. J. & Plumbley, M. D., 26 May 2016, Audio Engineering Society Convention 140. Audio Engineering Society, 9494Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review
Open Access -
Deep Karaoke: Extracting Vocals from Musical Mixtures Using a Convolutional Deep Neural Network
Simpson, A. J., Roma, G. & Plumbley, M. D., 15 Aug 2015, Latent Variable Analysis and Signal Separation. Vincent, E., Yeredor, A., Koldovský, Z. & Tichavský, P. (eds.). Springer, Cham, p. 429-436 8 p. (Lecture Notes in Computer Science; vol. 9237).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review
72 Link opens in a new tab Citations (Scopus)
Projects
- 1 Finished
-
FluCoMa: Fluid Corpus Manipulations
Tremblay, P. A. (PI), Green, O. (CoI), Roma, G. (CoI), Harker, A. (CoI), Clarke, M. (CoI) & Dufeu, F. (CoI)
1/09/17 → 28/02/23
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver