Feature Extraction of Binaural Recordings for Acoustic Scene Classification

Sławomir Zieliński, Hyunkook Lee

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Binaural technology becomes increasingly popular in the multimedia systems. This paper identifies a set of features of binaural recordings suitable for the automatic classification of the four basic spatial audio scenes representing the most typical patterns of audio content distribution around a listener. Moreover, it compares the five artificial-intelligence-based methods applied to the classification of binaural recordings. The results show that both the spatial and the spectro-temporal features are essential to accurate classification of binaurally rendered acoustic scenes. The spectro-temporal features appear to have a stronger influence on the classification results than the spatial metrics. According to the obtained results, the method based on the support vector machine, exploiting the features identified in the study, yields the classification accuracy approaching 84%.
LanguageEnglish
Title of host publicationIEEE Federated Conference on Computer Science and Information Systems, FedCSIS 2018
PublisherIEEE
Publication statusAccepted/In press - 1 Jul 2018
EventIEEE Federated Conference on Computer Science and Information Systems
- Poznan, Poland
Duration: 9 Sep 201812 Sep 2018
https://fedcsis.org/2018/ (Link to Conference Website)

Conference

ConferenceIEEE Federated Conference on Computer Science and Information Systems
Abbreviated titleFedCSIS
CountryPoland
CityPoznan
Period9/09/1812/09/18
Internet address

Fingerprint

acoustics
artificial intelligence
multimedia
method

Cite this

Zieliński, S., & Lee, H. (Accepted/In press). Feature Extraction of Binaural Recordings for Acoustic Scene Classification. In IEEE Federated Conference on Computer Science and Information Systems, FedCSIS 2018 IEEE.
Zieliński, Sławomir ; Lee, Hyunkook. / Feature Extraction of Binaural Recordings for Acoustic Scene Classification. IEEE Federated Conference on Computer Science and Information Systems, FedCSIS 2018. IEEE, 2018.
@inproceedings{ad6bcdce03a74fbc9c764f6298f9f418,
title = "Feature Extraction of Binaural Recordings for Acoustic Scene Classification",
abstract = "Binaural technology becomes increasingly popular in the multimedia systems. This paper identifies a set of features of binaural recordings suitable for the automatic classification of the four basic spatial audio scenes representing the most typical patterns of audio content distribution around a listener. Moreover, it compares the five artificial-intelligence-based methods applied to the classification of binaural recordings. The results show that both the spatial and the spectro-temporal features are essential to accurate classification of binaurally rendered acoustic scenes. The spectro-temporal features appear to have a stronger influence on the classification results than the spatial metrics. According to the obtained results, the method based on the support vector machine, exploiting the features identified in the study, yields the classification accuracy approaching 84{\%}.",
keywords = "Spatial scene, Classification, Machine learning, Artificial Intelligence, Binaural audio",
author = "Sławomir Zieliński and Hyunkook Lee",
note = "{\circledC} {\circledC} 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.",
year = "2018",
month = "7",
day = "1",
language = "English",
booktitle = "IEEE Federated Conference on Computer Science and Information Systems, FedCSIS 2018",
publisher = "IEEE",

}

Zieliński, S & Lee, H 2018, Feature Extraction of Binaural Recordings for Acoustic Scene Classification. in IEEE Federated Conference on Computer Science and Information Systems, FedCSIS 2018. IEEE, IEEE Federated Conference on Computer Science and Information Systems
, Poznan, Poland, 9/09/18.

Feature Extraction of Binaural Recordings for Acoustic Scene Classification. / Zieliński, Sławomir; Lee, Hyunkook.

IEEE Federated Conference on Computer Science and Information Systems, FedCSIS 2018. IEEE, 2018.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Feature Extraction of Binaural Recordings for Acoustic Scene Classification

AU - Zieliński,Sławomir

AU - Lee,Hyunkook

N1 - © © 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

PY - 2018/7/1

Y1 - 2018/7/1

N2 - Binaural technology becomes increasingly popular in the multimedia systems. This paper identifies a set of features of binaural recordings suitable for the automatic classification of the four basic spatial audio scenes representing the most typical patterns of audio content distribution around a listener. Moreover, it compares the five artificial-intelligence-based methods applied to the classification of binaural recordings. The results show that both the spatial and the spectro-temporal features are essential to accurate classification of binaurally rendered acoustic scenes. The spectro-temporal features appear to have a stronger influence on the classification results than the spatial metrics. According to the obtained results, the method based on the support vector machine, exploiting the features identified in the study, yields the classification accuracy approaching 84%.

AB - Binaural technology becomes increasingly popular in the multimedia systems. This paper identifies a set of features of binaural recordings suitable for the automatic classification of the four basic spatial audio scenes representing the most typical patterns of audio content distribution around a listener. Moreover, it compares the five artificial-intelligence-based methods applied to the classification of binaural recordings. The results show that both the spatial and the spectro-temporal features are essential to accurate classification of binaurally rendered acoustic scenes. The spectro-temporal features appear to have a stronger influence on the classification results than the spatial metrics. According to the obtained results, the method based on the support vector machine, exploiting the features identified in the study, yields the classification accuracy approaching 84%.

KW - Spatial scene

KW - Classification

KW - Machine learning

KW - Artificial Intelligence

KW - Binaural audio

M3 - Conference contribution

BT - IEEE Federated Conference on Computer Science and Information Systems, FedCSIS 2018

PB - IEEE

ER -

Zieliński S, Lee H. Feature Extraction of Binaural Recordings for Acoustic Scene Classification. In IEEE Federated Conference on Computer Science and Information Systems, FedCSIS 2018. IEEE. 2018