A Cautionary Tale For Phonetic Analysis

The Variability of Speech Between and Within Recording Sessions

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper investigates within and between session variability using a subset of 60 British English male speakers from the WYRED project. Three separate speaking tasks were compared using extracted i- vector PLDA scores within iVOCALISE. Different speaker pairs from contemporaneous (within-session) recordings and non-contemporaneous (between- session) recordings were tested. A within-session, between-task comparison was also performed in order to consider variation in speech style in addition to non-contemporaneity. EER and Cllr values indicate that non-contemporaneity is not the only factor which needs to be taken into account when conducting phonetic analysis or evaluating speaker comparison systems, as speech style also seems to play an important role. Further analysis supports the requirement for (forensic/socio-) phoneticians to sample data from the entirety of a recording, especially if the nature of the speech elicitation may change during the task, as the degree of variability is dependent on which portion of the sound file is sampled.
Original languageEnglish
Title of host publicationProceedings of the 19th International Congress of Phonetic Sciences, Melbourne, Australia 2019
EditorsSasha Calhoun, Paola Escudero, Marija Tabain, Paul Warren
Place of PublicationCanberra
PublisherAustralasian Speech Science and Technology Association Inc.
Pages3090-3094
Number of pages5
ISBN (Print)9780646800691
Publication statusPublished - Aug 2019
Event19th International Congress of the Phonetic Sciences - Melbourne Convention and Exhibition Centre, Melbourne, Australia
Duration: 5 Aug 20199 Aug 2019
https://www.icphs2019.org/

Conference

Conference19th International Congress of the Phonetic Sciences
Abbreviated titleICPhS 2019
CountryAustralia
CityMelbourne
Period5/08/199/08/19
Internet address

Fingerprint

analysis
speech
sound
systems comparison
project
comparison

Cite this

Ross, S., Earnshaw, K., & Gold, E. (2019). A Cautionary Tale For Phonetic Analysis: The Variability of Speech Between and Within Recording Sessions. In S. Calhoun, P. Escudero, M. Tabain, & P. Warren (Eds.), Proceedings of the 19th International Congress of Phonetic Sciences, Melbourne, Australia 2019 (pp. 3090-3094). Canberra: Australasian Speech Science and Technology Association Inc..
Ross, Sula ; Earnshaw, Katherine ; Gold, Erica. / A Cautionary Tale For Phonetic Analysis : The Variability of Speech Between and Within Recording Sessions. Proceedings of the 19th International Congress of Phonetic Sciences, Melbourne, Australia 2019. editor / Sasha Calhoun ; Paola Escudero ; Marija Tabain ; Paul Warren. Canberra : Australasian Speech Science and Technology Association Inc., 2019. pp. 3090-3094
@inproceedings{32fe5fa24ab4415a8facc43b995088e4,
title = "A Cautionary Tale For Phonetic Analysis: The Variability of Speech Between and Within Recording Sessions",
abstract = "This paper investigates within and between session variability using a subset of 60 British English male speakers from the WYRED project. Three separate speaking tasks were compared using extracted i- vector PLDA scores within iVOCALISE. Different speaker pairs from contemporaneous (within-session) recordings and non-contemporaneous (between- session) recordings were tested. A within-session, between-task comparison was also performed in order to consider variation in speech style in addition to non-contemporaneity. EER and Cllr values indicate that non-contemporaneity is not the only factor which needs to be taken into account when conducting phonetic analysis or evaluating speaker comparison systems, as speech style also seems to play an important role. Further analysis supports the requirement for (forensic/socio-) phoneticians to sample data from the entirety of a recording, especially if the nature of the speech elicitation may change during the task, as the degree of variability is dependent on which portion of the sound file is sampled.",
keywords = "Non-contemporaneous, Forensic speaker comparison, phonetic analysis, i-vector, Variation",
author = "Sula Ross and Katherine Earnshaw and Erica Gold",
year = "2019",
month = "8",
language = "English",
isbn = "9780646800691",
pages = "3090--3094",
editor = "Sasha Calhoun and Paola Escudero and Marija Tabain and Paul Warren",
booktitle = "Proceedings of the 19th International Congress of Phonetic Sciences, Melbourne, Australia 2019",
publisher = "Australasian Speech Science and Technology Association Inc.",

}

Ross, S, Earnshaw, K & Gold, E 2019, A Cautionary Tale For Phonetic Analysis: The Variability of Speech Between and Within Recording Sessions. in S Calhoun, P Escudero, M Tabain & P Warren (eds), Proceedings of the 19th International Congress of Phonetic Sciences, Melbourne, Australia 2019. Australasian Speech Science and Technology Association Inc., Canberra, pp. 3090-3094, 19th International Congress of the Phonetic Sciences, Melbourne, Australia, 5/08/19.

A Cautionary Tale For Phonetic Analysis : The Variability of Speech Between and Within Recording Sessions. / Ross, Sula; Earnshaw, Katherine; Gold, Erica.

Proceedings of the 19th International Congress of Phonetic Sciences, Melbourne, Australia 2019. ed. / Sasha Calhoun; Paola Escudero; Marija Tabain; Paul Warren. Canberra : Australasian Speech Science and Technology Association Inc., 2019. p. 3090-3094.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - A Cautionary Tale For Phonetic Analysis

T2 - The Variability of Speech Between and Within Recording Sessions

AU - Ross, Sula

AU - Earnshaw, Katherine

AU - Gold, Erica

PY - 2019/8

Y1 - 2019/8

N2 - This paper investigates within and between session variability using a subset of 60 British English male speakers from the WYRED project. Three separate speaking tasks were compared using extracted i- vector PLDA scores within iVOCALISE. Different speaker pairs from contemporaneous (within-session) recordings and non-contemporaneous (between- session) recordings were tested. A within-session, between-task comparison was also performed in order to consider variation in speech style in addition to non-contemporaneity. EER and Cllr values indicate that non-contemporaneity is not the only factor which needs to be taken into account when conducting phonetic analysis or evaluating speaker comparison systems, as speech style also seems to play an important role. Further analysis supports the requirement for (forensic/socio-) phoneticians to sample data from the entirety of a recording, especially if the nature of the speech elicitation may change during the task, as the degree of variability is dependent on which portion of the sound file is sampled.

AB - This paper investigates within and between session variability using a subset of 60 British English male speakers from the WYRED project. Three separate speaking tasks were compared using extracted i- vector PLDA scores within iVOCALISE. Different speaker pairs from contemporaneous (within-session) recordings and non-contemporaneous (between- session) recordings were tested. A within-session, between-task comparison was also performed in order to consider variation in speech style in addition to non-contemporaneity. EER and Cllr values indicate that non-contemporaneity is not the only factor which needs to be taken into account when conducting phonetic analysis or evaluating speaker comparison systems, as speech style also seems to play an important role. Further analysis supports the requirement for (forensic/socio-) phoneticians to sample data from the entirety of a recording, especially if the nature of the speech elicitation may change during the task, as the degree of variability is dependent on which portion of the sound file is sampled.

KW - Non-contemporaneous

KW - Forensic speaker comparison

KW - phonetic analysis

KW - i-vector

KW - Variation

M3 - Conference contribution

SN - 9780646800691

SP - 3090

EP - 3094

BT - Proceedings of the 19th International Congress of Phonetic Sciences, Melbourne, Australia 2019

A2 - Calhoun, Sasha

A2 - Escudero, Paola

A2 - Tabain, Marija

A2 - Warren, Paul

PB - Australasian Speech Science and Technology Association Inc.

CY - Canberra

ER -

Ross S, Earnshaw K, Gold E. A Cautionary Tale For Phonetic Analysis: The Variability of Speech Between and Within Recording Sessions. In Calhoun S, Escudero P, Tabain M, Warren P, editors, Proceedings of the 19th International Congress of Phonetic Sciences, Melbourne, Australia 2019. Canberra: Australasian Speech Science and Technology Association Inc. 2019. p. 3090-3094