A Cautionary Tale For Phonetic Analysis: The Variability of Speech Between and Within Recording Sessions

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper investigates within and between session variability using a subset of 60 British English male speakers from the WYRED project. Three separate speaking tasks were compared using extracted i- vector PLDA scores within iVOCALISE. Different speaker pairs from contemporaneous (within-session) recordings and non-contemporaneous (between- session) recordings were tested. A within-session, between-task comparison was also performed in order to consider variation in speech style in addition to non-contemporaneity. EER and Cllr values indicate that non-contemporaneity is not the only factor which needs to be taken into account when conducting phonetic analysis or evaluating speaker comparison systems, as speech style also seems to play an important role. Further analysis supports the requirement for (forensic/socio-) phoneticians to sample data from the entirety of a recording, especially if the nature of the speech elicitation may change during the task, as the degree of variability is dependent on which portion of the sound file is sampled.
LanguageEnglish
Title of host publicationProceedings of International Congress of Phonetic Sciences, August 2019, Melbourne
Pages1-5
Number of pages5
Publication statusAccepted/In press - 12 Feb 2019
Event19th International Congress of the Phonetic Sciences - Melbourne Convention and Exhibition Centre, Melbourne, Australia
Duration: 5 Aug 20199 Aug 2019
https://www.icphs2019.org/

Conference

Conference19th International Congress of the Phonetic Sciences
Abbreviated titleICPhS 2019
CountryAustralia
CityMelbourne
Period5/08/199/08/19
Internet address

Fingerprint

analysis
speech
sound
systems comparison
project
comparison

Cite this

Ross, S., Earnshaw, K., & Gold, E. (Accepted/In press). A Cautionary Tale For Phonetic Analysis: The Variability of Speech Between and Within Recording Sessions. In Proceedings of International Congress of Phonetic Sciences, August 2019, Melbourne (pp. 1-5)
Ross, Sula ; Earnshaw, Katherine ; Gold, Erica. / A Cautionary Tale For Phonetic Analysis : The Variability of Speech Between and Within Recording Sessions. Proceedings of International Congress of Phonetic Sciences, August 2019, Melbourne. 2019. pp. 1-5
@inproceedings{32fe5fa24ab4415a8facc43b995088e4,
title = "A Cautionary Tale For Phonetic Analysis: The Variability of Speech Between and Within Recording Sessions",
abstract = "This paper investigates within and between session variability using a subset of 60 British English male speakers from the WYRED project. Three separate speaking tasks were compared using extracted i- vector PLDA scores within iVOCALISE. Different speaker pairs from contemporaneous (within-session) recordings and non-contemporaneous (between- session) recordings were tested. A within-session, between-task comparison was also performed in order to consider variation in speech style in addition to non-contemporaneity. EER and Cllr values indicate that non-contemporaneity is not the only factor which needs to be taken into account when conducting phonetic analysis or evaluating speaker comparison systems, as speech style also seems to play an important role. Further analysis supports the requirement for (forensic/socio-) phoneticians to sample data from the entirety of a recording, especially if the nature of the speech elicitation may change during the task, as the degree of variability is dependent on which portion of the sound file is sampled.",
author = "Sula Ross and Katherine Earnshaw and Erica Gold",
year = "2019",
month = "2",
day = "12",
language = "English",
pages = "1--5",
booktitle = "Proceedings of International Congress of Phonetic Sciences, August 2019, Melbourne",

}

Ross, S, Earnshaw, K & Gold, E 2019, A Cautionary Tale For Phonetic Analysis: The Variability of Speech Between and Within Recording Sessions. in Proceedings of International Congress of Phonetic Sciences, August 2019, Melbourne. pp. 1-5, 19th International Congress of the Phonetic Sciences, Melbourne, Australia, 5/08/19.

A Cautionary Tale For Phonetic Analysis : The Variability of Speech Between and Within Recording Sessions. / Ross, Sula; Earnshaw, Katherine; Gold, Erica.

Proceedings of International Congress of Phonetic Sciences, August 2019, Melbourne. 2019. p. 1-5.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - A Cautionary Tale For Phonetic Analysis

T2 - The Variability of Speech Between and Within Recording Sessions

AU - Ross, Sula

AU - Earnshaw, Katherine

AU - Gold, Erica

PY - 2019/2/12

Y1 - 2019/2/12

N2 - This paper investigates within and between session variability using a subset of 60 British English male speakers from the WYRED project. Three separate speaking tasks were compared using extracted i- vector PLDA scores within iVOCALISE. Different speaker pairs from contemporaneous (within-session) recordings and non-contemporaneous (between- session) recordings were tested. A within-session, between-task comparison was also performed in order to consider variation in speech style in addition to non-contemporaneity. EER and Cllr values indicate that non-contemporaneity is not the only factor which needs to be taken into account when conducting phonetic analysis or evaluating speaker comparison systems, as speech style also seems to play an important role. Further analysis supports the requirement for (forensic/socio-) phoneticians to sample data from the entirety of a recording, especially if the nature of the speech elicitation may change during the task, as the degree of variability is dependent on which portion of the sound file is sampled.

AB - This paper investigates within and between session variability using a subset of 60 British English male speakers from the WYRED project. Three separate speaking tasks were compared using extracted i- vector PLDA scores within iVOCALISE. Different speaker pairs from contemporaneous (within-session) recordings and non-contemporaneous (between- session) recordings were tested. A within-session, between-task comparison was also performed in order to consider variation in speech style in addition to non-contemporaneity. EER and Cllr values indicate that non-contemporaneity is not the only factor which needs to be taken into account when conducting phonetic analysis or evaluating speaker comparison systems, as speech style also seems to play an important role. Further analysis supports the requirement for (forensic/socio-) phoneticians to sample data from the entirety of a recording, especially if the nature of the speech elicitation may change during the task, as the degree of variability is dependent on which portion of the sound file is sampled.

M3 - Conference contribution

SP - 1

EP - 5

BT - Proceedings of International Congress of Phonetic Sciences, August 2019, Melbourne

ER -

Ross S, Earnshaw K, Gold E. A Cautionary Tale For Phonetic Analysis: The Variability of Speech Between and Within Recording Sessions. In Proceedings of International Congress of Phonetic Sciences, August 2019, Melbourne. 2019. p. 1-5