Speaker identification in courtroom contexts - Part III: Groups of collaborating listeners compared to forensic voice comparison based on automatic-speaker-recognition technology

Agnes S Bali; Nabanita Basu; Philip Weber; Claudia Rosas-Aguilar; Gary Edmond; Kristy A Martire; Geoffrey Stewart Morrison

doi:10.1016/j.forsciint.2024.112048

Speaker identification in courtroom contexts - Part III: Groups of collaborating listeners compared to forensic voice comparison based on automatic-speaker-recognition technology

Forensic Sci Int. 2024 Jul:360:112048. doi: 10.1016/j.forsciint.2024.112048. Epub 2024 May 6.

Authors

Agnes S Bali¹, Nabanita Basu², Philip Weber², Claudia Rosas-Aguilar³, Gary Edmond⁴, Kristy A Martire¹, Geoffrey Stewart Morrison⁵

Affiliations

¹ School of Psychology, University of New South Wales, Sydney, New South Wales, Australia.
² Forensic Data Science Laboratory, Aston University, Birmingham, UK.
³ Instituto de Lingüística y Literatura, Universidad Austral de Chile, Valdivia, Chile.
⁴ School of Law, Society & Criminology, University of New South Wales, Sydney, New South Wales, Australia.
⁵ Forensic Data Science Laboratory, Aston University, Birmingham, UK; Forensic Evaluation Ltd, Birmingham, UK. Electronic address: geoff-morrison@forensic-evaluation.net.

PMID: 38733653
DOI: 10.1016/j.forsciint.2024.112048

Abstract

Expert testimony is only admissible in common-law systems if it will potentially assist the trier of fact. In order for a forensic-voice-comparison expert's testimony to assist a trier of fact, the expert's forensic voice comparison should be more accurate than the trier of fact's speaker identification. "Speaker identification in courtroom contexts - Part I" addressed the question of whether speaker identification by an individual lay listener (such as a judge) would be more or less accurate than the output of a forensic-voice-comparison system that is based on state-of-the-art automatic-speaker-recognition technology. The present paper addresses the question of whether speaker identification by a group of collaborating lay listeners (such as a jury) would be more or less accurate than the output of such a forensic-voice-comparison system. As members of collaborating groups, participants listen to pairs of recordings reflecting the conditions of the questioned- and known-speaker recordings in an actual case, confer, and make a probabilistic consensus judgement on each pair of recordings. The present paper also compares group-consensus responses with "wisdom of the crowd" which uses the average of the responses from multiple independent individual listeners.

Keywords: Admissibility; Forensic voice comparison; Likelihood ratio; Speaker identification; Validation.

MeSH terms

Adult
Biometric Identification / methods
Cooperative Behavior
Expert Testimony
Female
Forensic Sciences* / methods
Humans
Male
Speech Recognition Software
Voice*