You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
This paper looks at a methodology of quantifying the speaking voice, by which temporal and spectral features of the voice are extracted and processed to create a numeric code that identifies speakers, so those speakers can be searched in a database much like fingerprints. The parameters studied include: (1) average fundamental frequency (F0) of the speech signal over time, (2) standard deviation of the F0, (3) the slope and (4) sign of the FO contour, (5) the average energy, (6) the standard deviation of the energy, (7) the spectral energy contained from 50 Hz to 1,000 Hz, (8) the spectral energy from 1,000 Hz to 5,000 Hz, (9) the Alpha Ratio, (10) the average speaking rate, and (11) the total duration of the spoken sentence.
Author (s): Popolo, Peter S.;
Sanders, Richard W.;
Titze, Ingo R.;
Affiliation:
National Center for Voice & Speech; University of Iowa; University of Colorado at Denver
(See document for exact affiliation information.)
AES Convention: 123
Paper Number:7274
Publication Date:
2007-10-06
Session subject:
Audio Forensics
DOI:
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Popolo, Peter S.; Sanders, Richard W.; Titze, Ingo R.; 2007; Quantifying the Speaking Voice: Generating a Speaker Code as a Means of Speaker Identification Using a Simple Code-Matching Technique [PDF]; National Center for Voice & Speech; University of Iowa; University of Colorado at Denver; Paper 7274; Available from: https://aes.org/publications/elibrary-page/?id=14332
Popolo, Peter S.; Sanders, Richard W.; Titze, Ingo R.; Quantifying the Speaking Voice: Generating a Speaker Code as a Means of Speaker Identification Using a Simple Code-Matching Technique [PDF]; National Center for Voice & Speech; University of Iowa; University of Colorado at Denver; Paper 7274; 2007 Available: https://aes.org/publications/elibrary-page/?id=14332
@inproceedings{Popolo2007quantifying,
title={{Quantifying the Speaking Voice: Generating a Speaker Code as a Means of Speaker Identification Using a Simple Code-Matching Technique}},
author={Popolo, Peter S. and Sanders, Richard W. and Titze, Ingo R.},
year={2007},
month={oct},
booktitle={Journal of the Audio Engineering Society},
publisher={Paper 7274; AES Convention 123; October 2007},
number={7274},
organization={AES},
}
TY – paper
TI – Quantifying the Speaking Voice: Generating a Speaker Code as a Means of Speaker Identification Using a Simple Code-Matching Technique
AU – Popolo, Peter S.
AU – Sanders, Richard W.
AU – Titze, Ingo R.
PY – 2007
JO – Journal of the Audio Engineering Society
VL – 7274
Y1 – October 2007
Notifications