You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
We present an analysis of prediction intervals for a non-intrusive method to estimate the clarity index (C50). The method employed to estimate C50 is a data driven approach that extracts multiple features from a reverberant speech signal which are then used to train a bidirectional long-short term memory model which maps the feature space into the target C50 value. The prediction intervals are derived from the standard deviation of the per-frame C50 estimates. This approach was shown to provide a coverage probability of 80%, i.e. 80% of times the ground truth lies between the estimated intervals, where the interval bounds are computed by using 5.6 times the standard deviation of the per-frame estimates. This accuracy is shown to be consistent with other noisy reverberant environments.
Author (s): Peso Parada, Pablo;
Sharma, Dushyant;
Naylor, Patrick A.;
van Waterschoot, Toon;
Affiliation:
Imperial College London, London, UK; KU Leuven, Leuven, Belgium; Nuance Communications, Inc., Marlow, UK
(See document for exact affiliation information.)
Publication Date:
2016-01-06
Session subject:
Paper Session 5
DOI:
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Peso Parada, Pablo; Sharma, Dushyant; Naylor, Patrick A.; van Waterschoot, Toon; 2016; Analysis of Prediction Intervals for Non-Intrusive Estimation of Speech Clarity Index [PDF]; Imperial College London, London, UK; KU Leuven, Leuven, Belgium; Nuance Communications, Inc., Marlow, UK; Paper 5-2; Available from: https://aes.org/publications/elibrary-page/?id=18077
Peso Parada, Pablo; Sharma, Dushyant; Naylor, Patrick A.; van Waterschoot, Toon; Analysis of Prediction Intervals for Non-Intrusive Estimation of Speech Clarity Index [PDF]; Imperial College London, London, UK; KU Leuven, Leuven, Belgium; Nuance Communications, Inc., Marlow, UK; Paper 5-2; 2016 Available: https://aes.org/publications/elibrary-page/?id=18077
@inproceedings{Peso2016analysis,
title={{Analysis of Prediction Intervals for Non-Intrusive Estimation of Speech Clarity Index}},
author={Peso Parada, Pablo and Sharma, Dushyant and Naylor, Patrick A. and van Waterschoot, Toon},
year={2016},
month={jan},
booktitle={Journal of the Audio Engineering Society},
publisher={Paper 5-2; AES Conference: 60th International Conference: Dereverberation and Reverberation of Audio, Music, and Speech; January 2016},
number={5-2},
organization={AES},
}
Notifications