You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
Automatic Speech Recognition (ASR) allows a computer to identify the words that a person speaks into a microphone and convert it to written text. One of the most challenging situations for ASR is the cocktail-party environment. Although source separation methods have already been investigated to deal with this problem, the separation process is not perfect and the resulting artifacts pose an additional problem to ASR performance in case of using separation methods based on time-frequency masks. Recently, the authors proposed a specific training method to deal with simultaneous speech situations in practical ASR systems. In this paper, we study how the speech recognition performance is affected by selecting different combinations of separation algorithms both at the training and test stages of the ASR system under different acoustic conditions. The results show that, while different separation methods produce different types of artifacts, the overall performance of the method is always increased when using any cocktail-party training.
Author (s): Marti, Amparo;
Cobos, Máximo;
Lopez, José J.;
Affiliation:
Universitat de València, Valencia, Spain; Universitat Politècnica de València, Valencia, Spain
(See document for exact affiliation information.)
AES Convention: 132
Paper Number:8635
Publication Date:
2012-04-06
Session subject:
Analysis and Synthesis and Content Management
DOI:
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Marti, Amparo; Cobos, Máximo; Lopez, José J.; 2012; Evaluating the Influence of Source Separation Methods in Robust Automatic Speech Recognition with a Specific Cocktail-Party Training [PDF]; Universitat de València, Valencia, Spain; Universitat Politècnica de València, Valencia, Spain; Paper 8635; Available from: https://aes.org/publications/elibrary-page/?id=16273
Marti, Amparo; Cobos, Máximo; Lopez, José J.; Evaluating the Influence of Source Separation Methods in Robust Automatic Speech Recognition with a Specific Cocktail-Party Training [PDF]; Universitat de València, Valencia, Spain; Universitat Politècnica de València, Valencia, Spain; Paper 8635; 2012 Available: https://aes.org/publications/elibrary-page/?id=16273
@inproceedings{Marti2012evaluating,
title={{Evaluating the Influence of Source Separation Methods in Robust Automatic Speech Recognition with a Specific Cocktail-Party Training}},
author={Marti, Amparo and Cobos, Máximo and Lopez, José J.},
year={2012},
month={apr},
booktitle={Journal of the Audio Engineering Society},
publisher={Paper 8635; AES Convention 132; April 2012},
number={8635},
organization={AES},
}
Notifications