Comparison of Noise Robust Methods in Large Vocabulary Speech Recognition (2010)
AUTHORS:
Keronen Sami
,
Remes Ulpu,
Palomäki Kalle,
Virtanen Tuomas,
Kurimo Mikko
BOOKTITLE:
Proceedings of the 18th European Signal Processing Conference, EUSIPCO 2010
INTERNALPDF:
internalpdf/eusipco2010.pdf
@inproceedings{ keronen10.eusipco, author = {Keronen, Sami and Remes, Ulpu and Palom\"{a}ki, Kalle and Virtanen, Tuomas and Kurimo, Mikko}, title = "Comparison of Noise Robust Methods in Large Vocabulary Speech Recognition", booktitle = "Proceedings of the 18th European Signal Processing Conference, EUSIPCO 2010", year = "2010", corerank = "NA", month = "August", responsibleauthor = "Keronen, Sami", flags = "AIRC", pdf = "eusipco2010.pdf", address = "Aalborg, Denmark", keywords = "noise robust, speech recognition, DPMC", unitcode = "T3060=100", impactfactor = "A4", abstract = "In this paper, a comparison of three fundamentally different noise robust approaches is carried out. The recognition performances of multicondition training, Data-driven Parallel Model Combination (DPMC), and cluster-based missing data reconstruction methods implemented in a large vocabulary continuous speech recognition system are evaluated with Finnish language speech data consisting of real recordings in noisy environments. All three methods improve the recognition accuracy substantially in poor signal-to-noise ratio (SNR) conditions when compared to a baseline system trained on clean speech. DPMC and missing data reconstruction systems give the best performance on high SNR conditions. On low SNR conditions, the performance of multicondition trained system is ranked the best, DPMC the second best and missing data reconstruction the third." }