Improving bottleneck features for Vietnamese large vocabulary continuous speech recognition system using deep neural networks

Bao Quoc Nguyen; Thang Tat Vu; Mai Chi Luong

doi:10.15625/1813-9663/31/4/5944

Improving bottleneck features for Vietnamese large vocabulary continuous speech recognition system using deep neural networks

Bao Quoc Nguyen, Thang Tat Vu, Mai Chi Luong

Author affiliations

Authors

Bao Quoc Nguyen
Thang Tat Vu Institute of Information Technology, Vietnam Academy of Science and Technology
Mai Chi Luong Institute of Information Technology, Vietnam Academy of Science and Technology

DOI:

https://doi.org/10.15625/1813-9663/31/4/5944

Keywords:

Deep bottleneck features, neural network, Vietnamese speech recognition.

Abstract

In this paper, the pre-training method based on denoising auto-encoder is investigated and proved to be good models for initializing bottleneck networks of Vietnamese speech recognition system that result in better recognition performance compared to base bottleneck features reported previously. The experiments are carried out on the dataset containing speeches on Voice of Vietnam channel (VOV). The results show that the DBNF extraction for Vietnamese recognition decreases relative word error rate by 14 % and 39 % compared to the base bottleneck features and MFCC baseline, respectively.

Metrics

PDF views

228

Downloads

Published

03-01-2016

How to Cite

[1]

B. Q. Nguyen, T. T. Vu, and M. C. Luong, “Improving bottleneck features for Vietnamese large vocabulary continuous speech recognition system using deep neural networks”, J. Comput. Sci. Cybern., vol. 31, no. 4, p. 267, Jan. 2016.

Download Citation

Issue

Vol. 31 No. 4 (2015)

Section

Computer Science

License

1. We hereby assign copyright of our article (the Work) in all forms of media, whether now known or hereafter developed, to the Journal of Computer Science and Cybernetics. We understand that the Journal of Computer Science and Cybernetics will act on my/our behalf to publish, reproduce, distribute and transmit the Work.
2. This assignment of copyright to the Journal of Computer Science and Cybernetics is done so on the understanding that permission from the Journal of Computer Science and Cybernetics is not required for me/us to reproduce, republish or distribute copies of the Work in whole or in part. We will ensure that all such copies carry a notice of copyright ownership and reference to the original journal publication.
3. We warrant that the Work is our results and has not been published before in its current or a substantially similar form and is not under consideration for another publication, does not contain any unlawful statements and does not infringe any existing copyright.
4. We also warrant that We have obtained the necessary permission from the copyright holder/s to reproduce in the article any materials including tables, diagrams or photographs not owned by me/us.

Improving bottleneck features for Vietnamese large vocabulary continuous speech recognition system using deep neural networks

Authors

DOI:

Keywords:

Abstract

Metrics

Downloads

Published

How to Cite

Issue

Section

License

Most read articles by the same author(s)