Automatic identification of Vietnamese dialects

The experiment result for the dialect corpus of Vietnamese shows that the performance of dialectal identification with baseline increases from for the case using only MFCC coefficients to for the case using MFCC coefficients and the information of fundamental frequency. By combining the formants and their bandwidths with the normalized F0 according to average and standard deviation F0, the best recognition rate is . | Journal of Computer Science and Cybernetics, , (2016), 18–29 DOI: AUTOMATIC IDENTIFICATION OF VIETNAMESE DIALECTS PHAM NGOC HUNG1,2 , TRINH VAN LOAN1,2 , NGUYEN HONG QUANG2 1 Faculty of Information Technology, Hung Yen University of Technology and Education, of Information and Communication Technology, Hanoi University of Science and Technology 1,2 pnhung@; 1,2 loantv@; 2 quangnh@ 2 School Abstract. The dialect identification has been under study for many languages over the world nevertheless the research on signal processing for Vietnamese dialects is still limited and there are not many published works. There are many different dialects for Vietnamese. The influence of dialectal features on speech recognition systems is important. If the information about dialects is known during speech recognition process, the performance of recognition systems will be better because the corpus of these systems is normally organized according to different dialects. In our experiments, MFCC coefficients, formants, correspondent bandwidths and the fundamental frequency with its variants are input parameters for GMM. The experiment result for the dialect corpus of Vietnamese shows that the performance of dialectal identification with baseline increases from for the case using only MFCC coefficients to for the case using MFCC coefficients and the information of fundamental frequency. By combining the formants and their bandwidths with the normalized F 0 according to average and standard deviation F 0, the best recognition rate is . Keywords. Fundamental frequency, MFCC, Formant, Bandwidth, GMM, Vietnamese dialects, identification. 1. INTRODUCTION Vietnamese is a tonal language with many different dialects. It is the diversity of Vietnamese dialects that remains a great challenge to the systems of Vietnamese recognition. In other words, the pronunciation modality of the word .

Không thể tạo bản xem trước, hãy bấm tải xuống
TỪ KHÓA LIÊN QUAN
TÀI LIỆU MỚI ĐĂNG
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.