Báo cáo hóa học: " A Computationally Efficient Mel-Filter Bank VAD Algorithm for Distributed Speech Recognition Systems"

Tuyển tập báo cáo các nghiên cứu khoa học quốc tế ngành hóa học dành cho các bạn yêu hóa học tham khảo đề tài: A Computationally Efficient Mel-Filter Bank VAD Algorithm for Distributed Speech Recognition Systems | EURASIP Journal on Applied Signal Processing 2005 4 487-497 2005 Hindawi Publishing Corporation A Computationally Efficient Mel-Filter Bank VAD Algorithm for Distributed Speech Recognition Systems Damjan Vlaj Institute of Electronics Faculty of Electrical Engineering and Computer Science University ofMaribor Smetanova 17 2000 Maribor Slovenia Email Bojan Kotnik Institute of Electronics Faculty of Electrical Engineering and Computer Science University ofMaribor Smetanova 17 2000 Maribor Slovenia Email Bogomir Horvat Institute of Electronics Faculty of Electrical Engineering and Computer Science University ofMaribor Smetanova 17 2000 Maribor Slovenia Email Zdravko KaCiC Institute of Electronics Faculty of Electrical Engineering and Computer Science University ofMaribor Smetanova 17 2000 Maribor Slovenia Email kacic@ Received 18 March 2004 Revised 23 September 2004 Recommended for Publication by Douglas O Shaughnessy This paper presents a novel computationally efficient voice activity detection VAD algorithm and emphasizes the importance of such algorithms in distributed speech recognition DSR systems. When using VAD algorithms in telecommunication systems the required capacity of the speech transmission channel can be reduced if only the speech parts of the signal are transmitted. A similar objective can be adopted in DSR systems where the nonspeech parameters are not sent over the transmission channel. A novel approach is proposed for VAD decisions based on mel-filter bank MFB outputs with the so-called Hangover criterion. Comparative tests are presented between the presented MFB VAD algorithm and three VAD algorithms used in the and DSR advanced front-end Standards. These tests were made on the Aurora 2 database with different signal-to-noise SNRs ratios. In the speech recognition tests the proposed MFB VAD outperformed all the three VAD algorithms used in the standards by .

Không thể tạo bản xem trước, hãy bấm tải xuống
TÀI LIỆU LIÊN QUAN
TÀI LIỆU MỚI ĐĂNG
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.