Báo cáo hóa học: " Detection and Separation of Speech Event Using Audio and Video Information Fusion and Its Application to Robust Speech Interface"

Tuyển tập báo cáo các nghiên cứu khoa học quốc tế ngành hóa học dành cho các bạn yêu hóa học tham khảo đề tài: Detection and Separation of Speech Event Using Audio and Video Information Fusion and Its Application to Robust Speech Interface | EURASIP Journal on Applied Signal Processing 2004 11 1727-1738 2004 Hindawi Publishing Corporation Detection and Separation of Speech Event Using Audio and Video Information Fusion and Its Application to Robust Speech Interface Futoshi Asano 1 Kiyoshi Yamamoto 2 Isao Hara 1 Jun Ogata 1 Takashi Yoshimura 1 Yoichi Motomura 1 Naoyuki Ichimura 1 HidekiAsoh1 1 Information Technology Research Institute National Institute of Advanced Industrial Science and Technology Tsukuba 305-8568 Japan Emails isao-hara@ yoshimur@ nic@ 2 Department of Computer Science Tsukuba University Tsukuba 305-8573 Japan Email kyama@ Received 11 November 2003 Revised 3 February 2004 Recommended for Publication by Chin-Hui Lee A method of detecting speech events in a multiple-sound-source condition using audio and video information is proposed. For detecting speech events sound localization using a microphone array and human tracking by stereo vision is combined by a Bayesian network. From the inference results of the Bayesian network information on the time and location of speech events can be known. The information on the detected speech events is then utilized in the robust speech interface. A maximum likelihood adaptive beamformer is employed as a preprocessor of the speech recognizer to separate the speech signal from environmental noise. The coefficients of the beamformer are kept updated based on the information of the speech events. The information on the speech events is also used by the speech recognizer for extracting the speech segment. Keywords and phrases information fusion sound localization human tracking adaptive beamformer speech recognition. 1. INTRODUCTION Detection of speech events is an important issue in automatic speech recognition ASR in a real environment with background noise and interferences. Also the detection of the presence or .

Không thể tạo bản xem trước, hãy bấm tải xuống
TÀI LIỆU LIÊN QUAN
TÀI LIỆU MỚI ĐĂNG
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.