Tuyển tập báo cáo các nghiên cứu khoa học quốc tế ngành hóa học dành cho các bạn yêu hóa học tham khảo đề tài: Research Article Detection and Separation of Speech Events in Meeting Recordings Using a Microphone Array | Hindawi Publishing Corporation EURASIP Journal on Audio Speech and Music Processing Volume 2007 Article ID 27616 8 pages doi 2007 27616 Research Article Detection and Separation of Speech Events in Meeting Recordings Using a Microphone Array Futoshi Asano 1 Kiyoshi Yamamoto 1 Jun Ogata 1 Miichi Yamada 2 and Masami Nakamura2 1 Information Technology Research Institute National Institute of Advanced Industrial Science and Technology Tsukuba Central 2 1-1-1 Umezono Tsukuba 305-8568 Japan 2 Advanced Media Inc. 48F Sunshine 60 Building 3-1-1 Higashi-Ikebukuro Toshima-Ku Tokyo 170-6048 Japan Received 2 November 2006 Revised 14 February 2007 Accepted 19 April 2007 Recommended by Stephen Voran When applying automatic speech recognition ASR to meeting recordings including spontaneous speech the performance of ASR is greatly reduced by the overlap of speech events. In this paper a method of separating the overlapping speech events by using an adaptive beamforming ABF framework is proposed. The main feature of this method is that all the information necessary for the adaptation of ABF including microphone calibration is obtained from meeting recordings based on the results of speech-event detection. The performance of the separation is evaluated via ASR using real meeting recordings. Copyright 2007 Futoshi Asano et al. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use distribution and reproduction in any medium provided the original work is properly cited. 1. INTRODUCTION The analysis structuring and automatic transcription of meeting recordings have attracted considerable attention in recent years . 1-5 . Especially for small informal meetings a major difficulty is that the discussion consists of spontaneous speech and various types of unexpected speech or nonspeech events may occur. One such event is the responses by listeners such as Uh-huh or I see being inserted in short pauses in the main