We have witnessed signi cant progress in NLP applications such as information extraction IE , summarization, machine translation, cross-lingual information retrieval CLIR , etc. The progress will be accelerated by advances in speech technology, which not only enables us to interact with systems via speech but also to store and retrieve texts input via speech. The progress of NLP applications in this decade has been mainly accomplished by the rapid development of corpus-based and statistical techniques, while rather simple techniques have been used as far as the structural aspects of language are concerned. In this paper, we will discuss how we.