Tuyển tập các báo cáo nghiên cứu về y học được đăng trên tạp chí y học Wertheim cung cấp cho các bạn kiến thức về ngành y đề tài: Text-mining and information-retrieval services for molecular biology. | Review Text-mining and information-retrieval services for molecular biology Martin Krallinger and Alfonso Valencia Address Protein Design Group National Center of Biotechnology CNB-CSIC Cantoblanco E-28049 Madrid Spain. Correspondence Martin Krallinger. E-mail martink@. Alfonso Valencia. E-mail valencia@ Published 28 June 2005 Genome Biology 2005 6 224 doi gb-2005-6-7-224 The electronic version of this article is the complete one and can be found online at http 2005 6 7 224 2005 BioMed Central Ltd Abstract Text-mining in molecular biology - defined as the automatic extraction of information about genes proteins and their functional relationships from text documents - has emerged as a hybrid discipline on the edges of the fields of information science bioinformatics and computational linguistics. A range of text-mining applications have been developed recently that will improve access to knowledge for biologists and database annotators. The use of large-scale experimental techniques and bioinfor-matic tools has increased the pace at which biologists produce relevant information. This also promotes the growth of the scientific literature which contains information on those experimental results in the form of free text that is structured in a way that makes it straightforward for humans to read but more difficult for computers to interpret automatically. As a consequence there is increasing interest in methods that can handle collections of biological texts. Such methods include systems that efficiently retrieve and classify documents in response to complex user queries and beyond this systems that carry out a deeper analysis of the literature to extract specific associations such as proteinprotein interactions and protein functions. This deeper analysis is called text-mining. The complex and concise nature of the scientific literature means that the use of textmining tools developed for generic texts is often impractical