Thu thập dữ liệu (Chakrabarti, năm 2002, Novak, 2004b). Ý tưởng chính của cách tiếp cận này là sử dụng hạt giống "đầu tiêncác cuộc thảo luận được tiến hành quá nhanh, do đó không phải ai cũng có thể làm theo các lập luận, | KNOWLEDGE ACCESS AND THE SEMANTIC WEB 151 In order to offer such search facilities Swoogle builds an index of semantic web documents defined as web-accessible documents written in a semantic web language . A specialised crawler has been built using a range of heuristics to identify and index semantic web documents. The creators of Swoogle are building an ontology dictionary based on the ontologies discovered by Swoogle. . Semantic Browsing Web browsing complements searching as an important aspect of information-seeking behaviour. Browsing can be enhanced by the exploitation of semantic annotations and below we describe three systems which offer a semantic approach to information browsing. Magpie Domingue et al. 2004 is an internet browser plug-in which assists users in the analysis of web pages. Magpie adds an ontologybased semantic layer onto web pages on-the-fly as they are browsed. The system automatically highlights key items of interest and for each highlighted term it provides a set of services . contact details current projects related people when you right click on the item. This relies of course on the availability of a domain ontology appropriate to the page being browsed. CS AKTiveSpace Glaser et al. 2004 is a semantic web application which provides a way to browse information about the UK Computer Science Research domain by exploiting information from a variety of sources including funding agencies and individual researchers. The application exploits a wide range of semantically heterogeneous and distributed content. AKTiveSpace retrieves information related to almost two thousand active Computer Science researchers and over 24 000 research projects with information being contained within 1000 published papers located in different university web sites. This content is gathered on a continuous basis using a variety of methods including harvesting publicly available data from institutional web sites bulk translation from existing databases as well