Hàng chục công cụ khác để quản lý dữ liệu ngôn ngữ có sẵn, một số được khảo sát (Bird & Simons, 2003). Xem thêm các thủ tục tố tụng của các hội thảo LaTeCH về công nghệ ngôn ngữ cho dữ liệu di sản văn hóa. | linguistic annotations. An extended discussion of web crawling is provided by Croft Metzler Strohman 2009 . Full details of the Toolbox data format are provided with the distribution Buseman Buseman Early 1996 and with the latest distribution freely available from http computing toolbox . For guidelines on the process of constructing a Toolbox lexicon see http computing ddp . More examples of our efforts with the Toolbox are documented in Bird 1999 and Robinson Aumann Bird 2007 . Dozens of other tools for linguistic data management are available some surveyed by Bird Simons 2003 . See also the proceedings of the LaTeCH workshops on language technology for cultural heritage data. There are many excellent resources for XML . http and for writing Python programs to work with XML http doc lib . Many editors have XML modes. XML formats for lexical information include OLIF http and LIFT http p lift-standard . For a survey of linguistic annotation software see the Linguistic Annotation Page at http annotation . The initial proposal for standoff annotation was Thompson McKelvie 1997 . An abstract data model for linguistic annotations called annotation graphs was proposed in Bird Liberman 2001 . A generalpurpose ontology for linguistic description GOLD is documented at http . For guidance on planning and constructing a corpus see Meyer 2002 and Farghaly 2003 . More details of methods for scoring inter-annotator agreement are available in Artstein Poesio 2008 and Pevzner Hearst 2002 . Rotokas data was provided by Stuart Robinson and Iu Mien data was provided by Greg Aumann. For more information about the Open Language Archives Community visit http www . or see Simons Bird 2003 . Exercises 1. In Example 11-2 the new field appeared at the bottom of the entry. Modify this program so that it inserts the new .