File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/94/c94-1092_concl.xml
Size: 1,682 bytes
Last Modified: 2025-10-06 13:57:14
<?xml version="1.0" standalone="yes"?> <Paper uid="C94-1092"> <Title>ANNOTATING 200 MILLION WORDS: THE BANK OF ENGLISH PROJECT</Title> <Section position="7" start_page="566" end_page="567" type="concl"> <SectionTitle> (i ACKNOWLEDCI~MENTS </SectionTitle> <Paragraph position="0"> .~lmehd Lhauks are due I,o llarper ColliNs Ihddishers, (i;lasgow, for l)ernlission to us,~ boi,h Collm:; C()IIUIM) aml (Mllhl.~; English \[)ieiJouary ill eleetronic form. I%rs.nally, \[ am greatly imlebted to Pasi 'l'apalmln~m for solutions to :m hw:d('ulahle mmd;er of techuieal \]n'ol)lems and t(} Arm \/outihdnen I(n' guidance and supcrvisiorl during t, hi.q project. \] wish to thallk also l)r(fl '. Fred I<arlss(>n, ,\]uha II~fikld\]/i, I<ari Pitldhlen and Sari Salmi,~uo r~>r reviewing earlier drafts .r this paper.</Paragraph> <Paragraph position="1"> The table above shows the size of the 11 batches ailnotll.ted so far in words and the lnlnlber of new lexieal enl,rles a derived fi'om them.</Paragraph> <Paragraph position="2"> B An Example of the EN(-ICG analysed sentence (from the American Books data) The original text: <g> The situation at Stangord, to be examined in more detail later, is hardly unique.</Paragraph> <Paragraph position="3"> Annot, ated text: Syntactic tags, listed in \[Tapa,uainen, 1994; Voutilainen, 1992\] are marked with an at-sign (@). The shallow syntax distinguishes faur wn'b chain hlbels and nominal head and nlodiller functioris. Modilier fimctions have ~ pointer (> or <) 1,o t, he head to the right or to the lefl;, respectively. PP and adverbial attachnmnt is solved when it can be done reliably.</Paragraph> </Section> class="xml-element"></Paper>