File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/06/w06-3328_abstr.xml

Size: 926 bytes

Last Modified: 2025-10-06 13:45:41

<?xml version="1.0" standalone="yes"?>
<Paper uid="W06-3328">
  <Title>Bootstrapping and Evaluating Named Entity Recognition in the Biomedical Domain</Title>
  <Section position="1" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> We demonstrate that bootstrapping a gene name recognizer for FlyBase curation from automatically annotated noisy text is more effective than fully supervised training of the recognizer on more general manually annotated biomedical text. We presentanewtestsetforthistaskbasedon an annotation scheme which distinguishes gene names from gene mentions, enabling a more consistent annotation. Evaluating our recognizer using this test set indicates that performance on unseen genes is its main weakness. We evaluate extensions to the technique used to generate training data designed to ameliorate this problem.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML