File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/05/p05-1071_abstr.xml
Size: 719 bytes
Last Modified: 2025-10-06 13:44:28
<?xml version="1.0" standalone="yes"?> <Paper uid="P05-1071"> <Title>Arabic Tokenization, Part-of-Speech Tagging and Morphological Disambiguation in One Fell Swoop</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> We present an approach to using a morphological analyzer for tokenizing and morphologically tagging (including part-of-speech tagging) Arabic words in one process. We learn classifiers for individual morphological features, as well as ways of using these classifiers to choose among entries from the output of the analyzer. We obtain accuracy rates on all tasks in the high nineties.</Paragraph> </Section> class="xml-element"></Paper>