File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/96/c96-2192_abstr.xml
Size: 756 bytes
Last Modified: 2025-10-06 13:48:43
<?xml version="1.0" standalone="yes"?> <Paper uid="C96-2192"> <Title>Tagging Spoken Language Using Written Language Statistics</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> This paper reports on two experiments with a probabilistic part-of-speech tagger, trained on a tagged corpus of written Swedish, being used to tag a corpus of (transcribed) spoken Swedish. The results indicate that with very little adaptations an accuracy rate of 85% can be achieved, with an accuracy rate for known words of 90%. In addition, two different treatments of pauses were explored but with no significant gain in accuracy under either condition.</Paragraph> </Section> class="xml-element"></Paper>