File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/03/n03-2028_abstr.xml
Size: 1,144 bytes
Last Modified: 2025-10-06 13:42:48
<?xml version="1.0" standalone="yes"?> <Paper uid="N03-2028"> <Title>LM Studies on Filled Pauses in Spontaneous Medical Dictation Jochen Peters</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> We investigate the optimal LM treatment of abundant filled pauses (FP) in spontaneous monologues of a professional dictation task.</Paragraph> <Paragraph position="1"> Questions addressed here are (1) how to deal with FP in the LM history and (2) to which extent can the LM distinguish between positions with high and low FP likelihood. Our results differ partly from observations reported on dialogues. Discarding FP from all LM histories clearly improves the performance. Local perplexities, entropies and word rankings at positions following FP suggest that most FP indicate hesitations rather than restarts. Proper prediction of FP allows to distinguish FP from word positions by a doubled FP probability.</Paragraph> <Paragraph position="2"> Recognition experiments confirm the improvements found in our perplexity studies.</Paragraph> </Section> class="xml-element"></Paper>