File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/06/w06-3202_abstr.xml
Size: 1,525 bytes
Last Modified: 2025-10-06 13:45:39
<?xml version="1.0" standalone="yes"?> <Paper uid="W06-3202"> <Title>Improving Syllabification Models with Phonotactic Knowledge</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> We report on a series of experiments with probabilistic context-free grammars predicting English and German syllable structure. The treebank-trained grammars areevaluatedonasyllabificationtask. The grammar used by M&quot;uller (2002) serves as point of comparison. As she evaluates the grammar only for German, we re-implement the grammar and experiment with additional phonotactic features. Using bi-grams within the syllable, we can model the dependency from the previous consonant in the onset and coda. A 10-fold cross validation procedure shows that syllabification can be improved by incorporating this type of phonotactic knowledge. ComparedtothegrammarofM&quot;uller (2002), syllable boundary accuracy increases from 95.8% to 97.2% for English, and from 95.9% to 97.2% for German. Moreover, our experiments with different syllable structures point out that there are dependencies between the onset on the nucleus for German but not for English. The analysis of one of our phonotactic grammars shows that interesting phonotactic constraints are learned. For instance, unvoiced consonants are the most likely first consonants and liquids and glides are preferred as second consonants in two-consonantal onsets.</Paragraph> </Section> class="xml-element"></Paper>