File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/00/c00-1048_concl.xml
Size: 2,040 bytes
Last Modified: 2025-10-06 13:52:44
<?xml version="1.0" standalone="yes"?> <Paper uid="C00-1048"> <Title>A Rule Induction Approach to Modeling Regional Pronunciation Variation.</Title> <Section position="7" start_page="332" end_page="332" type="concl"> <SectionTitle> 6 Concluding remarks </SectionTitle> <Paragraph position="0"> In this 1):~l)er, we hz~ve prol)osed the llSe o\]' rule induction |;eclmiques to h'.m'n to adapt i)rommciation ret)resel~tations to regional vayiants, and to study the linguisti(: ast)e(:ts ()f su('h wu'intion. A qmnltitative and qualitative rarelysis was given of the t)honemie ditlbxences discovered 1)y these teehni(tues when trained on the Celex dnt~d)ase (Dutch) and the Fonilex (t~tat)ase (Flemish). \]n order to stu(ty the relationshi l) between both pronunciation syst(;lllS~ we \]l&VC lll}L(te llS(? Of tWO rifle in(hu:tion techniques, nnmely 3h:mlsformation-Based Error-Driven Learning (Brill, 1995) and C5.0 (Quinlan, 1993). Studying the accuracy of 1)oth systems, we noted that M'ter ~l)plication of the transtbrmation rules that were learned t)y the TBEDL method, 73% of the differences (m the word level and 80% of the ditl'crences on the t)honeme level was covered by the rules. The C5.0 I)ercentages are some 3('/o higher. This (:()rresl)onds with an overall a(:(:ura(:y in 1)redicting the 1)ronun(:iation of n lq('.mish word l)rommcb~tion ti'om the l)utch pronunciation of about 89% for TBEDL and 90% for C5.0 (about 99% at i)honeme level for l)oth).</Paragraph> <Paragraph position="1"> A qualitative analysis of the first ten rules produced by both methods, suggested that l)oth TBEDL and 05.0 extract valuable rules describing the most important linguistic differences between Dutch mid Flelnish on the consonant and the vowel level. The C5.() production rules, however, are more numerous and more dit\[icult to interpret. The results of the transtbrnlation-based le~rnillg approach are clearly more understandMfle than those of a classification-based lenrning approach for this problem.</Paragraph> </Section> class="xml-element"></Paper>