File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/05/w05-0212_metho.xml

Size: 26,802 bytes

Last Modified: 2025-10-06 14:09:52

<?xml version="1.0" standalone="yes"?>
<Paper uid="W05-0212">
  <Title>A Software Tool for Teaching Reading Based on Text-to-Speech Letter-to-Phoneme Rules</Title>
  <Section position="4" start_page="77" end_page="78" type="metho">
    <SectionTitle>
2 Software Design
</SectionTitle>
    <Paragraph position="0"> The Word Pronunciation tool allows a student to enter any word or name - whether it is in any dictionary or not - and see our set of rules that account for its pronunciation. The screen capture below shows the output for the word &amp;quot;photograph&amp;quot;. The student can also hear the word pronounced, either normally or syllable-by-syllable.</Paragraph>
    <Paragraph position="1">  The system uses the International Phonetic Alphabet (IPA) to represent phonetic transcriptions, because most of our target population, adult foreign-born learners of English as a second language, were familiar with this alphabet, since it is used in many English learners' dictionaries.</Paragraph>
    <Paragraph position="2"> In addition, a student can click on any rule and see other words obeying the same rule as well as words that are exceptions to the rule.</Paragraph>
    <Paragraph position="3">  The Letter Pattern tool allows a student to enter a letter or sequence of letters (ie, a letter pattern) and see the rules that apply to that pattern and exceptions to those rules. For example, a student confused by the fact that &amp;quot;how&amp;quot; and &amp;quot;snow&amp;quot; don't rhyme can enter the letter pattern &amp;quot;ow&amp;quot; and view the various generalizations (rules) that determine the pronunciation of this letter string in different contexts, as well as words that don't follow these generalizations (exceptions). The software underlying this tool allows the user to tailor the output to his needs of the moment. For example, one can choose 1-syllable versus multisyllabic words as targets for the rules, how many sample words to output by default, and how big a vocabulary from which to draw words. A simple example of the operation of this tool is illustrated in Figure 2.</Paragraph>
    <Paragraph position="4"> Figure 2: Letter Pattern tool display While we developed the Letter Pattern tool for general use by learners, we used its underlying search engine in exercises designed to diagnose and teach pronunciation rules.</Paragraph>
    <Paragraph position="5"> We implemented a framework for a self-paced set of exercises that allows the user to work alone to diagnose his pronunciation-rule weaknesses and learn the rules necessary to correct his errors. We used this framework to assess the effectiveness of our rules and system for learners.</Paragraph>
    <Paragraph position="6"> In the typical exercise interaction, the user sees a sequence of words and must choose the correct pronunciation for each. He indicates his choice of  In this prototype, we worked on presenting the segmental rules, that is, rules for pronouncing phonemes. Although our letter-to-phoneme software assigns lexical stress (to indicate which syllable bears primary stress in polysyllabic words), the stress assignment algorithm is quite complicated. Although the algorithm is accurate, it is too complicated for a human to apply for learning. We also ignored the rules for morphological decomposition, such as for analyzing &amp;quot;walking' into &amp;quot;walk&amp;quot; plus &amp;quot;ing&amp;quot; or &amp;quot;snowman&amp;quot; into &amp;quot;snow&amp;quot; plus &amp;quot;man&amp;quot;. Words with the letter pattern eigh  pronunciation by clicking on one of several options, represented in the International Phonetic Alphabet (IPA) or by clicking on a speaker-symbol, so that he can hear the options spoken. An example of a test item would be the nonsense word &amp;quot;doke&amp;quot;. If the user chooses an incorrect pronunciation for this word, he is told the correct pronunciation, as well as the relevant rule, which in this case is that an 'o' followed by a single consonant followed by a final 'e' is pronounced /oU /. The user can choose to see actual examples of the rule in action (&amp;quot;smoke&amp;quot;, &amp;quot;home&amp;quot;, etc.) and other rules involving 'o' (eg., the default pronunciation /a :/, as in &amp;quot;hot&amp;quot;). In some cases, the exercises tested real English words, and in others, &amp;quot;nonsense&amp;quot; words (words that do not exist in English but are possible as words, because they have letter sequences occurring in English words). Only through knowing the general rules of English pronunciation can a student correctly predict the pronunciation of words he has never seen before. Figure 3 shows the exercise for knowledge of the letter &amp;quot;a&amp;quot; in the nonsense  This &amp;quot;lesson&amp;quot; screen highlights the relevant pronunciation rule in the word. Because subjects told us that our pronunciation rule syntax, derived from our LTP rules, was &amp;quot;too mathematical&amp;quot; and hard to understand, the screen also displays an English language explanation for each rule (e.g., &amp;quot;In the letter pattern a - any letter - e, the letter a is pronounced as /eI /&amp;quot;). In our prototype, we developed a simple text-generation algorithm to translate from our &amp;quot;mathematical&amp;quot; rule syntax (&amp;quot;a - /eI /, in a.e&amp;quot;) into normal English for the rules that we tested in our evaluation. Going forward, however, we will need to produce the explanations via a more sophisticated algorithm or simply hand-prepare explanations for the rules.</Paragraph>
    <Paragraph position="7"> A subject can click on the &amp;quot;See all words&amp;quot; link to see more English words in which that rule applies. After the &amp;quot;lesson&amp;quot; the learner is given the opportunity to try again, in order to reinforce the correct pronunciation.</Paragraph>
    <Paragraph position="8"> The design of the prototype incorporates several features that are important to its extension to a full learning system. First, the set of exercises is tabledriven, so that is relatively easy to add a new set of exercises. This feature is important since a complete system will need a large number of exercises.</Paragraph>
    <Paragraph position="9"> Second, the system is designed so that the corpus of words that serve as examples of the rules can be changed easily. This feature is important since different user groups (e.g., adult nonnative speakers, children, speakers with reading disabilities) may require different kinds of words as examples.</Paragraph>
  </Section>
  <Section position="5" start_page="78" end_page="83" type="metho">
    <SectionTitle>
3 Experimental Results
</SectionTitle>
    <Paragraph position="0"> In addition to developing lexical resource tools, we conducted an experiment to determine (1) if our software could be useful in teaching nonnative speakers of English how to pronounce English words, and if so, (2) if both commonly-taught pronunciation rules and pronunciation rules that are idiosyncratic to the E-Speech letter-to-phoneme system can effectively be taught.</Paragraph>
    <Paragraph position="1"> We considered testing the lexical resource tools directly by giving students lists of words and instructing them to use the tools to learn the pronunciation rules for the words. However, we felt that a more efficient way of testing our software would be to develop a set of exercises to diagnose and teach various pronunciation rules and then to test how effectively students learned from the exercises. We developed the design of the exercises You may need help with the pronunciation rule for a in jate. The correct pronunciation for jate is dZ eI t Here are the pronunciation rules for all the letters in jate  Pronunciation rule j - dZ - Pronunciation rule a - eI , in a . e Pronunciation rule t - t Pronunciation rule e# - (not pronounced) Let's look at the rule for pronouncing a in this word: - Pronunciation rule a - eI , in a . e  This rule says: In the letter pattern a - any letter - e, the letter a is pronounced as eI . Examples: There are 2049 words in English where this rule applies. Here are 5 of them: made meI d state steI t make meI k same seI m place pleI s see all words Let's get started. Here is item 1 of 22 items. Choose the correct pronunciation for this word.  based on informal comments and results of pretests with more than 40 nonnative speakers of English.</Paragraph>
    <Section position="1" start_page="79" end_page="80" type="sub_section">
      <SectionTitle>
3.1 Experimental Design
</SectionTitle>
      <Paragraph position="0"> We sought to improve nonnative speakers' word pronunciation competence, aiming toward giving them the competence of native speakers of English.</Paragraph>
      <Paragraph position="1"> Therefore, we included both native and nonnative speakers as subjects. 10 nonnative speakers of English and 7 native speakers of English successfully completed the final set of exercises. Six non-native subjects were undergraduates or graduate students at Montclair State University who had been assigned to an English as a Second Language course based on their performance on an English language test administered by the university. The other four were nonnative speakers of English in Brazil, Bolivia, and Germany. Native languages of the subjects were German, Portuguese, Korean, Spanish, Polish, Bangla (Bangladesh), and Urhobo (Nigeria). The native English-speaking subjects were high school or college students who grew up in New Jersey.</Paragraph>
      <Paragraph position="2">  The subjects were assigned logins to the system and were instructed to complete a series of exercises, each of which would present different English pronunciation rules. A subject logged in to the system with a web-browser over the internet, saw a printed word and a set of possible pronunciations for the word (as described above). The student was instructed to listen to the set of choices and to choose the pronunciation that he thought was correct. Subjects were told that each exercise would consist of two parts. The first part of each exercise would identify the pronunciation rules with which a subject might need help and then teach the rule; the second part of the exercise would determine whether teaching the pronunciation rules was effective. In the teaching part of the exercise, each rule was presented several times, as it applied to different words. Subjects were allowed to repeat the first part of each exercise as many times as they wished, until they felt comfortable about proceeding to the test part of the exercise.</Paragraph>
      <Paragraph position="3">  Nonnative subjects were told that we had developed software for teaching word pronunciation and we needed nonnative speakers to try the software and see if it helped them to improve their word pronunciation. English-speaking subjects were told that we had developed software for teaching word pronunciation to nonnative subjects, and that we needed to compare the students' performance with native speakers' performance.</Paragraph>
      <Paragraph position="4"> Our software logged the students' choice for each word in each part of each exercise and scored it as correct (1) or incorrect (0). We computed the percentage of correct choices, which we call the word pronunciation score. We also logged the number of times a student practiced with the diagnosis/lesson portion of each exercise, and the amount of time a student spent with each item.</Paragraph>
      <Paragraph position="5"> The basic exercises were: Basic: 1-syllable nonsense words representing &amp;quot;basic&amp;quot; rules, rules that are extremely common in English words. These are productive rules (English speakers apply them in nonsense words), and rules capturing these generalizations are commonly taught in phonics programs. Specifically the exercise teaches: * a is pronounced /eI / in the letter sequence a - any letter - e , as in make * a is pronounced /ae/ by default, as in cat and analogous rules for the letters e, i, o, u.</Paragraph>
      <Paragraph position="6"> The other exercises taught and tested rules from the LTP system, using English words rather than nonsense words as the material. These were: LTP1: the basic rules for the letter a, plus the trisyllabic laxing rule (which we call the 3-syllable rule), which causes underlying long vowels and diphthongs to shorten to a lax vowel in antepenul- null * a is pronounced /a :/ in /w/a, as in watch and quality ... except ...</Paragraph>
      <Paragraph position="7"> * a is pronounced /ae/ in /w/a before the phonemes /k/, /g/, /m/, /ng /, as in wag, swam, quack, swang * a is pronounced /eI / in a - any letter - e , as in wake Since each of these exercises included only several rules, the final exercise, LTP-all, recapped the other exercises, in order to assess how well students could integrate all the rules.</Paragraph>
      <Paragraph position="8"> LTP-all: an integrated exercise: all the rules for the letter a that were presented in the previous exercises, plus the rule * a - /eI / in aste, as in paste We chose this particular set of LTP rules because they would allow us to compare the &amp;quot;basic&amp;quot; rules, common to many phonics programs, and rules in our LTP system that are not taught in phonics programs. All the non-basic rules in our experiment were pronunciation rules for the letter &amp;quot;a&amp;quot; and were chosen because they applied to many English words and represented a variety of formal types of rules. For example, some had relatively simple contexts (e.g., &amp;quot;alt&amp;quot;), and some had complicated contexts. For most of the rules, the context was specified in terms of the surrounding letters (for example, the letter a when followed by any letter and the letter e). For one rule, the context was specified in terms of the surrounding phonemes. This latter type of rule is complicated because it requires that a learner first identify the phonemes for the letters surrounding the target letter &amp;quot;a&amp;quot;.</Paragraph>
      <Paragraph position="9"> In the exercise on &amp;quot;basic rules&amp;quot;, we used nonsense words to teach and test pronunciation rules. Our reasoning was that the strongest test of whether a student knows the rules is to test his pronunciation of nonsense words, since the only way he could possibly know how to pronounce a nonsense word is by applying the rules. Further, we felt there was strong evidence that the &amp;quot;basic rules&amp;quot; are productive in English. That is, native speakers of English know these rules and apply them in novel words and nonsense words. For example, English speakers pronounce the &amp;quot;a&amp;quot; in nonsense words with &amp;quot;a&amp;quot; - consonant - &amp;quot;e&amp;quot; at the end of a word, (e.g., &amp;quot;pake&amp;quot;, &amp;quot;glape&amp;quot;, &amp;quot;nade&amp;quot;) as /eI /. Consequently, we felt that teaching and testing with nonsense words would help give nonnative speakers the same competence as native speakers.</Paragraph>
      <Paragraph position="10"> However, for the other exercises we used real English words. Our reasoning was that the non-basic rules, although they apply to classes of English words, may not be productive in English. That is, native speakers of English might not apply the rules to nonsense words, even though the rule governs a class of existing English words. For example, in English, &amp;quot;oo&amp;quot; is most commonly /u:/ (as in &amp;quot;coo&amp;quot; and &amp;quot;cool&amp;quot;), but when the &amp;quot;oo&amp;quot; is followed by a final &amp;quot;k&amp;quot;, the vowel is almost always pronounced /U / (e.g., &amp;quot;took&amp;quot;, &amp;quot;book&amp;quot;, &amp;quot;cook&amp;quot;, &amp;quot;brook&amp;quot;, &amp;quot;crook&amp;quot;, &amp;quot;snook&amp;quot;, though there are a few exceptions: &amp;quot;kook&amp;quot;, &amp;quot;spook&amp;quot;). The question thus arises whether native speakers of English, who obviously know how to pronounce these words, have internalized the &amp;quot;ook rule&amp;quot; and apply it in novel words. Native speakers do not always pronounce novel &amp;quot;ook&amp;quot; words with /U /; instead, they sometimes use /u:/ in nonsense words, like &amp;quot;mook&amp;quot;, &amp;quot;dook&amp;quot;, &amp;quot;vook&amp;quot; (see Treiman et. al., 2003). Of course, the fact that a rule is not productive does not mean that it is not useful for teaching students how to pronounce words; clearly it would be useful for students to know that &amp;quot;ook&amp;quot; is usually pronounced /U k/. However, since we wanted to compare nonnatives' performance to natives' performance, and we were primarily concerned with teaching nonnative speakers how to pronounce real English words, we chose to teach and test real, as opposed to nonsense, words.</Paragraph>
      <Paragraph position="11">  We did include one test of non-basic-rules that used nonsense words, anticipating that native speakers might perform differently from nonnatives, if the nonnatives, who had been explicitly taught pronunciation rules, applied them to nonsense words, even if natives did not apply them productively in nonsense words.</Paragraph>
    </Section>
    <Section position="2" start_page="80" end_page="82" type="sub_section">
      <SectionTitle>
3.2 Experimental Results
</SectionTitle>
      <Paragraph position="0"> We present our results informally, without statistical analysis of significance, primarily because we have to date collected data a relatively small number of subjects. Consequently, we interpret our results as preliminary.</Paragraph>
      <Paragraph position="1">  We attempted to choose words that have relatively low frequency-ofoccurrence, to minimize the chances that a nonnative speaker would simply know the word.</Paragraph>
      <Paragraph position="3"> &amp;quot;before&amp;quot; (word pronunciation scores before subjects were offered any lessons) and &amp;quot;after&amp;quot; (word pronunciation scores from the test parts of the exercises, after the lessons). As would be expected, native speakers had higher word pronunciation scores than nonnative speakers. Further, nonnative speakers had higher word pronunciation scores after completing the lessons than they did before the lessons, although, overall, they did not achieve native speakers' level of word pronunciation.</Paragraph>
      <Paragraph position="4"> Thus, our data suggests that, overall, nonnative speakers were able to learn aspects of word pronunciation from our system.</Paragraph>
      <Paragraph position="5">  among the subjects. Some nonnative subjects' scores increased much more than others', and several subjects' scores did not increase or increased only slightly. Nonnative subjects with higher &amp;quot;before&amp;quot; scores, in general, did not increase as much as the nonnatives with low &amp;quot;before&amp;quot; scores, probably because their scores were high to start with.  across subjects, for the different exercises, which represented different sets of pronunciation rules.</Paragraph>
      <Paragraph position="6"> We wanted to know whether some exercises proved more learnable than others. In general, as shown at the right side of the Figure 7, for nonnative speakers, for each exercise, the word pronunciation scores were higher after the lessons than before, although the effects were greater for some exercises than others. For native speakers, in contrast, there were no systematic differences in the before versus after scores. However, overall, scores were higher for some exercises than for other exercises even for native speakers. Examination of the native speakers' &amp;quot;incorrect&amp;quot; responses suggests that dialectal issues may have caused some native speakers to choose responses that we did not anticipate. For example, for the word &amp;quot;waffle&amp;quot;, some native subjects chose the pronunciation /wo :f@ l/, although we had assumed that the pronunciation in these subjects' dialect was /wa :f@ l/.</Paragraph>
      <Paragraph position="7"> Figures 8 and 9 suggest that some rules were useful to nonnative subjects. For example, the &amp;quot;basic&amp;quot; rules, in general, were effective; the nonsense words tested after the lessons elicited higher scores than those tested before any lessons. Of the other, letter-to-phoneme-based rules, the &amp;quot;war&amp;quot; rule and the 3-syllable rule seemed to be effective (the &amp;quot;before&amp;quot; bar for &amp;quot;war&amp;quot; is not displayed in Figure 9, because the before score was extremely low, only</Paragraph>
    </Section>
    <Section position="3" start_page="82" end_page="83" type="sub_section">
      <SectionTitle>
Rules
</SectionTitle>
      <Paragraph position="0"> A complicated rule, the /w/a rule (i.e., the rule that the letter &amp;quot;a&amp;quot; after the phoneme /w/ is pronounced /a :/), appeared not to be useful to nonnative subjects. We found no evidence for the effect of teaching for another rule, the &amp;quot;aste&amp;quot; rule, because all nonnative subjects knew the pronunciation of the &amp;quot;aste&amp;quot; words before the lessons. However, there were differences between subjects. One source of this difference is probably due to differences in the nonnative subjects' pre-existing English knowledge; that is some subjects knew some word pronunciations in advance of the lessons.</Paragraph>
      <Paragraph position="1"> Consequently, for some rules we obtained data for only a few subjects.</Paragraph>
      <Paragraph position="2"> As discussed above, since the items in the exercises testing our letter-to-phoneme rules were English words (even though there were not common words) how do we know that subjects' performance was due to our lessons; perhaps the subjects simply knew the words' pronunciation before participating in our experiments? How well do nonnative speakers apply the rules they learned to words that we can be sure they have never seen before? One of our exercises, LTP-all included a section that contained only nonsense words (e.g., &amp;quot;laterous&amp;quot;, &amp;quot;plar&amp;quot;, &amp;quot;swarg&amp;quot;, &amp;quot;falt&amp;quot;).  words versus words in the Integrated Rules Test LTP-all after lessons Figure 10 presents the results of the nonsense word portion of LTP-all: pronunciation scores for real English words versus nonsense words for non-native subjects and the analogous scores for native speakers, collapsed across subjects. Although there were between-subject differences, on average, both sets of subjects had lower scores for nonsense words than for words. If subjects based their scores for all test items - words as well as nonsense words - entirely on the word pronunciation rules that we included in our exercises, then we would expect their scores to be the same for words and nonsense words. Since words have an empirically correct pronunciation (they are given in a dictionary, for example), native speakers may be relying on a stored phonological representation for the word items. For nonsense words, however, the subjects must rely on rules or other principles. If subjects used rules or principles for pronouncing nonsense words different from the ones we expected, then the nonsense word scores would be lower than those for words. However, the pronunciation scores for the nonsense words for the non-native subjects were higher than that for natives.</Paragraph>
      <Paragraph position="3"> This fact suggests that the nonnatives were, in fact, applying the pronunciation rules they had learned in our lessons to the nonwords.</Paragraph>
      <Paragraph position="4">  In summary, our preliminary results indicate that: (1) our software was useful in teaching non-native speakers of English how to pronounce English words; (2) both &amp;quot;basic&amp;quot; pronunciation rules and some novel, proprietary pronunciation rules were useful for teaching word pronunciation.</Paragraph>
    </Section>
  </Section>
  <Section position="6" start_page="83" end_page="83" type="metho">
    <SectionTitle>
4 Future Research
</SectionTitle>
    <Paragraph position="0"> The major directions for our future research are to select and reformulate LTP rules that are useful for teaching and then to obtain more user data for nonnative speakers and for native-English-speaking children learning to read.</Paragraph>
    <Paragraph position="1"> First, we intend to produce a complete learner's rule system for English, based on the entire set of text-to-speech LTP rules. Since our text-to-speech system contains roughly 800 LTP rules, which we believe is too large a number for learners, a significant task is to reduce the number of rules. We intend to focus on rules that apply to large numbers of words and remove rules that apply to few words, relegating the words to which they apply to the exceptions dictionary. We also intend to attempt to collapse rules that apply in similar contexts.</Paragraph>
    <Paragraph position="2"> Second, we need to determine whether learners can, in general, understand a pronunciation system that requires rule ordering. In our system, some rules are labeled as &amp;quot;default&amp;quot; rules, meaning that they apply if no other rules apply. Consequently, learners must know all the rules in order to know when to apply the default rule. If the number of rules is too large, learners may need a system in which each rule is independently unambiguous.</Paragraph>
    <Paragraph position="3"> Third, we need to recruit more nonnative speakers of English who are good candidates for improving their word pronunciation skills. Some of our subjects in this experiment had relatively high word pronunciation scores before exposure to our lessons, so that observing any effect of our lessons was inherently limited. Therefore, we would like to recruit more subjects with lower word pronunciation ability, in order to get a better picture of the effectiveness of our system. Can we predict which students will have high pronunciation scores and which will have low scores based on a student's report of his or her experience in English  tion scores for each subject at the beginning of the exercises, before doing lessons. As shown, the reported length of time a student had studied English was not correlated with word pronunciation scores. Consequently, we cannot depend on a student's reported length of time studying English as predictive of his word pronunciation abilities. Instead, we will need to screen prospective subjects via pretesting.</Paragraph>
    <Paragraph position="4"> Finally, although we have developed our system for nonnative speakers, we would like to test our system with native-English-speaking children learning to read. However, it is likely that the user interface and corpus of exemplar words will need to be different for the child population.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML