File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/83/e83-1006_intro.xml
Size: 2,632 bytes
Last Modified: 2025-10-06 14:04:22
<?xml version="1.0" standalone="yes"?> <Paper uid="E83-1006"> <Title>at Urbana-Champalgn English as a Second Language Foreign Language Building</Title> <Section position="2" start_page="0" end_page="0" type="intro"> <SectionTitle> ABSTRACT </SectionTitle> <Paragraph position="0"> The speech synthesis group at the Computer-Based Education Research Laboratory (CERL) of the University of Illinois at Urbana-Champalgn is developing a diphone speech synthesis system based on pltch-adaptive short-tlme Fourier transforms.</Paragraph> <Paragraph position="1"> This system accepts the phonemic specification of an utterance along with pitch, time, and amplitude warping functions in order to produce high quality speech output from stored dlphone templates.</Paragraph> <Paragraph position="2"> This paper describes the operation of a program which operates as a front end for the dlphone speech synthesis system. The UTTER (for &quot;Unmarked Text Transcription by Expert Rule&quot;) system maps English text onto a phoneme string, which is then used as an input to the dlphone speech synthesis system. The program is a two-tiered Expert System which operates first on the word level and then on the (vowel or consonant) cluster level. The system's knowledge about pronunciation is organized in two decision trees automatically generated by an induction algorithm on a dynamically specified &quot;training set&quot; of examples.</Paragraph> <Paragraph position="3"> in that they are often unable to cope with a letter pattern that maps onto more than one phoneme pattern. Extreme cases are those words which, although differing in pronunciation, share orthographic representations (an analogous problem exists in speech recognition, where words which share phonemic representations differ in orthographic representation, and therefore possibly in semantic interpretation). A notable exception is the MIT speech synthesis system fAllen81\] which is llngulstlcally-based, but not solely phoneme-based.</Paragraph> <Paragraph position="4"> A desirable feature in any rule-based system is the ability to automatically acquire or modify its own rules. Previous work \[Oakey81\] applies this automatic inference process to the text-to-phoneme transcription problem. Unfortunately, Onkey's system is strlctly letter-based and suffers from the same deficiencies as other nonilnguistlcally-based systems.</Paragraph> <Paragraph position="5"> The UTTER system is an attempt to provide a llngulstlcally-based transcription system which has the ability to automatically acquire its own rule base.</Paragraph> </Section> class="xml-element"></Paper>