File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/96/c96-2154_intro.xml
Size: 1,393 bytes
Last Modified: 2025-10-06 14:06:06
<?xml version="1.0" standalone="yes"?> <Paper uid="C96-2154"> <Title>Modeling Topic Coherence for Speech Recognition</Title> <Section position="3" start_page="913" end_page="913" type="intro"> <SectionTitle> 2 Speech Recognition System </SectionTitle> <Paragraph position="0"> This research is being done in collaboration wittl SRI, which is providing the base of the combined speech recognition system. (Digalakis et.al., 1995). We use the N-best hypotheses produced by the Sill system, alon G with their acoustic and language model scores. There are two acoustic scores and four language scores. Language scores are namely the word trigram model, two kinds of part of speech 5-gram model and the number of tokens. Note that none of their language models take long-range dependencies into account. We combine these scores with the score produced by our sublanguage (:omponent an(1 our cache inodel score, and then select the hypothesis with the, highest combined score as the output of our system. The system structure is shown in Figure 1.</Paragraph> <Paragraph position="1"> The relative weights of the eight scores are determined by an optimization procedure on a training data set, which was produced under the same conditions as our evaluation data set, trot has no overlap with tile evaluation data set. The actual conditions will be presented later.</Paragraph> </Section> class="xml-element"></Paper>