File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/89/h89-2041_intro.xml
Size: 2,175 bytes
Last Modified: 2025-10-06 14:04:51
<?xml version="1.0" standalone="yes"?> <Paper uid="H89-2041"> <Title>TIED MIXTURES IN THE LINCOLN ROBUST CSR 1</Title> <Section position="3" start_page="0" end_page="293" type="intro"> <SectionTitle> INTRODUCTION </SectionTitle> <Paragraph position="0"> Single Gaussian per state speaker-dependent (SD) HMM recognizers and low-order Gaussian mixture per state speaker-independent (SI) HMM recognizers have been shown to work fairly well for 1000-word vocabulary, continuous speech recognition \[10,11\]. However, a SD system would require about 30,000 Gaussians to cover the word-internal triphones of English and a SI system would require at least 100,000. The strategy of one or more individual Gaussians per state is appropriate for small vocabulary systems, but becomes unwieldy for large vocabulary systems. Interpolation is often required to cluster models, smooth models, or to predict models which are not observed in training--but there is no clean strategy for interpolating independent Gaussian mixtures--either the mean(s) are changed or the mixture order increases each time another model is included into an interpolated model.</Paragraph> <Paragraph position="1"> Tied mixtures \[3,2,4\] offer a solution for these problems while retaining a basic continuous observation HMM system. (Gaussian tied mixtures are mixtures which share a common pool of Gaussians.) They are mixtures, and thus avoid the unimodal distribution limitation of single Gaussians. Unlike independent mixtures, they interpolate well by interpolating the weights of the corresponding Gaussians. And since the pool of Gaussians is of a given size, a mixture order cannot exceed this size. In effect, they form a middle ground between the histograms of discrete observation systems and non-tied-mixture systems. Tied mixtures can also be viewed as a discrete observation system modified to allow a simultaneous match to many templates with the degree of template match included. In contrast to the discrete observation system, there is no quantization error and the &quot;templates&quot;(Gaussians) can be jointly optimized with the rest of the HMM.</Paragraph> </Section> class="xml-element"></Paper>