File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/91/h91-1096_metho.xml
Size: 3,917 bytes
Last Modified: 2025-10-06 14:12:45
<?xml version="1.0" standalone="yes"?> <Paper uid="H91-1096"> <Title>A Robust Preprocessor for Speech-Recognition Systems</Title> <Section position="1" start_page="0" end_page="0" type="metho"> <SectionTitle> PROJECT GOALS </SectionTitle> <Paragraph position="0"> Signifion is designing a preprocessor chip for speech-recognition systems operating in noisy environments.</Paragraph> <Paragraph position="1"> Current preprocessors, such as those based on moving Fourier transforms or linear predictive coding, are linear and not effective when speech is embedded in high levels of noise.</Paragraph> <Paragraph position="2"> Specifically we are: * Developing a nonlinear transmission line speech preprocessor that accurately simulates the mechanics of the inner ear over the entire range of sound pressure levels found in speech.</Paragraph> <Paragraph position="3"> * Preprocessing speech with the nonlinear transmission line and demonstrating that there is a substantial improvement in the signal-to-noise ratio at the output taps of the line as compared with the output of conventional linear speech preprocessors.</Paragraph> <Paragraph position="4"> * Assessing the feasibility of creating a digital transmission line on a chip to use as a preprocessor in the CMU, BBN, or MIT DARPA funded speech recognition systems.</Paragraph> <Paragraph position="5"> laser&quot; that traps and amplifies quiet sounds, thereby aiding in their detection and analysis. This system was modeled as nonlinear transmission line with active circuit elements and negative feedback.</Paragraph> <Paragraph position="6"> * Developed a fourth-order Runge-Kutta method for solving the nonlinear transmission-line equation. * Solved the nonlinear transmission-line equation for a variety of simple nonlinearities using clicks and pure tones as inputs.</Paragraph> <Paragraph position="7"> * Determined the form of the nonlinearity by comparing solutions of the transmission-line equation with Rhode's corresponding data.</Paragraph> <Paragraph position="8"> * Showed that two frequencies interact with one another by affecting each other's region of amplification. Such two-tone suppression, the auditory analogue of visual lateral inhibition, leads to formant sharpening. * Developed a transmission-line parallel-processing algorithm for the Connection Machine. * Studied the feasilibility of creating digital VLSI chip preprocessors that emulate the nonlinear transmission line. Preliminary results are encouraging.</Paragraph> </Section> <Section position="2" start_page="0" end_page="0" type="metho"> <SectionTitle> RECENT RESULTS </SectionTitle> <Paragraph position="0"> * Deduced the mechanics of hearing at low sound pressure levels from measurements of basilar membrane motion. The mechanical properties of individual sections of the cochlea have been determined by examining their collective response to tones of different frequency.</Paragraph> <Paragraph position="1"> The conclusion is that each section acts like a harmonic oscillator with negative damping. The oscillator is controlled by negative feedback that drives the oscillator with a force proportional to the displacement of the oscillator at an earlier time. The time delay is approximately 1.75 times the oscillator's period. Thus, the inner ear is &quot;active,&quot; creating sound that interferes &quot;intelligently&quot; with the incoming sound.</Paragraph> <Paragraph position="2"> * Showed that the inner ear acts as a &quot;subthreshold</Paragraph> </Section> <Section position="3" start_page="0" end_page="422" type="metho"> <SectionTitle> PLANS FOR THE COMING YEAR </SectionTitle> <Paragraph position="0"> * Solve the nonlinear transmission line equations for noisy speech and provide a preliminary qualitative assessment of the signal processing capabilities of the nonlinear transmission line.</Paragraph> </Section> class="xml-element"></Paper>