File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/03/w03-0605_abstr.xml
Size: 1,307 bytes
Last Modified: 2025-10-06 13:43:05
<?xml version="1.0" standalone="yes"?> <Paper uid="W03-0605"> <Title>An Architecture for Word Learning using Bidirectional Multimodal Structural Alignment</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> Learning of new words is assisted by contextual information. This context can come in several forms, including observations in non-linguistic semantic domains, as well as the linguistic context in which the new word was presented. We outline a general architecture for word learning, in which structural alignment coordinates this contextual information in order to restrict the possible interpretations of unknown words. We identify spatial relations as an applicable semantic domain, and describe a system-in-progress for implementing the general architecture using video sequences as our non-linguistic input. For example, when the complete system is presented with &quot;The bird dove to the rock,&quot; with a video sequence of a bird flying from a tree to a rock, and with the meanings for all the words except the preposition &quot;to,&quot; the system will register the unknown &quot;to&quot; with the corresponding aspect of the bird's trajectory.</Paragraph> </Section> class="xml-element"></Paper>