File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/04/n04-4011_abstr.xml
Size: 986 bytes
Last Modified: 2025-10-06 13:43:31
<?xml version="1.0" standalone="yes"?> <Paper uid="N04-4011"> <Title>Performance Evaluation and Error Analysis for Multimodal Reference Resolution in a Conversation System</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> Multimodal reference resolution is a process that automatically identifies what users refer to during multimodal human-machine conversation. Given the substantial work on multimodal reference resolution; it is important to evaluate the current state of the art, understand the limitations, and identify directions for future improvement. We conducted a series of user studies to evaluate the capability of reference resolution in a multimodal conversation system. This paper analyzes the main error sources during real-time human-machine interaction and presents key strategies for designing robust multimodal reference resolution algorithms.</Paragraph> </Section> class="xml-element"></Paper>