File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/04/n04-4011_abstr.xml

Size: 986 bytes

Last Modified: 2025-10-06 13:43:31

<?xml version="1.0" standalone="yes"?>
<Paper uid="N04-4011">
  <Title>Performance Evaluation and Error Analysis for Multimodal Reference Resolution in a Conversation System</Title>
  <Section position="1" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> Multimodal reference resolution is a process that automatically identifies what users refer to during multimodal human-machine conversation. Given the substantial work on multimodal reference resolution; it is important to evaluate the current state of the art, understand the limitations, and identify directions for future improvement. We conducted a series of user studies to evaluate the capability of reference resolution in a multimodal conversation system. This paper analyzes the main error sources during real-time human-machine interaction and presents key strategies for designing robust multimodal reference resolution algorithms.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML