File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/06/n06-2034_intro.xml
Size: 1,396 bytes
Last Modified: 2025-10-06 14:03:32
<?xml version="1.0" standalone="yes"?> <Paper uid="N06-2034"> <Title>Using Phrasal Patterns to Identify Discourse Relations</Title> <Section position="2" start_page="0" end_page="0" type="intro"> <SectionTitle> 1 Introduction </SectionTitle> <Paragraph position="0"> Identifying discourse relations is important for many applications, such as text/conversation understanding, single/multi-document summarization and question answering. (Marcu and Echihabi 2002) proposed a method to identify discourse relations between text segments using Naive Bayes classifiers trained on a huge corpus.</Paragraph> <Paragraph position="1"> They showed that lexical pair information extracted from massive amounts of data can have a major impact.</Paragraph> <Paragraph position="2"> We developed a system which identifies the discourse relation between two successive sentences in Japanese. On top of the lexical information previously proposed, we added phrasal pattern information. A phrasal pattern includes at least three phrases (bunsetsu segments) from two sentences, where function words are mandatory and content words are optional. For example, if the first sentence is &quot;X should have done Y&quot; and the second sentence is &quot;A did B&quot;, then we found it very likely that the discourse relation is CONTRAST (89% in our Japanese corpus).</Paragraph> </Section> class="xml-element"></Paper>