XML Viewer - n06-1055

File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/evalu/06/n06-1055_evalu.xml
Size: 2,454 bytes
Last Modified: 2025-10-06 13:59:40
<?xml version="1.0" standalone="yes"?>
<Paper uid="N06-1055">
  <Title>Semantic Role Labeling of Nominalized Predicates in Chinese</Title>
  <Section position="6" start_page="436" end_page="436" type="evalu">
    <SectionTitle>
5 Related Work
</SectionTitle>
    <Paragraph position="0"> Compared with large body of work on the SRL of verbal predicates, there has been relatively little work done in analyzing the predicate-argument structure of nominalized predicates. There are even less work done for the nominalized predicates for Chinese. (Hull and Comez, 1996) implemented a rule-based system for identifying the arguments for nominal predicates and (Lapata, 2002) has a system that interprets the relation between the head of noun compound and its head, but no meaningful comparison can be made between our work and theirs. Perhaps the closest work to that of ours is that of (Pradhan et al., 2004a), where they reported preliminary work for analyzing the predicate-argument structure of Chinese nominalizations, using a small data set of 630 proposition for 22 nominalizations taken from the Chinese Treebank. Since different data sets are used, the results cannot be meaningfully compared.</Paragraph>
    <Paragraph position="1"> The results reported here for nominalized predicates are consistent with what Xue and Palmer (2005) reported for the SRL of Chinese verbs with regard to the role of the parser in their semantic role labeling system: there is a substantial performance drop when the automatic parser is used. At present, improvement in Chinese parsing is hindered by insufficient training material. Although the Chinese Treebank has a decent size of 500K words, it is evenly divided into two portions of very different sources, Xinhua newswire from mainland China and Sinorama magazines from Taiwan. Due to their very different styles, training on one portion of the data does not help or even hurt the parsing accuracy of the other portion. The lack of sufficient training material is compounded by inherent properties of the Chinese language that makes Chinese parsing particularly difficult. Chinese segmentation is a much more difficult problem than tokenization of English text and Chinese words do not have morphological clues that can help parsing decisions. We believe further improvement in SRL accuracy will be to a large extent contingent on the parsing accuracy, which requires more training material.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML