File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/06/p06-2123_concl.xml
Size: 780 bytes
Last Modified: 2025-10-06 13:55:27
<?xml version="1.0" standalone="yes"?> <Paper uid="P06-2123"> <Title>Segmentation</Title> <Section position="6" start_page="966" end_page="967" type="concl"> <SectionTitle> 5 Conclusions </SectionTitle> <Paragraph position="0"> In this work, we proposed a subword-based IOB tagging method for Chinese word segmentation. The approach outperformed the character-based method using both the MaxEnt and CRF approaches. We also successfully employed the confidence measure to make a confidence-dependent word segmentation.</Paragraph> <Paragraph position="1"> By setting the confidence threshold, R-oov and R-iv can be changed accordingly. This approach is effective for performing desired segmentation based on users' requirements to R-oov and R-iv.</Paragraph> </Section> class="xml-element"></Paper>