File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/05/p05-3031_concl.xml
Size: 878 bytes
Last Modified: 2025-10-06 13:54:58
<?xml version="1.0" standalone="yes"?> <Paper uid="P05-3031"> <Title>Reformatting Web Documents via Header Trees</Title> <Section position="7" start_page="123" end_page="123" type="concl"> <SectionTitle> 5 Conclusions and Future Work </SectionTitle> <Paragraph position="0"> This paper proposed a method for reformatting web documents by extracting header trees that give hierarchical structures of web documents. Preliminary experiments showed that the proposed algorithm was effective compared with some baseline methods. However, the performance of the algorithm on some of the test documents was not sufficient for practical use. We plan to improve the performance by, for example, using larger amount of training examples. Finding other reformatting strategies in addition to the ones proposed in this paper is also important future work.</Paragraph> </Section> class="xml-element"></Paper>