File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/93/m93-1018_concl.xml
Size: 1,471 bytes
Last Modified: 2025-10-06 13:57:04
<?xml version="1.0" standalone="yes"?> <Paper uid="M93-1018"> <Title>Japanese ERR UND OVG SUB REC PR E ALL OBJECT S MATCHED ONLY TEXT FILTERING</Title> <Section position="6" start_page="214" end_page="214" type="concl"> <SectionTitle> CONCLUSION </SectionTitle> <Paragraph position="0"> Overall, the data-driven architecture in SOLOMON allowed for minimum work on processing modules whe n working on different languages and domains. We ported the system to Spanish in a week for the demonstration given, at the MUC-5 conference .</Paragraph> <Paragraph position="1"> Although we successfully acquired large amounts of domain data from domain texts in both languages , using both statistical methods and newly developed user-friendly knowledge acquisition tools, we recogniz e the need to move even more quickly to new domains and languages . We plan to continue our work on automatic acquisition of lexicons, knowledge bases, and links between them in multiple languages .</Paragraph> <Paragraph position="2"> Tuning performance of each module (e.g. parsing, discourse analysis) as well as the' performance o f the whole system to a particular task more rapidly is another research issue we identified . We believe that developing automatic evaluation and training algorithms for such automated module/system tuning is crucial to develop a data extraction system that produces optimal results .</Paragraph> <Paragraph position="3"> 21 5</Paragraph> </Section> class="xml-element"></Paper>