File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/06/w06-2206_abstr.xml
Size: 1,094 bytes
Last Modified: 2025-10-06 13:45:29
<?xml version="1.0" standalone="yes"?> <Paper uid="W06-2206"> <Title>Spotting the 'Odd-one-out': Data-Driven Error Detection and Correction in Textual Databases</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> We present two methods for semi-automatic detection and correction of errors in textual databases. The first method (horizontal correction) aims at correcting inconsistent values within a database record, while the second (vertical correction) focuses on values which were entered in the wrong column. Both methods are data-driven and language-independent.</Paragraph> <Paragraph position="1"> We utilise supervised machine learning, but the training data is obtained automatically from the database; no manual annotation is required. Our experiments show that a significant proportion of errors can be detected by the two methods. Furthermore, both methods were found to lead to a precision that is high enough to make semi-automatic error correction feasible.</Paragraph> </Section> class="xml-element"></Paper>