File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/96/c96-2136_abstr.xml
Size: 913 bytes
Last Modified: 2025-10-06 13:48:36
<?xml version="1.0" standalone="yes"?> <Paper uid="C96-2136"> <Title>Context-Based Spelling Correction for Japanese OCR</Title> <Section position="2" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> We present a novel spelling correction method \['or those languages that have no delimiter between words, such ~rs ,lap;mese, (.',hinese, ,~nd ThM. It consists of an al)proximate word matching method and an N-best word seg mental|on Mgorithm using a statistical la.nguage model. For OCR errors, the proposed word-based correction method outperf.ornrs the conventional charactm'b`ased correction method. When the bmselme character recognition accuracy is 90%, it achieves 96.0% character recognition accuracy and 96.3% word segmentation accuracy, while the cilaracter recognition accuracy of cilaracterb,ased correction is</Paragraph> </Section> class="xml-element"></Paper>