File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/05/h05-1039_concl.xml
Size: 2,234 bytes
Last Modified: 2025-10-06 13:54:31
<?xml version="1.0" standalone="yes"?> <Paper uid="H05-1039"> <Title>Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP), pages 307-314, Vancouver, October 2005. c(c)2005 Association for Computational Linguistics Combining Deep Linguistics Analysis and Surface Pattern Learning: A Hybrid Approach to Chinese Definitional Question Answering</Title> <Section position="7" start_page="313" end_page="313" type="concl"> <SectionTitle> 6 Conclusions and Future Work </SectionTitle> <Paragraph position="0"> We have explored a hybrid approach for definitional question answering by combining deep linguistic analysis and surface pattern learning. For the first time, we have answered four questions regarding Chinese definitional QA: deep linguistic analysis and automatic pattern learning are complementary and may be combined; patterns are powerful in answering biographical questions; only a small amount of annotation (2 days) is required to obtain good performance in a biographical QA system; copulas and appositions are the most useful linguistic features; relation extraction also helps.</Paragraph> <Paragraph position="1"> Answering &quot;What-is&quot; questions is more challenging than answering &quot;Who-is&quot; questions. To improve the performance on &quot;What-is&quot; questions, we could divide &quot;What-is&quot; questions into finer classes such as organization, location, disease, and general substance, and process them specifically.</Paragraph> <Paragraph position="2"> Our current pattern matching is based on simple POS tagging which captures only limited syntactic information. We generalize words to their corresponding POS tags. Another possible improvement is to generalize using automatically derived word clusters, which provide semantic information.</Paragraph> <Paragraph position="3"> Acknowledgements This material is based upon work supported by the Advanced Research and Development Activity (ARDA) under Contract No. NBCHC040039. We are grateful to Linnea Micciulla for proof reading and three anonymous reviewers for suggestions on improving the paper.</Paragraph> </Section> class="xml-element"></Paper>