File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/98/p98-2153_intro.xml

Size: 3,467 bytes

Last Modified: 2025-10-06 14:06:39

<?xml version="1.0" standalone="yes"?>
<Paper uid="P98-2153">
  <Title>Hypertext Authoring for Linking Relevant Segments of Related Instruction Manuals</Title>
  <Section position="2" start_page="0" end_page="0" type="intro">
    <SectionTitle>
1 Introduction
</SectionTitle>
    <Paragraph position="0"> In reading traditional paper based manuals, we should use their indices and table of contents in order to know where the contents we want to know are written. In fact, it is not an easy task especially for novices. Recent years, electronic manuals in a form of hypertext like Help of Microsoft Windows became widely used. Unfortunately it is very expensive to make a hypertext manual by hand especially in case of a large volume of manual which consists of several separated volumes. In a case of such a large manual, the same topic appears at several places in different volumes. One of them is an introductory explanation for a novice. Another is a precise explanation for an advanced user. It is very useful to jump from one of them to another of them directly by just clicking a button of mouse in reading a manual text on a browser like NetScape. This type of access is realized by linking them in hypertext format by hypertext authoring.</Paragraph>
    <Paragraph position="1"> Automatic hypertext authoring has been focused on in these years, and much work has been done. For instance, Basili et al. (1994) use document structures and semantic information by means of natural language processing technique to set hyperlinks on plain texts.</Paragraph>
    <Paragraph position="2"> The essential point in the research of automatic hypertext authoring is the way to find semantically relevant parts where each part is characterized by a number of key words. Actually it is very similar with information retrieval, IR henceforth, especially with the so called passage retrieval (Salton et al., 1993). J.Green (1996) does hypertext authoring of newspaper articles by word's lexical chains which are calculated using WordNet. Kurohashi et al. (1992) made a hypertext dictionary of the field of information science. They use linguistic patterns that are used for definition of terminology as well as thesaurus based on words' similarity. Furner-Hines and Willett (1994) experimentally evaluate and compare the performance of several human hyper linkers. In general, however, we have not yet paid enough attention to a full-automatic hyper linker system, that is what we pursue in this paper.</Paragraph>
    <Paragraph position="3"> The new ideas in our system are the following points:  1. Our target is a multi-volume manual that describes the same hardware or software but is different in their granularity of descriptions from volume to volume.</Paragraph>
    <Paragraph position="4"> 2. In our system, hyper links are set not between  an anchor word and a certain part of text but between two segments, where a segment is a smallest formal unit in document, like a subsubsection of ~TEX if no smaller units like subsubsubsection are used.</Paragraph>
    <Paragraph position="5"> 3. We find pairs of relevant segments over two volumes, for instance, between an introductory manual for novices and a reference manual for advanced level users about the same software or hardware.</Paragraph>
    <Paragraph position="6"> 4. We use not only tf.idf based vector space model but also words' co-occurrence information to measure the similarity between segments.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML