File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/94/a94-1023_metho.xml

Size: 24,890 bytes

Last Modified: 2025-10-06 14:13:35

<?xml version="1.0" standalone="yes"?>
<Paper uid="A94-1023">
  <Title>Understanding Location Descriptions in the LEI System</Title>
  <Section position="4" start_page="0" end_page="0" type="metho">
    <SectionTitle>
2 Previous Research
</SectionTitle>
    <Paragraph position="0"> Talmy \[1983\], Herskovits \[1986\], and Andr6 etal. \[1986\] among others have documented the many problems in interpreting and using spatial prepositions. For example, in and on have similar but different meanings: &amp;quot;in the car&amp;quot; means within the car, while &amp;quot;on the car,&amp;quot; means on top of the car. However, &amp;quot;on the bus/plane,&amp;quot; means within the bus or plane. Also, each preposition typically has several different meanings or usages. For example, one says &amp;quot;at home,&amp;quot; but &amp;quot;at the bank,&amp;quot; and the meaning of &amp;quot;the plane is at Honolulu airport,&amp;quot; is within the area of Honolulu airport, but the meaning of &amp;quot;the dog is at the telephone pole&amp;quot; is not within the telephone pole, but near it. These context dependent usages make interpretation and application of spatial prepositions problematic.</Paragraph>
    <Paragraph position="1"> Kuipers \[1985\], Davis \[1986\], Peuquet and Ci-Xiang \[1987\], and Frank \[1991\] have investigated qualitative and/or quantitative reasoning techniques for dealing with spatial relations. Freeman \[1975\] and Mark and Frank \[1991\] have identified commonly used spatial relations.</Paragraph>
  </Section>
  <Section position="5" start_page="0" end_page="138" type="metho">
    <SectionTitle>
3 Characteristics of the Domain
</SectionTitle>
    <Paragraph position="0"> Although general purpose natural language processing (NLP) is beyond current state-of-the-art, limited domains have frequently been amenable to NLP using specific techniques because the domains use a &amp;quot;sublanguage,&amp;quot; a fairly restricted subset of a general natural language, which may have its own syntax and peculiarities. In this case, an analysis of one thousand three hundred and forty sample location descriptions from the Bishop Museum's Herbarium Pacificum collection (accumulated by about two hundred different collectors over a period of 160 years) shows a highly restricted use of language that is amenable to understanding using specialized techniques.</Paragraph>
    <Paragraph position="1"> Because these descriptions are meant to be read later by a reader who is not at the site, they contain very few observer relative descriptions (e.g., behind). Also, there  are limits to the scale of the descriptions. For example, out of a thousand descriptions that were located manually, about half of the descriptions were judged to be accurate to within 1/3 of a mile and 73% accurate to within 1 mile. At the other end, there were no descriptions with accuracy in the meter range and the best descriptions were only accurate to within several tens of meters.</Paragraph>
    <Paragraph position="2"> A typical location description is: &amp;quot;Punaluu Valley; Castle Trail from Punaluu to Kaluanui Valley + stream, on E. side of Northern Koolaus.&amp;quot; Associated accession information typically includes the date, collector name(s), genus, elevation, the museum's collection number, and the collector's accession number. This sublanguage is made up mostly of named objects (e.g., &amp;quot;Punaluu Valley&amp;quot; and &amp;quot;Castle Trail&amp;quot;) and prepositional phrases (e.g., from Punaluu to Kaluanui Valley + stream, on E.</Paragraph>
    <Paragraph position="3"> side of Northern Koolaus). The relation of the collection location to the unmodified named objects is almost always &amp;quot;within,&amp;quot; that is, the collection location is within the area designated by the named geographic object. The interpretation of the prepositions is somewhat simpler than the general case because the sublanguage deals only with a two dimensional cartographic space supplemented by elevation markings. This sublanguage is relatively simple syntactically, but there are still many problems for automatic interpretation of the sublanguage.</Paragraph>
    <Paragraph position="4"> One of the most common problems in interpreting this sublanguage is the inconsistent use of names. For example, Waikane-Schofield Trail also appears as Schofield-Waikane Trail, Schofield-Waikane Ditch Trail, Schofield Trail, W-S Ditch Trail, Schofield Waikane Trail, and Waikane Ditch Trail. A mountain like Kaala may be referred to as Mt Kaala, Mt. Kaala, Kaala Mrs., Kaala Mountain(s), Mount Kaala, Kaala Puu (puu is the Hawaiian word for mountain), Kaala Summit, or Kaala Range.</Paragraph>
    <Paragraph position="5"> Another problem is that names are often not unique. For example, Manoa is the name for Manoa Falls, Manoa Valley, Manoa Valley Park, Manoa Triangle Park, Manoa Stream, Manoa Elementary School, Manoa Japanese Language School, Manoa Tunnel, and Manoa Falls.</Paragraph>
    <Paragraph position="6"> When Manoa appears by itself, which Manoa is meant is usually clear from the context. In many cases, the same name is even used for the same type of object (e.g., many cities have Elm Streets and Main Streets). Luckily, similar objects with shared names tend to be geographically separated, otherwise confusion would result. Another very frequent problem is missing names. Collectors often use generic terms like stream and gulch to refer to landmarks that do have names. As before, the context is usually enough to find the correct object reference even without knowing the name. This heavy reliance on context for disambiguation is also a frequent problem for general purpose NLP systems.</Paragraph>
    <Paragraph position="7"> A difficult problem is the interpretation of ordinal numberings, which are used to differentiate forks of streams and branches of valleys. For example, &amp;quot;3rd branch S of S fork of Kahanaiki Stream,&amp;quot; refers to the third branch after the main South fork of Kahanaiki Stream counting from the head of the stream. Unfortunately, this description could also refer to the 3rd branch following the path of the collector going up the stream.</Paragraph>
    <Paragraph position="8"> Similarly, &amp;quot;Honolulu Valley, 4th small gulch,&amp;quot; refers to 4th small gulch counting from the open head of the valley, although this could easily be interpreted as the 4th small gulch along some trail that might enter the valley from some pass over the mountains at the tail end of the valley. Another problem is the occasional use of land coverage types such as &amp;quot;middle Metrosideros forest,&amp;quot; &amp;quot;wooded gulch,&amp;quot; and &amp;quot;Fern Forest.&amp;quot; Not only do most geographical databases lack land coverage information, but such information changes frequently over time. Also, descriptions sometimes refer to rainfall frequency, sun exposure or other ephemeral attributes of the area: &amp;quot;in dryish forest,&amp;quot; &amp;quot;wet valley,&amp;quot; &amp;quot;deep shade in wet gulch,&amp;quot; and &amp;quot;shady hillside.&amp;quot; Even after converting the location descriptions into the appropriate spatial relations, there are still many problems in the correct interpretation of the relations. For example, &amp;quot;along a stream&amp;quot; does not mean that the collection site was in the stream, but within some distance of the stream. The problem is what exactly is the value of that distance. Even cardinal directions like &amp;quot;north of&amp;quot; are fuzzy concepts. The region north of a point can be bounded by two vectors pointing NE and NW (the triangular model), but this model fails when computing north of an object that is elongated in the E-W direction. Some spatial relations like &amp;quot;in front of, .... behind,&amp;quot; and &amp;quot;beyond&amp;quot; are relative to the observer's direction. Although these are not very frequent (only 25 cases in the 1340 sample descriptions), they still appear. Other spatial relations like &amp;quot;above&amp;quot; are dependent on understanding the slope of elevation around the object.</Paragraph>
    <Paragraph position="9"> To solve some of these problems, we have developed and implemented the LEI system to partially automate interpretation of this sublanguage. LEI is described in the following section.</Paragraph>
  </Section>
  <Section position="6" start_page="138" end_page="142" type="metho">
    <SectionTitle>
4 The LEI System
</SectionTitle>
    <Paragraph position="0"/>
    <Section position="1" start_page="138" end_page="139" type="sub_section">
      <SectionTitle>
4.1 Organization
</SectionTitle>
      <Paragraph position="0"> The LEI 3 (Location and Elevation Interpreter) system is an implementation of our algorithms for interpreting the sublanguage of location description labels for biological specimens. LEI is composed of four main components: the language analyzer PPI, the geographical reasoner GR, the user interface LEIview, and the geographic knowledge base GKB. The geographic knowledge base contains an object-oriented description of geographical objects such as valleys, streams, and waterfalls with their associated locations and names. The user interface displays maps and allows users to add or modify object locations. The language analyzer parses the English loca-tion description and produces a collection of spatial relations that relate the actual collection point to geograph- null and their associated names from the geographic knowledge base. The geographical reasoner translates spatial relations from the language analyzer into polygons and performs polygon intersection calculations to obtain the area specified by the spatial relations. Each component is described in detail below.</Paragraph>
      <Paragraph position="1"> 4.2 GKB, the Geographic Knowledge Base LEI uses three U.S. Geological Survey (USGS) digital cartographic databases as the starting point for GKB, the Geographic Knowledge Base. These include the DLG (Digital Line Graph), GNIS (Geographic Name Information System), and DEM (Digital Elevation Model). Unfortunately, these databases are not object-oriented, that is, they do not link the names in GNIS to the object locations in DLG. The GNIS database contains only names, USGS quadrangles, a feature class, and the coordinates of the name as it appears on a USGS map. The DLG database contains a hierarchical organization of points, line segments (composed of points), and areas (composed of line segments) along with a two-level type hierarchy composed of major and minor codes. Unfortunately, the 60 plus GNIS feature classes do not correspond to the over 200 DLG major and minor codes. The DEM database consists of a raster style set of elevation values. To convert the three USGS databases into a useful object-oriented format requires human intervention to associate the names with the line and area objects (point objects are already correctly located by the GNIS coordinates). This process uses the LEIview component to display the appropriate section of the map to the user, then displays names one by one along with the geographically closest object of the same type as the name. The user can confirm the match, ask for alternatives, or modify the set of line segments or areas to give the actual location of the named object. In many cases, there are no corresponding objects, so the user must draw the outline of the area from scratch. This is required for all valleys and mountains since these are missing from the DLG specifications.</Paragraph>
      <Paragraph position="2"> Determining the closest object of the same type requires matching the GNIS feature class to appropriate DLG major and minor codes. This is done using LEI's own type hierarchy that includes type classes corresponding to each GNIS feature class and to each DLG major and minor code along with many bridging type classes and higher level types. Given a GNIS feature class, LEI first indexes into LEI's type hierarchy to find the corresponding LEI type. If this type has a corresponding DLG code, then that is the most likely match. Less likely matches consist of any subtypes that might have associated DLG codes. If there are no DLG codes at this type level nor at subtype levels, then LEI searches up the hierarchy for supertypes that have associated DLG codes.</Paragraph>
      <Paragraph position="3"> Using this algorithm, the matching process manages to find the correct match most of the time, so the user's time is freed to worry about the many missing entries and errors in USGS databases (e.g., rivers that extend into what should be coastlines, disconnected lines, etc.).</Paragraph>
    </Section>
    <Section position="2" start_page="139" end_page="139" type="sub_section">
      <SectionTitle>
4.3 LEIview, the User Interface
</SectionTitle>
      <Paragraph position="0"> The LEIview component provides a graphical interface that allows users to view maps; zoom; scroll; rearrange, add, and delete layers of the map (including DLG lines, GNIS names, and DEM elevations); search for named objects; enter points, line segments, or areas for new objects; modify existing objects; and view the results of interpreting location descriptions (both the English description and the area resulting from processing are displayed). LEIview is written in C under X windows with Motif widgets.</Paragraph>
      <Paragraph position="1"> LEIview is used to associate names with object locations in building the GKB geographic knowledge base. It is also used to display the results of interpreting location descriptions. When there are sections of the description that are not comprehensible to the PPI language analyzer, LEI sends the description to LEIview, which displays the description with the incomprehensible parts highlighted and displays the regions corresponding to the understood portions of the description. The user can tell LEI to ignore the unknown parts of the description, delay processing this description until later, send the description back for reprocessing, or add new geographic objects by entering new points, line segments, or areas and selecting the corresponding words in the description. Any new objects are stored in GKB and the correspondence between the words and the new object are stored in the PPI language knowledge base.</Paragraph>
    </Section>
    <Section position="3" start_page="139" end_page="140" type="sub_section">
      <SectionTitle>
4.4 PPI, the Language Analyzer
</SectionTitle>
      <Paragraph position="0"> The PPI (Prepositional Phrase Interpreter) component is responsible for parsing the natural language location descriptions and converting them into spatial relations. PPI uses the PAU 4 parser and understander \[Chin, 1992\] to interpret the English descriptions and convert them into spatial relations represented in the MERA (Meta Entity Relation Attribute) semantic-network-style knowledge representation language \[Takeda et al., 1992\]. PAU is an all-paths, chart-based, unification parser that completely integrates syntactic and semantic processing.</Paragraph>
      <Paragraph position="1"> Figure 1 shows the MERA graph for the grammar rule, PP ~-- Prep NP (i.e., a Prepositional-Phrase is a Preposition followed by Noun-Phrase), along with its semantic interpretation. The node PP-pat represents the left-hand-side of the rule, and the relations Pea (pattern component A) and Pcb (pattern component B) point to the components on the right-hand-side of the rule. The Ref relation denotes the meaning of the rule: a Geographic-object that has a Spatial-relation to some other Geographic-object.</Paragraph>
      <Paragraph position="2"> The Unify relation between the Prep and the Spatial-relation indicates that the meaning of the Prep should be unified with the relation, Spatial-relation. Likewise, the Unify relation between the NP and the lower Geographic-object indicates that the meaning of the NP should be unified with the lower Geographic-object Figure 2 speci- null The interpretation of the PP &amp;quot;along Ainapo&amp;quot; demonstrates how the integration of syntactic and semantic processing in PAU allows the early use of semantic constraints to reject semantically anomalous parses. &amp;quot;Along Ainapo&amp;quot; is ambiguous because Ainapo is both a trail and an area. However, since &amp;quot;along&amp;quot; only applies to linear objects such as trails, the Ainapo area interpretation is rejected by PAU. This happens when PAU is applying the grammar rule of Figure 1. When unifying the meaning of the Prep &amp;quot;along&amp;quot; (shown in Figure 2) with the Spatial-relation, the result is a Near relation. However, the sources and sinks of both relations must also be unified. This changes the lower Geographic-object into a Linear-object as seen in Figure 3, which shows the state of the &amp;quot;PP ~-- Prep NP&amp;quot; rule just before parsing &amp;quot;Ainapo.&amp;quot; In PAU, both meanings of Ainapo are tried in parallel.</Paragraph>
      <Paragraph position="3"> The area meaning of Ainapo is rejected because an Areaobject cannot unify with a Linear-object. This leaves only the Ainapo trail meaning to parse successfully.</Paragraph>
      <Paragraph position="4"> Table 1 shows the spatial relations in PPI along with the corresponding prepositions.</Paragraph>
    </Section>
    <Section position="4" start_page="140" end_page="141" type="sub_section">
      <SectionTitle>
4.5 GR, the Geographical Reasoner
</SectionTitle>
      <Paragraph position="0"> The GR (Geographical Reasoner) component takes the output from the PPI component, which is a set of spatial  relations, converts these into polygons, performs polygon overlay operations to find the common intersection of all the polygons, computes the center of the minimum bounding rectangle (mbr) of the polygon intersection, then returns the coordinates and centroid of the mbr. GR like PPI is written in Common LISP and converses with LEIview through UNIX sockets.</Paragraph>
      <Paragraph position="1"> The first step is the most difficult since there are no generally accepted algorithms for converting spatial relations into areas. For the spatial relations based on cardinal directions such as East-of, Peuquet and Zhan (1987) give a complex algorithm for determining if one polygon is in a particular directional relationship with another. Their algorithm takes into account the shapes of the polygons (e.g., east of an elongated N-S polygon covers a different area than east of a small point polygon) and considers polygons that partially enclose or intertwine one another. Their algorithm is a refinement of the basic triangular model (in which North is the open-ended triangular region between two vectors pointing NE and NW), but it still does not give any limits concerning the distance between the polygons. Unfortunately there is no absolute distance that forms the edge of the region North-of some polygon. In a sense, the edge is given by the limit of human sight in that direction.</Paragraph>
      <Paragraph position="2"> The algorithm currently used in GR for interpreting cardinal-direction relations around an geographic object starts by computing the minimum bounding rectangle (mbr) for the object. The area next to the mbr with the same size as the mbr is taken as the meaning of the spatial relation. Since the resultant area is the same size as the original object, this makes the meaning of cardinal directions relative to the size of the reference object, taking into account the fact that larger objects are visible from farther away. Cardinal directions relative to point objects are interpreted as a square, 500 meters on a side, lying in the appropriate direction.</Paragraph>
      <Paragraph position="3"> The observer or object oriented relations such as Adjacent-to (&amp;quot;beside waterfall,&amp;quot; &amp;quot;on Kona-Hilo Hwy&amp;quot;), Beyond (&amp;quot;1 1/2 mile beyond end of 20 Mile Road,&amp;quot; &amp;quot;at back of Waihoi Valley&amp;quot;), Front-of (no examples in the sample data), Right-of (&amp;quot;right hand side of Kupu Kai Gap&amp;quot;), and Left-of (&amp;quot;Kulani Prison Road, toward Kulani Prison from intersection w/Volcano Road, left road-</Paragraph>
    </Section>
    <Section position="5" start_page="141" end_page="142" type="sub_section">
      <SectionTitle>
Spatial-relation Prepositions
</SectionTitle>
      <Paragraph position="0"> Adjacent-to adjacent to, beside, next to, on At-elevation above, at, below, down, up  side&amp;quot;) require understanding the orientation of the object or observer. Currently in GR, only object-oriented relations are processed. Given an object with a front, back, and sides (left and right), the corresponding relations are Front-of, Beyond, and Adjacent-to (Left-of and Rightof). These areas are calculated from the object's mbr in a similar fashion to the cardinal direction relations.</Paragraph>
      <Paragraph position="1"> The At-elevation relation with respect to a given altitude requires computing the subregion of the common intersection area that is within 40 meters 5 of the given elevation. The At-elevation relation relative to an object (e.g., &amp;quot;above Schofield&amp;quot;) requires computing the prevalent slope of the terrain around the object. GR takes a 200 meter square on the up/down side of the object.</Paragraph>
      <Paragraph position="2"> Between, From, and Toward are handled by taking the mbr of the two objects, then computing the two corner points on each mbr that is furthest on either side from the line connecting the centers of the mbrs. These four points are then connected to form the area between the two objects.</Paragraph>
      <Paragraph position="3"> The Near relation is converted into a buffer zone around the area. Currently GR uses a fixed distance of 200 meters for simplicity, however further study is needed to determine if this corresponds to most people's interpretation. There may be individual, cultural, or regional differences in interpretation. Also, the size of the buffer zone may depend on the size of the geographic object. null Currently GR does not handle references to terrain type, wetness, or typical sun exposure because this type of data is not available in the USGS databases. References to ordinal forks and branches are assumed to start from the head of the rivers or valleys. Generic terms are handled after processing all other spatial relations by exhaustively searching for any instances of the same type  (or subtypes) that intersect with the intersection of the other known areas. In cases of multiple matches, the user is asked to help disambiguate through LEIview.</Paragraph>
    </Section>
  </Section>
  <Section position="7" start_page="142" end_page="142" type="metho">
    <SectionTitle>
5 Future Directions
</SectionTitle>
    <Paragraph position="0"> Because collectors often collect specimens on trips (either day hikes or multi-day camping expeditions), an analysis of the path of the collectors should yield valuable information about the location of collection. Specimens are typically labeled with the collection date and the collector's accession number, which provides the relative time of collection for specimens on that day. This information can be used to disambiguate location descriptions and to pinpoint vague locations. For example, in Hawaii, there are not only two Waihee Streams, but also a Waihee River. In a location description that mentions, &amp;quot;along Waihee Stream,&amp;quot; there is ambiguity as to which of these three waterways is actually meant. In the current version of LEI, disambiguation is possible only if the description contains more information that specifies an area that intersects with only one of the three streams. By adding reasoning about time using accession dates and numbers and combining this with reasoning about paths, LEI could determine that it is unlikely that the collector stopped collecting specimens along one Waihee Stream, flew to another island to collect a specimen along another Waihee Stream, then flew back to continue collecting along the first Waihee Stream.</Paragraph>
    <Paragraph position="1"> This type of reasoning can also help to pinpoint which part of Waihee Stream is meant by &amp;quot;along Waihee Stream.&amp;quot; If LEI knows that the previous specimen was collected at point A and the following specimen was collected at point B, then LEI can make the reasonable assumption that this specimen was collected somewhere near the intersection of Waihee Stream and a region between points A and B. Using this type of reasoning, LEI can even make a reasonable guess about the collection location of specimens that have no location labels (provided only that they have an accession number and accession date given by the collector and the adjacent specimen numbers can be located). Adding such reasoning about time and paths would improve the accuracy of LEI's processing. null</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML