File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/86/c86-1087_intro.xml

Size: 3,738 bytes

Last Modified: 2025-10-06 14:04:33

<?xml version="1.0" standalone="yes"?>
<Paper uid="C86-1087">
  <Title>PROCESSING CLINICAl NARRAI\]}VES IN IIUNGAR\]:AN G4bor PrGsz~.ky National Lduca Liona\] Library arid Museum Computer Oepar l:hqen</Title>
  <Section position="2" start_page="0" end_page="0" type="intro">
    <SectionTitle>
2. MORPIIOI.OGICAL ANALYSIS
</SectionTitle>
    <Paragraph position="0"> II/0 PSJrtit phase :i.,s the \[mrphoi!ogical ana\].vs;Js o\[' w\[~rd forms. Ilunflarian \]oa free word order language~ there:\[LiFe the I'o\]_e oJ: .~;u:\[f:i xes is very important #rom the viewpoJ.nL of ideM:i1:ying phrasal eons LJtuents. A lot o1: oynl;actJc and ~\](.mlanLJc inPSormaUon (iRJlnber, person, possessien, case, Lense, mood etc.) are car;.'\]ed by these e\]emenl;s. The cone~Yl:enwL\]oo of stem and suf1:Jxes J s someti.mes \]'atller complex: U/ere ace suffixes that have di1:fererYL stem--dependent 1:erms a\[id stem'.:\] that have different suffix-dependent forms.</Paragraph>
    <Paragraph position="1"> lhus the \]exJ.coH musL conLaJri a\]:l Lhe peso:iDle varianl:s o1: the sLems as ir/depenclenL enLc~os or we have I:o def\]fle an a\].!\]or\]thm feF collstrucLirlo I:hR real stem,.; \[rom Lhe arclfi.l~honemea o1.&amp;quot; Lhe \]ex:\]enn. We have eheuen the 1:ormer a\]/:ernaL~ve.</Paragraph>
    <Paragraph position="2"> Ihe :jex:ieoll consists of four parts but only concepLua\]\].y. From l:he point o\]: view of the a\].gorJthm, J'L J s an Jrltegra\] who l.e. The reasons why wo dJ.s-L:iHgu:ish :its parts are. as 1:ellows: (J) All the N\[ processing proorams o1: an aggluLinaL-J ve :\[aiigl.ta!\]e nlus L knew all_.1: :!.he g.~lElmaLical~jgr~hemes of l:ix.' \]anfluage.</Paragraph>
    <Paragraph position="3"> (\] \] ) File d:ictJ oI~ary nPS c onlmo~LxpressJuer\]s J s not i,~cPssary hut JL is a useful part of ai\] NI sysLems.</Paragraph>
    <Paragraph position="4"> lhis modu\]e can be eli\[a\]:'!jud by the user.</Paragraph>
    <Paragraph position="5"> \[ i I ) II1,. cib bd(J \[ I L.A El,till ~,\[Jiil,{\]\]~) I,IU\] {~ 0;: i (.,)~ \[\] ~ J I:he IexJna\]. elements l:haL is needed for Lhe acl:ua\] I:ype of applicaL:ion (I)\[I queryillg, updaLJno, informal:Joe exl:racl:Jon, Lrans\]al:Jon etc.).</Paragraph>
    <Paragraph position="6"> (iv) The ~peeial \]exi.con conl:ains terms ef l:he aeLua\] applLca\[Jon field ~ our case the \[ePS111s oPS medical scJ once). Ih:i s fllodu\] e Call, O\[ course, he en\] arced hy I;he user.</Paragraph>
    <Paragraph position="7"> After updatJn!\] the :lexicon, eni:ries w\]\]\] he arranged in a\]phahel:Jca\] ord{~r.</Paragraph>
    <Paragraph position="8"> The inorpho\]oqica} analyzer Js a f\]lqJ I;e stale aul:omates (F~ZTA-~?&amp;quot;I~\[.~:CGR~ ~-~r3-te hc ana\]yzed from the :inpul: sequence of words and searches the dictionary in order t\[\] find the :i.npuL werd. :\[f the \]eft part of the word matches a dietiunary entry, Ule enLry's JnformatJona\] iJari: must be cepJed I:o the worl&lt;\]ng buffer. lhe content of this buffer wJl\] be the input to l:he ,sy~rLactical analyzer. Then the automaton begins to work from right i:o \].e\[t. Its oul:put is the sequence of l:he Jnformatiena\] parts of i:he grammatical i/lOrpheme,s sLanding al:ter Lhe sLem w\[e ideITLi.fied a short while ago. \]:2 l:lle Jn\[ormatJon of t:he stem arld l:he suffixes are nol: compaUb\]e or there remained an unprooessed part in l:he word, the a\].go\].'il:l-m\] tries Lo ana\]yze the word as a compound once more and if this proce,as faJ\]s then J L asks the user what l:o do.</Paragraph>
    <Paragraph position="9"> FLoure 2 J.s an J\]\]ustratlon oPS this llrocess. (The &amp;quot;origin&amp;quot; of the erll:ries is marked by G, C, A and S, l:hat is (jrammal:Jcal, eommon, actual and special \].exJ-con, respeetive.ly. )</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML