WORD AND OBJECT IN DISEASE DESCRIPTIONS* 
M.S. Blois, D.D. Sherertz, M.S. Tuttle 
Section on Medical Information Science 
University of Calif(rnia, San Francisco 
Experiments were conducted on a book, Current Medical 
Information and Terminolog~, (AMA, Chicago, 1971, edited 
by Burgess Gordon, M.D.), which is a compendium of 3262 
diseases, each of which is defined by a collection of 
attributes. The original purpose of the book was to 
introduce a standard nomenclature of disease names, and 
the attributes are organized in conventional medical 
form: a definition consists of a brief description of 
the relevant symptoms, signs, laboratory findings, and 
the like. Each disease is, in addition, assigned to one 
(or at most two) of eleven disease categories which en- 
umerate physiological systems (skin, respiratory, card- 
iovascular, etc.). While the editorial style of the 
book is highly telegraphic, with many attributes being 
expressed as single words, it is nevertheless easily 
readable (see Figure i). 
The vocabulary employed consists of about 19,000 distinct 
"words" (determined by a lexical definition), roughly 
divided equally between common English words and medical 
terms. We measured word frequency by "disease occur- 
rence", (the number of disease definitions in which a 
given word occurs one or more times). By this measure, 
only seven words occurred in more than half the disease 
definitions, and about 40% of the vocabulary occurred in 
only a single disease definition. (Table i lists the 
words at the top of the frequency list together with the 
number of occurrences.) 
Assisted by the facilities of the TMuNIX operating sys- 
tem, we created a series of inverted files (from a 
magnetic tape of the CMIT text), and developed a set of 
interactive programs to form a word-and-context query 
system. This system has enabled us to study the problem 
of inferring term reference in this large sample of text 
(some 333,000 word occurrences), within the context of 
diseases. 
An interesting early result was the ease with which many 
medical terms could be algorithmically separated from 
co~on English words. After adjusting for the fact that 
some disease categories are larger than others, we de- 
fined an entropy-like measure of the distribution of 
word occurrences over the eleven physiological categor- 
ies as a measure of category specificity. We reasoned 
that some medical terms such as 'murmur', while not 
specific to any particular heart disease, are specific 
to heart disease generally. This term would not, for 
example, be used in describing endocrine disorders. 
Such a word would be expected to occur in category 04 
(cardiovascular disease) frequently, and not in the 
other categories. Such a term would, by our measure, 
have a low 'entropy'. A com~non English word like 'of', 
would be used in the descriptions of all kinds of dis- 
ease, and would accordingly have a high 'entropy'. 
Tables 2 and 3 show the top and bottom of the list of 
all words occurring in two or more diseases sorted by 
this entropy measure. In these lists, as our hypothesis 
seems to imply, low 'entropy' corresponds to high 
'specificity', and high 'entropy' to low 'specificity'. 
This separation of medical terms from common English 
words, by algorithmic means, is facilitated by the 
context supplied by the notion of 'disease category', 
and the fact that this was represented in the CMIT text. 
* This work was supported in part by grants from The 
Commonwealth Fund, and from the National Library of 
Medicine (i KI0 LM00014). 
Our second experiment investigated the co-occurrence 
properties of some medical terms. Aware that many medi- 
cal diagnostic programs have assumed attribute independ- 
ence, we sought to shed light on the appropriateness of 
the assumption by evaluating it in terms of word co- 
occurrence in disease definitions. 
Since the previously described procedure had given us a 
means of selecting medical terms from common English 
words, it was possible to produce lists of 'pure' 
medical terms. We then wrote a program which formed all 
pairs of such terms (ignoring order). We defined an 
'association measure' (A) which measured the difference 
between the observed co-occurrences of term-pairs (they 
could co-occur in any location in the definition and in 
either order), and the co-occurrences expected from 
chance alone. Tables 4 and 5 show the top and bottom of 
a list of all pairs formed from the low entropy terms in 
the previous experiment. The first 1120 terms were 
chosen, that is, those having an entropy of 2.0 napiers 
cr less. The pair list was then sorted by this associa- 
tion measure, A. 
Word pairs which are found to be highly associated, 
appear to do so for two reasons. The test, which is 
trivial, is that some word pairs are semantically one 
word despite their being lexically, two. Comon 
examples would be 'white House' and 'Hong Kong'; medical 
examples are 'vital capacity', 'axis deviation', and 
'slit lamp'. These could have been avoided algorithmic- 
ally by not taking adjacent words in forming the term- 
pairs, without any significant overall effect. The 
second reasons for high frequency word co-occurrence is 
that both words are causally related through underlying 
physiological mechanisms. It is these which had the 
greatest interest for us, and the measure A, may be 
viewed as a measure of the non-independence of the symp- 
toms or signs themselves. 
The term pairs which are negatively associated, have 
this property for the same reason. If the two terms are 
used typically in the descriptions of different diseases, 
they are less likely to co-occur than by chance. (In a 
baseball story on the sports page, we would not find 
'pass', 'punt', or 'tackle'). These negatively assoc- 
iated pairs may have value in diagnostic programs for 
the recognition of two or more diseases in a given 
patient, a problem not satisfactorily dealt with by even 
the most sophisticated of current programs. 
Finally, an extension of the entropy concept permits one 
to generate (algorithmically) the vocabularies used by 
the medical specialties (which correspond to the disease 
categories represented in CMIT. This is done by assign- 
ing terms which occur predominantly in one category to a 
single vocabulary and then sorting by entropy. Tables 6 
and 7 show the vocabularies used in dermatology and gas- 
troenterology (as derived from CMIT). These vocabular- 
ies, it will be noted, can be used as 'hit lists' for 
the purpose of recognizing the content of medical texts. 
In su~nary, we see the ability to differentiate medical 
terms from common words by context, and the ability to 
relate the medical words by meaning, as two of the first 
steps toward text processing algorithms that preserve 
and can manipulate the semantic content of words in med- 
ical texts. 
TMuNIX is a trademark of Bell Laboratories. 
149 
COLORADO TICK FEVER 00 2217 
AT FEVER, MOUNTAIN; FEVER, MOUNTAIN TICK. 
ET VIRUS TRANSMITTED BY TICK DERMACENTOR 
ANDERSONI. 
SM CHILL~; HEADACHE; PHOTOPHOB~A; BACK- 
ACHe; PAIN IN EYE; MYALGIA; ANOREXIA; 
NAUSEA; VOMITING; PROSTRATION. 
SG SEASONAL, MARCH TO JULI', IN WESTERN 
UNITED STATES; INCUBATION PERIOD 4-6 
DAYS; ONSET ABRUPT; POSSIBLY SLIGHT 
~; SUSTAINED FEVER, 102-104 F 
OR HIGHER SIGNIFICANT; PULSE RATE 
INCREASED. COURSE : IN PREVENTION, 
R~OVAL OF TICK FRDM SKIN; APPLICA- 
TIONS TO SKIN OF TURPENTINE, IODINE, 
R~'TONE; REMOVAL OF TICK BY INSERTION 
OF NEEDLE BETWEEN MOUTH PARTS; ASPIRIN 
FOR PAIN; ANTIBIOTIC TREATMENT IN- 
~-~CTIVE. 
CM ENCEPHALITIS, MENINGITIS ESPECIALLY 
IN CHILDREN. 
LB WBC DECREASED: MONOCYTOSIS; COmPLE- 
MENT-FIXATION TEST POSITIVE; INJECT- 
TZGN OF SERUM OR CSF KILLING SUCKLING 
MICE; NEUTRALIZATION OF VIRUS WITH 
II~UNE SERUM RESULTING IN SURVIVAL. 
Figure i. Typical disease 'definition' 
~aken from CMIT 
32~6 of 587 stall 364 other 
2865 in 492 possible 360 acute 
2485 p~msibly 489 se~re 368 years 
2315 wir.~ 470 ~St 358 ~silure 
2104 course 473 disease 349 ~et~m 
2010 to 457 pr~sura 349 large 
1953 or 447 a~sence 341 o~spnea 
1488 by 44~ trau~e 341 early 
1379 usually 443 chronic 34e ~akness 
1194 lain 442 edema 339 nausea 
900 as 435 ~rcent 338 ~enderness 
945 on 434 ~rea~ment 337 infl,emmm~icm 
889 from 432 vomitlr~ 337 mass 
812 infection 431 later 336 awe 
766 features 426 ah-ent 335 w~hLn 
749 unknown 422 camll~n 332 Lf 
738 at 421 asymp~oma~Lc 331 lower 
716 cells 420 durlr~j 328 ~ellinq 
699 associated 415 rarely 327 necrosL~ 
682 increased 414 hereditary 325 los "Ave 
674 onset: 401 lesLons 324 heaOaehe 
666 ~ssue 396 ~han 318 frequent 
650 bloo~ ~90 a~bominal 316 w~c 
627 nor~l 389 more 315 area 
619 sKLn 389 often 313 hemrrhac)e 
603 and \]83 into 313 infil~ra~Iom 
596 ~or \]82 ~pe 309 oh~r.cuccion 
575 rare 381 ~one 304 fom 
553 fever "5 in~l~P ~nt 301 conqenl~al 
541 IOSs 369 ~specially 3~I enlar~e~n~ 
538 after 367 areas 391 progressive 
Table i. The hiqhes~ frequency words 
used in CMIT, ~oge~er with r~e number 
of disease definitions in which ~he 
word occurs a~ leas~ once. 
u*s162 
X*OZO+ 
Z.bZ1+ 
Z .(}369 
I.(}311 
l.Oll\[ 
1.0422 
1.04~1 
IoU~I 
1.011~5 
l.ll3~ 
1.12~M 
L.|261 
l.ltl5 
1.1504 
L.15242 
L* II~2 
L*l;l+ 
1.1112 
1.21212 
1.2:162 
L*2Z92 
1*2~95 
1.2351 
1.231¢ 
I* ;P~I3 
L*lill 
L.273(} 
L.2tI3 
~ .30011 
1.\]019 
~. JlOS2 
1.3101 
1.3120 
|.31~6 
I*2115 
1.31m 
L.3211 
1.32~+ 
1.3269 
L.3327 
1o33~ 
1.338~ 
1.2315 
\[.)378 
! • ~3'J5 
L*3a39 
L.35~0 
L.33(}5 
1.3~05 
L. \]6~.1 
l.3~03 
1,3783 
.02 •~ .uZ .U2 •vt .,,Z .n; .n; •oJ .r)2 .~Z lens ~lJ 
.05 .u* .o! •o. + .u2 •U3 .r~2 •uu •OZ .Ul .77 come l,s 
.++2 •,t .t\]2 .,2 .~, .u+ .u2 .ul •7~ .u2 .~3 pbL lu 
.u~ ., u •ui .~, .o~ +oz .oz .hi .u~ •o+ •o2 .m~.mcor*tlo. s. 
• OZ .Ol •05 .ul .7~ .£1 .O2 .OI .02 .O! .02 plrmlr 86 
• 03 .OI .Ul .02 *IT .03 .02 .n; .0+ .02 .03 *crX.l, 45 
.O~ ,OI ,ul .a)2.0Z .u3 .02 .UI .05 ,(}3 .77 c1111ty 33 
• U3 .0~ .(}2.0Z .OZ .02 .01 .(}1 .u) .(}$ .76 trXl ~3 
• (}L .0| .Of .fl2 .(}) *02 .(}3 .(}I .;6 .(}Z .02 omclke Z(} 
.(}5 .(}I .01 .(}Z *(}3 *(}3 .~| .02 .76 .03 .02 lodtml 27 
• (}i .(} ...... ) .(}2 .... lJ3 ..... ) .Of .O\] hlqLo~t, $L 
.Ul .Of .(}2 .02 *;+ .~ .(}2 .(}Z .(}$ .(}2 .03 q,--,l 
• 02 .01 *O2 *(}I *H . llb .02 *02 .02 .Of .(}| l~ItOl |+ +I 
• (}$ .03 *U: .02 .02 .1|, .(}Z .0| *(}3 "toOl .1,~ +,~elllUac°m ,9 .(}; .01 .(}! +72 .+~1 .0; .(}Z .(}2.0Z . .(}! 
.0~ .00 .U2 .02 .lU .ul .Ol .(}l .12 .(}2 .(}I *©I l~O 
..... + ..... , ...... , ..... 5 ..... 3 I~rOlleh,co~? 26 
• 07 .U2 .Or .OZ .UU! *02 .l+l .01 .(}3 .0+ .7~ ¢llu~t$) 
• (}2 .03 .(}I *02 •0~ .02 *(}I .(}I .(}) .O? .72 lltlu 61 
• 01 .U3 *(}3 .05 .(}2 .(}+ .0+ .12 .02 .(}Z .0~ ,rechrl; m 
.0| .0) .0+ +o) .(}2 .U6 .02 .72 .(}5 .0+ .(}3 urllthl~l 58 
• Of *OZ *02 .(}) .(}2 .Ok .O~ .72 *(~ *02 .I~ ¢~ll¢Im¢oP7 $3 
• 03 .02 .(}~ .(}3 .02 .U* *02 .Of .(}$ .I~ .72 vtC+l~ ~I 
.02 .1(} .01 ,02 .O2 ,02 .(}2.0C ,(}3 .01.0~ leld/rmts 93 
.O3 .01 .U2 *U3 .oa .ul .(}z .12 .05 .o2 .(}) ¢llrvtz i~ 
.m, .(}~ .o) .(}2 .lo .o~ .o£ .o+, .o~ .(}\] .(}2 *tr~U 6s 
.(}S .(}I .02 .02 ,0~ ,Of .(}I .OL ,0~ .08 .39 vtl&o~ 192 
• 02 .U3 .02 .(}3 .D* .Q& .(}2 .(}l .OS .(}2 .11 lncrlocvL4r 23 
.O& .0l .(}2 .(}5 .02.0~ .02 .1(} .O5 .(}2 ,05 p~ril ~5 
.0~ .ul .0~ .Oa .(}I .05 .(}I .t9 .01 .(}~ .02 ,ter.l 
.(}| .(}~ .02 .0~ .89 .(}$ .0~ .(}1 .06 .02 .0~ llllo¢lrdtol +iphy )0 
• 01 *O! .02 .05 .k4 .02 .(}I *Of *07 .it .(}I mcrlm1+ 15 
• (}I .02 .02 .05 .0~ .0~ .05 .05 .i) .U2 .(}3 l~m 21 
.(}l .61 *(}2 .02 .Ul .02 .06.0a .(}6.0a .0+. +1~1 ~5 
.(}I .0\] .0~ .OZ .02 .05 .02 .(}1 .*9 .(}3 .02 hormoM ~ 
.0~ .Ol .ul .61 .O5 .O5 .05 .02 .(}6 *01 .02 mruq 57 
.O2 .01 .(}2.0e .~I .0; .~ .01 .(}5 .02 .(}3 ~rll 28 
• (}3 .01 .02 .O2 .0~ .08 .02 .68 .(}I .Of .(}2 u/lrlm 98 
.Ok .(}2 .+~ .i+ .02 *US ,O~ *03 .0~ ,02 *0& lLVeO~l ~& 
.(}) .~ .(}2 .(}1 .(}| .02 .OZ .(}7 .GB .05 .05 pt~ultm 52 
• U3 .(}2 .Ul .(}3 .Sl .05 .Of .01 .01 .0~ .02 lo¢~l tO 
.03 ., .03 .020~ .0~ ., .OZ .OI .05 .02 .05 t~l~l~ 22 
.02 .02 .02 • .U2 .kl .U3 .02 .07 .03 .0~ 
• 02 .03 .0~ .02 .02 .03 .0~ .69 .(}& .OL .02 ~|~1 91 
• 03 .02 *(}2 .(}3 .bM .(}3 .UZ .03 .0~ .~2 .0~ ~l~lo~ 29 
.(}3 .02 .01 .0~ oZO .u3 .03 .(}I .0~ .(}1 .66 ¢~lJ~er &l 
• (}3 .+J2 .02 .u) +o~ .o& .~2 .O2 .68 .02 .u? hymtlLyml L| 
.0, .U3 .O2 .o~ .92 .(}2 .01 .00 .05 .(}8 .67 ~* 113 
.O4.0L .(}! .0~ .65 .O5 .01 .Ul .~ *07 *(}I v~mtrlcuJ.i/ /tO 
,(}2 .05 .(}2 .U2 ,02 .(}+ .02 .(}1 .(}5 .15 .65 ~mLl 29 
.|5 .03 .(}I .01 .ut .O2 .01 .91 .07 .08 .61 cornelL 86 
• 02 .02 .02 .m .U2 .13 ......... 2 .02 -- 2, 
• (}2 .02 .02 .03 .65 .12 .0~ .Ol .~1 .02 .(}2 v41Lvl 55 
.0~ .~I .Of .02 .65 .U3 .0) .OL .(}~ .OB .05 .+vl ~? 
• Ul .05 *(}2 *03 .U2.0a .02 *Ol .il .02 .65 ileel 2~ 
• 02 .02 .02 .65 .02 .0~ .0~ .(}I .£U .OZ .03 mm*oc~oru 28 
Table 2. The lowest 'enr.ropy' words 
in CMIT, in order of increasing 'enT.ropy' 
The enl=ropy is given in P.he first column; 
,':he ent.ries in ~h~ nezt ii columns are 
~he percent of occurrences in the 11 
disease cat.Dries (body as a whole, skin, 
musculo-skeletal, respiratory, cardiovas- 
cular, heroic and lympha1:ic, GI, GU, endo- 
crine, nervous, organs of special sense)• 
150 
L, J62b . +0 .J5 .o~ ,tl .13 .U7 . ~ .10 ,ll .07 .13 de~rl! Iz~ 
z. Jh2v . :~ .06 .US .u9 .\]& .13 .C5 .u7 .41 .09 .09 absen¢ +~+ 
~.j+Jt} . + .12 .09 .07 .11 .10 .05 .\[Jb .13 .11 .ou bLo¢¢~7 z6226 
J.3635 5 .uY .LL ,o7 .LU .(J7 .10 *I0 .13 .IL .()7 common 
Z*~bJ? *L/ .OJJ .00 *(J7 .05 .IL .IU .10 .|~ .0~ .09 ihLn& 4 
2.~640 .LU .|0 .19 *(39 *LU *09 .(~9 .09 *~5 .16 .06 wtthln JJ5 
Z.Sb4Z ,03 ,\[1 ,12 .UY .08 .U9 .|3 ,LU ,06 .00 ,U5 marked 159 
2.36~7 .44 ,UG .It *(~ *07 .L3 ,U9 *Q~ .0~ .LO .IL tndl¢aclvl 20 
Z.3b6U .~9 .04 .0~ .07 .o9 .|L .0~ ,LL' .09 .13 .|2 mtlder 46 
Z.3667 .1~ .06 ,08 .13 .Lt ,~7 .09 ,10 .07 ,06 .09 ul*k 6S 
2.36b~ .O7 .09 .0~ .O6 .L3 .4O .|| .10 .|! .O9 .~6 o(tln 389 
2.36;6 *LI .05 .09 .10 .09 .07 .07 *11 .|3 .07 *LI st~L4 &6 
Z.368| ,12 .11 .06 .09 *08.0Y .O7 .07 .1~ .08 .09 2 130 
2.3687 .rJV .09 .O9 .10 .O6 .00 .09 .I~ .12 *05 .07 large 369 
2.3701 .0~ .07 *|\] .I)9 ,)3 *06 ,09 ,|0 ,07 .07 .11 causing Z56 
Z,3706 .LO .06 .10 .tO .10 ,1~ .Iu ,06 .07 .{)9 .O7 i*verl ~Bq 
Z.37|| .06 .06 .12 .12 *10 .09 .ll .U7 .|4 .09 .09 \[i¢1 425 
Z.37L6 .09 .I0 ,I| .OS .13 .09 .08 .08 .06 .10 .tt vttflou¢ Zt6 
Z.3716 .O9.0g .08 .08 .L! ,O9 .|3 .12 *09 .r)y .O5 tE 332 
~.37|8 .LO .oq .O9 .0~ .13 .IU .09 .05 .1( .07 .09 L,cce*ltn8 (2J 
2.372~ .13 .LO .09 .LO .O3 .~}9 .~7 .0~ .11 .O8 .~9 for 596 
Z.\]727 .UZ .O7 .13 .07 .ll .12 .n6 ,06 .1l .09 .o~ thaa 396 
Z.37~6 .06 .|1 .10 .O8 .0S .£o .11 .11 .~b .|1 .~)6 molt t78 
Z.37~6 .07 .12 .13 .07 .It| .10 .;u .09 .nb .08 .~6 eich 30 
Z.\]766 .09 .0~ .09 .09 .07 .10 .00 .06 .LL .13 .Oq OOli¢ 67A 
2.37~8 .IL .09 .07 .06 .07 .11 .06 .08 .07 .10 .L& accumaSat~oa 6i 
L\]782 .U9 ,10 .U7 .0~ ,11 .42 .O7 .06 .1| .1! ,n7 poor 55 
L3776 .U7 .11 .L~ .00 .u6 .09 .O6 .O6 .Lt .10 .07 more \]89 
2.3780 .09 .09 ,10 .L2 .n9 .0~ .10 .|( .LO .Oh .07 plrltltln¢ L2t 
2.3783 .10 .U9 .12 .03 .O8 .10 .O7 .O9 .lO .11 .09 mnd ~03 
2.3792 .ub .¢9 .O9 .4U .U6 .11 .L~ .O9 .\[2 .0~ .10 type 382 
2.3793 .ub .09 .o8 ,07 .IU ,10 .~g .IU .L\] .09 .09 =||ely ~|5 
Z,179~ .08 .08 .06 .ou .u~ .12 ,LZ .10 .O9 .It .O7 vlrlabll 203 
~.379~ .O9 .0~ .00 .lO .12 .¢)8 .u's .O0 .0~ .13 .O8 casll 26O 
2.JSUt .09 ,09 .10 ,07 .~19 .32 ,ZO .12 .07 *0~" .O7 frequen¢ 3|8 
Z.J~15 ,Ub .tO .08 ,14 .~}9.0S .o; .0~ .IL .11 .It facet ~31 
Z.3619 .08 .08 .tO .11 .\[2 .O9 .03 .L0 .10 .09 .0b du¢lal 620 
~.~\[ .u7 .Iu .|0 *|4 .I| .08 .4(' .06 .10 .t)9 .03 esge¢laL3y \]69 
2.3U~7 .O6 .I1 .ll .u6 .O9 .:)8 .09 .11 .o~ .LO .10 ulullly L379 
2.366/ ,IZ .10 .07 .|0 .07 ,07 ,09 .O9 .09 .11 .og $eneral 78 
2.3~5 .II .09 .U9 ,G9 .09 .09 .07 .07 .08 .09 ,12 i~ 9HO 
Z.3663 .08 *|0 ,IU .09 ,O6 .08 .|0 .|0 ,07 .09 ,33 O@ 3206 
2.3~8 .09 .O9.0B .O8 .u9 .IU .06 .11 .O7 .06 .IL fr~ 389 
2.3~92 .09 .~g .O8.0B .~7 .LU ,i! .09 .41 ,07 .10 i|ter 516 
2.J699 .O6 ,|| .10 ,O8 .09 .\[}8 .10 .10 ,0~ .08 .1\[ vlcn 3315 
2.390Z .O9 .O9.0q .O8 ,09 .LO .O8 ,10 .07 .(~q .12 eirly )61 
Z.3911 .06 ,II .10 ,06 .~)8 .0~ .I0 .\[0 .O8 .09 .IL In 2B65 
2.393~ .39 *LI .Oq .08 *08 .09 .O9 .IU .09.0S .ll by L~fl8 
Z.39(9 .~9 ,1| .O9 .0~ .0~ .10 .O8 .09 .~)6 .09 ,11 ¢o~11 2\[0~ 
L\]93~ .07 .1~ ,10 *08 .LO .O9 .1~ .10 .08.0q .10 ot 1953 
2.3950 .08 .LO .10 .09 .O8 .~ .09 .10 .+)q .09 .111 polltbly 2kOS 
~,3955.0S .LO ,16 .~)9 .09 ,o~ ,t~9 .O9 .ng ,09 .lu to ZOU. 
Table 3. The highest 'entropy' words in 
CHIT. Note that these are conu~on English 
words. 
A :e4t} Pt) uo u~ 
u.9~Z~ Z3 .9e (23 0) 
0.~500 53 .96 (53 1) 
u.9k9Z ZI .V6 (~l O) 
u.9671 2~ *gb (Z~ U) 
0.9~70 21 .96 (2X ~) 
0.~0 i9 .95 119 ~) 
0.~2Z 27 *~? (Z7 U) 
0.936~ 5Y ,91 (58 t) 
U.9380 33 .~1 (33 L) 
u.9321 27 .91 (27 ~) 
0.9305 ~l *96 (~1 L) 
u.~30l L~ .~* (l~ 0) 
U.~Z~7 12 .95 (|7 O) 
u.9279 Lb .9~ (16 O) 
0*9262 Ih .9~ (|b O) 
U*~2~7 21 .~6 (Zl o) 
U*92Ub Z7 .~3 (Zb U) 
~.~19! 11 ,~2 (11 U) 
0.912b J~ .94 (33 L) 
u*9|2b L~ .94 (lb o) 
U*906~ 19 .95 (19 O) 
0*9U~6 11 .92 (It 0) 
0.9036 ~ .9~ (Z~ u) 
0.~U33 23 .~Z (22 O) 
0.903Z ~l .~L (Zo O) 
0.6992 Z7 *~3 (26 ~) 
0.0965 21 .91 (20 O) 
0,695~ ~ .90 (8 u) 
0,6946 tz .93 (1~ O) 
U.6~12 Jo .~ (29 l) 
0.8Y06 53 .93 (50 I) 
0.890~ 9 *YL (9 u) 
U.8~t ~6 .93 (tJ 13 
~.8891 11 .92 (II u) 
0,~886 L3 .Y3 (13 U) 
o.e061 23 .~2 (Z2 u) 
0.6677 Z6 .~b (26 2) 
O.d6/b Z9 .~u 427 o) 
o.6667 Z9 .~u (27 O) 
u.~666 2Y .~7 (Z9 2) 
o.~a66 29 .97 (2~ 2) 
u.6663 L2 .~) (12 O) 
0,~861 ~1 .~1 (20 o) 
U.6S33 23 ,¥2 (22 0) 
Q.6~)3 lU .~2 (tO O) 
o.6626 55 .91 4~L I) 
u.6~oe 7 .69 (P u) 
U*6793 30 .9L (26 U) 
O.d/b8 11 .~Z (|I O) 
9,6733 ~ .90 (u o) 
0.8;23 3k .~Y (31 0) 
et ut ej u) ~t~+j 
.~1 (25) .Ul (:31 VIal-carl 
.03 (IUJ) .OZ 1531 tnhlllttOe--¢Lv 
.u1 (Z~) ,0l (Z|) Illtl--¢urct¢i 
.01 (i6) ,+l (2~) ~lr{ull~OnInOtl 
.0~ (931 °02 (~) d¢lbttll+'Slml1~LiUl 
*U3 (L08) .Ul (33) per-¢u6~c 
.03 (lOU) .UL (Z/) ~*r-~mcl¢ 
.OJ (150) ,01 (&t) l©s'qtl 
*U| (23) .UU (|k) inl\[niqte¢O¢ll 
.02 (bO) °O\[ (I)) 81~¢~4dd31~1 
*02 ($3) .UI (16) cZv-vlpo¢ 
.u2 (57) .U| (16) ocCU~l¢lOnll-Vl~r 
,U~ (|0\]) *O4 (~1) Cnhl\[l¢tOfl,',~lihl¢lll 
.o! (33) .01 (221 ¢,bt¢-=eCl¢ 
.o0 (12) .go (It) lift-tamp 
.05 41501 *gl (19) Icg--p-¢ 
.UZ (56) .UU (LJ) bLo¢k-bund|e=.b¢lnch 
.u~ (iu3) .o! (2~) Iohl~lt\[Ol~ll~+lJ{lCtMte 
,~2 (~2) ,~1 {{{) (~lEa~ltl{\[{qlpl{=~ 
.o~ (J~) ,~\[ (Z~) ¢L~iihlfl{I 
.O0 (|i) .0U (6) ~l~n\[i-b{~dlthl~{ 
.UO (l&) *UU (6) \[llUeL~t--ll{nO~OptldSll 
.0J (\[\[U) .00 (12) {~i{{~E~oi{~ld 
.0k (116) .OZ ($3) ate-flY 
*03 (69) .Oi (46) *lElctl-~llpectld 
*03 11101 .UU (11) ~*llL-rhtnol¢O~y 
.04 (148) .OU (i3| jlundtct't|p¢ 
,O} (10J) ,ul (Z3) inhllltlon--pereut4~eo~m 
.uz (50) .01 (zg) rnytn~llLop 
.02 (~J) .u! (29) civ~nutlcture 
.08 (Z6~) .01 (Z9) =oemtl-mec~m=yttc 
.OA (LJT) .uu (LZ) m=rrme-trythrotd 
*03 (1~) .01 (Z|) ~&lCt¢C--¢lChlEl~l 
*Ui (1161 .~{ (2\]) d~r-plfCU¢lnlll<~ll 
*03 (108) *OU (|U) pi¢*\[{\[l~ 
.03 {Vb) .U~ (55) ¢tlCttOnl--ldvlrll 
.ul (Zb) *UU (71 b¢onchoIcopy--6¢ottcnoK¢lphy 
.u& (1161 .UL (S&) atr-ppm 
.U3 187) *U\[ 130) glllrtC--tlVlll 
.05 (130) .OO (11) e¢8--b~ndll--O¢lO¢h 
.03 (ab) *UO (U) ~¢~¢--~0\[01y1¢031¢ 
• ^ I+iLJ ELi Uo op 
-0.166| ILU .UL (U L2) 
-.4J.lOb\] +£ .Ui (U LU) 
-U.IU39 lSU .~l (I 171 
-U.IU|9 6~ .02 (U 7) 
-0.u~95 55 .u~ (U O) 
~1.u~bY 53 .UZ (u b) 
-u.u~dZ 51 ,02 (U ~) 
~.0976 5b .UZ (U ~) 
-O,OQ/6 69 .02 (U 5) 
• "U.0968 ~1 .02 (U 5) 
','0,U¥43 Y3 *UI (U 9) 
-U.U¥38 41 .u2 (O ~) 
-U.U938 170 .U2 (J ZU) 
• .'O.UV3Z ~0 .02 (11 ~) 
-O.U~2b 80 oU| (U ~) 
-b.O~t~ ?3 .01 (U 2) 
-~.0907 36 .03 (0 ~) 
-(J.U~OU 35 .03 (0 ~) 
-O*UBgb 8~ ,OZ (U ~) 
-'~.0693 3~ .4)3 (O ~) 
-0.U887 60 .02 (0 6) 
-O.U~a5 ~3 .,~ (0 }) 
• "U*U~76 32 .03 (U 3) 
-~.U~15 56 .U2 (u ~) 
-0,O&TZ 5S .UZ (0 ~) 
• ,U.0872 b5 .U\] (I 7) 
~).UBb7 84 .U3 (L 7) 
-U.UBbI 97 .03 (2 ll) 
-0.U~67 31 .03 (0 3) 
-~.U~; 31 *UJ (U 3) 
-~J,0866 53 ,UZ (0 ~) 
-0.0886 ~ .02 (0 5) 
-0.u866 $3 .U2 (O 5) 
• "0.0~65 L29 .U3 (3 15) 
-0.0663 $2 .02 (0 5) 
-0.O~bL 95 *03 (2 Ll) 
-0.0058 30 *03 (U 3) 
• .~.uaS8 b2 .03 (L 7) 
-u.~$u 30 ,03 (O 3) 
-0.0~5~ 30 .03 (U 3) 
-U.U8§$ 50 .02 (U 5) 
-0.U~$$ SO .02 (O ~) 
-0.0~53 61 .U3 (1 7) 
-0.0~8 29 .03 (U 3) 
~J*O86~ Z9 .03 (O 3) 
-~.O~k8 bU .03 (1 7) 
-O.U6~ 29 ,03 (0 3) 
-o.oa4a Z~ .u3 (0 3) 
-o.ua~ 2~ .03 (U 3) 
-U.U837 58 *U3 (L 6) 
-0.0~37 ZU .03 (0 3) 
'-0.U~37 28 .03 (O 3\] 
-O.uu\]? Zb .U4 (U 3) 
Pi ui P( uj ~:i-uJ 
.IZ (3~1) .03 (IIU) Bo,i-vlniriculat 
.lZ (381) .UJ (9() bone-v4~inil 
.12 (SH|) .05 (15t*) bone-(c; 
.L2 (36|) .HZ (64) 6one-ceivtx 
• 12 (\]8l) .02 (55) 6Onm--$¢¢L¢¢u¢1 
• 12 (3~L) .UZ (5\]) oonl*¢ELI 
• IZ (\]6\[) .UZ (54} bone--pJ¢oxylaUb£ 
.|2 (36|) .UZ 450) ~ona-¢agt,e¢erzza¢lon 
• |~ (36L) .U2 (50) 0Qnl--¢hy¢6m 
• 12 13811 .U~ (kY) ~onm-Rilucoma 
• IZ (361) .O2 (~v) bonI-p 
• IZ (SUI) .U~ (&7) bone~dve 
.LU (~&l) .OJ (~3) uylpnll"epLaa¢=ll 
• |2 (~+\[) .U1 (4|) 6one-<Ill 
• lZ (161) .0~ (LTU) 6one-clsh¢ 
• 12 (~6l) *U| ({U) OONV--I¢B¢L/¢¢y 
.|0 (34|) .U2 (6U) oylpflel--nltvII 
• lO (3~\[) ,oz (73) ~ylpnU--lclJp 
• 12 (3~\[) .+I (36) hoel-placenca 
• 12 (3011 .0L 135} 0oma-~t+lug 
.Iu (341) .U2 (b~) 0ylpnel-.ucethral 
.|2 (sKi) .UI (34) bone-eo¢£um 
• IU (34|) .U2 (80) dys~ne4-~L¢ 
• 12 (38L) .Ui (33) boue-c~11~ry 
.A2 ()d|) .01 (J2) oone-pulmo, x¢ 
• 1(I (S&|) .U2 (~o) dylpnel-~ypet~eta¢os~l 
.IU (361) .02 (5~) dylpneJ-knte 
• 12 (38|) .03 (05) botm-a¢¢Xal 
• 12 (~8|) .0& (64) Dune"~rech¢ll 
°|2 (38l) *03 (97) bonm-loundM 
• ~2 (~l) ,()l (~A) 6Ont--p@E¢ntuB 
• 12 (3&;) .Ol (3l) bo,e-ova~y 
• |U (S&l) .OZ (53) uyspnel-cylmolcopy 
• |U ()&l) *02 (~) dylpnll--d~Mk 
• 12 (~83) .04 (1~) 6oal-lc~eiy 
• |2 (3~1) .03 (~) bO~l-vin{E\[{\[I 
• 12 (J~l) .Ol (30) 6onl-lnstocl~dXo\[rlpnY 
• LZ (~81) .UZ (62) oone-conJunccivi 
.lZ (J~l) .UI (30) 6onl-lelUl 
.LZ (JbA) .uL (~U) ~one-lxe¢¢~ol~l 
.IU (3&l) ,IJZ (50) dylpn~&-~en~l 
.lu (Jil) .o2 (SU} dyl~ll--6lnlv~Or 
• |Z ()UI) .UZ (hi) bone-.d~apncai~ 
.12 (381) .01 (29) bo~l--pupEl 
• |2 (~8|) .UZ (6U) boal~ave$ 
.LZ (\]~l) .UI (29) 8oM-glLlbLadalr 
• 12 ()8|) .0| (29) bonl-sborclon 
.12 (38|) *UZ (56) bone-ure¢,rl 
• lZ (3l+L) .UL (28) bone-¢ou~uactivlL 
• ~2 (3811 *U| 423) bone-~llld 
• 12 ()61) *OL (2~) oonl--envLron~nt 
Table 5. The bottom of the word-pair 
list, showing the negativaly correlating 
words. 
Table 4. The top of the word-pair list in 
decreasing order of associaUion value (A). 
151 
1.2252 ~ 76 
1. 3089 71 
i..19u2 9 59 
..467: ~ 17 
1.4E05 1 4(, 
1.6940 2 33 . 
;.6259 6 25 
1. 6267 4 ~2 
1.6619 6 58 
i. 704.7 1 28 
1.71.7.7 5 24 
1.7209 0 2 < ) 
1.'7246 2 19 
1.730'7 ° 18 
1.7441 ° 19 
1..7511 10 39 
1.'7590 2 25 
I.'7619 O 1.7 
1.7712 ;) 21 
1.7619 1'7 98 
1.7821 3 22 
1.8192 2 26 
1.83(18 10 24 
1.8391 0 IG 
1.8395 a 16 
1.8420 4 4.7 
1.8436 ° 1.7 
1.0905 19 24 
1.8521 1 19 
1.0580 0 16 
1.0~q.7 g 12 
1.9U22 17 31 
1.9209 82 29 
1,9242 29 
1.9251 0 13 
1.9283 ° 14 
1.9337 ~ 20 
1.9339 15 
1.9347 0 17 
1.9407 I 3.7 
1.9489 9 1.7 
1.9~.24 4 23 
1.9'701 J 21 
1.9.795 U I0 
1.9.775 5 19 
1.9.78.7 4 3"7 
I. 9796 2 16 
1.~84\] U 11 
1.~?e ~ 17 
1.~8.76 15 
1.9S83 l 1~ 
1.9926 I 13 
1.9994 0 Ii 
2.0008 ~ 13 
2.0026 ~4 
2.0032 6 29 
u 
0 
~ S 
0 
O 0 
3 0 
9 0 
0 0 
1 
0 0 
S 2 
0 0 
0 9 
o ~ 
1 0 
0 0 
2 0 
1 U 
l 0 
0 0 
3 2 
4 C 
0 
0 0 0 
2 = 
4 5 
0 0 
0 2 
~ 0 
2 ~ 0 
0 0 
° : 
0 
2 04°0 
e 3400 
1"511 
e 2 A L 
an 0 0 
1 ) 1 g 
e91 
~O 0 
109 
LSO~ 
2 10 
18 ~ 2 ~ 
0 1 0 o 
l 0 0 
09~0 
OlO0 
3go 
1091 
00 0 
3902 
4 5O 
¢ 
9 11 I °~ 
0 ~ 
C 1 O 
OlnU 
10o0 
1 5 
epAdermis 93 
~erm~s 95 
~mules ~G 
a~m~r~zs 44 
~ypor~erar.c.lsiJ~ 56 
epAder=ai 49 
~uiel 31 
seal Anq 41 
~lp "73 
inwlutic~ 22 
Fapuie 32 
~orny 21 
• era¢in 19 
s¢¢l¢u~ 21 
eruPtian 54 
corAu~ 34 
cornm~m 16 
melanin 26 
;~ruri~ 185 
pustules 28 
~uil~e 39 
solel 40 
s~ales 19 
nilR~le 19 
~nt lltra¢l .74 
;~r~te¢ltO~tl 21 
palm 40 
hypa rpiqlm~'.st ion 22 
~cis 19 
ictlthy~s i s 12 
S I~cche~ 54 
c~c 16 
xe¢lcosiJ 17 
3 {o111c~lar 32 
cneeu 21 
reel 29 
ci rc~mic~ipad 65 
ct~ust~r~ 27 
~rm~ 44 
s~eac )S 
m.dDepiderm~l 10 
4 leant rq 34 
pl~q,,.m~ 3? 
3 sunl i~hc 2~ 
vet ruc~,s 4 
SCaly 22 
r idRel 25 
hyperKera¢o¢~¢ 1.7 
MAts 13 
ec~m~ 21 
nevus 28 
~JC~CM 38 
1.5869 
1.5848 2 1 4 0 g 
1.~182 0 0 1 0 1 
1.6338 2 0 1 0 1 
1.6441 1 2 g 0 1 
1.662'7 3 9 1 1 3 
1.6686 I 3 I 0 3 
1.6836 4 ° 0 1 
1.6967 9 1,..7 : 9 I 2 ~ 
..7~1 \] : : : ,o 
1.'7445 
1.7659 IS 2 
1.'7051 I O 3 I g 1.=2~ 9 , : : : 
1.|077 O 3 
1.8149 1 0 1 9 0 
1.8188 
2.8410 2 9 l ° P 
2.8424 3 1 ° 2 9 
1.8M7 '7 O O 1 1 1.,2 1, ~ : o2 oo 
2.OTiS O 
1.8741 2 1 $ 1 9 
1.875.7 ° 0 ° 1 e 
° 1 1 
2.899.7 
1.8975 4 0 
1.8991 1 O 1 
l.I I 9 I 9 0 
l. 5~9,e $ 1 5 0 S 
1.91"22 9 O 9 O 9 
1.91'72 0 0 9 g B 
1.9172 O 9 ° ° 9 
1.9224 4 S I ° 9 
1.9238 ~ O 0 : 3 
1.9634 9 0 1 
1.9728 I ° 9 g 9 
1.9'736 i 0 ° 1 9 
1.9775 
1.~812 ° 1 ° 0 e 
1.9815 O 9 9 0 0 
9 0 .9.2::, 
1.9888 2 2 0 
1.9990 1 1 ° 0 
1.9933 3 9 I O 0 
2.~g~3 1 9 gg ~ O 
2.9093 I ° 0 
2.8099 O ° 0 ° 9 
2.0119 2 2 9 1 O 
44 0 1~9rlun 
34 4 5 0 0 oolon 
2.7 6 0 du=dsmal 40 
O 9 9 0 du~m~m 34 
21 0 0 0 par A ¢onLt LI .72 
33 9 1 0 1 UNdld¢41flt 47 
26 3 4 0 a bile 39 
26 | 2 0 9 bi 1181'y 49 
33 4 9 1 0 ml) A q4141¢ r tc 
O 14 0 0 9 0 94stroloo~y 15 
11 9 1 0 0 ur¢~l Ill.pan 21 
39 6 4 G 0 conic i peUon 78 
22 
9 ~1 9 0 . 0 : 0 :~:,.~'21" 
14 1 0 0 0 ¢o1¢~11 ¢ 17 
e 13 1 O 9 ° m 15 
12 O 0 0 ° bep 16 
13 O 0 9 g p/lot ic 16 
° 15 0 0 ° ° ~1110¢y 21 
21 2 3 1 ° blllrul~n 42 , ~ , ~ 1 9,-. 58 
9 0 9 pe¢i~rtall 10 
,, I I ' :..1 21 g 21 1 0 ~ S 
3 0 4 9 SCOOP 35 .7 ~ 0 ~ , 9oA.--. N 
16 0 @ 0 0 mnee.cet ic 
1 15 1 1 0 9 pie i sr.allts 22 
11 J g O 0 IIRpt 13 
9 11 A O 9 8 pcoctmnc~q.v 12 
33 4 2 ) 0 ir.=ol¢lnl 64 
9 ° 0 O ° ct~oLlrl A ¢1s 9 
° 9 8 0 0 0 ¢hO I ecys r.Oc) tamy 9 
9 0 O O U uOpMqOICOW 9 
I 15 O O 2 9 an~Z 26 
10 O 1 0 0 veL"| ¢tm 19 
69 O : : 0 Ancrm¢lc 11 
2 0 0 °°st rllC~Xmy 11 
9 I I 0 I i hCulma~'~ 1 on 12 
9 29 G 2 ° 0 loOpS 20 
5 0 2 ° ° ~nopeq~ I dale 8 
1 1 0 9 lt~a~|d 6 
O 13 4 0 0 e aues~,cosa 24 
11 I 1 e o i IrJr~ 17 
8 0 0 I O ~lorr~tdr ta 15 
, ,, 0 i ~ I .... ,~ 19 
Ii 3 9 1 2 ~IYP~ 19 
O 3 0 0 lun~¢al 19 ° g . ' iJ 
co11¢II 
Table 6. A word list generated algorinh- 
mically which constitutes a dermatological 
vocabulary. The disease category 'skin' 
is represented by the third colu~nn. 
Table 7. A word list qenerated algorith- 
mically which constitutes a vocabulary of 
gastroenterology. The eighth column 
represents ~he disease category 'digestive 
SySte~ t . 
152 
