CC~fl~TER-PRODUCED REPRES~EATION OF DIALECTAL VARIATION: 
INITIAL FRICATIVES IN SOUTHERN BRITISH ~GLISH 
W. Nelson Francis, Jan Svartvik, and Gerald M. Rubln 
Department of Linguistics, Brown University 
It has become apparent to some dialectolo6ists that di~lectol- 
ogy, particularly in its interpretive phase, is a branch of linguis- 
tics particularly adapted to the use of computers. The dialectol0- 
gist typically deals with large bodies of data, usually in the form 
of single words and short phrases, and he is interested in sorting 
and comparing individual items on many bases: phonological, morpho- 
logical, lexlcal, and geographical. The major obstacle that has 
prevented widespread use of computers in dialect study is the fact 
that the data for most of the great dialect surveys have been collec- 
I ted, recorded, and in most cases edited prior to the computer age. 
Thus the problem of preparing large bodies of data, much of it in 
narrow phonetic transcription, for computer use has been formidable. 
One of the aims of the present paper is to show that results can be 
obtained relatively easily by computerized sorting and mapping that 
would take endless hours by traditional methods, and hopefully to 
encourage others to invest time and money in preparing data for the 
computer rather than in time-cc~suming hand sorting and map-making. 
Accordingly we sought a proble~ I that would be canplex enough to 
reveal the advantages of computerized dialectolo~y while at the same 
2 
time involving a body of data small enough %0 be quickly prepared. 
Since two of the three,authors are specialists in English (the third 
is a computer specialist), we ~tural/y ~urned to the published 
volumes of the 5'u~e~ d of ~dZ'~h D~a/~c~, 2 which embody carefully 
controlled data, collected with professional skill, and presented in 
convenient tabular form in meticulously edited and printed volumes. 
And since une of the two areas coveredby the volumes in print at the 
time the study was ~adertaken (May 1969) was the south of England, 
the problem of the voicing of initial fricatives in the southwest 
naturally suggested itself. This lyrohlem had the further advs~tage, 
for our purposes, of dea//ng with cm~sonants (s inkier than vowels in 
most varieties of English) in /D/tial position, hence easily sorted 
and exsmined. The selection of this proble~ has proved to be a 
happy one. 
The area covered by Volume ~ of SED comprises the ten southern- 
most counties of England, which, with their key numbers in the S~w;ey, 
are 31 Somersetshire, 32 Wiltshire, 33 Berkshire, 3~ Surrey, 35 Kent, 
36 Cornwall, 37 Devonshire, 38 Dorsetshire, 39 Hampshire, and h0 
Sussex. A :~ "~i of 75 localities in this area were reported os by 
the Su~ey; in what follows these will be identified by a four-digit 
r 
number, the first two digits indicating the county and the remaining 
two the locality. Thus 3906 stands for Burley, the sixth locality 
listed in hampshire, accordi~to the numbered list on p. 31 of the 
Introduction to the 8ul-oey. The data selected for examination 
3 
included all those words beginning with graphic f-, s-, or th- 
followed by a vowel or voiced consonant which were starred in the 
~ED questionnaire. 3 To this list we later added a few non-starred 
words which showed universal distribution and were otherwise of 
interest. The final list contained 68 words, of which 27 are f-words, 
22 s-words, 16 Sh-words and 3 ah-words (i.e. words beginning with /~/ 
in standard English). We took only the first recorded form from 
each locality; this is presumably a citation form, produced by 8n 
informant in response to a question, and recorded in narrow IPA trans- 
cription. The 59 cases where no response was given were coded XXX 
in our computer code. 
The corpus thus comprised 68 x 75 or 5100 items, including the 
59 blanks. Our c~,puter expert then produced 68 decks of punch-cards, 
one for each word~ each deck containing 75 cards, one for each local- 
ity, These were numbered at the left for locality and on the right 
for the reference number of the item in the SED questionnaire. ~ A 
coding system was devised which preserved all significant features of 
the phonetic transcription while passing over apparently irrelevant 
fine points (see Appendix A), and the words were transcribed in this 
code directly onto the cards for the guidance of the key-puncher, 
who then punched the coded words in a fixed place on the cards. 
Subsequently the standard spelling was inserted by the computer to 
the left of the coded phonetic spelling. This whole process took 
about a dozen hours of the investigators' time (not counting the 
relatively simple progras~ing involved) and about the same amount of 
the key-puncher's time. The result was a body of data consisting of 
5100 entries of the following sort: 
3101 FINGER FIgG)R. 6 7 7 
This is to be interpreted as indicating that at locality 3101 (Weston 
in Somerset) the word f~nger, which appears as item VI.7.7 of the 
SED questionnaire, is pronounced \[f,ngar-\] (or perhaps more accu- 
rately /ftngar-/ in the quasi-phonemic transcription used). 
The nature of the problem with which we are dealing ms~ be most 
simply introduced by an excerpt frc~ the full treatment given to the 
voicing of initial fricatives in Middle English by Horn and Lehnert 
(195~, Vol. If, ~37): 
In gewissen Mnndarten sind in alter Zeit im Wortanlaut die 
starken und stimmlosen Reibelaute f, s und ~ vor Vokalen und 
schwachen, stimmhaften Konsonanten schwach und stimmhaft 
gew0rden: f- • v-, s- > z-, ~- • 6- . . . Die neuenglische Hoch- 
sprache hat einige ~6rter aus den Mundarten aufgen~n... 
Die starken und stimmlosen Reibelaute im Wortanlaut sind 
in Kent und im sUdSstlichen Mittelland schwach und stimmhaft 
geworden. In me. Handsehriften wird v f'dr anlautendes ~ vor 
vor Vok~len geschrieben, z f~r enlautendes e vor Vokalen, . . . 
Wit d~rfen annehmen, da~ aueh stimmloses th- stimmhaft geworden 
ist, da heutige Mundarten f'dr ~- ebenso wie f~r f- ,rod s- 
I 
stimmhafte Laute haben... 
Aus der Tatsache, da~ franzSsische Lehnw~rter im Me. ~- mud 
s- beibehalten . . . geht hervor, da~ die stimmlosen Reibalaute in 
den englischen WSrtern schon vor der Aufnahme der franzSsischen 
LehnwSrter stimmhaft geworden waren... 
5 
Der Lautwandel hat sich im Laufe der Zeit von Kent aus h'oer 
die s~dliehen und angrenzenden ~stlichen Grafschaften ausge- 
dehnt, ~uld zwar hat er heute in dlesen Gebieten anlautendes 
ebenso ergriffen wie fund s. Vereinzelt ist auch anlautendes 
~, zu ~ geworden... In den heutigen Mundarten yon Kent, Surrey 
und Sussex sind die anlautenden stinunhaften Laute unter dem 
Einfluss des Hochenglischen durch stimmlose ersetzt worden, 
w~d das ~stliche Herefordshire, Teile von Gloucestershire, 
das westliche Berkshire, und besonders Wiltshire, Dorsetshire, 
Scaersetshire told Devonshire stinunhafte Reibelaute im Anlaut 
aufweisen. 
Horn-Lehnert go on to point out that by analogy and under the 
influence of Standard English, initial f- and s- in French loanwords 
have become voiced. This is borne out in our corpus by the words 
~, ~Grmer, and ~ers, which have initial v- in more than half 
the localities included. As our subsequent discussion and maps will 
show, Horn-Lehnert should have included in the initial voiced frica- 
tive area the western half of Hampshire and all but the southwest tip 
of Cornwall. Since the SED records for Herefordshire and Gloucester- 
shire have not yet been published, we have not been able to include 
these counties in our survey. 
The traditional assumptions, then, are that ~-, s-jand ~-. 
bee~ne voiced (except before voiceless consonants) in initial posi- 
tion at an early date--certainly before the 13th century--in the 
southeast; that this affected all native words ; that this change sub- 
sequently spread into the old West Saxon area of the southwest ; that 
6 
after this spread voiceless initial fricatives were restored in the 
southeast; and that analogy, dialect borrowing, and the influence of 
standard English worked variously to blur the excepticmless charac- 
ter of the sound change, to produce voicing of initial 2- and a- 
in French loanwords and of initial ~- in native words like 8hiZlir~ 
as well as French words like sure, and othe~se to create a mixed 
situation in the whole southern area. 5 Our project was to see what 
light the records of the SED can throw on this situation by exploit- 
ing them in some of the many ways made possible by computer techno- 
logy. 
The first step was to sort the data in as many ways as we felt 
would be productive. Accordingly our computer expert produced four 
lists of the 5100 items, sorted as follows : 
List I: sorted first by keyword (the standard English graphic 
word identifying the item) alphabetically; then by locality. 
This list presents the data in the same kind of order in which 
it appears in the SED records and allows easy inspection of all 
versions of each word in one list. 
List II: sorted first by locality and than by keyword alphabet- 
ically. 'Fais list brings together in one place all the records 
from each locality and thus permits comparison of the 8mount of 
voicing recorded fr~n various localities. 
List Ill: sorted first by locality and then by citation (the 
recorded local form) alphanumerically. This list even more 
6 
7 
graphically reveals the -mount of initial voicing; it also makes 
it easy to surmise from inspection whether or not incidence of 
initial voicing might be influenced by the following vowel or 
cG~Is onsIlt. 
List IV: sorted first on the second and following characters of 
the citation form, in linguistically significant order (i.e. by 
vowels and consonants in articulatory order), then by the first 
character. This list greatly facilitates looking into the 
question of the possible influence of following sounds on initial 
voicing. 
These lists, though interesting in themselves, were cmm~idered 
primarily as intermediate diagnosti~ procedures, to be used to guide 
us in future ~ortlng, counting, and eventually mapping. Even the most 
cursory inspeetiou of them revealed what we had suspected frc~ our 
first look at the data: that there is tremendous variatio~ within 
the relevant area both from word to word and from locality to local- 
i ty. It ~ertainly seemed, at least so far as this feature in this 
region is concerned, that the maxim attributed to Gilli~ron, "Every 
word has its own history," is true. 
Accordingly we asked our c~nputer expert for various more soph- 
isticated sortings and counts, and for two kinds of maps, to be pro- 
duced by the CalCump plotter (see Appendix B). These were the 
following: 
i. A table for each of the four sets of words, listing all words 
7 
8 
in the set and counting the number of each occnrrf~ initial conson- 
ant, the words to be ranked in descending order of the n~ber of 
initial voiced consonants recorded. ~ese are reproduced here as 
Tables i - 4. 
2. A list of all possible ~vowels, dipht~s, and second conson- 
ants in each set of words, counted and tabulated in terms of each 
possible initial consonant. A portion of this list is reproduced 
herewith as Table 5. 
3. An individual map for each of the 68 words, showing the 
initial consonant recorded for each locality. Eight of these are 
reproduced here as Maps 6-13. 
4. Four proportional maps, one for each set of words, indica- 
ting at each locality the voiceless:voiced ratios; thus for the 
~h-words the legend 3/13 occurring at B905 indicates that at that 
locality only three of the 16 d-words begin with voiceless consau- 
ants. Map I shows the proportianal map for the e-words as it came 
from the plotter, while maps derived frc~ these proportional maps to 
reveal the varyiug distributions more clearly are included as Maps 
2-5. 
Tables i - ~ s~pport our suspicion that each word has its own 
unique distribution with regard to the initial c~nsonant. Thus Table 
1 uhows that ~ the f-words the proportion of voiceless to voiced 
ranges fr~ 20:5~ in F~.LT~ to 52:22 in FOAL. Even in those cases 
vhere the proportions are the same, recourse to List I ~eveals ~hat 
8 
Word 
TABLE i, F-Words 
Ref. \[V\] \[B\] \[~\] It\] \[el 
8A 
Vcls. Vcd. Total Total Missing 
1, FELLIES 1.9.9 1'9 5 ¢ 
2. FURROW 2.3.1 53 ¢ ¢ 
3. FOOT 6.10.1 h6 ¢ 1 
h. FRIDAY-1 7.4.h h6 ¢ ¢ 
5. FERN h.10.13 h5 ¢ 
6. FINGER 6.7.7 h5 ¢ 
7. FIRE 5.3.1 h5 ~ " 
8. FRIDAY-2 7.4.7 4h I~ (~ 
9. FIVE 7.5.6 h2 ~ 
i0. FLEAS h.8.h b2 @ 
ii. FROGS 4.9.6 h2 ~ @ 
12. FIND 9.3.2 hl ~ 
13. FOX h.5.11 )41 ¢ ¢ 
114. FURTHER 9.2.1 b,l ¢ ¢ 
15. FLIES h.8.5 .hO ~6 ¢ 
16. FLOUR 5.6.1 )40 ¢ @ 
17. FIRST 7.2.1 39 ¢ 
18. FLOOR 5.2.7 38 ~ i 
19. FARMER 8.h.7 38 ¢ ¢ 
20. FLITCH 3.12.3 38 ¢ ¢ 
21. FLOWERS 8.5.13 36 ¢ ¢ 
22. FORTy 7.1. lh 35 @ ¢ 
23. FATHER 8.1.i. 31 ~ 
24. FIGHT 3.13.6 31 ¢ 
25. FORD h.i. 3 22 ~ i 
26. FOAL 3. h. i 22 ,16 9 I 
27. FORKS 1.7.9 22 ¢ ¢ 
19 1 20 
21 ¢ 21 
28 ¢ 28 
26 3 29 
30 ¢ 30 
30 ¢ 30 
30 ¢ 30 . 
28 ¢ 28 
33 ~ 33 
33 ~ 33 
33 ¢ 33 
314 ¢ 3~ 
31t ¢ 314 
3h ¢ 3h 
3 b, ¢ 3)4 
35 ¢ 35 
36 ¢ 36 
36 ¢ 36 
37 ¢ 37 
26 ¢ 26 
39 ¢ 39 
~,o ¢ ~o 
b3 ¢ h3 
hi ¢ bz 
52 ¢ 52 
ho ¢ 40 
5h 1 
53 1 
1,7 ¢ 
1~6 @ 
1,5 ¢ 
1,5 ¢ 
h5 ¢ 
hi, 3 
h2 ¢ 
1,1 ¢ 
,4o ¢ 
39 ¢ 
39 ¢ 
38 ¢ 
38 11 
36 ¢ 
35 ¢ 
31 
31 1 
23 ii 
22 1 
22 13 
9 
Word Ref. 
TABLE 2, S-Words 
\[z\] \[z\] ~ Is\] \[~ 
8B 
TotaV i cls" TotV al cd" Missing 
i. SATUROAX 7.4.5 46 ¢ 29 ~ 29 46 ¢ 
2. S~E 6.3.2 46 ~ 29 ¢ 29 46 ¢ 
3. six 7.1.5 42 l 32 ~ 32 43 ¢ 
4. SOW-N 3.8.6 43 @ 31 @ 31 43 1 
5. SUCK 3.7.1 42 ~ 32 ~ 32 42 1 
6. SEV~ 7.1.6 41 ~ 34 @ 34 41 @ 
7. SADDLE 1.5.6 40 ~ 33 ~ 33 40 2 
8. SOUTH 7.6.25 40 ~ 35 @ 35 40 
9. SIGHT 8.2.9 39 ~ 36 ~ 36 39 
i0~ SILVER 7.7.7 39 @ 35 ~ 35 39 1 
ii. SEW 5.10.3 37 ~ 38 ~ 38 37 
12. SOOT 5.4.6 37 ~ 38 ~ 38 37 
13- SWEAT 6.13.5 33 1 41 ~ 41 34 
14. SECOND 7.2.3 31 ~ 44 @ 44 31 @ 
15. SUET 5.7.6 30 ~ 44 1 45 30 
16. SWEAR 8.8.9 23 @ 52 ~ 52 23 g 
17. SEXTON 8.5.h 22 @ 53 @ 53 22 @ 
18. SNOW 7.6.13 15 1 59 @ 59 16 
19. SMOKVS 5. i. 4 14 1 60 @ 60 15 @ 
20. SNOUT 3.9.1 4 ~ 70 ~ 70 4 i 
21. SLEDGE 1.9.1 3 @ 68 ~ 68 3 4 
22. STITCH 5.10.4 i @ 74 @ 74 i @ 
k 
I0 
Word Ref. 
TABLE 3, TH-Words 
\[B\] \[d\] Iv\] \[e\] Is\] VcIs. Total 
8C 
Vcd. Missing 
Total 
i. THATCH-N 2.7.6 31 8 6 29 ¢ 29 h5 1 
2. THATCH-V 2.7.5 32 7 6 29 ¢ 29 h5 i 
3. THUMB 6.7.6 45 ¢ ¢ 30 ¢ 30 h5 ¢ 
h. THISTLE • 2.2.2 19 22 3 28 2 30 4h ¢ 
5. THIRSTY 6.13.10 h3 @ ¢ 31 ¢ 31 43 1 
6. THIRTEEN 7.i.i ! 43 ¢ ¢ 32 ¢ 32 43 ¢ 
7. THRESH 2.8.1 i 42 ¢ 31 ¢ 31 43 i 
8. THURSDAY 7.4.3 43 ~ ~ 32 ~ 32 43 
9. THREE 7.1.3 ~ 42 ~ 33 ~ 33 h2 
i0. THIMBLE 5.10.9 4i ~ ~ 34 ~ 34 4i 
Ii. THIRD 7.2.h 4i ¢ @ 3h ~ 3h hi 
12. THIRTY 7.1.13 4I ~ ~ 31, @ 34 4i 
13. THOUSAND 7.1.16 40 ¢ ¢ 35 0 35 40 
lb. THREAD 7.10.2 ~ 39 ~ 36 ~ 36 39 
15. THUNDER 7.6.21 39 ~ ¢ 36 ¢ 36 39 
16. THIGH 6.9.3 33 ~ ~ ~ ~ Ill 33 i 
Word 
i. SUGAR 
2. SHILLING 
3. SURE 
Ref. 
5.8.10 
7.7.5 
9.7.12 
TABLE h, SH-Words 
\[~\] \[~\] \[z\] 
20 i i 
z8 ¢ ¢ 
z3 ¢ ¢ 
\[~\] Vcls. Vcd. Missing 
; Total Total 
52 52 22 1 
57 57 18 ¢ 
62 62 13 ¢ 
qq 
TABLE 5 
"~FOR A WE H~VE: 
47 > FROM T WORDS 
8 D FROM T WORDS 
62 F FROM F WORDS 
15 S FROM S WORDS 
82 V FROM F WORDS 
6 V FROM T WORDS 
80 Z FROM S WORDS 
? 8 FROM T WORDS 
"~/FOR A. WE HAVE: 
2 S FROM S WORDS 
V/FOR A~ WE HAVE: 
2 $ FROM S WORDS 
6 > FROM T WORDS 
4 S FROM S WORDS 
I V FROM F WORDS 
13 Z FROM S WORDS 
2 8 FROM T WORDS ~ " 
FOR A: WE HAVE: 
2 > FROM T WORDS 
49 F FROM F WORDS 
31 V FROM F WORDS 
,4 Z FROM S WORDS 
FOR A! WE HAVE: 
7 > FROM T WORDS 
17 D FROM T WORDS 
36 F FROM F WORDS 
10 S FROM S WOR~S 
19 V FROM F WORDS 
6 Z FROM S WORDS 
10 8 FROM T WORDS 
8D 
12 
9 
the geographical distribution is not identical. Thus items 5 - 7 on 
the table all have a voiceless-voiced ratio of 30:45, but the voice- 
less localities, at least in the area to the west of the major iso- 
gloss to be described below, are not the same. FERN (see Map 13) 
is voiceless at 3102, 3604, 3902, and 3907 and voiced at 3405 and 
h003~ FINGER is voiceless at 3405, 3902~ 3907, and 4003 and voiced 
at 3102 and 3604; FIRE is voiceless at 3102, 3405, 3604, and 4003, 
and voiced at 3902 and 3907. In tabular form: 
FERN F V F F F V . 
FINGER V F ~ V F F F 
FIRE F F F V V F 
The same kind of discrepancy is shown by items 9- ii, with 33:42 
ratio, and items 12 - 14, with 34:41. 
Table i also shows that neither etymology nor following conson- 
ant seems to affect the distribution markedly. The three French 
words, FLOUR, FARMER, and FLOWERS, appear in the middle of the list, 
with ratios of 35:40, 37:38, and 39:36 respectively. The words with 
initial ~- are also in the middle, ranging from FLEAS 33:42 to 
FLOWERS 39: 36. 
The three last words on the list, with ratios 41:23, 52:22, and 
40:22, all show some peculiarity in the recordings indicating that 
for many of the informants they were unfamiliar, learned, or bookish 
words, hence more likely to have standard English pronunciation. Thus 
13 
i0 
FORD, the expected response to the question "Sometimes there is no 
bridge (over a rivulet). What do you call that shallow place where 
you can walk across?" (SED IV.1.3), was not known, not found, or 
not recorded in seven localities, replaced by a local term (splash, 
sluice) in four others, and given as a "suggested word" (i.e. one 
pronounced first by the field worker) in nine others. Though FOAL 
is recorded from all but one locality in answer to III.~.l, it is 
revealed by the answers to another question (III.4.6) that in many 
localities the preferred word is colt. The records also show that 
FORKS (the agricultural kind), not recorded from 13 localities, 
yields to ~ other preferred local words (picks, prongs, spuds) in many 
others. Likewise it has a mixed etymology, being derivable from 
Latin furca either through OE forca or Anglo-Norman fourque. 
At the top of the list two words, FELLIES and FURROW, show 
unusual distribution in that voiced forms extend well into the usual- 
ly voiceless areas of Berkshire, Sussex, and the tip of Cornwall (see 
Maps 6 and 7). FELLIES also shows six instances of the substitution 
of the dental fricative for the labial, which otherwise occurs only 
in FRIDAY, and which is reversed in six occurrences of /v/ in THATCH. 
If these five words are set aside, the range of voiceless-voiced 
i 
ratios, from 28:~7 to 43:3~ much more closely resembles that of the 
th-words (Table 3), which range from 29:45 to 41:33. 
Observations in similar detail can also be made on the basis of 
Tables 2 - 4, but exigencies of space suggest abridgement. 6 The 
IOA 
MAP 6 
Fm 
Z 
If II If 
__1 
z 
CE 
Z 
15 
10B 
i 
l 
J 
t 
? 
/ 
MAP 7 
II II 
C~ 
II 
X 
II n 
H 
J 
I 
Zb_ 
16 
ii 
marked drop-off in voiced forms in the last five words of Table 2 
implies that, in contrast to initial ~-, the voicing of initial 8- 
is much reduced by a following nasal or /i/ (this will be discussed 
further below). Voicing is not to be expected before a voiceless 
consonant; we included STITCH because it shows one freakish occur- 
rence of initial \[z\] at 3901. In this set the French words~ SECOND, 
SUE~f, and SEXTON, do appear well down the list, with ratios of ~4:31, 
~5:30, and 53:22 respectively. If the words of French origin and 
those with a consonant after the initial 8- are set aside, the 
remaining range of ratios, from 29:46 to 38:37, comes much closer to 
that of the other sets. 
The th-words in Table 3 are, of course, all native words, since 
neither Norman nor Central French has had initial dental fricatives 
at any time when borrowing into English could occur. Likewise the 
only consonant that occurs at all frequently after initial /8/ or /~/ 
is /r/. Hence the range of ratios of the whole set is narrower than 
those of the f- and 8-words. Noteworthy is the fact that throughout 
the voiced area, the initial cluster /@r/ appears as /dr/, presumably 
as a result of a later change of /~/ to /d/ in this environment.7 
(See Map 8 for THRESH). This same change has occurred in THISTLE in 
western Somerset, Devon, and Cornwall (see Map 9), and in THATCH (Map 
i0) in south Devon and eastern Cornwall. These two words also show 
initial /v/ sporadically in Somerset and Dorset, presumably a substi- 
tution the opposite of that occurring in FELLIES in five localities. 
17 
1.1.A 
a~ 
co b 
ao X ¢3 
2 
MAP 8 
Z ~ 
C~ ~ 
n I! 
Z ~ x 
ffD LD kk 
U II 
__1 
CE -1- 
0"3 
CE\] H Z\] 
Zk--- 
I-----t 
18 
> 
^ ^ ^ 
^ 
^ 
^ ^ ^ 
^ 
^ ^ 
13..B 
MAP 9 
-0 ~) ~ 
II II It 
Z r../2 (~ X 
II II II 
^ ~ 
--J L~ 
CZ _~ 
Z~- 
0 
19 
°m' cD ¢x~ 
~ . ~ A 
^ ^ 
^ A ^ ^ 
,~ i1~ .~ ~ ^^ : ~ 
.'~ ^ 
^ ^ 
^ ~ ^ 
MAP I0 
~'~ CrJ 
It II 
H H H 
II II II 
! 
{ 
__J {. 
}__ t-- 
D 
20 
'a el. 
lID 
MAP I I 
or-) 
F-- 
Z 
CE 
H 
to ~ 
Z II II 0"1 
(_3 
• --J z C\]Z ~, 
J 
I.---i 
F--.-4 
I----4 
21 
12 
One locality, 3805 (Kingston, Dorset), even has /v/ in THISTLE and 
I~/ in FELLIES. Why only two of the 16 %h-words should show this 
kind of substitutic~ remains unexplained. ~here is certainly nothing 
in their etymology to account for it. 
Table ~ lists the only three sh-words that show more than a few 
scattered instances of initial /~/" The other ~tarred words of this 
sort, with the number of occurrences of initial voicing for each, are 
8heuf ~, s~ep i, she~f 3j 8hoe 2, and sh~;eZ 3. Even the three 
words listed have a ratio characteristic of the low end of the other 
lists: 52:22 for SUGAR (a French loanword), 57:18 for the native 
word "SHILLING, and 62:13 for another French word, SURE. Presumably 
the two French words had sc~ellhing like initial \[s J-\] at the time of 
borrowing, while 8~ILL~N~ and the other native words had /s~-/ until 
later OE. Instances of voicing in these words is to be attributed 
wholly to analogy, prest~ably with the s-words. As might be 
expected, the heaviest concentration of voiced forms is in the "hard- 
core" voicing areas in Devonshire and western Hampshire (see Map 5), 
but there are scattered instances in Sc~ereet, Dorset, and Wilts. 
(see Map ii for SHILLING). 
Just as Tables i - 4 reveal the wide variety in voiceless-voiced 
ratio for individual words, so Maps 2 - 5 show the same variety for 
individual localities. These m~ps have been simplified from the 
original cce~uter-produced proportional maps described above, which 
were in the form shown in Map 1. Taken together, they de~nstrate 
6 
22 
13 
several interesting points. All four of them show a strongly marked 
isogloss on the eastern boundary of the voicing area. This line 
starts o~ the coast east of Portsmouth, trends in a northwesterly 
directic~ across the middle of Hampshire, follows the Wiltshire - 
Berkshire line for a short distance, and then turns vest across 
northern Wiltshire toward the Cotswolds. The plotting of its sub- 
sequent course must await the publication of the records fr~ Glou- 
cestershire and Hereford. It marks a sharp division quite different 
frem the grading noticeable within the voicing area. At one point 
about ten miles north of Winchester it passes between two localities 
(3901 and 3902) showing respectively total voicing and total voice- 
lessness in the th-words. A little to the north of this it sepa- 
rates two localities--Burbage, Wilts., and Inkpen, Berks.--which are 
only 8 miles apart and yet show voiceless voiced ratios of @:16 and 
15:1 respectively in the th-words. It is equally well marked for the 
other sets, though in the case of the f-words the ~explicable eastern 
extension of voicing in two words--FELLIES AND FURR0W--creates the 
appearance of a transitional area (Map 2). 
The maps also show that the voicing did not extend to the south- 
west tip of Cornwall except in a few words. Presumably the English 
brought into this formerly Celtic area was more strongly influenced 
by standard English. The extreme case is represented by 3607, which 
has voicing in only two of the 68 words: FELLIES and FURROW, which, 
as we have seen above also extend beyond the voicing area on the east. 
23 
13A 
MAP i 
Or-) 
I-- 
Z 
E3 
0 
,.o ~_,, L_L__I C~Q 
-~ ~ C__~ ~ 
7 
cl~ m',l o, ) , 
o-~ ~''" a) ~ ~,, 
r ~ 
I.-.---I 
m,J 
2.4 
MAP 2 
~,~~ 
~,-" OHUgll 
25 
13C 
t 
L. 
I 
II 
t, 
MAP 3 
~. o~o 
H ~" ,~ L,-~. H~I 
~ ,-4 L ~- ~I 
' ~Hm 
2G 
13D 
MAP 4 
~ o ~ 0 
-r" 
27 
MAP 5 
O ~ H 
I ~I 
28 
Maps 2 - 5 also reveal rather similar patterns of voicing in 
different parts of the region. Just west of the main eastern iso- 
gloss is an area where voicing is virtually total for all sets of 
words. The area differs in size from one set to another, being 
larger for the f- and th-words than for the s- and sh-words, but its 
heart is western Hampshire, Dorset, and part of southern Wiltshire. 
It is separated from the other area of almost total voicing in Devon- 
shire by a mixed area in Somerset and Wiltshire where the proportion 
of voicing among the words examined ranges from 25% to 75%. 
Finally there are fringe areas in the east and in southern Cornwall 
where ~ly a few of the words~those high on the lists in Tables 
1 - h--show voicing. In the light of the traditional historical view 
of a feature spreading west from Kent, as ex~pressed in the quotation 
from Horn-Lehnert above, this distribution is a bit puzzling. Full 
exploration of its implications must be postponed to a later study. 
One possible explanation of the variation in the incidence of 
voicing from one word to another is to be sought in the influence of 
the sounds immediately following the initial fricative. Chart I 
displm~vs some of the characteristics of the four sets with regard to 
to following phonemes. The sh-words have far less initial voicing 
than the other sets. No more thou 22 localities have initial voicing 
for any single word in this set. On the other hand, the s-words dem- 
omStrate the widest range: frc~ voicing in 46 localities to a long 
tail of words with no initial voicing anywhere. The most stable is 
29 
lkA i 
3O 
15 
the th- set, which has between 45 and 33 localities with voicing. 
This means that for all words but one in this set, over half the 
localities have a voiced initial fricative. This is the highest 
proportion for any set. 
The sound following the initial fricative has been indicated on 
Chart I in terms of three categories: 
+ vowel (unmarked) 
+ voiced consonant (marked with square) 
+ voiceless consonant (marked with dot) 
There is a striking correlation between voiceless initial fricative 
and a following voiceless consonant in the s-words (no other set has 
a voiceless consonant in second position). With the exception of 
spring (72:3) and STITCH (74:1), the association is absolute for 
Ik/ school, scratch, sky, squirrel 
(s) + /p/ spade, speak, spokes 
It~ stars, steal, stile, stool, straw 
0nly STITCH has been included in the computation. 
Voiced consonants in second position of s-words also associate 
with initial /s/ rather than /z/: 
~ /w/ SWRAT (41:34~ SWEAR (52:23) 
J/n/ SNOW (59:16), SNObT (70:4) <s> + 
! I /m/ 
SMOKE (60:15) 
~/1/ SLEDGE (68:3) 
In the case of the other sets, however, there is no clear connection: 
31 
~th> 
<f> 
÷ 
1 
16 
Ir/ THRESH (31:~3), THREE (33:42), THREAD (36:39) 
/r/ FRIDAY-I (29:46), FRIDAY-2 (28:44) 
FROGS (33:42). 
/i/ FLEAS (33:42), FLIES (34:~0), FLOUR (35:40)~ 
FLOOR (36:39)~ FLITCH (26:38), FLOWERS (39:36) 
Vowels present a more complex situation. Table 6 shows that for 
f-, s-, and th-words, high and low front vowels /y, i; ~/ and low 
central vowels /a, a/ associate with initial voicing. For the three 
sets taken together the proportion of voiced initial fricatives occur- 
ring with these vowels following is between 60 and 63 per cent. 
TABLE 6 
PROPORTION OF VOICELESS TO VOICED INITIAL FRICATIVES 
IN RELATION TO THE FOLLOWING VOWEL IN F-, S-, TH-WORDS 
Mid 
Front 
62% 
189/308 
36% 248/lhO 
6o% 
Low 119/176 
Central Back 
5O% 6/0 iO9/lO8 
55% 5~% ~35/5h0 h3/50 
63~ ~o% 
197/329 252/167 
There is negative association between voicing and mid front /e, e, c£/ 
as well as low back vowels /D, 3/. A study of t~e values for the 
individual sets reveals that initial voicing is particularly associ- 
ated with high and low front vowels in f-words (72% and 78% respec- 
tively) and with low central vowels in 8- and th-words (77% and 83%). 
32 
16..~ 
MAP 12 
O~) 
Z 
CE 
Z 
II II 
_J 
CE 
~) 
33 
16B 
t~'tL" ~" ~=, ;='~, 
MAP 13 
p-- 
Z 
• II II 
__J 
I--- z rY- 
. I----I III 
ZLL 
3~ 
17 
In f- and th-words low back vowels have a particularly low sssocia- 
ti(m with initial voicing (20% and 30%, respectively). Vowel length 
appears to have no appreciable correlation with the voicing or 
unvoicing of the initial fricative. 
Rounded high front vowels /Y, y/ showed a more marked associa- 
tion with initial voicing than unrounded high front vowels /I, i/. 
The ratios for all f- and s-words with rounded front vowels are: 
FOOT @:16; SOOT 1:15, SUCK @:7, SUET 1:15. If we make a table for all 
the second vowels in SOOT (Map 12), we get: 
Central Back 
High i:15 @:i 32:19 
1:2 Mid 
Low 
There is, of course, a possibility here of pseudo-correlation, caused 
for instance by the possibility that those localities which have 
the fronted /Y/ for standard English /U/ in words like SOOT happen 
to be located in the area of strongest voicing. Investigation of this 
possibility must await further study. 
The tendency for high front vowels generally to co-occur with 
voiced initial fricatives is, however, obvious in words like FERN 
(Map 13) where no rounding is involved: 
Front Central ~:ib 
Mid @:2 30:21 
Low @:6 
35 
18 
Finally, to illustrate the difference in correlation with low central 
and low back vowels, we may cite the figures for FORTY. The 
voiceless:voiced ratio with a low central vowel \[a, a:, a, a:\] is 
13:17~ while with a low back \[D, ~\] it is 27:18. 
It is clear that even more study, of individual words and indi- 
vidual localities, is needed before all the complications of this 
one dialect feature can be unraveled. We should, for example~ take 
into account the second and third responses for many of the words~ 
many of which were taken from incidental conversation and hence are 
inclined to be more natural. Even casual inspection of the data 
indicates that they show a much higher incidence of initial voicing 
than do the citation forms. But we hope that this paper has shown 
that, given adequate and convenient data, the computer can be of 
inestimable aid to the dialectologist. 
36 
19 
NOTES 
1An exception is the Dictionary of American Regional English (DARE), 
being prepared at the University of Wisconsin under the direction 
of Frederic O. Cassidy, which is employing seine sophisticated 
computer techniques. 
2See under Orton and Dieth in Bibliography. This work will hence- 
forth be referred to as SED or the Survey. 
3The starred words are those which were included primarily for their 
phonolosical imports~ce. Fieldworkers were instructed to obtain 
them at all costs, even if they had to suggest the word and ask 
the informant to pronounce it. In most cases words were chosen 
,that have universal distribution in the dialects, but occasionally 
a word thought to be common turned out to be unfamiliar or even 
unknown, as in the case of FORKS, FORD, end FLITCH in our corpus. 
~he questionnaire is divided into nine books, each of which is subdi- 
vided into sections containing several questions. An item is thus 
identified by a ~hree-part number, e.g. VIII.4.6, indicating 
question 6 in section 4 of book VIII. We changed the Roman num- 
erals to Arabic in the interest of simpler coding. 
5For an alt~rn~ive theory, holding that initial fricatives were 
already voiced in the language of the Jutes and Frisians who settled 
Kent, see Bennett 1955 in Bibliography. 
6We hope to explore the linguistic implications of this project more 
fully in a later article. 
7One instance of /~r-/ in THRESH is reported from 3905 Hambledon, 
Hants., which is Just within the eastern border of the voicing 
8~ea. 
37 
38 
2i 
Kils--, Sir James, The Dialect of the Ne~ Forest in HumFshlre. 
Publications of the Philological Society 4, Oxford, 
1913. 
Wright, Joseph, The EW~lish Dialect Gr~nmar. Oxford, 1905, repr. 
1968. S|278, 310, 313, 320• 322. 
• An Elemen,~n~ Middle English Gmmm, ur, 2nd ed. 
Oxford, 1928. ~236. 
39 
APPENDIX A, CODING SYST\]~4 22 
The following system was used in coding the data for the computer. 
VOWELS OOiSO~aTS 
z= \[ii ~3 8= \[el 
E = \[e\] D = \[d\] 
= \[eel B = \[b\] 
A= \[aa~a~a\] z= \[z ~\] 
) = \[~ ~\] ~= \[z\]'- 
7= \[A÷X'X\] F= \[f\] 
= \[~@~3 2 = \[~3 
u= \[uu\] $= \[f\] (i.e. \[~\]) 
5- \[~3 w = \[w\] 
= \[~1\] L = \[! ~'\] 
0 = \[o o q 5\] R = \[r ~'\] also "r-coloring" 
9 = \[~3 
DIACRITICS M = \[m\] 
.= : j= \[j\] 
= " 3 = \[~\] (i.e. \[~\]) 
? = \[~\] 
XXX = n.a., n.k., n.r., etc. 
40 
23 
APP~IX B, PREPARIRG THE MAPS 
by Gerald M. Rubin 
When it was first decided to produce dialect maps by computer, 
several methods were discussed. Output could be intricately placed 
c~ a printed page, and then a map outline could be superimposed on 
that page by hand. In this way we could achieve our basic goal, that 
of having the computer de the tedious task of sorting and tallying 
the linguistic data to be displayed, while leaving a minimum of work 
to the researcher. But this idea was not pliable enough to let us 
represent a map as it actually is. The squareness of the format and 
the constant distance between characters on the printed page made it 
impossible to reproduce any map with sufficient accuracy. 
A second method discussed was to output the entire map and data 
on a visual display unit such as a CRT scope. Here we could draw the 
map, but our printing format was again too strict. This method also 
is expensive, since it requires the use of an on-line scope. 
We finally decided upon an off-line plotter. The one we used 
was a CalCump #563 Digital Plotter. This machine takes a conputer 
tape which has been prodneed by an on-line computer program and 
draws the date in the tape onto a roll of paper. In order to visual- 
ize how the plotter works, imagine a set of coordinate axes with a 
y-axis about 12" long and an infinitely long x-axis. This grid is 
the piece of paper to he plotted on. The instructions to the plotter 
are simple. They boil down to two: , ~ lower or raise the pen point 
(so it will or will not write as it moves) and move the pen in a 
straight line to location (x,y) on the grid. In this manner anything 
can be drawn, fr~ straight lines to circles, letters, and numbers 
(curves are actually made up of very short straight line segments ). 
The routines used to produce CalComp tapes are FORTRAN sub- 
routines. It was therefore necessary to write a FORTRAN program 
whose input would be (i) instructions for drawing the outline of a 
map and plotting localities within it, and (2) linguistic data in a 
specially processed format. The output would be the tape which 
directs the plotter. 
24 
Two methods were tried out for producing the map outline. In 
the first, a transparent grid was placed over a map and coordinates 
of "bends" in the outline were recorded. Thus a map would be pro- 
duced by moving in a straight line from one bend to another. The 
map produced by this method gave only a rough approximation of the 
original because we could not find a grid small enough to represent 
accurately all the bends and curves in the outline of the map. The 
second method was much more successful. A digitizer was used to 
measure and record the coordinates of bonds in the outline and the 
positions of the localities within the area covered. The digitizer 
(~sed by courtesy of the American Mathematical Society, Providence, 
R.I.) consists of a large table and a "bomb-sight" connected to a 
paper-tape punch. The crosshairs of the bombsight are moved along 
the outline of the map, and all coordinates are accurately measured 
and p~ched onto paper tape. This paper tape then becomes the direct 
input to the computer. Special codes were used to instruct the plot- 
ter to lift the pen (to move away and draw an island, for example) or 
to indicate that the next coordinates would be the positions of 
localities on the map. When the program was completed, the instruc- 
tions for drawing the outline of England and marking the coordinates 
of the 311 localities of the SurUey of EngZish D4uleot8 required 
about 4000 x and y measurements. \[For the present paper only the 
bottom quarter of the map, with 75 localities, was used.\] 
The raw linguistic data was keypunched onto cards using the 
phonetic coding discussed above (Appendix A). Also on the cards were 
the numbers of the county and locality and the keywords. For example, 
the card for SUGAR, county 31, locality ~, was 
310~ SU~A~ ;%G)R: 5. 8.10 
f 
Since the corpus of data included 68 words for each of the 75 loc- 
alities in Southern England, the card input consisted of 5100 cards. 
These records could be sorted by phonetic word (citation), by locality, 
by key~ord, or by any combination of these. 
42 
25 
A PL/I program was written to examine the records and to output 
the linguistic results which were to be drawn onto a map. This 
output was then read in by the FORTRAN plotting program. The input 
to the plotting program was of a standard format: 
MAP NB~4BER; COUNTY; LOCALITY; NUMBER OF SYMBOL$; LINE NUMBER; MESSAGE 
For example, the card 
3 33 5 7 3-F/I-V 
meant "on map #3 draw the seven characters 3-F/I-V at locality 5 in 
county 33." "Line number" was used to produce the legend at the 
bottom of each map. The legend was drawn whenever the county and 
locality numbers were @. For example, the instruction 
2 ~ ~ 18 i INITIAL CONSONANTS 
meant "~ map #2 write the above message at the position for line i 
of the legend." 
The input to the plotting program came in sorted by map number. 
Whenever a new map number was read in (meaning that all the data for 
the previous map had been plotted), the program would issue instruc- 
tions to move over on the plotting paper and draw a new map outline. 
Data would then be drawn on this new map until a different one was 
requested. Thus one run of each of the PL/I and FORTRAN programs 
could produce any number of related or even unrelated maps. 
It should be apparent that the great virtues of this method of 
producing both working and finished dialectal maps are speed and 
accuracy. Anyone who has produced even rough working maps by 
stamping or drawing symbols on an outline map knows how time- 
consuming and tedious this process is, and how subject it is to 
error. The computer never wearies and never (we trustl) makes an~ 
error. As a result, the dialectologist can experiment with all kinds 
of working maps and select those which are interesting or significant. 
This should add a new dimension to the study of linguistic geography. 
~3 

References

Bennett, W. H., "The Southern English Development of Germanic 
Initial \[f s )\]," Language 31:367-71 (1955). 

Elworthy, F. T., The Dialect of West Somerset. English Dialect 
Society, Series D, vol. 20, 1875. 

Hewett, Sarah, i"hs P~sant Speech of Devon. Lundon, 1892 

Horn, W., and M. Lmhnert, I~m~t und Leben: Engl~sche Luutgesuhichte 
der n~eren Zeit, vol. II. Berlin, 1954. 

Jacobson, Ulf, Phonologi~l Dialect Constituents in the Vox~ulary 
of St~ English. Lund Studies in English 31, Lund, 
1962. 

Jennings, J. K., The Dialect of the West of England, Purt4ouZarly 
SG, ersetshire... 2nd ed., London, 1869. 

Jordan, R., rev. H. C. Matthes, Hatch tier MitteDm,gllschen G~- 
matik, vol. I, §159. Heidelberg, 1932. 

KJederqvist, J., The Dialect of Pewsey (Wiltshire). Transactions 
of the Philological Society, London, 1903-0~. 

Kruisinga, E., A Grammar of the Dialect of West Somerset. Bonner 
Beitr~ge zur Anglistik 18, 1905. 

Kurath, H., "The Loss of Long Consonants snd the Rise of Voiced 
Fricatives in Middle English," la.guage 32:~35-~5 (1956). 

Matthews, W., "South Western Dialect in 11he Early Modern Period." 
NeophiL~logus, 2~:193-209, 1939. 

Moore, Samuel, rev. by A. H. Marckwardt, Historical Outlines of 
English Sounds and Inflections. Ann Arbor, 1951. p. llS. 

Moss~, F., tr. by J. A. Walker, A Handbook ofNiddZe ~gllsh. Balti- 
more, 1952. §~, pp. 39f, also Fig. 6, p. 38. 

Orton, Harold, and Eugen Dieth, Survey of EngZish DiaL~uts. Intro- 
duot/on, by Harold Ortca (Leeds, 1962);, The Busi~ Material, 
vol. h, The Southern Counties, ed. H. Ortc~ and M. F. 
Wakelin (Leeds, 1967). 

Widen, B., ~udies on the Dorset Dialeot. Lund Stud/es in English 
16, Lurid, 19~9. 
