Proceedings of the Workshop on Statistical Machine Translation, pages 7–14,
New York City, June 2006. c©2006 Association for Computational Linguistics
Initial Explorations in English to Turkish Statistical Machine Translation
˙Ilknur Durgar El-Kahlout
Faculty of Enginering
and
Natural Sciences
Sabancı University
Istanbul, 34956, Turkey
ilknurdurgar@su.sabanciuniv.edu
Kemal Oflazer
Faculty of Engineering
and
Natural Sciences
Sabancı University
Istanbul, 34956, Turkey
oflazer@sabanciuniv.edu
Abstract
This paper presents some very prelimi-
nary results for and problems in develop-
ing a statistical machine translation sys-
tem from English to Turkish. Starting with
a baseline word model trained from about
20K aligned sentences, we explore various
ways of exploiting morphological struc-
ture to improve upon the baseline sys-
tem. As Turkish is a language with com-
plex agglutinative word structures, we ex-
periment with morphologically segmented
and disambiguated versions of the parallel
texts in order to also uncover relations be-
tween morphemes and function words in
one language with morphemes and func-
tions words in the other, in addition to re-
lations between open class content words.
Morphological segmentation on the Turk-
ish side also conflates the statistics from
allomorphs so that sparseness can be al-
leviated to a certain extent. We find
that this approach coupled with a simple
grouping of most frequent morphemes and
function words on both sides improve the
BLEU score from the baseline of 0.0752
to 0.0913 with the small training data. We
close with a discussion on why one should
not expect distortion parameters to model
word-local morpheme ordering and that a
new approach to handling complex mor-
photactics is needed.
1 Introduction
The availability of large amounts of so-called par-
allel texts has motivated the application of statisti-
cal techniques to the problem of machine translation
starting with the seminal work at IBM in the early
90’s (Brown et al., 1992; Brown et al., 1993). Statis-
tical machine translation views the translation pro-
cess as a noisy-channel signal recovery process in
which one tries to recover the input “signal” e, from
the observed output signal f.1
Early statistical machine translation systems used
a purely word-based approach without taking into
account any of the morphological or syntactic prop-
erties of the languages (Brown et al., 1993). Lim-
itations of basic word-based models prompted re-
searchers to exploit morphological and/or syntac-
tic/phrasal structure (Niessen and Ney, (2004),
Lee,(2004), Yamada and Knight (2001), Marcu and
Wong (2002), Och and Ney (2004),Koehn et al.
(2003), among others.)
In the context of the agglutinative languages sim-
ilar to Turkish (in at least morphological aspects) ,
there has been some recent work on translating from
and to Finnish with the significant amount of data
in the Europarl corpus. Although the BLEU (Pap-
ineni et al., 2002) score from Finnish to English is
21.8, the score in the reverse direction is reported
as 13.0 which is one of the lowest scores in 11 Eu-
ropean languages scores (Koehn, 2005). Also, re-
ported from and to translation scores for Finnish are
the lowest on average, even with the large number of
1Denoting English and French as used in the original IBM
Project which translated from French to English using the paral-
lel text of the Hansards, the Canadian Parliament Proceedings.
7
sentences available. These may hint at the fact that
standard alignment models may be poorly equipped
to deal with translation from a poor morphology lan-
guage like English to an complex morphology lan-
guage like Finnish or Turkish.
This paper presents results from some very pre-
liminary explorations into developing an English-to-
Turkish statistical machine translation system and
discusses the various problems encountered. Start-
ing with a baseline word model trained from about
20K aligned sentences, we explore various ways of
exploiting morphological structure to improve upon
the baseline system. As Turkish is a language with
agglutinative word structures, we experiment with
morphologically segmented and disambiguated ver-
sions of the parallel text, in order to also uncover
relations between morphemes and function words in
one language with morphemes and functions words
in the other, in addition to relations between open
class content words; as a cursory analysis of sen-
tence aligned Turkish and English texts indicates
that translations of certain English words are actu-
ally morphemes embedded into Turkish words. We
choose a morphological segmentation representa-
tion on the Turkish side which abstracts from word-
internal morphological variations and conflates the
statistics from allomorphs so that data sparseness
can be alleviated to a certain extent.
This paper is organized as follows: we start with
the some of the issues of building an SMT system
into Turkish followed by a short overview Turk-
ish morphology to motivate its effect on the word
alignment problem with English. We then present
results from our explorations with a baseline sys-
tem and with morphologically segmented parallel
aligned texts, and conclude after a short discussion.
2 Issues in building a SMT system for
Turkish
The first step of building an SMT system is the com-
pilation of a large amount of parallel texts which
turns out to be a significant problem for the Turkish
and English pair. There are not many sources of such
texts and most of what is electronically available
are parallel texts diplomatic or legal domains from
NATO, EU, and foreign ministry sources. There
is also a limited amount data parallel news corpus
available from certain news sources. Although we
have collected about 300K sentence parallel texts,
most of these require significant clean-up (from
HTML/PDF sources) and we have limited our train-
ing data in this paper to about 22,500 sentence sub-
set of these parallel texts which comprises the sub-
set of sentences of 40 words or less from the 30K
sentences that have been cleaned-up and sentence
aligned.2 a03
The main aspect that would have to be seri-
ously considered first for Turkish in SMT is the
productive inflectional and derivational morphol-
ogy. Turkish word forms consist of morphemes
concatenated to a root morpheme or to other mor-
phemes, much like “beads on a string” (Oflazer,
1994). Except for a very few exceptional cases,
the surface realizations of the morphemes are con-
ditioned by various local regular morphophonemic
processes such as vowel harmony, consonant assim-
ilation and elisions. Further, most morphemes have
phrasal scopes: although they attach to a partic-
ular stem, their syntactic roles extend beyond the
stems. The morphotactics of word forms can be
quite complex especially when multiple derivations
are involved. For instance, the derived modifier
saˇglamlas¸tırdıˇgımızdaki 4 would be bro-
ken into surface morphemes as follows:
saˇglam+las¸+tır+dıˇg+ımız+da+ki
Starting from an adjectival root saˇglam, this word
form first derives a verbal stem saˇglamlas¸, meaning
“to become strong”. A second suffix, the causative
surface morpheme +tır which we treat as a verbal
derivation, forms yet another verbal stem meaning
“to cause to become strong” or “to make strong (for-
tify)”. The immediately following participle suffix
2We are rapidly increasing our cleaned-up text and expect to
clean up and sentence align all within a few months.
3As the average Turkish word in running text has between
2 and 3 morphemes we limited ourselves to 40 words in the
parallel texts in order not to exceed the maximum number of
words recommended for GIZA++ training.
4Literally, “(the thing existing) at the time we caused (some-
thing) to become strong”. Obviously this is not a word that one
would use everyday, but already illustrates the difficulty as one
Turkish “word” would have to be aligned to a possible discon-
tinues sequence of English words if we were to attempt a word
level alignment. Turkish words (excluding noninflecting fre-
quent words such as conjunctions, clitics, etc.) found in typical
running text average about 10 letters in length. The average
number of bound morphemes in such words is about 2.
8
+dıˇg, produces a participial nominal, which inflects
in the normal pattern for nouns (here, for 1a0a1 per-
son plural possessor which marks agreement with
the subject of the verb, and locative case). The final
suffix, +ki, is a relativizer, producing a word which
functions as a modifier in a sentence, modifying a
noun somewhere to the right.
However, if one further abstracts from the mor-
phophonological processes involved one could get a
lexical form
saˇglam+lAs¸+DHr+DHk+HmHz+DA+ki
In this representation, the lexical morphemes ex-
cept the lexical root utilize meta-symbols that stand
for a set of graphemes which are selected on the
surface by a series of morphographemic processes
which are rooted in morphophonological processes
some of which are discussed below, but have noth-
ing whatsoever with any of the syntactic and se-
mantic relationship that word is involved in. For
instance, A stands for back and unrounded vowels
a and e, in orthography, H stands for high vow-
els ı, i, u and ¨u, and D stands for d and t, repre-
senting alveolar consonants. Thus, a lexical mor-
pheme represented as +DHr actually represents 8
possible allomorphs, which appear as one of +dır,
+dir, +dur, +d¨ur, +tır, +tir, +tur, +t¨ur depending
on the local morphophonemic context. Thus at this
level of representation words that look very differ-
ent on the surface, look very similar. For instance,
although the words masasında ’on his table’ and def-
terinde ’in his notebook’ in Turkish look quite dif-
ferent, the lexical morphemes except for the root
are the same: masasında has the lexical structure
masa+sH+ndA, while defterinde has the lexical
structure defter+sH+ndA.
The use of this representation is particularly im-
portant for Turkish for the following reason. Allo-
morphs which differ because of local word-internal
morphographemic and morphotactical constraints
almost always correspond to the same words or units
in English when translated. When such units are
considered by themselves as the units in alignment,
statistics get fragmented and the model quality suf-
fers. On the other hand, this representation if di-
rectly used in a standard SMT model such as IBM
Model 4, will most likely cause problems, since
now, the distortion parameters will have to take on
the task of generating the correct sequence of mor-
phemes in a word (which is really a local word-
internal problem to be solved) in addition to gen-
erating the correct sequence of words.
3 Aligning English–Turkish Sentences
If an alignment between the components of paral-
lel Turkish and English sentences is computed, one
obtains an alignment like the one shown in Figure
1, where it is clear that Turkish words may actually
correspond to whole phrases in the English sentence.
Figure 1: Word level alignment between a Turkish
and an English sentence
One major problem with this situation is that even
if a word occurs many times in the English side,
the actual Turkish equivalent could be either miss-
ing from the Turkish part, or occur with a very low
frequency, but many inflected variants of the form
could be present. For example, Table 1 shows the
occurrences of different forms for the root word
faaliyet ’activity’ in the parallel texts we experi-
mented with. Although, many forms of the root
word appear, none of the forms appear very fre-
quently and one may even have to drop occurrences
of frequency 1 depending on the word-level align-
ment model used, further worsening the sparseness
problem.5
To overcome this problem and to get the max-
imum benefit from the limited amount of parallel
texts, we decided to perform morphological analy-
sis of both the Turkish and the English texts to be
able to uncover relationships between root words,
suffixes and function words while aligning them.
5A noun root in Turkish may have about hundred inflected
forms and substantially more if productive derivations are con-
sidered, meanwhile verbs can have thousands of inflected and
derived forms if not more.
9
Table 1: Forms of the word faaliyet ’activity’
Wordform Count Gloss
faaliyet 3 ’activity’
faaliyete 1 ’to the activity’
faaliyetinde 1 ’in its activity’
faaliyetler 3 ’activities’
faaliyetlere 6 ’to the activities’
faaliyetleri 7 ’their activities’
faaliyetlerin 7 ’of the activities’
faaliyetlerinde 1 ’in their activities’
faaliyetlerine 5 ’to their activities’
faaliyetlerini 1 ’their activities (acc.)’
faaliyetlerinin 2 ’of their activities’
faaliyetleriyle 1 ’with their activities’
faaliyette 2 ’in (the) activity’
faaliyetteki 1 ’that which is in activity’
Total 41
On the Turkish side, we extracted the lexical mor-
phemes of each word using a version of the mor-
phological analyzer (Oflazer, 1994) that segmented
the Turkish words along morpheme boundaries and
normalized the root words in cases they were de-
formed due to a morphographemic process. So
the word faaliyetleriyle when segmented into lexical
morphemes becomes faaliyet +lAr +sH +ylA. Am-
biguous instances were disambiguated statistically
(K¨ulekc¸i and Oflazer, 2005).
Similarly, the English text was tagged using Tree-
Tagger (Schmid, 1994), which provides a lemma
and a POS for each word. We augmented this pro-
cess with some additional processing for handling
derivational morphology. We then dropped any tags
which did not imply an explicit morpheme (or an
exceptional form). The complete set of tags that are
used from the Penn-Treebank tagset is shown in Ta-
ble 2.6 To make the representation of the Turkish
texts and English texts similar, tags are marked with
a ’+’ at the beginning of all tags to indicate that such
tokens are treated as “morphemes.” For instance,
the English word activities was segmented as activ-
6The tagset used by the TreeTagger is a refinement of Penn-
Treebank tagset where the second letter of the verb part-of-
speech tags distinguishes between ”be” verbs (B), ”have” verbs
(H) and other verbs (V),(Schmid, 1994).
ity +NNS. The alignments we expected to obtain are
depicted in Figure 2 for the example sentence given
earlier in Figure 1.
Table 2: The set of tags used to mark explicit mor-
phemes in English
Tag Meaning
JJR Adjective, comparative
JJS Adjective, superlative
NNS Noun, plural
POS Possessive ending
RBR Adverb, comparative
RBS Adverb, superlative
VB Verb, base form
VBD Verb, past tense
VBG Verb, gerund or present participle
VBN Verb, past participle
VBP Verb, non3rd person singular present
VBZ Verb, 3rd person singular present
Figure 2: “Morpheme” alignment between a Turkish
and an English sentence
4 Experiments
We proceeded with the following sequence of exper-
iments:
(1) Baseline: As a baseline system, we used a
pure word-based approach and used Pharaoh Train-
ing tool (2004), to train on the 22,500 sentences, and
decoded using Pharaoh (Koehn et al., 2003) to ob-
tain translations for a test set of 50 sentences. This
gave us a baseline BLEU score of 0.0752.
(2) Morpheme Concatenation: We then trained
the same system with the morphemic representation
10
of the parallel texts as discussed above. The de-
coder now produced the translations as a sequence
of root words and morphemes. The surface words
were then obtained by just concatenating all the
morphemes following a root word (until the next
root word) taking into just morphographemic rules
but not any morphotactic constraints. As expected
this “morpheme-salad” produces a “word-salad”, as
most of the time wrong morphemes are associated
with incompatible root words violating many mor-
photactic constraints. The BLEU score here was
0.0281, substantially worse than the baseline in (1)
above.
(3) Selective Morpheme Concatenation: With
a small script we injected a bit of morphotactical
knowledge into the surface form generation process
and only combined those morphemes following a
root word (in the given sequence), that gave rise to
a valid Turkish word form as checked by a morpho-
logical analyzer. Any unused morphemes were ig-
nored. This improved the BLEU score to 0.0424
which was still below the baseline.
(4) Morpheme Grouping: Observing that certain
sequence of morphemes in Turkish texts are trans-
lations of some continuous sequence of functional
words and tags in English texts, and that some mor-
phemes should be aligned differently depending on
the other morphemes in their context, we attempted
a morpheme grouping. For example the morpheme
sequence +DHr +mA marks infinitive form of a
causative verb which in Turkish inflects like a noun;
or the lexical morpheme sequence +yAcAk +DHr
usually maps to “it/he/she will”. To find such groups
of morphemes and functional words, we applied a
sequence of morpheme groupings by extracting fre-
quently occuring n-grams of morphemes as follows
(much like the grouping used by Chiang (2005): in a
series of iterations, we obtained high-frequency bi-
grams from the morphemic representation of paral-
lel texts, of either morphemes, or of previously such
identified morpheme groups and neighboring mor-
phemes until up to four morphemes or one root 3
morpheme could be combined. During this process
we ignored those combinations that contain punctu-
ation or a morpheme preceding a root word. A simi-
lar grouping was done on the English side grouping
function words and morphemes before and after root
words.
The aim of this process was two-fold: it let fre-
quent morphemes to behave as a single token and
help Pharaoh with identification of some of the
phrases. Also since the number of tokens on both
sides were reduced, this enabled GIZA++ to produce
somewhat better alignments.
The morpheme level translations that were ob-
tained from training with this parallel texts were then
converted into surface forms by concatenating the
morphemes in the sequence produced. This resulted
in a BLEU score of 0.0644.
(5) Morpheme Grouping with Selective Mor-
pheme Concatenation: This was the same as (4)
with the morphemes selectively combined as in (3).
The BLEU score of 0.0913 with this approach was
now above the baseline.
Table 3 summarizes the results in these five exper-
iments:
Table 3: BLEU scores for experiments (1) to (4)
Exp. System BLEU
(1) Baseline 0.0752
(2) Morph. Concatenation. 0.0281
(3) Selective Morph. Concat. 0.0424
(4) Morph. Grouping and Concat. 0.0644
(5) Morph. Grouping + (3) 0.0913
In an attempt to factor out and see if the transla-
tions were at all successful in getting the root words
in the translations we performed the following: We
morphologically analyzed and disambiguated the
reference texts, and reduced all to sequences of root
words by eliminating all the morphemes. We per-
formed the same for the outputs of (1) (after mor-
phological analysis and disambiguation), (2) and (4)
above, that is, threw away the morphemes ((3) is
the same as (2) and (5) same as (4) here). The
translation root word sequences and the reference
root word sequences were then evaluated using the
BLEU (which would like to label here as BLEU-r
for BLEU root, to avoid any comparison to previous
results, which will be misleading. These scores are
shown in Figure 4.
The results in Tables 3 and 4 indicate that with the
standard models for SMT, we are still quite far from
even identifying the correct root words in the trans-
11
Table 4: BLEU-r scores for experiments (1), (2) and
(4)
Exp. System BLEU
(1) Baseline 0.0955
(2) Morph. Concatenation. 0.0787
(4) Morph. Grouping 0.1224
lations into Turkish, let alone getting the morphemes
and their sequences right. Although some of this
may be due to the (relatively) small amount of paral-
lel texts we used, it may also be the case that splitting
the sentences into morphemes can play havoc with
the alignment process by significantly increasing the
number of tokens per sentence especially when such
tokens align to tokens on the other side that is quite
far away.
Nevertheless the models we used produce some
quite reasonable translations for a small number of
test sentences. Table 5 shows the two examples of
translations that were obtained using the standard
models (containing no Turkish specific manipula-
tion except for selective morpheme concatenation).
We have marked the surface morpheme boundaries
in the translated and reference Turkish texts to in-
dicate where morphemes are joined for exposition
purposes here – they neither appear in the reference
translations nor in the produced translations!
5 Discussion
Although our work is only an initial exploration
into developing a statistical machine translation sys-
tem from English to Turkish, our experiments at
least point out that using standard models to deter-
mine the correct sequence of morphemes within the
words, using more powerful mechanisms meant to
determine the (longer) sequence of words in sen-
tences, is probably not a good idea. Morpheme or-
dering is a very local process and the correct se-
quence should be determined locally though the ex-
istence of morphemes could be postulated from sen-
tence level features during the translation process.
Furthermore, insisting on generating the exact se-
quence of morphemes could be an overkill. On
the other hand, a morphological generator could
take a root word and a bag of morphemes and
Table 5: Some good SMT results
Input: international terrorism also remains to be an important
issue .
Baseline: ulus+lararası ter¨orizm de ¨onem+li kal+mıs¸+tır . bir
konu ol+acak+tır
Selective Morpheme Concatenation: ulus+lararası ter¨orizm
de ol+ma+ya devam et+mek+te+dir ¨onem+li bir sorun+dur .
Morpheme Grouping: ulus+lararası ter¨orizm de ¨onem+li bir
sorun ol+ma+ya devam et+mek+te+dir .
Reference Translation: ulus+lararası ter¨orizm de ¨onem+li bir
sorun ol+ma+ya devam et+mek+te+dir .
Input: the initiation of negotiations will represent the
beginning of a next phase in the process of accession
Baseline: m¨uzakere+ler+in g¨or+¨us¸+me+ler yap+ıl+acak bir
der+ken as¸ama+nın hasar+ı s¨urec+i bas¸langıc+ı+nı 15+’i
Selective Morpheme Concatenation: initiation m¨uzakere+ler
temsil ed+il+me+si+nin bas¸langıc+ı bir as¸ama+sı+nda katılım
s¨urec+i+nin ertesi
Morpheme Grouping: m¨uzakere+ler+in bas¸la+ma+sı+nın
bas¸langıc+ı+nı temsil ed+ecek+tir katılım s¨urec+i+nin bir
sonra+ki as¸ama
Reference Translation: m¨uzakere+ler+in bas¸la+ma+sı ,
katılım s¨urec+i+nin bir sonra+ki as¸ama+sı+nın bas¸langıc+ı+nı
temsil ed+ecek+tir
generate possible legitimate surface words by tak-
ing into account morphotactic constraints and mor-
phographemic constraints, possibly (and ambigu-
ously) filling in any morphemes missing in the trans-
lation but actually required by the morphotactic
paradigm. Any ambiguities from the morphologi-
cal generation could then be filtered by a language
model.
Such a bag-of-morphemes approach suggests that
we do not actually try to determine exactly where the
morphemes actually go in the translation but rather
determine the root words (including any function
words) and then associate translated morphemes
with the (bag of the) right root word. The resulting
sequence of root words and their bags-of-morpheme
can be run through a morphological generator which
can handle all the word-internal phenomena such as
proper morpheme ordering, filling in morphemes or
even ignoring spurious morphemes, handling local
morphographemic phenomena such as vowel har-
mony, etc. However, this approach of not placing
morphemes into specific position in the translated
output but just associating them with certain root
words requires that a significantly different align-
ment and decoding models be developed.
Another representation option that could be em-
12
ployed is to do away completely with morphemes on
the Turkish side and just replace them with morpho-
logical feature symbols (much like we did here for
English). This has the advantage of better handling
allomorphy – all allomorphs including those that are
not just phonological variants map to the same fea-
ture, and homograph morphemes which signal dif-
ferent features map to different features. So in a
sense, this would provide a more accurate decom-
position of the words on the Turkish side, but at the
same time introduce a larger set of features since
default feature symbols are produced for any mor-
phemes that do not exist on the surface. Removing
such redundant features from such a representation
and then using reduced features could be an interest-
ing avenue to pursue. Generation of surface words
would not be a problem since, our morphological
generator does not care if it is input morphemes or
features.
6 Conclusions
We have presented the results of our initial explo-
rations into statistical machine translation from En-
glish to Turkish. Using a relatively small parallel
corpus of about 22,500 sentences, we have exper-
imented with a baseline word-to-word translation
model using the Pharaoh decoder. We have also ex-
perimented with a morphemic representation of the
parallel texts and have aligned the sentences at the
morpheme level. The decoder in this cases produces
root word and morpheme sequences which are then
selectively concatenated into surface words by pos-
sibly ignoring some morphemes which are redun-
dant or wrong. We have also attempted a simple
grouping of root words and morphemes to both help
the alignment by reducing the number of tokens in
the sentences and by already identifying some pos-
sible phrases. This grouping of morphemes and the
use of selective morpheme concatenation in produc-
ing surface words has increased the BLEU score
for our test set from 0.0752 to 0.0913. Current
ongoing work involves increasing the parallel cor-
pus size and the development of bag-of-morphemes
modeling approach to translation to separate the
sentence level word sequencing from word-internal
morpheme sequencing.
7 Acknowledgements
This work was supported by T ¨UB˙ITAK (Turkish
Scientific and Technological Research Foundation)
project 105E020 ”Building a Statistical Machine
Translation for Turkish and English”.

References
Peter F. Brown, Stephen A. Della Pietra, Vincent J.
Della Pietra, John D. Lafferty, and Robert L. Mercer.
1992. Analysis, statistical transfer, and synthesis in
machine translation. In Proceeding of TMI: Fourth
International Conference on Theoretical and Method-
ological Issues in MT, pages 83–100.
Peter F Brown, Stephen A Della Pietra, Vincent J
Della Pietra, and Robert L Mercer. 1993. The mathe-
matics of statistical machine translation: Parameter es-
timation. Computational Linguistics, 19(2):263–311.
David Chiang. 2005. A hierarchical phrase-based model
for statistical machine translation. In Proceedings of
the 43rd Annual Meeting of the Association for Com-
putational Linguistics (ACL’05), pages 263–270, Ann
Arbor, Michigan, June. Association for Computational
Linguistics.
Philipp Koehn, Franz J. Och, and Daniel Marcu. 2003.
Statistical phrase-based translation. In Proceedings of
HLT/NAACL.
Philip Koehn. 2005. Europarl: A parallel corpus for sta-
tistical machine translation. In MT Summit X, Phuket,
Thailand.
M. Oguzhan K¨ulekc¸i and Kemal Oflazer. 2005. Pro-
nunciation disambiguation in turkish. In Pinar Yolum,
Tunga G¨ung¨or, Fikret S. G¨urgen, and Can C. ¨Ozturan,
editors, ISCIS, volume 3733 of Lecture Notes in Com-
puter Science, pages 636–645. Springer.
Young-Suk Lee. 2004. Morphological analysis for sta-
tistical machine translation. In Proceedings of HLT-
NAACL 2004 - Companion Volume, pages 57–60.
Daniel Marcu and William Wong. 2002. A phrase-
based, joint probability model for statistical machine
translation. In In Proceedings of the Conference on
Empirical Methods in Natural Language Processing
(EMNLP-02), Philadelphia.
Sonja Niessen and Hermann Ney. 2004. Statisti-
cal machine translation with scarce resources using
morpho-syntatic information. Computational Linguis-
tics, 30(2):181–204.
Franz J. Och and Hermann Ney. 2004. The alignment
template approach to statistical machine translation.
Computational Linguistics, 30(4):417–449.
Kemal Oflazer. 1994. Two-level description of Turk-
ish morphology. Literary and Linguistic Computing,
9(2):137–148.
Kishore Papineni, Todd Ward Salim Roukos, and Wei-
Jing Zhu. 2002. Bleu: a method for automatic eval-
uation of machine translation. In Proceedings of the
40th Annual Meeting of the Association for Computa-
tional Linguistics (ACL’02), pages 311–318, Philadel-
phia, July. Association for Computational Linguistics.
Helmut Schmid. 1994. Probabilistic part-of-speech tag-
ging using decision trees. In Proceedings of Interna-
tional Conference on New Methods in Language Pro-
cessing.
Kenji Yamada and Kevin Knight. 2001. A syntax-based
statistical translation model. In Proceedings of the
39th Annual Meeting of the Association for Compu-
tational Linguistics, pages 00–00, Toulouse.
