Finnish is not an Indo-European language, but belongs to the Finno-Ugric
group, which again belongs to the Uralic group (*). Distinctions between
a-, i- and d-suffixes can be made in Finnish, but they are much
less sharply separated than in an Indo-European language. The system of
endings is extremely elaborate, but strictly defined, and applies equally to
all nominals, that is, to nouns, adjectives and pronouns. Verb endings have a
close similarity to nominal endings, which again makes Finnish very different
from any Indo-European language.
More problematical than the endings themselves is the change that can be
effected in a stem as a result of taking a particular ending. A stem typically
has two forms, strong and weak, where one class of ending follows the
strong form and the complementary class the weak. Normalising strong and weak
forms after ending removal is not generally possible, although the common case
where strong and weak forms only differ in the single or double form of a
final consonant can be dealt with.
Finnish includes the following accented forms,
-
ä ö
The following letters are vowels:
-
a e i o u y ä ö
R1 and
R2 are then defined in the usual way
(see the note on R1 and R2).
Do each of steps 1, 2, 3, 4, 5 and 6.
Step 1: particles etc
-
Search for the longest among the following suffixes in R1, and perform the
action indicated
- (a) kin kaan kään ko kö han hän pa pä
- delete if preceded by n, t or a vowel
- (b) sti
- delete if in R2
(Of course, the n, t or vowel of 1(a) need not be in R1: only
the suffix removed must be in R1. And similarly below.
Step 2: possessives
-
Search for the longest among the following suffixes in R1, and perform the
action indicated
- si
- delete if not preceded by k
- ni
- delete
- if preceded by kse, replace with ksi
- nsa nsä mme nne
- delete
- an
- delete if preceded by one of ta ssa sta lla lta na
- än
- delete if preceded by one of tä ssä stä llä ltä nä
- en
- delete if preceded by one of lle ine
The remaining steps require a few definitions.
Define a v (vowel) as one of a e i o u y ä ö.
Define a V (restricted vowel) as one of a e i o u ä ö.
So Vi means a V followed by letter i.
Define LV (long vowel) as one of aa ee ii oo uu ää öö.
Define a c (consonant) as a character other than a v.
So cv means a c followed by a v.
Step 3: cases
-
Search for the longest among the following suffixes in R1, and perform the
action indicated
- hXn preceded by X, where X is a V other than u (a/han, e/hen etc)
- siin den tten preceded by Vi
- seen preceded by LV
- a ä preceded by cv
- tta ttä preceded by e
- ta tä ssa ssä sta stä lla llä lta ltä lle na nä ksi ine
- delete
- n
- delete, and if preceded by LV or ie, delete the last vowel
So aarteisiin -> aartei, the longest matching suffix being siin,
preceded as it is by Vi. But adressiin -> adressi. The longest
matching suffix is not siin, because there is no preceding Vi, but n,
and then the last vowel of the preceding LV is removed.
Step 4: other endings
-
Search for the longest among the following suffixes in R2, and perform the
action indicated
- mpi mpa mpä mmi mma mmä
- delete if not preceded by po
- impi impa impä immi imma immä eja ejä
- delete
Step 5: plurals
-
If an ending was removed in step 3, delete a final i or j if in R1;
otherwise, if an ending was not removed in step 3, delete a final t in
R1 if it follows a vowel, and, if a t is removed, delete a final mma or
imma in R2, unless the mma is preceded by po.
Step 6: tidying up
-
Do in turn steps (a), (b), (c), (d), restricting all tests to the region
R1.
a) If R1 ends LV delete the last letter
b) If R1 ends cX, c a consonant and X one of a ä e i, delete the last
letter
c) If R1 ends oj or uj delete the last letter
d) If R1 ends jo delete the last letter
Do step (e), which is not restricted to R1.
e) If the word ends with a double consonant followed by zero or more vowels,
remove the last consonant (so eläkk -> eläk, aatonaatto ->
aatonaato)
|