Links to resources | ||||||||||||||||||||||||||
|
Here is a sample of vocabulary, with the stemmed forms that will be generated with the algorithm.
| ||||||||||||||||||||||||||
|
This stemming algorithm removes the inflectional suffixes of nouns. Nouns are inflected for case, person/possession and number. Letters in Hungarian include the following accented forms,
For example:
t ó b a n consonant-vowel
|.....| R1 is 'a b a n'
a b l a k a n vowel-consonant
|.........| R1 is 'l a k a n'
a c s o n y vowel-digraph
|.....| R1 is 'o n y'
c v s
--->|<--- null R1 region
‘Delete if in R1’ means that the suffix should be removed if it is in
region R1 but not if it is outside.
Do steps 1 to 9 in turn Step 1: Remove instrumental case
| ||||||||||||||||||||||||||
The full algorithm in Snowball
|