[Snowball-discuss] Re: SnowBall German stemming

From: Martin Porter (martin_porter@softhome.net)
Date: Wed Mar 13 2002 - 18:39:24 GMT


>Thank you for your quick and kind answer! The problem with the special
>characters in not the ISO Latin code - i checked that already. Moreover,
>the stemmer algorithm is translating (not all but often ;-)) this
>special characters (ü,ä,ö) into the right vowel (u, a, o). e.g. "übung"
>becomes "ubung" but "bücher" becomes "büch" instead of "buch"!!

I cannot see what is going on here. If you look at

http://snowball.sourceforge.net/german/diffs.txt

you can see that bu"cher goes to buch okay. If you can send a failing test
program I'll have a look at it, but I really think the stemmer works as
advertised. Please check further ...

Martin

_______________________________________________
Snowball-discuss mailing list
Snowball-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/snowball-discuss



This archive was generated by hypermail 2.1.3 : Thu Sep 20 2007 - 12:02:41 BST