Dear Snowball people,
I have built a stemmer in Snowball for Hungarian (well four actually) and
tested it in CLEF 2005. The paper written about it can be viewed at
http://staff.science.uva.nl/~mdr/Publications/Files/clef2005-proc-adhoc.pdf
I imagine there is no point in having four stemmers but I can send the
code of the best stemmer along with a description and I can compile a word
list of 30000 words in Hungarian. Should I attach the information in a
message for the mailing list? Is there any kind of extra information that
is needed?
Thank you
Anna Tordai
This archive was generated by hypermail 2.1.3 : Thu Sep 20 2007 - 12:02:47 BST