[Snowball-discuss] Mismatch between vocab.txt and output.txt

From: Olly Betts (olly@survex.com)
Date: Mon Oct 14 2002 - 01:17:02 BST


I've found mismatches for french and finnish stemmers.

The first disagreement for finnish is that the stemmer produces
"aachenin" but output.txt contains "aachen".

And the first for french is "abaisai" when output.txt contains "abaiss".

I can generate full lists if they're useful, but I assume you have
testing scripts of your own...

Are the stemmers wrong, or being miscompiled, or is output.txt just
out of date for these two?

Cheers,
    Olly



This archive was generated by hypermail 2.1.3 : Thu Sep 20 2007 - 12:02:43 BST