Portuguese stemmer error.
The problem was in the Snowball script, which was failing to restore the
cursor to the beginning of the word at one significant point. It just needed
an 'and' inserting into the script. So, we have a slightly altered Snowball
script in place, a new file output.txt (and diffs.txt), with the words
insuficiência, deficiência, deficiências, eficiência, impaciência etc
now correctly stemmed, and new C and java versions.
Thanks to Frederick Brault for finding this,
Martin
This archive was generated by hypermail 2.1.3 : Thu Sep 20 2007 - 12:02:46 BST