Olly Betts wrote:
> I've just discovered that this patch incorrectly converted romanian1
> files to utf-8, but they were already in utf-8 (the "make check" rule
> didn't catch this because romanian1 isn't built into libstemmer by
> default). Sorry about that.
>
> This patch reverts those files to their original state:
>
> http://oligarchy.co.uk/xapian/patches/snowball-fix-overencoding-of-romanian1.patch
I've reverted the files.
> A related issue - there are a small number of examples in the hungarian
> vocabulary which contain upper case ASCII letters. Would it make sense
> to just change these to lower case for consistency with the other test
> vocabularies?
I think it would make sense to change these to lower case, so I've done
so. It doesn't change the output.txt file at all (as expected).
-- Richard
This archive was generated by hypermail 2.1.3 : Thu Sep 20 2007 - 12:02:49 BST