Re: [Snowball-discuss] More patches

From: Richard Boulton (richard@lemurconsulting.com)
Date: Fri Feb 16 2007 - 11:35:09 GMT


Olly Betts wrote:
> I've just discovered that this patch incorrectly converted romanian1
> files to utf-8, but they were already in utf-8 (the "make check" rule
> didn't catch this because romanian1 isn't built into libstemmer by
> default). Sorry about that.
>
> This patch reverts those files to their original state:
>
> http://oligarchy.co.uk/xapian/patches/snowball-fix-overencoding-of-romanian1.patch

I've reverted the files.

> A related issue - there are a small number of examples in the hungarian
> vocabulary which contain upper case ASCII letters. Would it make sense
> to just change these to lower case for consistency with the other test
> vocabularies?

I think it would make sense to change these to lower case, so I've done
so. It doesn't change the output.txt file at all (as expected).

-- 
Richard



This archive was generated by hypermail 2.1.3 : Thu Sep 20 2007 - 12:02:49 BST