Re: [Snowball-discuss] a simple algorithm problem

From: Martin Porter (martin.porter@grapeshot.co.uk)
Date: Thu Jan 06 2005 - 10:20:43 GMT


>Presumably this still restricts Snowball to code points in the BMP? Or
>does it just restrict it to recognising and doing things with
>characters at code points in the BMP, passing through any others?

It would be the latter. Since stemming is applicable to a system of
languages, all of whose characters are, I would assert, in the BMP, I do
think that is a problem.

>What's the character encoding of snowball scripts at the moment?

The scripts themselves are in ASCII, and ASCII assumptions are made in the
Snowball compiler.



This archive was generated by hypermail 2.1.3 : Thu Sep 20 2007 - 12:02:47 BST