Re: [Snowball-discuss] Porter2 problem

From: Olly Betts (olly@survex.com)
Date: Thu May 25 2006 - 19:36:24 BST


On Thu, May 25, 2006 at 12:14:54PM -0600, Neil wrote:
> Just have a question about the porter 2 algorithm, maybe someone has run
> into this before (I'd be surprised if they hadn't).
>
> When I stem the word sun I get sun (no surprise there)
> But sunny becomes sunni

This isn't a bug - it's how the stemming algorithms work. The stem
isn't necessarily a word itself (though it generally looks mostly like
one.) What really matters is that words the stemmer should conflate
produce the same stem, and words it shouldn't conflate produce different
stems.

Cheers,
    Olly



This archive was generated by hypermail 2.1.3 : Thu Sep 20 2007 - 12:02:48 BST