On Thu, May 25, 2006 at 12:14:54PM -0600, Neil wrote:
> Just have a question about the porter 2 algorithm, maybe someone has run
> into this before (I'd be surprised if they hadn't).
>
> When I stem the word sun I get sun (no surprise there)
> But sunny becomes sunni
This isn't a bug - it's how the stemming algorithms work. The stem
isn't necessarily a word itself (though it generally looks mostly like
one.) What really matters is that words the stemmer should conflate
produce the same stem, and words it shouldn't conflate produce different
stems.
Cheers,
Olly
This archive was generated by hypermail 2.1.3 : Thu Sep 20 2007 - 12:02:48 BST