I just finished implementing the porter stemmer in PHP, then discovered
porter2. So, I'm upgrading. :-)
On step 2, though, you have added ' bli --> ble ' as well as 'li
preceded by 'b' --> remove li '
so, for words ending in 'bli' which is more correct? removing the li or
change the i to e?
I know you say to take the longest first, but is that the intended
action on 'bli' vs. 'li' preceded by a b?
thanks,
wendy
This archive was generated by hypermail 2.1.3 : Thu Sep 20 2007 - 12:02:43 BST