I made a few alterations to snowball yesterday (though none which should
affect the functioning of snowball):
* correct a typo in russian/stemmer.html (algorith -> algorithm)
* moved the stemmer definitions from files named "stem" to "stem.sbl"
I needed the files to have an extension to make the build system work
properly. I updated all the links in webpages which pointed to these.
* fixed a bug in snowball itself: in the generated header file: if an
external prefix was specified, close_env wasn't being prefixed.
* made a build system, which encompasses the following, at the moment:
- builds snowball
- generates all the stem.[ch] files
- generates a program "stemmer" for each language
- uses the stemmer program to generate output.txt from voc.txt. The
timestamp of output.txt is only modified if the contents have
changed.
I also began work on making the build system build a single library,
which will be able to perform stemming for any of the supported
libraries (and will be a lot more convenient for IR people such as
myself to use than including scripts individually). As part of this
work, I changed the generated stem.[ch] files to use prefixes, to avoid
symbol conflicts when I link them together. The prefix used is
"language_" for each language.
This means that, for example, the english stemmer now has
english_create_env() english_close_env() and english_stem().
I believe some pieces of documentation may need to be changed to reflect
this: I havn't gone through and checked for this yet.
The build system may be invoked by running "make" in the "website"
directory.
-- Richard_______________________________________________ Snowball-discuss mailing list Snowball-discuss@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/snowball-discuss
_____________________________________________________________________ VirusChecked by the Incepta Group plc _____________________________________________________________________
This archive was generated by hypermail 2.1.3 : Thu Sep 20 2007 - 12:02:40 BST