slackbuilds/python/PyStemmer
B. Watson 329f159ece python/PyStemmer: Wrap README at 72 columns.
Signed-off-by: B. Watson <yalhcru@gmail.com>
2022-03-14 04:07:07 -04:00
..
PyStemmer.SlackBuild python/PyStemmer: Updated for version 2.0.1. 2021-07-25 16:56:07 +07:00
PyStemmer.info python/PyStemmer: Updated for version 2.0.1. 2021-07-25 16:56:07 +07:00
README python/PyStemmer: Wrap README at 72 columns. 2022-03-14 04:07:07 -04:00
slack-desc python/PyStemmer: Add python3 support. 2019-05-04 08:16:01 +07:00

README

Snowball stemming algorithms, for information retrieval

Stemming algorithms

PyStemmer provides access to efficient algorithms for calculating a
"stemmed" form of a word. This is a form with most of the common
morphological endings removed; hopefully representing a common
linguistic base form. This is most useful in building search
engines and information retrieval software; for example, a search
with stemming enabled should be able to find a document containing
"cycling" given the query "cycles".

PyStemmer provides algorithms for several (mainly european) languages,
by wrapping the libstemmer library from the Snowball project in a
Python module.

It also provides access to the classic Porter stemming algorithm for
english: although this has been superceded by an improved algorithm,
the original algorithm may be of interest to information retrieval
researchers wishing to reproduce results of earlier experiments.