PyStemmer

Snowball stemming algorithms, for information retrieval

These details have not been verified by PyPI

Project links

Homepage

Project description

Stemming algorithms

PyStemmer provides access to efficient algorithms for calculating a “stemmed” form of a word. This is a form with most of the common morphological endings removed; hopefully representing a common linguistic base form. This is most useful in building search engines and information retrieval software; for example, a search with stemming enabled should be able to find a document containing “cycling” given the query “cycles”.

PyStemmer provides algorithms for several (mainly european) languages, by wrapping the libstemmer library from the Snowball project in a Python module.

It also provides access to the classic Porter stemming algorithm for english: although this has been superseded by an improved algorithm, the original algorithm may be of interest to information retrieval researchers wishing to reproduce results of earlier experiments.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

2.2.0.3

Oct 10, 2024

2.2.0.2

Oct 8, 2024

2.2.0.1

Jan 16, 2023

2.2.0

Nov 29, 2022

2.0.1

Jul 15, 2020

2.0.0.1

Mar 21, 2020

This version

2.0.0

Mar 21, 2020

1.3.0

Feb 25, 2013

1.2.0

Aug 9, 2011

1.1.0

Nov 6, 2009

1.0.1

Jun 19, 2006

1.0

Jun 11, 2006

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

PyStemmer-2.0.0.tar.gz (558.8 kB view details)

Uploaded Mar 21, 2020 Source

File details

Details for the file PyStemmer-2.0.0.tar.gz.

File metadata

Download URL: PyStemmer-2.0.0.tar.gz
Upload date: Mar 21, 2020
Size: 558.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.0.0 requests-toolbelt/0.9.1 tqdm/4.43.0 CPython/3.8.2

File hashes

Hashes for PyStemmer-2.0.0.tar.gz
Algorithm	Hash digest
SHA256	`4c5aec02e874b300432541ddf6576ca49d159f7b8b76cf3719902d471daccdce`
MD5	`5f4a0c73ef6c418880f297fdab6be27f`
BLAKE2b-256	`2672e8c9fc268ca49d05d57c9ef43a412356beee3ece53651e2b2951a02521c6`