A fast and accurate part-of-speech tagger for TextBlob.
Project description
A fast and accurate part-of-speech tagger based on the Averaged Perceptron. For use with TextBlob.
Implementation by Matthew Honnibal, a.k.a. syllog1sm. Read more about it here.
NOTE: The PerceptronTagger became a part of TextBlob core as of version 0.6.3 but will be deprecated in favor of this extension in version 0.7.0. TextBlob’s extension system is in its experimental stages, as is this extension.
Install
If you have pip installed (you should), run
$ pip install -U git+https://github.com/sloria/TextBlob.git@dev $ pip install textblob-aptagger
Usage
>>> from text.blob import TextBlob
>>> from textblob_aptagger import PerceptronTagger
>>> blob = TextBlob("Simple is better than complex.", pos_tagger=PerceptronTagger())
>>> blob.tags
[('Simple', u'NN'), ('is', u'VBZ'), ('better', u'JJR'), ('than', u'IN'), ('complex', u'JJ')]
Requirements
Python >= 2.6 or >= 3.3
License
MIT licensed. See the bundled LICENSE file for more details.
Changelog
0.1.0 (unreleased)
First release.
Ports the PerceptronTagger from TextBlob 0.6.3.
0.0.1 (09/22/2013)
Experimental release.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file textblob-aptagger-0.0.1.tar.gz
.
File metadata
- Download URL: textblob-aptagger-0.0.1.tar.gz
- Upload date:
- Size: 3.4 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8fea4a2802e38491cbf0fc3e3336df3777e7168e4ccba71c265b8179abd5b2ed |
|
MD5 | 18704510ac9581e4fa18e16f27bc4a0c |
|
BLAKE2b-256 | c7a33add8f50bb35c90e711eb676330c1618df0b9fdf92fba36a545a2712e68d |
File details
Details for the file textblob_aptagger-0.0.1-py2.py3-none-any.whl
.
File metadata
- Download URL: textblob_aptagger-0.0.1-py2.py3-none-any.whl
- Upload date:
- Size: 3.4 MB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 760d46c19b95a3ce0cc92cfaf7d20324780b5c8052caf8d36a7023fde335bbb4 |
|
MD5 | 2aa7f312207abcbff512b230a97580d7 |
|
BLAKE2b-256 | ef0dbd6f9a2a693413faa70364c436df316ae821ef70d0d78e7911acb99f22bc |