Skip to main content

Topic modeling with latent Dirichlet allocation

Project description

pypi version travis-ci build status pypi download statistics

Topic modeling with latent Dirichlet allocation. lda aims for simplicity.

lda implements latent Dirichlet allocation (LDA) using collapsed Gibbs sampling. LDA is described in Blei et al. (2003) and Pritchard et al. (2000).

Installation

pip install lda

Getting started

lda.LDA implements latent Dirichlet allocation (LDA). The interface follows conventions found in scikit-learn.

>>> import numpy as np
>>> import lda
>>> X = np.array([[1,1], [2, 1], [3, 1], [4, 1], [5, 8], [6, 1]])
>>> model = lda.LDA(n_topics=2, n_iter, random_state=1)
>>> doc_topic = model.fit_transform(X)  # estimate of document-topic distributions
>>> model.components_  # estimate of topic-word distributions; model.doc_topic_ is an alias

Requirements

Python 3 is required. The following packages are also required

Caveat

lda aims for simplicity over speed. If you are working with large corpora or want to use faster and more sophisticated topic models, consider using hca or MALLET. hca is written in C and MALLET_ is written in Java.

License

horizont is licensed under Version 2.0 of the Mozilla Public License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lda-0.1.0.dev2.g0925a91.tar.gz (222.9 kB view details)

Uploaded Source

File details

Details for the file lda-0.1.0.dev2.g0925a91.tar.gz.

File metadata

File hashes

Hashes for lda-0.1.0.dev2.g0925a91.tar.gz
Algorithm Hash digest
SHA256 b395f2457b6e25b282c8476163506ef638e9318219cb928e9a3e2bb9c7f99ff1
MD5 88136ac3e16e464943f6a0326477bd51
BLAKE2b-256 69206901df97a391830e0bf0bfced0130a83e373cf0e6db69783c91822aab7be

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page