lda: Topic modeling with latent Dirichlet allocation

Topic modeling with latent Dirichlet allocation. lda aims for simplicity.

lda implements latent Dirichlet allocation (LDA) using collapsed Gibbs sampling. LDA is described in Blei et al. (2003) and Pritchard et al. (2000).

Installation

pip install lda

Getting started

lda.LDA implements latent Dirichlet allocation (LDA). The interface follows conventions found in scikit-learn.

>>> import numpy as np
>>> import lda
>>> X = np.array([[1,1], [2, 1], [3, 1], [4, 1], [5, 8], [6, 1]])
>>> model = lda.LDA(n_topics=2, n_iter, random_state=1)
>>> doc_topic = model.fit_transform(X)  # estimate of document-topic distributions
>>> model.components_  # estimate of topic-word distributions; model.doc_topic_ is an alias

Requirements

Python 2.7 or Python 3.3+ is required. The following packages are required

Caveat

lda aims for simplicity. (It happens to be fast, as essential parts are written in C via Cython_.) If you are working with a very large corpus you may wish to use more sophisticated topic models such as those implemented in hca and MALLET. hca is written in C and MALLET_ is written in Java. Unlike lda, hca can use more than one processor at a time.

Important links

Documentation: http://pythonhosted.org/lda
Source code: https://github.com/ariddell/lda/
Issue tracker: https://github.com/ariddell/lda/issues

License

lda is licensed under Version 2.0 of the Mozilla Public License.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
bench		bench
doc/source		doc/source
lda		lda
.gitignore		.gitignore
.testr.conf		.testr.conf
.travis.yml		.travis.yml
CONTRIBUTING.rst		CONTRIBUTING.rst
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.rst		README.rst
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py
test-requirements.txt		test-requirements.txt
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

lda: Topic modeling with latent Dirichlet allocation

Installation

Getting started

Requirements

Caveat

Important links

License

About

Uh oh!

Releases

Packages

Languages

License

cz511/lda

Folders and files

Latest commit

History

Repository files navigation

lda: Topic modeling with latent Dirichlet allocation

Installation

Getting started

Requirements

Caveat

Important links

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages