PyLDA

PyLDA is a Latent Dirichlet Allocation topic modeling package, developed by the Cloud Computing Research Team in University of Maryland, College Park.

Please download the latest version from our GitHub repository.

Please send any bugs of problems to Ke Zhai (kzhai@umd.edu).

Install and Build

This package depends on many external python libraries, such as numpy, scipy and nltk.

Launch and Execute

Assume the PyLDA package is downloaded under directory $PROJECT_SPACE/src/, i.e.,

$PROJECT_SPACE/src/PyLDA

To prepare the example dataset,

tar zxvf associated-press.tar.gz

To launch PyLDA, first redirect to the directory of PyLDA source code,

cd $PROJECT_SPACE/src/PyLDA

and run the following command on example dataset,

python -m launch_train --input_directory=./associated-press --output_directory=./ --number_of_topics=10 --training_iterations=100

The generic argument to run PyLDA is

python -m launch_train --input_directory=$INPUT_DIRECTORY/$CORPUS_NAME --output_directory=$OUTPUT_DIRECTORY --number_of_topics=$NUMBER_OF_TOPICS --training_iterations=$NUMBER_OF_ITERATIONS

You should be able to find the output at directory $OUTPUT_DIRECTORY/$CORPUS_NAME.

Under any circumstances, you may also get help information and usage hints by running the following command

python -m launch_train --help

Name	Name	Last commit message	Last commit date
Latest commit Zhai, Ke and Zhai, Ke commie parsed data Mar 24, 2019 9b6899e · Mar 24, 2019 History 134 Commits
parsed	parsed	commie parsed data	Mar 24, 2019
raw	raw	commit new datasets	Mar 24, 2019
vocab/prior/tree	vocab/prior/tree	update inference modules	Jul 10, 2016
.gitignore	.gitignore	update	Sep 23, 2015
README.md	README.md	Update README.md	May 18, 2018
__init__.py	__init__.py	major organization refactoring	Sep 23, 2015
associated-press.tar.gz	associated-press.tar.gz	rename dataset	Sep 24, 2015
hybrid.py	hybrid.py	update	May 8, 2017
inferencer.py	inferencer.py	update to include gamma distribuion	Apr 23, 2017
launch_test.py	launch_test.py	major organization refactoring	Sep 23, 2015
launch_train.py	launch_train.py	update to include gamma distribuion	Apr 23, 2017
monte_carlo.py	monte_carlo.py	update to include gamma distribuion	Apr 23, 2017
variational_bayes.py	variational_bayes.py	update to include gamma distribuion	Apr 23, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PyLDA

Install and Build

Launch and Execute

About

Releases

Packages

Contributors 2

Languages

kzhai/PyLDA

Folders and files

Latest commit

History

Repository files navigation

PyLDA

Install and Build

Launch and Execute

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages