Dr. Shi Zhong

Austin, TX 78739

Selected Publications

2008

2007

2006

2005

2004

2003

2002 and before

Technical Reports

Software Packages
(from Shi Zhong's research)

All programs listed below are distributed under GNU GPL license, thus are free for you to use or distribute. But please be aware that these programs come with absolutely no warranty what so ever, and in no way will I be held responsible for loss of properties due to the use of these software packages.

Model-based Text Clustering (in Matlab)

Brief description: The package contains model-based hard k-means and soft EM clustering algorithms for text applications. Probabilistic models implemented include multivariate Bernoulli, multinomial, and von Mises-Fisher models. Deterministic annealing version of all the above algorithms are also included.

Source code: [textclust.zip] For usage, read the README file.

Text data: [docdata.zip] in Matlab format. For a description of this data, see the reference below.

Reference: Shi Zhong and Joydeep Ghosh, "Generative model-based clustering of documents: a comparative study," Knowledge and Information Systems (KAIS), Vol. 8, 2005. pp. 374-384.

Online Spherical K-Means Clustering (in C & Matlab)

Source code: [C code | Matlab interface | Utility program that can convert text data from Matlab format to CCS format required by the C code.]

Reference: Shi Zhong, "Efficient Online Spherical K-means Clustering," In Proc. IEEE Int. Joint Conf. Neural Networks (IJCNN 2005), Montreal, Canada, July 31-August 4, 2005. pp. 3180-3185.

Coupled Hidden Markov Models (in Matlab)

Source code: [dchmm.zip]

References: Shi Zhong and Joydeep Ghosh, "Coupled Hidden Markov Models," Tech. Report, ECE Dept., University of Texas at Austin, June, 2001.

 

Copyright shi-zhong.com. All rights reserved.

Austin, TX 78739