Graphical Models Based Hierarchical Probabilistic Community Discovery in Large-scale Social Networks

Document Type


Department or Administrative Unit

Finance and Supply Chain Management

Publication Date



Real-world social networks, while disparate in nature, often comprise of a set of loose clusters (a.k.a. communities), in which members are better connected to each other than to the rest of the network. In addition, such communities are often hierarchical, reflecting the fact that some communities are composed of a few smaller, sub-communities. Discovering the complicated hierarchical community structure can gain us deeper understanding about the networks and the pertaining communities. This paper describes a hierarchical Bayesian model based scheme namely hierarchical social network-pachinko allocation model (HSN-PAM), for discovering probabilistic, hierarchical communities in social networks. This scheme is powered by a previously developed hierarchical Bayesian model. In this scheme, communities are classified into two categories: super-communities and regular-communities. Two different network encoding approaches are explored to evaluate this scheme on research collaborative networks, including CiteSeer. The experimental results demonstrate that HSN-PAM is effective for discovering hierarchical community structures in large-scale social networks.


This article was originally published in International Journal of Data Mining, Modeling and Management. The full-text article from the publisher can be found here.

Due to copyright restrictions, this article is not available for free download through ScholarWorks @ CWU.


International Journal of Data Mining, Modeling and Management


Copyright © 2010 Inderscience Enterprises Ltd.