Abstract
Estimating the number of communities is one of the fundamental problems in community detection. We re-examine the Bayesian paradigm for stochastic block models (SBMs) and propose a “corrected Bayesian information criterion” (CBIC), to determine the number of communities and show that the proposed criterion is consistent under mild conditions as the size of the network and the number of communities go to infinity. The CBIC outperforms those used in Wang and Bickel and Saldana, Yu, and Feng which tend to underestimate and overestimate the number of communities, respectively. The results are further extended to degree corrected SBMs. Numerical studies demonstrate our theoretical results.
Original language | English (US) |
---|---|
Pages (from-to) | 1771-1783 |
Number of pages | 13 |
Journal | Journal of the American Statistical Association |
Volume | 115 |
Issue number | 532 |
DOIs | |
State | Published - 2020 |
Keywords
- Consistency
- Degree corrected stochastic block model
- Network data
- Stochastic block model
ASJC Scopus subject areas
- Statistics and Probability
- Statistics, Probability and Uncertainty