This paper describes the application of co-occurrence and latent Dirichlet allocation (LDA)-based topic analyses in stem cell-related literature research. On account of the deficiency of parameter estimation in LDA, this study integrated co-occurrence theory and clustering judgement indicators and constructed an ATNLDA (Auto Topic Number LDA) model for topic segmentation. Next, ATNLDA was used to determine the optimal topic number of stem cell research literatures from 2006 to 2011 in PubMed, which was then used for topic segmentation of research content in stem cell data set. After stem cell research topics were obtained, they were analysed in terms of topic label, topic research content and interrelation between topics. The results verified that application of ATNLDA in topic segmentation in stem cell literature research is effective and feasible. Current deficiencies of ATNLDA and future study plan were also discussed.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.