2016
DOI: 10.1186/s40064-016-3252-8
|View full text |Cite
|
Sign up to set email alerts
|

An overview of topic modeling and its current applications in bioinformatics

Abstract: BackgroundWith the rapid accumulation of biological datasets, machine learning methods designed to automate data analysis are urgently needed. In recent years, so-called topic models that originated from the field of natural language processing have been receiving much attention in bioinformatics because of their interpretability. Our aim was to review the application and development of topic models for bioinformatics.DescriptionThis paper starts with the description of a topic model, with a focus on the under… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
170
0
4

Year Published

2017
2017
2022
2022

Publication Types

Select...
6
2
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 298 publications
(197 citation statements)
references
References 65 publications
1
170
0
4
Order By: Relevance
“…The LDA was introduced by Blei et al (2003) and became a popular model for uncovering latent topics in large text corpora and other kinds of discrete data (Griffiths, Steyvers, & Tenenbaum, 2007;Grün & Hornik, 2011;Kosinski et al, 2016;Liu, Tang, Dong, Yao, & Zhou, 2016;Poldrack et al, 2012). According to Blei and Lafferty (n.d.):…”
Section: Topic Modeling: Latent Dirichlet Allocation (Lda)mentioning
confidence: 99%
“…The LDA was introduced by Blei et al (2003) and became a popular model for uncovering latent topics in large text corpora and other kinds of discrete data (Griffiths, Steyvers, & Tenenbaum, 2007;Grün & Hornik, 2011;Kosinski et al, 2016;Liu, Tang, Dong, Yao, & Zhou, 2016;Poldrack et al, 2012). According to Blei and Lafferty (n.d.):…”
Section: Topic Modeling: Latent Dirichlet Allocation (Lda)mentioning
confidence: 99%
“…This helps science foundation administrators to better understand the content and context of funding portfolios in order to help promote future science funding plans. A topic model is presented for simultaneously modeling papers, authors, and publication venues known as Author conference topic model [73]. It includes data of 14134 authors 10716 papers and 1434 conferences from Arnet Miner.…”
Section: Scientific Researchmentioning
confidence: 99%
“…With this generative model, the LDA identifies a set of topics that have most likely generated a given collection of documents. In the end, LDA generates two matrices, one that describes the relationship between topics and document and the other describing the relationship between topics and words [9]. In the first matrix, topics with the highest contributions are those that best characterize a document.…”
Section: Introductionmentioning
confidence: 99%