2016
DOI: 10.1101/054676
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

MPAthic: Quantitative Modeling of Sequence-Function Relationships for massively parallel assays

Abstract: Massively parallel assays (MPAs) are being rapidly adopted for studying a wide range of DNA, RNA, and protein sequence-function relationships. However, the software available for quantitatively modeling these relationships is severely limited. Here we describe MPAthic, a software package that enables the rapid inference of such models from a variety of MPA datasets. Using both simulated and previously published data, we show that the modeling capabilities of MPAthic greatly improve on those of existing softwar… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2016
2016
2020
2020

Publication Types

Select...
5
3

Relationship

6
2

Authors

Journals

citations
Cited by 11 publications
(11 citation statements)
references
References 59 publications
0
11
0
Order By: Relevance
“…The identified binding sites are further interrogated by performing information-based modeling with the Sort-Seq data. Here, we generate energy matrix models ( 13 , 25 ) that describe the sequence-dependent energy of interaction of a transcription factor at each putative binding site. For each matrix, we use a convention that the wild-type sequence is set to have an energy of zero (an example energy matrix is in Fig.…”
Section: Resultsmentioning
confidence: 99%
“…The identified binding sites are further interrogated by performing information-based modeling with the Sort-Seq data. Here, we generate energy matrix models ( 13 , 25 ) that describe the sequence-dependent energy of interaction of a transcription factor at each putative binding site. For each matrix, we use a convention that the wild-type sequence is set to have an energy of zero (an example energy matrix is in Fig.…”
Section: Resultsmentioning
confidence: 99%
“…[27,62], we use a Markov Chain Monte Carlo (MCMC) algorithm to infer a set of energy values (in arbitrary units) for each energy matrix position that maximizes the mutual information between binding site sequence and fluorescence bin. This inference is performed using the MPAthic software package [63].…”
Section: B Bayesian Inference Of Energy Matrix Modelsmentioning
confidence: 99%
“…General purpose methods are ones that can flexibly analyze data from a range of study designs. While it is often of interest to study the effect of sequence features on the estimated activity levels of MPRA sequences (using tools such as MPAthic (Ireland and Kinney, 2016)), typically some sort of differential analysis is needed first to group interesting sequences together. This would usually involve comparing the activity of each putative regulatory sequence of interest to a suitable negative control.…”
Section: Introductionmentioning
confidence: 99%