2017
DOI: 10.1186/s13059-017-1272-5
|View full text |Cite|
|
Sign up to set email alerts
|

A statistical framework for analyzing deep mutational scanning data

Abstract: Deep mutational scanning is a widely used method for multiplex measurement of functional consequences of protein variants. We developed a new deep mutational scanning statistical model that generates error estimates for each measurement, capturing both sampling error and consistency between replicates. We apply our model to one novel and five published datasets comprising 243,732 variants and demonstrate its superiority in removing noisy variants and conducting hypothesis testing. Simulations show our model ap… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

3
227
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
8

Relationship

2
6

Authors

Journals

citations
Cited by 185 publications
(247 citation statements)
references
References 49 publications
(81 reference statements)
3
227
0
Order By: Relevance
“…However, models for computing error estimates that consider sampling noise and the distribution of scores for variants between replicate experiments are available. 71,72 As the field moves forward, unification of data analysis and error-estimation methods will enable comparisons of data quality across MAVEs just as it has for earlier technologies. 73 Fifth, interpretation of MAVE results can be complicated by interactions between variants in different functional elements or between variants and environmental effects.…”
Section: Limitations Of Maves and How To Overcome Themmentioning
confidence: 99%
“…However, models for computing error estimates that consider sampling noise and the distribution of scores for variants between replicate experiments are available. 71,72 As the field moves forward, unification of data analysis and error-estimation methods will enable comparisons of data quality across MAVEs just as it has for earlier technologies. 73 Fifth, interpretation of MAVE results can be complicated by interactions between variants in different functional elements or between variants and environmental effects.…”
Section: Limitations Of Maves and How To Overcome Themmentioning
confidence: 99%
“…The fitness values for the mutations common across all replicates correlated strongly (Appendix Fig S4) (Pearson's correlation coefficient between 0.81 and 0.98). We averaged the fitness scores for common mutations across replicates using the Fisher scoring iteration‐based maximum‐likelihood estimates (Materials and Methods; Rubin et al , ). We observed a bimodal distribution of fitness effects for all mutations at each rifampicin concentration (Fig E).…”
Section: Resultsmentioning
confidence: 99%
“…Finally, we wrote a custom code to extract information about the nucleotide and amino acid changes corresponding to each qrowdot output. Fitness calculations for resistance to rifampicin : Sample code for calculating the resistance to rifampicin has been provided ( https://github.com/Alaksh/CREPE-Analysis-Code). Fitness calculations for resistance to rifampicin with each replicate were estimated using the two time point enrichment score calculation algorithm described previously (Rubin et al , ). The fitness for each variant was estimated as follows: fitness,f=logCnormali,normalsel+0.5Cnormalwt,normalsel+0.5logCnormali,normalinput+0.5Cnormalwt,normalinput+0.5where C i is the total count for a variant “i” in the library and C wt is the total count for the wild‐type reference in the library.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Experimental mutational sensitivity information is now available for multiple proteins . Deep mutational scanning has also been used to study protein‐protein interactions and to affinity rank peptide ligands for receptor proteins .…”
Section: Discussionmentioning
confidence: 99%