2007
DOI: 10.1016/s1672-0229(07)60013-8
|View full text |Cite
|
Sign up to set email alerts
|

Generation of Synthetic Transcriptome Data with Defined Statistical Properties for the Development and Testing of New Analysis Methods

Abstract: We have previously developed a combined signal/variance distribution model that accounts for the particular statistical properties of datasets generated on the Applied Biosystems AB1700 transcriptome system. Here we show that this model can be efficiently used to generate synthetic datasets with statistical properties virtually identical to those of the actual data by aid of the JAVA application ace.map creator 1.0 that we have developed. The fundamentally different structure of AB1700 transcriptome profiles r… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2010
2010
2010
2010

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(5 citation statements)
references
References 7 publications
(20 reference statements)
0
5
0
Order By: Relevance
“…It is independent of cross-hybridizing monkey RNA to human arrays ( Figure 1B ). Importantly, synthetic, random pseudo-data generated using our mathematical model display the same distribution properties ( 16 ) ( Figure 1C ). When analyzing more carefully the parameter value distributions of the entire set of 750 arrays ( Figure S1 and Data File S1 ), these initial findings are confirmed.…”
Section: Resultsmentioning
confidence: 79%
See 4 more Smart Citations
“…It is independent of cross-hybridizing monkey RNA to human arrays ( Figure 1B ). Importantly, synthetic, random pseudo-data generated using our mathematical model display the same distribution properties ( 16 ) ( Figure 1C ). When analyzing more carefully the parameter value distributions of the entire set of 750 arrays ( Figure S1 and Data File S1 ), these initial findings are confirmed.…”
Section: Resultsmentioning
confidence: 79%
“…We have already shown previously how this invariance of the data structure of AB1700 experiments can be efficiently used to draw random pseudo-data from the parameterized model we had previously presented ( 16 ) . In order to even better capture the statistical properties of individual experiments, and after having observed a certain variability in the signal and signal-variance ranges that is only poorly captured by the parameters of the two lognormal distributions, we have decided to add the natural logarithm of the signal range and the signal-variance range, as estimated on the 99% quantile in both dimensions, as independent characteristic parameters to our model.…”
Section: Resultsmentioning
confidence: 98%
See 3 more Smart Citations