2003
DOI: 10.1038/nbt861
|View full text |Cite
|
Sign up to set email alerts
|

Discovery of uncharacterized cellular systems by genome-wide analysis of functional linkages

Abstract: We introduce a general computational method, applicable on a genome-wide scale, for the systematic discovery of uncharacterized cellular systems. Quantitative analysis of the coinheritance of pairs of genes among different organisms, calculated using phylogenetic profiles, allows the prediction of thousands of functional linkages between the corresponding proteins. A comparison of these functional linkages to known pathways reveals that calculated linkages are comparable in accuracy to genome-wide yeast two-hy… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
163
0

Year Published

2003
2003
2012
2012

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 196 publications
(165 citation statements)
references
References 44 publications
2
163
0
Order By: Relevance
“…We used the same transcription regulation data as described above under "Biological Coherence of Predicted Complexes." For any pair of proteins, we compute the mutual information between the profile vectors of the two proteins using the method described Date and Marcotte (48).…”
Section: Constructing a Reference List Of Positive And Negative Complmentioning
confidence: 99%
“…We used the same transcription regulation data as described above under "Biological Coherence of Predicted Complexes." For any pair of proteins, we compute the mutual information between the profile vectors of the two proteins using the method described Date and Marcotte (48).…”
Section: Constructing a Reference List Of Positive And Negative Complmentioning
confidence: 99%
“…For both methods, we analyzed 424 bacterial genome sequences (31 archeaebacteria and 393 eubacteria, downloaded from NCBI in December 2006). C. elegans protein sequences were aligned to protein sequences encoded by the 424 bacterial genomes, using the program BLASTP with default settings (Altschul et al 1997) and alignment scores analyzed as in Date and Marcotte (2003) with the modification of using discretized BLASTP E-values when calculating mutual information between phylogenetic profiles. We used bins of equal numbers of E-values rather than equal intervals of E-values, accounting for the nonuniform E-value distribution.…”
Section: Inferring Functional Associations From Phylogenetic Profilesmentioning
confidence: 99%
“…Computational clustering of genes according to their connectivities [39][40][41][42][43] shows that these features may correspond to known cellular machines (e.g., the ribosome, the proteasome, etc.) 44,45 but may also be totally unexplored [46][47][48][49] . The network as a stand-alone description of gene function is not directly interpretable by us.…”
Section: Low High Mediummentioning
confidence: 99%
“…Processes can thus be defined at different levels of complexity, which results directly in a hierarchical organization of subnetworks, networks, supernetworks. The networks thus implicitly capture the hierarchical organization of biological processes in the cell, and this hierarchy can be explored by clustering the genes according to their network connections, the genes' precise functions being reflected in their memberships in different clusters [39][40][41][42][43][44][45][46][47][48][49] . Third, however one defines a process in the full network, these processes are highly interlinked.…”
Section: Properties and Limitations Of Networkmentioning
confidence: 99%