2010
DOI: 10.1186/1471-2164-11-s2-s2
|View full text |Cite
|
Sign up to set email alerts
|

Global protein interactome exploration through mining genome-scale data in Arabidopsis thaliana

Abstract: BackgroundMany essential cellular processes, such as cellular metabolism, transport, cellular metabolism and most regulatory mechanisms, rely on physical interactions between proteins. Genome-wide protein interactome networks of yeast, human and several other animal organisms have already been established, but this kind of network reminds to be established in the field of plant.ResultsWe first predicted the protein protein interaction in Arabidopsis thaliana with methods, including ortholog, SSBP, gene fusion,… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

2012
2012
2023
2023

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 11 publications
(12 citation statements)
references
References 46 publications
(59 reference statements)
0
12
0
Order By: Relevance
“…There are some reported methods for extracting negative datasets, such as: (1) Negative datasets are constructed by using random pairs which exclude the experimentally detected interactions [1], and as there are discordant numbers between high-confidence interactions and random pairs, the scale and structure of networks should be balanced between negative and positive datasets. This method may include undetected PPIs; (2) Negative examples are chosen based on the categories of their distinct functions, such as sub-cellular localization (can be accessed by tools such as LOCATE [38], PSORTdb 3.0 [39], LocDB [40]) and annotations (such as KEGG pathways, gene ontology (GO), and Enzyme Commission (EC)) [22,41]. However, these methods can also lead to biases due to varying definitions of categories [42]; (3) Another alternative approach is based on topological policy: choose pairs of separated proteins in existing PPI networks to represent non-interactions: defining negative samples as the protein pairs with the shortest path lengths exceed the median shortest paths in a GSP network [43], or further construct a GSN network based on the principle of keeping the composition and degree of a node identical to the GSP network [20].…”
Section: Defining Gold Standard Datasetsmentioning
confidence: 99%
See 3 more Smart Citations
“…There are some reported methods for extracting negative datasets, such as: (1) Negative datasets are constructed by using random pairs which exclude the experimentally detected interactions [1], and as there are discordant numbers between high-confidence interactions and random pairs, the scale and structure of networks should be balanced between negative and positive datasets. This method may include undetected PPIs; (2) Negative examples are chosen based on the categories of their distinct functions, such as sub-cellular localization (can be accessed by tools such as LOCATE [38], PSORTdb 3.0 [39], LocDB [40]) and annotations (such as KEGG pathways, gene ontology (GO), and Enzyme Commission (EC)) [22,41]. However, these methods can also lead to biases due to varying definitions of categories [42]; (3) Another alternative approach is based on topological policy: choose pairs of separated proteins in existing PPI networks to represent non-interactions: defining negative samples as the protein pairs with the shortest path lengths exceed the median shortest paths in a GSP network [43], or further construct a GSN network based on the principle of keeping the composition and degree of a node identical to the GSP network [20].…”
Section: Defining Gold Standard Datasetsmentioning
confidence: 99%
“…These features may represent a particular source of information such as correlations of gene expression, phylogenetic profiles, sequence-based signatures, GO functional annotation and chemical properties. There are many modes to encode evidence sources into a featured vector, to choose statistical standard and data dimensions, and to check the normalization affect or the reliability of different computational predictions [22,45]. …”
Section: Strategy For Integrative Analysismentioning
confidence: 99%
See 2 more Smart Citations
“…Protein interactions can be inferred by homology to known interactions in other organisms (interaction ortholog or interlog) (Geisler- Lee et al, 2007), using indirect evidence or literature (Cui et al, 2008) or integrated methods (Xu et al, 2010). Interlogs can be filtered using functional association data to improve prediction reliability (De Bodt et al, 2009).…”
Section: Large-scale Protein-protein Interactions: the Interactomementioning
confidence: 99%