2013
DOI: 10.1186/1471-2148-13-146
|View full text |Cite
|
Sign up to set email alerts
|

EGN: a wizard for construction of gene and genome similarity networks

Abstract: BackgroundIncreasingly, similarity networks are being used for evolutionary analyses of molecular datasets. These networks are very useful, in particular for the analysis of gene sharing, lateral gene transfer and for the detection of distant homologs. Currently, such analyses require some computer programming skills due to the limited availability of user-friendly freely distributed software. Consequently, although appealing, the construction and analyses of these networks remain less familiar to biologists t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
56
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
7
2
1

Relationship

2
8

Authors

Journals

citations
Cited by 53 publications
(56 citation statements)
references
References 50 publications
0
56
0
Order By: Relevance
“…These translated peptide sequences were then combined with the translated proteins from the diatom genomes Fragilariopsis cylindrus CCMP1102 version 1.0, P. tricornutum CCMP632 version 2.0, Pseudonitzschia multiseries CLN-47 version 1.0, and T. pseudonana CCMP1335 version 3.0, which were collected from the Joint Genome Institute database (genome.jgi-psf.org/). A protein similarity network was then created using EGN, a software program that automates the reconstruction of gene networks from protein sequences through reciprocal BLASTp analysis (e-value <1e-5, 20% hit identity threshold, 5% best reciprocal threshold of best e-value, 90% minimal match coverage threshold) (80,81). Networks were then visualized and manipulated using Cytoscape 3.0, where the layout of the network was produced using an edge-weighted, spring-embedded model based on e-value, meaning that genes that are closer together are more similar (82,83).…”
Section: Methodsmentioning
confidence: 99%
“…These translated peptide sequences were then combined with the translated proteins from the diatom genomes Fragilariopsis cylindrus CCMP1102 version 1.0, P. tricornutum CCMP632 version 2.0, Pseudonitzschia multiseries CLN-47 version 1.0, and T. pseudonana CCMP1335 version 3.0, which were collected from the Joint Genome Institute database (genome.jgi-psf.org/). A protein similarity network was then created using EGN, a software program that automates the reconstruction of gene networks from protein sequences through reciprocal BLASTp analysis (e-value <1e-5, 20% hit identity threshold, 5% best reciprocal threshold of best e-value, 90% minimal match coverage threshold) (80,81). Networks were then visualized and manipulated using Cytoscape 3.0, where the layout of the network was produced using an edge-weighted, spring-embedded model based on e-value, meaning that genes that are closer together are more similar (82,83).…”
Section: Methodsmentioning
confidence: 99%
“…Two similarity network analyses, where each node represents a dpo sequence and an edge represents a sequence similarity/identity were performed using EGN (Halary et al 2013). The first network was constructed from a data set of 91 Glomeromycotan dpo sequences (from our collection and retrieved from GenBank).…”
Section: Methodsmentioning
confidence: 99%
“…Cluster maps can also provide a general overview of taxonomic distribution and sequence similarity, parameters that are not easily inferred from phylogenetic trees. Clustering is best thought of as a representation of sequence data as a similarity network, allowing evolutionary biologists to draw inferences about sequence evolution that are complementary to answers based on phylogeny (for a thoughtful introduction to the use of similarity networks, see Halary et al 2013).…”
Section: Origin and Evolution Of The Cytoskeletonmentioning
confidence: 99%