Comparison of the current RefSeq, Ensembl and EST databases for counting genes and gene discovery

Larsson, Thomas; Murray, Christian G.; Hill, Tobias; Fredriksson, Robert; Schiöth, Helgi B.

doi:10.1016/j.febslet.2004.12.046

Cited by 32 publications

(21 citation statements)

References 34 publications

(44 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Paired-end reads were mapped to the RefSeq database (National Center for Biotechnology Information (NCBI) build 37) using the Burrows-Wheeler Aligner (BWA) software with default parameters that allow up to 3 alignments for each read and up to 2 mismatches for the seed sequence (the first 25 bp of each read) [15]. Reads that failed to map to RefSeq were mapped to the Ensembl database, which includes additional transcripts and pseudogenes [16]. Remaining unmapped reads were mapped to the human genome assembly (NCBI build 37).…”

Section: Methodsmentioning

confidence: 99%

An integrated transcriptome and epigenome analysis identifies a novel candidate gene for pancreatic cancer

et al. 2013

View full text Add to dashboard Cite

BackgroundPancreatic cancer is a highly lethal cancer with limited diagnostic and therapeutic modalities.MethodsTo begin to explore the genomic landscape of pancreatic cancer, we used massively parallel sequencing to catalog and compare transcribed regions and potential regulatory elements in two human cell lines derived from normal and cancerous pancreas.ResultsBy RNA-sequencing, we identified 2,146 differentially expressed genes in these cell lines that were enriched in cancer related pathways and biological processes that include cell adhesion, growth factor and receptor activity, signaling, transcription and differentiation. Our high throughput Chromatin immunoprecipitation (ChIP) sequence analysis furthermore identified over 100,000 regions enriched in epigenetic marks, showing either positive (H3K4me1, H3K4me3, RNA Pol II) or negative (H3K27me3) correlation with gene expression. Notably, an overall enrichment of RNA Pol II binding and depletion of H3K27me3 binding were seen in the cancer derived cell line as compared to the normal derived cell line. By selecting genes for further assessment based on this difference, we confirmed enhanced expression of aldehyde dehydrogenase 1A3 (ALDH1A3) in two larger sets of pancreatic cancer cell lines and in tumor tissues as compared to normal derived tissues.ConclusionsAs aldehyde dehydrogenase (ALDH) activity is a key feature of cancer stem cells, our results indicate that a member of the ALDH superfamily, ALDH1A3, may be upregulated in pancreatic cancer, where it could mark pancreatic cancer stem cells.

show abstract

Section: Methodsmentioning

confidence: 99%

An integrated transcriptome and epigenome analysis identifies a novel candidate gene for pancreatic cancer

et al. 2013

View full text Add to dashboard Cite

show abstract

“…Today alternative splicing mechanisms, including exon skipping, alternative exon insertions, use of alternative 5′ splice site and 3′ splice site, and intron retention, are known to be one of the most important mechanisms in providing complexity of eukaryotic proteomes. These mechanisms facilitate the production of a much higher number of possible proteins than 25,000-30,000, which are the number of protein coding genes that have been identified in the human genome (InternationalHumanGenomeSequencingConsortium, 2004;Larsson et al, 2005). Estimations from several studies conclude that 40-60% of the human genes undergo alternative splicing, most of which affect the coding sequence leading to the formation of either functional or non-functional protein products .…”

Section: Introductionmentioning

confidence: 98%

Identification of novel splice variants of Adhesion G protein-coupled receptors

Bjarnadóttir

Geirardsdottir

Ingemansson

et al. 2007

Gene

Self Cite

View full text Add to dashboard Cite

“…There are several examples of domain changes in the N termini through alternative splicing [21,25,93] and the fact that their introns are frequent and long make them very interesting to study with respect to non-classical transcription. The increasing number of ESTs and the possibility of using genome assembly to aid alignment of expression and gene databases has provided good opportunities to study alternative splice variants [79,94]. In a recent study, we used ESTs and full-length mRNA sequences to systematically analyse splice variants for the Adhesion GPCRs.…”

Section: Alternative Splicing and Role Of Intronsmentioning

confidence: 99%

The Adhesion GPCRs: A unique family of G protein-coupled receptors with important roles in both central and peripheral tissues

Bjarnadóttir

Fredriksson

Schiöth

2007

Cell. Mol. Life Sci.

122

101

View full text Add to dashboard Cite

G protein-coupled receptors (GPCRs) are a diverse superfamily of membrane-bound receptors. The second largest subgroup of GPCRs, the Adhesion GPCRs, has 33 members in humans. Phylogenetic analysis of the entire repertoire of the seven transmembrane- domain (7TM) regions of GPCRs shows that the Adhesion GPCRs form a distinct family. Adhesion GPCRs are characterised by (1) long N termini with multiple functional domains often found in other proteins such as tyrosine kinases, integrins and cadherins, (2) highly complex genomic structure with multiple introns and splice variants and (3) a 7TM region that has no clear similarities with 7TM from other GPCRs. Several Adhesion GPCRs are known to have a role in the immune system but it is becoming more evident that many have important roles in the CNS. We speculate that the overall structural construction of the Adhesion GPCRs allows them to participate in different types of cell guidance.

show abstract

Comparison of the current RefSeq, Ensembl and EST databases for counting genes and gene discovery

Cited by 32 publications

References 34 publications

An integrated transcriptome and epigenome analysis identifies a novel candidate gene for pancreatic cancer

An integrated transcriptome and epigenome analysis identifies a novel candidate gene for pancreatic cancer

Identification of novel splice variants of Adhesion G protein-coupled receptors

The Adhesion GPCRs: A unique family of G protein-coupled receptors with important roles in both central and peripheral tissues

Contact Info

Product

Resources

About