Deletions spanning chromosome 5q31.2 are among the most common recurring cytogenetic abnormalities detectable in myelodysplastic syndromes (MDS). Prior genomic studies have suggested that haploinsufficiency of multiple 5q31.2 genes may contribute to MDS pathogenesis. However, this hypothesis has never been formally tested. Therefore, we designed this study to systematically and comprehensively evaluate all 28 chromosome 5q31.2 genes and directly test whether haploinsufficiency of a single 5q31.2 gene may result from a heterozygous nucleotide mutation or microdeletion. We selected paired tumor (bone marrow) and germline (skin) DNA samples from 46 de novo MDS patients (37 without a cytogenetic 5q31.2 deletion) and performed total exonic gene resequencing (479 amplicons) and array comparative genomic hybridization (CGH). We found no somatic nucleotide changes in the 46 MDS samples, and no cytogenetically silent 5q31.2 deletions in 20/20 samples analyzed by array CGH. Twelve novel single nucleotide polymorphisms were discovered. The mRNA levels of 7 genes in the commonly deleted interval were reduced by 50% in CD34+ cells from del(5q) MDS samples, and no gene showed complete loss of expression. Taken together, these data show that small deletions and/or point mutations in individual 5q31.2 genes are not common events in MDS, and implicate haploinsufficiency of multiple genes as the relevant genetic consequence of this common deletion.
With the proliferation of high-throughput technologies, genome-level data analysis has become common in molecular biology. Bioinformaticians are developing extensive resources to annotate and mine biological features from high-throughput data. The underlying database management systems for most bioinformatics software are based on a relational model. Modern non-relational databases offer an alternative that has flexibility, scalability, and a non-rigid design schema. Moreover, with an accelerated development pace, non-relational databases like CouchDB can be ideal tools to construct bioinformatics utilities. We describe CouchDB by presenting three new bioinformatics resources: (a) geneSmash, which collates data from bioinformatics resources and provides automated gene-centric annotations, (b) drugBase, a database of drug-target interactions with a web interface powered by geneSmash, and (c) HapMap-CN, which provides a web interface to query copy number variations from three SNP-chip HapMap datasets. In addition to the web sites, all three systems can be accessed programmatically via web services.
Deletions involving the long arm of chromosome 5 are among the most common recurring cytogenetic abnormalities detectable in human myelodysplastic syndromes (MDS). The minimally deleted segment has been mapped by several groups to a 2.3 megabase interval at 5q31.2. This region contains 21 annotated genes, one pseudogene, 4 small RNAs, and 2 predicted genes (SPOCK1, KLHL3, HNRPA0, NPY6R, MYOT, PKD2L2, C5orf5, WNT8A, NME5, BRD8, KIF20A, CDC23, GFRA3, CDC25C, FAM53C, JMJD1B, REEP2, EGR1, ETF1, HSPA9B, SNORD63, LOC391836, LOC729429, CTNNA1, LRRTM2, SIL1, MATR3, SNORA74A). Extensive study of many of these genes by other investigators has raised the possibility that gene(s) in this interval contribute to MDS pathogenesis by haploinsufficiency. To address this hypothesis, we selected samples from 46 de novo MDS patients that had adequate amounts of paired tumor (bone marrow) and germline (skin) DNA available. The 46 patients include all FAB subtypes, with a range of IPSS scores from 0–3 (median = 1), and a median blast count of 5%. The majority of these samples (n=37) do not have cytogenetically apparent 5q31.2 deletions. We first asked whether these samples might contain deletions in this interval below the limit of detection by cytogenetics. We used a custom oligomer based array comparative genomic hybridization (aCGH) platform developed by NimbleGen Systems, Inc. that has ∼385,000 probes spanning human chromosome 5, providing an average probe spacing of 500 base pairs. Analysis of aCGH data for 20 patients demonstrated that cytogenetically apparent 5q31.2 deletions could be detected by this platform, but no cytogenetically “silent” interstitial deletions were seen. We next asked whether genes in this interval might instead be targeted by point mutations. To define the sensitivity of our sequence-based studies, we used standard agarose gel electrophoresis and the Agilent LabChip to detect FLT3 internal tandem duplications (ITD) in 3 of 90 MDS samples. We chose one sample with an ITD and performed 6 serial dilutions of MDS DNA with wild-type genomic DNA and performed PCR amplification followed by DNA sequencing of the PCR products. We could detect the FLT3 ITD when 12% of the alleles harbored the mutation, or when ∼20–25% of cells contain a heterozygous mutation. From these results, we are confident that our strategy can detect mutant alleles in unpurified MDS bone marrow samples, but may miss mutations occurring in a rare cell. We then designed and validated primers for 415 amplicons covering the coding region and proximal introns of all 28 genes in the 5q31.2 interval and produced 7.2 megabases of sequence for these genes in the 46 patient samples. No somatic changes were identified. Twelve novel non-synonymous SNPs were discovered. The allele and genotype frequencies of 49 known SNPs in these genes were similar to the frequencies in race-matched HapMap individuals, apart from an enrichment in a non-synonymous CDC25C SNP in the MDS cohort (rs3734166, Odds Ratio=1.87, 95% confidence interval=1.06–3.32, p=0.036). The over-representation of this SNP and the other rare novel SNPs in the MDS population suggest that they may mark susceptibility alleles for MDS. Taken together, these data suggest that small deletions and/or point mutations of genes in 5q31.2 are not common events in the pathogenesis of MDS, and that larger deletions leading to haploinsufficiency of several genes in the interval appear to be the dominant mechanism.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.