SUMMARY Survival rates for the childhood cancer neuroblastoma have not substantively improved despite dramatic escalation in chemotherapy intensity. Like most human cancers, this embryonal malignancy can be inherited, but the genetic etiology of familial and sporadically occurring neuroblastoma was largely unknown. Here we show that germline mutations in the anaplastic lymphoma kinase gene (ALK) explain the majority of hereditary neuroblastomas, and that activating mutations can also be somatically acquired. We first identified a significant linkage signal at the short arm of chromosome 2 (maximum nonparametric LOD=4.23 at rs1344063) using a whole-genome scan in neuroblastoma pedigrees. Resequencing of regional candidate genes identified three separate missense mutations in the tyrosine kinase domain of ALK (G1128A, R1192P and R1275Q) that segregated with the disease in eight separate families. Examination of 491 sporadically occurring human neuroblastoma samples showed that the ALK locus was gained in 22.8%, and highly amplified in an additional 3.3%, and that these aberrations were highly associated with death from disease (P=0.0003). Resequencing of 194 high-risk neuroblastoma samples showed somatically acquired mutations within the tyrosine kinase domain in 12.4%. Nine of the ten mutations map to critical regions of the kinase domain and were predicted to be oncogenic drivers with high probability. Mutations resulted in constitutive phosphorylation consistent with activation, and targeted knockdown of ALK mRNA resulted in profound growth inhibition of 4 of 4 cell lines harboring mutant or amplified ALK, as well as 2 of 6 wild type for ALK. Our results demonstrate that heritable mutations of ALK are the major cause of familial neuroblastoma, and that germline or acquired activation of this cell surface kinase is a tractable therapeutic target for this lethal pediatric malignancy.
Initial expectations for genome-wide association studies were high, as such studies promised to rapidly transform personalized medicine with individualized disease risk predictions, prevention strategies and treatments. Early findings, however, revealed a more complex genetic architecture than was anticipated for most common diseases - complexity that seemed to limit the immediate utility of these findings. As a result, the practice of utilizing the DNA of an individual to predict disease has been judged to provide little to no useful information. Nevertheless, recent efforts have begun to demonstrate the utility of polygenic risk profiling to identify groups of individuals who could benefit from the knowledge of their probabilistic susceptibility to disease. In this context, we review the evidence supporting the personal and clinical utility of polygenic risk profiling.
The limitations of genome-wide association (GWA) studies that focus on the phenotypic influence of common genetic variants have motivated human geneticists to consider the contribution of rare variants to phenotypic expression. The increasing availability of high-throughput sequencing technology has enabled studies of rare variants, but will not be sufficient for their success since appropriate analytical methods are also needed. We consider data analysis approaches to testing associations between a phenotype and collections of rare variants in a defined genomic region or set of regions. Ultimately, although a wide variety of analytical approaches exist, more work is needed to refine them and determine their properties and power in different contexts.
No abstract
Recent results indicate that genome-wide association studies (GWAS) have the potential to explain much of the heritability of common complex phenotypes, but methods are lacking to reliably identify the remaining associated single nucleotide polymorphisms (SNPs). We applied stratified False Discovery Rate (sFDR) methods to leverage genic enrichment in GWAS summary statistics data to uncover new loci likely to replicate in independent samples. Specifically, we use linkage disequilibrium-weighted annotations for each SNP in combination with nominal p-values to estimate the True Discovery Rate (TDR = 1−FDR) for strata determined by different genic categories. We show a consistent pattern of enrichment of polygenic effects in specific annotation categories across diverse phenotypes, with the greatest enrichment for SNPs tagging regulatory and coding genic elements, little enrichment in introns, and negative enrichment for intergenic SNPs. Stratified enrichment directly leads to increased TDR for a given p-value, mirrored by increased replication rates in independent samples. We show this in independent Crohn's disease GWAS, where we find a hundredfold variation in replication rate across genic categories. Applying a well-established sFDR methodology we demonstrate the utility of stratification for improving power of GWAS in complex phenotypes, with increased rejection rates from 20% in height to 300% in schizophrenia with traditional FDR and sFDR both fixed at 0.05. Our analyses demonstrate an inherent stratification among GWAS SNPs with important conceptual implications that can be leveraged by statistical methods to improve the discovery of loci.
Summary Unlike humans or mice, some species have limited genome encoded combinatorial diversity potential, yet mount a robust antibody response. Cows are unusual in having exceptionally long CDR H3 loops and few V-regions, but the mechanism for creating diversity is not understood. Deep sequencing revealed that ultralong CDR H3s contain a remarkable complexity of cysteines, suggesting that disulfide-bonded mini-domains may arise during repertoire development. Indeed, crystal structures of two cow antibodies reveal that these CDR H3s form a very unusual architecture composed of a β-strand “stalk” that supports a structurally diverse, disulfide-bonded, “knob” domain. Sequence analysis suggests that diversity arises from somatic hypermutation of an ultralong DH with a severe codon bias towards mutation to cysteine. These unusual antibodies can be elicited to recognize defined antigens through the knob domain. Thus, the bovine immune system produces an antibody repertoire composed of CDR H3s of unprecedented length that fold into a diversity of mini-domains generated through combinations of somatically generated disulfides.
Compaction and looping of the ∼2.5-Mb Igh locus during V(D)J rearrangement is essential to allow all V H genes to be brought in proximity with D H -J H segments to create a diverse antibody repertoire, but the proteins directly responsible for this are unknown. Because CCCTC-binding factor (CTCF) has been demonstrated to be involved in long-range chromosomal interactions, we hypothesized that CTCF may promote the contraction of the Igh locus. ChIP sequencing was performed on pro-B cells, revealing colocalization of CTCF and Rad21 binding at ∼60 sites throughout the V H region and 2 other sites within the Igh locus. These numerous CTCF/cohesin sites potentially form the bases of the multiloop rosette structures at the Igh locus that compact during Ig heavy chain rearrangement. To test whether CTCF was involved in locus compaction, we used 3D-FISH to measure compaction in pro-B cells transduced with CTCF shRNA retroviruses. Reduction of CTCF binding resulted in a decrease in Igh locus compaction. Long-range interactions within the Igh locus were measured with the chromosomal conformation capture assay, revealing direct interactions between CTCF sites 5′ of DFL16 and the 3′ regulatory region, and also the intronic enhancer (Eμ), creating a D H -J H -Eμ-C H domain. Knockdown of CTCF also resulted in the increase of antisense transcription throughout the D H region and parts of the V H locus, suggesting a widespread regulatory role for CTCF. Together, our findings demonstrate that CTCF plays an important role in the 3D structure of the Igh locus and in the regulation of antisense germline transcription and that it contributes to the compaction of the Igh locus.A ntigen receptors are created through the highly regulated lineage-specific process of V(D)J recombination, creating a diverse repertoire of Ig and T-cell receptors. The generation of the mouse Ig heavy chain in pro-B cells begins with D H -to-J H rearrangement on both alleles, followed by V H -to-D H J H rearrangement. In order for the >100 functional murine V H genes spread across ∼2.5 Mb to gain access to the single D-J rearrangement on that allele, the Igh locus undergoes contraction and looping during the pro-B-cell stage of B-cell differentiation (1-5). By measuring spatial distances between 11 small probes spread throughout the Igh locus, Jhunjhunwala et al. (2) demonstrated that distal and proximal V H genes were approximately equidistant from the D genes specifically at the pro-B-cell stage when the V H genes are rearranging. Computational as well as geometrical approaches have suggested that the locus is organized into rosette-like clusters of loops that compact during rearrangement. Several proteins have been reported to influence Igh locus compaction, including Pax5, YY1, and Ikaros (5-7). These proteins and others, such as Ezh2 (8), are also necessary for the rearrangement of distal V H genes but not proximal V H genes. This is most likely a consequence of the lack of locus compaction in the absence of these proteins. How all these proteins funct...
Protein kinases are one of the largest families of evolutionarily related proteins and comprise one of the most abundant gene families in humans. Here we survey kinase gene mutations from the perspective of human disease phenotypes and further analyse the structural features of mutant kinases, including mutational hotspots. Our evaluation of the genotype-phenotype relationship across 915 human kinase mutations - that underlie 67 single-gene diseases, mainly inherited developmental and metabolic disorders and also certain cancers - enhances our understanding of the role of kinases in development, kinase dysfunction in pathogenesis and kinases as potential targets for therapy.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.