Basal cell carcinoma (BCC) of the skin is the most common malignant neoplasm in humans. BCC is primarily driven by the Sonic Hedgehog (Hh) pathway. However, its phenotypic variation remains unexplained. Our genetic profiling of 293 BCCs found the highest mutation rate in cancer (65 mutations/Mb). Eighty-five percent of the BCCs harbored mutations in Hh pathway genes (PTCH1, 73% or SMO, 20% (P = 6.6 × 10(-8)) and SUFU, 8%) and in TP53 (61%). However, 85% of the BCCs also harbored additional driver mutations in other cancer-related genes. We observed recurrent mutations in MYCN (30%), PPP6C (15%), STK19 (10%), LATS1 (8%), ERBB2 (4%), PIK3CA (2%), and NRAS, KRAS or HRAS (2%), and loss-of-function and deleterious missense mutations were present in PTPN14 (23%), RB1 (8%) and FBXW7 (5%). Consistent with the mutational profiles, N-Myc and Hippo-YAP pathway target genes were upregulated. Functional analysis of the mutations in MYCN, PTPN14 and LATS1 suggested their potential relevance in BCC tumorigenesis.
APOBEC3A and APOBEC3B, cytidine deaminases of the APOBEC family, are among the main factors causing mutations in human cancers. APOBEC deaminates cytosines in single-stranded DNA (ssDNA). A fraction of the APOBEC-induced mutations occur as clusters ("kataegis") in single-stranded DNA produced during repair of double-stranded breaks (DSBs). However, the properties of the remaining 87% of nonclustered APOBEC-induced mutations, the source and the genomic distribution of the ssDNA where they occur, are largely unknown. By analyzing genomic and exomic cancer databases, we show that >33% of dispersed APOBEC-induced mutations occur on the lagging strand during DNA replication, thus unraveling the major source of ssDNA targeted by APOBEC in cancer. Although methylated cytosine is generally more mutation-prone than nonmethylated cytosine, we report that methylation reduces the rate of APOBEC-induced mutations by a factor of roughly two. Finally, we show that in cancers with extensive APOBEC-induced mutagenesis, there is almost no increase in mutation rates in late replicating regions (contrary to other cancers). Because late-replicating regions are depleted in exons, this results in a 1.3-fold higher fraction of mutations residing within exons in such cancers. This study provides novel insight into the APOBEC-induced mutagenesis and describes the peculiarity of the mutational processes in cancers with the signature of APOBEC-induced mutations.
Clustering of mutations has been observed in cancer genomes as well as for germline de novo mutations (DNMs). We identified 1,796 clustered DNMs (cDNMs) within whole-genome-sequencing data from 1,291 parent-offspring trios to investigate their patterns and infer a mutational mechanism. We found that the number of clusters on the maternal allele was positively correlated with maternal age and that these clusters consisted of more individual mutations with larger intermutational distances than those of paternal clusters. More than 50% of maternal clusters were located on chromosomes 8, 9 and 16, in previously identified regions with accelerated maternal mutation rates. Maternal clusters in these regions showed a distinct mutation signature characterized by C>G transversions. Finally, we found that maternal clusters were associated with processes involving double-strand-breaks (DSBs), such as meiotic gene conversions and de novo deletion events. This result suggested accumulation of DSB-induced mutations throughout oocyte aging as the mechanism underlying the formation of maternal mutation clusters.
Biological mechanisms underlying human germline mutations remain largely unknown. We statistically decompose variation in the rate and spectra of mutations along the genome using volume-regularized nonnegative matrix factorization. The analysis of a sequencing dataset (TOPMed) reveals nine processes that explain the variation in mutation properties between loci. We provide a biological interpretation for seven of these processes. We associate one process with bulky DNA lesions that resolve asymmetrically with respect to transcription and replication. Two processes track direction of replication fork and replication timing, respectively. We identify a mutagenic effect of active demethylation primarily acting in regulatory regions and a mutagenic effect of LINE repeats. We localize a mutagenic process specific to oocytes from population sequencing data. This process appears transcriptionally asymmetric.
Populations of different species vary in the amounts of genetic diversity they possess. Nucleotide diversity π, the fraction of nucleotides that are different between two randomly chosen genotypes, has been known to range in eukaryotes between 0.0001 in Lynx lynx and 0.16 in Caenorhabditis brenneri. Here, we report the results of a comparative analysis of 24 haploid genotypes (12 from the United States and 12 from European Russia) of a split-gill fungus Schizophyllum commune. The diversity at synonymous sites is 0.20 in the American population of S. commune and 0.13 in the Russian population. This exceptionally high level of nucleotide diversity also leads to extreme amino acid diversity of protein-coding genes. Using whole-genome resequencing of 2 parental and 17 offspring haploid genotypes, we estimate that the mutation rate in S. commune is high, at 2.0 × 10−8 (95% CI: 1.1 × 10−8 to 4.1 × 10−8) per nucleotide per generation. Therefore, the high diversity of S. commune is primarily determined by its elevated mutation rate, although high effective population size likely also plays a role. Small genome size, ease of cultivation and completion of the life cycle in the laboratory, free-living haploid life stages and exceptionally high variability of S. commune make it a promising model organism for population, quantitative, and evolutionary genetics.
Gene expression levels can be subject to selection. We hypothesized that the age of gene origin is associated with expression constraints, given that it affects the level of gene integration into the functional cellular environment. By studying the genetic variation affecting gene expression levels (cis expression quantitative trait loci [cis-eQTLs]) and protein levels (cis protein QTLs [cis-pQTLs]), we determined that young, primate-specific genes are enriched in cis-eQTLs and cis-pQTLs. Compared to cis-eQTLs of old genes originating before the zebrafish divergence, cis-eQTLs of young genes have a higher effect size, are located closer to the transcription start site, are more significant, and tend to influence genes in multiple tissues and populations. These results suggest that the expression constraint of each gene increases throughout its lifespan. We also detected a positive correlation between expression constraints (approximated by cis-eQTL properties) and coding constraints (approximated by Ka/Ks) and observed that this correlation might be driven by gene age. To uncover factors associated with the increase in gene-age-related expression constraints, we demonstrated that gene connectivity, gene involvement in complex regulatory networks, gene haploinsufficiency, and the strength of posttranscriptional regulation increase with gene age. We also observed an increase in heritability of gene expression levels with age, implying a reduction of the environmental component. In summary, we show that gene age shapes key gene properties during evolution and is therefore an important component of genome function.
Mismatch repair (MMR) is one of the main systems maintaining fidelity of replication. Differences in correction of errors produced during replication of the leading and the lagging DNA strands were reported in yeast and in human cancers, but the causes of these differences remain unclear. Here, we analyze data on human cancers with somatic mutations in two of the major DNA polymerases, delta and epsilon, that replicate the genome. We show that these cancers demonstrate a substantial asymmetry of the mutations between the leading and the lagging strands. The direction of this asymmetry is the opposite between cancers with mutated polymerases delta and epsilon, consistent with the role of these polymerases in replication of the lagging and the leading strands in human cells, respectively. Moreover, the direction of strand asymmetry observed in cancers with mutated polymerase delta is similar to that observed in MMR-deficient cancers. Together, these data indicate that polymerase delta (possibly together with polymerase alpha) contributes more mismatches during replication than its leading-strand counterpart, polymerase epsilon; that most of these mismatches are repaired by the MMR system; and that MMR repairs about three times more mismatches produced in cells during lagging strand replication compared with the leading strand.
Mutation rate varies between sites in the genome. Part of this variation can be explained by well-recognized short nucleotide contexts, but a large component of this variation remains cryptic. We used data on interspecies divergence and intraspecies polymorphism in Drosophila and Hominidae to analyze variation of the average rate of the 12 possible kinds of single-nucleotide mutations and in the transition/transversion ratio κ at single-nucleotide resolution. Both the average mutation rate and κ vary by a factor of ~3 between nucleotide sites. The characteristic scale of variation in κ is up to at least ~30 nucleotides in Drosophila and ~5 nucleotides in Hominidae. Genome segments with locally elevated mutation rates possess lower values of κ; however, a substantial fraction of variation in κ cannot be directly explained by the local mutation rates.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.