SummaryUndifferentiated sarcomas (USARCs) of adults are diverse, rare, and aggressive soft tissue cancers. Recent sequencing efforts have confirmed that USARCs exhibit one of the highest burdens of structural aberrations across human cancer. Here, we sought to unravel the molecular basis of the structural complexity in USARCs by integrating DNA sequencing, ploidy analysis, gene expression, and methylation profiling. We identified whole genome duplication as a prevalent and pernicious force in USARC tumorigenesis. Using mathematical deconvolution strategies to unravel the complex copy-number profiles and mutational timing models we infer distinct evolutionary pathways of these rare cancers. In addition, 15% of tumors exhibited raised mutational burdens that correlated with gene expression signatures of immune infiltration, and good prognosis.
The evaluation of weight of evidence for forensic DNA profiles has been a subject of controversy since their introduction over 20 years ago. Substantial progress has been made for standard DNA profiles, but new issues have arisen in recent years with the advent of more sensitive profiling techniques, allowing profiles to be recovered from minuscule amounts of possibly degraded DNA. These low-template DNA profiles suffer from enhanced stochastic effects, including dropin, dropout, and stutter, which pose problems for DNA profile evaluation. These problems are now beginning to be overcome with the emergence of several statistical models and software. We first review the general principles of statistical evaluation of DNA profile evidence, and we then focus on low-template DNA profiles, briefly reviewing the main statistical models and software. We cover methods that use allele presence/absence and those that use electropherogram peak heights, focusing on the likelihood ratio as measure of evidential weight.
We estimate the population genetics parameter (also referred to as the fixation index) from short tandem repeat (STR) allele frequencies, comparing many worldwide human subpopulations at approximately the national level with continental-scale populations. is commonly used to measure population differentiation, and is important in forensic DNA analysis to account for remote shared ancestry between a suspect and an alternative source of the DNA. We estimate comparing subpopulations with a hypothetical ancestral population, which is the approach most widely used in population genetics, and also compare a subpopulation with a sampled reference population, which is more appropriate for forensic applications. Both estimation methods are likelihood-based, in which is related to the variance of the multinomial-Dirichlet distribution for allele counts. Overall, we find low values, with posterior 97.5 percentiles when comparing a subpopulation with the most appropriate population, and even for inter-population comparisons we find . These are much smaller than single nucleotide polymorphism-based inter-continental estimates, and are also about half the magnitude of STR-based estimates from population genetics surveys that focus on distinct ethnic groups rather than a general population. Our findings support the use of up to 3% in forensic calculations, which corresponds to some current practice.
The gains and losses of DNA that emerge as a consequence of mitotic errors and chromosomal instability are prevalent in cancer. These copy number alterations contribute to cancer initiaition, progression and therapeutic resistance. Here, we present a conceptual framework for examining the patterns of copy number alterations in human cancer using whole-genome sequencing, whole-exome sequencing, and SNP6 microarray data making it widely applicable to diverse datasets. Deploying this framework to 9,873 cancers representing 33 human cancer types from the TCGA project revealed a set of 19 copy number signatures that explain the copy number patterns of 93% of TCGA samples. 15 copy number signatures were attributed to biological processes of whole-genome doubling, aneuploidy, loss of heterozygosity, homologous recombination deficiency, and chromothripsis. The aetiology of four copy number signatures are unexplained and some cancer types have unique patterns of amplicon signatures associated with extrachromosomal DNA, disease-specific survival, and gains of proto-oncogenes such as MDM2. In contrast to base-scale mutational signatures, no copy number signature associated with known cancer risk factors. The results provide a foundation for exploring patterns of copy number changes in cancer genomes and synthesise the global landscape of copy number alterations in human cancer by revealing a diversity of mutational processes giving rise to copy number changes.
Undifferentiated pleomorphic sarcoma (UPS) is a highly aggressive soft tissue tumor. A subset of UPS is characterized by a CITED2-PRDM10 or a MED12-PRDM10 gene fusion. Preliminary data suggest that these so-called PRDM10-rearranged tumors (PRT) are clinically more indolent than classical high-grade UPS, and hence important to recognize. Here, we assessed the spectrum of accompanying mutations and the gene expression profile in PRT using genomic arrays and sequencing of the genome (WGS) and transcriptome (RNA-seq). The fusion protein's function was further investigated by conditional expression of the CITED2-PRDM10 fusion in a fibroblast cell line, followed by RNA-seq and an assay for transposase-accessible chromatin (ATAC-seq). The CADM3 gene was found to be differentially up-regulated in PRT and cell lines and was also evaluated for expression at the protein level using immunohistochemistry (IHC). The genomic analyses identified few and nonrecurrent mutations in addition to the structural variants giving rise to the gene fusions, strongly indicating that the PRDM10-fusions represent the critical driver mutations. RNA-seq of tumors showed a distinct gene expression profile, separating PRT from high-grade UPS and other soft tissue tumors. CADM3 was among the genes that was consistently and highly expressed in both PRT and fibroblasts expressing CITED2-PRDM10, suggesting that it is a direct target of the PRDM10 transcription factor. This conclusion is in line with sequencing data from ATAC-seq, showing enrichment of PRDM10 binding sites, suggesting that the amino-terminal fusion partner contributes by making the DNA more accessible to PRDM10 binding.
When evaluating the weight of evidence (WoE) for an individual to be a contributor to a DNA sample, an allele frequency database is required. The allele frequencies are needed to inform about genotype probabilities for unknown contributors of DNA to the sample. Typically databases are available from several populations, and a common practice is to evaluate the WoE using each available database for each unknown contributor. Often the most conservative WoE (most favourable to the defence) is the one reported to the court. However the number of human populations that could be considered is essentially unlimited and the number of contributors to a sample can be large, making it impractical to perform every possible WoE calculation, particularly for complex crime scene profiles. We propose instead the use of only the database that best matches the ancestry of the queried contributor, together with a substantial FST adjustment. To investigate the degree of conservativeness of this approach, we performed extensive simulations of one- and two-contributor crime scene profiles, in the latter case with, and without, the profile of the second contributor available for the analysis. The genotypes were simulated using five population databases, which were also available for the analysis, and evaluations of WoE using our heuristic rule were compared with several alternative calculations using different databases. Using FST = 0.03, we found that our heuristic gave WoE more favourable to the defence than alternative calculations in well over 99% of the comparisons we considered; on average the difference in WoE was just under 0.2 bans (orders of magnitude) per locus. The degree of conservativeness of the heuristic rule can be adjusted through the FST value. We propose the use of this heuristic for DNA profile WoE calculations, due to its ease of implementation, and efficient use of the evidence while allowing a flexible degree of conservativeness.
HighlightsThe behaviour of multi-replicate LRs with respect to the inverse match probability is proposed as a method to validate forensic LR software.We perform lab-based and simulated experiments of one-, two- and three-contributor CSPs, as well as investigating a real-world CSP.LRs rise towards the IMP with additional replicates, while never exceeding it. Additionally, the LR from multiple low-template replicates can exceed that from a single good-quality sample.We validate likeLTD by demonstrating that it adheres to the expected behaviours.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.