We conducted comprehensive integrative molecular analyses of the complete set of tumors in The Cancer Genome Atlas (TCGA), consisting of approximately 10,000 specimens and representing 33 types of cancer. We performed molecular clustering using data on chromosome-arm-level aneuploidy, DNA hypermethylation, mRNA, and miRNA expression levels and reverse-phase protein arrays, of which all, except for aneuploidy, revealed clustering primarily organized by histology, tissue type, or anatomic origin. The influence of cell type was evident in DNA-methylation-based clustering, even after excluding sites with known preexisting tissue-type-specific methylation. Integrative clustering further emphasized the dominant role of cell-of-origin patterns. Molecular similarities among histologically or anatomically related cancer types provide a basis for focused pan-cancer analyses, such as pan-gastrointestinal, pan-gynecological, pan-kidney, and pan-squamous cancers, and those related by stemness features, which in turn may inform strategies for future therapeutic development.
SUMMARY
Liver cancer has the second highest worldwide cancer mortality rate and has limited therapeutic options. We analyzed 363 hepatocellular carcinoma (HCC) cases by whole exome sequencing and DNA copy number analyses, and 196 HCC also by DNA methylation, RNA, miRNA, and proteomic expression. DNA sequencing and mutation analysis identified significantly mutated genes including LZTR1, EEF1A1, SF3B1, and SMARCA4. Significant alterations by mutation or down-regulation by hypermethylation in genes likely to result in HCC metabolic reprogramming (ALB, APOB, and CPS1) were observed. Integrative molecular HCC subtyping incorporating unsupervised clustering of five data platforms identified three subtypes, one of which was associated with poorer prognosis in three HCC cohorts. Integrated analyses enabled development of a p53 target gene expression signature correlating with poor survival. Potential therapeutic targets for which inhibitors exist include WNT signaling, MDM4, MET, VEGFA, MCL1, IDH1, TERT, and immune checkpoint proteins CTLA-4, PD-1, and PD-L1.
The factors regulating the expression of microRNAs (miRNAs), a ubiquitous family of ∼22-nt noncoding regulatory RNAs, remain undefined. However, it is known that miRNAs are first transcribed as a largely unstructured precursor, termed a primary miRNA (pri-miRNA), which is sequentially processed in the nucleus, to give the ∼65-nt pre-miRNA hairpin intermediate, and then in the cytoplasm, to give the mature miRNA. Here we have sought to identify the RNA polymerase responsible for miRNA transcription and to define the structure of a full-length human miRNA. We show that the pri-miRNA precursors for nine human miRNAs are both capped and polyadenylated and report the sequence of the full-length, ∼3433-nt pri-miR-21 RNA. This pri-miR-21 gene sequence is flanked 5 by a promoter element able to transcribe heterologous mRNAs and 3 by a consensus polyadenylation sequence. Nuclear processing of pri-miRNAs was found to be efficient, thus largely preventing the nuclear export of full-length pri-miRNAs. Nevertheless, an intact miRNA stem-loop precursor located in the 3 UTR of a protein coding gene only moderately inhibited expression of the linked open reading frame, probably because the 3 truncated mRNA could still be exported and expressed. Together, these data show that human pri-miRNAs are not only structurally similar to mRNAs but can, in fact, function both as pri-miRNAs and mRNAs.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.