Whole transcriptome sequencing (RNA-seq) has become a standard for cataloguing and monitoring RNA populations. One of the main bottlenecks, however, is to correctly identify the different classes of RNAs among the plethora of reconstructed transcripts, particularly those that will be translated (mRNAs) from the class of long non-coding RNAs (lncRNAs). Here, we present FEELnc (FlExible Extraction of LncRNAs), an alignment-free program that accurately annotates lncRNAs based on a Random Forest model trained with general features such as multi k-mer frequencies and relaxed open reading frames. Benchmarking versus five state-of-the-art tools shows that FEELnc achieves similar or better classification performance on GENCODE and NONCODE data sets. The program also provides specific modules that enable the user to fine-tune classification accuracy, to formalize the annotation of lncRNA classes and to identify lncRNAs even in the absence of a training set of non-coding RNAs. We used FEELnc on a real data set comprising 20 canine RNA-seq samples produced by the European LUPA consortium to substantially expand the canine genome annotation to include 10 374 novel lncRNAs and 58 640 mRNA transcripts. FEELnc moves beyond conventional coding potential classifiers by providing a standardized and complete solution for annotating lncRNAs and is freely available at https://github.com/tderrien/FEELnc.
Long non-coding RNAs (lncRNAs) are a family of heterogeneous RNAs that play major roles in multiple biological processes. We recently identified an extended repertoire of more than 10,000 lncRNAs of the domestic dog however, predicting their biological functionality remains challenging. In this study, we have characterised the expression profiles of 10,444 canine lncRNAs in 26 distinct tissue types, representing various anatomical systems. We showed that lncRNA expressions are mainly clustered by tissue type and we highlighted that 44% of canine lncRNAs are expressed in a tissue-specific manner. We further demonstrated that tissue-specificity correlates with specific families of canine transposable elements. In addition, we identified more than 900 conserved dog-human lncRNAs for which we show their overall reproducible expression patterns between dog and human through comparative transcriptomics. Finally, co-expression analyses of lncRNA and neighbouring protein-coding genes identified more than 3,400 canine lncRNAs, suggesting that functional roles of these lncRNAs act as regulatory elements. Altogether, this genomic and transcriptomic integrative study of lncRNAs constitutes a major resource to investigate genotype to phenotype relationships and biomedical research in the dog species.
Due to its rarity, histiocytic sarcoma (HS) is not well known and treatment remains a challenge. Using the canine sponteanous model, we highlighted the association of PTPN11 somatic mutations to a disseminated subtype of HS in humans and dogs. Thanks to in-house canine HS cell lines, we tested MAP kinase inhibitors as a proof of concept of the interest of this model for the development and selection of efficient therapies.
Canine cancers represent a tremendous natural resource due to their incidence and striking similarities to human cancers, sharing similar clinical and pathologic features as well as oncogenic events, including identical somatic mutations. Considering the importance of gene fusions as driver alterations, we explored their relevance in canine cancers. We focused on three distinct human-comparable canine cancers representing different tissues and embryonic origins. Through RNA-Seq, we discovered similar gene fusions as those found in their human counterparts: - in B-cell lymphoma, - in glioma, and - in dermatofibrosarcoma protuberans-like. We showed not only similar partner genes but also identical breakpoints leading to oncogene overexpression. This study demonstrates similar gene fusion partners and mechanisms in human-dog corresponding tumors and allows for selection of targeted therapies in preclinical and clinical trials with pet dogs prior to human trials, within the framework of personalized medicine. .
Histiocytic sarcoma (HS) is a rare but aggressive cancer in both humans and dogs. The spontaneous canine model, which has clinical, epidemiological, and histological similarities with human HS and specific breed predispositions, provides a unique opportunity to unravel the genetic basis of this cancer. In this study, we aimed to identify germline risk factors associated with the development of HS in canine-predisposed breeds. We used a methodology that combined several genome-wide association studies in a multi-breed and multi-cancer approach as well as targeted next-generation sequencing, and imputation We combined several dog breeds (Bernese mountain dogs, Rottweilers, flat-coated retrievers, and golden retrievers), and three hematopoietic cancers (HS, lymphoma, and mast cell tumor). Results showed that we not only refined the previously identified HS risk CDKN2A locus, but also identified new loci on canine chromosomes 2, 5, 14, and 20. Capture and targeted sequencing of specific loci suggested the existence of regulatory variants in non-coding regions and methylation mechanisms linked to risk haplotypes, which lead to strong cancer predisposition in specific dog breeds. We also showed that these canine cancer predisposing loci appeared to be due to the additive effect of several risk haplotypes involved in other hematopoietic cancers such as lymphoma or mast cell tumors as well. This illustrates the pleiotropic nature of these canine cancer loci as observed in human oncology, thereby reinforcing the interest of predisposed dog breeds to study cancer initiation and progression.
White coat color in mammals has been selected several times during the domestication process. Numerous dog breeds are fixed for one form of white coat color that involves darkly pigmented skin. The genetic basis of this color, due to the absence of pigment in the hairs, was suggested to correspond to extreme dilution of the phaeomelanin, by both the expression of only phaeomelanin (locus E) and its extreme dilution (locus I). To go further, we performed genome-wide association studies (GWAS) using a multiple breed approach. The first GWAS, using 34 white dogs and 128 non-white dogs, including White Shepherds, Poodles, Cotons de Tulear and Bichons allowed us to identify two significantly associated loci on the locus E and a novel locus on chromosome 20. A second GWAS using 15 other breeds presenting extreme phaeomelanin dilution confirmed the position of locus I on the chromosome 20 (position 55 Mb pcorrected = 6 × 10−13). Using whole-genome sequencing, we identified a missense variant in the first exon of MFSD12, a gene recently identified to be involved in human, mouse and horse pigmentation. We confirmed the role of this variant in phaeomelanin dilution of numerous canine breeds, and the conserved role of MFSD12 in mammalian pigmentation.
Canine oral melanoma is the first malignancy of the oral cavity in dogs and is characterized by a local invasiveness and a high metastatic propensity. A better knowledge of genetic alterations is expected to improve management of this tumour. Copy number alterations are known characteristics of mucosal melanomas both in dogs and humans. The goal of this study was to explore the prognostic value of somatic focal amplifications on chromosomes (Canis Familiaris [CFA]) 10 and 30 in canine oral melanoma. The cohort included 73 dogs with oral melanoma confirmed by histology, removed surgically without adjuvant therapy and with a minimal follow-up of 6 months. Epidemiological, clinical and histological data were collected and quantitative-PCR were performed on formalin-fixed paraffin-embedded (FFPE) samples to identify specific focal amplifications. The 73 dogs included in the study had a median survival time of 220 days. Focal amplifications on CFA 10 and 30 were recurrent (49.3% and 50.7% of cases, respectively) and CFA 30 amplification was significantly associated with the amelanotic phenotype (P = .046) and high mitotic index (MI; P = .0039). CFA 30 amplification was also linked to poor prognosis (P = .0005). Other negative prognostic factors included gingiva location (P = .003), lymphadenomegaly (P = .026), tumour ulceration at diagnosis (P = .003), MI superior to 6 mitoses over 10 fields (P = .001) and amelanotic tumour (P = .029). In multivariate analyses using Cox proportional hazards regression, CFA 30 amplification (Hazard ratio [HR] = 2.08; P = .011), tumour location (HR = 2.20; P = .005) and histological pigmentation (HR = 1.87; P = .036) were significantly associated with shorter survival time. Focal amplification of CFA 30 is linked to an aggressive subset and constitutes a new prognostic factor.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.