BackgroundThe methylotrophic yeast Pichia pastoris is widely used as a bioengineering platform for producing industrial and biopharmaceutical proteins, studying protein expression and secretion mechanisms, and analyzing metabolite synthesis and peroxisome biogenesis. With the development of DNA microarray and mRNA sequence technology, the P. pastoris transcriptome has become a research hotspot due to its powerful capability to identify the transcript structures and gain insights into the transcriptional regulation model of cells under protein production conditions. The study of the P. pastoris transcriptome helps to annotate the P. pastoris transcript structures and provide useful information for further improvement of the production of recombinant proteins.ResultsWe used a massively parallel mRNA sequencing platform (RNA-Seq), based on next-generation sequencing technology, to map and quantify the dynamic transcriptome of P. pastoris at the genome scale under growth conditions with glycerol and methanol as substrates. The results describe the transcription landscape at the whole-genome level and provide annotated transcript structures, including untranslated regions (UTRs), alternative splicing (AS) events, novel transcripts, new exons, alternative upstream initiation codons (uATGs), and upstream open reading frames (uORFs). Internal ribosome entry sites (IRESes) were first identified within the UTRs of genes from P. pastoris, encoding kinases and the proteins involved in the control of growth. We also provide a transcriptional regulation model for P. pastoris grown on different carbon sources.ConclusionsWe suggest that the IRES-dependent translation initiation mechanism also exists in P. pastoris. Retained introns (RIs) are determined as the main AS event and are produced predominantly by an intron definition (ID) mechanism. Our results describe the metabolic characteristics of P. pastoris with heterologous protein production under methanol induction and provide rich information for further in-depth studies of P. pastoris protein expression and secretion mechanisms.
Compared to classical antibodies, camel heavy chain antibodies (HCAbs) are smaller in size due to lack of the light chain and the first constant domain of the heavy chain (CH1 region). The variable regions of HCAbs (VHHs) are more soluble and stable than that of conventional antibodies (VHs). Even with such simple structure, they are still functional in antigen binding. Although HCAbs have been extensively investigated over the past two decades, most efforts have been based upon low throughput sequence analysis, and there are only limited reports trying to analyze and describe the complete immune repertoire (IR) of camel HCAbs. Here we leveraged the high-throughput data generated by Next Generation Sequencing (NGS) of the variable domains of the antibody heavy chains from three Bactrian camels to conduct in-depth comparative analyses of the immunoglobulin repertoire. These include analyses of the complementary determining region 3 (CDR3) length and distribution, mutation rate, antibody characteristic amino acids, the distribution of the cysteine (Cys) codons, and the non-classical VHHs. We found that there is higher diversity in the CDR2 than in the other sub-regions, and there is a higher mutation rate in the VHHs than in the VHs (P < 0.05). In addition to substitutions at amino acid (AA) residue positions NO.49/50/52 between VH and VHH clones, we also observed other substitutions at the positions NO.40/54/57/96/101 that could lead to additional structural alterations. We also found that VH-derived VHH clones, referred to as non-classical VHH clones in this study, accounted for about 8% of all clones. Further, only 5%-10% clones had the Trp > Arg AA substitution at the first position of framework 4 for all types of clones. We present, for the first time, a relatively complete picture of the Bactrian camel antibody immune repertoire, including conventional antibody (Ab) and HCAbs, using PCR and in silico analysis based on high-throughput NGS data.
The available promoters in the Pichia pastoris expression platform are still limited. We selected and identified a novel strong constitutive promoter, P GCW14 , and tested its promoter activity using enhanced green fluorescent protein (EGFP) as a reporter. Potential promoter regions of P GCW14 were cloned upstream of the EGFP gene and promoter activity was analyzed by measuring fluorescence intensity. P GCW14 exhibited significantly stronger promoter activity than the classic strong constitutive promoters P TEF1 and P GAP under various carbon sources, suggesting that P GCW14 is a strong and constitutive promoter. Hence, P GCW14 can be used as a promoter for high-level expression of heterologous proteins.
By predicting the potential signal peptides from proteins that are naturally secreted by Pichia pastoris, we identified three possible endogenous signal peptides: Scw, Dse and Exg. We compared their capability to mediate the secretion of enhanced green fluorescent protein (EGFP) and Candida antarctica lipase B (CALB) with that of the Saccharomyces cerevisiae α-factor prepro-signal. EGFP entered the secretory pathway of P. pastoris and was efficiently secreted into the culture medium by all three endogenous peptides. Further, these three putative endogenous signal peptides were also effective in secreting CALB. These endogenous signal peptides thus have the potential to mediate the efficient secretion of heterologous proteins in P. pastoris.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.