Predicting Cell-Penetrating Peptides: Building and Interpreting Random Forest based prediction Models

Yadahalli, Shilpa; Verma, Chandra

doi:10.1101/2020.10.15.341149

Cited by 4 publications

(6 citation statements)

References 42 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Previous studies have suggested that uptake efficiency of CPPs are correlated with sequence length and basic residue (arginine or lysine) positions (Futaki et al, 2007 ; Liu et al, 2016 ; Yadahalli & Verma, 2020 ). To further evaluate whether peptide P1 is sensitive to changes in amino acid sequences, peptide truncation ( Figure 2(A) ) and single mutation ( Figure 2(B) ) prediction by CellPPD were performed.…”

Section: Resultsmentioning

confidence: 99%

“…Previous studies have suggested that uptake efficiency of CPPs are correlated with sequence length and basic residue (arginine or lysine) positions (Futaki et al, 2007;Liu et al, 2016;Yadahalli & Verma, 2020). To further evaluate whether Truncation analysis in Figure 2(A) suggested that 15-mer and 10-mer truncated peptide P1 fragments have significantly decreased penetration property, although the first 10-mer of N-terminal peptide P1 still had a higher score than the fulllength peptide P1, which may be due to the core motif that determines the penetration property of peptide P1.…”

Section: Penetration Properties and Immunogenicity Prediction Of Peptide P1mentioning

confidence: 99%

See 1 more Smart Citation

In silico identification and experimental validation of cellular uptake by a new cell penetrating peptide P1 derived from MARCKS

et al. 2021

View full text Add to dashboard Cite

Viral vectors for vaccine delivery are challenged by recently reported safety issues like immunogenicity and risk for cancer development, and thus there is a growing need for the development of non-viral vectors. Cell penetrating peptides (CPPs) are non-viral vectors that can enter plasma membranes efficiently and deliver a broad range of cargoes. Our bioinformatic prediction and wet-lab validation data suggested that peptide P1 derived from MARCKS protein phosphorylation site domain is a new potential CPP candidate. We found that peptide P1 can efficiently internalize into various cell lines in a concentration-dependent manner. Receptor-mediated endocytosis pathway is the major mechanism of P1 penetration, although P1 also directly penetrates the plasma membrane. We also found that peptide P1 has low cytotoxicity in cultured cell lines as well as mouse red blood cells. Furthermore, peptide P1 not only can enter into cultured cells itself, but it also can interact with plasmid DNA and mediate the functional delivery of plasmid DNA into cultured cells, even in hard-to-transfect cells. Combined, these findings indicate that P1 may be a promising vector for efficient intracellular delivery of bioactive cargos.

show abstract

Section: Resultsmentioning

confidence: 99%

“…Previous studies have suggested that uptake efficiency of CPPs are correlated with sequence length and basic residue (arginine or lysine) positions (Futaki et al, 2007;Liu et al, 2016;Yadahalli & Verma, 2020). To further evaluate whether Truncation analysis in Figure 2(A) suggested that 15-mer and 10-mer truncated peptide P1 fragments have significantly decreased penetration property, although the first 10-mer of N-terminal peptide P1 still had a higher score than the fulllength peptide P1, which may be due to the core motif that determines the penetration property of peptide P1.…”

Section: Penetration Properties and Immunogenicity Prediction Of Peptide P1mentioning

confidence: 99%

In silico identification and experimental validation of cellular uptake by a new cell penetrating peptide P1 derived from MARCKS

et al. 2021

View full text Add to dashboard Cite

show abstract

“…csv Listing 1: Example TRILL commands for CPP workflow Cell penetrability is an example of a protein function that BLAST/HMMs routinely fail in identifying due to convergent properties without sharing common ancestry. Utilizing Dataset E from Yadahalli 2020 [8], we first trained an XGBoost classifier on the protein embeddings from ESM2-150M and then achieved an F1 of 0.876 on a held-out 25% of the CPPs (Figure 5). We then finetuned ProtGPT2 on the 955 CPPs for 10 epochs with a learning rate of 1e − 5.…”

Section: Workflow 2: Family Based Protein Generationmentioning

confidence: 99%

“…While these methods rely on evolutionary relationships to link related sequences through homology, machine learning based methods have shown success for functional comparisons without needing shared ancestry. For example, researchers have been able to predict whether a given protein is a cell-penetrating peptide, regardless of actual homology TRILL [8]. These predictions were enabled by extracting amino acid frequencies and biochemical properties for each protein and using this data to train random-forest classifiers.…”

Section: Introductionmentioning

confidence: 99%

TRILL: Orchestrating Modular Deep-Learning Workflows for Democratized, Scalable Protein Analysis and Engineering

Martinez,

Murray,

Thomson

2023

Preprint

View full text Add to dashboard Cite

Deep-learning models have been rapidly adopted by many fields, partly due to the deluge of data humanity has amassed. In particular, the petabases of biological sequencing data enable the unsupervised training of protein language models that learn the "language of life." However, due to their prohibitive size and complexity, contemporary deep-learning models are often unwieldy, especially for scientists with limited machine learning backgrounds. TRILL (TRaining and Inference using the Language of Life) is a platform for creative protein design and discovery. Leveraging several state-of-the-art models such as ESM-2, DiffDock, and RFDiffusion, TRILL allows researchers to generate novel proteins, predict 3-D structures, extract high-dimensional representations of proteins, functionally classify proteins and more. What sets TRILL apart is its ability to enable complex pipelines by chaining together models and effectively merging the capabilities of different models to achieve a sum greater than its individual parts. Whether using Google Colab with one GPU or a supercomputer with hundreds, TRILL allows scientists to effectively utilize models with millions to billions of parameters by using optimized training strategies such as ZeRO-Offload and distributed data parallel. Therefore, TRILL not only bridges the gap between complex deep-learning models and their practical application in the field of biology, but also simplifies the orchestration of these models into comprehensive workflows, democratizing access to powerful methods. Documentation: https://trill.readthedocs.io/en/latest/home.html.

show abstract

“…These predictors generally help with the design of a first generation CPP, but may also help to further modify already known CPPs to suit specific cargo and application. There are still some limitations on current prediction models as they are dependent on the quality of input data and the data used for training (Yadahalli and Verma, 2020).…”

Section: Prediction Of Cppsmentioning

confidence: 99%

Approaches for evaluation of novel CPP-based cargo delivery systems

Porosk

Langel

2022

Front. Pharmacol.

View full text Add to dashboard Cite

Cell penetrating peptides (CPPs) can be broadly defined as relatively short synthetic, protein derived or chimeric peptides. Their most remarkable property is their ability to cross cell barriers and facilitate the translocation of cargo, such as drugs, nucleic acids, peptides, small molecules, dyes, and many others across the plasma membrane. Over the years there have been several approaches used, adapted, and developed for the evaluation of CPP efficacies as delivery systems, with the fluorophore attachment as the most widely used approach. It has become progressively evident, that the evaluation method, in order to lead to successful outcome, should concede with the specialties of the delivery. For characterization and assessment of CPP-cargo a combination of research tools of chemistry, physics, molecular biology, engineering, and other fields have been applied. In this review, we summarize the diverse, in silico, in vitro and in vivo approaches used for evaluation and characterization of CPP-based cargo delivery systems.

show abstract

Predicting Cell-Penetrating Peptides: Building and Interpreting Random Forest based prediction Models

Cited by 4 publications

References 42 publications

In silico identification and experimental validation of cellular uptake by a new cell penetrating peptide P1 derived from MARCKS

In silico identification and experimental validation of cellular uptake by a new cell penetrating peptide P1 derived from MARCKS

TRILL: Orchestrating Modular Deep-Learning Workflows for Democratized, Scalable Protein Analysis and Engineering

Approaches for evaluation of novel CPP-based cargo delivery systems

Contact Info

Product

Resources

About