2022
DOI: 10.1101/2022.06.12.495804
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

TMbed – Transmembrane proteins predicted through Language Model embeddings

Abstract: Background: Despite the immense importance of transmembrane proteins (TMP) for molecular biology and medicine, experimental 3D structures for TMPs remain about 4-5 times underrepresented compared to non-TMPs. Today's top methods can accurately predict structures for many, but the annotations of the transmembrane regions remains a limiting step for proteome-wide predictions. Results: Here, we present a novel method, dubbed TMbed. Inputting embeddings from protein Language Models (in particular ProtT5), TMbed co… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
6
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(6 citation statements)
references
References 85 publications
0
6
0
Order By: Relevance
“… Panel A: residue level features: secondary structure, transmembrane topology, disordered residues, small molecule, nucleic or metal binding residues, residue conservation and average variation (23; 39; 42; 13; 29); Panel B: sequence-level features: predicted subcellular localization (64), and an excerpt of predicted GO-annotations (38); Panel C: effect of SAVs (wild-type sequence on x-axis, mutations on y-axis; darker color=higher effect) (42); and Panel D : predicted 3D structure (45). Interactive version at https://embed.predictprotein.org/#/Q9NZC2.…”
Section: Resultsmentioning
confidence: 99%
See 4 more Smart Citations
“… Panel A: residue level features: secondary structure, transmembrane topology, disordered residues, small molecule, nucleic or metal binding residues, residue conservation and average variation (23; 39; 42; 13; 29); Panel B: sequence-level features: predicted subcellular localization (64), and an excerpt of predicted GO-annotations (38); Panel C: effect of SAVs (wild-type sequence on x-axis, mutations on y-axis; darker color=higher effect) (42); and Panel D : predicted 3D structure (45). Interactive version at https://embed.predictprotein.org/#/Q9NZC2.…”
Section: Resultsmentioning
confidence: 99%
“…Per-protein features predicted solely with ProtT5 embeddings as input currently include subcellular location (64), and Gene Ontology terms (GO) (38). Per-residue predictions solely with ProtT5 embeddings as input include: conservation (42); helical transmembrane regions, transmembrane beta barrels, along with signal peptides (13); binding for various ligands (39); intrinsically disordered regions (29); secondary structure (23). LambdaPP also predicts the effect of introducing single amino acid variants (SAV) in the input sequence upon molecular function, which uses the predicted conservation with a BLOSUM62-score (26) of the SAV as input (42).…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations