Galectins (formerly known as “S-type lectins”) are a subfamily of soluble proteins that typically bind β-galactoside carbohydrates with high specificity. They are present in many forms of life, from nematodes and fungi to animals, where they perform a wide range of functions. Particularly in humans, different types of galectins have been described differing not only in their tissue expression but also in their cellular location, oligomerization, fold architecture and carbohydrate-binding affinity. This distinct yet sometimes overlapping distributions and physicochemical attributes make them responsible for a wide variety of both intra- and extracellular functions, including tremendous importance in immunity and disease. In this review, we aim to provide a general description of galectins most important structural features, with a special focus on the molecular determinants of their carbohydrate-recognition ability. For that purpose, we structurally compare the human galectins, in light of recent mutagenesis studies and novel X-ray structures. We also offer a detailed description on how to use the solvent structure surrounding the protein as a tool to get better predictions of galectin-carbohydrate complexes, with a potential application to the rational design of glycomimetic inhibitory compounds. Finally, using Gal-1 and Gal-3 as paramount examples, we review a series of recent advances in the development of engineered galectins and galectin inhibitors, aiming to dissect the structure-activity relationship through the description of their interaction at the molecular level.
Summary
UDP-glucose:glycoprotein glucosyltransferase (UGGT) flags misfolded glycoproteins for ER retention. We report crystal structures of full-length
Chaetomium thermophilum
UGGT (
Ct
UGGT), two
Ct
UGGT double-cysteine mutants, and its TRXL2 domain truncation (
Ct
UGGT-ΔTRXL2).
Ct
UGGT molecular dynamics (MD) simulations capture extended conformations and reveal clamping, bending, and twisting inter-domain movements. We name “Parodi limit” the maximum distance on the same glycoprotein between a site of misfolding and an N-linked glycan that can be reglucosylated by monomeric UGGT
in vitro
, in response to recognition of misfold at that site. Based on the MD simulations, we estimate the Parodi limit as around 70–80 Å. Frequency distributions of distances between glycoprotein residues and their closest N-linked glycosylation sites in glycoprotein crystal structures suggests relevance of the Parodi limit to UGGT activity
in vivo
. Our data support a “one-size-fits-all adjustable spanner” UGGT substrate recognition model, with an essential role for the UGGT TRXL2 domain.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.