“…These amino acids are
also found frequently in human and murine CDR H3 regions (3, 39), but in a
survey of the complete UniProtKB/SwissProt sequence database as of February 2016
(http://web.expasy.org/docs/relnotes/relstat.html), frequencies for
these particular amino acids in all available protein sequences are 2.9%
(Tyr), 7.1% (Gly), and 6.6% (Ser), so that they are indeed
over-represented in these ultralong CDR H3s. Synthetic libraries utilizing antibody
scaffolds as well as other protein folds have been developed using very small
subsets of amino acids as recognition elements, with tyrosine, serine, and glycine
proving the best minimal combination (40) for
effective antigen recognition.…”