Subunit monomers and oligomers of crystalloid-type legumins are major components of SDS-soluble fractions from Metasequoia glyptostroboides (Dawn redwood, Taxodiaceae) seed proteins. The subunits are made up of disulfide linked a-polypeptides and p-polypeptides with molecular masses of 33 kDa and 23 -25 kDa, respectively. Unusually for legumins, those from Metasequoia are glycosylated and the carbohydrate moieties are residing in the C-terminal region of the respective P-polypeptides. A Metasequoia endosperm cDNA library has been constructed and legumin-encoding transcripts representing two divergent gene subfamilies have been characterized. Intersubfamily comparisons reveal 75 % identity at the amino acid level and the values range from 53-34% when the legumin precursors deduced were compared with those from angiosperms. The predicted sequences together with data from amino acid sequencing prove that post-translational processing of Metasequoia prolegumins is directed to two different processing sites, each of them specific for one of the legumin subfamilies. The sites involved differ in their relative position and in the junction to be cleaved: Metasequoia legumin precursors MgLegl8 and MgLeg26 contain the conventional post-translational Asn-Gly processing site, which is generally regarded as highly conserved. In contrast, the MgLeg4 precursor is lacking this site and post-translational cleavage is directed to an unusual Asn-Thr processing site located in its hypervariable region, causing Nterminal extension of the /I-polypeptide relative to those hitherto known. Evidence is given that the unusual variant of processing also occurs in other conifers. Phylogenetic analysis reveals the precursors concerned as representatives of a distinct legumin subfamily, originating from duplication of an ancestral gene prior to or at the beginning of Taxodiaceae diversification.Keywords : seed storage protein ; legumin ; post-translational processing ; Metasequoia ; gymnosperms.Legumins are the most widely distributed seed storage proteins in angiosperms and their structure, biosynthesis and intracellular transfer are well conserved throughout the species investigated [l -41. They are synthesized as prepro-type precursors which are cotranslationally transported into the endoplasmic reticulum with the simultaneous cleavage of N-terminal signal peptides. Conversion of prolegumins into their mature forms involves further cleavage, taking place in the vacuole and catalyzed by specific proteases. The most invariable characteristic of known prolegumins is the site where cleavage occurs to form the mature a-polypeptides (~4 0 kDa) and /I-polypeptides (~2 0 kDa), which remain paired by a disulfide bond: with only a few exceptions cleavage is regularly directed to an Asn-Gly junction, and the position of this site is absolutely invariant in all prolegumins known; this pattern of processing has been viewed as highly conserved during evolution [5 -91. However, recent in vitro studies prove that the maturation proteases involved are sign...