SummaryThe calpain family is named for the calcium dependence of the papain-like, thiol protease activity of the well-studied ubiquitous vertebrate enzymes calpain-1 (µ-calpain) and calpain-2 (m-calpain).Proteins showing sequence relatedness to the catalytic core domains of these enzymes are included in this ancient and diverse eukaryotic protein family. Calpains are examples of highly modular organization, with several varieties of amino-terminal or carboxy-terminal modules flanking a conserved core. Acquisition of the penta-EF-hand module involved in calcium binding (and the formation of heterodimers for some calpains) seems to be a relatively late event in calpain evolution. Several alternative mechanisms for binding calcium and associating with membranes/phospholipids are found throughout the family. The gene family is expanded in mammals, trypanosomes and ciliates, with up to 26 members in Tetrahymena, for example; in striking contrast to this, only a single calpain gene is present in many other protozoa and in plants. The many isoforms of calpain and their multiple splice variants complicate the discussion and analysis of the family, and challenge researchers to ascertain the relationships between calpain gene sequences, protein isoforms and their distinct or overlapping functions. In mammals and plants it is clear that a calpain plays an essential role in development. There is increasing evidence that ubiquitous calpains participate in a variety of signal transduction pathways and function in important cellular processes of life and death. In contrast to relatively promiscuous degradative proteases, calpains cleave only a restricted set of protein substrates and use complex substraterecognition mechanisms, involving primary and secondary structural features of target proteins. The detailed physiological significance of both proteolytically active calpains and those lacking key catalytic residues requires further study.
Gene organization and evolutionary historyThis review focuses on the eukaryotic calpains, although genome databases reveal bacteria, but no archaea, with sequences related to the catalytic core domains (domains dI and dII) of the classical calpains, the criterion used for designating a protein as a calpain. Only single copies of calpain-coding genes are found in the small number of sequenced or partially sequenced protozoan genomes, such as those of the apicomplexan parasites Plasmodium falciparum, Theileria annulata and Cryptosporidium parvum [1][2][3], and of the amitochondrial parasite Entamoeba histolytica [4]. No calpain-like sequences were identified in the human pathogen Giardia lamblia, a diplomonad often considered to be the most basal eukaryotic organism [5]. Protozoan calpains lack a domain containing EF-hand-type Ca 2+ -binding sites, as also do plant and fungal calpains, and thus it seems likely that the proposed cysteine proteasecalmodulin gene fusion leading to the classical calpain structure (for earlier reviews see [6][7][8]) occurred exclusively within the animal lineage....