The whey acidic protein (WAP) domain is a conserved motif, containing eight cysteines found in a characteristic 4-disulphide core arrangement, that is present in a number of otherwise unrelated proteins. WAP motifs are present in SLPI and ela®n, two antiproteinases located on chromosome 20q12-13, in a locus rich in poorly characterized WAP domain proteins. One of these proteins, which contains two WAP domains, is HE4 (also known as WFDC2), originally described as an epididymis speci®c protein but more recently suggested to be a putative serum tumour marker for ovarian cancer. We have shown that HE4 is expressed in a number of normal human tissues outside of the male reproductive system, including regions of the respiratory tract and nasopharynx, as well as in a subset of lung tumour cell lines. Comparison of multiple HE4 cDNAs and RT ± PCR products with genomic sequence allowed the elucidation of the genomic organization. These studies revealed that HE4 can undergo a complex series of alternative splicing events that can potentially yield ®ve distinct WAP domain containing protein isoforms. These results cast doubt on the potential role of HE4 as a serum tumour marker speci®c for ovarian cancer and open the door to understanding the function of multiple WAP domain containing protein isoforms arising from a single gene.
Background: The Whey Acidic Protein domain is an evolutionarily conserved motif found in a number of proteins, the best studied of which are antiproteinases involved in the innate immune defence of multiple epithelia. We recently characterised the WFDC2 gene which encodes a two WAP domain-containing protein, initially suggested as a marker for epididymis, and showed that it is highly expressed in the lung and salivary gland. The precise location of WFDC2 protein in these sites has not been described.
The PLUNC family of human proteins are candidate host defense proteins expressed in the upper airways. The family subdivides into short (SPLUNC) and long (LPLUNC) proteins, which contain domains predicted to be structurally similar to one or both of the domains of bactericidal/permeability-increasing protein (BPI), respectively. In this article we use analysis of the human, mouse, and rat genomes and other sequence data to examine the relationships between the PLUNC family proteins from humans and other species, and between these proteins and members of the BPI family. We show that PLUNC family clusters exist in the mouse and rat, with the most significant diversification in the locus occurring for the short PLUNC family proteins. Clear orthologous relationships are established for the majority of the proteins, and ambiguities are identified. Completion of the prediction of the LPLUNC4 proteins reveals that these proteins contain approximately a 150-residue insertion encoded by an additional exon. This insertion, which is predicted to be largely unstructured, replaces the structure homologous to the 40s hairpin of BPI. We show that the exon encoding this region is anomalously variable in size across the LPLUNC proteins, suggesting that this region is key to functional specificity. We further show that the mouse and human PLUNC family orthologs are evolving rapidly, which supports the hypothesis that these proteins are involved in host defense. Intriguingly, this rapid evolution between the human and mouse sequences is replaced by intense purifying selection in a large portion of the N-terminal domain of LPLUNC4. Our data provide a basis for future functional studies of this novel protein family.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.