Domain of unknown function 239 (DUF239) is a conserved sequence found in the catalytic site of Neprosins which are specific secreted prolyl endopeptidases found in the Nepenthes genus. Neprosins participate in the nitrogen cycle by digesting preys trapped in the pitcher of these carnivorous plants. Apart from that, DUF239s have been poorly documented in plants. We have identified 50 genes containing DUF239coding sequences in the Arabidopsis genome that are distributed across six distinct phylogenetic clusters. The chromosomal distribution suggests that several genes are the result of recent duplication events, with up to eight genes found in a strict tandem distribution. In Arabidopsis, most of DUF239-containing sequences are also associated to a Neprosin-activating domain (DUF4409) and an amino-terminal α-helix which corresponds to the typical domain organization of the Neprosins described in the Nepenthes genus. Analysis of Arabidopsis transcriptomic datasets reveals that 39 genes are exclusively expressed in reproductive organs, mainly during seed development and more specifically in the endosperm (23 genes). The peculiar expression pattern of the DUF239 gene family in Arabidopsis suggests new functions of Neprosin-like proteins in plants during seed development.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.