Substantial evidence supports the role of the procollagen C-propeptide in the initial association of procollagen polypeptides and for triple helix formation. To evaluate the role of the propeptide domains on triple helix formation, human recombinant type I procollagen, pN-collagen (procollagen without the C-propeptides), pC-collagen (procollagen without the N-propeptides), and collagen (minus both propeptide domains) heterotrimers were expressed in Saccharomyces cerevisiae. Deletion of the N-or C-propeptide, or both propeptide domains, from both pro␣-chains resulted in correctly aligned triple helical type I collagen. Protease digestion assays demonstrated folding of the triple helix in the absence of the N-and C-propeptides from both pro␣-chains. This result suggests that sequences required for folding of the triple helix are located in the helical/ telopeptide domains of the collagen molecule. Using a strain that does not contain prolyl hydroxylase, the same folding mechanism was shown to be operative in the absence of prolyl hydroxylase. Normal collagen fibrils were generated showing the characteristic banding pattern using this recombinant collagen. This system offers new opportunities for the study of collagen expression and maturation.Collagen is the single most abundant protein found in animals. In the human body, it is expressed in most tissues and plays a structural, as well as a signaling, role in the development, maintenance, and repair of tissues and organs. 20 different collagen types are coded by more than 30 genes. Assembly of trimeric collagen intracellularly and formation of collagen fibers in the extracellular matrix is the result of a complex multistep process (1, 2). Within the endoplasmic reticulum, the individual procollagen polypeptides undergo several co-and post-translational modifications, including hydroxylation of specific prolyl and lysyl residues, selection and alignment of three procollagen polypeptides, and disulfide bond formation among the C-propeptides. Experimental evidence suggests folding of the triple helix begins at the C terminus and propagates toward the N terminus. Prior to triple helix formation, prolyl hydroxylase converts proline in the Y position of GXY triplets to hydroxyproline. Hydrogen bonding between the ␣-chains of the triple helix increases the denaturation temperature of the molecule, preventing it from unfolding at the animal's body temperature (3). Triple helical procollagen is secreted from the cell, and the N-and C-terminal propeptides are removed by specific N-and C-proteinases. The resulting collagen monomers, consisting of triple helical and telopeptide regions, undergo a self-assembly process to generate collagen fibril intermediates and then mature collagen fibers. These fibers are further stabilized by covalent cross-links within the triple helix and telopeptide regions, providing strength and support to the surrounding tissue.