To derive structural information about the vesicular stomatitis virus (VSV) nucleocapsid (N) protein, the N protein and the VSV phosphoprotein (P protein) were expressed together in Escherichia coli. The N and P proteins formed soluble protein complexes of various molar ratios when coexpressed. The major N/P protein complex was composed of 10 molecules of the N protein, 5 molecules of the P protein, and an RNA. A soluble N protein-RNA oligomer free of the P protein was isolated from the N/P protein-RNA complex using conditions of lowered pH. The molecular weight of the N protein-RNA oligomer, 513,879, as determined by analytical ultracentrifugation, showed that it was composed of 10 molecules of the N protein and an RNA of approximately 90 nucleotides. The N protein-RNA oligomer had the appearance of a disk with outer diameter, inner diameter, and thickness of 148 ؎ 10 Å, 78 ؎ 9 Å, and 83 ؎ 8 Å, respectively, as determined by electron microscopy. RNA in the complexes was protected from RNase digestion and was stable at pH 11. This verified that N/P protein complexes expressed in E. coli were competent for encapsidation. In addition to coexpression with the full-length P protein, the N protein was expressed with the C-terminal 72 amino acids of the P protein. This portion of the P protein was sufficient for binding to the N protein, maintaining it in a soluble state, and for assembly of N protein-RNA oligomers. With the results provided in this report, we propose a model for the assembly of an N/P protein-RNA oligomer.Vesicular stomatitis virus (VSV) is a nonsegmented, negative-stranded RNA virus belonging to the rhabdovirus family. The 11,161-nucleotide genome of VSV (21) contains five genes and is encapsidated by the nucleocapsid (N) protein to form the ribonucleoprotein (RNP) complex. Associated with the RNP are the phosphoprotein (P protein) and the large polymerase subunit (L protein) (12). These three components are the transcriptionally active unit of the virus (12). The two remaining genes encode the matrix (M) protein and glycoprotein (G). The five genes are arranged in the order 3Ј N-P-M-G-L 5Ј, and the relative abundance of each individual transcript is related to its genomic position. The amounts of the individual mRNAs decrease with distance from the 3Ј end of the genome due to a single polymerase entry site at the 3Ј end of the genome (14) and localized attenuation at each gene junction (3,25,46). Thus, the N and P mRNAs are produced in the greatest abundance. The two proteins encoded by these genes form complexes in virus-infected cells (36, 39) and in vitro when expressed simultaneously (8,28,34,40). In each case, the N/P protein complexes were observed as multiple species with various molar ratios. The optimal molar ratio for efficient viral replication was found to be between 1:1 and 2:1 (N to P protein), while ratios substantially above or below this range had a negative effect on replication (28,34,36,40,47). The P protein when provided in an appropriate molar ratio to the N protein can preven...