Therapeutic proteins and antibodies represent a $125 billion annual market. Chinese Hamster Ovary (CHO) derived cell lines are the preferred host cells for the production of therapeutic proteins. Here, we present a draft genomic sequence of the CHO-K1 ancestral cell line. The assembly comprises 2.45Gb genomic sequence with 24,383 predicted genes. We associate most scaffolds to 21 microfluidically-isolated chromosomes to identify chromosomal locations of genes. Furthermore, we investigate genes involved in glycosylation, which affects therapeutic protein quality, and viral susceptibility genes, which affect cell engineering and regulatory concerns. Specifically, homologs for most human glycosylation-associated genes are identified in the CHO-K1 genome, although 141 are not expressed under exponential growth. In addition, many important viral entry genes are present in the genome but not expressed, which may explain the unusual viral resistance property of CHO cell lines. We demonstrate how the availability of this genome sequence may facilitate genome-scale science for biopharmaceutical protein production.
Chinese hamster ovary (CHO) cells, first isolated in 1957, are the preferred production host for many therapeutic proteins. Although genetic heterogeneity among CHO cell lines has been well documented, a systematic, nucleotide-resolution characterization of their genotypic differences has been stymied by the lack of a unifying genomic resource for CHO cells. Here we report a 2.4-Gb draft genome sequence of a female Chinese hamster, Cricetulus griseus, harboring 24,044 genes. We also resequenced and analyzed the genomes of six CHO cell lines from the CHO-K1, DG44 and CHO-S lineages. This analysis identified hamster genes missing in different CHO cell lines, and detected >3.7 million single-nucleotide polymorphisms (SNPs), 551,240 indels and 7,063 copy number variations. Many mutations are located in genes with functions relevant to bioprocessing, such as apoptosis. The details of this genetic diversity highlight the value of the hamster genome as the reference upon which CHO cells can be studied and engineered for protein production.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.