Structural genomics has as its goal the provision of structural information for all possible ORF sequences through a combination of experimental and computational approaches. The access to genome sequences and cloning resources from an ever-widening array of organisms is driving high-throughput structural studies by the New York Structural Genomics Research Consortium. In this report, we outline the progress of the Consortium in establishing its pipeline for structural genomics, and some of the experimental and bioinformatics efforts leading to structural annotation of proteins. The Consortium has established a pipeline for structural biology studies, automated modeling of ORF sequences using solved (template) structures, and a novel high-throughput approach (metallomics) to examining the metal binding to purified protein targets. The Consortium has so far produced 493 purified proteins from >1077 expression vectors. A total of 95 have resulted in crystal structures, and 81 are deposited in the Protein Data Bank (PDB). Comparative modeling of these structures has generated >40,000 structural models. We also initiated a high-throughput metal analysis of the purified proteins; this has determined that 10%-15% of the targets contain a stoichiometric structural or catalytic transition metal atom. The progress of the structural genomics centers in the U.S. and around the world suggests that the goal of providing useful structural information on most all ORF domains will be realized. This projected resource will provide structural biology information important to understanding the function of most proteins of the cell.The complete genomes of a number of organisms have been sequenced and many more are underway. This progress in gene sequencing has shifted the landscape of biology, such that goals related to understanding the structure and function of each gene product, as well as their interactions within the cellular environment that lead to the behavior of complex systems are within reach, or at least to be contemplated. The sequencing of model organisms from bacterial species to human has allowed the identification of genes both essential to function, as well as genes that give rise to the diversity of life forms. Although the exact numbers and natures of the genes is still open to question, recent estimates place the numbers at <20,000 for Caenorhabditis elegans and Caenorhabditis briggsae and ∼30,000 for humans (Waterston et al. 2002;Stein et al. 2003). Our ability to recognize genes, their exons and introns, and their potential splice variants, has matured dramatically. This progress has driven highly successful attempts to develop resources to make available ORFs for rapid and highly parallel structural and functional studies of genes (Reboul et al. 2003). The success of these efforts are outlined in this issue, and the leveraging of these ORF sequences to examine protein activity, localization, protein structure, and proteinprotein interactions are examples of the value of these resources. Structural biology fa...