Motivation: The identification of suitable conditions for crystallization is a rate-limiting step in protein structure determination. The pH of an experiment is an important parameter and has the potential to be used in data-mining studies to help reduce the number of crystallization trials required. However, the pH is usually recorded as that of the buffer solution, which can be highly inaccurate.Results: Here, we show that a better estimate of the true pH can be predicted by considering not only the buffer pH but also any other chemicals in the crystallization solution. We use these more accurate pH values to investigate the disputed relationship between the pI of a protein and the pH at which it crystallizes.Availability and implementation: Data used to generate models are available as Supplementary Material.Contact:
julie.wilson@york.ac.ukSupplementary information:
Supplementary data are available at Bioinformatics online.
In a large-scale study using data from the Protein Data Bank, some of the many reported findings regarding the crystallization of proteins were investigated.
The crystallization of proteins is dependent on the careful control of numerous parameters, one of these being pH. The pH of crystallization is generally reported as that of the buffer; however, the true pH has been found to be as many as four pH units away. Measurement of pH with a meter is time-consuming and requires the reformatting of the crystallization solution. To overcome this, a high-throughput method for pH determination of buffered solutions has been developed with results comparable to those of a pH meter.
In macromolecular crystallization, success is often dependent on the pH of the experiment. However, little is known about the pH of reagents used, and it is generally assumed that the pH of the experiment will closely match that of any buffering chemical in the solution. We use a large dataset of experimentally measured solution pH values to show that this assumption can be very wrong and generate a model that can be used to successfully predict the overall solution pH of a crystallization experiment. Furthermore, we investigate the time dependence of the pH of some polyethylene glycol polymers widely used in protein crystallization under different storage conditions.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.