In the pharmaceutical industry, chemometrics is rapidly establishing itself as a tool that can be used at every step of product development and beyond: from early development to commercialization. This set of multivariate analysis methods allows the extraction of information contained in large, complex data sets thus contributing to increase product and process understanding which is at the core of the Food and Drug Administration's Process Analytical Tools (PAT) Guidance for Industry and the International Conference on Harmonisation's Pharmaceutical Development guideline (Q8). This review is aimed at providing pharmaceutical industry professionals an introduction to multivariate analysis and how it is being adopted and implemented by companies in the transition from "quality-by-testing" to "quality-by-design". It starts with an introduction to multivariate analysis and the two methods most commonly used: principal component analysis and partial least squares regression, their advantages, common pitfalls and requirements for their effective use. That is followed with an overview of the diverse areas of application of multivariate analysis in the pharmaceutical industry: from the development of real-time analytical methods to definition of the design space and control strategy, from formulation optimization during development to the application of quality-by-design principles to improve manufacture of existing commercial products.
Near-infrared spectroscopy (NIRS) is known to be a suitable technique for rapid fermentation monitoring. Industrial fermentation media are complex, both chemically (ill-defined composition) and physically (multiphase sample matrix), which poses an additional challenge to the development of robust NIRS calibration models. We investigated the use of NIRS for at-line monitoring of the concentration of clavulanic acid during an industrial fermentation. An industrial strain of Streptomyces clavuligerus was cultivated at 200-L scale for the production of clavulanic acid. Partial least squares (PLS) regression was used to develop calibration models between spectral and analytical data. In this work, two different variable selection methods, genetic algorithms (GA) and PLS-bootstrap, were studied and compared with models built using all the spectral variables. Calibration models for clavulanic acid concentration performed well both on internal and external validation. The two variable selection methods improved the predictive ability of the models up to 20%, relative to the calibration model built using the whole spectra. ß 2005 Wiley Periodicals, Inc.
Bootstrap-based methods have been applied for spectral variable selection in near (NIR) and mid-infrared (MIR) spectroscopy applications. In this paper, an extension of those methods for the selection of spectral intervals instead of single spectral variables is proposed. This approach, interval partial least square (PLS)-Bootstrap (iPLS-Bootstrap), was compared against the PLS-Bootstrap method and the use of the whole spectral region for model development. These methods were tested on a NIR spectral dataset obtained from at-line monitoring of an industrial fermentation process, by correlating the spectra with the concentration of the active pharmaceutical ingredient (API). The performance of the models was evaluated based on the predictive ability for both cross-validation and external validation. For the dataset used, iPLS-Bootstrap enabled to improve the model predictive ability, with a greater impact on external validation. The decrease observed in RMSEP relative to the full-spectrum and PLS-Bootstrap model was, respectively, 14 and 6%.
Purpose: Understanding and predicting the flow of bulk pharmaceutical materials could be key in enabling pharmaceutical manufacturing by continuous direct compression (CDC). This study examines whether, by taking powder and bulk measurements, and using statistical modelling, it would be possible the flow of a range of materials likely to be used in CDC. Methods: More than 100 materials were selected for study, from four pharmaceutical companies. Particle properties were measured by static image analysis, powder surface area and surface energy techniques, and flow by shear cell measurements. The data was then analysed and a range of statistical modelling techniques were used, to build predictive models for flow. Results: Using the results from static image analysis a model could be built which allowed the prediction of likely flow in a shear cell, which can be related to performance in a CDC system. Only a small amount of powder was required for the image analysis. Surface area did not add to the precision of the model, and the available surface energy technique did not correlate with flow. Conclusions: A small sample of powder can be examined by Static image analysis, and this data can be used to give an early read on likely flow of a material in a CDC system or other pharmaceutical process, allowing early intervention (if necessary) to improve the characteristics of a material, early in development.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.