Abstract:Multivariate statistical analysis using principal components can reveal patterns and structures within a data set and give insights into process performance and operation. The output medium is usually a two dimensional screen, however, so it is a challenge to visualize the multidimensional structure of a data set by means of a two-dimensional plot. A method of visualization is described in the form of a hierarchical classification tree that can be used to view the structure within a multivariate principal comp… Show more
“…Methods for principal component analysis have been widely documented. The method used in this article was derived from Thornhill et al and uses singular value decomposition. The output data from the simulations were first transformed by mean centering and stored as a comma separated variable file which was then imported.…”
Section: Methodsmentioning
confidence: 99%
“…re‐routing to larger tanks) and the Monte Carlo simulations were repeated with the improved process. The complexity of each dataset was then reduced using principal component analysis combined with clustering to reduce the dimensionality and eliminate noise . The results were compared to assess the impact of the process change.…”
Section: Methodsmentioning
confidence: 99%
“…Although PCA might reduce hundreds of datasets to a few principal components, it does not automatically identify clusters of batches with similar characteristics for further examination. Hence in this article, algorithms adapted from Thornhill et al are used to achieve hierarchical and k ‐means clustering of the datasets so as to identify significant clusters of batches. Multidimensional visualization of each cluster's characteristics in terms of the raw data (e.g.…”
ABSTRACT:This paper describes a decision-support tool that integrates Monte Carlo simulation data derived using a stochastic discrete-event simulation model to mimic process fluctuations with advanced multivariate statistical techniques to help pinpoint the potential root causes of sub-optimal short term facility fit issues. Principal component analysis combined with clustering algorithms was used to analyse the complex datasets from complete industrial batch processes for biopharmaceuticals. The challenge of visualising the multidimensional nature of the dataset was addressed using hierarchical and K-means clustering as well as parallel co-ordinate plots to help identify process fingerprints and characteristics of clusters leading to sub-optimal facility fit issues. Industrially-relevant case studies are presented that focus on technology transfer challenges for therapeutic antibodies moving from early phase to late phase clinical trials. The case study details how sub-optimal facility fit can be alleviated by allocating alternative product pool tanks. The impact of this operational change is then assessed by reviewing an updated process fingerprint.
“…Methods for principal component analysis have been widely documented. The method used in this article was derived from Thornhill et al and uses singular value decomposition. The output data from the simulations were first transformed by mean centering and stored as a comma separated variable file which was then imported.…”
Section: Methodsmentioning
confidence: 99%
“…re‐routing to larger tanks) and the Monte Carlo simulations were repeated with the improved process. The complexity of each dataset was then reduced using principal component analysis combined with clustering to reduce the dimensionality and eliminate noise . The results were compared to assess the impact of the process change.…”
Section: Methodsmentioning
confidence: 99%
“…Although PCA might reduce hundreds of datasets to a few principal components, it does not automatically identify clusters of batches with similar characteristics for further examination. Hence in this article, algorithms adapted from Thornhill et al are used to achieve hierarchical and k ‐means clustering of the datasets so as to identify significant clusters of batches. Multidimensional visualization of each cluster's characteristics in terms of the raw data (e.g.…”
ABSTRACT:This paper describes a decision-support tool that integrates Monte Carlo simulation data derived using a stochastic discrete-event simulation model to mimic process fluctuations with advanced multivariate statistical techniques to help pinpoint the potential root causes of sub-optimal short term facility fit issues. Principal component analysis combined with clustering algorithms was used to analyse the complex datasets from complete industrial batch processes for biopharmaceuticals. The challenge of visualising the multidimensional nature of the dataset was addressed using hierarchical and K-means clustering as well as parallel co-ordinate plots to help identify process fingerprints and characteristics of clusters leading to sub-optimal facility fit issues. Industrially-relevant case studies are presented that focus on technology transfer challenges for therapeutic antibodies moving from early phase to late phase clinical trials. The case study details how sub-optimal facility fit can be alleviated by allocating alternative product pool tanks. The impact of this operational change is then assessed by reviewing an updated process fingerprint.
“…Other measures reported in the literature include the cosine similarity [21] and related correlation measure [22], and Dynamic Time Warp (DTW) [23]. However, these measures have properties which are less desirable for the detection of transient disturbances.…”
Section: Detecting Anomalous Segments With Nearest Neighborsmentioning
“…Stonier et al [13] describe work where a database -driven simulation tool is used to mimic the stochastic nature of industrial mAb manufacturing processes when transferred to large -scale facilities using Monte Carlo simulation. The complexity of each dataset was then reduced using PCA combined with clustering to reduce the dimensionality and eliminate noise [17] . The tool predicted an unacceptable likelihood of product loss upon the transfer.…”
Section: Predicting Short -Term Facility Fit Upon Tech Transfer To Lamentioning
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.