2017
DOI: 10.1107/s1600576717015229
|View full text |Cite
|
Sign up to set email alerts
|

Hierarchical clustering for multiple-crystal macromolecular crystallography experiments: the ccCluster program

Abstract: This article describes ccCluster, a software providing an intuitive graphical user interface (GUI) and multiple functions to perform hierarchical cluster analysis on multiple crystallographic datasets. The program makes it easier for users to choose, in the case of multi-crystal data collection, those datasets that will be merged together to give good final statistics. It provides a simple GUI to analyse the dendrogram and various options for automated clustering and data merging.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

2
44
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 36 publications
(46 citation statements)
references
References 31 publications
2
44
0
Order By: Relevance
“…pCC-Selection scaling. In parallel to unit-cell clustering, another hierarchical clustering is performed based on pairwise cross correlation (pCC) values [described by Giordano et al (2012) and Santoni et al (2017)]. Correlations between symmetry-allowed common reflections from any two datasets are calculated with the cctbx library flex module.…”
Section: Ssx Data Scaling and Merging Utilitymentioning
confidence: 99%
See 1 more Smart Citation
“…pCC-Selection scaling. In parallel to unit-cell clustering, another hierarchical clustering is performed based on pairwise cross correlation (pCC) values [described by Giordano et al (2012) and Santoni et al (2017)]. Correlations between symmetry-allowed common reflections from any two datasets are calculated with the cctbx library flex module.…”
Section: Ssx Data Scaling and Merging Utilitymentioning
confidence: 99%
“…Decoupling these two types of errors and discarding datasets with large systematic errors (non-isomorphous) is not trivial. Various concepts have been developed for crystallographic data selection, scaling and merging such as hierarchical cluster analysis (HCA) based on cross-correlation among datasets (Giordano et al, 2012;Santoni et al, 2017), unit-cell-based hierarchical clustering (BLEND; Foadi et al, 2013), genetic algorithm (CODGAS; Zander et al, 2016) and xscale_ isocluster (Diederichs, 2017). All these concepts have been successfully demonstrated on various cases to provide complete and high-quality datasets by careful data selection and merging.…”
Section: Introductionmentioning
confidence: 99%
“…The result was evaluated by POINTLESS to determine the best-fitted space group and the data merged again by XSCALE utilizing the given space group. The SciPy functions Linkage and Dendrogram (Jones et al, 2001) were utilized to perform the hierarchical clustering of the correlation coefficients from XSCALE, using the distance definition described by Giordano et al (2012) and Santoni et al (2017). The resultant dendrogram was plotted via Python matplotlib (Hunter, 2007) for further outlier removal (Fig.…”
Section: Data Processingmentioning
confidence: 99%
“…unit-cell clustering followed by correlation coefficient clustering, was found to be necessary as the correlation coefficient clustering was not able to completely distinguish the difference in unit cells. Compared with other unit-cell clustering methods that use the edge lengths (Santoni et al, 2017) or face diagonal lengths (Giordano et al, 2012), this coordinate transformation method considers both the length and orientation of the volume diagonal vector, and worked very well for the datasets presented.…”
Section: Data Processingmentioning
confidence: 99%
“…Crystal radiation damage can be outrun due to the increased dose-rate Schubert et al, 2016;Chapman et al, 2014) making it possible to collect small wedges of data at room temperature. Data collection and data processing tools have evolved around these new developments and it is now standard practice using in situ data collection to be able to obtain full structural information from partial datasets on multiple samples (Foadi et al, 2013;Zander et al, 2015;Santoni et al, 2017;Stellato et al, 2014). It is a huge step forward for many challenging projects to be able to analyse samples in situ, bypassing the additional sample manipulation associated with cryo-cooling methods.…”
Section: Introductionmentioning
confidence: 99%