The Chemistry Development Kit (CDK) is a freely available open-source Java library for Structural Chemo-and Bioinformatics. Its architecture and capabilities as well as the development as an open-source project by a team of international collaborators from academic and industrial institutions is described. The CDK provides methods for many common tasks in molecular informatics, including 2D and 3D rendering of chemical structures, I/O routines, SMILES parsing and generation, ring searches, isomorphism checking, structure diagram generation, etc. Application scenarios as well as access information for interested users and potential contributors are given.
The Chemistry Development Kit (CDK) is a freely available open-source Java library for Structural Chemoand Bioinformatics. Its architecture and capabilities as well as the development as an open-source project by a team of international collaborators from academic and industrial institutions is described. The CDK provides methods for many common tasks in molecular informatics, including 2D and 3D rendering of chemical structures, I/O routines, SMILES parsing and generation, ring searches, isomorphism checking, structure diagram generation, etc. Application scenarios as well as access information for interested users and potential contributors are given.
The mass spectrometry (MS)-based analysis of free polysaccharides and glycans released from proteins, lipids and proteoglycans increasingly relies on databases and software. Here, we review progress in the bioinformatics analysis of protein-released N- and O-linked glycans (N- and O-glycomics) and propose an e-infrastructure to overcome current deficits in data and experimental transparency. This workflow enables the standardized submission of MS-based glycomics information into the public repository UniCarb-DR. It implements the MIRAGE (Minimum Requirement for A Glycomics Experiment) reporting guidelines, storage of unprocessed MS data in the GlycoPOST repository and glycan structure registration using the GlyTouCan registry, thereby supporting the development and extension of a glycan structure knowledgebase.
Formalin-fixed paraffin-embedded (FFPE) tissue is considered as an appropriate alternative to frozen/fresh tissue for proteomic analysis. Here we study formalin-induced alternations on a proteome-wide level. We compared LC-MS/MS data of FFPE and frozen human kidney tissues by two methods. First, clustering analysis revealed that the biological variation is higher than the variation introduced by the two sample processing techniques and clusters formed in accordance with the biological tissue origin and not with the sample preservation method. Second, we combined open modification search and spectral counting to find modifications that are more abundant in FFPE samples compared to frozen samples. This analysis revealed lysine methylation (+14 Da) as the most frequent modification induced by FFPE preservation. We also detected a slight increase in methylene (+12 Da) and methylol (+30 Da) adducts as well as a putative modification of +58 Da, but they contribute less to the overall modification count. Subsequent SEQUEST analysis and X!Tandem searches of different datasets confirmed these trends. However, the modifications due to FFPE sample processing are a minor disturbance affecting 2-6% of all peptide-spectrum matches and the peptides lists identified in FFPE and frozen tissues are still highly similar.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.