Background The Chinese National Infrastructure of Cell Line stores and distributes cell lines for biomedical research in China. This study aims to represent and integrate the information of NICR cell lines into the community-based Cell Line Ontology (CLO). Results We have aligned, represented, and added all identified 2704 cell line cells in NICR to CLO. We also proposed new ontology design patterns to represent the usage of cell line cells as disease models by inducing tumor formation in model organisms, and the relations between cell line cells and their expressed or overexpressed genes or proteins. The resulting CLO-NICR ontology also includes the Chinese representation of the NICR cell line information. CLO-NICR was merged into the general CLO. To serve the cell research community in China, the Chinese version of CLO-NICR was also generated and deposited in the OntoChina ontology repository. The usage of CLO-NICR was demonstrated by DL query and knowledge extraction. Conclusions In summary, all identified cell lines from NICR are represented by the semantics framework of CLO and incorporated into CLO as a most recent update. We also generated a CLO-NICR and its Chinese view (CLO-NICR-Cv). The development of CLO-NICR and CLO-NIC-Cv allows the integration of the cell lines from NICR into the community-based CLO ontology and provides an integrative platform to support different applications of CLO in China.
AbstractcaArray is an open-source, open development, web and programmatically accessible array data management system developed at National Cancer Institute. It was developed to support the exchange of array data across the Cancer Biomedical Informatics Grid (caBIG™), a collaborative information network that connect scientists and practitioners through a shareable and interoperable infrastructure to share data and knowledge. caArray adopts a federated model of local installations, in which data deposited are shareable across caBIG™. Comprehensive in annotation yet easy to use has always been a challenge to any data repository system. To alleviate this difficulty, caArray accepts data upload using the MAGE-TAB, a spreadsheet-based format for annotating and communicating microarray data in a MIAME-compliant fashion ("http://www.mged.org/mage-tab":http://www.mged.org/mage-tab). MAGE-TAB is built on community standards – MAGE, MIAME, and Ontology. The components and work flow of MAGE-TAB files are organized in such a way which is already familiar to bench scientists and thus minimize the time and frustration of reorganizing their data before submission. The MAGE-TAB files are also structured to be machine readable so that they can be easily parsed into database. Users can control public access to experiment- and sample-level data and can create collaboration groups to support data exchange among a defined set of partners. All data submitted to caArray at NCI will go through strict curation by a group of scientists against these standards to make sure that the data are correctly annotated using proper controlled vocabulary terms and all required information are provided. Two of mostly used ontology sources are MGED ontology ("http://mged.sourceforge.net/ontologies/MGEDontology.php":http://mged.sourceforge.net/ontologies/MGEDontology.php) and NCI thesaurus ("http://nciterms.nci.nih.gov/NCIBrowser/Dictionary.do":http://nciterms.nci.nih.gov/NCIBrowser/Dictionary.do). The purpose of data curation is to ensure easy comparison of results from different labs and unambiguous report of results. Data will also undergo automatic validation process before parsed into database, in which minimum information requirement and data consistency with the array designs are checked. Files with error found during validation are flagged with error message. Curators will re-examine those files and make necessary corrections before re-load the files. The iteration repeats until files are validated successfully. Data are then imported into the system and ready for access through the portal or through API. Interested parties are encouraged to review the installation package, documentation, and source code available from "http://caarray.nci.nih.gov":http://caarray.nci.nih.gov.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.