Abstract. The medical community is producing and manipulating a tremendous volume of digital data for which computerized archiving, processing and analysis is needed. Grid infrastructures are promising for dealing with challenges arising in computerized medicine but the manipulation of medical data on such infrastructures faces both the problem of interconnecting medical information systems to grid middlewares and of preserving patients' privacy in a wide and distributed multiuser system. These constraints are often limiting the use of grids for manipulating sensitive medical data.This paper describes our design of a medical data management system taking advantage of the advanced gLite data management services, developed in the context of the EGEE project, to fulfill the stringent needs of the medical community. It ensures medical data protection through strict data access control, anonymization and encryption. The multi-level access control provides the flexibility needed for c 2007 Kluwer Academic Publishers. Printed in the Netherlands.GRID172.tex; 1 2 J. Montagnat, A. Frohner, D. Jouvenot, C. Pera et al implementing complex medical use-cases. Data anonymization prevents the exposure of most sensitive data to unauthorized users, and data encryption guarantees data protection even when it is stored at remote sites. Moreover, the developed prototype provides a grid Storage Resource Manager (SRM) interface to standard medical DICOM servers thereby enabling transparent access to medical data without interfering with medical practice.Keywords: Secure grid storage, gLite middleware, medical data management 1. Context ObjectivesMany scientific areas benefit from large and distributed storage capabilities provided by grid infrastructures. On top of physical storage resources, the EGEE [17] grid data management system eases the manipulation of large data volumes and provides high level functionality such as data distribution, replication and optimized access. To build a data management system that can adapt to the heterogeneous data storage resources (disk, tapes, silos...), the grid community has adopted standard interfaces to virtualize the underlying resources. In particular, gLite [23], the next generation EGEE middleware, has adopted the Storage Resource Manager (SRM) interface [29] standardized in the context of the Open Grid Forum [28]. The SRM's primary concern is to provide efficient access to large volumes of data. It provides, among other services, prefetching of data files recorded on secondary storage, management of storage space and reservation of storage resources. However, it does not provide any access control nor protection of data which severely limits its usability for applications manipulating sensitive data.In this paper, we address the problem of sensitive data management on the EGEE grid infrastructure and we introduce a data management service designed to handle medical records on grids. We first motivate our approach through an in-depth requirement analysis of data management in the medical ar...
The Large Hadron Collider (LHC) is preparing for data taking at the end of 2009. The Worldwide LHC Computing Grid (WLCG) provides data storage and computational resources for the high energy physics community. Operating the heterogeneous WLCG infrastructure, which integrates 140 computing centers in 33 countries all over the world, is a complicated task. Reliable monitoring is one of the crucial components of the WLCG for providing the functionality and performance that is required by the LHC experiments. The Experiment Dashboard system provides monitoring of the WLCG infrastructure from the perspective of the LHC experiments and covers the complete range of their computing activities. This work describes the architecture of the Experiment Dashboard system and its main monitoring applications and summarizes current experiences by the LHC experiments, in particular during service challenges performed on the WLCG over the last years.
The Disk Pool Manager (DPM) is a lightweight solution for grid enabled disk storage management. Operated at more than 240 sites it has the widest distribution of all grid storage solutions in the WLCG infrastructure. It provides an easy way to manage and configure disk pools, and exposes multiple interfaces for data access (rfio, xroot, nfs, gridftp and http/dav) and control (srm). During the last year we have been working on providing stable, high performant data access to our storage system using standard protocols, while extending the storage management functionality and adapting both configuration and deployment procedures to reuse commonly used building blocks. In this contribution we cover in detail the extensive evaluation we have performed of our new HTTP/WebDAV and NFS 4.1 frontends, in terms of functionality and performance. We summarize the issues we faced and the solutions we developed to turn them into valid alternatives to the existing grid protocols -namely the additional work required to provide multi-stream transfers for high performance wide area access, support for third party copies, credential delegation or the required changes in the experiment and fabric management frameworks and tools. We describe new functionality that has been added to ease system administration, such as different filesystem weights and a faster disk drain, and new configuration and monitoring solutions based on the industry standards Puppet and Nagios. Finally, we explain some of the internal changes we had to do in the DPM architecture to better handle the additional load from the analysis use cases.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.