The
Architecture descriptionThe ASCI grid architecture provides the software infrastructure for an integrated information and simulation environment for the nuclear weapons complex in 2004. The Distributed Resource Management (DRM) project provides grid services to higher level applications. The initial implementation has two development thrusts. Fkst, ASCI requires a core integrated environment for job submission to classified ASCI platforms at the three weapons laboratories:Los Alamos (LANL), Lawrence Livermore (LLNL), and Sandia (SNL). G1obus provides these core services, with some extensions for specific ASCI requirements. Second, a software layer above the core services provides a set of common capabilities that support higher-level problem solving environmen( PSES). These capabilities include complex work management and resource brokering. This architecture demonstrates the following features: The use of Globus services for accessing resources in an independent ASCI environment Kerberos-based authentication, grid information authorization and accesss control A Globus interface to the two-tier architecture of Sandia's Computational Plant (Cplantm), a distributed computing resource built of commodity parts CORBA-based work management services to support the coordinated use of multiple resources and complex task sequencing CORBA-based criteria-driven resource brokering that extends job requestiresource matching to support software resources and load-balancing Java-based client for generic work requests Web-based monitoring Integration of the grid infrastructure with CORBA, Java, and Globus-based PSES and tools.The DRM grid services address three aspects of the shift from platform-centric to network-centric computing: software resources, work flow, and co-scheduling. Software resources serve a large class of users and problems where many resources are suitable, and the primary interest is how soon the results can be obtained. Some simulation codes are run by a number of users at different sites, and must be available for many of the grid computing resources. Multiple versions of these codes are maintained to support different users and to ensure reproducibility of results. Requests to "run code X" are satisfied by software resources and brokering. Workflow services manage complex task sequencing in the network environment, analogous to the complex scripts often submitted in a platform environment. Workflow can coordinate computational processing steps, computation and visualization, computation and data movement, or other resource usage. Subtasks are scheduled independently.'Sartdiit is a multiprogram laboratory operated by .%ndia Corporation, a