2015 IEEE/ACM 2nd International Symposium on Big Data Computing (BDC) 2015
DOI: 10.1109/bdc.2015.33
|View full text |Cite
|
Sign up to set email alerts
|

Any Data, Any Time, Anywhere: Global Data Access for Science

Abstract: Data access is key to science driven by distributed high-throughput computing (DHTC), an essential technology for many major research projects such as High Energy Physics (HEP) experiments. However, achieving efficient data access becomes quite difficult when many independent storage sites are involved because users are burdened with learning the intricacies of accessing each system and keeping careful track of data location. We present an alternate approach: the Any Data, Any Time, Anywhere infrastructure. Co… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
13
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
6
2

Relationship

4
4

Authors

Journals

citations
Cited by 19 publications
(13 citation statements)
references
References 12 publications
0
13
0
Order By: Relevance
“…The proxy service, operated by the facility administrators, has a valid credential for accessing experiment data. Data is downloaded on demand from an infrastructure like CMS's "Any Data, Any Time, Anywhere" (AAA) data federation [32]. In the current deployment, because we can guarantee all authenticated users are valid CMS members, we assume any authenticated access to the proxy service is allowed to access CMS data.…”
Section: Integration With Data Accessmentioning
confidence: 99%
“…The proxy service, operated by the facility administrators, has a valid credential for accessing experiment data. Data is downloaded on demand from an infrastructure like CMS's "Any Data, Any Time, Anywhere" (AAA) data federation [32]. In the current deployment, because we can guarantee all authenticated users are valid CMS members, we assume any authenticated access to the proxy service is allowed to access CMS data.…”
Section: Integration With Data Accessmentioning
confidence: 99%
“…Due to the increasing reliability and capacity of wide area network links CMS adapted its computing model for Run-2 to allow for remote data access, which was basically excluded before. CMS commissioned a global data federation [30] that comprises all Grid Storage Elements (SE). It is sufficient to just know the logical file name (LFN) and the URL of an entry point to access any CMS file that is presently hosted on disk storage.…”
Section: Cms Configurationmentioning
confidence: 99%
“…Almost all kinds of jobs (except for very IO intensive jobs like skims and digization using classical/non-premixed pileup) are allowed to run at NERSC. Job input that isn't available locally will be read remotely from another CMS site [3]. CHEP 2018 Figure 2.…”
Section: Hepcloud Integration Into Cms Workflow Managementmentioning
confidence: 99%