2011
DOI: 10.1088/1742-6596/331/6/062018
|View full text |Cite
|
Sign up to set email alerts
|

Popularity framework to process dataset traces and its application on dynamic replica reduction in the ATLAS experiment

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2011
2011
2021
2021

Publication Types

Select...
7
1

Relationship

4
4

Authors

Journals

citations
Cited by 12 publications
(14 citation statements)
references
References 2 publications
0
14
0
Order By: Relevance
“…These datasets are produced from the RAW dataset that was the most popular among RAW datasets based on information from DQ2 Popularity service on September 2011 [3]: "data11_7TeV.00184130.physics_JetTauEtmiss.merge.RAW". The obtained data will help to track the behavior of derived datasets usage.…”
Section: Evaluation Of Panda Datamentioning
confidence: 99%
See 1 more Smart Citation
“…These datasets are produced from the RAW dataset that was the most popular among RAW datasets based on information from DQ2 Popularity service on September 2011 [3]: "data11_7TeV.00184130.physics_JetTauEtmiss.merge.RAW". The obtained data will help to track the behavior of derived datasets usage.…”
Section: Evaluation Of Panda Datamentioning
confidence: 99%
“…Data deletion is also demand driven (by the Replica Reduction Agent), reducing the numbers of replicas for unpopular data to get space for more popular data [3]. This dynamic model has led to substantial improvements in efficient utilization of storage and processing resources [4].…”
Section: Introductionmentioning
confidence: 99%
“…The popularity service described in reference [9] uses the recorded trace messages to analyse the popular and unpopular files, and support dynamic replica reduction.…”
Section: Popularity and Dynamic Deletion Of Unpopular Replicamentioning
confidence: 99%
“…The strategy was first to distribute the minimal replicas as 'primary' and some extra as 'secondary' that are foreseen to be used, adding more 'secondary' replicas following the usage and needs, and removing unused 'secondary' replicas to ensure enough free space for further prompt replication, especially for new data or popular datasets. A system to measure data set popularity was established that recorded the number of accesses per dataset and per file from different activities, and another system for auto-cleaning that selects dataset replicas to be deleted based on the popularity accounting was developed for that purpose [8,10].…”
Section: Data Distribution Over the Gridmentioning
confidence: 99%