Many algorithms can uniformly distribute data to storage nodes in a storage system. However, it cannot avoid load imbalance because data has different popularity. To resolve this issue, we propose a novel dynamic replication scheme, namely, Active Replica Management (ARM). ARM actively establishes optimal number of copies for hotspot data according to data access behaviors and then efficiently distributes copies to other storage nodes based on current amount of copies related to hotspot data. To improve storage utilization, ARM automatically and gradually dereplicates the useless copies of hotspot data when they become nonhotspot data. ARM resolves load imbalance by allocating dynamic copies to adequate storage nodes, and hence it can prevent partial storage nodes from overburdening. Simulation results demonstrate that ARM is an efficient scheme with excellent performance on load balancing, significantly closer to Optimal Load Balancing (OLB). In addition, ARM’s performance outperforms both Static Load Balancing (SLB) and No Replica schemes.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.