“…The data grow exponentially in the widely used applications, such as IoT embeddings, artificial intelligence and cloud computing, which require efficient and large-scale storage capacities [7,17,18]. To save space and improve storage efficiency, data deduplication [31,41] becomes an efficient middleware to eliminate the duplicate data, and has been widely used in current storage systems [11, 24-26, 32, 39], especially for storage backup systems [14,19,30].…”