Big data has become popular for processing, storing and managing massive volumes of data. The clustering of datasets has become a challenging issue in the field of big data analytics. The K-means algorithm is best suited for finding similarities between entities based on distance measures with small datasets. Existing clustering algorithms require scalable solutions to manage large datasets. This study presents two approaches to the clustering of large datasets using MapReduce. The first approach, K-Means Hadoop MapReduce (KM-HMR), focuses on the MapReduce implementation of standard K-means. The second approach enhances the quality of clusters to produce clusters with maximum intra-cluster and minimum inter-cluster distances for large datasets. The results of the proposed approaches show significant improvements in the efficiency of clustering in terms of execution times. Experiments conducted on standard K-means and proposed solutions show that the KM-I2C approach is both effective and efficient.
This paper presents a robust and blind watermarking scheme for copyright protection of images in discrete wavelet transform domain based on the support vector machines (SVMs). This scheme is based on the relation between the coefficients in various sub bands in discrete wavelet transform decomposition. The proposed scheme is very secured and robust to various attacks, viz., Low pass Filtering, Salt & Pepper noise, Gamma Correction, JPEG Compression, Row-Column Copying, Row-column blanking, Bit plane removal, Cropping, Resize and Histogram Equalization etc. Experimental results show that the proposed scheme has significant improvements in both robustness and imperceptibility and superior to an algorithm proposed by Li et al. in terms of Normalized Cross correlation (NC) and Peak Signal to Noise Ratio (PSNR).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.