Machine learning is a process in which computer is used to train and calculate input data and output results in a complex, multi task simulation. In data analysis, we can use machine learning to carry out experimental research and theoretical verification. In order to improve the ability of data analysis, we need to use machine learning and data mining methods to better process data. In this paper, experimental method and principal component analysis method are mainly used to test and discuss the fusion of machine learning in data analysis. The experimental results show that the CPU utilization rate in Scheme 4 is about 85% on average. The reason why the CPU of the Scribe center server is reduced is that after receiving data, there is less data to decompress, which reduces the CPU utilization.