In order to realize the real-time and accurate prediction of dissolved oxygen concentration in the sewage treatment process, a prediction model of dissolved oxygen concentration in the sewage treatment process based on a data identification algorithm was proposed. Combined with the data characteristics of the sewage treatment process, a new sample similarity measure is defined to extract more representative modeling data. In the improved algorithm, in order to improve the quality of the initial members of the basic fireworks algorithm, the chaos algorithm is integrated. The search mechanism of the basic fireworks algorithm is improved, and the optimization process is divided into two stages based on the set criteria, and two groups are used simultaneously. The results show that compared with the basic FWA algorithm, the CFWA algorithm makes better use of the chaotic search mechanism. On the one hand, it avoids the excessive random or blind selection of the initial weight threshold of the neural network in the initial stage; on the other hand, in the optimization process of the weight threshold, two types of search mechanisms, FWA and COA, are used to give full play to their respective strengths and to continuously conduct information exchange and mutual cooperation between groups and individuals. The number of times is better than the basic FWA algorithm, and the training error and generalization error of the CFWA model in the simulation results of the soft sensor model are also better than those of the FWA model, which fully verifies the effectiveness of the CFWA algorithm. It is proved that the data recognition algorithm can effectively predict sewage treatment. It is proved that the data recognition algorithm can effectively predict the dissolved oxygen concentration in wastewater treatment process. It provides a new measurement method for some key process variables that cannot be measured or are difficult to measure in complex chemical processes.