In order to improve the effect of spoken English training, this paper combines multimedia information technology to reform the teaching of spoken English training, and integrates BP neural network English into spoken English training. Moreover, this paper combines the actual needs of spoken English training and the teaching framework of the multimedia system to construct the data set, clean up the data set, and implement the word vector representation of students and professionals. In addition, this paper constructs the entire system framework of the spoken English resource recommendation algorithm based on the graph convolutional neural network, and combines the BP deep neural network algorithm to construct the spoken English training system. Finally, this paper designs an experiment to evaluate the effect of this system. The experimental research results show that the multimedia based on the BP deep neural network proposed in this paper has a good effect in the application research of spoken English training, and can effectively promote the effect of spoken English training of students.