PM2.5 (Particulate Matter) and PM10 are the most common pollutants, and the increasing of concentration in the air will threaten people’s health. The machine learning method has recently been of particular interest to many researchers due to its effectiveness in air quality prediction models. Many solutions employing deep learning-based techniques such as Convolutional Neural Networks (CNN), Long Short-Term Memory (LSTM), and hybrid CNN-LSTM models to enhance air quality prediction accuracy have been developed. This paper proposes a hybrid Encoder STM model for predicting the next day to the next five days’ PM2.5 and PM10 concentrations in Hanoi. Additionally, we proposed five extended features to increase the accuracy of prediction. Then other models, namely the LSTM model and the Bidirectional LSTM model, are also considered for PM2.5 and PM10 concentration prediction. Our results show that the proposed approaches outperform other state-of-the-art deep learning-based methods on both Mean Absolute Error (MAE) and Root Mean Square Error (RMSE) due to low error and the small number of features.