The biomimetic electronic nose (e-nose) technology is a novel technology used for the identification and monitoring of complex gas molecules, and it is gaining significance in this field. However, due to the complexity and multiplicity of gas mixtures, the accuracy of electronic noses in predicting gas concentrations using traditional regression algorithms is not ideal. This paper presents a solution to the difficulty by introducing a fusion network model that utilizes a transformer-based multikernel feature fusion (TMKFF) module combined with a 1DCNN_LSTM network to enhance the accuracy of regression prediction for gas mixture concentrations using a portable electronic nose. The experimental findings demonstrate that the regression prediction performance of the fusion network is significantly superior to that of single models such as convolutional neural network (CNN) and long shortterm memory (LSTM). The present study demonstrates the efficacy of our fusion network model in accurately predicting the concentrations of multiple target gases, such as SO 2 , NO 2 , and CO, in a gas mixture. Specifically, our algorithm exhibits substantial benefits in enhancing the prediction performance of low-concentration SO 2 gas, which is a noteworthy achievement. The determination coefficient (R 2 ) values of 93, 98, and 99% correspondingly demonstrate that the model is very capable of explaining the variation in the concentration of the target gases. The root-mean-square errors (RMSE) are 0.0760, 0.0711, and 3.3825, respectively, while the mean absolute errors (MAE) are 0.0507, 0.0549, and 2.5874, respectively. These results indicate that the model has relatively small prediction errors. The method we have developed holds significant potential for practical applications in detecting atmospheric pollution detection and other molecular detection areas in complex environments.