Existing fixed pattern noise reduction (FPNR) methods are easily affected by the motion state of the scene and working condition of the image sensor, which leads to over smooth effects, ghosting artifacts as well as slow convergence rate. To address these issues, we design an innovative cascade convolution neural network (CNN) model with residual skip connections to realize single frame blind FPNR operation without any parameter tuning. Moreover, a coarse-fine convolution (CF-Conv) unit is introduced to extract complementary features in various scales and fuse them to pick more spatial information. Inspired by the success of the visual attention mechanism, we further propose a particular spatial-channel noise attention unit (SCNAU) to separate the scene details from fixed pattern noise more thoroughly and recover the real scene more accurately.Experimental results on test data demonstrate that the proposed cascade CNN-FPNR method outperforms the existing FPNR methods in both of visual effect and quantitative assessment. affected by the fixed pattern noise (FPN), which is mainly caused by the spatial non-uniform response of individual detectors in the sensor [6][7]. More seriously, spatial FPN generally drifts with time, which makes the problem be more challenging [8][9][10][11]. As a result, the FPN causes a significant decline in imaging quality and decreases the precision for object detection and recognition. To meet this challenge, the cost-effective fixed pattern noise reduction (FPNR) techniques based on signal processing are continually investigated and applied in nearly all the infrared imaging systems.Existing FPNR algorithms are mainly divided into two primary categories: reference-based FPNR (RB-FPNR) and scene-based FPNR (SB-FPNR) [12][13][14]. The RB-FPNR methods remove the FPN according to fixed calibration parameters calculated from the response of blackbody radiation at different temperatures [15]. Unfortunately, such a calibration requires the camera to halt the normal operation and update the calibration parameters due to the inherent temporal drift of detector characteristics [16]. Given this fact, most of the recent researches have focused on developing SB-FPNR methods, such as neural networks (NN) [17], temporal high-pass filter (THPF) [18,19] and constant-statistics (CS) method [20][21]. As for SB-FPNR algorithms, the calibration parameters are iteratively updated by utilizing the information extracted from inter-frame motion, therefore, ghosting artifacts and over smooth effects resulted from the sudden deceleration of scene motion often seriously degrade the noise reduction performance, moreover, the relatively slow convergence process occurred in scene switching is unacceptable for most of the practical applications.In recent years, convolution neural network (CNN) [22] models were explored deeply and applied in various image processing tasks [23], such as image super resolution [24,25], image denoising [26], and sketch synthesis [27][28][29]. To the best of our knowledge, CNN based FPNR m...