In industry, real-time fault detection and diagnosis methods are required to secure processes, reduce damage to products, and avoid possible system failures. Recently, Long Short-Term Memory (LSTM) neural networks are used as an approach to fault detection in industrial process operation because of their strength sequential data processing, such as time series processing. However, LSTM neural networks demand more effort computational to inferring and training when compared to other kinds of neural network architectures. Then, considering IIoT (Industrial Internet of Things) embedded systems have limited memory capacity and small battery charges, strategies to speed up inference in LSTM neural networks and enhance their performance became necessary. In this way, this paper proposes a basis to compress LSTM neural networks with pruning techniques in software. Our pruning approach removes redundant parameters of the LSTM neural network by zeroing absolute synaptic weight values below a threshold. Then, we retrain the pruned model to readjust nonzero weights. We used the Tennessee Eastman Process benchmark to assess our approach. Finally, the paper presents the accuracy, precision, recall and F1-Score for both faulty data sets, varying the network's sparsity and comparing sparsities with performance parameters of the proposed network.