Developing Deep Learning Based Intelligent Perception Algorithm for Thickener Equipment Monitoring Zuochen Liu1,2,†, Qingkai Wang1,2,†, Jiandong Dang3,†, Xiaochun Wang3,†, Kang Li1,2,∗,†, Roberto Avogadro4,†, Dumitru Roman4,† and Xiang Ma4,∗,† 1 State Key Laboratory of Intelligent Optimized Manufacturing in Mining & Metallurgy Process, Beijing 100160, China 2 Beijing General Research Institute of Mining and Metallurgy (BGRIMM), Beijing 100160, China 3 Anhui Tongguan (Lujiang) Mining Co., Ltd, Hefei 231500, Aahui, China 4 SINTEF AS, POB 124, Blindern, 0314 Oslo, Norway Abstract Dense dehydration by use of thickener equipment is a critical process in mineral processing, directly impacting the quality of concentrate products. However, current industrial practices lack effective methods for monitoring the condition of thickeners. This makes it difficult for operators to accurately perceive the internal state of the thickener, leading to a relatively inefficient production process. This paper addresses the challenge of real-time detection of one of the key process parameters, feed rate, in thickener by employing a data-driven model-ling approach with intelligent perception. The upstream processes are analyzed, and relevant variables are selected for modeling. The Long Short-Term Memory (LSTM) - Recurrent Neural Network (RNN) and the Gated Recurrent Unit (GRU) - RNN with the Particle Swarm Optimization (PSO) algorithm are used to build and train the models. The results show that the PSO-GRU-based intelligent perception model for thickener feed rate estimation achieves higher accuracy and shorter training times compared with the LSTM-RNN model. Keywords intelligent perception, deep learning, LSTM-RNN, GRU-PSO, thickener 1 1. Introduction Mineral processing is a crucial and indispensable step after geological exploration and mining, and before metallurgical or chemical processing in many material production value chains. It involves converting raw, complex mineral resources into standardized, orderly concentrates that meet specifications for further processing in smelters or chemical plants [1]. The mined ore is sequentially subjected to primary crushing, vibrating screening, fine crushing, grinding, flotation and final thickening, as shown in Figure 1. Moisture content is a critical quality indicator for concentrated products, making thickening and dewatering a crucial step in the mineral processing workflow. The thickening process plays an essential role in achieving solid-liquid separation, with the thickener being the key piece of equipment in this operation [2]. In actual production, the operation of thickening and filtration relies heavily on assessing the operating conditions of the thickener, with the feed rate being a critical factor. The feed rate is a nonlinear, dynamic variable, and industrial operations often depend on manual experience, which can be subjective and imprecise. This makes it difficult for operators to accurately perceive the internal state of the thickener, leading to a relatively inefficient production process. RuleML+RR'24: Companion Proceedings of the 8th International Joint Conference on Rules and Reasoning, September 16‐ ‐22, 2024, Bucharest, Romania ∗ Corresponding author † These authors contributed equally. likang@bgrimm.com (K. Li); xiang.ma@sintef.no (X. Ma) 0000-0003-1867-3919 (K. Li); 0000-0001-6465-0254 (X. Ma) © 2024 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). CEUR ceur-ws.org Workshop ISSN 1613-0073 Proceedings Issues such as low filtration efficiency and high energy consumption consequently arise. Therefore, optimizing thickener control requires accurate detection of the feed rate. In practice, ore feed rate detection typically relies on offline sampling or the installation of online analysis devices. However, due to the limitations of industrial environments, current detection methods fail to meet the requirements for real-time and accurate measurement [3]. Figure 1. Schematic diagram of conventional beneficiation process and equipment. With the development of soft sensing technology, data-driven regression models have been widely used to measure variables that are difficult to obtain directly in real-time. Moreover, recent years have seen breakthrough advancements in new intelligent sensing technologies, particularly in deep learning. Dynamic neural networks, represented by Recurrent Neural Networks (RNN), excel at capturing the temporal dynamics in sequential data. However, standard RNNs face issues like vanishing and exploding gradients when modeling long-time series, due to their indiscriminate memory of past data. To tackle this issue, Long Short-Term Memory (LSTM) networks were designed to selectively forget past information by incorporating gating units [4]. While LSTM offers superior performance, it comes with increased complexity and numerous parameters. The Gated Recurrent Unit (GRU) was thus introduced to simplify the LSTM structure while retaining its powerful performance [5]. In this study, the upstream processes of thickening are analyzed, and relevant data is used to implement intelligent perception of thickener feed rate using LSTM and GRU based models. The simulation results, along with actual data, verify that deep learning methods can effectively solve the problem of online detection of thickener feed rate. 2. Constructing and processing of datasets The mineral processing workflow is a continuous process that includes crushing, grinding, flotation, and thickening/filtration, with each step closely interconnected. The output of one stage typically serves as the input for the next, forming a tightly integrated process chain. From prior knowledge, it is understood that the feed rate of the thickener is closely related to the grinding and flotation processes. Given the complexity of these processes which involve numerous pieces of equipment and sensors, this study selects 58 auxiliary variables from the grinding and flotation stages that have a strong correlation with the thickener feed rate. The auxiliary variables' data can be detected in real-time at the industrial site and stored in a database. The raw data consists of actual operational data from the plant, with a sampling interval of 5 seconds, resulting in a total of 200,000 data sets. The selected time period for these samples effectively captures a wide range of conditions during normal process operation. The raw data, directly sourced from on-site industrial detection equipment, inevitably contains missing values and random errors. Using such unprocessed data for model building can significantly reduce predictive accuracy. Therefore, data preprocessing is essential and involves the following steps: (1) Handling missing values Given the 5-second sampling interval, there are instances where equipment or sensor data change during sampling, leading to unsuccessful data capture. To address this, missing values in the raw data are identified and filled using forward filling, where the missing data point is replaced with the value from the preceding timestamp. (2) Outlier detection This study employs a moving median method to detect and eliminate outliers. Specifically, a moving window of length 70 is applied sequentially across the dataset. Within each window, any data point that deviates from the local median by more than five times the local standard deviation is classified as an outlier. (3) Standardization and inverse standardization The auxiliary variables exhibit significant differences in their values and distributions, rendering them unsuitable for direct input into the model. To mitigate the effects of differing dimensions among input variables, the data are standardized to have a mean of 0 and a standard deviation of 1. This standardized data serves as the input for model training. Consequently, the model's output, which is based on the standardized input, must undergo inverse standardization to revert to the original scale. The equations for standardization and inverse standardization are as follows: 𝑋 (1) 𝑥 𝑋 𝑥 –𝑥 𝑥 (2) where 𝑥 : the i‐th denormalized data, 𝑋 : the normalized data, 𝑥 and 𝑥 : the minimum and maximum values of the sample set with N number of data, respectively. (4) Evaluation criteria The root mean square error (RMSE) is selected to evaluate the difference between the observed value 𝑦 and the true value 𝑝𝑟𝑒𝑑 of the model, and the calculated mean relative error (MRE) is selected to measure the overall deviation between the observed value and the true value to evaluate the quality of the model, as defined by: ∑ 𝑅𝑀𝑆𝐸 (3) ∑ | | 𝑀𝑅𝐸 (4) 3. Intelligent perception models for thickener feed rate prediction Intelligent perception models are developed for thickener feed rate evaluation using traditional deep learning-based LSTM-RNN and GRU-RNN. A subset of 10,000 data samples is selected from the dataset, and models are built using both this subset and the full dataset of 200,000 samples. The dataset contains 58 input variables, with the thickener feed rate as the output variable. The data is divided into training, validation, and test sets with a ratio of 0.7:0.15:0.15. The simulation software Matlab© R2023b is used to build and run the models in a 64-bit Windows 11 stationary PC with an Intel(R) Core(TM) i9-12900H CPU and a 16.00 GB RAM. 3.1. Intelligent perception model based on LSTM‐RNN LSTM-RNN is a special recurrent neural network, composed of input gates, for-getting gates, and output gates which can process and memorize long-term series data. It can effectively alleviate the problems of gradient vanishing and gradient explosion in ordinary RNNs, and is suitable for solving the regression and prediction problems of long-term series. Figure 2 shows the process of building the intelligent perception model based on LSTM. The hyperparameters are selected for random search optimization, and the Adam optimizer is used for training. FC LSTM LSTM . . . LSTM . . . Figure 2. Structural diagram of the intelligent perception model based on LSTM. The RMSE and the MRE for the predicted feed rate in ton/hour (t/h) using the test set containing 10,000 samples are 0.210 and 4.1%, respectively, and are 0.307 and 5.2% for the full dataset, respectively, as shown in Figure 3. However, the LSTM model's complexity, longer training time, higher hardware requirements, and increased number of parameters make the training process more challenging, which poses difficulties for its application in industrial settings. To address these issues, the use of a GRU network will be explored. 11.5 15 Actual Actual 11 Predicted Predicted 10.5 10 9.5 10 9 8.5 8 7.5 5 0 500 1000 1500 0 0.5 1 1.5 2 2.5 3 Sampling index Sampling index 104 (a) 10,000 samples out of the full dataset (b) Full dataset Figure 3. LSTM-RNN predicted feed rate output (t/h), compared with the actual vaules. 3.2. Intelligent perception model based on improved GRU The GRU model is a further improved structural model of the LSTM, merging the input and forgetting gates of the LSTM and using only two gating units: the update gate and the reset gate. At the same time, the particle swarm (PSO) algorithm is used for hyperparameter optimization, which is an optimization algorithm based on swarm intelligence and finds the optimal solution through information sharing between individuals [6]. The structure of the GRU-based intelligent perception model is similar to that of the LSTM model in which the LSTM network is replaced by the GRU network, and the PSO algorithm is used to optimize the hyperparameters of the GRU. As shown in Figure 4, the RMSE and the MRE for the predicted feed rate by PSO-GRU using the test set containing 10,000 samples are 0.153 and 3.5%, respectively, and are 0.136 and 3.4% for the full dataset, respectively. Actual result comparison shows that the performance of the PSO- GRU network can well meet the measurement accuracy requirements in the production line for the sample dataset of 10,000 sets, and the complete dataset with 200,000 sets. Additionally, the training time for the PSO-GRU model is 35min and 31s, shortened by 19% compared with of the LSTM model of 43min and 50s. 11.5 18 Actual Actual 11 Predicted 16 Predicted 10.5 14 10 9.5 12 9 10 8.5 8 8 7.5 6 7 4 0 500 1000 1500 0 0.5 1 1.5 2 2.5 3 Sampling index Sampling index 4 10 (a) 10,000 samples out of the full dataset (b) Full dataset Figure 4. PSO-GRU predicted feed rate output (t/h), compared with the actual values. 4. Conclusions The feed rate of the thickener is a crucial parameter in its operation. This study proposes several intelligent perception models to accurately predict the thickener feed rate, utilizing relevant variables from the grinding and flotation processes to address the challenge of online detection. Two intelligent perception models are developed based on LSTM and GRU-RNN. The best performed GRU-RNN model and its advantages over LSTM network are analyzed. This deep learning-based approach could be applied to actual production, providing a foundation for the optimized control of the thickener. Acknowledgements This work was supported partially by the National Key R&D Program of China (Grant No. 2021YFC2902703),the Open Foundation of State Key Laboratory of Process Automation in Mining & Metallurgy/Beijing Key Laboratory of Process Automation in Mining & Metallurgy, China (Grant No. BGRIMM-KZSKL-2022-4, Grant No. BGRIMM-KZSKL-2023-10), Horizon Europe funded enRichMyData (Grant No. 101070284), and the bilaterial project "BigDataMine" jointly funded by National Key R&D Program of China (Grant No. 2019YFE0105000) and the Research Council of Norway (Grant No. 309691). References [1] Hodouin D., Jämsä-Jounela S.-L, Carvalho M.T., Bergh L., “State of the art and challenges in mineral processing control”. Control Engineering Practice 9.9 (2001): 995-1005. [2] Shmueli G., Sarstedt M., Hair J. F., Cheah, J. H., Ting, H., Vaithilingam, S., Ringle, C. M., “Predictive model assessment in PLS-SEM: guidelines for using PLSpredict”. European Journal of Marketing 53.11 (2019): 2322–2347. [3] Chen, H., Huang, W., Huang, J., Cao, C., Yang, L., He, Y., Zeng, L., “Multi-fault condition monitoring of slurry pump with principle component analysis and sequential hypothesis test”. International Journal of Pattern Recognition and Artificial Intelligence 34.7 (2020): 2059019. [4] Graziani, S., Xibilia, M. G., “Design of a soft sensor for an industrial plant with unknown delay by using deep learning”. in: Proceedings of the 2019 IEEE International Instrumentation and Measurement Technology Conference, IEEE, Auckland, 2019, pp. 1–6. [5] Chung, J., Gulcehre, C., Cho, K. H., Bengio, Y., “Empirical evaluation of gated recurrent neural networks on sequence modeling". arXiv preprint arXiv: 1412.3555, 1–9 (2014). [6] Kennedy, J., Eberhart, R., “Particle swarm optimization”. in: Proceedings of the International Conference on Neural Networks, IEEE, Perth, 1995, pp. 1942–1948.