1. Introduction

1613-0073

Utilization of Machine Learning in Recognition of Rocks and Mock-mines by Sonar Chirp Signals

Yurii Kryvenchuk

yurii.p.kryvenchuk@lpnu.ua

Mykhailo Dmytryshyn

CEUR Workshop Proceedings

Lviv, Ukraine

0000 0002

1. Introduction

3. Methods 3.1. Dataset

The research utilized the "Connectionist Bench (Sonar, Mines vs. Rocks)" dataset from the UCI Machine Learning repository. The dataset is a csv file in which sonar patterns are stored. These patterns result from bouncing sonar signals off a metal cylinder and rocks, each explored across various angles and conditions. The sonar signals transmitted are frequency-modulated chirps, ascending in frequency, and were captured from diverse aspect angles—spanning 90 degrees for the cylinder and 180 degrees for the rock. Each pattern consists of 60 numerical values within the range of 0.0 to 1.0. These numbers denote the energy within specific frequency bands, integrated over defined time periods. Notably, the integration aperture for higher frequencies occurs later in time due to their transmission later in the chirp.

The labels assigned to each record are "R" for rocks and "M" for mines (metal cylinders). While the labels exhibit an ascending order corresponding to the aspect angle, they do not directly encode the angle information.

3.2. Data processing and organization methods

In the context of the work on "Utilization of Machine Learning in recognition of rocks and mockmines by sonar chirp signals," Label Encoding is employed to convert the class labels (categories) into numerical values. For the task of recognizing rocks ("R") and mock-mines ("M") based on sonar chirp signals, the classes can be encoded into numerical values.

For example, if there is a column with class labels like: ["R", "M", "R", "R", "M", "M", "R", "M", "R", "R"] Label Encoding can be used to transform these classes into numerical values, for instance: [ 1, 0, 1, 1, 0, 0, 1, 0, 1, 1 ]

Here, "R" has been assigned the value 0, and "M" has been assigned the value 1. This conversion allows machine learning algorithms to work with the data, as many algorithms require numerical values for both input and output.

Label Encoding can be performed using libraries like scikit-learn in Python, utilizing the LabelEncoder class. This encoding is particularly useful when dealing with categorical data in machine learning models. Ensemble methods, specifically AdaBoost-Samme, were employed to leverage the strengths of multiple weak learners. Decision trees, logistic regression, and random forests were individually used as base classifiers within the ensemble framework to assess the ir impact on classification accuracy. Various neural network architectures were explored. Techniques such as dropout and L2 regularization were applied to mitigate overfitting and enhance generalization performance. The performance of each model was assessed using accuracy as the result of cross validation score.

3.3. ML Methods

In this study, a diverse set of machine learning algorithms has been employed to discern patterns and classify sonar signals. The algorithms chosen demonstrate versatility in handling the complexity of the data and offer a comprehensive exploration of the recognition task. The following algorithms have been applied:

3.4. Overfitting 4. Experiment 4.1. Dataset Preprocessing: 4.2. Evaluation 5. Results Algorithm AdaBoost-Samme - decision tree AdaBoost-Samme - logistic regression AdaBoost-Samme - random forest Decision Tree Accuracy

71.12% 79.76% 87.50% 73.52%

Decision Tree - min cost complexity pruning Gaussian process - Laplace approximation K-nearest neighbors vote Logistic Regression Logistic Regression - L1 Logistic Regression - L2 Logistic Regression - L1 and L2 Multi-layer Perceptron Multi-layer Perceptron - L2 Neural Network Neural Network - dropout Neural Network - L2 Neural Network - dropout and L2

Random Forest

6. Discussions 6.1. Effectiveness of methods 6.2. Comparison with Previous Research Conclusions

systems. [14] W. Gong, J. Tian, J, Liu, Underwater Object Classification Method Based on Depthwise Separable Convolution Feature Fusion in Sonar Image, Applied Sciences 12 (2022). https://doi.org/10.3390/app12073268. [15] Harsh Yadav, Dropout in Neural Networks, 2022.

URL: https://towardsdatascience.com/dropout-in-neural-networks-47a162d621d9 [16] IBM, What is overfitting?, 2024. URL: https://www.ibm.com/topics/overfitting.

[1]

Lin , R , Dong, Z ,Lv, Deep Learning-Based Classification of Raw Hydroacoustic Signal, Journal of Marine Science and Engineering 11 ( 2023 ). https://doi.org/10.3390/jmse11010003.

[2]

Jason

Brownlee , Dropout Regularization in Deep Learning Models with Keras , 2022 . URL: https://machinelearningmastery.com /dropout-regularization-deep-learning-modelskeras/.

[3]

Steiniger ,

Kraus , T. Meisen, Survey on deep learning based computer vision for sonar imagery , Engineering Applications of Artificial Intelligence 114 ( 2022 ). https://doi.org/10.1016/j.engappai. 2022 .105157

[4]

Karimanzira ,

Renkewitz ,

Shea , Object Detection in Sonar Images, Electronics 9 ( 2020 ). https://doi.org/10.3390/electronics9071180

[5]

Fernandes ,

Junior , Deep Learning Models for Passive Sonar Signal Classification of Military Data , Remote Sensing 14 , ( 2022 ). https://doi.org/10.3390/rs14112648

[6] OpenGenus, Advantages and Disadvantages of Logistic Regression, 2024 . URL: https://iq.opengenus.org/advantages-and -disadvantages-of-logistic-regression/.

[7] IBM , What is a Decision Tree? , 2024 . URL: https://www.ibm.com/topics/decisiontrees#:~:text= A%20decision%20tree%20is%20a,internal%20nodes%20and%20leaf%20 nodes.

[8]

Nagpal , L1 and L2 Regularization Methods , 2017 . URL: https://towardsdatascience.com/l1-and - l2 - regularization -methods-ce25e7fc831c

[9]

Tian , Y. Zhang, A comprehensive survey on regularization strategies in machine learning , Information Fusion , ( 2021 ). https://doi.org/10.1016/j.inffus. 2021 . 11 .005

[10]

Shubham , An Overview of Regularization Techniques in Deep Learning , 2023 . URL: https://www.analyticsvidhya.com/blog/tag/regularization-in -deep-learning/.

[11]

Ruhela , Droput Rtgularization, 2023 . URL: https://ruhelalakshya.medium.com/dropout - regularization-b27885b4c55b.

[12]

Srivastava , G. Hinton, (Eds.), Dropout: A Simple Way to Prevent Neural Networks from Overfitting , Journal of Machine Learning Research 15 , ( 2014 ): 1929 - 1958 .

[13]

Geeks , Logistic Regression in Machine Learning , 2023 . URL: https://www.geeksforgeeks.org/understanding-logistic-regression/.