Methods for Predicting Failures in a Smart Home

Methods for Predicting Failures in a Smart Home ViktoriiaZhebka viktoria_zhebka@ukr.net State University of Information and Communication Technologies

7 Solomenskaya str 03110 Kyiv Ukraine

PavloSkladannyi p.skladannyi@kubg.edu.ua Borys Grinchenko Kyiv University

18/2 Bulvarno-Kudriavska str 04053 Kyiv Ukraine

YuriiBazak jura.bazak@gmail.com State University of Information and Communication Technologies

7 Solomenskaya str 03110 Kyiv Ukraine

AndriiBondarchuk State University of Information and Communication Technologies

7 Solomenskaya str 03110 Kyiv Ukraine

KamilaStorchak kpstorchak@ukr.net State University of Information and Communication Technologies

7 Solomenskaya str 03110 Kyiv Ukraine

Methods for Predicting Failures in a Smart Home D2551776594D36449BAA413D2021925C GROBID - A machine learning software for extracting information from scholarly documents Long short-term memory, LSTM, Machine learning, data processing, forecasting, smart home, failure, information technology (K. Storchak) ORCID: 0000-0003-4051-1190 (V. Zhebka) 0000-0002-7775-6039 (P. Skladannyi) 0009-0000-6098-2809 (Y. Bazak) 0000-0001-5124-5102 (A. Bondarchuk) 0000-0001-9295-4685 (K. Storchak)

Methods for predicting possible failures in smart home systems and analyzing the data required for this have been considered in the study. A study of machine learning methods has been carried out: their features, advantages, and disadvantages have been identified, the metrics of each method have been studied, and the effectiveness of methods for predicting failures in a smart home has been established. It has been found that the Long Short-Term Memory (LSTM) model is distinguished by its ability to work with data sequences and store information for a long time. The characteristics of the LSTM method and its algorithm have been studied in detail. The study emphasizes the importance of collecting and processing various data, such as sensor data, energy consumption, and information about devices and users. The results of the study can be useful for the further development of smart home control systems to improve their reliability and efficiency.

Introduction

Smart houses are becoming increasingly common thanks to the development of the Internet of Things (IoT) and smart technologies [1]. They provide automation and ease of control of various systems such as lighting, heating, security, energy efficiency, and many others [2,3].

However, as the complexity of these systems grows, the likelihood of failures or problems increases. Network instability, software errors, and faulty devices can all lead to unpredictable situations that affect the usability and security of a smart home [4].

Therefore, predicting failures in a smart home is a relevant issue, and it is appropriate to conduct such a prediction using machine learning methods. The use of machine learning algorithms allows for analyzing large amounts of data; and identifying deviations and patterns that precede failures. This approach allows predicting possible problems and taking measures to prevent them before they occur.

Today, smart home failure prediction is mostly based on reactive data analysis. This means that systems detect anomalous situations or failures after they occur, which can make it difficult to avoid potential problems.

However, using machine learning methods, such as classification, clustering, and prediction algorithms, it is possible to develop systems that can predict failures in a smart home in advance.

Such systems use data analysis from sensors, IoT devices, control systems, energy consumption, and other data to identify patterns and anomalies that may precede disruptions [5,6]. Based on this information, machine learning systems can build predictive models that respond to certain signals or changes in normal operation, warning of potential problems or taking steps to prevent them [7,8].

This area of research is still evolving, but it promises to improve smart home control systems by enabling them to predict and prevent possible failures in advance, providing greater reliability and security for users.

Research Results

Machine learning techniques can help detect and avoid some disruptions before they occur or restore system operations faster after a failure. They can provide early detection of anomalies in performance, which will prevent problems from occurring or respond quickly to them, which in turn will help reduce the impact of these failures on the smart home.

Machine learning algorithms are compared on four different data representations: original, balanced, normalized, and standardized. The original data is unchanged from the selected data except for the removal of timestamp values. Balancing is performed by undersampling the faultless data in the training data. Failure-free data inputs are randomly selected and removed from the dataset, resulting in the same number of failures and no failures. Insufficient sampling can erase important information from the data, leading to poor algorithm performance. The main advantage of data balancing is that it reduces the resources and time required to train algorithms. Data normalization refers to the scaling of feature values in the range from zero to one. Scaling is performed using (1) separately for each feature by finding its minimum and maximum values. The data is also standardized by considering each feature separately. The feature values are subtracted from the mean and then divided by the standard deviation, as shown in (2). These results in values centered around zero with unit dispersion. An additional pre-processing step that is performed before normalizing and standardizing the data is the conversion of the time features of the day of the week and the hour. To indicate that the difference between the hours 23 and 0 is the same as the difference between 22 and 23, the values are converted to cyclic representations using the Fourier transform [6]. The transformation calculates the sine and cosine values for each feature, as shown in equation (3). Thus, each feature is replaced by the corresponding sine and cosine feature. The calculation depends on the total number of different feature values N, which is 24 for hours and 7 for days of the week.

min max min n x x − = − . (1) n x x   − = (2) sin cos 22 sin , cos xx xx NN      ==         (3)

where х is the feature value, min is the minimum value of the feature, max is the maximum value of the feature, µ is the average value of the feature, σ is the standard deviation of a feature, and N is the total number of different values of the features. Some of the algorithms may have problems with dimensionality and run much slower than other algorithms. The problem can be solved by reducing the number of dimensions using principal component analysis. The method is only applicable to specific algorithms and specific data representations, depending on the speed of learning and prediction. In addition, some of the algorithms work only with a certain input format.

Many machine learning algorithms can be used to predict device failures. They can differ in many properties and features [9]. They can be supervised or unsupervised, they can solve classification, regression, or clustering problems or they can belong to different families such as deep learning, tree-based, probabilistic, or linear. The total number of algorithms considered for comparison was limited due to their lengthy setup and training. The algorithms were selected based on several criteria:

1. Supervised learning: based on the assumption that the data to be used have been chosen. 2. Practical use: some of the algorithms are used more than others for predictive maintenance. 3. Diversity: algorithms were chosen to represent different families, tasks, and functions, such as online learning or prediction over time. Based on these criteria, ten algorithms have been selected, nine of which are implemented as classification algorithms and one as a time series regression algorithm (Tables 1 and 2) [10,11]. There are representatives of different types. All algorithms support online learning either implicitly or through certain variations.

Table 1

List of machine learning methods and their brief description

Support Vector Machine

An algorithm that determines the optimal boundary of separation between classes using support vectors [17].

Logistic Regression

A classification method that uses a logistic function to determine the probability of an object belonging to a certain class [12].

Stochastic Gradient Descent An optimization algorithm that uses a gradient to find the minimum of a loss function with a randomly selected subset of data [16].

Multi-Layer Perceptron

A neural network with one or more hidden layers is used for classification and regression based on weighting coefficients [12].

LSTM

A type of recurrent neural network designed to store and use information over a long period to predict failures or events [16,17].

Т = (TP + TN) / (FP + FN + TP + TN), (4) where TP is true positive (correctly categorized positive ones), TN is true negative (correctly categorized negative ones), FP is false positive (incorrectly categorized positive ones), and FN is false negative (incorrectly categorized negative ones).

The precision determines the percentage of correctly identified positive classes among all identified positive classes, which is useful for working with uneven classes.

TP P FP TP = + .

(5)

Recall displays the percentage of correctly identified positive classes among all actual positive classes, which is important for identifying important cases that have been missed.

(

)

TP R TP FN = + . (6)

F1-average is a score that uses the harmonic mean between accuracy and completeness to understand how well the model solves the classification task.

𝐹1 = 2 𝑃𝑅 𝑃 + 𝑅 .(7)

ROC-AUC measures the area under the ROC curve and evaluates the model's performance depending on different classification thresholds, helping to determine its ability to make correct predictions.

1 0 ( ) d(S) ROC AUC R S − =  . (8)

To approximate this area, numerical methods such as the trapezoidal method or Simpson's method are usually since the ROC-AUC formula is an integral.

The completeness (R) and frequency (S) are determined using a confusion matrix for binary classification.

Indicator is calculated using formula (6), and the frequency is calculated using formula (9).

FP S FP TN = + . (9)

The Confusion Matrix provides detailed information about the real and predicted classes, which helps to estimate the level of correctness and errors for each class, which is important when analyzing the model.

The Confusion Matrix helps to evaluate the performance of a classification model by visualizing real and predicted values. It is the basis for calculating various metrics, such as accuracy, sensitivity, specificity, F1 score, and others.

These metrics are crucial for evaluating the effectiveness of algorithms and choosing the one that is most suitable for a particular task, depending on the requirements and needs [18,19].

Table 3 shows the performance of different machine learning methods by the main metrics considered.

Table 3 and Fig. 1 show the performance of different machine learning methods in terms of the main metrics such as precision, accuracy, classification accuracy, completeness, F1-mean, and ROC-AUC. The score of "High," "Medium", or "Very High" in the "Performance" column is a generalized characterization of the methods' performance based on these metrics. The study has found that the LSTM model is distinguished by its ability to work with data sequences and store information for a long time. This makes LSTM effective for analyzing time series, such as sensor data in a smart home, where information is usually sequential in time.

The LSTM model is capable of storing information for a long time, allowing it to effectively understand and analyze a sequence of real-time sensor data. By using mechanisms that ensure that some information is forgotten and others are retained, the LSTM can take into account long-term dependencies and the importance of individual events in time series. LSTM can adapt to and learn from different amounts of data, including large amounts of data from smart home sensors, which allows for more accurate predictions of failures. The LSTM model can adapt to changing conditions and detect changes in time series, which allows for predicting failures and anomalies in real-time. LSTM can process a variety of data types (text, numbers, sequences, etc.), making it versatile for use in various forecasting and analysis scenarios.

That is why it is not surprising that this algorithm showed the best results for predicting failures in a smart home.

The LSTM model has the following elements at each time step t:

Discussion

Machine learning helps to avoid certain problems by analyzing previous data and recognizing patterns, but it cannot predict absolutely all possible scenarios, especially if they arise from certain unpredictable factors or third-party interventions.

A smart home system that uses machine learning methods proves to be better than a system without this technology (as the study results show, the performance of a smart home using failure prediction methods gives an average of 22% better result compared to a similar system without prediction-Fig. 3). Machine learning allows the system to adapt to changes in the environment and user requirements, respond more quickly to new conditions, and optimize resource use. This helps to improve the system's efficiency in managing energy, comfort, safety, and user satisfaction [20].

Machine learning allows the system to predict and avoid failures, which ensures greater reliability and durability of the system. This approach also allows for increased automation, helping the system perform routine tasks without user intervention [21,22]. Overall, a machine learning system remains the preferred choice due to its ability to predict, optimize, and adapt to changes, enabling it to provide more efficient and convenient smart home management [23,24].

Conclusions

The study results have shown a wide range of modern technologies, sensors, and control systems used in smart homes. The overview has shown that existing technologies have the potential to improve convenience, security, and energy efficiency.

The analysis of available machine learning methods indicates their potential in predicting

Figure 1 :1Figure 1: Comparison of machine learning methods

Figure 2 :2Figure 2: Smart home system performance with and without machine learning methods

Figure 3 :3Figure 3: The process of integrating an information system into a smart home

Table 22Comparative characteristics of machine learning methodsMethodThe principle of operationApplication areaAdvantagesDisadvantagesk-NearestDetermining the class of anDetecting anomalies,Easy to implement, noSensitivity to emissions,Neighborobject through its nearestpredicting failurestraining requiredhigh computationalneighborscostsDecision TreeDecision-making based onAnomaly detection, failureEase of interpretation,Tendency to overlearn,sequential divisions by featuresclassificationaccommodating conditionsinstabilityRandom ForestTree ensemble to avoidDetecting anomalies,High accuracy, andA large number ofovertrainingpredicting failuresconsistency of solutionshyperparameters,training timeExtreme GradientUsing gradient lift to improveFailure prediction,High accuracy, lessA large number ofBoostingaccuracyanomaly detectiontendency to overlearnhyperparameters,complexity ofinterpretationNaive BayesUsing Bayes' theorem forFiltering anomalies,Efficiency for small data,The predictions are notprobabilistic classificationdetecting failure patternssimplicity of the modelflexible enoughSupport VectorDetermining the optimalClassification ofEfficiency in large spaces,Requirements for dataMachineboundary of separation betweenanomalies, forecastingflexibilitypreparation, highclassesfailurescomplexity ofcustomizationLogisticDetermining the probability of anFailure classification,Interpretability, ease ofRequires a linearRegressionobject belonging to a certain classanomaly detectionimplementationseparation surfaceStochasticUsing a gradient to optimize theModel training, failureFast learning, efficient forRequirements forGradient Descentloss functionanalysisbig datahyperparameters,tendency to stutterMulti-LayerA neural network with one orPattern recognition, timeAbility to solve complexRequires a lot of data forPerceptronmore hidden layersseries forecastingproblemstraining, training timeLSTMRecurrent neural network forTime sequence analysis,Ability to recognizeHigh computational costs,long-term memorizationfailure predictiondependencies over timethe complexity of setupThe main metrics for evaluating differentAccuracy represents the percentage ofmachine learning algorithms in prediction orcorrectly classified cases in the total number ofclassification tasks allow for an objectivecases, which gives a general idea of thecomparison of the effectiveness of thesealgorithm's accuracy [11]:algorithms.

Table 33Effectiveness of different machine learning methods by key metricsMethodAccuracyClassification accuracyCompletenessF1-averageROC-AUCEffectivenessk-Nearest Neighbor0.850.810.890.850.92HighDecision Tree0.780.820.750.760.85HighRandom Forest0.810.850.790.800.88HighExtreme Gradient Boosting0.870.880.860.870.94HighNaive Bayes0.750.790.720.730.82AverageSupport Vector Machine0.820.840.800.810.89HighLogistic Regression0.790.830.770.780.86AverageStochastic Gradient Descent0.800.820.790.800.87AverageMulti-Layer Perceptron0.840.860.820.830.91HighLSTM0.880.900.870.880.95Very high

The step-by-step algorithm for training an LSTM model includes the following steps:

1. Data preparation:

• Input: receive a set of data containing time series or sequences. The approach presented in this study takes advantage of LSTM to predict time series. The LSTM is implemented using Keras (a high-level neural network interface that simplifies the process of creating and training artificial neural networks; it is a machine learning library that runs on the Tensorflow, Theano, and Microsoft Cognitive Toolkit frameworks) as a sequential model with two LSTM layers and a dense output layer. It receives a sequence of data inputs (normalized feature values without rejections) and outputs a sequence of failure values.

It was decided to use a single data input, as this significantly reduces the training time and is sufficient for the algorithm to recognize failure patterns. The length of the source sequence determines the duration of the runtime. Prediction performance is tested on three different input sequences: 1 (1 second), 300 (5 minutes), and 1800 (30 minutes). As a result, three different LSTM models were built. For training, we used a dataset with 70% failures, creating a split into training and test sets in the proportion of 25-75% without mixing. In addition, the data was prepared by creating output sequences for each data record, which were then used for training.

Based on the conducted research, a data prediction platform has been developed. Once the system has been successfully integrated into the smart home, the implementation process takes place, when the system becomes an active part of the home environment. However, this is only the beginning: further support, optimization, and continuous improvement of the system play a key role in ensuring its long-term and efficient operation in a smart home, adapting to changing needs and conditions (Fig. 2). and managing risks in smart homes. The considered models have shown high accuracy in predicting failures.

The following areas can be considered for further development of this work:

• Improving machine learning methods to increase the accuracy of failure prediction. • In-depth study of the impact of the introduction of machine learning systems on the functioning of a smart home. Based on the obtained data, it is possible to build a methodology for predicting failures in a smart home, which will be the direction of the authors' next research.

References

Problems of Computer Engineering Conference Collection of abstracts 2023 Machine Learning: Methods and Models: a Textbook for Bachelors KKononova Masters and Doctors of Philosophy in Specialty 051 2020 V. N. Karazin Kharkiv National University Economics YBondarenko Manual for the Study of the Discipline Lira 2018 Statistical Analysis of Data A Review of Machine Learning Methods in the Task of forecasting Financial Time Series KHureeva OKudin ALisnyak Computer Science and Applied Mathematics 2 2018 IPuleko AYefimenko Architecture and Technologies of the Internet of Things: a Textbookб State University 2022 Zhytomyr Polytechnic Coding for Information Systems Security and Viability BZhurakovskyi Information Technologies and Security 2859 2021 Optimization Algorithms of Smart City Wireless Sensor Network Control MMoshenchenko Cybersecurity Providing in Information and Telecommunication Systems 3188 2021 Smart Factory of Industry 4.0: Key Technologies, Application Case, and Challenges BChen 10.1109/ACCESS.2017.2783682 IEEE Access 6 2018 A Gap Analysis of Internet-of-Things Platforms JMineraud 10.1016/j.comcom.2016.03.015 Computer Communications 89 90 2016