<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Learning partial correlation graph for multivariate sensor data and detecting sensor com munities in smart buildings</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Xiang Xie</string-name>
          <email>xiang.xie@newcastle.ac.uk</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Manuel Herrera</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tejal Shah</string-name>
          <email>tejal.shah@newcastle.ac.uk</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mohamad Kassem</string-name>
          <email>mohamad.kassem@newcastle.ac.uk</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Philip James</string-name>
          <email>philip.james@ncl.ac.uk</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Institute for Manufacturing, University of Cambridge</institution>
          ,
          <addr-line>Cambridge, CB3 0FS</addr-line>
          ,
          <country country="UK">United Kingdom</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>School of Computing, Newcastle University</institution>
          ,
          <addr-line>Newcastle upon Tyne, NE4 5TG</addr-line>
          ,
          <country country="UK">United Kingdom</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>School of Engineering, Newcastle University</institution>
          ,
          <addr-line>Newcastle upon Tyne, NE1 7RU</addr-line>
          ,
          <country country="UK">United Kingdom</country>
        </aff>
      </contrib-group>
      <fpage>201</fpage>
      <lpage>211</lpage>
      <abstract>
        <p>The storage and processing of massive time series data collected from smart buildings consume considerable computational resources. However, major information redundancy can be found in the smart building data. This paper proposed a partial correlation graph based approach to map the dependencies among sensors and detect the sensor communities in which the sensors are strongly “net” correlated. Specifically, the sparse partial correlation estimation method is used to learn the partial correlation graph. The Louvain algorithm is used to detect the communities of sensors by optimising the graph modularity. The case study demonstrates that the proposed method can identify spare sensors in the detected sensor communities and thus enhance the computational feasibility of smart building applications.</p>
      </abstract>
      <kwd-group>
        <kwd>Smart building</kwd>
        <kwd>computational feasibility</kwd>
        <kwd>partial correlation graph</kwd>
        <kwd>community detection</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        In recent years, the Internet of Things (IoT) has become increasingly popular in the realm of
smart buildings, leading to a more livable and sustainable indoor environment. By deploying
various IoT sensors and devices, a great amount of data is generated reflecting diverse aspects
of buildings’ operations [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The heterogeneous data contains valuable information that can
be used to facilitate better-informed decision-making [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Leveraging machine learning and
many big data analytic methods, the sensor data is transformed into information and further
mined to extract knowledge. This allows machines to gain better insights and wisdom into
the building systems, following the Data-Information-Knowledge-Wisdom (DIKW) pyramid
[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. However, strong spatiotemporal dependencies exist between the multivariate time series
generated by multiple sensors. It is unsustainable to treat each sensor as an independent
individual without considering the spatial correlation and temporal dynamics among them
CEUR
CEUR
Workshop
Proceedings
      </p>
      <p>
        ceur-ws.org
ISSN1613-0073
[
        <xref ref-type="bibr" rid="ref4 ref5">4, 5</xref>
        ]. Because computations generate 100 megatons of CO2 emissions per year, accounting for
3% global carbon footprint. For smart buildings, redundant information is computed repeatedly,
rarely contributing to new knowledge while generating extra carbon for computing. For
example, the window open/close sensors are redundant to a certain extent if indoor and ambient
temperatures are monitored for unconditioned spaces.
      </p>
      <p>
        Graphical models have been proven as useful tools for analysing multivariate time series [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ],
which specify conditional independence relationships among a collection of random variables.
The vertices (nodes) in the graph represent variables, while the directed or undirected edges
(links) reflect the causalities or dependencies between variables. The dynamic relationships
over time among the variables help to determine and explain the causation or association
mechanisms of the underlying systems. The application of graphical time series models can
be seen in a wide range of areas, such as financial market analysis [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] and brain interactivity
analysis [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. In particular, for high-dimensional stationary multivariate Gaussian time series,
the Gaussian graphical model, an undirected graph of partial correlation coeficients, allows for
sparse modelling of the underlying association structure amongst observed variables. This is
an eficient tool for modelling complex systems of myriad variables by building them using a
smaller proportion of variables.
      </p>
      <p>With the ubiquitous IoT sensors in smart buildings, buildings are modelled as systems with
humans in the loop. This requires huge computation and storage resources as well as advanced
software that implements innovative algorithms for analysing high-dimensional sensor data. In
this paper, the Gaussian graphical model, also known as the partial correlation graph, is used
to model the relationship underlying the multivariate time series data collected from smart
buildings. Based on the learned partial correlation graph, the communities of sensors could
be formed, in which a few sensors would represent the overall pattern of a community with
similar spatiotemporal features. It leads to a computationally feasible strategy to model the
diverse spatiotemporal processes for buildings in a dimension-reduced manner.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Literature review</title>
      <sec id="sec-2-1">
        <title>2.1. Feature selection for multivariate time series</title>
        <p>
          Data analysis is a computationally intensive process, particularly when dealing with large
datasets containing a significant number of variables. In this context, these “variables” are also
called “features”. The computational complexity, in terms of space or time, which measures the
total amount of either time or memory taken by an algorithm to run, often increases dramatically
with the feature dimensionality [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]. To avoid the so-called “curse of dimensionality”, feature
extraction and feature selection techniques are proposed respectively. Feature extraction
methods aim to project high-dimensional data into low-dimensional subspace. Principal Component
Analysis (PCA), multidimensional scaling (MDS) and Isometric Mapping (ISOMAP) are typical
feature extraction methods [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]. However, the new set of features built from the original feature
set lacks clear physical interpretations. On the other hand, feature selection methods remove
irrelevant and redundant dimensions from the raw dataset with minimal loss of information.
By filtering out unrepresentative features, the process of feature selection helps to extract
meaningful insights from the original dataset, reduce the complexity of data analysis, release
the computational burden, avoid the overfitting problems, and improve the generalisation and
interpretability capacity of the corresponding machine learning approaches [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ].
        </p>
        <p>
          In general, feature selection methods can be classified into three categories, which are filter,
wrapper, and embedded methods [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]. Filter methods measure and rank feature importance
based on certain feature evaluation criteria quantifying their statistical characteristics, such as
Pearson correlation coeficient, Fisher score [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ], and mutual information [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]. Independent of
the machine learning approach adopted for data analysis, the filter methods are computationally
eficient by simply filtering out lowly ranked features. As the alternative, wrapper methods
search for a subset of features that optimises the performance of the predefined machine learning
approach. Typical search strategies include sequential search,hill-climbing search, best-first
search, branch-and-bound search, and heuristic genetic algorithms. However, for very large 
features within 2 search space, the wrapper methods become computationally infeasible [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ].
Embedded methods provide a trade-of between filter and wrapper methods. Inheriting the
merits of the wrapper and filter methods, embedded methods introduce sparse regularisation
terms to the optimisation objective of a machine learning model to penalise complex models
and reduce the dimensionality of the input features.
        </p>
        <p>
          Many existing feature selection methods are based on an illusory assumption that features
are independent of each other while ignoring the inherent feature structures [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]. In the case of
smart buildings, the inherent feature structures mainly come from the spatial proximity and
functional association of sensors inside a building. Incorporating such spatial structure can help
select more important features for smart building applications. In this paper, a partial correlation
graph of multivariate time series data from smart buildings is learned, where undirected edges
with weights indicate the pairwise dependencies between features represented by vertices.
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Partial correlation graph of time series</title>
        <p>
          Partial correlation graphs have long been used to explicitly map the dependencies between
various stationary variables in multiple domains, such as river stage forecasting [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]. By
identifying the nonzero entities in the partial correlation matrix (equivalent or related to
inverse covariance matrix, concentration matrix and precision matrix), the topology of the
partial correlation graph can be reconstructed and the conditional dependencies among the
observed variables can be elucidated [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ]. The partial correlation coeficients, ranging from −1
to 1, encode the dependence between two variables after eliminating the influence of all the
remaining variables. When determining the conditional dependency between two variables of
interest A and B, the conventional Pearson correlation coeficient may give misleading results
when another confounding variable C has the so-called common cause or common efect on
both A and B. By computing the partial correlation coeficients, the spurious correlation can be
eliminated where only the “unbiased relationship” of A and B remains.
        </p>
        <p>
          Estimating partial correlation coeficients comes down to calculating the inverse of the
covariance matrix. Studies have shown that partial correlations are proportional to not only the
multiple linear regression coeficients but also the of-diagonal entries of the inverse covariance
matrix [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ]. Especially, in cases where the number of variables is greater than the number
of observations, the rank of the covariance matrix is equal to the number of observations,
and methods like generalised inverse or pseudo-inverse need to be adopted to tackle this
illposed inverse problem. However, these methods often fail to provide accurate and sparse
interpretable solutions. To address this issue, regularisation techniques are explored to impose
sparse constraints on the inverse covariance matrix. The overall sparsity assumption of the
partial correlation matrix is reasonable and taken for granted for many real-world problems.
L1-norm regularisation and elastic net regularisation, which is the combination of L1- and
L2-norm regularisation, have been used to extract sparse nonzero partial correlation coeficients,
with the elastic net regularisation showing stronger robustness for the estimation.
        </p>
        <p>
          In the graphical model for the multivariate time series, the number of possible edges between
vertices grows quadratically. Fortunately, there usually exists a corresponding sparse graph such
that the edges directly linked to each vertice are few [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ]. By taking this sparsity into account, it
is possible to develop graph models with good generalisation and predictive capability from far
fewer samples. This is the desired character for smart building applications, where the analysis
can be conducted and the decisions can be made relying on fewer data instead of all.
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Methodology</title>
      <sec id="sec-3-1">
        <title>3.1. Estimation of partial correlation graph</title>
        <p>Considering a weighted undirected graph  = { , ℰ }
with vertices indexed by  = {1, 2, ⋯ , }
and a corresponding set of undirected edges ℰ ⊆ ℝ× , let   ∈ ℝ
 denote the time series
observed from the vertice  and the edge from vertice  to vertice  has a weight   (,  ∈ [ 1, ]
The weighted undirected graph becomes the partial correlation graph when the respective
).
weights   are assigned the partial correlation coeficients between these variables
 and  .</p>
        <p>More precisely, the edge between vertice  and  exists if and only if   and   are conditionally
independent given the remaining  − 2</p>
        <p>variables. Suppose the multivariate Gaussian time
series data  = [ 1,  2, ⋯ ,   ] has positive-definite covariance matrix  = (
covariance matrix  =  −1. The (, ) element in  , represented by   , is nonzero when   and  
 ) and inverse
observed by vertice  and  are conditionally dependent straightforward.</p>
        <p>
          Acquiring the inverse of the covariance matrix becomes an ill-posed problem when the
number of observations  is less than the number of variables  . In this case, the covariance
matrix  is singular and non-invertible. A computationally eficient sparse partial correlation
matrix estimator, Sparse PArtial Correlation Estimation (SPACE) method, is proposed in [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ].
SPACE method alternates between solving for the partial correlation matrix  and for the
diagonal of the precision matrix  with the objective function of:
        </p>
        <p>1
2 =1
(, Θ) =
∑ ||  − ∑  
  ||22 +</p>
        <p>∑
1≤&lt;≤</p>
        <p>|  |

≠
  = −</p>
        <p>jj
√  ii
 ij
√ ii jj
(1)
(2)</p>
        <p>To solve this, a Least Absolute Shrinkage and Selection Operator (LASSO) problem is
formulated. Typically, the optimisation problem converges after 2 to 3 iterations.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Inference of sensor communities in smart buildings</title>
        <p>
          By 2026, the number of sensors deployed in smart buildings will exceed one billion, according
to a study from Juniper Research. With this growth, more smart building applications will
emerge to gain insights and make better-informed decisions using the vast amount of building
data collected [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ]. Assuming  sensors are deployed in a specific area of a building, the data
accumulates with time according to the sensor sampling rate or the frequency of changes in
sensor data values. However, the time series data from these  sensors are not independent
of each other due to the spatial correlations. To reflect the dependencies among these sensor
readings, a partial correlation graph of the multivariate time series collected from  sensors
is learned, as illustrated in Figure 1. These sensors can be of the same type, such as carbon
dioxide (CO2) sensors within the same space. Typically, they provide similar readings under the
well-mixed assumption when the sensor sampling interval is greater than 10 minutes. Strong
dependencies also exist between data generated by diferent types of sensors. For instance, the
volatile organic compound (VOC) concentration is highly correlated with the CO2 concentration
because CO2 serves as a common surrogate indicator for indoor air quality.
        </p>
        <p>To identify intensively tied sensors with considerable data redundancy, the community
detection approach is used to cluster dependent sensors based on the estimated partial correlation
graph [21]. A community refers to a group of vertices that are closely connected to each other
but less connected to the vertices outside the group. Detected communities contain groups of
sensors that generate highly correlated time series. Louvain algorithm is a classical method to
extract communities from networks, which optimises the defined modularity of communities
indicating the density of (weighted) edges within communities with respect to edges outside
communities [22]. The choice of Louvain algorithm is due to its properties of computational
eficiency and scalability that make it suitable even for large-size networks. The modularity
score is formulated as:
, where  is the sum of all edge weights in the undirected graph,   denotes the weight of the
edge between vertice  and  ,   and   , represents the sum of weights connecting vertice  and
 ,   and   are the communities of the vertices, and (⋅) is the Kronecker delta function. The
Louvain algorithm alternatively conducts modularity optimization and community aggregation.
This process is repeated until no further increase in modularity is possible and the detected
communities stabilise, leading to an optimal partition of the partial correlation graph into
communities of sensors.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Case study</title>
      <p>To validate the proposed methodology, the urban science building (USB) of Newcastle University,
a part of the Newcastle Helix site within Newcastle city centre, is used as the testbed. With
over 4,000 sensors, computing technology is embedded throughout the building’s structure,
making it one of the most intensively monitored buildings in the UK. Figure 2 (a) shows the
exterior of the urban science building and Figure 2 (b) shows the layout of its second floor and
the included spaces (red cubes indicating the centre of spaces) where sensors are deployed.</p>
      <p>In this case study, the carbon dioxide (CO2) concentration sensors on the building’s second
lfoor are used to demonstrate the information redundancy residing in the collected sensor data.
Figure 3 presents the CO2 concentrations measured by 24 sensors, in the unit of ppm (parts per
million). The 24-hour data was collected on Feb 15, 2023, which is a Wednesday. The sensor data
is cleaned and preprocessed to impute the missing sensor values and resampled to 15-minute
intervals. The sensors are labelled with the respective room names they are located in (i.e.,
‘R’+floor_number+‘.’+room_number). Note that, more than one sensor may be deployed in
one open space, in which the sensors inside are specified by the additional zone number (i.e.,
‘R’+floor_number+‘.’+room_number+‘-Z’+zone_number).</p>
      <p>The partial correlation graph is learned based on the collected daily CO2 data. Figure 4
visualises the partial correlation graph of the sensor time series, in which the edges indicate the
dependencies between the CO2 concentrations measured at proximate locations. The stronger
the dependency is, the thicker the corresponding edge is. The highest partial correlation
coeficient emerges between the CO2 concentrations measured in R2.037-Z1 and R2.037-Z2,
followed by the CO2 concentrations measured in R2.058-Z1 and R2.058-Z2. The Louvain
C1
C2
C2
C5
C2
C1</p>
      <p>C2
C2
C2
C5
C5
C1</p>
      <p>C2
C2
C1
C1
C5
C1</p>
      <p>C3
C4
C1
C1
C2
C2
algorithm is used to detect the communities of sensors, the results of which are given in Table 1.
Besides, the community numbers are shown in the top right corner of each time series in Figure
3. The detected 5 communities make sense to a certain extent. For example, rooms R2.048
and R2.037 are open areas next to each other and therefore share similar CO2 concentration
readings. Same for the room R2.060 and the neighbouring zone R2.048-Z4. An interesting
phenomenon can be observed. Rooms like R.027 and R.060 are clustered into diferent groups
although showing a correlation in-between. This is because they are clustered based on their
proximity to the centroid of that cluster, rather than the ”distance” between them.</p>
      <p>The detected communities of sensors can help to identify the spare sensors deployed in near
spaces. In practice, the deployment of IoT sensors in smart buildings largely follows intuition.
In the case of USB, at least one CO2 sensor is deployed in each occupied space, and for large
open spaces, one CO2 sensor is responsible for monitoring an individual zone. The learned
partial correlation graph and the detected communities of sensors indicate that some sensors
located in proximate locations basically provide the same piece of information. In such cases,
virtual sensors can be defined by fusing multivariate time series data from duplicated sensors,
which reduces the size of sensor data to be stored and processed.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Discussion</title>
      <p>To deal with the massive time series data from smart buildings, a partial correlation graph-based
methodology is proposed in this paper. The partial correlation graph reflects the conditional
independence relationships among the multivariate time series data generated by multiple
sensors distributed in diferent spaces. However, the conditional independence relationship
does not necessarily correspond to the causality. This is why some distanced sensors can still
appear in the same community. In the case of USB, some sensors deployed near the corridors
emerge in the same community, probably because the CO2 concentrations are afected by
similar occupants’ activities. To bring the spatial information into the equation, the spatial
adjacency graph, which describes the spatial relationships among sensors, can be converted
from the semantic models using ontology such as the Building Topology Ontology (BOT) [23].
By integrating the partial correlation graph and the spatial adjacency graph, the detected
communities would be more physically interpretable. Furthermore, because the dependencies
between sensor data rely on human activities as well, a sliding window approach will be applied
to detect the changes in the sensor communities over time. The dynamic sensor communities
with weekday/weekend, season, and year patterns are expected.</p>
      <p>Admittedly, the building semantic model can be enriched using the spatiotemporal features
extracted from the sensor data. But we reserve the opinion that the semantic model is not the
best repository for such spatiotemporal-wise knowledge considering the uncertainties and more
importantly dynamics. The learned graph in the case study only reflects the spatiotemporal
pattern of a specific day, and we expect weekly, monthly, seasonal or annual changes in
the acquired patterns. The semantic model, as it is, is not suitable for this type of dynamic
information, unless the objects and relations can be timestamped. Alternatively, these periodic
spatiotemporal patterns can be encoded in machine learning models. Further studies are needed
to tackle this challenge.</p>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusion</title>
      <p>The emergence of the smart building concept brings great opportunities and challenges. One
of the main challenges is the massive data generated every minute and every second. For the
multivariate time series data from smart buildings, the partial correlation graph is learned using
the sparse partial correlation estimation method, which maps the dependencies among the
sensors in the smart building. Leveraging the learned partial correlation graph, the sensors are
clustered into diferent communities, in which the sensors are strongly tied. The urban science
building of Newcastle University is used as the case study. The results demonstrate that the
proposed methodology can uniquely identify spare sensors in a detected community of sensors
that barely provide extra information to smart building applications. It leads to a computationally
feasible approach to reduce the volume of sensor data with minimum information loss.
count? exploring the applicability of smart building applications in the post-pandemic
period, Sustainable Cities and Society 69 (2021) 102804.
[20] M. U. Younus, S. ul Islam, I. Ali, S. Khan, M. K. Khan, A survey on software defined
networking enabled smart buildings: Architecture, challenges and use cases, Journal of
Network and Computer Applications 137 (2019) 62–77.
[21] B. S. Khan, M. A. Niazi, Network community detection: A review and visual survey, arXiv
preprint arXiv:1708.00977 (2017).
[22] V. D. Blondel, J.-L. Guillaume, R. Lambiotte, E. Lefebvre, Fast unfolding of communities
in large networks, Journal of statistical mechanics: theory and experiment 2008 (2008)
P10008.
[23] M. H. Rasmussen, M. Lefrançois, G. F. Schneider, P. Pauwels, BOT: The building topology
ontology of the w3c linked building data group, Semantic Web 12 (2021) 143–161.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A. P.</given-names>
            <surname>Plageras</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. E.</given-names>
            <surname>Psannis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Stergiou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. B.</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <article-title>Eficient IoT-based sensor big data collection-processing and analysis in smart buildings</article-title>
          ,
          <source>Future Generation Computer Systems</source>
          <volume>82</volume>
          (
          <year>2018</year>
          )
          <fpage>349</fpage>
          -
          <lpage>357</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>B.</given-names>
            <surname>Qolomany</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Al-Fuqaha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Benhaddou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Alwajidi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Qadir</surname>
          </string-name>
          ,
          <string-name>
            <surname>A. C. Fong,</surname>
          </string-name>
          <article-title>Leveraging machine learning and big data for smart buildings: A comprehensive survey</article-title>
          ,
          <source>IEEE Access 7</source>
          (
          <year>2019</year>
          )
          <fpage>90316</fpage>
          -
          <lpage>90356</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>M.</given-names>
            <surname>Ardolino</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Rapaccini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Saccani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Gaiardelli</surname>
          </string-name>
          , G. Crespi,
          <string-name>
            <given-names>C.</given-names>
            <surname>Ruggeri</surname>
          </string-name>
          ,
          <article-title>The role of digital technologies for the service transformation of industrial companies</article-title>
          ,
          <source>International Journal of Production Research</source>
          <volume>56</volume>
          (
          <year>2018</year>
          )
          <fpage>2116</fpage>
          -
          <lpage>2132</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Royal</given-names>
            <surname>Society</surname>
          </string-name>
          ,
          <article-title>Digital technology and the planet: Harnessing computing to achieve net zero</article-title>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>L.</given-names>
            <surname>Lannelongue</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Grealey</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Inouye</surname>
          </string-name>
          ,
          <article-title>Green algorithms: quantifying the carbon footprint of computation</article-title>
          ,
          <source>Advanced science 8</source>
          (
          <year>2021</year>
          )
          <fpage>2100707</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>J. K.</given-names>
            <surname>Tugnait</surname>
          </string-name>
          ,
          <article-title>On sparse high-dimensional graphical model learning for dependent time series</article-title>
          ,
          <source>Signal Processing 197</source>
          (
          <year>2022</year>
          )
          <fpage>108539</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>T.</given-names>
            <surname>Millington</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Niranjan</surname>
          </string-name>
          ,
          <article-title>Partial correlation financial networks</article-title>
          ,
          <source>Applied Network Science</source>
          <volume>5</volume>
          (
          <year>2020</year>
          )
          <fpage>1</fpage>
          -
          <lpage>19</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>A.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Fang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Liang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V. D.</given-names>
            <surname>Calhoun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.-P.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <article-title>Aberrant brain connectivity in schizophrenia detected via a fast gaussian graphical model</article-title>
          ,
          <source>IEEE journal of biomedical and health informatics 23</source>
          (
          <year>2018</year>
          )
          <fpage>1479</fpage>
          -
          <lpage>1489</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>O. Y.</given-names>
            <surname>Al-Jarrah</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. D.</given-names>
            <surname>Yoo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Muhaidat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. K.</given-names>
            <surname>Karagiannidis</surname>
          </string-name>
          ,
          <string-name>
            <surname>K. Taha,</surname>
          </string-name>
          <article-title>Eficient machine learning for big data: A review</article-title>
          ,
          <source>Big Data Research</source>
          <volume>2</volume>
          (
          <year>2015</year>
          )
          <fpage>87</fpage>
          -
          <lpage>93</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>P.</given-names>
            <surname>Ray</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. S.</given-names>
            <surname>Reddy</surname>
          </string-name>
          , T. Banerjee,
          <article-title>Various dimension reduction techniques for high dimensional data analysis: a review</article-title>
          ,
          <source>Artificial Intelligence Review</source>
          <volume>54</volume>
          (
          <year>2021</year>
          )
          <fpage>3473</fpage>
          -
          <lpage>3515</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>B.</given-names>
            <surname>Chizi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Maimon</surname>
          </string-name>
          ,
          <article-title>Dimension reduction and feature selection, Data mining and knowledge discovery handbook (</article-title>
          <year>2010</year>
          )
          <fpage>83</fpage>
          -
          <lpage>100</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>J.</given-names>
            <surname>Li</surname>
          </string-name>
          , K. Cheng,
          <string-name>
            <given-names>S.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Morstatter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. P.</given-names>
            <surname>Trevino</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Tang</surname>
          </string-name>
          , H. Liu,
          <article-title>Feature selection: A data perspective, ACM computing surveys (CSUR) 50 (</article-title>
          <year>2017</year>
          )
          <fpage>1</fpage>
          -
          <lpage>45</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>L.</given-names>
            <surname>Sun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Ding</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <article-title>Feature selection using fisher score and multilabel neighborhood rough sets for multilabel classification</article-title>
          ,
          <source>Information Sciences 578</source>
          (
          <year>2021</year>
          )
          <fpage>887</fpage>
          -
          <lpage>912</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>M.</given-names>
            <surname>Bennasar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Hicks</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Setchi</surname>
          </string-name>
          ,
          <article-title>Feature selection using joint mutual information maximisation</article-title>
          ,
          <source>Expert Systems with Applications</source>
          <volume>42</volume>
          (
          <year>2015</year>
          )
          <fpage>8520</fpage>
          -
          <lpage>8532</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>M.</given-names>
            <surname>Ghaemi</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.-R.</surname>
          </string-name>
          Feizi-Derakhshi,
          <article-title>Feature selection using forest optimization algorithm</article-title>
          ,
          <source>Pattern Recognition</source>
          <volume>60</volume>
          (
          <year>2016</year>
          )
          <fpage>121</fpage>
          -
          <lpage>129</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>S. R.</given-names>
            <surname>Venna</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Katragadda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Raghavan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Gottumukkala</surname>
          </string-name>
          ,
          <article-title>River stage forecasting using enhanced partial correlation graph</article-title>
          ,
          <source>Water Resources Management</source>
          <volume>35</volume>
          (
          <year>2021</year>
          )
          <fpage>4111</fpage>
          -
          <lpage>4126</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>J.</given-names>
            <surname>Peng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhu</surname>
          </string-name>
          ,
          <article-title>Partial correlation estimation by joint sparse regression models</article-title>
          ,
          <source>Journal of the American Statistical Association</source>
          <volume>104</volume>
          (
          <year>2009</year>
          )
          <fpage>735</fpage>
          -
          <lpage>746</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>A.</given-names>
            <surname>Venkitaraman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Zachariah</surname>
          </string-name>
          ,
          <article-title>Learning sparse graphs for prediction of multivariate data processes</article-title>
          ,
          <source>IEEE Signal Processing Letters</source>
          <volume>26</volume>
          (
          <year>2019</year>
          )
          <fpage>495</fpage>
          -
          <lpage>499</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>X.</given-names>
            <surname>Xie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Lu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Herrera</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. K.</given-names>
            <surname>Parlikad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Schooling</surname>
          </string-name>
          ,
          <article-title>Does historical data still</article-title>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>