<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Performance Comparison of Graph Analytic Methods for Supporting Enterprise Architecture Model Maintenance</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Nikhitha Rajashekar</string-name>
          <email>nikhitha93@gmail.com</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Simon Hacks</string-name>
          <email>hacks@swc.rwth-aachen.de</email>
          <email>shacks@kth.se</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nuno Silva</string-name>
          <email>nuno.silva@tecnico.ulisboa.pt</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Model</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Maintenance</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Graph</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science and Engineering, Technical University of Lisbon</institution>
          ,
          <addr-line>Lisbon</addr-line>
          ,
          <country country="PT">Portugal</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Division of Network and Systems Engineering, KTH Royal Institute of Technology</institution>
          ,
          <country country="SE">Sweden</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Research Group Software Construction, RWTH Aachen University</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Enterprise Architecture (EA) models help enterprise architects to make business decisions and to support organisations to understand and analyse its structure. Creating and maintaining such an EA model is expensive and time-consuming in large organisations. In this paper, the authors provide a review of how graph analytic methods can be used to support EA model maintenance. To achieve the best results for the duplicate identi cation, it is necessary to evaluate and optimise different graph analytic and Machine Learning (ML) algorithms to support the maintenance process.</p>
      </abstract>
      <kwd-group>
        <kwd>Enterprise Architecture</kwd>
        <kwd>Machine Learning</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The practice of Enterprise Architecture (EA) is, inter alia, about providing a
blueprint and roadmap for aligning business strategy with IT [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. Usually, in
large organisations, it is an e ortful task for a single individual to understand
how the various enterprise components connect and work together. Thus, EA
can be used to create a visual representation of the enterprise system.
      </p>
      <p>
        In todays organisations, it is necessary to deliver insights quickly and analyse
large amounts of enterprise data. In this context, it is essential to understand
and model the EA components and relationships. EA models are comprised of
a set of components, representing the enterprise's assets, and the links between
them. This interpretation allows to understand EA models as graphs.
Graphbased tools provide an advantage to enterprise architects by providing a uni ed
view of EA that helps in identifying and understanding it [
        <xref ref-type="bibr" rid="ref21 ref27">27, 21</xref>
        ].
      </p>
      <p>
        To provide an e ective analysis of the enterprise, the relying EA repository
needs to be sound and up-to-date. Maintaining an EA model that is
up-todate and consistent with the enterprise can be an e ortful task due to the size
Copyright © 2020 for this paper by its authors. Use permitted under
Creative Commons License Attribution 4.0 International (CC BY 4.0).
and complexity of EA models, frequent changes in the architecture, and the
challenges of collecting and managing EA information from di erent stakeholders
[
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. Due to the increase in information systems, as a manifestation of the digital
transformation age, the documentation of enterprise information as an inherent
part of EA model design is often regarded as time-consuming, cost-intensive,
and error-prone [
        <xref ref-type="bibr" rid="ref15 ref8 ref9">9, 8, 15</xref>
        ].
      </p>
      <p>
        Previously, we have elaborated on di erent Machine Learning (ML)
techniques to support enterprise architects by identifying possible duplicates within
EA models, which cause an unnecessary expansion [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. However, in our previous
research, we identi ed potential algorithms, which can support the enterprise
architects, but we did not focus on the run time of the algorithms. Those long
computation times prevent our approach to be adapted in working environments.
To cope with this challenge, we set up a ML study addressing the following
research question:
      </p>
      <p>Which graph analytic method performs best to identify duplicate components
in EA models?</p>
      <p>In line with the above, we compare di erent approaches which address a
speci c kind of challenge in EA model maintenance: the identi cation of
duplicate components in the EA repository. In the following sections, we rst discuss
the related literature. We then present the case study including the conducted
research methodology and introduce the di erent graph analytic approaches to
investigate the EA models. Finally, we illustrate the experimental results, point
out the identi ed limitations, and conclude our work.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        Previous research [
        <xref ref-type="bibr" rid="ref27 ref29 ref7">7, 27, 29</xref>
        ] was motivated by analysing EA models as a graph
and applying di erent ML concepts on EA models by providing decision support
for enterprise architects. The work of Dreyfus and Iyer [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] focuses on representing
the complexity of information systems architecture in social network terms and
then capturing insights from the graph representation.
      </p>
      <p>
        Di erent similarity approaches have been proposed in the literature to
identify the similarity and the di erences between models and their components to
be matched. The work of Dijkman et al. [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] presents three similarity metrics to
investigate matching of similar business process models in a given repository,
namely (i) structural similarity that compares element labels by considering
topology structure of business process models; (ii) behavioural similarity that
compares element labels by considering behavioural semantics of process
models; and (iii) label matching similarity that compares elements based on words
in the labels of business process model elements (string edit distance).
      </p>
      <p>
        The work of Aier and Schnherr [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] presents a clustering approach in
determining the structure of Service Oriented Architectures (SOA). The paper shows the
application of clustering algorithms in supporting the design of a SOA. However,
the publication does not present the evaluation criteria for comparing di erent
clustering methods. Discovering communities or clusters have been extensively
studied in various research areas [
        <xref ref-type="bibr" rid="ref11 ref25">11, 25</xref>
        ]. Several research papers discuss the
comparison of community detection algorithms [
        <xref ref-type="bibr" rid="ref22 ref23 ref32">22, 23, 32</xref>
        ].
      </p>
      <p>
        Our research is also linked to our to the eld of model reuse for UML
diagrams. Ganser et al. [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] developed an approach to encourage the usage of models
or rather parts of models. Therefore, they transform the diagrams into graphs
and store them in a library to recommend them to the modeller if necessary.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Research Approach</title>
      <p>
        As already stated, we like to identify the best performing algorithm to identify
duplicate components in EA models. Therefore, we compare di erent ML
algorithms inspired by the ML life-cycle model provided by [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ]. The main idea is
to improve the performance of a model by comparing di erent algorithms and
metrics:
1. Raw data: Any EA model can be expressed in XML, which is mainly used
for storage and communication purposes. Thus, we rst acquire our EA
model in XML format, which serves as input for our approach.
2. Data preprocessing: This step includes transforming raw data into a
relevant format and cleaning the data set. We parse the raw data obtained from
elements and relations les to a graph data object using igraph library [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
Next, we normalise the data to standardise the range of independent
variables and its features.
3. Feature engineering: The next step is to extract or compute relevant graph
feature statistics. The features extracted from data will directly in uence the
predictive models and consequently, the results. Extracting a right feature
set can boost the performance of a model.
4. Data modelling: After extracting the relevant data set and feature set,
the next step is to perform the core ML task. Making use of supervised/
unsupervised learning algorithms, we can evaluate similarity and investigate
clusters/communities formed to analyse EA models in a di erent context.
5. Model evaluation: The model evaluation mainly supports the nd of the
best performing model based on suitable performance metrics that represent
the data and how well the chosen algorithm will work in the future.
6. Performance improvement: Iterating over di erent graph analytic or ML
algorithms and feature set can result in optimised results.
      </p>
      <p>It is essential to know how the optimal solution for a particular scenario
looks like, in order to evaluate di erent algorithms. Therefore, we sketched an
airport departure system, which depicts the functionality of the passengers before
boarding an aircraft. This case study strategy will provide a richer understanding
of the context for the people and the researcher involved. Further, it is used for
the evaluation of all algorithms presented in the following.</p>
      <p>
        This example is modelled as an EA model based on ArchiMate 3.0.1. The
model incorporates the ArchiMate layers business, application, and technology.
It consists of 171 di erent elements and 250 relations. An excerpt of the model
is presented in Figure 1.
We depict the EA model as a graph representing ArchiMate components as
nodes and relationships between the components as edges. Thus, analysing EA
model as graph provides a deeper understanding. As there are several graph
analytic techniques, we focus on graph-based similarity and community detection
(unsupervised learning).
Considering EA model as a graph like structure, we apply graph similarity
techniques to assess the match between EA models. Graph similarity involves
determining a degree of similarity between two graphs [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] quanti ed by a similarity
score between 0 (no similarity) and 1 (complete similarity).
      </p>
      <p>Computing node similarity Node matching similarity is based on examining
the content of the components name. Therefore, we compare EA models to each
other, where the naming of the components di ers only slightly like having typing
errors, but both models have the same expression.</p>
      <p>
        Syntactic Similarity. The syntactic approach is related to both, the occurrence
of terms and the number of words in the text. In this paper, we use the Cosine
similarity index [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] and Jaccard similarity index [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] as a syntactic approach to
detect the match between components of two EA models. Those two algorithms
serve as starting point for our research and will be extended in future by other
algorithms like Levenshtein, N-Gram, Q-Gram, or S rensen-Dice coe cient.
      </p>
      <p>To nd the Cosine and Jaccard similarity score between the phrases
(component names) present in two EA models, phrases are turned into words, words
are then converted to binary vectors.</p>
      <p>
        Apart from the cosine similarity, another well-known measure for determining
the degree of similarity is the Jaccard similarity index. The Jaccard similarity
index measures the similarity between nite sample sets and is de ned as the
cardinality of the intersection of sets divided by the cardinality of the union of
the sample sets [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ].
      </p>
      <p>
        Latent Semantic Analysis. An alternative method to compare the similarity
between EA models is based on fetching the relations between words in the texts.
LSA [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] is a corpus-based method which does not use the semantic network,
grammar, syntax, and dictionaries. LSA applies a statistical analysis of the
latent structures of the documents by nding the underlying meaning or concepts
between the documents. It tries to map the words in a document into a concept
space and then comparing within that space.
      </p>
      <p>
        Computing edge similarity The second similarity metric uses the structure
of the EA model. This method is applicable when the node correspondence
is known and the edges do not vary signi cantly. The main idea behind this
approach is: a node in one graph is said to be similar to a node in another graph
if they share a standard set of neighbourhoods. This works recursively [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ].
      </p>
      <p>Architect 1 models the application layer, as shown in Figure 2a, which shows
the presence of the realisation relationship between Airline administration
support (Application Component) and Identifying boarding pass (Application
Service) and between Boarding control (Application Component) and Security
(Application Service). Architect 2 models a similar kind of application layer, as
shown in Figure 2b. Both models have an equal number of components with
varying edge connections. Since both the application components have
collaborated within Boarding &amp; departure control system (Application Collaboration),
there is a similarity between the models generated by both the architects
irrespective of edge connection.</p>
      <p>
        Motivated from the work of Liben-Nowell and Kleinberg [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ], we selected the
most widely used structural similarity measures based on edges, namely Jaccard
[
        <xref ref-type="bibr" rid="ref18">18</xref>
        ], Dice [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ], and Adamic-Adar [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] similarity indexes.
4.2
      </p>
      <sec id="sec-3-1">
        <title>Nearest Neighbour</title>
        <p>
          The nearest neighbour approach [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ] can provide a graph-based recommendation
for enterprise architects in order to avoid modelling already existing elements.
Early work on this domain was attempted by Aier and Schnherr [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]. Thus, nding
communities on a network helps to investigate the roles in an EA model where
components in a single cluster will have similar kind of roles compared to other
clusters [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ].
(a) Application View of M odel1
        </p>
        <p>(b) Application View of M odel2</p>
        <p>
          We present a technique based on unsupervised learning method to determine
the communities [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]. Considering an EA model as a graph [
          <xref ref-type="bibr" rid="ref27">27</xref>
          ], we apply di
erent community detection algorithms and analyse the results [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ]. The community
detection method tries to group a set of vertices having a higher probability of
being interconnected than being connected to the members of other groups. In
this way, it is possible to investigate similar kinds of grouped components.
        </p>
        <p>
          Here, similarity refers to the components having a common subset of
neighbours. Identifying communities in an EA model provides decision support for the
enterprise architects to place new components into the model repository based
on identi ed roles and position of the components. When the size of the model is
complex enough to analyse EA model, community detection technique becomes
robust regarding execution speed and outperforms SimRank approach, which is
discussed by Borozanov et al. [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ].
        </p>
        <p>
          Once a community detection algorithm is implemented and the network is
partitioned into communities, it is important to interpret the results to know
which algorithm performs best and detect meaningful communities. Since we do
not know the gold standard to which the communities should belong, an e ective
approach to evaluate formed communities is by using internal criteria such as
"modularity" [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ].
5
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Results</title>
      <p>In this section, we outline the evaluation results to select the best performing
similarity method and the best performing community detection algorithm that
provides automatic view generation of EA models.
5.1</p>
      <sec id="sec-4-1">
        <title>Similarity Between Enterprise Architecture Models</title>
      </sec>
      <sec id="sec-4-2">
        <title>Node Similarity Evaluation</title>
        <p>Syntactic Similarity. Concerning our example from section 4.1, Cosine and
Jaccard similarity measures are applied to assess the syntactic similarity between
component names of the EA models. The comparison results for our case shows
that cosine similarity (0.700) has a better similarity score which is closer to value
1 than the Jaccard Index (0.537). Thus, we conclude cosine similarity performs
better than the Jaccard index in capturing the similarity.</p>
        <p>Latent Semantic Analysis. Knowing that the models from section 4.1 convey
similar meaning (and from the results presented above), we conclude that by
using LSA (0.832) we got the highest score closer to 1. Thus, LSA outperforms
Cosine and Jaccard similarity measure in capturing relatedness between EA
models. Therefore, any one of the models can be added to the EA repository.
Thus, LSA technique is more e cient in nding similarity between EA models
compared to simple cosine similarity (0.700) based on the type of content present
between the component names. For a certain threshold (similarity score) closer to
1, enterprise architects can decide upon adding a new model into the repository.
Structural similarity evaluation In order to assess the structural similarity
between two graphs, we rst calculate the similarity between the pair of
vertices by constructing similarity matrices for the graphs using Jaccard, Dice, and
Adamic-Adar indexes.</p>
        <p>Association or correlation between two similarity matrices is later computed
by testing correlation to compare whether two graphs are similarly based on
its structure. The correlation coe cient can range in value from 1 to +1.
The larger the absolute value of the coe cient, the stronger the relationship
between the variables. We used the Pearson correlation coe cient to compute
the association between two similarity matrices.</p>
        <p>Our results show that best associations were obtained using the Jaccard
similarity coe cient (0.968) whose correlation score is closer to 1 than the Dice
similarity coe cient (0.956) and the Adamic-Adar similarity coe cient (0.853).
Thus, we choose the Jaccard similarity index as the best performing metric in
nding the structural similarity.
5.2</p>
      </sec>
      <sec id="sec-4-3">
        <title>Comparison Between Community Detection Algorithms</title>
        <p>In this section, we discuss the approach to nd the best-performing community
detection algorithm. Since we do not know to which the communities should
belong, we use modularity score (internal measure) as a general criterion to nd
the best-performing algorithm. We consider two di erent cases in determining
the communities of similar kind of EA components. Thus, for each case, we
evaluate the performance of di erent community detection algorithms.</p>
        <p>In the following, we consider that the graph does not contain isolated nodes.
Table 1 summarises the resulting community size, modularity score, execution
time, and ranking of di erent community detection algorithms.</p>
        <p>As shown in Table 1, spinglass takes a top position. The higher the
modularity score, the better the network partition. Leading eigenvector and walktrap</p>
        <p>Algorithm</p>
        <p>Size</p>
        <p>Modularity
Edge-betweenness
Infomap
Label propagation
Leading eigenvector
Spinglass
walktrap
returns the almost equal result with modularity score of 0:75. In this situation,
running time, t, is considered as a tie-break criterion. Since walktrap computes
faster than leading eigenvector, the user can decide between the selection of an
algorithm based on the running time of the speci c algorithm. Edge-betweenness
algorithm performed worst in this case with lowest modularity score of about
0:579. The lesser the modularity score indicates, the larger the number of nodes
in a single community, which makes it non-trivial to understand the distribution
of communities formed within a network.
6</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>With the goal of supporting EA model maintenance, this paper presented a
performance-based comparison between di erent graph similarity and
community detection algorithms as candidates in providing a better user experience
and suitability in a working environment. The presented approaches enable
enterprise architects to track changes in EA models and to avoid EA repository
pollution by adding duplicate components.</p>
      <p>One limitation of nding similarity method is that computation time is linear
concerning the product of graph sizes due to the size of the similarity matrix. A
further challenge is to determine the similarity between networks and to de ne
a measure of similarity. For instance, it would be di cult for a domain expert to
choose a threshold on the similarity measures to decide whether two models are
similar. Another limitation is that the igraph community detection algorithm
fails to handle overlapping communities.</p>
      <p>
        Moreover, abstracting EA models as graphs leads to loss of information as
it means a focus on the structural facet of the EA model. However, enterprise
architects also encode information by grouping elements visually to each other.
This information gets lost within our transformation into a graph. Lastly, we
focused on nding similarities based on the component names. As future work,
additional similarity approaches can be extended to nd similarity between EA
models based on the description information of an EA component. Furthermore,
applying other graph similarity measures may be useful to identify duplicate
components in EA models (e.g., [
        <xref ref-type="bibr" rid="ref31">31</xref>
        ]).
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Adamic</surname>
            ,
            <given-names>L.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Adar</surname>
          </string-name>
          , E.:
          <article-title>Friends and neighbors on the web</article-title>
          .
          <source>Social networks 25(3)</source>
          ,
          <volume>211</volume>
          {
          <fpage>230</fpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Aier</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schnherr</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Integrating an enterprise architecture using domain clustering (</article-title>
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Alhadi</surname>
            ,
            <given-names>A.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Deraman</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yussof</surname>
            ,
            <given-names>W.N.J.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mohamed</surname>
            ,
            <given-names>A.A.</given-names>
          </string-name>
          , et al.:
          <article-title>An ensemble similarity model for short text retrieval</article-title>
          .
          <source>In: International Conference on Computational Science and Its Applications</source>
          . pp.
          <volume>20</volume>
          {
          <fpage>29</fpage>
          . Springer (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Borozanov</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hacks</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Silva</surname>
            ,
            <given-names>N.:</given-names>
          </string-name>
          <article-title>Using machine learning techniques for evaluating the similarity of enterprise architecture models - technical paper</article-title>
          .
          <source>In: Advanced Information Systems</source>
          Engineering - 31st International Conference, CAiSE. pp.
          <volume>563</volume>
          {
          <issue>578</issue>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Csardi</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nepusz</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>The igraph software package for complex network research</article-title>
          .
          <source>InterJournal, Complex Systems 1695(5)</source>
          , 1{
          <issue>9</issue>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Dijkman</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dumas</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Van Dongen</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          , Kaarik, R.,
          <string-name>
            <surname>Mendling</surname>
          </string-name>
          , J.:
          <article-title>Similarity of business process models: Metrics and evaluation</article-title>
          .
          <source>Information Systems</source>
          <volume>36</volume>
          (
          <issue>2</issue>
          ),
          <volume>498</volume>
          {
          <fpage>516</fpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Dreyfus</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Iyer</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Enterprise architecture: A social network perspective</article-title>
          .
          <source>In: HICSS'06. Proceedings of the 39th Annual Hawaii International Conference on System Sciences</source>
          ,
          <year>2006</year>
          . vol.
          <volume>8</volume>
          .
          <string-name>
            <surname>IEEE</surname>
          </string-name>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Farwick</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Agreiter</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Breu</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ryll</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Voges</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hanschke</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>Automation processes for enterprise architecture management</article-title>
          .
          <source>In: 2011 IEEE 15th International Enterprise Distributed Object Computing Conference Workshops</source>
          . pp.
          <volume>340</volume>
          {
          <fpage>349</fpage>
          .
          <string-name>
            <surname>IEEE</surname>
          </string-name>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Farwick</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Agreiter</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Breu</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ryll</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Voges</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hanschke</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>Requirements for automated enterprise architecture model maintenance</article-title>
          .
          <source>In: 13th International Conference on Enterprise Information Systems (ICEIS)</source>
          , Beijing (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Farwick</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pasquazzo</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Breu</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schweda</surname>
            ,
            <given-names>C.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Voges</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hanschke</surname>
            ,
            <given-names>I.:</given-names>
          </string-name>
          <article-title>A meta-model for automated enterprise architecture model maintenance</article-title>
          .
          <source>In: 2012 IEEE 16th International Enterprise Distributed Object Computing Conference</source>
          . pp.
          <volume>1</volume>
          {
          <fpage>10</fpage>
          .
          <string-name>
            <surname>IEEE</surname>
          </string-name>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Fortunato</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Community detection in graphs</article-title>
          .
          <source>Physics reports</source>
          <volume>486</volume>
          (
          <issue>3-5</issue>
          ),
          <volume>75</volume>
          {
          <fpage>174</fpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Ganser</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lichter</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Roth</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rumpe</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Staged model evolution and proactive quality guidance for model libraries</article-title>
          .
          <source>Software Quality Journal</source>
          <volume>24</volume>
          (
          <issue>3</issue>
          ),
          <volume>675</volume>
          {708 (Sep
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Jain</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jain</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chauhan</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Singh</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thakur</surname>
          </string-name>
          , N.:
          <article-title>Information retrieval using cosine and jaccard similarity measures in vector space model</article-title>
          .
          <source>International Journal of Computer Applications</source>
          <volume>164</volume>
          (
          <issue>6</issue>
          ) (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Jonkers</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lankhorst</surname>
            ,
            <given-names>M.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>ter Doest</surname>
            ,
            <given-names>H.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Arbab</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bosma</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wieringa</surname>
          </string-name>
          , R.J.:
          <article-title>Enterprise architecture: Management tool and blueprint for the organisation</article-title>
          .
          <source>Information systems frontiers 8(2)</source>
          ,
          <volume>63</volume>
          {
          <fpage>66</fpage>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Kaisler</surname>
            ,
            <given-names>S.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Armour</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Valivullah</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Enterprise architecting: Critical problems</article-title>
          .
          <source>In: Proceedings of the 38th Annual Hawaii International Conference on System Sciences</source>
          . pp.
          <source>224b{224b. IEEE</source>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Koutra</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parikh</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ramdas</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xiang</surname>
          </string-name>
          , J.:
          <article-title>Algorithms for graph similarity and subgraph matching</article-title>
          .
          <source>In: Proc. Ecol. Inference Conf</source>
          .
          <article-title>(</article-title>
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Landauer</surname>
            ,
            <given-names>T.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Foltz</surname>
            ,
            <given-names>P.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Laham</surname>
            ,
            <given-names>D.:</given-names>
          </string-name>
          <article-title>An introduction to latent semantic analysis</article-title>
          .
          <source>Discourse processes 25(2-3)</source>
          ,
          <volume>259</volume>
          {
          <fpage>284</fpage>
          (
          <year>1998</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Liben-Nowell</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kleinberg</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>The link-prediction problem for social networks</article-title>
          .
          <source>journal of the Association for Information Science and Technology</source>
          <volume>58</volume>
          (
          <issue>7</issue>
          ),
          <volume>1019</volume>
          {
          <fpage>1031</fpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Newman</surname>
            ,
            <given-names>M.E.</given-names>
          </string-name>
          :
          <article-title>Modularity and community structure in networks</article-title>
          .
          <source>Proceedings of the national academy of sciences 103(23)</source>
          ,
          <volume>8577</volume>
          {
          <fpage>8582</fpage>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Niwattanakul</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Singthongchai</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Naenudorn</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wanapu</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>Using of jaccard coe cient for keywords similarity</article-title>
          .
          <source>In: Proceedings of the International MultiConference of Engineers and Computer Scientists</source>
          . vol.
          <volume>1</volume>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <given-names>O</given-names>
            <surname>'Neill</surname>
          </string-name>
          ,
          <string-name>
            <surname>T.</surname>
          </string-name>
          :
          <article-title>Extreme connectivity: EAs need enterprise graphs</article-title>
          . https: //www.infoworld.com/article/3045210/enterprise
          <article-title>-architecture/ extreme-connectivity-eas-need-enterprise-graphs</article-title>
          .html/ (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Orman</surname>
            ,
            <given-names>G.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Labatut</surname>
          </string-name>
          , V.:
          <article-title>A comparison of community detection algorithms on arti cial networks</article-title>
          .
          <source>In: International Conference on Discovery Science</source>
          . pp.
          <volume>242</volume>
          {
          <fpage>256</fpage>
          . Springer (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Orman</surname>
            ,
            <given-names>G.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Labatut</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cheri</surname>
          </string-name>
          , H.:
          <article-title>Qualitative comparison of community detection algorithms</article-title>
          .
          <source>In: International conference on digital information and communication technology and its applications</source>
          . pp.
          <volume>265</volume>
          {
          <fpage>279</fpage>
          . Springer (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Orman</surname>
            ,
            <given-names>G.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Labatut</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cheri</surname>
          </string-name>
          , H.:
          <article-title>Comparative evaluation of community detection algorithms: a topological approach</article-title>
          .
          <source>Journal of Statistical Mechanics: Theory and Experiment</source>
          <year>2012</year>
          (
          <volume>08</volume>
          ) (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Porter</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Onnela</surname>
            ,
            <given-names>J.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mucha</surname>
            ,
            <given-names>P.J.:</given-names>
          </string-name>
          <article-title>Communities in networks</article-title>
          .
          <source>Notices of the AMS</source>
          <volume>56</volume>
          (
          <issue>9</issue>
          ),
          <volume>1082</volume>
          {
          <fpage>1097</fpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Saha</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ghrera</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Nearest neighbor search in complex network for community detection</article-title>
          .
          <source>arXiv preprint arXiv:1511.07210</source>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Santana</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Souza</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Simon</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fischbach</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>De Moura</surname>
          </string-name>
          , H.:
          <article-title>Network science applied to enterprise architecture analysis: Towards the foundational concepts</article-title>
          .
          <source>In: 2017 IEEE 21st International Enterprise Distributed Object Computing Conference (EDOC)</source>
          . pp.
          <volume>10</volume>
          {
          <fpage>19</fpage>
          .
          <string-name>
            <surname>IEEE</surname>
          </string-name>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <surname>Sapp</surname>
            ,
            <given-names>C.E.</given-names>
          </string-name>
          :
          <article-title>Preparing and Architecting for Machine Learning</article-title>
          . https: //www.gartner.com/binaries/content/assets/events/keywords/catalyst/ catus8/preparing_and
          <article-title>_architecting_for_machine_learning</article-title>
          .pdf/ (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          29.
          <string-name>
            <surname>Simon</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fischbach</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>It landscape management using network analysis</article-title>
          .
          <source>In: Enterprise Information Systems of the Future</source>
          , pp.
          <volume>18</volume>
          {
          <fpage>34</fpage>
          . Springer (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          30. S rensen, T.:
          <article-title>A method of establishing groups of equal amplitude in plant sociology based on similarity of species and its application to analyses of the vegetation on danish commons</article-title>
          .
          <source>Biol. Skr. 5</source>
          ,
          <issue>1</issue>
          {
          <fpage>34</fpage>
          (
          <year>1948</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          31.
          <string-name>
            <surname>Yan</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yu</surname>
            , P.S., Han,
            <given-names>J</given-names>
          </string-name>
          .:
          <article-title>Substructure similarity search in graph databases</article-title>
          .
          <source>In: Proceedings of the 2005 ACM SIGMOD international conference on Management of data</source>
          . pp.
          <volume>766</volume>
          {
          <fpage>777</fpage>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          32.
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Algesheimer</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tessone</surname>
            ,
            <given-names>C.J.:</given-names>
          </string-name>
          <article-title>A comparative analysis of community detection algorithms on arti cial networks</article-title>
          .
          <source>Scienti c Reports</source>
          <volume>6</volume>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>