<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Probabilistic Metamodel Merging</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Robert</forename><surname>Lagerström</surname></persName>
							<email>robertl@ics.kth.se</email>
							<affiliation key="aff0">
								<orgName type="department">Industrial information and control systems</orgName>
								<orgName type="institution">The Royal Institute of Technology</orgName>
								<address>
									<addrLine>Osquldas v.12</addrLine>
									<postCode>10044</postCode>
									<settlement>Stockholm</settlement>
									<country key="SE">Sweden</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Moustafa</forename><surname>Chenine</surname></persName>
							<email>moustafac@ics.kth.se</email>
							<affiliation key="aff0">
								<orgName type="department">Industrial information and control systems</orgName>
								<orgName type="institution">The Royal Institute of Technology</orgName>
								<address>
									<addrLine>Osquldas v.12</addrLine>
									<postCode>10044</postCode>
									<settlement>Stockholm</settlement>
									<country key="SE">Sweden</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Pontus</forename><surname>Johnson</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Industrial information and control systems</orgName>
								<orgName type="institution">The Royal Institute of Technology</orgName>
								<address>
									<addrLine>Osquldas v.12</addrLine>
									<postCode>10044</postCode>
									<settlement>Stockholm</settlement>
									<country key="SE">Sweden</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Ulrik</forename><surname>Franke</surname></persName>
							<email>ulrikf@ics.kth.se</email>
							<affiliation key="aff0">
								<orgName type="department">Industrial information and control systems</orgName>
								<orgName type="institution">The Royal Institute of Technology</orgName>
								<address>
									<addrLine>Osquldas v.12</addrLine>
									<postCode>10044</postCode>
									<settlement>Stockholm</settlement>
									<country key="SE">Sweden</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Probabilistic Metamodel Merging</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">032E12D7072FB5197AEFA3F9EE1B9007</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T23:15+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>Metamodel merging, Bayesian networks</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>This paper proposes the use Bayesian networks for the automatic merging of metamodels. The proposed Bayesian networks calculate the probability that a merge of two metamodel elements is suitable, thus suggesting what to merge.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>In recent years, both researchers and practitioners have discovered the emerging possibilities of using metamodels and ontologies. As a consequence, a large quantity of metamodels and ontologies now exists within all sorts of different applications. The distributed environment of metamodel and ontology development has led to large overlaps within and between metamodels. Since these diverse metamodels often describe similar aspects of systems, developers and users would gain large benefits if metamodels could easily be merged and aligned with each other. Therefore, integration and merging of metamodels and ontologies has received an increasing interest lately <ref type="bibr">[1][2]</ref>.</p><p>This paper proposes an approach to metamodel merging where a probabilistic inference engine is employed to evaluate candidate metamodel concepts suitable for merging. The proposed approach uses Bayesian networks to assess the probability that a merge of two elements is suitable. The Bayesian network evaluates a merge based on various syntactic and semantic characteristics of the candidate concepts, such as the similarity of names and associations to other concepts.</p><p>The concepts described in this paper all pertain to a certain type of metamodels called abstract models. Abstract models have previously been proposed as a notation for describing and analyzing enterprise systems <ref type="bibr" target="#b2">[3]</ref>. The models represent the architectures of these systems as well as other expert knowledge and empirical observations that can be instantiated and used for analysis. An abstract model contains classes and class associations, augmented with attributes and attribute associations. A UML description of abstract models can be seen in Fig. <ref type="figure" target="#fig_0">1</ref>. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Merging Metamodels using Bayesian Networks</head><p>A Bayesian network, B=(G, P), is a representation of a joint probability distribution, where G=(V, E) is a directed acyclic graph consisting of vertices, V, and edges, E. The vertices denote a domain of random variables X 1 ,…, X n , also called chance nodes. Each chance node, X i , may take on a value x i from the finite domain Val(X i ). The edges denote causal dependencies between the nodes, i.e. how the nodes relate to each other. The second component, P, of the network B, describes a conditional probability distribution for each chance node, P(X i ), given its parents Pa(X i ) in G. More comprehensive treatment on Bayesian networks can be found in e.g. Jensen <ref type="bibr" target="#b3">[4]</ref>.</p><p>When merging abstract models, there are two concerns. Firstly, do any of the classes in the source models represent the same concepts? Secondly, when two classes have been merged, do any of the attributes in the merged class represent the same concepts? If these two concerns are correctly handled, then all associations separately holding in the source models will also be correctly transferred into the target model. Therefore, two Bayesian networks were developed; one describing class merges and one describing attribute merges, c.f. Fig. <ref type="figure" target="#fig_1">2</ref>.</p><p>The nodes in the class merge network have the following scales: Class Merge = {Yes, No}, Class Similarity Association = {Yes, No}, Class Names = {Identical, Similar, Dissimilar}, Class References = {SamePublication, SameAuthor, DifferentAuthors}, Class Attributes = {All, Some, None}, and Class Associations = {All, Some, None}.</p><p>The nodes in the attribute merge network have the following scales: Attribute Merge = {Yes, No}, Attribute Similarity Association = {Yes, No}, Attribute Names = {Identical, Similar, Dissimilar}, Attribute References = {SamePublication, SameAuthor, DifferentAuthors}, and Attribute Associations = {All, Some, None}. To illustrate the application of the Class Merge network c.f. the Bayesian network screenshot from GeNIe <ref type="bibr" target="#b4">[5]</ref> in Fig. <ref type="figure" target="#fig_2">3</ref>. Assume that a package, i.e. a set of abstract models, contains the abstract models presented in Fig. <ref type="figure" target="#fig_3">4</ref>. The method starts by comparing the pairs of classes from the models presented in Fig. <ref type="figure" target="#fig_3">4</ref>. Then, the classes with the highest probability are merged. In this example, the class pair Documentation and Document received the probability P = 100 %, and are therefore merged. The next step is to compare all pairs of attributes in the merged class Documentation. Then, the attributes with the highest probability are merged. In this example, the pair Readability and Readability received the probability P = 100 %, and are therefore merged.</p><p>It all starts over from the beginning by comparing all pairs of classes and iterates until no classes receive probabilities over a predefined merging threshold. The resulting model in this example is presented in Fig. <ref type="figure">5</ref>.</p><formula xml:id="formula_0">Aff ec ts [1] Affects [1][2] Influences [1][2]</formula><p>Inf lue nc es <ref type="bibr" target="#b1">[2]</ref> Fig. <ref type="figure">5</ref>. The resulting abstract model after employing the proposed Bayesian networks.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Conclusions</head><p>This paper addresses the issue of metamodel merging, using the probabilistic framework of Bayesian networks. It was shown that Bayesian networks can be used to guide the merging of metamodels, by considering some key features of the classes and attributes at hand: basically their names, references, and associations. With this information it is possible to discern the probability that the concepts are sufficiently similar to be merged.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. UML description of abstract models.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Fig. 2 .</head><label>2</label><figDesc>Fig. 2. Bayesian networks representing class and attribute merge.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Fig. 3 .</head><label>3</label><figDesc>Fig. 3. The Bayesian network for class merges with example values.</figDesc><graphic coords="3,205.80,286.86,200.40,99.00" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Fig. 4 .</head><label>4</label><figDesc>Fig. 4. Abstract models to be tested for possible merges.</figDesc></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_0">Proceedings of CAiSE'08 Forum</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">The PROMPT suite: interactive tools for ontology merging and mapping</title>
		<author>
			<persName><forename type="first">N</forename><surname>Noy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Musen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Internationl Journal of Human-Computer Studies</title>
		<imprint>
			<biblScope unit="volume">59</biblScope>
			<biblScope unit="issue">6</biblScope>
			<biblScope unit="page" from="983" to="1024" />
			<date type="published" when="2003">2003</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Learning to map between ontologies on the semantic web</title>
		<author>
			<persName><forename type="first">A</forename><surname>Doan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Madhaven</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Domingos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Halevy</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">the proceedings of the 11 th International Conference on World Wide Web</title>
				<imprint>
			<date type="published" when="2002">2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">A Tool for Enterprise Architecture Analysis</title>
		<author>
			<persName><forename type="first">P</forename><surname>Johnson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Johansson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Sommestad</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Ullberg</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 11 th IEEE International Enterprise Distributed Object Computing Conference (EDOC)</title>
				<meeting>the 11 th IEEE International Enterprise Distributed Object Computing Conference (EDOC)<address><addrLine>Annapolis USA</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2007">2007</date>
			<biblScope unit="volume">11</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<title level="m" type="main">Bayesian Networks and Decision Graphs</title>
		<author>
			<persName><forename type="first">F</forename><surname>Jensen</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2001">2001</date>
			<publisher>Springer-Verlag</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<ptr target="http://genie.sis.pitt.edu/about.html" />
		<title level="m">GeNIe: Decision System Laboratories, About GeNIe and SMILE</title>
				<imprint>
			<date type="published" when="2007">2007</date>
		</imprint>
		<respStmt>
			<orgName>University of Pittsburgh</orgName>
		</respStmt>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
