<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Similarity Measurement about Ontology-based Semantic Web Services</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Xia</forename><surname>Wang</surname></persName>
							<email>xia.wang@deri.org</email>
							<affiliation key="aff0">
								<orgName type="department">Digital Enterprise Research Institute IDA Business Park</orgName>
								<address>
									<settlement>Lower Dangan Galway</settlement>
									<country key="IE">Ireland</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Yihong</forename><surname>Ding</surname></persName>
							<email>ding@cs.byu.edu</email>
							<affiliation key="aff1">
								<orgName type="department">Dept. of Computer Science</orgName>
								<orgName type="institution">Brigham Young University Provo</orgName>
								<address>
									<settlement>Utah</settlement>
									<country key="US">USA</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Yi</forename><surname>Zhao</surname></persName>
							<email>yi.zhao@fernuni-hagen.de</email>
							<affiliation key="aff2">
								<orgName type="department">Chair of Computer Engineering Fernuniversität Hagen</orgName>
								<address>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Similarity Measurement about Ontology-based Semantic Web Services</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">DAA0D3CDA3F27AF4159913447876B07A</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-25T06:47+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Measurement of semantic similarity between Web services is an important factor for Web service discovery, composition, and even execution. Semantic Web services (SWS) are usually specified based on ontologies. The measurement of semantic similarity between Web services thus can be reduced to computing semantic distances between ontologies. In this paper, we briefly surveyed three major existing ontology-distance-computation algorithms and enhanced them to measure the single and multiple ontolgies similarity in SWS context. Based on this survey, we summarized a new hybrid ontology-similarity-measurement methodology that measures similarity between Semantic Web Services.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Service similarity is crucial to service discovery, selection, composition, and even execution. Especially semantic service discovery aims to locate the best matched service, it mostly depends on the measurement of the similarity between an user's service requirements and the profiles of published services. Currently, the semantic Ontology languages for services, such as the OWL-S<ref type="foot" target="#foot_0">1</ref> and the WSMO<ref type="foot" target="#foot_1">2</ref> , are required to semantically represent service capabilities, including non-function information (including Qos of service), functional information (IOPE of services operations, denoting input, output, precondition and effect). The ser-vice discovery, therefore, focuses on the matchmaking of service capability <ref type="bibr" target="#b14">[15]</ref> and QoS, while less work is done on ontology-based services selection.</p><p>Moreover, Ontology receives great attention in the progressively emerging Semantic Web <ref type="bibr" target="#b1">[2]</ref> and Semantic Web Services <ref type="bibr" target="#b5">[6]</ref>, by formally defining the concepts and relationships in a machine understandable way and enabling knowledge sharing and reuse. As the elements of the representation of semantic services, the similarity of the ontologies used is crucial to service similarity, especially when considering the discovery and execution.</p><p>Ontology similarity which is related to Ontology mapping is a well known topic in information retrieval, database integration systems, and artificial intelligence fields. Also, there is a wealth of work on similarity measures of Ontology concepts and concept-related notions <ref type="bibr" target="#b18">[19]</ref>. However, the measurement of similarity of ontologies and concepts itself is not easy, additionally many specific features (see section 2.2) in Semantic Web Service description environment.</p><p>After surveying the previous Ontology similarity measures and their application situations, in our SWS context two methods are combined to adapt to calculate the semantic distance of single formal Ontology concepts, e.g. code and zip in the examples of figure <ref type="figure" target="#fig_0">1</ref>. The approaches are a fuzzy-weighted associative network (edge-based measure) and an information-theoretical approach (contentbased measure).</p><p>Also regarding the compound ontology concepts in semantic service context, for example, in figure <ref type="figure" target="#fig_0">1</ref> the concepts findZipCodeDistance and CalcDistTwoZipsKm, are not the formal single terms as the ones in WordNet 4 , in this case the traditional method (e.g. edit distance of strings) is not useful to measure their semantic similarity. Therefore, we refine the hierarchical clustering algorithm to calculate the distance of two compound concept terms, similar to <ref type="bibr" target="#b6">[7,</ref><ref type="bibr" target="#b3">4]</ref>.</p><p>In this paper we aim to solve the ontology similarity in a semantic service environment. First, we differentiate two cases of service ontology concepts: single and compound ontology concept to measure their similarity in service context. Then, a hybrid Ontology-similarity measurement is proposed by combining and refining three existing methods. Finally, we define our ontology similarity-based model as sim S = Σsim O ∈ [0, 1] to improve the service selection; This model fuzzily and quantitatively measures the service similarity basing on service ontology similarity.</p><p>This paper is structured as follows. In Section 2 we state the occurring problems of Ontology in Semantic Web Service description context, and investigate the specific name features of ontology concept. A Ontology concept distance definition and three refined ontology similarity algorithms are discussed with examples of single formal term and compound term in Section 3. The service similarity measurement is defined in Section 4. In Section 5 and 6 we respectively discuss related works, and give conclusion and indications for future work.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Ontology in SWS Description Context</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Problem Statements</head><p>In order to illustrate the challenge of measuring the similarity of semantic Web services, we extract a set of zip code related services from the dataset of OWL-S annotated Web Services of the University College Dublin 3 . In Figure <ref type="figure" target="#fig_0">1</ref>., there are snatch description of four services, which are used for looking up a zip code or calculating the distance between two places according to the given zip codes. The information shown is retrieved from the wsdl documents of the respective service.</p><p>Current service matchmaking algorithms normally focus on measuring the syntactic (as service name, service text description and so on) and semantic (as service capabilities) of service. Taking sws4 and sws5 of Figure <ref type="figure" target="#fig_0">1</ref> as examples, if we assume that zip and code have the similar meaning, intuitively, by comparing service name and operation name, service sws4 and sws5 are regarded as similar from the signature level; and by matching their operation, as both opera-tion2, they also have similar inputs and outputs, so that sws4 and sws5 are concluded as similar services. This means that 4 WordNet, an online lexical reference system, http : //wordnet.princeton.edu/ 3 The Semantic Web Services Repository at the Smart Media Institute in University College Dublin, http : //moguntia.ucd.ie/repository/. Further, if we assume that a machine can understand some similarity between {zip, ZipCode, Zip Code 1, code, code1} and {CaleDisT woZipsKm, f indZipCode Distance}, then intuitively and naively from the above example services of Figure <ref type="figure" target="#fig_0">1</ref>, we know that sws1:operation1 is similar to sws5:operation1; sws2:operation1 is similar to sws3:operation2 and sws4:operatio2; and sws3:operation1 is similar to sws4:operation1.</p><p>Obviously, similarity, whether syntactic or semantic, the matching of ontology concepts used in service description is a critical challenge. If a machine can not understand the meaning of service concepts, it also cannot infer the imply relationships, then the automatic matching and discovery of services is impossible.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Naming Conventions for Ontologies</head><p>Intuitively and in ontology-related work, when ontology terminology is mentioned, it mostly means the terms in thesauri, e.g., Wordnet. On the other hand, the ontology terms defined in applications is very different from the formal words. Generally, the ontology concept used in service semantic descriptions are most compound terms, which are named depending on service developers by their ontology knowledge, experience and wonted. The situation is made worse by the following practices (parts of examples from Fig. <ref type="figure" target="#fig_0">1</ref>.):</p><p>Abbreviations Names are not given in their correct forms, but shortened, e.g. CalcDistTwoZipsKm;</p><p>Associated words with capitalization or delimiters Words have the form of associations of several words parts (full word or abbreviation) with delimiters, normally a part's first letter capitalized, and sometimes also using underscore, dash or space, e.g., LogIn, AcctName, ArrivalAirport In.</p><p>Words with suffix and prefix Examples are hasFlavour, locatedIn.</p><p>Variations or misspelling Names may be variations of word often due to grammatical flexion, e.g., Booking, madeFromGrape; And defined words are in misspelling format for machine.</p><p>Free inventions Any other cases the traditional similarity measures (based, e.g., on WordNet) are prevented to work.</p><p>Considering the above compound concept terms, the existing ontology measure algorithms can not work. Moreover, the data clustering algorithm from data mining field can be borrowed to apply to this case. This paper enhances the clustering algorithm in <ref type="bibr" target="#b3">[4]</ref> to measure the semantic closeness of composed terms.</p><p>3 Ontology Similarity</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Ontology Concept Distance</head><p>To semantically measure Ontology concept distance, we should consider both concept structure and concept content. Fortunately, both of these information are prolifically provided by service description. Here, we define the semantic distance dis of the assumed concepts C and D (which could be single formal term or compound term) as:</p><formula xml:id="formula_0">dis = w 1 * Dis s + w 2 * Dis i + w 3 * Dis c , 3 i=1 w i = 1 (1)</formula><p>where Dis s is the distance basing on the structure of concept in service Ontology, the Dis i basing on the common contents shared by concepts, and the Dis c is only used to measure the compound concept terms by clustering concepts, basing on the concept elements co-occurrence. Formulae 1 not only considers the different concept naming features, but also make up the loss of any single approach, because the service description context is just a structure and a short piece of text, not a corpus or thesaurus.</p><p>In the following sections, we will present the detail explanation of every distance measurement.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">Fuzzy-weighted Associative Network</head><p>Concepts in a hierarchical taxonomy are all related by certain relationships, based on which concepts can be represented in an associative network consisting of nodes and edges, where nodes denote concepts, edge denotes the binary relationship of the two linked concepts. Also for service description Ontology, such associative network with fuzzy-weighted value on each link can be constructed, in which the similarity of concepts can be measured by the shortest distance as <ref type="bibr" target="#b4">[5]</ref> and <ref type="bibr" target="#b16">[17]</ref>, which is defined as Dis s in our context.</p><p>As the detailed explanation by <ref type="bibr" target="#b4">[5]</ref> and correspondence to OWL-Lite, we define four concept relations as generalization (e.g., superclass), specification (e.g., subclass), negative association (e.g., disjoined) and positive association (e.g., equivalent).</p><p>Therefore, the distances of arbitrary two nodes in the network can be calculated based on Tables 1-3 <ref type="bibr" target="#b16">[17]</ref>. In Table <ref type="table">.</ref>1 s, g, p and n represent explicit relationships, that is, each two notes relationship can be evaluated basing on triangular norms. τ in Table <ref type="table">2</ref>. are the triangular norms (t-norms), which is defined in Table3, where α or β are fuzzy-weighted strength values of relations (0 ≤ α, β ≤ 1), n is the degree of dependence (−∞ ≤ n ≤ ∞) between the relationships, details please refer to <ref type="bibr" target="#b8">[9]</ref>. In the tables those fields are marked with X for which there is no definition. Therefore, the relationship of two arbitrary concepts can easily be inferred by traveling through the associative network.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>g s p n g g p p n s p s p n p p p p n n n n n X</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Table 1. Kind of paths</head><formula xml:id="formula_1">g s p n g τ 3 τ 1 τ 2 τ 2 s τ 1 τ 3 τ 2 τ 2 p τ 2 τ 2 τ 3 τ 3 n τ 2 τ 2 τ 3 X Table 2. Strength of paths τ 1 (α, β) = max(0, α + β − 1) n = −1 τ 2 (α, β) = αβ n = 0 τ 3 (α, β) = min(α, β) n = ∞</formula><p>Table <ref type="table">3</ref>. T-norms function</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3">Information-theoretical Approach</head><p>The definition of similarity between the concepts C and D relates to the concepts' commonality and difference <ref type="bibr" target="#b10">[11]</ref>:</p><formula xml:id="formula_2">sim(C, D) = I(common(C, D)) I(description(C, D)) = log P (common(C, D)) log P (description(C, D)) (2)</formula><p>where common(C, D) is a proposition that states the commonalities between C and D, I(common(C, D)) is the amount of information contained in this proposition and, similarly, I(description(C, D)) is a proposition describing what C and D are. In our service context, we refine the similarity expression as follows to calculate the distance Dis i :</p><formula xml:id="formula_3">Dis i = |C ∩ D| |C ∩ D| + γ|D/C| + δ|C/D| , γ, δ ∈ [0, 1]<label>(3)</label></formula><p>where C and D are two Ontology concept classes of OWL-Lite, |C ∩ D| is the number of common elements of C and D, e.g., the number of shared attributes, instances and relational classes, γ and δ are weight values defining the relative importance of their non-common characteristics.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.4">Ontology Distance for Single Formal Term</head><p>Given two single-form Ontology concept terms from two differen Web service description, as t 1 and t 2 , which are respectively described by a set of other class terms as their properties, instances and relational members (e.g., "g,s,n,p"). There are two cases:</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Figure 2. Example of distance in Associative Network</head><p>• Two terms t 1 and t 2 are organized in one hierarchical structure, which is transformed to a fuzzy weighted associative network of Section 3.2.</p><p>Reconsidering the example in Fig. <ref type="figure" target="#fig_0">1</ref>, it assumes that in the Zip service application domain, terms have the relationships (which are all experimental data, not the real value) cp.Fig. <ref type="figure">2</ref>. For example, the distance of term State and Zip is examined, the shortest path is path = {State, P lace, Code, Zip} with State =⇒ g,0.9 P lace, P lace =⇒ p,0.9 Code and Code =⇒ s,0.9 Zip. So that it hold that τ 2 (τ 2 (0.9, 0.9)0.9) = 0.729), following Table <ref type="table">1</ref> • Terms, t 1 and t 2 , are concept classes, respectively conof a set of properties and instances as ontology vocabulary according to Section 3. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.5">Concept Clustering for Compound Term</head><p>Clustering is also a well known approach group data on the basis of a certain similarity criteria. We adapt this clustering mechanism here to group the compound Ontology concept terms, which are from different service description Ontologies, e.g. findZipCodeDistance and Cal-cDistTwoZipsKm, in order to calculate their similarity by the distance dis c .</p><p>In the clustering algorithm, the association rule of two terms t 1 and t 2 is defined as follows <ref type="bibr" target="#b3">[4]</ref>:</p><formula xml:id="formula_4">t 1 −→ t 2 (s, c)</formula><p>where, the support s is the probability s = P (t 1 ) = Tt 1 T that t 1 occurs in T , T is the cardinality of the ontology terms' domain, T t 1 is the cardinality of the set which contains t 1 , the confidence c is the occurrence probability of t 2 in the case that t 1 occurred, i.e., c = P (t</p><formula xml:id="formula_5">2 |t 1 ) = T t 1 ,t 2 T t 1</formula><p>, with T t 1 ,t 2 is the cardinality of the set containing both t 1 and t 2 . The distance of two terms is weighted by their conditional probability c. The center of a cluster is the term which has the highest occurrence probability of the cluster.</p><p>In detail, including the natural language term extraction the clustering algorithm is used by us as follows:</p><p>1. Read service description document .owl, move all OWL-Lite tags, extract names and parameters, and delete redundancies in the vocabularies. The result is a bag of unique words including composted concept terms, denoting T = {t 1 , t 2 , ...}.</p><p>2. Preprocess all composted terms in T as follows.</p><p>Suppose that t i ∈ T is a composite term, we split it up on the basis of its delimiters, such as capital letters, into several parts. Then, we deal with each part towards extracting the word stem by removing stop words, suffixes and prefixes, restituting abbreviations or correcting misspelling, deleting redundant vocabulary terms and so on, resulting in the set t i = {t i1 , t i2 ...}. Substituting t i by all t ij ∈ t i for all i, ultimately yields T .</p><p>3. Compute the values s and c for any two terms in T , store them into a table in descending order, cluster them on the basis of their confidence c ≥ τ c and support s ≥ τ s (τ s and τ c are thresholds either assigned or obtained experimentally), resulting in the set T = (X 1 , X 2 , ..., X k ) of k clusters.</p><p>Roughly speaking, X i , 1 ≤ i ≤ k is a cluster including those terms whose co-occurrence probabilities exceed the threshold τ c . In traditional agglomeration clustering algorithms, T is an intermediate result, while in our context we should improve it in order to find an optimal clustering for our computation. This is the rationale of the algorithm's further steps.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>4.</head><p>In each X i ⊆ T , remove the frequent and rare parameters to avoid the query expansion and over-fitting problems, which are discussed in the field of information retrieval <ref type="bibr" target="#b7">[8]</ref>.</p><p>5. Split and merge the clusters in T , in order to wipe off the noise terms and optimize clusters by agglomerating terms according to concentric circularities with different radii.</p><p>The inner circularity consists of those terms, which are, at least, close to half of the other terms. Similarly, the terms in the outer circularity are, at least, close to a quarter of the other ones. They are called them 1 2 radius <ref type="bibr" target="#b3">[4]</ref>. And wiping off the terms, which are not in any circularity.</p><p>For example, to merge two cluster X 1 and X 2 , when ∀i ∈</p><formula xml:id="formula_6">X 1 ∪ X 2 , j|j ∈ X 1 ∪ X 2 , i = j, i −→)j(c &gt; τ c1 ) ≥ 1 2 ( X 1 + X 2 −1)<label>(4)</label></formula><p>Now, when calculating the distance between two random composite terms, here we used c 1 and c 2 distinctively, first, preprocess them using step (2) to obtain c 1 = {c 11 , c 12 , ...} and c 2 = {c 21 , c 22 , ...}, and then measure their similarity dis c by the probability of pairs of two terms to occur in the same cluster. As measure the maximum, minimum or or mean may be employed. Here we take the maximum as the optimistic way, the formula is as follows,</p><formula xml:id="formula_7">Dis c = max(sim(t 1i , t 2j )|∀t 1i ∈ t 1 , t 2j ∈ t 2 ), if t 1i , t 2j ∈ X k 0,</formula><p>otherwise.</p><p>(</p><p>Obviously, such a formula implies as extreme case, that is, all of the sub-terms of c 1 and c 2 have been wiped off as the noise words, such case have no way to scale the distance of ontology concepts. This part of work is right what our experiment will analysis, to evaluate the frequency of its occurrence.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Related Work</head><p>Similarity of ontologies has widely been researched, e.g., in the fields of information retrieval, artificial intelligence, databases, and especially in data mining and web mining. Many similarity measures are applied, e.g., Bernstein et al. in <ref type="bibr" target="#b2">[3]</ref> use two ways to measure the semantic similarity of objects in an ontology, which are organized in a hierarchical ontology structure, viz., the edge-based <ref type="bibr" target="#b9">[10]</ref> (a shorter path from one node to the other) and the node-based <ref type="bibr" target="#b13">[14]</ref> (the notion of shared information content) approach. Actually, they present five different distance measures of ontologies, where ontology distance stands for the shortest path through a common ancestor in a directed acyclic graph. However, computational degree and weight of edge are not considered. The vector space approaches computing cosine or Euclidean distances of k-dimensional vectors <ref type="bibr" target="#b0">[1,</ref><ref type="bibr" target="#b12">13]</ref> do not easily apply to nominal concepts, as it is difficult to represent them as vectors. The Full-text Retrieval Method (TF/IDF) is mostly used in information retrieval <ref type="bibr" target="#b0">[1]</ref> to compare documents, which are considered as bags of words. However, it is inadequate for structure concepts as semantic relations between them are ignored.</p><p>The work most closely related to ours are the studies on ontologies in the semantic web or in semantic web services, such as <ref type="bibr" target="#b6">[7,</ref><ref type="bibr" target="#b3">4]</ref> and <ref type="bibr" target="#b11">[12]</ref>. While they consider to cluster the similar terms, and most recur to TF/IDF to measure concept similarity, we follow Dong's notion of name clustering agglomeration algorithms. Maedche et al. also propose an approach to cluster ontology-based data, using the hierarchical clustering algorithm to consider instances of concept similarity. Hau et al. elaborate a metric to measure the similarity of semantic services annotated with OWL ontologies. They mainly depend on the information-theoretic approach to match similar ontology instances. Doan et al. computes the common information content of ontologies to scale their similarity. We combine multiple approaches to adapt to SWS environments. Based on a study of definitions and features of ontologies expressed in OWL, and from a computational point of view, we calculate the distance of two ontologies.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">Conclusions</head><p>Ontology similarity is unquestionable important for Semantic Web Service similarity when we consider the semantic service discovery, selection, composition, and even execution. This paper tries to propose a ontology similaritybased approach to measure service similarity and presents the primary work on it. The contributions of this paper are summarized as, 1) analysis the ontology similarity problem in semantic service context, and classify the ontology concept name features used by service description; 2) present a hybrid ontology concept distance method, and further to measure the service similarity.</p><p>As the complexity of ontology-based service similarity, under our model, there is still a lot left for our future work, including the set matching of the ontology-based concept and its type, also the detailed implementation and evaluation. However, fortunately the preliminary experiments show that this new methodology works well.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 .</head><label>1</label><figDesc>Figure 1. Snatch of Semantic Web Services Description</figDesc><graphic coords="2,308.87,72.18,244.20,110.70" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>3 .</head><label>3</label><figDesc>Assuming that their cardinality are |t 1 | = 9, |t 2 | = 6, and they share the number of elements |t 1 ∩ |t 2 | = 5, we obtain Dis i (t 1 , t 2 ) = 5 5+4+1 = 0.5, where γ, δ = 0.5.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head></head><label></label><figDesc>-3, that means State p,0.729 Zip, finally we get Dis s (t 1 , t 2 ) = 0.729.</figDesc><table /></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">OWL-S, http : //www.w3.org/Submission/OW L − S/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">WSMO, http : //www.wsmo.org</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_2">Service SimilarityIn our previous work<ref type="bibr" target="#b15">[16]</ref>, a semantic service model for selection is proposed as s = (N F, F, Q, C). By this model, the service selection can happen by filtering single property as Non-functional (in this model, only the service name and service category and short service text description defined as non-function) or combined properties as Non-functional, functional (basing on logical subsumption computing) together with qualities of services. Obviously, either Nonfunction or function-based based selection, the ontology concept similarity is critical fact for service selection.Under this selection model, we define an ontology-based service similarity algorithm. Especially when the nonfunction properties are considered during service selection, because the non-functional related service selection is ontology based.The idea is to measure the service similarity by the similarity of service name, service operations name, which are defined as Ontology concepts. We do not compare the whole piece service Ontologies, for example sim SO :(SO i ) × (SO j ) → [0..1],where SO i is the service ontology for service s i ; We only consider how similar two single ontology concepts are in service ontology context, assim(c i , c j ) = {f (c i , c j ) | c i ∈ SO i ∧ c j ∈ SO j } and the function f (c i , c j ) = min k=1,...,j dis(c i , c k ).Therefore, our work is different from Ontology mapping.The proposed Ontology-based service selection basically measure by the service name concepts and operations similarity, called lexical semantic level. It is defined as sim Service = sim Concepts + sim oP eration , where sim Concept is the sum similarity of all the concepts of services, and sim oP eration is the sum similarity of the operation parameters with their data types.</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<author>
			<persName><forename type="first">R</forename><surname>Baeza-Yates</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Ribeiro-Neto</surname></persName>
		</author>
		<title level="m">Modern Information Retrieval</title>
				<imprint>
			<publisher>ACM Press</publisher>
			<date type="published" when="1999">1999</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">The Semantic Web</title>
		<author>
			<persName><forename type="first">T</forename><surname>Berners-Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Hendler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Lassila</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Scientific American</title>
		<imprint>
			<biblScope unit="volume">284</biblScope>
			<biblScope unit="issue">5</biblScope>
			<biblScope unit="page" from="34" to="43" />
			<date type="published" when="2001">2001</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">How Similar Is It? Towards Personalized Similarity Measures in Ontologies</title>
		<author>
			<persName><forename type="first">A</forename><surname>Bernstein</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Kaufmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Buerki</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Klein</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International Tagung Wirtschaftsinformatik</title>
				<imprint>
			<date type="published" when="2005-02">February 2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Simlarity Search for Web Services</title>
		<author>
			<persName><forename type="first">X</forename><surname>Dong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Alon</surname></persName>
		</author>
		<author>
			<persName><surname>Halevy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Madhavan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Nemes</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Zhang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">VLDB</title>
		<imprint>
			<date type="published" when="2004">2004</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Semantic vs. structural resemblance of classes</title>
		<author>
			<persName><forename type="first">P</forename><surname>Fankhauser</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Kracker</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">J</forename><surname>Neuhold</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ACM SIGMOD RECORD</title>
		<imprint>
			<biblScope unit="volume">20</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="59" to="63" />
			<date type="published" when="1991">1991</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Ontology languages for the Semantic Web, Intelligent Systems</title>
		<author>
			<persName><forename type="first">A</forename><surname>Gomez-Perez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Corcho</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE</title>
		<imprint>
			<biblScope unit="page" from="54" to="60" />
			<date type="published" when="2002-01">Jan/Feb pp. 2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<title level="m" type="main">A Semantic Similarity Measure for Semantic Web Services</title>
		<author>
			<persName><forename type="first">J</forename><surname>Hau</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Darlington</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2005">2005</date>
			<publisher>Conference WWW2005</publisher>
			<pubPlace>Japan</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<title level="m" type="main">Automatic keyword classification for information retrieval</title>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">S</forename><surname>Jones</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1971">1971</date>
			<publisher>Archon Books</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<title level="m" type="main">Fuzzy Sets, Uncertainty and Information</title>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">L</forename><surname>Klir</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">A</forename><surname>Folger</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1988">1988</date>
			<publisher>Prentice Hall</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Information Retrieval Based on Conceptual Distance in IS-A Hierarchies</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">H</forename><surname>Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Kim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><forename type="middle">J</forename><surname>Lee</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">J. of Documentation</title>
		<imprint>
			<biblScope unit="volume">49</biblScope>
			<biblScope unit="page" from="188" to="207" />
			<date type="published" when="1993">1993</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">An Information-Theoretic Definition of Similarity</title>
		<author>
			<persName><forename type="first">D</forename><surname>Lin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Fifteenth International Conference on Machine Learning</title>
				<imprint>
			<date type="published" when="1998">1998</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Clustering Ontologybased Metadata in the Semantic Web</title>
		<author>
			<persName><forename type="first">A</forename><surname>Maedche</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Zacharias</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Pro. of the Joint Conferences ECML&apos;02 and PKDD&apos;02</title>
				<imprint>
			<date type="published" when="2002">2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<monogr>
		<title level="m" type="main">Machine Learning</title>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">M</forename><surname>Mitchell</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1997">1997</date>
			<publisher>McGraw-Hill</publisher>
			<pubPlace>New York</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language</title>
		<author>
			<persName><forename type="first">P</forename><surname>Resnik</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">J. of Artificial Intelligence Research</title>
		<imprint>
			<biblScope unit="volume">11</biblScope>
			<biblScope unit="page" from="95" to="130" />
			<date type="published" when="1999">1999</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<monogr>
		<title level="m" type="main">Semantic Matching of Web Services Capabilities</title>
		<author>
			<persName><forename type="first">M</forename><surname>Paolucci</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Kawmura</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Payne</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Sycara</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2002">2002</date>
			<pubPlace>ISWC, Italy</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Selection Model of Semantic Web Service</title>
		<author>
			<persName><forename type="first">X</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Zhao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Wolfgan</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">7th International FLINS Conference on Applied Artificial Intelligence</title>
				<imprint>
			<date type="published" when="2006">2006</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">LARKS: Dynamic Matchmaking Among Heterogeneous Software Agents in Cyberspace</title>
		<author>
			<persName><forename type="first">K</forename><surname>Sycara</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Widoff</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Klusch</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">G</forename><surname>Lu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Autonomous Agents and Multi-Agent Systems</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="page" from="173" to="203" />
			<date type="published" when="2002">2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<monogr>
		<author>
			<persName><forename type="first">I</forename><surname>Witten</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Frank</surname></persName>
		</author>
		<title level="m">Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations</title>
				<imprint>
			<publisher>Morgan Kaufmann</publisher>
			<date type="published" when="1999">1999</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Comparing concepts in differentiated ontologies</title>
		<author>
			<persName><forename type="first">P</forename><surname>Weinstein</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Birmingham</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of KAW-99</title>
				<meeting>of KAW-99</meeting>
		<imprint>
			<date type="published" when="1999">1999</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
