<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Licenses Compatibility and Composition in the Web of Data</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Serena</forename><surname>Villata</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">INRIA Sophia Antipolis</orgName>
								<address>
									<country key="FR">France</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Fabien</forename><surname>Gandon</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">INRIA Sophia Antipolis</orgName>
								<address>
									<country key="FR">France</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Licenses Compatibility and Composition in the Web of Data</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">94B10A948CAE13AD0AAF64D0A7363DC0</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T07:05+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The absence of clarity concerning the licensing terms does not encourage the reuse of the data by the consumers, thus preventing further publication and consumption of datasets, at the expense of the growth of the Web of Data itself. In this paper, we propose a general framework to attach the licensing terms to the data queried on the Web of Data. In particular, our framework addresses the following issues: (i) the various license schemas are collected and aligned taking as reference the Creative Commons schema, (ii) the compatibility of the licensing terms concerning the data affected by the query is verified, and (iii) if compatible, the licenses are combined into a composite license. The framework returns the composite license as licensing term about the data resulting from the query.</p><p>The author acknowledges the support of the DataLift Project ANR-10-CORD-09.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Heath and Bizer <ref type="bibr" target="#b4">[5]</ref> underline that "the absence of clarity for data consumers about the terms under which they can reuse a particular dataset, and the absence of common guidelines for data licensing, are likely to hinder use and reuse of data". Therefore, all Linked Data on the Web should include explicit licensing terms, or waiver statements <ref type="bibr" target="#b7">[8]</ref>. Data consumers need to know the licenses on the data to avoid misusing and even illegal reuse of the data. Following these highlights, several machine-readable licensing terms have been proposed, leading to a huge amount of unrelated but comparable ways of expressing the licenses. The scenario we consider in this paper is visualized in Figure <ref type="figure" target="#fig_0">1</ref>. When consumers query the Web of Data, results from different datasets, and thus released under different licensing terms, are provided. Our objective is to build a composite license which combines the elements from the distinct licenses associated to the selected data. The research questions we answer in this paper are: <ref type="bibr" target="#b0">(1)</ref> How to verify the compatibility among the licensing terms associated to a query result?, and (2) How to compose, if compatible, the distinct licensing terms for creating a composite license?</p><p>We answer the research questions by adopting Semantic Web languages only, and reusing the Creative Commons (CC) licenses schema <ref type="bibr" target="#b0">[1]</ref>. In particular, we choose the CC vocabulary to define the anatomy of our licenses. Licenses are composed by models (cc:Permission, cc:Requirement, cc:Prohibition) which specify the permissions allowed by the license, the required behaviors and the prohibited use and reuse of the data. The models are composed by elements like ShareAlike, Attribution, and many others, which specify the precise permissions, requirements, and prohibitions associated to the license. We choose this vocabulary because it provides a general schema for licenses specification. We are aware that there are works which cannot be considered as creative works <ref type="bibr" target="#b7">[8,</ref><ref type="bibr" target="#b4">5]</ref>. For addressing this issue and covering a wider range of machine-readable license specifications, we align CC with the other schemas representing licenses.</p><p>The set of licenses is evaluated as compatible if a number of compatibility rules are satisfied. These rules specify the subsumption relations among the licenses' elements and evaluate the compatibility among Permissions, Requirements and Prohibitions elements. If the pre-requisite of licenses compatibility is satisfied, the licenses are composed. We propose rules to deal with the composition of the licenses whose elements subsume other elements, and of the constraints expressed by the licenses. We define some basic heuristics to address the composition. Using these heuristics, the Composite License Generation algorithm returns the composite license. This license will be the only license attached to the data returned by the query. The composite license is retuned together with the query result using the standard SPARQL query results XML format by means of the &lt;link&gt;<ref type="foot" target="#foot_0">1</ref> element.</p><p>In this paper, we do not introduce yet another licensing schema. We consider RDF machine-readable license specifications only, and we do not consider MPEG-21<ref type="foot" target="#foot_1">2</ref> . Moreover, we do not provide verification compliance about the reuse of the data by the consumer. We extend and adapt existing proposals for licenses compatibility and composition in the area of service license analysis <ref type="bibr" target="#b3">[4]</ref> to the Web of Data scenario.</p><p>The reminder of the paper is as follows: Section 2 compares the existing literature with the proposed approach. Section 3 formalizes the licenses structure and presents the alignment with the other vocabularies. Section 4 defines the compatibility rules and the evaluation algorithm, and Section 5 shows how the licenses are composed.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Related work</head><p>The Creative Commons Rights Expression Language (ccREL) <ref type="bibr" target="#b0">[1]</ref> is the standard recommended by Creative Commons for machine-readable expression of copyright licensing terms and related information. Miller et al. <ref type="bibr" target="#b7">[8]</ref> propose the Open Data Commons waivers and licenses<ref type="foot" target="#foot_2">3</ref> that try to eliminate or fully license any rights that cover databases and data. The Waiver vocabulary<ref type="foot" target="#foot_3">4</ref> defines properties to use when describing waivers of rights over data and content where a waiver is the voluntary relinquishment or surrender of some known right or privilege. Iannella <ref type="bibr" target="#b5">[6]</ref> presents the Open Digital Rights Language (ODRL), a language for the Digital Rights Management community for expressing rights information over content. The ODRL-S language<ref type="foot" target="#foot_4">5</ref> extends ODRL by implementing the clauses of service licensing. Gangadharan et al. <ref type="bibr" target="#b3">[4,</ref><ref type="bibr" target="#b2">3]</ref> address the issue of service license composition and compatibility analysis. They specify a matchmaking algorithm which verifies whether two service licenses are compatible. In case of a positive answer, the services are composed and the framework determines the license of the composite service. Even if the Compatibility Evaluation algorithm we propose follows the basic steps of <ref type="bibr" target="#b3">[4,</ref><ref type="bibr" target="#b2">3]</ref>, there are several differences between the two approaches. The application scenarios are different (service composition vs Web of Data) and this opens different problems. In particular, the subsumption and compatibility rules we define are different, and this affects also the algorithm definition. The composite license mirrors the same differences. Truong et al. <ref type="bibr" target="#b8">[9]</ref> address the issue of analyzing data contracts which leads to the definition of a contract composition where first the comparable contractual terms from the different data contracts are retrieved, and second an evaluation of the new contractual terms for the data mashup is addressed. This work concentrates on data contracts and not on data licenses. However, there are also some common points like the use of RDF for the representation of data licenses/contracts. Krotzsch and Speiser <ref type="bibr" target="#b6">[7]</ref> present a semantic framework for evaluating ShareAlike recursive statements, developing a general policy modelling language for supporting selfreferential policies as expressed by CC. We address another kind of problem that is the composition of the licenses constraining the data into a unique composite license.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Creative Commons licensing language</head><p>The Creative Commons licenses are a family of license models for publishing resources on the Web, enabling the reuse of the data instead of the forbidden by default approach of traditional copyright specifications. The basic anatomy of such licenses<ref type="foot" target="#foot_5">6</ref> is composed by a Work which is a potentially copyrightable work, a License that is a set of permissions/requirements/prohibitions to the users of a Work, and a Jurisdiction that is the legal jurisdiction of a license.   Figure <ref type="figure" target="#fig_2">2</ref>.a shows the structure of a license in our framework. This structure is as desired general enough to include possible extensions to the CC vocabulary. Each license L is composed by a set of models. In particular, the models are cc:Permission, cc:Requirement, and cc:Prohibition, and express the kind of actions respectively permitted, required and prohibited on the data. A huge amount of vocabularies includes properties and classes about the licensing terms for data reuse. In order to achieve interoperability, we align the CC vocabulary with these vocabularies, in some cases very heterogeneous, including licensing terms. A summarization of the alignment about licensing terms<ref type="foot" target="#foot_6">7</ref> is visualized in Figure <ref type="figure" target="#fig_4">3</ref>. Note that in the CC vocabulary it holds that cc:license subPropertyOf dcterms:license, and cc:License subClassOf dcterms:LicenseDocument. The alignment considers the following vocabularies: the Description of a Project vocabulary (doap) <ref type="foot" target="#foot_7">8</ref> , the Ontology Metadata vocabulary (omv) <ref type="foot" target="#foot_8">9</ref> , the Data Dictionary for Preservation Metadata (premis) 10 , the Vocabulary Of Attribution and Governance (voag) 11 , the NEPOMUK Informa-tion Element Ontology (nie)<ref type="foot" target="#foot_11">12</ref> , the Music Ontology (mo)<ref type="foot" target="#foot_12">13</ref> , the Good Relations vocabulary (gr)<ref type="foot" target="#foot_13">14</ref> , and the myExperiment Base Ontology (meb) <ref type="foot" target="#foot_14">15</ref> . </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Licenses compatibility analysis</head><p>For providing a definition of the Compatibility Evaluation algorithm, we need to define a set of compatibility rules assessing the possible compatibility among the elements composing the licenses, following <ref type="bibr" target="#b3">[4]</ref> for service licenses. First, there are certain elements which are broader in scope of permission than other elements. Consider the following example where two named graphs<ref type="foot" target="#foot_15">16</ref> R 1 and R 2 respectively are associated to different licenses where the elements are cc:Reproduction and cc:Sharing. The consumer's query returns triples from both R 1 and R 2 , thus the compatibility between the two licenses has to be verified. Reproduction permits making multiple copies, and Sharing permits commercial derivatives <ref type="bibr" target="#b0">[1]</ref>.</p><p>In our model these two licenses are evaluated as compatible because cc:Sharing subsumes cc:Reproduction. In this context, subsumption means that there is a compatibility if a certain element el i is more permissive, i.e., it accepts more, than the other element el j . For instance, we have that cc:Sharing is more permissive than cc:Reproduction where only multiple copies are allowed but no commercial derivatives. The way the two licenses then are combined into the composite license L c depends on the redefinition rule applied by the data provider (Section 5). The subsumption rules for the cc:Permission elements are shown in Table <ref type="table" target="#tab_1">1</ref>. Table <ref type="table" target="#tab_2">2</ref>.a shows whether an element el 1 is compatible with another element el 2 , i.e., el 1 el 2 , for cc:Permission elements. Rules are expressed under the form of truth tables where elements are evaluated as compatible T , or incompatible F . The rationale behind Table <ref type="table" target="#tab_2">2</ref>.a is that the elements are compatible if there is a subsumption relation between them. Thus, the only case in which the compatibility does not hold is when the elements are cc:Sharing and cc:Distribution where no subsumption relation is present because cc:Sharing allows for non-commercial distribution only, while cc:Distribution allows commercial distribution as well. Note that compatibility in our rules is bi-directional, where (el 1 compatible with el 2 ) is equivalent to (el 2 compatible with el 1 )<ref type="foot" target="#foot_16">17</ref> .</p><p>As in <ref type="bibr" target="#b3">[4]</ref>, a possible situation which may arise is that one license specifies clauses which are not specified by the other license, e.g., one license specifies cc:Prohibition and the other license does not specify this clause. Table 2.b shows the compatibility rules for cc:Requirement, cc:Prohibition and cc:Permission elements against Unspecified elements<ref type="foot" target="#foot_17">18</ref> . Concerning cc:Requirement, the requirement for specification of Attribution does not affect the compatibility with Unspecified (the same holds for Notice, SourceCode, and CopyLeft), and ShareAlike affects the composite license L c by requiring that L c is similar to the original license. Concerning cc:Prohibition, these elements are incompatible with Unspecified. For instance, commercial use is the default setting of the licenses, thus we cannot assume compatibility if commercial use is denied by NonCommercial<ref type="foot" target="#foot_18">19</ref> . The same holds for the cc:HighIncomeNationUse element. Concerning cc:Permission, we follow the "conservative" approach in handling Unspecified elements where unspecification means a denial of compatibility <ref type="bibr" target="#b2">[3]</ref>.</p><p>Finally, the Compatibility Evaluation algorithm has to verify whether and how cc:Requirement elements and cc:Prohibition elements are compatible due to the constraints they specify. Assume two licenses L 1 and L 2 where L 1 contains a clause specifying the requirement for cc:Attribution and L 2 contains a clause specifying the prohibition cc:CommercialUse. Are these clauses compatible? The answer is that, despite Table <ref type="table" target="#tab_2">2</ref>.b, they are compatible as far as the composition of the two licenses includes both the elements, i.e., it satisfies both the cc:CommercialUse clause and the cc:Attribution clause. This means that these license sets are non-disjoint. The rules detailed in Tables 1-3 provide the basic components of our Compatibility Evaluation algorithm. Let L(C) = {L 1 (R 1 ), . . . , L n (R n )} be the set of licenses associated to the n named graphs affected by the consumer's query where L i (R i ) is the license associated with named graph R i , then our objective is to compose these licenses into the composite license L c . Definition 2 defines that two licenses are compatible if for all their clauses l i , these clauses are compatible. Definition 2. A license L i is compatible with another license L j , L i L j , if ∀l i=1,...,n ∈ L i , ∀l j=1,...,m ∈ L j it holds that l i l j .</p><p>The Compatibility Evaluation algorithm (Figure <ref type="figure" target="#fig_6">4</ref>) compares the clauses of the licenses to be composed. Two licenses are compatible if the models in both the licenses are compatible<ref type="foot" target="#foot_19">20</ref> . The models are compatible if (i) the models are the same (m i ≡ m j ), (ii) the models are composed by elements which satisfy  the compatibility rules (Table <ref type="table" target="#tab_3">3</ref>), and (iii) their elements are compatible. The elements compatibility is checked with respect to the following features: (i) the elements are the same (el i ≡ el j ), (ii) the elements satisfy the subsumption rules (Table <ref type="table" target="#tab_1">1</ref>) (SubsumptionRules(el i , el j )), (iii) the elements satisfy the compatibility rules against Unspecified (Table <ref type="table" target="#tab_2">2</ref>.b) (UnspecifiedRules(el i , el j )), (iv) the elements satisfy the compatibility rules (Table <ref type="table" target="#tab_3">3</ref>) (IntersectionRules(el i , el j )), and (v) all the nested elements (if any) are compatible.</p><p>Example 1. Assume we want to analyze two licenses L 1 and L 2 , visualized in Figure <ref type="figure">5</ref>. Following our algorithm, we first compare the two licenses at the model level. Both licenses contain the model cc:Permission, and both of them permit cc:Reproduction. As the models are of the same type and the elements are of the same type as well, then the compatibility is verified. License L 1 contains the model cc:Requirement with the element cc:Notice. This model is not specified by license L 2 , but following the compatibility rules of Table <ref type="table" target="#tab_2">2</ref>.b, the algorithm evaluates that the element cc:Notice is compatible with an Unspecified element. Compatibility is maintained. Finally, the algorithm evaluates the model cc:Prohibition of L 2 , looking for the same model and the same @prefix cc: http://creativecommons.org/ns. @prefix : http://example/licenses. :lic1 a cc:License; cc:permits cc:Reproduction; cc:requires cc:Notice.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>(a)</head><p>THE MODELS:</p><p>-Permission -Requirement THE ELEMENTS @prefix cc: http://creativecommons.org/ns. @prefix : http://example/licenses. :lic2 a cc:License; cc:permits cc:Reproduction; cc:prohibits cc:CommercialUse.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>(b)</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>THE MODELS:</head><p>-Permission -Prohibition THE ELEMENTS Fig. <ref type="figure">5</ref>. An example of compatibility evaluation which leads to a false result.</p><p>element (cc:CommercialUse) in license L 1 . The model cc:Prohibition is not specified in L 1 . From Table <ref type="table" target="#tab_2">2</ref>.b, the algorithm evaluates cc:CommercialUse and Unspecified incompatible. The compatibility evaluation of the licenses leads to a false answer.</p><p>The following properties hold for the license compatibility relation we define.</p><p>Property 1 (Symmetry). Let L i , L j be two compatible licenses, then ∀l i ∈ L i , ∀l j ∈ L j it holds that if l i l j then l j l i .</p><p>The proof follows from the definition of the Compatibility Evaluation algorithm and the composition rules. The symmetry property ensures that</p><formula xml:id="formula_0">L i L j ≡ L j L i .</formula><p>Property 2 (Associativity). Let L i , L j , L k be three licenses then it holds that</p><formula xml:id="formula_1">(L i L j ) L k = L i (L j L k ). Proof. (Associativity ⇒) Let the clause l ∈ (L i L j ) L k , then either l ∈ (L i L j ) or l ∈ L k (or both). This means that either l ∈ L i , or l ∈ L j , or l ∈ L k . Thus l ∈ L i or l ∈ (L j L k ). This means that l ∈ L i (L j L k ) where (L i L j ) L k ⊂ L i (L j L k ). (Associativity ⇐) Let the clause l ∈ L i (L j L k ), then either l ∈ L i or l ∈ (L j ) L k ) (or both). This means that either l ∈ L i , or l ∈ L j , or l ∈ L k . Thus l ∈ (L i L j ) or l ∈ L k . This means that l ∈ (L i L j ) L k where L i (L j L k ) ⊂ (L i L j ) L k .</formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Licenses Composition</head><p>The result of the Compatibility Evaluation algorithm is an answer of the kind true/false. If the answer is false, then the licenses considered for composition are not compatible. In this case, we leave to the data provider to decide the strategy to deal with this situation: (i) the data queried by the consumer is returned without attaching any license, (ii) the data is returned together with the more constraining license among L(C), and (iii) the data is returned together with the less constraining license among L(C).</p><p>If the licenses, instead, are compatible then we compose the licenses in L(C) such that the resulting composite license L c satisfies the licensing clauses specified by the licenses in L(C). In particular, the definition of the composite license satisfies the following properties, where is the composition relation. First, the composite license L c can be generated only if all the licenses composing it are compatible (Section 4). Property 3. Let L 1 , . . . , L n be n licenses to be composed in a composite license</p><formula xml:id="formula_2">L c . If it holds that L 1 L 2 . . . L n , then L c = L 1 . . . L n .</formula><p>Second, the composite license, if allowed, is consistent with the set of licenses used to compose it, due to the Compatibility Evaluation module. Property 4. Let L 1 (R 1 ), . . . , L n (R n ) be the licenses associated to named graphs R 1 , . . . , R n . The composite license L c for R 1 , . . . , R n is consistent with</p><formula xml:id="formula_3">L 1 (R 1 ), . . . , L n (R n ).</formula><p>We now define the algorithm which generates L c from the set of compatible licenses. The first step consists in providing the redefinition rules the algorithm applies in case a subsumption relation holds among these license elements. Table <ref type="table" target="#tab_1">1</ref> shows the redefinition rules given the subsumption relation among the elements. We identify two kinds of redefinition: a redefinition where the more permissive element is included in L c , and another redefinition which follows the converse. The second step consists in detailing the composition which results from the rules expressed in Table <ref type="table" target="#tab_3">3</ref>. The composition of these license elements is achieved by assigning both these elements to the composite license L c . These rules are necessary to keep the consistency of L c w.r.t. the licenses composing it. These rules overcome the compatibility rules against Unspecified, and the heuristics which guide the composition. This leads to the generation of some popular licensing terms: Attribution-ShareAlike, Attribution-NonCommercial, and ShareAlike-NonCommercial, namely. This is a minimum set of composed licenses. However, other composition rules can be defined, depending on the requirements of the data provider.</p><p>The algorithm for Composite License Generation is listed in Figure <ref type="figure" target="#fig_8">6</ref>. The pre-requisite for license composability is that the licenses in L(C) are compatible with each others. If so, L(C) are composed as follows: the ElementsSelectionHeuristic procedure adopts the heuristic h to combine the elements of each license together in a single license L c .  The three basic heuristics we consider in our framework are the following <ref type="bibr" target="#b1">[2]</ref>: OR-composition : If there is at least one of the licenses involved in the composition that owns a clause then also the composite license owns it; AND-composition : if all the licenses involved in the composition own a clause then also the composite license owns it; Constraining-value : the most constraining clause among the ones offered by the licenses is included in the composite license.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Algorithm 4: Composite License Generation Algorithm</head><p>In this paper, we do not argue in favor or against these three possibilities, and a qualitative evaluation of the best composition heuristics is left as future work. We leave to the data provider the choice of her best strategy for composing the licenses, e.g., AND-composition typically leads to a shorter and simpler license, while OR-decomposition leads to a more complex license where all the clauses present in L(C) are included. @prefix cc: http://creativecommons.org/ns. @prefix : http://example/licenses. :lic1 a cc:License; cc:permits cc:Reproduction; cc:requires cc:ShareAlike.</p><p>@prefix cc: http://creativecommons.org/ns. @prefix : http://example/licenses.</p><p>:lic2 a cc:License; cc:permits cc:DerivativeWorks; cc:prohibits cc:CommercialUse.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>(b)</head><p>@prefix cc: http://creativecommons.org/ns. @prefix : http://example/licenses.</p><p>:licComposite a cc:License; cc:permits cc:DerivativeWorks; cc:requires cc:ShareAlike; cc:prohibits cc:CommercialUse. Example 2. Assume we want to analyze two licenses L 1 and L 2 , visualized in Figure <ref type="figure" target="#fig_9">7</ref>.a-b. Following our Compatibility Evaluation algorithm, we first compare the two licenses at the model level. Both licenses contain the model cc:Permission, the first element is cc:Reproduction, and the second element is cc:DerivativeWorks. Even if the two elements are not the same, we know from Table <ref type="table" target="#tab_1">1</ref> that there is a subsumption relation between the element cc:Reproduction and cc:DerivativeWorks. This means that we can redefine one element as the other element when we build the composite license. The first license does not contain the model cc:Prohibition and the second license does not contain the model cc:Requirement. However, the two respective elements cc:ShareAlike and cc:CommercialUse are compatible due to Table <ref type="table" target="#tab_3">3</ref>, and then the two licenses are compatible. Thus composition is allowed, following Table <ref type="table" target="#tab_3">3</ref> and the composition rules defined above. L c is obtained using the more permissive permission (Table <ref type="table" target="#tab_1">1</ref>), and the OR-composition heuristic (Figure <ref type="figure" target="#fig_9">7</ref>.c). L c permits cc:DerivativeWorks (from the redefinition rules of Table <ref type="table" target="#tab_1">1</ref>), requires cc:ShareAlike, and prohibits cc:CommercialUse.</p><p>In this paper, we present a framework to associate licensing terms to the data on the Web of Data. The result of the consumer's query is not only the requested data but also its licensing terms. We adopt the CC licenses vocabulary because it provides a general schema for data licensing, and we align it with the other vocabularies including license specifications. First, we verify the compatibility among the licenses associated to the various named graphs affected by the consumer's query. Second, given the compatibility among the composed licenses, we construct L c which aggregates the clauses of the distinct licenses following the heuristics we propose. The composite license is finally returned to the consumer together with the requested data.</p><p>The first point to be addressed is a legal and social validation of the proposed framework, answering research questions like "Can data be licensed at all depending on the jurisdiction?" and "Can non CC licenses be related to CC licenses in a legally correct way to extend the framework to non CC licenses and waivers?". This means to evaluate, not only quantitatively the performance of the algorithms to retrieve remote licensing statements referenced in the data, but also the legal value of the composite license. We will address an evaluation of the disparate named graphs that might be used in a "typical" query, and their relative proportions that have compatible or incompatible licenses. Moreover, we plan to integrate our framework with the W3C provenance working group's activities <ref type="foot" target="#foot_20">21</ref> . Finally, we are defining more complex heuristics like the one looking for the minimal composite license.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. The architecture of the license compatibility and composition framework.</figDesc><graphic coords="2,264.91,116.14,69.27,83.06" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Fig. 2 .</head><label>2</label><figDesc>Fig. 2. The basic structure of our licenses (a), and the CC models and elements (b).</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Figure 2 .</head><label>2</label><figDesc>Figure2.a shows the structure of a license in our framework. This structure is as desired general enough to include possible extensions to the CC vocabulary. Each license L is composed by a set of models. In particular, the models are cc:Permission, cc:Requirement, and cc:Prohibition, and express the kind of actions respectively permitted, required and prohibited on the data. Figure 2.b shows the set of elements associated to each model in CC. Our general model of licenses takes into account for generality purposes also the possible values assigned to the elements.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Fig. 3 .</head><label>3</label><figDesc>Fig. 3. Alignment between the CC vocabulary and other vocabularies.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_5"><head>Algorithm 1 :</head><label>1</label><figDesc>LicensesCompatibility Input: Licenses Li, Lj Output: true if Li Lj , false otherwise if ∀mi : mi ∈ Li, ∃mj : mj ∈ Lj ← ModelsCompatibility(mi, mj ) = true ∧ ∀mj : mj ∈ Lj , ∃mi : mi ∈ Li ← ModelsCompatibility(mi, mj ) = true then compatible = true; else compatible = f alse; return compatible; Algorithm 2: ModelsCompatibility Input: Models mi, mj Output: true if mi mj , false otherwise if ((mi ≡ mj ) ∨ IntersectionRules(mi, mj ))∧ ∀eli : eli ∈ mi, ∃elj : elj ∈ mj ← ElementsCompatibility(eli, elj ) = true ∧ ∀elj : elj ∈ mj , ∃eli : eli ∈ mi ← ElementsCompatibility(eli, elj ) = true then compatible = true; else compatible = f alse; return compatible; Algorithm 3: ElementsCompatibility Input: Elements eli, elj Output: true if eli elj , false otherwise if ((eli ≡ elj ) ∨SubsumptionRules(eli, elj ) ∨ UnspecifiedRules(eli, elj ) ∨ IntersectionRules(eli, elj ))∧ ∀eli : eli ∈ mi, ∃elj : elj ∈ mj ← ElementsCompatibility(eli, elj ) = true ∧ ∀elj : elj ∈ mj , ∃eli : eli ∈ mi ← ElementsCompatibility(eli, elj ) = true then compatible = true; else compatible = f alse; return compatible;</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_6"><head>Fig. 4 .</head><label>4</label><figDesc>Fig. 4. The compatibility evaluation procedure.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_7"><head>Input:</head><label></label><figDesc>Compatible Licenses L1, . . . , Ln Output: Composite license Lc Lc ← ElementsSelectionHeuristic h (L(C)); return Lc;</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_8"><head>Fig. 6 .</head><label>6</label><figDesc>Fig.6. The procedure for generating the composite license.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_9"><head>Fig. 7 .</head><label>7</label><figDesc>Fig. 7. Compatibility evaluation and composition which leads to a true result.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 1 .</head><label>1</label><figDesc>Compatibility rules for subsumption relation for cc:Permission elements.</figDesc><table><row><cell>Subsumption</cell><cell cols="2">More permissive Less permissive</cell></row><row><cell>DerivativeW orks Sharing</cell><cell>DerivativeW orks</cell><cell>Sharing</cell></row><row><cell>Distribution Reproduction</cell><cell>Distribution</cell><cell>Reproduction</cell></row><row><cell cols="3">DerivativeW orks Reproduction DerivativeW orks Reproduction</cell></row><row><cell>Sharing Reproduction</cell><cell>Sharing</cell><cell>Reproduction</cell></row><row><cell cols="3">DerivativeW orks Distribution DerivativeW orks Distribution</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 2 .</head><label>2</label><figDesc>Table 3 shows the compatibility rules applied to the requirements cc:Attribution and cc:ShareAlike and the (a) Compatibility rules among cc:Permission elements, (b) Compatibility rules among cc:Requirement, cc:Prohibition and cc:Permission elements against Unspecified.</figDesc><table><row><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell>el1</cell><cell>el2</cell><cell>el1</cell><cell>el2</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell cols="2">N otice</cell><cell>U nspecif ied</cell><cell>T</cell></row><row><cell>el1 Sharing Reproduction Reproduction DerivativeW orks el2 DerivativeW orks Distribution Reproduction Sharing Distribution DerivativeW orks Sharing Distribution</cell><cell>el1</cell><cell>T T T T T F</cell><cell>el2</cell><cell cols="3">Attribution ShareAlike SourceCode CopyLef t N onCommercial HighIncomeN ationU se U nspecif ied U nspecif ied U nspecif ied U nspecif ied U nspecif ied U nspecif ied Reproduction U nspecif ied Distribution U nspecif ied</cell><cell>T T T T F F F F</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell cols="3">DerivativeW orks</cell><cell>U nspecif ied</cell><cell>F</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell cols="2">Sharing</cell><cell>U nspecif ied</cell><cell>F</cell></row><row><cell>el1</cell><cell></cell><cell></cell><cell>el2</cell><cell>el1</cell><cell cols="2">el2 L1 L2</cell></row><row><cell cols="2">Attribution</cell><cell cols="3">ShareAlike</cell><cell>T</cell><cell>el1 ∧ el2</cell></row><row><cell cols="5">Attribution N onCommercial</cell><cell>T</cell><cell>el1 ∧ el2</cell></row><row><cell cols="5">ShareAlike N onCommercial</cell><cell>T</cell><cell>el1 ∧ el2</cell></row></table><note>cc:CommercialUse prohibition where, with a slight abuse of notation, we write L 1 L 2 = el 1 ∧ el 2 to indicate that the composition of the licenses will include both the elements.</note></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 3 .</head><label>3</label><figDesc>Compatibility</figDesc><table /><note>rules among cc:Requirement and cc:Prohibition elements.</note></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">http://www.w3.org/TR/rdf-sparql-XMLres/#head</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">http://mpeg.chiariglione.org/standards/mpeg-21/mpeg-21.htm</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_2">http://opendatacommons.org/licenses/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_3">http://vocab.org/waiver/terms/.html</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_4">http://odrl.net/Profiles/Services/WD-20080402.html</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="6" xml:id="foot_5">http://creativecommons.org/ns</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="7" xml:id="foot_6">The research of the vocabularies including licensing terms has been addressed using the LOV dataset (available at http://labs.mondeca.com/dataset/lov/).</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="8" xml:id="foot_7">http://usefulinc.com/ns/doap</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="9" xml:id="foot_8">http://omv2.sourceforge.net/index.html</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="10" xml:id="foot_9">http://bit.ly/MVTZkr</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="11" xml:id="foot_10">http://voag.linkedmodel.org/schema/voag</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="12" xml:id="foot_11">http://www.semanticdesktop.org/ontologies/2007/01/19/nie/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="13" xml:id="foot_12">http://purl.org/ontology/mo/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="14" xml:id="foot_13">http://purl.org/goodrelations/v1</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="15" xml:id="foot_14">http://rdf.myexperiment.org/ontologies/base/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="16" xml:id="foot_15">The discussion about the use of named graphs in RDF 1.1 can be found at http://www.w3.org/TR/rdf11-concepts</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="17" xml:id="foot_16">Differently from software compatibility, it is not a matter of having the same element changed in different versions, but to have different elements to be compared, e.g., Sharing and Reproduction.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="18" xml:id="foot_17">We interpret unspecified elements as "do not care", as in<ref type="bibr" target="#b3">[4]</ref>.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="19" xml:id="foot_18">Note that in this section we refer to the element cc:CommercialUse as NonCommercial for keeping clear that it is a prohibition.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="20" xml:id="foot_19">In this paper, we do not consider the presence of sub-elements and of numerical values assigned to the elements, e.g., financial use, however the algorithm can easily be extended for treating these cases as well.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="21" xml:id="foot_20">http://www.w3.org/2011/prov/</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<title level="m" type="main">ccREL: The Creative Commons Rights Expression Language</title>
		<author>
			<persName><forename type="first">H</forename><surname>Abelson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Adida</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Linksvayer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Yergler</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<title level="m" type="main">Web Service Contracts: Specification, Selection and Composition</title>
		<author>
			<persName><forename type="first">M</forename><surname>Comerio</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2009">2009</date>
		</imprint>
		<respStmt>
			<orgName>University of Milano-Bicocca</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">PhD thesis</note>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Consumer-specified service license selection and composition</title>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">R</forename><surname>Gangadharan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">L</forename><surname>Truong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Treiber</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>D'andrea</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Dustdar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Iannella</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Weiss</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ICCBSS, IEEE</title>
				<imprint>
			<date type="published" when="2008">2008</date>
			<biblScope unit="page" from="194" to="203" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Service license composition and compatibility analysis</title>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">R</forename><surname>Gangadharan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Weiss</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>D'andrea</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Iannella</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ICSOC, LNCS</title>
				<imprint>
			<date type="published" when="2007">2007</date>
			<biblScope unit="volume">4749</biblScope>
			<biblScope unit="page" from="257" to="269" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<author>
			<persName><forename type="first">T</forename><surname>Heath</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Bizer</surname></persName>
		</author>
		<title level="m">Linked Data: Evolving the Web into a Global Data Space</title>
				<imprint>
			<publisher>Morgan &amp; Claypool</publisher>
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<author>
			<persName><forename type="first">R</forename><surname>Iannella</surname></persName>
		</author>
		<title level="m">Open Digital Rights Language (ODRL)</title>
				<imprint>
			<date type="published" when="2002">2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Sharealike your data: Self-referential usage policies for the semantic web</title>
		<author>
			<persName><forename type="first">M</forename><surname>Krötzsch</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Speiser</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ISWC, LNCS</title>
				<imprint>
			<date type="published" when="2011">2011</date>
			<biblScope unit="volume">7031</biblScope>
			<biblScope unit="page" from="354" to="369" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<title level="m" type="main">Open data commons, a license for open data</title>
		<author>
			<persName><forename type="first">P</forename><surname>Miller</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Styles</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Heath</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2008">2008</date>
			<publisher>LDOW</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">On analyzing and developing data contracts in cloud-based data marketplaces</title>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">L</forename><surname>Truong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">R</forename><surname>Gangadharan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Comerio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Dustdar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">De</forename><surname>Paoli</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">AP-SCC, IEEE</title>
				<imprint>
			<date type="published" when="2011">2011</date>
			<biblScope unit="page" from="174" to="181" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
