<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Using rules for assessing and improving data quality: A case study for the Norwegian State of Estate report</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Ling</forename><surname>Shi</surname></persName>
							<email>ling.shi@statsbygg.no</email>
						</author>
						<author>
							<persName><forename type="first">Dumitru</forename><surname>Roman</surname></persName>
							<email>dumitru.roman@sintef.no</email>
							<affiliation key="aff1">
								<orgName type="institution">SINTEF</orgName>
								<address>
									<addrLine>Blindern</addrLine>
									<postBox>Pb. 124</postBox>
									<postCode>0314</postCode>
									<settlement>Oslo</settlement>
									<country key="NO">Norway</country>
								</address>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff0">
								<address>
									<addrLine>1 Statsbygg, Dep</addrLine>
									<postBox>Pb. 8106</postBox>
									<postCode>0032</postCode>
									<settlement>Oslo</settlement>
									<country key="NO">Norway</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Using rules for assessing and improving data quality: A case study for the Norwegian State of Estate report</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">3491007A3B532A47D5BB689ED9E3B91D</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-23T22:43+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Rule-based approach</term>
					<term>Data quality assessment</term>
					<term>Data integration</term>
					<term>Report service</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The Norwegian State of Estate (SoE) report service -a service providing information about central government properties in Norway -is a result of integrating cross-domain government data originating from the Norwegian cadastral system, Business Entity Register, Building Accessibility Register and Statsbygg's property management system. This paper presents a rule-based approach to assess and improve the quality of the data upon which the SoE service is built. The approach develops a set of rules to specify a common data schema, rules for data quality assessment, and three dedicated measurement metrics for data integration. Application scenarios of the approach in identifying data inconsistencies in the sources are exemplified with strategies to improve data quality.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>A State of Estate (SoE) report 1 produces a complete list of state-owned real estates 2 , and represents a key input to the decision making process of the government or other stakeholders to increase the effectiveness of the public resources allocation. The SoE report in Norway is published as an attachment 3 to the proposed parliamentary resolution No.1 every four years by Statsbygg 4 on behalf of the Ministry of Local Government and Modernization 5 . The current reporting process is manual, static and errorprone and the report is outdated when it is produced, therefore the report does not properly support the decision making process. A new State of Estate (SoE) report generation process was introduced in <ref type="bibr" target="#b0">[1]</ref> to carry out the reporting task in a more effective way, realized as a reporting service. It aims to provide users with a dynamic and up-to-date report, including data visualization features, to better support the users' decision making process.</p><p>The new SoE reporting service reuses existing government data, from both open and proprietary sources, and integrates them in a way that can serve as a basis for the creation of the SoE service. The data sources include the Norwegian cadastral system <ref type="foot" target="#foot_0">6</ref> , Business Entity Register<ref type="foot" target="#foot_1">7</ref> , Building Accessibility Register<ref type="foot" target="#foot_2">8</ref> , and Statsbygg's property management system. Though data are collected from the most authoritative government agencies, they are not 100% consistent with each other and the inconsistency is one of the main challenges to create the SoE service. Our focus and contribution in this paper is to establish a rule-based approach which develops a set of rules to assess and improve the data quality. A rule-based approach is suitable in this context, quick to implement, and easy to document and understand.</p><p>The rest of this paper is structured as follows. Section 2 describes the SoE report service case focusing on the value proposition. Section 3 presents the rule-based approach for data quality assessment and improvement. Section 4 summarizes the paper and outlines possibilities for further work. The SoE service allows the property owners in the public sector to do quality assessment <ref type="bibr" target="#b1">[2]</ref> on data of their own real estates. It should also provide the reporting and visualization functions of state-owned properties to the above mentioned customer groups.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Norwegian SoE value proposition</head><p>The value proposition canvas<ref type="foot" target="#foot_3">9</ref> for property owners in the public sector is shown as an example in Fig. <ref type="figure" target="#fig_0">1</ref> and Fig. <ref type="figure" target="#fig_1">2</ref>. The property owners' pains and gains are listed up in the customer segment profile. Improved data quality, reliability, completeness and accessibility are the main gains against the pains on static reports, manual data collection and quality control and missing records. The value proposition map designs the SoE report service and its Gain Creators and Pain Relievers, including data quality requirements on improved quality and completeness of the report and reduced number of missing buildings.  In order to meet the data quality requirements illustrated in the value proposition canvas, we established a rule-based approach to assess and improve the data quality firstly of the source data and thereafter the result data of integration. Data quality rules are contextual <ref type="bibr" target="#b2">[3]</ref> and the rules developed in this section are therefore valid within the context of the SoE report service though the general method to categorize and define the rules can be reusable in other contexts. The following sub-sections cover the rules to specify a common data schema in Sub-section 3.1, rules for data quality assessment in Sub-section 3.2, measurement metrics for data integration is introduced in Subsection 3.3, and strategies for improving data quality is presented in Sub-section 3.4.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Rules to specify a common data schema</head><p>Data inconsistency and redundancy are well-known challenges in cross-domain data integration. For example, the cadastral ownership information and building information are registered both in the cadastral system and Statsbygg's property management system with different updating status; a property owner's organization number and name are registered both in the cadastral system and Business Entity Register but the cadastral system and the Business Entity Register are not synchronized. This subsection presents several steps: firstly to decide the master source systems for the involved domains, afterwards to define rules to specify a common data schema and integration keys.</p><p>As a first step we make a decision on which source system is the master for each domain or sub-domain involved in the data integration. The government organizational structure reflects the domain responsibility for government data. Both the Business Entity Register and the Cadastral system are national registers and provide data with relatively high quality, therefore those two systems are defined as the master or primary data sources for the correspondingly organization domain and cadastral domain. Statsbygg's property management system is defined as a supplementary data source for the cadastral domain. The Building Accessibility Register is defined as a supplementary data source for the cadastral building sub-domain.</p><p>Though each source system has its own data schema, there is no common data schema available for this data integration process. The next step is to define rules and exceptions to build a common data schema on the class and attribute levels.</p><p>Rules to specify a common data schema on the class levels. This type of rule decides which source system is the master for a specified class. For example, the "Organization" class from the Business Entity Register and the "Building" class from the cadastral system are the master classes with national unique identifiers. However there are also exceptions because of some special business rules in practice. For example: Buildings less than 15 square meters are not required to be registered in the cadastral register, neither do the embassy buildings in foreign countries. A supplementary unique identifier for the "Building" class is needed to handle the exception buildings without national unique identifier from the cadastral system.</p><p>Rules to specify a common data schema on the attribute levels. The rules decide which source system is the master for some specified attributes. For example: the Building Accessibility Register is the master for the accessibility attributes of a building though the "Building" class in the cadastral system is defined as the master class.</p><p>The last step in this sub-section defines rules to specify attributes that can be used to connect heterogeneous data sources (integration keys). The integration keys are normally the unique identifiers of the master classes. For example, the organization number for a real estate owner is an integration key to connect the cadastral system to Business Entity Register. There are exceptions in cases such as a supplementary unique identifier is needed to cover buildings less than 15 square meters. Both the primary and supplementary unique identifiers are used in the integration to return a complete building list for the SoE report service.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">Rules for data quality assessment</head><p>The result data is an integrated result of multiple source data. Both the source and result data should be screened for potential syntactic and semantic errors using data quality rules generated from existing domain models or expert knowledge. Examples of different types of data quality rules include:  Mandatory: The property owner is mandatory for a property ownership record. The rule is broken when the property owner is missing.  Data type: The area field of a building should be numeric. The rule is broken when the area field set to text "N/A".  Data length: A municipality number should be four digits. The rule is broken when a municipality number is made of three digits.  Uniqueness: The cadastral building number should be unique. It breaks the rule when one cadastral building number is registered on more than one building in the Statsbygg's property management system.  Cardinality: A cadastral parcel is located in a municipality. The rule is broken if the municipality field is missing.  Data domain and range: The valid values of cadastral parcel ownership types in this case should be either owned or leased. Including other values than those two breaks the rule.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3">Measurement metrics for data integration</head><p>In addition to the above rules, Table <ref type="table">1</ref> shows three measurement metrics that are dedicated to identify quality problems in the data integration and measure the quality of the integration result.</p><p>The integration keys are attributes used to connect heterogeneous data sources, and they are currently registered manually in the referring systems (systems that refer to the key attributes). There is no automatic updating of the values of integration keys after registration. For example the organization number is registered as an identifier for property owners when an ownership record is created in the cadastral system and it is then kept to be static in the cadastral system and does not follow the changes or deletions in Business Entity Register. Some of the integration keys are not mandatory fields in the referring systems. Here are two examples of the Key Value Quality metrics: 1) the percentage of outdated organization numbers for the property owners in the cadastral system; 2) the percentage of missing or outdated cadastral building numbers in the Statsbygg's property management system.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Table 1. Measurement metrics type on data integration</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Metrics type Description</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Key Value Quality</head><p>The number or percentage of missing or outdated key values registered in source data</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Integration Quality</head><p>The number or percentage of incorrect integrations in the result data.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Non-matched rows</head><p>The number of non-matched rows in an integration identified using an outer join or similar techniques.</p><p>The Integration Quality metric measures the percentage of correct integrations in the integration result. Though the key values may exist in the source system for master data, it could also refer to a wrong data item. The cadastral ownership data contains both the property owners' organization number and name. The organization number is used as an integration key to integrate the cadastral ownership data with Business Entity Register. We identify afterwards the deviation between organization names from two data sources to measure the correctness of the integration result. The non-matched rows metric returns the rows from one system that is not able to be integrated with another system. This metric is especially useful when a supplementary source data has partly more updated information on some specified data items than the primary source data.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.4">Strategies for improving data quality</head><p>Examples of measurement metrics and corresponding quality improvement strategies are presented in Table <ref type="table" target="#tab_1">2</ref>. The result of data integration <ref type="bibr" target="#b3">[4]</ref> for the SoE service is provisioned as RDF/Linked Data through a Linked Data generation process supported by DataGraft<ref type="foot" target="#foot_4">10</ref> [5][6] <ref type="bibr" target="#b6">[7]</ref> and is available via the proDataMarket marketplace<ref type="foot" target="#foot_5">11</ref> . SPARQL queries have been used as the underlying mechanism to assess the data quality. The query results help the responsible staff assess and improve the data quality in the source systems by following the suggested strategy for quality improvement. The updated source data with better data quality are then reloaded to the integration process to produce an updated result with improved quality. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Non-matched rows</head><p>The properties were sold to a non-central government organization after the previous report was made.</p><p>No specific actions needed.</p><p>The properties are abroad. There has been organization change with the owner and the owner's organization number is no longer valid in the business entity register.</p><p>Update the owner's organization number and name in the cadastral system.</p><p>The ownership change between organizations in the public sector is not always officially registered in the cadastral system.</p><p>Inform the current owner organization to update the ownership in the cadastral system. The owner's organization is not officially registered as central government organization in the business entity register.</p><p>Update the organization in the business entity register if it is applicably or add it to the manual exception list of the central government organization data.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Summary and outlook</head><p>This paper introduced the State of Estate report service together with its value proposition and the rule-based approach to address data quality issues. The report service is a result of integrating cross-domain data from multiple sources as the cadastral system, Business Entity Register, Building Accessibility Register and Statsbygg's property management system. A set of rules are developed to meet the data quality requirements on SoE report service, including rules to specify a common data schema, rules for data quality assessment and measurement metrics for data integration. Strategies for improving data quality are also presented. The rule-based approach is quick to implement and easily understandable both by domain experts and data engineers.</p><p>For the further work, the identified rules shall be transformed to executable rules if possible such that they can be applied directly in semantic reasoning to automate the quality assessment process. The suggested quality improvement strategies can also be half or fully automated to increase effectivity.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. Value proposition canvas -customer segment profile</figDesc><graphic coords="3,145.68,166.68,320.64,214.50" type="vector_box" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Fig. 2 .</head><label>2</label><figDesc>Fig. 2. Value proposition canvas -value proposition map</figDesc><graphic coords="3,172.56,425.40,277.68,203.46" type="vector_box" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2 .</head><label>2</label><figDesc>Examples of measurement metrics and quality improvement actions</figDesc><table><row><cell>SPARQL query</cell><cell>Type of meas-</cell><cell>Possible reasons of</cell><cell>Strategy to improve data</cell></row><row><cell>to identify:</cell><cell>urement met-</cell><cell>mismatch</cell><cell>quality</cell></row><row><cell></cell><cell>rics</cell><cell></cell><cell></cell></row><row><cell>The owner name</cell><cell>Integration</cell><cell>Delayed or missing</cell><cell>Update the owner names</cell></row><row><cell>difference between</cell><cell>Quality</cell><cell>updates of owner</cell><cell>in the cadastral system.</cell></row><row><cell>cadastral system</cell><cell></cell><cell>names in the cadas-</cell><cell></cell></row><row><cell>and business entity</cell><cell></cell><cell>tral system.</cell><cell></cell></row><row><cell>register 12</cell><cell></cell><cell></cell><cell></cell></row><row><cell>The state-owned</cell><cell>Non-matched</cell><cell>The properties were</cell><cell>No specific actions needed</cell></row><row><cell>properties that are</cell><cell>rows</cell><cell>acquired after the</cell><cell>though it reflects partially</cell></row><row><cell>missing in the</cell><cell></cell><cell>previous report was</cell><cell>the quality of the previous</cell></row><row><cell>previous SoE report 13</cell><cell></cell><cell>made. The properties were</cell><cell>SoE report.</cell></row><row><cell></cell><cell></cell><cell>forgotten to be regis-</cell><cell></cell></row><row><cell></cell><cell></cell><cell>tered in the previous</cell><cell></cell></row><row><cell></cell><cell></cell><cell>SoE report.</cell><cell></cell></row><row><cell>The state-owned</cell><cell></cell><cell></cell><cell></cell></row><row><cell>properties from the</cell><cell></cell><cell></cell><cell></cell></row><row><cell>previous SoE</cell><cell></cell><cell></cell><cell></cell></row><row><cell>report that are</cell><cell></cell><cell></cell><cell></cell></row><row><cell>missing in the</cell><cell></cell><cell></cell><cell></cell></row><row><cell>resulting SoE</cell><cell></cell><cell></cell><cell></cell></row><row><cell>report 14</cell><cell></cell><cell></cell><cell></cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="6" xml:id="foot_0">http://www.kartverket.no/en/Land-Registry-and-Cadestre/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="7" xml:id="foot_1">https://www.brreg.no/home/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="8" xml:id="foot_2">https://byggforalle.no/uu/sok.html?&amp;locale=en</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="9" xml:id="foot_3">https://strategyzer.com/canvas/value-proposition-canvas</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="10" xml:id="foot_4">https://datagraft.io/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="11" xml:id="foot_5">https://prodatamarket.eu/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="12" xml:id="foot_6">https://datagraft.io/prodatamarket_publisher/queries/soe-query1-the-owner-name-difference</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="13" xml:id="foot_7">https://datagraft.io/prodatamarket_publisher/queries/soe-query2-missing-soe-records</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="14" xml:id="foot_8">https://datagraft.io/prodatamarket_publisher/queries/soe-query3-missing-result-soe-records</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Acknowledgements. The work in this paper is partly supported by the EC funded project proDataMarket (Grant number: 644497).</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Norwegian State of Estate: A Reporting Service for the State-Owned Properties in Norway</title>
		<author>
			<persName><forename type="first">L</forename><surname>Shi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><forename type="middle">E</forename><surname>Pettersen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Østhassel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Nikolov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Khorramhonarnama</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">J</forename><surname>Berre</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Roman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International Symposium on Rules and Rule Markup Languages for the Semantic Web</title>
				<imprint>
			<publisher>Springer International Publishing</publisher>
			<date type="published" when="2015-08">2015. August</date>
			<biblScope unit="page" from="456" to="464" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Data quality assessment</title>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">L</forename><surname>Pipino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><forename type="middle">W</forename><surname>Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">Y</forename><surname>Wang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Communications of the ACM</title>
		<imprint>
			<biblScope unit="volume">45</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="211" to="218" />
			<date type="published" when="2002">2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Discovering data quality rules</title>
		<author>
			<persName><forename type="first">F</forename><surname>Chiang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">J</forename><surname>Miller</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the VLDB Endowment</title>
				<meeting>the VLDB Endowment</meeting>
		<imprint>
			<date type="published" when="2008">2008</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="1166" to="1177" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Data integration: the teenage years</title>
		<author>
			<persName><forename type="first">A</forename><surname>Halevy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Rajaraman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Ordille</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 32nd international conference on Very large data bases</title>
				<meeting>the 32nd international conference on Very large data bases</meeting>
		<imprint>
			<publisher>VLDB Endowment</publisher>
			<date type="published" when="2006-09">2006. September</date>
			<biblScope unit="page" from="9" to="16" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">DataGraft: One-Stop-Shop for Open Data Management</title>
		<author>
			<persName><forename type="first">D</forename><surname>Roman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Nikolov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Putlier</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Sukhobok</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Elvesaeter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">J</forename><surname>Berre</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Ye</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Dimitrov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Simov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Zarev</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Moynihan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Roberts</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Berlocher</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Kim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Smith</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Heath</surname></persName>
		</author>
		<idno type="DOI">10.3233/SW-170263</idno>
	</analytic>
	<monogr>
		<title level="m">To appear in the Semantic Web Journal (SWJ) -Interoperability, Usability, Applicability</title>
				<imprint>
			<publisher>IOS Press</publisher>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
	<note>published and printed</note>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">DataGraft: Simplifying Open Data Publishing</title>
		<author>
			<persName><forename type="first">D</forename><surname>Roman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Dimitrov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Nikolov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Putlier</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Sukhobok</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Elvesaeter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">J</forename><surname>Berre</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Ye</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Simov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Petkov</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ESWC (Satellite Events</title>
				<imprint>
			<biblScope unit="volume">2016</biblScope>
			<biblScope unit="page" from="101" to="106" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">DataGraft: A Platform for Open Data Publishing</title>
		<author>
			<persName><forename type="first">D</forename><surname>Roman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Dimitrov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Nikolov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Putlier</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Elvesaeter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Simov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Petkov</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">the Joint Proceedings of the 4th International Workshop on Linked Media and the 3rd Developers Hackshop</title>
				<imprint>
			<publisher>LIME/SemDev@ESWC</publisher>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
