<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">SoCK: SHACL on Completeness Knowledge</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Muhammad</forename><forename type="middle">Jilham</forename><surname>Luthfi</surname></persName>
							<email>muhammad.jilham@ui.ac.id</email>
							<affiliation key="aff0">
								<orgName type="department">Faculty of Computer Science</orgName>
								<orgName type="institution">Universitas Indonesia</orgName>
								<address>
									<settlement>Depok</settlement>
									<region>West Java</region>
									<country key="ID">Indonesia</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Fariz</forename><surname>Darari</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Faculty of Computer Science</orgName>
								<orgName type="institution">Universitas Indonesia</orgName>
								<address>
									<settlement>Depok</settlement>
									<region>West Java</region>
									<country key="ID">Indonesia</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Amanda</forename><forename type="middle">Carrisa</forename><surname>Ashardian</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Faculty of Computer Science</orgName>
								<orgName type="institution">Universitas Indonesia</orgName>
								<address>
									<settlement>Depok</settlement>
									<region>West Java</region>
									<country key="ID">Indonesia</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">SoCK: SHACL on Completeness Knowledge</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">4B2FFA7EC07BCD3E93EB257EFF1C378E</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T02:23+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>SHACL</term>
					<term>Completeness</term>
					<term>Patterns</term>
					<term>Shapes</term>
					<term>Validation</term>
					<term>DBpedia</term>
					<term>Wikidata</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The proliferation of applications based on knowledge graphs (KGs) in the recent years has created increasing demands for high-quality KGs. Completeness is an important quality aspect concerning the breadth, depth, and scope of data in KGs. In that light, describing and validating completeness over KGs have become a must go in order to make an informed decision whether or not to use (parts of) KGs. In this paper, we propose SoCK (short for SHACL on Completeness Knowledge), a pattern-oriented framework to support the creation and validation of knowledge about completeness in KGs. The framework relies on SHACL, a W3C recommendation for validating KGs against a collection of constraints. In SoCK, we first offer a number of patterns capturing how completeness requirements are typically expressed in a high-level way. Such completeness patterns can then be instantiated in various manners over different KG domains. These instantiations result in SHACL shapes that can be validated against KGs to provide a completeness profile of the KGs. As a proof-of-concept, we implement and demonstrate the SoCK framework as a Python library, creating over 360k SHACL shapes for real-world KGs (in our case, DBpedia and Wikidata) based on the aforementioned completeness patterns. We also develop a web app to serve as an information point for anything about SoCK, available at https://sock.cs.ui.ac.id/.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>The current massive development of knowledge graphs (KGs) may cause various problems related to data quality <ref type="bibr" target="#b0">[1]</ref>. The quality of data can affect application quality and is related to problems in publishing and using data <ref type="bibr" target="#b1">[2]</ref>. An aspect of quality is completeness of information. Completeness measures how much information is contained in a dataset <ref type="bibr" target="#b2">[3]</ref>. It is an aspect of data quality related to the breadth, depth, and scope of information contained in data <ref type="bibr" target="#b3">[4]</ref>. The aspect of completeness is one of the most important ones in measuring data quality and can indirectly affect other aspects, such as data accuracy and consistency <ref type="bibr" target="#b4">[5]</ref>. As for open KGs (e.g., DBpedia and Wikidata), their collaborative nature causes the diversity of data to be higher and that data quality, especially the aspect of completeness, is of particular concern <ref type="bibr" target="#b5">[6]</ref>.</p><p>Figure <ref type="figure" target="#fig_0">1</ref> shows two Wikidata entities of type hotel with different level of completeness in that Beverly Hills Hotel is missing the hotel rating information as opposed to that of Hotel Indonesia (as of <ref type="bibr">July 10, 2022)</ref>. Nevertheless, both hotels are still complete with respect to the information of: instance of, country, and review score. Such a case indicates that the quality of applications using hotel data on Wikidata may suffer from data incompleteness depending on whether hotel rating information is required in the applications or not. One way to be better informed about completeness in KGs is by means of data validation. In 2017, W3C introduced a standard for validating KG data called Shapes Constraint Language (SHACL) <ref type="bibr" target="#b6">[7]</ref>. This allows the higher need for consumption of good quality data through a validation process. SHACL validates data by applying a set of conditions that are combined into a "shape" and expressed in an RDF graph. Nevertheless, to the best of our knowledge there has been no research that specifically and systematically prioritizes SHACL as a basis for checking the quality of KG data on the aspect of completeness.</p><p>Validating data completeness in a KG could be done by collecting similar cases of completeness to form a completeness pattern. Such a pattern can then be reused and adapted to various domains in different KGs. An ontology design pattern (ODP) is a modeling solution related to repeated problems in ontologies and KGs <ref type="bibr" target="#b7">[8]</ref>. Our study aims to fill the gap from previous studies in developing a pattern-oriented solution based on SHACL for checking KG completeness. Particularly, we focus on four issues that revolve around the problem of completeness patterns: (𝑖) identification of completeness patterns; (𝑖𝑖) instantiation of completeness patterns into SHACL shapes; (𝑖𝑖𝑖) development of a (Python) library to support our completeness validation process and a web app to provide information points; and (𝑖𝑣) evaluation of completeness over real-world KGs, that is, DBpedia and Wikidata.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Preliminaries</head><p>Data Completeness. Data quality provides an outlook to the fitness for use by data consumers <ref type="bibr" target="#b3">[4]</ref>. Poor data quality can pose substantial risks to decision making in organizations. Beyond accuracy, data quality aspects include relevancy, timeliness, &amp; completeness. Data completeness concerns the breadth, depth, and scope of data w.r.t. the task at hand <ref type="bibr" target="#b3">[4]</ref>.</p><p>In the context of KGs, the quality of completeness can be classified into seven types <ref type="bibr" target="#b4">[5]</ref>. The first two are schema completeness, which is the degree to which properties of classes are sufficiently represented, and property completeness, which concerns the completeness of property values (e.g., Albert Einstein was married twice). The third type is population completeness, which checks the coverage of real-world objects as stored in a KG. The fourth one is interlinking completeness, referring to the degree to which entities in different KGs are interlinked. The last three types are: currency completeness, examining how property values are well represented over time; metadata completeness, observing if sufficient metadata is available; and labeling completeness, which has to do with the existence of human-and machine-readable labels.</p><p>SHACL. Shapes Constraint Language (SHACL) is a W3C recommendation to describe constraints and validate KGs against those constraints <ref type="bibr" target="#b6">[7]</ref>. Such constraints are provided as SHACL shapes, which themselves can be written as an RDF graph <ref type="bibr" target="#b8">[9]</ref>. In SHACL, RDF graphs representing shapes are called "shapes graphs" and that these shapes graphs can then be employed to validate RDF graphs (= "data graphs"). SHACL offers some advantages in that SHACL is high-level, declarative, concise, and yet feature-rich <ref type="bibr" target="#b9">[10]</ref>.</p><p>SHACL comes in two flavors: SHACL Core and SHACL-SPARQL. The former captures features frequently needed for the representation of shapes and constraints, whereas the latter extends the former by adding advanced features of constraints based on SPARQL, a query language for RDF KGs. SHACL Core supports a variety of basic constraint components, such as value types, cardinalities, and string filters. On the other hand, SHACL-SPARQL provides more expressiveness due to the flexibility of SPARQL SELECT queries in defining SHACL constraints.</p><p>SHACL use cases include documentation, user interface generation, validation during RDF data production/consumption, and quality control <ref type="bibr" target="#b10">[11]</ref>. In the context of quality control, a typical scenario for SHACL usage is that first a domain expert expresses quality requirements in natural language and then a knowledge engineer translates these requirements into SHACL shapes, to be validated against KGs of interest. Figure <ref type="figure" target="#fig_2">2</ref> illustrates a SHACL shape for the constraint that all instances of type person must have a name and a birth date. Validating a KG against that constraint may uncover the quality issue as to whether there exist person instances in that KG without a name or a birth date.</p><p># t h e s e p r e f i x e s a r e u s e d t h r o u g h o u t t h i s p a p e r @ p r e f i x e x : &lt; h t t p : / / e x a m p l e . o r g / &gt; . @ p r e f i x s h : &lt; h t t p : / / w w w . w 3 . o r g / n s / s h a c l # &gt; . @ p r e f i x x s d : &lt; h t t p : / / w w w . w 3 . o r g / 2 0 0 1 / X M L S c h e m a # &gt; .  Patterns. Ontology design patterns (ODPs) or for simplicity, patterns, are a modeling solution to recurrent problems in ontology designs <ref type="bibr" target="#b11">[12,</ref><ref type="bibr" target="#b7">8]</ref>. Patterns capture problems commonly encountered in the development of ontologies &amp; KGs, and make more explicit and reusable best practices in solving such problems. Benefits of using patterns include easier ontology design processes by both knowledge engineers and domain experts, as well as increased quality &amp; interoperability of developed ontologies &amp; KGs <ref type="bibr" target="#b12">[13]</ref>. Several interesting ontologies and KG-based applications developed using pattern-based approaches are chess games <ref type="bibr" target="#b13">[14]</ref>, historic slave trade <ref type="bibr" target="#b14">[15]</ref>, and open data conversion to Wikidata <ref type="bibr" target="#b15">[16]</ref>.</p><p>Gangemi and Presutti <ref type="bibr" target="#b7">[8]</ref> present a classification of patterns for ontology design. Those six families of patterns are: structural patterns, correspondence patterns, content patterns, reasoning patterns, presentation patterns, and lexico-syntactic patterns. In this work, we focus on content patterns, which encode conceptual solutions for ontology &amp; KG (quality) modeling problems. In principle, content patterns do not depend on any specific language. Nevertheless, in our work we rely on SHACL as a reference formalism for representing reusable, practical building blocks of content patterns.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Completeness Patterns and Instantiations</head><p>This section introduces data completeness patterns developed based on SHACL and approaches to instantiating them.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">Completeness Patterns</head><p>Issa et al. <ref type="bibr" target="#b4">[5]</ref> synthesized previous studies that discussed the completeness aspect of KGs. Of the seven completeness types proposed, we use five types as completeness patterns. We also add one additional completeness pattern which we will see below. Each completeness pattern may have several SHACL patterns, that is, patterns that generalize from the SHACL syntax <ref type="bibr" target="#b6">[7]</ref> with the addition of %% V A R I A B L E %% that later can be instantiated into SHACL shapes. Table <ref type="table" target="#tab_0">1</ref> shows eight SHACL patterns according to the six completeness patterns.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Schema completeness pattern (SC).</head><p>A completeness pattern about the extent to which entities in a KG have the required properties for a class, such as entities of the class "Human" must have the properties (among others) of name and date of birth. The definition of mandatory properties in the same class may differ depending on the specific needs for the task at hand.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Property completeness pattern (PC).</head><p>A completeness pattern related to the degree to which property values of a specific property in a particular entity are present. For example, the entity Joe Biden must have four values for a "child" property. This completeness only focuses on one property with varying cardinalities among entities. A population is complete whenever the number of its members equals C O U N T . The use of s h : i n v e r s e P a t h makes it possible to count the subjects of  This pattern extends that of IC1 by adding a requirement that the linked entity must come from a specific N A M E S P A C E -U R I (e.g., http: //dbpedia.org/resource/).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>This pattern extends the pattern LDC1 by adding the</head><formula xml:id="formula_0">L A N G U A G E re- quirement for the given L A B E L -O R - D E S C R I P T I O N -P R O P E R T Y .</formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>No-value completeness pattern (NVC).</head><p>A completeness pattern that deals with to which entities in a KG capture information about the absence of a property value. In this context, property values might be "unavailable" due to two possible reasons: the property values do not exist in the real world, or they are intentionally removed for privacy or confidentiality reasons. For example, the entity Keanu Reeves has no children. To be more precise, this pattern is a special case of the property completeness pattern. Nevertheless, the no-value completeness pattern deserves a special attention due to its peculiarity in that shapes instantiated from the no-value completeness pattern may represent negative facts (i.e., the non-existence) <ref type="bibr" target="#b16">[17]</ref> and any KG would trivially satisfy the shape.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Population completeness pattern (POC).</head><p>A completeness pattern related to the extent to which entities in the real world are present as members (or instances) of a population. For example, all provincial entities in Indonesia are members of the class "Provinces in Indonesia". In such a pattern, the membership of a population is usually expressed through typing properties like r d f : t y p e , d c t : s u b j e c t , d b o : t y p e (for DBpedia), and w d t : P 3 1 (for Wikidata).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Label &amp; description completeness pattern (LDC).</head><p>A completeness pattern related to the degree to which entities in a KG have human-readable labels and descriptions. One of the most commonly used label properties is r d f s : l a b e l . An alternative property is s k o s : p r e f L a b e l .</p><p>In addition, the language aspect of the label can be considered in this pattern. Apart from the label, the pattern also concerns description properties, such as </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Interlinking completeness pattern (IC).</head><p>A completeness pattern concerning the degree to which the same entity links to each other in various KGs. For example, the entity "Indonesia" on DBpedia is linked to the same entity on Wikidata. One of the commonly used properties to define such linking is o w l : s a m e A s . Other general properties with the same intention are s c h e m a : s a m e A s and s k o s : e x a c t M a t c h . Furthermore, there are also properties that specifically link to particular external authorities, such as ISBN, DOI, and VIAF. It is worth noting that the pattern IC1 in Table <ref type="table" target="#tab_0">1</ref> would only check the existence of interlinking properties regardless of the number of property values. We argue that attempting to impose some value counting on interlinking properties would be tricky to do since there can be an arbitrary number of KGs which may contain the entities to be linked. However, should one want to check the existence of interlinking properties to entities in some specific KG, the pattern IC2 which features namespace qualifiers can be utilized.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">Completeness Pattern Instantiations</head><p>Given the above completeness patterns, a question arises as to how one may instantiate those patterns. We identify a number of such approaches: manual, spreadsheet, automatic, ontology, and statistics. We also list the advantages and disadvantages of each approach in Table <ref type="table" target="#tab_3">2</ref>.</p><p>Manual. In this approach, a user instantiates a completeness pattern by hand. For example, with respect to the pattern SC1, the user will selectively choose the properties of a class that must exist. Afterwards, the selected properties will be substituted into the property variables of the SHACL pattern.</p><p>Spreadsheet. This approach gathers completeness information (e.g., number of children and number of authors) into a spreadsheet. Next, a program reads the spreadsheet and generates the corresponding SHACL shapes based on the collected completeness information. Here we use this approach to instantiate the property completeness pattern.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Automatic.</head><p>The automatic approach relies on data from KGs themselves to provide completeness information. Wikidata, for example, has the information of "properties for this type" (P1963) which lists properties that normally apply for a class. Furthermore, Wikidata also owns counting properties like "number of children" (P1971) which may give hints on the cardinality of property values (in this case, the property "child"). External APIs may also be leveraged to provide completeness information, such as Crossref for the number of complete authors of a scholarly article.<ref type="foot" target="#foot_0">1</ref> </p><p>Ontology. One may also instantiate completeness patterns by utilizing the ontology structure of a KG, such as using r d f s : d o m a i n . <ref type="foot" target="#foot_1">2</ref> Such ontological information provides properties that typically apply for a class. More specifically, referring to the statement of P r d f s : d o m a i n C , we can assume that property P associates with instances of C .</p><p>Statistics. Statistical information may be useful for instantiating completeness patterns. Over a class, one may list the frequency of property usage in a class, and then take the top-N most frequent properties as the mandatory properties of the class. It is time consuming and is not as flexible nor as detailed as the manual approach.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Automatic</head><p>Gives the benefit of quick and easy generations over an abundance of completeness instantiations (of some form).</p><p>Requires quality checking over generated instantiations.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Ontology</head><p>Uses terminological knowledge from ontologies made by domain experts, thus enabling the reuse of existing ontologies.</p><p>The quality of the results depends on the source of ontology being used and that a poor quality ontology would tend to produce a lower quality of instantiations.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Statistics</head><p>Data driven and does not require domain understanding.</p><p>Manual checking is still required to ensure that the captured instantiations make sense.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">SoCK Library and Web App</head><p>In this section, we present our SoCK library for completeness instatiation and validation, as well as our SoCK web app as an information point for the SoCK framework.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1.">SoCK Library</head><p>The SoCK library provides a Python-based implementation of instantiating and validating completeness patterns. <ref type="foot" target="#foot_2">3</ref> The library supports the pattern instantiation process as discussed in Subsection 3.2. We use the library to create SHACL shapes based on the aforementioned completeness patterns for real-world KGs like DBpedia and Wikidata. The SoCK library can also be used to validate the completeness of KGs based on the instantiated patterns. The validation process requires both the data graph and (completeness) shapes graph as the input. Thus, the first step is SPARQL querying to get the corresponding data of relevant entities and properties. Here we rely on SPARQLWrapper<ref type="foot" target="#foot_3">4</ref> for querying and RDFLib<ref type="foot" target="#foot_4">5</ref> for RDF data creation and manipulation. The library currently does not support validation over remote data graphs and hence, the data graphs to be validated have to be available locally in the same machine where the library resides. Next, the collected data graph is validated against the (completeness) shapes graph from the instantiation process as described before. The validation results in validation reports, informing the completeness profiles of KG entities. We reuse PySHACL <ref type="foot" target="#foot_5">6</ref> to support the validation step. The whole flow of validation process is displayed in Figure <ref type="figure" target="#fig_4">3</ref>. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2.">SoCK Web App</head><p>The SoCK web app is developed to serve as an information point about the SoCK framework. The web app is available at https://sock.cs.ui.ac.id, including a demo video at https://youtu.be/ FtJ3HO_6YHcq. This web app offers features for users in learning about completeness patterns and creating their instances or shapes. For example, an instance of the label &amp; description pattern is the SHACL shape of "American Film". <ref type="foot" target="#foot_6">7</ref> Another example is an instance of the population completeness pattern, about "G20 Nations". <ref type="foot" target="#foot_7">8</ref> Further pattern instances can be viewed at https://sock.cs.ui.ac.id/instance/?page=1.</p><p>The SoCK web app uses the client-server architecture. In particular, it adapts a three-tier architecture that separates the application into three logical layers: presentation, application, and database. The presentation relies on basic web stack (i.e., HTML, CSS, and JavaScript), whereas the database makes use of SQLite. In developing the application, we employ the Django framework.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Case Studies: Wikidata and DBpedia</head><p>In this section, we discuss the evaluation results of our SoCK framework to real-world KGs, that is, Wikidata and DBpedia. We create SHACL shapes out of our presented completeness patterns and validate those shapes to the KGs. Overall, we successfully validate 928,310 entities from Wikidata and DBpedia with 360,162 completeness pattern instances (= shapes). Further evaluation discussion for each completeness pattern is provided as follows.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Schema Completeness Validation.</head><p>Here we instantiate the pattern SC1 via three approaches, that is, automatic, ontology, and statistics, collecting in total 1,106 SHACL shapes. Through the automatic approach using "properties for this type" (P1963), we successfully validate 469,891 Wikidata entities using 1,095 pattern instances. In the validation, we record for each entity the completeness percentage of properties listed in "properties for this type". The validation results are as follows: 35% entities are complete for 0-19% of properties, 17% entities for 20-39% of properties, 17% entities for 40-59% of properties, 11% entities for 60-79% of properties, and 20% entities for 80-100% of properties.</p><p>Next, through the ontology approach, we validate 5,000 DBpedia sample entities using five pattern instances from the class of "Hotel", "Magazine", "Museum", "University", and "Person". The validation results show that the completeness of DBpedia entities from those classes are still quite low in that 78% of all entities have no more than 10% of the corresponding class properties from the ontology approach.</p><p>As for the last schema completeness experiment, we validate 6,000 DBpedia sample entities from the class of "Country", "Person", "Activity", "Film", "Museum", and "University" using six pattern instances generated with statistics approach, where we take top-10 most frequent properties from each class. The result indicates that entities on DBpedia with property selection using a statistical approach have a good average completeness value in that almost 70% of the entities are complete for more than 65% of class properties.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Property Completeness Validation.</head><p>In this validation scenario, we make 357,892 shapes of the pattern PC1 to validate the property completeness of entities. On the automatic approach, we validate 357,749 Wikidata sample entities for the completeness of the number of authors from "Scholarly Article" (Q13442814), the number of children from "Human" (Q5), the number of episodes from "Television Series Season" (Q3464665), the number of seasons from "Television Series" (Q5398426), and the number of participants from "Sports Season" (Q27020041). Then, on the spreadsheet approach, we gather completeness information about the number of children of Indonesian actors, the number of cities from Indonesian provinces, and so on. The validation results from both approaches show a relatively good completeness value, ranging from 60-70%.</p><p>No-Value Completeness Validation. We conduct experiments using this pattern through an automatic approach, instantiating the pattern NVC1 based on Wikidata. Here we take all entities having no children, as stated by the value of zero (0) for the property of "number of children" (P1971). There are in total 1,277 SHACL shapes generated for no-value completeness. The validation results are trivially 100% complete by the definition of no-value completeness.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Population Completeness Validation.</head><p>We create eleven population completeness pattern instances with the code POC1. We examine on DBpedia the populations of cantons of Switzerland, continents, countries in Africa and Europe, G20 nations, Indonesian active volcanoes, Indonesian legislative election events, Indonesian provinces, NATO nations, oceans, and Summer Olympics events. Based on the experiment results, DBpedia includes in total 92.8% of the entities from all the populations above. This shows that DBpedia has covered almost all the population entities tested.</p><p>Label &amp; Description Completeness Validation. Using nine label &amp; description pattern instances, we conduct validation experiments involving 69,045 entities on DBpedia and Wikidata. On DBpedia, we validate 5,000 sample entities from the class of mountain, politician, country, disease, and musical artist, for the existence of a basic label &amp; description property according to the pattern of the code LDC1. Based on the validation results, more than 90% of the entities have a complete label and description property, such as r d f s : l a b e l (99.96%), r d f s : c o m m e n t (94.96%), and d b o : a b s t r a c t (94.96%). As for Wikidata, we validate 64,045 entities using instantiations of the pattern with the code LDC02 to check whether the entities have a label and description property with a proper language tag. We examine the entities from the class "National Hero of Indonesia", "South Korean Music Group", "American Film", and "Japanese Manga". The validation results show that over 98.72% of entities have a r d f s : l a b e l and 93.80% of entities have s c h e m a : d e s c r i p t i o n with a proper language tag. However, only 28.41% of the entities capture the property of alias (skos:altLabel) so it indicates that the s k o s : a l t L a b e l property is rather incomplete. One possible reason is there is sometimes no alias for entities of interest.</p><p>We also compare the label &amp; description completeness of DBpedia entities that have equivalent Wikidata entities through o w l : s a m e A s . This way, we can have a fair comparison between DBpedia and Wikidata. The validation checks whether entities from the class of country, mountain, film, hotel, and song, have a label &amp; description in English according to the pattern code LDC2. Based on the validation results of 5,000 entities, DBpedia entities have completeness of 99.86% for the label property (rdfs:label) and 99.06% for the description property (rdfs:comment). Meanwhile, the corresponding Wikidata entities have completeness of 99.5% for the label property (rdfs:label) and 95.04% for the description property (schema:description). Note that Wikidata relies on s c h e m a : d e s c r i p t i o n for describing entities instead of r d f s : c o m m e n t . From the above results, we can say that DBpedia entities have slightly higher label &amp; description completeness than those of Wikidata.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Interlinking Completeness Validation.</head><p>We experiment with interlinking completeness patterns, creating ten completeness pattern instances (i.e., five with the code IC1 and five with IC2), to validate 9,194 sample entities of DBpedia and Wikidata. On DBpedia, we check entities from the country, actor, island, museum, and hotel classes. The validation result shows that the completeness of the o w l : s a m e A s property is the highest with 95.24% DBpedia entities linked to Wikidata and 72.36% linked to YAGO. Meanwhile, the completeness of the s c h e m a : s a m e A s property is only about 20.80%, and even smaller for s k o s : e x a c t M a t c h , which is only 1.40%. This shows that most entities on DBpedia are not yet complete in covering external entities through s c h e m a : s a m e A s and s k o s : e x a c t M a t c h properties.</p><p>On Wikidata, we examine the existence of external ID properties for entities of the class country, hotel, film, singer-songwriter, and university. The validation result shows that interlinking completeness varies: countries are mostly complete for VIAF ID (96%) and GND ID (96%); hotels are somewhat incomplete -reaching the highest of only 25% for Agoda hotel ID;<ref type="foot" target="#foot_8">9</ref> films are the most complete for IMDb ID (99%); singers &amp; songwriters are the most complete for VIAF ID (97%); and universities are also the most complete for VIAF ID (86%).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">Related Work</head><p>A number of research studies have investigated the problem of describing and checking KG completeness. Prasojo et al. <ref type="bibr" target="#b17">[18]</ref> introduce SP-statements, in the form of (𝑠, 𝑝), declaring that the entity 𝑠 is complete for all values of the property 𝑝 for the KG of interest. Such SP-statements are practically relevant for entity-centric KGs, such as DBpedia and Wikidata. A more generic approach to describe completeness is provided in <ref type="bibr" target="#b18">[19,</ref><ref type="bibr" target="#b19">20]</ref> in that more expressive completeness statements based on arbitrary basic graph patterns (BGPs), including their consequences to query completeness, are well-studied.</p><p>In <ref type="bibr" target="#b20">[21]</ref>, Wisesa et al. develop a framework for completeness profiling. The framework analyzes the completeness of a KG based on the Class-Facet-Attribute (CFA) profiles. A class represents a set of entities with a number of facets, which allow attribute completeness to be analyzed. The framework can be used, for example, to answer the question: how does the completeness of the attributes "residence" and "eye color" fare for humans with the occupation of actor?</p><p>Now, we would like to report on several use cases of SHACL in data quality. Hammond et al. <ref type="bibr" target="#b21">[22]</ref> discuss how SHACL can be leveraged to supervise the quality of ETL pipelines from various data sources to build the Springer Nature SciGraph. The graph describes the Springer Nature publishing world, containing data about sponsors, research projects, conferences, publications, and affiliations from across the research landscape. Cimmino et al. <ref type="bibr" target="#b22">[23]</ref> introduce an approach to generate SHACL shapes automatically from OWL ontologies. To this end, they provide two resources: (𝑖) Astrea-KG, a KG for mappings between ontology constraint patterns and SHACL constraint patterns; and (𝑖𝑖) Astrea, a tool implementing the mappings from the Astrea-KG for concrete ontologies. The tool has been experimented over various real-world ontologies to create automatically SHACL restrictions, such as s h : d a t a t y p e and s h : m i n I n c l u s i v e , in order to maintain KG quality. Pandit et al. <ref type="bibr" target="#b23">[24]</ref> envision reusing ODP axioms to define SHACL shapes. They provide a simple case study in the context of validating the data quality of microblogs (e.g., Twitter posts). In that study, constraints and relationships within the ODP, defined using RDFS subclasses and OWL cardinality restrictions, are mapped to the corresponding SHACL shapes using s h : c l a s s and s h : q u a l i f i e d ( M a x / M i n ) C o u n t conditions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7.">Conclusions</head><p>Through this paper, we present SoCK (SHACL on Completeness Knowledge), a pattern-oriented framework for describing and validating the completeness of KGs. We propose six completeness patterns based on general data quality problems related to completeness, namely those patterns of schema completeness, property completeness, no-value completeness, population completeness, label &amp; description completeness, and interlinking completeness. From those patterns,</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Data on Wikidata about (a) Beverly Hills Hotel and (b) Hotel Indonesia Kempinski Jakarta</figDesc><graphic coords="2,297.64,84.19,208.35,201.19" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head></head><label></label><figDesc>e x : P e r s o n S h a p e a s h : N o d e S h a p e ; s h : t a r g e t C l a s s e x : P e r s o n ; s h : p r o p e r t y [ a s h : P r o p e r t y S h a p e ; s h : p a t h e x : n a m e ; s h : m i n C o u n t 1 ] ; s h : p r o p e r t y [ a s h : P r o p e r t y S h a p e ; s h : p a t h e x : b i r t h D a t e ; s h : m i n C o u n t 1 ] .</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: A SHACL shape in Turtle for "every person must have a name and a birth date"</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head></head><label></label><figDesc>r d f s : c o m m e n t , s c h e m a : d e s c r i p t i o n , and d b o : a b s t r a c t (for DBpedia).</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: Validation flow in SoCK</figDesc><graphic coords="9,141.38,204.54,312.52,72.12" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 :</head><label>1</label><figDesc>List of SHACL patterns based on completeness patterns</figDesc><table><row><cell>Code</cell><cell>SHACL Pattern</cell><cell>Description</cell></row><row><cell>SC1</cell><cell>%% SHAPE-NAME %%</cell><cell>All members of a class are re-</cell></row><row><cell></cell><cell>a sh:NodeShape;</cell><cell>quired to have the mandatory</cell></row><row><cell></cell><cell>sh:targetClass %% CLASS %%; sh:property [ a sh:PropertyShape; sh:path %% PROPERTY-01 %%;</cell><cell>properties from P R O P E R T Y -0 1 to P R O P E R T Y -N N . A SHACL shape in-</cell></row><row><cell></cell><cell>sh:minCount 1 ];</cell><cell>stantiating such a pattern is ex-</cell></row><row><cell></cell><cell>sh:property [ a sh:PropertyShape;</cell><cell>emplified in Figure 2.</cell></row><row><cell></cell><cell>sh:path %% PROPERTY-NN %%;</cell><cell></cell></row><row><cell></cell><cell>sh:minCount 1 ].</cell><cell></cell></row><row><cell>PC1</cell><cell>%% SHAPE-NAME %%</cell><cell>An entity (= node) is complete</cell></row><row><cell></cell><cell>a sh:NodeShape;</cell><cell>for P R O P E R T Y whenever the num-</cell></row><row><cell></cell><cell>sh:targetNode %% NODE %%; sh:property [ a sh:PropertyShape; sh:path %% PROPERTY %%;</cell><cell>ber of the property values equals C O U N T .</cell></row><row><cell></cell><cell>sh:minCount %% COUNT %% ].</cell><cell></cell></row></table><note>NVC1 %% SHAPE-NAME %% a sh:NodeShape; sh:targetNode %% NODE %%; sh:property [ a sh:PropertyShape; sh:path %% PROPERTY %%; sh:minCount 0 ]. An entity (= node) is complete for P R O P E R T Y despite the absence of property values. POC1 %% SHAPE-NAME %% a sh:NodeShape; sh:targetNode %% NODE %%; sh:property [ a sh:PropertyShape; sh:path [ sh:inversePath %% TYPE-PROPERTY %% ]; sh:minCount %% COUNT %% ].</note></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 2 :</head><label>2</label><figDesc>Advantages and Disadvantages of Various Completeness Instantiation Approaches</figDesc><table><row><cell>Approach</cell><cell>Advantages</cell><cell>Disadvantages</cell></row><row><cell>Manual</cell><cell>Enables a more detailed and cus-</cell><cell>Manual checking can be time con-</cell></row><row><cell></cell><cell>tomized making of shapes thus re-</cell><cell>suming, requires prior knowledge</cell></row><row><cell></cell><cell>sulting in a completeness evaluation</cell><cell>about SHACL, and is prone to hu-</cell></row><row><cell></cell><cell>with high precision and quality.</cell><cell>man errors.</cell></row><row><cell cols="2">Spreadsheet Enables easier data collection with</cell><cell></cell></row><row><cell></cell><cell>high quality.</cell><cell></cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">https://www.crossref.org/documentation/retrieve-metadata/rest-api/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">https://www.w3.org/TR/rdf-schema/#ch_domain</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_2">Our library is available at https://github.com/JillyCS15/sock-validator</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_3">https://sparqlwrapper.readthedocs.io/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_4">https://rdflib.readthedocs.io/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="6" xml:id="foot_5">https://pypi.org/project/pyshacl/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="7" xml:id="foot_6">https://sock.cs.ui.ac.id/instance/show-pattern-instance-detail/12/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="8" xml:id="foot_7">https://sock.cs.ui.ac.id/instance/show-pattern-instance-detail/5/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="9" xml:id="foot_8">Out of the external ID properties of VIAF ID, Booking.com numeric ID, Agoda hotel ID, Hotels.com hotel ID, and TripAdvisor ID</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>we manage to create over 360k instances (i.e., SHACL shapes) and validate nearly 1 million Wikidata and DBpedia entities using our SoCK Python library (https://github.com/JillyCS15/ sock-validator). To disseminate our framework, we develop a SoCK web app, accessible via https://sock.cs.ui.ac.id, which provides a catalog of completeness patterns as well as instances and use cases.</p><p>There are a few things that can be done to improve the SoCK framework. Currently, five patterns are formed based on <ref type="bibr" target="#b4">[5]</ref>. The other two patterns, namely, currency completeness and metadata completeness, are not yet investigated in this work due to the time limitations, and are left for future research. At the moment, our evaluation is limited to Wikidata and DBpedia entities. Using entities in other KGs, such as YAGO and KBpedia, or even enterprise KGs, may help further explore the general quality of KGs. The current version of our work only concerns the syntactic level of RDF graphs and not yet the semantic level. Investigating the consequences in incorporating the semantics (and inferences) of RDFS and OWL would be relevant for future directions. Working on the semantic level would also be potential in better generalizing our SHACL-based approach and aligning SHACL &amp; ODPs for describing and validating KG completeness. Last but not least, it is also of our interest to analyze the completeness of ontologies, that is, how complete is an ontology in capturing some domain of interest (i.e., whether the classes and properties are sufficient enough).</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Evaluating the quality of the LOD cloud: An empirical investigation</title>
		<author>
			<persName><forename type="first">J</forename><surname>Debattista</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Lange</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Auer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Cortis</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Semantic Web</title>
		<imprint>
			<biblScope unit="volume">9</biblScope>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Data on the Web Best Practices</title>
		<ptr target="https://www.w3.org/TR/dwbp/" />
	</analytic>
	<monogr>
		<title level="m">W3C Recommendation</title>
				<editor>
			<persName><forename type="first">B</forename><forename type="middle">F</forename><surname>Lóscio</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">C</forename><surname>Burle</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">N</forename><surname>Calegari</surname></persName>
		</editor>
		<imprint>
			<date type="published" when="2017-01-31">31 January 2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Quality assessment for linked data: A survey</title>
		<author>
			<persName><forename type="first">A</forename><surname>Zaveri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Rula</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Maurino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Pietrobon</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Lehmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Auer</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Semantic Web</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Beyond accuracy: What data quality means to data consumers</title>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">Y</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">M</forename><surname>Strong</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">J. Manag. Inf. Syst</title>
		<imprint>
			<biblScope unit="volume">12</biblScope>
			<date type="published" when="1996">1996</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Knowledge graph completeness: A systematic literature review</title>
		<author>
			<persName><forename type="first">S</forename><surname>Issa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Adekunle</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Hamdi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">S</forename><surname>Cherfi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Dumontier</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Zaveri</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Access</title>
		<imprint>
			<biblScope unit="volume">9</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Non-parametric Class Completeness Estimators for Collaborative Knowledge Graphs -The Case of Wikidata</title>
		<author>
			<persName><forename type="first">M</forename><surname>Luggen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">E</forename><surname>Difallah</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Sarasua</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Demartini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Cudré-Mauroux</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ISWC</title>
				<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<title level="m" type="main">Shapes Constraint Language (SHACL), W3C Recommendation</title>
		<ptr target="https://www.w3.org/TR/shacl/" />
		<editor>H. Knublauch, D. Kontokostas</editor>
		<imprint>
			<date type="published" when="2017-07-20">20 July 2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Ontology design patterns</title>
		<author>
			<persName><forename type="first">A</forename><surname>Gangemi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Presutti</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Handbook on Ontologies, International Handbooks on Information Systems</title>
				<editor>
			<persName><forename type="first">S</forename><surname>Staab</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">R</forename><surname>Studer</surname></persName>
		</editor>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<title level="m" type="main">RDF Primer, W3C Recommendation</title>
		<ptr target="http://www.w3.org/TR/rdf-primer/" />
		<editor>F. Manola, E. Miller</editor>
		<imprint>
			<date type="published" when="2004-02-10">10 February 2004</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<monogr>
		<title level="m" type="main">SHACL Use Cases and Requirements, W3C Working Group Note</title>
		<ptr target="https://www.w3.org/TR/shacl-ucr/" />
		<editor>S. Steyskal, K. Coyle</editor>
		<imprint>
			<date type="published" when="2017-07-20">20 July 2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Validating RDF Data</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">E L</forename><surname>Gayo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Prud'hommeaux</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Boneva</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Kontokostas</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Synthesis Lectures on the Semantic Web: Theory and Technology</title>
				<imprint>
			<publisher>Morgan &amp; Claypool Publishers</publisher>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Ontology Design Patterns for Semantic Web Content</title>
		<author>
			<persName><forename type="first">A</forename><surname>Gangemi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ISWC</title>
				<imprint>
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Ontology Design Patterns in WebProtege</title>
		<author>
			<persName><forename type="first">K</forename><surname>Hammar</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ISWC Posters &amp; Demos</title>
				<imprint>
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<monogr>
		<title level="m" type="main">An ontology design pattern for chess games</title>
		<author>
			<persName><forename type="first">A</forename><surname>Krisnadhi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Rodríguez-Doncel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Hitzler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Cheatham</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Karima</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Amini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Coleman</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2015">2015</date>
			<publisher>WOP</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">The enslaved ontology: Peoples of the historic slave trade</title>
		<author>
			<persName><forename type="first">C</forename><surname>Shimizu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Hitzler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Q</forename><surname>Hirt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Rehberger</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">G</forename><surname>Estrecha</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Foley</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">M</forename><surname>Sheill</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Hawthorne</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Mixter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Watrall</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Carty</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Tarr</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">JWS</title>
		<imprint>
			<biblScope unit="volume">63</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<author>
			<persName><forename type="first">M</forename><surname>Faiz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">M F</forename><surname>Wisesa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">A</forename><surname>Krisnadhi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Darari</surname></persName>
		</author>
		<title level="m">OD2WD: From Open Data to Wikidata through Patterns</title>
				<imprint>
			<publisher>WOP</publisher>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Representing and querying negative knowledge in RDF</title>
		<author>
			<persName><forename type="first">F</forename><surname>Darari</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ESWC Posters and Demos</title>
				<imprint>
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Managing and Consuming Completeness Information for Wikidata Using COOL-WD</title>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">E</forename><surname>Prasojo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Darari</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Razniewski</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Nutt</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">COLD@ISWC</title>
				<imprint>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Completeness statements about RDF data sources and their use for query answering</title>
		<author>
			<persName><forename type="first">F</forename><surname>Darari</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Nutt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Pirrò</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Razniewski</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ISWC</title>
				<imprint>
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<monogr>
		<title level="m" type="main">Enabling fine-grained RDF data completeness assessment</title>
		<author>
			<persName><forename type="first">F</forename><surname>Darari</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Razniewski</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">E</forename><surname>Prasojo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Nutt</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2016">2016</date>
			<publisher>ICWE</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<monogr>
		<title level="m" type="main">Wikidata Completeness Profiling Using ProWD</title>
		<author>
			<persName><forename type="first">A</forename><surname>Wisesa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Darari</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Krisnadhi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Nutt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Razniewski</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2019">2019</date>
			<publisher>K-CAP</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Data integration and disintegration: Managing Springer Nature SciGraph with SHACL and OWL</title>
		<author>
			<persName><forename type="first">T</forename><surname>Hammond</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Pasin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Theodoridis</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ISWC Posters &amp; Demos</title>
				<imprint>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<monogr>
		<title level="m" type="main">Astrea: Automatic Generation of SHACL Shapes from Ontologies</title>
		<author>
			<persName><forename type="first">A</forename><surname>Cimmino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Fernández-Izquierdo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>García-Castro</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2020">2020</date>
			<publisher>ESWC</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<monogr>
		<title level="m" type="main">Using ontology design patterns to define SHACL shapes</title>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">J</forename><surname>Pandit</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>O'sullivan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Lewis</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2018">2018</date>
			<publisher>WOP</publisher>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
