1. Introduction

Jindřich Mynarz

mynarzjindrich@gmail.com 0 1 2

Kateřina Haniková

0 1 2

Vojtěch Svátek

0 1 2 0 Department of Information and Knowledge Engineering, Prague University of Economics and Business 1 KGCW'23: 4th International Workshop on Knowledge Graph Construction 2 Nám. W. Churchilla 4 , 130 67, Praha 3 , Czech Republic

We propose how to construct knowledge graphs using a method based on user requirements formulated as competency questions. It suggests to formalize the competency questions as SPARQL queries wrapped as SHACL constraints allowing them to be used as automated tests. The method defines a process to guide knowledge graph construction by feedback from tests. It aims to reduce the engineering efort required to construct a knowledge graph that meets the requirements, while assessing the quality of semantic artifacts produced in this efort. We intend the method to provide a solid engineering basis for knowledge graph construction that uses the lessons learnt from software development and is built on open standards. We demonstrate using the method to construct a knowledge graph about antigen covid tests and reflect on the proposed method.

testing knowledge graphs competency questions

1. Introduction

Knowledge graphs use graph-based data models to capture knowledge commonly combined from large and diverse data sources [ 1 ]. Constructing knowledge graphs can be an intricate and openended task that is in many ways an art rather than a craft. 1 While there are resources covering how to build knowledge graphs, there is little on what to build: how to elicit, formulate, and validate requirements for a knowledge graph. Overall, there is a shortage of solid engineering practices guiding through this task.

We propose how to construct knowledge graphs using a standards-based method founded upon user requirements formulated as competency questions (CQs). It suggests to formalize the CQs as SPARQL queries [ 3 ] and wrap them as SHACL constraints [ 4 ] in order to allow them to be executed as automated tests. The method defines a process guiding knowledge graph construction that is based on feedback from tests. It aims to reduce the efort required to construct a knowledge graph that meets its requirements and passes quality assessment of semantic artifacts, such as ontologies, that are made in the process. CEUR Workshop Proceedings 1The same argument was made about designing ontologies by Soldatova et al. [ 2 ].

2. Motivation

Following a method for knowledge graph construction ofers several benefits. In general, a method helps decide what to do next. In particular, it helps to overcome the blank canvas paralysis at the beginning and jump-start the work. Moreover, a method breaks down the efort required for knowledge graph construction into sub-tasks, which can help organize and coordinate a team working on them.

The direction provided by a method anchored in user requirements can reduce the unfocused upfront efort and help avoid over-engineering. For example, it discourages needless efort spent on achieving a lossless transformation that preserves all source data in the produced knowledge graph even though it might not be needed. It can help avoid premature abstraction and premature optimization, such as for performance or readability. Thanks to the tests checking if the user requirements are satisfied, we get an early proof of value instead of speculating about it. Additionally, the explicit links between requirements and tests allow to assess the requirements traceability and coverage.

3. Related Work

When outlining this method, we can learn from ontology engineering, as it has a head start of several decades on knowledge graphs. There is a long tradition of using competency questions as requirements for ontologies [ 5 ]. Much of this experience can be reused, since ontologies serve as essential building blocks of knowledge graphs imbuing them with explicit semantics. Knowledge graphs in turn can be considered as ontologies populated with data. Creating tests for ontologies out of CQs is also nothing new [ 6, 7 ]. More recently, it was adopted for knowledge graphs too [ 8 ].

Broader still, we can adopt the practices established in software engineering. The hereby presented method borrows from the test-driven development cycle [ 9 ], which is commonly characterized by the following steps: • Add a test • Run all tests, expecting the new test to fail • Write the simplest code that passes the new test • All tests should pass • Refactor as needed

A fundamental feature of this cycle is that it intertwines development with testing, so that tests exercise the developed artifacts and provide feedback informing further work. Tests provide a controlled way to evolve the artifacts under construction in response to changing requirements, such as when their scope is extended or when they are refined iteratively. The field of ontology engineering is already adopting agile development approaches using tests. Ontology-specific methods, such as SAMOD [ 10 ] or Linked Open Terms [ 11 ], are being proposed.

In summary, the hereby described method borrows approaches either from ontology engineering or software development in general and proposes a novel way how to combine them for knowledge graph construction. The following text characterizes the method and demonstrates its use for building a knowledge graph about antigen covid tests.

User requirements Competency questions SPARQL queries formulated as translate to SHACL

constraints Knowledge graph validate executed by induced from extracted from

Ontology

terms uses combine expressed by

Source

data is input to

Transformation to RDF produces 4. Method

We present a method for test-driven knowledge graph construction. The method proposes a sequence of steps and feedback loops between them. Each step produces or consumes one or more artifacts, such as ontologies or SPARQL queries. An overview of the relations between the key artifacts used by the method is depicted in Figure 1. The method involves the following steps: 1. Start by using knowledge elicitation techniques [ 12 ] to gather user requirements. User requirements can be gathered from subject-matter experts or prospective users of the knowledge graph. Formulate the user requirements as CQs [ 5 ]. CQs are analytical questions that the target knowledge graph is expected to answer. In order to avoid misinterpretation, they should be reviewed by those who expressed the requirements the CQs capture. 2. Analyze the CQs to extract terms and relations. CQs can be analyzed using the linguistic notion of presupposition, which can be defined as ”a condition that must be met for a linguistic expression to have a denotation” [ 6 ]. Viewed this way, CQs presuppose the terms with which they can be expressed. The terms to extract from CQs are typically nouns or noun phrases that refer to entities from the domain of the knowledge graph under construction. They can be considered lexical surface forms of these entities. The terms can be identified manually or with the help of tools, such as part-of-speech tagging. The relations to extract might be represented by verbs connecting the terms co-occurring in a CQ. Less direct mappings between entities and surface forms are possible too, such as when a surface form provides the context disambiguating an entity. 3. Formalize the terms and relations in a minimum viable ontology using RDF Schema [ 13 ].

While terms typically map to classes (i.e. instances of r d f s : C l a s s ), relations map either to properties (i.e. instances of r d f : P r o p e r t y ) or n-ary relations represented by classes. Document the ontology with definitions sourced from and validated by subject-matter experts to create a shared understanding. The ontology should make minimal ontological commitment to aid its evolution, so limit its upfront axiomatization, such as by using OWL restrictions. The resulting ontology shall serve as an explicit and machine-readable conceptualization of the knowledge graph’s domain. Formalizing the ontology may in turn reveal ambiguity in the CQs. Ambiguous CQs cannot be formalized reliably since their misinterpretations may occur. In such cases, the sources of ambiguity should be revised with subject-matter experts while aiming to reformulate the CQs in an unambiguous way. 4. Translate the CQs to SPARQL queries expressed by the ontology. SPARQL queries make the CQs executable. Here, we expect a manual translation of the CQs to SPARQL queries. Automated translation was attempted by others, such as in [ 14 ]. Start with syntactically valid SPARQL queries of hypothetical data expressed by the ontology. A CQ may translate to a SPARQL graph pattern composed of triple patterns joining the terms co-occurring in the question via specific relations. The way CQs should be translated to SPARQL queries largely depends on their expected answers. Ren et al. [ 6 ] suggest that we might be ”more interested in checking if a CQ can be meaningfully answered, instead of directly answering a CQ.” When using such existential quantification of the desired answers, a SPARQL query formalizing a CQ can be expected to return non-empty results. What it means is that the query can answer a CQ using a given knowledge graph. It does not verify if the answer is correct. Conversely, some CQs may describe universal invariants of their domain. Queries encoding these invariants are akin to property-based testing [15]. In case stricter guarantees are needed, the expected correct answers can be included in the queries and validation constraints in SHACL implementing the CQs, corresponding to the usual example-based testing. 5. Wrap the queries as SPARQL-based SHACL constraints to automate their execution.

SHACL defines a target 2 of each constraint. In our context, a target sets the scope of a CQ. The scope of the data graph validated by a given constraint may either encompass the entire knowledge graph or cover its subset. For instance, existentially quantified queries checking the complete data graph can target it by using s h : t a r g e t N o d e [ ] . It is also possible to rewrite a part of a SPARQL query translating a CQ as target selection in SHACL. Alternatively, the prerequisite part of the query can be represented as a SPARQLbased target [16]. An entity-centric partitioning of a given knowledge graph can be implemented by extracting concise bounded descriptions [17] of the targeted entities. Some data sources of knowledge graphs may natively partition data to independent subsets, such as API responses or messages from a queue, that can be efectively validated by specific subsets of SHACL constraints. SHACL can also define the passing criteria of SPARQL-based constraints. An s h : a s k query is expected to return the boolean t r u e , while

2https://www.w3.org/TR/shacl/#targets

any results produced by an s h : s e l e c t query are treated as violations. Specific expected results can be either hard-coded into the queries or specified via s h : h a s V a l u e in case a single value is expected. Note however, that when including the expected results in SPARQL-based constraints, we may run into the limitations of SHACL due to pre-binding of variables.3 For example, the V A L U E S clause, which could represent the expected results of S E L E C T queries, is not allowed. 6. Develop a transformation of the source data to the target knowledge graph that is expected to meet the constraints. This can be implemented in many ways, largely dependent on the format of the source data, so we will not cover it here. For an overview of these approaches, see e.g., Fensel et al. [18] 7. Validate the knowledge graph under development with the SHACL constraints. If the validation fails, resolve the reported violations by fixing the artifacts created in the previous steps. Fix the most primary artifact causing the violations, i.e. their root cause. For instance, data transformation shall not work around an insuficiently expressive ontology. Similarly, imposing a more expressive data model on the knowledge graph might reveal errors in its source data. In this way, the finer structure of its ontology makes the previously hidden data quality issues visible. Some feedback may even indicate ill-formed CQs in need of revision. 8. Refactor the artifacts. The SHACL specification of the knowledge graph is a key artifact to refactor. It can serve as a formal specification of the anticipated use of the ontology. Moreover, SHACL can transcend a single ontology and specify how to combine terms reused from multiple ontologies. During the refactoring, SHACL core constraints can be induced from SPARQL-based constraints to represent the frequently queried patterns in data. For instance, aided by the knowledge about the domain, you can infer universally quantified axioms represented as SHACL constraints from a sample of existentially quantified axioms represented as SPARQL queries. In particular, SHACL can prescribe the expected relations in data. More complex or distant relations might even require the expressivity of SHACL to be represented, using property paths or SPARQL graph patterns. For example, even though SHACL does not support conditional constraints directly, they can be rewritten according to the inference rule of material implication → ⇔ ¬ ∨ .

5. Non-functional Requirements

Note that the above-described method tests if a knowledge graph meets the given user requirements. It does not evaluate how well these requirements are satisfied. Absence of errors does not imply that the knowledge graph is sound. Therefore, the functional user requirements should be complemented by non-functional requirements describing the desired qualities of the solution. These requirements can be checked and refactored by using alternative sources of feedback, such as: Code review

Elicit expert insight from ontology and data engineers.

3https://www.w3.org/TR/shacl/#pre-binding

User feedback

The results produced by SPARQL queries formalizing the CQs can be judged to be incorrect by users. User feedback can identify wrong interpretation of CQs or false assumptions. It might prompt revisiting any of the previously described steps or lead to adding more CQs.

Usability testing

Usability may manifest in developer experience of writing SPARQL queries on a knowledge graph. In particular, complexity and verbosity of SPARQL queries may be considered. In this way, usability of the queries indirectly reflects the usability of the ontology. For example, verbosity may be caused by duplicate data that can be abstracted to the ontology, while complexity can be reduced by introducing ontological shortcuts. Verbose SPARQL queries may also suggest a need for introducing abstraction or additional axioms into the ontology. In fact, some CQs can be answered directly using the knowledge formalized in the ontology, since SHACL expects the validated data graph4 to include the ontologies it populates.

Performance testing

Performance can be evaluated indirectly using the execution time of SHACL validation. This time is proportional to the execution times of the SPARQL queries the constraints include.

The non-functional requirements can also be covered by tests, such as query performance tests. These tests can provide additional feedback informing the test-driven development. Using the above-mentioned feedback is a balancing act of multi-objective optimization. For example, performance feedback may be incompatible with the feedback about verbosity or CQs may pose mutually conflicting requirements. Addressing the feedback thus requires making informed trade-ofs.

6. Limitations

We are aware of several limitations of the proposed method. Here we recount some of them and suggest how to remedy them. The method may cause the developed artifacts outlined in the Figure 1 to overfit the user requirements in scope, which would hinder the reuse of the artifacts. This shortcoming can be remedied by including more CQs or focusing on reusability and abstraction during refactoring. User requirements for the knowledge graph under construction can be more complex than what SPARQL can express. One way around it is formalizing the requirements in a more expressive programming language that extends SHACL, such as in [19]. Ultimately, in order to ameliorate the limitations of this method, it is best combined with other methods, such as those for ontology design (e.g., Kendall and McGuinness [20]).

4https://www.w3.org/TR/shacl/#data-graph 7. Case Study: Antigen Covid Tests Knowledge Graph

We used the described method to create a knowledge graph about antigen tests for SARS-CoV-2.

Its source data was originally gathered to create an overview of diferent evaluations of selected rapid antigen tests available on the Czech market. Data analyses were carried out [21] in order to validate the assumption that the claims on test quality provided by the manufacturers difer significantly from what is verified by the laboratories of independent organizations.

The data was collected manually from several regulatory bodies, such as SÚKL5 and EU HSC6, and was combined with performance evaluations of the tests, such as their sensitivity and specificity, that came from independent studies from various countries. The data was collected into an XML file. The final destination of the data was originally twofold. One was the actual portal that provided a graphical visualization of various properties of the tests.7 The other was a dashboard allowing to perform analyses online. While the XML data was already structured, it lacked any ontological grounding, and some of its features were tuned towards presentation aspects within the website.

We set to create a knowledge graph out of it to open it to a wider reuse and allow performing retrospective analyses of this data. All artifacts we developed in this efort, such as CQs, are available as open source8 and the knowledge graph is released.9

Understanding the source data and its domain was a key prerequisite of our work. The data covers 158 antigen SARS-CoV-2 tests and their evaluations from several sources. Each source had its way of describing the domain, which we needed to comprehend first to represent the source’s data faithfully and make it commensurable. We also needed to become aware of the purpose for which the data was collected and how it was represented to serve this purpose. The data was originally structured in XML for display on a web page, so we needed to map its document-oriented structure and visual encoding into semantics. Knowing the source data well, we could begin with the knowledge graph construction.

Step 1: The first step of the proposed method is gathering user requirements. We started with capturing user requirements formulated as CQs, such as ”What is the sensitivity of given tests according to their manufacturers?” We came up with 19 CQs in total, covering both basic information look-up and more complex analytical questions. We removed one of the CQs later upon finding that it was subsumed by another CQ. The complete list of the CQs is available online.10

Step 2: We continued with analysis of the CQs and extraction of terms and relations, such as ”sensitivity”, ”test”, or ”has manufacturer” from the given example question. Since we did not have many CQs, we extracted the terms and relations manually.

Step 3: The third step was to create a minimum viable ontology from the extracted terms and relations, which we did mostly by reusing existing ontology terms and using RDF Schema; see 5https://www.sukl.cz/prehled-testu-k-diagnostice-onemocneni-covid-19 6https://health.ec.europa.eu/health-security-and-infectious-diseases/crisis-management/covid-19-diagnostic-tests_ en 7https://covidtesty.vse.cz/english/test-evaluation-older-data/ 8https://github.com/KIZI/antigen-covid-tests-knowledge-graph 9https://github.com/KIZI/antigen-covid-tests-knowledge-graph/releases/tag/v1.0.0 10https://github.com/KIZI/antigen-covid-tests-knowledge-graph/wiki/Competency-questions Listing 1. For what we did not find suitable terms to reuse, we formalised terms in the simple Antigen Covid Test Ontology. This ontology aimed to allow expressing the CQs in SPARQL.

Listing 1: Example ontology terms :DiagnosticSensitivity a rdfs:Class ;

rdfs:label ”Diagnostic sensitivity”@en . :AntigenCovidTest a rdfs:Class ;

rdfs:label ”Antigen covid test”@en . :hasManufacturer a rdf:Property ; rdfs:label ”Has manufacturer”@en .

Step 4: Having a minimum viable ontology, we were able to translate the CQs into SPARQL queries, such as in the example Listing 2. Each question was translated manually.

Listing 2: Example competency question in SPARQL

PREFIX act: <https://covidtesty.vse.cz/vocabulary#> PREFIX dcterms: <http://purl.org/dc/terms/> PREFIX ncit: <http://purl.obolibrary.org/obo/NCIT_> PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX schema: <http://schema.org/> ASK { ?test a act:AntigenCovidTest ;

schema:manufacturer ?manufacturer . }

Step 5: We wrapped the SPARQL queries as SHACL constraints, such as in the example Listing 3 that defines a SPARQL-based constraint component 11 applied to the entire data graph. The s h : m e s s a g e captures the CQ in natural language, which helps to identify which question failed to validate in the validation reports. The s h : a s k contains the CQ translated to a SPARQL query prepared in the previous step.

Listing 3: Example competency question in SHACL :DatasetShape a sh:NodeShape ; rdfs:label ”Competency questions applicable to the entire dataset”@en ; sh:targetNode [] ; :cq12 [] . 11https://www.w3.org/TR/shacl/#sparql-constraint-components :CQ12 a sh:ConstraintComponent ; rdfs:label ”Manufacturer-declared test sensitivity”@en ; sh:parameter [

sh:path :cq12 ] ; sh:nodeValidator [ a sh:SPARQLAskValidator ; sh:message ”””What is the sensitivity of given tests

according to their manufacturers?”””@en ; sh:prefixes :prefixes ; sh:ask ””” ASK { ?test a act:AntigenCovidTest ;

schema:manufacturer ?manufacturer .

Step 6: We started with implementing a transformation of the source data to RDF. We converted the input XML data into RDF/XML via an XSL transformation followed by SPARQL Update operations for post-processing. For example, we used post-processing for normalization of code-list values and manufacturers’ names. We also implemented few traditional unit tests of XSL functions within the transformation.

Step 7: The resulting data was tested by the CQs implemented in SHACL. We automated the data processing and test execution by a shell script based on Jena command-line tools.12 Given that this is a small knowledge graph of around 10 thousand RDF triples, we validated it as a whole.

Step 8: The final step of the proposed method is about refactoring the developed artifacts. Our development work proceeded in several iterations guided by continuous feedback from the script for data transformation and validation. When the knowledge graph satisfied the CQs, we continued with refactoring the semantic artifacts we created. For example, we wanted to avoid blank nodes to make the data transformation results better comparable, so we improved the XSL transformation to generate IRIs instead. We also abstracted a parent class for evaluations in the ontology and adjusted the SHACL shapes accordingly.

Some errors were not detected by the tests implementing the CQs. For example, when querying the knowledge graph, we found that the data transformation to RDF mistakenly created IRIs of manufacturers based on their antigen covid test identifiers. Consequently, the 12https://jena.apache.org/documentation/tools resulting data related each manufacturer to one antigen covid test, so that it was not possible to group multiple tests by the same manufacturer. A common response to discovering errors not covered by tests is to improve the test coverage to enable detecting the errors found and avoid future regressions. We followed this practice and added a CQ comparing a given manufacturer’s antigen covid tests. The CQ presupposed that there are manufacturers ofering more than one test and would therefore fail in case of the above-described error. Apart from fixing the way we generated manufacturers’ IRIs, we spent further efort on de-duplicating manufacturers via post-processing by SPARQL Update operations.

While we had CQs about manufacturers’ tests, the implementations of these CQs generally asked if manufacturers’ tests matching certain criteria exist. Being encoded as SPARQL ASK queries, any non-empty query results satisfied these CQs. They did not detect when the results were not the expected ones, such as when not showing all manufacturer’s tests in the example of duplicate manufacturers above. We addressed this limitation by asking a more specific CQ presupposing more specific assertions. The more specific assertions we make about our domain, the better we can detect invalid data describing it. Yet when the domain changes or when our understanding of the domain is flawed, such assertions are more likely to become invalid. Consequently, there is a trade-of to be made between an assertion’s specificity and its durability in face of change.

We encountered several other challenges during the development of the knowledge graph, such as handling the structure of the source data originally designed for a web presentation or frequent changes in relevant legislation and evaluation studies. Since the source data was collected manually, it was inconsistent and required fixes. Some of these inconsistencies manifested as duplicates and were detected by our test suite. Structuring the data as a knowledge graph and more rigorous testing thus helped to make the errors visible. Future work on this knowledge graph can be aimed at automatic updates of the data and expanding its coverage beyond the Czech market for antigen covid tests.

8. Conclusions

The hereby proposed method guides through the open-ended process of knowledge graph construction. It breaks the process down into a well-defined sequence of steps, feedback loops between them, and semantic artifacts that are produced or consumed in the process. The central contribution of the method is allowing to test if the produced knowledge graph satisfies the requirements for its construction. This is done via competency questions formulated as SPARQL queries embedded in SHACL shapes for test automation. As such, one of its key advantages is being based on the established semantic web standards.

We shared the experience of applying the method to build a knowledge graph about antigen covid tests. The method aided in collaborative development of this knowledge graph, allowing its incremental refinement without regressions. Since constructing knowledge graphs from the same user requirements using multiple methods, while comparing their outcomes, is prohibitively expensive, we expect future improvements of the proposed method to come from its applications on knowledge graphs with diferent requirements.

Acknowledgments

This research was supported by CHIST-ERA within the CIMPLE project (CHIST-ERA-19-XAI003). Ekaputra (Eds.), Knowledge Engineering and Knowledge Management, Springer, Cham, 2022, pp. 19–35. [15] G. Fink, M. Bishop, Property-based testing: A new approach to testing for assurance, SIGSOFT Software Engineering Notes 22 (1997) 74––80. URL: https://doi.org/10.1145/ 263244.263267. doi:1 0 . 1 1 4 5 / 2 6 3 2 4 4 . 2 6 3 2 6 7 . [16] SHACL-AF, SHACL advanced features, 2017. URL: https://www.w3.org/TR/shacl-af. [17] P. Stickler, CBD, 2005. URL: https://www.w3.org/Submission/CBD. [18] D. Fensel, U. Şimşek, K. Angele, E. Huaman, E. Kärle, O. Panasiuk, I. Toma, J. Umbrich, A. Wahler, How to Build a Knowledge Graph, Springer, Cham, 2020, pp. 11–68. URL: https://doi.org/10.1007/978-3-030-37439-6_2. doi:1 0 . 1 0 0 7 / 9 7 8 - 3 - 0 3 0 - 3 7 4 3 9 - 6 _ 2 . [19] SHACL-js, SHACL JavaScript extensions, 2017. URL: https://www.w3.org/TR/shacl-js. [20] E. F. Kendall, D. L. McGuinness, Ontology engineering, volume 18 of Synthesis lecture on the semantic web: theory and technology, Morgan and Claypool, 2019. [21] T. Kliegr, J. Jarkovský, H. Jiřincová, J. Kuchař, T. Karel, R. Tachezy, Role of population and test characteristics in antigen-based SARS-CoV-2 diagnosis, czechia, august to november 2021, Euro Surveillance 27 (2022). URL: https://doi.org/10.2807/1560-7917.ES.2022.27.33. 2200070.

[1]

Hogan , E. Blomqvist,

Cochez , C. D'amato , G. D. Melo , C.

Gutierrez , S.

Kirrane , J. E. L.

Gayo , R.

Navigli , S.

Neumaier , A.-C. N.

Ngomo , A.

Polleres , S. M.

Rashid , A.

Rula , L.

Schmelzeisen , J.

Sequeda , S.

Staab , A.

Zimmermann , Knowledge graphs , ACM Computing Survey 54 ( 2021 ). URL: https://doi.org/10.1145/3447772. doi:1 0 . 1 1 4 5 / 3 4 4 7 7 7 2 .

[2]

Soldatova ,

Panov ,

Džeroski , Ontology engineering: From an art to a craft , in: V. Tamma , M.

Dragoni , R.

Gonçalves , A . Ławrynowicz (Eds.), Ontology Engineering, Springer, Cham, 2016 , pp. 174 - 181 .

[3] SPARQL, SPARQL 1.1 query language , 2013 . URL: http://www.w3.org/TR/sparql11-query.

[4]

SHACL

, Shapes constraint language (SHACL ), 2017 . URL: https://www.w3.org/TR/shacl.

[5]

Gruninger ,

M. S.

Fox , The design and evaluation of ontologies for enterprise engineering , in: Workshop on Implemented Ontologies, European Conference on Artificial Intelligence (ECAI) , 1994 .

[6]

Ren ,

Parvizi ,

Mellish ,

J. Z.

Pan , K. van Deemter ,

Stevens , Towards competency question-driven ontology authoring , in: V. Presutti , C. d'Amato , F.

Gandon , M. d'Aquin, S.

Staab , A . Tordai (Eds.), The Semantic Web: Trends and Challenges , Springer, Cham, 2014 , pp. 752 - 767 . URL: https://link.springer.com/content/pdf/10.1007/978-3- 319 -07443-6_ 50 .pdf.

[7]

Zemmouchi-Ghomari ,

A. R.

Ghomari , Translating natural language competency questions into SPARQL queries: a case study , in: The First International Conference on Building and Exploring Web Based Environments , 2013 , pp. 81 - 86 .

[8]

J. Z.

Pan ,

Matentzoglu ,

Jay ,

Vigo ,

Zhao , Understanding Author Intentions: Test Driven Knowledge Graph Construction , Springer, Cham, 2017 , pp. 1 - 26 . URL: https: //doi.org/10.1007/978-3- 319 -49493- 7 _1. doi:1 0 . 1 0 0 7 / 9 7 8 - 3 - 3 1 9 - 4 9 4 9 3 - 7 _ 1 .

[9]

Beck , Test-driven development: by example , Addison-Wesley , 2003 .

[10]

Peroni , A simplified agile methodology for ontology development , in: OWL: Experiences and directions - reasoner evaluation , Springer, Cham, 2016 , pp. 55 - 69 .

[11]

Poveda-Villalón ,

Fernández-Izquierdo ,

Fernández-López , R. García-Castro, LOT: An industrial oriented ontology engineering framework , Engineering Applications of Artificial Intelligence 111 ( 2022 ). URL: https://www.sciencedirect.com/science/article/pii/ S0952197622000525. doi:h t t p s : / / d o i . o r g / 1 0 . 1 0 1 6 / j . e n g a p p a i . 2 0 2 2 . 1 0 4 7 5 5 .

[12]

N. R.

Shadbolt ,

P. R.

Smart , Knowledge elicitation, 4 ed., CRC Press, Boca Raton, 2015 , pp. 163 - 200 .

[13] RDFS, RDF Schema 1.1 , 2014 . URL: https://www.w3.org/TR/rdf-schema.

[14]

Espinoza-Arias ,

Garijo ,

Corcho , Extending ontology engineering practices to facilitate application development , in: O. Corcho , L.

Hollink , O.

Kutz , N.

Troquard , F. J.