<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>OSKGC: A benchmark for Ontology Schema-based Knowledge Graph Construction from Text</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Dali Wang</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mizuho Iwaihara</string-name>
        </contrib>
      </contrib-group>
      <abstract>
        <p>Knowledge graph construction is a critical task in natural language processing, providing structured information that supports downstream applications such as recommendation systems and question answering. To ensure the accuracy and completeness of the information stored in knowledge graphs, it is essential to continuously update them based on new textual data while maintaining consistency with the existing knowledge graph architecture. To this end, constructing knowledge graphs from text based on a predefined ontology schema is crucial. However, existing benchmark only considers simplistic schema, lacking subsumptions. A new task shall be constructed, that requires not only accurate extraction of factual triples, but also structural alignment between the constructed knowledge graph and the predefined hierarchical ontology schema. For benchmarking this task, we introduce OSKGC, a benchmark dataset specifically designed for ontology schema-based knowledge graph construction, along with a new evaluation metric that measures the structural similarity between the constructed knowledge graphs and the predefined ontology schema. Compared to existing datasets, OSKGC ensures alignment between textual data, triples, and the ontology schema, provides fine-grained ontology annotations, and incorporates hierarchy within the ontology schema. To validate the utility of OSKGC, we propose two baseline methods under joint extraction and pipeline settings and conduct experiments using several mainstream large language models. Source Repo: https://github.com/HeraclesWang/OSKGC DOI: https://zenodo.org/records/15297020</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Benchmark</kwd>
        <kwd>Knowledge Graph</kwd>
        <kwd>Knowledge Graph Construction</kwd>
        <kwd>Large Language Models</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        In recent years, the task of extracting triples from text to construct knowledge graphs has garnered
significant attention in the field of information extraction [
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ]. Despite containing rich semantic
information, text is challenging to use directly for complex tasks, such as automated reasoning and
question answering [
        <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
        ]. Knowledge graphs, in contrast, provide structured means of storing
information, typically representing facts in the form of triples, where each triple consists of a subject,
predicate, and object. For instance, the triple (Alabama, country, United States) represents the fact that
Alabama belongs to the country United States. Numerous studies have demonstrated that knowledge
graphs efectively support downstream tasks, including question answering [
        <xref ref-type="bibr" rid="ref5 ref6">5, 6</xref>
        ], decision-making
[
        <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
        ], recommendation systems [
        <xref ref-type="bibr" rid="ref10 ref11 ref9">9, 10, 11</xref>
        ], and search engines [
        <xref ref-type="bibr" rid="ref12 ref13">12, 13</xref>
        ]. However, knowledge graphs
often sufer from issues of incompleteness due to their collaborative or semi-automatic construction
processes [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. Researchers have long sought to use newly retrieved information to augment existing
knowledge graphs [
        <xref ref-type="bibr" rid="ref15 ref16">15, 16</xref>
        ]. A critical step in bridging unstructured text with structured knowledge
involves extracting triples that align with the schema of existing knowledge graphs. This task has
become a research hotspot, especially with the increasing demand for precise and consistent knowledge
representation [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ].
      </p>
      <p>
        Current tasks for constructing knowledge graphs from text can be categorized into two main settings:
Open information extraction (OIE) setting and ontology-driven, as shown in Figure 1. The OIE setting
takes text as input and produces triples as output. For example, given the input text "The capital
of France is Paris," the output would be the triple (France, capital, Paris). To evaluate this setting,
benchmark datasets such as WebNLG [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] and NYT [19] have been widely used. However, they only
provide text-triple pairs without ofering an ontology-based schema for constraint, which we refer
to as "ontology schema" in this paper, to constrain the extracted triples. While suitable for open
triple extraction, these datasets often fail to ensure the consistency and standardization of constructed
triples, making them less applicable for tasks requiring domain-specific knowledge graph updates. In
contrast, the ontology-driven setting takes both text and a predefined ontology as input and output
triples. TEXT2KGBENCH [20] represents one of the few works in this domain, introducing an
ontologydriven dataset to evaluate the performance of large language models in constructing knowledge graphs.
However, its ontology schema is critically simple, primarily exhibiting a single star-shaped structure
that inadequately captures the hierarchical information of fine-grained entity types. This limitation
reduces its efectiveness for handling complex tasks requiring precise distinctions between entity types.
Additionally, mismatches between the ontology schema and the text-triple data undermine its utility,
highlighting the need for a high-quality dataset with better alignment.
      </p>
      <p>To address these gaps, we propose a novel task setting for constructing knowledge graphs from text
based on an ontology schema. In our setting, the input consists of the text and a predefined ontology
schema, and the output includes both triples and their corresponding schemas. We evaluate not only
the factual extraction performance but also the alignment between the schema of the constructed
knowledge graph and the predefined ontology schema. This aims to extract structured information from
new texts to expand the knowledge graph without altering its existing architecture. To support this
task, we propose a benchmark dataset, OSKGC, along with an evaluation metric tailored to our setting.
Considering the need to design distinct ontology schemas for diferent thematic domains, we selected
WebNLG, which characterized by its clear topic separation, as the foundation. We provide fine-grained
annotations for entities, such as subdividing "politician" into categories like president, vice president,
and mayor, to better reflect the types of entities in the text. Furthermore, we introduce a subsumption
hierarchy derived from DBpedia [21] to support fine-grained entity type recognition. Through these
enhancements, OSKGC spans multiple domains, features fine-grained annotations, and ensures
wellaligned ontology schemas. Furthermore, we adopt several mainstream LLMs as baselines and conducted
experiments using joint extraction and pipeline settings to confirm OSKGC as a challenging benchmark.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>Constructing knowledge graphs from unstructured text typically involves extracting entities and their
relations and organizing them into structured triples. In this field, several benchmark datasets have
been proposed for model training and evaluation, which are summarized in Table 1. CoNLL04 [22] as an
early representative benchmark, employed manual annotation, clearly defining entity types and relation
categories, providing a solid experimental platform for named entity recognition (NER) and relation
extraction (RE). However, it lacks fine-grained entity type classification and does not incorporate a
formal ontology schema, focusing primarily on general-domain relation extraction task.</p>
      <p>With the integration of large-scale textual resources and knowledge bases, distant supervision has
been adopted to automatically construct training data. This approach assumes that if two entities have
a certain relation in a knowledge base, they also express the same relation when they co-occur in the
same sentence. Typical datasets such as NYT [19] and REBEL [23] were constructed using distant
supervision, automatically annotating large volumes of triple data, significantly improving the eficiency
of construction. However, this method also introduces considerable noise, such as incorrect entity
pairings or missing relation labels. Moreover, such datasets typically do not define entity types or build
extensible ontology schemas, which limits their support for canonicalization of extracted triples and
knowledge graph expansion.</p>
      <p>
        Other works attempt to generate corresponding text directly from structured data to construct
training data, such as WebNLG [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] and TekGen [24]. These datasets pair triples with natural language
descriptions, aiming to train models to generate factual text or recover triples from text. The WebNLG
dataset uses manual annotation and covers multiple thematic domains, providing good topic
diferentiation. In contrast, TekGen was built using distant supervision, lacking entity type definitions and
the design of ontology schema. Furthermore, the recently proposed TEXT2KGBENCH [20] introduced
a more systematic ontology design for ontology-driven triple extraction. However, the constructed
ontology schema is overly simplistic, with each category forming a star-shaped structure centered
around a core ontology, which makes it incapable of handling associations between entities with deep
hierarchical structures. In addition, the ontology schema does not align well with the text and triples,
because the context of the text was not fully considered during construction, resulting in low data
quality.
      </p>
      <p>Overall, most mainstream datasets today focus primarily on triple extraction itself, with limited
attention to ontology schema construction and fine-grained entity typing. These limitations hinder
the ability of current datasets to support incremental updates and structural alignment within existing
KG resources, thereby restricting the scalability of KGs in practical applications. In contrast, OSKGC
provides fine-grained entity annotations along with an explicitly ontology schema with hierarchy,
iflling the gap left by existing datasets.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Benchmark Construction</title>
      <p>
        In this section, we present the construction method of the OSKGC dataset, consisting of 57 categories,
where each category has two main components: The ontology schema and the corresponding text-triple
pairs. To ensure alignment among the text, triples, and ontology schema in the dataset, we construct the
ontology schema from scratch based on WebNLG [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] and cleaned the text-triple pairs. Additionally,
we introduce an evaluation metric designed to assess the alignment between the knowledge graph
constructed by the model and the predefined ontology schema.
      </p>
      <sec id="sec-3-1">
        <title>3.1. Construction of Ontology Schema</title>
        <p>The construction of the ontology schema serves two primary objectives: (1) to generate a schema label
for each text-triple pair, which is used for training and evaluation; and (2) to build a predefined ontology
schema for each category, which serves as a structural guide for knowledge graph construction. In
WebNLG, the data are presented as pairs of text and corresponding triples, without explicit entity type
information. We selected data entries that contain one, two, or three triples as the basis for constructing
our ontology schema. To ensure alignment between the ontology schema and the text-triple data, we
ifrst annotate the entities in each triple with their corresponding entity types, thereby transforming
each triple into a type-level schema that serves as the label for the associated data entry. Subsequently,
based on the subsumption hierarchy of the DBpedia ontology, we enrich each annotated entity type by
recursively adding more general types, thus constructing a subsumption hierarchy. Finally, for each
category, we aggregate all schema labels associated with the data entries. For every entity type in the
head and tail positions of these schema labels, we identify its corresponding root-level general type
within the subsumption hierarchy. Using the relations from the schema labels, we link these root-level
types to form a core structural schema in the form of triples. By summarizing the core structural
schemas and their associated hierarchical structures, we construct a predefined ontology schema for
each category. The complete ontology schema construction process is illustrated in Figure 2, consisting
of three main steps: annotating entity types, constructing the subsumption hierarchy, and constructing
relations.</p>
        <sec id="sec-3-1-1">
          <title>3.1.1. Annotating Entity Types</title>
          <p>In this step, we annotate the entity types in the triples of each data entry. For each entity in the triples,
we query DBpedia to find the deepest level type of the entity and use it as the initial type for the entity.
We manually reviewed the queried entity types to ensure the accuracy of the annotations. On this
basis, we performed fine-grained annotation manually for each entity. Specifically, we used external
knowledge sources including Wikipedia and Wikidata [25] as references to assign more precise entity
types to each entity. For example, for the entity "Rome," the initial type obtained from DBpedia was
"City." After manual annotation, it was refined to "CapitalCity." In this process, we also referred to the
context of each entity in the text to ensure that the annotated entity type aligns with the content in
the text. This is crucial due to the polysemy of entities. For example, the entity "Christian Panucci"
was a soccer player during his early career and later became a soccer manager. For such polysemous
entities, we annotate them with the type that aligns with the context of the text, ensuring that the
constructed ontology schema remains consistent with the textual content. After this step, we obtain
the corresponding entity type for each entity in diferent text contexts.</p>
        </sec>
        <sec id="sec-3-1-2">
          <title>3.1.2. Constructing Hierarchy</title>
          <p>In this step, we add a subsumption hierarchy to all entity types. The goal is to efectively categorize
the annotated entity types into a hierarchy. Specifically, we use the hierarchy of DBpedia ontology
classes as a reference. For each entity type, we query its hierarchical path to the root node. For example,
for the entity type "President," the obtained path is "President → Politician → Person → Animal →
Eukaryote → Species → Thing". For these paths, we removed the nodes that do not contribute to
efective classification. Specifically, we remove overly general nodes that are close to the root and retain
the nodes starting from the most general node in the path that can identify the annotated entity type.
For example, for the entity type "President", the most general entity type that can identify it is "Person".
Therefore, the retained path is "President → Politician → Person", with overly general nodes above the
"Person" level removed. In addition, some nodes in the hierarchy of DBpedia have sibling nodes that do
not have instantiated entities in OSKGC. These nodes not only fail to contribute to efective classification
but also add unnecessary complexity to our hierarchy. Therefore, we consider them redundant and
remove them from the paths where they appear. For example, for the entity type "FormulaOneRacer",
its path is "FormulaOneRacer → RacingDriver → MotorsportRacer → Athlete → Person". Since the
sibling nodes of "MotorsportRacer" and "RacingDriver" do not have instantiated entities in OSKGC,
these siblings are removed. Since we have performed fine-grained annotation of entity types, not all
entity types are included in DBpedia. For these entity types, we add them to appropriate positions
within the hierarchy.</p>
          <p>In OSKGC, not all entity types have a hierarchy. Since the purpose of introducing the hierarchy is to
classify the finely annotated entity types, we did not build additional hierarchies for entity types that
can be distinct on their own. For instance, entity types like "EthnicGroup" and "Language", which are
inherently easy to distinguish, do not need additional hierarchy for classification. For all entity types
with a hierarchy, we define the manually annotated entity type in the schema label as the golden label,
and all the higher-level nodes in the hierarchy as silver labels. For example, for the type "President,"
the golden label is "President," while the silver labels include "Politician" and "Person". The golden
label represents the fine-grained and precise entity type, whereas the silver labels correspond to more
general entity types. These silver labels are used to evaluate the structural similarity of the constructed
knowledge graph.</p>
        </sec>
        <sec id="sec-3-1-3">
          <title>3.1.3. Constructing Relations</title>
          <p>In this step, we construct relations between entity types based on the instantiated triples from the
WebNLG corpus. Specifically, we connected annotated entity types and their root-level types within the
hierarchy using relations derived from the triples, which allowed us to generate the schema label for the
text-triple data and the predefined ontology schema. For example, given the triple (United States, leader,
Joe Biden), where the type of "United States" is "Country" and the type of "Joe Biden" is "President," the
schema label for the data entry is (Country, leader, President). Subsequently, based on the hierarchy, we
determined that the root nodes corresponding to the entity types "Country" and "President" are "Place"
and "Person", respectively. Thus, we derived (Place, leader, Person) and included it in the predefined
ontology schema. Figure 3 illustrates a portion of the predefined ontology schema we constructed as
an example. Since the entity types connected by relations in the ontology schema are root nodes in
the hierarchy, which represents the most general classification of the entity types, the schema labels
specific to each data instance are not directly reflected here. We followed the classification standards
of the WebNLG and divided the data into three groups based on the number of triples they contain.
These were further subdivided into 19 thematic categories, resulting in a total of 57 categories. For each
category, we independently summarized a predefined ontology schema.</p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Cleaning Text-Triple Pairs</title>
        <p>Apart from the ontology schema, another crucial component of OSKGC is the text-triple pairs. This
portion of the data is sourced from WebNLG. To ensure consistency and alignment between the text
and the triples, we conducted a thorough review and validation of the text-triple pairs. As shown in
Table 2, the issues we found in WebNLG primarily include inconsistencies between entities in the text
and those in the triples, mismatches between the semantics of the text and the triples, and triples in
the labels containing information not present in the text. For entity inconsistency, an example is such
that the text mentions "American English" as a language, while the tail entity in the triple is "English
Algorithm 1: Compute Structural Similarity (SS)</p>
        <p>Initialize _ ←
foreach triple (ℎ, , ) in  do</p>
        <p>
          0
Input: Ontology hierarchy ℋ
Output: Structural similarity score  ∈ [
          <xref ref-type="bibr" rid="ref1">0, 1</xref>
          ]
        </p>
        <p>Find (ℎ, , ) in  such that  = ;</p>
        <p>, Golden schemas , Predicted schemas 
ℎ ←
_ ←</p>
        <p>_ + (ℎ × );
ComputeSS(ℋ, ℎ, ℎ),  ←</p>
        <p>ComputeSS(ℋ, , );
 ←
return ;
⎩
⎧⎨ (︂_/||,
_ ×
︁( || )︁ 2)︂
| |</p>
        <p>if || ≥ |  |
/||, otherwise
Americans," which is an ethnic group. In such cases, we correct the entity in the triple to ensure it
aligns with the text. In cases of semantic inconsistency, the triples may include the correct entities,
but the semantic information they convey does not align with the text. For example, the text states
that both "Bacon" and "Sausage" are the main ingredients of "Bacon Explosion." However, the triples
in the label indicate only that "Sausage" is the main ingredient, while "Bacon" is merely an ingredient.
Such inconsistencies can mislead the model, thereby afecting the evaluation of the results. To resolve
this issue, we modified the triples to ensure they accurately reflect the content of the text. In cases
where the triples in the label contain information not present in the text, for example, the label includes
a triple (Asterix, alternativeName, Astérix), but the text does not mention this information. In such
cases, we removed the triples that were not related to the text. Through the above cleaning process, we
corrected the errors in WebNLG and ensured mutual alignment between the text and the triples, thus
ensuring the quality of OSKGC.</p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Evaluation Metric</title>
        <p>In our task setting, we aim to extract correct triples from the text while ensuring that the constructed
knowledge graph aligns with the predefined ontology schema. To this end, we propose an evaluation
metric called Structural Similarity (SS) to measure the degree of alignment between the schema of the
constructed knowledge graph and the predefined ontology schema, as shown in Algorithm 1.</p>
        <p>The proposed metric begins by evaluating the structural similarity between individual pairs of entity
types, specifically between a gold entity type  and a predicted entity type . Let  denote the path
length from the gold type  to the root node of the ontology. Based on the structural relation between 
and  within the ontology hierarchy, we define two types of matching cases: If the predicted type  lies
on the ancestral path from the gold type  to the root, the match is considered an ancestor match. In
this case, the structural similarity score is computed using Equation 1,</p>
        <p>︂(
SS = exp −  · 
 )︂
where  is the path length from  to , and  is a parameter that controls the decay rate of the score,
which we set to 2. This design reflects the intuitive principle that the closer the predicted type is to
the gold type in the hierarchy, the higher the SS score it receives. If the predicted type  is not on the
ancestral path of  but they share a lowest common ancestor (LCA), the match is regarded as a lowest
common ancestor (LCA) match. In this case, the similarity score simultaneously considers the distances
from both  and  to their LCA. Additionally, we introduce local structural entropy as a penalty factor
to model the complexity of the local topological structure around the predicted node. The scoring is
defined by Equation 2,</p>
        <p>SS = exp −  ·  × exp −  ·  + log2( + 1)
︂(
 )︂
︂(
′
︂)
(1)
(2)
where  and ′ denote the path lengths from  and  to the LCA, respectively;  is the number of
sibling nodes of the predicted type ; and  is a parameter that controls the strength of the entropy
penalty, which we set to 1.5. In graph representation learning, structural entropy is commonly used to
quantify the complexity and uncertainty of graph structures [26]. The local structural entropy term
log2( + 1) is introduced to capture the branching complexity of the subtree where the predicted node
resides. A larger number of sibling nodes indicates higher local uncertainty and an increased likelihood
of misclassification, thus resulting in a greater penalty. This design is consistent with the principle of
entropy in information theory, which is used to quantify uncertainty and complexity in systems [27]. If
no common ancestor exists between  and , they belong to diferent connected components in the
ontology, and the SS score is 0.</p>
        <p>For each predicted schema, the final SS score is calculated by multiplying the SS scores of its head and
tail entity types. At the data entry level, where a single entry may contain multiple schemas, the overall
SS is computed by summing the scores of all schemas and dividing by the number of corresponding
golden labels. Notably, when the number of predicted triples exceeds that of the golden triples, we
introduce an additional penalty factor to suppress redundancy in predictions, thereby ensuring that the
metric maintains strong discriminative power and robustness across results of varying scales.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Dataset Statistics</title>
      <p>In this section, we present the statistical information of OSKGC. OSKGC consists of two components:
text-triple-schema pairs and predefined ontology schema. Each data entry consists of a textual instance
labeled with its corresponding subgraph of the knowledge graph in triple form and the schema label
associated with the subgraph. For each category, we provide its predefined ontology schema. As
shown in Table 3, we divide the data into three groups based on the number of triples contained in the
text: those with one triple, two triples, and three triples. Within each group, the text data is further
categorized into 19 categories based on the topic of the text, resulting in a total of 57 categories. For
each category, although there are overlapping parts in the ontology schema, each also has unique
components. Therefore, we provide a dedicated predefined ontology schema for each category. The
data in each category are divided into a training set, a validation set, and a test set in a 7:1:2 ratio. We
ensured that the test set contains entities that do not appear in the training or validation sets. OSKGC
contains a total of 207 entity types, 382 relations, and 10,183 text entries. The knowledge graph includes
3,446 entities, and the predefined ontology schema consists of 691 edges, 207 of which represent the
subsumption hierarchy. The maximum depth of the hierarchy is 4, with an average of 1.6. Among the
entity types that have descendant nodes, the maximum in-degree is 21, with an average of 4.6.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Experiment</title>
      <p>In this section, we introduce the baseline experiments conducted on OSKGC. To evaluate the performance
of OSKGC, we employed two experimental setups: joint extraction and pipeline, as shown in Figure 4.
We conducted experiments using several current mainstream LLMs, including open-source models
Llama3-8b [28], Phi-3-small [29], Qwen2.5-7b [30], Mistral-7b [31] and proprietary models GPT-4o [32],
Gemini 1.5 pro [33] and Claude 3.5 sonnet [34].</p>
      <sec id="sec-5-1">
        <title>5.1. Joint Extraction Setup</title>
        <p>In this part of the experiment, we designed a single prompt to jointly extract entities and relations,
enabling the construction of the knowledge graph in one step, as shown in Prompt 1. The goal is
to evaluate whether the LLM is capable of handling a large amount of complex ontology schema
information. In this experimental setup, the prompt consist of the entity types, relations, and hierarchy
from the ontology schema along with the test text. Additionally, we provide a one-shot example to
assist the model in understanding and specify the format for the output. The input ontology schema
information is raw data without any additional processing, and the ontology schema for data belonging
to the same category is constant. We used two methods to select the one-shot example: random selection
and similarity-based selection. For random selection, we randomly choose a fixed data sample from
the training set of each category to serve as the example. All test data within the category share this
example. For similarity-based selection, we used SBERT [35] to find the most similar text in the training
set of the corresponding category for each test data based on text similarity. Each test data entry has its
own unique example. The output includes the triples and their corresponding ontology schema.</p>
        <p>Prompt 1: Joint Extraction
Your task is to construct a knowledge graph from the input text based on the given ontology schema. The goal is to extract triples based
on the given ontology schema’s entity types, relations, and hierarchy, and provide the most accurate corresponding ontology schema
possible.</p>
        <p>Ontology schema:
entity type: {entity type}
relation: {relation}
hierarchy: {hierarchy}
Example text: {example text}
Example output: {example output}
The output format must strictly follow the example, with no additional text or explanations.</p>
        <p>Input text: {text}
Output:
Prompt 2: Entity Recognition
Your task is to find all the named entities in the given text, including numbers, codes, and dates. You need to find at least two entities.
Only focus on named entities and never extract adjectives or numerical units. Please strictly follow the format of the following example
and only provide the named entities you find, without any additional words.</p>
        <p>Example text: {example text}
Output: {example output}
Text: {input text}
Output:
Prompt 3: Entity Typing
Please select the appropriate entity type for each given entity from the candidate entity types based on the text and your knowledge.
Text: {text}
Entities: {entities}
Candidate entity types: {candidate types}
Each square bracket in the candidate entity types contains a candidate entity type. If a candidate entity type is followed by a colon, then
each entity type enclosed in square brackets within the subsequent curly brace is a sub-type of that entity type.
Example:
Text: {example text}
Entities: {example entities}
Output: {example output}
Please select only one most appropriate entity type from the brackets for each entity, strictly following the format shown in the example,
without any additional words or explanation.</p>
        <p>Output:
Prompt 4: Relation Extraction
Please select relations from the candidates to connect given entities in the text, if they exist.</p>
        <p>Example:
Text: {example text}
Given entities: {example entities}
Candidate relations: {example relations}
Output: {example output}
Please strictly follow the output format in the example without any additional words.</p>
        <p>Text: {input text}
Given entities: {entities}
Candidate relations: {relations}</p>
        <p>Output:</p>
      </sec>
      <sec id="sec-5-2">
        <title>5.2. Pipeline Setup</title>
        <p>In addition to the joint extraction method, we also adopted a pipeline experimental setup to construct
the knowledge graph step by step. Specifically, the process is divided into three steps: Entity recognition,
entity typing, and relation extraction, with separate prompts designed for each step. Similar to the
joint extraction experimental setup, we provided a one-shot example in the prompt, using both random
selection and similarity-based selection methods. Entity recognition aims to identify entity names from
the text. As shown in Prompt 2, we input the text into the prompt. To guide the LLM in generating the
expected output, we provided a one-shot example to specify the output format. The entity typing step
involves assigning the corresponding entity types to the extracted entity names. The prompt template
is shown in Prompt 3. The inputs for this step include the entity names output from entity recognition
step, the candidate entity types for the category of the input data, the input text, and a one-shot example.
The output is the corresponding entity type for each entity. The candidate entity types are derived
from the ontology schema of the category to which the input data belongs. Finally, relation extraction
involves identifying the relations between entities to form triples. The prompt template for this step is
shown in Prompt 4. The input for this step includes the entity names, candidate relations, the input text,
and a one-shot example. For the candidate relations, we did not use all the relations from the category
of the input data. Instead, we traced back the entity types obtained in the entity typing step to their
root node types based on the hierarchy. Then, we compiled all the relations existing between these
root nodes as the candidate relations. Through this approach, we eliminated the relations that were
irrelevant to the input data, thereby reducing the number of candidate relations and streamlining the
prompt.</p>
      </sec>
      <sec id="sec-5-3">
        <title>5.3. Experimental Results</title>
        <p>Table 4 present the experimental results of open-source LLMs and proprietary LLMs. Precision, recall,
and F1 are metrics used to evaluate the triples extracted by the model. SS is the metric we proposed
to assess the similarity between the ontology schema constructed by the model and the predefined
ontology schema. Regarding the selection of one-shot example, the results from both the pipeline
and joint extraction setups indicate that using examples similar to the input text yields better results
than random selection. Comparing the pipeline and joint extraction experimental setups, we can
observe that the results of the joint extraction setup generally outperform those of the pipeline setup in
terms of the evaluation of the extracted triples. However, when evaluating the similarity between the
constructed knowledge graph’s ontology schema and our predefined ontology schema, the pipeline
approach generally performs better. These baseline results leave room for improvement in future work.</p>
      </sec>
      <sec id="sec-5-4">
        <title>5.4. Error Analysis</title>
        <p>We examined the responses generated by the baseline LLMs and found that the main issues include
incorrect entity extraction, relation extraction errors, mismatches between the constructed triples and
the semantic meaning of the text, hallucinations, and inconsistencies between the structure of the
triples and the predefined ontology schema. Table 5 presents examples of such errors from Claude.
For relation errors, the model generated "locatedIn," which was not among the candidate relations; the
correct relation should have been "location." Regarding entity errors, the model produced the entity
"Bininigt," which difers from the "Binignit" mentioned in the text. In the case of factual errors, the
output triple (Faversham, IsA, Town) does not align with any semantic evidence in the text. The model
also produced hallucinated entities not mentioned in the input text, such as "A Wizard of Mars." Lastly,
the structure of some output triples violated the predefined ontology schema, for example, the subject
and object were reversed compared to the expected format. In particular, these examples come from
Claude, a proprietary model that generally performed better among the baselines. These types of error
are even more frequent in open-source models with fewer parameters.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6. Discussion</title>
      <p>The primary task setting described in this paper for OSKGC involves constructing triples and their
corresponding schemas from text based on a predefined ontology schema. However, this is not the
only task applicable to the dataset. OSKGC provides aligned data for text, triples, and schemas, as
well as a predefined ontology schema at the category level. Researchers can define other potential
task settings by using the fine-grained annotations provided in OSKGC. Inverse tasks are also possible,
for example, given an entity type label such as "airport" and a text as context, one could formulate a
question-answering task to predict the specific entity name.</p>
      <p>In terms of baseline experiments, we primarily focus on evaluating whether current LLMs, known
for their powerful information processing capabilities, can directly handle our complex knowledge
graph construction task. According to the baseline results, while LLMs demonstrate better performance
than previous similar work [20] under various evaluation metrics, they still sufer from issues such as
incorrect entity extraction, failure to conform to the predefined ontology schema, and hallucinations,
even with high-performing proprietary LLMs. In this work, we did not incorporate multi-model
frameworks, external knowledge graph lookups, or retrieval-augmented generation techniques, which
we consider promising directions for future research.</p>
    </sec>
    <sec id="sec-7">
      <title>7. Conclusion</title>
      <p>In this paper, we proposed a novel task setting for constructing knowledge graphs from text based
on ontology schema. Additionally, we introduced a corresponding benchmark dataset, OSKGC, and
an evaluation metric. Compared to existing datasets, OSKGC features fine-grained annotations and
incorporates hierarchy. While ensuring quality, OSKGC introduces more complex ontology schemas,
thus increasing the level of challenge. We conducted baseline experiments with joint extraction and
pipeline settings on OSKGC using the current mainstream LLMs. The baseline results indicate that
there is room for improvement on our fine-grained dataset.</p>
    </sec>
    <sec id="sec-8">
      <title>8. Acknowledgments</title>
      <p>Weixi Wang contributed to the construction of OSKGC. This work was in part supported by JSPS
KAKENHI Grant Number 22K12044.</p>
    </sec>
    <sec id="sec-9">
      <title>Declaration on Generative AI</title>
      <p>The authors have not employed any Generative AI tools.
micro-planning, in: 55th Annual Meeting of the Association for Computational Linguistics, ACL
2017, Association for Computational Linguistics (ACL), 2017, pp. 179–188.
[19] S. Riedel, L. Yao, A. McCallum, Modeling relations and their mentions without labeled text, in:
Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD
2010, Barcelona, Spain, September 20-24, 2010, Proceedings, Part III 21, Springer, 2010, pp. 148–163.
[20] N. Mihindukulasooriya, S. Tiwari, C. F. Enguix, K. Lata, Text2kgbench: A benchmark for
ontologydriven knowledge graph generation from text, in: International Semantic Web Conference,
Springer, 2023, pp. 247–265.
[21] S. Auer, C. Bizer, G. Kobilarov, J. Lehmann, R. Cyganiak, Z. Ives, Dbpedia: A nucleus for a web of
open data, in: international semantic web conference, Springer, 2007, pp. 722–735.
[22] D. Roth, W.-t. Yih, A linear programming formulation for global inference in natural language
tasks, in: Proceedings of the eighth conference on computational natural language learning
(CoNLL-2004) at HLT-NAACL 2004, 2004, pp. 1–8.
[23] P.-L. H. Cabot, R. Navigli, Rebel: Relation extraction by end-to-end language generation, in:</p>
      <p>Findings of the Association for Computational Linguistics: EMNLP 2021, 2021, pp. 2370–2381.
[24] O. Agarwal, H. Ge, S. Shakeri, R. Al-Rfou, Knowledge graph based synthetic corpus generation for
knowledge-enhanced language model pre-training, arXiv preprint arXiv:2010.12688 (2020).
[25] D. Vrandečić, M. Krötzsch, Wikidata: a free collaborative knowledgebase, Communications of the</p>
      <p>ACM 57 (2014) 78–85.
[26] D. Zou, H. Peng, X. Huang, R. Yang, J. Li, J. Wu, C. Liu, P. S. Yu, Se-gsl: A general and efective
graph structure learning framework through structural entropy optimization, in: Proceedings of
the ACM Web Conference 2023, 2023, pp. 499–510.
[27] C. E. Shannon, A mathematical theory of communication, The Bell system technical journal 27
(1948) 379–423.
[28] A. Dubey, A. Jauhri, A. Pandey, A. Kadian, A. Al-Dahle, A. Letman, A. Mathur, A. Schelten, A. Yang,</p>
      <p>A. Fan, et al., The llama 3 herd of models, arXiv preprint arXiv:2407.21783 (2024).
[29] M. Abdin, J. Aneja, H. Awadalla, A. Awadallah, A. A. Awan, N. Bach, A. Bahree, A. Bakhtiari,
J. Bao, H. Behl, et al., Phi-3 technical report: A highly capable language model locally on your
phone, arXiv preprint arXiv:2404.14219 (2024).
[30] A. Yang, B. Yang, B. Zhang, B. Hui, B. Zheng, B. Yu, C. Li, D. Liu, F. Huang, H. Wei, et al., Qwen2.</p>
      <p>5 technical report, arXiv preprint arXiv:2412.15115 (2024).
[31] A. Q. Jiang, A. Sablayrolles, A. Mensch, C. Bamford, D. S. Chaplot, D. de las Casas, F.
Bressand, G. Lengyel, G. Lample, L. Saulnier, L. R. Lavaud, M.-A. Lachaux, P. Stock, T. L. Scao,
T. Lavril, T. Wang, T. Lacroix, W. E. Sayed, Mistral 7b, 2023. URL: https://arxiv.org/abs/2310.06825.
arXiv:2310.06825.
[32] A. Hurst, A. Lerer, A. P. Goucher, A. Perelman, A. Ramesh, A. Clark, A. Ostrow, A. Welihinda,</p>
      <p>A. Hayes, A. Radford, et al., Gpt-4o system card, arXiv preprint arXiv:2410.21276 (2024).
[33] G. Team, P. Georgiev, V. I. Lei, R. Burnell, L. Bai, A. Gulati, G. Tanzer, D. Vincent, Z. Pan, S. Wang,
et al., Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context, arXiv
preprint arXiv:2403.05530 (2024).
[34] A. Anthropic, The claude 3 model family: Opus, sonnet, haiku, Claude-3 Model Card 1 (2024) 4.
[35] N. Reimers, Sentence-bert: Sentence embeddings using siamese bert-networks, arXiv preprint
arXiv:1908.10084 (2019).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Wei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Su</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Tian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <article-title>A novel cascade binary tagging framework for relational triple extraction</article-title>
          , arXiv preprint arXiv:
          <year>1909</year>
          .
          <volume>03227</volume>
          (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>H.</given-names>
            <surname>Ye</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , S. Deng,
          <string-name>
            <given-names>M.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Tan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <article-title>Contrastive triple extraction with generative transformer</article-title>
          ,
          <source>in: Proceedings of the AAAI conference on artificial intelligence</source>
          , volume
          <volume>35</volume>
          ,
          <year>2021</year>
          , pp.
          <fpage>14257</fpage>
          -
          <lpage>14265</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>W.</given-names>
            <surname>Zhong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Tang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Guo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Yin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Duan</surname>
          </string-name>
          , Ar-lsat:
          <article-title>Investigating analytical reasoning of text</article-title>
          ,
          <source>arXiv preprint arXiv:2104.06598</source>
          (
          <year>2021</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>A.</given-names>
            <surname>Asai</surname>
          </string-name>
          , E. Choi,
          <article-title>Challenges in information-seeking qa: Unanswerable questions and paragraph retrieval</article-title>
          , arXiv preprint arXiv:
          <year>2010</year>
          .
          <volume>11915</volume>
          (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>A.</given-names>
            <surname>Bordes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Chopra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Weston</surname>
          </string-name>
          ,
          <article-title>Question answering with subgraph embeddings</article-title>
          ,
          <source>arXiv preprint arXiv:1406.3676</source>
          (
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>L.</given-names>
            <surname>Dong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Wei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <article-title>Question answering over freebase with multi-column convolutional neural networks</article-title>
          ,
          <source>in: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing</source>
          (Volume
          <volume>1</volume>
          :
          <string-name>
            <surname>Long</surname>
            <given-names>Papers)</given-names>
          </string-name>
          ,
          <year>2015</year>
          , pp.
          <fpage>260</fpage>
          -
          <lpage>269</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>L.</given-names>
            <surname>Guo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Yan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Lu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <article-title>An automatic machining process decision-making system based on knowledge graph</article-title>
          ,
          <source>International journal of computer integrated manufacturing 34</source>
          (
          <year>2021</year>
          )
          <fpage>1348</fpage>
          -
          <lpage>1369</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>L. T. H.</given-names>
            <surname>Lan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. M.</given-names>
            <surname>Tuan</surname>
          </string-name>
          , T. T. Ngan,
          <string-name>
            <given-names>N. L.</given-names>
            <surname>Giang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V. T. N.</given-names>
            <surname>Ngoc</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Van Hai</surname>
          </string-name>
          , et al.,
          <article-title>A new complex fuzzy inference system with fuzzy knowledge graph and extensions in decision making, Ieee Access 8 (</article-title>
          <year>2020</year>
          )
          <fpage>164899</fpage>
          -
          <lpage>164921</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>X.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Cao</surname>
          </string-name>
          , M. Liu, T.-S. Chua,
          <article-title>Kgat: Knowledge graph attention network for recommendation</article-title>
          ,
          <source>in: Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery &amp; data mining</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>950</fpage>
          -
          <lpage>958</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>Q.</given-names>
            <surname>Guo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Zhuang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Qin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Zhu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Xie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Xiong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <article-title>A survey on knowledge graphbased recommender systems</article-title>
          ,
          <source>IEEE Transactions on Knowledge and Data Engineering</source>
          <volume>34</volume>
          (
          <year>2020</year>
          )
          <fpage>3549</fpage>
          -
          <lpage>3568</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>H.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Xie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Guo</surname>
          </string-name>
          ,
          <article-title>Knowledge graph convolutional networks for recommender systems</article-title>
          ,
          <source>in: The world wide web conference</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>3307</fpage>
          -
          <lpage>3313</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>J.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ren</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Zheng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Chi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Xia</surname>
          </string-name>
          ,
          <article-title>Web of scholars: A scholar knowledge graph</article-title>
          ,
          <source>in: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>2153</fpage>
          -
          <lpage>2156</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>C.</given-names>
            <surname>Xiong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Power</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Callan</surname>
          </string-name>
          ,
          <article-title>Explicit semantic ranking for academic search via knowledge graph embedding</article-title>
          ,
          <source>in: Proceedings of the 26th international conference on world wide web</source>
          ,
          <year>2017</year>
          , pp.
          <fpage>1271</fpage>
          -
          <lpage>1279</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>J.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Qiu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <article-title>Knowledge graph representation with jointly structural and textual encoding</article-title>
          ,
          <source>arXiv preprint arXiv:1611.08661</source>
          (
          <year>2016</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>X.</given-names>
            <surname>Lv</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Yao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Zeng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , L. Hou,
          <string-name>
            <given-names>J.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <article-title>Step out of kg: Knowledge graph completion via knowledgeable retrieval and reading comprehension</article-title>
          ,
          <source>arXiv preprint arXiv:2210.05921</source>
          (
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>K.</given-names>
            <surname>Zeng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Jin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Lv</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Zhu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Hou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Pang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Qi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Li</surname>
          </string-name>
          , et al.,
          <article-title>Xlore 3: A large-scale multilingual knowledge graph from heterogeneous wiki knowledge resources</article-title>
          ,
          <source>ACM Transactions on Information Systems</source>
          (
          <year>2024</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>L.</given-names>
            <surname>Zhong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Peng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <article-title>A comprehensive survey on automatic knowledge graph construction</article-title>
          ,
          <source>ACM Computing Surveys</source>
          <volume>56</volume>
          (
          <year>2023</year>
          )
          <fpage>1</fpage>
          -
          <lpage>62</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>C.</given-names>
            <surname>Gardent</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Shimorina</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Narayan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Perez-Beltrachini</surname>
          </string-name>
          ,
          <article-title>Creating training corpora for nlg</article-title>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>