<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <article-id pub-id-type="doi">10.1145/3502223.3502248</article-id>
      <title-group>
        <article-title>Datasets of Mystery Stories for Knowledge Graph Reasoning Challenge</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Kouji Kozaki</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Shusaku Egami</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kyoumoto Matsushita</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Takanori Ugai</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Takahiro Kawamura</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ken Fukuda</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Fujitsu Limited</institution>
          ,
          <addr-line>Kanagawa</addr-line>
          ,
          <country country="JP">Japan</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>National Agriculture and Food Research Organization</institution>
          ,
          <addr-line>Ibaraki</addr-line>
          ,
          <country country="JP">Japan</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>National Institute of Advanced Industrial Science and Technology (AIST)</institution>
          ,
          <addr-line>Tokyo</addr-line>
          ,
          <country country="JP">Japan</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Osaka Electro-Communication University</institution>
          ,
          <addr-line>Osaka</addr-line>
          ,
          <country country="JP">Japan</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <volume>31</volume>
      <fpage>0000</fpage>
      <lpage>0003</lpage>
      <abstract>
        <p>With the increasing application of AI systems across various social domains, the explanation for the AI decision is becoming important to guarantee the security and safety of such AI systems. Therefore, the Special Interest Group on Semantic Web and Ontology of JSAI started the Knowledge Graph Reasoning Challenge in 2018. It calls for techniques for reasoning and/or estimating criminals with a reasonable explanation based on knowledge graphs representing well-known stories of Sherlock Holmes. The challenges were held four times in Japan and once as an international event. Through them, 35 works were submitted in total. In organizing the challenge, we have been developing knowledge graphs about eight Holmes mystery stories. They have been extended and improved through the challenges. This paper reports on the knowledge graphs of mystery stories developed and published for the Knowledge Graph Reasoning Challenge.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;knowledge graph</kwd>
        <kwd>knowledge modeling for story</kwd>
        <kwd>dataset</kwd>
        <kwd>reasoning</kwd>
        <kwd>explainable AI</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>• They can allow for the design of tasks that are virtually closed (e.g., which have answers
and can control the constraints that lead to them) while including complex relationships
in the real world.
• Some tasks can be solved without including probabilistic processing or machine learning,
such as uncertain information or photographic evidence, or without supplementing
common knowledge that is not written explicitly, thus encouraging the fusion of estimation
and inference.
• They have an explanatory quality to human beings that the reader must agree with in
order for it to work as a novel.</p>
      <p>• The stories are widely known to the public and easily attract interest.</p>
      <p>A total of 35 entries have been submitted through the five challenges held to date. The
techniques used in these works include search and inference based on knowledge processing
using knowledge graphs and ontologies, machine learning using knowledge graph embedding,
and natural processing techniques, and so on.</p>
      <p>In organizing the challenge, we have been developing knowledge graphs eight Holmes
mystery stories in total. They have been extended and improved through the challenges. We believe
that these knowledge graphs are not only the target data for the challenge, but also a valuable
dataset for reasoning and explanation tasks using knowledge graphs. In this paper, we report
on the knowledge graphs of mystery stories developed and published for the Knowledge Graph
Reasoning Challenge.</p>
      <p>In the following sections of this paper, Section 2, we present an overview of the technical
challenges we have faced so far. Section 3 describes the schema design and the knowledge
graphs published for the of challenge. Section 4 lists the related works on event-centered
knowledge graphs, and finally, in Section 5, we summarize the results and discuss future
challenges.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Knowledge Graph Reasoning Challenge</title>
      <sec id="sec-2-1">
        <title>2.1. Purpose of the Challenge</title>
        <p>The Knowledge Graph Reasoning Challenge is a technical contest organized by the Special
Interest Group on Semantic Web and Ontology of the Japanese Society for Artificial
Intelligence (SIG-SWO of JSAI). The contest was launched in response to the growing interest in AI
technologies, particularly deep learning, and the associated emergence of issues related to the
explainability of AI systems. It calls for techniques for reasoning and/or estimating criminals
with a reasonable explanation based on knowledge graphs representing well-known stories of
Sherlock Holmes.</p>
        <p>The task for the challenge is to correctly identify the culprit and causes of incidents using
inference and estimation techniques. However, since it can be generalized as a kind of
knowledge graph completion 1, it can be positioned as a generic problem setting that can be applied
1For example, we can regard it is a completion task to find any missing relationship between a particular person
and the culprit.
to the construction of various knowledge bases including knowledge graphs, information
extraction and relation extraction, knowledge updating and maintenance, and so on. Moreover,
in addition to the focus on real social problems and the emphasis on the explainability of the
results, there are some unique dificulties, such as described in the following points:
• Real-world problems are all individual cases, and similar scenes do not necessarily
appear more than once. Therefore, knowledge or data is not necessarily big data, making
learning dificult.
• Rather than explaining single relationships by approximation in vector space, they must
be assembled or chained together to derive the goal as a whole.</p>
        <p>• The knowledge graph includes false statements spoken by the characters.</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Target Stories and Tasks</title>
        <p>In this study, we provided the contents of eight of Sherlock Holmes’s short mystery stories
as knowledge graphs in total with tasks which should be solved using them. The target stories
that provided the knowledge graphs and their respective tasks are listed below.
• The Speckled Band: Who killed Julia? (criminal &amp; explanation)
• The Devil’s Foot: Who killed the victims? (criminal &amp; explanation)
• The Crooked Man: Why did Barclay die? (explanation)
• The Dancing Men: Break the codes (code breaking)
• The Abbey Grange: Who killed Lord Blackenstall? (criminal &amp; explanation)
• The Resident Patient: Who killed Blessington? (criminal &amp; explanation)
• Silver Blaze: Who took out the White Silver Blaze? (criminal &amp; explanation)
The contents of these eight mystery stories were converted into knowledge graphs based on
events and scenes. Participants in the technical challenge developed AI systems using these
data together with their own external knowledge, which was added and created as necessary.
The knowledge graph and past proposed techniques are available on the oficial website 2.</p>
        <p>Details of the knowledge graphs and some examples are given in Section 3.</p>
      </sec>
      <sec id="sec-2-3">
        <title>2.3. History</title>
        <p>We held the 1st Japan domestic knowledge graph reasoning challenge in 2018. Then, we had
ifve challenges in total every year. Four of them are Japan domestic version, and the other is
international challenge.</p>
        <p>There were three categories for application to the challenge, as follows
1. Main track: develop a system to solve one or more tasks of the target stories.
2. Tool track: develop tools to solve partially any of the tasks (e.g., suspect estimation, alibi
verification, motive explanation, and so forth).
3. Idea track: derive ideas on how to realize any of the above (possibly without system
implementation).</p>
        <p>The total number of proposals was 35 (16 in the main track, 9 in the tools track, and 10 in
the ideas track). The following is a summary of the knowledge graph reasoning challenge held
so far, along with the number of submitted works:
• 1st Knowledge Graph Reasoning Challenge 2018:
– Provided KG: The Speckled Band
– Number of submitted works: 8 (Main Track: 5, Idea Track: 3)
• 2nd Knowledge Graph Reasoning Challenge 2019:
– Diference from the previous year: Introduction of the Tool Track (solving partial
tasks) and addition of four KGs
– Provided KGs: The Speckled Band, A Case of Identity, The Crooked Man, The
Dancing Men, and The Devil’s Foot
– Number of submitted works: 9 (Main Track: 4, Tool Track: 2, Idea Track: 3)
• 3rd Knowledge Graph Reasoning Challenge 2020:
– Diference from the previous year: Refinement of the existing KGs and addition of
three KGs
– Provided KGs: The Speckled Band, A Case of Identity, The Crooked Man, The
Dancing Men, The Devil’s Foot, The Abbey Grange, Silver Blaze, and The Resident
Patient
– Number of Submitted Works: 7 (Main Track: 3, Tool Track: 2, Idea Track: 2)
• 1st Knowledge Graph Reasoning Challenge for Students 2021:
– Diference from the previous year: Applicants were limited to students to foster
young talent. In addition, the existing KGs were refined.
– Provided KGs: same as the previous year
– Number of Submitted Works: 5 (Main Track: 2, Tool Track: 3)
• 1st International Knowledge Graph Reasoning Challenge 2023 (IKGRC2023):
– The challenge was internationalized. In addition, the existing KGs were refined.
– Provided KGs: same as the previous year
– Number of Submitted Works: 6 (Main Track: 2, Tool Track: 2, Idea Track: 2)</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Knowledge Graphs of Mystery Stories</title>
      <sec id="sec-3-1">
        <title>3.1. Knowledge graph schema</title>
        <p>We decided on a basic policy of describing the people, things, and places involved in each scene,
focusing on the scenes depicted in the scenes and the relationships between the scenes. When
designing the schema, in addition to expressiveness to represent the subject novels, we also
considered the ease of constructing the knowledge graph and the convenience of providing it
as data for inference processing, and decided on a schema with mainly 5W1H edges, focusing
on scenes. Thus, a mystery story is represented by each scene and the relationships among
scenes. Each scene3 in a mystery story is assigned a unique internationalized resource
identiifer (IRI), which is used as the subject to describe a scene in the story by adding information
about people, organizations, and places as objects. The relationships between scenes explain
the causal relationships of chronological actions and events by referring to the IRIs. This is
how a series of storylines is expressed. In addition, rules and table data can be linked to
describe common sense data such as axioms and to represent information such as timetables. The
content of the story is stored as literal values for natural language processing. Figure 1 shows
an example knowledge graph.</p>
        <p>The following basic properties are provided for describing each scene. In order to summarize
the information associated with a scene, these properties take the scene as their subject. Note
that it is not in the general &lt;subject, predicate, object&gt; format. Figure 2 shows an example
scene description.</p>
        <p>• subject: a person or thing that is the subject in the description of the scene.
• hasPredicate: a predicate that describes the content of the scene.
• hasProperty: a property of the person or thing that is the subject of the scene description.
• Objects that describe the details of the scene: who/whom, where, when, what, how, etc.
• Relationships between scenes: then, if, because, etc.
• time: an absolute time when the scene occurred (xsd:DateTime).</p>
        <p>• source: original text of the scene (English/Japanese; literal).</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Procedure of knowledge graph construction</title>
        <p>The following procedure was used to convert the eight mystery stories into knowledge graphs:
1. Extract sentences necessary for deduction from mystery stories (in Japanese) whose
copyrights have expired. For each novel, about 300 to 500 sentences were extracted manually.
Those sentences were selected mainly where they were closely related to the identification
of the culprits.
3Only scenes that are judged to be necessary for the deduction are converted to knowledge graphs.
2. Rewriting the original text into sentences with clear a subject and object (i.e., short
sentences). One short sentence corresponds to one scene on the knowledge graph.
3. Assign semantic roles (e.g., 5W1H) to phrases using natural language processing tools. The
results are output as a predicate and an object for each scene in a spreadsheet, and are
visually checked at the end.
4. Control orthographical variants. We eliminate any notational distortions on a
novel-bynovel basis and across novels as much as possible during the construction phase.
5. Add relationships between scenes (e.g., temporal relationships).
6. Translate the source text into English and convert the entire text into a knowledge graph.</p>
        <p>Note that the series of tasks were performed by part time students and software engineers
(general programmers, not advanced knowledge engineers). The costs for knowledge graph
construction of each story are as follows: (Step 1) 3 hours per person, (Step 2) 20 hours per
person, (Step 3) 5 hours per person, (Step 4) 7 hours per person, (Step 5) 3 hours per person,
and (Step 6) about 1 hour per person.</p>
        <p>
          In addition to the above procedure, we have made a series of improvements to the content
of the descriptions in the knowledge graph. In [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] , we present a guideline consisting of ten
items/steps we found through the refinement process of the knowledge graphs.
        </p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Example queries</title>
        <p>The latest version of the knowledge graph constructed for the Challenges is available on
GitHub4 and provides a SPARQL endpoint for searching. Here, we show some examples of
SPARQL queries for the knowledge graph focusing on scenes. In the SPARQL endpoint, graph
IRI is set for each novel. Therefore, the users can specify the target novels for searching using
FROM clause.</p>
        <p>Listing1 shows a SPARQL query for obtaining all triples of scene 036 in “The Speckled Band”.
Because each scene is represented by triples whose subjects are IRI of a scene, the users can
obtain details of scene descriptions to get its property-object pairs. If a kind of property is
specified in the query (e.g., kgc:hasPredicate), its object is obtained.</p>
        <p>Contrary to Listing1, by specifying a property-object pair, it is possible to retrieve scenes
containing that combination. For example, Listing2 shows a SPARQL query for obtaining all
scenes whose subjects (performers of the action in the scene) are Holmes in “The Speckled
Band”.</p>
        <p>Listing 1: Get all triples of Scene 036 in “The Speckled Band”
PREFIX rdf: &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt;
PREFIX xsd: &lt;http://www.w3.org/2001/XMLSchema#&gt;
PREFIX kgc: &lt;http://kgc.knowledge-graph.jp/ontology/kgc.owl#&gt;
PREFIX pred: &lt;http://kgc.knowledge-graph.jp/data/predicate/&gt;
PREFIX sb: &lt;http://kgc.knowledge-graph.jp/data/SpeckledBand/&gt;
SELECT ?p ?o
FROM &lt;http://kgc.knowledge-graph.jp/data/SpeckledBand&gt;
WHERE {</p>
        <p>sb:036 ?p ?o .
}
Query result:
?s
kgc:source
kgc:source
rdf:type
kgc:hasPredicate
kgc:subject
kgc:when
kgc:when
kgc:where
kgc:whom
kgc:time
?p
” ジュリアは２年前に海軍少佐とハロウで知り合う”@ja
”Julia gets acquainted with Major Navy two years ago at Harrow”@en
kgc:Situation
pred:meet
sb:Julia
sb:2_years_ago
sb:1880-12-24T10
sb:Harrow
sb:lieutenant_commander
”1880-12-24T10:00:00”^^xsd:dateTime
4https://github.com/KnowledgeGraphJapan/KGRC-RDF/tree/ikgrc2023
Listing 2: Get all scenes whose subjects (performers of the action in the scene) is Holmes in
“The Speckled Band”
PREFIX kgc: &lt;http://kgc.knowledge-graph.jp/ontology/kgc.owl#&gt;
PREFIX sb: &lt;http://kgc.knowledge-graph.jp/data/SpeckledBand/&gt;
SELECT ?s
FROM &lt;http://kgc.knowledge-graph.jp/data/SpeckledBand&gt;
WHERE {</p>
        <p>?s kgc:subject sb:Holmes .</p>
        <p>Listing3, shows a query to obtain all relationships among scenes in “The Speckled Band”. In
this query, subjects and objects of triples are specified as scenes (sub-class of kgc:Scene). These
relationships represent the flow of the story. It is an important feature of this knowledge graph
of mystery stories.</p>
        <p>Listing 3: Get all relationships among scenes in “The Speckled Band”.</p>
        <p>SPARQL query:
PREFIX rdf: &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt;
PREFIX rdfs: &lt;http://www.w3.org/2000/01/rdf-schema#&gt;
PREFIX kgc: &lt;http://kgc.knowledge-graph.jp/ontology/kgc.owl#&gt;
PREFIX sb: &lt;http://kgc.knowledge-graph.jp/data/SpeckledBand/&gt;
SELECT ?s ?p ?o
FROM &lt;http://kgc.knowledge-graph.jp/data/SpeckledBand&gt;
WHERE {
?s rdf:type/rdfs:subClassOf kgc:Scene .
?o rdf:type/rdfs:subClassOf kgc:Scene .</p>
        <p>?s ?p ?o .
}
?s
sb:001
sb:002
sb:014
Query result(parts):
?p
kgc:at_the_same_time
kgc:at_the_same_time
kgc:then
?o
sb:002
sb:003
sb:015
sb:015
sb:016
...</p>
        <p>kgc:then
kgc:if</p>
        <p>Listing4 is an example to search across diferent stories. Because the same action is
represented using the same IRI across all stories, it is possible to search across them by specifying
the IRI of action. Its targets are selected using FROM clause.</p>
        <p>Listing 4: Get all scenes whose predicate is “meet” across the eight stories with its subjects and
target objects.
PREFIX kgc: &lt;http://kgc.knowledge-graph.jp/ontology/kgc.owl#&gt;
PREFIX pred: &lt;http://kgc.knowledge-graph.jp/data/predicate/&gt;
PREFIX sb: &lt;http://kgc.knowledge-graph.jp/data/SpeckledBand/&gt;
PREFIX df: &lt;http://kgc.knowledge-graph.jp/data/DevilsFoot/&gt;
PREFIX dm: &lt;http://kgc.knowledge-graph.jp/data/DancingMen/&gt;
PREFIX ci: &lt;http://kgc.knowledge-graph.jp/data/ACaseOfIdentity/&gt;
PREFIX cm: &lt;http://kgc.knowledge-graph.jp/data/CrookedMan/&gt;
PREFIX ag: &lt;http://kgc.knowledge-graph.jp/data/AbbeyGrange/&gt;
PREFIX rp: &lt;http://kgc.knowledge-graph.jp/data/ResidentPatient/&gt;
SELECT DISTINCT ?s ?subj ?obj
FROM &lt;http://kgc.knowledge-graph.jp/data/SpeckledBand&gt;
FROM &lt;http://kgc.knowledge-graph.jp/data/DevilsFoot&gt;
FROM &lt;http://kgc.knowledge-graph.jp/data/SilverBlaze&gt;
FROM &lt;http://kgc.knowledge-graph.jp/data/DancingMen&gt;
FROM &lt;http://kgc.knowledge-graph.jp/data/ACaseOfIdentity&gt;
FROM &lt;http://kgc.knowledge-graph.jp/data/CrookedMan&gt;
FROM &lt;http://kgc.knowledge-graph.jp/data/AbbeyGrange&gt;
FROM &lt;http://kgc.knowledge-graph.jp/data/ResidentPatient&gt;
WHERE {
?s kgc:hasPredicate pred:meet .
?s kgc:subject ?subj.</p>
        <p>?s kgc:whom ?obj.
}
Query result(parts):
?s
cm:077
cm:178
cm:178
cm:240
cm:240
sb:036
sb:081
sb:152
sb:152
df:438
?subj
cm:Barclay
cm:Nancy
cm:Morrison
cm:Holmes
cm:Watson
sb:Julia
sb:Helen
sb:Holmes
sb:Watson
df:Sterndale
?obj
cm:Nancy
cm:Henry
cm:Henry
cm:Henry
cm:Henry
sb:lieutenant_commander
sb:Roylott
sb:Helen
sb:Helen
df:Mortimer</p>
        <p>number of scenes hasPredicate hasProperty hasPart subject infoSource infoReceiver</p>
      </sec>
      <sec id="sec-3-4">
        <title>3.4. Published Knowledge Graphs</title>
        <p>
          Knowledge graphs of mystery stories were constructed based on the above schema and
published as open datasets for the Knowledge Graph Reasoning Challenge. They were extended
and refined through five challenges according to feedbacks from participants and discussion
among organizers. Some of the findings from these processes have been compiled into
guidelines for knowledge graph construction [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ].
        </p>
        <p>Here we compare the published knowledge graphs of the eight mystery stories and describe
their appearance. Since these knowledge graphs are structured around scenes, comparisons
between novels are also made around scenes. First, we compare the properties used to
deifne scenes. Then, the relationships between scenes are compared, and finally the predicates
(actions) used in each scene are compared.</p>
        <p>Table 1 compares number of scenes in each novel and main properties used to define scenes.
These knowledge graphs consist of around 300 to 500 scenes while “The Dancing Men” has only
200 scenes. It is because the main topic of “The Dancing Men” is deciphering the cipher. Each
scene must have hasPredicate or hasProperty property. hasPredicate represent a predicate that
describes the content of the scene and hasProperty represents a property of the person or thing
that is the subject of the scene description. Subjects of the predicate/property are described
by subject property. The number of these three properties does not vary much among novels.
infoSource property is used to describe information sources when its scene type is Thought or
Statement. infoReceiver property shows receiver of some remarks while it is used only once
across all novels. hasPart property shows parts of some scenes while it is used only in “The
Speckled Band”.
total
sentence into multiple scenes, but the usage of the two novels needs to be closely examined.
Other than what, then is the most commonly used to indicate the order of scenes. This is not
surprising, since the order of scenes is important in a story. The use of because and why to
express reasons varies widely from novel to novel.</p>
        <p>Finally, we compare the use of predicates for actions. In the eight novels as a whole, 665
diferent actions are used. Of these actions, 53 were used more than 10 times. Table 5
compares the top 10 most frequently used predicates across the eight novels. The table shows that
predicates for basic actions such as say, have, and exist are frequently used. However, there
are diferences in the actions used in each novel.</p>
      </sec>
      <sec id="sec-3-5">
        <title>3.5. Example Works using the Knowledge Graphs</title>
        <p>
          Through the challenges, various approaches have been proposed by researchers from private
companies and universities to develop AI systems that can infer and explain the truth of
mysterious crimes. The lists of proposed works and summaries of each work are shown in the [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ].
There are two major approaches to use the knowledge graphs for reasoning. One is logical
explanation based on ontology and the other is interpretation of knowledge graph embedding.
        </p>
        <p>
          For example, Ugai et al. (IKGRC2023, Main Track) [
          <xref ref-type="bibr" rid="ref3">3, 6</xref>
          ] considered that the analysis of
motive, opportunity, and means is necessary to identify the culprit, and they prepared this
knowledge as external knowledge and combined it with the event-centric knowledge graphs
provided in this challenge to make inferences. Specifically, ontologies about the motive and
means of murder are manually created and used for inference. The knowledge graphs were
extended with external knowledge by linking these ontologies to the knowledge graphs of
mystery stories via ConceptNet and WordNet. The extended knowledge graphs were embedded in
a low-dimensional vector space to estimate the scores that each character could possess for the
motive and means of the crime. These scores were then utilized to infer the culprit. Ontologies
of motives and means help logically explain the reasons deduced.
        </p>
        <p>Kurokawa (KGRC2020, Main Track) [7] converted the event-centric knowledge graphs
provided in this challenge into the standard triples and proposed an approach to predict the
culprits of the stories using link prediction with multiple knowledge graph embedding methods
(TransE [8], PTransE [9], R-GCN [10]). To integrate multiple KGs, Kurokawa applied
parameter sharing proposed in ITransE, which shares the same embeddings for common entities, e.g.,
”Holmes” and ”Watson,” and for common relations, e.g., ”kill.” In addition, an external
commonsense KG, ConceptNet [11], was introduced to enrich semantic information. Kurokawa
also conducted an experiment using XKE [12] and GNNExplainer [13] to show plausible graph
paths as a basis for the prediction results.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Related Work</title>
      <p>Knowledge graphs can be used to describe static relationships between things, such as in
product data, a thesaurus, and human relationships, as well as events that occur in space and time,
such as observational data. In recent years, knowledge graphs of events or scenes, such as
video content [14], historical fact [15], news [16], and narrative content [17] have been
actively studied. VirtualHome2KG [14] can generate event-centric knowledge graphs of the
content of videos of daily activities simulated in virtual space and demonstrate various
applications, including accident risk detection. EventKG [15] is a knowledge graph describing 690,000
contemporary and historical events and incidents for the purpose of answering questions and
generating histories (timelines) from a specific perspective. The schema is based on the
aforementioned Simple Event Model (SEM) [18] and is extended to express temporal relationships,
and so forth. It has many similarities with our schema, such as definitions of relationships
among events. However, the granularity of its target events is considerably larger than in
our scenes, and it is dificult to represent information such as who, when, and how for each
scene using EventKG’s model (although it is possible to describe them, it would be a complex
graph, and it would be dificult to construct and search the dataset). ECKG [ 16] provides its
own model to annotate information extracted when building a knowledge graph directly from
news events written in a natural language. It provides a unique model. However, the model
is simple (only who, what, where, and when) because automatic extraction is the subject
matter. Drammer [17] is not simply a chronological representation and comparison of narrative
content, but is fiction–specific. It is an ontology that includes conflicts between characters,
segmentations of the narrative, and definitions of emotion and belief for more dramatic
representations. It was constructed by analyzing many dramas, but its purpose is diferent from
that of this study, which is intended to represent facts (including falsehoods) in the real world.</p>
      <p>By contrast, in this study we (1) constructed knowledge graphs that convert the background
of the case and the characters into knowledge, using a mystery novel as a subject, and (2)
conducted a technical challenge to identify correctly the culprit and cause of a case or an
accident from given information using inference and estimation techniques to explain the reasons
(evidence, tricks, etc.) for such identification appropriately.</p>
      <p>As for this technical challenge, in top conferences of AI and neural networks, such as IJCAI,
AAAI, NIPS, and ICML, papers and workshops that have “explainability” as a keyword and
that analyze the properties of AI models have significantly increased since 2016. However, no
other research activity exists like the challenge discussed in this work, which uses knowledge
graphs, including social problems as common test sets, and tries to solve the problems with
explainability (i.e., using XAI), aiming to integrate inductive estimation and deductive reasoning.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion</title>
      <p>In this paper, we discussed datasets of mystery stories for the Knowledge Graph Reasoning
Challenge. Its feature is a knowledge graph schema focusing on scenes of novels so that the
user can search its contents according to the flow of the stories. They were built on eight
Sherlock Holmes short stories and published as open datasets.</p>
      <p>Through comparison of the knowledge graphs across novels, we discussed how the proposed
schema and vocabularies are used in them. Each novel has a narrative flow that makes sense
to the reader, and these are structured as a knowledge graph. Therefore, we believe that such
a comparison ofers suggestions for considering the representation of stories as knowledge.
In this comparison, we observed that there is variation in properties and the vocabulary of
actions from novel to novel. Based on the results of this comparison, it is necessary to review
the schema, systematize the vocabulary, and examine the policy for constructing a knowledge
graph.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>This paper is based on results obtained from a project, JPNP20006, commissioned by the New
Energy and Industrial Technology Development Organization (NEDO), and Japan Society
Promotion of Science (JSPS) KAKENHI Grant Numbers JP19H04168 and JP22K18008.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>T.</given-names>
            <surname>Kawamura</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Egami</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Tamura</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Hokazono</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Ugai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Koyanagi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Nishino</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Okajima</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Murakami</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Takamatsu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sugiura</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Shiramatsu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Kozaki</surname>
          </string-name>
          ,
          <source>Report on the First Knowledge Graph Reasoning</source>
          Challenge 2018 -
          <article-title>Toward the eXplainable AI System -</article-title>
          ,
          <source>9th Joint Int'l Semantic Tech. Conf. (JIST2019)</source>
          (
          <year>2019</year>
          )
          <fpage>18</fpage>
          -
          <lpage>34</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>K.</given-names>
            <surname>Kozaki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Egami</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Matsushita</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Ugai</surname>
          </string-name>
          , T. Kawamura,
          <article-title>Knowledge graph reasoning techniques through studies on mystery stories -report on the knowledge graph reasoning challenge 2018 to 2020, 1st</article-title>
          <source>Int'l Workshop on Knowledge Graph Reasoning for Explainable Artificial Intelligence (KGR4XAI)</source>
          (
          <year>2021</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>T.</given-names>
            <surname>Ugai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Koyanagi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Nishino</surname>
          </string-name>
          ,
          <article-title>A logical approach to criminal case investigation</article-title>
          ,
          <source>1st Int'l Workshop on Knowledge Graph Reasoning for Explainable Artificial Intelligence (KGR4XAI)</source>
          (
          <year>2021</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>S.</given-names>
            <surname>Katsushima</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Anada</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Egami</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Fukuda</surname>
          </string-name>
          ,
          <article-title>A criminal detection of mystery novel using the principal components regression analysis considering co-occurrence words</article-title>
          ,
          <source>1st Int'l Workshop on Knowledge Graph Reasoning for Explainable Artificial Intelligence (KGR4XAI)</source>
          (
          <year>2021</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>T.</given-names>
            <surname>Kawamura</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Egami</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Matsushita</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Ugai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Fukuda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Kozaki</surname>
          </string-name>
          ,
          <article-title>Contextualized scene knowledge graphs for xai benchmarking</article-title>
          ,
          <source>IJCKG '22</source>
          ,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computing Machinery, New York, NY, USA,
          <year>2023</year>
          , pp.
          <fpage>64</fpage>
          -
          <lpage>72</lpage>
          . URL: https://doi.org/10.1145/3579051. 3579061. doi:
          <volume>10</volume>
          .1145/3579051.3579061.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>