<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Sparklis over PEGASE Knowledge Graph: A New Tool for Pharmacovigilance?</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Carlos Bobed</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Laura Douze</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sebastien Ferre</string-name>
          <email>sebastien.ferreg@irisa.fr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Romaric Marcilly</string-name>
          <email>romaric.marcillyg@univ-lille.fr</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Univ Rennes, CNRS, IRISA Campus de Beaulieu</institution>
          ,
          <addr-line>35042 Rennes</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Univ. Lille, INSERM, CHU Lille, CIC-IT / Evalab 1403 Centre d'Investigation clinique</institution>
          ,
          <addr-line>EA 2694, F-59000 Lille</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Pharmacovigilance is in charge of studying the adverse effects of pharmaceutical products. In this field, pharmacovigilance specialists experience several difficulties when searching and exploring their patient data despite the existence of standardized terminologies (MedDRA). In this paper, we present our approach to enhance the way pharmacovigilance specialists perform search and exploration on their data. First, we have developed a knowledge graph that relies on the OntoADR ontology to semantically enrich the MedDRA terminology with SNOMED CT concepts, and that includes anonymized patient data from FAERS3. Second, we have chosen and extended a semantic search tool, Sparklis, according to the user requirements that we have identified in pharmacovigilance. We report the results of a usability evaluation that has been performed by human factors specialists to check the benefits of our proposal.</p>
      </abstract>
      <kwd-group>
        <kwd>Knowledge Graph Semantic Search Query Building Pharmacovigilance MedDRA Sparklis Usability</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Pharmacology has provided humanity with a huge improvement in life quality. However,
while new drugs are thoroughly tested before being released for general consumption,
all their possible side effects cannot be completely foreseen. Thus, we need methods
to discover and track those adverse effects in order to improve the safety and efficacy
of drugs. Pharmacovigilance is defined by the World Health Organization (WHO) as
\the science and activities relating to the detection, assessment, understanding and
prevention of adverse effects or any other drug-related problem".</p>
      <p>Pharmacovigilance specialists manage and report adverse drug reactions (ADRs)
noticed by healthcare professionals and patients to different healthcare authorities. For
this, they must codify their reports using one or several terms that closely capture the
original verbatim description of the ADRs. In this context, the usefulness of standardized
vocabularies to unify the codification of the reports is evident. To this purpose, MedDRA
? This research is supported by ANR project PEGASE (ANR-16-CE23-0011-08), project</p>
      <p>
        TIN2016-78011-C4-3-R (AEI/ FEDER, UE), and DGA/FEDER.
3 FDA's Adverse Event Reporting System
(Medical Dictionary for Drug Regulatory Activities)4 is recommended by the ICH5
for the electronic transmission of individual case safety reports [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], and is used in most
countries. However, as pointed out by Bousquet et al. [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], \its main limitation comes
from its standard terminological format, which restricts the possibility of accessing
terms based on their semantics". Therefore, depending on their experience, expertise,
and interpretation of the meaning of the MedDRA terms, two pharmacovigilance
specialists may use different MedDRA terms to code the same report. To solve this
problem, Bousquet et al. proposed OntoADR [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], an ontology that describes MedDRA
terms by their actual semantics, expressed with a combination of SNOMED CT classes
and properties. However, until now, pharmacovigilance specialists could not use those
semantic descriptions due to the lack of appropriate tools to help them build the
semantic queries required to retrieve sets of MedDRA terms.
      </p>
      <p>
        In this paper, we present our solution to support pharmacovigilance specialists in the
exploration and search of their database of patient cases, the first step in the process
of detecting new adverse effects of drugs. First, we studied the existing tools, detecting
a lack of proper tools to ease the documentation stage of pharmacovigilance specialists.
Then, we built a knowledge graph integrating different knowledge sources, adopting the
Semantic Web standards, which makes it possible to have all the relevant data easily
accessible, providing the flexibility required to be extended under demand. Finally, in
order to improve the way pharmacovigilance specialists search for cases, we adopted and
extended Sparklis [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], a query builder that eases the exploration and querying of any
SPARQL endpoint, without requiring to master SPARQL itself. This work takes place in
the PEGASE project, whose aim is to improve pharmacovigilance and signal detection.
      </p>
      <p>The rest of the paper is as follows. Section 2 presents a literature analysis about
usability evaluations of controlled vocabulary searching tools along with a benchmark of
existing tools in pharmacovigilance. Section 3 describes the PEGASE Knowledge Graph.
Section 4 describes Sparklis and the extensions we have implemented. Section 5 presents
the knowledge graph validation we have performed, and the qualitative usability study
we have carried out on Sparklis. Finally, Section 6 draws some conclusions, and presents
the next stages of the project.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Analysis of Existing Tools</title>
      <p>
        Our main objective was to identify the desirable usability qualities and the defects of
the tools used by pharmacovigilance specialists. To do so, we performed: (i) a literature
review of usability evaluations of tools supporting searches using controlled vocabularies
(not limited to pharmacovigilance), and (ii) an evaluation of the tools currently used
by French pharmacovigilance specialists. A total of 908 papers were identified in
PubMed, Web of Science, and Scopus databases; six of which met all eligibility criteria6
and were analyzed in-depth: 1) Walji et al. [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] supports coding odontology
diagnostics, 2) Bakshi-Raiez et al. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] supports coding reason for entrance in the intensive care
unit (ICU), 3) Peute et al. [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] supports searching ICU patient data, 4) Shiri et al. [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]
supports searching keywords over a multilingual thesaurus, 5) Vega-Gorgojo et al. [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]
4 MedDRA R is a trademark of the IFPMA.
5 Int. Council for Harmonisation of Technical Requirements for Pharmaceuticals for Human Use
6 Omitted due to space restrictions. The most discriminative one was to include an objective
evaluation of the interfaces. See http://bit.ly/2qA9Jkl for details on the selection process.
evaluates the search of collaborative tools, and 6) Sutcliffe et al. [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] evaluates visual
user interfaces for information search. Besides, we visited four French pharmacovigilance
centers to identify the software used, where four main tools were identified:
{ BNPV (National Bank of PharmacoVigilance): the main tool used by the
pharmacovigilance specialists to report and search cases of ADRs.
{ MedDRA browser: it allows searching MedDRA terms along with Standardized
      </p>
      <p>MedDRA Queries (SMQ).
{ PA.PV: a tool provided by the French Agency for the Safety of Health Products
to help pharmacovigilance specialists choose MedDRA terms and SMQs.
{ Vigilyze: a world-wide pharmacovigilance database provided by the WHO.
Two human factors specialists went through the different functionalities of each software
to identify their significant usability qualities and defects. A total of 62 usability desirable
features or defects were learned from the literature review, and 86 from the benchmark
of the most used software. Overall, the main lessons learned were:
1. Searches should include synonyms and related terms.
2. One should be able to search terms either by entering keywords (without hierarchy
level constraints) or by browsing the term hierarchy.
3. Lists and hierarchies of terms should be ordered in a meaningful way.
4. Display the query, the number, and list of results on the same page.
5. Results should automatically be updated with the query.
6. Provide users with hierarchy information to distinguish similar terms.
7. Inform the user about the number of ADR reports each term triggers.
8. Help users build their request by providing an intuitive interface.
9. Allow saving a request to be reused.</p>
      <p>
        Those recommendations led to the choice of Sparklis, and to its adaptation to the
pharmacovigilance's context. Indeed, Sparklis [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] (see Section 4 for a description) already
meets to a large extent criteria 4, 5, 8, and 9 (as well as 7 in an approximate way) by
guiding its users in the incremental building of their request while receiving results and lists of
suggested terms at every step. The main missing feature was the handling of hierarchies,
which concerns criteria 2, 3 and 6, which we have added as a result of the project.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>PEGASE Knowledge Graph</title>
      <p>
        In this section, we detail the knowledge sources we have used to build the PEGASE
Knowledge Graph and its structure. First, we present MedDRA and OntoADR [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] the
core ontology we use, and then we move onto the additional sources we have included,
Standardized MedDRA Queries (SMQs) and anonymized patient data (obtained from
FAERS). The core structure of the PEGASE Knowledge Graph can be seen in Figure 1.
It currently contains 3,257,389 triples without taking into account FAERS data (with
the patient data of three months, it grows to 28,125,629 triples).
      </p>
      <p>
        OntoADR This ontology [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] is the result of progressive efforts to semantically
structure the Medical Dictionary for Regulatory Activities (MedDRA) terminology.
In the context of pharmacovigilance, MedDRA is a reference terminology to code and
give a precise description of ADRs and related issues [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. In particular, OntoADR is
an ontology which connects MedDRA terms to SNOMED CT concepts, making it
possible to search for MedDRA terms semantically, e.g. to search for all terms about
ADRs taking place in the skin. To establish the mapping between MedDRA terms and
SNOMED CT, OntoADR exploits the UMLS metathesaurus [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], which provides an
initial alignment. Apart from that, several curating steps and algorithms were applied
to have a further refined set of relationships.
      </p>
      <p>In order to include OntoADR in our Knowledge Graph, we had to do some
adaptations. To model MedDRA, we introduced the concept MedDRATerm, which has five
different subconcepts corresponding to the five levels of their hierarchy (see Figure 1).
However, to model the hierarchy relationship between terms, instead of using the subclass
relationship (i.e., formal subsumption), we introduced the property medDRA parent.
In this way, we can navigate the hierarchy without unexpected potential inferences.
For example, \Aortic aneurysm" is a Preferred Term (PT), whose medDRA parent is
\Aortic aneurysm and dissections", a High Level Term (HLT).</p>
      <p>
        To include the SNOMED CT definitions, we had to adapt their representation level.
On the one hand, we had MedDRA terms, all of which were instances; on the other
hand, we had SNOMED CT terms, all of which were concepts. To solve this mismatch,
we materialized SNOMED CT concept hierarchy, and treated the concepts as instances7.
This allowed us to introduce also different hierarchies to provide different navigation
dimensions. In particular, we introduced a top-level hierarchy of SNOMED CT
metaconcepts based on the semantic tags that SNOMED CT uses to further refine the
concepts' meaning (see Figure 2). Note that this grouping cohabits with the subclass
hierarchy of SNOMED CT concepts. This does not lead to inconsistencies as our
knowledge graph is in RDFS, not in OWL. OntoADR relationships between MedDRA
terms and SNOMED CT concepts (see [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] for the complete list) were included as they are.
      </p>
      <p>Following with the previous example, \Aortic aneurysm" (MedDRA term) is related
to \Aneurysm" (SNOMED concept) via associatedMorphology (OntoADR
property), and to \Abdominal Aorta Structure" (SNOMED concept) via findingSite
(OntoADR property). Furthermore, \Abdominal Aorta Structure" is a BodyStructure
(SNOMED meta concept) and a rdfs:subclassOf of \Descending aorta structure"
(SNOMED concept). Note how we use the metaconcept hierarchy to group the SNOMED
concepts according to their semantic tags.</p>
      <p>
        SMQs SMQs are \groupings of MedDRA terms, ordinarily at the Preferred Term
(PT) level that relate to a defined medical condition or area of interest" [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. In general,
SMQs can be seen as disjunctions of terms which are used together in order to perform
7 Abusing a little the language, we have flattened them in the RDF graph and allowed for
meta-modeling, i.e., classes of SNOMED CT concepts.
searches in a standardized way, although they can be grouped in more complex ways.
We added each SMQ as a new node, related to the terms that it includes. The inclusion
of SMQs in the graph has two main benefits: on the one hand, pharmacovigilants are
used to working with SMQs and they had them directly accessible; and on the other
hand, they extend the querying capabilities as now we have the full capabilities of
SPARQL on top of them.
      </p>
      <p>FAERS Data Finally, we integrated a source of patient data to show how the
integration capabilities of our knowledge graph can help pharmacovigilance specialists to
ease their jobs. FAERS (FDA Adverse Event Reporting System) is a dataset containing
the anonymized reports of drug adverse events which is gathered and made public
quarterly by the U.S. Food and Drug Administration (FDA). The patient data provided
by FAERS is split in seven different big tables, which we integrated as shown in the
resulting model in Figure 3. Such model was obtained after an evaluation round with
the human factors specialists in the project's team, where we brought the FAERS
model closer to the pharmacovigilants' cognitive process.</p>
    </sec>
    <sec id="sec-4">
      <title>Sparklis on the PEGASE Knowledge Graph</title>
      <p>Sparklis8 is a tool for retrieving information from RDF knowledge graphs, with the
objective to reconcile the expressivity of SPARQL 1.1 and the usability of point-and-click
user interfaces. We here present its principles as well as the main extensions that we
have performed to address the needs of the PEGASE project.
4.1</p>
      <sec id="sec-4-1">
        <title>Sparklis: a SPARQL Query Builder in Natural Language</title>
        <p>
          Sparklis is a query builder in natural language that allows people to explore and query
SPARQL endpoints without any knowledge of SPARQL [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. Sparklis is implemented
as a Web client running entirely in the browser, which directly connects to SPARQL
endpoints to retrieve query results and suggested query elements. It covers a large subset
of SPARQL 1.1 select queries: basic graph patterns including cycles, union, optional,
not exists, filter, bind, complex expressions, aggregations, group by, order by. All
those features can be combined in a flexible way. Results are presented as tables, and also
on maps and as slideshows. A configuration panel offers a few configuration options to
adapt to different endpoints (e.g., GET/POST, labelling properties and language tags).
4.2
        </p>
      </sec>
      <sec id="sec-4-2">
        <title>Applying Sparklis to the PEGASE Knowledge Graph</title>
        <p>Sparklis requires very little configuration to be applied to the PEGASE Knowledge
Graph. It is enough to provide the URL of the SPARQL endpoint9, and to choose
property rdfs:label for the labelling of entities, classes, and properties. As the end
users are French pharmacovigilance specialists, we also configure the user interface and
the labels to the French language.</p>
        <p>Figure 4 shows a screenshot of Sparklis on PEGASE data, taken during the process
of building a query10. The current query (at the top) select prefered terms (PT)
in MedDRA whose finding site is (a subconcept of) \Skin and subcutaneous tissue
structure", and whose associated morphology is (a subconcept of) various morphologic
abnormalities. A first abnormality, \Blister", has already been selected, and the user
is in the process of selecting (at the center) a disjunction of three more abnormalities
(\Vesicle", \Vesiculobullous rash", \Vesicular rash"). The keyword \vesic" was input at
the top of the list of suggested terms in order to ease their retrieval among a long list of
suggestions. Sparklis suggests modifications to be applied to the current focus (here, the
focus is on the associated morphology of the selected preferred terms): at the middle
left, Sparklis suggests classes and properties to refine the query; at the middle center,
individuals denoting associated morphologies relevant to the query; and at the middle
right, query modifiers and operators (e.g., \and", \or", \number of"). The table of
results of the current query is shown at each step. Here, it shows the selected preferred
terms along with their finding sites and associated morphologies.
8 http://www.irisa.fr/LIS/ferre/sparklis/ (includes the presented extensions)
9 The URL is not provided here because of restrictive licences on MedDRA and SNOMED.
10 A screencast of the whole query building is available at http://www.irisa.fr/LIS/
common/documents/pegase2018/#ExtraCase.
Handling hierarchies As the example in Figure 4 shows, it is important to take into
account hierarchies of MedDRA terms or SNOMED concepts in the evaluation of queries.
For instance, when asking for \preferred terms whose finding site is Skin...", what is
really meant is: \preferred terms whose finding site is any subconcept of Skin...", i.e.,
any part of the skin (e.g., \epidermis", \subepidermal region"). The correct translation
to SPARQL requires the use of property paths like</p>
        <p>?x ontoadr:findingSite/rdfs:subClassOf* snomed:Skin .</p>
        <p>A number of problems made it impossible to express in Sparklis, and thus required an
extension to handle hierarchies in the building and evaluation of queries. First, it was
possible to build a sequence of properties but not to apply the *-operator (transitive closure).
Therefore, it was possible to reach the parent or grand-parent concepts but not all
ancestors at once. Second, assuming the *-operator was available, it was tedious to explicitly
cross the property defining the hierarchy (rdfs:subClassOf in the example), and to
apply transitive closure. Third, once an element of the hierarchy was selected (\Skin..."
in the example) and the query focus set on it, only that element was visible whereas
the user would expect to see the subset of the hierarchy below and above that element.</p>
        <p>We solved those problems by modifying Sparklis with several extensions and
adaptations. What made it particularly tricky is that the new features had to combine smoothly
with the numerous existing features: e.g., crossing properties forward and backward,
Boolean coordinations of verb phrases and noun phrases. It was also important to make
it in a generic way so as to handle various kinds of hierarchies. The PEGASE knowledge
graph has two hierarchies based on two different properties (medDRA parent for
MedDRA terms, rdfs:subClassOf for SNOMED concepts), and other applications could
have other hierarchies, e.g., hierarchies of geographical places or historical periods. First,
we added a (hierarchy) in query construct that can turn any property into a
hierarchy. It does not only apply transitive closure on the property but translates the current
focus in a special way so that all terms below and above the selected terms are retrieved and
can be displayed as a tree to render the hierarchical relationships between the terms. For
example, the SPARQL translation of the above example, when the focus is on \Skin..." is
?x ontoadr:findingSite ?y .</p>
        <p>?y rdfs:subClassOf* snomed:Skin ; rdfs:subClassOf* ?focus .
Second, we added schema-level declarations to automate the use of the hierarchy
feature. For example, a declaration states that the range of property findingSite is
hierarchically organized by property rdfs:subClassOf. Therefore, as soon as property
findingSite is crossed, a hierarchy construct based on rdfs:subClassOf is inserted
in the query so that the user immediately sees a hierarchy of concepts in which terms
can be selected. We stated a similar declaration for each OntoADR property whose
range is made of SNOMED concepts.</p>
        <p>Other extensions Finally, a few other less original, yet important, extensions were
implemented. We have added support for full-text search (compatible so far with Jena
Fuseki and Virtuoso RDF stores), and for multi-selection. The former was required
to speed up keyword searches and improve the robustness (e.g., handling accentuated
letters). The latter, while it does not increase the expressivity of Sparklis, allows to
build coordinations of values in much fewer interactions.
5</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Evaluation</title>
      <p>
        We have performed the expert evaluation in two subsequent steps: 1) the human factors
specialists in the project team checked Sparklis against the list of recommendations
derived from the literature analysis, comparing it to the main tools used by
pharmacovigilance specialists in France (see Section 2); then, 2) they analyzed Sparklis over
PEGASE Knowledge Graph, after the extensions were implemented (see Section 4.3)
performing a cognitive walkthrough [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] in order to identify remaining usability issues.
5.1
      </p>
      <sec id="sec-5-1">
        <title>Qualitative Evaluation</title>
        <p>
          Two human factors specialists checked to which extent the extended version of Sparklis
respects the requirements learned from the literature review and the benchmark of
existing tools. Table 1 compares the BNPV tool and the extended Sparklis w.r.t.
the requirements identified in Section 2. The evaluation showed that Sparklis has a
better adherence to the requirements than the BNPV, the tool currently in use.
Therefore, Sparklis may provide a better support to pharmacovigilance specialists' searches.
Nonetheless, there is still room for some improvements regarding: criteria 1, synonyms
and related terms are not considered; criteria 7, Sparklis provides the number of ADR
reports linked to a given term based only on a sample of reports; and criteria 8, although
it guides users during the request building, the current interface is not sufficiently
intuitive for untrained pharmacovigilance specialists.
A cognitive walkthrough [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] was performed, following an ISO standard [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ], to identify
remaining usability issues in Sparklis. This inspection method focuses on how easy and
intuitive for new users is their interaction with an interactive device. Representative
pharmacovigilance specialists were asked for four prototypical use cases. Following those
use cases, two human factors specialists went through the sequences of tasks supported
by Sparklis stepping into pharmacovigilance specialists' shoes. For each subtask, they
asked themselves four standardized questions:
{ Will pharmacovigilance specialists try to achieve the effect the subtask has?
{ Will they notice that the correct action is available?
{ Will they understand that the wanted subtask can be achieved by the action?
{ Do they get appropriate feedback?
        </p>
        <p>For each question, at each subtask, usability problems leading to answering \no" were
noticed. While the evaluation was performed over Sparklis directly, we prepared some
illustrative videos to show how the user can deal with the use cases using Sparklis, which
are available at http://www.irisa.fr/LIS/common/documents/pegase2018/.</p>
        <p>Despite the potential support provided by Sparklis to pharmacovigilance specialists,
its current graphical user interface might not be intuitive enough from a
pharmacovigilance specialists' perspective. While Sparklis has been proved to successfully reduce the
gap between users and SPARQL queries, it still requires training in order to go through
the first stage of the learning curve. This is specially relevant in this context, where
the users (pharmacovigilance specialists) might have neither expertise nor training
in computer and information sciences, and their related logic, concepts and wording.
For instance, Sparklis provides all the flexibility of SPARQL in order to build the
queries; however, it can be argued that this flexibility might confuse pharmacovigilance
specialists who use daily a very limited range of requests.</p>
        <p>As a conclusion of this evaluation, we can conclude that we might need two different
versions of Sparklis interface depending on the user profile (i.e., advanced and beginner
users): 1) for advanced users, an interface presenting all the relevant information to
help pharmacovigilance specialists build a request, and 2) for beginners, a simplified
graphical user interface presenting only the logical operators and concepts which are
most likely to be used, and the others on-demand. In this way, we will allow the whole
range of pharmacovigilance specialists to take advantage of its power from the start.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Conclusions and Future Work</title>
      <p>
        To overcome the limitation of current pharmacovigilance tools, we have built a knowledge
graph that integrates: 1) MedDRA, the main pharmacovigilance vocabulary, 2)
OntoADR, its semantic enhancement using SNOMED CT, 3) SMQs, and 4) patient data.
In order to search and explore that graph, we have adopted Sparklis, a SPARQL query
builder in natural language. We conducted a literature review along with a benchmark of
existing tools, which led to the detection of several requirements that pharmacovigilance
tools have to meet, and which were taken into account in a Sparklis extension. Finally,
we carried out an evaluation of the interface following an ISO ergonomics standard [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ],
showing the potential of our proposal. Usability evaluation with end-users is in progress.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <article-title>1. ICH guideline E2B (R2), Electronic transmission of individual case safety reports</article-title>
          ,
          <source>Final Version 2</source>
          .3,
          <string-name>
            <given-names>Document</given-names>
            <surname>Revision</surname>
          </string-name>
          <string-name>
            <surname>February</surname>
          </string-name>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>UMLS</given-names>
            <surname>Website</surname>
          </string-name>
          . http://www.nlm.nih.gov/research/umls/, accessed:
          <year>September 2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Bakhshi-Raiez</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>de Keizer</surname>
          </string-name>
          , N.,
          <string-name>
            <surname>Cornet</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dorrepaal</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dongelmans</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jaspers</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>A usability evaluation of a SNOMED CT based compositional interface terminology for intensive care</article-title>
          .
          <source>Int. J. of Medical Informatics</source>
          <volume>81</volume>
          (
          <issue>5</issue>
          ),
          <volume>351</volume>
          {
          <fpage>362</fpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Bousquet</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sadou</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Souvignet</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jaulent</surname>
            ,
            <given-names>M.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Declerck</surname>
          </string-name>
          , G.:
          <article-title>Formalizing MedDRA to support semantic reasoning on adverse drug reaction terms</article-title>
          .
          <source>Journal of Biomedical Informatics</source>
          <volume>49</volume>
          ,
          <issue>282</issue>
          {
          <fpage>291</fpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Ferre</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          : Sparklis:
          <article-title>An expressive query builder for SPARQL endpoints with guidance in natural language</article-title>
          .
          <source>Semantic Web: Interoperability, Usability, Applicability</source>
          <volume>8</volume>
          (
          <issue>3</issue>
          ),
          <volume>405</volume>
          {
          <fpage>418</fpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Harrison</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mozzicato</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <string-name>
            <surname>MedDRA R :</surname>
          </string-name>
          <article-title>The tale of a terminology: Side effects of drugs essay</article-title>
          .
          <source>Side Effects of Drugs Annual</source>
          , vol.
          <volume>31</volume>
          , pp.
          <source>xxxiii { xli. Elsevier</source>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7. ICH:
          <article-title>Introductory Guide for Standardised MedDRA Queries (SMQs) Version 21</article-title>
          .0,
          <string-name>
            <given-names>Document</given-names>
            <surname>Revision</surname>
          </string-name>
          <string-name>
            <surname>March</surname>
          </string-name>
          ,
          <year>2018</year>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <article-title>Ergonomics of human-system interaction { Part 210: Human-centred design for interactive systems</article-title>
          . Standard, International Organization for Standardization, Geneva,
          <string-name>
            <surname>CH</surname>
          </string-name>
          (Mar
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Peute</surname>
            , L.W., de Keizer,
            <given-names>N.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jaspers</surname>
            ,
            <given-names>M.W.:</given-names>
          </string-name>
          <article-title>The value of retrospective and concurrent think aloud in formative usability testing of a physician data query tool</article-title>
          .
          <source>Journal of Biomedical Informatics</source>
          <volume>55</volume>
          ,
          <issue>1</issue>
          {
          <fpage>10</fpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Polson</surname>
            ,
            <given-names>P.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lewis</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rieman</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wharton</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Cognitive walkthroughs: a method for theory-based evaluation of user interfaces</article-title>
          .
          <source>Int. J. of Man-Machine Studies</source>
          <volume>36</volume>
          (
          <issue>5</issue>
          ),
          <volume>741</volume>
          {
          <fpage>773</fpage>
          (
          <year>1992</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Shiri</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ruecker</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bouchard</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Doll</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fiorentino</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>User evaluation of Searchling and T-saurus: Multilingual thesaurus-enhanced visual interfaces for digital libraries</article-title>
          .
          <source>Canadian Journal of Information and Library Science</source>
          <volume>37</volume>
          (
          <issue>2</issue>
          ),
          <volume>137</volume>
          {
          <fpage>160</fpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Sutcliffe</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ennis</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hu</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <article-title>Evaluating the effectiveness of visual user interfaces for information retrieval</article-title>
          .
          <source>Int. J. Human-Computer Studies</source>
          <volume>53</volume>
          (
          <issue>5</issue>
          ),
          <volume>741</volume>
          {
          <fpage>763</fpage>
          (
          <year>2000</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Vega-Gorgojo</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bote-Lorenzo</surname>
            ,
            <given-names>M.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Asensio-Perez</surname>
            ,
            <given-names>J.I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gomez-Sanchez</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dimitriadis</surname>
            ,
            <given-names>Y.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jorr</surname>
            n-Abellan,
            <given-names>I.M.:</given-names>
          </string-name>
          <article-title>Semantic search of tools for collaborative learning with the ontoolsearch system</article-title>
          .
          <source>Computers and Education</source>
          <volume>54</volume>
          (
          <issue>4</issue>
          ),
          <volume>835</volume>
          {848 (May
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Walji</surname>
            ,
            <given-names>M.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kalenderian</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tran</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kookal</surname>
            ,
            <given-names>K.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nguyen</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tokede</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>White</surname>
            ,
            <given-names>J.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vaderhobli</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ramoni</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stark</surname>
            ,
            <given-names>P.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kimmes</surname>
            ,
            <given-names>N.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schoonheim-Klein</surname>
            ,
            <given-names>M.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Patel</surname>
            ,
            <given-names>V.L.</given-names>
          </string-name>
          :
          <article-title>Detection and characterization of usability problems in structured data entry interfaces in dentistry</article-title>
          .
          <source>Int. J. Medical Informatics</source>
          <volume>82</volume>
          (
          <issue>2</issue>
          ),
          <volume>128</volume>
          {
          <fpage>138</fpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>