<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>MeSH2Matrix: Machine learning-driven biomedical relation classification based on the MeSH keywords of PubMed scholarly publications</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Houcemeddine Turki</string-name>
          <email>turkiabdelwaheb@hotmail.fr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bonaventure F. P. Dossou</string-name>
          <email>femipancrace.dossou@gmail.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Chris Chinenye Emezue</string-name>
          <email>chris.emezue@gmail.com</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mohamed Ali Hadj Taieb</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mohamed Ben Aouicha</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hanen Ben Hassen</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Afif Masmoudi</string-name>
          <email>afif.masmoudi@fss.usf.tn</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Data Engineering and Semantics Research Unit, Faculty of Sciences of Sfax, University of Sfax</institution>
          ,
          <addr-line>Sfax</addr-line>
          ,
          <country country="TN">Tunisia</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Jacobs University Bremen</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Laboratory of Probability and Statistics, Faculty of Sciences of Sfax, University of Sfax</institution>
          ,
          <addr-line>Sfax</addr-line>
          ,
          <country country="TN">Tunisia</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Technical University of Munich</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <fpage>45</fpage>
      <lpage>60</lpage>
      <abstract>
        <p>Biomedical relation classification has been significantly improved by the application of advanced machine learning techniques on the raw texts of scholarly publications. Despite this improvement, the reliance on large chunks of raw text makes these algorithms sufer in generalization, precision and reliability. However, the use of the distinctive characteristics of bibliographic metadata can prove efective in achieving a better performance for this challenging task. In this research paper, we introduce an approach for biomedical relation classification using the qualifiers of co-occurring Medical Subject Headings (MeSH). First of all, we introduce MeSH2Matrix, our dataset consisting of 46,469 biomedical relations curated from PubMed publications using our approach. Using MeSH2Matrix, we build and train three machine learning models (SVM, D-Model and C-Net) to evaluate the eficiency of our approach for biomedical relation classification. Our best model achieves an accuracy of 70.78% for 195 classes and 83.09% for five superclasses. Our results will hopefully shed light on developing better algorithms for biomedical ontology construction based on the MeSH keywords of PubMed publications. For reproducibility purposes, MeSH2Matrix as well as all our source codes are made publicly accessible at https://github. com/SisonkeBiotik-Africa/MeSH2Matrix.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Biomedical Relation Classification</kwd>
        <kwd>MeSH Keywords</kwd>
        <kwd>PubMed Records</kwd>
        <kwd>MeSH qualifiers</kwd>
        <kwd>Machine Learning</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Biomedical ontologies are currently having a growing place in driving intelligent systems
and information retrieval for clinical decision support and natural language processing [
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ].
Ontologies are constituted of concepts related to each other using statements in the form of
triples: Subject (Concept), Predicate (Relation Type), and Object (Concept) [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. As a result
they can be easily enriched, processed, validated, and reused by machines [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Nevertheless,
biomedical ontologies are mainly created through human curation by experts (e.g., physicians
and ontologists), consortiums (e.g., Open Biomedical Ontologies Consortium), and institutions
(e.g. NIH National Center for Biomedical Ontology) [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], while machines are delegated particular
tasks in ontology engineering: biomedical relation extraction and classification [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], biomedical
ontology matching and integration [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], and biomedical ontology evaluation and validation
[
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. For these tasks, free-form text from large collections of scholarly publications are usually
processed, while the metadata, particularly bibliographic metadata, of the analyzed publications
are not leveraged [
        <xref ref-type="bibr" rid="ref6 ref9">6, 9</xref>
        ].
      </p>
      <p>
        Although these systems achieve high accuracy rates (F1-Score of biomedical relation
classification with raw texts from 63.5 to 84.3), the consideration of the characteristics of scholarly
publications could help refine their results to achieve better outputs [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. In addition, the
development of customized algorithms driven by the structure and the logic behind facets of the
extracted publications – such as controlled keywords – could bring more trustworthy results
using fewer computer resources [
        <xref ref-type="bibr" rid="ref10 ref11">10, 11</xref>
        ].
      </p>
      <p>
        In this context, Medical Subject Headings (MeSH) keywords used to annotate PubMed scholarly
publications can be a very useful resource for biomedical relation extraction and classification
[
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. Based on MeSH taxonomy, it always assigns the same concept to various publications
in PubMed using the same term [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. Consequently, the use of MeSH keywords as input
for biomedical ontology engineering can be more eficient than the use of user-generated
bibliometric metadata and raw texts of scholarly publications [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. In this research paper,
we propose an approach for biomedical relation classification using the associations of the
MeSH keywords in PubMed records. For this, we will use the biomedical relations between
MeSH terms as revealed by Wikidata, an open and collaborative knowledge graph available
at https://www.wikidata.org [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], to construct the training dataset named MeSH2Matrix for
biomedical relation classification using our method.
      </p>
      <p>We will begin by giving an overview of MeSH Keywords and how they have contributed so
far to enhancing tasks in Biomedical Informatics (Section 2.1). Then, we will describe Wikidata
as a biomedical knowledge resource and explain how it can be used to extract biomedical
relations between the MeSH terms (Section 2.3). After that, we will explain the principles of
our proposed approach for the biomedical relation classification based on the MeSH keywords
of PubMed scholarly publications and explicate our method for the creation of our training
dataset from PubMed and Wikidata (Section 3.1). Later, we will provide a description of the
MeSH2Matrix dataset and assess its quality by comparing its main features to previous research
ifndings (Section 3.2). Subsequently, we will describe our experiments for the development of the
biomedical relation classification machine-learning algorithms to be trained on MeSH2Matrix
and outline our results for the biomedical relation classification based on MeSH2Matrix (Section
4). Finally, we will conclude our experiments and provide future directions for our research
paper (Section 5).</p>
    </sec>
    <sec id="sec-2">
      <title>2. Overview</title>
      <sec id="sec-2-1">
        <title>2.1. MeSH keywords as a valuable input</title>
        <p>
          As a controlled vocabulary, MeSH supports sixteen types of biomedical concepts ranging from
anatomical structures to symptoms, diseases, and drugs1 [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. This broad coverage of MeSH
Terms makes them useful to represent the topics of all scholarly publications [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. The value
of these terms is further increased by the regular revision of MeSH to include new concepts
such as COVID-192 and SARS-CoV-23 and to cover updates in the biomedical nomenclature
[
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]. That is why it has been used to annotate PubMed records for years by the human
curators of the bibliographic database to enable consistent indexing of scholarly papers and
intuitive data mining [
          <xref ref-type="bibr" rid="ref13 ref15">13, 15</xref>
          ]. Beyond its contribution to the enhanced topic granularity
in PubMed, MeSH keywords have the ability to represent facets of a given topic through
the use of predefined subheadings known as MeSH qualifiers providing more precision to
the MeSH keywords of the PubMed records [
          <xref ref-type="bibr" rid="ref10 ref9">9, 10</xref>
          ]. Currently, there are 89 MeSH qualifiers
representing all the characteristics and features of a biomedical entity as revealed at https:
//www.nlm.nih.gov/mesh/subhierarchy.html.
        </p>
        <p>
          These interesting features of the MeSH keywords encouraged its usage beyond information
seeking. MeSH keywords are gaining an increasing popularity in biomedical information
retrieval as they allow better accuracy for relation extraction and classification from PubMed
[
          <xref ref-type="bibr" rid="ref15 ref16">15, 16</xref>
          ]. This is due to the restriction of the considered publications to the ones that are most
likely to include the required information [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ] and the analysis of the co-occurrences of the
MeSH Keywords [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. The better output of clinical knowledge engineering driven by the MeSH
terms of the PubMed publications has proven the value of MeSH Keywords, especially when
assigned controlled qualifiers, to classify biomedical relations [
          <xref ref-type="bibr" rid="ref10 ref9">9, 10</xref>
          ]. Essentially, the qualifiers
of two co-occurring MeSH keywords can inform us of the nature of the semantic relation
between them as shown in Table 1. This is particularly motivated by the fact that biomedical
publications usually have narrow research scope and do not consequently study multiple and
unrelated facets of a given topic unless the publication type is an encyclopedic review [? ]. The
        </p>
        <sec id="sec-2-1-1">
          <title>1https://www.nlm.nih.gov/bsd/disted/meshtutorial/meshtreestructures/index.html 2https://www.ncbi.nlm.nih.gov/mesh/2052179 3https://www.ncbi.nlm.nih.gov/mesh/2052180</title>
          <p>
            qualifiers of the MeSH keywords can be easily found as they are simply separated from their
corresponding headings using a slash (/). The MeSH keywords of PubMed scholarly publications
can be retrieved from the NCBI Entrez API using the Biopython Python Library [
            <xref ref-type="bibr" rid="ref17 ref18">17, 18</xref>
            ]. The
structured format of MeSH keywords and their simple retrieval from the PubMed bibliographic
database motivate their usage for the biomedical relation extraction and classification.
          </p>
        </sec>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Wikidata as a biomedical semantic resource</title>
        <p>
          Wikidata was created in October 2012 as a knowledge database to support structured data in
Wikipedia such as interlanguage links and infoboxes [? ]. However, it has grown over the past
years to become one of the largest free and open knowledge graphs covering various range of
ifelds, particularly biomedicine [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]. Its collaborative and crowdsourcing-based enrichment,
regular updates according to recent advances in the major areas of interest, etc make it the most
adequate knowledge base to support ever-changing scholarly evidences, mainly in the context
of the COVID-19 pandemic [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ]. Currently, Wikidata represents various types of medical
entities as items, such as drugs, diseases, genes, proteins, organs, and symptoms [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]. These
items are linked to their equivalent entities in external knowledge resources, mainly MeSH
[
          <xref ref-type="bibr" rid="ref19">19</xref>
          ]. Every entity is assigned a language-independent identifier (so-called Q-number) as well
as its main names (labels), glosses (descriptions), and alternative names (aliases) in a variety of
natural languages, particularly English as shown in Fig. 1 [
          <xref ref-type="bibr" rid="ref12 ref19">12, 19</xref>
          ]. Furthermore, entities are
related to other ones using semantic relations (Subject – Predicate – Object) where the relation
type (Predicate) is also a Wikidata entity having its own identifier ( P-number) and semantic
description [
          <xref ref-type="bibr" rid="ref12 ref20">12, 20</xref>
          ].
        </p>
        <p>
          There are two major kinds of relation types: taxonomic or non-taxonomic ones [
          <xref ref-type="bibr" rid="ref12 ref19">12, 19</xref>
          ].
Taxonomic relations are hierarchical ones that link an entity to its parent class or constituents
(e.g., instance of [P31] – Fig. 2) [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]. Non-taxonomic relation types are non-hierarchical ones
that are assigned to define specialized knowledge such as biomedical information (e.g., signs
or symptoms [P780] – Fig.3) [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]. A non-taxonomic relation can be symmetric: where the
subject-object inversion would not afect the meaning of the statement. If this condition is not
fulfilled, the relation in non-symmetric [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ]. Such a classification of biomedical relation types
can be easily inferred from the semantic information about Wikidata relation types, particularly
the property constraints providing conditions for the definition of relational statements (Fig.
4) [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ]. These conditions are not only useful for the validation of semantic information about
the semantic relations but also to recognize the distinctive features of relation types. Moreover,
Wikidata items are also matched to their equivalents in external resources using non-relational
statements in the form of triples where the predicate reveals the aligned resource and the object is
the identifier of the concept in the external database [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ]. Particularly, MeSH Descriptor ID [P486]
statements align between MeSH terms and Wikidata items as revealed by Fig. 5. As Wikidata
knowledge graph is developed using the Resource Description Framework (RDF) format, it can
be easily processed to get subsets of semantic information needed to drive knowledge-based
applications using a variety of tools, particularly the MediaWiki API4, the Wikidata SPARQL
query service5, and the Wikibase Integrator Python Library6.
        </p>
        <sec id="sec-2-2-1">
          <title>4https://www.wikidata.org/w/api.php 5https://query.wikidata.org 6https://github.com/LeMyst/WikibaseIntegrator</title>
        </sec>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. MeSH2Matrix</title>
      <sec id="sec-3-1">
        <title>3.1. Principles and dataset generation</title>
        <p>
          We develop our approach upon the assumption that the qualifiers of two co-occurring MeSH
terms can outline the type of semantic relation between them, as previously shown in Table 1
[
          <xref ref-type="bibr" rid="ref10 ref9">9, 10</xref>
          ]. Let t1 and t2 be two semantically related MeSH terms that are not assigned a relation
type. Our method proposes to first search for the PubMed scholarly publications having both t1
and t2 as MeSH headings. Then for each of the found records, we extract the qualifiers q1 and
q2 (e.g., therapeutic use for Sofosbuvir/therapeutic use) respectively corresponding to t1 and t2
(e.g., Sofosbuvir for Sofosbuvir/therapeutic use). This will enable the creation of (q1, q2) couples
as shown in Fig. 6. When a term is assigned two or more qualifiers (e.g., t2/Z/U for Paper 3 –
Fig. 6), this means that a paper deals with a facet of a characteristic of the considered topic. In
such a situation, we consider it as though the qualifiers were independently assigned to the
MeSH term for the paper (e.g., t2/Z and t2/U for Paper 3 – Fig. 6). We restrict the number of
considered publications to the 100 most relevant research papers according to PubMed Best
Match search algorithm [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ]. This will prevent matters related to the timeout limit of the NCBI
PubMed API (Error 429). After the couples of MeSH qualifiers are retrieved, we draw a matrix of
correspondence ( ) – this is a square matrix of the qualifiers ( 1, ..., 89)7 where each element
7There are currently 89 pre-defined MeSH qualifiers as revealed at https://www.nlm.nih.gov/mesh/subhierarchy.
html.
, is the number of records featuring both t1/qi and t2/qj as MeSH keywords divided by the
total number of records with the two MeSH terms t1 and t2 (Equation 1).
        </p>
        <p>As a practical example, as of March 6, 2022, there are 32 PubMed records where Hepatitis B
and Sofosbuvir are featured together as MeSH headings. From these 32 publications, there are
15 papers where drug therapy and therapeutic use are the respective qualifiers to Hepatitis B and
Sofosbuvir. In this situation, the value that will be represented for the association between drug
therapy and therapeutic use in the Hepatitis B-Sofosbuvir matrix is 15/32 = 0.469.</p>
        <p>Floating-point representations (noted r) in this matrix will generally range between 0 and 1.
This matrix of correspondence will be used as an input to find the nature of the semantic relation
between t1 and t2.</p>
        <p>, =
 (1/, 2/ ) .</p>
        <p>(1, 2)
(1)</p>
        <p>To construct our dataset, we first retrieve biomedical relations from Wikidata where the
subject and the object are matched to their equivalent Medical Subject Headings (MeSH). This
is accomplished with the SPARQL query featured in Fig. 7. The output of the query is saved as a
tab-separated values (TSV) file to allow its automatic processing. After that, we use the Wikidata
identifiers (also called P-numbers) of the extracted relation types as labels for the generated
matrices. Then, we find the oficial names of the subjects and objects of the relations in MeSH
based on their MeSH Descriptor ID [P486], which is then used to retrieve their associations
in PubMed using the NCBI Entrez API. Finally, we extract the (q1, q2) couples and return the
qualifiers’ matrices for every subject-object association. Every matrix is assigned the relation
type corresponding to the subject-object association as a label. For a better analysis of our
proposed approach, we extract the features of the considered Wikidata relation types and verify
their names as well as if they are taxonomic, symmetric, or biomedical through the application
of SPARQL queries on Wikidata using Wikibase Integrator coupled with human validation.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Results and Discussion</title>
        <p>As of December 12, 2021, our SPARQL query (Fig. 7) has successfully retrieved 81,000 biomedical
relations between MeSH Terms from Wikidata. This is a very significant amount of information
as Wikidata only includes 99,208 semantic relations between MeSH concepts8. We have chosen
81,000 as the number of considered relations in order to simplify the performed computations
for the analysis of our proposed approach. When analyzing the biomedical relations extracted
for building our dataset, we found out that the supported relation types can be classified into
ifve categories:
• Non-Biomedical Non-Symmetric (156 relation types, 17,758 relations),
• Biomedical Non-Symmetric (53 relation types, 27,429 relations),
• Non-Biomedical Symmetric (12 relation types, 9,000 relations),
• Biomedical Symmetric (3 relation types, 1,441 relations), and
• Taxonomic (3 relation types, 25,372 relations).</p>
        <p>
          This goes in line with the coverage of various aspects of biomedical knowledge in Wikidata
as a multidisciplinary knowledge graph [
          <xref ref-type="bibr" rid="ref12 ref19">12, 19</xref>
          ]. The extraction of the associations between the
subject and object of every semantic relation in the MeSH keywords of PubMed publications has
shown that most of the associations are likely to be found in a limited number of publications
(Fig. 8A) and that commonly available MeSH associations in the PubMed records are rare (25,227
associations [42.7%] each available in 100 papers or more – Green dot in Fig. 8A). This is evident
as scientific productivity follows Lotka’s Law, an inverse power law that describes the uneven
distribution of research outputs [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ]. When seeing if the extraction of qualifiers describing the
MeSH associations has been successful in generating matrices, we found that the probability
of the creation of qualifiers’ matrices tends to increase with the augmentation of the number
of PubMed publications including the MeSH association before reaching a plateau near 1 at
8For a live update: https://w.wiki/4JN9
twenty publications (Fig. 8B). The existence of biomedical relations in Wikidata that cannot
be found in PubMed records and that do not consequently return matrices of correspondence
could be explained by the fact that Wikidata is subject to include irrelevant biomedical relations
as it is collaboratively edited by human users without any restriction [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ]. By contrast, it
is important to reveal that the proportion of the generation of the matrices for the MeSH
associations available in 100 publications or more is below the plateau with a rate of 73.3% (Red
dots in Fig. 8B). To identify the reason behind such an unexpected behavior, we compute the
quotient of the MeSH associations not generating matrices and available at least in 100 PubMed
papers out of the overall number of the MeSH associations not having qualifiers’ matrices for
every class of Wikidata relation types. We found out that this rate is significantly higher for
taxonomic (6,133 out of 13,411, 45.7%) and non-biomedical non-symmetric (1,712 out of 8,335,
20.5%) relations than for biomedical non-symmetric (1,137 out of 9,498, 12.0%), non-biomedical
symmetric (189 out of 2,647, 7.1%), and biomedical symmetric (7 out of 640, 1.1%). This proves
the ability of the MeSH qualifiers to better represent biomedical or symmetric relations than
generic and non-symmetric ones.
        </p>
        <p>
          The obtained dataset of qualifiers’ matrices represented 46,469 relations covering the five
classes of semantic relation types (195 supported relation types, 54.2% of the matrices based
on 100 publications or more): 17,931 biomedical non-symmetric, 11,961 taxonomic, 9,423
nonbiomedical non-symmetric, 6,353 non-biomedical symmetric, and 801 biomedical symmetric
relations. Unsurprisingly, the most represented relation types in the dataset have been dominated
by taxonomic (e.g., subclass of [P279] and instance of [P31]) or biomedical non-symmetric relation
types (e.g., drug and therapy used for treatment [P2176]). This makes our dataset more inclusive
than other available corpora for biomedical relation classification only covering a few relation
types, particularly drug interactions, drug adverse efects , and drug-disease relations [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ].
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Biomedical relation classification using MeSH2Matrix</title>
      <p>
        Machine learning-based approaches handle biomedical relation classification as a supervised
learning classification task, where labelled data is used to train models. In this paper, we provide
benchmark results on our dataset, using three machine learning models:
SVM: Support vector machines (SVMs) [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ] are best suited for samples with many features
because of their ability to learn is independent of the features space [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ]. They have been used
exensively in biomedical classification tasks [
        <xref ref-type="bibr" rid="ref25 ref26 ref27 ref28">25, 26, 27, 28</xref>
        ] due to their ability to generalize
well with data consisting of sparse high-dimensional features. For our baseline, we trained a
linear support vector machine. For this, we transformed each 89 × 89 matrix into a single 7921
feature vector.
      </p>
      <p>
        D-Model: Neural networks (NNs) have produced state-of-the-art results in the area of relation
classification [
        <xref ref-type="bibr" rid="ref28 ref29 ref30 ref31">28, 29, 30, 31</xref>
        ]. The major advantage of neural network based approaches lies
in thier ability to directly learn the latent feature representation from the labeled training
data without requiring experts to carefully craft them [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. For our experiments with neural
networks, we designed D-Model, a simple multi-layer perceptron with an input layer of output
feature size of 3, 960, a hidden layer of 1, 980 and an output layer with an output feature size
corresponding to the number of classes [rationale for the choice of the size of neurons: 1). we
tested diferent sizes and this gave the best result, and 2). we followed [
        <xref ref-type="bibr" rid="ref32 ref33 ref34">32, 33, 34</xref>
        ] in keeping the
hidden layer size between input layer size and output layer size]. ReLU activation function [
        <xref ref-type="bibr" rid="ref35">35</xref>
        ]
was used between the input and hidden layers to introduce non-linearity. The output layer is
connected to a softmax activation function which converts the model’s output into a probability
over the classes. Although NNs have shown great promise for relation classification, they are
highly susceptible to overfitting [
        <xref ref-type="bibr" rid="ref36">36</xref>
        ] and require lots of hyperparameter tuning. Therefore, we
experimented with regularization techniques (early stopping and dropout) the hyperparameters
(learning rate, batch size, etc) in order to produce the best performing D-Model.
C-Net: Convolutional neural networks (CNNs) are a type of neural networks that can
successfully capture the spatial and temporal dependencies in an image through the application
of convolution operation and relevant filters. Their potential was first witnessed in computer
vision around 2012 [
        <xref ref-type="bibr" rid="ref37">37</xref>
        ], and since then have been used extensively even in biomedical relation
classification [
        <xref ref-type="bibr" rid="ref30 ref38 ref39">38, 39, 30</xref>
        ]. CNNs perform well on an image dataset better due to the reduction
in the number of parameters involved and reusability of their weights - they are therefore
best suited for image-type data. Furthermore, with CNNs we can work directly with the
2dimensional matrix (compared to transforming it for SVM and D-Model). To explore the impact
of CNNs on MeSH2Matrix, we decided to interprete our feature matrix as spatially correlated
features and designed C-Net, a simple CNN-based architecture made up of four convolution
layers (each layer consisting of a 2-dimensional convolution, batch normalization [
        <xref ref-type="bibr" rid="ref40">40</xref>
        ], a ReLU
activation function [
        <xref ref-type="bibr" rid="ref35">35</xref>
        ] and max-pooling) and two fully connected layers. After passing through
the fully connected layers, the final layer uses the softmax activation function which is used to
get probabilities of the input matrix being in a particular class. CNN-based models, while being
very promising, require practical knowledge to configure the model architecture with regard to
the performance [
        <xref ref-type="bibr" rid="ref41">41</xref>
        ], and to set the hyperparameters for the best optimization [42]. Similarly,
we conducted hyperparameter tuning and optimization in order to explore the reasonable ranges
for the sensitive hyperparameters of the classification model.
      </p>
      <sec id="sec-4-1">
        <title>4.1. Experiments</title>
        <p>We performed two rounds of classification: one with all relation types (195), and another with 5
categories obtained after grouping the initial 195 relation types (see section 3.2 for more details
on grouping). We split our dataset into training (33, 457 samples), validation (13, 012 samples)
for early stopping, regularization and hyperparameter tuning - and testing (9, 294 samples) - for
the final evaluation of the model. For SVM training, we merged the training and validation set,
making a total of 46, 469 samples for training. For the training of D-Model and C-Net, we used
the Adam optimizer [43]. The code for all our deep learning experiments was written using the
PyTorch deep learning framework [44], while for SVM we implemented the training using the
LinearSVC package.</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Results</title>
        <p>
          Table 2 shows the results of the three benchmark models on the 195-classification and
5classification tasks. The metric being used are accuracy and multi-class F1-score (which is a
metric that combines the precision and recall of the model). It is clear that the three proposed
models achieved acceptable accuracy measures that go in line with the recent advances in
biomedical relation classification ( F1-Score between 0.65 and 0.85) [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ].
        </p>
        <p>
          We see a notable improvement in the probabilistic methods (D-Model and C-Net) over SVM.
DModel performs the best, although outperforming C-Net by a small margin. Another observation
is that for all the models, their performance on the 5-class takes was much better than the
195-class ones. This could be because the consideration of five generalized superclasses actually
reduces the complexity of the task, making it easier for the model to learn [45]. For example,
class generalization allowed us to be get rid of the closely related taxonomic relation types
(i.e., instance of [P31], subclass of [P279], and part of [P361]) and eliminate the efect of the
confusion between these three relation types on the accuracy of the models. Another possible
reason could be that grouping increased the distribution of some minority classes (classes with
a very few samples). On the one hand, due to the enrichment of Wikidata thanks to human
eforts, important Wikidata statements can be mistakenly defined for minor relation types
[
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]. The efect of such deficient relations will become insignificant when the generalization
occurs. On the other hand, as of December 12, 2021, 4,522 (4.4%) out of the 99,208 Wikidata
relations between MeSH Terms are having the same subject and object as another supported
semantic relations between MeSH Concepts9. Statements having the same subjects and objects
but diferent relation types are likely to be merged together due to the class generalization,
allowing to reduce the confusion between slightly overlapping relation types.
        </p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion</title>
      <p>In this research paper, we proposed a novel approach for the classification of biomedical relations
based on the association between the qualifiers of two semantically related MeSH keywords
of PubMed scholarly publications. We generated MeSH2Matrix as a training dataset (covering
195 relation types involved in five superclasses) to enable the MeSH-based biomedical relation
classification and we trained three benchmarking machine-learning models ( SVM, D-Model and
C-Net) to evaluate the eficiency of our approach to classify various types of biomedical relations.
We found an interesting eficiency of our approach in biomedical relation classification ( F1-Score
&gt; 0.66) proving the promising value of using Bibliometric-Enhanced Information Retrieval
towards the improvement of biomedical relation classification. For reproducibility purposes,
our source code and dataset are currently available at https://github.com/SisonkeBiotik-Africa/
MeSH2Matrix. As a future direction of this work, we propose to expand our approach into a
method for converting subsets of the MeSH taxonomy into biomedical ontologies. Furthermore,
we propose to further analyze the efect of the generalization of relation types to their respective
superclasses on the accuracy rates of our models to study the behavior of our approach.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>This work has been supported by the Tunisian Ministry of Higher Education and Scientific
Research (MoHESR) within the framework of the Federated Research Project PRFCOV19-D1-P1
and by Craig Newmark Philanthropies, Facebook, and Microsoft through the WikiCred Grant
Initiative. We thank the Sisonkebiotik Community, particularly Chris Fourie (University of the
Witwatersrand, South Africa), for contributing to the development of the research collaboration
that resulted in this research paper. We thank Dr. Ahmed Ben Abdelaziz (University of Sousse,
Tunisia) for introducing the Medical Subject Headings to the first author of this research work
in March 2016. We also thank Mr. Colin Leong (University of Dayton, United States of America)
for his useful comments and discussion.</p>
      <sec id="sec-6-1">
        <title>9Live data: https://w.wiki/4itd</title>
        <p>https://aclanthology.org/I17-1026.
[42] J. Bergstra, Y. Bengio, Random search for hyper-parameter optimization, J. Mach. Learn.</p>
        <p>Res. 13 (2012) 281–305.
[43] D. P. Kingma, J. Ba, Adam: A method for stochastic optimization, in: Y. Bengio, Y. LeCun
(Eds.), 3rd International Conference on Learning Representations, ICLR 2015, San Diego,
CA, USA, May 7-9, 2015, Conference Track Proceedings, 2015. URL: http://arxiv.org/abs/
1412.6980.
[44] A. Paszke, S. Gross, F. Massa, A. Lerer, J. Bradbury, G. Chanan, T. Killeen, Z. Lin,
N. Gimelshein, L. Antiga, A. Desmaison, A. Kopf, E. Yang, Z. DeVito, M. Raison, A. Tejani,
S. Chilamkurthy, B. Steiner, L. Fang, J. Bai, S. Chintala, Pytorch: An imperative style,
high-performance deep learning library, in: H. Wallach, H. Larochelle, A. Beygelzimer,
F. d'Alché-Buc, E. Fox, R. Garnett (Eds.), Advances in Neural Information Processing
Systems, volume 32, Curran Associates, Inc., 2019. URL: https://proceedings.neurips.cc/paper/
2019/file/bdbca288fee7f92f2bfa9f7012727740-Paper.pdf.
[45] H. Turki, M. A. Hadj Taieb, M. Ben Aouicha, How knowledge-driven class generalization
afects classical machine learning algorithms for mono-label supervised classification,
in: Proceedings of the 21st International Conference on Intelligent Systems Design and
Applications, Springer, Cham, Online, 2021. doi:10.1007/978-3-030-96308-8_59.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Bona</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F. W.</given-names>
            <surname>Prior</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. N.</given-names>
            <surname>Zozus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Brochhausen</surname>
          </string-name>
          ,
          <article-title>Enhancing clinical data and clinical research data with biomedical ontologies - insights from the knowledge representation perspective</article-title>
          ,
          <source>Yearbook of Medical Informatics</source>
          <volume>28</volume>
          (
          <year>2019</year>
          )
          <fpage>140</fpage>
          -
          <lpage>151</lpage>
          . doi:
          <volume>10</volume>
          .1055/ s-0039-1677912.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M.</given-names>
            <surname>Kulmanov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F. Z.</given-names>
            <surname>Smaili</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Gao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Hoehndorf</surname>
          </string-name>
          ,
          <article-title>Semantic similarity and machine learning with ontologies</article-title>
          ,
          <source>Briefings in Bioinformatics</source>
          <volume>22</volume>
          (
          <year>2021</year>
          )
          <article-title>bbaa199</article-title>
          . doi:
          <volume>10</volume>
          .1093/ bib/bbaa199.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A.</given-names>
            <surname>Bandrowski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Brinkman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Brochhausen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. H.</given-names>
            <surname>Brush</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Bug</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. C.</given-names>
            <surname>Chibucos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Clancy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Courtot</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Derom</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dumontier</surname>
          </string-name>
          , et al.,
          <article-title>The ontology for biomedical investigations</article-title>
          ,
          <source>PLOS ONE 11</source>
          (
          <year>2016</year>
          )
          <article-title>e0154556</article-title>
          . doi:
          <volume>10</volume>
          .1371/journal.pone.
          <volume>0154556</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>R.</given-names>
            <surname>Hoehndorf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dumontier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. V.</given-names>
            <surname>Gkoutos</surname>
          </string-name>
          , Evaluation of research in biomedical ontologies,
          <source>Briefings in Bioinformatics</source>
          <volume>14</volume>
          (
          <year>2012</year>
          )
          <fpage>696</fpage>
          -
          <lpage>712</lpage>
          . doi:
          <volume>10</volume>
          .1093/bib/bbs053.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>B. M.</given-names>
            <surname>Konopka</surname>
          </string-name>
          ,
          <article-title>Biomedical ontologies-a review</article-title>
          ,
          <source>Biocybernetics and Biomedical Engineering</source>
          <volume>35</volume>
          (
          <year>2015</year>
          )
          <fpage>75</fpage>
          -
          <lpage>86</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.bbe.
          <year>2014</year>
          .
          <volume>06</volume>
          .002.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Sun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <article-title>Neural network-based approaches for biomedical relation classification: A review</article-title>
          ,
          <source>Journal of Biomedical Informatics</source>
          <volume>99</volume>
          (
          <year>2019</year>
          )
          <article-title>103294</article-title>
          . doi:
          <volume>10</volume>
          .1016/j.jbi.
          <year>2019</year>
          .
          <volume>103294</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>D.</given-names>
            <surname>Oliveira</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Pesquita</surname>
          </string-name>
          ,
          <article-title>Improving the interoperability of biomedical ontologies with compound alignments</article-title>
          ,
          <source>Journal of Biomedical Semantics</source>
          <volume>9</volume>
          (
          <year>2018</year>
          ).
          <source>doi:10.1186/ s13326-017-0171-8.</source>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Amith</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Bian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Lossio-Ventura</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.</surname>
          </string-name>
          <article-title>Tao, Assessing the practice of biomedical ontology evaluation: Gaps and opportunities</article-title>
          ,
          <source>Journal of Biomedical Informatics</source>
          <volume>80</volume>
          (
          <year>2018</year>
          )
          <fpage>1</fpage>
          -
          <lpage>13</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.jbi.
          <year>2018</year>
          .
          <volume>02</volume>
          .010.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>H.</given-names>
            <surname>Turki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Hadj Taieb</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. Ben</given-names>
            <surname>Aouicha</surname>
          </string-name>
          , G. Fraumann,
          <string-name>
            <given-names>C.</given-names>
            <surname>Hauschke</surname>
          </string-name>
          , L. Heller,
          <article-title>Enhancing knowledge graph extraction and validation from scholarly publications using bibliographic metadata</article-title>
          ,
          <source>Frontiers in Research Metrics and Analytics</source>
          <volume>6</volume>
          (
          <year>2021</year>
          )
          <article-title>694307</article-title>
          . doi:
          <volume>10</volume>
          .3389/frma.
          <year>2021</year>
          .
          <volume>694307</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>H.</given-names>
            <surname>Turki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Hadj Taieb</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Ben Aouicha, Mesh qualifiers, publication types and relation occurrence frequency are also useful for a better sentence-level extraction of biomedical relations</article-title>
          ,
          <source>Journal of Biomedical Informatics</source>
          <volume>83</volume>
          (
          <year>2018</year>
          )
          <fpage>217</fpage>
          -
          <lpage>218</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.jbi.
          <year>2018</year>
          .
          <volume>05</volume>
          .011.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>H.</given-names>
            <surname>Turki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Hadj Taieb</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Ben Aouicha, Enhancing filter-based parenthetic abbreviation extraction methods</article-title>
          ,
          <source>Journal of the American Medical Informatics Association</source>
          <volume>28</volume>
          (
          <year>2021</year>
          )
          <fpage>668</fpage>
          -
          <lpage>669</lpage>
          . doi:
          <volume>10</volume>
          .1093/jamia/ocaa314.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>H.</given-names>
            <surname>Turki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Shafee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Hadj Taieb</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. Ben</given-names>
            <surname>Aouicha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Vrandečić</surname>
          </string-name>
          ,
          <string-name>
            <surname>D. Das</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Hamdi</surname>
          </string-name>
          ,
          <article-title>Wikidata: A large-scale collaborative ontological medical database</article-title>
          ,
          <source>Journal of Biomedical Informatics</source>
          <volume>99</volume>
          (
          <year>2019</year>
          )
          <article-title>103292</article-title>
          . doi:
          <volume>10</volume>
          .1016/j.jbi.
          <year>2019</year>
          .
          <volume>103292</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>N.</given-names>
            <surname>Baumann</surname>
          </string-name>
          ,
          <article-title>How to use the medical subject headings (mesh</article-title>
          ),
          <source>International Journal of Clinical Practice</source>
          <volume>70</volume>
          (
          <year>2016</year>
          )
          <fpage>171</fpage>
          -
          <lpage>174</lpage>
          . doi:
          <volume>10</volume>
          .1111/ijcp.12767.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>L.</given-names>
            <surname>Leydesdorf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Comins</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. A.</given-names>
            <surname>Sorensen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Bornmann</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Hellsten</surname>
          </string-name>
          ,
          <article-title>Cited references and medical subject headings (mesh) as two diferent knowledge representations: Clustering and mappings at the paper level</article-title>
          ,
          <source>Scientometrics</source>
          <volume>109</volume>
          (
          <year>2016</year>
          )
          <fpage>2077</fpage>
          -
          <lpage>2091</lpage>
          . doi:
          <volume>10</volume>
          .1007/ s11192-016-2119-7.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Lu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Figler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.-C.</given-names>
            <surname>Tu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wang</surname>
          </string-name>
          , F. Cheng,
          <article-title>Characterization of the mechanism of drug-drug interactions from pubmed using mesh terms</article-title>
          ,
          <source>PLOS ONE 12</source>
          (
          <year>2017</year>
          )
          <article-title>e0173548</article-title>
          . doi:
          <volume>10</volume>
          .1371/journal.pone.
          <volume>0173548</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>T.</given-names>
            <surname>Tran</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Kavuluru</surname>
          </string-name>
          ,
          <article-title>Distant supervision for treatment relation extraction by leveraging mesh subheadings</article-title>
          ,
          <source>Artificial Intelligence in Medicine</source>
          <volume>98</volume>
          (
          <year>2019</year>
          )
          <fpage>18</fpage>
          -
          <lpage>26</lpage>
          . doi:
          <volume>10</volume>
          .1016/j. artmed.
          <year>2019</year>
          .
          <volume>06</volume>
          .002.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>B.</given-names>
            <surname>Chapman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Chang</surname>
          </string-name>
          , Biopython:
          <article-title>Python tools for computational biology</article-title>
          ,
          <source>ACM SIGBIO Newsletter</source>
          <volume>20</volume>
          (
          <year>2000</year>
          )
          <fpage>15</fpage>
          -
          <lpage>19</lpage>
          . doi:
          <volume>10</volume>
          .1145/360262.360268.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>P. J.</given-names>
            <surname>Cock</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Antao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. T.</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. A.</given-names>
            <surname>Chapman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. J.</given-names>
            <surname>Cox</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Dalke</surname>
          </string-name>
          , I. Friedberg,
          <string-name>
            <given-names>T.</given-names>
            <surname>Hamelryck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Kauf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Wilczynski</surname>
          </string-name>
          , et al.,
          <article-title>Biopython: Freely available python tools for computational molecular biology and bioinformatics</article-title>
          ,
          <source>Bioinformatics</source>
          <volume>25</volume>
          (
          <year>2009</year>
          )
          <fpage>1422</fpage>
          -
          <lpage>1423</lpage>
          . doi:
          <volume>10</volume>
          .1093/bioinformatics/btp163.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>H.</given-names>
            <surname>Turki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Hadj Taieb</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Shafee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Lubiana</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Jemielniak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. Ben</given-names>
            <surname>Aouicha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. E. Labra</given-names>
            <surname>Gayo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. A.</given-names>
            <surname>Youngstrom</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Banat</surname>
          </string-name>
          ,
          <string-name>
            <surname>D. Das</surname>
          </string-name>
          , et al.,
          <article-title>Representing covid-19 information in collaborative knowledge graphs: The case of wikidata</article-title>
          ,
          <source>Semantic Web</source>
          <volume>13</volume>
          (
          <year>2022</year>
          )
          <fpage>233</fpage>
          -
          <lpage>264</lpage>
          . doi:
          <volume>10</volume>
          .3233/sw-210444.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>H.</given-names>
            <surname>Turki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Jemielniak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Hadj Taieb</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. E. Labra</given-names>
            <surname>Gayo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. Ben</given-names>
            <surname>Aouicha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Banat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Shafee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Prud'Hommeaux</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Lubiana</surname>
          </string-name>
          ,
          <string-name>
            <surname>D. Das</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Mietchen</surname>
          </string-name>
          ,
          <article-title>Using logical constraints to validate statistical information about covid-19 in collaborative knowledge graphs: the case of wikidata</article-title>
          ,
          <source>Zenodo</source>
          (
          <year>2021</year>
          ). doi:
          <volume>10</volume>
          .5281/zenodo.4008358.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>N.</given-names>
            <surname>Fiorini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Canese</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Starchenko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Kireev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Kim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Miller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Osipov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kholodov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Ismagilov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Mohan</surname>
          </string-name>
          , et al.,
          <article-title>Best match: New relevance search for pubmed</article-title>
          ,
          <source>PLOS Biology 16</source>
          (
          <year>2018</year>
          )
          <article-title>e2005343</article-title>
          . doi:
          <volume>10</volume>
          .1371/journal.pbio.
          <volume>2005343</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>L.</given-names>
            <surname>Egghe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Rousseau</surname>
          </string-name>
          ,
          <article-title>Theory and practice of the shifted lotka function</article-title>
          ,
          <source>Scientometrics</source>
          <volume>91</volume>
          (
          <year>2012</year>
          )
          <fpage>295</fpage>
          -
          <lpage>301</lpage>
          . doi:
          <volume>10</volume>
          .1007/s11192-011-0539-y.
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>N.</given-names>
            <surname>Cristianini</surname>
          </string-name>
          , E. Ricci, Support Vector Machines,
          <string-name>
            <surname>Springer</surname>
            <given-names>US</given-names>
          </string-name>
          , Boston, MA,
          <year>2008</year>
          , pp.
          <fpage>928</fpage>
          -
          <lpage>932</lpage>
          . doi:
          <volume>10</volume>
          .1007/978-0-
          <fpage>387</fpage>
          -30162-4_
          <fpage>415</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>T.</given-names>
            <surname>Joachims</surname>
          </string-name>
          ,
          <article-title>Text categorization with support vector machines: Learning with many relevant features</article-title>
          , in: C.
          <string-name>
            <surname>Nédellec</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          Rouveirol (Eds.),
          <source>Machine Learning: ECML-98</source>
          , Springer Berlin Heidelberg, Berlin, Heidelberg,
          <year>1998</year>
          , pp.
          <fpage>137</fpage>
          -
          <lpage>142</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>A.</given-names>
            <surname>Ben Abacha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Zweigenbaum</surname>
          </string-name>
          ,
          <article-title>A hybrid approach for the extraction of semantic relations from MEDLINE abstracts</article-title>
          ,
          <source>in: Computational Linguistics and Intelligent Text Processing</source>
          , Springer Berlin Heidelberg,
          <year>2011</year>
          , pp.
          <fpage>139</fpage>
          -
          <lpage>150</lpage>
          . doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>642</fpage>
          -19437-5_
          <fpage>11</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>T.</given-names>
            <surname>Mavropoulos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Liparas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Symeonidis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Vrochidis</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Kompatsiaris</surname>
          </string-name>
          ,
          <article-title>A hybrid approach for biomedical relation extraction using finite state automata and random forest-weighted fusion</article-title>
          , in: A.
          <string-name>
            <surname>Gelbukh</surname>
          </string-name>
          (Ed.),
          <source>Computational Linguistics and Intelligent Text Processing</source>
          , Springer International Publishing, Cham,
          <year>2018</year>
          , pp.
          <fpage>450</fpage>
          -
          <lpage>462</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>A. W.</given-names>
            <surname>Muzafar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Azam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>U.</given-names>
            <surname>Qamar</surname>
          </string-name>
          ,
          <article-title>A relation extraction framework for biomedical text using hybrid feature set</article-title>
          ,
          <source>Computational and Mathematical Methods in Medicine</source>
          <year>2015</year>
          (
          <year>2015</year>
          )
          <fpage>1</fpage>
          -
          <lpage>12</lpage>
          . doi:
          <volume>10</volume>
          .1155/
          <year>2015</year>
          /910423.
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>D.</given-names>
            <surname>Zeng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Lai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <article-title>Relation classification via convolutional deep neural network</article-title>
          ,
          <source>in: Proceedings of COLING</source>
          <year>2014</year>
          ,
          <source>the 25th International Conference on Computational Linguistics: Technical Papers</source>
          , Dublin City University and Association for Computational Linguistics, Dublin, Ireland,
          <year>2014</year>
          , pp.
          <fpage>2335</fpage>
          -
          <lpage>2344</lpage>
          . URL: https://aclanthology. org/C14-1220.
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <surname>C. dos Santos</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Xiang</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Zhou</surname>
          </string-name>
          ,
          <article-title>Classifying relations by ranking with convolutional neural networks</article-title>
          ,
          <source>in: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing</source>
          (Volume
          <volume>1</volume>
          :
          <string-name>
            <surname>Long</surname>
            <given-names>Papers)</given-names>
          </string-name>
          ,
          <source>Association for Computational Linguistics</source>
          , Beijing, China,
          <year>2015</year>
          , pp.
          <fpage>626</fpage>
          -
          <lpage>634</lpage>
          . doi:
          <volume>10</volume>
          .3115/v1/
          <fpage>P15</fpage>
          -1061.
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Peng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Lu</surname>
          </string-name>
          ,
          <article-title>Deep learning for extracting protein-protein interactions from biomedical literature</article-title>
          ,
          <source>in: BioNLP</source>
          <year>2017</year>
          ,
          <article-title>Association for Computational Linguistics</article-title>
          , Vancouver, Canada„
          <year>2017</year>
          , pp.
          <fpage>29</fpage>
          -
          <lpage>38</lpage>
          . doi:
          <volume>10</volume>
          .18653/v1/
          <fpage>W17</fpage>
          -2304.
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>A.</given-names>
            <surname>Rios</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Kavuluru</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Lu</surname>
          </string-name>
          ,
          <article-title>Generalizing biomedical relation classification with neural adversarial domain adaptation</article-title>
          ,
          <source>Bioinformatics</source>
          <volume>34</volume>
          (
          <year>2018</year>
          )
          <fpage>2973</fpage>
          -
          <lpage>2981</lpage>
          . doi:
          <volume>10</volume>
          .1093/ bioinformatics/bty190.
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>J.</given-names>
            <surname>Heaton</surname>
          </string-name>
          , Introduction to Neural
          <source>Networks for Java, 2nd Edition</source>
          , 2nd ed.,
          <source>Heaton Research</source>
          , Inc.,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [33]
          <string-name>
            <given-names>D.</given-names>
            <surname>Stathakis</surname>
          </string-name>
          ,
          <article-title>How many hidden layers</article-title>
          and nodes?,
          <source>International Journal of Remote Sensing</source>
          <volume>30</volume>
          (
          <year>2009</year>
          )
          <fpage>2133</fpage>
          -
          <lpage>2147</lpage>
          . URL: https://doi.org/10.1080/01431160802549278. doi:
          <volume>10</volume>
          .1080/ 01431160802549278. arXiv:https://doi.org/10.1080/01431160802549278.
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [34]
          <string-name>
            <surname>H. B. Demuth</surname>
            ,
            <given-names>M. H.</given-names>
          </string-name>
          <string-name>
            <surname>Beale</surname>
          </string-name>
          ,
          <string-name>
            <surname>O. De Jess</surname>
          </string-name>
          , M. T. Hagan, Neural Network Design, 2nd ed.,
          <string-name>
            <surname>Martin</surname>
            <given-names>Hagan</given-names>
          </string-name>
          , Stillwater,
          <string-name>
            <surname>OK</surname>
          </string-name>
          , USA,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [35]
          <string-name>
            <given-names>A. F.</given-names>
            <surname>Agarap</surname>
          </string-name>
          ,
          <article-title>Deep learning using rectified linear units (relu</article-title>
          ),
          <year>2018</year>
          . arXiv:
          <year>1803</year>
          .08375.
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          [36]
          <string-name>
            <given-names>L.</given-names>
            <surname>Mou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Meng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Yan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Jin</surname>
          </string-name>
          ,
          <article-title>How transferable are neural networks in NLP applications?</article-title>
          ,
          <source>in: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing</source>
          , Association for Computational Linguistics, Austin, Texas,
          <year>2016</year>
          , pp.
          <fpage>479</fpage>
          -
          <lpage>489</lpage>
          . doi:
          <volume>10</volume>
          .18653/v1/
          <fpage>D16</fpage>
          -1046.
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          [37]
          <string-name>
            <given-names>A.</given-names>
            <surname>Krizhevsky</surname>
          </string-name>
          , I. Sutskever,
          <string-name>
            <given-names>G. E.</given-names>
            <surname>Hinton</surname>
          </string-name>
          ,
          <article-title>Imagenet classification with deep convolutional neural networks</article-title>
          , in: F. Pereira,
          <string-name>
            <given-names>C. J. C.</given-names>
            <surname>Burges</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Bottou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. Q.</given-names>
            <surname>Weinberger</surname>
          </string-name>
          (Eds.),
          <source>Advances in Neural Information Processing Systems</source>
          , volume
          <volume>25</volume>
          ,
          <string-name>
            <surname>Curran</surname>
            <given-names>Associates</given-names>
          </string-name>
          , Inc.,
          <year>2012</year>
          . URL: https://proceedings.neurips.cc/paper/2012/file/ c399862d3b9d6b76c8436e924a68c45b-Paper.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          [38]
          <string-name>
            <given-names>S.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Tang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <article-title>Drug-drug interaction extraction via convolutional neural networks</article-title>
          ,
          <source>Computational and Mathematical Methods in Medicine</source>
          <year>2016</year>
          (
          <year>2016</year>
          )
          <article-title>6918381</article-title>
          . doi:
          <volume>10</volume>
          .1155/
          <year>2016</year>
          /6918381.
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          [39]
          <string-name>
            <given-names>C.</given-names>
            <surname>Quan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Hua</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Sun</surname>
          </string-name>
          , W. Bai,
          <article-title>Multichannel convolutional neural network for biological relation extraction</article-title>
          ,
          <source>BioMed Research International</source>
          <year>2016</year>
          (
          <year>2016</year>
          )
          <article-title>1850404</article-title>
          . doi:
          <volume>10</volume>
          .1155/
          <year>2016</year>
          /1850404.
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          [40]
          <string-name>
            <given-names>S.</given-names>
            <surname>Iofe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Szegedy</surname>
          </string-name>
          ,
          <article-title>Batch normalization: Accelerating deep network training by reducing internal covariate shift</article-title>
          , in: F.
          <string-name>
            <surname>Bach</surname>
          </string-name>
          , D. Blei (Eds.),
          <source>Proceedings of the 32nd International Conference on Machine Learning</source>
          , volume
          <volume>37</volume>
          <source>of Proceedings of Machine Learning Research</source>
          , PMLR, Lille, France,
          <year>2015</year>
          , pp.
          <fpage>448</fpage>
          -
          <lpage>456</lpage>
          . URL: https://proceedings.mlr.press/v37/iofe15.html.
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          [41]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Wallace</surname>
          </string-name>
          ,
          <article-title>A sensitivity analysis of (and practitioners' guide to) convolutional neural networks for sentence classification</article-title>
          ,
          <source>in: Proceedings of the Eighth International Joint Conference on Natural Language Processing</source>
          (Volume
          <volume>1</volume>
          :
          <string-name>
            <surname>Long</surname>
            <given-names>Papers)</given-names>
          </string-name>
          ,
          <source>Asian Federation of Natural Language Processing</source>
          , Taipei, Taiwan,
          <year>2017</year>
          , pp.
          <fpage>253</fpage>
          -
          <lpage>263</lpage>
          . URL:
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>