<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Philipp Scharpf</string-name>
          <email>philipp.scharpf@uni-konstanz.de</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Moritz Schubotz</string-name>
          <email>moritz.schubotz@fiz-karlsruhe.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bela Gipp</string-name>
          <email>gipp@uni-wuppertal.de</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>FIZ Karlsruhe</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Konstanz</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Wuppertal</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Documents from Science, Technology, Engineering, and Mathematics (STEM) disciplines usually contain a signi cant amount of mathematical formulae alongside text. Some Mathematical Information Retrieval (MathIR) systems, e.g., Mathematical Question Answering (MathQA), exploit knowledge from Wikidata. Therefore, the mathematical information needs to be stored in items. In the last years, there have been e orts to de ne several properties and seed formulae together with their constituting identi ers into Wikidata. This paper summarizes the current state, challenges, and discussions related to this endeavor. Furthermore, some data mining methods (supervised formula annotation and concept retrieval) and applications (question answering and classi cation explainability) of the mathematical information are outlined. Finally, we discuss community feedback and issues related to integrating Mathematical Entity Linking (MathEL) into Wikidata and Wikipedia, which was rejected in 33% and 12% of the test cases, for Wikidata and Wikipedia respectively. Our long-term goal is to populate Wikidata, such that it can serve a variety of automated math reasoning tasks and AI systems.</p>
      </abstract>
      <kwd-group>
        <kwd>Wikidata</kwd>
        <kwd>Mathematical Information Retrieval</kwd>
        <kwd>Mathematical Entity Linking</kwd>
        <kwd>Mathematical Question Answering</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Mathematical Information Retrieval (MathIR) systems, such as Document
Recommender (DocRec), Mathematical Question Answering (MathQA), and
Automatic Document Classi cation (ADC), need to process and query
mathematical formulae. Since Wikidata has been proven useful as a semantic grounding
database for Natural Language Processing (NLP) approaches and applications,
it was consequential to transfer and adapt classical IR and NLP methods to the
special case of mathematical knowledge. In 2016, we implemented support for
mathematical properties, such as `de ning formula' (P2534), which were
pro? Copyright © 2021 for this paper by its authors. Use permitted under Creative
Commons License Attribution 4.0 International (CC BY 4.0). This work was supported
by DFG grant GI-1259-1.
posed4 and used5. Later, additional properties to include the semantics of the
formula identi ers were added 6.</p>
      <p>Our long-term goal is to build math.wikipedia.org, a large collaborative,
semiformal, machine-readable, language-independent mathematics encyclopedia. Its
purpose will be to provide the backbone for automated reasoning tasks, concept
entity linking, knowledge-graph population, question answering, and more.
#R e t r i e v e a l l i t e m s w i t h ` d e f i n i n g f o r m u l a ' p r o p e r t y P2534
SELECT ? f o r m u l a WHERE f</p>
      <p>? i t e m wdt : P2534 ? f o r m u l a .
g
g
4 https://www.wikidata.org/w/index.php?title=Property:P2534&amp;oldid=303933381
5 https://www.wikidata.org/w/index.php?title=Q35875&amp;oldid=303968820
6 https://www.wikidata.org/w/index.php?title=Property:P4934&amp;oldid=646697942
7 https://query.wikidata.org</p>
      <p>The remainder of this paper is structured as follows. In Section 2, we
describe how the knowledge can be distilled by annotating mathematical
documents (papers, articles, etc.). We show how this can be accelerated using an
annotation recommender system. In Section 3, we present standards and
systems for benchmarking the knowledge for Mathematical Information Retrieval
(MathIR) experiments. Mathematical Entity Retrieval and Linking methods are
introduced, and community feedback on incorporating MathEL data into
Wikidata and Wikipedia is discussed. Section 4 outlines MathQA and DCE as two
example applications of MathEL and concludes with an outlook to challenges
and future work.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Mathematical Entity Annotation</title>
      <p>The process of Mathematical Entity Linking can be comprised of 1)
Mathematical Entity Annotation and 2) Mathematical Entity Retrieval. In this chapter,
we start with 1) by presenting approaches for document annotation and its
acceleration by annotation recommendation.
2.1</p>
      <sec id="sec-2-1">
        <title>Document Annotation</title>
        <p>
          Document annotations are generally employed to provide additional
information about a resource (e.g., comments) or to link resources (e.g., to URLs). The
Web Annotation Data Model8 speci es the annotation model structure (id, type,
property, relationship) in JSON format. Moreover, RDF classes and ontologies
should be de ned and serialized according to the Web Annotation Vocabulary9.
Several annotation tools and recommender systems for linked data have been
developed so far. Tietz et al. present a system for Wordpress [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ] that recommends
DBpedia resources and visualizes the annotation process. Users can explore
background information and relationships between named entities. Vagliano et al.
provide a technical report [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ] on semantic annotation of user reviews using
DBpedia and Wikidata. Purwitasari et al. introduce an ontology-based annotation
8 https://www.w3.org/TR/annotation-model
9 https://www.w3.org/TR/annotation-vocab
recommender for learning material [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] using Latent Semantic Analysis (LSA)
and WordNet to determine the context of content categories, which are then
structured into an ontology model. Lastly, Wiesing et al. developed an RDF
annotation tool (KAT) speci c for STEM documents in XHTML format [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ].
2.2
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Annotation Recommendation</title>
        <p>
          To disambiguate and match mathematical expressions in Wikipedia articles to
Wikidata items [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ], the `AnnoMathTeX' formula and identi er annotation
recommender system10 was developed. The system is designed to suggest
Wikidata item name and QID candidates provided from several sources, such as the
arXiv11, Wikipedia, Wikidata, or the text that surrounds the formula. In the rst
evaluation, it could be shown that 78% of the identi er name recommendations
were accepted by the user. In additional experiments, the community acceptance
of the Wikipedia article link and Wikidata item seed edits was assessed [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. For
88% of the edited Wikipedia articles and 67% of the Wikidata items, the
contributions were accepted. Moreover, the annotation could be accelerated by a
speedup of factor 1.4 for formulae and 2.4 for identi ers. The `AnnoMathTeX'
system is ready to be integrated seamlessly into the Wikimedia user interfaces
via a `MathWikiLink' API.
        </p>
        <p>
          We presented the system with its applications at the Wikiworkshop21
(WWW21 conference) [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. Figure 2 shows the User Interface of the
`AnnoMath10 https://annomathtex.wm abs.org
11 https://arxiv.org
TeX' system at the start, where Wikipedia Wikitext or arXiv LATEX articles can
be selected, loaded, and deleted.
        </p>
        <p>If the user clicks on a formula or identi er in the loaded document ,
recommendations are displayed as shown in Figure 3 for the example formula F = m a,
which is seeded into Wikidata as the item `Newton's second law of motion'
(Q3268014).</p>
        <p>
          Figure 4 shows an example where the formula name recommendation is very
speci c within a concept hierarchy.
The purpose of the rst testing phase of the system was to elaborate on how
the mathematics knowledge contained in Wikipedia articles can be transferred to
Wikidata statements. For the annotation, we developed the following annotation
rules or guidelines:
{ Annotate identi ers rst, such that the formula name recommendation
retrieval from Wikidata via the `has part' properties is enabled;
{ Do not annotate identi er describing objects, such as `gas', `solid', `line'
instead of quantities or constants;
{ Ignore derivative d characters, such as in d=dt and all indices (superscript or
subscript);
{ Locally di erent meanings of the same identi er within an article should be
avoided (appeal to editors);
{ Ignore block-level formulae that are not relations (equations, inequations,
etc.) or do not have a single identi er right-hand side, e.g., Pi Ii = Pi ri2mi,
0 = :::, dE = :::. Also, ignore formulae in tables, and derivations;
{ Proper names (e.g., `Planck constant') must be capitalized according to the
conventions from `Content dictionary description' (DRMF) [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ].
        </p>
        <p>During the annotation process, we discovered the following issues:
{ It is not possible to parse equations with no spaces between identi ers, e.g.,
in the right-hand side of the LATEX string `L = rmv';
{ There are di erent common practices to denote vectors in LATEX, e.g., \vec
vs. \mathbf;
{ There are di erent common practices for properties in Wikidata to include
the semantics of the formula elements or identi er, e.g., `has part' (P527)
`calculated from' (P4934) - see the discussion in Section 3.3;
{ Sometimes two names are both commonly used to denote the same Formula
Concept, e.g., `M-sigma relation' (Q3424023) and `Faber{Jackson relation'
(Q1390162);
{ In case the Wikidata item for a Formula Concept was missing, and we had
to create it, we needed to reinsert the new QID into the annotations table
manually.</p>
        <p>In the future, the process of discovering new issues and requirements to
improve the system and extend the annotation guidelines will be continued.
Wikimedia users can collaboratively contribute to this joint endeavor.
3
3.1</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Mathematical Entity Linking</title>
      <sec id="sec-3-1">
        <title>Mathematical Entity Benchmarking</title>
        <p>
          The open-source and open access formula benchmark system MathMLben12 was
introduced to facilitate the conversion between di erent mathematical formats
such as LaTeX variations and Computer Algebra Systems (CAS) [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ]. Figure
5 shows the Graphical User Interface (GUI) of the system, displaying the
expression tree of an example formula. Each formula identi er can be annotated
with Wikidata QID macros. The annotation functionality was motivated by the
potential to de ne semantic relatedness for formulae by counting Wikidata links
12 https://mathmlben.wm abs.org
between them [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]. The MathMLben database contains 375 expressions or
formulae (GoldIDs) from Wikipedia, the arXiv, and the Digital Library of
Mathematical Functions (DLMF). The content is ranging from individual symbols
to complex multi-line formulae. It additionally contains meta-information, such
as the source URL or document page it is retrieved from. Expressions 1 to 100
are random samples taken from the National Institute of Informatics Testbeds
and Community for Information access Research Project (NTCIR) 11/12 Math
Wikipedia Task [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ]. Expressions 101 to 200 are random samples taken from the
NIST Digital Library of Mathematical Functions (DLMF) [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] available on the
website https://dlmf.nist.gov containing around 10.000 labeled LaTeX formulae
with semantic markup classi ed in 36 categories [
          <xref ref-type="bibr" rid="ref2 ref4">2, 4</xref>
          ]. Expressions 201 to 305
were selected from the NTCIR arXiv and NTCIR-12 Wikipedia dataset retrieval.
70 % of these formulae were taken from the arXiv and 30 % from a Wikipedia
dump. The remaining formulae were extracted from an annotation of 25 selected
Wikipedia articles from physics (classical mechanics) [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ].
        </p>
        <p>
          For each Gold ID entry or formula, there is an input eld for the Formula
Name, Formula Type (de nition, equation, relation or general formula),
Original Input TeX and manually Corrected TeX together with a Hyperlink to
the source. The Semantic LaTeX Input eld is used for the semantic
annotations, as a grounding for the generation of Content MathML with Wikidata
annotations by LaTeXML [
          <xref ref-type="bibr" rid="ref5 ref9">9, 5</xref>
          ]. The corrected TeX is rendered in real time by
Mathoid [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ]. Moreover, an expression tree is displayed, rendered by our
visualization tool VMEXT [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ]. For each symbol in the tree, the assigned annotation is
shown as a yellow mouse-over infobox containing the Wikidata QID, name, and
description (if available). The system includes a user guide on how to access raw
data or contribute by extending or correcting the expression tree or (Wikidata)
annotations.
3.2
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>Formula Concept Seeding and Retrieval</title>
        <p>
          In 2018, we rst introduced linking mathematical formula content to Wikidata,
both in MathML and LATEX markup [
          <xref ref-type="bibr" rid="ref14 ref19">19, 14</xref>
          ]. In 2019, we called out for a `Formula
Concept Discovery (FCD) and Formula Concept Recognition (FCR) challenge'
to elaborate automated Mathematical Entity Linking. For our FCD approach, we
could achieve a recall of 68% for retrieving equivalent representations of frequent
formulae and 72% for extracting the formula name (assigned to a Wikidata
item) from the surrounding text on the NTCIR arXiv dataset [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ]. We de ned
a `Formula Concept' as a `labeled collection of mathematical formulae that are
equivalent but have di erent representations through notation, e.g., the use of
di erent identi er symbols or commutations' [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. For example, the formula E =
mc2 can be regarded as being one representation of the Formula Concept labeled
`mass-energy equivalence'. A di erent representation of this same concept with
di erent notation and rearrangement could be = =c2.
        </p>
        <p>The following snipped exempli es how Einstein's famous formula E = mc2,
the item `mass-energy equivalence' (Q35875) can be found via a SPARQL query.
Based on the snippet, a formula search engine on Wikidata can be implemented.
#R e t r i e v e a l l i t e m s with l a b e l , d e s c r i p t i o n ,
and formula , whose d e f i n i n g f o r m u l a p r o p e r t y ( P2534 )
c o n t a i n s t h e s t r i n g `E=mc^ 2 '</p>
        <p>SELECT ? item ? i t e m L a b e l ? i t e m D e s c r i p t i o n
? d e f i n i n g F o r m u l a
WHERE f
? item wdt : P2534 ? d e f i n i n g F o r m u l a ;
FILTER( c o n t a i n s ( ? d e f i n i n g F o r m u l a , `E=mc^ 2 ' @en ) )
SERVICE w i k i b a s e : l a b e l
g</p>
        <p>g</p>
        <p>Figure 6 illustrates how to make use of the `has part' (P527) property to get
all items with formula whose identi ers are annotated as `energy' (Q11379) and
`speed of light' (Q2111). Based on the snippet, a semantic formula search engine
on Wikidata can be implemented.
3.3</p>
      </sec>
      <sec id="sec-3-3">
        <title>Community Feedback on Wikidata and Wikipedia</title>
        <p>
          In [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] we presented the evaluation of our AnnoMathTeX formula and identi er
annotation recommender system on a selection of 25 Wikipedia articles from
physics. The linked formula concepts were seeded into Wikidata and persisted
in our formula benchmark system MathMLben (see Section 3.1).
SELECT ?item ?itemLabel ?itemDescription
WHERE {
?item wdt:P527 wd:Q11379.
?item wdt:P527 wd:Q2111
SERVICE wikibase:label
{ bd:serviceParam wikibase:language "en" .} }
        </p>
        <p>The formula linkings from the annotated Wikipedia articles were included
in the Wikitext via qid attribute of the &lt;math&gt; tag. After uploading the edited
articles to Wikipedia, the following issues were pointed out by the community:
{ One community member responded that Wikidata should be usable
independently of Wikipedia, not having to comply with the technical requirements
for the special page display.
{ It was pointed out that currently, for the Wikdata items that have a `de ning
formula' (P2534), the use of the `calculated from' (P4934) property is much
higher than `has part' (P527). The claim was that `has part' is only a relict
from the past, which will not be used anymore for newly populated items.
{ Studying some sample equations with `calculated from' properties, another
user found that for `Gauss's law for magnetism' (Q1195250) with the formula
r B = 0 calculated from `magnetic eld' B and `divergence' r does not
make sense. On the other hand, it was asked if `length' or `time' was indeed
a `part of' `acceleration' ? In summary, concerns about the general validity
of both properties were expressed.
{ Furthermore, the coexistence and di erent bene ts of the properties
`quantity symbol (string)' (P416), `quantity symbol (LaTeX)' (P7973), and `de
ning formula' (P2534) for the subexpression strings were discussed.
{ One user argued that using the latter, only two properties (P2234 and P527)
would be needed to develop applications, such as the special page, which is
planned to be displayed as popup in the future13.
{ It was argued that the property P2234 would be general enough to be suitable
for longer expressions, such as f(x) or \exp, which are not just symbols.
{ Another user replied that one should distinguish between a de nition
(typically using an `=' sign) and citation (possibly using di erent symbols, such
as m or for `mass') of a quantity.</p>
        <p>Less than half a day after the execution of the script, two Wikipedia editors
started a discussion on our user's talk page14, pointing out the following issues:
13 https://phabricator.wikimedia.org/T208758
14 https://en.wikipedia.org/wiki/User talk:PhilMINT
{ The `de ning formula' property of the corresponding Wikidata item is edited
and evolving independently from the formula strings linked in the Wikipedia
articles;
{ The Wikidata items need to be very speci c to account for a particular
formula. See for example `kinetic energy' (Q46276): T = 12 mv2 vs. `kinetic
energy of rotating body' (Q104145205): Er = 12I!2;
{ If a Wikidata item has two or more `de ning formula' properties, only the
rst is displayed in the `Special page' (even if the `has part' identi er
annotations refer to another). This is problematic, e.g., with `Hooke's law'
(Q170282), which currently has both F = kX and = E" as `de ning
formula';
{ Sometimes disambiguation is needed to distinguish the physics terms from
other word meanings, e.g., `work' (Q42213) with description: `energy
transferred to an object via the application of force on it through a displacement'
vs. `work' (Q6958747): `particular form of activity, sold by many people to
sustain themselves';
{ In the `Equations of motion' article, a one-line formula includes three sub
formulae with di erent meanings that would need three di erent QIDs;
{ It should be possible for a Wikipedia reader to edit the formula and
identi ers directly on the special page, such that the changes get transferred to
Wikidata;
{ The special pages and corresponding Wikidata items should have a `what
links here' link, providing a list of every page with a Wikilink to this page.
This way, dependencies can be analyzed, and editors warned.</p>
        <p>In summary, issues with the formula data representation in both Wikipedia
and Wikidata as well as their exchange communication were identi ed. A
subsequent discussion on the issue to nd an appropriate property for the identi er
semantics was started15.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Applications and Outlook</title>
      <p>In this nal section, we outline some applications of Mathematical Entity Linking
(with Wikidata) and conclude with an outlook to future work potential.
4.1</p>
      <sec id="sec-4-1">
        <title>Mathematical Question Answering</title>
        <p>
          Motivated by an increasing number of questions on Question Answering (QA)
platforms like Math Stack Exchange (MSE) [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ], signifying a growing
information need to answer math-related questions, we developed a Mathematical
Question Answering (MathQA) system [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ]. Our open source and open data approach
retrieves mathematical formulae using their concept names or querying formula
identi er relationships from the Wikidata knowledge graph. Furthermore, we
15 https://www.wikidata.org/wiki/Wikidata:Property proposal/symbol represents
developed Unsupervised Formula Labeling (UFL) for semantic formula search
and question answering on the arXiv preprint repository and Wikipedia.
        </p>
        <p>
          Figure 7 shows the MathQA UI with an example relationship question. A
Question Parsing Module transforms natural language questions into a triple
representation. Subsequently, a Formula Retrieval Module queries the Wikidata
knowledge-base for the requested formula and presents the result to the user.
The user can subsequently choose values for the occurring variables and order
a calculation that is done by a Calculation Module. For the retrieved formula
match, the Wikidata source link is displayed and identi er names and constant
values (e.g., for the speed of light) are fetched from the respective items. The
system employs the formula properties that are discussed in Section 1.
Another application of MathEL can is document subject classi cation [
          <xref ref-type="bibr" rid="ref18 ref22">18, 22</xref>
          ],
which is essential for structuring (digital) libraries and allowing readers to search
for literature within a speci c eld. Supervised (automatic) or semi-supervised
(semi-automatic) Machine Learning algorithms can support human domain
expert classi ers by predicting subject classes for unclassi ed documents using
classi cation information of documents that are already labeled. While humans
can, in principle, explain their decisions, for machines, it is more di cult. This
shortcoming motivated explainable AI research to address the problem of
Machine Learning decisions being a black box [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]. In 2016, `LIME' - a method for
`Local Interpretable Model-agnostic Explanations' was introduced to improve
human trust in a classi er [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]. In 2019, `SHAP' - an approach to improve the
interpretability of tree-based models using game-theoretic Shapley values was
presented to enhance understanding global model structure based on combining
many local explanations [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]. Both LIME and SHAP models are available
opensource and heavily employed. For the classi cation of natural language texts,
such as legal or medical documents, explainer approaches have already
successfully been applied. However, documents from Science, Technology, Engineering,
and Mathematics (STEM) disciplines are more di cult to tackle since they
contain a signi cant amount of mathematical formulae alongside text [
          <xref ref-type="bibr" rid="ref12 ref14">14, 12</xref>
          ].
        </p>
        <p>
          Mathematical Entity Linking can help to foster explainability for STEM
documents. We are currently working on rst approaches for STEM document
classi cation explainability using classical and mathematical Entity Linking [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ].
Mining a collection of documents from the arXiv preprint repository (NTCIR and
zbMATH dataset), we could show that mathematical entities have the potential
to provide high explainability as they are a crucial part of a STEM document.
Our full paper contribution has been submitted very recently and is currently
under review.
4.3
        </p>
      </sec>
      <sec id="sec-4-2">
        <title>Conclusion and Future Work</title>
        <p>In this resource paper, we showed how Wikidata can be employed for the
semantic grounding of mathematical entities in documents. We introduced the
data model of mathematical statements and discussed the use of di erent
properties for the semantics of formula identi er parts. We presented some
example SPARQL queries to access the mathematical knowledge in Wikidata. Next,
we discussed how to obtain entity linking data via document annotation using
systems such as our `AnnoMathTeX' formula and identi er annotation
recommender system. We introduced guidelines for formula annotation and reported
issues occurring in the annotation process. Moreover, we presented
possibilities to benchmark Mathematical Entity Linking (MathEL) with Wikidata using
our `MathMLben' gold-standard UI. Next, we discussed community feedback
on Wikidata and Wikipedia on the usage of MathEL and the di erent
properties involved. Finally, we introduced two applications of MathEL: Mathematical
Question Answering (MathQA) and document classi cation explainability.</p>
        <p>
          Since mathematical Formula Concepts [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] link both mathematical and
natural language, MathEL can bridge two worlds and is thus a very valuable method
to foster the methodological understanding of mathematical knowledge.
Wikidata can help to achieve this by storing and linking both the concept name (with
QID) and the formula string, e.g., linking `mass-energy equivalence' (Q35875)
with E=mc^2. However, there are still numerous challenges to tackle. Di erent
symbols are used for constants and variables, such as m or for `mass'. Di erent
unit systems and substitutions are rendering some identi ers or terms implicit.
And other types of notational freedom (e.g., di erential vs. integral forms or
derivative notation) are making automated MathEL more di cult.
        </p>
        <p>Summarizing, our key ndings are 1) Wikidata is a valuable resource for
storing and retrieving mathematical knowledge; 2) The data model and
representation of mathematical statements still su ers from various issues, such
as disagreement on the use of speci c properties; 3) By creating a formula
semantics benchmark (MathMLben) and annotation recommender system
(AnnoMathTeX), we are able to foster reproducibility of MathIR experiments and
speed up the process of Wikidata knowledge-base and knowledge-graph
population; 4) The machine-interpretable mathematical knowledge can be exploited
by various MathIR AI systems, such as question answering or automated
mathematical reasoning.</p>
        <p>MathEL research is currently at an early stage, and there is still a lot of
questions and potential to explore. In the future, we will continue our research
on the discovery and recognition of Formula Concept as a basis for MathEL.
Moreover, we will promote applications, such as MathQA and STEM document
classi cation explainability, by extending our existing systems and approaches.
Furthermore, we will introduce new applications, such as our currently
developing `PhysWikiQuiz' physics question generation and interrogation system using
Wikidata. Finally, we will create and publish large MathEL benchmark datasets
and integrate annotation entity linking recommendations into the editing view
of Wikipedia articles.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Aizawa</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kohlhase</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ounis</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schubotz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          : NTCIR-11 math
          <article-title>-2 task overview</article-title>
          .
          <source>In: NTCIR. National Institute of Informatics (NII)</source>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Cohl</surname>
            ,
            <given-names>H.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McClain</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Saunders</surname>
            ,
            <given-names>B.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schubotz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Williams</surname>
            ,
            <given-names>J.C.</given-names>
          </string-name>
          :
          <article-title>Digital repository of mathematical formulae</article-title>
          .
          <source>In: Conference on Intelligent Computer Mathematics (CICM)</source>
          , Coimbra, Portugal. pp.
          <volume>419</volume>
          {
          <issue>422</issue>
          (
          <year>2014</year>
          ). https://doi.org/10.1007/978-3-
          <fpage>319</fpage>
          -08434-3 30
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Cohl</surname>
            ,
            <given-names>H.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schubotz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Content dictionary description: select symbols from chapter 9 of the kls dataset in the drmf (</article-title>
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Cohl</surname>
            ,
            <given-names>H.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schubotz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McClain</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Saunders</surname>
            ,
            <given-names>B.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zou</surname>
            ,
            <given-names>C.Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mohammed</surname>
            ,
            <given-names>A.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dano</surname>
            ,
            <given-names>A.A.</given-names>
          </string-name>
          :
          <article-title>Growing the digital repository of mathematical formulae with generic sources</article-title>
          . pp.
          <volume>280</volume>
          {
          <issue>287</issue>
          (
          <year>2015</year>
          ). https://doi.org/10.1007/978-3-
          <fpage>319</fpage>
          -20615-8 18
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Ginev</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stamerjohanns</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kohlhase</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>The latexml daemon: Editable math on the collaborative web</article-title>
          .
          <source>In: LWA</source>
          <year>2011</year>
          , Magdeburg, Germany. pp.
          <volume>255</volume>
          {
          <issue>256</issue>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Lozier</surname>
            ,
            <given-names>D.W.:</given-names>
          </string-name>
          <article-title>NIST digital library of mathematical functions</article-title>
          . Ann. Math. Artif. Intell.
          <volume>38</volume>
          (
          <issue>1-3</issue>
          ),
          <volume>105</volume>
          {
          <fpage>119</fpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Lundberg</surname>
            ,
            <given-names>S.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Erion</surname>
            ,
            <given-names>G.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>DeGrave</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Prutkin</surname>
            ,
            <given-names>J.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nair</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Katz</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Himmelfarb</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bansal</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Explainable AI for trees: From local explanations to global understanding</article-title>
          .
          <source>CoRR</source>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Mahoney</surname>
            ,
            <given-names>C.J.</given-names>
          </string-name>
          , Zhang, J.,
          <string-name>
            <surname>Huber-Fli et</surname>
          </string-name>
          , N.,
          <string-name>
            <surname>Gronvall</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhao</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          :
          <article-title>A framework for explainable text classi cation in legal document review</article-title>
          .
          <source>In: IEEE BigData</source>
          . pp.
          <year>1858</year>
          {
          <year>1867</year>
          . IEEE (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Miller</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>LaTeXML: A LATEX to XML converter</article-title>
          . http://dlmf.nist.gov/LaTeXML/, http://dlmf.nist.gov/LaTeXML/, accessed:
          <fpage>2018</fpage>
          -05-09
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Purwitasari</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yuniar</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yuhana</surname>
            ,
            <given-names>U.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Siahaan</surname>
            ,
            <given-names>D.O.</given-names>
          </string-name>
          :
          <article-title>Ontology-based annotation recommender for learning material using contextual analysis</article-title>
          .
          <source>In: Proceedings of the IETEC'11 Conference</source>
          , Kuala Lumpur, Malaysia. (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Ribeiro</surname>
            ,
            <given-names>M.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Singh</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guestrin</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>"why should I trust you?": Explaining the predictions of any classi er</article-title>
          .
          <source>In: KDD</source>
          . pp.
          <volume>1135</volume>
          {
          <fpage>1144</fpage>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Scharpf</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mackerracher</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schubotz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Beel</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Breitinger</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gipp</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>AnnoMath TeX - a formula identi er annotation recommender system for STEM documents</article-title>
          .
          <source>In: RecSys</source>
          . pp.
          <volume>532</volume>
          {
          <fpage>533</fpage>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Scharpf</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schubotz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cohl</surname>
            ,
            <given-names>H.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gipp</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Towards formula concept discovery and recognition</article-title>
          .
          <source>In: BIRNDL@SIGIR. CEUR Workshop Proceedings</source>
          , vol.
          <volume>2414</volume>
          , pp.
          <volume>108</volume>
          {
          <fpage>115</fpage>
          .
          <string-name>
            <surname>CEUR-WS.org</surname>
          </string-name>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Scharpf</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schubotz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gipp</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Representing mathematical formulae in content mathml using wikidata</article-title>
          .
          <source>In: BIRNDL@SIGIR. CEUR Workshop Proceedings</source>
          , vol.
          <volume>2132</volume>
          , pp.
          <volume>46</volume>
          {
          <fpage>59</fpage>
          .
          <string-name>
            <surname>CEUR-WS.org</surname>
          </string-name>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Scharpf</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schubotz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gipp</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Fast linking of mathematical wikidata entities in wikipedia articles using annotation recommendation</article-title>
          .
          <source>In: Proceedings of the Web Conference (WWW)</source>
          <year>2021</year>
          . ACM / IW3C2 (April
          <year>2021</year>
          ). https://doi.org/10.1145/3442442.3452348
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Scharpf</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schubotz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gipp</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Towards explaining stem document classi cation using mathematical entity linking</article-title>
          .
          <source>arXiv preprint arXiv:2109.00954</source>
          (
          <year>2021</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Scharpf</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schubotz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Greiner-Petter</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ostendor</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Teschke</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gipp</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Arqmath lab: An incubator for semantic formula search in zbmath open? In: CLEF (Working Notes)</article-title>
          .
          <source>CEUR Workshop Proceedings</source>
          , vol.
          <volume>2696</volume>
          .
          <string-name>
            <surname>CEUR-WS.org</surname>
          </string-name>
          (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Scharpf</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schubotz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Youssef</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hamborg</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Meuschke</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gipp</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Classi cation and clustering of arxiv documents, sections, and abstracts, comparing encodings of natural and mathematical language</article-title>
          .
          <source>In: JCDL</source>
          . pp.
          <volume>137</volume>
          {
          <fpage>146</fpage>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Schubotz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Greiner-Petter</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scharpf</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Meuschke</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cohl</surname>
            ,
            <given-names>H.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gipp</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Improving the representation and conversion of mathematical formulae by considering their textual context</article-title>
          .
          <source>In: JCDL</source>
          . pp.
          <volume>233</volume>
          {
          <fpage>242</fpage>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Schubotz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Meuschke</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hepp</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cohl</surname>
            ,
            <given-names>H.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gipp</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>VMEXT: A visualization tool for mathematical expression trees</article-title>
          .
          <source>In: CICM, Edinburgh</source>
          , UK. pp.
          <volume>340</volume>
          {
          <issue>355</issue>
          (
          <year>2017</year>
          ). https://doi.org/10.1007/978-3-
          <fpage>319</fpage>
          -62075-6 24
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Schubotz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scharpf</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dudhat</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nagar</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hamborg</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gipp</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Introducing mathqa - A math-aware question answering system</article-title>
          .
          <source>Information Discovery and Delivery 42 - No. 4</source>
          ,
          <issue>214</issue>
          {
          <fpage>224</fpage>
          (
          <year>2019</year>
          ). https://doi.org/10.1108/IDD-06
          <string-name>
            <surname>-</surname>
          </string-name>
          2018-0022
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Schubotz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scharpf</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Teschke</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          , Kuhnemund,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Breitinger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Gipp</surname>
          </string-name>
          ,
          <string-name>
            <surname>B.</surname>
          </string-name>
          :
          <article-title>Automsc: Automatic assignment of mathematics subject classi cation labels</article-title>
          .
          <source>In: CICM. Lecture Notes in Computer Science</source>
          , vol.
          <volume>12236</volume>
          , pp.
          <volume>237</volume>
          {
          <fpage>250</fpage>
          . Springer (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Schubotz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wicke</surname>
          </string-name>
          , G.:
          <article-title>Mathoid: Robust, scalable, fast and accessible math rendering for wikipedia</article-title>
          .
          <source>In: CICM. Lecture Notes in Computer Science</source>
          , vol.
          <volume>8543</volume>
          , pp.
          <volume>224</volume>
          {
          <fpage>235</fpage>
          . Springer (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Tietz</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Waitelonis</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , Jager, J.,
          <string-name>
            <surname>Sack</surname>
          </string-name>
          , H.:
          <article-title>refer: a linked data based text annotation and recommender system for wordpress</article-title>
          .
          <source>In: International Semantic Web Conference (Posters &amp; Demos)</source>
          .
          <source>CEUR Workshop Proceedings</source>
          , vol.
          <volume>1690</volume>
          . CEURWS.org (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Vagliano</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Monti</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scherp</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Morisio</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Content recommendation through semantic annotation of user reviews and linked data. In: K-CAP</article-title>
          . pp.
          <volume>32</volume>
          :
          <issue>1</issue>
          {
          <issue>32</issue>
          :
          <article-title>4</article-title>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Wiesing</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schmoll</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>KAT: an annotation tool for STEM documents</article-title>
          . In: FM4M/MathUI/ThEdu/DP/WIP@CIKM.
          <source>CEUR Workshop Proceedings</source>
          , vol.
          <volume>1785</volume>
          , pp.
          <volume>66</volume>
          {
          <fpage>72</fpage>
          .
          <string-name>
            <surname>CEUR-WS.org</surname>
          </string-name>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>