<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Evolving MathHub for Tetrapodal Math (Working Title)</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Tom Wiesing</string-name>
        </contrib>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <fpage>8</fpage>
      <lpage>12</lpage>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The STEM sciences – Science, Technology, Engineering and Mathematics – are moving to bigger and bigger
developments, ultimately facing problems such as the one-brain-barrier [
        <xref ref-type="bibr" rid="ref3">4</xref>
        ]. To overcome these limitations, we
must make use of computer support.
      </p>
      <p>In the following I will use mathematics as a working example. Of the STEM sciences it is one with the most
well-understood structure. I expect that the observations and results will generalize to all STEM sciences.</p>
      <p>
        We observe the following aspects of mathematical practices: (i) taking
existing mathematical knowledge and using it to make inferences ; (ii) guid- Tabulation
ing a reader by writing narrating texts about a specific topic; (iii)
generating a large set of information – such as return values of a particularly Ontology
interesting function – and tabulating it inside some form of database; and Narration Inference
(iv) making computations. We see these four as forming central aspects
of mathematics. [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] arranges these in a terapodal strucuture and posits Computation
that all four must be integrated by a mathematical ontology for effective
machine support.
      </p>
      <p>To support each of these practices, different kinds of computer-provided mathematical services are required.
Services commonly work on objects ranging from low-level mathematical objects, such as numbers, via complex
objects, such as specific mathematical groups, to objects on a higher meta-level, such as mathematical
computations. Any particular service might treat specific objects as blackboxes – knowing that it is an object but not
knowing any of its’ internal structure – or as a white box – being fully aware of the semantics of the objects
and being able to infer all its’ properties. We call services treating objects as the former shallow mathematical
services and those as the latter deep mathematical services.</p>
      <p>Deep services also have to take context into account. For example, in certain cases an integer might only be
an integer, but in other cases the same integer might represent the value of a specific property of an object. This
leads to the orthogonal distinction between local services – those taking only the current document into account
– and global services – those taking an entire ontology of background knowledge or corpus of documents into
account.</p>
      <p>Consider on one hand a service that takes as input a simple mathematical term within a document, and then
navigates to the document defining this term. This service is global (because it takes the corpus as a whole
into account) and shallow (because it does not rely on the internal structure of the object). Consider on the
other hand a service that converts all physical quantity expressions (e.g. 10 khm ) from the imperial system to the
metric system. This service is local (because it only works on a particular expression) and deep (because for
each quantity expression it takes into account its’ internal structure consisting of value and units).</p>
      <p>Services that are shallow and local are easy to implement – requiring only application of standard software
engineering practices to quickly be usable on a researchers machine – whereas deep and global services require
more infrastrucure. To be able to support these more complex services, a highly semantic, highly structured
ontology is needed to bind the different mathematical aspects together.</p>
      <p>The classification of aspects of ‘doing mathematics’ and – based on that – the services sketched above give
Copyright © by the paper’s authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
a plausible, albeit theoretical, model for designing a system providing computer support for mathematics. To
validate this model, I want to build a concrete scalable implementation. This leads to the following research
questions:
RQ1 Can an information system provide infrastructure to jointly support all four tetrapodal aspects of
mathematics?
RQ2 Is it possible to support community-based workflows for authoring, maintaining, curating, and visualization
of tetrapodal mathematics?</p>
    </sec>
    <sec id="sec-2">
      <title>RQ3 What kind of ontology is needed for our model to work in practice?</title>
      <p>A positive answer to question RQ1 – backed by an implementation – will demonstrate the feasibility of our
model. The aim of Question RQ2 is to guide any implementation towards practical applicability. In particular,
it is insufficient to build a system that is merely applicable to a specific subset of documents; instead we want
to aim to make it broadly accessible to the community of mathematics as a whole. We expect these particular
workflows to cause a need in an implementation to support both deep and global services.</p>
      <p>Unlike the above, the answer to Question RQ3 is not yet clear. At first glance, one would expect any such
ontology to be fully formal, defining every part of mathematics in great detail. However, it might be possible to
re-write mathematical services to not need a complete formalization of every object in question. As an alternative
a flexi-formalized ontology – defining only core concepts and providing human-readable descriptions for the rest
– might be sufficient1.</p>
      <p>
        Another possible requirement on the ontology might be to ensure that it is not system specific. In particular,
it is possible for it to remain findable and accessible by other researchers, to further system interoperability?
These desired properties are also known as FAIR [
        <xref ref-type="bibr" rid="ref8">9</xref>
        ].
      </p>
      <p>
        I want to base my system on MathHub, a portal for active mathematical documents, an archive for flexiformal
mathematics, and a frontend to mathematical databases initially conceived by Michael Kohlhase, Florian Rabe
and others in [
        <xref ref-type="bibr" rid="ref6">7</xref>
        ]. MathHub roughly consists of three parts: a) Several archives of mathematical knowledge, each
consisting of a set of thematically related OMDOC [
        <xref ref-type="bibr" rid="ref4">5</xref>
        ] documents; b) an MMT backend [
        <xref ref-type="bibr" rid="ref7">8</xref>
        ] capable of providing
semantic services on these archives; and c) a Drupal [2] frontend presenting an interface to users.
      </p>
      <p>
        MathHub was initially implemented by Mihnea Iancu and others during the scope of his PhD [
        <xref ref-type="bibr" rid="ref2">3</xref>
        ]. It provided
services for several of the above-defined aspects of mathematics. However, there are two major problems with this
implementation a) it does not implement a tetrapodal system as a whole; and b) several technological problems
necessitate a re-design of parts of the system.
      </p>
      <p>I want to base my work to answer the research questions on MathHub by developing the system further. On a
technical level, I want to overcome the two limitations above, and then focus on the following workflow-oriented
work packages that concretly instantiate the research questions:
WP1 Which tetrapodal Active Document Services can be developed and efficiently integrated into MathHub that
make the system more user-friendly?
WP2 Which processes are efficient at keeping MathHub Content up-to-date and simultaneously scale to large
amounts of data?</p>
    </sec>
    <sec id="sec-3">
      <title>WP3 What kind of integrable search functionality is desirable for MathHub users?</title>
    </sec>
    <sec id="sec-4">
      <title>WP4 How can MathHub improve interoperability with external systems and services? WP1 addresses RQ1, WP2 and WP3 focus on RQ2 and WP4 focuses on the ontology in RQ3.</title>
      <p>
        Drupal.org – Community plumbing. url: http://drupal.org (visited on 02/14/2015).
1There are already some ideas how such an ontology might look like in detail. We refer the interested reader to [
        <xref ref-type="bibr" rid="ref5">6</xref>
        ].
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Jacques</given-names>
            <surname>Carette</surname>
          </string-name>
          et al. “
          <article-title>Big Math and the One-Brain Barrier - A Position Paper and Architecture Proposal”</article-title>
          . submitted to Mathematical Intelligencer.
          <year>2019</year>
          . url: https://arxiv.org/abs/
          <year>1904</year>
          .10405.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Mihnea</given-names>
            <surname>Iancu</surname>
          </string-name>
          . “Towards Flexiformal Mathematics”.
          <source>PhD thesis</source>
          . Bremen, Germany: Jacobs University,
          <year>2017</year>
          . url: https://opus.jacobs-university.de/frontdoor/index/index/docId/721.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Michael</given-names>
            <surname>Kohlhase</surname>
          </string-name>
          . “
          <article-title>Mathematical Knowledge Management: Transcending the One-Brain-Barrier with Theory Graphs”</article-title>
          . In: EMS Newsletter (
          <year>June 2014</year>
          ), pp.
          <fpage>22</fpage>
          -
          <lpage>27</lpage>
          . url: https://kwarc.info/people/mkohlhase/ papers/ems13.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Michael</given-names>
            <surname>Kohlhase</surname>
          </string-name>
          .
          <source>OMDoc - An open markup format for mathematical documents [Version 1.2]. LNAI 4180</source>
          . Springer Verlag, Aug.
          <year>2006</year>
          . url: http://omdoc.org/pubs/omdoc1.2.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Michael</given-names>
            <surname>Kohlhase</surname>
          </string-name>
          . “
          <article-title>The Flexiformalist Manifesto”</article-title>
          . In: International Workshop on Symbolic and
          <article-title>Numeric Algorithms for Scientific Computing (SYNASC</article-title>
          <year>2012</year>
          ). Ed. by Andrei Voronkov et al. Timisoara, Romania: IEEE Press,
          <year>2013</year>
          , pp.
          <fpage>30</fpage>
          -
          <lpage>36</lpage>
          . isbn:
          <fpage>978</fpage>
          -1-
          <fpage>4673</fpage>
          -5026-6. url: http : / / kwarc . info / kohlhase / papers / synasc13.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Christoph</given-names>
            <surname>Lange</surname>
          </string-name>
          et al. “
          <source>The Planetary System: Executable Science</source>
          , Technology, Engineering and Math Papers”. In: The Semantic Web:
          <article-title>Research and Applications (Part II). 8th Extended Semantic Web Conference (ESWC) (Hersonissos</article-title>
          , Crete, Greece, May 29-June 2,
          <year>2011</year>
          ). Ed. by Grigoris Antoniou et al.
          <source>LNCS 6644</source>
          . Heidelberg: Springer Verlag,
          <year>2011</year>
          , pp.
          <fpage>471</fpage>
          -
          <lpage>475</lpage>
          . isbn:
          <fpage>978</fpage>
          -3-
          <fpage>642</fpage>
          -21033-4. arXiv:
          <volume>1103</volume>
          .1482 [cs.DL].
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Florian</given-names>
            <surname>Rabe</surname>
          </string-name>
          and
          <string-name>
            <given-names>Michael</given-names>
            <surname>Kohlhase</surname>
          </string-name>
          . “
          <string-name>
            <given-names>A Scalable</given-names>
            <surname>Module</surname>
          </string-name>
          <article-title>System”</article-title>
          .
          <source>In: Information &amp; Computation</source>
          <volume>0</volume>
          .230 (
          <year>2013</year>
          ), pp.
          <fpage>1</fpage>
          -
          <lpage>54</lpage>
          . url: http://kwarc.info/frabe/Research/mmt.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Mark D.</given-names>
            <surname>Wilkinson</surname>
          </string-name>
          et al. “
          <article-title>The FAIR Guiding Principles for scientific data management and stewardship”</article-title>
          .
          <source>In: Scientific Data</source>
          <volume>3</volume>
          (
          <year>2016</year>
          ). url: https://doi.org/10.1038/sdata.
          <year>2016</year>
          .
          <volume>18</volume>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>