<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Towards a Census of Relational Data in Mathematics</article-title>
      </title-group>
      <abstract>
        <p>Research data are becoming ever more important in science as well as in the humanities. This is re ected in the various national and international initiatives that are aimed at developing and supporting data stewardship and the related area of knowledge management. It may come as a surprise to some that research data is experiencing a similar boom in mathematics. However, it could be argued that mathematics is lagging behind other disciplines in using the tools of the trade when it comes to data. This work-in-progress census aims to shed light on what a large class of mathematical datasets looks like. An increased understanding of data in mathematical research is an important step towards building better infrastructure for these data. The author would like to encourage authors and curators to contribute information about their datasets for future versions of this census.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Mathematicians have long been computing, collecting, and storing interesting,
often hard to obtain facts and used them as reference, source of examples and
counter examples, and generally to better understand the structure of objects
they study. The early examples were all obtained painstakingly by hand. One
such example is the computation of logarithm tables. Another example which
(at least partially) predates systematic use of computers, is the Foster census
of cubic symmetric graphs. This project was begun in 1930 and remarkably
contained nearly all cubic symmetric graphs of up to 512 vertices by the time
it was published in book form in 19881. More interesting early examples can be
found in a MathOver ow thread started by Gordon Royle [con].</p>
      <p>The book form was indeed the norm until the internet revolution. The Atlas
of Graphs [RW05] would probably have been published digitally if its creation
was shifted by only a few years forward. The rst two iterations of well known
On-Line Encyclopedia of Integer Sequences [OEIS] were published as books in
1973 and 1995.</p>
      <p>The circumstances that gave rise to the increasing importance of research
data in general have had a corresponding e ect on research data in
mathematics. With access to computing power, the complexity and size of the datasets
grew signi cantly. Mathematical datasets also found new uses, such as
algorithm benchmarking (an example here is the ARG database for benchmarking
of graph isomorphism algorithms [San+03]). This growth sparked a need for
tools to manage the data.</p>
      <p>In the scienti c community, we have seen the formation of FAIR
principles [Wil+16], which break down the vague concept of usefulness into properties
that form a basis for guidelines. The data should be ndable, accessible,
interoperable, and reusable. As stressed by the authors of the principles, these still
need to be adapted for the needs of speci c scienti c communities. In a work that
predates the FAIR principles (and even an earlier paper discussing similar ideas
about accessibility), Billey and Tenner [BT13] outlined a set of desirable
properties in a certain class of mathematical databases they call ngerprint databases
for theorems. Their properties had some overlap with the FAIR principles; in
particular, they require the databases (and their contents) to be citable via unique
identi ers. With Vidali, the author outlined some further recommendations for
mathematical databases as part of the work on the DiscreteZOO project [BV].
Data in mathematics is intrinsically linked with knowledge and as such,
managing if falls into the intersection of data stewardship and mathematical knowledge
management.</p>
      <p>Data in mathematics and the scope of this census Similar to data in general,
mathematical data appear in several forms. In ongoing joint work with Michael
Kohlhase and Florian Rabe [BKR], we propose a division of data in mathematics
into four categories (Figure 1).</p>
      <p>array
computation</p>
      <p>record
RELATIONAL</p>
      <p>SYMBOLIC
proof
model
metadata</p>
      <p>LINKED</p>
      <p>knowledge graph
MATHEMATICAL</p>
      <p>DATA</p>
      <p>verbalisations</p>
      <p>NARRATIVE
documents
notations</p>
      <p>Symbolic data (knowledge) are typically found in libraries such as the TPTP
Problem Library for Automated Theorem Proving [SS98]. In some sense, these
libraries are similar to corpora in linguistics. Linked data relates to data in library
science and ontologies in information science. A good example of narrative data
in mathematics is the repository of electronic preprints arXiv. Finally, relational
data are perhaps closest to what most people think of as data and are what this
census focuses on. Examples include lists of mathematical objects that could
be (or are) organised into a table, such as censuses of graphs, lists of integer
sequences, etc. This classi cation is not strict: for example, the OEIS ts most
neatly in the relational data type, but it includes references to theorems and
formulas for closed forms of the generating functions. The census mostly focuses
on the datasets themselves, however, we will also brie y mention some of the
systems that have been built for them.</p>
      <p>Limitations It turns out that nding out what datasets are out there, and what
they look like is challenging! With the exception of Billey and Tenner [BT13],
there is no literature about relational math datasets in general. Dataset
authors often describe a dataset in a paper. Such papers get lost in a
multitude of irrelevant results when searching for keywords such as \database" in
databases of mathematical literature like arXiv, MathSciNet and zbMATH. One
can nd some information on swMATH information service for mathematical
software [SWM] by browsing the types Data Collections and Services,
Webservices, and the special collection Math.Databases - MathDBS2. Another source
are computer algebra systems, which integrate some datasets (for example as
packages). Unfortunately, most datasets live only on their authors' websites and
are not indexed anywhere.</p>
      <p>As early work, the census is strongly in uenced by the author's area of
mathematics. The contents are skewed towards datasets that the author knew about
from the start, those discovered through word of mouth, and those datasets
mentioned on MathOver ow and swMATH. It is not exhaustive or nal in terms of
aspects examined, nor examples given. The di culty in obtaining information
is re ected in the uneven coverage of areas of mathematics and in the uneven
level of detail about speci c datasets. In particular, we have not yet collected
much information about datasets that are only available through computer
algebra systems. Similarly, we have focused on datasets that do not appear only
in commercial systems (such as Magma [BCP97] and Mathematica [Inc]).
Goals More work is needed before a more structured review can be attempted.
The work reported here is a necessary rst step, as it outlines the use of relational
data in mathematics. Within this larger picture, it aims to set up a foundation for
creating community guidelines for FAIR mathematics, and to serve as a reference
to anyone who needs to know how data in mathematics look like, such creators
of data frameworks for mathematics. Finally, this census aims to increase the
visibility of data in mathematics, and contribute towards better recognition of
the work that goes into constructing and collecting the datasets.
2 https://swmath.org/browse/types</p>
    </sec>
    <sec id="sec-2">
      <title>Description of Datasets and Systems</title>
      <p>This section aims to illustrate the diversity of the datasets via several aspects.
Throughout, we will use the word \dataset" quite loosely to encompass both
simple datasets, such as those containing a collection of objects or records with
the same structure, as well as organised collections of simple datasets.</p>
      <p>Relational data in mathematics are best characterised by the utilisation of
representation theorems that allow encoding mathematical objects as simple
data structures built from numbers, strings, lists and records. Such
representations can be quite far from the objects' semantic type. For example, polynomials
with integer coe cients can be encoded as lists of integers. Graphs can be
represented as adjacency or incidence matrices, or as adjacency or edge lists. These
can in turn be represented as arrays or strings (such as graph6 [McKb]) at the
database level. Further more, testing for graph isomorphism (not an uncommon
task in a database) is a hard problem in general and results such as canonical
forms [BL83] can be used in the encoding to help get around that di culty. Most
relational data appears as collections of concrete mathematical objects.
2.1</p>
      <p>Aspects
Structure Some datasets consist of simple lists of objects (such as Kohonen's
giant list of unlabeled lattices [FL]), while others are lists of records, each
consisting of an object, together with some of its mathematical invariants. Larger
projects can end up organising several simple datasets into a larger one, also
storing the interconnection. Arguably the largest such project is the L-functions
and Modular Forms Database (LMFDB [LM]) combines more than 30 datasets
that have arisen in the context of the Langlands program [Ber03], which explores
connections between number theory and geometry.</p>
      <p>Content organisation The contents of datasets obtained either by
(systematically) generating all objects that satisfy a given set of parameters, or by collecting
objects in some other way.</p>
      <p>An example of the former are some of the collections of highly symmetric
objects (such as graphs). Symmetric objects are quite rare compared to ones
without symmetry and obtaining a complete list of all objects up to a certain
size can take months.</p>
      <p>On the other hand, unsystematic collections of (rare or interesting) objects
and systematic collections of all objects of a speci c kind. The On-Line
Encyclopedia of Integer Sequences (OEIS [OEIS]) is an example of the former that
collects sequences of integers (such as the Fibonacci sequence 0, 1, 1, 2, 3, 5, 8,
13, etc.). Similarly, the stated goal of The House of Graphs [Bri+13] is \to nd a
workable de nition of 'interesting' and provide a searchable database of graphs
that conform to this de nition". Both of these provide a lot of information about
every object, including references to research papers.</p>
      <p>The generated collections typically have a small number of authors, while
the unsystematic collections tend to become a collaborative e ort.
Authorship The authorship varies widely.</p>
      <p>{ The majority of datasets has a single author or group of authors. These
datasets are often accompanied by a paper (or a small number of papers)
describing the mathematical background, generation, and contents.
{ Some datasets have a large number of contributors; these are typically the
unsystematic ones, for which a core group of authors contributed a
substantive part of the data, together with a large number of authors with
smaller contributions. In addition to the OEIS (with thousands of
contributors), the LMFDB (with a 100 contributors) and the House of Graphs,
Findstat [BSa14] (with 69 contributors) is such an example.
{ A somewhat special case are the combinatorial catalogues that can consist of
tens of lists of (combinatorial) objects. Some examples of these are catalogues
produced by McKay [McKa], Royle [Roy], and Wanless [Wan].</p>
      <p>Provenance The provenance of the dataset usually corresponds to its structure
and authorship. The datasets with a small number of authors are usually
produced via a small number of methods. We did not yet explore the provenance
of the larger datasets, especially the unsystematic ones. Important to note here
is that datasets can be built on top of other datasets. An example of this is the
Census of cubic vertex-transitive graphs [PSV13]. The authors split the graphs
into a few cases, each of which required a specialised method. One of these cases
was a dataset already generated in previous work.</p>
      <p>Infrastructure and Shareability The datasets are usually accessible either through
a website or indirectly through a computer algebra system. Exceptions to these
are especially older works, such as, the collection of graphs described in the
book Atlas of Graphs [RW05]. Many projects with a website also provide an
encyclopedic page for every object, and many researchers have commented that
this is an important feature.</p>
      <p>Especially the larger projects develop some infrastructure for the data,
possibly seeding it with the initial contents. The infrastructure then supports
contributions of objects (like the OEIS, the House of Graphs and Findstat), or lists of
objects. The latter, hosting lists of graphs, is the other stated goal of the House
of Graphs. Similarly, the Encyclopedia of Graphs [EG] is a rare example of an
online resource developed to help researchers nd and use data, without actually
producing any of its own datasets. It currently hosts about 30 datasets. Another
example is a more recent project DiscreteZOO [BV], which initially aimed to
support the community studying symmetric objects.</p>
      <p>An important example of a dataset that is only available via CAS is the Small
Groups Library [SGL] (in GAP, Magma). It relies on the system to compute a
signi cant part of the information on-the- y and thus only uses a little under a
bit per group. In a little under 80 MB the library stores enough information to
nd which of the over 400 million groups a group given by the user is isomorphic
to.</p>
      <p>A more typical situation is where a dataset is hosted with minimal
infrastructure on one of the authors' websites. The website is often browsable with
the browsing interface consisting of HTML tables. An illustrative example here
is the Census of edge transitive graphs [EET]. Wilson has the core information
about the census stored in a CAS, in which he has also written code that
produces the HTML for the website. Another such example is Michael Hartley's
atlas of abstract polytopes [AP]. Authors also often provide les with code for
the collection as an array in some computer algebra system.</p>
      <p>Metrics There is no standard measure for a size of a dataset. It is possible
to consider the compressed or uncompressed size on disk, the number or size
of objects, or the time (itself a problematic metric) it took to generate the
dataset, etc. The uncompressed size on disk can range from a few megabytes
to over a terabyte, or up to roughly 25 GB with heavy compression. There is
a small inverse correlation between the size (on disk or the number of objects)
and computational complexity of the process of generation. Kohonen's lattice
dataset appears to be a record holder with respect to size, with a few billions of
lattices.</p>
      <p>The number of users can be estimated through the number of citations, or the
number of citations of the corresponding paper for some projects. The number
of downloads would be interesting, but it appears that nearly no-one records it.
FAIR-related aspects For details about the FAIR principles, we refer the reader
to the GoFAIR website [GF].</p>
      <p>Metadata. Some details about a dataset are typically available on the same
website as the dataset, and most datasets have an accompanying paper.
Metadata are generally not structured and, with a few possible exceptions, do not
specify a license.</p>
      <p>Unique IDs and Findable. Many datasets have some sort of an unique ID for
all the objects. Some of the projects also provide some sort of a globally unique
ID (in the sense of a URL), but the persistence of it is bounded by the projects'
lifespan, as the URLs will expire if the website is decommissioned.</p>
      <p>ET [EET]
Findstat
OEIS</p>
      <p>C4[10, 1]
http://www.findstat.org/StatisticsDatabase/St000001/
https://oeis.org/A000045</p>
      <p>Accessible. While most datasets are in some way available online, the formats
are typically ad hoc. Almost all of the others are available through computer
algebra systems.</p>
      <p>Area of mathematics At least partly due to the author's home area, the
information collected so far has mostly been skewed towards combinatorics and
geometry. In addition to these, we have found datasets from number theory (LMFDB,
NFDB [NF]), group theory and algebra (the Small Groups Library [SGL], the
Graded Ring Database [GRD]), topology (the Knot Atlas [BNMa]), algebraic
geometry (the Toric Calabi-Yau Database [CY]), and probability
(Distributome [Dst]).
3</p>
    </sec>
    <sec id="sec-3">
      <title>Living census</title>
      <p>To facilitate the collection of information about data in mathematics, we set up
a database with a website frontend [Ber]. While it grew out of the necessity to
keep track of the information, it has at least two further goals. First, it aims
to make it easy for anyone to see what information has been collected so far.
Second, it aims to eventually make it easy to contribute information.</p>
      <p>The information about the datasets can be displayed a few di erent views
(with switching implemented through tabs): general information, information
about size, information pertaining to the FAIR principles, as well as some other
properties.</p>
      <p>The FAIR principles in particular are a little unwieldy to get an impression
about at a glance, which is why we devised simple diagrams (Figure 3) to aid
with that. The design of the diagrams is based o the fact that each of the four
principles ( ndable, accessible, interoperable, and reusable) is composed of 3-4
sub-principles, Findable (F1, F2, F3, and F4), Accessible (A1, A1.1, A1.2,
and A2), Interoperable (I1, I2, and I3), and Reusable (R1, R1.1, R1.2, and
R1.3) [Wil+16]. Each of these can be applied to (or not) to one of the three
layers of information about the dataset. These layers are not necessarily included
in the original FAIR principles, but it seems to be helpful to break information
down depending on whether it applies to
{ the dataset (D) itself (such as whether the dataset has its own globally
unique identi er or whether it is registered in a searchable resource),
{ the datum (A) (each of the objects needs its own globally unique identi er),
or
{ the metadata (M) (such as whether the metadata is accessible even after
the data are no longer available).</p>
      <p>The colour of each cell in the diagram corresponds to a value for a
subprinciplelayer pair: unknown (black), not considered (blank), mostly supported (green),
somewhat supported (yellow) and mostly unsupported (red).</p>
      <p>For example, let us consider F1 for FindStat [BSa14], the Combinatorial
Statistic Finder. The dataset (but not the metadata) is indexed in zbMATH
(Findable D, M). Each combinatorial statistic in the dataset has a unique
identi er, such as St0000813 and can be found through a search interface (Findable
A).
In some areas of mathematics, research products can consist of listings or
tabulations of complex mathematical objects and their properties. These datasets
can be later used by researchers to form or refute conjectures. The main result of
the work reported here are the beginnings of a census of existing mathematical
datasets of this kind, together with a classi cation of the entries along multiple
dimensions. As such, this work contributes to the eld of knowledge management
in areas of mathematics where research products can consist of such datasets,
such as experimental and computational mathematics.</p>
      <p>Currently, the census contains about 70 datasets from several areas of
mathematics. This includes links to dataset websites and author information for
3 http://www. ndstat.org/StatisticsDatabase/St000081/
(nearly) all of the datasets, as well as literature references, area of mathematics
and size-related information for many. Even this small sample shows large
variations in terms of structure, content organisation, provenance, infrastructure and
shareability, and size.</p>
      <p>Perhaps the most important immediate use for this census is as a
\market study" for a prototypal uni ed infrastructure for mathematical data,
MathDataHub [BKR19]. It serves as a source of use cases for the infrastructure, as
well as beginnings of a community of researchers that work with mathematical
data. Even in this initial stage, the census gives the developers of MathDataHub
some idea of the requirements for the system in terms of the ranges of dataset
size, complexity, etc.</p>
      <p>We will continue to gather information about the relational datasets in
mathematics in the living census website. One way to nd more datasets would be to
(in a way that is not yet clear) search for literature in all areas of mathematics
(but not computer science) with keywords \database", \atlas", \census" and
similar. Such a search currently does not appear to be supported by any of the
major databases of mathematical literature. Another large set of datasets that
has yet to be added to the census are datasets incorporated into the various
computer algebra systems.</p>
      <p>Finally, we plan to use the new information as a basis for a more structured
census.</p>
      <p>Acknowledgements The author gratefully acknowledges Tom Wiesing's help in
setting up the Django based living census website. The author would also
particularly like to thank Michael Kohlhase for suggesting the need for a census of
this type, as well as for regular constructive discussions. Finally, the author is
grateful to the many dataset authors who responded to questions about their
datasets and use of data. The work presented here was supported by the EU
grant Horizon 2020 ERI 676541 OpenDreamKit.
[AP]
[BCP97]
[Ber]
[Ber03]
[BKR]
[BKR19] Katja Bercic, Michael Kohlhase, and Florian Rabe. \Towards a
Unied Mathematical Data Infrastructure: Database and Interface
Generation". In: Intelligent Computer Mathematics (CICM) 2019. Ed.
by Cezary Kaliszyck et al. LNAI. in preparation. Springer, 2019,
pp. 28{43. url:
https://kwarc.info/kohlhase/papers/cicm19</p>
      <p>MDH.pdf.
[BL83] Laszlo Babai and Eugene M. Luks. \Canonical Labeling of Graphs".</p>
      <p>
        In: Proceedings of the Fifteenth Annual ACM Symposium on Theory
of Computing. STOC '83. New York, NY, USA: ACM, 1983, pp. 171{
183. isbn: 0-89791-099-0. doi: 10.1145/800061.808746.
[BNMa] Dror Bar-Natan, Scott Morrison, and et al. The Knot Atlas. url:
http://katlas.org
        <xref ref-type="bibr" rid="ref3">(visited on 06/18/2019)</xref>
        .
[Bri+13] Gunnar Brinkmann et al. \House of Graphs: a database of interesting
graphs". In: Discrete Appl. Math. 161.1-2 (2013), pp. 311{314. issn:
0166-218X. doi: 10.1016/j.dam.2012.07.018.
[BSa14] C. Berg, C. Stump, and al. FindStat: The Combinatorial Statistic
Finder. http://www.FindStat.org. [Online; accessed 31 August
2016]. 2014.
[BT13] Sara C. Billey and Bridget E. Tenner. \Fingerprint databases for
theorems". In: Notices Amer. Math. Soc. 60.8 (2013), pp. 1034{1039.
      </p>
      <p>
        issn: 0002-9920. doi: 10.1090/noti1029.
[BV] Katja Bercic and Janos Vidali. \DiscreteZOO: a Fingerprint Database
of Discrete Objects". accepted. url: https : / / arxiv . org / pdf /
1812.05921.pdf.
[con] MathOver ow contributors. What are some early examples of
creation of lists / catalogues of (particularly) combinatorial objects?
MathOver ow. url: https : / / mathoverflow . net / questions /
47044 / what - are - some - early - examples - of - creation - of
lists-catalogues-of-particularly (visited on 11/20/2018).
[CY] Toric Calabi-Yau Database. url: http://www.rossealtman.com/
index.html
        <xref ref-type="bibr" rid="ref3">(visited on 06/18/2019)</xref>
        .
[Dst] Distributome. url: http : / / www . distributome . org/
        <xref ref-type="bibr" rid="ref3">(visited on
06/18/2019)</xref>
        .
[EET] Steve Wilson and Primoz Potocnik. A Census of edge-transitive
tetravalent graphs. url: https://jan.ucc.nau.edu/~swilson/C4FullSite/
index.html
        <xref ref-type="bibr" rid="ref3">(visited on 01/23/2019)</xref>
        .
[EG] Encyclopedia of Graphs. url: http://atlas.gregas.eu
        <xref ref-type="bibr" rid="ref3">(visited on
01/24/2019)</xref>
        .
[FL] Jukka Kohonen. Lists of nite lattices (modular, semimodular, graded
and geometric). url: https://b2share.eudat.eu/records/dbb096da4e364b5e9e37b982431f
        <xref ref-type="bibr" rid="ref3">(visited on 01/25/2019)</xref>
        .
[GF] GoFAIR. url: https : / / www . go - fair . org / fair - principles/
        <xref ref-type="bibr" rid="ref3">(visited on 06/18/2019)</xref>
        .
[GRD] Graded Ring Database. url: http://www.grdb.co.uk/
        <xref ref-type="bibr" rid="ref3">(visited on
06/18/2019)</xref>
        .
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <given-names>Michael</given-names>
            <surname>Hartley</surname>
          </string-name>
          . Abstract Polytopes. url: http://www.abstractpolytopes.com/atlas/index.html (visited on 01/23/
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>Wieb</given-names>
            <surname>Bosma</surname>
          </string-name>
          , John Cannon, and Catherine Playoust. \
          <article-title>The Magma algebra system. I. The user language"</article-title>
          .
          <source>In: J. Symbolic Comput</source>
          .
          <volume>24</volume>
          .3-
          <fpage>4</fpage>
          (
          <year>1997</year>
          ).
          <source>Computational algebra and number theory (London</source>
          ,
          <year>1993</year>
          ), pp.
          <volume>235</volume>
          {
          <fpage>265</fpage>
          . issn:
          <fpage>0747</fpage>
          -
          <lpage>7171</lpage>
          . doi:
          <volume>10</volume>
          .1006/jsco.
          <year>1996</year>
          .
          <volume>0125</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          info/ (visited on 01/15/
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <given-names>Steve</given-names>
            <surname>Bernstein</surname>
          </string-name>
          Joseph Gelbart, ed.
          <article-title>An Introduction to the Langlands Program</article-title>
          . Birkhauser,
          <year>2003</year>
          . isbn:
          <fpage>3</fpage>
          -
          <lpage>7643</lpage>
          -3211-5.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <given-names>Katja</given-names>
            <surname>Bercic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Michael</given-names>
            <surname>Kohlhase</surname>
          </string-name>
          , and Florian Rabe. \
          <article-title>(Deep) FAIR Mathematics"</article-title>
          . submitted. url: https://kwarc.info/kohlhase/ submit/it19.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [Inc] Wolfram Research, Inc. Mathematica, Version
          <volume>12</volume>
          .0.
          <string-name>
            <surname>Champaign</surname>
          </string-name>
          , IL,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [LM]
          <article-title>The L-functions and Modular Forms Database</article-title>
          . url: http://www. lmfdb.
          <source>org (visited on 02/01/</source>
          <year>2016</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>[McKa] Brendan McKay. Combinatorial Data</surname>
          </string-name>
          . url: http://users.cecs. anu.edu.au/~bdm/data/ (visited on 01/25/
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>[McKb] Brendan McKay</surname>
          </string-name>
          .
          <article-title>Description of graph6, sparse6 and digraph6 encodings</article-title>
          . url: http://users.cecs.anu.edu.au/~bdm/data/formats. txt (visited on 06/24/
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [NF] Number Fields. url: https://hobbes.la.asu.edu/NFDB/ (visited on 06/18/
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <source>[OEIS] The On-Line Encyclopedia of Integer Sequences</source>
          . url: http://oeis. org (visited on 05/28/
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [PSV13]
          <string-name>
            <given-names>Primoz</given-names>
            <surname>Potocnik</surname>
          </string-name>
          , Pablo Spiga, and Gabriel Verret. \
          <article-title>Cubic vertextransitive graphs on up to 1280 vertices"</article-title>
          .
          <source>In: J. Symbolic Comput</source>
          .
          <volume>50</volume>
          (
          <year>2013</year>
          ), pp.
          <volume>465</volume>
          {
          <fpage>477</fpage>
          . issn:
          <fpage>0747</fpage>
          -
          <lpage>7171</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.jsc.
          <year>2012</year>
          .
          <volume>09</volume>
          .002.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [Roy]
          <string-name>
            <given-names>Gordon</given-names>
            <surname>Royle</surname>
          </string-name>
          . Combinatorial Catalogues. url: http://staffhome. ecm.uwa.edu.au/~00013890/data.html (visited on 01/25/
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <surname>[RW05] Ronald</surname>
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Read</surname>
            and
            <given-names>Robin J.</given-names>
          </string-name>
          <string-name>
            <surname>Wilson</surname>
          </string-name>
          . An Atlas of Graphs. New York, NY, USA: Oxford University Press, Inc.,
          <year>2005</year>
          . isbn:
          <volume>0198526504</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [San+03]
          <string-name>
            <surname>M. De Santo</surname>
          </string-name>
          et al. \
          <article-title>A large database of graphs and its use for benchmarking graph isomorphism algorithms"</article-title>
          .
          <source>In: Pattern Recognition Letters 24.8</source>
          (
          <year>2003</year>
          ).
          <article-title>Graph-based Representations in Pattern Recognition</article-title>
          , pp.
          <volume>1067</volume>
          {
          <fpage>1079</fpage>
          . issn:
          <fpage>0167</fpage>
          -
          <lpage>8655</lpage>
          . doi: https://doi. org/10.1016/S0167-
          <volume>8655</volume>
          (
          <issue>02</issue>
          )
          <fpage>00253</fpage>
          -
          <lpage>2</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [SGL]
          <article-title>The Small Groups Library</article-title>
          . url: http://www.icm.tu- bs.de/ag_ algebra/software/small/small.html (visited on 04/16/
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [SS98] G. Sutcli e and
          <string-name>
            <given-names>C.</given-names>
            <surname>Suttner</surname>
          </string-name>
          . \
          <source>The TPTP Problem Library: CNF Release v1.2.1"</source>
          . In
          <source>: Journal of Automated Reasoning 21.2</source>
          (
          <issue>1998</issue>
          ), pp.
          <volume>177</volume>
          {
          <fpage>203</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <source>[SWM] Mathematical Software { swMATH</source>
          . url: http://swmath.org (visited on 09/07/
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [Wan]
          <string-name>
            <given-names>Ian</given-names>
            <surname>Wanless</surname>
          </string-name>
          .
          <article-title>Combinatorial Data</article-title>
          . url: http://users.monash.edu. au/~iwanless/data/ (visited on 01/25/
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [Wil+16]
          <string-name>
            <surname>Mark</surname>
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Wilkinson</surname>
          </string-name>
          et al. \
          <article-title>The FAIR Guiding Principles for scienti c data management and stewardship"</article-title>
          .
          <source>In: Scienti c Data</source>
          <volume>3</volume>
          (
          <year>2016</year>
          ). url: https://doi.org/10.1038/sdata.
          <year>2016</year>
          .
          <volume>18</volume>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>