<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Towards a Census of Relational Data in Mathematics</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
				<date type="published" when="2019-08-23">August 23, 2019</date>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Katja</forename><surname>Berčič</surname></persName>
						</author>
						<title level="a" type="main">Towards a Census of Relational Data in Mathematics</title>
					</analytic>
					<monogr>
						<imprint>
							<date type="published" when="2019-08-23">August 23, 2019</date>
						</imprint>
					</monogr>
					<idno type="MD5">BB55353EB8ED382DC8CA8188961E0697</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T18:28+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Research data are becoming ever more important in science as well as in the humanities. This is reflected in the various national and international initiatives that are aimed at developing and supporting data stewardship and the related area of knowledge management. It may come as a surprise to some that research data is experiencing a similar boom in mathematics. However, it could be argued that mathematics is lagging behind other disciplines in using the tools of the trade when it comes to data. This work-in-progress census aims to shed light on what a large class of mathematical datasets looks like. An increased understanding of data in mathematical research is an important step towards building better infrastructure for these data. The author would like to encourage authors and curators to contribute information about their datasets for future versions of this census.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Mathematicians have long been computing, collecting, and storing interesting, often hard to obtain facts and used them as reference, source of examples and counter examples, and generally to better understand the structure of objects they study. The early examples were all obtained painstakingly by hand. One such example is the computation of logarithm tables. Another example which (at least partially) predates systematic use of computers, is the Foster census of cubic symmetric graphs. This project was begun in 1930 and remarkably contained nearly all cubic symmetric graphs of up to 512 vertices by the time it was published in book form in 1988 1 . More interesting early examples can be found in a MathOverflow thread started by Gordon Royle <ref type="bibr">[con]</ref>.</p><p>The book form was indeed the norm until the internet revolution. The Atlas of Graphs <ref type="bibr" target="#b26">[RW05]</ref> would probably have been published digitally if its creation was shifted by only a few years forward. The first two iterations of well known On-Line Encyclopedia of Integer Sequences <ref type="bibr">[OEIS]</ref> were published as books in <ref type="bibr">1973 and 1995.</ref> The circumstances that gave rise to the increasing importance of research data in general have had a corresponding effect on research data in mathematics. With access to computing power, the complexity and size of the datasets grew significantly. Mathematical datasets also found new uses, such as algorithm benchmarking (an example here is the ARG database for benchmarking of graph isomorphism algorithms <ref type="bibr" target="#b27">[San+03]</ref>). This growth sparked a need for tools to manage the data.</p><p>In the scientific community, we have seen the formation of FAIR principles <ref type="bibr" target="#b32">[Wil+16]</ref>, which break down the vague concept of usefulness into properties that form a basis for guidelines. The data should be findable, accessible, interoperable, and reusable. As stressed by the authors of the principles, these still need to be adapted for the needs of specific scientific communities. In a work that predates the FAIR principles (and even an earlier paper discussing similar ideas about accessibility), <ref type="bibr">Billey and Tenner [BT13]</ref> outlined a set of desirable properties in a certain class of mathematical databases they call fingerprint databases for theorems. Their properties had some overlap with the FAIR principles; in particular, they require the databases (and their contents) to be citable via unique identifiers. With Vidali, the author outlined some further recommendations for mathematical databases as part of the work on the DiscreteZOO project <ref type="bibr">[BV]</ref>. Data in mathematics is intrinsically linked with knowledge and as such, managing if falls into the intersection of data stewardship and mathematical knowledge management.</p><p>Data in mathematics and the scope of this census Similar to data in general, mathematical data appear in several forms. In ongoing joint work with Michael Kohlhase and Florian Rabe <ref type="bibr">[BKR]</ref>, we propose a division of data in mathematics into four categories (Figure <ref type="figure" target="#fig_0">1</ref>). Symbolic data (knowledge) are typically found in libraries such as the TPTP Problem Library for Automated Theorem Proving <ref type="bibr" target="#b29">[SS98]</ref>. In some sense, these libraries are similar to corpora in linguistics. Linked data relates to data in library science and ontologies in information science. A good example of narrative data in mathematics is the repository of electronic preprints arXiv. Finally, relational data are perhaps closest to what most people think of as data and are what this census focuses on. Examples include lists of mathematical objects that could be (or are) organised into a table, such as censuses of graphs, lists of integer sequences, etc. This classification is not strict: for example, the OEIS fits most neatly in the relational data type, but it includes references to theorems and formulas for closed forms of the generating functions. The census mostly focuses on the datasets themselves, however, we will also briefly mention some of the systems that have been built for them.</p><p>Limitations It turns out that finding out what datasets are out there, and what they look like is challenging! With the exception of <ref type="bibr">Billey and Tenner [BT13]</ref>, there is no literature about relational math datasets in general. Dataset authors often describe a dataset in a paper. Such papers get lost in a multitude of irrelevant results when searching for keywords such as "database" in databases of mathematical literature like arXiv, MathSciNet and zbMATH. One can find some information on swMATH information service for mathematical software <ref type="bibr">[SWM]</ref> by browsing the types Data Collections and Services, Webservices, and the special collection Math.Databases -MathDBS<ref type="foot" target="#foot_0">2</ref> . Another source are computer algebra systems, which integrate some datasets (for example as packages). Unfortunately, most datasets live only on their authors' websites and are not indexed anywhere.</p><p>As early work, the census is strongly influenced by the author's area of mathematics. The contents are skewed towards datasets that the author knew about from the start, those discovered through word of mouth, and those datasets mentioned on MathOverflow and swMATH. It is not exhaustive or final in terms of aspects examined, nor examples given. The difficulty in obtaining information is reflected in the uneven coverage of areas of mathematics and in the uneven level of detail about specific datasets. In particular, we have not yet collected much information about datasets that are only available through computer algebra systems. Similarly, we have focused on datasets that do not appear only in commercial systems (such as Magma <ref type="bibr" target="#b1">[BCP97]</ref> and Mathematica <ref type="bibr">[Inc]</ref>).</p><p>Goals More work is needed before a more structured review can be attempted. The work reported here is a necessary first step, as it outlines the use of relational data in mathematics. Within this larger picture, it aims to set up a foundation for creating community guidelines for FAIR mathematics, and to serve as a reference to anyone who needs to know how data in mathematics look like, such creators of data frameworks for mathematics. Finally, this census aims to increase the visibility of data in mathematics, and contribute towards better recognition of the work that goes into constructing and collecting the datasets.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Description of Datasets and Systems</head><p>This section aims to illustrate the diversity of the datasets via several aspects. Throughout, we will use the word "dataset" quite loosely to encompass both simple datasets, such as those containing a collection of objects or records with the same structure, as well as organised collections of simple datasets.</p><p>Relational data in mathematics are best characterised by the utilisation of representation theorems that allow encoding mathematical objects as simple data structures built from numbers, strings, lists and records. Such representations can be quite far from the objects' semantic type. For example, polynomials with integer coefficients can be encoded as lists of integers. Graphs can be represented as adjacency or incidence matrices, or as adjacency or edge lists. These can in turn be represented as arrays or strings (such as graph6 <ref type="bibr">[McKb]</ref>) at the database level. Further more, testing for graph isomorphism (not an uncommon task in a database) is a hard problem in general and results such as canonical forms <ref type="bibr" target="#b6">[BL83]</ref> can be used in the encoding to help get around that difficulty. Most relational data appears as collections of concrete mathematical objects.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Aspects</head><p>Structure Some datasets consist of simple lists of objects (such as Kohonen's giant list of unlabeled lattices <ref type="bibr">[FL]</ref>), while others are lists of records, each consisting of an object, together with some of its mathematical invariants. Larger projects can end up organising several simple datasets into a larger one, also storing the interconnection. Arguably the largest such project is the L-functions and Modular Forms Database (LMFDB <ref type="bibr">[LM]</ref>) combines more than 30 datasets that have arisen in the context of the Langlands program <ref type="bibr" target="#b3">[Ber03]</ref>, which explores connections between number theory and geometry.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Content organisation</head><p>The contents of datasets obtained either by (systematically) generating all objects that satisfy a given set of parameters, or by collecting objects in some other way.</p><p>An example of the former are some of the collections of highly symmetric objects (such as graphs). Symmetric objects are quite rare compared to ones without symmetry and obtaining a complete list of all objects up to a certain size can take months.</p><p>On the other hand, unsystematic collections of (rare or interesting) objects and systematic collections of all objects of a specific kind. The On-Line Encyclopedia of Integer Sequences (OEIS <ref type="bibr">[OEIS]</ref>) is an example of the former that collects sequences of integers (such as the Fibonacci sequence 0, 1, 1, 2, 3, 5, 8, 13, etc.). Similarly, the stated goal of The House of Graphs <ref type="bibr" target="#b8">[Bri+13]</ref> is "to find a workable definition of 'interesting' and provide a searchable database of graphs that conform to this definition". Both of these provide a lot of information about every object, including references to research papers.</p><p>The generated collections typically have a small number of authors, while the unsystematic collections tend to become a collaborative effort.</p><p>Authorship The authorship varies widely.</p><p>-The majority of datasets has a single author or group of authors. These datasets are often accompanied by a paper (or a small number of papers) describing the mathematical background, generation, and contents. -Some datasets have a large number of contributors; these are typically the unsystematic ones, for which a core group of authors contributed a substantive part of the data, together with a large number of authors with smaller contributions. In addition to the OEIS (with thousands of contributors), the LMFDB (with a 100 contributors) and the House of Graphs, Findstat <ref type="bibr" target="#b9">[BSa14]</ref> (with 69 contributors) is such an example. -A somewhat special case are the combinatorial catalogues that can consist of tens of lists of (combinatorial) objects. Some examples of these are catalogues produced by McKay <ref type="bibr">[McKa]</ref>, Royle <ref type="bibr">[Roy]</ref>, and Wanless <ref type="bibr">[Wan]</ref>.</p><p>Provenance The provenance of the dataset usually corresponds to its structure and authorship. The datasets with a small number of authors are usually produced via a small number of methods. We did not yet explore the provenance of the larger datasets, especially the unsystematic ones. Important to note here is that datasets can be built on top of other datasets. An example of this is the Census of cubic vertex-transitive graphs <ref type="bibr" target="#b24">[PSV13]</ref>. The authors split the graphs into a few cases, each of which required a specialised method. One of these cases was a dataset already generated in previous work.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Infrastructure and Shareability</head><p>The datasets are usually accessible either through a website or indirectly through a computer algebra system. Exceptions to these are especially older works, such as, the collection of graphs described in the book Atlas of Graphs <ref type="bibr" target="#b26">[RW05]</ref>. Many projects with a website also provide an encyclopedic page for every object, and many researchers have commented that this is an important feature.</p><p>Especially the larger projects develop some infrastructure for the data, possibly seeding it with the initial contents. The infrastructure then supports contributions of objects (like the OEIS, the House of Graphs and Findstat), or lists of objects. The latter, hosting lists of graphs, is the other stated goal of the House of Graphs. Similarly, the Encyclopedia of Graphs <ref type="bibr">[EG]</ref> is a rare example of an online resource developed to help researchers find and use data, without actually producing any of its own datasets. It currently hosts about 30 datasets. Another example is a more recent project DiscreteZOO <ref type="bibr">[BV]</ref>, which initially aimed to support the community studying symmetric objects.</p><p>An important example of a dataset that is only available via CAS is the Small Groups Library <ref type="bibr">[SGL]</ref> (in GAP, Magma). It relies on the system to compute a significant part of the information on-the-fly and thus only uses a little under a bit per group. In a little under 80 MB the library stores enough information to find which of the over 400 million groups a group given by the user is isomorphic to.</p><p>A more typical situation is where a dataset is hosted with minimal infrastructure on one of the authors' websites. The website is often browsable with the browsing interface consisting of HTML tables. An illustrative example here is the Census of edge transitive graphs <ref type="bibr">[EET]</ref>. Wilson has the core information about the census stored in a CAS, in which he has also written code that produces the HTML for the website. Another such example is Michael Hartley's atlas of abstract polytopes <ref type="bibr">[AP]</ref>. Authors also often provide files with code for the collection as an array in some computer algebra system.</p><p>Metrics There is no standard measure for a size of a dataset. It is possible to consider the compressed or uncompressed size on disk, the number or size of objects, or the time (itself a problematic metric) it took to generate the dataset, etc. The uncompressed size on disk can range from a few megabytes to over a terabyte, or up to roughly 25 GB with heavy compression. There is a small inverse correlation between the size (on disk or the number of objects) and computational complexity of the process of generation. Kohonen's lattice dataset appears to be a record holder with respect to size, with a few billions of lattices.</p><p>The number of users can be estimated through the number of citations, or the number of citations of the corresponding paper for some projects. The number of downloads would be interesting, but it appears that nearly no-one records it.</p><p>FAIR-related aspects For details about the FAIR principles, we refer the reader to the GoFAIR website <ref type="bibr">[GF]</ref>.</p><p>Metadata. Some details about a dataset are typically available on the same website as the dataset, and most datasets have an accompanying paper. Metadata are generally not structured and, with a few possible exceptions, do not specify a license.</p><p>Unique IDs and Findable. Many datasets have some sort of an unique ID for all the objects. Some of the projects also provide some sort of a globally unique ID (in the sense of a URL), but the persistence of it is bounded by the projects' lifespan, as the URLs will expire if the website is decommissioned.</p><p>ET <ref type="bibr">[EET]</ref> C4 Accessible. While most datasets are in some way available online, the formats are typically ad hoc. Almost all of the others are available through computer algebra systems.</p><p>Area of mathematics At least partly due to the author's home area, the information collected so far has mostly been skewed towards combinatorics and geometry. In addition to these, we have found datasets from number theory (LMFDB, NFDB [NF]), group theory and algebra (the Small Groups Library <ref type="bibr">[SGL]</ref>, the Graded Ring Database <ref type="bibr">[GRD]</ref>), topology (the Knot Atlas <ref type="bibr">[BNMa]</ref>), algebraic geometry (the Toric Calabi-Yau Database <ref type="bibr">[CY]</ref>), and probability (Distributome <ref type="bibr">[Dst]</ref>).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Living census</head><p>To facilitate the collection of information about data in mathematics, we set up a database with a website frontend <ref type="bibr">[Ber]</ref>. While it grew out of the necessity to keep track of the information, it has at least two further goals. First, it aims to make it easy for anyone to see what information has been collected so far. Second, it aims to eventually make it easy to contribute information.</p><p>The information about the datasets can be displayed a few different views (with switching implemented through tabs): general information, information about size, information pertaining to the FAIR principles, as well as some other properties.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Fig. 2. The living census website</head><p>The FAIR principles in particular are a little unwieldy to get an impression about at a glance, which is why we devised simple diagrams (Figure <ref type="figure" target="#fig_1">3</ref>) to aid with that. The design of the diagrams is based off the fact that each of the four principles (findable, accessible, interoperable, and reusable) is composed of 3-4 sub-principles, Findable (F1, F2, F3, and F4), Accessible (A1, A1.1, A1.2, and A2), Interoperable (I1, I2, and I3), and Reusable (R1, R1.1, R1.2, and R1.3) <ref type="bibr" target="#b32">[Wil+16]</ref>. Each of these can be applied to (or not) to one of the three layers of information about the dataset. These layers are not necessarily included in the original FAIR principles, but it seems to be helpful to break information down depending on whether it applies to the dataset (D) itself (such as whether the dataset has its own globally unique identifier or whether it is registered in a searchable resource), the datum (A) (each of the objects needs its own globally unique identifier), or the metadata (M) (such as whether the metadata is accessible even after the data are no longer available).</p><p>The colour of each cell in the diagram corresponds to a value for a subprinciplelayer pair: unknown (black), not considered (blank), mostly supported (green), somewhat supported (yellow) and mostly unsupported (red). For example, let us consider F1 for FindStat <ref type="bibr" target="#b9">[BSa14]</ref>, the Combinatorial Statistic Finder. The dataset (but not the metadata) is indexed in zbMATH (Findable D, M). Each combinatorial statistic in the dataset has a unique identifier, such as St000081<ref type="foot" target="#foot_1">3</ref> and can be found through a search interface (Findable A). </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Conclusions and Future Work</head><p>In some areas of mathematics, research products can consist of listings or tabulations of complex mathematical objects and their properties. These datasets can be later used by researchers to form or refute conjectures. The main result of the work reported here are the beginnings of a census of existing mathematical datasets of this kind, together with a classification of the entries along multiple dimensions. As such, this work contributes to the field of knowledge management in areas of mathematics where research products can consist of such datasets, such as experimental and computational mathematics.</p><p>Currently, the census contains about 70 datasets from several areas of mathematics. This includes links to dataset websites and author information for (nearly) all of the datasets, as well as literature references, area of mathematics and size-related information for many. Even this small sample shows large variations in terms of structure, content organisation, provenance, infrastructure and shareability, and size.</p><p>Perhaps the most important immediate use for this census is as a "market study" for a prototypal unified infrastructure for mathematical data, Math-DataHub <ref type="bibr" target="#b5">[BKR19]</ref>. It serves as a source of use cases for the infrastructure, as well as beginnings of a community of researchers that work with mathematical data. Even in this initial stage, the census gives the developers of MathDataHub some idea of the requirements for the system in terms of the ranges of dataset size, complexity, etc.</p><p>We will continue to gather information about the relational datasets in mathematics in the living census website. One way to find more datasets would be to (in a way that is not yet clear) search for literature in all areas of mathematics (but not computer science) with keywords "database", "atlas", "census" and similar. Such a search currently does not appear to be supported by any of the major databases of mathematical literature. Another large set of datasets that has yet to be added to the census are datasets incorporated into the various computer algebra systems.</p><p>Finally, we plan to use the new information as a basis for a more structured census.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. Kinds of mathematical knowledge and data</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Fig. 3 .</head><label>3</label><figDesc>Fig. 3. An example of a diagram for Findable.</figDesc><graphic coords="8,273.18,380.16,69.00,63.60" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0"><head></head><label></label><figDesc></figDesc><graphic coords="7,134.77,342.91,345.83,217.18" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 .</head><label>1</label><figDesc>Examples of unique IDs</figDesc><table><row><cell></cell><cell>[10, 1]</cell></row><row><cell>Findstat</cell><cell>http://www.findstat.org/StatisticsDatabase/St000001/</cell></row><row><cell>OEIS</cell><cell>https://oeis.org/A000045</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_0">https://swmath.org/browse/types</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_1">http://www.findstat.org/StatisticsDatabase/St000081/</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Acknowledgements The author gratefully acknowledges Tom Wiesing's help in setting up the Django based living census website. The author would also particularly like to thank Michael Kohlhase for suggesting the need for a census of this type, as well as for regular constructive discussions. Finally, the author is grateful to the many dataset authors who responded to questions about their datasets and use of data. The work presented here was supported by the EU grant Horizon 2020 ERI 676541 OpenDreamKit.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<author>
			<persName><forename type="first">Michael</forename><surname>Hartley</surname></persName>
		</author>
		<ptr target="http://www.abstract-polytopes.com/atlas/index.html" />
		<title level="m">Abstract Polytopes</title>
				<imprint>
			<date type="published" when="2019-01-23">01/23/2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">The Magma algebra system. I. The user language</title>
		<author>
			<persName><forename type="first">Wieb</forename><surname>Bosma</surname></persName>
		</author>
		<author>
			<persName><forename type="first">John</forename><surname>Cannon</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Catherine</forename><surname>Playoust</surname></persName>
		</author>
		<idno type="DOI">10.1006/jsco.1996.0125</idno>
	</analytic>
	<monogr>
		<title level="m">Computational algebra and number theory</title>
				<meeting><address><addrLine>London</addrLine></address></meeting>
		<imprint>
			<date type="published" when="1993">1997. 1993</date>
			<biblScope unit="volume">24</biblScope>
			<biblScope unit="page" from="235" to="265" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<author>
			<persName><forename type="first">Katja</forename><surname>Berčič</surname></persName>
		</author>
		<ptr target="Https://mathdb.mathhub.info/(visitedon" />
		<title level="m">Math Databases table</title>
				<imprint>
			<date type="published" when="2019">01/15/2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<title level="m" type="main">An Introduction to the Langlands Program</title>
		<editor>Steve Bernstein Joseph Gelbart</editor>
		<imprint>
			<date type="published" when="2003">2003</date>
			<publisher>Birkhäuser</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<author>
			<persName><forename type="first">Katja</forename><surname>Berčič</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Michael</forename><surname>Kohlhase</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Florian</forename><surname>Rabe</surname></persName>
		</author>
		<ptr target="https://kwarc.info/kohlhase/submit/it19.pdf" />
		<title level="m">Deep) FAIR Mathematics</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Towards a Unified Mathematical Data Infrastructure: Database and Interface Generation</title>
		<author>
			<persName><forename type="first">Katja</forename><surname>Berčič</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Michael</forename><surname>Kohlhase</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Florian</forename><surname>Rabe</surname></persName>
		</author>
		<ptr target="https://kwarc.info/kohlhase/papers/cicm19-MDH.pdf" />
	</analytic>
	<monogr>
		<title level="m">Intelligent Computer Mathematics (CICM)</title>
				<editor>
			<persName><forename type="first">Cezary</forename><surname>Kaliszyck</surname></persName>
		</editor>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2019">2019. 2019</date>
			<biblScope unit="page" from="28" to="43" />
		</imprint>
	</monogr>
	<note>in preparation</note>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Canonical Labeling of Graphs</title>
		<author>
			<persName><forename type="first">László</forename><surname>Babai</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Eugene</forename><forename type="middle">M</forename><surname>Luks</surname></persName>
		</author>
		<idno type="DOI">10.1145/800061.808746</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Fifteenth Annual ACM Symposium on Theory of Computing. STOC &apos;83</title>
				<meeting>the Fifteenth Annual ACM Symposium on Theory of Computing. STOC &apos;83<address><addrLine>New York, NY, USA</addrLine></address></meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="1983">1983</date>
			<biblScope unit="page" from="171" to="183" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<title level="m" type="main">The Knot Atlas</title>
		<author>
			<persName><forename type="first">Dror</forename><surname>Bar-Natan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Scott</forename><surname>Morrison</surname></persName>
		</author>
		<ptr target="http://katlas.org(visitedon06/" />
		<imprint>
			<date type="published" when="2019">18/2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">House of Graphs: a database of interesting graphs</title>
		<author>
			<persName><forename type="first">Gunnar</forename><surname>Brinkmann</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.dam.2012.07.018</idno>
	</analytic>
	<monogr>
		<title level="j">Discrete Appl. Math</title>
		<idno type="ISSN">0166-218X</idno>
		<imprint>
			<biblScope unit="volume">161</biblScope>
			<biblScope unit="page" from="311" to="314" />
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<monogr>
		<author>
			<persName><forename type="first">C</forename><surname>Berg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Stump</surname></persName>
		</author>
		<ptr target="http://www.FindStat.org" />
		<title level="m">FindStat: The Combinatorial Statistic Finder</title>
				<imprint>
			<date type="published" when="2014">31 August 2016. 2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Fingerprint databases for theorems</title>
		<author>
			<persName><forename type="first">Sara</forename><forename type="middle">C</forename><surname>Billey</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Bridget</forename><forename type="middle">E</forename><surname>Tenner</surname></persName>
		</author>
		<idno type="DOI">10.1090/noti1029</idno>
	</analytic>
	<monogr>
		<title level="j">Notices Amer. Math. Soc</title>
		<idno type="ISSN">0002-9920</idno>
		<imprint>
			<biblScope unit="volume">60</biblScope>
			<biblScope unit="issue">8</biblScope>
			<biblScope unit="page" from="1034" to="1039" />
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<monogr>
		<title level="m" type="main">DiscreteZOO: a Fingerprint Database of Discrete Objects</title>
		<author>
			<persName><forename type="first">Katja</forename><surname>Berčič</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Janoš</forename><surname>Vidali</surname></persName>
		</author>
		<ptr target="https://arxiv.org/pdf/1812.05921.pdf" />
		<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<monogr>
		<ptr target="https://mathoverflow.net/questions/47044/what-are-some-early-examples-of-creation-of-lists-catalogues-of-particularly" />
		<title level="m">What are some early examples of creation of lists / catalogues of (particularly) combinatorial objects? MathOverflow</title>
				<imprint>
			<date type="published" when="2018">11/20/2018</date>
		</imprint>
	</monogr>
	<note>MathOverflow contributors</note>
</biblStruct>

<biblStruct xml:id="b13">
	<monogr>
		<ptr target="http://www.rossealtman.com/index.html" />
		<title level="m">Toric Calabi-Yau Database</title>
				<imprint>
			<date type="published" when="2019">06/18/2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<monogr>
		<ptr target="http://www.distributome.org/(visitedon06/" />
		<title level="m">Distributome</title>
				<imprint>
			<date type="published" when="2019">18/2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<author>
			<persName><forename type="first">Steve</forename><surname>Wilson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Primož</forename><surname>Potočnik</surname></persName>
		</author>
		<ptr target="https://jan.ucc.nau.edu/~swilson/C4FullSite/index.html" />
		<title level="m">A Census of edge-transitive tetravalent graphs</title>
				<imprint>
			<date type="published" when="2019-01-23">01/23/2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<monogr>
		<ptr target="http://atlas.gregas.eu" />
		<title level="m">Encyclopedia of Graphs</title>
				<imprint>
			<date type="published" when="2019-01-24">01/24/2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<monogr>
		<author>
			<persName><forename type="first">Jukka</forename><surname>Kohonen</surname></persName>
		</author>
		<ptr target="https://b2share.eudat.eu/records/dbb096da4e364b5e9e37b982431f4" />
		<title level="m">Lists of finite lattices (modular, semimodular, graded and geometric</title>
				<imprint>
			<date type="published" when="2019-01-25">01/25/2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<monogr>
		<title/>
		<author>
			<persName><surname>Gofair</surname></persName>
		</author>
		<ptr target="https://www.go-fair.org/fair-principles/(visitedon06/" />
		<imprint>
			<date type="published" when="2019">18/2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<monogr>
		<ptr target="http://www.grdb.co.uk/(visitedon06/" />
		<title level="m">Graded Ring Database</title>
				<imprint>
			<date type="published" when="2019">18/2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<monogr>
		<ptr target="http://www.lmfdb.org(visitedon02/" />
		<title level="m">LM] The L-functions and Modular Forms Database</title>
				<meeting><address><addrLine>Champaign, IL</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2016">2019. 01/2016</date>
		</imprint>
		<respStmt>
			<orgName>Wolfram Research, Inc.</orgName>
		</respStmt>
	</monogr>
	<note>Mathematica, Version 12.0</note>
</biblStruct>

<biblStruct xml:id="b21">
	<monogr>
		<author>
			<persName><forename type="first">Brendan</forename><surname>Mckay</surname></persName>
		</author>
		<ptr target="http://users.cecs.anu.edu.au/~bdm/data/(visitedon01/" />
		<title level="m">Combinatorial Data</title>
				<imprint>
			<date type="published" when="2019">25/2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<monogr>
		<title level="m" type="main">Description of graph6, sparse6 and digraph6 encodings</title>
		<author>
			<persName><forename type="first">Brendan</forename><surname>Mckay</surname></persName>
		</author>
		<ptr target="http://users.cecs.anu.edu.au/~bdm/data/formats.txt" />
		<imprint>
			<date type="published" when="2019-06-24">06/24/2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<monogr>
		<ptr target="http://oeis.org(visitedon05/" />
		<title level="m">The On-Line Encyclopedia of Integer Sequences</title>
				<imprint>
			<date type="published" when="2017">28/2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">Cubic vertextransitive graphs on up to 1280 vertices</title>
		<author>
			<persName><forename type="first">Primož</forename><surname>Potočnik</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Pablo</forename><surname>Spiga</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Gabriel</forename><surname>Verret</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.jsc.2012.09.002</idno>
	</analytic>
	<monogr>
		<title level="j">J. Symbolic Comput</title>
		<idno type="ISSN">0747-7171</idno>
		<imprint>
			<biblScope unit="volume">50</biblScope>
			<biblScope unit="page" from="465" to="477" />
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<monogr>
		<author>
			<persName><forename type="first">Gordon</forename><surname>Royle</surname></persName>
		</author>
		<ptr target="http://staffhome.ecm.uwa.edu.au/~00013890/data.html" />
		<title level="m">Combinatorial Catalogues</title>
				<imprint>
			<date type="published" when="2019-01-25">01/25/2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b26">
	<monogr>
		<title level="m" type="main">An Atlas of Graphs</title>
		<author>
			<persName><forename type="first">Ronald</forename><forename type="middle">C</forename><surname>Read</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Robin</forename><forename type="middle">J</forename><surname>Wilson</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2005">2005</date>
			<publisher>Oxford University Press, Inc</publisher>
			<pubPlace>New York, NY, USA</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b27">
	<analytic>
		<title level="a" type="main">A large database of graphs and its use for benchmarking graph isomorphism algorithms</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">De</forename><surname>Santo</surname></persName>
		</author>
		<idno type="DOI">10.1016/S0167-8655(02)00253-2</idno>
		<idno>(02)00253-2</idno>
		<ptr target="https://doi.org/10.1016/S0167-8655" />
	</analytic>
	<monogr>
		<title level="m">Graph-based Representations in Pattern Recognition</title>
				<imprint>
			<date type="published" when="2003">2003</date>
			<biblScope unit="page" from="1067" to="1079" />
		</imprint>
	</monogr>
	<note>Pattern Recognition Letters 24</note>
</biblStruct>

<biblStruct xml:id="b28">
	<monogr>
		<ptr target="http://www.icm.tu-bs.de/ag_algebra/software/small/small.html" />
		<title level="m">The Small Groups Library</title>
				<imprint>
			<date type="published" when="2019">04/16/2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b29">
	<analytic>
		<title level="a" type="main">The TPTP Problem Library: CNF Release v1.2.1</title>
		<author>
			<persName><forename type="first">G</forename><surname>Sutcliffe</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Suttner</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Automated Reasoning</title>
		<imprint>
			<biblScope unit="volume">21</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="177" to="203" />
			<date type="published" when="1998">1998</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b30">
	<monogr>
		<ptr target="http://swmath.org(vis-itedon09/07/2017" />
		<title level="m">Mathematical Software -swMATH</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b31">
	<monogr>
		<author>
			<persName><forename type="first">Ian</forename><surname>Wanless</surname></persName>
		</author>
		<ptr target="http://users.monash.edu.au/~iwanless/data/(visitedon01/" />
		<title level="m">Combinatorial Data</title>
				<imprint>
			<date type="published" when="2019">25/2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b32">
	<analytic>
		<title level="a" type="main">The FAIR Guiding Principles for scientific data management and stewardship</title>
		<author>
			<persName><forename type="first">D</forename><surname>Mark</surname></persName>
		</author>
		<author>
			<persName><surname>Wilkinson</surname></persName>
		</author>
		<idno type="DOI">10.1038/sdata.2016.18</idno>
		<ptr target="https://doi.org/10.1038/sdata.2016.18" />
	</analytic>
	<monogr>
		<title level="j">Scientific Data</title>
		<imprint>
			<biblScope unit="volume">3</biblScope>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
