<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Analysing the Evolution of Com munity-Driven (Sub-)Schemas within Wikidata</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Axel Polleres</string-name>
          <email>axel.polleres@wu.ac.at</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Ontology Evolution, Empirical Semantics, Domain-specific Communities, Wikidata</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>AGH University of Science and Technology</institution>
          ,
          <addr-line>Cracow</addr-line>
          ,
          <country country="PL">Poland</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Cefriel - Politecnico di Milano</institution>
          ,
          <addr-line>Milan</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Bologna</institution>
          ,
          <addr-line>Bologna</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Vienna University of Economics and Business &amp; Complexity Science Hub Vienna</institution>
          ,
          <addr-line>Wien</addr-line>
          ,
          <country country="AT">Austria</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>Vrije Universiteit</institution>
          ,
          <addr-line>Amsterdam</addr-line>
          ,
          <country country="NL">Netherlands</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Wikidata is a collaborative knowledge graph not structured according to predefined ontologies. Its schema evolves in a bottom-up approach defined by its users. In this paper, we propose a methodology to investigate how semantics develop in sub-schemas used by particular, domain-specific communities within the Wikidata knowledge graph: (i) we provide an approach to identify the domain sub-schema from a set of given classes and its related community, considered domain-specific; (ii) we propose an approach for analysing the such identified sub-schemas and communities, including their evolution over time. Finally, we suggest further possible analyses that would give better insights in (i) the communities themselves, (ii) the KG vocabulary accuracy, quality and its evolution over time according to domain areas, raising the potential of Wikidata improvement and its re-use by domain experts.</p>
      </abstract>
      <kwd-group>
        <kwd>Wikidata</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Since its initiation by the Wikimedia Foundation in 2012 [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], the Wikidata collaborative
knowledge graph has now a collection of almost 100 million items. Wikidata users, editors and
contributors, can describe and navigate through real-world concepts by querying the
knowledge graph based on its entities, properties and attributes [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. In contrast with the usual approach
to knowledge graph (KG) engineering, Wikidata does not comply with a specific and predefined
ontology for the creation and editing of items. Wikidata relies on its community of volunteers
to support and expand its knowledge graph across diferent domains and expertise, as well as
provide a connection to the Linked Data Web [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. The fact that the community is responsible
for the maintenance and expansion of the knowledge graph, makes a very interesting use case
in the context of collaborative ontology engineering [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>
        We would like to investigate the “empirical semantics” resulting from the creation, usage,
modification and adaptation of a (sub-)schema. The collaborative nature and availability of the
edit history of Wikidata provide a perfect proving ground for the study on empirical semantics
and its evolution over time. While the quality of the Wikidata ontology and its evolution over
time has already been investigated [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], a comparison considering the role of (sub-)communities
is – to the best of our knowledge - still missing. To this end, we lay the foundations to investigate
whether and how diferent domain-specific communities utilize distinct, diferent schemas, and
how such sub-schemas evolve over time, which in turn can be compared with each other as
well as with the evolution of the Wikidata ontology as a whole.
      </p>
      <p>Along these lines, we define the following concrete research questions:
1. How can domain-specific community schemas be defined and identified within the Wikidata</p>
      <p>KG?
2. Which patterns and metrics can be used to describe the empirical semantics adopted in a
community-driven schema?
3. What is the evolution of a community-driven schema over time?
4. How do diferent communities within Wikidata compare with respect to the metrics and
evolution of the schemas they use?</p>
      <p>In this position paper, we address these questions by proposing a structured approach based
on the analysis of the literature and a preliminary assessment of the existing challenges. An
overview of the approach is presented in Figure 1.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Preliminaries and Related Work</title>
      <p>
        Wikidata is a collaborative knowledge graph based on a schema defined by the relations
between the items in the graph (i.e. classes, properties and entities). To illustrate the taxonomy
of such relationships, the Wikidata graph has properties such as instance of (P31) and subclass
of (P279), allowing us to understand the structure that builds the graph and distinguishes
between which items are classes and which are entities. The Wikidata schema - made of classes
and properties, their definitions and restrictions - is collaboratively built by its users with a
bottom-up approach and does not follow a predefined ontology formalised through the Web
Ontology Language (OWL) [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
      <p>
        The current version of the Wikidata graph is publicly accessible through the Wikidata SPARQL
query service1. Historical data about the edits made over time can be retrieved through the
dumps made available by the Wikimedia foundation2. Pellissier Tanon and Suchanek in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]
analyse the problem of querying the edit history of Wikidata, proposing the implementation
of a SPARQL endpoint to make data more accessible. A public endpoint, the History Query
Service3, is available and supports the analysis of the edit history of Wikidata until 2019. The
endpoint allows executing queries considering the global state of the graph after each revision,
the triples added and/or deleted at each revision, and the metadata about each revision (e.g., the
user performing the edit).
      </p>
      <p>
        The peculiar characteristics of the community-driven Wikidata knowledge graph are studied
considering many diferent perspectives. Piscopo and Simperl in [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], reviewed the existing
literature identifying diferent quality dimensions that could be considered in the analysis of
Wikidata and its overall schema. The same authors in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] have also investigated user roles in
Wikidata, along with an analysis of the ontology engineering practices primarily followed by
these roles. The paper also proposes a definition for the identification of the Wikidata ontology
and a relevant set of metrics for analysing its quality and its evolution over time. We build upon
both these contributions in the definition of our methodology. Additionally, we propose to
consider other metrics to describe and analyze the schema of a community, such as the concept
of characteristic sets introduced by Neumann and Moerkotte [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] originally to support cardinality
estimation methods in RDF triplestore.
      </p>
      <p>Kartik et al. in [9] analysed the quality of the statements available in Wikidata as a way to
investigate current practices applied by the community. The analysis was based on removed
statements, deprecated statements and constraint violations. As opposed to our work, the
authors propose an analysis of the general quality of statements in Wikidata, while we focus on
an approach to investigate schemas considering specifically a domain- or community-specific
perspective. Indeed, as mentioned also in [10] as future work, the analysis of users and edits is
influenced by the set of topics and/or categories considered, i.e., by their domain of expertise.</p>
      <p>
        Diferent aspects are highlighted in the literature as relevant problems to be taken into
consideration in the definition of community-driven schema within the Wikidata knowledge
graph. In particular, various works have discussed the challenges of multi-linguality [11, 12]
as well as the role of bots within the Wikidata community, especially with respect to the
edits they make [
        <xref ref-type="bibr" rid="ref4">4, 13</xref>
        ]. Particularly, given that specific languages are a potential defining
attribute of specific (sub-)communities, multi-linguality is, therefore, also potentially relevant
to us. Likewise, the presence of bots could significantly skew the activity traces of real human
sub-communities of editors.
      </p>
      <p>1https://query.wikidata.org/
2https://dumps.wikimedia.org/
3https://www.wikidata.org/wiki/Wikidata:History_Query_Service</p>
    </sec>
    <sec id="sec-3">
      <title>3. Proposed approach</title>
      <p>This section describes the proposed approach for the identification and analysis of
communitydriven (sub-)schemas within the Wikidata knowledge graph. A more extensive and complete
visual representation of the approach, in addition to what is already shown in Figure 1, is
available on Zenodo4.</p>
      <sec id="sec-3-1">
        <title>3.1. The Wikidata Surface Schema</title>
        <p>
          As a first step, we define the Wikidata Surface Schema, i.e., the schema within the Wikidata KG
considered as input for the identification of community-driven (sub-)schemas. Following the
approach reported in [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ], the Wikidata schema can be defined as the set of properties and the set
of items that are used as classes, i.e., those that are the object of instance of (P31), or subject/object
of subclass of (P279). The Wikidata direct claim graph contains only the truthy statements5,
defined as statements that have the best non-deprecated rank for a given property. These
statements can be identified through the Wikidata wikibase:directClaim special predicate,
or the wikibase:directClaimNormalized one if also properties from external vocabularies
should be considered. We define the Wikidata Surface Schema as the schema extracted by
applying the definition from [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] to the surface claim graph.
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. (Sub-)Communities within Wikidata</title>
        <p>
          The broad definition of the Wikidata community encompasses all users contributing to the
development of the Wikidata knowledge graph. This definition can be narrowed to identify
diferent groups of users:
• based on the user role - this definition distinguishes communities depending on users’
role in the Wikidata community [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ],
• based on domains of knowledge - the community is defined as the set of users editing
entities belonging to a specific domain,
• based on languages - in this sense, the definition of community is a set of users
contributing to Wikidata in a particular language [11, 12].
        </p>
        <p>In this study, we focus on the definition of communities based on a specific domain of knowledge,
since it is a relevant aspect still not addressed by the literature.</p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. (Sub-)schema for a Wikidata (Sub-)Community</title>
        <p>We propose a set of definitions and a methodology to identify a certain community contributing
to a domain-specific part of the Wikidata knowledge graph, and the schema defined and adopted
by users belonging to the community.</p>
        <p>4https://doi.org/10.5281/zenodo.6961940
5https://www.mediawiki.org/wiki/Wikibase/Indexing/RDF_Dump_Format
1. Identify the domain. An initial vocabulary of Wikidata items associated with the
considered domain should be identified, e.g., a set of relevant classes selected by domain
experts. The core-community schema is a sub-schema of the Wikidata schema that
leverages the initial vocabulary by selecting all classes and properties related to the items
identified respectively through the instance of (P31), subclass of (P279) and subproperty
of (P1647) properties.
2. Identify the community. The community is identified as the set of users, excluding bots
6, that created at least K1-number classes and/or properties of the core-community schema.
A broader definition may be adopted, including also users that created at least K2-number
instances of the classes defined in the core-community schema. The value of K1 and K2
can be parameterized, the higher the parameter value, the greater the probability that the
group of filtered users has knowledge in the specific domain.
3. Identify the community-driven (sub-)schema. The community-driven schema can
be obtained from the Wikidata Surface Schema by extracting: (i) all the classes created
by users in the identified community, (ii) all the properties associated with at least one
instance of the obtained classes.</p>
        <p>
          The implementation of the approach requires the analysis of the Wikidata edit history
accessible programmatically through the described dumps and/or the dedicated endpoint described
in Section 2. The parametric nature of the approach is made necessary by the diferent roles
that users may assume in editing the Wikidata knowledge graph [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ], indeed, it is necessary to
iflter out users making edits across diferent domains. The proposed approach assumes that the
usage of classes and instances in the Wikidata knowledge graph is proper, erroneous and/or
inconsistent modelling decisions adopted by users may require more advanced methods for
identifying a community and its (sub-)schema.
        </p>
      </sec>
      <sec id="sec-3-4">
        <title>3.4. Analysis</title>
        <p>
          Both the identified (sub-)schema and the community should be analysed. The metrics proposed
by Piscopo and Simperl in [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] ofer a structured approach to evaluate both the ontology quality
and the role of users considering the evolution over time of the schema. The same set of
metrics can be assessed on Wikidata snapshots over time to detect patterns in its evolution.
In our approach, we propose the adoption of the same set of metrics but reducing the scope
of the schema and the community, from the whole Wikidata ontology and community to the
ones identified through the described approach. Furthermore, we describe a set of additional
analyses that can be performed to extract relevant insights in the context of community-driven
(sub-)schemas within Wikidata.
        </p>
        <p>
          1. Analyse the schema and the community. The features discussed in [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] about ontology
quality (e.g., number of classes, number of instances, chains of sub-classes relations) and
users (e.g., number of edits on diferent types of items, the proportion between the number
6Bots can be identified by a flag and by the word “bot” in the user’s name, as described by https://www.wikidata.
org/wiki/Wikidata:Bots. As not all the bots are identified, more accurate methods for their detection can be based
on other parameters, such as the users’ behaviour [14]
of edits and number of items edited) provide relevant metrics to analyse both the schema
and the community. Additional factors that we propose to investigate are related to the
analysis of the schema concerning its actual usage within the Wikidata knowledge graph.
A simpler analysis can consider the top-k used classes and properties by assessing which
elements of the extracted schema are more widely adopted by the community. A more
detailed analysis, can consider the concept of Characteristic sets [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] to detect patterns of
usage. Characteristic sets identifies the set of subjects characterized using the same set of
predicates within RDF datasets and allows for the identification of semantically similar
subjects. We argue that this approach can also be used to characterise and compare
patterns in the analysis of empirical semantics adopted by a community. In this way, it is
possible to analyse if there is consistency in the usage of the same schema structure to
describe a given class, or if there are significant variations. If a recurrent characteristic
set emerges, it can be then supposed that this empirical application of Wikidata schema
is preferred by the domain-specific community of interest.
2. Analyse the schema and the community over time. The same set of metrics defined
in the first step should be computed on diferent snapshots of the Wikidata KG to analyse
the evolution of the schema over time. Moreover, we also would like to investigate
how, considering a fixed community of users (i.e. the group of users as defined in point
2 of Section 3.3, considering the overall revisions made in the snapshots selected), the
identified (sub-)schema evolves over time. A first analysis should consider the quantitative
evolution of the schema by comparing numerical metrics on classes and properties. The
additional analysis proposed are: (i) top-k new classes and properties introduced in the
schema and their usage7, (ii) diachronic analysis of statements, e.g., considering what
were the first created properties and/or sub-classes for a specific class.
3. Analyse the schemas among communities. A comparison of the results obtained
considering diferent communities and their (sub-)schemas within the Wikidata
knowledge graph is needed to complement the approach. This type of analysis opens to a
comparison between structural ways used by diferent communities to express a certain
type of knowledge.
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Discussion and Conclusions</title>
      <p>
        In this position paper, we sketched an approach to identify a community-driven schema
starting from a given (set of) domain (core classes and properties), as well as vice versa to
identify a community from a schema in a knowledge graph and perform analyses over such
community-driven schemata. We have proposed to re-use existing metrics defined by [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] on
the one hand and add new additional metrics that shall allow us to compare communities and
their schema usage and to detect the evolution of a community-driven schema over time. As a
result, we contribute to the current state-of-the-art by providing a methodology for the analysis
of the Wikidata schema considering the role of communities of users.
      </p>
      <p>7Can be extracted considering the maximum Q and P identifiers from the previous snapshot and selecting items
and properties with higher identifiers in the current snapshot</p>
      <p>
        Future works include the evaluation of the approach on a set of case studies, as well as a
more extensive definition of community-driven schemas. Such schemas, for example, could
be defined by using not only the instance of (P31) and subclass of (P279) properties, but also
the additional meta-modelling properties introduced in the Wikidata schema and analysed by
Haller et al. in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Moreover, we suggest a further thorough analysis of the domain community
by subdividing it into diferent roles/sub-communities such as (1) the group of editors and (2)
the group of contributors. This approach applying the study conducted by [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] to a
domainspecific community would potentially give interesting insights into the use of sub-vocabularies,
evolution and users dynamics of sub-communities, with the final goal to compare the diferent
communities coexisting in a knowledge graph. Furthermore, this approach would allow us to
have a better understanding of the domains represented in Wikidata. Firstly, it would be possible
to understand the accuracy of the domain vocabulary usage by comparing the initial vocabulary
considered relevant by a domain expert with the actual schema implemented. Secondly, it
allows an evaluation of the Wikidata quality according to domain areas, identifying, on one
hand, those domain schemas that need some improvements or, on the other hand, to rate the
domains that have the most excellent quality to enhance the reuse of their data by domain
experts.
RDF queries with multiple joins, in: 2011 IEEE 27th International Conference on Data
Engineering, 2011, pp. 984–994. doi:10.1109/ICDE.2011.5767868, iSSN: 2375-026X.
[9] K. Shenoy, et al., A study of the quality of Wikidata, Journal of Web Semantics 72 (2022)
100679. doi:10.1016/j.websem.2021.100679.
[10] C. Sarasua, et al., The Evolution of Power and Standard Wikidata s: Comparing
Editing Behavior over Time to Predict Lifespan and Volume of Edits, Computer Supported
Cooperative Work (CSCW) 28 (2019) 843–882. doi:10.1007/s10606-018-9344-y.
[11] G. Amaral, et al., Assessing the Quality of Sources in Wikidata Across Languages: A
Hybrid Approach, Journal of Data and Information Quality 13 (2021) 23:1–23:35. doi:10.
1145/3484828.
[12] L.-A. Kafee, Multilinguality in knowledge graphs, phd, University of Southampton, 2021.
      </p>
      <p>URL: https://eprints.soton.ac.uk/456783/.
[13] L. N. Zheng, C. M. Albano, N. M. Vora, F. Mai, J. V. Nickerson, The Roles Bots Play in
Wikipedia, Proceedings of the ACM on Human-Computer Interaction 3 (2019) 215:1–215:20.
doi:10.1145/3359317.
[14] A. Hall, L. Terveen, A. Halfaker, Bot detection in wikidata using behavioral and other
informal cues, Proc. ACM Hum. Comput. Interact. 2 (2018) 1–18.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>D.</given-names>
            <surname>Vrandečić</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Krötzsch</surname>
          </string-name>
          ,
          <article-title>Wikidata: a free collaborative knowledgebase</article-title>
          ,
          <source>Communications of the ACM</source>
          <volume>57</volume>
          (
          <year>2014</year>
          )
          <fpage>78</fpage>
          -
          <lpage>85</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>H.</given-names>
            <surname>Paulheim</surname>
          </string-name>
          ,
          <article-title>Knowledge graph refinement: A survey of approaches and evaluation methods</article-title>
          ,
          <source>Semantic web 8</source>
          (
          <year>2017</year>
          )
          <fpage>489</fpage>
          -
          <lpage>508</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A.</given-names>
            <surname>Haller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Polleres</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Dobriy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ferranti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. J. Rodríguez</given-names>
            <surname>Méndez</surname>
          </string-name>
          ,
          <article-title>An analysis of links in wikidata</article-title>
          ,
          <source>in: The Semantic Web</source>
          , Springer International Publishing, Cham,
          <year>2022</year>
          , pp.
          <fpage>21</fpage>
          -
          <lpage>38</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>A.</given-names>
            <surname>Piscopo</surname>
          </string-name>
          , E. Simperl, Who Models the World?
          <article-title>: Collaborative Ontology Creation and User Roles in Wikidata</article-title>
          ,
          <source>Proceedings of the ACM on Human-Computer Interaction</source>
          <volume>2</volume>
          (
          <year>2018</year>
          )
          <fpage>1</fpage>
          -
          <lpage>18</lpage>
          . doi:
          <volume>10</volume>
          .1145/3274410.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>A.</given-names>
            <surname>Haller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Polleres</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Dobriy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ferranti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. J. Rodriguez</given-names>
            <surname>Mendez</surname>
          </string-name>
          ,
          <article-title>An Analysis of Links in Wikidata</article-title>
          ,
          <source>in: The Semantic Web, Lecture Notes in Computer Science</source>
          , Springer International Publishing, Cham,
          <year>2022</year>
          , pp.
          <fpage>21</fpage>
          -
          <lpage>38</lpage>
          . doi:
          <volume>10</volume>
          .1007/978- 3-
          <fpage>031</fpage>
          - 06981-
          <issue>9</issue>
          _
          <fpage>2</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>T. Pellissier</given-names>
            <surname>Tanon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Suchanek</surname>
          </string-name>
          ,
          <article-title>Querying the Edit History of Wikidata, in: The Semantic Web: ESWC 2019 Satellite Events</article-title>
          , volume
          <volume>11762</volume>
          , Springer International Publishing, Cham,
          <year>2019</year>
          , pp.
          <fpage>161</fpage>
          -
          <lpage>166</lpage>
          . doi:
          <volume>10</volume>
          .1007/978- 3-
          <fpage>030</fpage>
          - 32327- 1_32, series Title: Lecture Notes in Computer Science.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>A.</given-names>
            <surname>Piscopo</surname>
          </string-name>
          , E. Simperl,
          <article-title>What we talk about when we talk about wikidata quality: a literature survey</article-title>
          ,
          <source>in: Proceedings of the 15th International Symposium on Open Collaboration</source>
          , OpenSym '19,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computing Machinery, New York, NY, USA,
          <year>2019</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>11</lpage>
          . doi:
          <volume>10</volume>
          .1145/3306446.3340822.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>T.</given-names>
            <surname>Neumann</surname>
          </string-name>
          , G. Moerkotte,
          <article-title>Characteristic sets: Accurate cardinality estimation for</article-title>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>