<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Multilingual Labels for FoodOn</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Katherine Thornton</string-name>
          <email>katherine.thornton@yale.edu</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kenneth Seals-Nutt</string-name>
          <email>kenneth@seals-nutt.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mika Matsuzaki</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Food Composition, Nutri-informatics, Wikibase, Wikidata</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Johns Hopkins Bloomberg School of Public Health</institution>
          ,
          <addr-line>615 N Wolfe St, Baltimore, MD 21205</addr-line>
          ,
          <country country="US">United States</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>WikiFCD Collaborative</institution>
          ,
          <addr-line>New York, New York</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>WikiFCD Collaborative</institution>
          ,
          <addr-line>Olympia, WA</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Data sets involving food frequently are of interest across multiple cultural contexts. Many ontologies and vocabularies related to food are monolingual. Enriching ontologies and vocabularies with multilingual data is useful for extending access to a resource to wider audiences. The WikiFCD knowledge base of FAIR food composition data includes mappings to Wikidata and mappings to FoodOn, a widely-used ontology for food. In this paper we describe a sample of six food items from WikiFCD as the basis for an exploration of strategies for sourcing multilingual labels from projects of the Wikimedia Foundation. We present five subgraphs of data related to food items sourced from Wikidata, Wikipedias and WikiFCD. We describe the advantages and disadvantages of sourcing labels from each subgraph. Each subgraph can be quickly retrieved from the Wikidata Query Service SPARQL endpoint or the SPARQL endpoint of WikiFCD. These strategies could be adapted to other domains seeking to enrich their data with multilingual labels.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>CEUR
ceur-ws.org</p>
    </sec>
    <sec id="sec-2">
      <title>Introduction</title>
      <p>
        WikiFCD is a knowledge base of food composition data. After importing more than three
hundred thousand food items and their associated food composition measurements from the
United States Department of Agriculture’s Food Data Central (FDC), we began to add data from
additional countries. We have added more than three thousand food items to the knowledge
base along with measurements of nutritional components of those foods from published sources
[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The purpose of this knowledge base is to provide web access to published data about
food composition related to foods that are not found in FDC. For example, researchers have
documented gaps in coverage of plant foods and foods from international cuisines in FDC [
        <xref ref-type="bibr" rid="ref2 ref3">2, 3</xref>
        ].
WikiFCD contains food composition information for many plant foods that are not included
in FDC. Plant foods contain phytonutrients that support human health [
        <xref ref-type="bibr" rid="ref4 ref5 ref6">4, 5, 6</xref>
        ]. The nutrients
found in these minimally-processed plant foods are within the range efectively utilized by
(M. Matsuzaki)
      </p>
      <p>
        CEUR
Workshop
Proceedings
human metabolism, meaning we can digest and make use of these nutrients more easily [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
Providing the food composition data for these foods allows people who track their food intake
to get a more accurate estimate of their nutritional intake.
      </p>
      <p>
        We created this knowledge base using Wikibase, and it is now part of the ecosystem of
Wikibases related to Wikidata. The ecosystem of Wikibases includes all instances of Wikibase
that can be federated with the Wikidata Query Service SPARQL endpoint [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Wikibase is
software infrastructure for creating knowledge bases [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. An advantage to creating a knowledge
base is that it is possible to connect the data in the knowledge base with other data on the web.
We created mappings to Wikidata for some classes of items and for some properties in WikiFCD.
These mappings allow us to combine our data with subsets of data from Wikidata, increasing
the complexity and types of questions we can ask of our data. We have integrated the FoodOn
ontology into WikiFCD so that other projects that make use of FoodOn can also make use of
data from WikiFCD [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
      </p>
      <p>
        The FoodOn mappings in WikiFCD allow us to connect these food items to their corresponding
food items in Wikidata. This connection, stored in WikiFCD, opens up several pathways to
source additional information about these food items. We can think of each of the potential
graphs of food-related data in Wikidata as Wikidata subsets [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. In this paper we describe
ifve subsets of data from WikiFCD and Wikidata that can be used to source labels for these
food items in languages other than English. Multilingual labels can then be used for search, in
application development, or for other purposes. For projects that make use of FoodOn, this data
set of food items could be an option for sourcing multilingual labels to expand the audiences
for other projects.
      </p>
      <p>
        Data in Wikidata and WikiFCD are FAIR data. The FORCE 11 community published the
FAIR data principles in 2014 to promote data publishing practices that would support open
science and open access [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. FAIR is an acronym for Findable, Accessible, Interoperable and
Reusable, four qualities of published data that promote data sharing. By populating WikiFCD
with data from published sources, we made food composition data findable and accessible on
the web via the WikiFCD SPARQL endpoint1. Every food item in Wikidata and in WikiFCD
has a QID, which is a Unique Resource Identifier (URI). This means that every piece of data in
our dataset has a machine-actionable URI. The mapping statements we added to our food items
that connect them to Wikidata serve as our bridges with the web of linked open data, making
our data interoperable with many additional datasets. Data in Wikidata and in WikiFCD are
available under the terms of the Creative Commons Zero license2. Our selection of CC0 as the
license means that anyone can freely reuse the data. These aspects of publishing data in the
Wikidata and WikiFCD knowledge bases fulfills the most complete degree of FAIRness, level F,
“FAIR data, Open Access, Functionally Linked”, as described in [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. We welcome others who
are interested in reusing data from WikiFCD.
      </p>
      <sec id="sec-2-1">
        <title>1https://wikifcd.wikibase.cloud/query/ 2https://creativecommons.org/share-your-work/public-domain/cc0/</title>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>1. The WikiFCD Knowledge Base</title>
      <p>We implemented the WikiFCD knowledge base using Wikibase, an extension of the MediaWiki
software that was created for the Wikidata knowledge base 3. We include several subsets of
data from Wikidata in WikiFCD in order to establish useful mappings between the WikiFCD
system and Wikidata itself. These mappings allow us to ask questions of the WikiFCD dataset
in combination with data from Wikidata, increasing the breadth and complexity of the queries.</p>
      <p>Each item in WikiFCD has a unique identifier, or Qid. The Qid for this food item is Q135853.
Most of the statements on the food items in WikiFCD provide information about the amount of
a nutrient in a 100 gram sample of the food item. Each statement also has room for a reference.
References record the source for the statement. In WikiFCD, most reference are for a food
composition table.</p>
      <p>
        Researchers have already created knowledge graphs related to food. Fore example, FoodKG
is a knowledge graph of food data [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. WikiFCD centers food composition data for plant
foods, specifically unprocessed or minimally processed plant foods. One diference between
WikiFCD and FoodKG is that FoodKG does not include mappings to Wikidata. When Food KG
was created there were not many food items in Wikidata. Another diference between Food KG
and WikiFCD is that WikiFCD contains multilingual data.
      </p>
      <p>
        Open Food Facts is a database of food composition data that is open to community data
contribution and curation [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. Open Food Facts makes use of the UPC codes on food
packaging. While WikiFCD does contain data related to some packaged foods, we emphasize fruits,
vegetables, grains and other foods that are often sold without packaging.
      </p>
    </sec>
    <sec id="sec-4">
      <title>2. Identifying Food Items in Wikidata</title>
      <p>We used several strategies to identify candidate queries that we would use to extract a subset of
food items from Wikidata. Our aim was to find a subset of food items to match with FoodOn
identifiers. We wrote a SPARQL query for food items in Wikidata with a corresponding article
in English Wikipedia. We thought that the fact that Wikipedians had already written an article
about the food might be an indicator of the popularity of the food. While this was a useful set
of food items, we had to remove food brands from the query. We rejected the article query
results as unsuitable and tried another approach. We wrote a SPARQL query for food items
in Wikidata with a statement using Property ‘‘USDA NDB number” which is an identifier for
a food item in the United States Department of Agriculture National Nutrient Database. Due
to the fact that this identifier would only be found on food items in the Wikidata knowledge
base, and would not include food brands, we determined that this would be a more suitable
subset of food items for matching with food items from FoodOn. Our subgraph, Wikidata food
items with a USDA NDB identifier, consists of one thousand three hundred and ninety-five food
items. Due to the fact that some of these food items have multiple USDA NDB identifiers, the
subgraph contains five hundred thirteen unique food items. This seemed like a suitable corpus
of food items for which we could identify FoodOn term matches.</p>
    </sec>
    <sec id="sec-5">
      <title>3. Matching Food Items to FoodOn</title>
      <p>
        When an organization wants to integrate FoodOn with their data, they need to map foods
from their data to FoodOn identifiers. Researchers have found the LexMapr application to
provide useful matches for food items and FoodOn identifiers [
        <xref ref-type="bibr" rid="ref16 ref17 ref18">16, 17, 18</xref>
        ]. LexMapr is a Django
application that accepts CSV files as input and returns a file with information about matches
and potential matches of submitted food labels with FoodOn identifiers. We used the LexMapr
application to provide candidate matches between food items from our NDB number subset of
Wikidata and FoodOn identifiers 4. LexMapr is a Python-based Django application that reads a
list of food item labels and then returns the following columns: Sample_Desc, Processed_Sample,
Processed_Sample (With Scientific Name), Matched_Components, Match_Status(Macro Level),
and Third Party Classification, as seen in Figure 1.
      </p>
      <p>For some of the food items LexMapr generated full matches. For those items for which
LexMapr generated component matches, we manually reviewed the results. Some of the
component matches turned out to contain matches and some did not.</p>
    </sec>
    <sec id="sec-6">
      <title>4. Adding Food Items Subset from Wikidata to WikiFCD</title>
      <p>
        We wrote a bot script using WikidataIntegrator, a Python library, to create new food items
in WikiFCD for the food items in the Wikidata subset. Numerous research groups use
WikidataIntegrator to add data to Wikidata and to Wikibases [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]. We designed the bot script to add
statements with links back to the corresponding food items in Wikidata. This set of mappings is
useful for writing SPARQL queries to ask questions about the data in WikiFCD in combination
with the data from Wikidata.
      </p>
    </sec>
    <sec id="sec-7">
      <title>5. Leveraging Multilingual Content in Wikimedia Projects</title>
      <p>FoodOn provides food names in English as well as alternative names in English. The FoodOn
curation team would like to include multilingual content in order to make FoodOn more useful
for people working in other linguistic contexts. Multilingual data is expensive to create and
dificult to source. The purpose of this paper is to compare and contrast five strategies for
extracting multilingual labels for food items from projects of the Wikimedia Foundation. The</p>
      <sec id="sec-7-1">
        <title>4https://lexmapr.cidgoh.ca/user-guide/</title>
        <p>
          mappings that we created between items in WikiFCD and Wikidata allow us to write federated
SPARQL queries that combine data from WikiFCD with data from Wikidata. Wikidata is a
multilingual knowledge base with support for more than three hundred human languages
[
          <xref ref-type="bibr" rid="ref20">20</xref>
          ]. We identify five graphs of multilingual data from the Wikidata knowledge base that
the FoodOn curation team may evaluate for suitability for integration into FoodOn. The five
graphs are structurally distinct in that they are modeled diferently in the knowledge base, and
partially overlapping. In this way the subgraphs can be compared with one another to provide
an agreement score for various foods.
        </p>
        <p>In WikiFCD we curate multilingual labels for food items when we find them in food
composition table sources. An advantage of this approach is that all multilingual labels have provenance
information and include, at the statement level, references to the source from which we gathered
the data. Using our subgraph of food items in WikiFCD for which we have created mappings
to Wikidata, we then write SPARQL queries to extract multilingual labels for the food items
from Wikidata. An advantage to writing these SPARQL queries is that we can run them again
in the future and check if additional data has been added to Wikidata. With more than twelve
thousand active editors each month contributing to Wikidata, we anticipate that the number of
multilingual labels in Wikidata will increase over time.</p>
        <sec id="sec-7-1-1">
          <title>5.1. Article Names per Language Version of Wikipedia</title>
          <p>
            The Wikidata community has created items in Wikidata for each of the articles across the
diferent language versions of Wikipedia [
            <xref ref-type="bibr" rid="ref21">21</xref>
            ]. There are currently more than three hundred
active Wikipedias [
            <xref ref-type="bibr" rid="ref22">22</xref>
            ]. Wikidata contains information about mappings between items and the
sitelinks to all of the corresponding articles across the diferent language versions of Wikipedia.
If we look at a food item such as kale, the Wikidata item for which is Q45989, we can use a
SPARQL query on the Wikidata Query Service to find links to all Wikipedias that have an article
about kale. Currently sixty-two Wikipedias have an article about kale. Each of these Wikipedia
articles has a title, and from these article titles, we can get a sense of what kale is called in these
diferent languages.
          </p>
        </sec>
        <sec id="sec-7-1-2">
          <title>5.2. Multilingual Labels from Wikidata Food Items</title>
          <p>
            Another option for finding multilingual data is to consult the labels on Wikidata items. The
Wikidata data model was designed with multilingual content in mind [
            <xref ref-type="bibr" rid="ref21">21</xref>
            ]. More than three
hundred human languages are supported in Wikidata [
            <xref ref-type="bibr" rid="ref20">20</xref>
            ]. If we return to the example of kale,
the Wikidata item for kale currently has labels in seventy-nine languages, a sample of which can
be seen in Figure 2. This means that there are seventeen additional labels beyond the number
of articles across the diferent Wikipedia language versions. We can compare these labels with
those we found from the Wikipedias to check for consistency and accuracy.
          </p>
        </sec>
        <sec id="sec-7-1-3">
          <title>5.3. Common Names from Taxon Items in Wikidata</title>
          <p>Wikidata has a property “Taxon common name” (P1843) that can be used on taxon items to list
common names for the organism. The common names listed using this property are another
source of multilingual labels for food items. The property has a required qualifier which indicates
that the language of the common name must also be provided in statements using this property.
In this way we not only know the label, but also the language in which it is found.</p>
        </sec>
        <sec id="sec-7-1-4">
          <title>5.4. Wikidata Lexemes</title>
          <p>
            Wikidata introduced support for creating and editing lexemes in 2018 [
            <xref ref-type="bibr" rid="ref23">23</xref>
            ]. The Wikidata
community creates lexemes, forms and senses in the L namespace [
            <xref ref-type="bibr" rid="ref24">24</xref>
            ]. Editors have already
added more than half a million lexical entries [25]. The words described in the L namespace
include words related to food. A fourth pathway for sourcing multilingual labels related to
foods is to leverage Wikidata’s lexeme data.
          </p>
          <p>Editors use the property ‘item for this sense’ (P5137) to connect senses to items in Wikidata.
In Figure 3 we see the result of searching for ‘oregano’ in the Ordia application [26]. In Figure 4
we see the lexeme ‘oregano’ (L324776 ) in English. Under the first sense, which has the identifier
‘L324776-S1’, we see a statement that uses ‘item for this sense’ (P5137) connecting the sense to
the Wikidata item for oregano.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>6. Food Items and Multilingual Labels</title>
      <p>We selected six food items as a demonstration data set: purslane, kale, apple, rice, oregano and
chocolate. We wrote SPARQL queries to describe subsets of Wikidata or WikiFCD for each
technique for sourcing multilingual labels described above. This sample allows us to compare
and contrast the suitability of each subgraph of multilingual labels, and to identify advantages
and disadvantages of the diferent techniques. We also created a web application that allows
users to browse the dataset, available here5.</p>
      <p>In Table 1 we list the count of labels that we found in three of the subgraphs: Wikidata labels,
titles of articles in diferent language versions of Wikipedia, and lexeme senses. For the Wikidata
labels subgraphs we find that apple, rice, and chocolate have more labels than kale, oregano or
purslane. This distribution is likely due to the popularity of these foods. We anticipate that over
time more labels will be added for all of these food items. It is interesting that Wikidata editors
have not yet added labels from some of these language versions of Wikipedia to Wikidata.
5https://wikifcd.k2.services/multi-lingual-table
food item
purslane
oregano
kale
apple
rice
chocolate</p>
      <p>
        In terms of the historical layers of information in Wikidata, creating items for all Wikipedia
articles and adding sitelinks between Wikipedias and Wikidata was a very early step in adding
data to Wikidata [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]. The order in which data layers were added to Wikidata is likely a factor
in why there are fewer lexeme senses that have been connected to Wikidata items. The Lexeme
namespace was added to Wikidata in 2018, six years after Wikidata launched, so editors have
had less time to contribute data [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ].
      </p>
      <p>In Table 2 we see counts for the number of statements using the property “taxon common
name” on items for the taxa related to the food items from our sample set. The taxon to food
item relationships are: Portulaca oleracea / purslane, Origanum vulgare / oregano, Brassica
oleracea / kale, Malus domestica / apple, Oryza sativa / rice, and Theobroma cacao / chocolate.
We note that the relationships between taxa and food items are not consistently modeled in
Wikidata. For example, Brassica oleracea encompasses many cultivars of food items. We were
not able to find a closer match for kale in Wikidata at the time of writing.</p>
      <sec id="sec-8-1">
        <title>6.1. Wikidata Labels Subgraph</title>
        <p>
          An advantage of sourcing translations from the subgraph that consists of Wikidata food items
and their labels is that many editors see these labels. It is likely that this subgraph will grow the
most quickly out of the five discussed in this paper in the near term, due to the ease of adding
labels to the Wikidata knowledge base. Adding labels in additional languages is work that some
editors monitor [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ]. Labels, descriptions and aliases are parts of Wikidata items that receive
frequent editor attention [27]. A disadvantage of sourcing translations from this subgraph is
that labels do not have references, thus we do not know the provenance of the labels. There
is some inconsistency in the subgraph of food items that they are sometimes confused with
taxon items. For example, in Figure 5, we see that the Wikidata item for ‘kale’ is both a food
item (vegetable) as well as the taxon Brassica oleracea var. sabellica. It would be preferable to
have two separate Wikidata items, one for the food item and one for the taxon as is the case for
most food items.
        </p>
        <p>While there are two taxon common names listed the Wikidata item for ‘kale’ (Q45989), there
are more taxon common names related to ‘kale’ listed on the Wikidata item for ‘Brassica oleracea’
(Q146212), thus we chose to use the item ‘Brassica oleracea’ for the count of multilingual labels
in Table 2.</p>
        <p>A disadvantage of sourcing translations from the subgraph that consists of Wikidata food
items and their labels are that there are no references for labels. Labels, descriptions, and aliases
are structured diferently than statements on items in Wikidata. So if we are curious about a
particular label, we can’t turn to a reference for more information. The inconsistency in data
modeling practices across editors results in some food items to also describe taxa is another
disadvantage. There should be separate Wikidata items for food items and taxa. This will likely
be cleaned up by the Wikidata community, but it will require time. Looking at the labels for
‘apple’ from this subgraph, we find labels in Faroese, ‘súrepli’, and Navaho, ‘Bilasáana’ among
the 142 labels available in Wikidata.</p>
      </sec>
      <sec id="sec-8-2">
        <title>6.2. Article Titles from Sitelinks to Wikipedias</title>
        <p>Sourcing labels from the subgraph of food items and their sitelinks to diferent language versions
of Wikipedia involves the connections between Wikidata items and Wikipedia articles, if the
food item is well-known, there may be articles in many diferent language versions of Wikipedia,
as seen in Figure 6. Using this subgraph has the advantage of precise food item matches to
articles about those foods, thus the article title is usually a reliable source for multilingual data.
Article titles are high-visibility in that they are frequently seen by readers, and thus errors are
corrected more quickly. Sourcing labels from the subgraph of food items and their sitelinks
with diferent language versions of Wikipedia has the disadvantage that new article titles get
added when new articles are written, which is not a fast process. This contrasts with the ease
of adding additional labels to Wikidata. Rather than typing a string and pressing save to add a
label in Wikidata, more efort is required for an editor to create a new article in Wikipedia, the
title of which would then become an additional multilingual label candidate.</p>
      </sec>
      <sec id="sec-8-3">
        <title>6.3. Taxon Common Name Subgraph</title>
        <p>Some of the advantages of sourcing translations from the subgraph that consists of Wikidata
taxon items and their taxon common names include the fact that editors can contribute references
on these statements, and that this subgraph is likely to grow over time. As of October, 2022
there are more than 780,000 uses of P1843 ‘taxon common name’.</p>
        <p>Sourcing labels from the subgraph of Wikidata taxon items and their taxon common names
has the disadvantage that sometimes when multiple common names are provided per language,
it will require manual review to determine which of this would be appropriate for consideration
from FoodOn. In the cases where multiple common names are provided for a language, it is
not always clear if the labels provided all refer to the same species. For example, in Figure 7
we see that several common names are listed on the Wikidata item for Origanum vulgare are
from Spanish. Without further research it is not clear which of these might be the best fit for
FoodOn. Some of them could even be for specific subspecies of Origanum vulgare, and may
need to be moved to the subspecies items as they are added to Wikidata.</p>
        <p>We anticipate that more Wikidata editors will continue to contribute taxon common name
statements, and that this subgraph will continue to grow over time.</p>
      </sec>
      <sec id="sec-8-4">
        <title>6.4. Lexeme Senses Subgraph</title>
        <p>The advantages of sourcing multilingual labels from the subgraph of Wikidata Lexeme Senses
that connect to food items include the fact that there is room for references on these statements,
and that the subgraph is likely to grow as more editors contribute to the L namespace. In Figure
8, we can see that there are a small number of labels that we can source for ’apple’ from the
L namespace. The Wikidata community creates Lexeme challenges periodically to encourage
participation in the L namespace. For example, a recent challenge related to vegetables resulted
in the addition of more of the connections between food items and lexeme senses 6. With the
creation of the Abstract Wikipedia community, we believe it is likely that more editors will be
6https://dicare.toolforge.org/lexemes/challenge.php?id=64
drawn to contribute to the L namespace so that more data from the lexeme namespace will be
available for reuse by Abstract Wikipedia [28].</p>
      </sec>
      <sec id="sec-8-5">
        <title>6.5. WikiFCD Common Names Subgraph</title>
        <p>In WikiFCD we created a property, “common name” (P76), that we use to record common names
in multiple languages. We add these statements to WikiFCD whenever additional food item
names are provided in the source food composition tables. An advantage of sourcing labels
from WikiFCD is that all common name statements have a reference back to a publication.
For example, one of the food composition tables we integrated into WikiFCD is the SMILING
Food composition table for Vietnam. Food item labels are provided by the team of authors who
prepared the food composition measurements for each food item. In this food composition table
both the food item name in Vietnamese and the food item name in English are provided.</p>
        <p>Currently there are five hundred sixty three food items in this subgraph. A disadvantage of
sourcing labels from WikiFCD is that the corpus grows more slowly than the Wikidata subgraphs
because a smaller number of people contribute to WikiFCD than Wikidata. We anticipate that
as more people contribute data to WikiFCD this subgraph will grow more quickly. We chose to
exclude some of the labels that the SPARQL query for common names from WikiFCD returned
because they were the names of dishes rather than food items. Some food composition tables
include data for dishes of multiple ingredients as well as individual food items in their datasets.
We also found that there were diferences in practices among the authors of food composition
tables. For example, some authors provide taxon information at the species level and some
provide taxon information at the varietal or subspecies levels.</p>
      </sec>
    </sec>
    <sec id="sec-9">
      <title>7. Creating a Dataset of Multilingual Labels for Food Items</title>
      <p>Anyone can reuse data from WikiFCD or Wikidata for any purpose. One or more of the
subgraphs described in this paper could be reused as a source of labels for any food-related
application that requires multilingual access terms for food items. Additionally these subsets
could be periodically monitored for updates from the communities. We anticipate that all of
these subgraphs will be extended by editors adding new information to these wikis. As Wikidata
editors notice errors, they will address them and improve them [29].</p>
      <p>We explored five distinct subgraphs in order to understand which of these would return
relevant label candidates. Another advantage of collecting labels from multiple subgraphs is
that we can cross-check labels with the results of another subgraph. For example, if we look at
the leafy green called ‘kale’ in English, for the Wikipedia article subset we find that the German
language article is titled ‘Grünkohl’. The Wikidata item has the German label ‘Grünkohl’. Seeing
the same label in diferent subsets increases our confidence that the label is accurate.</p>
      <p>Each of the food items in our sample data set had dozens of distinct language labels across
the sets of relevant subgraphs. In total we counted 173 distinct languages with labels for ‘apple’,
134 distinct languages with labels for ‘chocolate’, 68 distinct languages with labels for ‘kale’,
82 distinct languages with a label for ‘oregano’, 65 distinct languages with labels for ‘purslane’
and 151 distinct languages with labels for ‘rice’. Currently the labels in Wikidata are unevenly
distributed across the supported languages. We anticipate that as more editors join the Wikidata
community they will contribute many more labels in additional languages. Reusing multilingual
content from Wikidata in open scientific projects will increase the accessibility of data produced.
For domains such as food and human nutrition, multilingual data can be shared more widely
which could impact more people.</p>
    </sec>
    <sec id="sec-10">
      <title>8. Creating Wikibases for Interoperability</title>
      <p>We chose to create WikiFCD so that we could design a data model to accommodate food
composition data sourced from a diverse range of published sources. Our decision to use
Wikibase allowed us to provide web-based access to WikiFCD, meaning anyone can find this
data online. We had an explicit data model in mind, inspired by the structure of the published
food composition tables we drew from as our data sources. The data in WikiFCD are not all
appropriate for Wikidata. By connecting some items and some properties in WikiFCD to their
Wikidata correlates through mapping statements, we are able to treat the two resources as if
they were a single knowledge base through writing federated SPARQL queries. The SPARQL
endpoint for WikiFCD supports federated queries that allow us to combine data from WikiFCD
with data from Wikidata.</p>
      <p>In the future, if certain subsets of WikiFCD are of interest to the Wikidata community, we
are well-positioned to quickly contribute data to Wikidata itself. We will be able to leverage
the mappings that we created between food items with FoodOn identifiers and appropriate
Wikidata items. As more and more people create Wikibase instances for specialized data, we all
have more data to combine with diferent subsets of Wikidata through federated queries.</p>
    </sec>
    <sec id="sec-11">
      <title>9. Conclusion</title>
      <p>The five subgraphs we describe in this paper were created by diferent communities of editors
across multiple projects of the Wikimedia Foundation and the ecosystem of Wikibases. Editors
add content to diferent language versions of Wikipedia, to Wikidata, and to WikiFCD. As
more people contribute multilingual content, all of these projects become more accessible to
additional language communities.</p>
      <p>Some decisions about the suitability of sourcing labels from any combination of the subgraphs
described in this paper will require manual review. In some cases there will be multiple candidate
labels per languages, and a curator will need to evaluate them for suitability. The breadth of
languages covered by these label subgraphs and the fact that the data is free to reuse make this
an attractive source of multilingual content.</p>
      <p>Enriching WikiFCD with multilingual labels is a priority for our project because we want to
be sure that FAIR food composition data for a broad range of foods from diverse cuisines are
easily accessible on the web at no cost. As nutrition plays a part in maintaining health, people
who need data about foods that are not found in popular sources like Food Data Central will
be able to find food composition data in WikiFCD. We created an interactive webapp for our
sample dataset so that others can quickly compare the availability of labels across the subsets
we discussed7.</p>
      <p>Developers of applications, ontologies, vocabularies, and other resources may need a
free-toreuse source for multilingual content. As a multilingual knowledge base, Wikidata contains
labels in more than three hundred human languages. This means that anyone who needs to
source multilingual labels for words could explore Wikidata to see how complete the current
label inventory is. Depending on the domain, one or more of these subgraphs may contain</p>
      <sec id="sec-11-1">
        <title>7https://wikifcd.k2.services/multi-lingual-table</title>
        <p>enough multilingual labels to increase coverage for a specific project or use case. The purpose
of demonstrating five distinct subgraphs in this paper is to emphasize that there is multilingual
content in diferent layers of the projects of the Wikimedia Foundation.</p>
      </sec>
    </sec>
    <sec id="sec-12">
      <title>Acknowledgments</title>
      <p>We thank the Joint Food Ontology Working Group for productive discussions about FoodOn.
We thank the Wikidata community for continuing to improve the Wikidata knowledge base.</p>
      <sec id="sec-12-1">
        <title>Linked Data in Linguistics (LDL-2020), 2020, pp. 82–86.</title>
        <p>[25] Ordia, Ordia statistics, 2022. URL: https://ordia.toolforge.org/statistics/, [Online; accessed
13-October-2022].
[26] F. Å. Nielsen, Ordia: A web application for wikidata lexemes, in: European Semantic Web</p>
        <p>Conference, Springer, 2019, pp. 141–146.
[27] T. Pellissier Tanon, L.-A. Kafee, Property label stability in wikidata: evolution and
convergence of schemas in collaborative knowledge bases, in: Companion Proceedings of
the The Web Conference 2018, 2018, pp. 1801–1803.
[28] D. Vrandečić, Building a multilingual wikipedia, Communications of the ACM 64 (2021)
38–41.
[29] K. Shenoy, F. Ilievski, D. Garijo, D. Schwabe, P. Szekely, A study of the quality of wikidata,
Journal of Web Semantics 72 (2022) 100679. URL: https://www.sciencedirect.com/science/
article/pii/S1570826821000536. doi:h t t p s : / / d o i . o r g / 1 0 . 1 0 1 6 / j . w e b s e m . 2 0 2 1 . 1 0 0 6 7 9 .</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>K.</given-names>
            <surname>Thornton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Seals-Nutt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Matsuzaki</surname>
          </string-name>
          ,
          <article-title>Introducing wikifcd: Many food composition tables in a single knowledge base</article-title>
          ,
          <source>in: CEUR Workshop Proceedings</source>
          , volume
          <volume>2969</volume>
          ,
          <string-name>
            <surname>CEURWS</surname>
          </string-name>
          ,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Tidball</surname>
            , Tidball,
            <given-names>Curtis,</given-names>
          </string-name>
          <article-title>The absence of wild game and fish species from the usda national nutrient database for standard reference: Addressing information gaps in wild caught foods 53 (</article-title>
          <year>2014</year>
          ).
          <source>doi:1 0 . 1 0</source>
          <volume>8 0 / 0 3 6 7 0 2 4 4 . 2 0 1 3 . 7 9 2 0 7 7 .</volume>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A.</given-names>
            <surname>Durazzo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Camilli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Marconi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Lisciani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Gabrielli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Gambelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Aguzzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lucarini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Kiefer</surname>
          </string-name>
          , L. Marletta,
          <article-title>Nutritional composition and dietary intake of composite dishes traditionally consumed in italy</article-title>
          ,
          <source>Journal of Food Composition and Analysis</source>
          <volume>77</volume>
          (
          <year>2019</year>
          )
          <fpage>115</fpage>
          -
          <lpage>124</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>N.</given-names>
            <surname>Monjotin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Amiot</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Fleurentin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Morel</surname>
          </string-name>
          ,
          <string-name>
            <surname>S. Raynal,</surname>
          </string-name>
          <article-title>Clinical evidence of the benefits of phytonutrients in human healthcare</article-title>
          ,
          <source>Nutrients</source>
          <volume>14</volume>
          (
          <year>2022</year>
          ). URL: https: //www.mdpi.
          <source>com/2072-6643/14/9/1712. doi:1 0 . 3 3 9 0 / n u 1 4</source>
          <volume>0 9 1 7 1 2 .</volume>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>J.</given-names>
            <surname>Gibbs</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F. P.</given-names>
            <surname>Cappuccio</surname>
          </string-name>
          ,
          <article-title>Plant-based dietary patterns for human and planetary health</article-title>
          ,
          <source>Nutrients</source>
          <volume>14</volume>
          (
          <year>2022</year>
          ). URL: https://www.mdpi.
          <source>com/2072-6643/14/8/1614. doi:1 0 . 3 3 9 0 / n u 1 4</source>
          <volume>0 8 1 6 1 4 .</volume>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>H.</given-names>
            <surname>Mechchate</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>El Allam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. El</given-names>
            <surname>Omari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. El</given-names>
            <surname>Hachlafi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Shariati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Wilairatana</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. S.</given-names>
            <surname>Mubarak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bouyahya</surname>
          </string-name>
          ,
          <article-title>Vegetables and their bioactive compounds as anti-aging drugs</article-title>
          ,
          <source>Molecules</source>
          <volume>27</volume>
          (
          <year>2022</year>
          ). URL: https://www.mdpi.
          <source>com/1420-3049/27/7/2316. doi:1 0 . 3 3</source>
          <volume>9 0</volume>
          / m o l e c u l e s
          <volume>2 7 0 7 2 3 1 6 .</volume>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>G.</given-names>
            <surname>Menichetti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.-L.</given-names>
            <surname>Barabási</surname>
          </string-name>
          ,
          <article-title>Nutrient concentrations in food display universal behaviour</article-title>
          ,
          <source>Nature Food</source>
          <volume>3</volume>
          (
          <year>2022</year>
          )
          <fpage>375</fpage>
          -
          <lpage>382</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>L.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Shimizu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Hitzler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. M.</given-names>
            <surname>Sheill</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. G.</given-names>
            <surname>Estrecha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Foley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Tarr</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Rehberger</surname>
          </string-name>
          ,
          <article-title>The enslaved dataset: A real-world complex ontology alignment benchmark using wikibase</article-title>
          ,
          <source>in: Proceedings of the 29th ACM International Conference on Information &amp; Knowledge Management</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>3197</fpage>
          -
          <lpage>3204</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>D.</given-names>
            <surname>Diefenbach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. D.</given-names>
            <surname>Wilde</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Alipio</surname>
          </string-name>
          ,
          <article-title>Wikibase as an infrastructure for knowledge graphs: The eu knowledge graph</article-title>
          , in: International Semantic Web Conference, Springer,
          <year>2021</year>
          , pp.
          <fpage>631</fpage>
          -
          <lpage>647</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>K.</given-names>
            <surname>Thornton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Seals-Nutt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Matsuzaki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Damion</surname>
          </string-name>
          ,
          <article-title>Reuse of the foodon ontology in a knowledge base of food composition data</article-title>
          ,
          <source>Semantic Web Journal</source>
          (
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>J. E.</given-names>
            <surname>Labra-Gayo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ammar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Brickley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. F.</given-names>
            <surname>Álvarez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. G.</given-names>
            <surname>Hevia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. J.</given-names>
            <surname>Gray</surname>
          </string-name>
          , E. Prud'hommeaux, D. Slater,
          <string-name>
            <given-names>H.</given-names>
            <surname>Solbrig</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. A. H.</given-names>
            <surname>Beghaeiraveri</surname>
          </string-name>
          , et al.,
          <article-title>Knowledge graphs and wikidata subsetting</article-title>
          ,
          <source>BioHackathon Europe</source>
          <year>2020</year>
          (
          <year>2021</year>
          ). URL: https://biohackrxiv.org/wu9et/.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <article-title>”FORCE11”, The fair data principles (</article-title>
          <year>2014</year>
          ). https://www.force11.org/group/fairgroup/ fairprinciples.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>B.</given-names>
            <surname>Mons</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Neylon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Velterop</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dumontier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. O. B. da Silva</given-names>
            <surname>Santos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. D.</given-names>
            <surname>Wilkinson</surname>
          </string-name>
          ,
          <article-title>Cloudy, increasingly fair; revisiting the fair data guiding principles for the european open science cloud</article-title>
          , Information Services &amp;
          <string-name>
            <surname>Use</surname>
          </string-name>
          (
          <year>2017</year>
          )
          <fpage>1</fpage>
          -
          <lpage>8</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>S.</given-names>
            <surname>Haussmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Seneviratne</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <surname>Y.</surname>
          </string-name>
          <article-title>Ne'eman</article-title>
          , J. Codella,
          <string-name>
            <given-names>C.-H.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. L.</given-names>
            <surname>McGuinness</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Zaki</surname>
          </string-name>
          ,
          <article-title>Foodkg: a semantics-driven knowledge graph for food recommendation</article-title>
          , in: International Semantic Web Conference, Springer,
          <year>2019</year>
          , pp.
          <fpage>146</fpage>
          -
          <lpage>162</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>E.</given-names>
            <surname>Chazelas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Deschasaux</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Srour</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Kesse-Guyot</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Julia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Alles</surname>
          </string-name>
          , N. DruesnePecollo, P. Galan,
          <string-name>
            <given-names>S.</given-names>
            <surname>Hercberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Latino-Martel</surname>
          </string-name>
          , et al.,
          <article-title>Food additives: distribution and co-occurrence in 126,000 food products of the french market</article-title>
          ,
          <source>Scientific reports 10</source>
          (
          <year>2020</year>
          )
          <fpage>1</fpage>
          -
          <lpage>15</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>M.</given-names>
            <surname>Balkey</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Batz</surname>
          </string-name>
          , G. Gopinath,
          <string-name>
            <given-names>G.</given-names>
            <surname>Gosal</surname>
          </string-name>
          , E. Grifiths,
          <string-name>
            <given-names>H.</given-names>
            <surname>Tate</surname>
          </string-name>
          ,
          <string-name>
            <surname>R. Timme</surname>
          </string-name>
          ,
          <article-title>(v) standardizing the isolation source metadata for the genomic epidemiology of foodborne pathogens using lexmapr</article-title>
          ,
          <source>IAFP</source>
          <year>2021</year>
          (
          <year>2021</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>D.</given-names>
            <surname>Dooley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Andres-Hernandez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Bordea</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Carmody</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Cavalieri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Chan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Castellano-Escuder</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Lachat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Mougin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Vitali</surname>
          </string-name>
          , et al.,
          <article-title>Obo foundry food ontology interconnectivity</article-title>
          ,
          <source>in: CEUR Workshop Proceedings</source>
          , volume
          <volume>2969</volume>
          ,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>J.</given-names>
            <surname>Pires</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. S.</given-names>
            <surname>Huisman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Bonhoefer</surname>
          </string-name>
          ,
          <string-name>
            <surname>T. P. Van Boeckel</surname>
          </string-name>
          ,
          <article-title>Increase in antimicrobial resistance in escherichia coli in food animals between 1980 and 2018 assessed using genomes from public databases</article-title>
          ,
          <source>Journal of Antimicrobial Chemotherapy</source>
          <volume>77</volume>
          (
          <year>2022</year>
          )
          <fpage>646</fpage>
          -
          <lpage>655</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>A.</given-names>
            <surname>Waagmeester</surname>
          </string-name>
          , G. Stupp,
          <string-name>
            <given-names>S.</given-names>
            <surname>Burgstaller-Muehlbacher</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. M.</given-names>
            <surname>Good</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Grifith</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O. L.</given-names>
            <surname>Grifith</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Hanspers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Hermjakob</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. S.</given-names>
            <surname>Hudson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Hybiske</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. M.</given-names>
            <surname>Keating</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Manske</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mayers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Mietchen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Mitraka</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. R.</given-names>
            <surname>Pico</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Putman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Timothy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Queralt-Rosinach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. M.</given-names>
            <surname>Schriml</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Shafee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Slenter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Stephan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Thornton</surname>
          </string-name>
          , G. Tsueng,
          <string-name>
            <given-names>R.</given-names>
            <surname>Tu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ul-Hasan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Willighagen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <surname>A. I. Su</surname>
          </string-name>
          ,
          <article-title>Wikidata as a knowledge graph for the life sciences</article-title>
          ,
          <source>Elife</source>
          <volume>9</volume>
          (
          <year>2020</year>
          )
          <article-title>e52614</article-title>
          . URL: https://doi.org/10.7554/ELIFE.52614.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>L.-A.</given-names>
            <surname>Kafee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Piscopo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Vougiouklis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Simperl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Carr</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Pintscher</surname>
          </string-name>
          ,
          <article-title>A Glimpse into Babel: An Analysis of Multilinguality in Wikidata</article-title>
          ,
          <source>in: Proceedings of the 13th International Symposium on Open Collaboration, OpenSym '17</source>
          ,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , New York, NY, USA,
          <year>2017</year>
          , pp.
          <volume>14</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>14</lpage>
          :
          <fpage>5</fpage>
          . URL: https://doi.org/10.1145/3125433.3125465.
          <source>doi:1 0 . 1 1</source>
          <volume>4 5 / 3 1 2 5 4 3 3 . 3 1 2 5 4 6 5 .</volume>
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>D.</given-names>
            <surname>Vrandečić</surname>
          </string-name>
          ,
          <article-title>Wikidata: A new platform for collaborative data collection</article-title>
          ,
          <source>in: Proceedings of the 21st International Conference Companion on World Wide Web, ACM</source>
          ,
          <year>2012</year>
          , pp.
          <fpage>1063</fpage>
          -
          <lpage>1064</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <surname>Meta</surname>
          </string-name>
          , List of wikipedias meta,
          <source>discussion about wikimedia projects</source>
          ,
          <year>2022</year>
          . URL: https: //meta.wikimedia.org/w/index.php?title=List_of_Wikipedias&amp;oldid=23800107, [Online; accessed 8-October-2022].
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>B.</given-names>
            <surname>Cartoni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. C.</given-names>
            <surname>Aros</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Vrandečić</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Lertpradit</surname>
          </string-name>
          ,
          <article-title>Introducing lexical masks: a new representation of lexical entries for better evaluation and exchange of lexicons</article-title>
          ,
          <source>in: Proceedings of the 12th Language Resources and Evaluation Conference</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>3046</fpage>
          -
          <lpage>3052</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>F.</given-names>
            <surname>Nielsen</surname>
          </string-name>
          , Lexemes in wikidata: 2020 status, in
          <source>: Proceedings of the 7th Workshop on</source>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>