<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>CEA CTA CPA
SemTab</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>MammoTab: a giant and comprehensive dataset for Semantic Table Interpretation</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Mattia Marzocchi</string-name>
          <email>m.marzocchi@campus.unimib.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marco Cremaschi</string-name>
          <email>marco.cremaschi@unimib.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Riccardo Pozzi</string-name>
          <email>riccardo.pozzi@unimib.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Roberto Avogadro</string-name>
          <email>roberto.avogadro@unimib.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Matteo Palmonari</string-name>
          <email>matteo.palmonari@unimib.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Milan - Bicocca</institution>
          ,
          <addr-line>viale Sarca 336, Edificio U14, 20126, Milan</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <volume>4</volume>
      <abstract>
        <p>In this paper, we present MammoTab, a dataset composed of 1M Wikipedia tables extracted from over 20M Wikipedia pages and annotated through Wikidata. The lack of this kind of datasets in the stateof-the-art makes MammoTab a good resource for testing and training Semantic Table Interpretation approaches. The dataset has been designed to cover several key challenges, such as disambiguation, homonymy, and NIL-mentions. The dataset has been evaluated using MTab, one of the best approaches of the SemTab challenge.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Semantic Table Interpretation</kwd>
        <kwd>Tabular Data</kwd>
        <kwd>SemTab Challenge</kwd>
        <kwd>Knowledge Graph</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        international challenge1, now in its 4th version. The challenge consists of diferent rounds in
which groups of tables with diferent features and levels of dificulty have to be annotated. The
increased interest in the STI has led to the construction of several datasets (gold standards)
in the last decade. As it will be better described later, these datasets often include only a part
of the characteristics of Web tables (e.g., small tables with few mentions, easy to annotate
semantically [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]).
      </p>
      <p>As a consequence, we created a new dataset, MammoTab, composed of 980 254 tables
extracted from 21 149 260 Wikipedia pages and annotated through Wikidata. The number and the
diferent features of the tables make</p>
      <sec id="sec-1-1">
        <title>MammoTab a good resource for testing and/or training</title>
        <p>STI approaches. In particular, because of its dimension MammoTab is a useful tool to train
data-hungry models, which require a vast amount of data (e.g., entity linking systems based on
large language models).</p>
        <p>The rest of the paper is organised as follows. Section 2 will present a brief analysis of the
state-of-the-art related datasets used in the context of the STI. Subsequently, MammoTab is
described (Section 3), its characteristics are listed (Section 3.1), and the pipeline used for its
implementation is presented (Section 3.2). The evaluation is depicted in Section 4 through a
state-of-the-art STI approach.</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>2. Datasets</title>
      <p>Although several approaches deal with semantic annotations on tabular data, there are limited
Gold Standards (GSs) for assessing the quality of these annotations. The main ones are T2Dv2,
Limaye, Musicbrainz, IMDB, Taheryan, Tough Table and SemTab. Table 1 shows statistics for
the GSs.</p>
      <p>An excellent STI approach must consider and adequately balance the diferent features of
a table (or a set of tables). The annotation involves several key challenges: i) disambiguation:
the class of the entities described in a table are not known in advance, and those entities may
correspond to more than one class in the KG. ii) homonymy: this issue is related to the presence
of diferent entities with the same name and class. iii) matching: the mention in the table may
be syntactically diferent from the label of the entity in a KG (i.e., use of acronyms, aliases
and typos). iv) NIL-mentions: the approach must also consider strings that refer to entities
for which a representation has not yet been created within the KG, namely NIL-mentions. v)
literal and named-entity: in a table, there can be columns that contain named-entity mentions
(NE-column) and columns containing strings (L-column). vi) missing context: it is often easier
to extract the context from textual documents than from tables due to the amount of content
to be processed. For instance, the header (i.e., the first row of a table) which usually contains
descriptive attributes for the columns, may or may not be present. vii) amount of data: the
approach must consider large tables with many rows and columns, and tables with very few
mentions. viii) diferent domains : the tables within a set can belong to very general or specific
domains. MammoTab has been designed to cover all these cases, making it a resource for
evaluating or training STI approaches.</p>
      <p>
        The dataset is made up of tables automatically extracted from Wikipedia. Some pipelines
for extracting tables from Wikipedia have been presented in the state-of-the-art. Among these,
TabEL [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] proposes a STI approach and a dataset (WikiTable corpus) composed of 1.6M tables
from the November 2013 XML dump of English Wikipedia. The dataset focuses only on the
CEA task using YAGO as reference KG. It should be noted that the WikiTable is outdated and
the code has not been made available2. Another paper [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] proposes a dataset composed of
670 171 tables extracted from WikiTable corpus to which the Wikipedia page title, section title,
and table caption have been added to obtain a more comprehensive description. However, the
dataset has not been released, nor has the source code for its generation.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. The MammoTab Dataset</title>
      <p>The annotations inside MammoTab are based on Wikidata v. 20220520 and are provided
following the structure used by the SemTab challenge. One table is stored in one CSV file,
and each line corresponds to a table row. The target columns for annotation, CTA, and CEA
annotations are saved in CSV files. A JSON document has also been created for each Wikipedia
page with additional information about the tables extracted by that page (see Listing 1). We
released MammoTab in Zenodo3 following the FAIR Guiding Principles4. The dataset is released
under the Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) licence5. MammoTab contains
a variety of tables that allow to evaluate approaches considering the challenges previously
listed: in the dataset there are tables containing i) entities hard to disambiguate (e.g., table id
LBJJ1WGD - reactor Clinton)6, ii) cases of homonymy (e.g., table id MRBWAAOA - soccer player</p>
      <sec id="sec-3-1">
        <title>2websail-fe.cs.northwestern.edu/TabEL/</title>
        <p>3zenodo.org/deposit/7014472
4www.nature.com/articles/sdata201618
5creativecommons.org/licenses/by-sa/4.0/
6en.wikipedia.org/wiki/List_of_cancelled_nuclear_reactors_in_the_United_States
Michael Jordan)7, iii) aliases (table id JCND1XGG - Tom Riddle alias of Lord Voldemort)8, and
iv) NIL-mentions (e.g., table id MSTBGKPR - KKOP-LP Wildcat Broad. Inc)9.</p>
        <sec id="sec-3-1-1">
          <title>3.1. Dataset Profile</title>
          <p>The MammoTab tables were extracted from 21 149 260 Wikipedia pages using the XML dump10.
In these pages 2 803 424 tables were detected. Among these, the tables with at least three links
in the same column have been stored for a total amount of 980 254 tables. Some dataset statistics
are reported in Table 2.</p>
        </sec>
        <sec id="sec-3-1-2">
          <title>3.2. Implementation</title>
          <p>The dataset was built through a pipeline implemented as a set of Python scripts, which are
available in a Git repository11.</p>
          <p>The pipeline consists of 10 steps: i) dump processing: each Wikipedia XML dump file (we
used the multiple bz2 stream dumps for easier parallelisation) is parsed using BeautifulSoup12
to extract each page; ii) tables identification : pages are scanned to find those that contain at least
one table (wikitext syntax: | class=wikitable |); iii) tables normalisation and cleaning: each cell
is normalised and cleaned up using custom and wikitextparser13 functions. Elements such as
subscripts, superscripts, elements of the wikitext syntax, images, Wikipedia help and project
pages links, and links to external pages are removed; iv) tables analysis: each table is analysed
to check for cells that contain links to Wikipedia pages (wikitext syntax: [[link]]). A cell
is considered as a mention of an entity only if the entire cell is a link. The remaining cells,
containing multiple links or additional words around the link, may also be mentions of entities,
but we consider them uncertain and mark them as UNKNOWN; v) tables storing: tables that
have at least three fully linked cells in a column are stored; vi) table header and table caption
detection: table header (wikitext syntax: !header) and table caption, if any, are stored and added
to the current table; vii) column analysis: each column is analysed and classified into Literal
columns (L-column) for datatype values (e.g., strings, numbers, dates, such as 4808, 10/04/1983),
7en.wikipedia.org/wiki/USA_Today_All-USA_high_school_football_team
8en.wikipedia.org/wiki/Christian_Coulson
9en.wikipedia.org/wiki/List_of_radio_stations_in_Nebraska
10dumps.wikimedia.org/enwiki/20220720/
11bitbucket.org/disco_unimib/mammotab/
12www.crummy.com/software/BeautifulSoup/
13github.com/5j9/wikitextparser
or Named-Entity columns (NE-column) if it contains links to Wikipedia pages; viii) entity linking
- CEA: for each Wikipedia link, the related Wikidata entity is extracted; ix) column annotation
CTA: column types are set by choosing the most specific entity class (according to Wikidata
subclass relationships) that is shared by most of the column rows. For columns with less than 5
rows, all cells must be instances of that class, while, for bigger columns, at least 60% of the cells
are instances of that class; x) NIL-identification : we mark as NIL the cells containing Wikipedia
red links14, which are those links referring to a page that does not exist.</p>
          <p>
            The Listing 1 shows an example of a JSON document used to manage the results of the process
described above on the Wikipedia page about “As Long as You Love Me ( Justin Bieber song)”15.
Listing 1: JSON document with the information relating to a Wikipedia page that contains at
least one table.
{"wiki_id": ’36115735’,
""ttaibtllee"s:":’A{s Long as You Love Me (Justin Bieber song)’,
"XXI7BFMW": {
"""chlaeipantdk"ie:or"n[":[:’[[’’’,’[P,’’rR’o’e’m,go,i’toM’inu’’os,,nia’’cl’D_r]ade,tolwee’n,also’eaFddo’a,rtm’eaIsts’fl,oa’nrLd"a_AbRseeclL’oo]rn]d,gs’a],s You Love Me"’,
[’’, ’’, ’Music_download’, ’Island_Records’],
...],
"text": [[’Region’, ’Date’, ’Format’, ’Label’],
[’United States’, ’June 11, 2012’, ’Digital Download’, ’Island Records’],
[’Canada’, ’June 11, 2012’, ’Digital Download’, ’Island Records’],
...],
"target_col": [
            <xref ref-type="bibr" rid="ref2">2</xref>
            ],
"entity": [[’[’’,’,’’’’,,’’Q’6,47’3’5]6,4’, ’Q190585’],
          </p>
          <p>
            [’’, ’’, ’Q6473564’, ’Q190585’],
"types": [[.[[.[],.],][,[],],[[’Q]8,1[
            <xref ref-type="bibr" rid="ref9">9</xref>
            ]4]1,037’], [’Q18127’]],
[[], [], [’Q81941037’], [’Q18127’]],
...],
"col_types": [[], [], [[’Q81941037’, 0.8571428571428571]], [[’Q18127’, 0.2857142857142857]]],
"col_type_perfect": [’’, ’’, ’Q81941037’, ’’]}}}
          </p>
          <p>A re-run of the Python scripts, simply pointing a new
allows to obtain a new version of the dataset.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Evaluation</title>
      <p>
        Wikipedia XML dump file to process 16,
The experiments in this Section aim to demonstrate how the use of MammoTab allows
identifying the weaknesses of a STI approach, with particular reference to the key challenges reported in
Section 2. The Mtab [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] approach was considered since it won several versions of the SemTab
challenge. A sample of 5 000 tables was selected without NIL-mentions due to the limitations of
the Mtab17 free API. Table 3 reports the results obtained by the approach.
      </p>
      <p>
        The values obtained by MTab versus MammoTab are lower than the other datasets. A
substantial decrease in the F1-Score in the CTA can also be noted. The results show that in the
current version, the MammoTab tables are a valuable resource for testing STI approaches which
must be characterised by sophisticated mechanisms that consider many semantics aspects.
However, as done in other datasets [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], it is possible to add some noise (i.e., adding misspelt or
fake mentions) to increase the complexity of the annotation task.
      </p>
      <p>14en.wikipedia.org/wiki/Wikipedia:Red_link
15en.wikipedia.org/wiki/As_Long_as_You_Love_Me_( Justin_Bieber_song)
16dumps.wikimedia.org/backup-index.html
17mtab.app/mtab/docs</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>S.</given-names>
            <surname>Neumaier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Umbrich</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. X.</given-names>
            <surname>Parreira</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Polleres</surname>
          </string-name>
          <article-title>, Multi-level semantic labelling of numerical values</article-title>
          ,
          <source>in: The Semantic Web - ISWC 2016</source>
          , Springer International Publishing, Cham,
          <year>2016</year>
          , pp.
          <fpage>428</fpage>
          -
          <lpage>445</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M.</given-names>
            <surname>Kejriwal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. A.</given-names>
            <surname>Knoblock</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Szekely</surname>
          </string-name>
          ,
          <article-title>Knowledge graphs: Fundamentals, techniques, and applications</article-title>
          , MIT Press,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>V.</given-names>
            <surname>Cutrona</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Efthymiou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Hassanzadeh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Jimenez-Ruiz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Sequeda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Srinivas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Abdelmageed</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hulsebos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Oliveira</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Pesquita</surname>
          </string-name>
          ,
          <source>Results of semtab</source>
          <year>2021</year>
          , in: 20th
          <source>International Semantic Web Conference</source>
          , volume
          <volume>3103</volume>
          , CEUR Workshop Proceedings,
          <year>2022</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>12</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>S.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , E. Meij,
          <string-name>
            <given-names>K.</given-names>
            <surname>Balog</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Reinanda</surname>
          </string-name>
          ,
          <article-title>Novel entity discovery from web tables</article-title>
          ,
          <source>in: Proceedings of The Web Conference</source>
          <year>2020</year>
          , WWW '20,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computing Machinery, New York, NY, USA,
          <year>2020</year>
          , p.
          <fpage>1298</fpage>
          -
          <lpage>1308</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>D.</given-names>
            <surname>Ritze</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Bizer</surname>
          </string-name>
          ,
          <article-title>Matching web tables to dbpedia - a feature utility study</article-title>
          ,
          <source>in: Proceedings of the 20th International Conference on Extending Database Technology, EDBT</source>
          <year>2017</year>
          , Venice, Italy, March
          <volume>21</volume>
          -24,
          <year>2017</year>
          , OpenProceedings, Konstanz,
          <year>2017</year>
          , pp.
          <fpage>210</fpage>
          -
          <lpage>221</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>G.</given-names>
            <surname>Limaye</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Sarawagi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Chakrabarti</surname>
          </string-name>
          ,
          <article-title>Annotating and searching web tables using entities, types and relationships</article-title>
          ,
          <source>Proc. VLDB Endow</source>
          .
          <volume>3</volume>
          (
          <year>2010</year>
          )
          <fpage>1338</fpage>
          -
          <lpage>1347</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <article-title>Efective and eficient semantic table interpretation using tableminer+</article-title>
          ,
          <source>Semantic Web</source>
          <volume>8</volume>
          (
          <year>2017</year>
          )
          <fpage>921</fpage>
          -
          <lpage>957</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Taheriyan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. A.</given-names>
            <surname>Knoblock</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Szekely</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. L.</given-names>
            <surname>Ambite</surname>
          </string-name>
          ,
          <article-title>Leveraging linked data to discover semantic relations within data sources</article-title>
          ,
          <source>in: The Semantic Web - ISWC</source>
          ,
          <year>2016</year>
          , pp.
          <fpage>549</fpage>
          -
          <lpage>565</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>V.</given-names>
            <surname>Cutrona</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Bianchi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Jimenez-Ruiz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Palmonari</surname>
          </string-name>
          ,
          <article-title>Tough tables: Carefully evaluating entity linking for tabular data</article-title>
          ,
          <source>in: The Semantic Web - ISWC 2020, Lecture Notes in Computer Science</source>
          , Springer International Publishing,
          <year>2020</year>
          , pp.
          <fpage>328</fpage>
          -
          <lpage>343</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>C. S.</given-names>
            <surname>Bhagavatula</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Noraset</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Downey</surname>
          </string-name>
          , Tabel:
          <article-title>Entity linking in web tables</article-title>
          , in: M.
          <string-name>
            <surname>Arenas</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          <string-name>
            <surname>Corcho</surname>
            , E. Simperl,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Strohmaier</surname>
            , M. d'Aquin,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Srinivas</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Groth</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Dumontier</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Heflin</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Thirunarayan</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Thirunarayan</surname>
          </string-name>
          , S. Staab (Eds.),
          <source>The Semantic Web - ISWC 2015</source>
          , Springer International Publishing, Cham,
          <year>2015</year>
          , pp.
          <fpage>425</fpage>
          -
          <lpage>441</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>X.</given-names>
            <surname>Deng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Sun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Lees</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <article-title>Turl: Table understanding through representation learning</article-title>
          ,
          <source>SIGMOD Rec</source>
          .
          <volume>51</volume>
          (
          <year>2022</year>
          )
          <fpage>33</fpage>
          -
          <lpage>40</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>P.</given-names>
            <surname>Nguyen</surname>
          </string-name>
          , I. Yamada,
          <string-name>
            <given-names>N.</given-names>
            <surname>Kertkeidkachorn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Ichise</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Takeda</surname>
          </string-name>
          ,
          <year>Semtab 2021</year>
          :
          <article-title>Tabular data annotation with mtab tool</article-title>
          ., in: SemTab@ ISWC,
          <year>2021</year>
          , pp.
          <fpage>92</fpage>
          -
          <lpage>101</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>