<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Evaluating Semantic Interoperability of Government Open Data Portals</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>National University of Kaohsiung</institution>
          ,
          <addr-line>Kaohsiung 811</addr-line>
          ,
          <country country="TW">Taiwan</country>
        </aff>
      </contrib-group>
      <fpage>0000</fpage>
      <lpage>0002</lpage>
      <abstract>
        <p>Open data emerged in the last decade as governments worldwide endeavor to publicize their owned data for government transparency, effectiveness, and social goodness. Many applications and use cases of open government data have been reported and revealed the potentials of them. Most of the users accessed these data via portals established by governments in different scales. Aspects such as usability, accessibility, and openness should be considered in building such portals to provide users data and services in high quality. In this study, we applied a set of measures covering the semantic aspects of open data portal design and maintenance to evaluate the quality of government open data portals. Results on several major national-level portals were reported.</p>
      </abstract>
      <kwd-group>
        <kwd>Government Open Data</kwd>
        <kwd>Open Data Portal</kwd>
        <kwd>Open Data Quality</kwd>
        <kwd>Portal Quality Evaluation</kwd>
        <kwd>Semantic Interoperability</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Drive by the trend of open data, authorities of the public sector release their data
actively or proactively and play the roles of data providers, producing enormous amount
of government open data (GOD). In this aspect, the governments can improve many
aspects of governance, such as transparency, accountability, innovative and intelligent
services, and government-citizen interactivity, through the use of open data. It is
essential and common to provide a portal, which is generally implemented as a Web service,
for accessing a large quantity of data maintained by their owners. Many governments
have established their open data portals as the volume of GOD increases drastically
recently. As a Web service, an open data portal should meet requirements in various
aspects to provide quality services. The arguments on the architecture and functions of
open data portals are still ongoing since we lack standard and good practices on these
subjects. Schemes for evaluating the quality of portals are thus required for data
providers in building their portals. The quality assessment should take account of several
aspects such as functionality, usability, accessibility, openness, and so forth. Several
schemes have been devised to tackle this problem (Máchová &amp; Lnénicka, 2017).</p>
      <p>Data semantics is important information that can be applied to many domains such
as information retrieval, e-commerce, knowledge acquisition, etc. An open data portal
that can provide datasets together with much semantic information will be helpful for
service providers and application developers. Thus, it is crucial to access the semantic
interoperability, which is defined here as the completeness and easiness of
implementation and access of the data semantics, of an open data portals. However, current
practices on open data evaluation did not pay much attention to this issue. In this work, we
will try to give an initial attempt on evaluating the semantic interoperability of open
data portals.</p>
      <p>
        Quality Assessment Measures for Open Data Portals
Open data portals are essential infrastructure for disseminating open government data.
Governments publish their data on these portals and allow individuals or organizations
to use them freely to create values. Currently, several measures and metrics have been
proposed to ensure the effectiveness of such portals (Máchová &amp; Lnénicka, 2017).
According to Máchová &amp; Lnénicka’s survey, the United Kingdoms ranks first, followed
by India and the United States. However, Taiwan was ranked 21 out of 89, which
contrasts with the result of Global Open Data Index1 and Open Data Barometer2
        <xref ref-type="bibr" rid="ref2">(Brown,
2017)</xref>
        .
      </p>
      <p>Unfortunately, the semantic interoperability of open data portals were rarely
discussed. To evaluate the semantic interoperability of the open data portals, we observed
that two major aspects should be addressed. First, the descriptive semantics of datasets
should be provided. These descriptive information could be stored as their metadata,
which could be easily conformed to existing standards. Second, the portals should
provide a way to allow users utilizing such data semantics. In this work, we identified 22
indices which could be used to evaluate the semantic interoperability of open data
portals. Table 1 lists the proposed indices. These indices were categorized into 2 classes,
namely ‘semantic description’ and ‘semantic operation’, which reflect the two aspects
mentioned above. We believe that these indices could reflect semantic information in
aspects of content and access to some extent. Evaluations on such indices may give an
insight into the semantic interoperability of the open data portals.
1 https://index.okfn.org/
2 https://opendatabarometer.org/</p>
      <p>Semantic
operation</p>
      <p>The datasets are tagged with keywords
Each dataset is accompanied by a reference to the
lan</p>
      <p>guage used
If a dataset refers to a specific range of time, its temporal</p>
      <p>coverage is specified
If a dataset refers to a specific geographical area, its spatial</p>
      <p>coverage is specified
The descriptive record contains a direct link to the URL</p>
      <p>of the data
The name and email address of the publisher of a dataset</p>
      <p>are provided
The frequency with which dataset is updated is provided
Each dataset is accompanied by a descriptive record
Metadata associated with each dataset is available in a</p>
      <p>standard format
Metadata describing the datasets is structured in a
stand</p>
      <p>ard way
Any vocabularies used within the dataset are identified</p>
      <p>and documented
Topics are linked to published vocabularies and
taxono</p>
      <p>mies
Each dataset is given a unique identifier
A checksum and/or signature is available to verify the
va</p>
      <p>lidity of each file
Data adheres to the defined syntax of any specified
vocab</p>
      <p>ularies
It is possible to query data and metadata in accordance</p>
      <p>with standards of the web of data (Linked Open Data)</p>
      <p>Data sources linked from a dataset are reported
3</p>
      <p>
        Evaluations on Semantic Interoperability of GOD Portals
For preliminary justification of the proposed metrics, we adopted the indices in Table
1 to evaluate some open data portals. In this study, six examiners were assembled to
assess the portals using the checklist during June 2015. These examiners are all
graduate students taking the open data related courses for at least two semesters. They were
also trained in how to apply the indices. We translated the indices into Chinese and
gave two-hour instructions on the meaning of each index. Examiners then assessed
every portal independently. For each index, the five-level Likert scale was adopted to
score the index where a score of 5 means that the function or requirement is fully
implemented or applicable and 1 means the function or requirement is missed entirely
or inapplicable. The scores for each index were then aggregated over all examiners as
its final score. We evaluated three GOD portals, namely US3, UK4, and Taiwan5, in the
preliminary experiments. The result is shown in Table 2. We only show the overall
result of the two categories due to space limitation. The result of the evaluation shows
that the US portal outperformed the other two in both categories. However, all portals
received lower ratings in ‘semantic operation’ category, reflecting the fact that
implementation of functionality and support for standards such as linked data
        <xref ref-type="bibr" rid="ref1">(Berners-Lee,
2006)</xref>
        for access and usage of semantic information were still immature for these
portals.
      </p>
      <p>Conclusions
In this study, we devised a set of criteria to assess the quality of three major government
open data portals on their semantic interoperability. Two aspects of semantic
interoperability were defined to measure the descriptive and operational nature of semantics
provided in the portals. A total of 22 indices were suggested. We believe that this is the
first attempt to assess the semantic interoperability of open data portals. Preliminary
experiments on three major government open data portals, namely US, UK, and
Taiwan, were conducted. The US portal topped the other two according to the result.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Berners-Lee</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          (
          <year>2006</year>
          ).
          <article-title>Linked Data</article-title>
          . Retrieved from http://www.w3.org/DesignIssues/LinkedData.html
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Brown</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Open Source Data: Just How Open Is Taiwan's Government? Retrieved from https://www</article-title>
          .ketagalanmedia.com/
          <year>2017</year>
          /07/27/open-source
          <article-title>-data-just-open-taiwansgovernment/</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Máchová</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Lnénicka</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Evaluating the Quality of Open Data Portals on the National Level</article-title>
          .
          <source>Journal of Theoretical and Applied Electronic Commerce Research</source>
          ,
          <volume>12</volume>
          (
          <issue>1</issue>
          ), pp.
          <fpage>21</fpage>
          -
          <lpage>41</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>