<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>proDataMarket: A Data Marketplace for Monetizing Linked Data</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Dumitru Roman</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Javier Paniagua</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tatiana Tarasova</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Georgi Georgiev</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Dina Sukhobok</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nikolay Nikolov</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Till Christopher Lech</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Ontotext AD, 47A Tsarigradsko Shosse</institution>
          ,
          <addr-line>Sofia 1124</addr-line>
          ,
          <country country="BG">Bulgaria</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>SINTEF</institution>
          ,
          <addr-line>Pb. 124 Blindern, 0314 Oslo</addr-line>
          ,
          <country country="NO">Norway</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>SpazioDati S.r.l.</institution>
          ,
          <addr-line>Via A. Olivetti 13, 38122, Trento (TN)</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Linked data has emerged as an interesting technology for publishing structured data on the Web but also as a powerful mechanism for integrating disparate data sources. Various tools and approaches have been developed in the semantic Web community to produce and consume linked data, however little attention has been paid to monetization of linked data. In this paper we introduce a data marketplace - proDataMarket - that enables data providers to generate, advertise, and sell linked data, and data consumers to purchase linked data on the marketplace. The marketplace was originally designed with a focus on geospatial linked data (targeting property-related data providers and consumers) but its capabilities are generic and can be used for data in various domains. This demo will highlight the capabilities offered to the providers and consumers of the data made available on the marketplace.</p>
      </abstract>
      <kwd-group>
        <kwd>data marketplace</kwd>
        <kwd>data publishing</kwd>
        <kwd>data consumption</kwd>
        <kwd>data monetization</kwd>
        <kwd>linked data</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>The proDataMarket marketplace is a virtual space that connects providers of open and
proprietary data. It was originally designed as a platform for sharing and monetizing
linked property-related data (e.g., real-estate and related contextual data), though its
software components are generic and can be used for data in various domains.</p>
      <p>On one hand, the marketplace aims at making it easier for data providers to publish,
distribute and eventually reach out to potential consumers of their data. On the other
hand, it helps data consumers discover and easily access data published at the
marketplace. Consequently, the technical platform of the marketplace is composed of the tools,
services and infrastructure developed to support two types of users: producers and
consumers, each of which has a dedicated area on the marketplace. Fig. 1 gives an overview
of the marketplace services it provides to data producers and data consumers.</p>
      <p>A high level overview of the marketplace architecture is presented in Fig. 2.
Software components developed in the project were grouped either into Producer or
Consumer areas in the marketplace, depending on whether they realize services for data
producers or data consumers.</p>
      <p>User interfaces of the components (whenever present) are highlighted in light-green,
while grey boxes summarize all the important user operations enabled through the
components. Whenever the components were built on top of the existing products or
services, the names of the latter are given in parentheses. All the components communicate
with each other via a RESTful API. In the followings we briefly discuss the marketplace
services offered to the producers and consumers respectively, and end with an overview
of the planned demonstration.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Producer Services</title>
      <p>
        The Producer Services are available via a user interface of the DataGraft platform
[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ][
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].1 DataGraft implements User Profile Management and Assets Management
operations, where assets can be data files, queries, transformations or SPARQL endpoints.
Data Transformation and Publication operations are provided via Grafterizer [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] – a
framework for data cleaning and knowledge graph generation.
      </p>
      <p>Data Augmentation allows data producers to enrich their data with contextual
indicators. This functionality is currently available via the API implemented as part of the
Amerigo Augmentation Engine developed by SpazioDati and deployed as a service.
This service can be used to enrich a dataset that contains geographical entities with
indicators that describe certain phenomena in the given area. The indicators are
computed from contextual databases, such as OpenStreetMap2 used by default or a custom
data source provided by the user.</p>
      <p>The data hosting payment services and associated user interfaces belong to the area
in the marketplace where the data producer can “reserve a place” in the market. In
particular, the system asks the data producer to provision a hosting space by, first,
requesting and authorizing payments for it and then paying for it on a subscription basis. The
data hosting component is currently based on Ontotext S4 triplestore as a service
solution.3
3</p>
    </sec>
    <sec id="sec-3">
      <title>Consumer Services</title>
      <p>The Consumer Services are exposed to the end users through the Consumer Portal4.
The Portal implements User Profile Management that regulates access to the data
available in the marketplace. Not registered users have access to free Open Data and preview
of proprietary datasets. Registration is required to purchase subscriptions and get access
to parts of or full proprietary datasets. Data catalogue enables search on datasets and
provides access to datasets’ metadata and available subscription options.
1 https://datagraft.io/
2 https://www.openstreetmap.org/
3 https://console.s4.ontotext.com/
4 https://store.prodatamarket.eu/</p>
      <p>Amerigo Data Visualisation Service5 allows users to explore data on a map through
available visualisations prepared by data producers (e.g., choropleth or category map).
The maps offer different types of interactive data filtering widgets, to facilitate
exploration of different types of data.</p>
      <p>Finally, the Data Payment component enables users to purchase data on the
marketplace. The component implements subscription-based data access and supports various
business models of different data producers. It communicates with the Data Pricing
Setup component on the Producer side, to obtain vendor-specific configurations for
each dataset.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Demonstration Outline</title>
      <p>The demonstration will focus on an end-to-end scenario covering data provisioning and
consumption on the marketplace, and will consist of two parts, one for the data
provider, and one for the data consumer:
 Data provider: Set up a database in the cloud (configuration, payment), populate the
database with data and create the queries through which the data will be served to
the marketplace; configure data visualization to be advertised on the marketplace;
configure payment/subscription options for the data; configure access to the dataset
page on the marketplace;
 Data consumer: Search for data on the marketplace; metadata browsing, visual data
exploration; data purchase.</p>
      <p>The demo scenario will be related to selling/buying the mass transportation score in a
given city, calculated per census cell (used as input to estimating value of real estate
properties in the given city).</p>
      <p>As of September 2017, the marketplace is publicly available via
http://prodatamarket.eu/. Some of the components of the marketplace (e.g., DataGraft, S4) are also
publicly available as separate components.</p>
      <p>Acknowledgements. The work in this paper is partly supported by the EC funded
project proDataMarket (Grant number: 644497).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Roman</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , et all. DataGraft:
          <article-title>One-Stop-Shop for Open Data Management. To appear in the Semantic Web Journal (SWJ) - Interoperability, Usability, Applicability (published and printed by IOS Press</article-title>
          , ISSN:
          <fpage>1570</fpage>
          -
          <lpage>0844</lpage>
          ),
          <year>2017</year>
          , DOI: 10.3233/SW-170263.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Roman</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , et all.
          <source>DataGraft: Simplifying Open Data Publishing. ESWC (Satellite Events)</source>
          <year>2016</year>
          :
          <fpage>101</fpage>
          -
          <lpage>106</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Sukhobok</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , et all.
          <article-title>Tabular data cleaning and linked data generation with Grafterizer</article-title>
          .
          <source>ESWC (Satellite Events)</source>
          <year>2016</year>
          :
          <fpage>134</fpage>
          -
          <lpage>139</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>5 Powered by CartoDB, https://github.com/CartoDB/cartodb.</mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>