<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>FarolApp: Live Linked Data on Light Pollution</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Nandana Mihindukulasooriya</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Esteban Gonzalez</string-name>
          <email>egonzalez@fi.upm.es</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fernando Serena</string-name>
          <email>fserena@fi.upm.es</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Carlos Badenes</string-name>
          <email>cbadenes@fi.upm.es</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Oscar Corcho?</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Ontology Engineering Group, Universidad Politecnica de Madrid</institution>
          ,
          <country country="ES">Spain</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>FarolApp is a mobile web application that aims to increase the awareness of light pollution by generating illustrative maps for cities and by encouraging citizens and public administrations to provide street light information in an ubiquitous and interactive way using online street views. In addition to the maps, FarolApp builds on existing sources to generate and provide up-to-date data by crowdsourced user annotations. Generated data is available as dereferenceable Linked Data resources in several RDF formats and via a queryable SPARQL endpoint. We propose Live Linked Data, a new approach to publish data about city infrastructures trying to keep them synchronized leveraging the collaboration of citizens. The demo presented in this paper illustrates how FarolApp maintains continuously evolving Linked Data that re ect the current status of city street light infrastructures and use that data to generate light pollution maps.</p>
      </abstract>
      <kwd-group>
        <kwd>linked data</kwd>
        <kwd>evolution</kwd>
        <kwd>crowdsourcing</kwd>
        <kwd>light pollution</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Light pollution is one of the most unknown and rapidly increasing environmental
problems nowadays[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Arti cial lighting is the main cause of the excessive level
of light pollution, leading to several problems a ecting animal species, human
health and energy consumption. In cities, the street lighting infrastructures are
the biggest known polluters.
      </p>
      <p>A starting point to reduce light pollution is to increase public awareness
about this type of pollution by providing the relevant data in a human
consumable manner. Public institutions may implement wellness policies tailored to the
current situation and provide open data that may help getting a deeper
understanding of this problem. However, to be usable, open data does not only have to
be accessible, but also be o ered in a comprehensible way. Linked Data allows to
explicitly describe the meaning of data by using a vocabulary and linking them
to other data sources.</p>
      <p>Cities are living organisms in constant evolution. Live Linked Data, i.e.
continuously maintained and updated data, o ers a more realistic representation of
? This research is partially supported by the STARS4ALL project (H2020 - 688135),
the 4V project (TIN2013-46238-C4-2-R) and the FPI grant (BES-2014-068449.)
a city rather than traditional static RDF dumps. However, in order to do this,
someone has to be aware of the changes that occur in the city and re ect them
in the data. Citizen science projects could bring some light into this situation.</p>
      <p>FarolApp (http://farolapp.linkeddata.es/) is a Linked Data based
application whose goal is to increase the awareness of light pollution among citizens
and public administrations, presenting pollution maps and allowing citizens to
contribute to that information. Citizens can participate both on-site or remotely
using Google Street View. In this demo, we present the approach followed by
FarolApp and its architecture. A video1 also is available on the project website.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Approach</title>
      <p>Linked Data Generation FarolApp is seeded with existing heterogeneous
datasets. On the one hand, if the existing data is already in RDF, its
vocabulary is mapped to the FarolApp Vocabulary2 to create a new dataset by using
SPARQL construct queries. On the other hand, if the existing data is not in
RDF (e.g, CSV, JSON), it is transformed to RDF using external tools such as
OpenRe ne. Initially, FarolApp integrates data from ten di erent cities and the
seed data is available in datahub3.</p>
      <p>Furthermore, during the generation process the data is linked to other
relevant datasets. For instance, each street light is associated to the city and the
country where it is located based on its GPS coordinates and linked to the
corresponding entities in DBpedia. Provenance information are also provided to
attribute the data with their original data sources.</p>
      <p>Linked Data Publication The main interface of FarolApp is a web UI in
which the street light data is overlaid on a map. Furthermore, the app provides
pollution maps which are generated based on application data as well as an API
where information about street lights can be extracted. Also, this information is
published in Social Networks such as Twitter. All these representations use the
Linked Data created and maintained by FarolApp as their data source.</p>
      <p>In addition, FarolApp provides raw Linked Data as dereferenceable resources
(supporting several RDF formats such as RDF/XML, Turtle, JSON-LD using
HTTP content negotiation), RDF dumps, and via a SPARQL endpoint, enabling
other third-party applications to reuse its up-to-date information. For instance,
interlinking street light data with data about energy consumption, tra c
accidents, or street crimes may reveal interesting ndings.</p>
      <p>Linked Data Evolution With the aim of keeping the data inline with the
current state of the street lights infrastructure, it is necessary to get live information
from any kind of sensors that are able to communicate their observations.
Humans are the most reliable sensors in this scenario, mainly because there is no</p>
      <sec id="sec-2-1">
        <title>1 http://liveldp.github.io/demo 2 http://goo.gl/5n8zEs 3 https://datahub.io/dataset/farolapp-dataset</title>
        <p>
          common data source, neither a sensor network (e.g. an IoT platform) to date
suitable to build on for: i) identifying street lights whose data is not yet openly
available, ii) updating the most interesting attributes of all of them. Therefore,
this approach leverages crowdsourcing techniques [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] in order to extend, update
and ultimately evolve the initial datasets.
        </p>
        <p>Citizens can contribute by giving multiple value annotations for attributes
of street lights. The range and type of values depend on the nature of the
attribute. In some cases, values expected to be discrete to simplify annotations.
For instance, the height attribute, which is normally a real-value, is converted
to a categorical value such as low, medium and high.</p>
        <p>FarolApp follows a consensus-based approach to conclude a satisfactory level
of agreement from all annotations. Two main basic rules support consensus
detection: i) a minimum number of annotations is required, ii) the standard
deviation must be lower than an attribute-speci c threshold. When both rules are
satis ed, FarolApp transforms the agreed value into Linked Data format and
enriches the dataset making it evolve.
3</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>High-level architecture</title>
      <p>
        The Farolapp architecture ( g. 1) follows a Staged Event-Driven Architecture
(SEDA) [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] that decomposes the ow into a set of components connected by
an event-bus, o ering some important advantages over traditional architectures:
i) Dynamic processing ows created by the modules connected to queues, ii)
Isolated and distributed environments where the modules can be implemented
in di erent languages, iii) a leaky bucket approach handled by circular queues.
      </p>
      <p>Web/Mobile UIs allow users to navigate and annotate the street lights
information in an intuitive way using Google Maps. The REST API provides
JSONbased descriptions about individual or clusters of street lights. The Loader
bootstraps the application with the existing street light data. The Consensus Engine
detects and noti es about consensus from annotations of street lights' attributes.
The Publisher is in charge of publishing relevant events from FarolApp in social
media platforms such as Twitter. The Transcriber persists the updates generated
from the consensus engine based on the user annotations.</p>
      <p>In addition, FarolApp makes use of a Virtuoso server as a triplestore for
providing a SPARQL endpoint, RabbitMQ as a messaging broker, and Redis as
an in-memory cache for annotations.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Related Work</title>
      <p>
        Inspired by the DBpedia Live4 approach, which transforms information from
Wikipedia into Linked Data, FarolApp collects data from citizens, transforming
them into Live Linked Data. Other approaches, such as Zooniverse5 (a citizen
science platform for crowdsourcing scienti c research) or OpenStreetMap6 (OSM,
a collaborative platform to create a free editable map of the world) are not
directly available as semantically enriched Linked Data. LOSM[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], which allows to
query OSM data using SPARQL queries, is one step in that direction.
5
      </p>
    </sec>
    <sec id="sec-5">
      <title>Conclusion and future work</title>
      <p>FarolApp leverages Live Linked Data and serves as a proof of concept of such
approach to provide up-to-date street light information and light pollution maps.</p>
      <p>As future works we plan to integrate with IoT platforms from smart cities
initiatives (e.g. photometers), keep track and o er historical data to study its
evolution, and use OSM both as a data source and publisher. Besides, the UI
will follow a more educational approach and provide multilingual support.</p>
      <sec id="sec-5-1">
        <title>4 http://live.dbpedia.org/</title>
        <p>5 https://www.zooniverse.org/
6 https://www.openstreetmap.org/</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Falchi</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cinzano</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Elvidge</surname>
            ,
            <given-names>C.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Keith</surname>
            ,
            <given-names>D.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haim</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Limiting the impact of light pollution on human health, environment and stellar visibility</article-title>
          .
          <source>Journal of environmental management</source>
          <volume>92</volume>
          (
          <issue>10</issue>
          ) (
          <year>2011</year>
          )
          <volume>2714</volume>
          {
          <fpage>2722</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Estelles-Arolas</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gonzalez-Ladron-de Guevara</surname>
          </string-name>
          , F.:
          <article-title>Towards an integrated crowdsourcing de nition</article-title>
          .
          <source>Journal of Information Science</source>
          <volume>38</volume>
          (
          <issue>2</issue>
          ) (
          <year>2012</year>
          )
          <volume>189</volume>
          {
          <fpage>200</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Welsh</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>The staged event-driven architecture for highly-concurrent server applications</article-title>
          . University of California, Berkeley (
          <year>2000</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Anelli</surname>
            ,
            <given-names>V.W.</given-names>
          </string-name>
          , Di Noia,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Galeone</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Nocera</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            ,
            <surname>Rosati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Tomeo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Di Sciascio</surname>
          </string-name>
          , E.:
          <article-title>LOSM: a SPARQL endpoint to query Open Street Map</article-title>
          .
          <source>In: Proceedings of the ISWC 2015 Posters &amp; Demonstrations Track</source>
          , Bethlehem, USA (
          <year>October 2015</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>