<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>From Research Ob jects to Research Networks: Combining Spatial and Semantic Search</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Sara La a</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lisa Staehli</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Geography, UCSB</institution>
          ,
          <addr-line>Santa Barbara, CA</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Institute of Cartography and Geoinformation, ETH Zurich</institution>
          ,
          <addr-line>Zurich</addr-line>
          ,
          <country country="CH">Switzerland</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The spatial and semantic discovery of research objects extracted from sources available on the Web can be enabled with georeferenced and annotated metadata. Constraints on data retrieval are based on the types of queries and services that current repositories o er, which contribute to their limited usability. We address these constraints by illustrating a framework for a linked research network along with exemplary research questions, which demonstrate the added value that spatial and semantic annotation can contribute to information retrieval.</p>
      </abstract>
      <kwd-group>
        <kwd>annotation</kwd>
        <kwd>data discovery</kwd>
        <kwd>georeferencing</kwd>
        <kwd>linked data</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>The practice of publishing and sharing research data is broadly recognized as
bene cial across diverse elds of study. Most current open data management
systems either curate research institutional data across various domains, through a
university library for instance, or manage domain-speci c data, such as
biomedical observations, across multiple institutions. As federated query capabilities
across repositories are limited, search is either spatially restricted (e.g. to
published UCSB research data) or thematically restricted (e.g. to dendritic cell
research data). Initiatives promoting the open research data paradigm show the
same trend. They evolve either by governmental inducement (e.g. Horizon20203)
or by research communities motivated to share knowledge and data (e.g.
DCThera4 for biology or PANGAEA5 for environmental and earth sciences).</p>
      <p>
        Spatial data infrasturctures, such as DataONE6, span a wide range of datasets
but do not enable the discovery of related publications. Keyword-based search
engines, like Google Scholar, o er search for publications across repositories and
disciplines, but do not provide access to associated datasets. Approaches that
link publications by citations, such as the Citation Map [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], advance relations
between research objects by exposing similar work being conducted in a particular
research area, but they do not yet span disciplines.
      </p>
      <sec id="sec-1-1">
        <title>3 https://ec.europa.eu/programmes/horizon2020/ 4 http://dc-research.eu/ 5 https://www.pangaea.de/ 6 https://search.dataone.org/#data</title>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>Framework</title>
      <p>A research network that enables both spatial and semantic discovery links
heterogeneous research objects available on the Web. Adding spatial descriptors to
research objects is a rst step in the realization of integrated data networks.
As location is an integrator of diverse contents [7], geo-semantic annotation
improves information retrieval and enables data integration across domains [6].</p>
      <p>Publications (hosted PDF les) and data (CSV les, geometry, imagery, etc.)
are research objects, conceptualized as nodes in a research network.
Geosemantic annotations can be used to link research objects to a location and
a spatial extent. Vocabularies like Dublin Core7 provide a light-weight means
for annotating and linking research objects, such as the term coverage, which
can describe spatial scope. One publication may share network edges with one
or more datasets, and conversely one dataset can be associated with multiple
publications. Developing linked data research networks can enable broader
discovery, horizontally across disciplines and vertically within topics.</p>
      <p>A research object consists of a spatial and a semantic component. Links
between research objects are the edges of the research network, representing either
7 http://dublincore.org/documents/2012/06/14/dcmi-terms/
a spatial or a semantic relation. Figure 1 shows the distinctions between
relations. Spatial relations between research objects occur whenever the nodes share
location, such as a study area extent or an institution. Semantic relations can
also include spatiality, (e.g., the same conference to which several publications
have been submitted). Moreover, semantic relations consist of a thematic and a
temporal dimension. Once relations between objects have been formalized, they
can be made explicit through annotation [6] following the linked data concept by
storing triples that link research objects. Linked research objects can be explored
within a spatial (similar location) or semantic (similar topic) neighborhood.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Use Cases</title>
      <p>Simple extension of metadata enables the combination of spatial and semantic
discovery for research objects. Sections 3.1 and 3.2 show exemplary research
questions that can be answered by geo-semantic annotation of research objects
along with existing search interfaces. Section 3.3 demonstrates queries that
necessitate combined spatial and semantic search across a research network.
3.1</p>
      <sec id="sec-3-1">
        <title>Local and Regional Scale</title>
        <p>On a local or regional scale, exemplary questions raised from the research
community focus on interdisciplinary data integration and reuse of existing datasets.
{ What research has already been done with similar data in a location?</p>
        <p>Smart Cities: air quality, tra c, housing conditions, social media, etc.
{ What kind of research has already been done in the same study area?</p>
        <p>Natural Reserves: Species distribution, soil and water samples, etc.</p>
        <sec id="sec-3-1-1">
          <title>8 http://discovery.ucsb.opendata.arcgis.com/ 9 http://www.nationalpark.ch/de/forschung/aktuelle-forschungsprojekte/</title>
          <p>3.2
On a global scale, researchers are interested in reproducibility and comparison
of phenomena across cultural or social contexts at multiple granularities.
{ How can my research question be applied to other datasets with a di erent
cultural or social context?</p>
          <p>Linguistic reasoning, navigation studies, social science, brain images, etc.
{ Can I reproduce my results or are there any signi cant di erences if I
use data from another spatial context? Where can I nd more contrasting
datasets and observations to extend my models?</p>
          <p>Image recognition, machine learning, social media data, climate models, etc.
3.3</p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>Networks</title>
        <p>The following queries can only be addressed by a research network that contains
relations of a spatial, thematic, or temporal nature.</p>
        <p>{ hasSameClimate: Where can I nd data that has been collected in the same
climate? What other research has been done under the same conditions?
{ contrasts, supports : Is there any research that shows a counterexample? Is
there a publication that supports my research question or results?
{ isFollowedBy, isOriginatedFrom : What research follows from this
publication/dataset? Which publication mentioned the research interest rst?
{ hasSimilarRegion, hasTemporalFrequency : Where are mountain regions in
which researchers monitor glaciers at least once every three years?
{ isNear : Where are anthropology research clusters located?</p>
        <p>Extracting georeferences from published research object metadata poses
challenges to this vision [6]. However, extending terms in ontologies, such as Prov:Location
10 http://frankenplace.com/
11 http://pangaea.de/
in PROV-O, is a rst step toward better spatial descriptions of research data.
Future challenges also include the use of spatio-temporal reasoning to parse,
disambiguate, and reason on queries, necessitating a combination of spatial discovery
and semantic analysis. The vision for a research network is best understood as
a compiler placed between a back-end database and front-end web applications.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Prospects</title>
      <p>
        Place-based search for research data is not a new notion [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and is expressly
supported by myriad Spatial Data Infrastructure initiatives, but the idea of
integrating semantic discovery of associated research objects, such as publications, is
novel. Some data management systems allow users to preview geographical data
and perform place-based search for publications, combining spatial search with
faceted browsing and keyword query. Publishing interfaces, such as DataONE,
must continue to advance this trend by enabling researchers to easily annotate
their observation metadata spatially, with footprints or coordinates, and
semantically, with controlled vocabulary keywords.
      </p>
      <p>On the front-end, interfaces for federated exploration across research
networks already exist, but require back-end development to achieve integration.
Additionally, front-end interfaces need to support spatial and semantic search
functionality for users without any prior knowledge nor experience with
geographic information systems. Data deposit and search tools can be made more
accessible for researchers from diverse disciplines, enabling them to connect with
an increasingly interdisciplinary user-base.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
      <p>The authors wish to thank Dr. Werner Kuhn and the UCSB Center for Spatial
Studies, along with the UCSB Library, for continued research support.
6. Janowicz, K., Scheider, S., and Adams, B. (2013). A geo-semantics yby. In
Reasoning web. Semantic technologies for intelligent data access (pp. 230-250). Springer
Berlin Heidelberg.
7. Kuhn, W. (2012). Core concepts of spatial information for transdisciplinary
research. International Journal of Geographical Information Science, 26(12),
22672276.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Adams</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2012</year>
          ).
          <article-title>Collaborations: The rise of research networks</article-title>
          .
          <source>Nature</source>
          ,
          <volume>490</volume>
          (
          <issue>7420</issue>
          ),
          <fpage>335</fpage>
          -
          <lpage>336</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Bechhofer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Buchan</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>De Roure</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Missier</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ainsworth</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bhagat</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goble</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          (
          <year>2013</year>
          ).
          <article-title>Why linked data is not enough for scientists</article-title>
          .
          <source>Future Generation Computer Systems</source>
          ,
          <volume>29</volume>
          (
          <issue>2</issue>
          ),
          <fpage>599611</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Field</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          (
          <year>2008</year>
          ).
          <article-title>Working together to put molecules on the map</article-title>
          .
          <source>Nature</source>
          ,
          <volume>453</volume>
          (
          <issue>7198</issue>
          ),
          <fpage>978</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Goodchild</surname>
            <given-names>M. F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Anselin</surname>
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Appelbaum</surname>
            <given-names>R. P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Harthorn</surname>
            <given-names>B. H.</given-names>
          </string-name>
          (
          <year>2000</year>
          )
          <article-title>: Toward Spatially Integrated Social Science</article-title>
          .
          <source>International Regional Science Review</source>
          ,
          <volume>23</volume>
          :
          <fpage>2</fpage>
          ,
          <fpage>139</fpage>
          -
          <lpage>159</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Hu</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McKenzie</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gao</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Abdalla</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Janowicz</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          (
          <year>2014</year>
          ).
          <article-title>A Linked-Data-Driven Web Portal for Learning Analytics: Data Enrichment, Interactive Visualization, and Knowledge Discovery</article-title>
          .
          <source>In LAK Workshops.</source>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>