<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Link++: Adaptive Linking of Multiple Transportation Networks</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ali Masri</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Karine Zeitouni</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Zoubida Kedad</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Universite de Versailles Saint-Quentin-En-Yvelines, Laboratoire DAVID Versailles</institution>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>VEDECOM Versailles</institution>
          ,
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The integration of heterogeneous transportation data and services is essential towards enabling multimodality. Many new services are emerging and gaining a lot of popularity but are still isolated from multimodal solutions such as ridesharing, bike sharing. These services tend to publish their data according to the principles of linked open data in order to allow this integration. However, existing data interlinking tools are not suitable to discover and create transportation connections. The discovery of transportation connections is dependent on the timetable and geospatial information of the entities. In addition, the representation of the connection needs to be rich enough to provide the required information for further processing. In this paper, we introduce Link++, a customizable interlinking tool that enables detection and generation of customized semantic connections between transportation datasets.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Multimodality is the integration of multiple modes of transportation data
and services. It allows us to form links between di erent modes and
services in order to provide richer and more optimized planning services.
What is interesting about transportation data is that these links are
translated to real physical links between the transportation entities. For
example, when we say that there is a relation between a railway stop and
a bus stop, we mean that there is a physical path at a speci c moment
that connects these two stops. Therefore, we give passengers the ability
to use this path as a transit service to switch from one mode to another.
This switch may be very important in many cases. It improves trip time,
maybe cost and for sure extends trip plan possibilities and options.
In general, existing approaches tend to solve the integration problem by
mapping the data they need into a uni ed model, then storing the
unied data into a repository supported by an API e.g. Google Transit3,
3 http://maps.google.com/landing/transit/index.html</p>
      <p>STIF4. However, they still do not take into consideration highly
evolving datasets such as car sharing, bike sharing, car pooling, etc. Such
services are highly dynamic and often do not have the notion of a xed
transportation stop. In turn, this makes the integration problem more
challenging and demanding special needs.</p>
      <p>Our goal is to nd a simple way for operators to identify nearby
transportation services by providing a connection portal enabling the
identication of connections between one transportation data source and
another. We are in a need of a homogeneous light-weighted representation
of transportation connections (transfer points from one stop to another)
and the means to discover them in a exible and customized manner.
With this representation we can link di erent types of transportation
services regardless the mode or service they o er. All what
transportation systems need to know is just how to handle these light connections
and use them to connect with the outer world, which is much simpler
than handling heterogeneous data and maintaining them.</p>
      <p>
        Enabling such solution requires access to transportation sources which
can be obtained from open data [
        <xref ref-type="bibr" rid="ref5">7, 5</xref>
        ]. Open data is gaining a great deal
of popularity and numerous transportation operators are using it to
publish their data on the web5;6;7. The main cause behind publishing the
data is to increase the market visibility for each service.
      </p>
      <p>Many solutions took bene t from this to provide rich data for smart cities
applications. They use linked data techniques and data interlinking tools
to provide extended information relevant to both transportation and
passenger pro le queries [13, 6]. These techniques address equivalence
detection between entities to establish links between data sources. This may
help in enriching data about entities. However, this is not always enough
in transportation data. Further complex relations are required to re ect
the nature of transportation connections.</p>
      <p>Beyond equivalence or sameAs links, we are interested in nding
connections between transportation data sources based on the geospatial and
time characteristics of the data which capture the reach-ability between
di erent transportation networks. Furthermore, using the given tools we
face two main limitations. The rst is the restriction to a prede ned set
of functions for composing linking rules, due to the lack of exibility of
existing systems in de ning custom functions. For instance, to calculate
information such as closeness of two transportation points of transfer
(bus stop, train station, etc), we can not de ne custom functions to
calculate walking distances, driving distances, etc. The user is forced to dig
into the code (if available) and modify it directly. The second limitation
4 http://www.stif.info
5 http://opendata.paris.fr/page/home/
6 http://www.strasbourg.eu/ma-situation/professionnel/open-data/donnees/
mobilite-transport-open-data
7 http://www.uitp.org/tags/open-data
is the representation of the generated output. Supporting complex
relations requires more complex output patterns.</p>
      <p>As an example, let us suppose that a link is established between two
transportation points of transfers. Existing tools can provide the
output BusStop1 nextTo TrainStation132 which does not give information
about the occurrence of this relation. They are next to each others but
how close are they? when the connection is available? and what are the
modes of transportation that we can use? etc.</p>
      <p>Based on what precedes, there is no way of creating connections that
are suitable to many complex matching tasks other than the standard
equivalence matching tools. Our main goal is to provide a system that
is exible and rich enough to allow users to de ne their own way of
connecting data sources. Users must be given the power to use custom
functions and de ne any form of output needed for their tasks.</p>
      <p>In this paper we introduce Link++, a system that uses connection
patterns, custom functions and rules to enable the generation of richer
connections to link data sources. These connections can be applied to
provide missing connections between transportation operators and services.
With this approach, we can generate rich semantic links between entities
to be published as open data in the sense of improving re-usability and
reducing the need for re-calculation. We evaluate our approach using a
real use case on connecting two transportation modes and checking how
this a ects the time of trips.</p>
    </sec>
    <sec id="sec-2">
      <title>The paper is structured as follows:</title>
      <p>2</p>
      <sec id="sec-2-1">
        <title>Background and Related Work</title>
        <p>
          With the introduction of LinkedData[
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] many approaches took chance
of leveraging and exposing their data to the web. In the category of
transportation data and smart cities [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ][13][6] did this integration by
following the linked data principles[
          <xref ref-type="bibr" rid="ref4">4</xref>
          ][
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] and they succeeded in connecting
data from di erent sources to produce applications with wider-scope
services. [10] tackled the problem of cataloging, exploring, integrating,
understanding, processing and transforming urban information. They
proposed an approach for incremental and continuous integration of static
and streaming data, based on Semantic Web technologies, while they
tested their system to a tra c diagnosis scenario.
        </p>
        <p>The GeoKnow[11] project and DataLift[16] platform came as a
solution to help transforming data from isolated silos into linked data. They
provided the necessary tools to transform, link, publish and query data
extracted from multiple di erent sources with di erent formats.
The way links are created for open data datasets are through data
interlinking. Data interlinking in general is a way to discover similarity
relationships between RDF data sources in a semi-automatic fashion. The
process of linking requires two datasets { source and target, a distance
measure and a threshold. A link between two entities of a dataset is
successfully assigned if a distance measure between them exceeds a selected
threshold. The main goal is to link similar instances { that are scattered
between di erent data sources { in order to expand the knowledge graph.
The MeLinDa survey[17] described data interlinking in more details and
highlighted the characteristics of the most popular approaches.
We can divide the link discovery frameworks into two categories: domain
speci c and universal ones. The domain speci c frameworks aim to
discover the links between knowledge bases of a particular domain. The
second category is designed to consider the linking tasks regarding the
knowledge base domain. Based on [17], table1 shows the most popular
interlinking tools with their properties. RKB-CRS [9], proposed an
ar</p>
        <sec id="sec-2-1-1">
          <title>Techniques</title>
        </sec>
        <sec id="sec-2-1-2">
          <title>Output</title>
        </sec>
        <sec id="sec-2-1-3">
          <title>Domain</title>
          <p>RKB-CRS[9]</p>
          <p>GNAT[15]
ODD-Linker[8]</p>
          <p>RDF-AI[18]</p>
          <p>Silk[20]
LIMES[12]</p>
          <p>
            Link++
chitecture for managing URI equivalences on the Web of Data by using
Consistent Reference Services. Their approach requires a JAVA program
to be written for each pair of datasets to integrate. In each program, the
coder selects the resources to compare and the their comparison function
using string similarity on the property values. GNAT[15] is an automatic
interlinking tool that works on music datasets described within the Music
Ontology[14]. It is implemented in prolog and based on similarity
aggregation algorithm to detect relations based on resource's neighbors in a
graph. ODD-Linker[8] proposed an extensible framework for interlinking
relational data with a high quality links. Linking rules are expressed in
LinQL that is later translated to SQL queries in order to compare and
identify links. LinQL supports many string matching algorithms,
synonyms, hyponyms and other conditions on attributes. RDF-AI[18] is a
dataset matching and fusion architecture. It takes two les, the datasets
to be linked and a set of XML les describing the linking process
(preprocessing, matching con guration, dataset structure, merge con
guration). A local copy of the datasets is needed, and the matching is based
on string similarity with an external resource (WordNet). Silk[20] is a
domain independent interlinking tool. Input datasets are inserted via a
SPARQL endpoint URI, a local copy, or a database access. Matching
con guration can be done either with a GUI toolbox or the Silk Link
Speci cation Language (Silk LSL). User speci es the properties to be
matched, the pre-processing functions and the matching technique to be
used. Matching function are combined via aggregation functions (MAX,
MIN, AVG). Silk provides a load of pre-processing function on Literals
and numeric data types. Many comparison functions are de ned
including string similarity, numerical distance, date-time. The only distance
function available for matching geospatial datasets, is the geometric
distance | based on the Euclidean distance. It takes the latitude and the
longitude of both entities and matches them according to a given
threshold. The good thing in this distance function is that the threshold is
well formatted, the user can de ne the minimal distance in meters or
kilo-meters. LIMES[12] is one of the tools provided by the GeoKnow[
            <xref ref-type="bibr" rid="ref1">1</xref>
            ]
project, it handles the matching in a very fast speed compared to other
link discovery frameworks. LIMES provide better distance functions for
geospatial data, thus we have more options to match. It supports
basic string metrics, numeric vectors such as Euclidean and Orthodromic
distance metrics and many other similarity metrics e.g. Hausdor , Sum
of minimum, Frchet, etc. Writing a linkage rule in LIMES is done via
XML, and no GUI is available to support the process. Although many
distance metrics are supported, these functions are only good for
geographical data. Geographical data is a subset of transportation data, so
more powerful functions are needed, those who enable richer linking
between transportation units or objects.
          </p>
          <p>
            Analyzing existing link discovery approaches shows that they are more
suitable to equivalence matching. They provide functions and
aggregations to detect sameAs, part-of or subClass relationships. These
approaches may be suitable in some cases for geospatial data (the GeoKnow
project [
            <xref ref-type="bibr" rid="ref1">1</xref>
            ] and LinkedGeoData [19]), but they are not su cient for
transportation data. Interlinking solutions must take into account both the
spatial and temporal characteristics of transportation data in addition to
the real-time state. Consider that we want to connect two transportation
data sources with the intention of discovering how we can reach one stop
from another. Doing so with existing tools limits us to equivalence
detection due to the provided functions and output format. What is required
is a more representative and semantic way to connect these sources [
            <xref ref-type="bibr" rid="ref3">3</xref>
            ]
showing how they can be connected from a transportation point of view.
As a conclusion, the output of an interlinking process mainly focuses on
detecting a set of owl:sameAs links. However, we need to have more
information in the generated links to enable better post-processing and
analysis and to reduce re-calculation costs (e.g., include information about
a connection status and the distance between two connected entities in
transportation links).
3
          </p>
        </sec>
      </sec>
      <sec id="sec-2-2">
        <title>Link++</title>
        <p>As we have discussed earlier, existing interlinking tools are not suitable
to connect transportation data sources. What is required is a more
customizable connection generation process to enable richer and more
exible connection representation. We introduce Link++ (shown in Figure
1), a system that enables exible connection discovery and customized
output de nition using connection patterns, custom functions and
linking rules. Connection patterns are templates for connection generation
used to de ne both the content and format of a linking process output.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>In general, the approach consists of two main phases:</title>
      <p>{ The de nition phase, where users de ne the connection patterns, the
required functions and linking rules.
{ The generation phase, where: 1) the de nitions are taken and applied
to the datasets 2) the rule is applied to the entities 3) and when valid,
a connection is created and stored in a repository.</p>
      <p>In a formal de nition, a linking task T requires the following input for
the process:
{ Input data sources D1 and D2 representing the datasets to be linked
{ O is the custom-de ned connection pattern
{ R is the linkage rule that de nes when a connection must be
generated
{ F is a set of functions required for the linking task
{ L is a set of pre-de ned libraries implementing the dependencies of</p>
      <p>F</p>
      <p>The following sections explain in detail the tasks required for an
interlinking process.</p>
      <p>Specifying Custom Functions and External Libraries
Enabling users to de ne their own functions is crucial for a complete
system that supports all required matching tasks. Thus, the rst task is
to enable users to write any functions to be used in their linking rules
or similarity calculations. Users can simply do it with Link++ with a
custom Java class. In addition, external libraries are needed to support
third-party code. It is supported and can be used in our system by
simply adding the JAR le to the code repository.</p>
      <p>The user speci ed functions play an important goal in the matching
tasks since they can form a linking rule, a similarity metric, a
transformation/preprocessing operations or any other function based on users'
needs. The functions are gathered in a JAVA le accompanied with the
used jar libraries, then they are compiled at run time and used when
needed. To reference each method the user simply address the function as
follows: "class-name"."method-name". For example: MyClass.walkingDistance.
3.2</p>
      <p>De ning a Linking Rule
A linking rule speci es the conditions required to generate a connection
between a given pair of entities. The main goal is to apply this rule to
each entity pair in order to seek for a match and create the speci ed
connection. De ning a rule requires a set of functions (similarity metrics
and preprocessing functions) previously de ned by the user. Each rule is
de ned with a root node that is either an aggregation or a comparison
operator and sub-nodes specifying any other function chained in a way
to suit the linking task.</p>
      <p>An aggregation operator combines the values of di erent operators/values
by applying the speci ed aggregation method, e.g., max, min, average,
etc. It is de ned by an aggregation function and a threshold. Each
function contains a set of parameters that can be speci ed from the given
data sources or directly by the user. The threshold de nes whether the
value of the operator must be evaluated as true or false in the linking rule.
Since data sources can be represented in di erent ways, we can use the
transformation operator to modify this representation. To this end, we
de ne a function that takes its parameters from the data sources or from
the composition of other transformation operators, e.g., lowercase,
uppercase, concatenation, round, ceiling, etc.</p>
      <p>Finally, the comparison operator is used to de ne the similarity (or the
relatedness) between two properties of the given data sources. A
comparison is valid between operators themselves or with other transformation
functions, and a threshold de nes whether the value is accepted or not
for the rule to be valid, e.g., distance, equality, etc.
Connections are the nal outputs of the interlinking task speci ed by
connection patterns. Therefore, it is important to be precise when de
ning a connection pattern. A pattern speci es the format of the generated
connections and the required information they must contain. In other
words, it represents a template that will be lled when a connection is
instantiated.</p>
      <p>A connection pattern is composed of a set of properties, where each
property is de ned by a function that calculates it. Function parameters can
be the inputs from the data sources or prede ned by the rule composer.
A connection pattern is freely chosen by a user according to the
interlinking task and the post-processing needs. The formal de nition of a
connection pattern O is as follows:
Let D1 and D2 be two data sources. Given V any data type and F a
set of custom functions required to generate the patterns, Pr is a set of
properties where each property is represented by a property name n, a
value v and a corresponding function f , which calculates the property
value during the generation process.</p>
      <p>P r = f(n; v; f )jn 2 String; v 2 V f 2 F g</p>
    </sec>
    <sec id="sec-4">
      <title>A connection pattern is formalized as:</title>
      <p>O = (d1; d2; pr)jd1 2 D1; d2 2 D2; pr</p>
      <p>P r
(1)
(2)
Both the connection pattern and the linking rule les are described in
XML les that conform a data type de nition (DTD); custom functions
are written using JAVA (users can write any JAVA le and use the
dened methods in his/her connection pattern or rule), and the output is
generated in RDF.</p>
      <p>We will give a demonstration case with a real scenario of de ning both
the linking rule and the connection pattern in the evaluation section. It
illustrates the con guration process and shows an instance of the XML
les (output pattern and rule).
3.4</p>
      <p>Connection Discovery Algorithm
Once the con guration step is completed, the connection discovery
process starts as described in the sequel. Algorithm 1 represents the
pseudocode of the implemented linking process. The algorithm iterates over each
pair of entities in the two data sources and evaluates the linking rule
between them. Based on the rule evaluation, the algorithm decides if a
connection must be created or not. If a rule is triggered, a new
connection is generated by evaluating the connection pattern and applying the
corresponding function of each property. The values are calculated by
the speci ed functions in the output pattern, and their parameters are
lled from the currently-compared entities. Here, we instantiate the
connection and ll in its information from the return values of the functions.
The connection is stored in a speci ed repository, and the algorithm
continues on the remaining pairs until all are treated. In the worst cases,
Data: D1, D2, O, R, F
Result: Discover the list of connections and add them to the connections store
/* iterate over the elements of D1 */
foreach e1 in D1 do
/* iterate over the elements of D2 */
foreach e2 in D2 do
/* evaluate the linking rule */
if evaluateRule(e1, e2, R) is true then
/* if the rule holds, create new connection based on the</p>
      <p>output pattern */
c createConnection(e1, e2, O);
/* calculate the value of each property in the pattern</p>
      <p>based on the specified function */
foreach p in c.properties do
f F.getFunction(p.getFunction);
value f.calculate(p.getProperties);
c.addProperty(p.name, value);
end
/* the connection is instantiated and ready to be added to
the connection store */
add c to connections store;
end
end
end</p>
      <p>Algorithm 1: Connection discovery algorithm.
the time complexity of the algorithm is O(n * m), where n and m are
the sizes of the input datasets. The storage complexity (in terms of data
pages) is the same as a nested loop join in databases that is equal to the
size of the smallest dataset + one page, which usually ts in memory.
This complexity may be reduced by using some pre- ltering techniques
that the system may o er in a future version; for instance, using a spatial
index to replace the inner loop by a search in an index (which reduces
the cost to log(n)). Then, the speci c rules and function de ned by the
user will be applied in a re nement phase automatically by the system.
Link++ is implemented and an executable version of it can be found
online via the link https://github.com/alimasri/link-plus-plus.git;
in addition to a video tutorial on: https://youtu.be/u2gr7Wa4eT4. A
screen-shot of the system is shown in Figure 2. It shows the
projectbased graphical user interface where users can add their data sources,
functions, linking rule and connection pattern. The output is shown in a
separate directory.
8 http://gtfs.s3.amazonaws.com/transilien-archiver_20160202_0115.zip
9 http://opendata.paris.fr/explore/dataset/stations_et_espaces_autolib_
de_la_metropole_parisienne/</p>
      <p>In the following, we describe the evaluation phases from preparing the
data, setting-up the system and visualizing the generated output.</p>
      <p>Data Preparation In this phase, the goal is to represent the timetable
information in a format compatible with our de nition of connection.
Instead of designing a network by a series of stops or other
representations, we want to represent it by a series of connections between stops.
Since SNCF is a public transportation company with data described in
timetables, the task here is to extract scheduled connections from the
given data.</p>
      <p>To this end, we have proposed an algorithm that transforms timetable
data from GTFS les into scheduled connections. The algorithm iterates
over the timetable information for each stop and creates a connection
that starts from a departure stop at a departure time and ends with an
arrival stop with the speci ed time. The process is repeated to a
predened date range to limit the number of connections created.</p>
      <p>In case of Autolib, we do not have timetable information, so we need a
way to discover the connections between its stops. Using our approach,
we can match Autolib's dataset with itself (in order to know when a
Autolib station is reachable from an another) to discover these
unscheduled connections between. Since the con guration task is common and
independent, the following section describes how to use our approach to
discover the unscheduled connections for Autolib-Autolib and
AutolibSNCF.</p>
      <p>Discovering New Connections Two tasks are required one for
Autolib-Autolib connections and one for Autolib-SNCF connections. In
this example, unscheduled connections are driving or walking
connections between Autolib-VELIB and Autolib-SNCF, respectively. We use
our approach to search for connections that match a prede ned criteria.
Since our approach works on RDF data, we have used the DataLift [16]
platform to transform both SNCF stops and VELIB CSV les into RDF
turtle formats. In the sequel, we describe in detail all of the required
tasks to achieve our goal.</p>
      <p>{ De ning custom functions: Our system is exible as it allows users
to create any custom function to be used in the linking task. Users
can use external dependencies, as well. In our example, we de ne
the functions getWalkingDistance, getWalkingTime,
getDrivingDistance and getDrivingTime. In a real scenario, we get this information
from a web service, such as Google's distance matrix API (https://
developers.google.com/maps/documentation/distance-matrix/).
However, due to the query limit, we have chosen to implement them
by local functions based on mathematical calculations (http://www.
movable-type.co.uk/scripts/latlong.html).
{ De ne the linking rules: Recall that the linking rule describes the
condition that triggers the creation of a connection. Two rules are
required, one for Autolib-Autolib and the other for Autolib-SNCF.
For the rst one, the condition of the de ned rule is the following:
\If a driving path exists within 200 km (the time before the
battery is totally discharged), create a connection". For Autolib-SNCF
connections, the rule is: \If a walking path exists from one stop to
another within one kilometer, create a connection".</p>
      <p>Rules are written in XML format, and the functions that calculate
the walking distance and time are referenced from the custom
functions le. We note that the parameters \200 km" and \1 km" are
given by the user who is responsible for the con guration. We set
these parameters as the maximum feasible scope for a person to
ride the car or walk from one station to another. Figure 3 shows an
example of how a rule can be de ned.
{ De ning the connection pattern: We de ne the output generated by
the system at each valid rule. We have chosen the following
properties to be represented in a connection pattern: source-id, target-id,
walking/driving distance and walking/driving time. This pattern is
the same for both tasks, and an example is shown in Figure 4.</p>
      <p>Executing these tasks with the above con guration enabled us to
enrich the network with discovering 535,966 internal connections between
Autolib car stations and 272 new connections between the two di erent
transportation modes SNCF and Autolib.</p>
      <p>Compared to the existing link discovery frameworks, our approach
succeeded in discovering links with richer representations and customized
properties that can be used for numerous tasks.
5</p>
      <sec id="sec-4-1">
        <title>Conclusion</title>
        <p>Open data made it possible for new companies and services to gain
visibility in the market. New transportation services are taking advantage
of this to provide the needed data to support multimodality. As we have
seen, existing open data interlinking tools are not suitable to support
transportation data linking which represents real connections in the
physical world. In this paper, we introduce Link++, a customizable
interlinking tool that enables detection and generation of customized semantic
connections between transportation datasets.</p>
        <p>Link++ provides the means for existing transportation systems to
access information about nearby services and integrate them. We have
introduced how Link++ de nes custom connections and used these
connections to expand a transportation network containing trains and car
sharing networks. In the future, our focus will target the dynamic part of
the connections since they are not always static and may be a ected by
external events. On the other hand, we aim to take pro le information
while generating connections which introduces the need of real-time
connection generation. Furthermore, we will consider optimizing the
Cartesian product in the discovery algorithm.
6. Sergio Consoli, Misael Mongiov , Diego Reforgiato Recupero, Silvio
Peroni, Aldo Gangemi, Andrea Giovanni Nuzzolese, and Valentina
Presutti. Producing linked data for smart cities: the case of catania.
7. Michael B Gurstein. Open data: Empowering the empowered or
e ective data use for everyone? First Monday, 16(2), 2011.
8. Oktie Hassanzadeh, Lipyeow Lim, Anastasios Kementsietsidis, and
Min Wang. A declarative framework for semantic link discovery over
relational data. In Proceedings of the 18th international conference
on World wide web, pages 1101{1102. ACM, 2009.
9. Afraz Ja ri, Hugh Glaser, and Ian Millard. Managing uri synonymity
to enable consistent reference on the semantic web. 2008.
10. Spyros Kotoulas, Vanessa Lopez, Raymond Lloyd, Marco Luca
Sbodio, Freddy Lecue, Martin Stephenson, Elizabeth Daly, Veli Bicer,
Aris Gkoulalas-Divanis, Giusy Di Lorenzo, et al. Spud semantic
processing of urban data. Web Semantics: Science, Services and Agents
on the World Wide Web, 24:11{17, 2014.
11. Jon Jay Le Grange, Jens Lehmann, Spiros Athanasiou, A
GarciaRojas, Giorgos Giannopoulos, Daniel Hladky, Robert Isele,
AC Ngonga Ngomo, M Ahmed Sherif, Claus Stadler, et al. The
geoknow generator: Managing geospatial data in the linked data web.</p>
        <p>Linking Geospatial Data, 2014.
12. Axel-Cyrille Ngonga Ngomo and Soren Auer. Limes-a time-e cient
approach for large-scale link discovery on the web of data.
integration, 15:3, 2011.
13. Julien Plu and Francois Schar e. Publishing and linking transport
data on the web: extended version. In Proceedings of the First
International Workshop on Open Data, pages 62{69. ACM, 2012.
14. Yves Raimond, Samer A Abdallah, Mark B Sandler, and Frederick
Giasson. The music ontology. In ISMIR, pages 417{422. Citeseer,
2007.
15. Yves Raimond, Christopher Sutton, and Mark B Sandler. Automatic
interlinking of music datasets on the semantic web. LDOW, 369,
2008.
16. Francois Schar e, Ghislain Atemezing, Raphael Troncy, Fabien
Gandon, Serena Villata, Benedicte Bucher, Faycal Hamdi, Laurent
Bihanic, Gabriel Kepeklian, Franck Cotton, et al. Enabling linked data
publication with the datalift platform. In Proc. AAAI workshop on
semantic cities, pages No{pagination, 2012.
17. Francois Schar e and Jero^me Euzenat. Melinda: an interlinking
framework for the web of data. arXiv preprint arXiv:1107.4502,
2011.
18. Francois Schar e, Yanbin Liu, and Chuguang Zhou. Rdf-ai: an
architecture for rdf datasets matching, fusion and interlink. In Proc.
IJCAI 2009 workshop on Identity, reference, and knowledge
representation (IR-KR), Pasadena (CA US), 2009.
19. Claus Stadler, Jens Lehmann, Konrad Ho ner, and Soren Auer.</p>
        <p>Linkedgeodata: A core for a web of spatial open data. Semantic
Web, 3(4):333{354, 2012.
20. Julius Volz, Christian Bizer, Martin Gaedke, and Georgi Kobilarov.</p>
        <p>Silk-a link discovery framework for the web of data. LDOW, 538,
2009.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Spiros</surname>
            <given-names>Athanasiou</given-names>
          </string-name>
          , Daniel Hladky, Giorgos Giannopoulos, Alejandra Garcia Rojas, and
          <string-name>
            <given-names>Jens</given-names>
            <surname>Lehmann</surname>
          </string-name>
          .
          <article-title>Geoknow: Making the web an exploratory place for geospatial knowledge</article-title>
          .
          <source>ERCIM News</source>
          ,
          <volume>96</volume>
          :
          <fpage>12</fpage>
          {
          <fpage>13</fpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2. Soren Auer, Jens Lehmann, and
          <string-name>
            <given-names>Sebastian</given-names>
            <surname>Hellmann</surname>
          </string-name>
          . Linkedgeodata:
          <article-title>Adding a spatial dimension to the web of data</article-title>
          . Springer,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>Montserrat</given-names>
            <surname>Batet</surname>
          </string-name>
          , Sebastien Harispe, Sylvie Ranwez,
          <string-name>
            <surname>David Sanchez</surname>
            ,
            <given-names>and Vincent</given-names>
          </string-name>
          <string-name>
            <surname>Ranwez</surname>
          </string-name>
          .
          <article-title>An information theoretic approach to improve semantic similarity assessments across multiple ontologies</article-title>
          .
          <source>Information Sciences</source>
          ,
          <volume>283</volume>
          :
          <fpage>197</fpage>
          {
          <fpage>210</fpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>Christian</given-names>
            <surname>Bizer</surname>
          </string-name>
          .
          <article-title>Evolving the web into a global data space</article-title>
          .
          <source>In BNCOD</source>
          , volume
          <volume>7051</volume>
          , page 1,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>Christian</given-names>
            <surname>Bizer</surname>
          </string-name>
          , Tom Heath, and
          <string-name>
            <surname>Tim</surname>
          </string-name>
          Berners-Lee.
          <article-title>Linked data-the story so far</article-title>
          .
          <source>Semantic Services, Interoperability and Web Applications: Emerging Concepts</source>
          , pages
          <volume>205</volume>
          {
          <fpage>227</fpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>