<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>IR Scientific Data: How to Semantically Represent and Enrich Them</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Toine Bogers</string-name>
          <email>toine@hum.aau.dk</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Aalborg University Copenhagen</institution>
          ,
          <country country="DK">Denmark</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Georgeta Bordea Paul Buitelaar Insight Centre, National University of Ireland</institution>
          ,
          <addr-line>Galway</addr-line>
          ,
          <country country="IE">Ireland</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Nicola Ferro Gianmaria Silvello University of Padua</institution>
          ,
          <addr-line>Padua</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>English. Experimental evaluation carried out in international large-scale campaigns is a fundamental pillar of the scientific and technological advancement of Information Retrieval (IR) systems. Such evaluation activities produce a large quantity of scientific and experimental data, which are the foundation for all the subsequent scientific production and development of new systems. We discuss how to annotate and interlink this data, by proposing a method for exposing experimental data as Linked Open Data (LOD) on the Web and as a basis for enriching and automatically connecting this data with expertise topics and expert profiles. In this context, a topiccentric approach for expert search is proposed, addressing the extraction of expertise topics, their semantic grounding with the LOD cloud, and their connection to IR experimental data.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Italiano. La valutazione
sperimentale condotta mediante campagne
internazionali su larga scala, e` un
pilastro fondante dello sviluppo scientifico e
dell’avanzamento tecnologico dei sistemi
di reperimento dell’informazione. Queste
attivita` di valutazione producono una
grande quantita` di dati sperimentali che
costituiscono la base per la conseguente
produzione scientifica e lo sviluppo di
nuovi sistemi. In questo lavoro, si
discute come annotare e collegare questi
dati, proponendo un metodo per esporre
i dati sperimentali come LOD nel Web
e per usare tali dati come base per
arricchirli. In questo contesto, viene
proposto un approccio centrato sui topic per
la ricerca di esperti, che affronta il
problema dell’estrazione dei topic e il
collegamento di questi con la “LOD cloud” e con
i dati sperimentali.</p>
    </sec>
    <sec id="sec-2">
      <title>1 Introduction</title>
      <p>
        The importance of research data is widely
recognized across all scientific fields as this data
constitutes a fundamental building block of science.
Recently, a great deal of attention was dedicated to
the nature of research data
        <xref ref-type="bibr" rid="ref5">(Borgman, 2015)</xref>
        and
how to describe, share, cite, and re-use them in
order to enable reproducibility in science and to
ease the creation of advanced services based on
them
        <xref ref-type="bibr" rid="ref12 ref12 ref13 ref7 ref7">(Ferro et al., 2016; Silvello and Ferro, 2016)</xref>
        .
      </p>
      <p>
        Nevertheless, in the field of Information
Retrieval (IR), where experimental evaluation based
on shared data collections and experiments has
always been central to the advancement of the
field
        <xref ref-type="bibr" rid="ref8">(Harman, 2011)</xref>
        , the Linked Open Data
(LOD) paradigm has not been adopted yet and
no models or common ontologies for data sharing
have been proposed. So despite the importance of
data to IR, the field does not share any clear ways
of exposing, enriching, and re-using experimental
data as LOD with the research community.
      </p>
      <p>Therefore, the main contributions of this paper
are to:
define an Resource Description Framework
(RDF) model of the scientific IR data with
the aim of enhancing their discoverability and
easing their connections with the scientific
production related to and based on them;
provide a methodology for automatically
enriching the data by exploiting relevant
external entities from the LOD cloud.
2</p>
    </sec>
    <sec id="sec-3">
      <title>Use Case: Discover, Understand and</title>
    </sec>
    <sec id="sec-4">
      <title>Re-use IR Experimental Data</title>
      <p>
        In this section, we discuss an example of the
outcomes of the semantic modeling and automatic
enrichment processes applied to the use case of
discovering, understanding and re-using the
experimental data. Figure 1 shows an RDF graph, which
provides a visual representation of how the
experimental data are enriched. In particular, we can see
the relationship between a contribution and an
author enriched by expertise topics, expert profiles
and connections to the LOD cloud, as supported
by the Distributed Information Retrieval
Evaluation Campaign Tool (DIRECT) system which
provides the conceptual model for representing and
enriching the data
        <xref ref-type="bibr" rid="ref1 ref2">(Agosti and Ferro, 2009; Agosti
et al., 2012)</xref>
        .
      </p>
      <p>In this instance, the author (Jussi Karlgren)
and the contribution (KarlgrenEtAl-CLEF2012)
are data derived from the evaluation workflow,
whereas all the other information are
automatically determined by the enrichment process. The
adopted methodology for expertise topics
extraction determined two main topics, “reputation
management” and “information retrieval”, which are
related to the KarlgrenEtAl-CLEF2012
contribution. We can see that KarlgrenEtAl-CLEF2012 is
featured by “reputation management” with a score
of 0:53 and by “information retrieval” with 0:42,
meaning that both these topics are subjects of the
contribution; the scores (normalized in the interval
[0; 1]) give a measure of how much this
contribution is about a specific topic and we can see that
in this case it is concerned a bit more with
reputation management than with information retrieval.
Furthermore, the backward-score gives us
additional information by measuring how much a
contribution is authoritative with respect to a scientific
topic. In Figure 1, we can see that
KarlgrenEtAlCLEF2012 is authoritative for reputation
management (backward-score of 0:87), whereas it is not a
very important reference for information retrieval
(backward-score of 0:23). Summing up, we can
say that if we consider the relation between a
contribution and an expertise topic, the score
indicates the pertinence of the expertise topic within
the contribution; whereas the backward score
indicates the pertinence of the contribution within the
expertise topic. The higher the backward score,
the more pertinent is the contribution for the given
topic.</p>
      <p>This information is confirmed by the expert
profile data; indeed, looking at the upper-left part of
Figure 1, the author Jussi Karlgren is considered
“an expert in” reputation management
(backwardscore of 0:84), even if it is not his main field of
expertise (score of 0:46).</p>
      <p>All of this automatically extracted
information enriches the experimental data enabling for a
higher degree of re-usability and
understandability of the data themselves. In this use case, we
can see that the expertise topics are connected via
an owl:sameAs property to external resources
belonging to the DBPedia1 linked open dataset.
These connections are automatically defined via
the semantic grounding methodology described
below and enable the experimental data to be
easily discovered on the Web. In the same way,
authors and contributions are connected to the
DBLP2 linked open dataset.</p>
      <p>
        In Figure 1 we can see how the contribution
(KarlgrenEtAl-CLEF2012) is related to the
experiment (profiling kthgavagai 1) on which it is
based. This experiment was submitted to the
RepLab 2012 of the evaluation campaign CLEF
2012. It is worthwhile to highlight that each
evaluation campaign in DIRECT is defined by the name
of the campaign (CLEF) and the year it took place
        <xref ref-type="bibr" rid="ref2">(e.g., 2012 in this instance)</xref>
        ; each evaluation
campaign is composed of one or more tasks identified
by a name
        <xref ref-type="bibr" rid="ref2">(e.g., RepLab 2012)</xref>
        and the
experiments are treated as submissions to the tasks. Each
experiment is described by a contribution which
reports the main information about the research
group which conducted the experiment, the
system they adopted, developed and any other useful
detail about the experiment.
      </p>
      <p>We can see that most of the reported
information are directly related to the contribution
and they allow us to explicitly connect the
research data with the scientific publications based
on them. Furthermore, the experiment is
evaluated from the “effectiveness” point of view by
using the “accuracy” measurement which has 0:77
score. Retaining and exposing this information as
LOD on the Web allow us to explicitly connect the</p>
      <sec id="sec-4-1">
        <title>1http://www.dbpedia.org/ 2http://dblp.l3s.de/</title>
        <p>dblp.l3s.de/
d2r/resource/
authors/
Jussi_nKarlgre owl:sameAs
dblp.l3s.de/d2r/
resource/
publications/
conf/clef/
Karlgren2SOEH1
owl:sameAs</p>
        <p>Jussi</p>
        <p>Karlgren
ims:has-source
Link ims:relation is-expert-in
ims:has-target</p>
        <p>Reputation</p>
        <p>Management
CKLaERrFelg2pr0eL1na2bEw-tAnl- ims:has-source</p>
        <p>2012
ims:score ims:backward-score</p>
        <p>0.46 0.84
swrc:has-author
ProfilingReputationofCorporate
EntitiesinSemanticSpace
CLEF
2012
ims:title
ims:refersTo
profiling
_kthgavagai
_1</p>
        <p>ims:evaluates
ims:submittedTo
ims:isPartOf</p>
        <p>RepLab
2012
ims:has-target</p>
        <p>feature
ims:relation</p>
        <p>Link
ims:has-source ims:score ims:backward-score</p>
        <p>0.53 0.87
Link</p>
        <p>ims:has-target
ims:score ims:backward-score
0.42 0.23
ims:relation Information</p>
        <p>Retrieval</p>
        <p>Accuracy
ims:isEvaluatedBy
Ef ectiveness</p>
        <p>ims:assignedTo
ims:measuredBy
0.77 ims:score Measure
owl:sameAs
dbpedia.or
g/resource/</p>
        <p>Inform_ation
owl:sameAs retrieval
results of the evaluation activities to the claims
reported by the contributions.</p>
        <p>
          The details of the full RDF model are reported
in
          <xref ref-type="bibr" rid="ref12 ref13">(Silvello et al., 2016)</xref>
          .
2.1
        </p>
        <p>Accessing the Experimental Data
The described RDF model has been realized by
the DIRECT system which allows for accessing
the experimental evaluation data enriched by the
expert profiles created by means of the techniques
that will be described in the next sections. This
system is called LOD-DIRECT and it is
available at the URL: http://lod-direct.dei.
unipd.it/.</p>
        <p>The data currently available include the
contributions produced by the Conference and Labs of
the Evaluation Forum (CLEF) evaluation
activities, the authors of the contributions, information
about CLEF tracks and tasks, provenance events
and the above described measures. Furthermore,
this data has been enriched with expert profiles and
expertise topics which are available as linked data
as well.</p>
        <p>At the time of writing, LOD-DIRECT allows
access to 2; 229 contributions, 2; 334 author
profiles and 2; 120 expertise topics. Overall, 1; 659
experts have been individuated and on average
there are 8 experts per expertise topics (an expert
can have more than one expertise of course).</p>
        <p>The URIs of the resources are constructed
following the pattern:
base-path/{resource-name}/
{id};{ns}
where,
base-path is
http://lod-direct.dei.unipd.it;
resource-name is the name of the
resource to be accessed as defined in the RDF
model presented above;
id is the identifier of the resource of interest;
ns is the namespace of the resource of
interest, this applies only for the namespace
identifiable resources.</p>
        <p>As an example, the URI corresponding
to the contribution resource shown in
Figure 1 with identifier
CLEF2012wn-RepLabKarlgrenEt2012b is:
http://lod-direct.dei.
unipd.it/contribution/
CLEF2012wn-RepLab-KarlgrenEt2012b
In this section we describe SOME methods for
semantically enriching experimental IR data
modelled as described above, by analysing
unstructured data available in scientific publications.
Figure 2 presents an overview of the semantic
enrichment of documents and authors based on term
and topical hierarchy extraction. First, we propose
a method to automatically extract expertise
topics from a domain-specific collection of
publications using an approach for term extraction. Then,
we present a preliminary approach for enriching
expertise topics by grounding them in the LOD
cloud.</p>
        <p>
          Topic-centric approaches for expert search
emphasize the extraction of keyphrases that can
succinctly describe expertise areas, also called
expertise topics, using term extraction techniques
          <xref ref-type="bibr" rid="ref3">(Bordea et al., 2012)</xref>
          . Expertise topics are extracted
from a domain-specific corpus using the
following approach. First, candidate expertise topics are
discovered from text using a syntactic description
for terms (i.e., nouns or noun phrases) and
contextual patterns that ensure that the candidates are
coherent within the domain. A domain model is
constructed using the method proposed in
          <xref ref-type="bibr" rid="ref4">(Bordea
et al., 2013)</xref>
          and then noun phrases that include
words from the domain model or that appear in
their immediate context are selected as candidates.
        </p>
        <p>These topics describe core concepts of the
domain such as search engine, IR system, and
retrieval task, as well as prominent subfields of the
domain including image retrieval, machine
translation, and question answering.</p>
        <p>Only the best 20 expertise topics are stored for
each document, ranking expertise topics based on
their overall score. In this way, each document
is enriched with keyphrases, taking into
consideration the quality of a term for the whole corpus
in combination with its relevance for a particular
document.</p>
        <p>
          Expertise topics can be used to provide links
between IR experimental data and other data sources.
These links play an important role in
crossontology question answering, large-scale
inference and data integration
          <xref ref-type="bibr" rid="ref11">(Ngonga Ngomo, 2012)</xref>
          .
Additional background knowledge, as found on
the LOD cloud, can inform expert search at
different stages.
        </p>
        <p>A first step in the direction of exploiting this
potential is to provide an entry point in the LOD
cloud through DBpedia3. Our goal is to associate
as many terms as possible with a concept from the
LOD cloud through DBpedia URIs—as shown in
the use-case above. Where available, concept
descriptions are collected as well and used in our
system.</p>
        <p>Two approaches for grounding expertise topics
on DBpedia have been evaluated. The first
approach matches a candidate DBpedia URI with an
expertise topic, using the string as it appears in the
corpus. The second approach makes use of the
lemmatised form of the expertise topic. In order
to evaluate our URI discovery approach, we build
a small gold standard dataset by manually
annotating 186 expertise topics with DBpedia URIs.
First of all, we note that about half of the
analysed expertise topics have a corresponding
concept in DBpedia. One of the main reasons for the
low coverage is that DBpedia is a general
knowledge datasource that has a limited coverage of
specialised technical domains.</p>
        <p>Although both approaches achieve similar
results in terms of F-score, the approach that makes
use of lemmatisation (A2) achieves better
precision, as can be seen in Table 1. Surprisingly, using
lemmatization achieves a lower recall but higher
precision but this might be due to the small size of
the dataset.</p>
        <p>Expert finding is the task of identifying the most</p>
      </sec>
      <sec id="sec-4-2">
        <title>3DBpedia: http://dbpedia.org/</title>
        <p>knowledgeable person for a given expertise topic.
In this task, several competent people have to be
ranked based on their relative expertise on a given
expertise topic. We compare several topic-centric
methods for expert finding with two
languagemodelling baselines.</p>
        <p>The results for the expert finding task are
presented in Table 2. The expert finding methods
evaluated in this section include Experience (E),
Relevance and Experience (RE) and Relevance,
Experience and Area Coverage (REC).</p>
        <p>
          Experience (E) is based on the idea that
documents written by a person can be used as an
indirect evidence of expertise, assuming that an expert
often mentions his areas of interest. Relevance and
Experience (RE) exploits the idea that expertise is
closely related to the notion of experience. The
assumption is that the more a person works on a
topic, the more knowledgeable they are. We
estimate the experience of a researcher on a given
topic by counting the number of publications that
have the topic assigned as a top ranked keyphrase.
Relevance and expertise measure different aspects
of expertise and can be combined to take
advantage of both features. In the case that the subtopics
of an expertise topic are known, we can evaluate
the expertise of a person based on their knowledge
of the more specialised fields. A previous study
showed that experts have increased knowledge at
more specific category levels than novices
          <xref ref-type="bibr" rid="ref14">(Tanaka
and Taylor, 1991)</xref>
          . We introduce a novel measure
for expertise called Area Coverage (REC) that
measures whether an expert has in depth
knowledge of an expertise topic, using an automatically
constructed topical hierarchy.
        </p>
        <p>
          The Area Coverage measure makes use of a
topical hierarchy. Therefore we automatically
construct a topical hierarchy for IR using the method
proposed in
          <xref ref-type="bibr" rid="ref10">(Hooper et al., 2012)</xref>
          . Figure 3
shows a small extract from this hierarchy that
correctly identifies “information retrieval” as the root
of the taxonomy as well as several subfields
including “digital libraries”, “interactive
information retrieval”, and “cross language information
retrieval”.
        </p>
        <p>
          The details on the algorithms and weighting
schemes for topic extraction, expert profiling, and
expert finding are reported in
          <xref ref-type="bibr" rid="ref12 ref13">(Silvello et al.,
2016)</xref>
          .
In this paper we discussed the data modelling and
the semantic enrichment of IR experimental data,
as produced by large-scale evaluation campaigns.
        </p>
        <p>In particular, the main results of the paper are:
an accurate RDF data model for
describing IR experimental data in detail,
available at http://ims.dei.unipd.it/
data/rdf/direct.3.10.ttl;
a dataset about CLEF contributions, extracted
expertise topics and related expert profiles;
the online accessible LOD DIRECT system,
available at http://lod-direct.dei.
unipd.it/, to access the above data in
different serialization formats, RDF+XML,
Turtle, N3, XML and JSON.</p>
        <p>Future work will concern the application of
these semantic modeling and automatic
enrichment techniques to other areas of the evaluation
workflow. For example, expert profiling and topic
extraction could be used to automatically improve
and enhance the descriptions of the single
experiments submitted to an evaluation campaign, which
are typically not very rich and often cryptic—for
example “second iteration with tuned parameters”
as description—and to automatically link
experiments to external resources, e.g., describing the
used components, such as stemmers or stop lists,
and systems. Finally, the RDF model defined
within DIRECT opens up the possibility of
integrating established Digital Library (DL)
methodologies for data access and management which
inCL
SW
IR</p>
        <p>Measure</p>
        <p>
          MAP
MRR
P@5
MAP
MRR
P@5
MAP
MRR
P@5
MAP
MRR
P@5
creasingly exploit the LOD paradigm
          <xref ref-type="bibr" rid="ref6 ref9">(Hennicke et
al., 2011; Di Buccio et al., 2013)</xref>
          . This would
enable broadening the scope and the connections
between IR evaluation and other related fields,
providing new paths for semantic enrichment of the
experimental data.
        </p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <given-names>M.</given-names>
            <surname>Agosti</surname>
          </string-name>
          and
          <string-name>
            <given-names>N.</given-names>
            <surname>Ferro</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Towards an Evaluation Infrastructure for DL Performance Evaluation</article-title>
          . In G. Tsakonas and C. Papatheodorou, editors,
          <source>Evaluation of Digital Libraries: An insight into useful applications and methods</source>
          , pages
          <fpage>93</fpage>
          -
          <lpage>120</lpage>
          . Chandos Publishing, Oxford, UK.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>M.</given-names>
            <surname>Agosti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. Di</given-names>
            <surname>Buccio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ferro</surname>
          </string-name>
          , I. Masiero,
          <string-name>
            <given-names>S.</given-names>
            <surname>Peruzzo</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Silvello</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>DIRECTions: Design and Specification of an IR Evaluation Infrastructure</article-title>
          . In T. Catarci,
          <string-name>
            <given-names>P.</given-names>
            <surname>Forner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Hiemstra</surname>
          </string-name>
          ,
          <string-name>
            <surname>A</surname>
          </string-name>
          . Pen˜as, and G. Santucci, editors,
          <source>Information Access Evaluation</source>
          . Multilinguality, Multimodality, and
          <string-name>
            <given-names>Visual</given-names>
            <surname>Analytics</surname>
          </string-name>
          .
          <source>Proceedings of the Third International Conference of the CLEF Initiative (CLEF</source>
          <year>2012</year>
          ), pages
          <fpage>88</fpage>
          -
          <lpage>99</lpage>
          . Lecture Notes in Computer Science (LNCS) 7488, Springer, Heidelberg, Germany.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <given-names>Georgeta</given-names>
            <surname>Bordea</surname>
          </string-name>
          , Sabrina Kirrane, Paul Buitelaar, and
          <string-name>
            <surname>Bianca</surname>
            <given-names>O</given-names>
          </string-name>
          <string-name>
            <surname>Pereira</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Expertise Mining for Enterprise Content Management</article-title>
          . In N. Calzolari,
          <string-name>
            <given-names>K.</given-names>
            <surname>Choukri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Declerck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. U.</given-names>
            <surname>Dogan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Maegaard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Mariani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Odijk</surname>
          </string-name>
          , and S. Piperidis, editors,
          <source>Proc. of the Eighth Int. Conference on Language Resources and Evaluation (LREC-2012)</source>
          , pages
          <fpage>3495</fpage>
          -
          <lpage>3498</lpage>
          .
          <string-name>
            <given-names>European</given-names>
            <surname>Language Resources Association (ELRA).</surname>
          </string-name>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <given-names>G.</given-names>
            <surname>Bordea</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Polajnar</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Buitelaar</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Domain-Independent Term Extraction Through Domain Modelling</article-title>
          .
          <source>In 10th International Conference on Terminology and Artificial Intelligence.</source>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <given-names>C. L.</given-names>
            <surname>Borgman</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Big Data, Little Data, No Data</article-title>
          . MIT Press.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <given-names>E.</given-names>
            <surname>Di Buccio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. M.</given-names>
            <surname>Di Nunzio</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Silvello</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>A Curated and Evolving Linguistic Linked Dataset</article-title>
          .
          <source>Semantic Web</source>
          ,
          <volume>4</volume>
          (
          <issue>3</issue>
          ):
          <fpage>265</fpage>
          -
          <lpage>270</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>N.</given-names>
            <surname>Ferro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Fuhr</surname>
          </string-name>
          , K. Ja¨rvelin, N. Kando,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lippold</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Zobel</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Increasing Reproducibility in IR: Findings from the Dagstuhl Seminar on “Reproducibility of Data-Oriented Experiments in e-Science”</article-title>
          .
          <source>SIGIR Forum</source>
          ,
          <volume>50</volume>
          (
          <issue>1</issue>
          ):
          <fpage>68</fpage>
          -
          <lpage>82</lpage>
          , June.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <given-names>D. K.</given-names>
            <surname>Harman</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Information Retrieval Evaluation</article-title>
          . Morgan &amp; Claypool Publishers, USA.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <given-names>S.</given-names>
            <surname>Hennicke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Olensky</surname>
          </string-name>
          , V. de Boer,
          <string-name>
            <given-names>A.</given-names>
            <surname>Isaac</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Wielemaker</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Conversion of EAD into EDM Linked Data</article-title>
          . In L. Prediu,
          <string-name>
            <given-names>S.</given-names>
            <surname>Hennicke</surname>
          </string-name>
          ,
          <string-name>
            <surname>A</surname>
          </string-name>
          . Nu¨rnberger,
          <string-name>
            <given-names>A.</given-names>
            <surname>Mitschick</surname>
          </string-name>
          , and S. Ross, editors,
          <source>Proc. 1st International Workshop on Semantic Digital Archives (SDA</source>
          <year>2011</year>
          ) http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>801</volume>
          /, pages
          <fpage>82</fpage>
          -
          <lpage>88</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Clare J. Hooper</surname>
            , Nicolas Marie, and
            <given-names>Evangelos</given-names>
          </string-name>
          <string-name>
            <surname>Kalampokis</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Dissecting the butterfly: representation of disciplines publishing at the web science conference series</article-title>
          . In Noshir S. Contractor, Brian Uzzi,
          <string-name>
            <given-names>Michael W.</given-names>
            <surname>Macy</surname>
          </string-name>
          , and Wolfgang Nejdl, editors,
          <source>WebSci</source>
          , pages
          <fpage>137</fpage>
          -
          <lpage>140</lpage>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Axel-Cyrille Ngonga Ngomo</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>On link discovery using a hybrid approach</article-title>
          .
          <source>Journal on Data Semantics</source>
          ,
          <volume>1</volume>
          (
          <issue>4</issue>
          ):
          <fpage>203</fpage>
          -
          <lpage>217</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <given-names>G.</given-names>
            <surname>Silvello</surname>
          </string-name>
          and
          <string-name>
            <given-names>N.</given-names>
            <surname>Ferro</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>“Data Citation is Coming”. Introduction to the Special Issue on Data Citation</article-title>
          .
          <source>Bulletin of IEEE Technical Committee on Digital Libraries (IEEE-TCDL)</source>
          ,
          <volume>12</volume>
          (
          <issue>1</issue>
          ):
          <fpage>1</fpage>
          -
          <lpage>5</lpage>
          , May.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <given-names>G.</given-names>
            <surname>Silvello</surname>
          </string-name>
          , G. Bordea,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ferro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Buitelaar</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Bogers</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Semantic Representation and Enrichment of Information Retrieval Experimental Data</article-title>
          .
          <source>International Journal on Digital Libraries (IJDL).</source>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <given-names>James W.</given-names>
            <surname>Tanaka</surname>
          </string-name>
          and
          <string-name>
            <given-names>Marjorie</given-names>
            <surname>Taylor</surname>
          </string-name>
          .
          <year>1991</year>
          .
          <article-title>Object categories and expertise: Is the basic level in the eye of the beholder?</article-title>
          <source>Cognitive Psychology</source>
          ,
          <volume>23</volume>
          (
          <issue>3</issue>
          ):
          <fpage>457</fpage>
          -
          <lpage>482</lpage>
          ,
          <year>July</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>