<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Knowledge discovery and enrichment from scholarly data for expert nding</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>LIPN - CNRS UMR 7030 - Universite Paris XIII 99 avenue Jean-Baptiste Clement</institution>
          ,
          <addr-line>93430 Villetaneuse</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2018</year>
      </pub-date>
      <abstract>
        <p>With the generalisation of the digitalization of scienti c publications, scholarly data is now tackled with a big data perspective. In this context, interest about applications like expert nding, research recommandation systems or collaborators discovery grows and challenges related to knowledge discovery on large-scale scholarly data arise. The Unied Knowledge Platform (Plateforme de Connaissances Uni ees) project aims at developing an open-source platform valuing scholarly as well as business data. In that respect, we propose an approach and a methodology for discovering knowledge and enrich it from workings documents, more precisely scienti c publications, in the particular use case of expert nding. Compared to the state of the art, the originality of our approach lies in the combination of text mining as well as graph mining methods, more speci cally graph abstraction. We present an experiment with already published results on the 9-years acts of a French workshop on semantic information retrieval. In this experiment, we managed to obtain a graph mapping researchers who participated in the workshop. In this graph, researchers are linked together by co-publication relationships and described by their topics of publication. We were also able to detect dense communities of researchers with the help of graph abstraction. Based on these results and in the light of the state of the art, we discuss further research tracks.</p>
      </abstract>
      <kwd-group>
        <kwd>Scholarly data Graph mining Knowledge discovery Expert nding</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        An expertise is "an individual's skill, knowledge, aptitude or behaviour" [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. The
task of expert nding consists in assessing individuals' expertises (i.e.
constructing their expert pro le [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]). This task has various applications in industries such
as nding employable and appropriate candidates or assigning an expert to a
task or a project for example. In academia, expert nding is also useful for
assigning a researcher to a program committee or a project expertise, or setting
up research projects, to name a few. According to the claim that an author of a
text is an expert of its content, text appears like a solid source of knowledge for
expert nding. More precisely, we focus on working documents (e.g CV, project
reports, etc.). Such documents contain crucial information about an individual's
expertises. In academia, they mainly consist in scholarly data.
      </p>
      <p>
        The PCU1 (Plateforme de Connaissances Uni ee, i.e Uni ed Knowledge
Platform) project's aim is to propose an open source industrial platform valuing
business (and scholarly, to an extent) data. With the recent explosion of
digitization of academic and technical documents, scholarly data has known such a rapid
growth [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] that we now talk about big scholarly data. In that respect, interest
in big scholarly data platforms has emerged [
        <xref ref-type="bibr" rid="ref12 ref17">17, 12</xref>
        ]. In this context, our aim is
to discover knowledge from text (i.e scienti c publications) for expert nding,
represent it automatically into graphs and enrich knowledge with the hypothesis
that new knowledge will emerge from the graph structure. Our research question
is the following: how precise and accurate knowledge can become thanks to the
knowledge enrichment process, and what methods are providing the best results
on working documents, or more speci cally, scienti c publications?
      </p>
      <p>To answer this question, we will enrich PCU with a semantic platform with
the aim of supporting experiments that will test our hypothesis and enable us to
answer this research question. This research question lies in the area of knowledge
extraction and at the interface between knowledge discovery, natural language
processing and arti cial intelligence, more precisely machine learning
(unsupervised methods) and graph mining.</p>
      <p>We will present a state of the art in section 2, de ne our approach in section
3, de ne our methodology in section 4, present results that we have reached so
far in section 5 re ect on the relevance of our approach and discuss further work
and research tracks in section 6.
2</p>
    </sec>
    <sec id="sec-2">
      <title>State of the art</title>
      <p>
        According to the literature, expert nding is closely related to the problem of
expert pro ling [
        <xref ref-type="bibr" rid="ref11 ref3">3, 11</xref>
        ], which implies identi cation of expertises and their
assignation to appropriate individuals owning them [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Initially, expert nding
systems were based on people assessing their own expertises by selecting prede ned
keywords [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], and the use of manually generated heuristics was predominant [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ].
For the sake of the automation of expert nding, textual sources of knowledge
were harvested [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. With the explosion of online data stored in digital libraries,
scholarly data [
        <xref ref-type="bibr" rid="ref10 ref18">18, 10</xref>
        ] became a solid source of knowledge for expert nding. In
the rest of this paper, we will consider scholarly data, more precisely scienti c
publications, as a relevant textual source of expert nding concerning scholars.
      </p>
      <p>
        From scienti c publications, classical methods of expert nding described in
the literature are based on information extraction methods [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] such as
metadata (title, authors, abstract, date of publication, etc.) as well as citations and
1 Plateforme de Connaissances Uni ee :
https://www.smile.eu/fr/publications/smilelab/pcu-plateforme-connaissances-uni ees
author information (co-authorship, authors' a liations) extraction. Open-source
systems for metadata, citations and author information extraction exist [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] but
still need improvement according to their error analysis. To identify underlying
expertises within scienti c publications, concept extraction methods are used [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]
by means of classical keyphrase extraction algorithms for example. Such
algorithms can be domain-dependant, thus rely on a model trained on an annotated
corpus [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] or take advantage of knowledge of the domain by investigating
relations between expertise topics extracted [
        <xref ref-type="bibr" rid="ref5 ref8">5, 8</xref>
        ]. Concerning the computer science
domain, an ontology has recently been released [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ].
      </p>
      <p>
        Some scholarly data platforms have already been developed, such as
Rexplore [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. Representation of knowledge extracted from text through a graph
is quite common. Rexplore takes advantage of this representation by
providing a semantic network of ne-grained research areas, linked by semantic
relations. As described in the literature, researchers have mostly used graph and
machine-learning techniques for expert nding [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. As suggested [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], issues
related to the identication and ranking of experts [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] can be avoided by "combining
content-based expertise indicators and social relationships". Combining machine
learning algorithms with graph mining methods for expert nding in order to
discover knowledge and enrich it is a challenging research question. Inspired from
the analysis of social networks, graph mining techniques have been applied to
the detection of frequent k-communities [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. With the claim that researchers
and expertises are represented through an attributed graph, detecting strongly
connected k-communities would be interesting to investigate for expert nding.
Moreover, the application of the recent hub-authority core theory [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] is also a
promising investigation trail for directed citation graphs. As far as we know, no
scholarly data platform takes advantage of the recent advances of graph mining
techniques, even if graph representation is common.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Proposed approach</title>
      <p>
        In the light of the state of the art, we propose an original approach for expert
nding consisting in combining text mining, more precisely machine learning
algorithms applied on text (i.e scienti c publications), with graph mining
methods. The graph mining methods are applied on the graph representing
knowledge extracted from the text. The machine learning algorithms considered are
keyphrases [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] and semantic relationships [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] extraction algorithms. From the
text, the keyphrase extraction algorithm is initially applied, in order to extract
ne-grained topics or thematics of publication within scholarly data, more
precisely on full-text scienti c publications. The algorithm considered is based on
a model trained on an annotated corpus thus it is language-dependent and
applied to the domain of computer science. It is based on a conditional random
elds model trained with keyphrase candidated ltered with part-of-speech tag
sequences.
      </p>
      <p>
        Then, the semantic relationships extraction algorithm is applied on the
output of the keyphrase extraction algorithm. It is based on semantic similarity
between extracted keyphrases belonging to the same sentence. The semantic
similarity is measured thanks to most frequent patterns and clustering methods.
The algorithm is unsupervised, which enables the automatic extraction of
already known as well as brand new relationships between extracted keyphrases,
such as "is-a" between "information retrieval" and "task". It has been tested on
the ACL corpus [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. The sequential application of these two algorithms enables
us to collect the knowledge required for building a graph representing knowledge
extracted from the text.
      </p>
      <p>
        The originality of our approach described in Figure 1 lies in the combination
of these algorithms with graph abstraction [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. From the representation of a
text as a graph, the idea is to focus on strongly connected vertices applying a
topological constraint, for example by searching for the k-core (i.e the largest
subgraph verifying a topological constraint such as "each vertex in the subgraph
has a k degree"). Several representations are possible with attributed graphs,
one of them being that vertices represent researchers and are labeled by topics
of publication. In Figure 1, experts on topic 1 are obtained by removing R5 (who
does not have 2-degree), then R4 for the same reason after R5's removal. Our
expectation considering strongly connected vertices is that it may bring out new
and interesting knowledge from the graph that is itself built from the
representation of text. In Figure 1, all experts on topic 1 obtained by 2-core abstraction
on the graph are also experts on topic t2, which is a new knowledge. This
hypothesis has been raised from the state of the art, more precisely from network
analysis for social sciences [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] and our claim is that interesting knowledge could
emerge for expert nding from the generalisation of such graph mining methods.
      </p>
    </sec>
    <sec id="sec-4">
      <title>Methodology</title>
      <p>
        To support our experiments and meet PCU project's purpose, we have developed
an open source semantic platform making the machine learning algorithms [
        <xref ref-type="bibr" rid="ref8 ref9">9, 8</xref>
        ]
that we selected available as a work ow. We also developed a Mathematica
workow as a complementary tool for our experiments, mostly for data manipulation
and displaying as well as for the automation of the connection between the
machine learning algorithms outputs and the input of MinerLC (i.e the software for
applying graph abstractions). We selected the following datasets : ACL corpus
written in English and the 9-years scienti c acts of the Recherche d'Information
SEmantique (RISE) workshops2, mostly written in French. We plan on building
a larger dataset, by collecting the scienti c publications of the members of our
lab. Our experiments consist in applying the machine learning algorithms on the
full-text of the scienti c publications. We also collect structured metadata such
as titles, authors or keywords thanks to CERMINE [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ].
      </p>
      <p>From the knowledge discovered, we build a graph G = (V,E) (V being the
vertices, i.e the objects considered and described by items, E the edges, i.e the
relations between the vertices) for each dataset. Our rst experiments consisted
in describing researchers (i.e vertices, objects) by topics of publication (i.e items)
and linking them together by relationships of co-publications (i.e edges). On this
graph, 2-core abstraction is applied in order to identify communities of strongly
connected researchers who published on a common set of topic of publication
with at least two other researchers that also belong to the community.
Further expriments will emerge, consisting in identifying the best way to represent
knowledge from scienti c publications, being describing researchers or
publications with topics of publication, dates of publication, locations of laboratories or
keywords, for example. The nature of the semantic relationships between objects
described could also be di erent, like co-citation relationships for example. Also,
more interesting topological constraints should be applied on the graph during
graph abstraction, such as "each vertex in the subgraph belongs to a star" for
example.</p>
      <p>To evaluate our work, no gold standard exists as far as we know. We should
evaluate the costs of building a gold standard with manually annotated corpus for
supporting our experiments as well as the possibility of conducting an evaluation
campaign in the domain of scholarly data. As we aim at enriching knowledge
extracted with the use of graph abstractions, an error analysis as well as a
comparison of knowledge extracted with and without (baseline) the application
of graph abstraction should be proposed. Indeed, we should be able to detect
communities straight from the graph, but such communities should be narrower
after the application of graph abstraction. As a matter of fact, our hypothesis
consisting in new knowledge emerging from graph abstraction would be veri ed,
as we would be able to detect core scienti c publications or core researchers of
a domain with more precision.
2 RISE : https://sites.google.com/site/frenchsemanticir/documents</p>
      <p>Zevio</p>
    </sec>
    <sec id="sec-5">
      <title>Results</title>
      <p>
        We presented the preliminary results obtained on our exp eriment on the 9-years
scienti c acts of Recherche d'Information SEmantique (RISE) workshops (from
2009 to 2017) during the 10th edition in 2018 [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. During this exp eriment, we
managed to obtain an attributed graph mapping the researchers of the
workshops, describ ed by their topics of publication and linked by coauthor
relationships. As scienti c publications in RISE are mainly written in French and the
keyphrase extraction algorithm is language-dep endant and trained for English,
the topics of publications were simply extracted from the keywords given by the
authors. We obtained a graph by applying text mining metho ds on the scienti c
acts of the workshops. We were able to detect dense communities of researchers
based on graph abstraction applied on this graph.
      </p>
      <p>{Ontologie}</p>
      <p>Ines
Bannour</p>
      <p>Adeline
Nazarenko</p>
      <p>Sylvie
Salot i
Nada
Mimouni
EmmAalenxuaenlHderervé</p>
      <p>Saidi
NiDneglnaiSngdarnéBdalraaVnaclhéorine
DLaiuvidSAkcahfiBlGeealybnricekla</p>
      <p>Csurka
RCouhqriusteFiatnlaisLeuca
BoiteLtimMianrgchesot i</p>
      <p>Chen</p>
      <p>Didier
Schwab</p>
      <p>Jibril
Frej</p>
      <p>Kian-Lam</p>
      <p>TanMAolhmaansnrai d
AbKdualraahmhCadaBJteeCharehnruei-ntvPeailePMerthueillhipepme</p>
      <p>Eric
Gaussier</p>
      <p>Loïc
Maisonnasse</p>
      <p>Farah
MoHharmaethdi
Mohsen
Gammoudi</p>
      <p>Catherine
Roussey</p>
      <p>Synda
Ouardani
Vanna
ChhuoStephan</p>
      <p>Bernard
VincenCthanet</p>
      <p>Jean-Pier e
Soulignac</p>
      <p>Haïfa</p>
      <p>Zargayouna
Sylvie
Calabret o</p>
      <p>Arnaud</p>
      <p>Renard
BJeenGaenuyy GSaeEmscluöhedel RamRBieuamtrpilceer
CaEpglayted-Zsigmond Har athi</p>
      <p>GMuaehfdaiz JeDane-scPlieesr e BMeartricn</p>
      <p>For example, we queried researchers who published on the sub ject of
"ontology" (i.e "ontologie" in French). We obtained a community of researchers based
on authors who published on this particular topic. With the application of 2-core
graph abstraction on this community, we managed to remove researchers from
the community b ecause of their lack of co-publication relationships with the
others within the workshops. Thus, we managed to identify a narrower community
of researchers strongly connected to each other, according to a degree 2, whose
memb ers would b e the core exp erts of the domain. The results are showed in
Figure 2.</p>
    </sec>
    <sec id="sec-6">
      <title>Discussion</title>
      <p>As our preliminary results seem to imply, our approach looks quite promising.
We already managed to obtain ner-grained communities of researchers
according to a given topic of publication, which seems to validate the relevance of our
approach. We run into di culties related to considerations such as the size of
the graph obtained and the lacking of the quality of semantic relationships
describing the objects (i.e vertices) of the graph. Indeed, the graph we obtained is
quite small (less than 50 vertices) and lacks items describing the objects and
relationships between objects. Such a graph is interesting for experiments and ease
of manipulation for showing examples, but we should consider large-scale data
or at least larger corpus. Also, resources for French-language are quite limited.</p>
      <p>Recommandations for future work would be supporting the semantic
interoperability of our graphs and opening to the semantic web by integrating the
Computer Science Ontology to PCU. We should enrich our vertices'
descriptions with the concepts of the ontology recognized in the full-text or abstract
of the scienti c publications thanks to semantic annotation, with the idea of a
bottom-up enrichment by generalization. For supporting multilingual processing,
a translation of the Computer Science Ontology in French as well as a training
of the keyphrase extraction algorithm in French would be useful, including in
the aim of meeting PCU's purpose, but costs of translation should be evaluated.
We should also consider conducting further experiments such as described in
the section 4, among other things describing objects with more than topics of
publication (dates of publication or locations of laboratories for example) and
nding the best parameters for graph abstraction for expert nding.</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgements</title>
      <p>This thesis is supervised by Professor Thierry Charnois, Dr. Guillaume Santini
and Dr. Hafa Zargayouna. It is nanced by the FUI project PCU (Plateforme
de Connaissances Uni ees).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Al-Taie</surname>
            ,
            <given-names>M.Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kadry</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Obasa</surname>
            ,
            <given-names>A.I.</given-names>
          </string-name>
          :
          <article-title>Understanding Expert Finding Systems: Domains and Techniques</article-title>
          .
          <source>Social Network Analysis and Mining</source>
          <volume>8</volume>
          (
          <issue>1</issue>
          ),
          <volume>57</volume>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Angelova</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Boeva</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tsiporkova</surname>
          </string-name>
          , E.:
          <article-title>Advanced Data-driven Techniques for Mining Expertise</article-title>
          .
          <source>In: 30th Annual Workshop of the Swedish Arti cial Intelligence Society SAIS</source>
          <year>2017</year>
          , May
          <volume>15</volume>
          {
          <fpage>16</fpage>
          ,
          <year>2017</year>
          , Karlskrona, Sweden. pp.
          <volume>45</volume>
          {
          <fpage>52</fpage>
          . No.
          <volume>137</volume>
          , Linkoping University Electronic Press (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Balog</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>De Rijke</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , et al.:
          <article-title>Determining Expert Pro les (With an Application to Expert Finding)</article-title>
          .
          <source>In: International Joint Conference on Arti cial Intelligence</source>
          . vol.
          <volume>7</volume>
          , pp.
          <volume>2657</volume>
          {
          <issue>2662</issue>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Bordea</surname>
          </string-name>
          , G.:
          <article-title>Concept Extraction Applied to the Task of Expert Finding</article-title>
          . In: Aroyo,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Antoniou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            , Hyvonen, E.,
            <surname>ten Teije</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Stuckenschmidt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            ,
            <surname>Cabral</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Tudorache</surname>
          </string-name>
          , T. (eds.)
          <source>The Semantic Web: Research and Applications</source>
          . pp.
          <volume>451</volume>
          {
          <fpage>456</fpage>
          . Springer Berlin Heidelberg (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Bordea</surname>
          </string-name>
          , G.:
          <article-title>Domain Adaptive Extraction of Topical Hierarchies for Expertise Mining</article-title>
          .
          <source>Ph.D. thesis</source>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Draganidis</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mentzas</surname>
          </string-name>
          , G.:
          <article-title>Competency Based Management: a Review of Systems and Approaches</article-title>
          .
          <source>Information Management &amp; Computer Security</source>
          <volume>14</volume>
          (
          <issue>1</issue>
          ),
          <volume>51</volume>
          {
          <fpage>64</fpage>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Gabor</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tellier</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Charnois</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zargayouna</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Buscaldi</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Detection et classi cation non supervisees de relations semantiques dans des articles scienti ques</article-title>
          .
          <source>In: JEP-TALN-RECITAL 2016. Actes de la conference conjointe JEPTALN-RECITAL</source>
          <year>2016</year>
          , vol.
          <volume>2</volume>
          . Paris, France (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Gabor</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zargayouna</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Buscaldi</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tellier</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Charnois</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Semantic Annotation of the ACL Anthology Corpus for the Automatic Analysis of Scienti c Literature</article-title>
          .
          <source>In: LREC 2016. Proceedings of the LREC 2016 Conference</source>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Hernandez</surname>
            ,
            <given-names>S.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Buscaldi</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Charnois</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          : LIPN at SemEval-2017
          <source>Task</source>
          <volume>10</volume>
          :
          <article-title>Filtering Candidate Keyphrases from Scienti c Publications with Part-of-Speech Tag Sequences to Train a Sequence Labeling Model</article-title>
          .
          <source>In: Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017)</source>
          . pp.
          <volume>995</volume>
          {
          <issue>999</issue>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Khan</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shakil</surname>
            ,
            <given-names>K.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Alam</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>A Survey on Scholarly Data: From Big Data Perspective</article-title>
          .
          <source>Information Processing &amp; Management</source>
          <volume>53</volume>
          (
          <issue>4</issue>
          ),
          <volume>923</volume>
          {
          <fpage>944</fpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Lin</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hong</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>A Survey on Expert Finding Techniques</article-title>
          .
          <source>Journal of Intelligent Information Systems</source>
          <volume>49</volume>
          (
          <issue>2</issue>
          ),
          <volume>255</volume>
          {279 (Oct
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Osborne</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Motta</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mulholland</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Exploring Scholarly Data With Rexplore</article-title>
          . In: International Semantic Web Conference. pp.
          <volume>460</volume>
          {
          <fpage>477</fpage>
          . Springer (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Salatino</surname>
            ,
            <given-names>A.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thanapalasingam</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mannocci</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Osborne</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Motta</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          : The Computer Science Ontology:
          <string-name>
            <given-names>A</given-names>
            <surname>Large-Scale Taxonomy</surname>
          </string-name>
          of Research Areas (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Soldano</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Santini</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bouthinon</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Local Knowledge Discovery in Attributed Graphs</article-title>
          .
          <source>In: International Conference on Tools with Arti cial Intelligence (ICTAI)</source>
          . pp.
          <volume>250</volume>
          {
          <issue>257</issue>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Soldano</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Santini</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bouthinon</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lazega</surname>
          </string-name>
          , E.:
          <article-title>Hub-Authority Cores and Attributed Directed Network Mining</article-title>
          .
          <source>In: Tools with Arti cial Intelligence (ICTAI)</source>
          ,
          <year>2017</year>
          IEEE 29th International Conference on. pp.
          <volume>1120</volume>
          {
          <fpage>1127</fpage>
          .
          <string-name>
            <surname>IEEE</surname>
          </string-name>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Tkaczyk</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Szostek</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fedoryszak</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dendek</surname>
            ,
            <given-names>P.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bolikowski</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>CERMINE: Automatic Extraction of Structured Metadata from Scienti c Literature</article-title>
          .
          <source>International Journal on Document Analysis and Recognition (IJDAR) 18(4)</source>
          ,
          <volume>317</volume>
          {
          <fpage>335</fpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Khabsa</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Williams</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>H.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tuarob</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Choudhury</surname>
            ,
            <given-names>S.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ororbia</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mitra</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Giles</surname>
            ,
            <given-names>C.L.</given-names>
          </string-name>
          :
          <article-title>Towards Building a Scholarly Big Data Platform: Challenges, Lessons and Opportunities</article-title>
          .
          <source>In: Proceedings of the 14th ACM/IEEE-CS Joint Conference on Digital Libraries</source>
          . pp.
          <volume>117</volume>
          {
          <fpage>126</fpage>
          . IEEE Press (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Xia</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bekele</surname>
            ,
            <given-names>T.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
          </string-name>
          , H.:
          <article-title>Big Scholarly Data: A Survey</article-title>
          .
          <source>IEEE Transactions on Big Data</source>
          <volume>3</volume>
          (
          <issue>1</issue>
          ),
          <volume>18</volume>
          {
          <fpage>35</fpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Yimam-Seid</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kobsa</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Expert-Finding Systems for Organizations: Problem and Domain Analysis and the DEMOIR Approach</article-title>
          .
          <source>Journal of Organizational Computing and Electronic Commerce</source>
          <volume>13</volume>
          (
          <issue>1</issue>
          ),
          <volume>1</volume>
          {
          <fpage>24</fpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Zevio</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zargayouna</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Santini</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Charnois</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Vers une cartographie automatique des thematiques et pro ls d'experts associes a une conference scienti que : 9 ans d'ateliers Recherche d'Information SEmantique (RISE)</article-title>
          . In: Actes de la dixieme edition de l'
          <source>atelier Recherche d'Information SEmantique (RISE)</source>
          . pp.
          <volume>6</volume>
          {
          <issue>13</issue>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>