<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Knowledge discovery and enrichment from scholarly data for expert finding</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Stella</forename><surname>Zevio</surname></persName>
							<email>zevio@lipn.univ-paris13.fr</email>
							<affiliation key="aff0">
								<orgName type="laboratory">LIPN -CNRS UMR 7030</orgName>
								<orgName type="institution">Université Paris XIII</orgName>
								<address>
									<addrLine>99 avenue Jean-Baptiste Clément</addrLine>
									<postCode>93430</postCode>
									<settlement>Villetaneuse</settlement>
									<country key="FR">France</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Knowledge discovery and enrichment from scholarly data for expert finding</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">AFABE40BFF67139E91DFE399EF368A20</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-25T08:42+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Scholarly data</term>
					<term>Graph mining</term>
					<term>Knowledge discovery</term>
					<term>Expert finding</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>With the generalisation of the digitalization of scientific publications, scholarly data is now tackled with a big data perspective. In this context, interest about applications like expert finding, research recommandation systems or collaborators discovery grows and challenges related to knowledge discovery on large-scale scholarly data arise. The Unified Knowledge Platform (Plateforme de Connaissances Unifiées) project aims at developing an open-source platform valuing scholarly as well as business data. In that respect, we propose an approach and a methodology for discovering knowledge and enrich it from workings documents, more precisely scientific publications, in the particular use case of expert finding. Compared to the state of the art, the originality of our approach lies in the combination of text mining as well as graph mining methods, more specifically graph abstraction. We present an experiment with already published results on the 9-years acts of a French workshop on semantic information retrieval. In this experiment, we managed to obtain a graph mapping researchers who participated in the workshop. In this graph, researchers are linked together by co-publication relationships and described by their topics of publication. We were also able to detect dense communities of researchers with the help of graph abstraction. Based on these results and in the light of the state of the art, we discuss further research tracks.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>An expertise is "an individual's skill, knowledge, aptitude or behaviour" <ref type="bibr" target="#b5">[6]</ref>. The task of expert finding consists in assessing individuals' expertises (i.e. constructing their expert profile <ref type="bibr" target="#b4">[5]</ref>). This task has various applications in industries such as finding employable and appropriate candidates or assigning an expert to a task or a project for example. In academia, expert finding is also useful for assigning a researcher to a program committee or a project expertise, or setting up research projects, to name a few. According to the claim that an author of a text is an expert of its content, text appears like a solid source of knowledge for expert finding. More precisely, we focus on working documents (e.g CV, project reports, etc.). Such documents contain crucial information about an individual's expertises. In academia, they mainly consist in scholarly data.</p><p>The PCU<ref type="foot" target="#foot_0">1</ref> (Plateforme de Connaissances Unifiée, i.e Unified Knowledge Platform) project's aim is to propose an open source industrial platform valuing business (and scholarly, to an extent) data. With the recent explosion of digitization of academic and technical documents, scholarly data has known such a rapid growth <ref type="bibr" target="#b17">[18]</ref> that we now talk about big scholarly data. In that respect, interest in big scholarly data platforms has emerged <ref type="bibr" target="#b16">[17,</ref><ref type="bibr" target="#b11">12]</ref>. In this context, our aim is to discover knowledge from text (i.e scientific publications) for expert finding, represent it automatically into graphs and enrich knowledge with the hypothesis that new knowledge will emerge from the graph structure. Our research question is the following: how precise and accurate knowledge can become thanks to the knowledge enrichment process, and what methods are providing the best results on working documents, or more specifically, scientific publications?</p><p>To answer this question, we will enrich PCU with a semantic platform with the aim of supporting experiments that will test our hypothesis and enable us to answer this research question. This research question lies in the area of knowledge extraction and at the interface between knowledge discovery, natural language processing and artificial intelligence, more precisely machine learning (unsupervised methods) and graph mining.</p><p>We will present a state of the art in section 2, define our approach in section 3, define our methodology in section 4, present results that we have reached so far in section 5 reflect on the relevance of our approach and discuss further work and research tracks in section 6.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">State of the art</head><p>According to the literature, expert finding is closely related to the problem of expert profiling <ref type="bibr" target="#b2">[3,</ref><ref type="bibr" target="#b10">11]</ref>, which implies identification of expertises and their assignation to appropriate individuals owning them <ref type="bibr" target="#b4">[5]</ref>. Initially, expert finding systems were based on people assessing their own expertises by selecting predefined keywords <ref type="bibr" target="#b1">[2]</ref>, and the use of manually generated heuristics was predominant <ref type="bibr" target="#b18">[19]</ref>. For the sake of the automation of expert finding, textual sources of knowledge were harvested <ref type="bibr" target="#b1">[2]</ref>. With the explosion of online data stored in digital libraries, scholarly data <ref type="bibr" target="#b17">[18,</ref><ref type="bibr" target="#b9">10]</ref> became a solid source of knowledge for expert finding. In the rest of this paper, we will consider scholarly data, more precisely scientific publications, as a relevant textual source of expert finding concerning scholars.</p><p>From scientific publications, classical methods of expert finding described in the literature are based on information extraction methods <ref type="bibr" target="#b9">[10]</ref> such as metadata (title, authors, abstract, date of publication, etc.) as well as citations and author information (co-authorship, authors' affiliations) extraction. Open-source systems for metadata, citations and author information extraction exist <ref type="bibr" target="#b15">[16]</ref> but still need improvement according to their error analysis. To identify underlying expertises within scientific publications, concept extraction methods are used <ref type="bibr" target="#b3">[4]</ref> by means of classical keyphrase extraction algorithms for example. Such algorithms can be domain-dependant, thus rely on a model trained on an annotated corpus <ref type="bibr" target="#b8">[9]</ref> or take advantage of knowledge of the domain by investigating relations between expertise topics extracted <ref type="bibr" target="#b4">[5,</ref><ref type="bibr" target="#b7">8]</ref>. Concerning the computer science domain, an ontology has recently been released <ref type="bibr" target="#b12">[13]</ref>.</p><p>Some scholarly data platforms have already been developed, such as Rexplore <ref type="bibr" target="#b11">[12]</ref>. Representation of knowledge extracted from text through a graph is quite common. Rexplore takes advantage of this representation by providing a semantic network of fine-grained research areas, linked by semantic relations. As described in the literature, researchers have mostly used graph and machine-learning techniques for expert finding <ref type="bibr" target="#b0">[1]</ref>. As suggested <ref type="bibr" target="#b0">[1]</ref>, issues related to the identication and ranking of experts <ref type="bibr" target="#b1">[2]</ref> can be avoided by "combining content-based expertise indicators and social relationships". Combining machine learning algorithms with graph mining methods for expert finding in order to discover knowledge and enrich it is a challenging research question. Inspired from the analysis of social networks, graph mining techniques have been applied to the detection of frequent k-communities <ref type="bibr" target="#b13">[14]</ref>. With the claim that researchers and expertises are represented through an attributed graph, detecting strongly connected k-communities would be interesting to investigate for expert finding. Moreover, the application of the recent hub-authority core theory <ref type="bibr" target="#b14">[15]</ref> is also a promising investigation trail for directed citation graphs. As far as we know, no scholarly data platform takes advantage of the recent advances of graph mining techniques, even if graph representation is common.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Proposed approach</head><p>In the light of the state of the art, we propose an original approach for expert finding consisting in combining text mining, more precisely machine learning algorithms applied on text (i.e scientific publications), with graph mining methods. The graph mining methods are applied on the graph representing knowledge extracted from the text. The machine learning algorithms considered are keyphrases <ref type="bibr" target="#b8">[9]</ref> and semantic relationships <ref type="bibr" target="#b7">[8]</ref> extraction algorithms. From the text, the keyphrase extraction algorithm is initially applied, in order to extract fine-grained topics or thematics of publication within scholarly data, more precisely on full-text scientific publications. The algorithm considered is based on a model trained on an annotated corpus thus it is language-dependent and applied to the domain of computer science. It is based on a conditional random fields model trained with keyphrase candidated filtered with part-of-speech tag sequences.</p><p>Then, the semantic relationships extraction algorithm is applied on the output of the keyphrase extraction algorithm. It is based on semantic similarity between extracted keyphrases belonging to the same sentence. The semantic similarity is measured thanks to most frequent patterns and clustering methods. The algorithm is unsupervised, which enables the automatic extraction of already known as well as brand new relationships between extracted keyphrases, such as "is-a" between "information retrieval" and "task". It has been tested on the ACL corpus <ref type="bibr" target="#b6">[7]</ref>. The sequential application of these two algorithms enables us to collect the knowledge required for building a graph representing knowledge extracted from the text.</p><p>The originality of our approach described in Figure <ref type="figure" target="#fig_0">1</ref> lies in the combination of these algorithms with graph abstraction <ref type="bibr" target="#b13">[14]</ref>. From the representation of a text as a graph, the idea is to focus on strongly connected vertices applying a topological constraint, for example by searching for the k-core (i.e the largest subgraph verifying a topological constraint such as "each vertex in the subgraph has a k degree"). Several representations are possible with attributed graphs, one of them being that vertices represent researchers and are labeled by topics of publication. In Figure <ref type="figure" target="#fig_0">1</ref>, experts on topic 1 are obtained by removing R5 (who does not have 2-degree), then R4 for the same reason after R5's removal. Our expectation considering strongly connected vertices is that it may bring out new and interesting knowledge from the graph that is itself built from the representation of text. In Figure <ref type="figure" target="#fig_0">1</ref>, all experts on topic 1 obtained by 2-core abstraction on the graph are also experts on topic t2, which is a new knowledge. This hypothesis has been raised from the state of the art, more precisely from network analysis for social sciences <ref type="bibr" target="#b14">[15]</ref> and our claim is that interesting knowledge could emerge for expert finding from the generalisation of such graph mining methods. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Methodology</head><p>To support our experiments and meet PCU project's purpose, we have developed an open source semantic platform making the machine learning algorithms <ref type="bibr" target="#b8">[9,</ref><ref type="bibr" target="#b7">8]</ref> that we selected available as a workflow. We also developed a Mathematica workflow as a complementary tool for our experiments, mostly for data manipulation and displaying as well as for the automation of the connection between the machine learning algorithms outputs and the input of MinerLC (i.e the software for applying graph abstractions). We selected the following datasets : ACL corpus written in English and the 9-years scientific acts of the Recherche d'Information SEmantique (RISE) workshops<ref type="foot" target="#foot_1">2</ref> , mostly written in French. We plan on building a larger dataset, by collecting the scientific publications of the members of our lab. Our experiments consist in applying the machine learning algorithms on the full-text of the scientific publications. We also collect structured metadata such as titles, authors or keywords thanks to CERMINE <ref type="bibr" target="#b15">[16]</ref>.</p><p>From the knowledge discovered, we build a graph G = (V,E) (V being the vertices, i.e the objects considered and described by items, E the edges, i.e the relations between the vertices) for each dataset. Our first experiments consisted in describing researchers (i.e vertices, objects) by topics of publication (i.e items) and linking them together by relationships of co-publications (i.e edges). On this graph, 2-core abstraction is applied in order to identify communities of strongly connected researchers who published on a common set of topic of publication with at least two other researchers that also belong to the community. Further expriments will emerge, consisting in identifying the best way to represent knowledge from scientific publications, being describing researchers or publications with topics of publication, dates of publication, locations of laboratories or keywords, for example. The nature of the semantic relationships between objects described could also be different, like co-citation relationships for example. Also, more interesting topological constraints should be applied on the graph during graph abstraction, such as "each vertex in the subgraph belongs to a star" for example.</p><p>To evaluate our work, no gold standard exists as far as we know. We should evaluate the costs of building a gold standard with manually annotated corpus for supporting our experiments as well as the possibility of conducting an evaluation campaign in the domain of scholarly data. As we aim at enriching knowledge extracted with the use of graph abstractions, an error analysis as well as a comparison of knowledge extracted with and without (baseline) the application of graph abstraction should be proposed. Indeed, we should be able to detect communities straight from the graph, but such communities should be narrower after the application of graph abstraction. As a matter of fact, our hypothesis consisting in new knowledge emerging from graph abstraction would be verified, as we would be able to detect core scientific publications or core researchers of a domain with more precision.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>S. Zevio</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Results</head><p>We presented the preliminary results obtained on our experiment on the 9-years scientific acts of Recherche d'Information SEmantique (RISE) workshops (from 2009 to 2017) during the 10th edition in 2018 <ref type="bibr" target="#b19">[20]</ref>. During this experiment, we managed to obtain an attributed graph mapping the researchers of the workshops, described by their topics of publication and linked by coauthor relationships. As scientific publications in RISE are mainly written in French and the keyphrase extraction algorithm is language-dependant and trained for English, the topics of publications were simply extracted from the keywords given by the authors. We obtained a graph by applying text mining methods on the scientific acts of the We were able to detect dense communities of researchers based on graph abstraction applied on this graph. Fig. <ref type="figure">2</ref>. Community of researchers who participated in RISE workshops <ref type="bibr">(2009)</ref><ref type="bibr">(2010)</ref><ref type="bibr">(2011)</ref><ref type="bibr">(2012)</ref><ref type="bibr">(2013)</ref><ref type="bibr">(2014)</ref><ref type="bibr">(2015)</ref><ref type="bibr">(2016)</ref><ref type="bibr">(2017)</ref>. In red : community of researchers obtained by the application of a graph abstraction with topological constraint of 2-degree on the topic of "ontologie" (i.e ontology, in French)</p><p>For example, we queried researchers who published on the subject of "ontology" (i.e "ontologie" in French). We obtained a community of researchers based on authors who published on this particular topic. With the application of 2-core graph abstraction on this community, we managed to remove researchers from the community because of their lack of co-publication relationships with the others within the workshops. Thus, we managed to identify a narrower community of researchers strongly connected to each other, according to a degree 2, whose members would be the core experts of the domain. The results are showed in Figure <ref type="figure">2</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">Discussion</head><p>As our preliminary results seem to imply, our approach looks quite promising. We already managed to obtain finer-grained communities of researchers according to a given topic of publication, which seems to validate the relevance of our approach. We run into difficulties related to considerations such as the size of the graph obtained and the lacking of the quality of semantic relationships describing the objects (i.e vertices) of the graph. Indeed, the graph we obtained is quite small (less than 50 vertices) and lacks items describing the objects and relationships between objects. Such a graph is interesting for experiments and ease of manipulation for showing examples, but we should consider large-scale data or at least larger corpus. Also, resources for French-language are quite limited.</p><p>Recommandations for future work would be supporting the semantic interoperability of our graphs and opening to the semantic web by integrating the Computer Science Ontology to PCU. We should enrich our vertices' descriptions with the concepts of the ontology recognized in the full-text or abstract of the scientific publications thanks to semantic annotation, with the idea of a bottom-up enrichment by generalization. For supporting multilingual processing, a translation of the Computer Science Ontology in French as well as a training of the keyphrase extraction algorithm in French would be useful, including in the aim of meeting PCU's purpose, but costs of translation should be evaluated. We should also consider conducting further experiments such as described in the section 4, among other things describing objects with more than topics of publication (dates of publication or locations of laboratories for example) and finding the best parameters for graph abstraction for expert finding.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. Our approach combining data and graph mining for expert finding from scientific publications</figDesc><graphic coords="4,132.81,441.09,357.58,198.01" type="bitmap" /></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">Plateforme de Connaissances Unifiée : https://www.smile.eu/fr/publications/smilelab/pcu-plateforme-connaissances-unifiees</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">RISE : https://sites.google.com/site/frenchsemanticir/documents</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgements</head><p>This thesis is supervised by Professor Thierry Charnois, Dr. Guillaume Santini and Dr. Haïfa Zargayouna. It is financed by the FUI project PCU (Plateforme de Connaissances Unifiées).</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Understanding Expert Finding Systems: Domains and Techniques</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">Z</forename><surname>Al-Taie</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Kadry</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">I</forename><surname>Obasa</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Social Network Analysis and Mining</title>
		<imprint>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page">57</biblScope>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Advanced Data-driven Techniques for Mining Expertise</title>
		<author>
			<persName><forename type="first">M</forename><surname>Angelova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Boeva</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Tsiporkova</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">30th Annual Workshop of the Swedish Artificial Intelligence Society SAIS 2017</title>
				<meeting><address><addrLine>Karlskrona, Sweden</addrLine></address></meeting>
		<imprint>
			<publisher>Linköping University Electronic Press</publisher>
			<date type="published" when="2017">May 15-16, 2017. 2017</date>
			<biblScope unit="page" from="45" to="52" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Determining Expert Profiles (With an Application to Expert Finding)</title>
		<author>
			<persName><forename type="first">K</forename><surname>Balog</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>De Rijke</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International Joint Conference on Artificial Intelligence</title>
				<imprint>
			<date type="published" when="2007">2007</date>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="page" from="2657" to="2662" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Concept Extraction Applied to the Task of Expert Finding</title>
		<author>
			<persName><forename type="first">G</forename><surname>Bordea</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">The Semantic Web: Research and Applications</title>
				<editor>
			<persName><forename type="first">L</forename><surname>Aroyo</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">G</forename><surname>Antoniou</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">E</forename><surname>Hyvönen</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Ten Teije</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">H</forename><surname>Stuckenschmidt</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">L</forename><surname>Cabral</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">T</forename><surname>Tudorache</surname></persName>
		</editor>
		<meeting><address><addrLine>Berlin Heidelberg</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2010">2010</date>
			<biblScope unit="page" from="451" to="456" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<title level="m" type="main">Domain Adaptive Extraction of Topical Hierarchies for Expertise Mining</title>
		<author>
			<persName><forename type="first">G</forename><surname>Bordea</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
	<note type="report_type">Ph.D. thesis</note>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Competency Based Management: a Review of Systems and Approaches</title>
		<author>
			<persName><forename type="first">F</forename><surname>Draganidis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Mentzas</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Information Management &amp; Computer Security</title>
		<imprint>
			<biblScope unit="volume">14</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="51" to="64" />
			<date type="published" when="2006">2006</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Détection et classification non supervisées de relations sémantiques dans des articles scientifiques</title>
		<author>
			<persName><forename type="first">K</forename><surname>Gábor</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Tellier</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Charnois</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Zargayouna</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Buscaldi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">-TALN-RECITAL 2016</title>
		<title level="s">Actes de la conférence conjointe JEP</title>
		<meeting><address><addrLine>Paris, France</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2016">2016. 2016</date>
			<biblScope unit="volume">2</biblScope>
		</imprint>
	</monogr>
	<note>JEP-TALN-RECITAL</note>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Semantic Annotation of the ACL Anthology Corpus for the Automatic Analysis of Scientific Literature</title>
		<author>
			<persName><forename type="first">K</forename><surname>Gábor</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Zargayouna</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Buscaldi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Tellier</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Charnois</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the LREC 2016 Conference</title>
				<meeting>the LREC 2016 Conference</meeting>
		<imprint>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
	<note>LREC 2016</note>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">LIPN at SemEval-2017 Task 10: Filtering Candidate Keyphrases from Scientific Publications with Part-of-Speech Tag Sequences to Train a Sequence Labeling Model</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">D</forename><surname>Hernandez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Buscaldi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Charnois</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 11th International Workshop on Semantic Evaluation</title>
				<meeting>the 11th International Workshop on Semantic Evaluation<address><addrLine>SemEval-</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2017">2017. 2017</date>
			<biblScope unit="page" from="995" to="999" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">A Survey on Scholarly Data: From Big Data Perspective</title>
		<author>
			<persName><forename type="first">S</forename><surname>Khan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">A</forename><surname>Shakil</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Alam</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Information Processing &amp; Management</title>
		<imprint>
			<biblScope unit="volume">53</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="923" to="944" />
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">A Survey on Expert Finding Techniques</title>
		<author>
			<persName><forename type="first">S</forename><surname>Lin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Hong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Li</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Intelligent Information Systems</title>
		<imprint>
			<biblScope unit="volume">49</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="255" to="279" />
			<date type="published" when="2017-10">Oct 2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Exploring Scholarly Data With Rexplore</title>
		<author>
			<persName><forename type="first">F</forename><surname>Osborne</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Motta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Mulholland</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International Semantic Web Conference</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2013">2013</date>
			<biblScope unit="page" from="460" to="477" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<monogr>
		<title level="m" type="main">The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">A</forename><surname>Salatino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Thanapalasingam</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Mannocci</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Osborne</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Motta</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Local Knowledge Discovery in Attributed Graphs</title>
		<author>
			<persName><forename type="first">H</forename><surname>Soldano</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Santini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Bouthinon</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International Conference on Tools with Artificial Intelligence (ICTAI)</title>
				<imprint>
			<date type="published" when="2015">2015</date>
			<biblScope unit="page" from="250" to="257" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Hub-Authority Cores and Attributed Directed Network Mining</title>
		<author>
			<persName><forename type="first">H</forename><surname>Soldano</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Santini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Bouthinon</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Lazega</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Tools with Artificial Intelligence (ICTAI), 2017 IEEE 29th International Conference on</title>
				<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="1120" to="1127" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">CERMINE: Automatic Extraction of Structured Metadata from Scientific Literature</title>
		<author>
			<persName><forename type="first">D</forename><surname>Tkaczyk</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Szostek</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Fedoryszak</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">J</forename><surname>Dendek</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Bolikowski</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">International Journal on Document Analysis and Recognition (IJDAR)</title>
		<imprint>
			<biblScope unit="volume">18</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="317" to="335" />
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Towards Building a Scholarly Big Data Platform: Challenges, Lessons and Opportunities</title>
		<author>
			<persName><forename type="first">Z</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Khabsa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Williams</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">H</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Huang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Tuarob</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">R</forename><surname>Choudhury</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Ororbia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Mitra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">L</forename><surname>Giles</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 14th ACM/IEEE-CS Joint Conference on Digital Libraries</title>
				<meeting>the 14th ACM/IEEE-CS Joint Conference on Digital Libraries</meeting>
		<imprint>
			<publisher>IEEE Press</publisher>
			<date type="published" when="2014">2014</date>
			<biblScope unit="page" from="117" to="126" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Big Scholarly Data: A Survey</title>
		<author>
			<persName><forename type="first">F</forename><surname>Xia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">M</forename><surname>Bekele</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Liu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Transactions on Big Data</title>
		<imprint>
			<biblScope unit="volume">3</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="18" to="35" />
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Expert-Finding Systems for Organizations: Problem and Domain Analysis and the DEMOIR Approach</title>
		<author>
			<persName><forename type="first">D</forename><surname>Yimam-Seid</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Kobsa</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Organizational Computing and Electronic Commerce</title>
		<imprint>
			<biblScope unit="volume">13</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="1" to="24" />
			<date type="published" when="2003">2003</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">experts associés à une conférence scientifique : 9 ans d&apos;ateliers Recherche d&apos;Information SEmantique (RISE)</title>
		<author>
			<persName><forename type="first">S</forename><surname>Zevio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Zargayouna</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Santini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Charnois</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Actes de la dixième édition de l&apos;atelier Recherche d&apos;Information SEmantique (RISE)</title>
				<meeting>s de la dixième édition de l&apos;atelier Recherche d&apos;Information SEmantique (RISE)</meeting>
		<imprint>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="6" to="13" />
		</imprint>
	</monogr>
	<note>Vers une cartographie automatique des thématiques et profils d&apos;</note>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
