<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">New Ontology and Knowledge Graph for University Curriculum Recommendation</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Nicolas</forename><surname>Hubert</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Université de Lorraine</orgName>
								<address>
									<settlement>ERPI</settlement>
									<country key="FR">France</country>
								</address>
							</affiliation>
							<affiliation key="aff1">
								<orgName type="institution" key="instit1">Université de Lorraine</orgName>
								<orgName type="institution" key="instit2">CNRS</orgName>
								<orgName type="institution" key="instit3">LORIA</orgName>
								<address>
									<country key="FR">France</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Armelle</forename><surname>Brun</surname></persName>
							<affiliation key="aff1">
								<orgName type="institution" key="instit1">Université de Lorraine</orgName>
								<orgName type="institution" key="instit2">CNRS</orgName>
								<orgName type="institution" key="instit3">LORIA</orgName>
								<address>
									<country key="FR">France</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Davy</forename><surname>Monticolo</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Université de Lorraine</orgName>
								<address>
									<settlement>ERPI</settlement>
									<country key="FR">France</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">New Ontology and Knowledge Graph for University Curriculum Recommendation</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">5F2C06A1817A2E793A69C7377489A928</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-25T03:14+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Knowledge Graph</term>
					<term>Education</term>
					<term>Ontology</term>
					<term>Recommender System</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Education is a complex domain where students must make their curricula choices carefully. To model the intricacies of the educational domain, ontologies have successfully been leveraged in the past. However, no available ontology or dataset directly addresses the critical transition from high school to university from a decision-making perspective. Therefore, the contribution of our on-going work is twofold. Firstly, we introduce EducOnto -an ontology that aims at modeling university curricula and students' profiles. Secondly, we introduce EduKG -a knowledge graph inheriting the semantics of EducOnto and instantiated with data about French students and curricula.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Higher education is an intricate domain: it features a lot of distinct concepts, relationships and structural constraints. Whenever a given application domain is too complex to be grasped directly, ontologies established themselves as the reference approach to model the richness of this domain. As such, they have naturally been leveraged in education, where there is a real need for structuring information <ref type="bibr" target="#b0">[1,</ref><ref type="bibr" target="#b1">2]</ref>. However, there is a lack of ontologies specifically concerned with the critical period ranging from the end of high school to the first years of university, in which students commit for their future.</p><p>Besides, owing to the vast amount of provided information on university curricula, students can find it tough to make the most appropriate decision regarding their educational pathways <ref type="bibr" target="#b2">[3]</ref>. Recommender Systems (RSs) have successfully been deployed to reduce the overhead associated with decision-making by sifting through massive amounts of choices and suggesting only the most relevant items to the active user <ref type="bibr" target="#b3">[4]</ref>. RSs have proved to be useful for recommending curricula to students <ref type="bibr" target="#b0">[1]</ref>. Nevertheless, most educational datasets related to curriculum recommendation are university-specific and remain private <ref type="bibr" target="#b4">[5,</ref><ref type="bibr" target="#b5">6]</ref>.</p><p>International Semantic Web Conference (ISWC) 2022: Posters, Demos, and Industry Tracks, October 23-27, 2022 * Corresponding author. Envelope nicolas.hubert@univ-lorraine.fr (N. Hubert); armelle.brun@loria.fr (A. Brun); davy.monticolo@univ-lorraine.fr (D. Monticolo) GLOBE https://nicolas-hbt.github.io/ (N. Hubert) Orcid 0000-0002-4682-422X (N. Hubert); 0000-0002-9876-6906 (A. Brun); 0000-0002-4244-684X (D. Monticolo) Hence, facing the need for public resources related to curriculum recommendation, we present an on-going work related to the development and provision of both a new ontology and a new dataset, referred to as EducOnto and EduKG, respectively. More specifically, EducOnto focuses on modeling the pivotal period bridging the last year of high school and the first years of university, while EduKG is a Knowledge Graph (KG) encompassing students' profiles and academic training information. EduKG is built on the basis of EducOnto, from which it inherits its semantics and structural constraints.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">EducOnto: An Ontology for High School to University</head><p>In this section, we elaborate on the methodology used to design EducOnto. We follow best practices guidelines presented in <ref type="bibr" target="#b6">[7]</ref>. We distinguish three main phases that will respectively form the following paragraphs: (1) Domain and purpose definitions; (2) Ontology building process; (3) Ontology evaluation. EducOnto is publicly available at the following link: purl.org/educonto.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.">Domain and Purpose Definitions</head><p>In the educational domain, several ontologies are publicly available<ref type="foot" target="#foot_0">1</ref> . However, none of them directly models the period ranging from the end of high school to the first years of university. EducOnto aims at modeling such a period. Together with education experts, we defined competency questions whose purpose is to frame the scope of the ontology and act as requirements that EducOnto should meet. The exhaustive list of competency questions is made available <ref type="foot" target="#foot_1">2</ref> . Let us introduce two of them as an example:</p><p>CQ1. What are the recommended academic backgrounds for a specific university curriculum? CQ2. What is the most popular university curriculum following a given high school major?</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">Ontology Building Process</head><p>With education experts, we first enumerate all the terms related to our application domain and the expected purpose of EducOnto. Two main concepts stand out: student and curriculum. Both concepts are related to other intermediate concepts such as major and specialty. Then, adopting a top-down development process, we end up with 71 classes and 30 object properties. A general overview of EducOnto that contains the main classes and properties is presented in Figure <ref type="figure" target="#fig_0">1</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3.">Ontology Evaluation</head><p>EducOnto can be considered as successfully designed if it helps to fulfill downstream tasks related to university curricula information retrieval and recommendation. According to <ref type="bibr" target="#b7">[8]</ref>, we rely on a task-based evaluation. To do so, we first take a general use case, after which we seek to answer the previously mentioned competency questions by querying EducOnto using SPARQL. The answers to the queries are retrieved from the data contained in EduKG. Below, we  present the query for CQ1. More precisely, we retrieve the recommended high school majors to start a Bachelor's degree in Computer Science (hereinafter referred to as curriculum:lg_cs). </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.4.">Directions for Future Work</head><p>We will enrich EducOnto by linking it with other ontologies. As previously said, few ontologies model the period from the end of high school to the first years of university. However, some ontologies may be partially reused, e.g. EduProgression Ontology<ref type="foot" target="#foot_2">3</ref> .</p><p>In addition, we intend to develop collaborative work with other European institutions and education experts to broaden the scope of EducOnto to countries belonging to the European Higher Education Area<ref type="foot" target="#foot_3">4</ref> .</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">EduKG: An Educational Knowledge Graph</head><p>In this section, we detail the whole pipeline for the construction of EduKG. EduKG is publicly available at the following link: purl.org/edukg.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">Data Collection</head><p>Data were collected in tabular format through a self-administered online survey whose content has been developed with the education experts who helped define the competency questions (see Section 2.1). Several French high schools and universities were asked to disseminate the survey among their students. A wide spectrum of geographical areas was targeted to ensure representativeness among survey respondents. From January 2022 to April 2022, 3,583 respondents have fully answered the survey and are part of EduKG.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">Construction and Description</head><p>Before constructing EduKG, some modeling choices were made, such as:</p><p>• Students rated their curriculum on a 0 to 5 scale. Ratings were binarized so as to lead to the predicates likedCurriculum (rating ≥ 3) and dislikedCurriculum (rating &lt; 3); • Students were allowed to provide keywords related to their educational interests. Because they were given a freeform text field to fill, consistency among respondents was needed.</p><p>For instance, keywords that appeared too little were removed, and too precise keywords were replaced by broader concepts; • Publicly available datasets provided by government agencies were used to enrich the knowledge graph with additional curriculum information.</p><p>Then we use Python 3.9 to create RDF triples meeting the conditions specified by EducOnto. All these triples forms our knowledge graph EduKG. A comprehensive view of EduKG is provided in Table <ref type="table" target="#tab_1">1</ref>: the number of distinct entities for each intermediate class (Table <ref type="table" target="#tab_1">1</ref>.a), information about the number of Student-Curriculum interactions and the relative sparsity of EduKG (Table <ref type="table" target="#tab_1">1</ref>.b) and the total amount of entities, relations and triplets of EduKG (Table <ref type="table" target="#tab_1">1</ref>.c).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3.">Directions for Future Work</head><p>We will enrich EduKG by mapping EduKG keywords to DBpedia or Wikidata entities. This would give more genericity to EduKG and facilitate its use in downstream tasks. Similarly to EducOnto, we intend to extend EduKG by adding instances from other European countries, although its feasability will depend on privacy-related and data-sharing concerns. On a technical side, a public SPARQL endpoint will be provided, so that information can be retrieved by running SPARQL queries against EduKG.</p><p>Finally, recall that the ultimate goal for building EduKG was to provide a public dataset for curriculum recommendation. Thus, the natural next step of this work is to design a RS that specifically relies on EduKG, by refining the work initiated in <ref type="bibr" target="#b8">[9]</ref>.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: A conceptual overview of EducOnto. The lower, middle and upper parts of the diagram depict the classes and properties related to the User Profile, High School and University, respectively.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Listing 1 :</head><label>1</label><figDesc>SPARQL query to address CQ1P R E F I X e d : &lt; h t t p : / / p u r l . o r g / e d u c o n t o / &gt; P R E F I X m a j o r : &lt; h t t p : / / p u r l . o r g / e d u k g / m a j o r / &gt; P R E F I X c u r r i c u l u m : &lt; h t t p : / / p u r l . o r g / e d u k g / c u r r i c u l u m / &gt; P R E F I X r d f : &lt; h t t p : / / w w w . w 3 . o r g / 1 9 9 9 / 0 2 / 2 2 -r d f -s y n t a x -n s # &gt; S E L E C T ? m a j o r W H E R E { ? m a j o r r d f : t y p e / r d f s : s u b C l a s s O f * e d : H i g h S c h o o l M a j o r . c u r r i c u l u m : l g _ c s e d : r e c o m m e n d s H i g h S c h o o l M a j o r ? m a j o r . }</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 1</head><label>1</label><figDesc>Statistics of EduKG</figDesc><table><row><cell>(a)</cell><cell>#Fields Of Study 12</cell><cell>#Keywords 321</cell><cell>#School Subjects #Majors #Specialties 15 92 13</cell></row><row><cell>(b)</cell><cell cols="2">#Users (Students) #Items (Curricula) 3,583 286</cell><cell>#Interactions 7,021</cell></row><row><cell>(c)</cell><cell>#Entities 5,452</cell><cell>#Relations 27</cell><cell>#KG Triples 36,301</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">https://lov.linkeddata.es/dataset/lov/vocabs?&amp;tag=Academy</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">https://nicolas-hbt.github.io/educ-ontokg/competencyquestions/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_2">https://lov.linkeddata.es/dataset/lov/vocabs/edupro</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_3">https://en.wikipedia.org/wiki/Bologna_Process</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>This work is supported by the AILES PIA3 project (https://www.projetailes.com/) that seeks to support French students in their educational choices. We thank all the project partners: the University of Reims Champagne-Ardenne, the University of Lorraine, the Technological University of Troyes and the two Academies of Reims and Nancy-Metz.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Ontology-based personalized course recommendation framework</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">E</forename><surname>Ibrahim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">L</forename><surname>Ndzi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Al-Maliki</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Access</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="page" from="5180" to="5199" />
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Educor: An educational and career-oriented recommendation ontology</title>
		<author>
			<persName><forename type="first">E</forename><surname>Ilkou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Abu-Rasheed</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Tavakoli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Hakimov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Kismihók</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Auer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Nejdl</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">LNCS</title>
		<imprint>
			<biblScope unit="page" from="546" to="562" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Vers un système de recommandation explicable pour l&apos;orientation scolaire</title>
		<author>
			<persName><forename type="first">N</forename><surname>Hubert</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Brun</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Monticolo</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Workshop EXPLAIN&apos;AI -EGC</title>
				<meeting><address><addrLine>Blois, France</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<title level="m" type="main">Introduction to Recommender Systems Handbook</title>
		<author>
			<persName><forename type="first">F</forename><surname>Ricci</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Rokach</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Shapira</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2011">2011</date>
			<publisher>Springer US</publisher>
			<biblScope unit="page" from="1" to="35" />
			<pubPlace>Boston, MA</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<author>
			<persName><forename type="first">B</forename><surname>Ma</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Taniguchi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Konomi</surname></persName>
		</author>
		<title level="m">Course recommendation for university environment</title>
				<imprint>
			<publisher>EDM</publisher>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Designing for serendipity in a university course recommendation system</title>
		<author>
			<persName><forename type="first">Z</forename><forename type="middle">A</forename><surname>Pardos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Jiang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. 10th ACM LAK</title>
				<meeting>10th ACM LAK</meeting>
		<imprint>
			<publisher>Association for Computing Machinery</publisher>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="350" to="359" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<title level="m" type="main">Ontology development 101: A guide to creating your first ontology</title>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">F</forename><surname>Noy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">L</forename><surname>Mcguinness</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2001">2001</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Strategies for the evaluation of ontology learning</title>
		<author>
			<persName><forename type="first">K</forename><surname>Dellschaft</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Staab</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of the Conf. on Ontology Learning and Population</title>
				<meeting>of the Conf. on Ontology Learning and Population</meeting>
		<imprint>
			<publisher>IOS Press</publisher>
			<date type="published" when="2008">2008</date>
			<biblScope unit="page" from="253" to="272" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">New Strategies for Learning Knowledge Graph Embeddings: the Recommendation Case</title>
		<author>
			<persName><forename type="first">N</forename><surname>Hubert</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Monnin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Brun</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Monticolo</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">EKAW -23rd International Conf. on Knowledge Engineering and Knowledge Management</title>
				<meeting><address><addrLine>Bolzano, Italy</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
