<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">WebEduRank: an Educational Ranking Principle of Web Resources for Teaching</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Alessandro</forename><surname>Marani</surname></persName>
							<email>alessandro.marani@griffithuni.edu.au</email>
							<affiliation key="aff0">
								<orgName type="department">School of Information and Communication Technology</orgName>
								<orgName type="institution">Griffith University</orgName>
								<address>
									<addrLine>170 Kessels Road</addrLine>
									<postCode>4111</postCode>
									<settlement>Nathan</settlement>
									<region>QLD</region>
									<country key="AU">Australia</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">WebEduRank: an Educational Ranking Principle of Web Resources for Teaching</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">6EC92F1ED3626DF18785EE3703FC4F3B</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T17:42+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Educational ranking principle</term>
					<term>teaching resources retrieval</term>
					<term>teaching context</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The seeking of teaching resources on the web is a very common practice among instructors. Some on-line systems aim to support instructors in this complex and delicate task. In fact, quality of teaching is without doubt an important factor for learning, and an important aspect of teaching is the delivery of quality learning material to students. Remarkable results have been achieved in the reuse and recommendation of teaching resources, producing the corner stone for Information Retrieval (IR) in education. Our research aims to connect those findings with web resources, overcoming the current limitation of analysing resources from educational repositories only. Firstly, this study addresses the problem of identifying those attributes of a teaching context that actually carry on a piece of information useful for rating web resources for teaching. Then, the project works on the design of WebEduRank, a ranking principle for rating webresources according to the teaching requirements and needs of an instructor. So far, the research has already identified the attributes that are expected to positively inform the WebEduRank. Using these attributes, the design of the WebEduRankis now completed and the methodology for its validation is already devised and implemented.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Learning Object Repositories (LOR) are a very reliable source of Learning Objects (LO), where the recommendation approaches are facilitated by the metadata that those objects come with <ref type="bibr" target="#b2">[3]</ref>. A number of issues in annotating such metadata, in both a manual and an automatic manner, are well known and some solutions have been proposed <ref type="bibr" target="#b1">[2,</ref><ref type="bibr" target="#b9">10]</ref>. However, there is still the limitation due to the low amount of resources hosted by such repositories, and low completeness of annotation of LOs metadata in LORs <ref type="bibr" target="#b9">[10,</ref><ref type="bibr" target="#b11">12]</ref>.</p><p>Focusing on the applications of IR techniques for the retrieval of resources for teaching, it would be interesting to do not limit this search in resources hosted by LORs or other educational datasets, but to explore also the web. In practice, we are talking of going out on the web where we cannot rely on the fact that we have educational metadata like in LORs, and we do not even know if they actually are educational resources. This research mainly aims to propose a method, called WebEduRank, for ranking or re-ranking web resources according to their suitability for a teaching context. The desired goal for the novel ranking principle is to provide a higher ranking of those web resources that present characteristics of interest for a teaching context.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Recommendation of Learning Objects</head><p>Learning Object is currently the most relevant approach for the description of educational characteristics of digital content, and it is adopted by most of the recommender systems of educational resources <ref type="bibr" target="#b2">[3]</ref>. A Learning Object is a multimedia content provided with instructional information for the reuse of the content. This instructional information, called metadata, should be able to describe the educational material. However, Learning Object metadata has some semantic issues that makes information processing hard for computers <ref type="bibr" target="#b9">[10]</ref>. These issues have to be carefully considered for the purposes of this research.</p><p>Learning Object has been widely used by recommender systems for achieving different recommendation goals. Some studies focused on recommending Learning Objects according to course structure and learning styles, among other aspects <ref type="bibr" target="#b12">[13]</ref>. Hence, Learning Object metadata resulted useful in this field, but the current popular schema of Learning Object metadata are not enough for describing the educational traits of resources, especially depicting how they can be used to achieve some learning (or teaching) goals <ref type="bibr" target="#b0">[1]</ref>.</p><p>The metadata of Learning Objects can also facilitate the process of query expansion for a more effective retrieval of resources <ref type="bibr" target="#b5">[6]</ref>, but yet there is concern about the semantic ambiguity and completeness of annotated Learning Object metadata <ref type="bibr" target="#b10">[11,</ref><ref type="bibr" target="#b9">10,</ref><ref type="bibr" target="#b11">12]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Attributes of teaching context for informing WebEduRank</head><p>This section aims to present the process for designing a teaching context towards the proposal of WebEduRank. The teaching context has the delicate role of representing the teaching requirements and experiences of instructors. This task is very important for proposing a ranking principle of web resources able to take into account the educational context of a profiled instructor. In order to achieve this goal, the teaching context should exhaustively represent the teaching requirements of an instructor that the web resources have to comply with.</p><p>From an analysis of a review study of instructors' knowledge involved in the process of teaching (called teaching knowledge), we have found that most of the contributions refer to some important categories of teaching knowledge such as Content Knowledge, Pedagogical Knowledge and Pedagogical Content Knowledge among others <ref type="bibr" target="#b14">[15]</ref>. Also, the review of some significant studies for analysing the completeness rate and usefulness of some metadata in Learning Object Repositories can provide further attributes to be included in the Instructor Profile. <ref type="bibr" target="#b10">[11]</ref> and <ref type="bibr" target="#b11">[12]</ref> report some statistics on the completeness of Learning Object metadata in two different repositories. The first study analyses Learning Objects in GLOBE, an important repository, whilst the second one considers those Learning Objects used in a small repository of only agricultural materials called Organic.Edunet.</p><p>After the review of literature about the teaching knowledge and Learning Object, a first draft of the teaching context is presented. We suggest a teaching context TC of a course formed by Prerequisite Knowledge (PK), Concept Name (CN), Course Title (CT), Education Level (ED-LEV), Difficulty (DIFF), Starting Knowledge (SK), Target Knowledge (TK).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Evaluation of the attributes for informing the WebEduRank</head><p>This evaluation is only a first refining of the teaching context, so that the WebEdu-Rank can be focused on only those aspects that are likely to be useful for our purposes. Then, the final evaluation of the WebEduRank is the final evidence whether or not such aspects have been useful and exhaustive. The evaluation has been carried out by surveying instructors about the usefulness of the attributes of the proposed teaching context for ranking web resources for teaching.</p><p>The research question that piloted the formulation and structure of the questionnaire is the following: RQ 1 : What information of the proposed Teaching Context is perceived useful by instructors for informing the WebEduRank for the retrieval of web resources for teaching?</p><p>For addressing the research question, the questionnaire simply focused on presenting the attributes of the Teaching Context to the participants and asking them the perceived usefulness and comprehensiveness of such information for the process of ranking teaching resources. So far, a total of 33 responses have been collected; most of the participants have been recruited by e-mail. Such number of participants already allows us to conduct some statistical analysis with a good degree of significance.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Structure of the questionnaire</head><p>We approached the research question with a questionnaire of 11 items divided into three constructs as follow:</p><p>-General questions (3).</p><p>-Attitude in using the Internet for seeking teaching material <ref type="bibr" target="#b5">(6)</ref>.</p><p>-Perceived usefulness of the attributes of the teaching context <ref type="bibr" target="#b1">(2)</ref>.</p><p>The first construct provides some general information about the sample such as age and teaching experience. We assume that instructors with different age, teaching experience and place where they teach consider a teaching context similarly. Hence, this first construct is only for describing the sample and for further analysis of the results.</p><p>Instead, the second construct shows the current attitude, and so practical experience, of instructors in using current on-line systems for seeking teaching resources. This aspect should also tell us if the sample massively use the Internet for such purposes, or it is just a beginner user in this regard. We believe that answers provided by instructors that massively use the Internet for teaching can provide us with more reliable and, more importantly, experienced feedback. In this construct, there are six 5-point Likert scale questions.</p><p>Finally, the last construct is related to the perceived usefulness of the attributes in the proposed teaching context for seeking teaching resources on the web. This evaluation is based on what information instructors usually consider when they look for resources on the web; information used in different phases of the search, from formulating the query to browsing the results. Two items belong to this construct. The first item is a multiplechoice question where instructors are asked to select those attributes that they think useful when searching teaching resources on the web for a specific concept. Participants have to choose at least one option, where also the 'none of the above' option is available.</p><p>There is an open field for additional considerations by participants. The second question is a matrix of 5-points Likert scales, where each row represents one attribute of the proposed Teaching Context; participants are required to assign a relevance score to each attribute. Also for this question, there is an open field for additional comments from the participants.</p><p>Data analysis For analysing the results of the questionnaire, statistics for measuring the perceived usefulness of respondents and repeatability of the results is used. The questions involved in this task are 5-points Likert scales, so we can use statistical hypotheses test methods for rejecting or not the null hypothesis that an attribute in the Teaching Context is believed useless by the population of instructors, not only the sample of this study. The null hypothesis can be stated as follows:</p><formula xml:id="formula_0">H 0 : µ relevance 3.0<label>(1)</label></formula><p>If H 0 is rejected, it means that the research hypothesis, namely the true mean relevance for the population is greater than 3.0, is retained. A value greater than 3.0 in a 5-points Likert scale is a positive response to the relevance of the attribute. The null hypothesis is tested for all the attributes in the Teaching Context.</p><p>The application of t for this study is two-fold: i) retain or reject H 0 for the population, and ii) compute a confidence interval for the population mean of the relevance of all the attributes in the Teaching Context.</p><p>Relevance of the attributes of the Teaching Context This section presents the perceived usefulness of the attributes of the Teaching Context according to the surveyed instructors (N = 33). We use t distributions for testing H 0 for all the attributes of the Teaching Context. For each attribute, it is reported whether or not H 0 (it means the attribute is not relevant) is retained or rejected.</p><p>Two questions of the questionnaire are focused on the relevance of the attributes of the Teaching Context; only the most important one for the purposes of the WebEdu-Rank is presented here.</p><p>By means of 5-point Likert scales, participants evaluated the relevance of all the attributes of the Teaching Context, in order to capture to what extent the attributes of the proposed Teaching Context actually represent important aspects of a teaching context, independently of the specific retrieval task. For each attribute, all the participants are required to express the relevance. Looking at the results achieved so far, other than the attributes country and course title, all the attributes got a very positive outcome.</p><p>In order to reject H 0 for the degree of freedom equal to 32 (N-1), t must be greater than 2.738 at .01 significance level. All the attributes but course title and country rejected H 0 , presenting a t-value significantly over 2.738 and p-values smaller than .05.</p><p>course title is a borderline case where H 0 is retained but the p-value is just over .05 and the lower bound of the confidence interval mean for the population is 2.995, slightly lower than the desired value of 3.0. Totally different is the situation for the attribute country, where the lower bound of the population mean is 2.348, remarkably lower than 3.0. For this reason, only the attribute country is removed.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">The WebEduRank</head><p>The WebEduRank aims to rate web pages according to some aspects of the teaching of an instructor. The rating should reflect the suitability of a web resource for teaching a certain concept in a particular teaching context or situation. A first note is that the proposed principle is not much focused on the ranking of the resources according to a topic. Google is already able to do it very well and with a remarkable performance. The main problem that the WebEduRank is trying to solve is the ordering of a set of web resources according to their suitability for teaching in a certain context. At this stage of the research, we consider as input a set of web resources that is assumed to be about the topic of a query, for example the results of a Google interrogation. The novel contribution of the WebEduRank would be the re-ranking of the web resources about a topic according to the teaching context of an instructor. This task is not simple at all, mainly because of the problematic structure of a web resource and the detection of educational attributes by text-analysis only. The structure of web resources, sometimes even unstructured, is a very well-known problem, challenging many activities around the text-analysis of web pages including information retrieval. However, this issue is better addressed by studies related to semantic web and web crawling, which are not research areas of this project.</p><p>Instead, the detection of educational attributes by text-analysis is a main problem to be addressed in this research. In order to achieve this goal, the WebEduRank has to be able to i) capture educational traits of web resources, and then ii) use such traits for ranking them accordingly to the teaching context of the instructor. These two problems are addressed in this research, where the traits to be extracted from web resources are those included in the Instructor Profile presented by this thesis.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1">Attributes of the Instructor Profile for informing the WebEduRank</head><p>According to the finding in Section 3, during the search of teaching resources on the web, an instructor knows that he is looking for resources about a concept (CN) for a particular course (CT), when some prerequisite knowledge (PK) is assumed to be already learnt by the students. Implicitly, the resources have to comply with the education level (ED-LEV) of the class with a certain level of difficulty (DIFF). The course may also be a bit different in terms of concepts taught in it because of the starting knowledge (SK) of a class. Finally, the instructor knows what is the target knowledge (TK) of the course. Some of these elements, such as PK, SK and TK can be derived by analysing the concept map of the teaching context, where the SK can be represented by source concepts (concepts with no prerequisites) in the map, and the TK can consist of the leaves of the concept map.</p><p>The proposed WebEduRank should replicate the aforementioned behaviour of instructors when ranking web resources for teaching. The Teaching Context here presented can provide the WebEduRank with such information, now the problem is how to use these attributes for analysing web resources accordingly to the process previously described. The purpose is to use these data for ranking the resources, not for formulating or expanding the user query. It would be interesting to deeply explore also these aspects, but they are out of the reach of this Ph.D. project. Only a shallow investigation has been conducted within this research, and the results are presented in <ref type="bibr" target="#b6">[7]</ref>. The attributes for informing the WebEduRank is a first important finding of this research towards the design of the WebEduRank itslef.</p><p>The next challenge is in which parts of a web page the principle should look for that information. We can expect that some information of the teaching context is more likely to be found in the links of a web page (e.g. prerequisite knowledge or concepts) than the body or the title. Usually, for ranking web pages the body is the part of the page that is mostly analysed beside to some metadata if available. However, within the body of a web page we can find different tags that may contain different kind of information. For example, links (expressed by the html tag a) can have as text the name of other related concepts. So, we want to distinguish the text that comes from the following four parts of a web page: title, body, links and highlights. We believe that these parts of a web page can contain different attributes of the Teaching Context, reflecting a more informative information than just finding the information in the whole body-text. The identification of those four parts of the web pages is based on associating some HTML tags that are usually used for expressing those parts of a web page. Table <ref type="table">1</ref> shows the HTML tags that are extracted from a web page and whose text is used for composing the four texts of the web page that the WebEduRank will analyse.  <ref type="table">1</ref>. For each of the four parts of a web page analysed by the WebEduRank, it is reported the HTML tags that are used for composing their texts.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2">The Expectancy Appearance Matrix</head><p>The last step for the design of the WebEduRank is defining the way of how the attributes of the Teaching Context are matched or searched into the four texts representing the web page. As anticipated earlier, we expect that the attributes can appear differently, we can say with a certain likelihood, into the four components of the web page defined in this research. In this stage, we have 7 attributes of the Teaching Context considered over the rating process of WebEduRank. We propose the Expectancy Appearance Matrix (EAM) for representing the expectation that an attribute appears into a component of the web page. Formally, EAM is a 4x7 matrix, where the rows are the four components of the web pages analysed by the WebEduRank, and the columns are the attributes of the Teaching Context. The element of a i j ∈ EAM is the likelihood that the j-th attribute is found in the i-th part of the web page.The events of finding or not an attribute in a certain part of the web page are independent of each other, because an attribute can appear in a section of the web page without any correlation with the appearance in other sections. In this regard, the definition of the elements of the EAM raises a new challenge, allowing the tuning of the WebEduRank for discovering the most appropriate values for each element of the matrix. The analysis of the dataset DAJEE <ref type="bibr" target="#b3">[4]</ref> can provide some insights about the values to be assigned to the elements of the EAM.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.3">Formulation of the WebEduRank</head><p>The idea behind the WebEduRank is to find a match between the text of a web resource and the text of the attributes of the Teaching Context, being the values of the Teaching Context are texts for describing the different aspects of the teaching. The matching of the texts is performed dividing the web page into four sections providing different information about the content of the web page. The WebEduRank basically analyses the frequency of the terms of the values of the attributes of the Teaching Context with a weighting system based on the values in the EAM. As introduced in Section 4.2, the columns EAM indicate the expectancy that an attribute of the Teaching Context of an instructor appears in the four sections of the web page. Hence, the frequency of the terms of an attribute j-th into a section i-th of the web page is weighted by the expectancy value a i j reported by the EAM. In practice, each column of the EAM is a vector stating the importance of considering an attribute of the Teaching Context along the four sections of a web page.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Evaluation of the WebEduRank</head><p>This section presents the experiment for validating the WebEduRank. Part of this experiment consists of collecting teaching contexts from instructors. Then, instructors are asked to provide the rating of a set of web resources according to their suitability for teaching a concept in the context defined by them. This preliminary phase has to be conducted because the teaching context used for informing the WebEduRank is a novel approach, so there are no datasets that we can use for our purposes. For this reason, we need instructors for defining a collection of teaching contexts and web resources so that we can run a number of experiments for tuning and validating our new proposal against current practice and baseline methods. In addition, we do not have the capability of Google in indexing most of the web pages on the web, so the data collected are about web pages retrieved by Google after a query submitted by instructors. In this way, the WebEduRank proposes a ranking of the same resources presented by Google, overcoming the issue of gathering web pages to be rated by instructors in a particular teaching context.</p><p>The goal of this evaluation is two-fold: tuning the WebEduRank with different settings of the EAM and proving its improvement in the ranking of web resources for teaching compared to some baselines. As said in the previous section, the EAM is a critical aspect of the WebEduRank, so it needs to be carefully tuned for achieving the best performance that the WebEduRank can actually reach. For this reason, we can have a number of different settings of the ranking principle to be evaluated against themselves first, and then against other approaches. Therefore, we can formulate the following research question: RQ 2 : Given a set of web resources about a topic, can the WebEduRank offer an educational ranking of the web resources better than current practices?</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1">Methodology</head><p>For addressing the research question, the experiments need a set of data on which the benchmark systems and our proposal can be run. Usually, web search results produced by an IR system are compared to what the users of those systems actually believe useful and relevant for their query. In this regard, it has been pointed out a small but yet important difference between usefulness and relevance of an object for the web search <ref type="bibr" target="#b8">[9]</ref>. In our case, we have to make sure that external assessors rate the usefulness of web resources retrieved by Google more than their relevance to the query. The best way for avoiding this issue is providing external assessors with the highest level of knowledge and awareness of the purpose of the web search. Hence, it is recommended to i) clearly describe the contextual information about the web search, ii) use very informative query, not too short or ambiguous, iii) state clearly the purpose and need of the user <ref type="bibr" target="#b8">[9]</ref>.</p><p>In this study we need to undertake the data collection about the usefulness of some web resources for teaching of a concept in a certain teaching context; this is the purpose (or user need) in our case. For this reason, we ask instructors to i) define a teaching context of their interest, which includes a concept map, ii) formulate a query for retrieving web resources for a concept of the concept map using Google, and iii) rate the retrieved resources according to their usefulness for teaching the concept in the given teaching context. In practice, instructors are the external assessors of the web pages presented by Google, where the same assessors define the teaching context and formulate the query. We believe that this protocol for the data collection cover the three points highlighted by <ref type="bibr" target="#b8">[9]</ref> for actually getting the assessment of usefulness, not just relevance, of web pages.</p><p>Once the data collection phase is completed, we can finally run a number of experiments for validating our proposal. Traditionally, when evaluating an IR system the rating of resources provided by the external assessors are considered the most reliable and truthful ratings. A number of metrics are usually involved for measuring how close to or far from the assessors' ratings is the ranking or order of the resources produced by the novel IR system.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.2">Structure of the experiments</head><p>A first experiment is used for comparing the performance of the different tuning of the WebEduRank and some baselines. The goal of this experiment is to have evidence of the improved ranking produced by the novel approach against some simple baseline methods. The baseline methods consist of two scoring approaches: i) the documents are scored based only on the terms in the query provided by the instructors during the data collection phase, ii) plain WebEduRank where TF-IDF score of the documents is applied using a query composed by the values of the attributes of the Teaching Context. More details about the benchmark systems and baseline are reported in the next subsection. For this first experiment, prediction-accuracy metrics are more suitable for electing the best approach. In particular, Root Mean Squared Error (RMSE) and Mean Average Precision (MAP) <ref type="bibr" target="#b13">[14]</ref> are used. The first metric measures how close to actual users' ratings are the ratings predicted by the subject approaches as follows:</p><formula xml:id="formula_1">RMSE = √ 1 |items| ∑ i∈items ( r i − r i ) 2 (2)</formula><p>where items is the set of web resources rated by instructors during the phase of data collection, r i is the rating for the resource i-th given by the user and r i is the rating for the i-th object predicted by the system. The latter metric simply computes the mean of the average precision of the results presented by the subject system. In this regard, a resource is considered relevant when it has been given a rate of 3 out of 5 by the external assessors. During the collection data phase, the first ten results presented by Google have been rated by the assessors, so we decide to average the top-3 precision measure of the result sets. The results are ordered by descent rating as produced by the subject system. It is expected that a configuration of the WebEduRank performs better than the baselines according to these two metrics. The best tuning of the WebEduRank will be tested against the benchmark systems in the next experiment.</p><p>During the second experiment, the WebEduRank is tested against Google which is pointed out to be the most popular system for retrieving teaching resources <ref type="bibr" target="#b7">[8]</ref>. The WebEduRank should benefit of the information provided by the Teaching Context for predicting more accurately the usefulness of a web resource for teaching. In this experiment, we use position-based and prediction-accuracy metrics for comparing the two approaches. About the prediction-accuracy, we can use only the MAP metric because from Google we have only the ranking of the resources, not the ratings. Among the position-based metrics we choose the Discounted Cumulative Gain (DCG) <ref type="bibr" target="#b4">[5]</ref> and the Average Precision Correlation Coefficient (τ AP ) <ref type="bibr" target="#b15">[16]</ref>. Both metrics work analysing the position of the presented resources by the subject systems against their actual usefulness.</p><p>The DCG metric tells us to what extent the produced ranking reflects the relevance of the resources, by discounting the gain for users of recommending relevant resources at lower positions. The formula of DCG given a list of items J is the following <ref type="bibr" target="#b4">[5]</ref>:</p><formula xml:id="formula_2">DCG = CG[ j] max(1,log 2 j)</formula><p>where CG[ j] expresses the cumulative gain that the user has from being recommended the item at position j within the list of items J. The gain values are cumulative, so, for each J, it is built a Gain vector (G) of the 10 presented items (ten is the number of resources rated for each query). The element G[ j] is the user-relevance of the item returned in position j. Then, the Cumulated Gain (CG) is simply the sum of the gains from position 1 to j, recursively defined as follows:</p><formula xml:id="formula_3">CG[ j] = { G[1], i f j = 1 CG[ j − 1] + G[ j], otherwise<label>(3)</label></formula><p>Similarly, τ AP works on the positions of the items but with a different approach. It measures the differences in the ordering of resources as provided by the user and the one produced by the subject system. This particular metric is more sensible of differences happening in the highest positions of the list than the ones in the bottom of the list. Hence, τ AP is more strict when resources at the top positions of the user ranking appear differently or appear at the lowest positions in the order generated by the subject method. The formula of this metric is <ref type="bibr" target="#b15">[16]</ref>:</p><formula xml:id="formula_4">τ AP = 2 J − 1 J ∑ i=2 ( C(i) i − 1 ) − 1<label>(4)</label></formula><p>where C(i) is the number of correctly ranked items above position i.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.3">Benchmark system and baselines</head><p>The only benchmark system used in this study is Google. We do not need to spend many words for justifying this decision, being Google the most reliable search engine also for teaching. However, before testing our proposal against Google, we have to show some improvements that WebEduRank can offer compared to some baseline approaches. Also, this comparison allows us to test the WebEduRank with different tuning of the EAM. In this study, two baseline approaches are used. The first one does not use any information from the Teaching Context and it is used with the aim of motivating the usage of the Teaching Context for informing the WebEduRank. The second approach is pretty similar to the method of the WebEduRank, but without using the EAM. Both methods use the same score function for predicting the rating of the items, based on the TF-IDF score of a document given the query. The only difference is in the formulation of the query; the first approach uses the same query used by instructors during the collection data phase (no usage of Teaching Context), while the second approach builds the query using the attributes of the Teaching Context by following the structure of the fifth query presented in Table <ref type="table">2</ref> of <ref type="bibr" target="#b6">[7]</ref>. Both approaches do not divide the web page into different parts, but only the body-text is analysed. Given the query text q and the body-text text, the rating is computed as follows:</p><formula xml:id="formula_5">score(q,text) = ∑ t∈q √ f requency(t in text) • id f (t) 2<label>(5)</label></formula><p>where id f (t) is defined as follows:</p><formula xml:id="formula_6">id f (t) = 1 + log numDocs docFreq(t)+1</formula><p>This score method of documents given a query is taken from the TFIDFSimilarity class of Apache Lucene<ref type="foot" target="#foot_0">1</ref> , after removing the normalisation and boosting factors.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">Potential Advances</head><p>A successful proposal of a ranking principle of web resources for educational users (i.e. students and instructors) would be a very important improvement in the field of recommender systems in Technology Advanced Learning (TEL). As discussed along this contribution, the predominant limitation of current solutions in TEL against Google or other generic search engines is the number of resources and data that these systems can handle. The proposal of this doctoral work will put a first important step towards the consideration of web resources into the recommendation process of teaching resources. As part of the proposal of this research, the definition of the teaching context, which is expected to positively inform the proposed WebEduRank, may finally concentrate the proposal of new recommendation approaches on those aspects that are actually important for such task. Finally, the impact of building such teaching context should be evaluated. After a first analysis of the data about MOOCs from Coursera offered by the DAJEE dataset <ref type="bibr" target="#b3">[4]</ref>, it is anticipated that most of the elements of the proposed Teaching Context, among other data, can extracted out of the information available in DAJEE (and so in MOOCs).</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. Relevance of the attributes of the proposed Teaching Context for an effective and exhaustive description of a real teaching context.</figDesc><graphic coords="5,134.77,116.90,368.63,134.29" type="bitmap" /></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">https://lucene.apache.org/core/4 0 0/core/org/apache/lucene/search/similarities/TFIDFSimilarity.html</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Recommending learning objects according to a teachers&apos; contex model</title>
		<author>
			<persName><forename type="first">J</forename><surname>Bozo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Alarcón</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Iribarra</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Sustaining TEL: From innovation to learning and practice</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2010">2010</date>
			<biblScope unit="page" from="470" to="475" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Linked education: interlinking educational resources and the web of data</title>
		<author>
			<persName><forename type="first">S</forename><surname>Dietze</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">Q</forename><surname>Yu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Giordano</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Kaldoudi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Dovrolis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Taibi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 27th Annual ACM Symposium on Applied Computing</title>
				<meeting>the 27th Annual ACM Symposium on Applied Computing</meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2012">2012</date>
			<biblScope unit="page" from="366" to="371" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Panorama of recommender systems to support learning</title>
		<author>
			<persName><forename type="first">H</forename><surname>Drachsler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Verbert</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><forename type="middle">C</forename><surname>Santos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Manouselis</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Recommender systems handbook</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2015">2015</date>
			<biblScope unit="page" from="421" to="451" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Dajee: A dataset of joint educational entities for information retrieval in technology enhanced learning</title>
		<author>
			<persName><forename type="first">V</forename><surname>Estivill-Castro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Limongelli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Lombardi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Marani</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval</title>
				<meeting>the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval</meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2016">2016</date>
			<biblScope unit="page" from="681" to="684" />
		</imprint>
	</monogr>
	<note>SIGIR &apos;16</note>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Cumulated gain-based evaluation of ir techniques</title>
		<author>
			<persName><forename type="first">K</forename><surname>Järvelin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Kekäläinen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ACM Transactions on Information Systems (TOIS)</title>
		<imprint>
			<biblScope unit="volume">20</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="422" to="446" />
			<date type="published" when="2002">2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">A recommendation module to help teachers build courses through the moodle learning management system</title>
		<author>
			<persName><forename type="first">C</forename><surname>Limongelli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Lombardi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Marani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Sciarrone</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Temperini</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">New Review of Hypermedia and Multimedia</title>
		<imprint>
			<biblScope unit="volume">22</biblScope>
			<biblScope unit="issue">1-2</biblScope>
			<biblScope unit="page" from="58" to="82" />
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">A comparative framework to evaluate recommender systems in technology enhanced learning: a case study</title>
		<author>
			<persName><forename type="first">M</forename><surname>Lombardi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Marani</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Advances in Artificial Intelligence and Its Applications</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2015">2015</date>
			<biblScope unit="page" from="155" to="170" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Sharing teaching and learning resources: perceptions of a university&apos;s faculty members</title>
		<author>
			<persName><forename type="first">S</forename><surname>Maloney</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Moss</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Keating</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Kotsanas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Morgan</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Medical education</title>
		<imprint>
			<biblScope unit="volume">47</biblScope>
			<biblScope unit="issue">8</biblScope>
			<biblScope unit="page" from="811" to="819" />
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">When does relevance mean usefulness and user satisfaction in web search?</title>
		<author>
			<persName><forename type="first">J</forename><surname>Mao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Zhou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">Y</forename><surname>Nie</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Song</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Ma</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Sun</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Luo</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval</title>
				<meeting>the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval</meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2016">2016</date>
			<biblScope unit="page" from="463" to="472" />
		</imprint>
	</monogr>
	<note>SIGIR &apos;16</note>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Metadata management and sharing in multimedia open learning environment (mole)</title>
		<author>
			<persName><forename type="first">M</forename><surname>Mylonakis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Arapi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Pappas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Moumoutzis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Christodoulakis</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Metadata and Semantic Research</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2011">2011</date>
			<biblScope unit="page" from="275" to="286" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">On the use of learning object metadata: The globe experience</title>
		<author>
			<persName><forename type="first">X</forename><surname>Ochoa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Klerkx</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Vandeputte</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Duval</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Towards ubiquitous learning</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2011">2011</date>
			<biblScope unit="page" from="271" to="284" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Metadata quality in learning object repositories: a case study</title>
		<author>
			<persName><forename type="first">N</forename><surname>Palavitsinis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Manouselis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Sanchez-Alonso</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">The Electronic Library</title>
		<imprint>
			<biblScope unit="volume">32</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="62" to="82" />
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Broa: An agent-based model to recommend relevant learning objects from repository federations adapted to learner profile</title>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">A</forename><surname>Rodríguez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Tabares</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">D D</forename><surname>Mendez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">A O</forename><surname>Carranza</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">M</forename><surname>Vicari</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IJIMAI</title>
		<imprint>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="6" to="11" />
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Evaluating recommendation systems</title>
		<author>
			<persName><forename type="first">G</forename><surname>Shani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Gunawardana</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Recommender systems handbook</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2011">2011</date>
			<biblScope unit="page" from="257" to="297" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Technological pedagogical content knowledge-a review of the literature</title>
		<author>
			<persName><forename type="first">J</forename><surname>Voogt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Fisser</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Pareja Roblin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Tondeur</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Van Braak</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Computer Assisted Learning</title>
		<imprint>
			<biblScope unit="volume">29</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="109" to="121" />
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">A new rank correlation coefficient for information retrieval</title>
		<author>
			<persName><forename type="first">E</forename><surname>Yilmaz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">A</forename><surname>Aslam</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Robertson</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval</title>
				<meeting>the 31st annual international ACM SIGIR conference on Research and development in information retrieval</meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2008">2008</date>
			<biblScope unit="page" from="587" to="594" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
