=Paper=
{{Paper
|id=Vol-3335/MSW_paper5
|storemode=property
|title=ISCR - Integrative Intelligent Semantically Driven
Scheme for Online Course Recommendation
|pdfUrl=https://ceur-ws.org/Vol-3335/MSW_Paper5.pdf
|volume=Vol-3335
|authors=Harshal Sharma,Gerard Deepak
}}
==ISCR - Integrative Intelligent Semantically Driven
Scheme for Online Course Recommendation==
<pdf width="1500px">https://ceur-ws.org/Vol-3335/MSW_Paper5.pdf</pdf>
<pre>
ISCR - Integrative Intelligent Semantically Driven
Scheme for Online Course Recommendation
Harshal Sharma1,† , Gerard Deepak2,†
1
Birla Institute of Technology and Science, Pilani , Vidya Vihar, Pilani, Rajasthan, India
2
 Department of Computer Science and Engineering, Manipal Institute of Technology Bengaluru, Manipal Academy of
Higher Education, Manipal, India


                                         Abstract
                                         The easy access of Internet has caused an exponential increase in the availability of courses on a variety
                                         of platforms. The increasing popularity and demand of these online courses subsequently create a need
                                         for better and more efficient course recommendation systems. The development of WEB 3.0 has also
                                         added to this need as the current course recommendation systems are not semantically compliant. The
                                         system is user knowledge-centric. It uses Decision Trees to classify the dataset. The semantic similarities
                                         have been computed with the help of normalized compression distance, normalized google distance and
                                         Gini-index. The performance has been evaluated and compared with other approaches like OPCR, CRFL,
                                         CRQCA and hierarchical clustering plus Jaccard similarity. A clear observation has been made that the
                                         proposed, knowledge centric ISCR course recommendation system performs superiorly and attains an
                                         average Recall, and F-measure of 98.48

                                         Keywords
                                         Course Recommendation, Decision Trees, RDF Synthesis, Semantically Driven, Semantic Similarity,


1. Introduction
The advancement of the Internet has had a significant impact on all aspects of human lives,
especially in the field of education. With the increasing accessibility of the Internet, education
is shifting towards an online mode. Many platforms like Coursera, Udemy, Alison, edX, etc.,
provide a multitude of online courses and have a vast user base. This has prompted an exceptional
expansion in the quantity of these courses, making it progressively troublesome to express the
user’s individual needs accurately. With this arises a need for a course recommendation system.
   A course recommendation system analyses the user’s history and preferences to present a list
of courses that they might find interesting. E-learning has significantly increased information
consumption, and such systems are fairly accurate in suggesting relevant courses to users.
Such systems are beneficial as people often don’t exactly know what they want and might
find interesting. Besides courses recommendation these recommendation systems are used
by many organizations for a variety of purposes. For example, the main page or the feed

International Workshop on Semantic IoT (IWSIoT-2022) and Multilingual Semantic Web (IWMSW-2022) and Deep
Learning for Question Answering (IWDLQ-2022)
∗
    Corresponding author.
Envelope-Open gerard.deepak.christuni@gmail.com (G. Deepak)
Orcid 0000-0002-0877-7063 (G. Deepak)
                                       © 2022 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
    CEUR
    Workshop
    Proceedings
                  http://ceur-ws.org
                  ISSN 1613-0073
                                       CEUR Workshop Proceedings (CEUR-WS.org)
on apps like Facebook, Instagram, Google, Amazon etc. are different for different users as
the data presented to the user on these pages is according to the user’s preferences. This
idea has been in use for many years, and there have been many advancements in this field.
The current models work on content-based filtering, collaborative filtering, and knowledge
graphs. But there is always scope for improvement, especially with the developments in Web 3.0.

   Motivation: Web 3.0 or frequently referred to as the semantic web is the third generation
in the evolution of the World Wide Web and of internet services like websites and other
applications. Web 3.0 focuses on making the data available on the Internet machine readable so
as to make a semantically compliant and data-driven Web. The advancement of technology and
the tremendous increase in the amount of data generated go hand in hand. Therefore, we need
a robust recommendation system for the increasing number of courses and users. The semantic
web is a knowledge-centric and data-driven web. Furthermore, the current recommendation
models are not semantic web compliant as it is highly cohesive. This makes the development of
better and more accurate models crucial.

  Contribution: A semantically driven; knowledge centric system is proposed for online
course recommendation systems. The user generates an input query which is then subject to
pre-processing. Query pre-processing involves lemmatization, tokenization, stop word removal,
and NER (Named Entity Recognition). Furthermore, on obtaining the individual query words,
the RDF (Resource Description Framework) synthesis takes place using the RDF distiller. Entity
enrichment has been done using the Wikidata API. Classification of the dataset is done using
decision Trees. Semantic similarity has been computed using normalized compression distance,
normalized google distance and Gini index.

  Organization: The publication’s structure is as follows: Section 2 comprises is comprised of
the related works around the subject of research. Section 3 contains of the architecture for the
proposed framework. Section 4 consists of the implementation of the proposed model along
with the performance evaluation and results. Section 5 consists of the conclusion.


2. Related Works
Zhang et al., [1] described a model for a course recommendation system for MOOCs
[Massive Open Online Courses]. MCRS is based on distributed computational framework. Its
basic algorithm is distributed association rule mining, which is rooted on improvement of
Apriori algorithm. Ibrahim et al., [2] described system for recommending courses which are
personalized and ontology-based and hence the name OPCR. It combines content-based filtering
with collaborative-based filtering. Furthermore, it uses an ontology mapping technique. Lin et
al., [3] proposed an adaptive course recommendation in MOOCs. They presented Dynamic
Attention and hierarchical Reinforcement Learning [DARL] mechanism which improves the
course recommendation in terms of adaptivity. They presented a dynamic attention mechanism
that can be used to monitor alterations in user preferences.
   Pardos et al., [4] designed for serendipity in course recommendation in a university. One of
their models was based on descriptions mentioned in course catalogs and another set by course
enrollment histories. Zhang et al., [5] proposed a new approach of learning for recommendation
of courses in MOOCs using Hierarchical Reinforcement Learning. The attention mechanism
performs poorly because the additional courses the user is interested in dilute the effects of
contributory courses. To address this, they presented hierarchical reinforcement learning
algorithm to revisit and make changes to the user’s profile and based on this, enhance the model.
Agorista et al., [6] proposed an approach towards Course Recommendation using Markov
Chain Framework for. It uses a random walk-based methodology to record the connections
between several courses in chronological order.

   Jing et al., [7] described course recommendation in MOOCs. The course recommendation
algorithm uses collaborative filtering based on user’s interests, demographic profiles and course
requirements relations. Bhumichitr et al., [8] presented a recommender system for university
elective courses. It utilizes collaborative based recommendation using Alternating Least Square
[ALS] and Pearson Correlation Coefficient. Then, based on a dataset of students’ academic
records, it compares their performance. Chang et al., [9] described a hybrid system for course
recommendations. It integrates collaborative filtering and artificial immune systems. The
testing parameters used are confusion matrix analysis and average error. Ibrahim et al., [10]
presented a novel approach and built an ontology-based personalised course recommendation
framework, an ontology-based hybrid-filtering system [OPCR]. Sulaiman et al., [11] proposed
employing fuzzy logic to create a course recommendation system. using the Mamdani fuzzy
inference system as a foundation, it applied the fuzzy rules technique to determine each related
student’s skill and interest level [CRFL]. Gulzar et al., [12] presented a model for recommending
courses to scholars depending on their requirements and domains of interest using N-gram
classification technique [CRQCA]. In [14-18] several ontological frameworks which also include
knowledge graphs and a conceptual knowledge model in support of the literature of the proposed
framework have been discussed.
3. Proposed System Architecture


                               Figure 1: Proposed ISCR model

  Fig. 1 depicts the proposed system architecture for knowledge driven course recommendation
model. The user’s input query, which goes through pre-processing, serves as the model’s
driving force. Lemmatization, tokenization, stop word removal, and NER (Named Entity
Recognition) are all components of query pre-processing . Tokenization involves separating
individual words from sentences and obtaining the individual tokens for the separated words.
Lemmatization has been done to derive the base form of the word from its inflectional form.
Stop word removal focuses on eliminating stop words like of, the, and, other articles. Named
entity recognition involves marking the entities by recognizing them. Standard Python Natural
Language Toolkit (NLTK) based libraries are used for lemmatization, stop word removal,
tokenization, and NER.

  Furthermore, on obtaining the individual query words, the RDF (Resource Description
Framework) synthesis takes places. RDF synthesis takes places using the RDF distiller,
which is a triadic format of knowledge obtained from the World-Wide Web (WWW). RDF
consists of subject, object and predicate. However, the RDF is not used as it is. Only
the subject and the object are obtained. The reason pertaining to this is that the subject
and the object both are either in the form of a term or sentence while the predicate
can be a URL or a sentence or a word. Owing to the heterogeneity of the predicate,
only the subject and the object are used. Nonetheless, subject and object when occur to-
gether can imbibe stronger level of semantics. As a result, RDF subject and object alone are used.

  E-books and other learning materials are stored in a repository. The learning material
involves e-books pertaining to subjects such as, history, psychology, civics, journalism,
environmental science, ecology etc. All the e-books constituting these domains are crawled
and stored in a localized repository which are then passed to extract the taxonomy from the
indexes of the e-books. These indexes are formalized domain-wise and are subjected to the
Wikidata API for further knowledge enrichment. In other words, entity enrichment takes place
by subjecting the indexes to the Wikidata API and a knowledge map for individual domains or
subjects is formalized. Wikidata is an open and free, multilingual knowledge graph built and
managed by the Wikimedia Foundation, which is readable and editable by both humans and
machines. For many initiatives, Wikidata serves as a central repository for their structured
data. Upon obtaining the individual knowledge map for the domains, the dataset is classified
using the features obtained from the domain-based knowledge maps. The classification is done
using Decision Trees.

   Classification Decision Trees
These types of trees classify the data using yes or no questions. For example: - if we want to
classify heart disease patients, we can do so by judging the sample space, here the people we
want to classify based on whether or not they suffer from chest pain. If a person answers yes,
then they are classified into having heart disease, if they answer No, then they are classified into
not having heart disease. The same can be done by using various other factors that influence
the subject, here heart disease.

   Regression Trees
In contrast to classification trees these types of trees use numeric values to classify a dataset.
For example: - testing the dosage of a new drug developed to cure coronavirus disease. Given
the data for the percentage effectiveness of the drug upon taking a certain dose, we can make a
decision (regression) tree by dividing the given data into categories of dosage. Then, we use
these numeric values to find the percentage effectiveness of the drug at any given dosage, like
is the dosage greater than 10mg? If yes, then is it lesser than 15mg, and finally we answer by
taking the mean of the percentage effectiveness of the drug in the given range of doses taken.

   An ordinary decision tree is depicted inverted.. The top of the decision tree is called the Root
Node. Every factor or question that splits the data is called an Internal Node or Node. Nodes
which are not divided further are called Leaf nodes or leaves. Decision Trees are a simple and
effective algorithm. Often combined version, using both forms of decision trees are used, so as
to increase the accuracy.

  Based on the domain to which the query belongs to, the query enriched RDF subject object
entities have been used to compute the semantic similarity with the domain specific knowledge
map. The semantic similarity has been computed using the normalized compression distance,
normalized google distance and Gini index is also computed. Normalized compression distance
and normalized google distance are used with a threshold of 0.5.


                                         |𝐶 (𝑋 𝑌)| − min(|𝐶 (𝑋)| , |𝐶 (𝑌)|)
                        𝑁 𝐶𝐷 (𝑋 , 𝑌) =                                                         (1)
                                               max(|𝐶 (𝑋)| , |𝐶 (𝑌)|)
   Equation (1) defines the Normalized Compression Distance (NCD). E (X, Y) is the information
distance between two string X and Y, which is equal to the duration of the shortest program.
which converts X into Y and vice versa in some fixed programming language. E (X, Y) is roughly
equivalent to C(XY), which is the outcome of compressing a file made up of X concatenated
with Y using a particular compression method C.


                                     𝑚𝑎𝑥 [log 𝑓 (𝑥) , log 𝑓 (𝑦)] − log 𝑓 (𝑥, 𝑦)
                     𝑁 𝐺𝐷 (𝑥, 𝑦) =                                                             (2)
                                       log 𝑁 − min [log 𝑓 (𝑥), log 𝑓 (𝑦)]


   Equation (2) defines the Normalized Google Distance (NGD), where N is the number of
singleton search items on the pages multiplied by the total number of web pages searched by
Google. The number of results for the search terms x and y are represented by f(x) and f(y),
respectively. The number of web pages where both x and y are present at the same time is
represented by f(x, y).It is desired to maximize the number of instances at this phase, but it is
also crucial to maintain relevancy. Thus, to maintain relevancy and maximize the number of
instances, we enhance the space where the semantic similarity is computed using these two
measures by setting threshold of 0.5. Along with this Gini index is also computed with a step
deviation of 0.2. the reason behind using two semantic similarity measures and Gini index is to
enhance the number of relevant entities in the term space, and moreover, to keep the rate of
relevancy high.

  Furthermore, we are obtaining the most relevant entities to the query and these entities are
again used to calculate semantic similarity with instances classified with the decision trees. At
this stage only the normalized google distance with a threshold of 0.75 is used. The reason
behind this is that the relevancy has been computed already and has reached a refined stage.
Hence, a single semantic similarity measure with a high threshold is sufficient. Moreover, this
will also reduce the computational complexity and make the process of recommending courses
computationally inexpensive.

  Finally, the courses are ranked and recommended in the increasing order of the normalized
google distance to the user. Also, further clicks from the user are recorded, and this process
continues recursively until the user is satisfied and if there are no further user clicks then the
entire recommendation stops.
4. Implementation and Performance Evaluation and Results
The experimentations on the proposed Integrative Semantically driven Course Recommendation
(ISCR) architecture were executed on Kaggle’s Coursera Course dataset. This dataset contains
data of 890 courses. It also contains detailed description of the courses in 6 columns like rating,
difficulty, certificate, number of students enrolled etc. This is primarily the main dataset on
which the experimentations have been done. However, to enhance the relevance with respect
to domain, several courses pertaining to history, psychology, civics, journalism, environmental
science, ecology etc. were crawled. It is seen that around 142 courses regarding history, 14
courses regarding civics, 78 courses regarding psychology, 24 courses regarding environmental
science, 8 courses regarding ecology were crawled and incorporated in the framework. Out
of the added the mentioned numbers were true positives and nearly an equal number of
false positives were also found. Around 50 percent of the courses added were labelled as
neutral as they were neither true positives nor false positives but were relevant to subjects. As
a result, these courses were annotated and categorized according to the Coursera Course Dataset.

   The performance of the proposed ISCR framework for course recommendation which is
semantically inclined and integrative in nature is compared with the help of precision recall
accuracy, f measure percentages and false discovery rate (FDR) as potential metrics. The reason
for using precision, recall accuracy and f-measure percentages is because it quantifies the
percentage of the relevance of results and false discovery rate is used as the preferred metric. It
also computes the number of false positives captured by the framework. Furthermore, standard
formulation for precision recall, accuracy, f measure and false discovery rate have been used.
The experimentations were conducted for 4471 queries on the customized dataset described
later below. To compare the performance of the proposed ISCR, it is base-lined with OPCR,
CRFL, CRQCA models. It is also benchmarked with the combination of hierarchical clustering
and Jaccard’s similarity.

Table 1
Comparison of Performance of the proposed ISCR with other approaches
       Model        Avg Precision%    Avg Recall %   Avg Accuracy %    Avg F-Measure %     FDR
     OPCR [2]            90.23           93.18           91.71               91.68         0.10
     CRFL [11]           90.37           94.69            92.53              92.47         0.10
    CRQCA [12]           87.63           90.22            88.93              88.90         0.12
       HC+JS             85.31           88.12            86.72              86.69         0.15
   Proposed ISCR         96.07           98.48           97.275              97.26         0.04

  From Table 1 it is indicated where the proposed ISCR yields the highest precision of 96.07%,
highest average recall percentage of 98.48%, highest average accuracy of 97.275%, highest
average f measure percentage of 97.26%, with the lowest FDR of 0.04%. It is clearly observable
that OPCR yields an average precision of 90.23%, an average recall of 93.18%, an average
accuracy of 91.705%, an average f measure of 91.68% with an FDR of 1-0.90 = 0.1. The CRFL
model yields an average precision of 90.37%, an average recall of 94.69%, an average accuracy
of 92.53%, average f measure of 92.47% with an FDR of 0.1. The CRQCA model yields an
average precision of 87.63%, an average recall of 90.22%, an average accuracy of 88.925%,
an average f measure of 88.9% with an FDR of 0.13. The HC+JS model, the combination of
hierarchical clustering and Jaccard’s similarity yields an average precision of 85.31%, an average
recall of 88.12%, an average accuracy of 86.715%, an average f measure of 86.69% with an
FDR of 0.15. The reason why the proposed ISCR model yields the highest average precision
percentage, highest average recall percentage, highest average accuracy, highest average
F-measure percentage and lowest FDR is because it is integrative and hybridized in nature, it is
semantically driven, it is driven by RDF. The model ensures RDF synthesis for the query terms
and perfect taxonomy, which is domain specific is used and knowledge is enriched using the
Wikidata API. Since it is RDF driven, the lateral semantics is very high because the subject and
object co-occurrent is used together. Most importantly the dataset is classified using decision
trees from which the features are derived from an initially formalized knowledge graph which
is conceived based on the taxonomy as well as the RDF synthesizer. Most importantly the
relevance computation mechanism is very strong in terms of semantic similarity computation
using normalized compression distance, normalized google distance and the Gini Index. Owing
to all these factors and since it is driven by the query and RDF synthesis is carried out along
with the incorporation of a perfect taxonomy from the domains based on the standard E-books,
the proposed model is much better than the baseline models. Moreover, the semantic similarity
computation scheme which is to compute the relevance is very stringent and strong as it uses
NCP, NGP and the Gini-index.

   The reason why the OPCR model lacks, although it uses a perfect ontological model is that
ontologies are shallow in nature and apart from being shallow, they are static. So hierarchical
ontology semantic similarity model alone is used along with collaborative and content-based
filtering. The combination of contest-based filtering, collaborative filtering along with the
hierarchical ontology similarity also become insufficient because the ontology itself is static.
So even though the schemes for relevance computation are slightly stronger, owing to shallow
nature of the ontology this model does not perform as expected. The CRFL model which
uses the fuzzy logic approach for course recommendation uses fuzzy rules and techniques for
fuzzification. Although it considers rules, it does not work well and moreover there is an absence
of ontologies which is static knowledge or auxiliary knowledge into the model. So as a result,
this model also lacks and fuzzy based computation is always approximate computation and is
not concrete and the relevance computation mechanism is not strong. This is the reason why the
CRFL model lacks compared to the proposed model. The CRQCA model also lacks as it uses a
query classification approach but however, the N-gram is alone used for classification and there
is some amount of auxiliary knowledge in the model but the entire learning takes place based
on the dataset depending on the domain of the courses. Due to these reasons N-gram becomes
quite shallow and learning happens from the dataset which creates a lag in the performance of
the CRQCA model. Finally, the hybridization of hierarchical clustering with Jaccard’s similarity
also does not perform well. Although hierarchical clustering ensures good strategy clustering
and Jaccard’s similarity provides high relevance computation mechanism, the model lacks in
its accumulation of auxiliary knowledge. As a result, the hierarchical clustering plus Jaccard’s
similarity model lacks when compared to the proposed model and even compared to other
models it lacks abruptly.
                     Figure 1: Precision % vs Number of Recommendations


   Fig. 2 represents precision % vs number of recommendations curve. From the figure we can
infer that ISCR yields the highest precision in the hierarchy of precision with the number of
recommendations distribution curve. The next immediate position is taken by the OPCR model.
The next position is occupied by CRFL, followed by CRQCA and the lower most position is
occupied by the hierarchical clustering plus Jaccard’s similarity.


5. Conclusion
A semantically driven and knowledge centric model is proposed to recommend online courses.
The model is based on RDF synthesis and also uses Decision Tress and various methods for
computing semantic similarity. It also uses Wikidata API for knowledge enrichment. The
proposed ISCR model achieved an average accuracy of 97.275% with an average precision of
96.07%. It is evident from the results that the proposed ISCR model outperforms the OPCR,
CRFL, CRQCA and Hierarchical Clustering plus Jaccard Similarity models in every aspect. This
makes it a better, efficient and semantically compliant model for online course recommendation.
  References
[1] Zhang, H., Huang, T., Lv, Z., Liu, S., Zhou, Z. (2018). MCRS: A course recommendation
system for MOOCs. Multimedia Tools and Applications, 77(6), 7051-7069.
[2] Ibrahim, M. E., Yang, Y., Ndzi, D. L., Yang, G., Al-Maliki, M. (2018). Ontology-based person-
alized course recommendation framework. IEEE Access, 7, 5180-5199.
[3] Lin, Y., Feng, S., Lin, F., Zeng, W., Liu, Y., Wu, P. (2021). Adaptive course recommendation
in MOOCs. Knowledge-Based Systems, 224, 107085.
[4] Pardos, Z. A., Jiang, W. (2020, March). Designing for serendipity in a university course
recommendation system. In Proceedings of the tenth international conference on learning
analytics knowledge (pp. 350-359).
[5] Zhang, J., Hao, B., Chen, B., Li, C., Chen, H., Sun, J. (2019, July). Hierarchical reinforcement
learning for course recommendation in MOOCs. In Proceedings of the AAAI conference on
artificial intelligence (Vol. 33, No. 01, pp. 435-442).
[6] Polyzou, A., Nikolakopoulos, A. N., Karypis, G. (2019). Scholars Walk: A Markov Chain
Framework for Course Recommendation. International Educational Data Mining Society.
[7] Jing, X., Tang, J. (2017, August). Guess you like: course recommendation in MOOCs. In
Proceedings of the international conference on web intelligence (pp. 783-789).
[8] Bhumichitr, K., Channarukul, S., Saejiem, N., Jiamthapthaksin, R., Nongpong, K. (2017,
July). Recommender Systems for university elective course recommendation. In 2017 14th
international joint conference on computer science and software engineering (JCSSE) (pp. 1-5).
IEEE.
[9] Chang, P. C., Lin, C. H., Chen, M. H. (2016). A hybrid course recommendation system by
integrating collaborative filtering and artificial immune systems. Algorithms, 9(3), 47.
[10] M. E. Ibrahim, Y. Yang, D. L. Ndzi, G. Yang and M. Al-Maliki, ”Ontology-Based Personal-
ized Course Recommendation Framework,” in IEEE Access, vol. 7, pp. 5180-5199, 2019, doi:
10.1109/ACCESS.2018.2889635.
[11] Sulaiman, M. S., Tamizi, A. A., Shamsudin, M. R., Azmi, A. (2020). Course recommendation
system using fuzzy logic approach. Indonesian Journal of Electrical Engineering and Computer
Science (IJEECS), 17(1), 365-371.
[12] Gulzar, Z., Leema, A. A. (2018). Course recommendation based on query classification
approach. International Journal of Web-Based Learning and Teaching Technologies (IJWLTT),
13(3), 69-83.
[13] M. Anirudh, Gerard Deepak, A. Santhanavijayan. ”Chapter 31 SIITR: A Semantic Infused
Intelligent Approach for Tag Recommendation”, Springer Science and Business Media LLC, 2022
[14] Gupta, S., Tiwari, S., Ortiz-Rodriguez, F., Panchal, R. (2021). KG4ASTRA: question answer-
ing over Indian missiles knowledge graph. Soft Computing, 25(22), 13841-13855.
[15] Tiwari, S., Abraham, A. (2020). Semantic assessment of smart healthcare ontology. Inter-
national Journal of Web Information Systems, 16(4), 475-491.
[16] Gaurav, D., Tiwari, S. M., Goyal, A., Gandhi, N., Abraham, A. (2020). Machine intelligence-
based algorithms for spam filtering on document labeling. Soft Computing, 24(13), 9625-9638.
[17] Rai, C., Sivastava, A., Tiwari, S., Abhishek, K. (2021). Towards a Conceptual Modelling of
Ontologies. Emerging Technologies in Data Mining and Information Security: Proceedings of
IEMIS 2020, Volume 1, 1286, 39.
[18] Tiwari, S., Al-Aswadi, F. N., Gaurav, D. (2021). Recent trends in knowledge graphs: theory
and practice. Soft Computing, 25(13), 8337-8355.

</pre>