=Paper=
{{Paper
|id=Vol-1688/paper-01
|storemode=property
|title=MoocRec.com : Massive Open Online Courses Recommender System
|pdfUrl=https://ceur-ws.org/Vol-1688/paper-01.pdf
|volume=Vol-1688
|authors=Panagiotis Symeonidis,Dimitrios Malakoudis
|dblpUrl=https://dblp.org/rec/conf/recsys/SymeonidisM16
}}
==MoocRec.com : Massive Open Online Courses Recommender System==
MoocRec.com : Massive Open Online Courses
Recommender System
Panagiotis Symeonidis Dimitrios Malakoudis
Department of Informatics Department of Informatics
Aristotle University Aristotle University
Thessaloniki, 54124, Greece Thessaloniki, 54124, Greece
psymeon@gmail.com dmalakoudis@gmail.com
ABSTRACT
Massive open online courses (MOOCs) have recently gained
a huge users’ attention on the Web. They are considered
as a highly promising form of teaching from leading uni-
versities such as Stanford and Berkeley. MoocRec.com is
a web site that recommends courses to users so that, they
can acquire those skills, that are expected from their ideal
job posting. MoocRec’s recommendation engine is based on
Matrix Factorization (MF) model combined with Collabo-
rative Filtering (CF) algorithm, which exploits information
from external resources (i.e., users’ skills, courses’ character-
istics, etc.) to predict course trends and to perform rating
predictions according to them.
Categories and Subject Descriptors Figure 1: Users and Courses in the 2-D space.
H.3.3 [Information Search and Retrieval]: Information
Filtering
As shown in Figure 1, courses/users that are placed in
close distance, are the most suitable/similar to each other.
1. INTRODUCTION As shown, women prefer literature courses, whereas men
Massive Open Online Courses (MOOCs) platforms offer choose the technical ones. Specifically, the course “English
thousands of different courses and each course’s registra- Grammar and Style” can be recommended to Maria and
tion/enrolment can be in the hundreds of thousand students. Irene, whereas “From Java to Android” course is more suit-
It would be very useful, if someone could be recommended able to John. Please notice that matrix decomposition has
a course to acquire those skills, that are expected from his also revealed a second separation, which takes place among
ideal job description. people’s preference, towards practical and theoretical types
MoocRec.com is a web site that provides to users rec- of courses. In MoocRec.com, we predict users’ ratings over
ommendations of MOOCs. Firstly, users provide some in- courses based on matrix factorization (MF) technique, which
formation about their studies and their dream job. Then, exploits information from several external resources/matri-
MoocRec.com recommends to them related courses, to ac- ces.
quire the required skills for getting their dream job. The The rest of this paper is organized as follows. Section 2
heart of the recommendation engine of MoocRec.com is ma- summarizes the related work, Section 3 summarizes the sys-
trix decomposition over a user-course rating matrix R to re- tem’s architecture. Section 4 describes our recommendation
duce its dimensions and remove noise from data. To do this, engine. Finally, Section 5 concludes this paper.
we preserve a small number of k latent features (i.e., dimen-
sions) with the objective to reveal the mainstream users’
preferences. For example, in Figure 1, we plot users and
2. RELATED WORK
courses, assuming that k has been tuned to 2. Furnas et al. [1] proposed Singular Value Decomposition
(SVD) in Information Retrieval research field. More specif-
ically, SVD captures latent associations between the terms
and the documents. SVD is a well-known factorization tech-
nique that factors a matrix into three matrices. An instance
of SVD, known as UV-decomposition, searches for two ma-
trices (U and V ), whose their multiplication gives an approx-
imation of the original matrix R. A significant improvement
on the prediction accuracy of classic MF algorithm may be
obtained through the incorporation of implicit feedback into
the MF model [2, 3, 4].
Extensions of classic MF algorithm have been successfully example, course “Data Analysis for Life Sciences 6: High-
applied for recommendations in MOOC domain to address performance Computing for Reproducible Genomics” is rec-
the problem of high students’ drop-out rates from online ommended because it is related to Life Sciences and Ge-
courses. To reduce the high students’ drop-out rates, they nomics. Our recommendation algorithm, places first the
provide recommendations of useful forum threads to stu- courses which provide the greatest number of skills to the
dents based on their blog history inside a MOOC discussion target user. Moreover, courses that start soon or are self-
forum. For instance, Yang et al. [5] have designed a latent paced have priority towards others.
feature model to describe student behaviors inside a MOOC
forum.
3. SYSTEM’S ARCHITECTURE
Figure 2 introduces the architecture of the MoocRec sys-
tem. It consists of the web site, the recommendation engine,
the database and the web crawler.
Recommender Web Site
System User monitoring system
Rating system
Advanced matrix
factorization User profile based
recommender recommender
Content based Skill based Alerts
recommender recommender Search engine
Database Web Crawler
Users Alerts Searches MOOC Figure 3: Recommendation appearance and reason-
providers and social
Courses Watch lists
networks.
ing.
Skills User actions
Jobs Studies
5. CONCLUSIONS
In this paper, we proposed MoocRec.com, which exploits
Figure 2: MoocRec System Architecture information from external resources (i.e., users’ skills, courses’
characteristics, etc.) to provide course recommendations. In
future, we want to test experimentally our system to check
MoocRec.com integrates a search engine for MOOCs that
its accuracy effectiveness in terms of accurate recommenda-
are automatically retrieved using web content mining tech-
tions.
niques from MOOC providers such as edX and Coursera.
It also incorporates a MOOC recommender system, which
provides a target user with personalized content according References
to the skills he wishes to acquire. Therefore, we use content- [1] G. Furnas, S. Deerwester, and S. et al. Dumais. Informa-
based filtering in order to connect course’s description with tion retrieval using a singular value decomposition model
the desired skills. We also embed the matrix factorization of latent semantic structure. In Proc. ACM SIGIR Conf.,
technique and recommend MOOCs to a target user based pages 465–480, 1988.
on previous choices of other users.
Moreover, using alerts a registered user may be notified [2] Yehuda Koren. Factorization meets the neighborhood: a
of MOOCs when they become available. He may also rate multifaceted collaborative filtering model. In Proceeding
courses either explicitly or implicitly (log history). That is, of the 14th ACM SIGKDD international conference on
our monitoring system records the user’s actions and helps Knowledge discovery and data mining, pages 426–434.
in constructing his profile. ACM, 2008.
The web crawler scans edX and Coursera web sites and [3] Yehuda Koren and Robert M. Bell. Advances in Collabo-
enters all the MOOCs found in a MySQL database, In ad- rative Filtering. In Francesco Ricci, Lior Rokach, Bracha
dition, an additional web crawler mines from LinkedIn the Shapira, and Paul B. Kantor, editors, Recommender Sys-
skills which are connected to the top-rated jobs. Therefore, tems Handbook, pages 145–186. Springer, 2011.
the database holds information that correlates jobs to skills
and skills to courses, which makes course and job recom- [4] Arkadiusz Paterek. Improving regularized singular value
mendation possible. decomposition for collaborative filtering. In Proceedings
of KDD cup and workshop, volume 2007, pages 5–8,
2007.
4. RECOMMENDATION ENGINE
Figure 3 illustrates an example of the course recommen- [5] Diyi Yang, Mario Piergallini, Iris Howley, and Carolyn
dations which are provided by our system. An important Rose. Forum thread recommendation for massive open
characteristic of our recommendations is that the user is in- online courses. In Proceedings of 7th International Con-
formed about the reason he was recommended a course. For ference on Educational Data Mining, 2014.