=Paper= {{Paper |id=Vol-1841/E01_26 |storemode=property |title=Assessing the Effectiveness of Self-Regulated Learning in MOOCs Using Macro-level Behavioural Sequence Data |pdfUrl=https://ceur-ws.org/Vol-1841/E01_26.pdf |volume=Vol-1841 |authors=Lan Min,Lu Jingyan |dblpUrl=https://dblp.org/rec/conf/emoocs/LanL17 }} ==Assessing the Effectiveness of Self-Regulated Learning in MOOCs Using Macro-level Behavioural Sequence Data== https://ceur-ws.org/Vol-1841/E01_26.pdf

Proceedings of EMOOCs 2017:
Work in Progress Papers of the Experience and Research Tracks and Position Papers of the Policy Track

Assessing the Effectiveness of Self-Regulated Learning in
MOOCs using Macro-level Behavioural Sequence Data
Lan Min1, Lu Jingyan2
1,2
The University of Hong Kong, Faculty of Education, Hong Kong, China
minlan@hku.hk, jingyan@hku.hk

Abstract. Learners are demanded more self-regulatory capability to carry on
effective online learning. Online course instructors attempt to stimulate online
learners’ effective self-regulated learning (SRL) to support effective learning
and enhance achievement. Knowing how the online learners learn in SRL loop
will contribute to the effective course design and scaffolding. In this study, the
learners from an edX MOOC were differentiated into more effective self-
regulated learners (SRLers) and less effective SRLers based on the criteria of
three SRL phases behavioural sequence patterns. The clickstream data of 5764
learners was analyzed on macro-level behavioural learning sequence through n-
gram algorithm. Persistence and grade were compared among the different
types of learners. The results showed us that more effective SRLers persisted
longer and performed better than less effective SRLers on a significant level.

Keywords: Self-regulated Learning, Behavioural Sequence Pattern, MOOC,
Persistence, Learning Achievement

1 Introduction
Recently, massive open online courses (MOOCs) offer people more opportunities to
access the free educational resources. Learners are required more self-regulation
capability to self-learning with fewer instructors’ assistance. Various emerging
problems on learning and teaching in MOOCs challenge the MOOCs development.
For instance, the prominent high attrition phenomenon [1] leads the persistence issue
to become one of the controversial topics for MOOC study. According to Wigfield,
Klauda, and Cambria [2], persistence is a key behavioural indicator of self-regulatory
capacity in the monitoring and control phases of self-regulation. Most previous
persistence research in SRL field was conducted in face to face situation using a self-
reported questionnaire or experiment [3-4]. In some e-learning settings, for instance,
Azevedo and his colleagues [5] developed a tool named Metatutor to assess and
convey SRL. These studies suggested that the adaptive scaffolding based on learners’
navigation path was necessary to fostering learners’ learning and use of key SRL
processes. However, these fine-grained scaffoldings stressed on time-sensitivity that
remains a challenge for researchers [5], because micro-level click actions from whole
course learning behaviour perspectives generated uncountable learners’ types, which
are inapplicable for scaffolding from SRL aspects. When it comes to the natural
learning processes in the online environment, to accurately detect, track, and model
students’ SRL processes will remain a challenge as well [6-7]. In this study, through

1
Proceedings of EMOOCs 2017:
Work in Progress Papers of the Experience and Research Tracks and Position Papers of the Policy Track

exploring the clickstream data in an edX MOOC, based on the edX platform feature
and course design feature, the learning processes were detected to see how the SRL
behavioural sequences patterns reflect persistence and achievement.

2 Related Work
Studies show that self-regulatory processes are different among students and the
differences lead to the variance in achievement [8-9]. Schunk [10-11] states that self-
regulated learning (SRL) is an effective means to improve achievement. According to
self-regulated learning theory, students are self-regulated to the degree they are
metacognitively, motivationally, and behaviorally active participants in their learning
processes [12]. These metacognitive processes include goal setting, self-monitoring,
and self-evaluative feedback loops. Motivational feelings and beliefs refer to self-
regulated learners’ display of personal initiative, perseverance, and adaptive skills.
Researchers may be unable to observe the learners’ psychological states or cognitive
thinking. However, behaviorally, self-regulation refers to specific beneficial actions,
such as record keeping, environmental structuring, and help-seeking, which are
observable [12].
In traditional studies, SRL measuring included self-reported questionnaires,
structured interviews, teacher ratings, think aloud methods, error detection tasks, trace
methodologies, and observations [13]. Questionnaires and interviews are two most
widely-used measurements in MOOC studies on SRL topics. Kizilcec et al.’s studies
[3-4] provided us with some implications of how learners perform in MOOC context,
but, in terms of research method, as Winne and Perry [13] indicated, self-report
measures do not necessarily give a reliable picture of the self-regulation tactics
students actually engage in.
The fine-grained log data that were recorded by online platform give a chance to
apply trace methodology and learning analytics on every or groups of participants. In
a MOOC video study, researchers suggested that the clickstream data organized by
using n-gram algorithm method may help people understand how the online learners
process information [14]. According to topic modeling and n-gram analysis,
researchers construct the behavioural actions sequences into several categories to
classify how these learners interact with the online videos and process information
[14]. However, viewing learners’ behaviours as a whole, their online actions on other
pages or activities also reflect their cognitive thinking. Moreover, instructional
designers promise that learners who follow an underlying SRL behavioural sequence
pattern may have a better learning achievement. We assume that the more effective
SRL learners would have a relatively regular macro-level learning sequence. In other
words, when the actual online learning behaviours are calibrated with or similar to the
ideal learning sequences, they may get on the right track of SRL loop and learn better
and longer in an online course.
According to social cognitive SRL model from Zimmerman [15], the three-stage
learning process consist of forethought and planning phase, performance monitoring
phase and self-reflection phase. The more effective SRLers who experience the
complete three-stage loop are assumed to learn better than those who do not

2
Proceedings of EMOOCs 2017:
Work in Progress Papers of the Experience and Research Tracks and Position Papers of the Policy Track

experience the completed SRL loop. From this perspective, we can distinguish
effective form from ineffective forms self-regulation from the quality and quantity of
one’s self-regulatory behavioural sequence processes [15].

3 Research Questions
1. What types of learners do we have in MOOC based on macro-level SRL
behavioural sequence patterns?
2. What is the relationship between the macro-level SRL behavioural sequences
and persistence in MOOC?
3. What is the relationship between the macro-level SRL behavioural sequences
and grade in MOOC?

4 Method

4.1 Course Context
The MOOC, Epidemics, offered by The University of Hong Kong on edX was used in
this study. It composed of ten weeks of learning materials that were broken up into
four themes. There were 4-10 video lectures in each week (most videos were 5-10
minutes). Quiz was the only assessment in this MOOC. According to the navigational
bar, this MOOC has five blocks: Home (H), Course (C), Course Details (CD),
Discussion (D) and Progress (P). The learners’ click action on each block will create
one macro-level behaviour record. Under Course block, as the video watching and
quiz taking are two main activities, they are also defined as two macro-level activities.

4.2 Data Collection
Data Cleaning

Raw data were collected from 5764 learners who did at least one action in this
MOOC during the first running time. It was organized based on three items: user id
code, time, and event. Non-learners’ behaviour records (e.g. system self-generated
records and instructors’ records) were removed. Then, the micro-level behaviours
were clustered into a macro-level behaviour if the same macro-level behaviour
category was in a linear sequence based on the time series. For instance, in a time
duration 07:00:00 to 07:20:00, all the micro-level behaviours for doing the quiz that
included the actions of saving the answer, checking the answer and showing the
correct answer were marked as “Q”. Table 1 illustrates the ID-based macro-level
behaviour sequence list.

3
Proceedings of EMOOCs 2017:
Work in Progress Papers of the Experience and Research Tracks and Position Papers of the Policy Track

Table 1. Learner macro-level behaviour sequence list

ID Learning Sequence
3465 H CD C V D C H V CD D C V C V Q C V
5317 H CD D P C V C V C V Q C V
H C V C V C V Q V Q V C Q C V Q C V Q V Q V Q V Q C V C V H P H CD C H C
5879
V C V C ...

Data Analysis

In this study, the n-gram is the contiguous sequence of n items from a learner’s online
learning clickstream record on the macro-level behaviours. For instance, the
behavioural sequences “H-CD-C-V-Q-D-P” would be composed of 2-grams (e.g. H-
CD, CD-C, C-V, V-Q, Q-D, D-P), 3-grams (e.g. H-CD-C, CD-C-V, C-V-Q, V-Q-D,
Q-D-P) and 4-grams (e.g. H-CD-C-V, CD-C-V-Q, C-V-Q-D, V-Q-D-P).
Researchers predict some sequences that may reflect the SRL behaviours. During
the forethought phase, learners prepare work before performance phase on their
studying [15]. In an online setting, any navigational button that provides the
course information may attract learners’ attention to preview/overview the
course as a pre-action. In this study, Course details (CD) offers the information
about the course learning outcomes, course syllabus, course assessing types and grade
criteria, which the learners may go through before doing activities in the performance
phase. In performance phase, self-control and self-monitoring occur in the learning
process that involves learner’s attention and willpower [15]. In the relatively fixed
online setting, the behaviours in performance phase mainly happen during the
activities. The activities are mainly in the Course page (C), which include video
lecture (V), quiz (Q), and discussion (D). Learners carried out different task strategies
to monitor their learning process. Many reasonable sequences patterns can be the
combination of C, V, Q and D. Self-reflection phase happens in the final stage when
learners review their performance toward final goals [15-16]. Any course design that
helps learners review their performance or match to the final goals can be
potentially lead to actions in self-reflection phase. Progress page (P) helps the
learners check their learning progress is going on. Since the quiz is the only
assessment in this course, a possible self-reflection behavioural sequence pattern may
be the Q-P and V-Q-P or Q-P-CD because after taking the quiz the learner may
review their performance on P. Above all, three SRL behavioural sequence patterns
were proposed in Table 2. Each learner will have a count record of sequence pattern
in each of the three SRL phases. More effective SRLers are those who did action in
each SRL phase, while less effective SRLers may missed one or two phases.
Persistence is defined as the weeks the learner stays in the course (10 weeks in
total). Achievement is the grade (1 point in total). One-way ANOVA will be
conducted to observe the significant differences among the more effective SRLers
and less effective SRLers on persistence and achievement. In the relationship between
the persistence/the grade and three SRL phases, multiple regression will be applied.

4
Proceedings of EMOOCs 2017:
Work in Progress Papers of the Experience and Research Tracks and Position Papers of the Policy Track

Table 2. Three SRL Behavioural Sequence Patterns

Phase Sequences
Forethought Phase H-CD, CD-C, H-CD-C, H-C-CD-D-P (or H-CD-D-P-C or C-H-CD-
D-P)
Performance Phase C-V, V-Q, C-V-Q, C-D, V-D, C-V-D, Q-D, C-Q-D, V-Q-D, C-V-Q-D
Self-reflection Phase P-CD, Q-P, Q-P-CD, V-Q-P, V-Q-P-CD

5 Result

5.1 Macro-level SRL Behavioural Sequences Overview (RQ1)
According to the definitions of more effective SRLer and less effective SRLer
mentioned above, it is found that the number of less effective-SRLers was nearly four
times of that of more effective SRLers (See Figure 1).
5000
4000
3000
2000
1000
0
MEF-SRLer LEF-SRLer
Number 1185 4579

Fig. 1. Number of More Effective-SRLers (MEF-SRLer) and Less Effective-SRLers
(LEF-SRLer) (N=5764)

The less effective SRLers were further subdivided based on the phase(s) they
missed (See Figure 2). About 45% less effective SRLers missed both the forethought
and self-reflection phases, and around 10% and 35% less effective SRLers missed
forethought phase and self-reflection phase respectively. There were even about 10%
less effective SRLers missed the performance phase based on the SRL behaviours
criteria. These learners are regarded as browsers (“Others” in the figure).

2500
2000
1500
1000
500
0
LEF-SRL_mf LEF-SRL_msr LEF-SRL_mf&sr Others
Number 471 1615 2034 459
*LEF-SRL_mf: only miss forethought phase; LEF-SRL_msr: only miss self-
reflection phase; LEF-SRL_mf&sr: miss forethought phase and self-reflection phase;
Others: those who assess to the course but may never do any course-related activities

Fig. 2. Types of less effective SRLers (LEF-SRLers)

5
Proceedings of EMOOCs 2017:
Work in Progress Papers of the Experience and Research Tracks and Position Papers of the Policy Track

5.2 Persistence and SRL Behavioural Sequence Patterns (RQ2)
As shown in Table 3, more effective SRLers had the highest mean value of
persistence and others had the lowest. With the one/two SRL phase(s) missing, the
persistence decreases. Especially when the self-reflection phase was missing, the
persistence accelerated dropped slightly. One-way ANOVA analysis and Tukey test
showed that the persistence of the different types of learners was different
significantly (F = 886.146, p-value < 0.001***).

Table 3. Persistence and More Effective-SRLers (MEF-SRLer) and Less Effective-
SRLers (LEF-SRLer)

MEF- LEF- LEF- LEF-
Others
Persistence SRLer SRL_mf SRL_msr SRL_mf&sr
(n=459)
(n=1185) (n=471) (n=1615) (n=2034)
Mean 6.528 5.461 2.975 2.197 1.242
Std 2.966 3.000 2.384 1.816 0.684
Min 1 1 1 1 1
25% 4 3 1 1 1
50% 7 5 2 2 1
75% 9 8 4 3 1
Max 10 10 10 10 9

We found that forethought phase and performance phase had a small positive
effect on persistence through ordinary least square (OLS) multiple regression, but
they all reached the significant level on statistics [Adj. R2 (0.480), Prob (F-statistics <
0.001), the coefficients of the three independent values (b-forethought = 0.0482, r <
0.01**, b-performance = 0.0170, r < 0.01**, b-reflection = -0.0008, r < 0.878)]. Self-
reflection had a little negative effect on persistence, but it did not reach the significant
level on statistics.

5.3 Grade and SRL Behavioural sequence patterns (RQ3)
Table 4 shows that the more effective SRLers had the highest average grade, and
others hardly got mark. With one or two SRL phase(s) missing, the average grade
dropped. Especially when the self-reflection phase was missing, the grade accelerated
dropped slightly. One-way ANOVA analysis and Tukey test showed that the grade of
the different types of learners was different significantly (F = 972.546, p-value <
0.01**).
Interestingly, there were 15 less effective SRLers who got the full mark (1 point),
while only 8 more effective SRLers got the full mark. These less effective SRLers
hardly did actions in the forethought phase. Their learning behaviours focused on
Course page, video lecture, and quiz. More specifically, they regularly learn in the
course with the sequence pattern “C-V-Q”.

6
Proceedings of EMOOCs 2017:
Work in Progress Papers of the Experience and Research Tracks and Position Papers of the Policy Track

Table 4. Grade and More Effective SRLers (MEF-SRLer) and Less Effective SRLers
(LEF-SRLer)

LEF- LEF- LEF-
MEF-SRLer Others
Grade SRL_mf SRL_msr SRL_mf&sr
(n=1185) (n=459)
(n=471) (n=1615) (n=2034)
Mean 0.538 0.410 0.120 0.051 0.002
Std 0.35 0.357 0.238 0.144 0.041
Min 0 0 0 0 0
25% 0.16 0 0 0 0
50% 0.67 0.41 0 0 0
75% 0.85 0.76 0.12 0.01 0
Max 1 1 1 1 0.87

We found that forethought phase had a little negative effect on grade through OLS
regression, but it did not reach the significant level. Performance phase had a small
positive effect on grade, and it reached the significant level on statistics. Self-
reflection had a little negative effect on persistence, and it reached the significant
level on statistics [Adj. R2 (0.544), Prob (F-statistics < 0.001), the coefficients of the
three independent values (b-forethought = -0.0009, r = 0.639, b-performance =
0.0023, r < 0.05*, b-reflection = -0.0013, r < 0.05*)].

6 Discussion

6.1 Macro-Level SRL Behavioural Sequence Patterns
Good instructional design is believed to support and stimulate SRL [17]. On the one
hand, the course design should be close to the natural SRL process that supports
learners’ essential capability to self-direct; on the other hand, the more support on the
SRL strategies should facilitate different people to construct a personalised learning
model for their SRL strategies. Then, if the course was designed with more pages of
activities for preparation before performance phase and reflection after performance
phase, the learners may pay more attention to the forethought phase and self-
reflection phase. In this study, from the course design perspective, this MOOC offers
relative concise course design that participants can follow the designed steps to learn
one by one easily. Only if the participant persist on the learning activities, they may
get a good result. However, those defined less effective SRLers may still have used
other strategies that did not leave action records online in these two phases (e.g. time
manage on schedule and taking note after learning for reflection).

6.2 Persistence, Grade and SRL
In general, the more effective SRLers were significantly persisting longer and
achieving higher than the less effective SRLers. To outward seeming, with the
forethought phase and self-reflection phase missing in online learning, both
persistence and grade performance dropped dramatically. The decreasing phenomena

7
Proceedings of EMOOCs 2017:
Work in Progress Papers of the Experience and Research Tracks and Position Papers of the Policy Track

especially happened when they missed self-reflection phase. As mentioned above, the
self-reflection phase was defined as reviewing progress page after doing the quiz or
before reviewing course detail for assessing information again. Before the less
effective SRL_msr learners did something on the progress page, they have already
made the decision between keep learning in this course and other potential events. In
other words, when we see a learner missing the reflection phase in this course, he/she
may have already logged out of the course at that time. That is why the decreasing of
persistence and grade was steeper than those just miss the forethought phase.
Regarding the multiple regression results, how much the learners did in each phase
had little or no effect on both persistence and grade. For the full mark less effective
SRLers, these learners may have a clear purpose in this course, such as getting a
certificate (performance goal), because their learning sequence pattern was regularly
on “C-V-Q...C-V-Q”. And their understanding of these contents was well. However,
we did not know if they have reviewed the videos or checked other learning resources
before taking the quiz (micro-level). Therefore, exploring the micro-level behaviours
is necessary to explain the learning performance in future studies.

7 Conclusion
More effective SRLers and less effective SRLers were differentiated based on the
three SRL phases behavioural sequence patterns according to Zimmerman’s three
stages of SRL. The finding showed that more effective SRLers outperform less
effective SRLers on both persistence and grade significantly. Compared to the
Zimmerman’s SRL model, the concrete SRL behavioural sequence patterns were
constructed through n-gram method. However, the definition of the SRL behaviours
in this study was on narrow sense, which means more patterns can be involved in
other situations. In future studies, micro-level behavioural sequence patterns studies
can be explored.

8 Limitations and Future Studies
In online learning setting, participants’ learning process is the result of both internal
and external factors, including people’s motivation, emotion, psychological and
cognitive thinking, and online learning environment. In this study, we only focused
on the macro-level behavioural sequences from the superficial perspective based on a
fixed online learning environment design. According to Zimmerman’s three stages
model [15], SRL is a cycle. It is difficult to separate SRL from subsequent three
stages activities. Furthermore, n-gram algorithm helps us capture the most frequent
behavioural sequences from the long clickstream based on the pre-defined patterns.
However, it will not automatically help us identify different types of behaviours. In
future, two aspects can be explored: 1) supplemented internal factors research to
validate the impact of SRL behavioural sequences on achievement; 2) design-based
MOOC experiment to see what kind of course design or interface navigational design
can facilitate more effective SRL behavioural sequences that lead to higher
achievement, persistence and engagement.

8
Proceedings of EMOOCs 2017:
Work in Progress Papers of the Experience and Research Tracks and Position Papers of the Policy Track

References
1. Koller, D., Ng, A., Do, C., & Chen, Z.: Retention and intention in massive open online
courses: In depth. Educause review, 48(3), 62-63 (2013).
2. Wigfield, A., Klauda, S. L., & Cambria, J.: Influences on the Development of Academic
Self Regulatory Processes. In B. J. Zimmerman & D. H. Schunk (Eds.), Handbook of self-
regulation of learning and performance (pp. 40). New York: Taylor & Francis (2011).
3. Kizilcec, R. F., Pérez-Sanagustín, M., & Maldonado, J. J.. Self-regulated learning
strategies predict learner behavior and goal attainment in Massive Open Online Courses.
Computers & Education, 104, 18-33 (2017).
4. Kizilcec, R. F., Pérez-Sanagustín, M., & Maldonado, J. J.. Recommending self-regulated
learning strategies does not improve performance in a MOOC. In Proceedings of the Third
(2016) ACM Conference on Learning@ Scale (pp. 101-104). ACM (2016, April).
5. Azevedo, R., Johnson, A., Chauncey, A., & Graesser, A.: Use of Hypermedia to Assess
and Convey Self-Regulated Learning In B. J. Zimmerman & D. H. Schunk (Eds.),
Handbook of self-regulation of learning and performance (pp. 115). New York: Taylor &
Francis Group (2011).
6. Azevedo, R. Theoretical, methodological, and analytical challenges in the research on
metacognition and self-regulation: A commentary. Metacognition & Learning, 4(1), 87-95
(2009).
7. Azevedo, R., Moos, D. C., Johnson, A. M., & Chauncey, A. D.: Measuring cognitive and
metacognitive regulatory processes during hypermedia learning: Issues and challenges.
Educational Psychologist, 45(4), 210-223 (2010).
8. Zimmerman, B. J., & Martinez-Pons, M.: Development of a structured interview for
assessing student use of self-regulated learning strategies. American Educational Research
Journal, 23(4), 614-628 (1986).
9. Zimmerman, B. J., & Martinez-Pons, M.: Construct validation of a strategy model of
student self regulated learning. Journal of Educational Psychology, 80(3), 284 (1988).
10. Schunk, D. H.: Modeling and attributional effects on children's achievement: A self-
efficacy analysis. Journal of Educational Psychology, 73(1), 93-105 (1981).
11. Schunk, D. H.: Sequential attributional feedback and children's achievement behaviors.
Journal of Educational Psychology, 76(6), 1159-1169 (1984).
12. Zimmerman, B. J.: Motivational Sources and Outcomes of Self-Regulated Learning and
Performance. In B. J. Zimmerman & D. H. Schunk (Eds.), Handbook of self-regulation of
learning and performance (pp. 49). New York: Taylor & Francis Group (2011).
13. Winne, P. H., & Perry, N. E. (2000). Measuring self-regulated learning. In M. Boekaerts,
P. R. Pintrich, & M. Zeidner (Eds.), Handbook of self-regulation (pp. 531-566). San Diego,
CA, US: Academic Press.
14. Sinha, T., Jermann, P., Li, N., & Dillenbourg, P.: Your click decides your fate: Inferring
information processing and attrition behavior from mooc video clickstream interactions.
arXiv preprint arXiv:1407.7131, 1-12 (2014).
15. Zimmerman, B. J.: Attaining Self-Regulation: A Social Cognitive Perspective. In M.
Boekaerts, P. R. Pintrich, & M. Zeidner (Eds.), Handbook of self-regulation (pp. 13-39).
California: An Imprint of Elsevier (2000).
16. Phan, T., McNeil, S. G., & Robin, B. R. Students’ patterns of engagement and course
performance in a Massive Open Online Course. Computers & Education, 95, 36-44.
doi:10.1016/j.compedu.2015.11.015 (2016)
17. Jeske, D., Backhaus, J., & Stamov Roßnagel, C.: Self- regulation during e-learning: using
behavioural evidence from navigation log files. Journal of Computer Assisted Learning,
30 (3), 272-284 (2014).