=Paper= {{Paper |id=Vol-2868/article_4 |storemode=property |title=Idiographic Learning Analytics: A single student (N=1) approach using psychological networks |pdfUrl=https://ceur-ws.org/Vol-2868/article_4.pdf |volume=Vol-2868 |authors=Mohammed Saqr,Sonsoles López-Pernas }} ==Idiographic Learning Analytics: A single student (N=1) approach using psychological networks== https://ceur-ws.org/Vol-2868/article_4.pdf
Idiographic Learning Analytics:
A single student (N=1) approach using psychological networks
Mohammed Saqr 1,2, Sonsoles López-Pernas 3
1
  KTH Royal Institute of Technology, Stockholm, Sweden
2
  University of Eastern Finland, Joensuu, Finland
3
  Universidad Politécnica de Madrid, Madrid, Spain


                Abstract
                Recent findings in the field of learning analytics have brought to our attention that conclusions
                drawn from cross-sectional group-level data may not capture the dynamic processes that unfold
                within each individual learner. In this light, idiographic methods have started to gain grounds
                in many fields as a possible solution to examine students’ behavior at the individual level by
                using several data points from each learner to create person-specific insights. In this study, we
                introduce such novel methods to the learning analytics field by exploring the possible potentials
                that one can gain from zooming in on the fine-grained dynamics of a single student.
                Specifically, we make use of Gaussian Graphical Models —an emerging trend in network
                science— to analyze a single student's dispositions and devise insights specific to him/her. The
                results of our study revealed that the student under examination may be in need to learn better
                self-regulation techniques regarding reflection and planning.
                Keywords 1
                Graphical Gaussian Models, Idiographic Learning Analytics, Network Science, Psychological
                Networks

1. Introduction

    The growing field of learning analytics (LA) has drawn the attention of academics, researchers, and
administrators who aspire to understand and optimize teaching and learning [1]. Over ten years of
findings have brought immense insights to our attention. One of the most important lessons that we
have learned is that context matters: models obtained in one context are barely transferable to other
contexts [2]. Researchers have failed to replicate the results of predictive models (e.g., for estimating
student performance) across multiple learning settings due to the remarkable diversity in the data
generated by students’ learning activates, the obtained predictors, as well as the levels of statistical
significance [3,4]. These inconsistencies have made the efforts towards offering adaptive learning or
personalizing support an arduous endeavor. Researchers have called for using the high resolution data
generated by students to generate personalized insights [5]. However, analyzing cross-sectional (i.e.,
group-level) data to generate personalized recommendations does not mean that each individual person
will conform to the group average, and consequently, such insights generated by averaging over a group
are hardly transmutable to every individual person [6]. Furthermore, cross-sectional group-level data
fail to account for the dynamic processes (e.g., cognition and communication) that unfold within the
individual. Obviously, a single cross-sectional timepoint is hardly useful to explain a dynamic
phenomenon occurring over multiple time points [7].
     On this basis, idiographic methods have started to gain grounds as a possible solution to examine
behavior at the individual level in other fields. Idiographic methods use several data points from an
individual to create person-specific insights. Being derived on the person level, such analyses account
for the individual factors while being able to explain dynamic phenomena [8–10]. Winne et al. (2017)

Proceedings of the NetSciLA21 workshop, April 12, 2021
EMAIL: mmas3@kth.se (A. 1); sonsoles.lopez.pernas@upm.es (A. 2)
ORCID: 0000-0001-5881-3109 (A. 1); 0000-0002-9621-1392 (A. 2)
             ©️ 2021 Copyright for this paper by its authors.
             Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
             CEUR Workshop Proceedings (CEUR-WS.org)
argued that high resolution data enable individual (i.e., idiographic) learner analytics, so that learners
can gather own data and “interpret results to decide whether and how to adapt study tactics and learning
strategies”. Dawson et al. (2019) examined a large sample of students and tried early interventions
aiming at prevention of dropouts. Their findings pointed to no effect on the retention outcome. The
authors concluded that more data about individual differences are needed to better understand the
retention process as well as to design relevant personalized interventions. A recent massive scale study
that has examined a large sample of students (around 250,000) have found small benefit of a group-
based behavioral intervention despite the massive dataset. Authors concluded that the field needs
efficient interventions tailored to the individual and course context. Thus, education researchers need
to explore such individual-based approaches [7].
    This study builds on the aforementioned insights and takes inspiration from the emerging fields of
idiographic psychology and precision medicine, which have developed methods and standards for such
methods of analysis [7,8,11]. In doing so, we explore the possible potentials that one can gain from
zooming in on the fine-grained dynamics of a single student. We explore a person-specific data
collection method as well as person-specific analysis and recommendations. Using data from a single
student over 30 days, we analyze his/her dispositions and devise insights specific to him/her. Our
approach is based on the emerging trend in network science, in particular, Gaussian Graphical Models
(GGM) [10,12]. Our research question is as follows: What insights can idiographic learning analytics
reveal about students’ self-regulation and learning dispositions?

2. Background
2.1. The cognitive process as a networked system

    Representing elements of the cognitive and social processes as a network is an established research
method. Such representation has afforded researchers a way to visualize the structure of these processes
to measure the magnitude of association between their elements, and to devise statistical indices that
allow a precise interpretation of the resultant graphs [13]. In education, research on networks spans
three decades. Networks have been used to visualize the patterns of interactions in collaborative groups,
to study the roles students play in the collaboration, to rank students’ activities, or to predict
performance to mention a few examples [14–17]. While such methods have contributed enormously to
our understanding of the learning process with their repertoire of powerful visualizations methods, there
is a need for harnessing the power of other methods to extend our understanding different phenomena.

2.2.    Gaussian Graphical Models

    Recent advances in network sciences have led to the remarkable growth of probabilistic network
models, often referred to as GGM [10]. GGM map the dynamic relationships between the elements of
the cognitive or sociological phenomena we seek to understand as a complex system through the
estimation of a network where the nodes are variables and the edges are the partial correlation
coefficients between these variables [10,18–21]. Similar to multiple regression, partial correlations
estimate the correlations after controlling for all other variables in the network, thus eliminating the
possible effect of confounding variables [19]. This is particularly useful when there are multiple
dependencies, i.e., consider an example when a researcher finds a positive correlation between coffee
consumption and academic performance, such a correlation may simply be an unmeasured confounding
factor (e.g., study time that leads to more coffee drinking). Thus, in GGM networks, two nodes are
connected —if and only if— there is a covariance between these nodes that cannot be explained by any
other variable in the network [10,12,18]. The resulting networks show only the significant relationships,
the strength of such relationships, the sign (positive or negative), as well as the mediation pathways.
Such rigorous network models offer “hypothesis generating structures, which may reflect potential
causal effects to be further examined” [18]. As such, GGM offer several advantages that overcome the
shortcomings of existing methods in terms of rigorous inferential statistics, ability to control for
confounding factors, modelling the temporal evolution of the studied process. Moreover, there is a
diverse and large community working on refining and improving GGM methods.
2.3.    Graphical Vector Autoregression

    An extension of GGM methods has allowed for the modeling of temporal processes, i.e., how a
variable predicts another in the next time window. The abundance of intensive time-stamped data (time-
series) has led to the existence of enough observations of individual subjects across short periods (e.g.,
experience sampling methods, observational data and physiological data), i.e., an individual can be
studied as a unique case (N=1) [10,22]. Such time-series data are amenable to multivariate time-series
analysis, commonly known as known as vector autoregression (VAR) [10]. VAR estimates a directed
network (in contrast to undirected in GGM): the nodes are variables (e.g., motivation, behavior or
attitude) and the link between them are temporal relationships (a variable predicts another in the next
time window) [10]. This is commonly represented by drawing a directed arrow from the node that
represents the variable (e.g., motivation) to the variable that it predicts in the next time window of
measurement (e.g., engagement). An example is presented in Figure 1, which shows a temporal network
generated from a fictional individual dataset about hourly eating and exercise habits. The graph
illustrates that running predicts rest thereafter and that comfort predicts eating (weak prediction, see the
thin line). The loop around comfort means that comfort at one hour predicts that the person will be at
comfort the next hour; probably breaking the eating habits may entail keeping occupied with activities.
As shown, a temporal network predicts if a variable (an element of the studied phenomena) predicts
another in the next time window. Such type of network is used to explain within-subject covariation or
potential causal pathways.




Figure 1: A fictional temporal network of four behaviors. The circles are variables. Blue lines are
positive partial correlations. The thickness of the line is proportional to the magnitude of the
correlation. The direction of the arrow points to the direction of the temporal correlation.

3. Methods

    The study included a single student who signed an informed consent for an anonymous version of
the responses to be used for research purposes. The student was attending a course over a duration of a
month. The student had to respond to ten questions representing common dispositions and self-
regulation (SRL) that are commonly employed in learning analytics [23–25]. The questions covered the
following constructs: Expectancy value (Vlu), Motivation (Mtv), Stress as negative affect (Str), Hope
and enthusiasm as positive affect (Hop), SRL Planning (Pln), SRL Engagement with task (Tsk), SRL
Reflection and evaluation (Rfc), External Regulation by assignments (Asg), Socializing (Soc),
Challenging learning tasks (Chl).
    The survey data was detrended using the method described in [26] to make the data close to
stationary. Since our interest was to study the interplay between the student’s different dispositions, we
used the VAR model. VAR models have been established in the study of psychological phenomena,
shedding light on the temporal progression, individual aspects and dynamics of psychological processes
within individuals [10,26,27]. To understand the sequential temporal dependencies, we created a
temporal network by estimating a Graphical VAR model [26]. The temporal network captures what will
happen next as an effect of what is happening now (lag-1 or cross-lagged effects), e.g., if the person is
motivated now, the person is going to work on the task on the next step. To account for multiple
comparisons, the model was regularized using graphical least absolute shrinkage and selection operator
(GLASSO). Using GLASSO algorithm for estimating GGM networks has been shown to retrieve the
true structure of the network [26].

4. Results
    The results of the temporal network showed interesting results about the involved student (Figure 2
and Table 1). After controlling for all other variables in the network, the positive affect (feeling hope)
was the most predictive variable of engagement in a task in the next day, shown as a thick arrow between
the Hop and Tsk nodes in Figure 2 indicating the strong association. Motivation was also strongly
predictive of engagement with the task after controlling for all other variables, i.e., independent of
feeling hopeful, socializing, etc. The challenging nature of the task was also predictive of engagement
for the student, as well as stress, indicating that a bit of a challenge may help some students engage and
work on the learning activities. The expected value and relevance of the task was also predictive of the
student’s engagement with the task, emphasizing the need for creating more relevant and authentic
learning tasks.

Table 1
Values of the VAR partial correlations
                Tsk      Vlu      Mtv        Str     Hop        Pln      Rfc       Asg      Soc       Chl
 Tsk           0.00     -0.02     0.00     0.00      0.00      0.00     0.00      0.01     -0.01     0.00
 Vlu           0.16     0.00      0.03     0.00      0.00     -0.02     0.00      0.04     0.10      0.08
 Mtv           0.27     0.03      0.00     0.03      0.00     -0.07     0.03      0.00     0.28      0.00
 Str           0.17     -0.07     0.00     0.00      0.05     -0.01     0.00     -0.08     0.12      0.03
 Hop           0.29     0.06      0.00     0.00      0.00     -0.05     0.05     -0.02     0.06      0.13
 Pln          -0.06     -0.07     0.00     -0.06     0.00      0.00     0.00      0.00     -0.04    -0.09
 Rfc          -0.22     0.01      0.00     0.00      0.00      0.00     0.05      0.01     0.00      0.00
 Asg          -0.17     0.05      0.00     -0.09    -0.03      0.00     0.05      0.18     -0.17     0.01
 Soc          -0.02     0.04     -0.01     -0.03    -0.02      0.02    -0.01      0.00     0.01      0.03
 Chl           0.20     0.00      0.09     0.00      0.04      0.00    -0.09      0.00     0.00     -0.06

   Working on the assignment was negatively predictive of engagement with learning tasks, as the
student focused more on finishing the submissions. Such results also indicate that external regulation
may be counterproductive for some students. Similarly, reflection was negatively predictive of
engagement with the task the next day, which raises the question of the nature of reflection the student
has. Planning was also weekly negatively associated with engagement with the task. These negative
associations for assignment, reflection and planning are indicative of poor self-regulation practices by
the student. In fact, the student had to repeat one of the assignments as it was not fulfilling the required
guidelines and was incomplete. He also scored below the 50th percentile in the two most important
course assignments. There is room for improvement here, by helping the student learn optimal self-
regulation practices. There was a negative association between motivation and planning, while strong
positive association with socialization. Stress and assignment negatively influenced each other: the
more stress the student was under, the less he/she worked on the assignments, and the more work on
assignments the less stressed was the student, as expected. The results are detailed in Table 1. Please
note that, since partial correlations do control for other variables, their values are not to be interpreted
in the same way, as they tend to be lower.




Figure 2: Temporal network for the student

5. Discussion and conclusions

    In this study, we have used psychological network methods in the form of GGM and graphical VARs
to study a single student disposition during a course. Such idiographic method offers several advantages
over cross-sectional group level analysis. Being focused on a single student, the insights generated are
more relevant and actionable, i.e., precisely personalized, paving the path for precision education. These
methods also offer several advantages regarding controlling for confounders, deleting spurious
correlations and regularization which requires high magnitude significant correlation, offering a good
level of rigorousness [10,26,27]. The study has shown that the student under examination may be in
need to learn better self-regulation techniques regarding reflection and planning based on his own
responses. However, the value of such targeted intervention is yet to be investigated.
    The implication of our study can be the applicability of the approach in several scenarios and
contexts. Researchers who wish to apply personalized learning analytics can use such methods to design
personalized intervention for their students. We believe there is an opportunity that may change the
deserves attention and efforts from the research community to extend, improve and build on such
methods. Our methods are not without limitations. The idea that the data have to be collected on a daily
basis makes it sometimes difficult to collect data without some gaps, non-compliance, or missing
values. The rate of data collection can be tricky: we have used a lag of a single day, but we do not know
for sure if that lag was optimal. The timing of the data collection is another factor: whether data should
be collected before or after the working day is still an open question. Similarly, how frequently data
should be collected, what factors are to be included in the study, and how long we should collect the
data are aspects in need of further investigation. The collection of data comes always with problems
and risks of privacy and ethical concerns [28,29], in idiographic approach where much data is collected
it can pose a risk which needs to be mitigated [30].
6. References

[1] G. Siemens, Learning Analytics: The Emergence of a Discipline, American Behavioral Scientist.
     57 (2013) 1380–1400. https://doi.org/10.1177/0002764213498851.
[2] D. Gašević, S. Dawson, T. Rogers, D. Gasevic, Learning analytics should not promote one size
     fits all: The effects of instructional conditions in predicting academic success, The Internet and
     Higher Education. 28 (2016) 68–84. https://doi.org/10.1016/j.iheduc.2015.10.002.
[3] R. Conijn, C. Snijders, A. Kleingeld, U. Matzat, Predicting Student Performance from LMS Data:
     A Comparison of 17 Blended Courses Using Moodle LMS, IEEE Transactions on Learning
     Technologies. 10 (2017) 17–29. https://doi.org/10.1109/TLT.2016.2616312.
[4] S. Dawson, S. Joksimovic, O. Poquet, G. Siemens, Increasing the Impact of Learning Analytics,
     in: Proceedings of the 9th International Conference on Learning Analytics & Knowledge, ACM,
     New York, NY, USA, 2019: pp. 446–455. https://doi.org/10.1145/3303772.3303784.
[5] P.H. Winne, J.C. Nesbit, F. Popowich, nStudy: A System for Researching Information Problem
     Solving,      Technology,        Knowledge       and     Learning.      22     (2017)       369–376.
     https://doi.org/10.1007/s10758-017-9327-y.
[6] A.J. Fisher, J.D. Medaglia, B.F. Jeronimus, Lack of group-to-individual generalizability is a threat
     to human subjects research, Proceedings of the National Academy of Sciences of the United States
     of America. 115 (2018) E6106–E6115. https://doi.org/10.1073/pnas.1711978115.
[7] A.M. Beltz, A.G.C. Wright, B.N. Sprague, P.C.M. Molenaar, Bridging the Nomothetic and
     Idiographic Approaches to the Analysis of Clinical Data, Assessment. 23 (2016) 447–458.
     https://doi.org/10.1177/1073191116648209.
[8] P.C.M. Molenaar, C.G. Campbell, The New Person-Specific Paradigm in Psychology, Current
     Directions in Psychological Science. 18 (2009) 112–117. https://doi.org/10.1111/j.1467-
     8721.2009.01619.x.
[9] J.T. Lamiell, Toward an idiothetic psychology of personality, American Psychologist. 36 (1981)
     276–289. https://doi.org/10.1037/0003-066X.36.3.276.
[10] S. Epskamp, L.J. Waldorp, R. Mõttus, D. Borsboom, The Gaussian Graphical Model in Cross-
     Sectional and Time-Series Data, Multivariate Behavioral Research. 53 (2018) 453–480.
     https://doi.org/10.1080/00273171.2018.1454823.
[11] G. Costantini, S. Epskamp, D. Borsboom, M. Perugini, R. Mõttus, L.J. Waldorp, A.O.J. Cramer,
     State of the aRt personality research: A tutorial on network analysis of personality data in R,
     Journal of Research in Personality. 54 (2015) 13–29. https://doi.org/10.1016/j.jrp.2014.07.003.
[12] M. Saqr, O. Viberg, W. Peeters, Using Psychological Networks to Reveal the Interplay between
     Foreign Language Students’ Self-Regulated Learning Tactics, in: Proceedings of the 2020
     STELLA Symposium, 2021: pp. 12–23.
[13] M. Dado, D. Bodemer, A review of methodological applications of social network analysis in
     computer-supported collaborative learning, Educational Research Review. 22 (2017) 159–180.
     https://doi.org/10.1016/j.edurev.2017.08.005.
[14] M. Saqr, J. Nouri, U. Fors, Time to focus on the temporal dimension of learning: a learning
     analytics study of the temporal patterns of students’ interactions and self-regulation, International
     Journal        of      Technology         Enhanced        Learning.       11       (2019)        398.
     https://doi.org/10.1504/IJTEL.2019.102549.
[15] B. Chen, O. Poquet, Socio-temporal dynamics in peer interaction events, in: Proceedings of the
     Tenth International Conference on Learning Analytics & Knowledge, ACM, New York, NY, USA,
     2020: pp. 203–208. https://doi.org/10.1145/3375462.3375535.
[16] B. Chen, T. Huang, It is about timing: Network prestige in asynchronous online discussions,
     Journal of Computer Assisted Learning. 35 (2019) 503–515. https://doi.org/10.1111/jcal.12355.
[17] I. Halatchliyski, T. Hecking, T. Göhnert, H.U. Hoppe, Analyzing the Flow of Ideas and Profiles of
     Contributors in an Open Learning Community, Proceedings of the Third International Conference
     on Learning Analytics and Knowledge - LAK ’13. 1 (2013) 66–74.
     https://doi.org/10.1145/2460296.2460311.
[18] D. Hevey, Network analysis: A brief overview and tutorial, Health Psychology and Behavioral
     Medicine. 6 (2018) 301–328. https://doi.org/10.1080/21642850.2018.1521283.
[19] R. Artner, P.P. Wellingerhof, G. Lafit, T. Loossens, W. Vanpaemel, F. Tuerlinckx, The shape of
     partial correlation matrices, Communications in Statistics - Theory and Methods. 0 (2020) 1–18.
     https://doi.org/10.1080/03610926.2020.1811338.
[20] M. Hamilton, J. Clarke-Midura, J.F. Shumway, V.R. Lee, An Emerging Technology Report on
     Computational Toys in Early Childhood, Technology, Knowledge and Learning. (2019).
     https://doi.org/10.1007/s10758-019-09423-8.
[21] D. Borsboom, A network theory of mental disorders, World Psychiatry. 16 (2017) 5–13.
     https://doi.org/10.1002/wps.20375.
[22] P.C.M. Molenaar, A Manifesto on Psychology as Idiographic Science: Bringing the Person Back
     Into Scientific Psychology, This Time Forever, Measurement: Interdisciplinary Research &
     Perspective. 2 (2004) 201–218. https://doi.org/10.1207/s15366359mea0204_1.
[23] D. Tempelaar, B. Rienties, J. Mittelmeier, Q. Nguyen, Student profiling in a dispositional learning
     analytics application using formative assessment, Computers in Human Behavior. 78 (2018) 408–
     420. https://doi.org/10.1016/j.chb.2017.08.010.
[24] D. Tempelaar, B. Rienties, Q. Nguyen, Investigating learning strategies in a dispositional learning
     analytics context: The case of worked examples, ACM International Conference Proceeding
     Series. (2018) 201–205. https://doi.org/10.1145/3170358.3170385.
[25] D. Tempelaar, How Dispositional learning analytics helps understanding the worked-example
     principle, in: Proceedings 14th International Conference on Cognition and Exploratory Learning
     in Digital Age (CELDA 2017), 2017: pp. 117–124.
[26] S. Epskamp, C.D. van Borkulo, D.C. van der Veen, M.N. Servaas, A.M. Isvoranu, H. Riese, A.O.J.
     Cramer, Personalized Network Modeling in Psychopathology: The Importance of
     Contemporaneous and Temporal Connections, Clinical Psychological Science. 6 (2018) 416–427.
     https://doi.org/10.1177/2167702617744325.
[27] A.J. Fisher, J.W. Reeves, G. Lawyer, J.D. Medaglia, J.A. Rubel, Exploring the idiographic
     dynamics of mood and anxiety via network analysis, Journal of Abnormal Psychology. 126 (2017)
     1044–1056. https://doi.org/10.1037/abn0000311.
[28] M. Saqr, Big data and the emerging ethical challenges., International Journal of Health Sciences.
     11 (2017) 1–2.
[29] A. Munoz-Arcentales, S. López-Pernas, A. Pozo, Á. Alonso, J. Salvachúa, G. Huecas, An
     Architecture for Providing Data Usage and Access Control in Data Sharing Ecosystems,
     Proceedings of the 6th International Symposium on Emerging Information, Communication and
     Networks (EICN 2019). 160 (2019) 590–597. https://doi.org/10.1016/j.procs.2019.11.042.
[30] S. López-Pernas, M. Saqr, Idiographic Learning Analytics: A Within-Person Ethical Perspective,
     in: Companion Proceedings 11th International Conference on Learning Analytics & Knowledge
     (LAK21), 2021: pp. 310–315.