<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Modeling Trajectories to Understand the Delayed Completion of Sequential Curricula Undergraduate Programs</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Renato</forename><surname>Boegeholz</surname></persName>
							<email>renato.boegeholz@uach.cl</email>
							<affiliation key="aff0">
								<orgName type="department">Instituto de Informatica</orgName>
								<orgName type="institution">Universidad Austral de Chile</orgName>
								<address>
									<country key="CL">Chile</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Julio</forename><surname>Guerra</surname></persName>
							<email>jguerra@inf.uach.cl</email>
							<affiliation key="aff0">
								<orgName type="department">Instituto de Informatica</orgName>
								<orgName type="institution">Universidad Austral de Chile</orgName>
								<address>
									<country key="CL">Chile</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Eliana</forename><surname>Scheihing</surname></persName>
							<email>escheihi@inf.uach.cl</email>
							<affiliation key="aff0">
								<orgName type="department">Instituto de Informatica</orgName>
								<orgName type="institution">Universidad Austral de Chile</orgName>
								<address>
									<country key="CL">Chile</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Modeling Trajectories to Understand the Delayed Completion of Sequential Curricula Undergraduate Programs</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">C7095B886BD31C8AADB1B7DA8B24A76B</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-25T02:04+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Taking more time than expected to complete university degree programs is a global and known problem, and in Chile, has relevance because pressure exists to complete degrees on time. In this work, we explore academic delay in higher education programs, in particular an engineering program, and its relation with academic information summarizing the trajectory of students along with the academic program. Academic information is represented by semester-by-semester features that reflect different aspects such as performance, workload, and difficulty. Exploratory analyses of these variables reveal two orthogonal groups: performance and workload; then used to build models predicting the relative delay of a student at her 8th term at the program relative to the expected completion at 8th term. To further explore the trajectory of delay and analyze how the delay relates to other academic aspects such as term by term performance or workload, a sequential model was built. Results show different patterns of behaviors across different levels of delay in the 8th term. The methods and results of this research can be used by educational institutions or the government to support its decisions about the use of resources and attrition rates reduction.</p><p>Keywords: academic analytics • curricular analytics • learning analytics, • educational data mining • academic trajectories • time to degree.</p><p>1 The ratio between the average time it takes for students to graduate; and the theoretical duration of the study program</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Taking more time than expected to complete university degree programs is a global and known problem. The average time to degree 1 is 1.36 <ref type="bibr" target="#b0">[1]</ref> in Latin America, and 1.31 in Chile, a number that has not experienced a relevant variation in the last 10 years. This statistic not only means that obtaining a degree takes on average about 31% more time than expected but also reveals that career delays are a serious problem with economic considerations: this situation causes in Chile an additional expense for families and the government estimated at US$ 500 million <ref type="bibr" target="#b1">[2,</ref><ref type="bibr" target="#b2">3]</ref>. Delay in completing a degree program may have multiple causes such as the failure and repetition of subjects, temporary suspension of studies, or assuming less course load than expected according to the curricular plan <ref type="bibr" target="#b3">[4]</ref>. Special importance is given to academic reasons behind academic delay because pressure exists to complete degrees on time: high education in Chile is financed by the student (or her family) with the help of scholarships or other funding benefits that require good performance and usually do not tolerate delays <ref type="bibr" target="#b4">[5]</ref>.</p><p>In this work, we explore academic delay in higher education programs, in particular an engineering program, and its relation with academic information summarizing the trajectory of students along with the academic program. While similar work focused in performance variables such as grades <ref type="bibr" target="#b5">[6,</ref><ref type="bibr" target="#b16">17,</ref><ref type="bibr" target="#b18">19]</ref>, we include other relevant aspects of the academic information such as the course load taken by the student each term, the difficulty associated with the courses taken, and repetition of courses which academic situation may determine the risk of punitive actions (for example, elimination of the study program by failing a subject more than twice, or failing, in the same semester, more than two subjects), among others. The first research question of this work is:</p><p>1) What is the relation between delay in obtaining the degree and academic information along the trajectory of the student?</p><p>We build prediction models on academic delay considering different features that can describe academic trajectories. As mentioned before, we seek to represent such trajectories in terms of different academic information spanning performance, course workload, and difficulty. These academic factors are relevant not only because they could predict academic delay, but because they could characterize the trajectories in an actionable manner, that is, further analysis of such trajectories could bring insights that could support counseling practices and curricular re-design actions. Thus, we state a second research question:</p><p>2) Can the academic trajectories relate to delay be characterized in a manner that provides information about student behavior?</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Related Work</head><p>Researchers have used different approaches to analyze academic information and typically centered around performance measures. Based on grading data and dismissing the background characteristics of the students, <ref type="bibr" target="#b8">[9]</ref> identified curriculum subjects that can serve as effective indicators of academic performance. Using X-means <ref type="bibr" target="#b6">[7]</ref> to group students yearly, <ref type="bibr" target="#b8">[9]</ref> discovered typical progress patterns and evaluated the predictive capacity of the explanatory subjects. Considering the results in an entrance self-assessment test and the academic performance of the first year, <ref type="bibr" target="#b10">[11]</ref> used K-means <ref type="bibr" target="#b9">[10]</ref> to group students and follow their performance trajectories in the following 2nd and 3rd year, measuring the influence of the first year behavior in the progression of the curriculum <ref type="bibr" target="#b10">[11]</ref>. <ref type="bibr" target="#b5">[6]</ref> explored multiple personal and social factors that can affect the academic performance of university students and, using the Grade Point Average (GPA) as an explained variable through decision trees <ref type="bibr" target="#b13">[14]</ref> proposed a qualitative model to classify and predict it <ref type="bibr" target="#b5">[6]</ref>.</p><p>Dropout has also been investigated. Aiming to relieve dropout, <ref type="bibr" target="#b15">[16]</ref> used recommendation systems techniques to predict the grades that students will obtain in future subjects. The predictions were made using personalized multiple linear regressions (PLMR) <ref type="bibr" target="#b14">[15]</ref> on student participation data in both traditional classes and Massive Open Online Courses (MOOC) <ref type="bibr" target="#b15">[16]</ref>. <ref type="bibr" target="#b12">[13]</ref> examined demographic variables as family characteristics; pre-university and university academic performance factors; and the participation or not in recovery courses, to predict the persistence of the students in the study programs. Using as explanatory variables the scores of the ACT standardized test for college admission, the average of grades in high school, the average grades of the first semester of the university; and using analysis of variance (ANOVA), Pearson product-moment correlations, and multiple regression analysis <ref type="bibr" target="#b11">[12]</ref> showed that students who were academically prepared to take college-level courses were more likely to persist than students assigned to mandatory recovery courses <ref type="bibr" target="#b12">[13]</ref>.</p><p>Researchers have also focused on performing analyses of students'trajectories to inform the design of curricula. <ref type="bibr" target="#b16">[17]</ref> used student performance data from a specific program to perform an analysis of the curriculum design of the program. In particular, they modeled the difficulty of each subject as its contribution (negative or positive) to the students' GPA and then contrasting this measure to a survey of student perception. Using the same performance data they also performed a dropout and enrollment path analysis <ref type="bibr" target="#b16">[17]</ref>. Important to consider is that most of the previous work has been carried related to a flexible course-credit systems where the degrees are obtained as the sum of core and optional approved subjects <ref type="bibr" target="#b7">[8,</ref><ref type="bibr" target="#b10">11,</ref><ref type="bibr" target="#b12">13,</ref><ref type="bibr" target="#b18">19]</ref>. This context differs from our sequential-non-flexible curricula, where the flow of subjects to take is pre-defined for all the terms of the program.</p><p>The representation of the academic trajectories of the students is not trivial, and it is necessary to consider the temporal dependence of the variables under study. Following this idea, <ref type="bibr" target="#b18">[19]</ref> used frequent pattern mining <ref type="bibr" target="#b17">[18]</ref> to reveal academic trajectories to understand the sequences of subjects to take that could improve student performance. In <ref type="bibr" target="#b19">[20]</ref>, a sequential data model is proposed that explicitly captures the temporal dependencies of the academic performance characteristics to form fingerprints or signatures that are constructed allowing different analytical interpretations and the development of predictive models for the risk of academic delay.</p><p>As presented, several of the proposed models -all of the regression type-considered the performance of the students as explanatory variables, without taking into account the dependence that exists in the performance of a student in var-ious subjects within the same semester, as well as in successive semesters. Our work distinguished from previous work in several aspects: in its multivariate nature considers in addition to academic performance aspects such as the academic workload and the difficulty of the subjects in each semester; incorporating the temporal dimension in the modeling of the students' trajectories; and exploring such trajectories in the context of a curriculum with a sequential structure, where about 90% of the courses to be completed are mandatory.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Methods</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Academic Features and Data Descriptions</head><p>Academic information at the level of the degree program includes data of courses taken, passed, failed (with their grades), and dropped in each term of the academic life of the student. We combine these records with historic data and the expected curricular progress<ref type="foot" target="#foot_0">2</ref> to generate a series of academic features in each term of the student academic life. These features represent different dimensions related to academic performance, academic workload (courseload), the relative difficulty of the courses, and the consistency between the courses taken and their theoretical order in the curriculum. Nine features for each term of each student were computed and are defined in Table <ref type="table" target="#tab_0">1</ref>. More details about the definition of the variables can be found in Appendix A.</p><p>We understand by academic trajectory to all this information organized in a term by term sequence. To allow comparisons between trajectories, we considered only the activity of the first 8 semesters (8 terms) for each student, counted from their first enrollment. These 8 semesters also represent a milestone of the study program because contain the required subjects to obtain the Licenciatura degree <ref type="foot" target="#foot_1">3</ref> . Considering this, the explained variable will be the delay in the 8th term (DELAY8), which measures how far is the student of having completed all courses of the first 8 terms of the plan in her first 8 terms of academic life. A student who passed all planned courses of the first 8 semesters of the program in her first 8 semesters of academic life, has delay zero.</p><p>To reduce inconsistencies in the comparisons of the trajectories (because of the dependency between the features and the structure of the program curricula), it was decided to analyze only one study program: Engineering in Computer Science. For the period of available data, this program implements three curriculum versions (2008, 2010, and 2015) each with 11 semesters of duration and an average of 6 subjects per semester. To study the performance of the students during their first 8 semesters, only whose admitted between 2008 and 2015 were DIF F IC A Difficulty of the semester, as an additive measure ("Alpha difficulty").</p><p>[0, max(HF Rj)SU Bi] DIF F IC B Difficulty of the semester, as a geometric measure ("Beta difficulty").</p><p>[0, 1]</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>DISP AR</head><p>The disparity of subjects enrolled in the semester: the difference, in semesters, between the highest level subject and the lowest level subject.</p><p>[0, 1]</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>DELAY</head><p>Measurement of the delay between the theoretical and actual (average) semester of the student given their date of admission.</p><p>[0, 1]</p><p>considered. The resulting data set was composed of 14,199 records of academic activity of 365 different students.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">Data Modeling</head><p>We are interested in modeling the behavior of the 8th-semester delay (DELAY8) as a function of the other eight features defined for each semester. We limit the scope of the independent variables (the features) to the first 4 semesters because of two reasons. First, the idea of predicting delay (and also predicting dropout) gain relevance if a prediction can be done early, thus it seems reasonable to predict a delay in the 8th semester with information from the four first terms. Second, the four initial semesters correspond to the "Bachillerato" milestone in the engineering programs of the university, where the foundational courses of math and physics are concentrated and which have the higher failure rates, thus are strongly related with academic failure or success. The analyses will be performed in three steps. The first step is to perform an exploratory data analysis (EDA) for all the variables. We summarize the main characteristics of each variable (mean, median, quantiles, and range) together with their box plots, following with correlation matrix and principal component analysis (PCA) to gain an understanding of the structure of the set of variables and identify the most significant variables which can explain the academic delay.</p><p>The second step includes building predictive models using two supervised algorithms, linear regression (LR) and support vector machine (SVM) on the delay at 8th semester with other features of the first 4 semesters, such as difficulty and workload as predictors. These analyses target research question 1.</p><p>The third step is to characterize academic trajectories in terms of the relation of the term features and delay. An adaptation of the sequential model proposed by <ref type="bibr" target="#b19">[20]</ref> is implemented. In our case, this model is built as follows. Each student trajectory is represented as a sequence of 4 nodes (semesters 1 to 4), where each node is a single value representing a delay score. To compute the i semester's delay score, first all students are clustered using k-means on their i semester's features. Then the score of the i semesters is the average delay of all students in the cluster (cluster members).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Results and Discussion</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1">Exploratory Data Analysis</head><p>We explored the behavior of all the features in their semester-by-semester progression. Box plots and summary statistics for the variables can be found in Appendix B. Figure <ref type="figure" target="#fig_0">2</ref> presented here as a sample shows box plots of the features GPA (not cumulative weighted grade average) and DIFFIC A (an additive measure of difficulty) for the first 8 semesters.</p><p>From the analysis can be observed that the DELAY variable shows distributions with increasing medians and variability through the terms, as students on average accumulate more delay as they stay more terms. It can be observed that the distributions of the average grade (GPA) have medians that increase slightly but consistently through the terms. PASSRATE shows the lowest median in the 2nd semester with a value of 0.5. In the following semesters, the values of the medians increase progressively meanwhile the median passing rate for the 1st semester is much higher than the rest, with a value of 0.7. Dispersion is similar in all semesters.</p><p>In the case of the academic workload (WKLOAD), the median distribution in semesters 2nd to 4th is below the academic workload defined by the curriculum and it approaches that value in semester 5. The dispersion of these distributions increases between the 6th and 8th semesters. The relative difficulty, both DIFFIC A, and DIFFIC B, show medians that descend as students progress in their semesters, with quite similar dispersion. The disparity (DIS-PAR) shows only two median values: 0.143 for semesters 2 through 5; and 0.286, for semesters 6 through 8. Dispersion appears biased (positively or negatively) for all semesters, except for semester 8 in which symmetry is appreciated.</p><p>Correlation matrices and principal component analysis (PCA) were obtained to explore options for reducing the set of variables. The results of the analyzes for semester 3 are described in Figure <ref type="figure" target="#fig_1">4</ref>. Biplots and correlograms for all the features can be found in Appendix B. In all semesters it is observed that the delay is negatively correlated with the variables of performance, curricular progress, and academic load, namely: GPA, PASSRATE, PROGRE, AVG, and WKLOAD. On the other hand, it has a very weak positive correlation with the measures of difficulty of the subjects (DIFFIC A and DIFFIC B) and weak negative with the measure of disparity (DIS). Two main components of the PCA shows two groups of variables: i) those that are more related to the individual performance of the students: AVG, PASSRATE, PROGRE and GPA, and ii) those which are related to the characteristics of the study program: WKLOAD, DIFFIC A, and DIFFIC B. It is interesting the distinct components represented by performance and workload. To represent each of the groups in the subsequent analyzes of this work, we selected GPA and DIFFIC A respectively based on a better degree of interpretation than they may have compared to the other features. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2">Prediction Models</head><p>To analyze the predictive capacity of the explanatory variables, two families of models where built. The SGD i models use predictors GP A i and DIF F IC A i adding them incrementally from first to fourth semester. In this way, SGD1 implements:</p><formula xml:id="formula_0">DELAY 8 ∼ GP A 1 + DIF F IC A 1 SGD2 implements: DELAY 8 ∼ GP A 1 + DIF F IC A 1 + GP A 2 + DIF F IC A 2</formula><p>and so on. The SAi models use as predictors all the features in each semester in the same incremental way. Thus, SA1 implements:</p><formula xml:id="formula_1">DELAY 8 ∼ GP A 1 +P ASSRAT E 1 +W KLOAD 1 +DIF F IC A 1 + DIF F IC B 1 + DISP AR 1 + F IRST IM E 1 + P ROGRE 1 SA2 implements: DELAY 8 ∼ GP A 1 +P ASSRAT E 1 +W KLOAD 1 +DIF F IC A 1 + DIF F IC B 1 +DISP AR 1 +F IRST IM E 1 +P ROGRE 1 +GP A 2 +P ASSRAT E 2 + W KLOAD 2 + DIF F IC A 2 + DIF F IC B 2 + DISP AR 2 + F IRST IM E 2 + P ROGRE 2</formula><p>and so forth. Table <ref type="table" target="#tab_1">2</ref> summarizes the results obtained. Regressions were built using a Generalized Linear Model (GLM) with Gaussian errors and an identity link function. The corrected Akaike Information Criterion (AICc) of every model and the difference with the lower value was computed. We can observe that this indicator declines, as expected, with the inclusion of the variables of the consecutive semesters. An important finding is that the predictive power in the first two semesters is quite similar between the model with two variables and the model with all variables. In contrast, by including semesters 3 and 4, the predictive power of the complete model is much greater. In the case of SVM, the RSME, RSq, and MAE indicators are calculated and ratify the observed with AICc. In particular, with the complete model until the fourth semester, an RSq of 0.805 is obtained which is high.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.3">Characterization of the Delay Trajectories</head><p>Trajectories were built for each student as a sequence of 4 values, one per each of the 4 first terms, each representing the average delay of all students who have similar academic features in a semester. To do this, we performed a clustering at each semester with all its features. Each term, the student falls into one cluster, from which the average delay marks his/her delay trajectory. The number of clusters obtained varied from 3 to 4.</p><p>Figure <ref type="figure" target="#fig_5">5a</ref> shows the trajectories for all students who completed 8 semesters. To understand the delay behavior along time, the trajectories were presented into 3 groups: those students who reached their 8th semester with 2 semesters of delay or less (DELAY 8 ≤ 0.29), students who reached the 8th semester with a delay between 2 and 4 semesters (0.29 &lt; DELAY 8 ≤ 0.57), and students who have a delay of more than 4 semesters (DELAY 8 &gt; 0.57).</p><p>Figures <ref type="figure" target="#fig_5">5b to 5d</ref> shows the trajectories for every one of those groups. It can be observed -as in Figure <ref type="figure" target="#fig_5">5a</ref>-that for the first semester, the prediction of delay for all students is 2 or 4 semesters.</p><p>Most of the students who complete their 8th semester with a delay lower or equal than 2 semesters (Figure <ref type="figure" target="#fig_5">5b</ref>), were projected with 2 semesters or less of delay throughout the entire program.</p><p>On the other hand, students who finished the 8th semester with a delay greater than 4 semesters (Figure <ref type="figure" target="#fig_5">5d</ref>), maintained similar forecasts during the 4 semesters under study.</p><p>It can be seen that in both groups, the less and more delayed, the proportion of "good" and "bad" results for their 1st semester is quite similar, which would make the 1st semester a not reliable indicator of the final delay result.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Conclusions</head><p>In this research paper, we applied statistical and data mining methods to understand the different behaviors in the progression of students across a sequential curriculum program related to delay in obtaining a degree. First, features that summarize different aspects of the academic records information such as performance, workload, and course difficulty were built for each term a student stays in the academic program. A measure of the delay was custom-made in relation to the expected progress at term 8th. Second, we performed an exploratory data analysis of the features which revealed two different sets of variables that appeared orthogonal within the two principal components of a PCA: a group with all performance variables, and a group with workload and difficulty. The weighted average grade (GPA) and one measure of difficulty (DIFFIC A) were selected to represent these groups. Third, the predictive capacity of the features was explored, revealing that the selected two variables can predict the 8th-semester delay close enough as a model using all predictors. Fourth, delay sequences were modeled and represented as trajectories showing distinguishable groups of students'behavior. The evidence provided shows that the delay is not strictly determined by the students'performance during their first semester. Low initial performances can follow a path of progressive improvement and reduce their potential delay while ends the program. In conclusion, this work provides valuable insight into a better understanding of the dynamics of the progress in a sequential curricular program, potentially contributing to the decision-making of institutions, directors, and students.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>CT S j</head><p>= The number of CTS credits of the subject j. CT S Avg = The average number of CTS credits per semester of the study program. HF R j = The historical failure rate of the subject j.               </p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 2 :</head><label>2</label><figDesc>Fig. 2: Variability in the samples across the 8 semesters for (a) The not cumulative weighted grade average (GPA) and (b) The alpha difficulty (DIFFIC A).</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Fig. 4 :</head><label>4</label><figDesc>Fig. 4: (a) Biplot and (b) correlogram of the 3rd-semester variables.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Fig. 6 :</head><label>6</label><figDesc>Fig. 6: Delay trajectories for (a) All the students. (b) Students with an 8th-semester delay less or equal than 2 semesters. (c) Students with an 8th-semester delay between 2 and 4 semesters. (d) Students with an 8th-semester delay greater than 4 semesters.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Fig. B. 3 :</head><label>3</label><figDesc>Fig. B.3: Variability in samples for the first-time-enrolled subjects rate (FIRSTIME), over the 8 semesters.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Fig. B. 4 :</head><label>4</label><figDesc>Fig. B.4: Variability in the samples for the academic workload (WKLOAD), throughout the 8 semesters.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_5"><head>Fig. B. 5 :</head><label>5</label><figDesc>Fig. B.5: Variability in the samples for the alpha difficulty (DIFFIC A), throughout the 8 semesters.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_6"><head>Fig. B. 7 :</head><label>7</label><figDesc>Fig. B.7: Variability in the samples for the disparity in the semester (DISPAR), throughout the 8 semesters.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_7"><head>Fig. B. 8 :</head><label>8</label><figDesc>Fig. B.8: Variability in the samples for the delay related to the curriculum (DELAY), throughout the 8 semesters.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_8"><head>Fig. B. 11 :</head><label>11</label><figDesc>Fig. B.11: Biplot and correlogram for semester 1.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_9"><head>Fig. B. 13 :</head><label>13</label><figDesc>Fig. B.13: Biplot and correlogram for semester 2.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_10"><head>Fig. B. 15 :</head><label>15</label><figDesc>Fig. B.15: Biplot and correlogram for semester 3.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_11"><head>Fig. B. 17 :</head><label>17</label><figDesc>Fig. B.17: Biplot and correlogram for semester 4.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_12"><head>Fig. B. 19 :</head><label>19</label><figDesc>Fig. B.19: Biplot and correlogram for semester 5.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_13"><head>Fig. B. 21 :</head><label>21</label><figDesc>Fig. B.21: Biplot and correlogram for semester 6.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_14"><head>Fig. B. 23 :</head><label>23</label><figDesc>Fig. B.23: Biplot and correlogram for semester 7.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_15"><head>Fig. B. 25 :</head><label>25</label><figDesc>Fig. B.25: Biplot and correlogram for semester 8.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 :</head><label>1</label><figDesc>Description and possible values of the features built for this study. Definition of the variables can be found in Appendix A.</figDesc><table><row><cell>Feature</cell><cell>Description</cell><cell>Possible val-</cell></row><row><cell></cell><cell></cell><cell>ues</cell></row><row><cell>GP A</cell><cell cols="2">Not cumulative weighted grade average for the semester. [1.0, 7.0] with 4.0</cell></row><row><cell></cell><cell></cell><cell>the passing grade</cell></row><row><cell cols="2">P ASSRAT E Passing rate of the semester (the ratio of passed subjects</cell><cell>[0, 1]</cell></row><row><cell></cell><cell>to passed plus failed subjects).</cell><cell></cell></row><row><cell cols="2">F IRST IM E The proportion of subjects enrolled for the very first time</cell><cell>[0, 1]</cell></row><row><cell></cell><cell>in the semester.</cell><cell></cell></row><row><cell>P ROGRE</cell><cell>Contribution of subjects passed in the semester to the</cell><cell>[0, 1]</cell></row><row><cell></cell><cell>total of subjects required to obtain the degree.</cell><cell></cell></row><row><cell>W KLOAD</cell><cell>Academic workload rate for the semester measured in</cell><cell></cell></row><row><cell></cell><cell>CST to the average semester CST of the program.</cell><cell></cell></row></table><note>[0, LENP rog ]</note></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2 :</head><label>2</label><figDesc>Summary measures of the predictive capacity of the models. K indicates the number of predictors in the model.</figDesc><table><row><cell></cell><cell></cell><cell>Regression</cell><cell></cell><cell>SVM</cell><cell></cell><cell></cell></row><row><cell>Model</cell><cell>K</cell><cell>AICc</cell><cell>∆AICc</cell><cell>RMSE</cell><cell>RSq</cell><cell>MAE</cell></row><row><cell>SGD1</cell><cell>2</cell><cell>-123.430</cell><cell>98.913</cell><cell>0.154</cell><cell>0.588</cell><cell>0.127</cell></row><row><cell>SGD2</cell><cell>4</cell><cell>-146.596</cell><cell>75.746</cell><cell>0.142</cell><cell>0.645</cell><cell>0.114</cell></row><row><cell>SGD3</cell><cell>6</cell><cell>-161.323</cell><cell>61.020</cell><cell>0.139</cell><cell>0.679</cell><cell>0.116</cell></row><row><cell>SGD4</cell><cell>8</cell><cell>-177.462</cell><cell>44.881</cell><cell>0.137</cell><cell>0.688</cell><cell>0.115</cell></row><row><cell>SA1</cell><cell>8</cell><cell>-118.020</cell><cell>104.322</cell><cell>0.153</cell><cell>0.562</cell><cell>0.127</cell></row><row><cell>SA2</cell><cell>16</cell><cell>-153.047</cell><cell>69.296</cell><cell>0.140</cell><cell>0.657</cell><cell>0.117</cell></row><row><cell>SA3</cell><cell>24</cell><cell>-180.135</cell><cell>42.208</cell><cell>0.122</cell><cell>0.738</cell><cell>0.102</cell></row><row><cell>SA4</cell><cell>32</cell><cell>-222.343</cell><cell>0</cell><cell>0.106</cell><cell>0.805</cell><cell>0.088</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table of</head><label>of</label><figDesc>Fig. B.4: Academic workload (WKLOAD)</figDesc><table><row><cell cols="8">Semester N Mean St. Dev. Min Pctl(25) Median Pctl(75) Max</cell></row><row><cell>1</cell><cell>365 1.036</cell><cell>0.108</cell><cell>0.131</cell><cell>1.067</cell><cell>1.067</cell><cell>1.067</cell><cell>1.067</cell></row><row><cell>2</cell><cell>337 0.770</cell><cell>0.226</cell><cell>0.000</cell><cell>0.667</cell><cell>0.800</cell><cell>0.984</cell><cell>1.133</cell></row><row><cell>3</cell><cell>280 0.754</cell><cell>0.196</cell><cell>0.000</cell><cell>0.656</cell><cell>0.767</cell><cell>0.875</cell><cell>1.167</cell></row><row><cell>4</cell><cell>241 0.713</cell><cell>0.251</cell><cell>0.000</cell><cell>0.600</cell><cell>0.733</cell><cell>0.885</cell><cell>1.148</cell></row><row><cell>5</cell><cell>204 0.797</cell><cell>0.253</cell><cell>0.000</cell><cell>0.667</cell><cell>0.867</cell><cell>0.942</cell><cell>1.333</cell></row><row><cell>6</cell><cell>191 0.832</cell><cell>0.316</cell><cell>0.000</cell><cell>0.633</cell><cell>0.918</cell><cell>1.049</cell><cell>1.333</cell></row><row><cell>7</cell><cell>164 0.827</cell><cell>0.203</cell><cell>0.000</cell><cell>0.700</cell><cell>0.900</cell><cell>0.938</cell><cell>1.167</cell></row><row><cell>8</cell><cell>158 0.926</cell><cell>0.289</cell><cell>0.000</cell><cell>0.738</cell><cell>0.967</cell><cell>1.167</cell><cell>1.500</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table of</head><label>of</label><figDesc></figDesc><table><row><cell></cell><cell></cell><cell cols="5">Fig. B.5: Difficulty (alpha) (DIFFIC A)</cell><cell></cell></row><row><cell cols="8">Semester N Mean St. Dev. Min Pctl(25) Median Pctl(75) Max</cell></row><row><cell>1</cell><cell>365 2.009</cell><cell>0.331</cell><cell>0.053</cell><cell>1.822</cell><cell>2.248</cell><cell>2.248</cell><cell>2.248</cell></row><row><cell>2</cell><cell>324 1.914</cell><cell>0.386</cell><cell>0.465</cell><cell>1.737</cell><cell>2.005</cell><cell>2.209</cell><cell>2.615</cell></row><row><cell>3</cell><cell>277 1.741</cell><cell>0.340</cell><cell>0.742</cell><cell>1.488</cell><cell>1.830</cell><cell>2.030</cell><cell>2.610</cell></row><row><cell>4</cell><cell>235 1.552</cell><cell>0.490</cell><cell>0.155</cell><cell>1.357</cell><cell>1.627</cell><cell>1.876</cell><cell>2.484</cell></row><row><cell>5</cell><cell>201 1.490</cell><cell>0.472</cell><cell>0.096</cell><cell>1.151</cell><cell>1.552</cell><cell>1.846</cell><cell>2.484</cell></row><row><cell>6</cell><cell>184 1.336</cell><cell>0.487</cell><cell>0.107</cell><cell>1.018</cell><cell>1.285</cell><cell>1.747</cell><cell>2.450</cell></row><row><cell>7</cell><cell>163 1.307</cell><cell>0.509</cell><cell>0.450</cell><cell>0.805</cell><cell>1.297</cell><cell>1.772</cell><cell>2.415</cell></row><row><cell>8</cell><cell>154 1.068</cell><cell>0.436</cell><cell>0.400</cell><cell>0.747</cell><cell>1.014</cell><cell>1.299</cell><cell>2.443</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_0">In Chile most of higher education programs have a semi-flexible curricular plan, where the study sequence in pre-defined term by term.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_1">In Chile, "Licenciatura" is similar to a bachelor's degree. To obtain this grade is necessary to complete 8 semesters of subjects that are part of an academic major. This degree allows you to continue an academic career. To be qualified for professional practice there are necessary between 2</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_2">and 4 semesters of additional subjects.</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">Acknowledgments</head><p>Work funded by Universidad Austral de Chile and the LALA project (grant no. 586120-EPP-1-2017-1-ES-EPPKA2-CBHE-JP). This project has been funded with support from the European Commission. This publication reflects only the views of the authors, and the Commission cannot be held responsible for any use which may be made of the information contained therein.</p></div>
			</div>

			<div type="annex">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Appendix A Definition of Explanatory Variables</head><p>The following equations describe how the features were built for each semester i of the student's stay and every subject j enrolled in that semester:</p><p>(1)</p><p>Where:</p><p>LEN P rog = The total number of semesters of the study program. SU B P rog = The total number of subjects to be completed in the program to obtain the degree.</p><p>CT S P rog = The total number of CTS credits of the study program. SU B i = The number of subjects enrolled in semester i. SU B1T i = The number of subjects enrolled in the semester i for the very first time. P ASS i = The number of passed subjects in semester i. F AIL i = The number of failed subjects in semester i. GRA j = The final grade obtained by the student in the subject j. SEM j = Semester in which the subject j is located within the study program.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>AV G i</head><p>= Average semester in which the student is, with   </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Appendix B Exploratory Data Visualization</head></div>			</div>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">At a Crossroads: Higher Education in Latin America and the Caribbean</title>
		<author>
			<persName><forename type="first">M</forename><surname>Ferreyra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Avitabile</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Botero</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Paz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Urzúa</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Directions in Development</title>
				<meeting><address><addrLine>Washington, DC</addrLine></address></meeting>
		<imprint>
			<publisher>World Bank</publisher>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
	</analytic>
	<monogr>
		<title level="m">SIES Servicio de Información de Educacion Superior del Ministerio de Educación de Chile</title>
		<title level="s">Informe Duracion Real y Sobreduracion de las carreras de Educación Superior</title>
		<imprint>
			<date type="published" when="2013">2018. 2013-. 2017</date>
		</imprint>
	</monogr>
	<note>Spanish</note>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<title level="m">Estimación del gasto fiscal y familiar para financiar la sobre-duración de los estudiantes en las carreras: caso chileno (Spanish)</title>
				<imprint>
			<publisher>Aequalis Foro de Educación Superior</publisher>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<author>
			<persName><forename type="first">E</forename><surname>Himmel</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Modelo de análisis de la deserción estudiantil en la educación superior (Spanish)</title>
				<imprint>
			<date type="published" when="2002">2002. 2018</date>
			<biblScope unit="volume">17</biblScope>
			<biblScope unit="page" from="91" to="108" />
		</imprint>
		<respStmt>
			<orgName>Facultad de Educación/ Pontificia Universidad Católica de Chile</orgName>
		</respStmt>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<title level="m" type="main">Factores asociados al logro cognitivo de los estudiantes de America Latina y el Caribe (Spanish)</title>
		<author>
			<persName><forename type="first">E</forename><surname>Trevino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Valdes</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Castro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Costilla</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Pardo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Donoso Rivas</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2010">2010</date>
			<publisher>OREALC/UNESCO</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Educational data mining and students&apos; performance prediction</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">A</forename><surname>Saa</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">International Journal of Advanced Computer Science and Applications</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="issue">5</biblScope>
			<biblScope unit="page" from="212" to="220" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">X-means: Extending k-means with efficient estimation of the number of clusters</title>
		<author>
			<persName><forename type="first">D</forename><surname>Pelleg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">W</forename><surname>Moore</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Icml</title>
		<imprint>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="727" to="734" />
			<date type="published" when="2000">2000</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Pathways to completion: Patterns of progression through a university degree</title>
		<author>
			<persName><forename type="first">R</forename><surname>Robinson</surname></persName>
		</author>
		<idno type="DOI">10.1023/B:HIGH.0000009803.70418.9c</idno>
	</analytic>
	<monogr>
		<title level="j">Higher Education</title>
		<imprint>
			<biblScope unit="volume">47</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="1" to="20" />
			<date type="published" when="2004">2004</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Analyzing undergraduate students&apos; performance using educational data mining</title>
		<author>
			<persName><forename type="first">R</forename><surname>Asif</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Merceron</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">A</forename><surname>Ali</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">G</forename><surname>Haider</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Computers &amp; Education</title>
		<imprint>
			<biblScope unit="volume">113</biblScope>
			<biblScope unit="page" from="177" to="194" />
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Some methods for classification and analysis of multivariate observations</title>
		<author>
			<persName><forename type="first">J</forename><surname>Macqueen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the fifth Berkeley symposium on mathematical statistics and probability</title>
				<meeting>the fifth Berkeley symposium on mathematical statistics and probability</meeting>
		<imprint>
			<date type="published" when="1967">1967</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="281" to="297" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">The Influence of First Year Behaviour in the Progressions of University Students. Computers Supported Education. CSEDU</title>
		<author>
			<persName><forename type="first">R</forename><surname>Campagni</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Merlini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">C</forename><surname>Verri</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Communications in Computer and Information Science</title>
		<imprint>
			<biblScope unit="volume">865</biblScope>
			<date type="published" when="2017">2018. 2017</date>
			<publisher>Springer</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<monogr>
		<title level="m" type="main">Designing experiments and analyzing data: a model comparison perspective</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">E</forename><surname>Maxwell</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">D</forename><surname>Delaney</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2004">2004</date>
			<publisher>Lawrence Erlbaum Associates, Inc</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Factors influencing college persistence for first-time students</title>
		<author>
			<persName><forename type="first">S</forename><surname>Stewart</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">H</forename><surname>Lim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Kim</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Developmental Education</title>
		<imprint>
			<biblScope unit="page" from="12" to="20" />
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Induction of decision trees</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">R</forename><surname>Quinlan</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Machine learning</title>
		<imprint>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="81" to="106" />
			<date type="published" when="1986">1986</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<monogr>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">H</forename><surname>Kutner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">J</forename><surname>Nachtsheim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Neter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Li</surname></persName>
		</author>
		<title level="m">Applied linear statistical models</title>
				<meeting><address><addrLine>Boston</addrLine></address></meeting>
		<imprint>
			<publisher>McGraw-Hill Irwin</publisher>
			<date type="published" when="2005">2005</date>
			<biblScope unit="volume">5</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Predicting student performance using personalized analytics</title>
		<author>
			<persName><forename type="first">A</forename><surname>Elbadrawy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Polyzou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Ren</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Sweeney</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Karypis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Rangwala</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Computer</title>
		<imprint>
			<biblScope unit="volume">49</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="61" to="69" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Curricular Design Analysis: A Data-Driven Perspective</title>
		<author>
			<persName><forename type="first">G</forename><surname>Mendez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Ochoa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Chiluiza</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>De Wever</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Learning Analytics</title>
		<imprint>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="84" to="119" />
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Mining association rules between sets of items in large databases</title>
		<author>
			<persName><forename type="first">R</forename><surname>Agrawal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Imieliński</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Swami</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Acm sigmod record</title>
		<imprint>
			<biblScope unit="volume">22</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="207" to="216" />
			<date type="published" when="1993">1993</date>
			<publisher>ACM</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Identifying Course Trajectories of High Achieving Engineering Students through Data Analytics</title>
		<author>
			<persName><forename type="first">O</forename><surname>Almatrafi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Johri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Rangwala</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Lester</surname></persName>
		</author>
		<idno type="DOI">10.18260/p.25519</idno>
	</analytic>
	<monogr>
		<title level="m">ASEE Annual Conference and Exposition</title>
				<meeting><address><addrLine>New Orleans, Louisiana</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">A Sequence Data Model for Analyzing Temporal Patterns of Student Data</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">J</forename><surname>Mahzoon</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">L</forename><surname>Maher</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Eltayeby</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Dou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Grace</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Learning Analytics</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="55" to="74" />
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">PM2: a Process Mining Project Methodology</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">L</forename><surname>Van Eck</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Lu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">J J</forename><surname>Leemans</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">M P</forename><surname>Van Der Aalst</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">CAiSE 2015</title>
				<imprint>
			<date type="published" when="2015">2015. 9097</date>
			<biblScope unit="page" from="297" to="313" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
