<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>An early warning dropout model in higher education degree programs: A case study in Ecuador</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Vanessa Heredia-Jimenez</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alberto Jimenez</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Margarita Ortiz-Rojas</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jon Imaz Mar n</string-name>
          <email>jimaz@pa.uc3m.es</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Pedro Manuel Moreno-Marcos</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Pedro J. Mun~oz-Merino</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Carlos Delgado Kloos</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Escuela Superior Politecnica del Litoral, ESPOL, Information Technology Center</institution>
          ,
          <addr-line>Guayaquil</addr-line>
          ,
          <country country="EC">Ecuador</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Universidad Carlos III de Madrid</institution>
          ,
          <addr-line>Madrid, Espan~a</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>Worldwide, a signi cant concern of universities is to reduce academic dropout rate. Several initiatives have been made to avoid this problem; however, it is essential to recognize at-risk students as soon as possible. In this paper, we propose a new predictive model that can identify the earliest moment of dropping out of a student of any semester in any undergraduate course. Unlike most available models, our solution is based on academic information alone, and our evidence suggests that by ignoring socio-demographics or pre-college entry information, we obtain more reliable predictions, even when a student has only one academic semester nished. Therefore, our prediction can be used as part of an academic counseling tool providing the performance factors that could in uence a student to leave the institution. With this, the counselors can identify those students and take better decisions to guide them and nally, minimize the dropout in the institution. As a case study, we used the students' data of all undergraduate programs from 2000 until 2019 from a public high education university in Ecuador.</p>
      </abstract>
      <kwd-group>
        <kwd>Data mining</kwd>
        <kwd>Dropout prediction</kwd>
        <kwd>Early detection</kwd>
        <kwd>Algo- rithm</kwd>
        <kwd>Learning analytics</kwd>
        <kwd>Higher education</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Worldwide, one main concern in all Higher Education Institutions (HEI) is
student dropout [
        <xref ref-type="bibr" rid="ref12 ref19">12, 19</xref>
        ]. For instance, according to the Organization for Economic
Cooperation and Development (OECD), the dropout rate is about 24% [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ].
      </p>
      <p>Copyright c 2020 for this paper by its authors. Use permitted under Creative
Commons License Attribution 4.0 International (CC BY 4.0).</p>
      <p>
        In the speci c case of Latin America, the World Bank reports a similar rate
than the OECD with a 22% dropout rate [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. Some proposals to solve this
situation have been, among others, o ering students an introductory
orientation phase [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ], o ering nancial aids or scholarships [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], and use ICT in their
University Tutorials Systems [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Nonetheless, the overarching strategies and the
funding programs on academic success do not su ciently reduce the academic
dropout. It is vital to recognize at-risk students before dropping out.
      </p>
      <p>
        The available data about students' academic performance, creates an
opportunity to identify what factors a ect student dropout and take preventive
measures to solve this problem. This is the reason why studies have been done using
prediction algorithms to solve this situation [
        <xref ref-type="bibr" rid="ref21 ref6">6,21</xref>
        ]. In the case of Latin America,
there are not many studies in this area. However, when found, these are con ned
to: a) identify attrition through descriptive statistics [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], b) study only the rst
years of student study [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], c) focus on a speci c undergraduate program [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] or a
courses [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] and, d) use a di erent type of variables such as academic and
sociodemographic data, which results in over- tting or biased results due to the large
number of variables [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ]. Moreover, the studies do not specify the input
characteristics, causing that the models can not be replicated. This study addresses
the problems above by proposing a predictive module considering the academic
history of students throughout their undergraduate programs with a strong
emphasis on analyzing which input variables have the most substantial in uence on
predicting dropout. A case study is presented in a Latin American institution
of higher education where the model was tested, scaling it to all undergraduate
programs. This model will help as input for the Early Warning Systems (EWS)
dashboards to be developed in the Learning Analytics area.
      </p>
      <p>The rest of the paper is organized as follows. Section 2 provides a brief
overview of the relevant literature. Section 3 describes the data used in our
predictive model and summarises our work. Section 4 shows the experimental
results from the study. The discussion about our work is presented in Section
5. Finally, Section 6 draws conclusions from this analysis, and suggests possible
directions for future work in this area.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>When it comes to input variables to predict dropout, there are di erent models
and input features found in the literature, some focused on analyzing a single
type of data, while others mix di erent groups of data.</p>
      <p>
        Regarding a single type of data, for example, in studio [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ], the authors
used psychosocial variables with a statistics model on 690 students, and the
results ranged from 11.8% to 22% of the variance. The studio of [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ], focused on
analyzing academic data such as the rst partial grade and the percentage of
nonattendance accumulated of 171 students. Unfortunately, the prediction rate is no
mentioned. The study of [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] combined intelligence, personality, and motivational
predictors in 137 students and the results showed 33% of the variance in GPA
and 30% of the variance in time to graduation.
      </p>
      <p>
        In terms of mixing di erent groups of data, in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], the authors used
demographic, pre-college entry information, and complete transcript records of 69,116
students, generating a total of more than 700 features and had an accuracy of
66%. In [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], the authors used the Cox proportional hazards model with di erent
groups of variables, such as demography, family history, nancial information,
high school, college enrollment and semester credits of 11,121 students. The
precision of its model ranged between 71% and 82%. In [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], the attributes used
were about the student's behavior in the rst part of his undergraduate
program of 498 students, ignoring classical attributes. The precision of their models
ranged between 74% and 78%. Another predictive model, as well as an EWS,
is presented by [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], which only uses academic performance and data gathered of
LMS, the precision ranges where from 79% to 92%.
      </p>
      <p>While all of the above models are accurate in their contexts, they are not
scalable because they use small samples, only use the rst academic years, or
focus on speci c elds or undergraduate programs. Moreover, a mix of di erent
variables, such as psychological, demographic and academic, could cause biased
results or less accuracy prediction. Besides, certain models cannot be replicated
because the variables are not speci ed but rather mentioned in a general way.
Therefore, a model that can be scalable and replicable is needed to understand
the student's path from beginning to end and consider variables that cause an
impact on learning performance.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Case Study</title>
      <p>The study took place in Escuela Superior Politecnica del Litoral, ESPOL, a
public engineering-oriented university located in Ecuador. The model was
implemented in Spyder3, an open-source platform, and we used Python V3.7 as
programming language and made use of the Scikit-learn library, an open-source
library implements many machine learning algorithms. The proposed model and
each of its phases is described below.
3.1</p>
      <sec id="sec-3-1">
        <title>Data set</title>
        <p>The information was provided by the Information Technology and Systems
Management4. For the model, we considered the bachelor's degree in Industry
Engineering, which has 753 students enrolled from 2000 until the rst semester
of 2019. This is one of the undergraduate programs with the most signi cant
number of students enrolled during this period. The years selected were used as
boundaries to allow ve years of full studies from the time the students began
their rst academic semester until obtaining a professional degree.</p>
        <sec id="sec-3-1-1">
          <title>3 https://www.spyder-ide.org/</title>
        </sec>
        <sec id="sec-3-1-2">
          <title>4 https://www.serviciosti.espol.edu.ec/</title>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>Pre-processing</title>
        <p>This phase focused on the information acquired, with the purpose of generating
consistent data that could be used for the algorithm. The data was categorized
as follows: socio-demographic data of the student (gender, marital status,
employment status, city of residence, among others), data about the undergraduate
program (number of total credits, code of the course, among others) and data on
the student's performance (courses taken, state of the course, number of times
the course has been taken, GPA, credits of the course, among others). During
the pre-processing phase, cleaning, transformation and integration procedures
were performed due to the origin of the information.</p>
        <p>A recognition process was carried out, in which the academic terms were
appropriately identi ed because the period of study varied depending on the term.
Therefore, the periods were the rst and second term, also called ordinary
periods, and third term also called extraordinary period. For the predictive model,
ordinary periods were selected as they had the same duration. Subsequently,
we enumerated the semesters in which students had been studying with their
respective academic record, in order to execute the analysis per semester.</p>
        <p>After the pre-processing, the most representative academic performance
variables were calculated and used as inputs for the model, which are detailed in
Table 1. Besides, Section 4.1 explains the reasons why the model uses academic
information alone, and discards socio-demographics information. Other
indicators as undergraduate program performance indicators per semester were
calculated in order to obtain additional information to compare the results gathered
from the prediction.
V3 aNfutemrbfaerilionfgttimheesseocfotnhdirtdimenerollment in a subject
V4 The period in years that the student takes to return</p>
        <p>to study
V5 GPA average of all subjects
V6 tGaPkAenaivnertahgeecoufrrseunbtjeucntdsergraduate program</p>
        <p>GPA average of subjects with number of credits greater
V7 than zero, penalized for the number of times</p>
        <p>the subjects are taken by the student
V8 Ratio of the approved subjects taken by the student
V9 Ratio of the failed subjects taken by the student
V10 Ratio of the canceled subjects taken by the student</p>
      </sec>
      <sec id="sec-3-3">
        <title>Desertion alert algorithm solution</title>
        <p>
          For this preliminary work, we de ne the concept of academic dropout when a
student has not been enrolled in any course for some time since his/her last
registration. Based on ESPOL's internal guidelines and regulations5, the period
is ve years. Therefore, those students have a dropout value equal to one. On the
other hand, students who graduated had a dropout value equals to zero. Also,
students with 90% of their undergraduate degrees completed are assigned with
dropout equal to zero. This percentage based on other studies [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ], which indicated
that people with this advanced level abandon their undergraduate degree for
reasons outside their academic performance. Thus, dropout is considered as a
single, binary outcome feature. As a classi cation problem, we used the Random
Forest Classi er (RFC) algorithm [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]. This is a well-known tree-based learning
algorithm, known for its low tendency to over-training and its high accuracy.
        </p>
        <p>
          After pre-processing, the process started with the dropout assignment equal
to one or zero, according to the de nition of academic dropout indicated above.
Subsequently, the semester selection was made cumulatively. This means,
selecting the academic history of all students in their rst semester and
entering this data into the model; then, selecting the students in their rst and
second semesters and entering this data into the model; and so on, until the
fth semester, which represents two and a half years of studies. After the fth
semester, the evaluation of all students who had more than ve semesters in
their undergraduate programs was carried out. In Ecuador, according to [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ], the
average of student dropout occurs between the second and third year of the
studies.
        </p>
        <p>Every group of semesters was used to calculate the academic performance
indicators, and consequently, de ne the set of data that was taken for the
training and testing. The training data was built with the group of students whose
dropout was previously de ned, and the algorithm was trained; while, for testing
data, the set of students who did not have a dropout de ned was considered.</p>
        <p>In general, high probability values mean that the student has a high
probability of dropping out of the undergraduate program, while low values indicate a
high probability of nishing the degree. Fig. 1 depicts the steps that were carried
out to produce meaningful information.
4
4.1</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Results</title>
      <sec id="sec-4-1">
        <title>One degree</title>
        <p>For data acquired from the bachelor's degree in Industry Engineering, we
applied a correlation matrix to analyze how the di erent variables were a ected
within the proposed model. As consequence, the socio-demographic variables
were discarded, such as gender, married status, school type, and work status,
because this variables had a correlation coe cient lower than 0.1; therefore, were</p>
        <sec id="sec-4-1-1">
          <title>5 http://reglamentos.espol.edu.ec/</title>
          <p>not included into the model. Additionally, we use the RFC with the following
parameters: number of trees equal to 300, depth of trees equal to 6, minimum
number of samples required to split equal to 6, minimum number of samples
required to be at a leaf node equal to 7. These parameters were chosen based on
the results in accuracy when the model was applied to our testing set with an
accuracy of more than 95%.</p>
          <p>
            In order to evaluate the e ectiveness of the proposed algorithm, accuracy,
area under curve (AUC), F1-score, recall and precision were used as evaluation
criteria. Accuracy is the proportion of correct prediction of dropout. Precision
is the proportion of dropout learners predicted correctly by the classi er. Recall
is the proportion of dropout learners predicted correctly by the classi er in all
real dropout learners. F1-score is the harmonic mean of precision and recall [
            <xref ref-type="bibr" rid="ref10">10</xref>
            ].
Table 2 presents the results of the evaluation criteria per semester of the
predictive model based on the variables described in Section 3.2, for the undergraduate
program selected.
It was decided to scale the model for all undergraduate programs that are
dictated in ESPOL. The data selected from 2000 until the rst semester of 2019
has a total of 29,983 students. Due to the time of the selected data, there were
some changes in denomination codes of undergraduate programs. Nevertheless,
the essential structure of the undergraduate programs had been preserved. With
the purpose of scalability, a process of unifying undergraduate program codes
was carried out to avoid the loss of information.
          </p>
          <p>A total of 65 undergraduate programs were predicted within the model. The
model predicted the dropout degree by degree, and as explained before, every
group of semesters was used to calculate the academic performance indicators,
and consequently, de ne the set of data that was taken for the training and
testing. In this way, in parallel to testing our proposal, we proved that
sociodemographic data are unnecessary for dropout prediction, and eliminate typical
biases from the learning process. A summary of the ve evaluation criteria of the
model for the 10 undergraduate programs with the largest of number of students
enrolled are shown in Table 3.
Our proposed model was based on academic data alone and the analysis of the
academic semesters of students throughout the student's undergraduate
program. The initial results indicated a strong signal in the use of data purely
academic for predicting student dropout, with an accuracy of more than 95%.
In addition, the experimental results proved the e ectiveness and scalability of
our algorithm due to the di erent numbers of students enrolled.</p>
          <p>
            Unfortunately, these results can not be compared with similar studies that
have only used academic data as well because of the following reasons. First, the
prediction model focuses on the rst semesters only [
            <xref ref-type="bibr" rid="ref1">1</xref>
            ], while ours studies the
student throughout his/her undergraduate program. Second, the variables are
not the same. For instance, in [
            <xref ref-type="bibr" rid="ref15">15</xref>
            ], the authors used three academic variables,
but the studio considered the fees deadline as academic indicator, while we used
in our model data purely academic. Third, we tested our model in di erent
undergraduate programs. Meanwhile in [
            <xref ref-type="bibr" rid="ref4">4</xref>
            ] used one undergraduate program.
          </p>
          <p>
            We also acknowledge the limitation that prevents us from generating a
better model. These limitations arise from the data gathered. Although, it is true
we only focus on the use of academic data for the model, it should be noted
the importance of the students' nancial status, because this factor is one of
the motivation behind students' decisions to stop their studies [
            <xref ref-type="bibr" rid="ref20">20</xref>
            ]. Thus, our
limitation is because the students' nancial status was calculated in ESPOL
using di ered range values for students from 2000 until 2014 and for students
from 2015 onwards. Another limitation is when students withdraw for a period
of time and their undergraduate programs change the assigned code, so when
they return to their studies, they are registered into the database with the new
undergraduate program code, but with all their academic records as a single
semester. The portion of this student population with more than 10 subjects
taken was 6%. This limitation was addressed considering as maximum average
7 subjects taken per semester.
6
          </p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusion and future work</title>
      <p>Dropout prediction is an essential prerequisite to make interventions of at-risk
students. To address this issue, we proposed a predictive model, which used
academic data alone and analyzed the information semester by semester to obtain
more reliable predictions. Besides, it was able to identify the earliest moment of
dropping out of a student. To the best of our knowledge, this is the rst proposal
of a predictive model able to predict even for students who have just nished
their rst semester.</p>
      <p>Our model was also tested with real data of ESPOL and was also applied
to other large data sets of the di erent undergraduate programs. Experimental
results showed that our model has better results than existing proposals in terms
of input features and accuracy. Therefore, we have proven that our model allows
to predict with su ciently high accuracy but, at the same time, can predict the
intention of dropping out early enough, so that some corrective actions can be
attempted.</p>
      <p>Additionally, the fact that the variables that worked are speci cally
mentioned can help other universities to replicate this model or adapt it to their
needs.</p>
      <p>Our future work aims to make technical improvements to our prediction
model and use other machine learning algorithms to compare them as other
researches do. Furthermore, we plan to use this model in postgraduate programs
and in academic data sets from other types of universities where attrition rates
tend to be much higher. Additionally, we are currently working on a
visualization prototype to integrate the results of the predictive model into the Academic
Counseling System. Using visualization techniques with explanatory models,
allows counselors to have the opportunity to identify those students who have the
possibility of an early dropout, and be able to adapt a recommendation strategy
to minimize the risk of failure of a student. Lastly, we plan to analyze and
visualize which input variables have the most substantial in uence on predicting
dropout.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgements</title>
      <p>Work partially funded by the LALA project (grant
no.586120-EPP-1-2017-1-ESEPPKA2-CBHE-JP). This project has been funded with support from the
European Commission. This work has also been partially funded by the Madrid
Regional Government through the e-Madrid-CM Project under Grant
S2018/TCS4307, a project which is co-funded by the European Structural Funds (FSE and
FEDER). This work has also been partially funded by the FEDER/Ministerio de
Ciencia, Innovacion y Universidades{Agencia Estatal de Investigacion, through
the Smartlet Project under Grant TIN2017-85179-C3-1-R. This publication
reects only the views of the authors, and the funders cannot be held responsible
for any use which may be made of the information contained therein.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Abu-Oda</surname>
            ,
            <given-names>G.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>El-Halees</surname>
            ,
            <given-names>A.M.</given-names>
          </string-name>
          :
          <article-title>Data mining in higher education: university student dropout case study. Data mining in higher education: university student dropout case study 5(1) (</article-title>
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Alvarrac</surname>
            <given-names>n</given-names>
          </string-name>
          , P.,
          <string-name>
            <surname>Daniel</surname>
          </string-name>
          , J.:
          <article-title>Identi cacion del per l de egreso correspondiente a la licenciatura de la carrera de laboratorio cl nico e histotecnologico de la universidad central del ecuador periodo 2017-2022 (</article-title>
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Ameri</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fard</surname>
            ,
            <given-names>M.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chinnam</surname>
            ,
            <given-names>R.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reddy</surname>
            ,
            <given-names>C.K.</given-names>
          </string-name>
          :
          <article-title>Survival analysis based framework for early prediction of student dropouts</article-title>
          .
          <source>In: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management</source>
          . pp.
          <volume>903</volume>
          {
          <fpage>912</fpage>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Aulck</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Velagapudi</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Blumenstock</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>West</surname>
          </string-name>
          , J.:
          <article-title>Predicting student dropout in higher education</article-title>
          .
          <source>arXiv preprint arXiv:1606.06364</source>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Baneres</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rodriguez-Gonzalez</surname>
            ,
            <given-names>M.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Serra</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>An early feedback prediction system for learners at-risk within a rst-year higher education course</article-title>
          .
          <source>IEEE Transactions on Learning Technologies</source>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Barbu</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vilanova</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Lopez</given-names>
            <surname>Vicario</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Pereira</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.J.</given-names>
            ,
            <surname>Alves</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Podpora</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Angel</given-names>
            <surname>Prada</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Moran</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Torreburno</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Marin</surname>
          </string-name>
          ,
          <string-name>
            <surname>S.</surname>
          </string-name>
          , et al.:
          <article-title>Data mining tool for academic data exploitation: literature review and rst architecture proposal. Projecto SPEET-Student Pro le for Enhancing Engineering Tutoring (</article-title>
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Breiman</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Random forests</article-title>
          .
          <source>Machine learning 45(1)</source>
          ,
          <volume>5</volume>
          {
          <fpage>32</fpage>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Catalano</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Murcia</given-names>
            <surname>Perez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.E.</given-names>
            ,
            <surname>Escudero</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            ,
            <surname>Gerlo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.D.</given-names>
            ,
            <surname>Pantoja</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.</surname>
          </string-name>
          :
          <article-title>In uence of a tutorial system based on the use of ict in the decrease of dropout and academic failureof rst-year students of the veterinary career of the juan agust n maza university</article-title>
          . In: I Jornadas de Inclusion de
          <article-title>Tecnolog as Digitales en la Educacion Veterinaria (La Plata,</article-title>
          <year>2018</year>
          ) (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Cevallos</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Cuadernos del contrato social por la educacion</article-title>
          .
          <source>Cuaderno</source>
          <volume>10</volume>
          ,
          <issue>34</issue>
          {
          <fpage>46</fpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Feng</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sun</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Mooc dropout prediction using a hybrid algorithm based on decision tree and extreme learning machine</article-title>
          .
          <source>Mathematical Problems in Engineering</source>
          <year>2019</year>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Chen</surname>
          </string-name>
          , R.:
          <article-title>Financial aid and student dropout in higher education: A heterogeneous research approach</article-title>
          . In: Higher education, pp.
          <volume>209</volume>
          {
          <fpage>239</fpage>
          . Springer (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Chen</surname>
          </string-name>
          , R.:
          <article-title>Institutional characteristics and college student dropout risks: A multilevel event history analysis</article-title>
          .
          <source>Research in Higher education 53(5)</source>
          ,
          <volume>487</volume>
          {
          <fpage>505</fpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Dalipi</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Imran</surname>
            ,
            <given-names>A.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kastrati</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          :
          <article-title>Mooc dropout prediction using machine learning techniques: Review and research challenges</article-title>
          .
          <source>In: 2018 IEEE Global Engineering Education Conference (EDUCON)</source>
          . pp.
          <volume>1007</volume>
          {
          <fpage>1014</fpage>
          .
          <string-name>
            <surname>IEEE</surname>
          </string-name>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Ferreyra</surname>
            ,
            <given-names>M.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Avitabile</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Botero</surname>
            <given-names>Alvarez</given-names>
          </string-name>
          ,
          <string-name>
            <surname>J.</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Haimovich</given-names>
            <surname>Paz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            ,
            <surname>Urzua</surname>
          </string-name>
          ,
          <string-name>
            <surname>S.</surname>
          </string-name>
          :
          <article-title>At a crossroads : Higher education in latin america and the caribbean (</article-title>
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Jimenez</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paoletti</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sanchez</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sciavicco</surname>
          </string-name>
          , G.:
          <article-title>Predicting the risk of academic dropout with temporal multi-objective optimization</article-title>
          .
          <source>IEEE Transactions on Learning Technologies</source>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Kappe</surname>
          </string-name>
          , R., van der Flier, H.:
          <article-title>Predicting academic success in higher education: what's more important than being smart?</article-title>
          <source>European Journal of Psychology of Education</source>
          <volume>27</volume>
          (
          <issue>4</issue>
          ),
          <volume>605</volume>
          {
          <fpage>619</fpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Murphy</surname>
            ,
            <given-names>J.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Murphy</surname>
            ,
            <given-names>S.A.</given-names>
          </string-name>
          :
          <article-title>Get ready, get in, get through: Factors that in uence latino college student success</article-title>
          .
          <source>Journal of Latinos and Education</source>
          <volume>17</volume>
          (
          <issue>1</issue>
          ),
          <volume>3</volume>
          {
          <fpage>17</fpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <source>OECD: Education at a Glance</source>
          <year>2019</year>
          (
          <year>2019</year>
          ). https://doi.org/https://doi.org/https://doi.org/10.1787/f8d7880d-en, https://www.oecd-ilibrary.org/content/publication/f8d7880d-en
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Paura</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Cause analysis of students' dropout rate in higher education study program</article-title>
          .
          <source>Procedia - Social and Behavioral Sciences</source>
          <volume>109</volume>
          ,
          <volume>1282</volume>
          {
          <volume>1286</volume>
          (01
          <year>2014</year>
          ). https://doi.org/10.1016/j.sbspro.
          <year>2013</year>
          .
          <volume>12</volume>
          .625
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <given-names>Sinchi</given-names>
            <surname>Nacipucha</surname>
          </string-name>
          ,
          <string-name>
            <surname>E.R.</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Gomez</given-names>
            <surname>Ceballos</surname>
          </string-name>
          ,
          <string-name>
            <surname>G.P.</surname>
          </string-name>
          :
          <article-title>Access and desertion in universities. nancing alternatives</article-title>
          .
          <source>ALTERIDAD. Revista de Educacion</source>
          <volume>13</volume>
          (
          <issue>2</issue>
          ),
          <volume>274</volume>
          {
          <fpage>287</fpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Suganya</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Narayani</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Analysis of students dropout forecasting using data mining</article-title>
          .
          <source>In: 3rd Internaational Conference on Lastest Trends in Engineering, Science, Humanities and Management</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Ting</surname>
            ,
            <given-names>S.M.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Man</surname>
          </string-name>
          , R.:
          <article-title>Predicting academic success of rst-year engineering students from standardized test scores and psychosocial variables</article-title>
          .
          <source>International Journal of Engineering Education</source>
          <volume>17</volume>
          (
          <issue>1</issue>
          ),
          <volume>75</volume>
          {
          <fpage>80</fpage>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Viloria</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lezama</surname>
            ,
            <given-names>O.B.P.</given-names>
          </string-name>
          :
          <article-title>Mixture structural equation models for classifying university student dropout in latin america</article-title>
          .
          <source>Procedia Computer Science</source>
          <volume>160</volume>
          ,
          <issue>629</issue>
          {
          <fpage>634</fpage>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <given-names>Viloria</given-names>
            <surname>Silva</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.J.</given-names>
            ,
            <surname>Parody</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.</surname>
          </string-name>
          :
          <article-title>Methodology for obtaining a predictive model academic performance of students from rst partial note and percentage of absence (</article-title>
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Vossensteyn</surname>
            ,
            <given-names>J.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kottmann</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jongbloed</surname>
            ,
            <given-names>B.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kaiser</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cremonini</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stensaker</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hovdhaugen</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wollscheid</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Dropout and completion in higher education in europe: Main report (</article-title>
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>