<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>O. Mendez);</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Machine Learning Classification Model for Detecting Academic Risk in the Development of Projective Spatial Thinking</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Oscar Mendez</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hector Florez</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Universidad Distrital Francisco Jose de Caldas</institution>
          ,
          <addr-line>Bogota</addr-line>
          ,
          <country country="CO">Colombia</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2025</year>
      </pub-date>
      <volume>000</volume>
      <fpage>0</fpage>
      <lpage>0002</lpage>
      <abstract>
        <p>This study developed and evaluated a Machine Learning classification model for early detection of academic risk in the development of projective spatial thinking. Academic and sociodemographic variables from 153 students in basic and secondary education were analyzed using a Gradient Boosting Classifier algorithm. The model was trained to identify predictors of low performance and classify students according to their risk level. The results showed a good predictive performance (ROC AUC = 0.866). We conclude that the classification system is an efective tool for teachers, facilitating proactive identification of at-risk students and enabling the design of early, personalized pedagogical interventions to strengthen this fundamental cognitive skill.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Spatial Thinking</kwd>
        <kwd>Performance Prediction</kwd>
        <kwd>Machine Learning</kwd>
        <kwd>Educational Data Mining</kwd>
        <kwd>Sociodemographic Factors</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        The development of spatial thinking is a fundamental cognitive skill for academic success in various
disciplines; however, there is evidence of persistent low performance in this area among primary and
secondary school students [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Traditional pedagogical approaches, including the use of Information
and Communication Technologies (ICT), often prove insuficient as they cannot process the complex
network of academic and sociodemographic factors that influence each student’s performance [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. This
limitation hinders the creation of efective and personalized intervention strategies.
      </p>
      <p>
        Faced with this challenge, Educational Data Mining and Machine Learning (ML) techniques emerge
as a powerful approach, capable of analyzing large datasets to identify patterns and make predictions
[
        <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
        ]. Previous research has demonstrated the feasibility of using ML models for the early detection of
students at risk of dropping out or general low performance [
        <xref ref-type="bibr" rid="ref3 ref5">3, 5</xref>
        ]. Nevertheless, there is a gap in the
application of these models to specific cognitive domains such as spatial thinking.
      </p>
      <p>
        The development of models capable of early detecting students at risk of low performance has
proven to be an efective application for predicting dropout and student success. Building on this,
the present article addresses a gap in the literature by detailing the development and evaluation of a
Machine Learning classification model specifically designed for the early detection of academic risk
in the development of the projective space of spatial thinking. The study’s objective is to answer the
following research question: To what extent do sociodemographic and academic variables, analyzed
with Machine Learning techniques, allow for the prediction of risk in the development of projective
spatial thinking [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]?
      </p>
      <p>This paper is structured as follows. Section 2 describes the theoretical framework related to this study.
Section 3 presents the methodology used to develop the work. Section 4 explains the results obtained
from the work. Section 5 presents a discussion of the most insightful elements. Section 6 presents the
future work. Some recommendations are ofered in section 7. Finally, Section 8 concludes the work.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Theoretical Framework</title>
      <sec id="sec-2-1">
        <title>2.1. Spatial Thinking as a Psychological Construct</title>
        <p>
          Spatial thinking is defined as the set of cognitive processes through which an individual mentally
constructs, represents, and manipulates objects in space, as well as their relationships and transformations
[
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]. Far from being an innate ability, psychogenetic psychology, led by Jean Piaget, posits that the
notion of space is a construct that is progressively developed through the subject’s interaction with and
action upon the physical world, and not from mere perception [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ].
        </p>
        <p>
          The Piagetian theory, which underpins this study, distinguishes a hierarchy in the development of
spatial relations, each with direct implications for the student’s capacity for abstraction:
• Topological Relations: These constitute the first understanding of space, focused on qualitative
properties such as proximity, separation, order, and enclosure. At this stage, the child understands
basic concepts of the object itself, without yet considering metrics or external perspectives [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ].
• Projective Relations: These represent a more advanced stage where the subject is able to
coordinate diferent points of view. This implies the ability to anticipate how an object is perceived
from a perspective diferent from one’s own, while preserving its fundamental properties. The
analysis of the development of this space is the central focus of our research, as it is a prerequisite
for formal geometry [
          <xref ref-type="bibr" rid="ref10 ref9">9, 10</xref>
          ].
• Euclidean Relations: These culminate the development with the ability to establish a
coordinated and metric reference system, allowing for the conservation of distances, angles, and surfaces.
Here, thinking becomes quantitative, and the foundations for abstract geometric reasoning are
established.
        </p>
        <p>This cognitive development does not occur in a vacuum; it is profoundly influenced by the student’s
physical, social, and cultural environment. The quality of pedagogical strategies and the richness of
interactive experiences with the environment are decisive for the student to successfully transition
through these stages.</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Data Mining and Machine Learning in Education</title>
        <p>
          The prediction of academic performance is a complex task due to the multiplicity of factors that afect
it, from the family environment to pedagogical strategies. Educational Data Mining (EDM) emerges as
a research field that applies computational methods to explore large volumes of educational data and
discover patterns that would otherwise remain hidden [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ].
        </p>
        <p>
          Within EDM, Machine Learning (ML) ofers a set of techniques for building predictive models from
data [
          <xref ref-type="bibr" rid="ref12 ref13">12, 13</xref>
          ]. For this work, the problem is approached as a supervised classification task, where
an algorithm "learns" from a set of labeled data (students with known performance) to be able to
classify new cases [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. This study develops a model for the early detection of students at risk of
low performance, an application that has proven efective in in the development of projective spatial
thinking [
          <xref ref-type="bibr" rid="ref2 ref3">2, 3</xref>
          ].
        </p>
        <p>
          To ensure a systematic and reproducible process, this research adopted the CRISP-DM (Cross-Industry
Standard Process for Data Mining) methodological framework [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. As illustrated in Figure 1, this
model proposes an iterative life cycle for data mining projects.
        </p>
        <p>The phases of CRISP-DM, followed in this work, are:
1. Business Understanding, where the objectives are defined
2. Data Understanding, for an initial exploratory analysis
3. Data Preparation, which includes cleaning and transformation
4. Modeling, where ML algorithms are selected and trained
5. Evaluation, to measure the model’s performance
6. Deployment, which involves integrating the model into a functional system</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Methodology</title>
      <p>This research adopted a quantitative, correlational approach with the objective of determining the
relationship between academic and sociodemographic variables and the prediction of performance in
projective spatial thinking. The entire analysis and modeling process was structured following the
CRISP-DM (Cross-Industry Standard Process for Data Mining) methodological framework. Figure 2
visually summarizes the complete workflow, from data collection to the final evaluation of the model.</p>
      <sec id="sec-3-1">
        <title>3.1. Population and Sample</title>
        <p>The study was conducted at the Institución Educativa Departamental General Carlos Albán, located
in Albán, Cundinamarca (Colombia). The total population consisted of 599 students from urban and
rural sectors. For this research, a non-probabilistic convenience sample of 153 students who voluntarily
participated during the 2019 academic year was selected. The sample was distributed across the eighth,
ninth, tenth, and eleventh grades, with ages ranging from 12 to 18 years.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Data Collection Instruments</title>
        <sec id="sec-3-2-1">
          <title>Two main instruments were used for data collection:</title>
          <p>• Mock Test: A 15-question spatial reasoning test, adapted from the 2010 entrance examination of
the National University of Colombia, was administered. This instrument was designed to assess
the students’ level of development in projective space.</p>
          <p>• Sociodemographic Factors Survey: A survey composed of 33 groups of questions was designed
and administered to characterize the sociodemographic conditions of the students. The survey
variables were based on the dictionary of associated factors from the Colombian Institute for the
Evaluation of Education (ICFES).</p>
        </sec>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Analysis and Modeling Process (CRISP-DM)</title>
        <p>The data processing and the construction of the predictive model were carried out in several stages,
using the Python programming language and specialized libraries such as Pandas, Scikit-learn, and
PyMC3.
3.3.1. Data Preparation
The initial dataset, composed of 119 variables, underwent an exploratory analysis. Missing values were
imputed using the mean for numerical data and the mode for categorical data. Given the limited size of
the original sample, a statistical simulation based on the Monte Carlo method was applied to generate a
more robust synthetic dataset, preserving the original distributions of the variables.
3.3.2. Feature Selection
To optimize the model’s performance and avoid overfitting, a feature selection process was carried
out. A Random Forest Classifier algorithm was used to evaluate and rank the predictive importance of
each variable. A final subset of 63 variables was selected, which together explained 95% of the model’s
predictive capacity, a practice recommended in the literature for optimizing classifiers.
3.3.3. Model Building and Evaluation
Several supervised classification algorithms were evaluated, with the Gradient Boosting Classifier
ofering the best performance for this problem. The dataset was split into a training set (80%) and a test
set (20%) to validate the model. The final performance was measured using the ROC (Receiver Operating
Characteristic) curve and the area under it (AUC), a standard metric for evaluating the discriminative
ability of a binary classifier.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Results</title>
      <sec id="sec-4-1">
        <title>4.1. Diagnostic Performance in Spatial Thinking</title>
        <p>
          The administration of the mock test to the sample of 153 students revealed a widespread low performance
in the development of projective spatial thinking. The results indicated that a high percentage of students
across all evaluated grades fell into the "Low Level" category. Specifically, it was found that 88% of
eighthgrade students, 76% of ninth-grade, 98% of tenth-grade, and 84% of eleventh-grade students obtained
an insuficient score. These findings empirically validated the existence of a significant educational
problem in the area of study and provided the baseline performance data for training the model [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ].
        </p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Feature Analysis and Model Selection</title>
        <p>To build a robust model, a feature importance analysis was first conducted to identify the variables with
the greatest predictive power. As shown in Figure 3, the most influential variables were not academic
but rather sociodemographic and related to personal habits. The top five predictors were the time the
student spends reading, the number of siblings, the number of people in the household, the time spent
browsing the internet, and the Sisbén score.</p>
        <p>Subsequently, four classification algorithms were evaluated to select the most suitable one. Table
1 summarizes the performance of each. The Gradient Boosting Classifier was selected as the final
model due to its superior performance (AUC of 0.86) on the test dataset.</p>
      </sec>
      <sec id="sec-4-3">
        <title>4.3. Performance and Evaluation of the Final Model</title>
        <p>The final Gradient Boosting Classifier model was optimized by tuning its hyperparameters, determining
that 100 estimators ofered the best balance between performance and overfitting (see Figure 4). The
ifnal evaluation of the optimized model yielded an AUC = 0.86, as observed in the ROC Curve in Figure
5, confirming a high discriminatory power.</p>
        <p>The confusion matrix (Figure 6) details this performance on the 5060 samples of the test set. The
model correctly identified 1626 students at risk (True Positives) and 2354 students not at risk (True
Negatives). The classification report (Table 2) summarizes the key metrics, highlighting a Recall of 0.71
for the "Risk" class and an overall Accuracy of 79%.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Discussion</title>
      <p>The results of this research demonstrate that it is feasible to predict the risk of academic performance
in projective spatial thinking with a high degree of certainty (AUC = 0.86) using a Machine Learning
model. Beyond the technical validation of the model, the findings ofer a deep insight into the factors
that modulate cognitive development in primary and secondary school students.</p>
      <p>
        A first significant finding emerges when contrasting the students’ low performance on the diagnostic
test with Piaget’s theoretical framework [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Although the students were in an age range where
they theoretically should have consolidated concrete operations, the majority showed deficiencies in
projective tasks. This indicates that the development of spatial skills is not an automatic maturational
process linked solely to age, but is strongly conditioned by the environment and the stimulation received,
suggesting that educational and contextual experience plays a more critical role than might be assumed.
      </p>
      <p>
        The model’s feature importance analysis (Figure 3) strongly reinforces this idea. The model did not
identify previous grades as the strongest predictors, but rather sociodemographic variables and personal
habits such as time spent reading, the number of people in the household, and the Sisbén score. This
result aligns with studies that have demonstrated the profound influence of cultural and economic
capital on academic performance [
        <xref ref-type="bibr" rid="ref14 ref15">14, 15</xref>
        ], and suggests that the student’s environmental conditions are
a determining factor in the development of complex cognitive skills.
      </p>
      <p>
        Finally, the model’s performance as a classification tool has direct practical implications for teaching.
The classifier’s ability to identify students "At Risk" (Recall = 0.71) and those "Not at Risk" (Recall = 0.85),
as detailed in Table 2, makes it an efective early warning system [
        <xref ref-type="bibr" rid="ref16 ref17">16, 17</xref>
        ]. Although the model is not
infallible—the 665 cases of False Negatives (Figure 6) represent at-risk students who were not detected
and deserve special attention—it does provide a first layer of analysis that allows educators to focus
their resources and design personalized pedagogical interventions. Instead of applying homogeneous
strategies, teachers can use the model’s results to nurture the potential of high-achieving students and,
crucially, to ofer focused support to those most likely to face dificulties.
      </p>
    </sec>
    <sec id="sec-6">
      <title>6. Future Work</title>
      <p>It is essential to acknowledge a methodological limitation of this study: the use of synthetic data
generated from a small original sample to achieve suficient volume for model training. Although this
technique allowed for the construction of a robust classifier and the validation of the hypothesis, it
introduces a potential bias, as the generated data inherits the characteristics and distribution of the
initial sample.</p>
      <p>Therefore, while the model has shown high potential, the generalization of these results should
be considered with caution. The reproducibility of the use case in other populations or educational
contexts will require empirical validation with a larger and more diverse organic dataset. Future lines
of research should focus on applying and retraining this model with data collected on a larger scale to
confirm and expand upon the findings presented here.</p>
    </sec>
    <sec id="sec-7">
      <title>7. Recommendations</title>
      <p>
        Based on the findings and limitations identified, the following recommendations are proposed:
• Prioritize Organic Data in Future Iterations: The most important recommendation is that
future replications of this study prioritize the use of non-synthetic data. To validate and generalize
the findings, it is essential to train and test the model on a larger and more diverse organic dataset,
thereby eliminating the bias inherent in the simulation technique used.
• Adopt a "Data Culture": It is recommended that educational institutions foster an organizational
culture that values data as a strategic asset. This involves the systematic and ethical collection of
information to use predictive models like the one presented here and thus design personalized
and proactive pedagogical strategies.
• Deepen Causal Analysis: Future research should go beyond prediction and delve into the causal
analysis of the most influential variables. Understanding why "time spent reading" is such a
strong predictor, for example, could generate valuable pedagogical knowledge.
• Integration into Educational Systems: Work should be done to integrate the predictive model
into an interactive module within school information systems (SIS) or learning management
systems (LMS). This would allow teachers and counselors to consult risk predictions in real-time,
transforming the model from a research artifact into a daily support tool for decision-making
[
        <xref ref-type="bibr" rid="ref18">18</xref>
        ].
• Creation of an Alert and Manual Review Module: It is recommended to develop an automated
alert module. This system should establish a clear policy for cases where a student’s predicted
risk score exceeds a predefined threshold. Upon triggering an alert, the system should facilitate a
protocol for a detailed manual review by the teacher, allowing for a human-centric intervention
that complements the model’s automated diagnosis.
      </p>
    </sec>
    <sec id="sec-8">
      <title>8. Conclusions</title>
      <p>This study afirmatively answers the research question, demonstrating that the application of
Machine Learning techniques is an efective tool for predicting the risk of academic performance in the
development of projective spatial thinking. The classification model validated the hypothesis that
sociodemographic variables and personal habits are highly significant predictors. However, it is
imperative to contextualize these good results, acknowledging that they were achieved using a dataset
augmented through synthetic simulation. While this was a necessary methodological strategy for the
analysis in this use case due to the small size of the original sample, it introduces a potential bias that
must be considered.</p>
      <p>Therefore, it is also concluded that the success of such projects depends on robust interdisciplinary
collaboration, where education experts guide the interpretation of the context and technical results.
The model should not be seen as an end in itself, but as a starting point for deeper inquiry.</p>
      <p>Finally, it is crucial to address the ethical dimension raised by the high predictive power of
socioeconomic variables, such as the Sisbén score. While efective for the model, their uncritical use risks
reinforcing systemic biases and potentially stigmatizing students from vulnerable backgrounds.
Therefore, it is emphatically concluded that the model must not be used as an automatic labeling tool. Instead,
it should function as an early warning system that serves as a starting point for a teacher’s inquiry.
Its role is to complement, not replace, the professional judgment of the educator, who provides the
indispensable human context and understanding required for any fair and efective intervention.</p>
    </sec>
    <sec id="sec-9">
      <title>Declaration on Generative AI</title>
      <sec id="sec-9-1">
        <title>The authors have not employed any Generative AI tools.</title>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Ó. A.</given-names>
            <surname>Méndez Aguirre</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. G.</given-names>
            <surname>López</surname>
          </string-name>
          <string-name>
            <surname>Martínez</surname>
          </string-name>
          ,
          <article-title>Técnicas de Machine Learning para la predicción de desempeño académico en el Desarrollo del espacio proyectivo del Pensamiento Espacial</article-title>
          , Universidad Pedagógica Nacional, Bogotá,
          <string-name>
            <surname>D.C.</surname>
          </string-name>
          ,
          <year>2019</year>
          . Master Thesis.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>P. H.</given-names>
            <surname>Santoso</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Setiaji</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Kurniawan</surname>
          </string-name>
          , Wahyudi,
          <string-name>
            <given-names>S.</given-names>
            <surname>Bahri</surname>
          </string-name>
          , Fathurrahman,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kusuma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I. U.</given-names>
            <surname>Wusqo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. D.</given-names>
            <surname>Muldayanti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. D.</given-names>
            <surname>Kurniawan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Syahbrudin</surname>
          </string-name>
          ,
          <article-title>Students' performance dataset for using machine learning technique</article-title>
          <source>in physics education research</source>
          ,
          <source>Scientific Data</source>
          <volume>12</volume>
          (
          <year>2025</year>
          ).
          <source>doi: 10.1038/ s41597-025-04913-0.</source>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>S.</given-names>
            <surname>Ma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Jia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Wei</surname>
          </string-name>
          , W. Zhang,
          <article-title>Constructing a predictive model of negative academic emotions in high school students based on machine learning methods</article-title>
          ,
          <source>Scientific Reports</source>
          <volume>15</volume>
          (
          <year>2025</year>
          )
          <fpage>19183</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>J.</given-names>
            <surname>Lawson</surname>
          </string-name>
          ,
          <article-title>Data Science in Higher Education: A Step-by-Step Introduction to Machine Learning for Institutional Researchers</article-title>
          ,
          <source>CreateSpace Independent Publishing Platform</source>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>O.</given-names>
            <surname>Mendez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Florez</surname>
          </string-name>
          ,
          <article-title>Applying the flipped classroom model using a vle for foreign languages learning</article-title>
          ,
          <source>in: International Conference on Applied Informatics</source>
          , Springer,
          <year>2018</year>
          , pp.
          <fpage>215</fpage>
          -
          <lpage>227</lpage>
          . doi:https://doi.org/10.1007/978-3-
          <fpage>030</fpage>
          -01535-0_
          <fpage>16</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>A.</given-names>
            <surname>Mukuka</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. K.</given-names>
            <surname>Alex</surname>
          </string-name>
          ,
          <article-title>Student teachers' knowledge of school-level geometry: Implications for teaching and learning</article-title>
          .,
          <source>European Journal of Educational Research</source>
          <volume>13</volume>
          (
          <year>2024</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>J.</given-names>
            <surname>Piaget</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Inhelder</surname>
          </string-name>
          ,
          <source>Psicología del niño, Morata</source>
          , Madrid,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>J.</given-names>
            <surname>Piaget</surname>
          </string-name>
          ,
          <source>La representación del Espacio en el Niño</source>
          , Presses Universitaires de France (PUF), Paris,
          <year>1947</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>J.</given-names>
            <surname>Piaget</surname>
          </string-name>
          , La Geometría Espontánea en el Niño, Presses Universitaires de France (PUF), Paris,
          <year>1948</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Abuhasanein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Rejeb</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Jemai</surname>
          </string-name>
          ,
          <article-title>Challenges to learning mathematical concepts among sixthgrade students in primary education: A teachers' perspective</article-title>
          ,
          <source>International Electronic Journal of Mathematics Education</source>
          <volume>20</volume>
          (
          <year>2025</year>
          )
          <article-title>em0818</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>G.</given-names>
            <surname>Rabinowitz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. S.</given-names>
            <surname>Moore</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ali</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Weckel</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Chafee</surname>
          </string-name>
          ,
          <article-title>Study of an efective machine learning-integrated science curriculum for high school youth in an informal learning setting</article-title>
          ,
          <source>International Journal of STEM Education</source>
          <volume>12</volume>
          (
          <year>2025</year>
          ).
          <source>doi:10.1186/s40594-025-00543-5.</source>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>A.</given-names>
            <surname>Géron</surname>
          </string-name>
          ,
          <article-title>Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow, 2nd</article-title>
          <string-name>
            <surname>Edition</surname>
            ,
            <given-names>O</given-names>
          </string-name>
          <string-name>
            <surname>'Reilly Media</surname>
          </string-name>
          , Inc.,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Lou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. F.</given-names>
            <surname>Colvin</surname>
          </string-name>
          ,
          <article-title>Performance prediction using educational data mining techniques: a comparative study</article-title>
          ,
          <source>Discover Education</source>
          <volume>4</volume>
          (
          <year>2025</year>
          )
          <fpage>1</fpage>
          -
          <lpage>14</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Velázquez Narváez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>González</surname>
          </string-name>
          <string-name>
            <surname>Medina</surname>
          </string-name>
          ,
          <article-title>Factores institucionales y socioeconómicos en el rendimiento académico: Un estudio longitudinal en universidades públicas mexicanas (</article-title>
          <year>2018</year>
          -2023), Revista de la
          <source>Educación Superior</source>
          <volume>53</volume>
          (
          <year>2024</year>
          )
          <fpage>45</fpage>
          -
          <lpage>68</lpage>
          . doi:
          <volume>10</volume>
          .33412/rev.educ.sup.
          <year>2024</year>
          .
          <volume>210</volume>
          .3.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15] V.
          <article-title>autores, Predicting academic performance in public universities using hybrid ai models</article-title>
          ,
          <source>IEEE Transactions on Learning Technologies</source>
          <volume>17</volume>
          (
          <year>2024</year>
          )
          <fpage>102</fpage>
          -
          <lpage>115</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>A.</given-names>
            <surname>Villar</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.</surname>
          </string-name>
          R. V. de Andrade,
          <article-title>Supervised machine learning algorithms for predicting student dropout and academic success: a comparative study</article-title>
          ,
          <source>Springer Nature Computer Science</source>
          <volume>4</volume>
          (
          <year>2024</year>
          ). doi:https://doi.org/10.1007/s44163-023-00079-z.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>D.</given-names>
            <surname>Perez Bravo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Zanata Alves</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. A.</given-names>
            <surname>Ensina</surname>
          </string-name>
          , L. E. Soares de Oliveira,
          <article-title>Evaluating strategies to predict student dropout of a bachelor's degree in computer science</article-title>
          ,
          <source>Anais do Symposium on Knowledge Discovery, Mining and Learning (KDMiLe) 11</source>
          (
          <year>2024</year>
          )
          <fpage>1</fpage>
          -
          <lpage>8</lpage>
          . doi:
          <volume>10</volume>
          .5753/kdmile.
          <year>2023</year>
          .
          <volume>232763</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>O.</given-names>
            <surname>Mendez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Camargo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Florez</surname>
          </string-name>
          ,
          <article-title>Machine learning operations applied to development and model provisioning</article-title>
          ,
          <source>in: International Conference on Applied Informatics</source>
          , Springer,
          <year>2024</year>
          , pp.
          <fpage>73</fpage>
          -
          <lpage>88</lpage>
          . doi:https://doi.org/10.1007/978-3-
          <fpage>031</fpage>
          -75144-
          <issue>8</issue>
          _
          <fpage>6</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>