<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Ben Guerir, Morocco
* Corresponding author.
$ mespinoza@ecotec.edu.ec (M. Espinoza-Mina); acolina@ecotec.edu.ec (A. Colina-Vargas)</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Predictive Analysis of Missing Persons in Ecuador (2014-2024)</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Marcos Espinoza-Mina</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alejandra Colina-Vargas</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Universidad Ecotec</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Samborondón</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ecuador</string-name>
        </contrib>
      </contrib-group>
      <pub-date>
        <year>2025</year>
      </pub-date>
      <volume>000</volume>
      <fpage>0</fpage>
      <lpage>0003</lpage>
      <abstract>
        <p>The growing issue of missing persons in Ecuador demands a data-driven approach to strengthen institutional prevention and search strategies. This study develops a predictive analysis framework using an oficial dataset of 68,072 missing persons records from 2014 to 2024. Applying the CRISP-DM methodology, we implemented classification and anomaly detection models to identify patterns and risk factors. To address a severe class imbalance, the SMOTE technique was applied, resulting in a robust XGBoost classifier. The model achieved an overall accuracy of 88.1% and, more importantly, a substantial improvement in the detection of minority outcomes, with recall values of 0.28 for "Missing" and 0.33 for "Deceased." Beyond predictive performance, the SHapley Additive exPlanations (SHAP) analysis conclusively identified distinct risk profiles for each outcome: short reporting delays strongly predict a "Found" outcome, while long delays increase the likelihood of a case remaining "Missing." For the "Deceased" class, older age emerged as the dominant predictor. This study establishes a fundamental quantitative baseline for the analysis of missing persons, demonstrating that machine learning can generate actionable intelligence for resource prioritization, especially when augmented with interpretability techniques and anomaly detection.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Predictive Analytics</kwd>
        <kwd>Missing Persons</kwd>
        <kwd>Ecuador</kwd>
        <kwd>Risk Factors</kwd>
        <kwd>Model Interpretability</kwd>
        <kwd>Class Imbalance</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        The disappearance of people is a global problem that causes sufering and uncertainty, afecting millions
of people worldwide, including children [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. This phenomenon, often linked to human traficking,
violates human rights and has serious consequences for health [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Historically, forced disappearances
have been used for intimidation, and the IACHR (Inter-American Commission on Human Rights) has
expressed concern about their persistence in Latin America and the prevailing impunity [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. The
inability to know the whereabouts of a loved one causes distress to family and friends [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>
        In Ecuador, the increase in missing persons over the last five years has revealed deficiencies in security,
justice, and human rights [
        <xref ref-type="bibr" rid="ref5 ref6">5, 6</xref>
        ]. Adolescents, young people, and children are the most vulnerable [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
Although women account for 56.7% of the cases, men constitute 80% of the unresolved or fatal ones [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
Pichincha and Guayas concentrate 46% of the cases [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Despite a resolution rate of 94.0% of cases being
found, 3.1% ending in deceased persons, and 2.8% remaining unresolved, the persistence of these last
categories represents the central challenge [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. The average distance between the place of disappearance
and discovery is 50.69 km, with provincial disparities in resolution rates, such as Pichincha (91.02%)
versus Esmeraldas (61.28%) [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>
        Families of missing persons sufer a devastating psychological and economic impact, with 70.6%
unable to resume their routines. There is low social awareness (47.1%) in Ecuador about this issue
[
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Public institutions, such as the Prosecutor’s Ofice, the Ministry of Government, and the National
Directorate of Crimes Against Life, Violent Deaths, Disappearances, Extortion, and Kidnapping
(DINASED), are perceived as having low performance. The 24-hour delay in reporting is criticized by 64.7%
of respondents, who highlight the importance of the first few hours. An uneven state response has been
observed, with greater speed in high-profile media cases [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. The main causes are family and social
problems [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. This underscores the need for robust information systems, standardized search protocols,
and early warnings [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>
        Artificial Intelligence (AI) and Machine Learning (ML) are promising solutions for public safety
management [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. AI, through predictive analysis, can improve the eficiency and decision-making in
the public sector [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. In the context of disappearances, AI and ML are advancing in the study and
prediction of social problems [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. Deep learning techniques can ofer more accurate predictions [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ],
and models like Random Forest have improved the recall rate in location prediction [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. ML can also
optimize humanitarian aid [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. However, the implementation of AI faces challenges such as algorithmic
bias and data privacy, requiring solid ethical frameworks and a human-centered design [
        <xref ref-type="bibr" rid="ref14 ref15">14, 15</xref>
        ].
      </p>
      <p>
        This study addresses the challenge of missing persons in Ecuador using a machine learning framework
on historical data from 2014 to 2024. Critically, we tackle the inherent class imbalance of disappearing
data using the SMOTE technique to improve the prediction of minority outcomes. Furthermore, we
employ SHAP (SHapley Additive exPlanations ) interpretability analysis to identify the key risk factors
driving each prediction (’Found’, ’Missing’, or ’Deceased’). The objective is to generate data-driven
insights to help authorities optimize search protocols and prioritize high-risk cases [
        <xref ref-type="bibr" rid="ref16 ref9">9, 16</xref>
        ].
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. State of the art</title>
      <sec id="sec-2-1">
        <title>2.1. Definition and types of disappearances</title>
        <p>
          Disappearances in Ecuador are a serious social problem that requires efective solutions, with a notable
increase in cases in recent years [
          <xref ref-type="bibr" rid="ref17 ref5">5, 17</xref>
          ]. Ecuadorian legislation, through the Organic Law on Action in
Cases of Disappeared and Missing Persons (2020), classifies disappearances as voluntary (by personal
decision) and involuntary (caused by third parties and subject to police investigation) [
          <xref ref-type="bibr" rid="ref4 ref5">5, 4</xref>
          ]. Since June
2020, involuntary disappearance has been an autonomous crime in the Comprehensive Organic Penal
Code (COIP) [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ].
        </p>
        <p>
          On the other hand, forced disappearance is defined in the COIP (article 84) as an act under state
command, executed by agents or armed groups [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. The Inter-American Convention on Forced
Disappearance of Persons (1994) characterizes it as the deprivation of liberty by state agents or their
collaborators, followed by a lack of information on the person’s whereabouts [
          <xref ref-type="bibr" rid="ref18 ref3">18, 3</xref>
          ]. This act is a crime
against humanity and violates fundamental rights [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ].
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Historical context and prevalence</title>
        <p>
          Forced disappearance originated in the Second World War to intimidate and conceal the fate of detainees,
arriving in Latin America in the 1960s [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]. In Mexico, the phenomenon grew with the "War on Drugs"
(2006-2012), generating a concept of "disappeared person" that exceeds the traditional definition of
forced disappearance [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ]. In Ecuador, cases of forced, voluntary, and involuntary disappearances have
been documented [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. The Truth Commission of Ecuador registered 17 forced disappearances between
1984 and 2008 [
          <xref ref-type="bibr" rid="ref18 ref3">18, 3</xref>
          ]. Between 1970 and 2017, the country recorded 42,484 missing persons, with an
average of 500 reports per month. Quito was the most afected city (34% of cases), and between 2014
and 2017, 59% of the victims were women [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ].
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>2.3. The role of the State and legal framework</title>
        <p>
          The Ecuadorian State must protect the rights of victims and their families [
          <xref ref-type="bibr" rid="ref1 ref20">1, 20</xref>
          ], establishing a
normative framework with international declarations and conventions, in addition to national laws
such as the Constitution and the COIP [
          <xref ref-type="bibr" rid="ref4 ref7">4, 7</xref>
          ]. State responsibility for damages caused by its agents has
evolved, recognizing the obligation to repair for actions or omissions [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]. Ecuador’s 2008 Constitution
guarantees victims of crimes special protection and comprehensive reparation, including the right to
truth, restitution, compensation, rehabilitation, and non-repetition [
          <xref ref-type="bibr" rid="ref3 ref7">7, 3</xref>
          ]. The right to truth implies the
State’s obligation to investigate, judge, and sanction those responsible, ensuring access to information
[
          <xref ref-type="bibr" rid="ref3">3</xref>
          ].
        </p>
        <p>
          The Attorney General’s Ofice and the National Police, through the DINASED (created in 2013), are
the main entities in charge of locating missing persons and recovering remains [
          <xref ref-type="bibr" rid="ref3 ref4">4, 3</xref>
          ]. Between 2014 and
June 2021, DINASED located 37,258 people. The Council of the Judiciary has implemented a Protocol
for Action in the Search, Investigation, and Location of Missing Persons, establishing procedures for
the National Police [
          <xref ref-type="bibr" rid="ref3 ref4">4, 3</xref>
          ].
        </p>
      </sec>
      <sec id="sec-2-4">
        <title>2.4. Imprescriptibility and statute of limitations for crimes</title>
        <p>
          Article 80 of the Ecuadorian Constitution declares that legal actions and penalties for crimes such as
genocide, crimes against humanity, war crimes, forced disappearance, and aggression are imprescriptible,
prohibiting their amnesty. It establishes criminal responsibility for both perpetrators and those who
ordered the crime [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ]. Forced disappearance is imprescriptible because it is a grave violation of human
rights and a crime against humanity [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ].
        </p>
        <p>
          In contrast, the COIP (Art. 417, num. 3, literal d) indicates that the statute of limitations for the
common crime of disappearance begins when the person appears or when there are elements to charge
the crime. This creates a potential constitutional conflict if it is considered imprescriptible without
being included in the Art. 80. The statute of limitations for criminal ofenses seeks to guarantee legal
certainty, limit criminal action, and prevent indefinite prosecution, contributing to legal stability [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ].
        </p>
      </sec>
      <sec id="sec-2-5">
        <title>2.5. Impact on families and society</title>
        <p>
          The disappearance of a person causes immense sufering to their families, who face uncertainty, constant
grief, and economic and health problems. Organizations like ASFADEC (Association of Families of
Disappeared Persons in Ecuador), founded in 2012, provide support [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. In Mexico, the concept of
"disappeared alive", a term mainly used in the Mexican context, refers to individuals without an oficial
record or civil existence, who are vulnerable to dangers like human traficking or slavery. In this context,
the search is vital to grant them "civil existence" and make visible that "there are lives that matter" [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ].
        </p>
      </sec>
      <sec id="sec-2-6">
        <title>2.6. Challenges in investigation and search</title>
        <p>
          Families of missing persons often perceive deficiencies in public institutions (the Prosecutor’s Ofice,
Ministry of Government, DINASED), pointing to a lack of support, knowledge, and prejudice. The
24-hour waiting period to file a report is seen as a critical obstacle, as the first few hours are vital.
Frequent changes of prosecutors and investigators, together with disrespectful treatment, re-victimize
the complainants and violate their rights [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. Traditional media often do not prioritize missing persons
cases, which forces families to seek alternative channels [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. While media exposure can help gather
information, it can also negatively influence judicial decisions and the search for the truth [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ].
        </p>
      </sec>
      <sec id="sec-2-7">
        <title>2.7. Tools and strategies for search and visibility</title>
        <p>
          The dissemination of information about missing persons through social media is fast and efective,
especially for minors. Ecuador has implemented programs like the "Alerta Emilia," supported by the
International Centre for Missing &amp; Exploited Children (ICMEC), highlighting social collaboration in the
recovery of children [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ].
        </p>
        <p>
          Complementing this dissemination, the analysis of large volumes of data and predictive models are
crucial [
          <xref ref-type="bibr" rid="ref1 ref23">1, 23</xref>
          ]. Neutrosophic multicriteria analysis is used to handle data uncertainty [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ], and platforms
like datosabiertos.gob.ec are relevant sources [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]. Additionally, Deep Learning models, such as TextRNN,
predict the locations of missing persons, improving accuracy with the inclusion of oral information [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ].
To expand sources and improve data quality, databases with economic indicators, social problems, and
humanitarian organizations are used [
          <xref ref-type="bibr" rid="ref1 ref20 ref24">1, 20, 24</xref>
          ].
        </p>
        <p>
          A practical example is the QSSC (Quito Smart Safe City) prototype, a distributed mobile system with
IoT, Crowdsensing, and cloud computing for alerting and gathering evidence. Simulations in Quito
showed an average resolution time of 34.2 minutes, underscoring the importance of citizen collaboration
[
          <xref ref-type="bibr" rid="ref17">17</xref>
          ].
        </p>
      </sec>
      <sec id="sec-2-8">
        <title>2.8. International cooperation and human rights</title>
        <p>
          The Inter-American Commission on Human Rights (IACHR) has expressed concern about the persistence
of forced disappearances in Latin America, highlighting the lack of investigation and impunity. The
Inter-American Court of Human Rights (IACtHR) emphasizes the state’s obligation to investigate human
rights violations, as negligence violates the American Convention on Human Rights (ACHR) [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ].
        </p>
        <p>
          Cases such as Barrios Altos and La Cantuta in Peru illustrate systematic extrajudicial executions and
forced disappearances. The IACtHR’s rulings have been key to invalidating amnesty laws and revoking
pardons, reiterating the state’s obligation to investigate, prosecute, punish, and provide reparations. The
control of conventionality, implemented by the IACtHR and national judges, ensures the conformity of
domestic laws with the ACHR and international law [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ].
        </p>
        <p>
          This overview highlights the complexity of disappearances [
          <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
          ] and state deficiencies, characterized
by a lack of support, knowledge, and prejudice. Ineficiency is evident in the absence of protocols,
frequent staf turnover in prosecutorial and investigative bodies, and the disrespectful treatment that
re-victimizes complainants. Despite this, the resilience of families is notable, with organizations like
ASFADEC providing support [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. Efective cooperation and the use of technology are crucial [
          <xref ref-type="bibr" rid="ref17 ref6">6, 17</xref>
          ].
Prototypes like QSSC demonstrate that citizen collaboration can significantly reduce case resolution
time [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ].
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Methodology</title>
      <p>
        This study was developed following the CRISP-DM (Cross-Industry Standard Process for Data Mining)
model, a systematic and rigorous approach. All phases, from business understanding to implementation,
were executed using Python to ensure traceability, reproducibility, and eficiency in data processing,
analysis, and modeling [
        <xref ref-type="bibr" rid="ref25 ref26 ref27">25, 26, 27</xref>
        ].
      </p>
      <sec id="sec-3-1">
        <title>3.1. Business understanding</title>
        <p>
          The disappearance of people in Ecuador is a complex problem with a high social, institutional, and
humanitarian impact [
          <xref ref-type="bibr" rid="ref17 ref6 ref7 ref8">7, 8, 6, 17</xref>
          ]. Its visibility has increased due to the rise in cases and pressure from
organizations, media, and families [
          <xref ref-type="bibr" rid="ref18 ref5">5, 18</xref>
          ]. This study aims to generate data-driven intelligence to
support decision-making in the prevention, search, and resolution of cases [
          <xref ref-type="bibr" rid="ref1 ref28 ref8">1, 28, 8</xref>
          ].
        </p>
        <p>
          The problem is broken down into several key sub-problems: Lack of predictive capability to anticipate
the evolution of a case [
          <xref ref-type="bibr" rid="ref1 ref23">1, 23</xref>
          ]. Limited understanding of the factors that influence the duration and
outcome of disappearances [
          <xref ref-type="bibr" rid="ref1 ref2 ref29">1, 29, 2</xref>
          ].The need to identify patterns to distinguish between routine
disappearances and atypical cases [
          <xref ref-type="bibr" rid="ref1 ref12 ref2 ref23">1, 2, 12, 23</xref>
          ]. The absence of analytical tools to optimize resources in
investigation and search eforts [
          <xref ref-type="bibr" rid="ref11 ref23 ref24 ref25">11, 25, 24, 23</xref>
          ].
        </p>
        <p>The research questions guiding the study are: Is it possible to accurately predict the resolution status
(FOUND, MISSING, or DECEASED) of a case based on the initial report? What demographic (age,
gender, ethnicity, nationality), geographic (area, province, canton), and contextual (date, motive, police
sub-circuit) factors are most relevant to the duration and resolution of the case? How can atypical or
anomalous cases requiring specialized attention be identified?</p>
        <p>
          The impact of this study will benefit multiple institutional stakeholders [
          <xref ref-type="bibr" rid="ref23 ref30 ref9">9, 30, 23</xref>
          ]. For law
enforcement, predictive models will ofer early warnings, resource prioritization, and risk profiles [
          <xref ref-type="bibr" rid="ref1 ref2 ref30">1, 2, 30</xref>
          ].
For public policy makers, the findings can inform decisions on prevention, budget allocation, and
protocol design [
          <xref ref-type="bibr" rid="ref2 ref27 ref28 ref6 ref8 ref9">28, 2, 9, 8, 6, 27</xref>
          ]. For civil society and families, data-driven models can improve
transparency, eficiency, and institutional trust, contributing to a more timely and efective response
[
          <xref ref-type="bibr" rid="ref10 ref16 ref17 ref28 ref5 ref6 ref8 ref9">10, 5, 28, 9, 16, 8, 6, 17</xref>
          ], to this urgent social phenomenon [
          <xref ref-type="bibr" rid="ref17 ref8">8, 17</xref>
          ].
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Data understanding</title>
        <p>
          The dataset was obtained from the Open Data portal of Ecuador, specifically from the Ministry of the
Interior [
          <xref ref-type="bibr" rid="ref22 ref6 ref7 ref8">7, 22, 8, 6</xref>
          ]. It includes historical records of missing persons in Ecuador between 2014 and
2024, which guarantees its institutional validity. The main database contains variables such as: zona,
provincia, cantón, distrito, circuito, subcircuito, sexo, nacionalidad, edad_aproximada, rango_edad,
etnia, fecha_localizacion, motivo_desaparicion, motivo_desaparicion_observada, situacion_actual,
dias_solucion, latitud_desaparicion, longitud_desaparicion, fecha_denuncia y fecha_desaparicion. These
variables cover geographic, sociodemographic, and temporal dimensions, facilitating a multifactorial
analysis and serving as a robust foundation for descriptive, inferential, and predictive studies.
        </p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Data preparation</title>
        <p>
          Data preparation is crucial in data science, as it transforms raw data to maximize the performance of
machine learning algorithms [
          <xref ref-type="bibr" rid="ref23 ref31 ref32 ref33">31, 32, 33, 23</xref>
          ]. In this study, the data underwent a series of systematic
transformations (cleaning, structuring, and enrichment) using Python scripts to ensure the replicability
and traceability of the workflow [
          <xref ref-type="bibr" rid="ref25 ref27 ref32 ref33 ref34 ref35">25, 34, 35, 32, 33, 27</xref>
          ].
        </p>
        <sec id="sec-3-3-1">
          <title>3.3.1. Initial cleaning and field name translation</title>
          <p>A Python script was developed to standardize the dataset, systematically translating column names
from Spanish to English based on the oficial data dictionary. The headers of the input file (1.csv), which
had an irregular structure with column names in the second row, were reconfigured, and irrelevant rows
and columns were removed. Then, a mapping dictionary was applied to translate key fields such as zona,
sexo, edad, motivo_desaparicion, and estado_desaparecido to zone, gender, age, disappearance_motive,
and current_status, respectively. The resulting file, 1_translated.csv, has standardized field names for
compatibility with international analysis and modeling tools while preserving data integrity.</p>
        </sec>
        <sec id="sec-3-3-2">
          <title>3.3.2. Mapping and standardization of categorical values</title>
          <p>After standardizing column names to English, the values of the categorical variables were homogenized.
Minor inconsistencies were identified in the original Spanish values (e.g., "ENCONTRADO" and
"ENCONTRADA"). To ensure consistency and prepare the data for international analysis and modeling, a
comprehensive mapping was implemented.</p>
          <p>First, an inspection script (03-mapeo-cat.py) extracted and counted the unique values of categorical
columns such as gender, ethnicity, nationality, current_status, age_range, and the disappearance motives
(disappearance_motive, observed_motive). This analysis allowed for the creation of a "translation
dictionary" (a Python dictionary named translation_maps).</p>
          <p>Subsequently, this dictionary was applied to translate and standardize categories; for example,
’HOMBRE’ and ’MUJER’ were mapped to ’Male’ and ’Female’, and ’MESTIZO/A’ and ’INDIGENA’ were
mapped to ’Mestizo’ and ’Indigenous’. This process also handled null values, assigning them ’N/A’ (Not
Applicable/Not Available) for consistent handling.</p>
        </sec>
        <sec id="sec-3-3-3">
          <title>3.3.3. Integrated pipeline for exploratory processing and analysis</title>
          <p>An automated Python pipeline (full_pipeline_with_export.py) was developed to integrate cleaning,
exploratory analysis, feature engineering, and final preparation for modeling. The process began
with cleaning and transforming data types, ensuring the dataset’s structural integrity by converting
geographical coordinates to floats and standardizing missing data to np.nan. Date columns were
converted to datetime objects for temporal analysis.</p>
          <p>Next, an Exploratory Data Analysis (EDA) was performed to validate the data quality and
quantitatively understand the phenomenon. Descriptive analysis of numerical variables showed an average age
of disappearance of 22.9 years. The high completeness of the dataset, with only 2.8% of missing data
in key columns like location_date and days_to_resolution, justified not applying complex imputation.
Frequency distributions of the categorical variables revealed important demographic findings: The most
afected age group is Adolescents (34,495 cases; 50.7%), followed by Adults (26,710; 39.2%), highlighting
the vulnerability of the young population (Figure 1). A notable gender disparity was found, with 43,090
reported cases of women compared to 24,982 of men. Regarding case resolution, 94.0% were "Found,"
3.1% "Deceased," and 2.8% remain "Missing."</p>
          <p>The spatial and temporal dimensions were investigated through visualizations. The analysis of the
annual time series of reports (Figure 2) shows a growing trend of cases over the decade, with a peak in
2023. For the spatial analysis, an interactive map (Figure 3) identified clusters of high incidence in the
country’s densest urban areas.</p>
          <p>The feature engineering phase enriched the dataset with predictive variables. The
disappearance_duration_days (time elapsed until location) and report_delay_days (delay in reporting the case)
were calculated. Temporal features such as day_of_week and disappearance_quarter were extracted to
capture cyclical patterns.</p>
          <p>Finally, the dataset was prepared for modeling by transforming all features into a numerical
format. One-Hot Encoding was applied to nominal categorical variables (gender, ethnicity) and ordinal
encoding to age_range. All numerical features underwent standard scaling (StandardScaler) to
normalize their distributions, thus optimizing the performance of the machine learning algorithms. The
processed DataFrame was exported to 2_data_processed_for_modeling.csv, which served as the direct
and standardized input for the Modeling phase.</p>
        </sec>
        <sec id="sec-3-3-4">
          <title>3.3.4. Handling Class Imbalance with SMOTE</title>
          <p>The exploratory data analysis revealed a severe class imbalance, with over 94% of cases belonging to the
"Found" category. To address this issue, which hinders the model’s ability to learn from minority classes
(’Missing’ and ’Deceased’), the SMOTE (Synthetic Minority Over-sampling Technique) was applied.
This technique was implemented only on the training dataset to prevent data leakage, generating
synthetic samples for the minority classes to create a balanced class distribution. This step is crucial for
training a robust classifier capable of identifying high-risk cases.</p>
        </sec>
      </sec>
      <sec id="sec-3-4">
        <title>3.4. Modeling</title>
        <p>
          The modeling phase focused on two computational tasks: classification and anomaly detection [
          <xref ref-type="bibr" rid="ref14 ref2 ref25 ref27 ref32">14, 25,
2, 32, 27</xref>
          ]. Multiple algorithms were evaluated for each task. The processed dataset was split into a
70% training set and a 30% test set [
          <xref ref-type="bibr" rid="ref1 ref11 ref12 ref2 ref33 ref36 ref37">1, 11, 36, 2, 37, 12, 33</xref>
          ], using stratification on the target variable for
classification to ensure a proportional representation of all classes [
          <xref ref-type="bibr" rid="ref2 ref25 ref32">25, 2, 32</xref>
          ].
        </p>
        <sec id="sec-3-4-1">
          <title>3.4.1. Classification models: Predicting the final status</title>
          <p>The main objective was to predict the final status of a case: "Found," "Missing," or "Deceased." After
applying SMOTE to the training data, Random Forest, XGBoost, and a Multilayer Perceptron (MLP)
were evaluated. The XGBoost model was selected as it ofered the best balance between performance
and interpretability, achieving a macro-averaged recall of 0.51. This indicates a much more balanced
and practically useful model, correctly identifying 28% of "Missing" cases and 33% of "Deceased" cases
in the test set.</p>
          <p>While the overall accuracy is slightly lower than in the unbalanced model, the macro average recall
(which measures the average recall across all classes) improved dramatically, from 0.36 to 0.51 for the
XGBoost model. This indicates a much more balanced and practically useful model. The XGBoost model
correctly identified 28% of "Missing" cases and 33% of "Deceased" cases in the test set, a substantial
improvement over the pre-correction performance.</p>
        </sec>
        <sec id="sec-3-4-2">
          <title>3.4.2. Anomaly detection</title>
          <p>Anomaly detection was performed using an Isolation Forest model to identify atypical cases. A
descriptive analysis of these anomalies revealed that they are predominantly driven by family-related motives
(representing 55% of the anomalies), frequently involve individuals with disabilities or illnesses, and
exhibit highly variable and prolonged resolution times, with a maximum recorded delay of 1812 days.</p>
        </sec>
        <sec id="sec-3-4-3">
          <title>3.4.3. Model selection and storage</title>
          <p>The XGBoost Classifier was selected as the best-performing model due to its superior balance in recall
across all classes after the application of SMOTE to address the severe class imbalance. This improved
model was saved as best_classification_model.joblib for the subsequent evaluation phase.</p>
        </sec>
      </sec>
      <sec id="sec-3-5">
        <title>3.5. Evaluation</title>
        <p>The evaluation phase focused on measuring the performance of the selected XGBoost model and
interpreting the factors driving its predictions. The performance of the model was evaluated on the test
set, yielding an overall accuracy of 88.1%. While this metric is slightly lower than that of the unbalanced
model, a more granular analysis demonstrates the model’s superior practical utility. The Confusion
Matrix and detailed Classification Report confirmed the model’s enhanced capability to detect critical
minority classes, achieving a recall of 0.28 for "Missing" and 0.33 for "Deceased". This represents a
substantial improvement in identifying high-risk cases, directly addressing the primary limitation of
the initial models.</p>
        <p>Additionally, the model’s distinction capability was evaluated using Receiver Operating Characteristic
(ROC) Curves and the calculation of the Area Under the Curve (AUC), using a one-vs-rest approach.
This technique provides a performance measure that is insensitive to class imbalance, ofering a more
reliable view of the classifier’s discriminative efectiveness.</p>
        <p>For model interpretability, the SHAP technique was employed. This advanced method quantified the
contribution of each feature to the model’s predictions. The analysis was expanded to all three outcome
classes (’Found’, ’Missing’, and ’Deceased’) to identify the distinct factors influencing each status. This
comprehensive approach allowed for a deeper understanding of how variables like reporting delay
and age impact the likelihood of each specific outcome, providing valuable insights for operational
decision-making. The detailed results of these evaluations (including the Confusion Matrix, ROC Curves,
and SHAP plots) will be presented in Section 4</p>
        <p>These findings have direct operational implications. The identified risk factors, particularly reporting
delays and age, can be embedded into institutional early-warning dashboards, enabling law enforcement
to prioritize high-risk cases in real time. By integrating the predictive model into decision-support
systems, agencies such as DINASED could generate alerts that guide resource allocation, accelerate
search protocols, and ultimately improve the efectiveness and timeliness of responses to disappearances.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Results</title>
      <p>This section presents the empirical findings of the study, derived from the exploratory analysis and the
execution of predictive models on the dataset of missing persons in Ecuador between 2014 and 2024.
The results are presented through a descriptive analysis, the evaluation of model performance, and an
interpretation of the most influential factors.</p>
      <sec id="sec-4-1">
        <title>4.1. Findings from Exploratory Data Analysis (EDA)</title>
        <p>The initial analysis of the 68,072 records revealed significant demographic, temporal, and geographic
patterns.</p>
        <sec id="sec-4-1-1">
          <title>4.1.1. Demographic profile</title>
          <p>The gender analysis shows a marked preponderance of cases involving women, who account for
63.3% of the total (43,090 cases), compared to 36.7% for men (24,982 cases). Regarding the age group,
adolescents (12-17 years old) are the most vulnerable, accumulating 50.7% of the reports (34,495 cases),
followed by adults (26,710 cases, 39.2%). The average age of a missing person is approximately 23 years.
The predominant ethnicity is Mestizo, representing 86.7% (59,028 cases) of the records with ethnic
information. Regarding nationality, most cases correspond to Ecuadorian citizens (96.6%), although
there is a significant presence of Venezuelans (1,156 cases) and Colombians (640 cases), reflecting
migratory dynamics.</p>
        </sec>
        <sec id="sec-4-1-2">
          <title>4.1.2. Motives and resolution status</title>
          <p>The most common disappearance motive is "Family Reasons," accounting for 69.6% of cases. It is crucial
to note that the vast majority of cases (94.1%) are resolved with the person being "Found." However,
2.8% of cases (1,907 records) remained in a "Missing" status at the time of data extraction, representing
the core of the persistent disappearances problem.</p>
        </sec>
        <sec id="sec-4-1-3">
          <title>4.1.3. Temporal and geospatial patterns</title>
          <p>The temporal analysis (Figure 4) shows a fluctuating dynamic in the number of annual reports, with
the highest levels between 2017 and 2019, exceeding 10,000 reports in that period. Since 2020, there
has been a significant decrease, followed by a partial recovery, without reaching the initial peaks. The
monthly analysis did not reveal a clear seasonal pattern.</p>
          <p>The geospatial analysis indicates a non-uniform distribution. The provinces of Pichincha and Guayas
are the main "hot spots," with the highest density of incidents, which is consistent with their high
population density.</p>
        </sec>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Predictive model performance after addressing class imbalance</title>
        <p>To overcome the severe class imbalance identified in the EDA, the SMOTE technique was applied to
the training data. The re-evaluated XGBoost classifier was selected as the best-performing model,
achieving an overall accuracy of 88.14% on the test set. The macro-averaged metrics were a precision
of 0.44, a recall of 0.51, and an F1-score of 0.46. While the overall accuracy is slightly lower than that
of the unbalanced model, the significant increase in macro-averaged recall demonstrates a superior
ability to identify the critical minority classes. The confusion matrix (Figure 5) visually confirms this
enhancement.</p>
        <p>The model now correctly identifies 28% of "Missing" cases (158 instances) and 33% of "Deceased" cases
(212 instances), a substantial improvement that makes the model practically useful for risk assessment.
Furthermore, the model’s discriminative ability is confirmed by the high Area Under the Curve (AUC)
scores for all classes, as shown in the ROC curves (Figure 6).</p>
      </sec>
      <sec id="sec-4-3">
        <title>4.3. Model interpretability and risk factor analysis (SHAP)</title>
        <p>To understand the key factors driving the model’s predictions, a SHAP analysis was conducted for all
three outcome classes. This approach reveals distinct risk profiles for each status by quantifying the
impact of each feature on the final prediction (Figure 7).</p>
        <p>The SHAP analysis reveals distinct risk profiles for each outcome. The variable report_delay_days
(a) Found</p>
        <p>(b) Missing
(c) Deceased
emerges as a critical predictor with a dual role: low values strongly predict a ’Found’ outcome, indicating
that cases reported quickly are more likely to be resolved successfully. Conversely, high values for the
same feature are the strongest predictor that a case will remain ’Missing’. For the ’Deceased’ class,
however, the dominant factor shifts to approximate_age, where older individuals who go missing
have a markedly higher probability of a fatal outcome. These findings provide actionable intelligence,
underscoring the critical importance of immediate reporting and suggesting that age should be a primary
factor in assessing the risk level of a missing person’s case.</p>
      </sec>
      <sec id="sec-4-4">
        <title>4.4. Analysis of atypical cases</title>
        <p>The Isolation Forest algorithm was applied to the dataset, identifying 671 records (0.99%) as atypical. A
descriptive analysis of these cases revealed distinct characteristics compared to the general population.
The most frequent motive among anomalies was “Family Reasons,” and these cases exhibited highly
variable and prolonged resolution times, with a maximum recorded delay of 1,812 days. This suggests
that while the primary model captures general trends, these anomalous cases represent unique dynamics
or potential data entry issues that warrant separate, targeted investigation.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusions</title>
      <p>This study addressed the complex phenomenon of missing persons in Ecuador (2014–2024), developing
a predictive framework to support institutional decision-making. The main contribution lies in the
creation and evaluation of a comprehensive analytical pipeline, from data cleaning and enrichment
to the development of classification and anomaly detection models. It generated the first predictive
models for this context, quantifying underlying patterns and identifying both the potential and the
limitations of using administrative data to model a complex social problem.</p>
      <p>The findings are multifaceted. On one hand, the exploratory analysis of 68,072 records confirmed key
demographic and temporal patterns: adolescents (50.7%) and women (63.3%) are the most vulnerable
groups, with most cases concentrated in Pichincha and Guayas. Although 94.0% of cases are resolved
with the person found, a persistent 2.8% remain unresolved, representing the central challenge. On
the other hand, the results of predictive modeling reveal progress and challenges. After addressing
the severe class imbalance with the SMOTE technique, the XGBoost classifier achieved an accuracy of
88.1% and, more importantly, a substantial improvement in the detection of minority outcomes, with
recall values of 0.28 for “Missing” and 0.33 for “Deceased.” These results demonstrate that predictive
analytics can support risk assessment and provide actionable intelligence, even when accuracy alone
may appear limited.</p>
      <p>Beyond predictive performance, the study highlights valuable practical implications. The SHAP
interpretability analysis revealed distinct risk profiles: short reporting delays strongly predict a “Found”
outcome, long delays increase the likelihood of remaining “Missing,” and older age is the dominant
predictor of a “Deceased” outcome. These findings can be operationalized into early-warning dashboards
that prioritize high-risk cases in real time, guiding resource allocation, accelerating search protocols,
and improving the timeliness of institutional responses. Likewise, anomaly detection using Isolation
Forest identified a small but significant set of atypical cases that difer from the general population,
often involving extreme ages, uncommon motives, or unusually prolonged resolution times, flagging
records that merit specialized investigation. For public policy makers, these results underscore the
importance of awareness campaigns on the immediate reporting of disappearances, especially involving
minors, and the need to strengthen institutional protocols for exceptional cases.</p>
      <p>Nevertheless, this study also faced limitations. Although class imbalance was mitigated with SMOTE,
recall levels for the most critical outcomes remain modest, indicating the need for further
improvements through ensemble methods or cost-sensitive learning. In addition, regression models predicting
resolution time proved unfeasible, reflecting the insuficiency of the current feature space. The dataset,
while oficial and authoritative, lacks contextual variables such as investigative efort, socioeconomic
conditions, media coverage, or a history of violence, factors that likely exert a decisive influence on the
duration and outcome of cases.</p>
      <p>In conclusion, this study establishes a fundamental quantitative baseline for the analysis of missing
persons in Ecuador. It demonstrates that while directly predicting the most tragic outcomes remains a
formidable challenge, machine learning approaches, especially those incorporating interpretability and
anomaly detection, can extract valuable and actionable insights even from imperfect data. The potential
of these tools to transform institutional responses is undeniable, but their full realization will depend
on a sustained commitment to improving data collection, integrating richer contextual variables, and
deploying predictive models into operational systems. This work is not an endpoint but a call to action
to strengthen the analytical capabilities of the State and provide faster, more efective responses to one
of the country’s most urgent social problems.</p>
    </sec>
    <sec id="sec-6">
      <title>6. Future work</title>
      <p>Based on the findings and limitations of this study, future research will focus on three strategic
axes. First, although the implementation of SMOTE significantly improved the detection of minority
outcomes (“Missing” and “Deceased”), recall levels remain modest. Future work should therefore explore
complementary strategies such as cost-sensitive learning, ensemble methods, or temporal validation
schemes to further strengthen model robustness. Second, to overcome the infeasibility of predicting
resolution time, it will be essential to enrich the current dataset with external contextual variables, such
as indicators of investigative efort, media coverage, and socioeconomic factors, which could explain a
greater proportion of the variance. Third, the analysis of existing data can be deepened by adopting a
mixed-methods approach, combining quantitative modeling with qualitative evaluation of anomalous
cases, and by applying Natural Language Processing (NLP) techniques to extract latent features from
text fields. These combined eforts aim to develop a more robust predictive framework with greater
operational utility for institutions responsible for the search and prevention of disappearances in
Ecuador.</p>
    </sec>
    <sec id="sec-7">
      <title>Declaration on Generative AI</title>
      <p>The authors have not employed any Generative AI tools.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>D.</given-names>
            <surname>Backer</surname>
          </string-name>
          , T. Billing,
          <article-title>Forecasting the prevalence of child acute malnutrition using environmental and conflict conditions as leading indicators</article-title>
          ,
          <source>World Development</source>
          <volume>176</volume>
          (
          <year>2024</year>
          )
          <fpage>106484</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>A.</given-names>
            <surname>Dong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Guo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Luo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Yao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Zhu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Jiang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Xiong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Guan</surname>
          </string-name>
          ,
          <article-title>Predicting the locations of missing persons in china by using ngo data and deep learning techniques</article-title>
          ,
          <source>International Journal of Digital Earth</source>
          <volume>17</volume>
          (
          <year>2024</year>
          )
          <fpage>2304076</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>G. R. T.</given-names>
            <surname>Paredes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W. E. R.</given-names>
            <surname>Barreto</surname>
          </string-name>
          ,
          <article-title>El rol del estado frente a las desapariciones forzadas de personas en ecuador</article-title>
          ,
          <source>Revista Metropolitana de Ciencias Aplicadas</source>
          <volume>5</volume>
          (
          <year>2022</year>
          )
          <fpage>229</fpage>
          -
          <lpage>238</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M. A. P.</given-names>
            <surname>Baldeón</surname>
          </string-name>
          , Proceso de investigación de personas desaparecidas en ecuador,
          <source>INNOVACIÓN &amp; SABER</source>
          <volume>2</volume>
          (
          <year>2021</year>
          )
          <fpage>68</fpage>
          -
          <lpage>75</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>H. R. A.</given-names>
            <surname>Ortiz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. A. P.</given-names>
            <surname>Llerena</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. E. V.</given-names>
            <surname>López</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A. C.</given-names>
            <surname>Pulloquinga</surname>
          </string-name>
          ,
          <article-title>Las desapariciones en ecuador más allá de estadísticas, familias resilientes, inacción estatal</article-title>
          , Revista Científica y Arbitrada de Ciencias Sociales y Trabajo Social: Tejedora. ISSN:
          <fpage>2697</fpage>
          -
          <lpage>3626</lpage>
          7 (
          <year>2024</year>
          )
          <fpage>166</fpage>
          -
          <lpage>182</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>J. Vera</given-names>
            <surname>Macias</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F. E. Alvarado</given-names>
            <surname>Veliz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. Arguello</given-names>
            <surname>Arguello</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. J.</given-names>
            <surname>Cevallos</surname>
          </string-name>
          <string-name>
            <surname>Torres</surname>
          </string-name>
          ,
          <article-title>Neutrosophic multicriteria analysis of missing people in ecuador: An open data approach for the period 2024</article-title>
          .,
          <string-name>
            <given-names>Neutrosophic</given-names>
            <surname>Computing</surname>
          </string-name>
          &amp;
          <source>Machine Learning</source>
          <volume>37</volume>
          (
          <year>2025</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>M. J. Castillo</given-names>
            <surname>Román</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. C.</given-names>
            <surname>Espinosa Torres</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. M.</given-names>
            <surname>González Arias</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. C.</given-names>
            <surname>Giler Zambrano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W. L. Gallegos</given-names>
            <surname>Pilligua</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. F.</given-names>
            <surname>Robayo Cabrera</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Quiroz</surname>
          </string-name>
          <string-name>
            <surname>Martinez</surname>
          </string-name>
          ,
          <article-title>Examination of disappeared individuals in ecuador using orange data mining</article-title>
          .,
          <source>Neutrosophic Computing &amp; Machine Learning</source>
          <volume>37</volume>
          (
          <year>2025</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>E. X.</given-names>
            <surname>Quito Recalde</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Pando Farez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. A.</given-names>
            <surname>Riofrio Núñez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. D.</given-names>
            <surname>Ramírez Carrión</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. L. Pauta</given-names>
            <surname>Riera</surname>
          </string-name>
          ,
          <string-name>
            <surname>D.</surname>
          </string-name>
          <article-title>Rumbaut Rangel, Patterns and factors determinants in cases of missing persons in ecuador</article-title>
          .,
          <source>Neutrosophic Computing &amp; Machine Learning</source>
          <volume>37</volume>
          (
          <year>2025</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>G.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <article-title>Construction and path of urban public safety governance and crisis management optimization model integrating artificial intelligence technology</article-title>
          ,
          <source>Sustainability</source>
          <volume>15</volume>
          (
          <year>2023</year>
          )
          <fpage>7487</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>C.</given-names>
            <surname>Aldemir</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Uçma</surname>
          </string-name>
          <string-name>
            <surname>Uysal</surname>
          </string-name>
          ,
          <article-title>Artificial intelligence for financial accountability and governance in the public sector: Strategic opportunities and challenges</article-title>
          ,
          <source>Administrative Sciences</source>
          <volume>15</volume>
          (
          <year>2025</year>
          )
          <fpage>58</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>G. M.</given-names>
            <surname>Campedelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Bartulovic</surname>
          </string-name>
          ,
          <string-name>
            <surname>K. M. Carley</surname>
          </string-name>
          ,
          <article-title>Learning future terrorist targets through temporal meta-graphs</article-title>
          ,
          <source>Scientific reports 11</source>
          (
          <year>2021</year>
          )
          <fpage>8533</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Qu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Tran</surname>
          </string-name>
          , W. Ma,
          <article-title>Deep learning approach to biogeographical ancestry inference</article-title>
          ,
          <source>Procedia Computer Science</source>
          <volume>159</volume>
          (
          <year>2019</year>
          )
          <fpage>552</fpage>
          -
          <lpage>561</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>E.</given-names>
            <surname>Aiken</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Bellue</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Karlan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Udry</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. E.</given-names>
            <surname>Blumenstock</surname>
          </string-name>
          ,
          <article-title>Machine learning and phone data can improve targeting of humanitarian aid</article-title>
          ,
          <source>Nature</source>
          <volume>603</volume>
          (
          <year>2022</year>
          )
          <fpage>864</fpage>
          -
          <lpage>870</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>I. D.</given-names>
            <surname>Apostolopoulos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. P.</given-names>
            <surname>Groumpos</surname>
          </string-name>
          ,
          <article-title>Fuzzy cognitive maps: Their role in explainable artificial intelligence</article-title>
          ,
          <source>Applied Sciences</source>
          <volume>13</volume>
          (
          <year>2023</year>
          )
          <fpage>3412</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>A.</given-names>
            <surname>Karatzogianni</surname>
          </string-name>
          ,
          <article-title>Research design for an integrated artificial intelligence ethical framework</article-title>
          ,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>C. F.</given-names>
            <surname>Luo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Bhambhoria</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Dahan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Zhu</surname>
          </string-name>
          ,
          <article-title>Prototype-based interpretability for legal citation prediction</article-title>
          ,
          <source>arXiv preprint arXiv:2305.16490</source>
          (
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>A.</given-names>
            <surname>Zambrano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Ortiz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Calderón</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Zambrano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Urquiza-Aguiar</surname>
          </string-name>
          ,
          <article-title>Sistema de localización de personas desaparecidas basado en iot y cloud computing (caso de estudio: Quito)</article-title>
          ,
          <string-name>
            <surname>Revista Ibérica De Sistemas e Tecnologias De Informação</surname>
          </string-name>
          (
          <year>2019</year>
          )
          <fpage>82</fpage>
          -
          <lpage>94</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>L. L. E.</given-names>
            <surname>Berru</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. Y. V.</given-names>
            <surname>Loaiza</surname>
          </string-name>
          , Prescripción de acción en el delito de desaparición de
          <article-title>personas en el código orgánico integral penal</article-title>
          ,
          <source>Revista Científica Cultura, Comunicación y Desarrollo</source>
          <volume>8</volume>
          (
          <year>2023</year>
          )
          <fpage>250</fpage>
          -
          <lpage>260</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>G.</given-names>
            <surname>Gatti</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Irazuzta</surname>
          </string-name>
          ,
          <article-title>Diario de la desaparición mexicana. entre el precedente y el exceso</article-title>
          ,
          <source>Disparidades. Revista de Antropología</source>
          <volume>74</volume>
          (
          <year>2019</year>
          )
          <fpage>e019</fpage>
          -
          <lpage>e019</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>A.</given-names>
            <surname>Beduschi</surname>
          </string-name>
          ,
          <article-title>Harnessing the potential of artificial intelligence for humanitarian action: Opportunities and risks</article-title>
          ,
          <source>International Review of the Red Cross</source>
          <volume>104</volume>
          (
          <year>2022</year>
          )
          <fpage>1149</fpage>
          -
          <lpage>1169</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>A.</given-names>
            <surname>Serranò</surname>
          </string-name>
          ,
          <article-title>El derecho de acceso a la justicia de las víctimas de violaciones de derechos humanos en perú: la corte interamericana de derechos humanos ante el indulto de alberto fujimori</article-title>
          , Anuario iberoamericano de justicia constitucional (
          <year>2021</year>
          )
          <fpage>485</fpage>
          -
          <lpage>511</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <surname>M. P. D. Masache</surname>
          </string-name>
          ,
          <article-title>La vulneración del derecho a la verdad por exposición mediática en la investigación de casos de personas naturales reportadas como desaparecidas</article-title>
          ,
          <source>Debate Jurídico Ecuador</source>
          <volume>4</volume>
          (
          <year>2021</year>
          )
          <fpage>110</fpage>
          -
          <lpage>121</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>T.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , M. Lauras, G. Zacharewicz,
          <string-name>
            <given-names>S.</given-names>
            <surname>Rabah</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Benaben</surname>
          </string-name>
          ,
          <article-title>Coupling simulation and machine learning for predictive analytics in supply chain management</article-title>
          ,
          <source>International Journal of Production Research</source>
          <volume>62</volume>
          (
          <year>2024</year>
          )
          <fpage>8397</fpage>
          -
          <lpage>8414</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>A.</given-names>
            <surname>Kjaerum</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. S.</given-names>
            <surname>Madsen</surname>
          </string-name>
          ,
          <article-title>Pushing the boundaries of anticipatory action using machine learning</article-title>
          ,
          <source>Data &amp; Policy</source>
          <volume>7</volume>
          (
          <year>2025</year>
          )
          <article-title>e8</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>C.</given-names>
            <surname>Christensen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Wagner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Langhals</surname>
          </string-name>
          ,
          <article-title>Year-independent prediction of food insecurity using classical and neural network machine learning methods</article-title>
          ,
          <source>Ai</source>
          <volume>2</volume>
          (
          <year>2021</year>
          )
          <fpage>244</fpage>
          -
          <lpage>260</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>A.</given-names>
            <surname>Fischmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. Z.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <article-title>Emotion regulation and cheap talk as signaling strategies: Evidence from crowdfunding for ukraine</article-title>
          ,
          <source>Telematics and Informatics</source>
          <volume>87</volume>
          (
          <year>2024</year>
          )
          <fpage>102086</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>H.</given-names>
            <surname>Wilde</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. L.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Nguyen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Kimpel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Sidgwick</surname>
          </string-name>
          ,
          <string-name>
            <surname>A. De Unanue</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Veronese</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Mateen</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Ghani</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Vollmer</surname>
          </string-name>
          ,
          <article-title>A recommendation and risk classification system for connecting rough sleepers to essential outreach services</article-title>
          ,
          <source>Data &amp; Policy</source>
          <volume>3</volume>
          (
          <year>2021</year>
          )
          <article-title>e2</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <surname>C. B. Casagran</surname>
          </string-name>
          , G. Stavropoulos,
          <article-title>Developing ai predictive migration tools to enhance humanitarian support: The case of eumigratool, Data &amp; policy 6 (2024) e64</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>A.</given-names>
            <surname>Berman</surname>
          </string-name>
          ,
          <string-name>
            <surname>K. de Fine Licht</surname>
          </string-name>
          , V. Carlsson,
          <article-title>Trustworthy ai in the public sector: An empirical analysis of a swedish labor market decision-support system</article-title>
          ,
          <source>Technology in Society 76</source>
          (
          <year>2024</year>
          )
          <fpage>102471</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>W.</given-names>
            <surname>Zhai</surname>
          </string-name>
          ,
          <article-title>A multi-level analytic framework for disaster situational awareness using twitter data</article-title>
          ,
          <source>Computational Urban Science</source>
          <volume>2</volume>
          (
          <year>2022</year>
          )
          <fpage>23</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>S.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. G.</given-names>
            <surname>Kim</surname>
          </string-name>
          ,
          <article-title>Attribute of big data analytics quality afecting business performance</article-title>
          ,
          <source>Journal of Social Computing</source>
          <volume>4</volume>
          (
          <year>2023</year>
          )
          <fpage>357</fpage>
          -
          <lpage>381</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>V.</given-names>
            <surname>Shevchenko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Lukashevich</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Taniushkina</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bulkin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Grinis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Kovalev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Narozhnaia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Sotiriadi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Krenke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Maximov</surname>
          </string-name>
          ,
          <article-title>Climate change impact on agricultural land suitability: An interpretable machine learning-based eurasia case study</article-title>
          ,
          <source>IEEE Access 12</source>
          (
          <year>2024</year>
          )
          <fpage>15748</fpage>
          -
          <lpage>15763</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [33]
          <string-name>
            <given-names>L. K.</given-names>
            <surname>Topham</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Khan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Al-Jumeily</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Waraich</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. J.</given-names>
            <surname>Hussain</surname>
          </string-name>
          ,
          <article-title>Gait identification using limb joint movement and deep machine learning</article-title>
          ,
          <source>IEEE access 10</source>
          (
          <year>2022</year>
          )
          <fpage>100113</fpage>
          -
          <lpage>100127</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [34]
          <string-name>
            <given-names>S.</given-names>
            <surname>Majumdar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Awasthi</surname>
          </string-name>
          ,
          <article-title>From vulnerability to resilience: securing public safety gps and location services with smart radio, blockchain, and ai-driven adaptability</article-title>
          ,
          <source>Electronics</source>
          <volume>14</volume>
          (
          <year>2025</year>
          )
          <fpage>1207</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [35]
          <string-name>
            <given-names>S. H.</given-names>
            <surname>Mohammed</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Al-Jumaily</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. S. J.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V. P. G.</given-names>
            <surname>Jiménez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. S.</given-names>
            <surname>Jaber</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y. S.</given-names>
            <surname>Hussein</surname>
          </string-name>
          ,
          <string-name>
            <surname>M. M. A. K. Al-Najjar</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Al-Jumeily</surname>
          </string-name>
          ,
          <article-title>A review on the evaluation of feature selection using machine learning for cyber-attack detection in smart grid</article-title>
          ,
          <source>Ieee Access</source>
          <volume>12</volume>
          (
          <year>2024</year>
          )
          <fpage>44023</fpage>
          -
          <lpage>44042</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          [36]
          <string-name>
            <given-names>F.</given-names>
            <surname>Checchi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Frison</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Warsame</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. T.</given-names>
            <surname>Abebe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Achen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. A.</given-names>
            <surname>Ategbo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Ayoya</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Kassim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Ndiaye</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Nyawo</surname>
          </string-name>
          ,
          <article-title>Can we predict the burden of acute malnutrition in crisis-afected countries? ifndings from somalia and south sudan</article-title>
          ,
          <source>BMC nutrition 8</source>
          (
          <year>2022</year>
          )
          <fpage>92</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          [37]
          <string-name>
            <given-names>M.</given-names>
            <surname>Kooshki Forooshani</surname>
          </string-name>
          ,
          <string-name>
            <surname>M. Van Den Homberg</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Kalimeri</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Kaltenbrunner</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Mejova</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Milano</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Ndirangu</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Paolotti</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Teklesadik</surname>
            ,
            <given-names>M. L.</given-names>
          </string-name>
          <string-name>
            <surname>Turner</surname>
          </string-name>
          ,
          <article-title>Towards a global impact-based forecasting model for tropical cyclones</article-title>
          ,
          <source>Natural Hazards and Earth System Sciences</source>
          <volume>24</volume>
          (
          <year>2024</year>
          )
          <fpage>309</fpage>
          -
          <lpage>329</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>