<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Web-Oriented Application for Student Attendance Accounting with a Module for Automatic Parsing of Class Schedules*</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Yurii Huk</string-name>
          <email>yuriiguk529@gmail.com</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Libor Dostalek</string-name>
          <email>libor.dostalek@fit.cvut.cz</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jan Owedyk</string-name>
          <email>j.owedyk@kpsw.edu.pl</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hamlet Harutyunyan</string-name>
          <email>h.harutyunyan@wunu.edu.ua</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andriy Yushko</string-name>
          <email>a.yushko@wunu.edu.ua</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Systems, Czech Technical University in Prague.</institution>
          ,
          <addr-line>Prague</addr-line>
          ,
          <country country="CZ">Czech Republic</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Fundamental Disciplines Yerevan Educational and Scientific Institute,</institution>
          ,
          <addr-line>Yerevan</addr-line>
          ,
          <country>Republic of Armenia</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Department of Informatics Kujawy and Pomorze University in Bydgoszcz</institution>
          ,
          <addr-line>Bydgoszcz</addr-line>
          ,
          <country country="PL">Poland</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>West Ukrainian National University</institution>
          ,
          <addr-line>11 Lvivska Str., Ternopil, 46009</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The automation of academic schedule parsing and student attendance tracking is crucial for modern educational institutions. Traditional methods relying on rule-based Excel parsing are prone to errors and lack adaptability to unstructured formats. This study presents a comparative analysis of different methods for parsing academic schedules, focusing on the development of an automated system that utilizes AI-driven approaches for intelligent data extraction. A comparative analysis of these methodologies provides insights into their practical applicability and suggests optimal strategies for integrating AI into education.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Artificial intelligence</kwd>
        <kwd>machine learning</kwd>
        <kwd>schedule parsing</kwd>
        <kwd>deep learning</kwd>
        <kwd>data extraction</kwd>
        <kwd>natural language processing 1</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>The rapid development of artificial intelligence (AI) and machine learning (ML) has led to their
widespread integration into various domains, including education. The integration of artificial
intelligence (AI) into educational institutions has become increasingly essential, as it enhances
administrative efficiency and minimizes human error in routine processes. One of the most critical
aspects of academic management is student attendance tracking, which requires accurate and timely
extraction of class schedule data. Traditional methods rely heavily on manual input or rule-based
algorithms for parsing structured Excel tables, which often fail when dealing with unstructured or
scanned schedule formats. As educational institutions continue to digitize their workflows, there is a
growing need for intelligent systems capable of automating schedule parsing and attendance tracking
with high accuracy and adaptability.</p>
      <p>This study explores three distinct methods for extracting and structuring class schedules: (1) a
conventional rule-based Excel parsing approach, (2) a computer vision-driven solution using
TensorFlow for image recognition, and (3) an AI-powered natural language processing (NLP)
technique that processes textual data. The primary objective is to compare the effectiveness of these
approaches in handling different schedule formats, optimizing automation, and reducing the
dependency on manual corrections. The research focuses on developing a web-based system that
leverages AI to transform schedule data into a structured format suitable for automated attendance
tracking.</p>
      <p>AI-driven solutions offer the potential to significantly improve data processing reliability and
efficiency. The application of deep learning models trained on schedule images can enhance text
recognition, making it possible to extract meaningful information even from scanned or poorly
formatted documents. Additionally, NLP techniques allow for intelligent interpretation of textual
data, enabling systems to understand variations in schedule formats and structure them into
standardized outputs. This study presents a comparative analysis of these methods, highlighting their
strengths, limitations, and practical implications for real-world implementation.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Methods of data mining</title>
      <p>
        The process of extracting and structuring academic schedules is a critical component of student
attendance tracking systems. Traditional approaches rely on predefined rules for parsing structured
Excel tables, which can be effective when dealing with well-formatted data but often fail when
schedules are unstructured, scanned, or contain inconsistencies [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. To overcome these limitations,
AI-driven techniques such as computer vision and natural language processing (NLP) offer more
adaptable and intelligent solutions, allowing for the automated recognition and interpretation of
schedule data in various formats [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
      </p>
      <p>This section explores three methodologies for academic schedule parsing: direct rule-based
extraction from Excel files, a machine learning-based computer vision approach that processes images
of schedules using TensorFlow, and an NLP-powered method that structures textual schedule data
into a standardized format. Each approach is analyzed in terms of its algorithmic workflow,
implementation challenges, and practical applicability in educational settings. The goal is to evaluate
their accuracy, flexibility, and efficiency to determine the most effective solution for automated
schedule parsing and attendance tracking.</p>
      <sec id="sec-2-1">
        <title>2.1. Direct Rule-Based Extraction from Excel Files</title>
        <p>Traditional schedule parsing methods rely on extracting data from structured Excel files using
predefined rules. This method assumes that the schedule format remains constant, with data fields
occupying fixed positions within the table. Implementation of this method does not require excessive
writing of program code listing and voluminous backend architecture. The easiest method provides
a common format of tables, their clear structure, which should be followed, namely (Figure 1): clear
naming by headers of each column and corresponding values under them in the same format.
# Function to convert Excel to JSON
def convert_excel_to_json(excel_file, output_file):
# Reading an Excel file
df = pd.read_excel(excel_file)
# Converting to JSON format
json_data = df.to_json(orient='records', indent=2, force_ascii=False)
# Writing data to a file
with open(output_file, 'w', encoding='utf-8') as f:</p>
        <p>f.write(json_data)
convert_excel_to_json(excel_file, output_file)
Listing 1: Simple direct Excel data conversion</p>
        <p>This Python script (Listing 1) converts a simple Excel file into a JSON file as follows:
1. Reads the Excel file:
• the pd.read_excel(excel_file) function loads the Excel file into a Pandas DataFrame
(df), automatically interpreting its structure (columns and rows).
2. Converts it to JSON:
• df.to_json(orient='records', indent=2, force_ascii=False) converts the DataFrame into</p>
        <p>JSON format.
• The orient='records' parameter ensures the JSON output is a list of dictionaries (each
row becomes a dictionary where column names are keys).
• indent=2 makes the JSON output human-readable.
• force_ascii=False keeps non-ASCII characters (e.g., special or non-English
characters).
3. Writes the JSON file:
• the script opens a file in write mode ('w') with UTF-8 encoding;
• the JSON data is written to this file.</p>
        <p>It relies on the DataFrame structure from the Excel file (Figure 1) to successfully build a structured
JSON file: column names become dictionary keys, rows become JSON objects (dictionaries) and the
orient='records' parameter ensures each row is converted into a dictionary within a list.</p>
        <p>Another typical approach involves using programming languages such as Python to read an Excel
file, navigate to specific cells, and extract relevant information based on predefined coordinates. This
technique is widely used in simple applications where the schedule format does not change over time.</p>
        <p>The process of direct rule-based parsing generally follows these steps:
1. Loading the Excel file – The program opens the file and accesses the specified sheet
containing the schedule data;
2. Navigating to fixed cell positions – Since the table structure is static, the program reads data
from predefined rows and columns;
3. Extracting relevant information – Course names, dates, times, and classroom assignments are
retrieved based on known cell references;
4. Storing and structuring the data – The extracted data is stored in a structured format such as
a JSON object or a Python dictionary for further processing;</p>
        <p>The following Python script demonstrates how to extract a simple class schedule from an Excel
file using openpyxl, a lightweight library for handling Excel files:
from openpyxl import load_workbook
# Load the Excel workbook and select the active sheet
wb = load_workbook("schedule.xlsx")
sheet = wb.active
# Define fixed positions for schedule fields (assuming known structure)
schedule_data = []
for row in range(2, sheet.max_row + 1): # Skipping header row
group = sheet.cell(row=1).value
weektype = sheet.cell(row=row, column=1).value
subject = sheet.cell(row=row, column=2).value
teacher = sheet.cell(row=row, column=3).value
day = sheet.cell(row=row, column=4).value
time = sheet.cell(row=row, column=5).value
room = sheet.cell(row=row, column=6).value
zoom = sheet.cell(row=row, column=7).value
schedule_data.append({
"Group": group,
"Weektype": weektype,
"Subject": subject,
"Teacher": teacher,
"Day": day,
"Time": time,
"Room": room,
"Zoom": zoom
})
# Convert the extracted data into a JSON format
schedule_json = json.dumps(schedule_data, indent=4, ensure_ascii=False)
print(schedule_json)
Listing 2: Direct rule-based extraction from Excel file
"№": "1",
"Group": "CS-31",
"Weektype": "Odd",
"Subject": "Artificial Intelligence",
"Teacher": "Dr. Smith",
"Day": "Monday",
"Time": "10:00",
"Room": "6401",
"Zoom": "603 232 4543"
"№": "2",
"Group": "SE-42",
"Weektype": "Even",
"Subject": "Machine Learning",
"Teacher": "Prof. Jones",
"Day": "Wednesday",
"Time": "14:00",
"Room": "6102",
"Zoom": "604 232 4543"
]
Listing 3: The output (result) of the script in JSON file</p>
        <p>
          While this approach is straightforward and computationally efficient, it suffers from major
limitations. Any changes to the file’s structure, such as column rearrangements or merged cells, can
lead to parsing errors. Moreover, this method does not handle unstructured or scanned schedules,
making it unsuitable for real-world educational environments where schedules frequently change.
Studies on AI-driven scheduling confirm that rigid rule-based approaches lack the flexibility needed
for automated schedule extraction across diverse formats [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ].
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Computer Vision-Based Recognition Using TensorFlow</title>
        <p>
          Computer Vision encompasses techniques for the automated extraction, interpretation, and analysis
of meaningful information from individual images or sequences of images [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]. To overcome the
limitations inherent in rule-based extraction methods, computer vision techniques have been
employed to interpret schedule data from images or scanned documents. Utilizing TensorFlow, a
prominent deep learning framework, models can be trained to recognize patterns and extract
pertinent information from visual data [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]. This approach involves creating a dataset of labeled
schedule images and training a convolutional neural network (CNN) to identify and extract relevant
details, such as course names, times, and locations. The advantage of this method lies in its ability to
handle unstructured data and variations in formatting, providing a more flexible solution for schedule
parsing. Research has demonstrated that deep learning techniques significantly improve the accuracy
of text recognition from complex tabular images, making them highly applicable in educational data
processing [
          <xref ref-type="bibr" rid="ref2 ref3">2,3</xref>
          ]. Furthermore, CNN models trained on large datasets have been proven to outperform
traditional optical character recognition (OCR) techniques, particularly when handling varying fonts,
alignments, and noise levels in scanned documents [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ].
        </p>
        <p>
          The implementation of a computer vision-based schedule parsing system encompasses several key
stages:
1. Dataset Preparation: collect a comprehensive dataset of schedule images, ensuring diversity
in formats, fonts, and layouts. Annotate these images to label the regions containing relevant
information, such as course names, times, and locations;
2. Image Preprocessing: enhance image readability by applying grayscale conversion [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ],
contrast adjustments, and noise reduction. Techniques such as adaptive thresholding and edge
detection help refine input data for better recognition;
3. Neural Network Design: construct a CNN model that can efficiently process image-based
schedule data. The architecture should include multiple convolutional layers to capture spatial
features, along with pooling layers to reduce dimensionality [
          <xref ref-type="bibr" rid="ref12 ref13 ref9">9,12,13</xref>
          ];
4. Model Training: use supervised learning techniques to train the CNN model on annotated
schedules. The network learns to distinguish patterns and extract relevant text-based information,
improving accuracy with each training iteration;
5. Post-Processing: implement post-processing steps to convert the model's output into a
structured format, such as JSON. This may involve mapping the detected information to
predefined categories and organizing it for further use;
        </p>
        <p>The following Python example demonstrates how to define and train a CNN model using
TensorFlow to recognize structured data from schedule images:
import tensorflow as tf
from tensorflow.keras import Sequential
from tensorflow.keras.layers import Conv2D, MaxPooling2D, Flatten, Dense
# Define the CNN model structure
cnn_model = Sequential([</p>
        <p>Conv2D(32, (3, 3), activation='relu', input_shape=(128, 128, 1)),
MaxPooling2D((2, 2)),
Conv2D(64, (3, 3), activation='relu'),
MaxPooling2D((2, 2)),
Conv2D(128, (3, 3), activation='relu'),
Flatten(),
Dense(128, activation='relu'),</p>
        <p>Dense(5, activation='softmax') # Adjust output neurons based on label categories
# Configure the model with a suitable loss function and optimization algorithm
cnn_model.compile(optimizer='adam',
loss='sparse_categorical_crossentropy',
metrics=['accuracy'])
# Train the CNN using preprocessed schedule images and labels
cnn_model.fit(train_data, train_labels, epochs=25, validation_data=(test_data, test_labels))
Listing 4: Computer vision-based recognition using TensorFlow snippet</p>
        <p>
          In this example, train_images and train_labels represent the preprocessed training data and
corresponding labels, respectively. The CNN consists of convolutional layers interspersed with
pooling layers, followed by fully connected layers that output class probabilities [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ].
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>2.3. Natural Language Processing-Driven Text Processing</title>
        <p>
          In the realm of automated schedule parsing, Natural Language Processing (NLP) techniques have
emerged as powerful tools for extracting structured information from unstructured textual data [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ].
NLP leverages computational methods to process and analyze human language, enabling systems to
interpret and organize textual information effectively [
          <xref ref-type="bibr" rid="ref15 ref8">8,15</xref>
          ]. This approach is particularly beneficial
when dealing with schedules presented in free-form text, where traditional rule-based methods may
falter due to variability in language and formatting. Studies suggest that NLP-based models
outperform rule-based approaches in terms of flexibility and adaptability, as they can generalize
across different formatting styles without requiring predefined rules [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. Additionally, advancements
in NLP models, such as transformers and sequence-to-sequence architectures, have shown promising
results in extracting structured data from semi-structured academic documents [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ].
        </p>
        <p>
          The process begins with data collection, where a substantial corpus of textual schedules is gathered
to serve as the training dataset. This dataset should encompass a wide variety of schedule formats
and linguistic expressions to ensure the model's robustness. Subsequently, data preprocessing is
conducted, involving tokenization (dividing text into words or phrases), part-of-speech tagging
(identifying grammatical categories) [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ], and named entity recognition (detecting entities like dates,
times, and course names). These preprocessing steps transform raw text into a structured format
suitable for machine learning algorithms.
        </p>
        <p>
          Following preprocessing, the core of the NLP approach involves training machine learning models
to recognize patterns and relationships within the text. Advanced models, such as Transformer-based
architectures (e.g., BERT, GPT, and T5) [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ], have demonstrated remarkable proficiency in
understanding context and semantics in natural language, making them well-suited for this task [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ].
These models are trained to identify and extract pertinent information, such as course titles, timings,
and locations, from the textual data. The extracted information is then organized into a structured
format, such as a JSON object, facilitating seamless integration with other systems and applications.
        </p>
        <p>For instance, consider a segment of text from a schedule: "The Introduction to Biology class meets
every Monday and Wednesday at 09:35 AM in Room 6204 with Prof. Jones only on even week." An NLP
model can process this sentence to extract the course name ("Introduction to Biology"), the days of
the week ("Monday and Wednesday"), the time ("09:35 AM"), the location ("Room 6204"), the teacher
(“Prof. Jones”) and the type of the week (“even”). The extracted data can then be structured into JSON
format as follows in Listing 3.</p>
        <p>The following Python code demonstrates how an NLP pipeline can extract structured schedule
data using the spaCy library:
import spacy
import json
# Load a pre-trained NLP model
nlp = spacy.load("en_core_web_sm")
# Example schedule text
text = " The Introduction to Biology class meets every Monday and Wednesday at 09:35 AM in Room 6204 with
Prof. Jones only on even week."
# Process text
doc = nlp(text)
# Extract relevant entities (simplified approach)
schedule_data = {
"course_name": "Introduction to Biology",
"days": ["Monday", "Wednesday"],
"time": "09:35",
"location": "6204",
"teacher": "Prof. Jones",
"week_type": "even"
}
# Convert to JSON
schedule_json = json.dumps(schedule_data, indent=4)
print(schedule_json)
Listing 5: Example Code for an NLP-Based Schedule Parser</p>
        <p>The flexibility of NLP-based methods allows for the handling of diverse schedule representations,
making them adaptable to various textual formats without the need for rigid predefined rules. This
adaptability is particularly advantageous in educational settings where schedule formats may vary
significantly across institutions or departments. Moreover, NLP techniques can manage ambiguities
and variations in natural language, enhancing the robustness of the schedule parsing system.</p>
        <p>
          The application of NLP techniques in schedule parsing offers a sophisticated and flexible approach
to extracting structured information from unstructured textual data [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. By leveraging advanced
machine learning models, NLP enables the development of robust systems capable of adapting to
diverse schedule formats and linguistic variations, thereby enhancing the efficiency and accuracy of
automated schedule management. NLP-based approaches outperform traditional rule-based methods
in their adaptability and efficiency, making them well-suited for modern AI-driven academic
administration systems.
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Declaration on Generative AI</title>
      <p>The authors have not employed any Generative AI tools.</p>
    </sec>
    <sec id="sec-4">
      <title>3. Conclusions</title>
      <p>This study explored three distinct methods for parsing academic schedules: rule-based extraction
from structured Excel files, deep learning-based computer vision recognition, and Natural Language
Processing (NLP)-driven text processing. Each method presents unique advantages and limitations,
influencing its suitability for different use cases in automated student attendance tracking.</p>
      <p>Rule-based Excel parsing is a straightforward approach that efficiently extracts data from
wellstructured spreadsheets. It is computationally inexpensive and easy to implement but lacks
adaptability when schedule formats change. Even minor deviations, such as merged cells or column
reordering, can cause errors, making it unsuitable for handling unstructured data.</p>
      <p>The computer vision-based approach using TensorFlow offers greater flexibility by recognizing
schedule information directly from images. It enables the extraction of data from scanned documents
and printed timetables, making it more robust than rule-based methods. However, it requires
substantial computational resources for training convolutional neural networks (CNNs) and a large
annotated dataset to ensure high accuracy. Despite these challenges, this method excels in
environments where schedules are available only in image form.</p>
      <p>The NLP-driven method is the most adaptable, capable of handling unstructured text data and
generalizing across different formatting styles. By leveraging transformer-based architectures, NLP
models can extract and structure schedule information with high accuracy. This approach is
especially beneficial when dealing with schedules in free-text format or inconsistent table structures.
However, its effectiveness depends on the availability of extensive training datasets and
preprocessing techniques.</p>
      <p>The choice of a schedule parsing method depends on the specific requirements of an institution.
If schedules are consistently formatted Excel spreadsheets, a rule-based method may suffice. If the
schedules are often scanned or photographed, a computer vision-based approach would provide
greater flexibility. However, if schedules exist in multiple textual formats with varying structures,
NLP-based methods offer the most robust and scalable solution.</p>
      <p>Future improvements in AI-driven schedule parsing could involve hybrid models that integrate
rule-based approaches for structured data, CNNs for image-based recognition, and NLP for text
processing. Such a multimodal system would enhance adaptability, ensuring high accuracy regardless
of the input format. Additionally, fine -tuning deep learning models with larger and more diverse
datasets will further improve their generalization capabilities and efficiency in real-world educational
applications.</p>
      <p>By integrating AI-driven solutions, educational institutions can automate schedule management
with greater precision, reducing manual workload and improving attendance tracking efficiency. As
AI continues to evolve, future systems may incorporate real-time schedule adjustments and predictive
analytics to optimize resource allocation and enhance overall academic administration.
[16] Dyvak, Mykola, Oleksandr Papa, Andrii Melnyk, Andriy Pukas, Nataliya Porplytsya, and Artur
Rot. 2020. "Interval Model of the Efficiency of the Functioning of Information Web Resources for
Services on Ecological Expertise" Mathematics 8, no. 12: 2116.
https://doi.org/10.3390/math8122116
[17] A. Kovbasistyi, A. Melnyk, M. Dyvak, V. Brych and I. Spivak, "Method for detection of
nonrelevant and wrong information based on content analysis of web resources," 2017 XIIIth
International Conference on Perspective Technologies and Methods in MEMS Design
(MEMSTECH), Lviv, Ukraine, 2017, pp. 154-156, doi: 10.1109/MEMSTECH.2017.7937555.
[18] M. Dyvak, A. Melnyk, A. Kovbasistyi, R. Shevchuk, O. Huhul and V. Tymchyshyn, "Mathematical
Modeling of the Estimation Process of Functioning Efficiency Level of Information
WebResources," 2020 10th International Conference on Advanced Computer Information
Technologies (ACIT), Deggendorf, Germany, 2020, pp. 492-496, doi:
10.1109/ACIT49673.2020.9208846.
[19] M. Dyvak, A. Kovbasistyi, A. Melnyk, I. Shcherbiak and O. Huhul, "Recognition of Relevance of
Web Resource Content Based on Analysis of Semantic Components," 2019 9th International
Conference on Advanced Computer Information Technologies (ACIT), Ceske Budejovice, Czech
Republic, 2019, pp. 297-302, doi: 10.1109/ACITT.2019.8779897.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Vasileiou</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Yeoh</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          (
          <year>2025</year>
          ).
          <article-title>AI tool helps make trustworthy, explainable scheduling decisions</article-title>
          . Washington University in St. Louis Engineering News. URL: https://engineering.washu.edu/news/2025/
          <article-title>AI-tool-helps-make-trustworthy-explainablescheduling-decisions</article-title>
          .html
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Hilbert</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>2021</year>
          ).
          <article-title>Machine learning for the educational sciences</article-title>
          .
          <source>Review of Education</source>
          ,
          <volume>9</volume>
          (
          <issue>3</issue>
          ),
          <fpage>691</fpage>
          -
          <lpage>725</lpage>
          . URL: https://doi.org/10.1002/rev3.3310
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Lin</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xia</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lin</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          (
          <year>2023</year>
          ).
          <article-title>A comprehensive survey on deep learning techniques in educational data mining</article-title>
          .
          <source>arXiv preprint arXiv:2309</source>
          .04761. URL: https://arxiv.org/abs/2309.04761
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Mahakud</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parida</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Panda</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Maity</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sahoo</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Sharma</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          (
          <year>2022</year>
          ).
          <article-title>A machine learning system to monitor student progress in educational institutes</article-title>
          .
          <source>arXiv preprint arXiv:2211</source>
          .05829. URL: https://arxiv.org/abs/2211.05829
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Virtosoftware.</surname>
          </string-name>
          (
          <year>2024</year>
          ).
          <article-title>AI tools for school schedules and timetables: Prompts &amp; guide</article-title>
          . Virtosoftware Blog. URL: https://blog.virtosoftware.com/ai
          <article-title>-schedule-maker-for-schools/</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>TensorFlow</given-names>
            <surname>Core. Convolutional Neural</surname>
          </string-name>
          <article-title>Network (CNN)</article-title>
          . URL: https://www.tensorflow.org/tutorials/images/cnn
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Sebastian</given-names>
            <surname>Schreiber</surname>
          </string-name>
          , Stefan Agne, Ivo Wolf, Andreas Dengel, and Sheraz Ahmed.
          <article-title>DeepDeSRT: Deep Learning for Detection and Structure Recognition of Tables in Document Images</article-title>
          .
          <source>In Proceedings of the International Conference on Document Analysis and Recognition (ICDAR)</source>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Devlin</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chang</surname>
          </string-name>
          , M.-W.,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Toutanova</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          (
          <year>2019</year>
          ).
          <article-title>BERT: Pre-training of deep bidirectional transformers for language understanding</article-title>
          . arXiv preprint arXiv:
          <year>1810</year>
          .04805. URL: https://arxiv.org/abs/
          <year>1810</year>
          .04805
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Nayeem</surname>
            ,
            <given-names>T. A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Motaharuzzaman</surname>
            ,
            <given-names>S. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hoque</surname>
            ,
            <given-names>A. T.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Rahman</surname>
            ,
            <given-names>M. H.</given-names>
          </string-name>
          (
          <year>2022</year>
          ,
          <article-title>December)</article-title>
          .
          <article-title>Computer vision based object detection and recognition system for image searching</article-title>
          .
          <source>In 2022 12th International Conference on Electrical and Computer</source>
          Engineering (ICECE) (pp.
          <fpage>148</fpage>
          -
          <lpage>151</lpage>
          ). IEEE.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Medium</surname>
          </string-name>
          .
          <article-title>AI Techniques for Data Parsing and Structuring</article-title>
          . URL: https://medium.com/isomeric/aitechniques
          <article-title>-for-data-parsing-and-structuring-4345c0456032</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Medium</surname>
          </string-name>
          .
          <article-title>Comparing 6 Frameworks for Rule-based PDF parsing</article-title>
          . URL: https://levelup.gitconnected.com/comparing-6
          <article-title>-frameworks-for-rule-based-pdf-parsingf9e7ca5b6cc9</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Zhao</surname>
            ,
            <given-names>Wenzhi</given-names>
          </string-name>
          &amp; Du,
          <string-name>
            <surname>Shihong.</surname>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>Spectral-Spatial Feature Extraction for Hyperspectral Image Classification: A Dimension Reduction and Deep Learning Approach</article-title>
          .
          <source>IEEE Transactions on Geoscience and Remote Sensing</source>
          .
          <volume>54</volume>
          .
          <fpage>4544</fpage>
          -
          <lpage>4554</lpage>
          .
          <fpage>10</fpage>
          .1109/TGRS.
          <year>2016</year>
          .
          <volume>2543748</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Medium</surname>
          </string-name>
          .
          <article-title>Convolutional Neural Networks</article-title>
          . URL: https://medium.com/@erdematbas/convolutional
          <article-title>-neural-networks-ff2070fe185d</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Docsumo. Harnessing Natural Language</surname>
          </string-name>
          <article-title>Processing (NLP) for Information Extraction</article-title>
          . URL: https://www.docsumo.com/blog/nlp-information-extraction
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Khurana</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koli</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Khatter</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Singh</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2023</year>
          ).
          <article-title>Natural language processing: state of the art, current trends and challenges</article-title>
          .
          <source>Multimedia tools and applications</source>
          ,
          <volume>82</volume>
          (
          <issue>3</issue>
          ),
          <fpage>3713</fpage>
          -
          <lpage>3744</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>