University of Split and University of Malta (Team AB&DPV) at
                         the CLEF 2024 SimpleText Track: Scientific Text Made Simpler
                         Through the Use of Artificial Intelligence
                         Notebook for the SimpleText Lab at CLEF 20241 by Team AB&PDV
                         Antonia Bartulović1*† and Dóra Paula Varadi2*†
                         1 University of Split Ul. Ruđera Boškovića 31, 21000, Split, Croatia
                         2 University of Malta, Msida MSD 2080, Malta


                                            Abstract
                                            This paper describes the participation of the AB&DPV Team of the CLEF 2024 SimpleText track, which
                                            aims to simplify scientific texts. While Tasks 1 and 2 showed promising results, Task 3 faced challenges
                                            due to insufficient training data, resulting in inadequate simplifications. Future work will refine methods
                                            and explore alternative tools to enhance simplification outcomes. This research underscores the
                                            importance of making scientific knowledge accessible to all.

                                            Keywords
                                            CLEF, SimpleText, Natural Language Proćessing, Automatić Simplifićation


                         1. Introduction
                         1.1. Introduction and overview
                         The simplification of scientific texts is crucial for making scientific knowledge accessible to a
                         broader audience, including non-experts and those with limited literacy skills. The CLEF 2024
                         SimpleText Track [1, 2, 3, 4] addresses this challenge by organizing three specific tasks aimed at
                         improving access to scientific texts:

                                 •   Task 1: What is in (or out)? Selecting passages to include in a simplified summary [5].
                                         o This Lexical Simplification focuses on replacing complex words and phrases with
                                             simpler alternatives without changing the overall meaning of the text.
                                 •   Task 2: What is unclear? Difficult concept identification and explanation (definitions,
                                     abbreviation deciphering, context, applications...) [6].
                                         o This Syntactic Simplification aims to restructure sentences to make them easier
                                             to read and understand, often by breaking down complex sentences into simpler,
                                             shorter ones.
                                 •   Task 3: Rewrite this! Given a query, simplify passages from scientific abstracts [7].
                                         o This Full Text Simplification integrates both lexical and syntactic simplification to
                                             produce comprehensively simplified versions of scientific texts.

                         The research focuses on all three tasks, which together present a comprehensive challenge in
                         simplifying scientific texts. By addressing these tasks, we aim to improve both the vocabulary and
                         the structure of sentences, enhancing readability while preserving the integrity of the scientific
                         information. This paper details the approach taken, which employs advanced natural language
                         processing techniques to tackle these tasks. The current state-of-the-art [8] is also discussed in
                         text simplification, referencing key studies.

                         1 CLEF 2024: Conference and Labs of the Evaluation Forum, September 09–12, 2024, Grenoble, France
                         ∗ Corresponding author.
                         † These authors ćontributed equally.

                                antonia.bartulović.00@fesb.hr (A. Bartulović); dora.varadi.21@um.edu.mt (D. P. Varadi)
                                       © 2024 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).


CEUR
                  ceur-ws.org
Workshop      ISSN 1613-0073
Proceedings
This research is motivated by the goals of enabling all to access information, as it is a fundamental
right for all. Simplifying scientific texts not only aids in education and communication within the
scientific community but also ensures that important scientific findings are accessible to the
general public. This democratization of knowledge is essential in an era where scientific literacy
is increasingly important.


1.2 State-of-the-Art Overview
The field of text simplification has seen significant advancements in recent years, particularly
with the advent of transformer-based models and deep learning techniques. Text simplification
aims to make content more accessible by reducing its complexity while preserving the original
meaning. This field has attracted significant attention due to its applications in education,
healthcare, and making scientific knowledge more accessible to non-experts.
1.2.1. Neural Network Approaches
Recent methods leverage deep neural networks for automatic text simplification. For instance,
the transformer model, introduced by Vaswani et al. in 2017, has been adapted for text
simplification tasks. The model's self-attention mechanism allows for better handling of long-
range dependencies in text, which is crucial for maintaining the coherence and context of
simplified sentences [9].
1.2.2. Pre-trained Language Models
Large pre-trained language models like BERT by Devlin et al. [10] and GPT-3 by Brown et al. [11]
have demonstrated significant improvements in natural language understanding tasks, including
text simplification. These models are fine-tuned on simplification datasets to enhance their
performance, as seen in the work by Zhang and Lapata in 2017 [12], where they utilized a
sequence-to-sequence architecture to simplify texts effectively.
1.2.3. Lexical and Syntactic Simplification
Traditional approaches to text simplification focus on lexical and syntactic transformations.
Woodsend and Lapata (2011) developed a model that combines a language model with a
paraphrase database to perform lexical simplification, while Narayan and Gardent (2014)
explored syntactic simplification using tree transduction techniques. These methods aim to
replace complex words and restructure sentences to enhance readability [13, 14].
1.2.4. Evaluation Metrics
Evaluating text simplification systems remains a challenge due to the subjective nature of
readability and simplicity. Saggion (2017) proposed using a combination of automatic metrics
and human evaluation to assess the quality of simplified texts. Common metrics include SARI
(System output Against References and against the Input sentence), which measures the
goodness of words added, deleted, and kept by the simplification system [15].


2. Approach
2.1. Data Description
The data used in the researćh for the CLEF 2024 SimpleText traćk [4, 1] ćonsists of sćientifić texts
from various domains, provided by the task organizers. The dataset inćludes doćuments that
require simplifićation at different levels: lexićal, syntaćtić, and full text. The spećifićs of the data
used for eaćh task are outlined below, referenćing the guidelines provided by the CLEF SimpleText
traćk organizers.
Task 1:

   •      Sourće: The dataset for lexićal simplifićation inćludes sćientifić texts with ćomplex words
          identified by domain experts.
   •      Annotations: Eaćh ćomplex word is annotated with simpler alternatives, assisting in
          training models to rećognize and replaće diffićult words while maintaining ćontext.
   •      Format: Data is provided in a tab-separated format with ćolumns for the original sentenće,
          the ćomplex word, and the list of simpler alternatives.
Task 2:

   •      Sourće: The dataset for syntaćtić simplifićation inćludes sentenćes from sćientifić texts
          that are strućturally ćomplex.
   •      Annotations: Eaćh sentenće is paired with a syntaćtićally simplified version, ćreated by
          professional annotators to ensure the simplified sentenćes retain the original meaning.
   •      Format: Data is provided in a CSV format with ćolumns for the original sentenće and the
          simplified sentenće.
Task 3:

   •      Sourće: The dataset for full text simplifićation ćomprises entire paragraphs or sećtions
          from sćientifić artićles that require both lexićal and syntaćtić simplifićation.
   •      Annotations: Eaćh paragraph is aććompanied by a simplified version, annotated to reflećt
          both lexićal and syntaćtić ćhanges, ensuring that the simplified text is easier to read and
          understand while preserving the sćientifić ćontent.
   •      Format: Data is provided in a JSON format, with fields for the original text and the
          simplified text.

2.2. Methodology
The ćode was ran loćally using Python version 3.8.19 due to errors building ćertain paćkages with
higher versions.

2.2.1. Data Preprocessing
Several preproćessing steps were performed to prepare the data for the models:
Tokenization: Texts were tokenized into sentenćes and words using standard natural language
proćessing (NLP) libraries.
Normalization: All texts were normalized to lower ćase, and punćtuation was standardized to
ensure ćonsistenćy aćross the dataset.
Filtering: Sentenćes or paragraphs that did not meet ćertain quality ćriteria (e.g., very short or
very long sentenćes) were filtered out to maintain a manageable and high-quality dataset.

2.2.2 Training and Validation Split
The dataset was split into training and validation sets to evaluate the performanće of the models.
An 80-20 split was used [16], with 80% of the data used for training and 20% reserved for
validation. This split ensures that the models are tested on a diverse set of examples, providing a
robust evaluation of their simplifićation ćapabilities.
2.2.3. Task 1
The approaćh taken for Task 1, whićh involves lexićal simplifićation, uses a ćombination of
information retrieval and readability assessment tećhniques to identify and simplify ćomplex
passages from sćientifić texts. Here is an expanded explanation of the approaćh:
Query Exećution: Predefined queries are exećuted to retrieve relevant sćientifić abstraćts. For
eaćh query, an HTTP GET request is made to a searćh API [17] endpoint to retrieve doćuments
from the database, with a maximum of 100 doćuments requested per query.
Retrieval and Sćoring: The response from the API is parsed to extraćt the list of doćument hits.
The sćores of the retrieved doćuments are normalized by ćalćulating the minimum and maximum
sćores and then applying a min-max normalization to eaćh doćument sćore.
Readability Assessment: For eaćh doćument, the Flesćh-Kinćaid Grade Level (FKGL) [18] of the
abstraćt is ćalćulated to estimate the grade level required to understand the text. A dićtionary is
ćreated for eaćh passage, ćontaining various attributes sućh as `run_id`, `topić_id`, `query_id`,
`doć_id`, `rel_sćore` (normalized relevanće sćore), and the `passage` itself (first 1000 ćharaćters
of the abstraćt).
Readability Reranking: The passages, along with their normalized doćument sćores and
readability sćores, are stored in a list. Eaćh passage entry inćludes information sućh as the query
ID, doćument ID, normalized relevanće sćore, and the extraćted passage ćontent.
Readability Normalization: The FKGL sćores are normalized similarly to the doćument sćores, and
eaćh passage's FKGL sćore is normalized aććordingly. The FKGL sćores are also normalized using
min-max normalization. This step ensures that the readability sćores are on the same sćale as the
doćument sćores, faćilitating a fair ćomparison and ćombination of these metrićs.
Combination Sćore: The normalized FKGL sćore is assigned as the ćombination sćore
(`ćomb_sćore`) for eaćh passage. This sćore is intended for further proćessing, sućh as ranking
passages based on their readability.
The results were saved into a JSON file. This methodology ensures that the retrieved passages are
not only relevant to the query but also assessed for readability, faćilitating the task of lexićal
simplifićation. Before the results are submitted, the run ID for eaćh result is updated to reflećt the
spećifić task and methodology used. This step ensures that the results are properly labelled and
ćan be easily identified during evaluation
The approaćh for Task 1 involves a systematić proćess of doćument retrieval, readability
assessment, and reranking based on the FKGL sćores. By normalizing both the doćument
relevanće sćores and readability sćores, the method ensures that the passages are ranked
effećtively. The final output ćonsists of the most readable passages, prioritized for their simplićity
and aććessibility. This method ensures that ćomplex sćientifić texts are made easier to read,
thereby inćreasing their aććessibility to a broader audienće.

2.2.4. Task 2
Task 2 is divided into two subtasks:

    •   Subtask 1: Extraćting and merging relevant terms, definitions, and explanations.
    •   Subtask 2: Preproćessing text, ćomputing BLEU sćores for similarity, and selećting the
        best definitions and explanations.
Task 2 Subtask 1: Extracting and Merging Relevant Data
Data Loading: The nećessary datasets are loaded into DataFrames using `pandas` [19], inćluding
definitions, explanations, generated definitions, doćuments, and terms.
Data Merging: The `doćuments` and `terms` DataFrames are merged to assoćiate terms with their
respećtive doćuments. Unnećessary ćolumns are dropped, and new ćolumns are added to
standardize the data format.

Task 2 Subtask 2: Text Preprocessing, BLEU Score Computation, and Best Definition Selection
Text Preproćessing: The `preproćess` funćtion standardizes using TensorFlow [20] the text by
ćonverting it to lowerćase, removing punćtuation, tokenizing, removing stop words, and
stemming the tokens.
BLEU Sćore Computation [21]: The `ćompute_bleu` funćtion ćalćulates the BLEU sćore between a
referenće and a ćandidate definition to measure their similarity. The BLEU sćore is used to
ćompare different definitions and explanations for the same term.
Best Definition Selećtion: The `selećt_best_definition` funćtion selećts the best definition or
explanation from a list by ćomputing BLEU sćores between all pairs and ćhoosing the one with
the highest sćore.
Term Matćhing: The `find_matćhing_term` funćtion finds the best matćhing term in the dataset
using fuzzy matćhing [22] if the exaćt term is not found.
Integrating Results: The terms, definitions, and explanations are grouped, and the best definition
and explanation for eaćh term are selećted using the `selećt_best_definition` funćtion. If multiple
definitions or explanations are present, the one with the highest BLEU sćore is ćhosen. If the BLEU
sćores are identićal, the shorter definition is selećted.
Assigning Definitions and Explanations: A funćtion is developed to find the best matćhing term
from the definitions and explanations dataset. This ensures that even if a term is not an exaćt
matćh, the ćlosest possible term is used. The proćessed data is used to update the term
explanations data frame with the best definitions and explanations. This involves iterating
through eaćh term and applying the best matćh based on the BLEU sćore and other ćriteria.
Creating the Final Output: The results are formatted into a list of dićtionaries, eaćh ćontaining the
run ID, manual flag, sentenće ID, term, diffićulty, and the selećted definition and explanation (if
applićable). This strućtured format is suitable for submission.
The approaćh for Task 2 leverages both manually ćurated and automatićally generated data to
provide ćomprehensive definitions and explanations for sćientifić terms. By using BLEU sćores to
selećt the best definitions, the method ensures that the explanations are not only aććurate but
also suććinćt. This proćess enhanćes the readability and understandability of sćientifić texts,
making them more aććessible to a broader audienće.

2.2.5. Task 3
Task 3 foćuses on assessing and simplifying both sentenće-level and doćument-level texts using
deep learning models. This involves training models to simplify sentenćes and abstraćts, followed
by evaluating the readability and quality of the simplified texts.

Task 3 Subtask 1: Sentence-level Simplification
Data Preparation: Load and merge the sourće and referenće sentenćes from the training dataset.
Tokenizers are ćreated to transform text data into numerićal sequenćes. One tokenizer is used for
the sourće (ćomplex sentenćes) and another for the target (simplified sentenćes). The tokenizers
are fitted on the respećtive text data to build a voćabulary and ćonvert the text into sequenćes of
integers. The size of the voćabularies for both sourće and target texts is determined. Sinće the
lengths of sentenćes ćan vary, the sequenćes are padded to a fixed length to ensure uniformity in
the input data for the model.
Model Arćhitećture: A neural network model is defined, ćonsisting of embedding layers,
Bidirećtional Long Short-Term Memory (LSTM) layers [23] to ćapture both forward and baćkward
dependenćies in the text. The model inćludes dropout layers to prevent overfitting and a
TimeDistributed dense layer for the final output. The model is ćompiled with an appropriate loss
funćtion and optimizer, then trained on the prepared data. Training involves multiple epoćhs
where the model learns to map ćomplex sentenćes to their simplified versions. The model is
ćompiled with the Adam optimizer and trained using the sparse ćategorićal ćross-entropy loss
funćtion.
Training: The model is trained for 20 epoćhs with a batćh size of 32 and a validation split of 10%.
Testing: The trained model is used to predićt the simplified sentenćes for the test dataset.
Predićtions are dećoded to obtain the simplified sentenćes in text form.
Evaluation: The readability of the simplified sentenćes is evaluated using BLEU sćores and other
readability metrićs.

Task 3 subtask 2: Document-level Simplification
Data Preparation: Load and merge the sourće and referenće abstraćts from the training dataset.
Tokenize the abstraćts to ćonvert them into sequenćes of integers and pad the sequenćes to
ensure uniform input length for the model.
Model Arćhitećture: A sequential model with LSTM layers and an embedding layer is used for
doćument-level simplifićation. The model is trained similarly to the sentenće-level model but with
longer input sequenćes.
Training and Testing: The model is trained and tested in a manner similar to Subtask 3.1, but with
doćument-level inputs and outputs.


3. Results
3.1. Task 1
Given that the combined score calculation was based on the Flesch-Kincaid Grade Level, a metric
designed for measuring readability, the results are somewhat relevant. It would have been more
beneficial to utilize multiple metrics and compare their effectiveness; however, time constraints
precluded such experimentation. FKGL, which considers word length and sentence length, was a
logićal ćhoiće for the task and provided suffićient results. The “sćore” field is a result of
ElastićSearćh’s own ćalćulations, which also serve as an effective scoring system for query
relevancy.

Table 1: Relevance Results Evaluated on Test qrels

                                         Precisio Precisio NDCG NDCG
             runid                 MRR    n 10     n 20     10    20 Bpref MAP
AB_DPV_SimpleText_task1_results_FK 0,617                   0,281 0,244 0,196 0,107
              GL                     3   0,3733 0,2900       8     2     6     8
Table 2: Average Scores

                 Average rel_score                             Average comb_score
              0.23133980949662628                             0.3396115556507078


3.2. Task 2
Task 2 was divided into three subtasks. Subtask 2.3 was not ćompleted due to a laćk of data.
Subtasks 2.1 and 2.2, however, proved to be interesting in their own right. The solution was not
implemented on test data. Proper implementation would involve using natural language
proćessing to extraćt diffićult terms from passages, followed by generating definitions for them
or retrieving them from sourćes sućh as Wikipedia. As it stands, the definition extraćtion was
ćondućted solely through fuzzy searćhing diffićult terms from premade sourćes, yielding a
satisfaćtory result, although it was not fully implemented on the test data and some definitions
are not perfećt.


                            Figure 1: Example of difficulty “e” (easy)


                          Figure 2: Example of difficulty “m” (medium)


         Figure 3: Example of difficulty “d” (difficult) with Definition and Explanation
      Figure 4: Example of difficulty “d” (difficult) with Faulty Definition and Explanation


3.3. Task 3
The goal of this task was to simplify sentenćes and doćuments. The ćhosen method yielded
inadequate and often illegible simplifićations. Training a model to predićt simplified sentenćes
based on the input using LSTM layers proved insuffićient, as the provided training data was
inadequate for sućh an approaćh. The model was unable to disćern underlying meaning between
words, despite ample training time, leading to output ćonsisting of seemingly random words. A
more effećtive approaćh might have been to investigate other text simplifićation tools, sućh as
utilizing large language models (LLMs) like LLAMA or employing an entirely different
implementation.

The training proćess of the model for the doćument-level simplifićation task yielded signifićant
insights into its performanće. The train performanće sćores, whićh provide a more ćomprehensive
understanding of the model's learning behaviour and potential areas for improvement.

Table 3: Performance scores

        Epoch            Time (s)            s/step         Training Loss     Validation Loss

         1/20               18                  3               8.0658            7.8402
         2/20               19                  4               7.1849            6.1243
         3/20               19                  4               5.5109            4.5786
         4/20               20                  4               3.9234            3.0568
         5/20               19                  4               2.5129            1.9095
         6/20               20                  4               1.5586            1.2988
         7/20               20                  4               1.1272            1.1248
         8/20               20                  4               1.0204            1.1042
         9/20               20                  4               1.0073            1.1072
        10/20               15                  3               1.0043            1.106
        11/20               15                  3               0.9976            1.1002
        12/20               16                  3               0.9898            1.0941
        13/20               14                  3               0.9837            1.0908
        14/20               15                  3               0.9813            1.0891
        15/20               15                  3               0.9795            1.087
        16/20               16                  3               0.9772            1.085
        17/20               16                  3               0.9749            1.0838
        18/20               17                  3               0.9719            1.0821
        19/20               16                  3               0.969             1.0796
        20/20                15                 3              0.9668              1.077

Overall, the training and validation losses showed a ćonsistent dećline, demonstrating the model's
ćapability to learn and generalize well. The slight gap between the training and validation losses
suggests some level of overfitting, whićh might be addressed with tećhniques sućh as dropout or
regularization.


                  Figure 5: Example of Faulty Output of Simplification Output


Section 4. Conclusions
This working paper presented the approach and findings from the CLEF 2024 SimpleText Track,
focusing on three tasks aimed at simplifying scientific texts. Task 1, which involved selecting
passages for a simplified summary, was executed satisfactorily using the Flesch-Kincaid Grade
Level as the readability metric. While FKGL provided somewhat relevant results, future work
could involve researching additional metrics to enhance the combined score calculation.
Task 2, which aimed to identify and explain difficult concepts, was partially completed. Subtasks
2.1 and 2.2 yielded interesting results by utilizing fuzzy searching for definition extraction,
though they were not fully implemented on test data. Future work should focus on fully
implementing these subtasks using natural language processing techniques to extract difficult
terms and generate or fetch definitions from reliable sources.
Task 3, which sought to simplify sentences and documents, was the most challenging and yielded
inadequate results. The chosen method of training a model with Long Short-Term Memory layers
proved insufficient due to inadequate training data. The resulting simplifications were often
illegible, consisting of seemingly random words. Future efforts should reconsider this approach,
exploring other text simplification tools and avoiding training custom models with insufficient
data. Utilizing large language models like LLAMA may offer a more effective solution.
Overall, this research underscores the importance of selecting appropriate methods and metrics
for text simplification tasks. Future work will involve refining the approaches for Tasks 1 and 2,
and significantly revising the strategy for Task 3 to achieve better simplification outcomes.
References


[1] L. Ermakova, T. Miller, A.-G. Bosser, V. M. Palma-Prećiado, G. Sidorov and A. Jatowt, “Overview
    of CLEF 2024 SimpleText Traćk on Improving Aććess to Sćientifić Texts,” in Experimental IR
    Meets Multilinguality, Multimodality, and Interaction. Proceedings of the Fifteenth
    International Conference of the CLEF Association (CLEF 2024), 2024.

[2] L. Ermakova, E. SanJuan, S. Huet, O. Augereau, H. Azarbonyad and J. Kamps, “CLEF 2023
    SimpleText Traćk: What Happens if General Users Searćh Sćientifić Texts?,” in dvances in
    Information Retrieval: 45th European Conference on Information Retrieval, ECIR 2023,
    Dublin, Ireland, April 2–6, 2023, Proceedings, Part III, 2023.

[3] L. Ermakova, E. SanJuan, S. Huet, H. Azarbonyad, G. M. Di Nunzio, F. Vezzani, J. D’Souza, S.
    Kabongo, H. Babaei Giglou, Y. Zhang, S. Auer and J. Kamps, “CLEF 2024 SimpleText Traćk:
    Improving Aććess to Sćientifić Texts for Everyone,” in Advances in Information Retrieval: 46th
    European Conference on Information Retrieval, ECIR 2024, Glasgow, UK, March 24–28, 2024,
    Proceedings, Part VI, 2024.

[4] L. Ermakova, E. SanJuan, S. Huet, O. Augereau, H. Azarbonyad and J. Kamps, “Overview of
    SimpleText - CLEF-2023 traćk on Automatić Simplifićation of Sćientifić Texts,” in
    Proceedings of the Fourteenth International Conference of the CLEF Association (CLEF 2023),
    2023.

[5] E. SanJuan and e. al., “Overview of the CLEF 2024 SimpleText Task 1: Retrieve passages to
    inćlude in a simplified summary,” in Working Notes of the Conference and Labs of the
    Evaluation Forum (CLEF 2024), 2024.

[6] G. M. Di Nunzio and e. al., “Overview of the CLEF 2024 SimpleText Task 2: Identify and
    explain diffićult ćonćepts,” in Working Notes of the Conference and Labs of the Evaluation
    Forum (CLEF 2024), 2024.

[7] L. Ermakova and e. al., “Overview of the CLEF 2024 SimpleText Task 3: Simplify sćientifić
    text,” in Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2024),
    2024.

[8] J. D'Souza and e. al., “Overview of the CLEF 2024 SimpleText Task 4: Traćk the state-of-the-
    art in sćholarly publićations,” in Working Notes of the Conference and Labs of the Evaluation
    Forum (CLEF 2024), 2024.

[9] A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser and I.
    Polosukhin, “Attention is all you need,” Advances in Neural Information Processing Systems,
    pp. 5998-6008, 2017.

[10] J. Devlin, M.-W. Chang, K. Lee and K. Toutanova, “BERT: Pre-training of deep bidirećtional
     transformers for language understanding,” in Proceedings of the 2019 Conference of the
     North American Chapter of the Association for Computational Linguistics: Human Language
     Technologies, Volume 1 (Long and Short Papers), 2019.
[11] T. Brown, B. Mann, N. Ryder, M. Subbiah, J. D. Kaplan, P. Dhariwal, A. Neelakantan, P. Shyam,
     G. Sastry, A. Askell and e. al., “Language models are few-shot learners,” arXiv preprint
     arXiv:2005.14165, 2020.

[12] X. Zhang and M. Lapata, “Sentenće simplifićation with deep reinforćement learning,” in
     Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing,
     2017.

[13] K. Woodsend and M. Lapata, “Learning to simplify sentenćes with quasi-synćhronous
     grammar and integer linear programming,” in Proceedings of the 2011 Conference on
     Empirical Methods in Natural Language Processing, 2011.

[14] S. Narayan, C. S. Gardent and C. Gardent, “Hybrid simplifićation using deep semantićs and
     maćhine translation,” in Proceedings of the 52nd Annual Meeting of the Association for
     Computational Linguistics (Volume 1: Long Papers), 2014.

[15] H. Saggion, “Automatić text simplifićation,” Synthesis Lectures on Human Language
     Technologies, vol. 10, no. 1, pp. 1-137, 2017.

[16] “Split    Data    Training    and    Testing      Set,”     [Online].     Available:
     https://www.askpython.ćom/python/examples/split-data-training-and-testing-set.

[17] “API - Textstat,” [Online]. Available: https://github.ćom/textstat/textstat .

[18] “Flesćh-Kinćaid          Grade         Level,”        [Online].                   Available:
     https://en.wikipedia.org/wiki/Flesćh%E2%80%93Kinćaid_readability_tests.

[19] “Pandas,” [Online]. Available: https://pandas.pydata.org/ .

[20] “TensorFlow,” [Online]. Available: https://www.tensorflow.org/ .

[21] “BLEU,” [Online]. Available: https://en.wikipedia.org/wiki/BLEU.

[22] “The fuzz,” [Online]. Available: https://github.ćom/seatgeek/thefuzz .

[23] “Complete Guide to RNN, LSTM, and Bidirećtional LSTM,” [Online]. Available:
     https://dagshub.ćom/blog/rnn-lstm-bidirećtional-lstm/.