<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Transformer Based Approach for Text-to-Picto Generation</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Avaneesh Koushik</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jithu Morrison S</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>P Mirunalini</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jothir Aditya R K</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science and Engineering, Sri Sivasubramaniya Nadar College of Engineering</institution>
          ,
          <addr-line>Tamil Nadu</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>This study aims to develop a Text to Pictogram translation system which is used to convert a French text into its corresponding pictogram terms. The proposed system demonstrates the efectiveness of a transformer-based model in translating French text into meaningful pictogram sentences. Google-T5 is utilized and further fine-tuned on a custom dataset of French text to predict corresponding pictogram terms in French. The model underwent ifne-tuning across multiple epochs to optimize performance. Additionally, the trained model was iteratively ifne-tuned to enhance its translation capabilities. Metrics like PictoER score, BLEU score and METEOR score were used to assess the model's performance. The proposed model achived a PictoER score of 13.9, BLEU score of 74.3 and a METEOR score 87.0.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Text to Pictogram</kwd>
        <kwd>French text generation</kwd>
        <kwd>Transformers model</kwd>
        <kwd>Google-T5</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>those with language impairments, thereby contributing to the advancement of computational methods
in assistive technologies and enhancing the quality of life for individuals with aphasia.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Background</title>
      <p>
        Text-to-pictogram translation is a task which involves translation of natural language text into text
with words for which appropriate pictograms are available. A unified approach to transfer learning in
NLP tasks can be achieved by considering every text processing problem as a “text-to-text” problem,
i.e. taking text as input and producing new text as output [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. This approach was utilised for the
proposed model as the dataset for this task comprises of input and output texts. Pretrained models have
significantly better performance over the original T5 models [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>
        A shallow linguistic analysis approach can be used to perform linguistic analysis for text to picto
conversion [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Shallow linguistic analysis involves processing of basic linguistic units like tokenization
and POS tagging, without performing deep semantic analysis. Transformers are powerful tools
that helps in building more complex and efective models for sequence-based tasks. Transformer
architectures have facilitated building higher-capacity models and pre-training has made it possible to
efectively utilize this capacity for a wide variety of tasks [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. The original system of text-to-picto aimed
at people with an intellectual disability can be extended to various other interesting applications [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>
        Encoder-decoder models perform better than other models on textual similarity tasks [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Google-T5
was found to be the better than various other approaches like LSTM and CNN for other text related
tasks like hate speech detection [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] and it was found to be suitable for Question-Answer Generation [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]
both of which are tasks involving pattern detection in text.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Approach</title>
      <p>Google T5 or Text-to-Text Transfer Transformer is an encoder-decoder model which was pre-trained on
a multi-task mixture of both unsupervised and supervised tasks. It is known to work well in tasks which
require out of the box thinking. This task involves converting French text in various everyday contexts
into words which are simpler and have a corresponding pictogram available. The main objective of this
approach is to develop a Text-to-Picto translation system using the T5 model.</p>
      <sec id="sec-3-1">
        <title>3.1. Dataset</title>
        <p>
          The dataset that has been used was built from the TCOF corpus, and is stored in JSON format. TCOF
contains interactions between adults, adults and children, and children themselves, covering a wide
range of topics including debates, everyday situations, and medical consultations. This type of text is
representative of the interactions we observe between caregivers (families, medical staf) and individuals
who rely on pictograms due to language impairments [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]. Each entry in the dataset contains multiple
data points, including an identifier labelled as "id" which is a unique identifier for the source, target
pair, the source text which is an oral transcription of a sentence spoken in French labelled as "src", the
target sequence of simplified pictogram terms "tgt", and a list that assigns a pictogram identifier to
each term in the target sequence labelled as "pictos".
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Data Preprocessing</title>
        <sec id="sec-3-2-1">
          <title>3.2.1. Tokenization</title>
          <p>On further analysis of the dataset, it was found that the data contained 24270 lines of French text
with appropriate target text. The average size of the source lines is 54.6 words and the average size of
the target text is 53.8 pictogram words.</p>
          <p>Tokenization is important for preparing the data for model training. Here, both the target pictogram
sequence and the source French text are tokenized using the pre-trained tokenizer from the google-T5
model. The text is split into individual tokens and converted to a numerical representation. Padding
and truncating limit the text sequences to have a maximum size of 256 tokens.</p>
        </sec>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Model Selection</title>
        <p>The T5 model can be adjusted for particular tasks and comes pre-trained on a large data corpus. Here,
the model is adjusted to produce the simpler pictogram terms from oral transcriptions in French. The
model is inherently trained in solving text-to-text tasks and hence proves to be eficient for this task.
The "t5-base" variant of Google’s T5 model is utilised for this task. t5-base is a snapshot of the T5 model
taken after it was trained with 220 million parameters. This makes it flexible and easy to train for the
given French text data.</p>
        <sec id="sec-3-3-1">
          <title>3.3.1. Self-Attention mechanism</title>
          <p>The self-attention mechanism enables the model to identify long-range links and dependencies in the
input sequence. To be more precise, the T5 model calculates attention scores between every pair of
tokens in the input sequence using self-attention layers. By assigning a diferent weight (attention
score) to each token based on relevance to the next token, it makes every token in the sequence able to
pay attention to every other token.</p>
        </sec>
        <sec id="sec-3-3-2">
          <title>3.3.2. Encoder-decoder mechanism</title>
          <p>The T5 model is based on the traditional transformer architecture comprising of an encoder-decoder
structure. The input sequence is processed by the encoder, which also outputs contextual embeddings.
The output sequence is subsequently generated by the decoder using these embeddings.</p>
        </sec>
      </sec>
      <sec id="sec-3-4">
        <title>3.4. Methodology</title>
        <p>The dataset is loaded from the file and the source text (src) and target sequence (tgt) are extracted for
every element and stored as a list, following which it is tokenized. This is then converted to key-value
pairs and fed as input to the model. TThe T5 model contains several training arguments, such as the
batch size, which is set to 16 and indicates the number of training instances processed in each iteration,
and the number of training epochs, which are set to 3, 5, and 6 (where the model is initially trained for
3 epochs and then further trained on the already trained model). Additionally, the save steps are set
to 1,000, meaning that model checkpoints are saved every 1,000 steps to ensure training progress is
recorded. The learning rate of the optimizer algorithm—such as Adam—is chosen in order to efectively
update the model parameters in light of the training data. The goal of this optimization is to increase
model performance by minimizing the loss function. The model is trained using the hyperparameters
mentioned in Table 1</p>
        <p>The sequence of pictogram terms generated by the proposed model is converted to the corresponding
pictos sequence using the resources provided by the task organisers.</p>
        <p>For example: If the input source text is: "il y a un moment donné elle nous avait dit essayez de pas
dire de mots français pendant le truc.", the model generates the following sequence of pictogram terms:
"il_y_a un instant donner passé elle nous dire essayer de dire non de mot français pendant le truc".
Figure 2 shows the pictos sequence corresponding to the above generated sequence of pictogram terms.</p>
        <sec id="sec-3-4-1">
          <title>3.4.1. Resources Used</title>
          <p>Pandas is used in the project to manipulate data, including loading data from JSON files and structuring
it into dataframes. The main deep learning framework, PyTorch, makes it easier to apply and train
the T5 model, which creates pictogram sequences from French text. A cloud-based Jupyter notebook
environment with GPU-accelerated resources for quicker model training and inference is ofered by
Google Colab. Together, these tools improve productivity and eficiency by streamlining the development
and experimentation process.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Results and Discussion</title>
      <p>
        The parameters used for evaluating the model are the Picto-term Error Rate (PictoER) [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ],
BLEU [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] and METEOR [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. The model was trained for various epochs to test the improvement in
learning. These results of the proposed model for diferent epochs were tabulated in the following Table
2
      </p>
      <p>Comparing the outcomes from varying the epochs while training yields insightful observations.
The error rate decreased from 18 to 17 when training for 5 epochs, suggesting some improvement
in performance. When initially trained for 3 epochs and fine-tuned for an additional 3 epochs, the
pictoer_score drops significantly from 17.5 to 13.9, suggesting improved generalization and
performance on unseen data. This is also reflected in the BLEU score which measures the precision
of n-grams and the METEOR score which focuses on word order, both of which show considerable
improvement. This significant improvement underscores the efectiveness of fine-tuning in refining
model parameters and enhancing its ability to capture underlying patterns in the data.</p>
      <p>The results suggest that the model may not have been able to reach its maximum potential during
the first training period. Rather, the model’s representations were gradually improved through the
repeated training process, which improved generalization and decreases the error rate. These findings
highlight the importance of iterative training strategies and the need for careful experimentation to
achieve optimal results. This shows that one may continuously enhance the model’s performance and
guarantee its flexibility to a variety of datasets and applications by iteratively fine-tuning it.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion</title>
      <p>In conclusion, this research demonstrates the efectiveness of advanced transformer models, specifically
the Google-T5, for the task of translating French text into pictogram sequences. Through iterative
ifne-tuning, the model consistently improved in accuracy, demonstrating its ability to handle intricate
aspects of language. This was evaluated using metrics like PictoER, BLEU, and METEOR scores. This
research emphasizes how transformer-based techniques can improve accessibility and communication
for people who use augmentative and alternative forms of communication.
Researchers may enhance the model’s performance and guarantee its flexibility to a variety of datasets
by iteratively fine-tuning it. In order to further advance the fields of assistive technology and natural
language processing, future studies could explore expanding this strategy to other languages and
improving the model’s adaptability to diverse linguistic contexts.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>B.</given-names>
            <surname>Ionescu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Müller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Drăgulinescu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Rückert</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Ben</given-names>
            <surname>Abacha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Garcıa Seco de Herrera</surname>
          </string-name>
          , L. Bloch,
          <string-name>
            <given-names>R.</given-names>
            <surname>Brüngel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Idrissi-Yaghir</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Schäfer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. S.</given-names>
            <surname>Schmidt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. M.</given-names>
            <surname>Pakull</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Damm</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Bracke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. M.</given-names>
            <surname>Friedrich</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Andrei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Prokopchuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Karpenka</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Radzhabov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Kovalev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Macaire</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Schwab</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Lecouteux</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Esperança-Rodier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Yim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Fu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Sun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Yetisgen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Xia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. A.</given-names>
            <surname>Hicks</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Riegler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Thambawita</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Storås</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Halvorsen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Heinrich</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Kiesel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Potthast</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Stein</surname>
          </string-name>
          , Overview of ImageCLEF 2024:
          <article-title>Multimedia retrieval in medical applications, in: Experimental IR Meets Multilinguality</article-title>
          , Multimodality, and
          <string-name>
            <surname>Interaction</surname>
          </string-name>
          ,
          <source>Proceedings of the 15th International Conference of the CLEF Association (CLEF</source>
          <year>2024</year>
          ), Springer Lecture Notes in Computer Science LNCS, Grenoble, France,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>C.</given-names>
            <surname>Rafel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Shazeer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Roberts</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Narang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Matena</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. J.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <article-title>Exploring the limits of transfer learning with a unified text-to-text transformer</article-title>
          ,
          <source>Journal of machine learning research 21</source>
          (
          <year>2020</year>
          )
          <fpage>1</fpage>
          -
          <lpage>67</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>D.</given-names>
            <surname>Carmo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Piau</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Campiotti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Nogueira</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Lotufo</surname>
          </string-name>
          ,
          <article-title>Ptt5: Pretraining and validating the t5 model on brazilian portuguese data</article-title>
          , arXiv preprint arXiv:
          <year>2008</year>
          .
          <volume>09144</volume>
          (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>V.</given-names>
            <surname>Vandeghinste</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I. S. L.</given-names>
            <surname>Sevens</surname>
          </string-name>
          ,
          <string-name>
            <surname>F. Van Eynde</surname>
          </string-name>
          ,
          <article-title>Translating text into pictographs</article-title>
          ,
          <source>Natural Language Engineering</source>
          <volume>23</volume>
          (
          <year>2017</year>
          )
          <fpage>217</fpage>
          -
          <lpage>244</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>T.</given-names>
            <surname>Wolf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Debut</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Sanh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Chaumond</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Delangue</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Moi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Cistac</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Rault</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Louf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Funtowicz</surname>
          </string-name>
          , et al.,
          <article-title>Transformers: State-of-the-art natural language processing</article-title>
          ,
          <source>in: Proceedings of the 2020 conference on empirical methods in natural language processing: system demonstrations</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>38</fpage>
          -
          <lpage>45</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>M.</given-names>
            <surname>Norré</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Vandeghinste</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Bouillon</surname>
          </string-name>
          , T. François,
          <article-title>Extending a text-to-pictograph system to french and to arasaac</article-title>
          ,
          <source>in: Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP</source>
          <year>2021</year>
          ),
          <year>2021</year>
          , pp.
          <fpage>1050</fpage>
          -
          <lpage>1059</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>J.</given-names>
            <surname>Ni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. H.</given-names>
            <surname>Abrego</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Constant</surname>
          </string-name>
          , J. Ma, K. B.
          <string-name>
            <surname>Hall</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Cer</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Yang</surname>
          </string-name>
          , Sentence-t5:
          <article-title>Scalable sentence encoders from pre-trained text-to-text models</article-title>
          ,
          <source>arXiv preprint arXiv:2108.08877</source>
          (
          <year>2021</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>T.</given-names>
            <surname>Adewumi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. S.</given-names>
            <surname>Sabry</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Abid</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Liwicki</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Liwicki, T5 for hate speech, augmented data, and ensemble</article-title>
          ,
          <source>Sci</source>
          <volume>5</volume>
          (
          <year>2023</year>
          ). URL: https://www.mdpi.com/2413-4155/5/4/37. doi:
          <volume>10</volume>
          .3390/sci5040037.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>S.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Chauhan</surname>
          </string-name>
          ,
          <string-name>
            <surname>P. Kumar C.</surname>
          </string-name>
          ,
          <article-title>Learning enhancement using question-answer generation for e-book using contrastive fine-tuned t5</article-title>
          , in: P. P.
          <string-name>
            <surname>Roy</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Agarwal</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Krishna</surname>
            <given-names>Reddy</given-names>
          </string-name>
          , R. Uday Kiran (Eds.),
          <source>Big Data Analytics</source>
          , Springer Nature Switzerland, Cham,
          <year>2022</year>
          , pp.
          <fpage>68</fpage>
          -
          <lpage>87</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>V.</given-names>
            <surname>André</surname>
          </string-name>
          , E. Canut,
          <article-title>Mise à disposition de corpus oraux interactifs : le projet tcof (traitement de corpus oraux en français), Pratiques</article-title>
          . Linguistique, littérature, didactique
          <volume>147</volume>
          -
          <fpage>148</fpage>
          (
          <year>2010</year>
          )
          <fpage>35</fpage>
          -
          <lpage>51</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Woodard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. T.</given-names>
            <surname>Nelson</surname>
          </string-name>
          ,
          <article-title>An information theoretic measure of speech recognition performance</article-title>
          ,
          <source>in: Workshop on standardisation for speech I/O technology, Naval Air Development Center</source>
          , Warminster, PA,
          <year>1982</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>K.</given-names>
            <surname>Papineni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Roukos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Ward</surname>
          </string-name>
          , W.-J. Zhu,
          <article-title>Bleu: a method for automatic evaluation of machine translation, in: Proceedings of the 40th annual meeting of the Association for Computational Linguistics</article-title>
          , Association for Computational Linguistics,
          <year>2002</year>
          , pp.
          <fpage>311</fpage>
          -
          <lpage>318</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>S.</given-names>
            <surname>Banerjee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Lavie</surname>
          </string-name>
          ,
          <string-name>
            <surname>Meteor:</surname>
          </string-name>
          <article-title>An automatic metric for mt evaluation with improved correlation with human judgments</article-title>
          ,
          <source>in: Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization, Association for Computational Linguistics</source>
          ,
          <year>2005</year>
          , pp.
          <fpage>65</fpage>
          -
          <lpage>72</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>