<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Embedding-To-Embedding Method Based on Autoencoder for Solving Sentence Analogies</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Weihao Mao</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yves Lepage</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Graduate School of Information, Production and Systems, Waseda University</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>We propose a method for solving sentence analogies using an embedding-to-embedding method. The method involves the pretraining of an autoencoder with a denoising decoder that generates sentence embeddings and reconstructs sentences. To generate solutions to analogical equations in the sentence embedding space, we introduce a network architecture that learns analogy properties from the dataset instead of relying on predefined formulas. The embeddings of the solutions are then decoded back into sentences using the decoder of the pretrained autoencoder. We conduct experiments on a set of semantico-formal analogies and purely-formal analogies datasets in English, French, and German. The results show that our method achieves state-of-the-art performance in most cases and to some extent provides evidence of the limitations of the 3CosAdd formula in handling longer sentences.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Sentence analogy</kwd>
        <kwd>Sentence embedding</kwd>
        <kwd>Autoencoder</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>please tell us about : please tell me about :: what do you expect : what do you expect
it. it. us to do? me to do?
he never saw his : theer naegvaeirn.saw his sis- :: thheenreavgearinsa.w his fa- : he never saw his
brother again. mother again.
semantic changes, such as the brother corresponding to the sister and the father corresponding
to the mother. The ratio in this example contains some gender-related information. We refer to
such examples as semantic analogies.</p>
      <p>The above two important concepts indicate that we can deduce the fourth term based on any
three terms of the quadruplet. This property has led to the gradual application of analogy to
some natural language processing related tasks, such as natural language inference, question
answering, and machine translation, especially EBMT (Example-based machine translation).</p>
      <p>
        The application of analogies in natural language processing mainly involves two tasks that
need to be addressed. The first one is analogy detection, i.e., determining whether a quadruple ,
, , and  constitute an analogy. Since the concept of analogy still lacks a standard definition,
we mainly refer to the analogy property proposed by [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], which is taken as an analogy if it
satisfies the two properties of symmetry of conformity and exchange of means. Because we can
reason out eight equivalent forms of an analogy based on these two properties. It is worth
mentioning that such assumption provides a relatively strict definition for analogies, especially
for sentence analogies. [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] introduces internal reversal as a substitution for the exchange of
means mentioned above, allowing for more quadruples to meet the definition of this analogy at
the sentence level.
      </p>
      <p>The second primary task is analogy solving, the process of giving , , and  in a quadratic
group to obtain . That means we need to find the solution to the analogical equation:
 :  ::  :</p>
      <p>⇒  = ?</p>
      <p>Currently, in recent years, methods mainly rely on vector representations of sentences in
embedding space. The approach involves using the parallelogram rule (if  −  = −  , then
 =  −  +  ) to find four vectors that satisfy the analogy property and simultaneously
ifnd the solution of the analogical equation in the embedding space.</p>
      <p>
        After obtaining the embeddings of the solutions in the embedding space, a commonly used
approach is to employ retrieval-based methods. These methods involve providing a set of
candidate sentences and retrieving the most similar sentence to the target based on metrics like
cosine similarity. One example of such a method is the 3CosAdd method [
        <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
        ]. These methods
typically require the embedding space to exhibit good linearity properties and rely on specific
formulas. They are unable to learn the analogy properties from the dataset itself. However [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]
trains a decoder to map the embeddings of the solutions of analogical equations back to their
corresponding sentences, which allows the model to generate results beyond the limitations
of specific candidate sentences. We refer to these methods as generation-based methods. In
generation-based methods, the model learns to generate sentences based on the given analogical
equations, providing more flexibility in producing diverse and contextually appropriate outputs.
      </p>
      <p>
        Inspired by the work of [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], we design a generative method based on an autoencoder to
address sentence analogies. More precisely, the main contributions of this paper are as follows:
i We have designed a more stable autoencoder architecture to reconstruct the solutions of
analogical equations from the embedding space back into sentences.
ii We propose a novel model that does not rely on predefined formulas to solve analogical
equations in the sentence embedding space. The entire network architecture is more
lfexible and applicable to all encoder-decoder structures.
iii We have achieved promising results in the generation-based approach and, to some
extent, demonstrated that the efectiveness of the 3CosAdd formula decreases for longer
sentences.
      </p>
      <p>In the remaining sections of this paper, we first introduce the related work in solving analogies,
particularly sentence analogies, in Section 2. In Section 3, we describe the main approach we
adopt, namely the embedding-to-embedding method. In Section 4, we present the experiments
and results. In Section 5, we provide an overview of the contributions of this paper and propose
further directions for future research.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related work</title>
      <p>
        In this paper, we primarily focus on solving sentence analogies, which involve deriving an
unknown sentence  given known sentence analogies , , and . However, we can still draw
inspiration from recent word analogy tasks. As mentioned in Section 1, some retrieval-based
methods like 3CosAdd rely on predefined formulas and expected properties of the embedding
space. Their goal is not to learn the properties of analogies from existing actual data so as to
solve analogy. [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] used a simple network architecture called ANNr that consists of only linear
fully connected layers to learn the embeddings of words , , and  to  in the embedding
space, rather than relying on predefined formulas. The model has achieved state-of-the-art
performance on word analogy tasks in 11 diferent languages. This demonstrates that even
without relying on traditional formulas such as  =  −  +  , but instead learning
relevant properties from the dataset, one can achieve good results.
      </p>
      <p>
        Unlike word analogy tasks, sentence analogies are more diverse and complex in terms of
vocabulary, syntax, and semantics, making them more challenging to solve. However, a sentence
can still be seen as a whole composed of multiple words. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] proposed a method that decomposes
sentence analogies into multiple sets of word analogies based on the editing traces between
sentences. The optimal solutions of multiple sets of word analogies are then concatenated to
form the solution for the sentence analogy. Indeed, this work has also resulted in the creation
of a sentence semantico-formal analogy dataset.
      </p>
      <p>
        [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] proposed a Vec2Seq model to learn the mapping from sentence vectors to corresponding
sentences, thus addressing the limitation of retrieval-based approaches that can only select the
best sentence from candidate sentences. This led to the idea of a generation-based solution. They
ifrst employed a simple sum operation of FastText [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] word vectors in corresponding dimensions
to represent the entire sentence vector. Then, they trained a decoder to reconstruct the sentence
Add Gaussian
      </p>
      <p>noise
sentence
embedding
sentence
decoder
~
+
encoder
sentence</p>
      <p>sentence embedding space
A B</p>
      <p>Ratio C</p>
      <p>Output
predicted D</p>
      <p>Ratio
Extraction
Network</p>
      <p>Conformity
Mapping
Network</p>
      <p>Offset network for solving analogies
from the sentence vector. Additionally, they designed a simple linear fully connected network
FCN to learn the mapping of analogical equation solutions in the embedding space. They tested
diferent ways of combining vectors in semantico-formal analogy dataset, and the calculation
formula  =  −  +  in 3CosAdd ultimately achieved the best performance.</p>
      <p>
        Inspired by work of [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] proposed a character-level autoencoder to reconstruct words and
address word analogy problems. This method achieved 99% accuracy on word reconstruction
tasks in multiple languages and showed promising results in solving word analogy tasks.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Proposed approach</title>
      <p>
        Similarly as in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], we propose an internally denoising autoencoder architecture to achieve the
generation of sentence vectors from word vector sequences and a more stable decoding process.
Additionally, we introduce an ofset network structure to learn the mapping from three known
vectors to a solution of analogical equations in the sentence embedding space. As this approach
operates in the sentence embedding space, it is referred to as an "embedding-to-embedding"
method. The entire method architecture is illustrated in Figure 1.
      </p>
      <sec id="sec-3-1">
        <title>3.1. Pre-training an auto-encoder</title>
        <p>
          The method used in [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] for generating sentence vectors from word vector sequences involves
simply adding up the corresponding dimensions of all word vectors to form the sentence vector.
This method, starting from pre-trained word vectors, can produce decent decoding results
even with a small amount of training data. Additionally, the simple addition of corresponding
dimensions is quite efective for certain specific tasks. However, the sentence embeddings
generated by this simple summation method tend to lose sequential information and some
semantic information. Structurally, this method is not conducive to sentence reconstruction.
Therefore, taking inspiration from that method, we also start from pre-trained word vectors and
retain its decoder part. However, we incorporate a bidirectional LSTM model as an encoder to
process the word vector sequence and form an autoencoder structure. Subsequently, we adopt
the method mentioned in [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] to obtain sentence embeddings, which involves concatenating the
last hidden state and cell state of the encoder as the resulting sentence embedding.
        </p>
        <p>Additionally, because the task of the decoder is to decode embeddings that satisfy the
constraints of analogical equations, there may be slight deviations in the numerical values of the
generated embeddings, whether produced by neural networks or predefined formulas,
compared to the true reference embeddings. This can cause the decoder to struggle in correctly
decoding these embeddings. Therefore, during the training process of the autoencoder, we
introduce a certain proportion of Gaussian noise to the sentence embeddings generated by the
encoder, aiming to train the decoder to produce accurate sentences. This approach enhances the
decoder’s robustness to small perturbations along the target embedding manifold, expands the
range of manifolds the decoder can correctly decode, and mitigates overfitting to some extent.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Embedding-to-embedding method for solving analogies</title>
        <p>After completing the pre-training of the autoencoder, we can obtain sentence embeddings using
the well-trained encoder. Within the generated embedding space, we propose an Ofset network
structure to learn predicting embeddings that satisfy the constraints of analogical equations.
This neural network is based on two important concepts of analogies: conformity and ratio, and
it is divided into two parts: the ratio extraction network and the conformity mapping network.</p>
        <p>The ratio extraction network, the first part of the Ofset network, learns the ratio relationship
in the analogy by taking the embeddings of sentences A and B as inputs.</p>
        <p>The conformity mapping network, the second part, learns to map the ratio and the embedding
of sentence C to obtain the embedding of sentence D.</p>
        <p>These two parts of the network have a simple structure, consisting of only one layer of
convolutional network and one fully connected layer. To some extent, this network structure
achieves the ofset of embedding C by ensuring the conformity of the ratio between two binary
tuples in the analogy. Hence, we refer to it as the Ofset network . Our expectation is that it
can learn the properties of analogies from the dataset and solve analogies without relying on
predefined formulas.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Experiments</title>
      <sec id="sec-4-1">
        <title>4.1. Evaluation metrics</title>
        <p>
          In our experiments, we want the generated sentences and the reference sentences to be as
similar as possible. So we use BLEU [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] to evaluate the similarity of two sentences. BLEU
scores are between 0 and 100. The higher the score, the more similar the two sentences are. We
also use the Levenshtein distance to evaluate the degree of diference between two sentences.
In addition, the accuracy rate is the ratio of the number of perfectly predicted sentences to the
total number of reference sentences.
4.2. Data
For the pre-training of the auto-encoder, we randomly extracted 85,000 English, French, and
German sentences from the Tatoeba1 corpus. The average length of English sentences is 6.5,
while for French and German, it is 8.7. We split into 80%, 10%, 10% for training, validation
and testing. In order to evaluate our method for solving sentence analogies, we conducted
tests on the semantico-formal analogy dataset proposed in [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ], which contains 5,607 sentence
analogies. Additionally, to further assess the performance of our model in solving sentence
formal analogies, we utilized the Nlg package proposed in [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] to extract purely formal analogies
from Tatoeba in the three languages. Statistics on the data are presented in Table 1.
        </p>
      </sec>
      <sec id="sec-4-2">
        <title>4.3. Setups</title>
        <p>
          For decoding sentence embeddings, we keep the decoder part of the autoencoder consistent
with the decoder in [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. After obtaining word vector sequences using pre-trained FastText word
embeddings, we employ two approaches to obtain sentence embeddings, i.e., simple summation:
adding the word vectors corresponding to each dimension together. encoder of autoencoder:
using a bidirectional LSTM to obtain sentence embeddings. During training, we employed
cross-entropy as the loss function and utilized the Adam optimizer with a learning rate to 0.001.
In training the sentence embeddings decoder, we set the maximum iteration count to 1000 and
used an early stopping mechanism, which means that training stops if there is no improvement
after 15 iterations. However, for solving sentence analogies, we set the tolerance count for early
stopping to 50. Additionally, when training the model for solving sentence analogies, we froze
the parameters of the autoencoder, meaning that we did not fine-tune the embedding model.
• sum-FCN : Using the FCN network proposed in [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] in conjunction with the formula from
3CosAdd to process embeddings as inputs to solve analogies and obtaining sentence
embeddings by simple summation.
• enc-FCN : Obtaining sentence embeddings using an encoder and solving analogies using
the FCN network in conjunction with the formula from 3CosAdd to process embeddings
as inputs.
• enc-Ofset : Obtaining sentence embeddings using an encoder and solving analogies using
our Ofset network.
• enc-ANNr: Obtaining sentence embeddings using an encoder and solving analogies with
the ANNr network used in [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ].
        </p>
        <p>During training, we employed MSE as the loss function.</p>
      </sec>
      <sec id="sec-4-3">
        <title>4.4. Performance in decoding sentence embeddings</title>
        <p>During the pre-training of the autoencoder, we set the ratio of Gaussian noise added to the
sentence embeddings as 0.1. Additionally, the dimension of the sentence embeddings was set
to 300. The results on the three languages are shown in Table 2. In terms of accuracy, using
sentence embeddings generated by the encoder of the autoencoder outperforms the simple
summation approach by nearly 30% in all three languages. For English sentences, which are
shorter with a smaller vocabulary, the decoding accuracy reaches 91.1%. Additionally, the
0.1 Levenshtein distance indicates that, on average, less than one word is incorrect when
decoding English sentences. As for French and German, which have longer sentence lengths
and vocabulary sizes two to three times larger than that of English, the decoding performance
decreases slightly, but the decoding accuracy using encoder-generated sentence embeddings
still surpasses the simple summation approach by a considerable margin more than 30%.
en
fr
de
en
fr</p>
        <p>de
100
80
EU 60
L
B
40
20
0 5
EU 60
L
B
100
80
40
20</p>
        <p>Additionally, we investigated the impact of sentence length on the decoding of sentence
embeddings. As shown in Figure 2, we can observe that both methods experience a gradual
decrease in decoding performance as sentence length increases. However, the approach of using
encoder-derived sentence embeddings exhibits relatively more stability. Furthermore, due to
the larger vocabulary in German, the decoding performance is relatively lower compared to the
other two languages.</p>
      </sec>
      <sec id="sec-4-4">
        <title>4.5. Performance in solving sentence analogies</title>
        <sec id="sec-4-4-1">
          <title>4.5.1. Semantic-formal analogies</title>
          <p>For the performance on the semantic-formal analogy set, we first evaluated the performance of
our pre-trained autoencoder in decoding the embeddings of the fourth item sentence D in the
analogy. Both accuracy and BLEU score reached 100, indicating that all the target reference
sentences were perfectly reconstructed. Then, as described in Subsection 4.2, we tested the
performance of diferent method combinations on this dataset. As shown in Table 3, from the
perspective of obtaining sentence embeddings, using the encoder to obtain sentence embeddings
performs similarly to the simple summing method, with only a 1-point diference in BLEU score.
This is because the average length of English sentences in this dataset is relatively short, and
both methods show similar performance in decoding sentence embeddings. However, from the
perspective of solving analogies, the FCN network using the 3CosAdd formula outperforms
the Ofset network and ANNr. This indirectly indicates that methods relying on predefined
formulas are more efective than learning analogy properties from a dataset when the data size
is limited, sentences are short, and analogies are relatively simple in form.</p>
        </sec>
        <sec id="sec-4-4-2">
          <title>4.5.2. Purely formal analogies</title>
          <p>For purely formal analogies, Table 4 presents the performance of diferent models in the three
languages.</p>
          <p>Considering the languages, although French and German have a larger vocabulary, the overall
impact is mainly determined by the average sentence length. Since French has the longest
average sentence length, followed by German, and English has the shortest, the performance of
diferent models is generally lower in French compared to the other two languages. Especially,
due to the extremely short average sentence length in English, when using the FCN network
to solve analogies, the method of obtaining sentence embeddings using the encoder and the
simple summing method show similar performance, with the simple summing method even
outperforming it. In contrast, the performance trends of diferent models in French and German
are roughly similar. First, the method of obtaining sentence embeddings using the encoder
outperforms the simple summing method. Second, the FCN network performs better than the
Ofset network.</p>
        </sec>
        <sec id="sec-4-4-3">
          <title>4.5.3. Performance on longer sentences</title>
          <p>It is worth mentioning that French has a longer average sentence length, with the longest
sentence reaching around 10 words. The performance of the Ofset network is almost on par
with the FCN network, with only a slight diference of around 1 in both BLEU score and accuracy.
This suggests that when the average sentence length becomes longer, the Ofset network, which
learns analogy properties from the dataset, may perform well. Therefore, we further conducted
tests on the French dataset by selecting sentences with a length of 10 or more, and the results
are shown in Figure 3. We observe that when the sentence length exceeds 10, the Ofset network
performs better than FCN. We infer that when dealing with longer sentences, methods that learn
analogy properties from the dataset are more reliable than using predefined formulas such as
3CosAdd. This could be because the application of the 3CosAdd formula in analogies of longer
average sentence lengths requires the sentence embedding space to have more pronounced
linear properties. On the other hand, learning from the dataset allows for lower expectations in
the embedding space having linear properties, especially when there is a larger amount of data
available.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion</title>
      <p>We proposed an auto-encoder architecture that internally removes noise to generates sentence
embeddings and reconstructs sentences, achieving high accuracy in decoding sentence
embeddings. Building upon this, we devised an embedding-to-embedding method and a model that
learns analogies from datasets in the sentence embedding space instead of relying on predefined
formulas. Our experiments demonstrated that this approach performs better than a model
relying on the 3CosAdd formula, especially in cases where the sentence length is longer.</p>
      <p>Our method for analogy solving is a generation-based approach. It is still limited by the
drawback of LSTM decoders in handling long sentences. In the future, we need to explore more
advanced encoder-decoder architectures that are better suited for decoding longer sentences, as
well as generating more meaningful sentence embeddings specifically designed for analogies.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>This research has been partially supported by a JSPS grant Kiban C n° 21K12038 entitled «
Theoretically founded algorithms for the automatic production of test sets in NLP."</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Lepage</surname>
          </string-name>
          ,
          <article-title>Languages of analogical strings</article-title>
          ,
          <source>in: Proceedings of the 18th International Conference on Computational Linguistics (COLING</source>
          <year>2000</year>
          ), volume
          <volume>1</volume>
          ,
          <string-name>
            <surname>Saarbrücken</surname>
          </string-name>
          ,
          <year>2000</year>
          , pp.
          <fpage>488</fpage>
          -
          <lpage>494</lpage>
          . URL: https://aclanthology.org/C00-1071.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>S.</given-names>
            <surname>Afantenos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Lim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Prade</surname>
          </string-name>
          , G. Richard,
          <article-title>Theoretical study and empirical investigation of sentence analogies</article-title>
          ,
          <source>in: IJCAI-ECAI Workshop: Workshop on the Interactions between Analogical Reasoning and Machine Learning (IAMRL</source>
          <year>2022</year>
          )
          <article-title>@ IJCAI-ECAI 2022</article-title>
          , volume
          <volume>3174</volume>
          , CEUR-WS. org,
          <year>2022</year>
          , pp.
          <fpage>15</fpage>
          -
          <lpage>28</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>T.</given-names>
            <surname>Mikolov</surname>
          </string-name>
          , W.-t. Yih, G. Zweig,
          <article-title>Linguistic regularities in continuous space word representations, in: Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Association for Computational Linguistics</article-title>
          , Atlanta, Georgia,
          <year>2013</year>
          , pp.
          <fpage>746</fpage>
          -
          <lpage>751</lpage>
          . URL: https://aclanthology.org/N13-1090.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>O.</given-names>
            <surname>Levy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Goldberg</surname>
          </string-name>
          ,
          <article-title>Dependency-based word embeddings</article-title>
          ,
          <source>in: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume</source>
          <volume>2</volume>
          :
          <string-name>
            <surname>Short</surname>
            <given-names>Papers)</given-names>
          </string-name>
          ,
          <source>Association for Computational Linguistics</source>
          , Baltimore, Maryland,
          <year>2014</year>
          , pp.
          <fpage>302</fpage>
          -
          <lpage>308</lpage>
          . URL: https://aclanthology.org/P14-2050. doi:doi: 10.3115/v1/
          <fpage>P14</fpage>
          -2050.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>L.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Lepage</surname>
          </string-name>
          ,
          <article-title>Vector-to-sequence models for sentence analogies</article-title>
          ,
          <source>in: 2020 International Conference on Advanced Computer Science and Information Systems (ICACSIS)</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>441</fpage>
          -
          <lpage>446</lpage>
          . doi:doi: 10.1109/ICACSIS51025.
          <year>2020</year>
          .
          <volume>9263191</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>E.</given-names>
            <surname>Marquer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Alsaidi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Decker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.-A.</given-names>
            <surname>Murena</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Couceiro</surname>
          </string-name>
          ,
          <article-title>A deep learning approach to solving morphological analogies</article-title>
          , in: M. T. Keane, N. Wiratunga (Eds.),
          <source>Case-Based Reasoning Research and Development</source>
          , Springer International Publishing, Cham,
          <year>2022</year>
          , pp.
          <fpage>159</fpage>
          -
          <lpage>174</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Lepage</surname>
          </string-name>
          ,
          <article-title>Semantico-formal resolution of analogies between sentences</article-title>
          ,
          <source>in: the 9th Language and Technology Conference (LTC</source>
          <year>2019</year>
          ),
          <year>2019</year>
          , p.
          <fpage>57</fpage>
          -
          <lpage>61</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>P.</given-names>
            <surname>Bojanowski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Grave</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Joulin</surname>
          </string-name>
          , T. Mikolov,
          <article-title>Enriching word vectors with subword information, Transactions of the Association for Computational Linguistics 5 (</article-title>
          <year>2017</year>
          )
          <fpage>135</fpage>
          -
          <lpage>146</lpage>
          . URL: https://aclanthology.org/Q17-1010. doi:doi: 10.1162/tacl_a_
          <fpage>00051</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>K.</given-names>
            <surname>Chan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. P.</given-names>
            <surname>Kaszefski-Yaschuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Saran</surname>
          </string-name>
          , E. Marquer,
          <string-name>
            <given-names>M.</given-names>
            <surname>Couceiro</surname>
          </string-name>
          , Solving Morphological Analogies Through Generation,
          <source>in: IJCAI-ECAI Workshop on the Interactions between Analogical Reasoning and Machine Learning (IARML@IJCAI-ECAI 2022)</source>
          , volume
          <volume>3174</volume>
          <source>of Proceedings of the IJCAI-ECAI Workshop on the Interactions between Analogical Reasoning and Machine Learning (IARML@IJCAI-ECAI</source>
          <year>2022</year>
          ),
          <article-title>Miguel Couceiro</article-title>
          and
          <string-name>
            <surname>Pierre-Alexandre</surname>
            <given-names>Murena</given-names>
          </string-name>
          , Vienna, Austria,
          <year>2022</year>
          , pp.
          <fpage>29</fpage>
          -
          <lpage>39</lpage>
          . URL: https://hal.inria.fr/hal-03674913.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>K.</given-names>
            <surname>Papineni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Roukos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Ward</surname>
          </string-name>
          , W.-J. Zhu,
          <article-title>Bleu: a method for automatic evaluation of machine translation, in: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics</article-title>
          , Philadelphia, Pennsylvania, USA,
          <year>2002</year>
          , pp.
          <fpage>311</fpage>
          -
          <lpage>318</lpage>
          . URL: https://aclanthology.org/P02-1040. doi:doi: 10.3115/1073083.1073135.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>R.</given-names>
            <surname>Fam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Lepage</surname>
          </string-name>
          ,
          <article-title>Tools for the production of analogical grids and a resource of n-gram analogical grids in 11 languages</article-title>
          , in
          <source>: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC</source>
          <year>2018</year>
          ),
          <article-title>European Language Resources Association (ELRA), Miyazaki</article-title>
          , Japan,
          <year>2018</year>
          , pp.
          <fpage>1060</fpage>
          -
          <lpage>1066</lpage>
          . URL: https://aclanthology.org/L18-1171.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>