<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>What if the whole is greater than the sum of the parts? Modelling Complex (Multiword) Expressions</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Aline Villavicencio</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science, University of Shefield Regent Court (DCS) 211 Portobello Shefield</institution>
          ,
          <addr-line>S1 4DP</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Multiword Expressions (MWEs) such as idioms (make ends meet), light verb constructions (give a sigh), verb particle constructions (shake up) and noun compounds (loan shark), are an integral part of the mental lexicon of native speakers often used to express complex ideas in a simple and conventionalised way accepted by a given linguistic community. As they may display a wealth of idiosyncrasies, from lexical, syntactic and semantic to statistical, they have represented a real challenge for natural language processing. However, their accurate integration has the potential for improving the precision, naturalness and fluency of downstream tasks like text simplification. In this paper I discuss some advances in the identification and modelling of MWEs, concentrating on techniques for identifying their degree of idiomaticity and approximating their meaning. One of the challenges is that their interpretation often needs more knowledge than can be gathered from their individual components and their combinations to diferentiate combinations whose meaning can be (partly) inferred from their parts (as apple juice: juice made of apples) from those that cannot (as dark horse: an unknown candidate who unexpectedly succeeds). In particular, the paper presents results obtained with the use of contextualised word representation models, which have been successfully used for capturing diferent word usages, and therefore could provide an attractive alternative for representing idiomaticity in language.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Multiword Expressions</kwd>
        <kwd>Idiomaticity</kwd>
        <kwd>Lexical Simplification</kwd>
        <kwd>Contextualised Word Embeddings</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Human language is a powerful means for communicating ideas, and concepts, requests and
desires, for transmitting folklore, tales, history and scientific knowledge, on both an individual
and a global scale. It is expressive allowing complex ideas to be transmitted both formally
and informally, in scientific and in colloquial settings; it is creative and dynamic incorporating
new concepts, words and usages; flexible and ambiguous allowing for irony, jokes, idioms and
metaphors (e.g. a blanket of snow, French kiss, rocket science). However, such a large, rich and
complex system may also challenge humans and create barriers for a clear understanding of the
message to be communicated. For instance, a lack of vocabulary or of a particular word usage
may hinder the full understanding of a message. This is a common situation for children learning
a language [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], for adult native speakers learning a new domain, and often faced by second
language learners [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Health factors have also been linked with an impact in language abilities,
including in clinical conditions like aphasia [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] and dyslexia [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and autism [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Socio-economic
conditions and lack of access to schooling may also result in low language abilities and even
illiteracy [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], which, in many contexts, may afect a large part of the population, as, for instance,
in Brazil, where according to social indicators, in 2012 6% of the Brazilian population aged 15 or
over were fully illiterate, among 27% who were functionally illiterate.1 Therefore, making texts
accessible for diferent target groups can be viewed as a crucial task not only for leading to a
better understanding, but also for ensuring a better quality of life.
      </p>
      <p>
        In fact, there have been considerable research eforts into how to simplify texts [
        <xref ref-type="bibr" rid="ref10 ref7 ref8 ref9">7, 8, 9, 10</xref>
        ],
analysing their readability levels [
        <xref ref-type="bibr" rid="ref11 ref12 ref13">11, 12, 13</xref>
        ], creating resources [
        <xref ref-type="bibr" rid="ref14 ref15 ref16">14, 15, 16</xref>
        ], proposing new
techniques [
        <xref ref-type="bibr" rid="ref14 ref17">17, 14</xref>
        ] and analysing the suitability of evaluation frameworks [
        <xref ref-type="bibr" rid="ref11 ref15 ref18">11, 15, 18</xref>
        ]. These
have been applied for languages that in addition to English, include Spanish [
        <xref ref-type="bibr" rid="ref19 ref20">19, 20</xref>
        ], Portuguese
[
        <xref ref-type="bibr" rid="ref12 ref21 ref22 ref6">6, 12, 21, 22</xref>
        ], Italian [23] and Basque [24].
      </p>
      <p>
        One approach to text simplification targets lexical substitutions [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], where words identified
as dificult are replaced by easier synonyms, according to a given simplicity measure, such as
word frequency, polysemy and length. However, in addition to considering words individually,
lexical substitution needs to take into account the occurrence of Multiword Expressions (MWEs)
in sentences, including compound nouns (like loan shark2) and idioms (like let the cat out of the
bag3) since their meanings may not be related to the meanings of their individual components,
and replacing one of their components in isolation may result in a nonsensical simplification
or even in loss of the original meaning (e.g. loan fish instead of loan shark). In particular,
MWEs are also challenging since they display many idiosyncrasies, from lexical to semantic
and statistical [25], and these may have an efect on the readability of a sentence, and are
particularly problematic for non-native speakers of a language since these idiosyncrasies are
often unpredictable, and may lead to markedness (e.g. strong cofee vs. ?powerful cofee).
      </p>
      <p>Indeed, the positive impact of explicitly handling MWEs has been discussed for a number
of tasks and applications like machine translation [26], information retrieval [27] and parsing
[28], as discussed by Constant et al [29]. Although text simplification has received considerable
attention, studies focusing on MWEs are still scarce and there is a need for further investigation
into potential processing overheads involved in MWEs (if any) for diferent groups, and how
they can best be handled for text simplification. In what follows, I discuss some of the challenges
of MWEs and some techniques for detecting their idiomaticity.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Multiword Expressions and Idiomaticity</title>
      <p>MWEs have been defined as word sequences, not necessarily adjacent, that are recurrent, act as
a single unit at some level of linguistic analysis [30], and whose interpretation crosses word
boundaries [31]. MWEs include, besides compound nouns (cheese knife, software engineering)
1http://www.ipm.org.br/
2Meaning a person who lends money at very high interest rates.
3Meaning to reveal a secret.
and idioms (make ends meet, kick the bucket), verb-particle constructions (break down, carry on),
determinerless PPs (in hospital), collocations (strong cofee, heave baggage ), support verbs (give
sigh, take shower), among others. The importance of MWEs for a simplification system can be
gauged from estimates about their frequency in language: for Biber et al. [32] they correspond
to between 30% and 45% of spoken English and 21% of academic prose, while for Jackendof [ 33]
they have the same order of magnitude as the number of single words in a speaker’s mental
lexicon. They have also been found to have faster processing times compared to non-MWEs
(compositional novel sequences) [34, 35].</p>
      <p>Their computational treatment involves steps like discovering their occurrence in sentences,
determining how idiomatic they are (as a type in general and as an MWE token in a particular
sentence), and approximating their meaning. For MWE discovery, methods for determining
if a given sequence of words forms an MWE or not have often been based on the statistical
markedness of their recurrence, using association and entropic measures calculated from corpus
counts [36], and on morphosyntactic patterns commonly associated to MWEs [37, 38]. These
have been applied to a variety of MWE types and languages and have been extensively discussed
[39, 36, 40].</p>
      <p>
        For determining the meaning of a combination of words, one common strategy is to derive
it from the meanings of the parts. However, for idiomatic cases this approach will lead to
an unrelated meaning being derived. This diference between the meaning of the MWE and
the meanings of the components can be used as the basis for determining how idiomatic a
given MWE is [
        <xref ref-type="bibr" rid="ref23">41, 42, 43, 44</xref>
        ]. The assumption is that the closer the meaning of the MWE is
to the meaning of the components, the more compositional it is, and the more they difer, the
more idiomatic the MWE is. This approach has been used with word embeddings, using the
proximity in a multidimensional space between the embedding of the MWE and the embedding
of the components combined by operations like vector addition as semantic proximity. The
results obtained with static word embeddings like word2vec [
        <xref ref-type="bibr" rid="ref24">45</xref>
        ] and GloVe [
        <xref ref-type="bibr" rid="ref25">46</xref>
        ] for MWE type
idiomaticity detection for instance for Noun Compounds in languages like English, French and
Portuguese have been strongly correlated with human judgments about idiomaticity [
        <xref ref-type="bibr" rid="ref23">44</xref>
        ].
      </p>
      <p>
        For token idiomaticity detection, the challenge is to decide if a potentially ambiguous MWE
is used literally or idiomatically in a given sentence (e.g. big fish literally as a large aquatic
animal or idiomatically as an important person). For this task, contextualised word embeddings
like BERT [
        <xref ref-type="bibr" rid="ref26">47</xref>
        ] and ELMo [
        <xref ref-type="bibr" rid="ref27">48</xref>
        ] may be an attractive alternative, as they seem to represent
words more accurately than static word embeddings like GloVe, being able to encode diferent
usages of a given word that appear to form clusters that seem related to its various senses
[
        <xref ref-type="bibr" rid="ref28">49</xref>
        ]. However, analyses of whether and to what extent idiomaticity in MWEs is accurately
incorporated by word representation models have recently reported mixed results. On the one
hand, in an analysis of diferent classifiers initialised with contextualised and non-contextualised
embeddings evaluated in five tasks related to lexical composition (including the literality of
NCs) contextualised models, especially BERT, obtained the best performance across all tasks
[
        <xref ref-type="bibr" rid="ref29">50</xref>
        ]. On other analyses, static models like word2vec had better performance than contextualised
models [
        <xref ref-type="bibr" rid="ref30 ref31">51, 52</xref>
        ]. These mixed results suggest that a controlled evaluation setup may be needed
to obtain comparable results across models and languages.
      </p>
      <p>
        In recent work a set of probing tasks were defined to assess the representation of noun
compounds (NCs) in vector space models in two languages, English and French. The probes
took into account the NCs and their paraphrases in contexts that involved minimal modifications
and were used to compare the idiomatic and literal representations of a given NC [
        <xref ref-type="bibr" rid="ref32">53</xref>
        ]. The goal
was to assess if word embeddings with diferent levels of contextualisation would be sensitive to
changes in idiomaticity caused by diferent paraphrases in naturalistic sentences and in context
neutral sentences.
      </p>
      <p>
        A second set of probes was defined to assess if the models were more sensitive to idiomaticity
at type or at token level [54], and whether contextualised models could reach the performance
obtained by static embeddings for type idiomaticity detection [
        <xref ref-type="bibr" rid="ref23">44</xref>
        ]. The results obtained in
these evaluations suggest that these models are still not sensitive enough to detect idiomaticity
accurately. Moreover, even if contextualised embeddings outperform static models in many
tasks, they still have lower performance for idiomaticity detection. Further analyses that take
into account more fine-grained sense distinctions for MWEs, allowing for polysemy in literal
and in idiomatic senses, have revealed that fine-tuning improves performance, both for MWE
discovery in sentences and for sense identification [ 55]. This analysis is part of SEMEVAL 2022
Task4, and includes datasets for English, Portuguese and Galician.
      </p>
      <p>For downstream tasks like text simplification, these results mean that systems that adopt
state-of-the-art pre-trained models as they are, will lack accuracy when faced with MWEs,
interpreting an idiomatic MWE like eager beaver in a sentence such as When she first started
working she was a real eager beaver. as more similar to anxious castor than to its actual synonym
of hardworking person.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Conclusions</title>
      <p>In this paper I discussed MWEs and some of the challenges they pose for tasks like text
simplification. In particular I concentrated on recent work on idiomaticity detection using
state-of-the-art language models. The results obtained suggest that there are still advances to
be made for more accurate representation of MWE idiomaticity. However, given the prevalence
of MWE in natural languages, and their role in both general and technical domains, approaches
for text simplification, especially lexical simplification, need to take them into account for more
precise and understandable results. Moreover, additional research on the impact of MWEs
for diferent groups of speakers can shed light into any additional complexity, given their
advantageous processing for native speakers, and their often opaque meaning for non-native
speakers.</p>
    </sec>
    <sec id="sec-4">
      <title>Acknowledgments</title>
      <p>I want to thank the many co-authors who have collaborated in this work, including Marco Idiart,
Carlos Ramisch, Carolina Scarton, Harish Tayyar Madabushi, Tiago Vieira, Marcos Garcia,
Rodrigo Wilkens, Leonardo Zilio, Renata Ramisch and Silvio Cordeiro (in no particular order).
This research has been partly funded by EPSRC project MIA.</p>
      <p>4https://sites.google.com/view/semeval2022task2-idiomaticity
sio, Using eye-tracking data to predict the readability of Brazilian Portuguese sentences in
single-task, multi-task and sequential transfer learning approaches, in: Proceedings of
the 28th International Conference on Computational Linguistics, International Committee
on Computational Linguistics, Barcelona, Spain (Online), 2020, pp. 5821ś5831. URL: https:
//aclanthology.org/2020.coling-main.512. doi:10.18653/v1/2020.coling-main.512.
[23] G. Barlacchi, S. Tonelli, ERNESTA: A sentence simplification tool for children’s
stories in italian, in: A. F. Gelbukh (Ed.), Computational Linguistics and Intelligent Text
Processing - 14th International Conference, CICLing 2013, Samos, Greece, March 24-30,
2013, Proceedings, Part II, volume 7817 of Lecture Notes in Computer Science, Springer,
2013, pp. 476ś487. URL: https://doi.org/10.1007/978-3-642-37256-8_39. doi:10.1007/
978-3-642-37256-8\_39.
[24] I. Gonzalez-Dios, M. J. Aranzabe, A. D. de Ilarraza, The corpus of basque simplified
texts (CBST), Lang. Resour. Evaluation 52 (2018) 217ś247. URL: https://doi.org/10.1007/
s10579-017-9407-6. doi:10.1007/s10579-017-9407-6.
[25] T. Baldwin, S. N. Kim, Multiword expressions, in: N. Indurkhya, F. J. Damerau (Eds.),
Handbook of Natural Language Processing, Second Edition, Chapman and Hall/CRC, 2010,
pp. 267ś292. URL: http://www.crcnetbase.com/doi/abs/10.1201/9781420085938-c12.
[26] M. Carpuat, M. T. Diab, Task-based evaluation of multiword expressions: a pilot study
in statistical machine translation, in: Human Language Technologies: Conference of
the North American Chapter of the Association of Computational Linguistics,
Proceedings, June 2-4, 2010, Los Angeles, California, USA, The Association for Computational
Linguistics, 2010, pp. 242ś245. URL: https://aclanthology.org/N10-1029/.
[27] O. Acosta, A. Villavicencio, V. Moreira, Identification and treatment of multiword
expressions applied to information retrieval, in: Proceedings of the Workshop on
Multiword Expressions: from Parsing and Generation to the Real World, Association
for Computational Linguistics, Portland, Oregon, USA, 2011, pp. 101ś109. URL: https:
//aclanthology.org/W11-0815.
[28] M. Constant, J. Nivre, A transition-based system for joint lexical and syntactic analysis, in:
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics
(Volume 1: Long Papers), Association for Computational Linguistics, Berlin, Germany, 2016,
pp. 161ś171. URL: https://aclanthology.org/P16-1016. doi:10.18653/v1/P16-1016.
[29] M. Constant, G. Eryigˇit, J. Monti, L. van der Plas, C. Ramisch, M. Rosner, A. Todirascu,
Survey: Multiword expression processing: A Survey, Computational Linguistics 43 (2017)
837ś892. URL: https://aclanthology.org/J17-4005. doi:10.1162/COLI_a_00302.
[30] N. Calzolari, C. J. Fillmore, R. Grishman, N. Ide, A. Lenci, C. MacLeod, A. Zampolli, Towards
best practice for multiword expressions in computational lexicons., in: Proceedings of
the Third International Conference on Language Resources and Evaluation (LREC-2002),
European Language Resources Association (ELRA), Las Palmas, Canary Islands - Spain,
2002. URL: http://www.lrec-conf.org/proceedings/lrec2002/pdf/259.pdf, aCL Anthology
Identifier: L02-1259.
[31] I. A. Sag, T. Baldwin, F. Bond, A. A. Copestake, D. Flickinger, Multiword expressions: A pain
in the neck for NLP, in: A. F. Gelbukh (Ed.), Computational Linguistics and Intelligent Text
Processing, Third International Conference, CICLing 2002, Mexico City, Mexico, February
17-23, 2002, Proceedings, volume 2276 of Lecture Notes in Computer Science, Springer, 2002,
pp. 1ś15. URL: https://doi.org/10.1007/3-540-45715-1_1. doi:10.1007/3-540-45715-1\
_1.
[32] D. Biber, S. Johansson, G. Leech, S. Conrad, E. Finegan, Longman Grammar of Spoken and</p>
      <p>Written English, Pearson Education, 1999.
[33] R. Jackendof, The Architecture of the Language Faculty, volume 28, MIT Press, 1997.
[34] M. H. Christiansen, I. Arnon, E. Lieven, A. Wray, Multiword sequences as building
blocks for language: Insights into first and second language learning, in: M. Knauf,
M. Pauen, N. Sebanz, I. Wachsmuth (Eds.), Proceedings of the 35th Annual Meeting of
the Cognitive Science Society, CogSci 2013, Berlin, Germany, July 31 - August 3, 2013,
cognitivesciencesociety.org, 2013. URL: https://mindmodeling.org/cogsci2013/papers/0029/
index.html.
[35] A. Siyanova-Chanturia, K. Conklin, N. Schmitt, Adding more fuel to the fire: An
eyetracking study of idiom processing by native and non-native speakers, Second Language
Research 27 (2011) 251ś272. URL: https://doi.org/10.1177/0267658310382068. doi:10.1177/
0267658310382068. arXiv:https://doi.org/10.1177/0267658310382068.
[36] P. Pecina, Lexical association measures and collocation extraction, Lang. Resour.
Evaluation 44 (2010) 137ś158. URL: https://doi.org/10.1007/s10579-009-9101-4. doi:10.1007/
s10579-009-9101-4.
[37] J. S. Justeson, S. M. Katz, Technical terminology: some linguistic properties and an
algorithm for identification in text, Nat. Lang. Eng. 1 (1995) 9ś27. URL: https://doi.org/10.
1017/S1351324900000048. doi:10.1017/S1351324900000048.
[38] T. Baldwin, Deep lexical acquisition of verb-particle constructions, Comput. Speech
Lang. 19 (2005) 398ś414. URL: https://doi.org/10.1016/j.csl.2005.02.004. doi:10.1016/j.
csl.2005.02.004.
[39] C. Ramisch, Multiword Expressions Acquisition - A Generic and Open Framework, Theory
and Applications of Natural Language Processing, Springer, 2015. URL: https://doi.org/10.
1007/978-3-319-09207-2. doi:10.1007/978-3-319-09207-2.
[40] N. Schneider, D. Hovy, A. Johannsen, M. Carpuat, SemEval-2016 task 10: Detecting minimal
semantic units and their meanings (DiMSUM), in: Proceedings of the 10th International
Workshop on Semantic Evaluation (SemEval-2016), Association for Computational
Linguistics, San Diego, California, 2016, pp. 546ś559. URL: https://aclanthology.org/S16-1084.
doi:10.18653/v1/S16-1084.
[41] J. Mitchell, M. Lapata, Composition in distributional models of semantics, Cogn. Sci. 34
(2010) 1388ś1429. URL: https://doi.org/10.1111/j.1551-6709.2010.01106.x. doi:10.1111/j.
1551-6709.2010.01106.x.
[42] S. Reddy, D. McCarthy, S. Manandhar, An empirical study on compositionality in
compound nouns, in: Fifth International Joint Conference on Natural Language Processing,
IJCNLP 2011, Chiang Mai, Thailand, November 8-13, 2011, The Association for Computer
Linguistics, 2011, pp. 210ś218. URL: https://aclanthology.org/I11-1024/.
[43] M. Farahmand, J. Henderson, Modeling the non-substitutability of multiword expressions
with distributional semantics and a log-linear model, in: Proceedings of the 12th Workshop
on Multiword Expressions, Association for Computational Linguistics, Berlin, Germany,
2016, pp. 61ś66. URL: https://www.aclweb.org/anthology/W16-1809. doi:10.18653/v1/
W16-1809.
iomaticity in vector space models, in: Proceedings of the 16th Conference of the
European Chapter of the Association for Computational Linguistics: Main Volume,
Association for Computational Linguistics, Online, 2021, pp. 3551ś3564. URL: https:
//aclanthology.org/2021.eacl-main.310.
[54] M. Garcia, T. Kramer Vieira, C. Scarton, M. Idiart, A. Villavicencio, Assessing the
representations of idiomaticity in vector models with a noun compound dataset labeled at
type and token levels, in: Proceedings of the 59th Annual Meeting of the Association
for Computational Linguistics and the 11th International Joint Conference on Natural
Language Processing (Volume 1: Long Papers), Association for Computational Linguistics,
Online, 2021, pp. 2730ś2741. URL: https://aclanthology.org/2021.acl-long.212.
[55] H. Tayyar Madabushi, E. Gow-Smith, C. Scarton, A. Villavicencio, Astitchintransformers:
Dataset and methods for the exploration of idiomaticity in pre-trained language models, in:
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing.,
Association for Computational Linguistics, 2021.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>J. De Belder</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.-F. Moens</surname>
          </string-name>
          ,
          <article-title>Text simplification for children</article-title>
          , ACM; New York,
          <year>2010</year>
          , pp.
          <fpage>19ś26</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>G. H.</given-names>
            <surname>Paetzold</surname>
          </string-name>
          ,
          <article-title>Lexical simplification for non-native english speakers</article-title>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Carroll</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Minnen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Pearce</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Canning</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Devlin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Tait</surname>
          </string-name>
          ,
          <article-title>Simplifying text for language-impaired readers</article-title>
          ,
          <source>in: EACL</source>
          <year>1999</year>
          ,
          <article-title>9th Conference of the European Chapter of the Association for Computational Linguistics</article-title>
          , June 8-12,
          <year>1999</year>
          , University of Bergen, Bergen, Norway, The Association for Computer Linguistics,
          <year>1999</year>
          , pp.
          <fpage>269ś270</fpage>
          . URL: https://aclanthology.org/E99-1042/.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>L.</given-names>
            <surname>Rello</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Baeza-Yates</surname>
          </string-name>
          ,
          <article-title>The efect of font type on screen readability by people with dyslexia</article-title>
          ,
          <source>ACM Trans. Access. Comput. 8</source>
          (
          <year>2016</year>
          )
          <volume>15</volume>
          :1ś15:
          <fpage>33</fpage>
          . URL: https://doi.org/10.1145/2897736. doi:
          <volume>10</volume>
          .1145/2897736.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>R.</given-names>
            <surname>Evans</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Orasan</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Dornescu</surname>
          </string-name>
          ,
          <article-title>An evaluation of syntactic simplification rules for people with autism</article-title>
          , in: S.
          <string-name>
            <surname>Williams</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Siddharthan</surname>
            ,
            <given-names>A</given-names>
          </string-name>
          . Nenkova (Eds.),
          <source>Proceedings of the 3rd Workshop on Predicting and Improving Text Readability for Target Reader Populations, PITR@EACL</source>
          <year>2014</year>
          , Gothenburg, Sweden, April
          <volume>27</volume>
          ,
          <year>2014</year>
          , Association for Computational Linguistics,
          <year>2014</year>
          , pp.
          <fpage>131ś140</fpage>
          . URL: https://doi.org/10.3115/v1/
          <fpage>W14</fpage>
          -1215. doi:
          <volume>10</volume>
          .3115/v1/
          <fpage>W14</fpage>
          -1215.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>S.</given-names>
            <surname>Aluísio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Gasperin</surname>
          </string-name>
          ,
          <article-title>Fostering digital inclusion and accessibility: The PorSimples project for simplification of Portuguese texts</article-title>
          ,
          <source>in: Proceedings of the NAACL HLT 2010 Young Investigators Workshop on Computational Approaches to Languages of the Americas, Association for Computational Linguistics</source>
          , Los Angeles, California,
          <year>2010</year>
          , pp.
          <fpage>46ś53</fpage>
          . URL: https://aclanthology.org/W10-1607.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>A.</given-names>
            <surname>Siddharthan</surname>
          </string-name>
          ,
          <article-title>An architecture for a text simplification system</article-title>
          ,
          <source>in: 2002 Language Engineering Conference (LEC</source>
          <year>2002</year>
          ),
          <fpage>13</fpage>
          -
          <lpage>15</lpage>
          December 2002, Hyderabad, India, IEEE Computer Society,
          <year>2002</year>
          , p.
          <fpage>64</fpage>
          . URL: https://doi.org/10.1109/LEC.
          <year>2002</year>
          .
          <volume>1182292</volume>
          . doi:
          <volume>10</volume>
          .1109/LEC.
          <year>2002</year>
          .
          <volume>1182292</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Shardlow</surname>
          </string-name>
          ,
          <article-title>A comparison of techniques to automatically identify complex words, in: 51st Annual Meeting of the Association for Computational Linguistics</article-title>
          ,
          <source>ACL 2013, Proceedings of the Student Research Workshop, 4-9 August</source>
          <year>2013</year>
          , Sofia, Bulgaria, The Association for Computer Linguistics,
          <year>2013</year>
          , pp.
          <fpage>103ś109</fpage>
          . URL: https://aclanthology.org/P13-3015/.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>S.</given-names>
            <surname>Štajner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Saggion</surname>
          </string-name>
          ,
          <article-title>Data-driven text simplification</article-title>
          ,
          <source>in: Proceedings of the 27th International Conference on Computational Linguistics: Tutorial Abstracts</source>
          , Association for Computational Linguistics, Santa Fe, New Mexico, USA,
          <year>2018</year>
          , pp.
          <fpage>19ś23</fpage>
          . URL: https: //aclanthology.org/C18-3005.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>F.</given-names>
            <surname>Alva-Manchego</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Scarton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Specia</surname>
          </string-name>
          ,
          <article-title>Data-driven sentence simplification: Survey and benchmark</article-title>
          ,
          <source>Computational Linguistics</source>
          <volume>46</volume>
          (
          <year>2020</year>
          )
          <article-title>135ś187</article-title>
          . URL: https://aclanthology.org/
          <year>2020</year>
          .cl-
          <volume>1</volume>
          .4. doi:
          <volume>10</volume>
          .1162/coli_a_
          <fpage>00370</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>S.</given-names>
            <surname>Štajner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Mitkov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Saggion</surname>
          </string-name>
          ,
          <article-title>One step closer to automatic evaluation of text simplification systems</article-title>
          ,
          <source>in: Proceedings of the 3rd Workshop</source>
          on Predicting and
          <article-title>Improving Text Readability for Target Reader Populations (PITR), Association for Computational Linguistics</article-title>
          , Gothenburg, Sweden,
          <year>2014</year>
          , pp.
          <fpage>1ś10</fpage>
          . URL: https://aclanthology.org/W14-1201. doi:
          <volume>10</volume>
          .3115/v1/
          <fpage>W14</fpage>
          -1201.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Wagner</surname>
          </string-name>
          <string-name>
            <surname>Filho</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Wilkens</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Villavicencio</surname>
          </string-name>
          ,
          <article-title>Automatic construction of large readability corpora</article-title>
          ,
          <source>in: Proceedings of the Workshop on Computational Linguistics for Linguistic Complexity (CL4LC)</source>
          ,
          <source>The COLING 2016 Organizing Committee</source>
          , Osaka, Japan,
          <year>2016</year>
          , pp.
          <fpage>164ś173</fpage>
          . URL: https://aclanthology.org/W16-4119.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>M.</given-names>
            <surname>Maddela</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <article-title>A word-complexity lexicon and a neural readability ranking model for lexical simplification</article-title>
          ,
          <source>in: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing</source>
          , Association for Computational Linguistics, Brussels, Belgium,
          <year>2018</year>
          , pp.
          <fpage>3749ś3760</fpage>
          . URL: https://aclanthology.org/D18-1410. doi:
          <volume>10</volume>
          .18653/ v1/
          <fpage>D18</fpage>
          -1410.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>W.</given-names>
            <surname>Coster</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Kauchak</surname>
          </string-name>
          ,
          <article-title>Simple english wikipedia: A new text simplification task, in: The 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies</article-title>
          ,
          <source>Proceedings of the Conference</source>
          ,
          <volume>19</volume>
          -
          <fpage>24</fpage>
          June,
          <year>2011</year>
          , Portland, Oregon, USA - Short
          <string-name>
            <surname>Papers</surname>
          </string-name>
          , The Association for Computer Linguistics,
          <year>2011</year>
          , pp.
          <fpage>665ś669</fpage>
          . URL: https://aclanthology.org/P11-2117/.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>F.</given-names>
            <surname>Alva-Manchego</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Martin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bordes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Scarton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Sagot</surname>
          </string-name>
          , L. Specia,
          <article-title>ASSET: A dataset for tuning and evaluation of sentence simplification models with multiple rewriting transformations, in: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics</article-title>
          , Online,
          <year>2020</year>
          , pp.
          <fpage>4668ś4679</fpage>
          . URL: https://aclanthology.org/
          <year>2020</year>
          .acl-main.
          <volume>424</volume>
          . doi:
          <volume>10</volume>
          .18653/v1/
          <year>2020</year>
          .acl-main.
          <volume>424</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>G.</given-names>
            <surname>Paetzold</surname>
          </string-name>
          , L. Specia, SemEval
          <year>2016</year>
          task 11:
          <article-title>Complex word identification</article-title>
          ,
          <source>in: Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016)</source>
          ,
          <article-title>Association for Computational Linguistics</article-title>
          , San Diego, California,
          <year>2016</year>
          , pp.
          <fpage>560ś569</fpage>
          . URL: https:// aclanthology.org/S16-1085. doi:
          <volume>10</volume>
          .18653/v1/
          <fpage>S16</fpage>
          -1085.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>K.</given-names>
            <surname>Woodsend</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lapata</surname>
          </string-name>
          , Wikisimple:
          <article-title>Automatic simplification of wikipedia articles</article-title>
          , in: W. Burgard, D. Roth (Eds.),
          <source>Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence</source>
          ,
          <source>AAAI</source>
          <year>2011</year>
          , San Francisco, California, USA,
          <year>August</year>
          7-
          <issue>11</issue>
          ,
          <year>2011</year>
          , AAAI Press,
          <year>2011</year>
          . URL: http://www.aaai.org/ocs/index.php/AAAI/AAAI11/paper/view/3505.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>F.</given-names>
            <surname>Alva-Manchego</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Scarton</surname>
          </string-name>
          ,
          <string-name>
            <surname>L. Specia,</surname>
          </string-name>
          <article-title>The (Un)Suitability of Automatic Evaluation Metrics for Text Simplification</article-title>
          , Computational
          <string-name>
            <surname>Linguistics</surname>
          </string-name>
          (
          <year>2021</year>
          ). arXiv:https://direct.mit.edu/coli/article/doi/10.1162/coli
          <volume>00418</volume>
          /106930/ ℎ   −  −
          <string-name>
            <surname />
          </string-name>
          −  −  −   .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>S.</given-names>
            <surname>Štajner</surname>
          </string-name>
          , I. Calixto,
          <string-name>
            <given-names>H.</given-names>
            <surname>Saggion</surname>
          </string-name>
          ,
          <article-title>Automatic text simplification for Spanish: Comparative evaluation of various simplification strategies</article-title>
          ,
          <source>in: Proceedings of the International Conference Recent Advances in Natural Language Processing</source>
          , INCOMA Ltd. Shoumen,
          <string-name>
            <surname>BULGARIA</surname>
          </string-name>
          , Hissar, Bulgaria,
          <year>2015</year>
          , pp.
          <fpage>618ś626</fpage>
          . URL: https://aclanthology.org/R15-1080.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>H.</given-names>
            <surname>Saggion</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Štajner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Bott</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Mille</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Rello</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Drndarevic</surname>
          </string-name>
          ,
          <article-title>Making it simplext: Implementation and evaluation of a text simplification system for spanish</article-title>
          ,
          <source>ACM Trans. Access. Comput. 6</source>
          (
          <year>2015</year>
          ). URL: https://doi.org/10.1145/2738046. doi:
          <volume>10</volume>
          .1145/2738046.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>N.</given-names>
            <surname>Hartmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. H.</given-names>
            <surname>Paetzold</surname>
          </string-name>
          , S. Aluísio,
          <article-title>SIMPLEX-PB 2.0: A reliable dataset for lexical simplification in Brazilian Portuguese</article-title>
          ,
          <source>in: Proceedings of the The Fourth Widening Natural Language Processing Workshop</source>
          , Association for Computational Linguistics, Seattle, USA,
          <year>2020</year>
          , pp.
          <fpage>18ś22</fpage>
          . URL: https://aclanthology.org/
          <year>2020</year>
          .winlp-
          <volume>1</volume>
          .6. doi:
          <volume>10</volume>
          .18653/v1/
          <year>2020</year>
          . winlp-
          <volume>1</volume>
          .6.
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>S.</given-names>
            <surname>Evaldo Leal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Munguba Vieira</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. dos Santos</given-names>
            <surname>Rodrigues</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. Nogueira</given-names>
            <surname>Teixeira</surname>
          </string-name>
          , S. Aluí-
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [44]
          <string-name>
            <given-names>S.</given-names>
            <surname>Cordeiro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Villavicencio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Idiart</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Ramisch</surname>
          </string-name>
          ,
          <article-title>Unsupervised compositionality prediction of nominal compounds</article-title>
          ,
          <source>Computational Linguistics</source>
          <volume>45</volume>
          (
          <year>2019</year>
          )
          <article-title>1ś57</article-title>
          . URL: https://www.aclweb.org/anthology/J19-1001. doi:
          <volume>10</volume>
          .1162/coli_a_
          <fpage>00341</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [45]
          <string-name>
            <given-names>T.</given-names>
            <surname>Mikolov</surname>
          </string-name>
          , I. Sutskever,
          <string-name>
            <given-names>K.</given-names>
            <surname>Chen</surname>
          </string-name>
          , G. Corrado,
          <string-name>
            <given-names>J.</given-names>
            <surname>Dean</surname>
          </string-name>
          ,
          <article-title>Distributed representations of words and phrases and their compositionality</article-title>
          ,
          <source>CoRR abs/1310</source>
          .4546 (
          <year>2013</year>
          ). URL: http: //arxiv.org/abs/1310.4546. arXiv:
          <volume>1310</volume>
          .
          <fpage>4546</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [46]
          <string-name>
            <given-names>J.</given-names>
            <surname>Pennington</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Socher</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. D.</given-names>
            <surname>Manning</surname>
          </string-name>
          , Glove:
          <article-title>Global vectors for word representation</article-title>
          , in: A.
          <string-name>
            <surname>Moschitti</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Pang</surname>
          </string-name>
          , W. Daelemans (Eds.),
          <source>Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, EMNLP 2014, October 25-29</source>
          ,
          <year>2014</year>
          , Doha,
          <string-name>
            <surname>Qatar,</surname>
          </string-name>
          <article-title>A meeting of SIGDAT, a Special Interest Group of the ACL</article-title>
          , ACL,
          <year>2014</year>
          , pp.
          <fpage>1532ś1543</fpage>
          . URL: https://doi.org/10.3115/v1/d14-
          <lpage>1162</lpage>
          . doi:
          <volume>10</volume>
          .3115/v1/d14-
          <fpage>1162</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [47]
          <string-name>
            <given-names>J.</given-names>
            <surname>Devlin</surname>
          </string-name>
          , M.-
          <string-name>
            <given-names>W.</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Toutanova</surname>
          </string-name>
          , BERT:
          <article-title>Pre-training of deep bidirectional transformers for language understanding</article-title>
          ,
          <source>in: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies</source>
          , Volume
          <volume>1</volume>
          (Long and Short Papers),
          <source>Association for Computational Linguistics</source>
          , Minneapolis, Minnesota,
          <year>2019</year>
          , pp.
          <fpage>4171ś4186</fpage>
          . URL: https://aclanthology.org/ N19-1423. doi:
          <volume>10</volume>
          .18653/v1/
          <fpage>N19</fpage>
          -1423.
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [48]
          <string-name>
            <given-names>M.</given-names>
            <surname>Peters</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Neumann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Iyyer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Gardner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Clark</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Zettlemoyer</surname>
          </string-name>
          ,
          <article-title>Deep contextualized word representations</article-title>
          ,
          <source>in: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies</source>
          , Volume
          <volume>1</volume>
          (
          <string-name>
            <surname>Long</surname>
            <given-names>Papers)</given-names>
          </string-name>
          ,
          <source>Association for Computational Linguistics</source>
          , New Orleans, Louisiana,
          <year>2018</year>
          , pp.
          <fpage>2227ś2237</fpage>
          . URL: https://www.aclweb.org/anthology/N18-1202. doi:
          <volume>10</volume>
          .18653/v1/
          <fpage>N18</fpage>
          -1202.
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [49]
          <string-name>
            <given-names>T.</given-names>
            <surname>Schuster</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Ram</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Barzilay</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Globerson</surname>
          </string-name>
          ,
          <article-title>Cross-lingual alignment of contextual word embeddings, with applications to zero-shot dependency parsing, in: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Association for Computational Linguistics</article-title>
          , Minneapolis, Minnesota,
          <year>2019</year>
          , pp.
          <fpage>1599ś1613</fpage>
          . URL: https: //aclanthology.org/N19-1162. doi:
          <volume>10</volume>
          .18653/v1/
          <fpage>N19</fpage>
          -1162.
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [50]
          <string-name>
            <given-names>V.</given-names>
            <surname>Shwartz</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Dagan</surname>
          </string-name>
          ,
          <article-title>Still a pain in the neck: Evaluating text representations on lexical composition, Transactions of the Association for Computational Linguistics 7 (</article-title>
          <year>2019</year>
          )
          <article-title>403ś419</article-title>
          . URL: https://aclanthology.org/Q19-1027. doi:
          <volume>10</volume>
          .1162/tacl_a_
          <fpage>00277</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [51]
          <string-name>
            <given-names>N.</given-names>
            <surname>Nandakumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Baldwin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Salehi</surname>
          </string-name>
          ,
          <article-title>How Well Do Embedding Models Capture Noncompositionality? A View from Multiword Expressions</article-title>
          ,
          <source>in: Proceedings of the 3rd Workshop on Evaluating Vector Space Representations for NLP</source>
          ,
          <article-title>Association for Computational Linguistics</article-title>
          , Minneapolis, USA,
          <year>2019</year>
          , pp.
          <fpage>27ś34</fpage>
          . URL: https://www.aclweb.org/ anthology/W19-2004. doi:
          <volume>10</volume>
          .18653/v1/
          <fpage>W19</fpage>
          -2004.
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [52]
          <string-name>
            <given-names>M.</given-names>
            <surname>King</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Cook</surname>
          </string-name>
          ,
          <article-title>Leveraging distributed representations and lexico-syntactic fixedness for token-level prediction of the idiomaticity of english verb-noun combinations</article-title>
          , in: I. Gurevych, Y. Miyao (Eds.),
          <article-title>Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics</article-title>
          ,
          <string-name>
            <surname>ACL</surname>
          </string-name>
          <year>2018</year>
          , Melbourne, Australia,
          <source>July 15-20</source>
          ,
          <year>2018</year>
          , Volume
          <volume>2</volume>
          :
          <string-name>
            <given-names>Short</given-names>
            <surname>Papers</surname>
          </string-name>
          ,
          <source>Association for Computational Linguistics</source>
          ,
          <year>2018</year>
          , pp.
          <fpage>345ś350</fpage>
          . URL: https: //www.aclweb.org/anthology/P18-2055/. doi:
          <volume>10</volume>
          .18653/v1/
          <fpage>P18</fpage>
          -2055.
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [53]
          <string-name>
            <given-names>M.</given-names>
            <surname>Garcia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. Kramer</given-names>
            <surname>Vieira</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Scarton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Idiart</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Villavicencio</surname>
          </string-name>
          , Probing for id-
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>