<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Comparing Ref-Vectors and word embeddings in a verb semantic similarity task</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Andrea Amelio Ravelli[</string-name>
          <email>andreaamelio.ravelli@unifi.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lorenzo Gr</string-name>
          <email>lorenzo.gregori@unifi.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>gori[</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Florence</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>This work introduces reference vectors (Ref-Vectors), a new kind of word vectors in which the semantics is determined by the property of words to refer to world entities (i.e. objects or events), rather than by contextual information retrieved in a corpus. Ref-Vectors are here compared with state-of-the-art word embeddings in a verb semantic similarity task. The SimVerb-3500 dataset has been used as a benchmark to verify the presence of a statistical correlation between the semantic similarity derived by human judgments and those measured with RefVectors and verb embeddings. Results show that Ref-Vector similarities are closer to human judgments, proving that, within the action domain, these vectors capture verb semantics better than word embeddings.</p>
      </abstract>
      <kwd-group>
        <kwd>Ref-Vectors</kwd>
        <kwd>Verb semantics</kwd>
        <kwd>Word embedding</kwd>
        <kwd>Semantic representation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Natural Language Understanding is a key point in Arti cial Intelligence and
Robotics, especially when it deals with human-machine interaction, such as
instruction given to arti cial agents or active learning from observation of human
actions and activities. In these scenarios, enabling arti cial agents to interpret
the semantics and, most of all, the pragmatics behind human speech acts is of
paramount importance. Moreover, the verb class is crucial for language
processing and understanding, given that it is around verbs that the human language
builds sentences.</p>
      <p>As an example, consider two of the possible interpretation of a sentence such
as John pushes the glass : (1) apply a continuous and controlled force to move an
object from position A to position B; (2) shove an object away from its position.
If we are in a kitchen, in front of a table set with cutlery, plates and glasses, no
human will interpret the sentence with (2). This naive example shows how much
important is contextual understanding of action verbs.</p>
      <p>
        In this contribution we present a novel semantic representation of action verbs
through what we will call reference vectors (Ref-vectors ). Ref-vectors encode
the verb possibility to refer to di erent types of action. The reference values are
extracted from the IMAGACT ontology of action, a multilingual and multimodal
ontology built on human competence (sec.3.1). We compared the resulting verb
embeddings with other popular word embeddings (namely, word2vec, fastText,
GloVe) on the task of word similarity, and benchmarked the results against
simVerb-3500 human similarity judgments [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Related Works</title>
      <p>
        Vector Space Models, also known as Distributional Semantic Models or Word
Space Models [
        <xref ref-type="bibr" rid="ref21 ref25 ref7">25, 7, 21</xref>
        ], are widely used in NLP to represent the meaning of
a word. Each word corresponds to a vector whose dimensions are scores of
cooccurrence with other words of the lexicon in a corpus.
      </p>
      <p>
        Word embeddings (also known as neural embeddings ) can be considered the
evolution of classical vector models. They are still based on the distributional
hypothesis (the semantic of each word is strictly related to word occurrences in
a corpus), but are built by means of a neural network that learn to predict words
in contexts (e.g. [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]). The literature on the topic is huge and many di erent
models with di erent features and parameters have been developed through
the years ([
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]), although only few of them exploit non-contextual features. An
important example is the Luminoso system by Speer and Lowry-Duda ([
        <xref ref-type="bibr" rid="ref24">24</xref>
        ])
where neural embeddings are enriched with vectors based on common sense
knowledge extracted from ConceptNet ([
        <xref ref-type="bibr" rid="ref23">23</xref>
        ]).
      </p>
      <p>
        In this work we also use non-contextual information, but without combining
it to traditional word embeddings. Our space is not a co-occurrence matrix but
rather a co-referentiality matrix: it encodes the ability of two or more verbs to
refer to the same action concepts, i.e. local equivalence [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. Co-referentiality
matrices have been successfully used to represent typological closeness of
multilingual data [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] and for action types induction [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
      <p>
        Action concepts in IMAGACT are instantiated as videos depicting actions.
The combination of linguistic and visual features to perform a more accurate
classi cation of actions has been widely used in recent years [
        <xref ref-type="bibr" rid="ref16 ref22 ref6">22, 6, 16</xref>
        ], with the
development of techniques based on the integration of NLP and computer vision.
Within this perspective, our work could be fruitfully exploited to build complex
models for action understanding grounded on human knowledge.
      </p>
      <p>
        We evaluated the performance of the di erent representation models by
means of a verb similarity task, using the SimVerb-3500 dataset [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] as
benchmark, which has been previously used in similar works on verbs similarity [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
3
3.1
      </p>
    </sec>
    <sec id="sec-3">
      <title>Action vectors for action verbs</title>
      <sec id="sec-3-1">
        <title>IMAGACT</title>
        <p>
          IMAGACT1 [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] is a multimodal and multilingual ontology of action that
provides a video-based translation and disambiguation framework for action verbs.
        </p>
        <sec id="sec-3-1-1">
          <title>1 http://www.imagact.it</title>
          <p>
            The resource consists in a ne-grained categorization of action concepts, each
represented by one or more visual prototypes in the form of recorded videos and
3D animations. IMAGACT currently contains 1,010 scenes which encompass the
action concepts most commonly referred to in everyday language usage. Action
concepts have been gathered through an information bootstrapping from Italian
and English spontaneous spoken corpora, and the occurrences of verbs
referring to physical actions have been manually annotated [
            <xref ref-type="bibr" rid="ref13">13</xref>
            ]. Metaphorical and
phraseological usages have been excluded from the annotation process, in order
to collect exclusively occurrences of physical actions.
          </p>
          <p>
            The database continuously evolves and at present contains 12 fully-mapped
languages and 17 which are underway. The insertion of new languages is obtained
through competence-based extension (CBE) [
            <xref ref-type="bibr" rid="ref12">12</xref>
            ] by mother-tongue speakers,
using a method of ostensive de nitions inspired by Wittgenstein [
            <xref ref-type="bibr" rid="ref26">26</xref>
            ]. The
informants are asked to watch each video, to list all the verbs in their language that
correctly apply to the depicted action, and to provide a caption describing the
event for every listed verb as an example of use.
          </p>
          <p>The visual representations convey action information in a cross-linguistic
environment, o ering the possibility to model a conceptualization avoiding bias
from monolingual-centric approaches.
3.2</p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>Dataset</title>
        <p>The IMAGACT database contains 9189 verbs from 12 languages (Table 1), which
have been manually assigned by native speakers to 1010 scenes. It is important to
notice that the task has been performed on the whole set of 1,010 scenes for each
language and the di erences between the number of verbs depend on linguistic
factors: some examples of verb-rich languages are (a) Polish and Serbian, in
which perfective and imperfective forms are lemmatized as di erent dictionary
entries, (b) German, that have particle verb compositionality, (c) Spanish and
Portuguese, for which verbs belong to both American and European varieties.</p>
        <p>
          Judgments of applicability of a verb to a video scene rely on the semantic
competence of annotators. An evaluation of CBE assignments has been made
for Arabic and Greek [
          <xref ref-type="bibr" rid="ref14 ref15">15, 14</xref>
          ]; results are summarized in Table 2.
3.3
        </p>
      </sec>
      <sec id="sec-3-3">
        <title>Creating Ref-vectors</title>
        <p>From the IMAGACT database, we derived our dataset as a binary matrix
C9189 1010 with one row per verb (in each language) and one column per video
prototype. Matrix values are the assignments of verbs to videos:
Ci;j =
(1 if verb i refers to action j</p>
        <p>0 else
In this way, the matrix C encodes referential properties of verbs.</p>
        <p>In order to provide an exploitable vector representation, an approximated
matrix C0 has been created from C, by using Singular Value Decomposition
(SVD) for dimensionality reduction.</p>
        <p>Language Verbs
Arabic (Syria) 571
Danish 646
English 662
German 990
Greek 638
Hindi 470
Italian 646
Japanese 736
Polish 1,193
Portuguese 805
Serbian 1,096
Spanish 736</p>
        <p>TOTAL 9189
Table 1. Number of verbs per language</p>
        <p>SVD is a widely used technique in distributional semantics to reduce the
feature space. The application of SVD to our dataset allowed us to remove the
matrix sparsity and to obtain a xed-size feature space, that is independent of
the number of action videos.</p>
        <p>Finally, the output C0 is a dense matrix 9189 300.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Evaluation and results</title>
      <p>We compared our co-referentiality vectors to state-of-the-art word embeddings
in a verb semantic similarity task. For the evaluation, we considered the full set
of 220 English verbs that are shared by the IMAGACT and the SimVerb-3500
dataset. We sampled the comparison dataset (Comp-DS) by taking those pairs
of verbs for which similarity scores were present in SimVerb-3500, obtaining 624
verb pairs. Data are reported in Table 3.</p>
      <p>Verb semantic similarity has been automatically estimated for each verb pair
in the Comp-DS by computing the cosine similarity between the related
Refvectors. Then, the correlation between automatic and human judgments about
verb pair similarity has been determined through the Spearman's rank
correlation coe cient2. The result is a positive correlation of 0.212 (Table 5). This
number highlights a low correlation, but it is not informative without a
comparison with other semantic vectors.
2 We applied the Spearman's rank, because data are non-parametric: Shapiro-Wilk
normality test reports W = 0:9578 and p = 1:984e 12.</p>
      <p>
        To this aim, we considered 6 state-of-the-art word embedding, created with
3 algorithms - word2vec3 [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], fastText4 [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] and GloVe5 [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] - trained on two
big corpora - English Wikipedia6 (2017 dump) and English GigaWord7 ( fth
edition) [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. In our experiments, we used lemmatized word embeddings, instead
of token-speci c representation, in order to obtain vectors that are comparable
with SimVerb-3500's verb pairs.
      </p>
      <p>Algorithm Corpus Lemmas Window Dimensions
Word2Vec English Wikipedia 2017 296,630 5 300
Word2Vec Gigaword 5th Ed. 261,794 5 300
FastText Skipgram English Wikipedia 2017 273,930 5 300
FastText Skipgram Gigaword 5th Ed. 262,269 5 300
Global Vectors English Wikipedia 2017 273,930 5 300
Global Vectors Gigaword 5th Ed. 262,269 5 300
Table 4. Numbers of the lemmatized word embeddings used for comparison.</p>
      <p>The previous procedure has been repeated by using these embeddings
instead of Ref-vectors: cosine similarity has been measured between each pair of
the Comp-DS, by using di erent embeddings. The Spearman's rank correlation
coe cient with the Comp-DS is reported in Table 5. Data show that Ref-vectors
are closer to human judgments in estimating verb semantic similarity (0.212),
with the exception of word2vec trained on the GigaW corpus that obtained
almost the same degree of correlation (0.211). All the other verb embeddings
considered report a lower correlation with Comp-DS.</p>
      <p>The same analysis has been conducted considering semantic classes.
SimVerb3500, and thus Comp-DS, contains the annotation of the semantic relation
between the two verbs: the pairs can be synonyms, hyper-hyponyms, cohyponyms,
antonyms or not related. We used this information to measure the correlation of
vector similarity with Comp-DS in verb pairs for each semantic relation. Table 6</p>
      <sec id="sec-4-1">
        <title>3 https://code.google.com/archive/p/word2vec/</title>
        <p>4 https://fasttext.cc
5 https://nlp.stanford.edu/projects/glove/
6 https://archive.org/details/enwiki-20170920
7 https://catalog.ldc.upenn.edu/LDC2011T07
shows that Ref-vectors have a stronger correlation (0.26 and 0.35) with human
judgments with two classes of related pairs, i.e. hyper-hyponyms and synonyms.
Their results are rather poor instead with antonyms and for non semantically
related pairs. This suggests that Ref-vectors are better at capturing semantic
similarity rather than semantic relatedness. Antonyms, indeed, cannot be
considered as semantically similar, since they have opposite meanings. Moreover, if
we do not consider pairs that are not semantically related, the general results of
Ref-vectors outperform those of the other systems (Table 7).
In this paper we have introduced a novel model to represent action verbs
semantics based on their referential properties, rather than standard corpus
cooccurrences. We compared our model to state-of-the-art word embeddings
systems in a verb semantic similarity task. We have shown that our referential
vectors correlate better to human judgments from the SimVerb-3500 dataset
in presence of speci c types of semantic relations. These results suggest that
di erent types of embeddings capture di erent types of relations, like semantic
similarity or simple relatedness. They bring new interesting directions into
semantic modeling, a eld in which research has focused in the last years mainly on
corpus co-occurrences data. We believe that merging referential and corpus data
into semantic modeling can improve language processing and understanding, and
foster Natural Language Understanding in Arti cial Intelligence towards a more
contextually informed dimension.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Bengio</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ducharme</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vincent</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jauvin</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>A neural probabilistic language model</article-title>
          .
          <source>Journal of machine learning research 3(Feb)</source>
          ,
          <volume>1137</volume>
          {
          <fpage>1155</fpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Blundell</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sadrzadeh</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jezek</surname>
          </string-name>
          , E.:
          <article-title>Experimental Results on Exploiting Predicate-Argument Structure for Verb Similarity in Distributional Semantics</article-title>
          . In: Dobnik,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Lappin</surname>
          </string-name>
          , S. (eds.)
          <source>Conference on Logic and Machine Learning in Natural Language (LaML</source>
          <year>2017</year>
          ).
          <source>Gothenburg</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Bojanowski</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grave</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Joulin</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Enriching Word Vectors with Subword Information. Transactions of the Association for Computational Linguistics (TACL) 5(1</article-title>
          ),
          <volume>135</volume>
          {146 (Dec
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Gerz</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vulic</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hill</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reichart</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Korhonen</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>SimVerb-3500: A LargeScale Evaluation Set of Verb Similarity</article-title>
          .
          <source>EMNLP</source>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Gregori</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Varvara</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ravelli</surname>
            ,
            <given-names>A.A.</given-names>
          </string-name>
          :
          <article-title>Action type induction from multilingual lexical features</article-title>
          .
          <source>Procesamiento del Lenguaje Natural</source>
          <volume>63</volume>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Hahn</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Silva</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rehg</surname>
            ,
            <given-names>J.M.:</given-names>
          </string-name>
          <article-title>Action2Vec: A Crossmodal Embedding Approach to Action Learning</article-title>
          . arXiv.org p.
          <source>arXiv:1901.00484 (Jan</source>
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Lenci</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Distributional models of word meaning</article-title>
          .
          <source>Annual Review of Linguistics</source>
          <volume>4</volume>
          (
          <issue>1</issue>
          ),
          <volume>151</volume>
          {
          <fpage>171</fpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Lenci</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Distributional models of word meaning</article-title>
          .
          <source>Annual review of Linguistics</source>
          <volume>4</volume>
          ,
          <issue>151</issue>
          {
          <fpage>171</fpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Corrado</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dean</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>E cient estimation of word representations in vector space</article-title>
          .
          <source>arXiv preprint arXiv:1301.3781</source>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Corrado</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dean</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>E cient Estimation of Word Representations in Vector Space</article-title>
          .
          <source>ICLR cs.CL</source>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Moneglia</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brown</surname>
          </string-name>
          , S.,
          <string-name>
            <surname>Frontini</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gagliardi</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Khan</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Monachini</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Panunzi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>The imagact visual ontology. an extendable multilingual infrastructure for the representation of lexical encoding of action</article-title>
          . In: Calzolari,
          <string-name>
            <given-names>N.</given-names>
            ,
            <surname>Choukri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Declerck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Loftsson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            ,
            <surname>Maegaard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Mariani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Moreno</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Odijk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Piperidis</surname>
          </string-name>
          , S. (eds.)
          <source>Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)</source>
          .
          <source>European Language Resources Association (ELRA)</source>
          , Reykjavik, Iceland (may
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Moneglia</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brown</surname>
          </string-name>
          , S.,
          <string-name>
            <surname>Kar</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kumar</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ojha</surname>
            ,
            <given-names>A.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mello</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Niharika</surname>
            , Jha,
            <given-names>G.N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ray</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sharma</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Mapping Indian Languages onto the IMAGACT Visual Ontology of Action</article-title>
          . In: Jha,
          <string-name>
            <given-names>G.N.</given-names>
            ,
            <surname>Bali</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            , L, S.,
            <surname>Banerjee</surname>
          </string-name>
          , E. (eds.)
          <source>Proceedings of WILDRE2 - 2nd Workshop on Indian Language Data: Resources and Evaluation at LREC'14. European Language Resources Association (ELRA)</source>
          , Reykjavik, Iceland (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Moneglia</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Frontini</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gagliardi</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Russo</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Panunzi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Monachini</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Imagact: deriving an action ontology from spoken corpora</article-title>
          .
          <source>Proceedings of the Eighth Joint ACL-ISO Workshop on Interoperable Semantic Annotation (isa-8</source>
          ) pp.
          <volume>42</volume>
          {
          <issue>47</issue>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Mouyiaris</surname>
            ,
            <given-names>A.:</given-names>
          </string-name>
          <article-title>I verbi d'azione del greco nell'ontologia IMAGACT</article-title>
          .
          <source>Master's thesis</source>
          , University of Florence (forthcoming)
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Mutlak</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>I verbi di azione dell'arabo standard nell'ontologia dell'azione IMAGACT</article-title>
          .
          <source>Ph.D. thesis</source>
          , University of Florence (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Naha</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Beyond verbs: Understanding actions in videos with text</article-title>
          .
          <source>In: Pattern Recognition (ICPR)</source>
          ,
          <year>2016</year>
          23rd International Conference on. pp.
          <year>1833</year>
          {
          <year>1838</year>
          . IEEE (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Panunzi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Moneglia</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gregori</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Action identi cation and local equivalence of action verbs: the annotation framework of the imagact ontology</article-title>
          . In: Pustejovsky, J.,
          <source>van der Sluis, I. (eds.) Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC</source>
          <year>2018</year>
          ).
          <source>European Language Resources Association (ELRA)</source>
          , Paris, France (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Parker</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gra</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kong</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Maeda</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <string-name>
            <surname>English Gigaword Fifth Edition. Linguistic Data</surname>
            <given-names>Consortium</given-names>
          </string-name>
          , LDC2011T07
          <volume>12</volume>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Pennington</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Socher</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Manning</surname>
          </string-name>
          , C.D.: GloVe:
          <article-title>Global vectors for word representation</article-title>
          .
          <source>In: Conference on Empirical Methods in Natural Language Processing</source>
          . pp.
          <volume>1532</volume>
          {
          <fpage>1543</fpage>
          . Stanford University, Palo Alto, United
          <string-name>
            <surname>States</surname>
          </string-name>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Ryzhova</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kyuseva</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paperno</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Typology of adjectives benchmark for compositional distributional models</article-title>
          .
          <source>In: Proceedings of the 10th Language Resources and Evaluation Conference</source>
          . pp.
          <volume>1253</volume>
          {
          <issue>1257</issue>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Sahlgren</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>The Word-Space Model</article-title>
          .
          <source>Ph.D. thesis</source>
          , Stockholm University (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Silberer</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ferrari</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lapata</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Models of semantic representation with visual attributes</article-title>
          .
          <source>In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)</source>
          . pp.
          <volume>572</volume>
          {
          <fpage>582</fpage>
          .
          <article-title>Association for Computational Linguistics (</article-title>
          <year>2013</year>
          ), http://aclweb.org/anthology/P13-1056
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Speer</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chin</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Havasi</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Conceptnet 5.5: An open multilingual graph of general knowledge</article-title>
          .
          <source>In: Thirty-First AAAI Conference on Arti cial Intelligence</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Speer</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lowry-Duda</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          : Conceptnet at semeval
          <article-title>-2017 task 2: Extending word embeddings with multilingual relational knowledge</article-title>
          .
          <source>arXiv preprint arXiv:1704.03560</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Turney</surname>
            ,
            <given-names>P.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pantel</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>From frequency to meaning: Vector space models of semantics</article-title>
          .
          <source>Journal of arti cial intelligence research 37</source>
          ,
          <volume>141</volume>
          {
          <fpage>188</fpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Wittgenstein</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Philosophische Untersuchungen</article-title>
          . Suhrkamp Verlag (
          <year>1953</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>