<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>INSEARCH A platform for Enterprise Semantic Search</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Diego De Cao</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Valerio Storch</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Danilo Croce</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Roberto Basili</string-name>
          <email>basilig@info.uniroma2.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Enterprise Engineering University of Roma</institution>
          ,
          <addr-line>Tor Vergata 00133 Roma</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This paper discusses the system targeted in the INSEARCH EU project. It embodies most of the state-of-the-art techniques for Enterprise Semantic Search: highly accurate lexical semantics, semantic web tools, collaborative knowledge management and personalization. An advanced information retrieval system has been developed integrating robust semantic technologies and industry-standard software architectures for proactive search as well as personalized domain-speci c classi cation and ranking functionalities.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>order to determine the core functionalities in the targeted system, an analysis
involving 90 SMEs has been performed during the INSEARCH project to
understand the process of searching within the innovation process. Most of the
SMEs (92% of 90 interviewed SMEs) declared to make use of market and/or
technology information when planning a technological innovation. Such
informations are used to collect novel information for innovative ideas, performing
prior art investigation, acquiring knowledge for technical planning or just gather
inspiration and ideas. This search targets product and processes and it is mainly
performed on scienti c Web Sites and Competitors web site.</p>
      <p>
        In these scenarios, keyword-based search related to product types and
functions of the products are still used to retrieve information related to
innovation processes. Search is mostly performed through iterative searches,
evaluating search results through the very rst lines of documents/web sites. Overall,
the most requested knowledge extraction features are related to nding patterns
within documents to propose possible innovation or customer requirements. This
requirements are in line with the INSEARCH proposed approach of making
usage of a TRIZ based methodology [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], to abstract functionalities from the speci c
innovation case under study and search for information through speci c patterns
(the TRIZ based Object-Action-Tool patterns) that could propose to SMEs
possible technology innovations for the system under study.
      </p>
      <p>In this paper the overall INSEARCH framework and its corresponding
distributed system will be described, focusing on the advantage of integrating in a
systematic fashion the bene ts of analytical natural language processing tools,
the adaptivity supported by inductive methods as well as the robustness
characterizing advanced document management architectures built over
interoperability standards in the Semantic Web (such as the iQser GIN Server). In the rest
of the paper, section 2 discusses the di erent involved paradigms used to
support semantic search. The overall architecture is presented in Section 3 that also
show some typical user interactions with the system. Finally, section 4 derives
the conclusions.
2
2.1</p>
    </sec>
    <sec id="sec-2">
      <title>Integrating Ontological and Lexical Knowledge</title>
      <sec id="sec-2-1">
        <title>Modeling Knowledge for Enterprise Semantic Search</title>
        <p>
          Ontologies correspond to semantic data models that are shared across large user
communities. The targeted enterprise or networked enterprises in INSEARCH
are a typical expression of such communities where semantics can be produced,
reused and validated in a shared (i.e. collaborative) manner. However, while
knowledge representation languages are very useful to express machine readable
models, the interactive and user-driven nature of most of the task focused by
INSEARCH emphasize the role of natural language as the true user-friendly
knowledge exchange language. Natural languages naturally support all the expressions
used by producers and consumers of information and their own semantics is rich
enough to provide strong basis for most of the meaningful inferences needed
in INSEARCH. Document classi cation aiming at recognizing the interests of a
user in accessing a text (e.g. a patent) requires a strongly linguistic basis as texts
are mostly free and unstructured, as in [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. In retrieval, against user queries,
document ranking functions are inherently based on lexical preferences models,
whose traditional TF-IDF models are just shallow surrogates. Moreover, the rich
nature of the patterns targeted by INSEARCH (e.g. Object-Action-Tool triple
foreseen by the TRIZ methodology) is strongly linguistic, as the same
information is usually expressed in text with a huge freedom, and as for the language
variability itself. Consider as an example that if a tool like a packing machine is
adopted for the manufacturing of co ee boxes, several sentences can make
reference to them, e.g. packing machine applied to co ee, co ee is packed through
dedicated machines or dedicated machines are used to pack small co ee boxes of
10 inch.
        </p>
        <p>
          Organizing knowledge through the SKOS concept scheme. Users are
able to access, create or re ne descriptions of a domain in the form of \tree of
topics", or simply topic-trees (modeled as SKOS [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ] concept schemes) which
will support their contextual search throughout the system. These topics act
as collectors for documents which expose all those textual contents that can be
naturally associated to their de nition. They are under all aspects a controlled
hierarchical vocabulary of tags o ered to a community of users. Behind every
tag a large term vocabulary is used in order to exploit the corresponding topic
semantics during search activities. Topic-document associations may be
discovered through information push by the mass: users inside a community contribute
their bookmarks to the system. On the other hand, it can be achieved by the
system itself, by machine learning from the above information, automatically
creating topic associations for massive amount of documents which are gathered
through the multichannel multimodal document discovery and acquisition
component, as discussed in [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. Examples of SKOS topic for the speci c domain
of the Innovation Engineering domain are reported in Fig. 1. Main SKOS
concepts are Research and Intellectual Properties (organizing scienti c
papers or patents) and Tecnology. The latter can be speci ed with the concept
biotecnology or material and so on. Apart from their role of document
containers, topics may be described by enriching them with annotations, comments
and multiple lexicalizations for the various languages supported by INSEARCH,
so that their usage is informally clari ed to human users, possibly enforcing their
consistent adoption across the community.
        </p>
        <p>
          User Management. In INSEARCH, standard models and technologies of the
RDF [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] family have been adopted to allow each user to view his own SKOS
ontology. It requires to model the information associated to user management,
domain modeling and user data. The three di erent aspects have been physically
modularized by partitioning the triples content, and each of these partitions is
in turn divided into smaller segments to further account for speci c data
organization requirements such as provenance and access privileges. The partitions are
obtained through the use of RDF named graphs, so that, whenever appropriate,
the knowledge server may bene t of a single shared data space, or is able
conversely to manage each partition (or set of partitions) as a separate dataset. The
two main categories of users access these partitions in INSEARCH: companies
and employees. Companies act like user-groups, collecting standard users
(employees) under a common hat and possibly providing shared information spaces
(e.g. domain models or reference information) which will be inherited by all of
them. Each employee shares with his colleagues common data provided by the
company, while at the same time he can be o ered a personalized opportunity
or a restricted access.
        </p>
        <p>
          Semantic Bookmarking. In such a scenario, it is crucial to populate the SKOS
ontology, thus providing examples for the document categorization process,
allowing to link novel documents to existing (or user-de ned) SKOS concepts.
Semantic Turkey (ST) [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ] was born as a tool for semantic bookmarking and
annotation, thought for supporting people doing extensive searches on the web,
and needing to keep track of: results found, queries performed and so on.
Today ST is a fully edged Semantic Platform for Knowledge Management and
Acquisition supporting all of W3C standards for Knowledge Representation (i.e.
RDF/RDFS/OWL SKOS and SKOS-XL extension). It is possible to extend it,
in order to produce completely new applications based on the underlying
knowledge services. The underlying framework allows access to RDF (and all modeling
vocabularies already mentioned) through Java API, client/server AJAX
communication (proprietary format, no Web service) and client-side Javascript API
(hiding TCP/HTTP details). The ST o ers among the others functionalities for
editing a reference (domain) ontology (i.e. a SKOS-compliant topic taxonomy),
bookmarking pages according to the taxonomy as well as organizing query
results according to the hierarchical structure the SKOS taxonomy. Users may surf
the web with a standards compliant web browser, associating information found
on web documents to concepts from the current knowledge organization
systems (KOS). The core framework of ST has been totally reused in INSEARCH
without speci c customization. However, novel dedicated services have been
developed and plugged, anking the main ones, to meet the speci c INSEARCH
requirements (see also the discussion in next section on architecture). In
particular, the annotation mechanism is merged into the multiuser environment of
the INSEARCH platform, so that the system may exploit contributions from
di erent users, whenever the power of mass-contribution is exploitable.
2.2
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Robust Modeling of Lexical Information</title>
        <p>
          Computational models of natural language semantics have been traditionally
based on symbolic logic representations naturally accounting for the meaning of
sentences, through the notion of compositionality (as the Montague's approach
in [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ] or [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]). While formally well de ned, logic-based approaches have
limitations in the treatment of ambiguity, vagueness and other cognitive aspects
such as uncertainty, intrinsically connected to natural language communication.
These problems inspired recently research on distributional models of
lexical semantics (e.g. Firth [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] or Schutze [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]). In line with Wittgenstein's later
philosophy, these latter characterize lexical meanings in terms of their context
of use [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ]. Distributional models, as recently surveyed in [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ], rely on the
notion of Word Space, inspired by Information Retrieval, and manage semantic
uncertainty through mathematical notion grounded in probability theory and
linear algebra. Points in normed vector space represent semantic concepts, such
as words or topics, and can be learned from corpora, in such a way that similar,
or related, concepts are near to one another in the space. Methods for
constructing representations for phrases or sentences through vector composition have
recently received a wide attention in literature (e.g. [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]). While, vector-based
models typically represent isolated words and ignore grammatical structure [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ],
the so-called compositional distributional semantics (DCS) has been
recently introduced and still object of rich on-going research (e.g. [
          <xref ref-type="bibr" rid="ref11 ref5">11, 5</xref>
          ], [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ], [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]).
Notice that several applications, such as the one targeted by INSEARCH, are
tight to structured concepts, that are more complex than simple words. An
example are the TRIZ inspired Object-Action-Tool (OAT) triples that describe
Object (s) that receive(s) an Action from Tool (s), such as those written in
sentences like \: : : [the co ee]Object in small quantities [is prepared ]Action by the
[packing machine itself ]T ool : : : " or \: : : for [preparing ]Action [the co ee]Object
by extraction with [hot water ]T ool, : : : ".
        </p>
        <p>Here physical entities (such as co ee or hot water ) play the role of Objects
or T ools according to the textual contexts they are mentioned in. Compositional
models based on distributional analysis provide lexical semantic information that
is consistent both with the meaning assignment typical of human subjects to
words and to their sentential or phrasal contexts. It should support synonymy
and similarity judgments on phrases, rather than only on single words. The
objective should be assigning high values of similarity to expressions, such as
\: : : buy a car : : : " vs. \: : : purchase an automobile : : : ", while lower values to
overlapping expressions such as \: : : buy a car : : : " vs. \: : : buying time : : : ".
Distributional compositional semantics methods provide models to de ne: (1)
ways to represent lexical vectors v and o, for words v; o occurring in a phrase
(r; v; o) (where r is a syntactic relation, such as verb-direct object), and (2)
metrics for comparing di erent phrases according to the basic representations,
i.e. the vectors v, o.</p>
        <p>
          While a large literature already exist (e.g. [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]) the user can nd more details
about the solution adopted in INSEARCH in [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]. Compositional distributional
semantic models are used to guide the user modeling of ontological concepts of
interest (such as the SKOS topics), feed the document categorization process (that
is sensitive to OAT patterns through vector based representation of their
composition), concept spotting in text as well as query completion in INSEARCH.
The adopted methods are discussed in [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] and [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ].
3
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>The INSEARCH architecture</title>
      <p>The INSEARCH overall architecture is designed as a set of interacting services
whose overall logic is integrated within the iQser GIN Server for information
ecosystems. The comprehensive logical view of the system is depicted in Fig. 2.</p>
      <p>The core GIN services are in the main central box. External Analyzers are
shown on the left, as they are responsible for text and language processing or, as
in the case of the Content vectorization module, for the semantic enrichment of
input documents. GIN speci c APIs are responsible for interfacing heterogenous
content providers and managing other speci c data gathering processes (e.g.
speci c crawlers). Client Connector APIs are made available by GIN for a variety
of user level functionalities, such as User Management, Semantic Bookmarking
or Contextual searches that are managed via appropriate GIN interface(s). At
the client level in fact, the basic search features from web sources and patents,
are extended with:
{ Navigation in linked search results and Recommendations for uploaded or
pre-de ned contents through bookmarks or SKOS topics of interest.
Recommendations are strongly driven by the semantically linked content,
established by the core analysis features of the GIN server.
{ Semantic bookmarking is supported allowing sophisticated content
management, including the upload of documents, the triggering of web crawling
stages, the de nition and lexicalization of interests, topics and concepts
described in SKOS. Interesting information items are used for upgrading
recommendations, topics and concepts and prepare contextual searches.
{ Personalization allows user management functions at the granularity of
companies as well as people.</p>
      <p>On the backend side, we emphasize that the current server supports the
integration with Alfresco3 as the document and content management system,
whereas the de ned interests are also managed as Alfresco's content. While the
integration of Web sources is already supported by a dedicated crawler, also
patents are targeted with an interface to the patent content provider WIPO4.</p>
      <p>Contextual Semantic search is also supported through vector space
methods. Vectorization is applied to incoming documents with an expansion of
traditional bag-of-word models based on topic models and Latent Semantic Analysis
(as discussed in Section 2.2). Moreover, the available vector semantics supports
distributional compositional functions that model the representation and
inferences regarding TRIZ-like OAT patterns, so that natural language processing
and querying based on domain speci c patterns are consistently realized. Basic
feature extraction services and morphosyntactic analyzers (such as
lemmatization and part of speech tagging) are already in place as external GIN analyzers.</p>
      <p>The main functionalities currently integrated in INSEARCH are thus:
{ Website monitoring: Observe changes in given pages/domains, which are
added by the user and implemented as bookmarklets
{ Assisted Search: such as in Query completion, e.g. support the user in the
designing proper queries about company's products or markets .
{ Document analysis: Intelligent Document Analysis is applied to asses their
relevance to high-level topics prede ned by the user in the SKOS taxonomy.
Relevance to individual topics is provided through automatic classi cation
driven by weighted membership scores of results with respect to individual
topics.
{ Patent and scienti c paper search: Search for patents and/or scienti c
papers in existing databases (e.g. European patent o ce) is supported.
{ OAT-Pattern analysis: TRIZ-inspired Object-Action-Tool (OAT) triples
are searched in documents: these patterns play the role of suggestions for
tools, which provide a certain function speci ed by the object and the action.
3 http://www.alfresco.com/
4 http://www.wipo.int/portal/index.html.en</p>
      <p>{ Adaptivity: The system tracks user behaviors and adjusts incrementally
its own relevance judgments for the topics and categories of interest.
3.1</p>
      <sec id="sec-3-1">
        <title>Typical user interactions</title>
        <p>The system has been recently deployed in its full functional version and
provides a unique opportunity to evaluate its application to realistic data sets and
industrial processes. The INSEARCH users will be able to quantitatively and
qualitatively evaluate the impact of its semantic capabilities, its collaborative
features as well as the overall usability of the personalized search environment
in a systematic manner.</p>
        <p>
          The front end of the INSEARCH system is shown in an interactive contextual
search use-case in Fig. 3 and 4. The main tabs made available here are related
to the Domains, Search, Alerting and Tools functionalities. In Domains
the user can interact with and re ne his own SKOS topics as well as interests
and preferences, as shown in Fig. 1. Alerting supports the visualization of
the results of Web Monitoring activities: here returned URLs, documents or
other texts are conceptually organized around the SKOS concepts thanks to
the automatic classi cation targeted to the ontology categories, made available
through the Rocchio Classi ers, as discussed in [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. In Tools most of the
installation and con guration activities can be carried out.
        </p>
        <p>In the Search tab, contextual search and query completion is o ered to the
user. In Fig. 3 the suggestions related to the ambiguous keyword \plant " early
provided by the user are shown, where nouns like \generator " and \battery " (as
well verbs like \generator " and \battery ") are the proper continuation of the
query, given the underlying domain, i.e electrical power. The completion is
di erent when a topic such as biotecnology is selected, as shown in Fig. 4.</p>
        <p>The di erent completion is made available by the lexicalization of each concept:
these lexical preferences are projected in an underlying Word Space (discussed in
Section 2.2) that provides the geometrical representation of all words appearing
in the indexed documents. Given the vectors representing all query terms and
the lexical preferences of the selected SKOS concepts, the most similar (i.e.
nearest) words are selected and proposed for the completion. This adaptivity is
achieved also to provide novel information to the nal users. In the front-end
interface, a list of news is proposed. These are continually downloaded from the
web and retrieved using the lexical preferences speci ed by the user during his
own registration as well as the selected SKOS concepts. Notice that news are
sensitive to the di erent SKOS concepts during the session, as in Fig. 3 and 4.</p>
        <p>Once the query is submitted, documents are retrieved, automatically
classied and clustered with respect to the existing SKOS concepts, as in Fig. 5. This
clustering phase allows users to browse documents exploring their relatedness
to speci c SKOS concepts, such as electrical power or research. The user
interface also allows to implement a relevance feedback strategy to improve the
quality and adaptivity of text classi ers by simply clicking over the \thumbs up"
or \thumbs down" icons. They allow to accept or reject each concept/document
association, that re ects the underlying text classi cation. When the user
accepts a classi cation, the Rocchio classi er associated with the corresponding
concept is incrementally fed with the document, that becomes a positive
example. On the contrary, the selected document is provided as a negative example,
by clicking on the \thumbs down" icon.</p>
        <p>Finally, the Object-Action-Tool (OAT) pattern-based search is shown in Fig.
6. The user is allowed to retrieve documents specifying speci c actions (pack ),
objects (co ee boxes ) or tools (dedicated machine). During the data-gathering
phase, the OAT pattern extraction module (see Fig. 2) extracts all patterns from
the documents, by exploiting a set of pre-de ned morphosyntactic patterns, such
as Subject-Verb-Object. The extracted OAT patterns are used during the
indexing phase, thus enabling semi-structured queries through (possible
incomplete) OAT patterns. Fig. 6 summarizes a session where the user is interested in
documents related to the action control and object nuclear ssion. Initially the
system suggests a set of possible tools, such as method, system or product. The
user can select one or more tools to browse the related documents.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Conclusions</title>
      <p>In the innovation process, the search of external information represents a crucial
activity for the most of Small and Medium Sized Enterprises. In this paper the
system targeted in the INSEARCH EU project is discussed. It embodies most of
the state-of-the-art techniques for Enterprise Semantic Search: highly accurate
lexical semantics, semantic web tools, collaborative knowledge management and
personalization. The outcome is an advanced integration of analytical natural
language analysis tools, robust adaptive methods and semantic document
management systems relying over the Semantic Web standards. The knowledge bases
personalization as well as the semantic nature of the recommending
functionalities (e.g. query completion, contextual search and Object-Action-Tool
triplebased search) will be evaluated in systematic benchmarking activities, carried
at the enterprise premises, within realistic and representative scenarios.
Acknowledgment The authors would like to thank all the partners of the
INSEARCH consortium as they made this research possible. In particular, we thank
Armando Stellato and Daniele Previtali from UNITOR, Jorg Wurzer from iQSer,
Paolo Salvatore from CiaoTech, Sebastian Dunninger, Stefan Huber from
Kusftein, Antje Schlaf from INFAI, Mirko Clavaresi from Innovation Engineering,
Cesare Rapparini from ICA and Hank Koops from Compano.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Altshuller</surname>
          </string-name>
          , G.:
          <article-title>40 principles, TRIZ keys to technical innovation</article-title>
          . No.
          <article-title>1 in Triz tools</article-title>
          ,
          <source>Technical Innovation Center</source>
          , Worcester, Mass., 1. ed edn. (
          <year>1998</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Annesi</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Storch</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Basili</surname>
          </string-name>
          , R.:
          <article-title>Space projections as distributional models for semantic composition</article-title>
          . In: Gelbukh,
          <string-name>
            <surname>A.F</surname>
          </string-name>
          . (ed.)
          <source>CICLing (1)</source>
          . LNCS, vol.
          <volume>7181</volume>
          , pp.
          <volume>323</volume>
          {
          <fpage>335</fpage>
          . Springer (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>B.</given-names>
            <surname>Coecke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.S.</given-names>
            ,
            <surname>Clark</surname>
          </string-name>
          ,
          <string-name>
            <surname>S.</surname>
          </string-name>
          :
          <article-title>Mathematical foundations for a compositional distributed model of meaning</article-title>
          .
          <source>Lambek Festschirft, Linguistic Analysis</source>
          <volume>36</volume>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Baeza-Yates</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ciaramita</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mika</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zaragoza</surname>
          </string-name>
          , H.:
          <article-title>Towards semantic search</article-title>
          .
          <source>Natural Language and Information</source>
          Systems pp.
          <volume>4</volume>
          {
          <issue>11</issue>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Baroni</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zamparelli</surname>
          </string-name>
          , R.:
          <article-title>Nouns are vectors, adjectives are matrices: representing adjective-noun constructions in semantic space</article-title>
          .
          <source>In: Proceedings of EMNLP 2010</source>
          . pp.
          <volume>1183</volume>
          {
          <fpage>1193</fpage>
          .
          <string-name>
            <surname>Stroudsburg</surname>
          </string-name>
          , PA, USA (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Basili</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Giannone</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>De Cao</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Learning domain-speci c framenets from texts</article-title>
          .
          <source>In: Proceedings of the ECAI Workshop on Ontology Learning and Population. ECAI, ECAI</source>
          , Patras,
          <source>Greece (July</source>
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Cocchi</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bohm</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Deliverable 2.2: Analysis of functional and market information</article-title>
          .
          <source>TECH-IT-EASY</source>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Firth</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>A synopsis of linguistic theory 1930-1955</article-title>
          . In:
          <article-title>Studies in Linguistic Analysis</article-title>
          .
          <source>Philological Society</source>
          , Oxford (
          <year>1957</year>
          ), reprinted in Palmer, F. (ed.
          <year>1968</year>
          ) Selected Papers of
          <string-name>
            <given-names>J. R.</given-names>
            <surname>Firth</surname>
          </string-name>
          , Longman, Harlow.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Grefenstette</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sadrzadeh</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Experimental support for a categorical compositional distributional model of meaning</article-title>
          .
          <source>CoRR abs/1106</source>
          .4058 (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Klyne</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Carroll</surname>
            ,
            <given-names>J.J.: Resource</given-names>
          </string-name>
          <string-name>
            <surname>Description</surname>
          </string-name>
          <article-title>Framework (RDF): Concepts and Abstract Syntax (</article-title>
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11. Mitchell, J.,
          <string-name>
            <surname>Lapata</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Vector-based models of semantic composition</article-title>
          .
          <source>In: In Proceedings of ACL-08: HLT</source>
          . pp.
          <volume>236</volume>
          {
          <issue>244</issue>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Montague</surname>
          </string-name>
          , R.: Formal Philosophy: Selected Papers of Richard Montague. Yale University Press (
          <year>1974</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Moschitti</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Basili</surname>
          </string-name>
          , R.:
          <article-title>Complex linguistic features for text classi cation: a comprehensive study</article-title>
          .
          <source>In: Proc. of the ECIR</source>
          . pp.
          <volume>181</volume>
          {
          <fpage>196</fpage>
          . Springer Verlag (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Pazienza</surname>
            ,
            <given-names>M.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scarpato</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stellato</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Turbati</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Semantic turkey: A browser-integrated environment for knowledge acquisition and management</article-title>
          .
          <source>Semantic Web journal 3(2)</source>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15. Schutze, H.:
          <article-title>Automatic Word Sense Discrimination</article-title>
          .
          <source>Computational Linguistics</source>
          <volume>24</volume>
          ,
          <issue>97</issue>
          {
          <fpage>124</fpage>
          (
          <year>1998</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Turney</surname>
            ,
            <given-names>P.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pantel</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>From frequency to meaning: Vector space models of semantics</article-title>
          .
          <source>Journal of arti cial intelligence research 37</source>
          ,
          <volume>141</volume>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Wittgenstein</surname>
            ,
            <given-names>L.: Philosophical</given-names>
          </string-name>
          <string-name>
            <surname>Investigations. Blackwells</surname>
          </string-name>
          , Oxford (
          <year>1953</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>World Wide Web Consortium: SKOS Simple Knowledge Organization System Reference</surname>
          </string-name>
          (
          <year>Aug 2009</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>