<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>June</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>PaloAnalytics pro ject concept, scope and outcomes: an opportunity for culture</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Knowledge</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Uncertainty Research Laboratory</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of the Peloponnese Tripolis</institution>
          ,
          <addr-line>Greece 221 31</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of the Peloponnese</institution>
          ,
          <addr-line>Tripolis, Greece 221 31</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>1821</year>
      </pub-date>
      <volume>9</volume>
      <issue>2019</issue>
      <abstract>
        <p>This paper describes the national funded project entitled PaloAnalytics, which develops an innovative platform that allows companies and organizations, that operate in several countries, to monitor and analyze, in depth, the markets' interest to their products and successfully plan their marketing and communication strategy, with data and insights collected from all the local media, and focuses on its application to cultural spaces and museums. In this notion, we examine the e ect that this project can have in cultural spaces or companies related to arts and culture. PaloAnalytics platform allows organizations to investigate the impact of their products on consumers across di erent countries and this is achieved with the analysis of content from sites, blogs, social networks and open data. This implies that cultural organization can bene t by adopting the implemented services, so that the can recognize and analyze their audience, their online marketing campaigns as well as examine the impact of their messages and the spread of their messages on the Internet. In this paper, we brie y describe the project and discuss on the impact on cultural related organizations.</p>
      </abstract>
      <kwd-group>
        <kwd>big data</kwd>
        <kwd>data monitoring</kwd>
        <kwd>trending topics</kwd>
        <kwd>in uencers</kwd>
        <kwd>info graphics</kwd>
        <kwd>data visualization</kwd>
        <kwd>deep learning</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The data that is generated daily in the world of the internet is vast. The amount
of information is such that it is impossible for companies and organizations to
fetch, analyze and learn from all the data produced. In this scope, PaloAnalytics
is a project that aims to perform the procedures of collecting, analyzing and
extracting useful information from di erent sources of the internet, web pages,
news portals, open sources, and social media. The procedure of collecting and
analyzing information from diverse sources is not something new, and has
attracted research during the last 20 years [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. It resides to the area of Data
Mining [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] and it focuses on Big Data analysis, which is based on multiple
custom Data Warehouses [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. In this notion, we present a project that intends
to employ resources from all these sectors in order to produce its nal results; in
depth analysis of social media and web data in order to support organizations
and companies.
      </p>
      <p>Market research has proven that companies and organizations are in strong
need for an holistic market monitoring and analysis service in several countries
and not solely the country of their origin; or at least they are convinced that they
can perform much better if they have such a tool. Besides, the competition of such
companies and organizations is usually international. Furthermore, it clear that
when analyzing data in an international environment each local information can
easily a ect the whole organization, but it is usually di cult to become a part of
the organization's international policy. In general, it seems possible that data can
be collected and analyzed in some extent locally but is usually not transferred
as knowledge to the international level. In fact, for such organizations it would
be extremely useful to utilize a unique language for all the data analyzed and it
seems that the English language is acceptable and consistent. A number of tools
have been developed including Mention 3 and Brandwatch 4 in order to collect
and analyze data internationally but they have some major disadvantages. They
focus mainly on social media and target experienced users, while in parallel
they do not provide translations of reports from local languages in a universal
language. Furthermore, they do not o er a homogeneous overall picture for all
the countries that are of interest for a business.</p>
      <p>In this ground, we introduce PaloAnalytics project, which intends to focus
on the basic challenges that organizations face and includes the ability to have
a universal monitoring tool, with links and interconnections between data
collected and analyzed from a number of di erent sources and di erent languages.
In this way the project will be the ideal solution for international companies (or
companies willing to become international) and companies that their
international competition a ect their local business. An ideal solution, through which
the organization will be able to get information out of large sets of data.</p>
      <p>The proposed design and implementation, introduces a series of software
modules that will
{ analyze multilingual content posted on news sites, social networks and open
data
{ extract knowledge and information about products and companies, including
product characteristics
{ analyze sources, their in uence and trends
3 https://mention.com - Mention: Scour the web, social media, and more for powerful
market insights
4 https://www.brandwatch.com/ - Brandwatch: Know what your customers think
{ help in assessing the image of the business and its products as well as its
competitors
{ visualize the knowledge in order to easily understand the analyzed
information</p>
      <p>These procedures describe how this project can be used by any type of
company. Research has shown that cultural spaces and organizations have started
to take seriously the world of the Internet and the Social Media. Consequently,
they nd it attractive to spread their messages through these mediums, as it is
expected to reach a larger and global audience, they can make serious debates
and conversations, and, generally, have an alternative active role in order to
challenge the mass culture. Having the aforementioned as a base, it is evident
that the project can help all these organizations have a holistic presence in social
media and the internet; a presence that can be expected to be international.</p>
      <p>The rest of the paper is structured as follows: Section 2 presents the
methodology of the project, while section 3 discusses the system architecture. In section
?? a detailed description of each component is presented, providing more
emphasis on the Trending Topics software module and its results. Section 4 de nes
the expected outcomes of the proposed system and the nal section presents a
discussion on the project.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Methodology of the project</title>
      <p>Due to the large number of di erent modules, the high complexity of their
implementation and the importance in precision of their algorithmic procedures,
an advanced methodology is employed. As such, the Rational Uni ed Process is
used. It is a software engineering procedure that ensures producing high
quality software and achieving end user needs within a speci c timetable and cost.
Two cycles of project evolution are followed, one that leads to the basic
implementation and is longer, while a shorter one will be done in order to perform
re nements. Both of the cycles will go through the same steps of development.
During the rst cycle the implementation will be ensured, while the second cycle
will focus on the quality of the outcomes. The cycle phases include:
{ Inception Phase
{ Elaboration Phase
{ Construction Phase
{ Transition Phase</p>
      <p>During the inception phase a general description of the key requirements
of the project is done; key points and the basic constraints are de ned and
the system use cases are de ned in brief. An initial business case including the
business framework, the success criteria and nancial forecasting is the ones
that lead to the project plan and to a draft business model. While analyzing the
information, during the processing phase, the use case model was completed, and
the nal requirements were recorded. The architecture reached its nal form and
the project's development plan was nalized. Currently the project is under the
rst construction phase, where modules are implemented and starting to be
integrate into the PaloAnalytics platform. Upon completion of the rst phase of
implementations an overall system functionality, performance and usability test
will be done.</p>
      <p>The development of the platform follows a bottom-up approach, based on the
proposed architecture as presented in gure 2), starting from data collection that
will directly lead to data aggregation services which will be used individually. On
the produced data, multilingual content analysis' services are employed, while in
parallel, at this stage, business intelligence extracting solutions are applied. The
availability of the proposed services will be both on Web and Mobile application
enabling increased penetration into the business community. Each service is built
supporting endpoint integration in order to be available for use as an individual
component even for third party systems, external to PaloAnalytics platform.</p>
      <p>This will develop a complete development stack, that is based on multilingual
content from news sites, open data sources and social media. The services of this
stack are expected to attract third-party businesses companies, public bodies
and researchers who will develop new management modes of business data from
the sources incorporated by PaloAnalytics platform and will set up new business
models on them, multiplying the bene ts for the companies and organizations</p>
      <p>Fig. 2: Proposed architecture
while maximizing the in uence of the proposed solutions for the scienti c and
business community.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Architecture</title>
      <p>According to the architecture presented in gure 2 the proposed system is divided
into several components and modules enabling in this way individual design and
integration. The system, though, can be separated into four major components:
{ data entrance point / data storage
{ deep data analysis
{ semantics and metadata analysis
{ point of presentation</p>
      <p>Each of the major components consists of a number of modules in order to
successfully achieve its scope. Furthermore, each component will o er services
for direct data extraction and usage by third party systems.
3.1</p>
      <sec id="sec-3-1">
        <title>Entry point</title>
        <p>The entry point of the system is the component that is responsible for collecting
and storing data from the several di erent sources (social media, news and open
data). The data storage is built enabling several interfaces to be connected in
order to fetch and store data. In general, it follows a hybrid scheme including
both an SQL and a noSQL database.</p>
        <p>The system acts as a data warehouse, including modules for data extraction,
data transformation as well as data loading. The extraction of data is done
from several di erent sources including news websites, blogging platforms, social
media - focusing on text based ones - and open data sources. The data collected is
transformed in order to formulate similar objects with speci c uni ed structure.
The uni ed structure of each unique object includes a unique identi er, title,
body, source, timestamp and author.</p>
        <p>The aforementioned is the main object of the system and described the main
form of data collected. A number of metadata and objects analyzing in depth
each object is used including detailed information about the source, the author,
accompanying multimedia and more. Figure 3 presents a generic schema of the
database infrastructure that is used in order to support the essential for the
system storage.</p>
        <p>
          The data collected are stored on both an SQL-like storage environment as
well as a noSQL environment. The hybrid scheme will help for storing elements
for fast access in the noSQL nodes and collection of all the collected data in an
SQL based structure for better interconnection between them and permanent
storage of data with historic metadata [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]. Furthermore, a time-series database
is used in order to keep track of the records that are stored in the database,
including information about the source or the author. The latter is extremely
useful when de ning the rate of update for each source or the frequency of
posting for authors and their relation to period and time.
The system core contains all the key elements and services of the system. It
consists the basis upon which the complete system is designed and implemented.
Each of the modules formulating the system core can act as an autonomous
system providing endpoints for independent usage. These endpoints can also be
used by the system internally in order to perform the physical interconnection
between the di erent services.
        </p>
        <p>
          The system core includes the following elements:
{ Named Entity Recognition (NER), which is a module for
recognizing entities in bunches of text. A machine learning mechanism based on
OpenNLP 5, a set of language features and a set of annotated documents
for nding candidate NERs, enhanced by the use of dictionaries is used [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ].
{ Breaking news detection, which is a component for recognizing important
news topics. This is usually based on the number of similar articles produced
in a period of time, but it should be considered that not all news topics
are increased in numbers in the same manner. As such, machine learning
algorithms are employed that are able to recognize breaking news based on
the growth rate in time [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ].
{ Clustering, which is responsible for nding interconnections between the
di erent entities. It should be noted that the objects collected derive from
several di erent sources and the scope of this module is to create physical
interconnections between objects having identical meaning. According to the
de nition of the object (without any attached metadata) the main scope of
the clustering procedure is to interconnect conceptually two objects.
Furthermore, as the system intends to operate regardless of the language of
origin, the interconnection of the object should be language agnostic.
5 OpenNLP: a machine learning based toolkit for the processing of natural language
text. https://opennlp.apache.org/
{ Classi cation, which is a module for automatic categorization of objects to
prede ned categories. As the categories of the system are prede ned, due to
the fact that Palo is used as a news aggregation service, the categorization is
done in several primal categories. The current mechanism will be enhanced
in order to enable multilevel categorization including two di erent levels [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ].
{ Sentiment Analysis, which is responsible for extracting the polarity of the
objects. A machine learning algorithm will be employed in order to replace
a currently used algorithm based on the bag of words method [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ].
{ Summarization, which is responsible for extracting summaries out of the
clusters of objects. As the clustering procedure evolves in time, the
summarization procedure must adapt to changes that are done to the size of the
cluster in time.
{ Trending topics detection and enrichment, which is responsible for
analyzing social media and open sources in order to detect topics that are
trending and enrich them accordingly in order to detect their trends to other
countries and languages.
3.3
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>High level analysis</title>
        <p>
          The high level data analysis of the system includes a number of components that
combine the outcomes of the deep data analysis and they include:
{ Discovering social media in uencers [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ], [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]
{ Applying cross-border analytics [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]
{ Performing network analysis
{ Exploring semantic means of the web [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]
{ Simulating web and social media campaigns and measuring their impact
3.4
        </p>
      </sec>
      <sec id="sec-3-3">
        <title>Frontend</title>
        <p>The system frontend consists of both web and mobile applications that utilize
the data collected and analyzed in order to present reports, visualize data and
make it easy to explore the combined information.</p>
        <p>The web and mobile applications will have a public part that will make
parts of the collected available to public. This is a news aggregation service
including rich media format of data as well as interconnection of information
and multilingual content. The same is for the mobile application which ca be
formulated in order to enhance portability and usability of the presented content.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Expected outcomes and opportunities for cultural organizations</title>
      <p>The design and development of the proposed system consists of a new and
innovative product for the international market, which is expected to be the attraction
for many companies and organizations primarily organizations that operate
internationally. The absence of specialized competitive products in this eld o ers
a signi cant advantage and allows it to be a leading player in the Greek market,
which is the country of origin, and to penetrate the emerging and demanding
international market of high-volume data analysis technology by providing
innovative services and products.</p>
      <p>All the aforementioned, is expected to provide a new dynamic in the eld
of application development in the referred emerging sectors. This is achieved
by using state-of-the-art technologies and methodologies together with the
extensive knowledge in the eld by the partnership. At the same time, within
the framework of the proposed project, the know-how acquired in the areas of
large volume analysis is fully exploited, thereby enhancing the company's policy
towards the increased use of cutting-edge technologies, as well as the
partnerships' research background. Finally, we should consider the valuable know-how
acquired by all the participating bodies during the implementation of the
proposed project through the two research organizations, which will be done by
the research and development in order to achieve the desired objectives. The
know-how to be transferred will improve all the organizations' and especially
the company's scienti c potential by increasing its knowledge and expertise and
consequently the company's capabilities for future support as well as developing
new applications and undertaking new research projects in the context of its
activities.</p>
      <p>Focusing on cultural related organizations it is possible to nd opportunities
that these venues never had. It is important to note that cultural spaces have
recognized the important role of technology and online synchronous and
asynchronous communication, and are willing to utilize modern and edge-cutting
technological features in order both to enhance the experience of the visitors
as well as attract a broader audience. In this scope, it is extremely di cult for
people related to arts and culture perform an advanced step towards analyzing
the impact of their presence and marketing procedures on the internet.
PaloAnalytics project can play the role of the companion when it comes to their
online presence. The project can help recognize the supporters and fans, can
measure the impact of the online marketing strategy, can keep a record of other
spaces' impact or connection and can help towards the improvement of the online
presence.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Discussion</title>
      <p>We presented the project PaloAnalytics, which is reaching its rst year of
undergoing. During this period the rst crucial steps have been made, including the
de nition of the system use-cases, the formulation of the system architecture,
the set-up of the system infranstructure, as well as the design and initiation
of the rst system components. Furthermore, the business-case is completed
and the implementation of the rst sub-systems is almost nalized. The
infrastructure of the system is set-up and the means of integration are de ned. An
interesting feature of the project is the participation of two research laboratories
from two di erent institutions in Greece, which will join their research teams
to produce the results of the project. In order to achieve the objectives of the
project, cutting-edge technology and algorithms are used, which means that the
participants will join forces towards the research.</p>
      <p>Despite the fact that the actual outcome of the project is minimal compared
to the algorithmic procedures that lead to it, a number of related research elds
will be explored during the design and implementation of the components. First
of all, data mining algorithms will be researched in order to produce the optimal
solution for fetching data. Furthermore, the infrastructure that stores the data
is the basis of the system and as such its design and integration is part of a
research and development procedure. On the other hand, a number of algorithms
and techniques including deep machine learning will be investigation in order
to achieve procedures listing: clustering of data (including text objects deriving
from social media), summarization of clusters, named entity recognition,
sentiment analysis, aspect mining and breaking news de nition. Furthermore, apart
from the core algorithms, a number of \high level" procedures are required in
order to achieve the complete set of project scopes. These include in uencers
mining, semantic web, network analysis, campaign impact, swot analysis and
more, which are based on the metadata that accompany the information
collected and processed.</p>
      <p>It should be noted, that all the aforementioned are not just part of a
research procedure; meaning that the research should not stand on the feasibility
and soundness of the results. The system is a production based environment
targeting large business and organizations, which can even test and formulate the
procedures and the use-case scenarios. It lies on the ground of applied research
and it is expected that all the implemented solutions will be able to endure large
volumes of data, users and demanding procedures.</p>
      <p>As far as the role that the system can play for cultural organizations it is
clearly de ned as an important one. Speci cally we de ned the system as a
valuable companion that can totally alter the procedures of online marketing
strategies and social media interactions. The system can be used to examine the
behavior of the users towards exhibitions and presentations as well as towards
individual cultural objects. The project can be the beginning of a new era in
cultural informatics, acting as a novel pioneer procedure, that can involve edge
cutting technologies directly on the relation of the organizations and visitors
introducing a new way of mass culture.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgment</title>
      <p>This research has been co nanced by the European Union and Greek national
funds through the Operational Program Competitiveness, Entrepreneurship and
Innovation, under the call RESEARCH CREATE INNOVATE (project code:
T1EDK-03470)</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Castellano</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kessous</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Caridakis</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          (
          <year>2008</year>
          ).
          <article-title>Emotion recognition through multiple modalities: face, body gesture, speech. In A ect and emotion in humancomputer interaction</article-title>
          (pp.
          <fpage>92</fpage>
          -
          <lpage>103</lpage>
          ). Springer, Berlin, Heidelberg.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Caridakis</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Karpouzis</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wallace</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kessous</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Amir</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          (
          <year>2010</year>
          ).
          <article-title>Multimodal users a ective state analysis in naturalistic interaction</article-title>
          .
          <source>Journal on Multimodal User Interfaces</source>
          ,
          <volume>3</volume>
          (
          <issue>1-2</issue>
          ),
          <fpage>49</fpage>
          -
          <lpage>66</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Vlachostergiou</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Caridakis</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kollias</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2014</year>
          ).
          <article-title>Investigating context awareness of a ective computing systems: a critical approach</article-title>
          .
          <source>Procedia Computer Science</source>
          ,
          <volume>39</volume>
          ,
          <fpage>91</fpage>
          -
          <lpage>98</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Varlamis</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tsirakis</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Poulopoulos</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tsantilas</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          (
          <year>2014</year>
          ,
          <article-title>October)</article-title>
          .
          <article-title>An automatic wrapper generation process for large scale crawling of news websites</article-title>
          .
          <source>In Proceedings of the 18th Panhellenic Conference on Informatics</source>
          (pp.
          <fpage>1</fpage>
          -
          <lpage>6</lpage>
          ). ACM.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Makrynioti</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grivas</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sardianos</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tsirakis</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Varlamis</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vassalos</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Poulopoulos</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          <string-name>
            <surname>Tsantilas</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>PaloPro: a platform for knowledge extraction from big social data and the news</article-title>
          .
          <source>International Journal of Big Data Intelligence</source>
          ,
          <volume>4</volume>
          (
          <issue>1</issue>
          ),
          <fpage>3</fpage>
          -
          <lpage>22</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Varlamis</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hilliard</surname>
            ,
            <given-names>D. F.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Finding in uential sources and breaking news in news media using graph analysis techniques</article-title>
          .
          <source>International Journal of Web Engineering and Technology</source>
          ,
          <volume>12</volume>
          (
          <issue>2</issue>
          ),
          <fpage>143</fpage>
          -
          <lpage>164</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Tsirakis</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Poulopoulos</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tsantilas</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Varlamis</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Large scale opinion mining for social, news and blog data</article-title>
          .
          <source>Journal of Systems and Software</source>
          ,
          <volume>127</volume>
          ,
          <fpage>237</fpage>
          -
          <lpage>248</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Margaris</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vassilakis</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Georgiadis</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>Query personalization using social network information and collaborative ltering techniques</article-title>
          .
          <source>Future Generation Computer Systems</source>
          ,
          <volume>78</volume>
          ,
          <fpage>440</fpage>
          -
          <lpage>450</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Bampatzia</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bravo-Quezada</surname>
            ,
            <given-names>O. G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Antoniou</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nores</surname>
            ,
            <given-names>M. L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wallace</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lepouras</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vassilakis</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          (
          <year>2016</year>
          ,
          <article-title>September)</article-title>
          .
          <article-title>The use of semantics in the CrossCult H2020 project</article-title>
          .
          <source>In Semanitic Keyword-based Search on Structured Data Sources</source>
          (pp.
          <fpage>190</fpage>
          -
          <lpage>195</lpage>
          ). Springer, Cham.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Levy</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rajaraman</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ordille</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>1996</year>
          ).
          <article-title>Querying heterogeneous information sources using source descriptions</article-title>
          .
          <source>Stanford InfoLab.</source>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Hand</surname>
            ,
            <given-names>D. J.</given-names>
          </string-name>
          (
          <year>2006</year>
          ).
          <source>Data Mining. Encyclopedia of Environmetrics</source>
          ,
          <volume>2</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Fayyad</surname>
            ,
            <given-names>U. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Piatetsky-Shapiro</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smyth</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Uthurusamy</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          (
          <year>1996</year>
          ).
          <article-title>Advances in knowledge discovery and data mining</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>McAfee</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brynjolfsson</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Davenport</surname>
            ,
            <given-names>T. H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Patil</surname>
            ,
            <given-names>D. J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Barton</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          (
          <year>2012</year>
          ).
          <article-title>Big data: the management revolution</article-title>
          .
          <source>Harvard business review</source>
          ,
          <volume>90</volume>
          (
          <issue>10</issue>
          ),
          <fpage>60</fpage>
          -
          <lpage>68</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Ghoshal</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Swietojanski</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Renals</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2013</year>
          , May).
          <article-title>Multilingual training of deep neural networks</article-title>
          .
          <source>In 2013 IEEE International Conference on Acoustics, Speech and Signal Processing</source>
          (pp.
          <fpage>7319</fpage>
          -
          <lpage>7323</lpage>
          ). IEEE.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Hinton</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Deng</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yu</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dahl</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mohamed</surname>
            ,
            <given-names>A. R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jaitly</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          , ... and
          <string-name>
            <surname>Sainath</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          (
          <year>2012</year>
          ).
          <article-title>Deep neural networks for acoustic modeling in speech recognition</article-title>
          .
          <source>IEEE Signal processing magazine</source>
          ,
          <volume>29</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Violos</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tserpes</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Varlamis</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Varvarigou</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>Text classi cation using the n-gram graph representation model over high frequency data stream</article-title>
          .
          <source>Frontiers in Applied Mathematics and Statistics, section Mathematics of Computation and Data Science Journal. doi: 10</source>
          .3389/fams.
          <year>2018</year>
          .00041
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>