=Paper=
{{Paper
|id=Vol-3102/paper7
|storemode=property
|title=CMS Optimisation with Deep Learning Techniques
|pdfUrl=https://ceur-ws.org/Vol-3102/paper7.pdf
|volume=Vol-3102
|authors=Alberto Schiaffino,Matteo Reina,Ricardo A. Matamoros A.,Francesco Epifania,Francesco Ruggeri,Ignazio Maria Castrignano,Luca Marconi
|dblpUrl=https://dblp.org/rec/conf/aiia/SchiaffinoRAERC21
}}
==CMS Optimisation with Deep Learning Techniques==
<pdf width="1500px">https://ceur-ws.org/Vol-3102/paper7.pdf</pdf>
<pre>
       CMS Optimization with Deep Learning
                  Techniques

          Alberto Schiaffino2 , Matteo Reina2 , Ricardo A.Matamoros
           A.1[0000−0002−1957−2530] , Francesco Epifania1 , Francesco
       1[0000−0002−5827−2565]
Ruggeri                       , Ignazio Maria Castrignano1[0000−0003−1123−2520] ,
                   and Luca Marconi1[0000−0002−0236−6159]
                             1
                             Social Things SRL, Italy
   {ricardo.matamoros, francesco.epifania, francesco.ruggeri, ignazio.castrignano,
                       luca.marconi}@socialthingum.com,
                              2
                                Engitel S.p.a., Italy
              {alberto.schiaffino, matteo.reina}@engitel.com


      Abstract. This study stems from the observation of the growing de-
      mand, interest and incredible potential offered by Artificial Intelligence
      and Machine Learning techniques in the fields of content website or e-
      commerce.
      In particular, we will study the Content Management System (CMS), a
      system capable of creating and maintaining websites, forums and appli-
      cations for customers. These systems have an intrinsic predisposition to
      provide services based on Machine Learning (ML), thus being able to
      add to the solutions offered by the most innovative ML systems.
      These include Sentiment Analysis, the Recommender System and the
      more innovative Chatbot .
      Integrating them into a website or web platform very often leads to sig-
      nificantly improved results for the user-experience offered to the user.
      This study aims to analyse the new artificial intelligence techniques and
      how to implement them in a company. Reporting also a case of Study
      based on a real agency.

      Keywords: Machine Learning · Content Management Systems · Artifi-
      cial Intelligence · Sentiment Analysis · Natural Language Processing


                      Copyright © 2021 for this paper by its authors. Use
permitted under Creative Commons License Attribution 4.0 International (CC
BY 4.0).
1   Context description

The objective of the study is to create a usable Platform-as-a-Service (PaaS) ex-
tension that makes available and integrates different Machine Learning services
so that the company can offer these solutions to its customers.

To this end, an infrastructure will be created to show an interface through
which CMS can request different ML services as needed. In particular, CMS
will only have to indicate which specific ML module it wants to use within a
site (whether a Sentiment Analysis module [2], items recommendations [1], ...)
and some parameters; the infrastructure will interpret the request and select the
correct module to be queried.
For some of these, there are already various solutions offered by providers such
as Microsoft (Azure), Google (Google Cloud Platform) and Amazon (AWS); in
these cases, the services will be studied and tested to see which ones meet the
requirements.
Where the offered solutions are not adequate, the necessary ML models will be
built from scratch, starting from the state of the art and improving it with more
recent techniques. In particular, given the central importance in websites and the
absence of an adequate service, the item recommendation system will be studied.

The realisation of the project involves the design and development of two com-
ponents: the architectural infrastructure and the ML-based services.


2   Technology implementation

By analysing the time required for the various phases of the implementation,
we have arrived at significant estimates. The first macro-phase is aimed at the
realisation of the infrastructure that integrates the services and makes them
easily usable.
The specific sub-phases, with their corresponding time estimates, are:
Numbered (ordered) lists are easy to create:

1. Infrastructure design and development (1 month)
2. Design and development of procedures for data life cycle management (0.5
   month)
3. Design and development of the communication interfaces with the Content
   Management System (0.5 months)

The second macro-phase is aimed at the implementation of basic services with
ML. The specific sub-phases are:

1. Study of the state of the art of existing ML services (0.5 months)
2. Integration of ML models already offered by external services (1.5 months)
3. State of the art of Recommender Systems (RS) (0.5 months)
4. Model evaluation and testing (1.0 months)
The aim is to create a working prototype with at least TRL 5, to verify and
validate the effectiveness of the technologies applied to the reference context
before moving on to the next phase.


3     State-of-the-art innovation

The implementation of these technologies within a company produces first and
foremost Product Innovation, as it proposes a radical enhancement of the propos-
ing company’s current solution. This will allow the company to open up to new
markets, to meet demands that it could not previously satisfy.
But the main value of these technologies is that they will enable important sim-
plifications and reduction of implementation times in the business processes of
customers using the Content Management System.
The CMS enables:

 – more timely, effective and efficient data collection;
 – a complete processing of data and information through a wide variety of
   tools;
 – overall integration between them
 – the provision of new services that did not previously exist

Therefore, the implementation will affect: the enhancement of management ac-
tivities, optimisation and monitoring of business processes through artificial in-
telligence and Machine Learning techniques. In fact, the multiple synergies that
it stimulates within the processes, allow a restructuring of the interaction be-
tween technology and organization, thus digital transformation in a broad sense.


3.1    Unique and complete system

The artificial intelligence-based services offered by some providers, such as Mi-
crosoft, Google and Amazon, are ’black box’, they are neither unified nor inte-
grated, nor are the solutions complete with all services.
The proposed infrastructure, on the other hand, is aimed at a comprehensive and
dynamic management of the whole range of tools and services and the necessary
data.
In particular, the infrastructure will deal with:

 – management of the data life cycle
 – creation and updating of models
 – communication between the infrastructure and the system

The architecture of the infrastructure will make it possible to implement the full
spectrum of services required, either by developing them from scratch or by ex-
ploiting those already offered by the providers, thus enabling interchangeability
and the choice of the most performing ones.
3.2   Study and implementation of new ML models


Particular attention was paid to different Recommender System models. It starts
with a state-of-the-art study of the most popular algorithms currently imple-
mented to try out more recent techniques based on neural networks. In this way
it is possible to test and evaluate different solutions, describing which ones and
in which contexts they provide better results.
Some services are related to all or some of the following fields:

 – Recommender systems: item recommendations customised for the user
 – Sentiment Analysis: extraction of opinion from a text [9][20]
 – Content Moderator: collection of user contributions (comments, generic text
   and image content) and their automatic moderation to filter out inappropri-
   ate or out of scope content [3][4]
 – Text analysis for SEO [12][19]
 – Chatbot or advanced search to refine the content searched for the user [7]
   [14] [16]

In this way, a significant improvement in the state of the art will be possible,
which will have a significant impact on the competitive advantage of the company
and of the customers who adopt its system.


Fig. 1. In the figure we see at a very high level the link between the various services
used in this context.
4     Economic impact

4.1   How to finance it

The breakdown of costs according to the actions envisaged in the project is
indicatively as follows:

 1. Activities to identify the Artificial Intelligence services useful to complete
    the CMS platform. (6% of costs)
 2. Design and development of the infrastructure with CMS interface. (30% of
    costs)
 3. Integration and implementation of basic services with ML.(52% of costs)
 4. User Interface validation activities.(12% of costs)


4.2   Impact on Enterprise

A second macro-result that can be highlighted, from the point of view of the
effects on productivity and on the increase in revenues, is that relating to the
management of data, including unstructured data: the possibility for the infras-
tructure to jointly manage heterogeneous and also unstructured data and obtain
significant insights, thanks also to economies of scale and scope and knowledge,
allows in the medium term a reduction in organisational costs for data process-
ing and knowledge management in the broadest sense [13] [15]. These potential
productivity gains from optimised knowledge management [17] and the parallel
decreases in organisational costs are then also transferred to the organisational
structure of customers, who see significantly improved performance from a data
analytics perspective.
And that’s not all: the numerous possibilities for creating and selling customised
services, as well as customised data and information based on the needs of cus-
tomers, means that differentiated pricing logics can be configured, making in-
formation available on request according to the logic of the ”Data as a Service”
paradigm.


4.3   Impact on the Environment

As far as product innovation is concerned, the proposed system is characterised
first and foremost by the potential environmental benefits that can be obtained
by integrating services into a single infrastructure. This entails numerous energy-
saving factors deriving from the univocal and omni-comprehensive management
of the entire data life cycle.
As can be seen from the study carried out applying this technology to e-learning
courses. The resources recommended by the system will be improved in quality
and quantity through external sources as they have done in their work by Garg
et al. [8].
In particular, the user will analyse the data by means of a single architectural
structure integrating the different services, which are variously complementary
from the point of view of the obtainable insides: consequently, the use of the
energy resources of the Content Management System will be structured in a
progressively better way according to the reorganisation of the production pro-
cesses stimulated adaptively by the use of the platform itself. In this way, the
content provided to the user will dynamically adapt to his needs in a more spe-
cific and precise manner and navigation within the system will be facilitated,
with significant savings in time and energy.
Following this, the improved performance of the system will progressively stim-
ulate further savings, thanks to Process Intelligence, through the highlighting of
organisational bottlenecks at each stage of the process and data life cycle. Ulti-
mately, the resulting environmental, social and economic benefits will be percep-
tible from the point of view of both the company and the customers themselves,
and therefore of the entire ecosystem that uses the services generated by the
CMS.

In addition, since the chosen architecture is that of a Platform-as-a-Service,
all processing linked to the Machine Learning algorithms will be carried out on
Cloud machines optimised for this type of processing, reducing consumption in
terms of energy used and heat produced, as we seen in the work of Rajabion,
Lila, et al.. [18]
Finally, during the entire life cycle of the project, the choice of the Cloud implies
further important benefits for the entire system from the point of view of envi-
ronmental impact and in terms of saving on the costs of energy for local data
and materials for server hardware.


5   Case of Study

Over the last few years Engitel has developed its own Content Management Sys-
tem (CMS) called Spin&GO (SG) with which it creates and maintains websites,
forums and applications for its customers. Spin&GO is increasingly a Made in
Italy excellence. Engitel has already participated in the project called SPAC3,
as part of the projects funded by the Lombardy region as Bando Smart Cities
2014 and the Easy 4.0 project.
The system currently used by Engitel, Spin&Go, does not offer any solution
based on Artificial Intelligence; the advantages that can be achieved through
these techniques are by now well known to all and in this case they mainly con-
cern the quality of the experience provided to the user while navigating the site
or using the application: from the suggestion of personalised content based on
the user, to the automation of content insertion procedures, to the automatic
recognition of multimedia content.

The SPIN&GO BRAINY project therefore opens the way to the use of AI ap-
plications and environments within the CMS and the Engitel processes.
The use of such services therefore leads to a higher quality of the products pro-
vided by Engitel, consequently with an increase in turnover for the company.
Engitel turned to Social Things which has all the complementary competences
in the Artificial Intelligence field necessary to realise SPIN&GO BRAINY.

Therefore, the SPIN&GO BRAINY project allows the opening of the propos-
ing company to new markets and new clients (who could currently use other
providers), through the provision of Machine Learning services: this significantly
expands the offer and enriches the value proposition of Engitel itself.

On the one hand, it is a unique system especially at the Italian level and unified
of different synergic or complementary ML services, which allows for an easy
integration within websites and applications.
Therefore, the SPIN&GO BRAINY project fully falls within the macro-area of
Intelligent Business Process Management, as defined by Gartner [22]: that is, the
enhancement of management activities, optimisation and monitoring of business
processes by means of Artificial Intelligence and Machine Learning techniques.

Studying the economic impact on the company we note that:
Engitel’s revenue over the last 4 years has been roughly as follows:

 – Year 2017: 3,795,448€.
 – Year 2018: 4.207.526€.
 – Year 2019: 3.716.874€.
 – Year 2020: 3.743.917€.

The Net Profit after tax (in thousands of Euros) amounted to:

 – Year 2017: 685,508€.
 – Year 2018: 656,868€.
 – Year 2019: 430,321€.
 – Year 2020: 451,196€.

    From the above figures, Engitel is able to tackle this project with its own
means. The financial and budget parameters recorded in the last fours years and
the forecasts make this possible, as the investment of approximately €40,000
represents approximately 7% of the average net profit of the last three years and
the company is not in debt.
The price differentiation resulting from customisation and the user-centred offer
will enable Engitel and its customers to progressively increase revenues.
In the case of Engitel, it will be necessary to recruit highly specialised technical
staff to manage the enhanced CMS: therefore, figures will be needed to deal with
system integration and system administration, for the overall management of the
platform, but also data scientists and full-stack developers to design specific ML
modules. It is estimated that there will be an increase of about 10 people in
three years.

In the case of Social Things, an additional two engineers specialised in data
science and AI will be needed to design the algorithms and modules, while an
additional full-stack developer will take care of the infrastructure design.

In fact, Engitel currently manages each process and the resulting services for
specific customers in an individual and totally differentiated way. For each web
service offered, there are specific resources allocated, resulting in energy redun-
dancies due to waste and lack of synergies.
By means of the SPIN&GO BRAINY platform, from the point of view of pro-
duction processes, the optimised management of knowledge deriving from the
integration of synergic services will instead allow an overall saving in terms of
energy and production efficiency.


6   Implementation details

In this work we plan to use Microsoft cloud tools for the delivery of the ML
services provided. As we can observe below, in the figure 2, the architecture
and data flow of this proposed work had a simple and intuitive representation,
specifically we will use:

 – Azure Data Factory and Azure Synapse Analytics: automating the extrac-
   tion, loading and transformation (ELT) workflow
 – Azure Databricks and Azure SQL Server Analysis Services: for analytics and
   batch data management.
 – Azure API management: for publishing APIs for internal and external con-
   sumers

The importance of cloud services lies in the improvement of software scalability
as it allows the optimisation of continuous integration and release processes of
functionalities, both in the creation and update phases. In particular, the services
for the hierarchical orchestration of functionalities introduce greater flexibility
in terms of structural independence by providing the possibility to efficiently
implement the configuration of microservices for the CMS.
The 4 proposed services will be used through cloud APIs according to a set of
characteristics:

 – on-demand and self-service
 – high accessibility and flexibility
 – dynamic grouping of resources by adapting the allocation to physical storage
   and computing devices according to requests
 – service measurement and monitoring and control to ensure transparency of
   service use.

The choice of Microsoft Azure cloud infrastructure relates to a number of tech-
nological and operational aspects, but also to recognitions received in the API
sphere. In fact, Microsoft was listed as a leader in Gartner’s 2021 Magic Quad-
rant report for Comprehensive API Lifecycle Management.
                         Fig. 2. Azure API architecture


7   Conclusions and Future Developments


In this study we have analysed the implementation of new technologies based
on Machine Learning to Content Management Systems.
The problem addressed in this work, besides being complex because it concerns
a vast domain of active search, presents other problems related to the need to
apply it on a real case, i.e. on a company that offers such services.
Has we can see in the work of Bonhard et al. [6] it is possible to significantly
improve the quality of recommendations by exploiting the knowledge derived
from users profile and their social network pages.
The case study analysed in this paper shows how Engitel through its CMS called
SPIN&GO offers the indicated web services and by creating instead a PaaS that
integrates new Machine Learning technologies it is able to guarantee the en-
hancement of management activities, optimisation and monitoring of business
processes through Artificial Intelligence and Machine Learning techniques.
The actual state-of-the-art envisages the use of deep learning technique in the
area of Natural Language Processing, in particular the best performing model
now are based on Recurrent Neural Networks and/or reinforcement learning as
in the work of Subramanyam et al. [21] in which are described the the model
based on the self-supervised learning and transfer learning [10].
Another research line propose as approach to NLP models based on attentional
mechanism as introduced in the paper of Hu and Dichao [5].
References
1. Bahdanau, Dzmitry, Kyunghyun Cho, and Yoshua Bengio. ”Neural machine trans-
   lation by jointly learning to align and translate.” arXiv preprint arXiv:1409.0473
   (2014).
2. Hussein, Doaa Mohey El-Din Mohamed. ”A survey on sentiment analysis chal-
   lenges.” Journal of King Saud University-Engineering Sciences 30.4 (2018): 330-338.
3. Gorwa, Robert, Reuben Binns, and Christian Katzenbach. ”Algorithmic content
   moderation: Technical and political challenges in the automation of platform gov-
   ernance.” Big Data & Society 7.1 (2020): 2053951719897945.
4. Gillespie, Tarleton. ”Content moderation, AI, and the question of scale.” Big Data
   & Society 7.2 (2020): 2053951720943234.
5. Bonhard, Philip, and Martina Angela Sasse. ”’Knowing me, knowing you’—Using
   profiles and social networking to improve recommender systems.” BT Technology
   Journal 24.3 (2006): 84-98.
6. Subramanyam Kalyan, Katikapalli, Ajit Rajasekharan, and Sivanesan Sangeetha.
   ”AMMUS: A Survey of Transformer-based Pretrained Models in Natural Language
   Processing.” arXiv e-prints (2021): arXiv-2108.
7. Nuruzzaman, Mohammad, and Omar Khadeer Hussain. ”A survey on chatbot imple-
   mentation in customer service industry through deep neural networks.” 2018 IEEE
   15th International Conference on e-Business Engineering (ICEBE). IEEE, 2018.
8. Garg, Vishal, and Ritu Tiwari. ”Hybrid massive open online course (MOOC) rec-
   ommendation system using machine learning.” International Conference on Recent
   Trends in Engineering, Science & Technology-(ICRTEST 2016). IET, 2016.
9. Young, Tom, et al.. ”Recent trends in deep learning based natural language pro-
   cessing.” ieee Computational intelligenCe magazine 13.3 (2018): 55-75.
10. Hu, Dichao. ”An introductory survey on attention mechanisms in NLP problems.”
   Proceedings of SAI Intelligent Systems Conference. Springer, Cham, 2019.
11. Alyafeai, Zaid, Maged Saeed AlShaibani, and Irfan Ahmad. ”A survey on transfer
   learning in natural language processing.” arXiv preprint arXiv:2007.04239 (2020).
12. Vedantam, Vamsi Krishna. ”The Survey: Advances in Natural Language Processing
   using Deep Learning.” Turkish Journal of Computer and Mathematics Education
   (TURCOMAT) 12.4 (2021): 1035-1040.
13. Zaslavsky, Arkady, Charith Perera, and Dimitrios Georgakopoulos. ”Sensing as a
   service and big data.” arXiv preprint arXiv:1301.0159 (2013).
14. Bala, K., et al.. ”Chat-bot for college management system using AI.” International
   Research Journal of Engineering and Technology 4.11 (2017): 2030-2033.
15. O’Leary, Daniel E. ”Using AI in knowledge management: Knowledge bases and
   ontologies.” IEEE Intelligent Systems and Their Applications 13.3 (1998): 34-39.
16. Lalwani, Tarun, et al.. ”Implementation of a Chatbot System using AI and NLP.”
   International Journal of Innovative Research in Computer Science & Technology
   (IJIRCST) Volume-6, Issue-3 (2018).
17. Mendes Rodrigues, Eduarda, and Natasa Milic-Frayling. ”Socializing or knowledge
   sharing? Characterizing social intent in community question answering.” Proceed-
   ings of the 18th ACM conference on Information and knowledge management. 2009.
18. Rajabion, Lila, et al.. ”A new model for evaluating the effect of cloud computing
   on the e-learning development.” Journal of Workplace Learning (2019).
19. Gjorgjevska, Emilija, and Georgina Mirceva. ”Content Engineering for State-of-
   the-art SEO Digital Strategies by Using NLP and ML.” 2021 3rd International
   Congress on Human-Computer Interaction, Optimization and Robotic Applications
   (HORA). IEEE, 2021.
20. Lee, Gichang, et al.. ”Sentiment classification with word localization based on
   weakly supervised learning with a convolutional neural network.” Knowledge-Based
   Systems 152 (2018): 70-82.
21. Lee, Joohong, Sangwoo Seo, and Yong Suk Choi. ”Semantic relation classifica-
   tion via bidirectional lstm networks with entity-aware attention using latent entity
   typing.” Symmetry 11.6 (2019): 785.
22. Wang, John, et al.. ”Cloud Computing, Intelligent Business Process Management
   and Artificial Intelligence.” international journal of data analysis and information
   systems 9 (2017): 1-12.

</pre>