<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Model for Libraries: The CSBNO Consortium Approach</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Anna Maria Tammaro</string-name>
          <email>annamaria.tammaro@unipr.it</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Michele Tomaiuolo</string-name>
          <email>michele.tomaiuolo@unipr.it</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Monica Mordonini</string-name>
          <email>monica.mordonini@unipr.it</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mattia Pellegrino</string-name>
          <email>mattia.pellegrino@unipr.it</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Riccardo Demicelis</string-name>
          <email>riccardo.demicelis@csbno.net</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>CSBNO Consortium</institution>
          ,
          <addr-line>Milan</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Parma</institution>
          ,
          <addr-line>Parma</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The CSBNO Consortium investigated the libraries communities during the lockdown and at their reopening, to learn about their wishes and expectations from the library. Sentiment analysis could improve the analysis of data integrating the community's perception of the library in services design. The framework and the methodology of the research are described in the three foreseen phases: Selection and loading of training data, Text processing, Creating a model. The research is in its initial phase and three characteristics will be analyzed: Information access, Library space, Affect service. The findings will support CSBNO to promote innovative libraries by actively engaging with participative communities. Sentiment analysis for libraries; User studies; Participatory approach The CSBNO (Culture Socialità Biblioteche Network Operativo) Consortium manages 60 libraries in the Milan area and coordinates the transformation of libraries and innovation of services to make them supporting the changing needs of society. CSBNO collaborates with other innovative European libraries gathered in the NEWCOMER1 project funded by ERASMUS 2 + which intends to promote the vision of the innovative libraries improving the community. The NEWCOMER Project partners intend to promote innovative libraries by actively engaging with users, in a participatory approach. A Manifesto2 is shared by all Project NEWCOMER partners. in CSBNO libraries starting from 31/12/2017 but no longer active in using the loan service.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>1.1.</p>
    </sec>
    <sec id="sec-2">
      <title>How to get to know the library community?</title>
      <p>At the beginning of the Covid-19 pandemic in Italy, during the first lockdown from March to May
2020, the CSBNO tried to stay connected to libraries community, informing them that libraries, even if
closed, continue the service. The greatest difficulty for CSBNO has been to change the service model
from face-to-face services to remote services and understanding communities wishes and expectations
from the library. For data collection, more than 30.000 telephone calls were made by librarians from
the CSBNO Consortium to members classified as active and inactive members. Active members are
defined as users enrolled in CSBNO libraries starting from 31/12/2017 and still active in using the loan
service, selecting those aged between 25 and 65 years. Inactive members are defined as users enrolled</p>
      <sec id="sec-2-1">
        <title>Two datasets were collected: 1. telephone replies received from inactive library members;</title>
        <p>2022 Copyright for this paper by its authors.</p>
      </sec>
      <sec id="sec-2-2">
        <title>2. telephone replies received from active members.</title>
        <p>The first dataset concerns responses collected during the lockdown from inactive members, but had
been active in the past.</p>
        <p>The second dataset concerns responses collected during the lockdown of active members who use
the library loan.</p>
        <p>At the end of the lockdown in May 2020, the libraries of the CSBNO Consortium participated in a
national satisfaction survey called “Library for you” on the perception of the library by users. The aim
of the national survey was to analyze the satisfaction towards libraries upon reopening. The
questionnaire administered soon after the lockdown allowed respondents to answer qualitative
questions about the level of service and to leave open comments that provide additional data for
understanding community opinions. The answers of communities concerning the CSBNO libraries have
been extracted.</p>
        <p>The third dataset concerns the responses to the national Satisfaction survey from CSBNO libraries
community.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>2. Aims and objectives</title>
      <p>The CSBNO Consortium intends to build a sentiment analysis model as a tool to explore community
expectations and wishes on which to build a participatory approach for service design. The aim of the
research is to establish a data mining model to perform sentiment analysis on qualitative comments
collected by libraries. The objective is to test a new analytical method to be used to understand the data
collected from community and their year-by-year comparison.</p>
      <p>
        The feedback mechanism most used by libraries in Italy is usually the survey collecting data with a
questionnaire, such as “The Library for You” survey. However this data collection has the drawback
that it is administered to only active users. To overcome this limitation, sentiment analysis, or opinion
mining, can use text datasets with data mining programs. As the name suggests, sentiment analysis
involves the analysis and identification of positive and negative opinions and emotions within a given
text [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. By building such a model today, future library surveys done by the CSBNO Consortium can
be analyzed quickly and effectively to provide an accurate assessment of users' overall perception of
specific areas of the library.
2.1.
      </p>
    </sec>
    <sec id="sec-4">
      <title>Sentiment analysis for libraries</title>
      <p>Sentiment analysis for libraries has never been studied in Italy. The international library community
has used sentiment analysis in three ways: using social media, using free answer text of questionnaires,
and using other corpora.</p>
      <p>
        An experience that is important for this research was carried out by Canadian libraries by collecting
the free text responses of the LibQual questionnaire [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. The characteristics analyzed by Canadian
libraries were:
• Information control: access to information, promotion, skills and bibliographic guides;
• Library space: approach to physical or digital space;
• Affect service: negative and positive sentiment for service.
      </p>
      <p>These three characteristics analyze the feelings for the two fundamental services of the library seen
as access to a collection and physical space. An emotional perception that the library in general arouses
in users is added.</p>
    </sec>
    <sec id="sec-5">
      <title>3. Methodology</title>
      <p>To gain comparative appreciation for respondent feedback over time, the comments of the three
datasets collected by CSBNO will be analyzed to track their sentiment and the topics they relate to. To
gain control over such a significant amount of data, computer-aided data mining tools will be used to
conduct sentiment analysis on the comments of each dataset of the survey. The framework for the
sentiment analysis model essentially involves three steps: selection of training data, text processing,
creating a model. Two students have been involved in the project.
3.1.</p>
    </sec>
    <sec id="sec-6">
      <title>Selection and loading of training data</title>
      <p>The pre-tagged training data is selected and loaded into the program. To create a template, both text
elements and any corresponding sentiment assignments must be selected.
3.2.</p>
    </sec>
    <sec id="sec-7">
      <title>Text processing</title>
      <p>The text considered by the CSBNO Consortium is in Italian. Text preprocessing eliminated minor
language differences, such as lowercase versus uppercase letters, pluralization, and tenses, using
common stemming and stop-words techniques, to create an accurate text analysis model. However,
since some models use the grammatical structure of text, the original plain text is also kept in the dataset,
for possible use in the following steps of analysis. Once finished, the training data corpus is used to
create positive, negative and neutral vectors of features, to capture the polarized elements that
characterize the text of the comment. Those vectors of features are saved for future use.
3.3.</p>
    </sec>
    <sec id="sec-8">
      <title>Creating a model</title>
      <p>Using these vectors of features, the program uses a classification algorithm to create a pattern to
separate other unseen comments into positive, negative, or neutral, for sentiment analysis. As an
orthogonal task, the selected comments will also be classified according to their topic. This further
classification will provide a deeper and more complete understanding of the collected opinions. To
verify the accuracy of the models, they are tested on some pre-tagged test data, to measure the precision,
recall and accuracy of the classification. The model is saved for future use.</p>
      <p>In simplified terms, the most traditional approaches of sentiment analysis work by providing the
algorithm with a so-called “bag of words”, that allows it to recognize the words and the groups of words
that humans use to express positive and negative opinions. The process is a form of supervised machine
learning; pre-tagged datasets are used as training examples to “teach” the computer and create the basis
for the classification of future unlabeled information. By providing pre-tagged “positive” (good, polite,
excellent, etc.) and “negative” (terrible, shoddy, rude, etc.) words, the data mining software can
establish a model that will be applied to future comments to decipher their polarity or whether they have
a positive or negative feeling. With the same approach, it is also possible to classify a text according to
its specific topic, in a task of topic detection. Alternatively, it is possible to use clustering algorithms to
group together texts with similar features, in a non supervised scenario.</p>
      <p>
        In this work, the most representative and consolidated techniques of sentiment analysis and topic
detection will be compared. In particular, the best algorithms of different families will be considered,
including those based on some notion of geometric distance between samples (i.e. knn, svm) [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ],
decision trees (rf, xgboost) [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], probability and statistics (NB) [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], perceptrons and small neural
networks [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. In fact, those algorithms, or their composition [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], have proven their good accuracy over
many different datasets, of small and medium size [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>
        However, some newer algorithms have improved the accuracy over larger datasets, exploiting
socalled deep neural network architectures, together with more advanced techniques for collecting the
vectors of features of the training set. In fact, the traditional vectorization, based on the bag of words
algorithm, creates a dataset with a very large number of features and requires an accurate and sensible
phase of feature selection, for obtaining the best results. Instead, techniques of word embedding and
dense representations [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] are able to map each word in a multidimensional space, where semantically
related words are represented as points at a short distance. The vector representing each sample is
calculated on the basis of positions of words in this multidimensional space. Moreover, deep neural
networks have shown some impressive results in many applications, including sentiment analysis
(BERT) [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. But these networks are characterized by a very large number of parameters which have to
be learned, requiring the use of samples in the order of magnitude of Big Data.
      </p>
      <p>In the present work, these new techniques will also be used, exploiting pre-trained models and
additional phases of transfer learning and fine tuning, for adapting the models to the particular task at
hand. The additional steps usually require much smaller datasets, than those used to train the whole
model.</p>
      <p>The research is in its initial step. We intend to analyze a training set of some comments, randomly
selected from the responses of the three data sets collected. This training set will be manually reviewed
by the two students and labeled as having a positive or negative feeling.</p>
      <p>Using the data mining platform these training sets of comments will provide the framework for
creating data-specific positive and negative word vectors to power the sentiment analysis model. It is
thought to create an additional process to isolate individual topics within the larger comments, allowing
for more nuanced sentiment analysis.</p>
    </sec>
    <sec id="sec-9">
      <title>4. Conclusions</title>
      <p>The sentiment analysis model provides a complementary tool for analyzing quantitative and
qualitative results of simple satisfaction survey of active and inactive users for library services.
Sentiment analysis application, could facilitate the realization of a participatory approach with
communities, allowing a simple and efficient year-by-year analysis of open comments. The CSBNO
Consortium expects the sentiment analysis process to provide the means to isolate specific topics based
on specified keywords, allowing individual institutions to tailor results for more in-depth analysis.</p>
    </sec>
    <sec id="sec-10">
      <title>5. References</title>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M. S.</given-names>
            <surname>Akhtar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Ghosal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ekbal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Bhattacharyya</surname>
          </string-name>
          ,
          <article-title>A multilayer perceptron based ensemble technique for fine-grained financial sentiment analysis</article-title>
          ,
          <source>in Proceedings of the 2017 conference on empirical methods in natural language processing</source>
          (
          <year>2017</year>
          ), pp.
          <fpage>540</fpage>
          -
          <lpage>546</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>G.</given-names>
            <surname>Angiani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Cagnoni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Chuzhikova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Fornacciari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mordonini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Tomaiuolo</surname>
          </string-name>
          ,
          <article-title>Flat and hierarchical classifiers for detecting emotion in tweets</article-title>
          , in
          <source>Conference of the Italian Association for Artificial Intelligence</source>
          (
          <year>2016</year>
          ), pp.
          <fpage>51</fpage>
          -
          <lpage>64</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>R. H.</given-names>
            <surname>Hama Aziz</surname>
          </string-name>
          , N. Dimililer,
          <article-title>SentiXGboost: enhanced sentiment analysis in social media posts with ensemble XGBoost classifier</article-title>
          ,
          <source>in Journal of the Chinese Institute of Engineers</source>
          (
          <year>2021</year>
          ),
          <volume>44</volume>
          (
          <issue>6</issue>
          ), pp.
          <fpage>562</fpage>
          -
          <lpage>572</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M. R.</given-names>
            <surname>Huq</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ali</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Rahman</surname>
          </string-name>
          ,
          <article-title>Sentiment analysis on Twitter data using KNN and SVM</article-title>
          , in
          <source>International Journal of Advanced Computer Science and Applications</source>
          (
          <year>2017</year>
          ),
          <volume>8</volume>
          (
          <issue>6</issue>
          ), pp.
          <fpage>19</fpage>
          -
          <lpage>25</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>R. A.</given-names>
            <surname>Laksono</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. R.</given-names>
            <surname>Sungkono</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Sarno</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. S.</given-names>
            <surname>Wahyuni</surname>
          </string-name>
          ,
          <article-title>Sentiment analysis of restaurant customer reviews on tripadvisor using naïve bayes</article-title>
          ,
          <source>in IEEE 2019 12th International Conference on Information &amp; Communication Technology and System</source>
          (
          <year>2019</year>
          ), pp.
          <fpage>49</fpage>
          -
          <lpage>54</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>M. T.</given-names>
            <surname>Moore</surname>
          </string-name>
          ,
          <article-title>Constructing a sentiment analysis model for LibQUAL+ comments, Performance Measurement and Metrics (</article-title>
          <year>2017</year>
          ), Vol.
          <volume>18</volume>
          No.
          <issue>1</issue>
          , pp.
          <fpage>78</fpage>
          -
          <lpage>87</lpage>
          . https://doi.org/10.1108/ PMM-07- 2016-0031
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>C.</given-names>
            <surname>Sun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Qiu</surname>
          </string-name>
          ,
          <article-title>Utilizing BERT for aspect-based sentiment analysis via constructing auxiliary sentences (</article-title>
          <year>2019</year>
          ), arXiv preprint arXiv:
          <year>1903</year>
          .09588.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Tomaiuolo</surname>
          </string-name>
          , G. Lombardo,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mordonini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Cagnoni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Poggi</surname>
          </string-name>
          ,
          <article-title>A survey on troll detection</article-title>
          , in Future Internet (
          <year>2020</year>
          ),
          <volume>12</volume>
          (
          <issue>2</issue>
          ), p.
          <fpage>31</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>T.</given-names>
            <surname>Wilson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wiebe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Hoffmann</surname>
          </string-name>
          ,
          <article-title>Recognizing contextual polarity in phrase-level sentiment analysis, in Proceedings of the conference on human language technology and empirical methods in natural language processing</article-title>
          ,
          <source>Association for Computational Linguistics</source>
          , (
          <year>2005</year>
          ) pp.
          <fpage>347</fpage>
          -
          <lpage>354</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>A.</given-names>
            <surname>Yadav</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. K.</given-names>
            <surname>Vishwakarma</surname>
          </string-name>
          ,
          <article-title>Sentiment analysis using deep learning architectures: a review</article-title>
          ,
          <source>in Artificial Intelligence Review</source>
          (
          <year>2020</year>
          ),
          <volume>53</volume>
          .6, pp.
          <fpage>4335</fpage>
          -
          <lpage>4385</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>