<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Tool to Analyze the Reading Behavior of the Users in a Mobile Digital Publishing Platform?</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ludovico Boratto</string-name>
          <email>ludovico.boratto@acm.org</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mattia Cadeddu</string-name>
          <email>mattia.cadeddu@gmail.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Salvatore Carta</string-name>
          <email>salvatore@unica.it</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Gianni Deplano</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fabio Mereu</string-name>
          <email>fabio.mereug@applixgroup.com</email>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Applix srl</institution>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Dipartimento di Matematica e Informatica, Universita di Cagliari</institution>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In their daily activities, users interact multiple times with mobile applications. This generates huge amounts of data related to these interactions that, when ltered and analyzed, would give insights on the behavior of the users while using an application. In this paper, we consider a real-world mobile digital publishing platform, named Viewerplus, which enables a digital, augmented fruition of content from traditional magazines. The objective is to develop a tool that allows the human editors to analyze the reading behavior of the users, by providing analytics that show how the users read magazine issues (i.e., how they browse an issue and move inside the app, which portions of an issue are most frequently read and which frequency, and which topics are of interest for the users during a reading session). The tool has been developed by employing a dataset extracted from the reading sessions of a magazine of an important international publisher. In this work we also employ the dataset to present a preliminary study of the user reading behavior.</p>
      </abstract>
      <kwd-group>
        <kwd>Reading Behavior</kwd>
        <kwd>Mobile Application</kwd>
        <kwd>Data Analysis</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        In order to access information, we interact with di erent types of devices, from
personal computers, to mobile phones, to tablets. These interactions take
various forms and the usage of mobile applications is certainly the most di used
nowadays. The vast amounts of data implicitly generated by the users during the
interactions might lead to useful information on the behavior of the users while
using the applications. In [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], the authors highlight that user behavior on mobile
applications is analyzed from three main perspectives, i.e., (i) data usage, (ii)
mobility patterns, and (iii) application usage. In this paper we will focus on the
rst and third types of behavior, by analyzing both the usage and the content
browsed by the users of a real-world mobile application, named Viewerplus3,
which serves as a magazine reader and provides the users with a digital and
augmented fruition of content. More speci cally, we will analyze how the users
browse the issues of a magazine while reading it, and which topics characterize
their interest.
      </p>
      <p>
        The analysis of the reading behavior of the users is an aspect that is gaining
more and more interest nowadays. In their survey, Okoli et al. [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] highlighted
that less than 1% of the studies focused on the readers of Wikipedia. However,
in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] it was highlighted that reading can be considered as a form of participation,
and in their recent study Lehmann et al. [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] stated that the reading activity of
the users can provide insights to the human editors. Indeed, by understanding
the reading behavior of the users, human editors can tailor the structure of a
product such as a magazine, and improve aspects like the content organization
or the placing of the ads.
      </p>
      <p>
        In the recently mentioned study, the authors focus on the user preferences
and reading behavior on Wikipedia [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. According our knowledge, no study on the
reading behavior in mobile content fruition applications has been performed, and
the two application domains present substantial di erences. When analyzing the
reading behavior in a mobile content fruition application, it should be noted that
the browsing of a magazine in form of a pdf le inside an application presents
many di erences with respect to web browsing. Indeed, the type of browsing
we are considering is usually sequential (users usually move from one page to
the next), while web pages usually contain links and this is not the case of a
magazine issue, which reports the printed version in a digital le. Moreover, the
web pages usually form a hierarchy, and this is not the case in our scenario (users
employ the mobile application to read pdf les, which is structured as a sequence
of pages).
      </p>
      <p>In order to allow the human editors of a magazine to analyze the reading
behavior of the users, in this paper we present Reader Behavior, a Java tool that
analyzes the interactions of the users with the Viewerplus mobile application.
The tool presents analytics on how the users browsed a speci c issue, which
portions have been read more and with which frequency, and which topics are
more interesting for the users.</p>
      <p>The scienti c contributions of this paper are the following:
{ we study for the rst time in the literature the reading behavior of the users
in mobile applications;
{ we present a tool that gives the human editors the possibility to dynamically
explore the reading of di erent magazine issues, by selecting them and seeing
how users read their content;
{ we perform a preliminary study of the users' reading behavior, based on a
real-world dataset extracted from the reading sessions of a magazine
published by a famous international publisher.</p>
      <p>The rest of the paper is organized as follows: we rst present related work
(Section 2), followed by a description of the Viewerplus mobile application
(Sec</p>
    </sec>
    <sec id="sec-2">
      <title>3 http://www.viewerplus.com/</title>
      <p>tion 3); next, we present the tool developed to analyze the reading behavior
of the users along with a preliminary analysis of the results obtained
considering the reading sessions of a magazine (Section 4); the paper ends with some
concluding remarks and by presenting future work (Section 5).
2</p>
      <sec id="sec-2-1">
        <title>Related Work</title>
        <p>
          The reading behavior of the users in Web environments has been studied from
several perspectives. In [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ], the readers of Wikipedia are analyzed and the authors
found out that the most read article are not the most edited ones, and they
identi ed four patterns that describe how the articles are read. Castillo et al. [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]
analyzed the life cycle of online news stories and discovered that the number of
visits of a news article and the activity on Twitter and Facebook decay after
a short time; moreover, the reactions on the social networks can be employed
to predict the future visits an article will receive. Zhang and Ma [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] analyzed
the correlations between users' educational level and their reading behavior,
and found out that higher educated people pay for academic papers, while the
other users prefer online literature. In [
          <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
          ], systems to analyze the web reading
behavior of the users by employing eye tracking systems were presented.
        </p>
        <p>
          Regarding the analysis of the user behavior in mobile applications, some
studies analyze the motivation behind their use. In [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ], Church and Oliveira
compare the use of Whatsapp with respect to traditional SMS, and the results
show that WhatsApp is usually employed because of the reduced cost, the social
interactions it can o er, and its immediacy, while SMS is considered more
reliable and privacy preserving. In [
          <xref ref-type="bibr" rid="ref10 ref11">10, 11</xref>
          ] the factors that lead to user engagement
are studied, and those that emerged as the most important are the perceived
enjoyment and usefulness of an application.
        </p>
        <p>
          The patterns in the usage of mobile applications were also studied. Xu et
al. [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ] found out that 20% of the applications are local (e.g., radio stations),
that some applications co-occur in a smartphone (i.e., they can be treated as a
bundle), and that diurnal patterns of di erent genres of applications can be
signi cantly di erent. In [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ], Tossell et al. identi ed behavioral patterns associated
with browsing, native internet applications' use, and physical locations.
        </p>
        <p>
          The search behavior of the users has been investigated in [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ], which
discovered that mobile information access is characterized in 94% of the sessions by
browsing content, that 8% of the users are involved in search activities, and that
these users have a much richer online behavior than the browsing-only
counterpart. In [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] it was highlighted that 70% of mobile information access happens
in a stationary place (e.g., at home or at work).
        </p>
        <p>
          The geospatial dynamics of mobile application usage were mostly analyzed
with clustering algorithms. In [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ], the authors clustered cell locations and
performed an analysis of the cells belonging to di erent clusters, nding that the
byte, packet, ow, and user distributions across di erent geographical regions
are signi cantly di erent. Keralapura et al. [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ] performed a co-clustering of
users and websites, discovering that the browsing behavior of most users can be
classi ed as either homogeneous in terms of interests and characterized by short
sessions, or heterogeneous with very long sessions.
        </p>
        <p>As this analysis showed, no study in the literature is devoted at analyzing
the reading behavior of the users in mobile applications, and the problem we are
studying is novel.
3</p>
      </sec>
      <sec id="sec-2-2">
        <title>Viewerplus: a Mobile Digital Publishing Platform</title>
        <p>In this section we set out the mobile application employed in our study and
developed by Applix, called Viewerplus4, by providing an overview of its core
features, speci cally designed to address the needs of users during their reading
activities. For the purpose of this work, it is important to note that Viewerplus
is not a prototype, but a full-featured application used by thousands people
everyday, freely available for Android- and iOS-powered devices, and available in
the main digital distribution platforms, such as Apple's App Store, Google Play,
Amazon Marketplace, and Samsung Galaxy Apps. Vierwerplus is the leading
application for the visualization and digital fruition of magazine periodicals,
and it is employed by the main Italian editorial groups.</p>
        <p>The application allows users to browse magazine issues in a mobile device,
by interacting with a pdf le through several types of interactions and features
(e.g., zoom, underline, page saving, bookmark). The interaction is made possible
both o ine (users can read a magazine issue without being connected to the
Internet) and online, thanks to push noti cations and the possibility to access
to multimedia content. Indeed, the application supports integrations to include
photos, audios, videos, links to external pages, and ads. Moreover, users can
share excerpts of what they are viewing or reading on the main social media
platforms. Thanks to these online features, Viewerplus is also largely employed
by companies who want to provide their customers with their latest catalogue.</p>
        <p>The monitoring of the users activity inside the application is made possible
by a suite developed by Xorovo, named APP-BI5, which tracks the interactions
of the users with the application, and extracts analytics that can be employed
for di erent purposes, such as business intelligence.</p>
        <p>With this work, we aim to extend the functionalities o ered by APP-BI,
by introducing the concept of reading session and by analyzing in detail the
behavior of the users while reading magazine issues (e.g., which portions are
read and with which frequency, which pages are read together based on the
reading sessions). In conclusion, we would also like to point out that we will
focus on the functionalities o ered by Viewerplus as a reader. This means that
we will consider a scenario where a user can browse a pdf with a magazine
issue and no link, multimedia, or online content is available. Indeed, a user can
move through the pages of a pdf by reading a magazine and by exploring its
content with classic gestures that allow her to interact with the device and the
application (i.e., scroll, tap, zoom, swipe, etc.).</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>4 http://www.viewerplus.com/ 5 http://www.app-bi.com/</title>
      <sec id="sec-3-1">
        <title>Reading Behavior Analysis</title>
        <p>Here, we will present Reader Behavior, a Java tool developed to analyze and
automatically describe how users behave while reading magazine issues. This
section starts with a description of the collected dataset (Section 4.1), the data
processing performed to extract the reading sessions employed in our analysis
(Section 4.2), ending with a presentation of the tool developed to support the
human editors at analyzing the reading behavior and with a preliminary
presentation of our ndings (Section 4.3).
4.1</p>
        <sec id="sec-3-1-1">
          <title>Data Collection</title>
          <p>In order to build the tool and analyze the user behavior, we analyzed the
interactions of the users with the application, considering a magazine of a
widelyrenowned publisher. APP-BI keeps track of di erent types of events, but not all
of them are related to the reading behavior of the users (e.g., the purchase of an
issue).</p>
          <p>For this study, we collected the data related to the visualization of a page.
Such events are tracked if a user visits a page for at least two seconds (this value
was studied internally by the APP-BI development team and set as the optimal
one). Each record contains the following attributes: &lt;deviceID, issueID, time,
duration, pageID, pageNumber&gt;, where deviceID is employed to monitor the
behavior of a user that employs the same device, issueID identi es the
magazine issue, time indicates the timestamp in which the event started, duration is
the number of seconds that the user spent on the page, pageID is an absolute
identi er of the page number, and pageN umber indicates the number of the
page indicated in the pdf of the magazine issue. Note that having an
anonymous deviceID, in order to monitor the activities performed inside a device,
helps us analyzing the behavior by respecting the privacy of the users: indeed,
the identity of a user is not tracked by APP-BI and no personal information is
disclosed.</p>
          <p>We monitored the interactions of the users with the application between
01/04/2014 and 04/06/2015, recording 10994 events of this type, which involve
110 di erent magazine issues.</p>
          <p>We would like to point out that no metadata was made available, so we had no
table of contents that linked the articles to the pages, and no separation between
the title of an article and its text. All this information had to be automatically
extracted by us in order to analyze the user behavior, and we will describe this
process in the following subsection.
4.2</p>
        </sec>
        <sec id="sec-3-1-2">
          <title>Data Preparation and Processing</title>
          <p>In order to have a more structured data representation and to link the collected
events to the content of a magazine issue, we performed three steps to divide
the events into reading sessions, get the text of each page in a magazine issue,
and automatically extract the topics of the magazine.</p>
          <p>
            Reading sessions de nition In [
            <xref ref-type="bibr" rid="ref18">18</xref>
            ], the authors de ne a browsing session
as all the activities that occur in less than 30 minutes between an activity and
the following. This de nition was also employed in [
            <xref ref-type="bibr" rid="ref4">4</xref>
            ], to de ne the reading
sessions of the users in Wikipedia. In order to characterize the reading behavior
of the users, we also adopted this de nition, and considered as reading session
all the events that involve the same user and for which less than 30 minutes
passed between the end of an event (time + duration) and the beginning of the
following.
          </p>
          <p>Text extraction Given the pdf le of a magazine issue, we used Apache's
PDFBox6 to parse it and get as output the text.</p>
          <p>
            Page topics extraction Given the text of each issue, we automatically
extracted the topics that characterize the magazine. This was made thanks to
Latent Dirichlet Allocation (LDA), which is usually employed for this purpose
(i.e., extract the topics from a set of documents), by employing a Java
implementation of the algorithm made available in the MALLET framework [
            <xref ref-type="bibr" rid="ref19">19</xref>
            ].
          </p>
          <p>The framework received as input a text corpus with the content of all the
110 issues in the dataset and the number of topics to extract, and produced a
set of topics. After a set of experiments (not reported to facilitate the reading
of the paper), we extracted seven topics. This choice was made since having a
lower number of topics led to having keywords that belong to di erent domains
in the same topic, while having a number of topics higher than seven meant that
keywords that belong to the same domain were split into two topics.</p>
          <p>Out of the seven detected topics, two were characterized by keywords that
occur in all the issues (i.e., the details of the publisher, and common keywords
that appear in an issue such as \number", \price", and \data"). These two topics
were removed, and we manually assigned the following labels to the remaining
ve, according to the keywords extracted through LDA:
1. family life;
2. tv;
3. lifestyle;
4. health and self-care;
5. cinema.</p>
          <p>The choice to extract the topics for the whole magazine was made since a
magazine's articles are usually about the same topics (a magazine is usually
directed toward a speci c user target), to facilitate the manual labelling of the
topics given the keywords extracted by LDA, and to be able to compare the
reading behavior on di erent issues (e.g., the interest generated by the \cinema"
articles published in an issue with respect to those published in another).</p>
          <p>Given these ve topics, we processed each page of each issue through
MALLET, and extracted a vector whose elements indicate the relevance of each topic
for the page.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>6 https://pdfbox.apache.org/</title>
      <p>4.3</p>
      <sec id="sec-4-1">
        <title>A Tool to Analyze the Reading Behavior of the Users</title>
        <p>Reader Behavior o ers three main types of features to human editors:
1. Co-readings graph. By selecting a magazine issue, the tool shows a graph
that contains a node for each page, and an undirected weighted edge that
connects two pages that have been read one after the other. A human editor
has the possibility to interact with the graph and visualize only the edges
whose weight is above a certain threshold, in order to isolate the most read
subgraphs.
2. Visualization of the interest toward the pages. Given a magazine issue,
the tool shows each page as a box, whose color is based on the frequency
with which the page was read. This allows the human editors to analyze how
the readings are distributed and which pages caused more interest on the
users.</p>
      </sec>
      <sec id="sec-4-2">
        <title>3. Clustering of the pages that have been read together. For each maga</title>
        <p>zine issue, we perform a clustering to put together the pages that have been
read in the same sessions. Thanks to this feature, the human editors are
allowed to re-organize the content of future issue, by having an automatic
description of what users have read together.</p>
        <p>In the following, we will describe the details of each feature.</p>
        <p>Co-readings graph The rst feature o ers the possibility to human editors to
select a magazine issue and visualize a graph that contains a node for each page
in the issue, and an undirected weighted edge that connects two pages that have
been read one after the other in the same reading session of a user; the weight
represents the number of times the two pages have been read one after the other.
A screenshot of the feature is shown in Fig. 1.</p>
        <p>As it can be seen, a human editor is provided with the possibility to interact
with the graph, in order to see only the edges whose weight is above a certain
threshold (in the gure, the threshold is 75). This type of dynamic interaction
with the graph o ers the possibility to analyze in real time how the graph is split
into subgraphs and which components are strongly connected (each subgraph
represents a subset of pages for which users have shown the same interest).</p>
        <p>We are currently working on an automatic description of each subgraph in
terms of the topics that characterize its pages. A preliminary analysis shows
that pages that have been read together and with a similar frequency are also
characterized by the same topics. This is visually indicated by the colors in the
nodes, which are homogeneous in each subgraph (i.e, users tend to read together
pages related to the same topics).</p>
      </sec>
      <sec id="sec-4-3">
        <title>Visualization of the interest toward the pages With this feature, the</title>
        <p>tool visualizes all the pages of a magazine issue. For each page we consider the
number of times it has been read and split these values based on the quartiles.
This allows us to obtain four data quarters, which indicate how the interest
toward the pages of that issue is distributed.</p>
        <p>The tool visualizes the issue in a unique representation, and each page is
represented as a box whose color is given by the data quarter associated with
the number of times the page has been read. To give a clear di erentiation of
the data quarters, we chose four vivid colors; the 25% of less read pages ( rst
quarter) is represented with a green color, the 25% of pages under the median
(second quarter) has a cyan color, the 25% above the median (third quarter) is
given a violet color, and 25% of most read pages (fourth quarter) has a red color.</p>
        <p>Fig. 2 shows a representative example of an issue. With representative, we
mean that it depicts the usual distribution of the page readings if considering
di erent issues. Indeed, the rst part of the issue is usually the most read (red
boxes), alternated and followed by the violet boxes that represent the third
quarter. The cyan boxes that can be occasionally met even in the rst half of
the issue represent pages with ads, which have been automatically detected since
they are the ones with no text. The less read pages can be found at the end of the
issue, represented in green. The fact that the advertising pages do not represent
the less considered quarter shows the e ectiveness of placing ads in between
pages that are of interest for the users.</p>
      </sec>
      <sec id="sec-4-4">
        <title>Clustering of the pages that have been read together The last feature we</title>
        <p>present is a clustering of the pages in a magazine issue, based on the sessions in
which they have been read. Each page is represented by a binary vector, whose
elements represent the session IDs and contain 1 if the page was read in the
corresponding session, 0 otherwise.</p>
        <p>To estimate the number of clusters a priori, the tool employs a technique
called canopy. This is a fast approximate clustering technique, used to divide
the input set of points into overlapping clusters, known as canopies. Although
this algorithm may not give accurate and precise clusters, it can detect the
optimal number of clusters extremely quickly (i.e., with a single pass over the
data). For this reason, the tool runs the algorithm as a pre-processing step to
automatically nd the optimal number of clusters k, which is given as input to
the k-means clustering algorithm, along with the vector representation of the
pages, to generate the clusters.</p>
        <p>Once the clusters have been detected, the output is given as a set of pages
that are in a cluster, plus an automatically-generated description of the cluster
in terms of topics, where the relevance of the topic for the cluster is indicated.
Let relevancet;p indicate the relevance of a topic t for a page p, and distancep;c
indicate the distance of a page p from the centroid of the cluster c in which the
page is. The relevance of a topic t for a cluster c is built as follows:
relevancet;c =</p>
        <p>X relevancet;p
p2c distancep;c</p>
        <p>Thanks to this formula, the highest is the di erence in the reading behavior
of a page p with respect to the the others in the cluster c, the lower is the
weight assigned to the topic t for that cluster (the value distancep;c is seen as
an indication of \cohesion" between the page and the rest of the cluster).</p>
        <p>In order to give a relative value to the relevance of a topic for a cluster and
give the human editor the perception of the reading behavior in a cluster, we
normalize the relevance of each topic with a value between 0 and 1, as follows:
relevancet;c =</p>
        <p>relevancet;c
max(relevancec)
min(relevancec)
min(relevancec)
where min(relevancec) and max(relevancec) respectively indicate the
minimum and maximum relevance values obtained by a topic in a cluster c. Trivially,
1 is the score assigned to the most relevant topic, and 0 is the score assigned to
the least relevant topic.</p>
        <p>Due to space constraints, we will not show a screenshot of this feature, but we
will provide an example of the description of an issue, whose pages can be split
into three clusters based on the readings sessions. The automatic descriptions
generated by the tool are the following:
1. lifestyle (1.00), family life (0.99), health and self-care (0.31), cinema (0.00),
tv (0.00)
2. family life (1.00), lifestyle (0.56), health and self-care (0.24), cinema (0.05),
tv (0.00)
3. lifestyle (1.00), family life (0.84), health and self-care (0.32), cinema (0.03),
tv (0.00)</p>
        <p>Apart from the content in terms of pages of these three clusters (which would
not be interesting for this purpose of this paper), we can see that based on the
reading sessions, the interest of the users varies signi cantly from cluster to
cluster. Indeed, in the rst cluster, the pages that are characterized by lifestyle
and family life have been read with similar interest in the same sessions, while
health and self-care generated a much lower interest, and the users showed the
lowest interest for cinema- or tv-related topics. The second cluster of pages is
instead much more centered toward pages related to family life, while lifestyle
has been half as relevant in those reading sessions; health and self-care is a topic
that generates little interest, and cinema and tv still represent the topics that
generate less interest in the users. Finally, the third cluster shows the same
ranking as the rst one, but with di erent weights.</p>
        <p>This feature gives insights to the human editor on both the ways in which the
users read a magazine issue (pages are grouped based on the reading sessions)
and on the topics that characterize these sessions with their associated relevance.</p>
        <p>It should be noted that the combined use of the features provided by our tool
can be very helpful to the human editors. For example, if given these clusters a
human editor wanted to increase the relevance of cinema and tv pages, knowing
that they are usually placed at the end of an issue (i.e., the less read portion),
the articles related to these topics could be moved into a section of the magazine
that appears earlier in future issues.
5</p>
        <sec id="sec-4-4-1">
          <title>Conclusions and Future Work</title>
          <p>In this paper we presented Reader Behavior, a Java tool developed to analyze the
reading behavior of the users, based on their interactions with a mobile digital
publishing platform, named Viewerplus.</p>
          <p>Our proposal takes the data collected during the browsing of a magazine's
issues, extracts the reading sessions of the users, and provides visual and
descriptive features of how the users read a given magazine issue. The objective is
to provide the human editors with tools that allow them to get to know their
customers more and improve the service they provide to them.</p>
          <p>At the moment, the tool provides three features that describe the reading
behavior of the users from di erent perspectives. Future work will extend the
tool with additional features, like the automatic description of the subgraphs in
the \co-readings graph", or the possibility to click on a box with the interest
toward a page, in order to show which topics characterize that page. Moreover,
we will employ more real-world datasets and try to develop metrics to describe
the reading behavior not only in terms of single magazine issues, but by giving
a global view on how a magazine is read.</p>
        </sec>
        <sec id="sec-4-4-2">
          <title>Acknowledgments</title>
          <p>The authors would like to thank Gianluca Zuddas, Andrea Aresu, Giacomo
Piseddu, Paolo Tanzi, Davide Melis, Corrado Alvau, and Francesco Argiolas,
for their contribution in this research work.</p>
        </sec>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Qiao</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
          </string-name>
          , F., Cheng, G.:
          <article-title>Characterizing user behavior in mobile internet</article-title>
          .
          <source>IEEE Trans. Emerging Topics Comput</source>
          .
          <volume>3</volume>
          (
          <year>2015</year>
          )
          <volume>95</volume>
          {
          <fpage>106</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Okoli</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mehdi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mesgari</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nielsen</surname>
            ,
            <given-names>F.A.</given-names>
          </string-name>
          , Lanamaki, A.:
          <article-title>The people's encyclopedia under the gaze of the sages: A systematic review of scholarly research on Wikipedia (</article-title>
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Antin</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cheshire</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Readers are not free-riders: Reading as a form of participation on wikipedia</article-title>
          .
          <source>In: Proceedings of the 2010 ACM Conference on Computer Supported Cooperative Work. CSCW '10</source>
          , New York, NY, USA, ACM (
          <year>2010</year>
          )
          <volume>127</volume>
          {
          <fpage>130</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <article-title>Muller-</article-title>
          <string-name>
            <surname>Birn</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Laniado</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lalmas</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kaltenbrunner</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Reader preferences and behavior on wikipedia</article-title>
          .
          <source>In: Proceedings of the 25th ACM Conference on Hypertext and Social Media. HT '14</source>
          , New York, NY, USA, ACM (
          <year>2014</year>
          )
          <volume>88</volume>
          {
          <fpage>97</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Castillo</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>El-Haddad</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pfe</surname>
            <given-names>er</given-names>
          </string-name>
          , J.,
          <string-name>
            <surname>Stempeck</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Characterizing the life cycle of online news stories using social media reactions</article-title>
          .
          <source>In: Proceedings of the 17th ACM Conference on Computer Supported Cooperative Work &amp;#38; Social Computing. CSCW '14</source>
          , New York, NY, USA, ACM (
          <year>2014</year>
          )
          <volume>211</volume>
          {
          <fpage>223</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ma</surname>
          </string-name>
          , W.:
          <article-title>Correlation analysis between users' educational level and mobile reading behavior</article-title>
          .
          <source>Library Hi Tech</source>
          <volume>29</volume>
          (
          <year>2011</year>
          )
          <volume>424</volume>
          {
          <fpage>435</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Beymer</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Russell</surname>
            ,
            <given-names>D.M.:</given-names>
          </string-name>
          <article-title>Webgazeanalyzer: A system for capturing and analyzing web reading behavior using eye gaze</article-title>
          .
          <source>In: CHI '05 Extended Abstracts on Human Factors in Computing Systems. CHI EA '05</source>
          , New York, NY, USA, ACM (
          <year>2005</year>
          )
          <year>1913</year>
          {
          <fpage>1916</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Granka</surname>
            ,
            <given-names>L.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Joachims</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gay</surname>
          </string-name>
          , G.:
          <article-title>Eye-tracking analysis of user behavior in www search</article-title>
          .
          <source>In: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. SIGIR '04</source>
          , New York, NY, USA, ACM (
          <year>2004</year>
          )
          <volume>478</volume>
          {
          <fpage>479</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Church</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>de</surname>
            <given-names>Oliveira</given-names>
          </string-name>
          , R.:
          <article-title>What's up with whatsapp?: Comparing mobile instant messaging behaviors with traditional sms</article-title>
          .
          <source>In: Proceedings of the 15th International Conference on Human-computer Interaction with Mobile Devices and Services</source>
          .
          <source>MobileHCI '13</source>
          , New York, NY, USA, ACM (
          <year>2013</year>
          )
          <volume>352</volume>
          {
          <fpage>361</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>Y.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>D.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wachter</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>A study of mobile user engagement (moen): Engagement motivations, perceived value, satisfaction, and continued engagement intention</article-title>
          .
          <source>Decis. Support Syst</source>
          .
          <volume>56</volume>
          (
          <year>2013</year>
          )
          <volume>361</volume>
          {
          <fpage>370</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Verkasalo</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lopez-Nicolas</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Molina-Castillo</surname>
            ,
            <given-names>F.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bouwman</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          :
          <article-title>Analysis of users and non-users of smartphone applications</article-title>
          .
          <source>Telemat. Inf</source>
          .
          <volume>27</volume>
          (
          <year>2010</year>
          )
          <volume>242</volume>
          {
          <fpage>255</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Xu</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Erman</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gerber</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mao</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pang</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Venkataraman</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Identifying diverse usage behaviors of smartphone apps</article-title>
          .
          <source>In: Proceedings of the 2011 ACM SIGCOMM Conference on Internet Measurement Conference. IMC '11</source>
          , New York, NY, USA, ACM (
          <year>2011</year>
          )
          <volume>329</volume>
          {
          <fpage>344</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Tossell</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kortum</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rahmati</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shepard</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhong</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Characterizing web use on smartphones</article-title>
          .
          <source>In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. CHI '12</source>
          , New York, NY, USA, ACM (
          <year>2012</year>
          )
          <volume>2769</volume>
          {
          <fpage>2778</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Church</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smyth</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cotter</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bradley</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Mobile information access: A study of emerging search behavior on the mobile internet</article-title>
          .
          <source>ACM Trans. Web</source>
          <volume>1</volume>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Church</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Oliver</surname>
          </string-name>
          , N.:
          <article-title>Understanding mobile web and mobile search use in today's dynamic mobile landscape</article-title>
          .
          <source>In: Proceedings of the 13th International Conference on Human Computer Interaction with Mobile Devices and Services</source>
          .
          <source>MobileHCI '11</source>
          , New York, NY, USA, ACM (
          <year>2011</year>
          )
          <volume>67</volume>
          {
          <fpage>76</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Sha</surname>
            <given-names>q</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>M.Z.</given-names>
            ,
            <surname>Ji</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.X.</given-names>
            ,
            <surname>Pang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.</surname>
          </string-name>
          :
          <article-title>Characterizing geospatial dynamics of application usage in a 3g cellular data network</article-title>
          . In Greenberg,
          <string-name>
            <given-names>A.G.</given-names>
            ,
            <surname>Sohraby</surname>
          </string-name>
          , K., eds.
          <source>: Proceedings of the IEEE INFOCOM</source>
          <year>2012</year>
          ,
          <article-title>Orlando</article-title>
          , FL, USA, March
          <volume>25</volume>
          -30,
          <year>2012</year>
          , IEEE (
          <year>2012</year>
          )
          <volume>1341</volume>
          {
          <fpage>1349</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Keralapura</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nucci</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>Z.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gao</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Pro ling users in a 3g network using hourglass co-clustering</article-title>
          .
          <source>In: Proceedings of the Sixteenth Annual International Conference on Mobile Computing and Networking</source>
          .
          <source>MobiCom '10</source>
          , New York, NY, USA, ACM (
          <year>2010</year>
          )
          <volume>341</volume>
          {
          <fpage>352</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Catledge</surname>
            ,
            <given-names>L.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pitkow</surname>
            ,
            <given-names>J.E.</given-names>
          </string-name>
          :
          <article-title>Characterizing browsing strategies in the world-wide web</article-title>
          .
          <source>In: Proceedings of the Third International World-Wide Web Conference on Technology, Tools and Applications</source>
          , New York, NY, USA, Elsevier North-Holland, Inc. (
          <year>1995</year>
          )
          <volume>1065</volume>
          {
          <fpage>1073</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>McCallum</surname>
            ,
            <given-names>A.K.</given-names>
          </string-name>
          :
          <article-title>Mallet: A machine learning for language toolkit</article-title>
          . http://mallet.cs.umass.edu (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>