<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Is Readability a Valuable Signal for Hashtag Recommendations?</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>CCS Concepts</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Hashtag Recommendation; Readability</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Human-centered computing</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Ion Madrazo Azpiazu Computer Science Dept. Boise State University Boise</institution>
          ,
          <addr-line>Idaho</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Maria Soledad Pera Computer Science Dept. Boise State University Boise</institution>
          ,
          <addr-line>Idaho</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>We present an initial study examining the bene ts of incorporating readability indicators in social network-related tasks. In order to do so, we introduce TweetRead, a readability assessment tool speci cally designed for Twitter and use it to inform the hashtag prediction process, highlighting the importance of a readability signal in recommendation tasks.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. INTRODUCTION</title>
      <p>Readability is a measure of the ease with which a text can
be read. Usually represented by a number, it is an indicator
used by teachers to classify and nd appropriate resources
for students. Several studies have demonstrated the bene ts
of using readability indicators in educational-related
applications, such as book recommendation, text simpli cation, or
automatic translation. However, applying readability
indicators outside this environment remains relatively unexplored.
Social networks could bene t from readability assessment.
Twitter is a social network where users and texts are the
main focus. For this reason, it is natural to think that for
Twitter the ease with which a tweet can be understood by
a user may a ect his interest in it, and therefore in uence
actions taken, such as re-tweeting, giving a like or replying
to the tweet.</p>
      <p>
        The authors of [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] examined the degree to which the age
of a user, a feature strongly correlated with readability,
inuences who people follow on Twitter, and demonstrated
that Twitter users have a higher chance to follow people
of similar age. Using standard readability measures in text
from Twitter, which constrains tweets to be of at most 140
characters in length, is not a trivial task. The lack of
structure and shortness of those texts make standard natural
Copyright held by the authors.
language analysis techniques ine cient. With that in mind,
we developed TweetRead, a novel readability assessment tool
speci cally designed for tweets. TweetRead takes advantage
of social information, such as hashtags or mentions, for
predicting the text complexity levels of tweets. Furthermore, in
order to highlight the usefulness of such a tool in social
networking environments, we developed a simple, yet e ective,
hashtag recommendation strategy that takes advantage of
TweetRead-generated complexity levels of tweets to inform
the hashtag recommendation process.
2.
      </p>
    </sec>
    <sec id="sec-2">
      <title>TWEETREAD</title>
      <p>
        TweetRead's goal is to estimate readability of any given
tweet T . TweetRead is based on a logistic regression
technique1 that fuses simple indicators describing T from di erent
perspectives and determines its text complexity. The
indicators considered by TweetRead include: (i) T 's readability
level, estimated using F lesch2 [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], (ii) T 's similarity with
respect to word distributions generated from a large Twitter
corpora C labeled by age groups, (iii) average readability of
each hashtag h in T , computed based on the average
readability levels estimated using Flesch of tweets in C that include
h, (iv) average readability level of users mentioned on T ,
estimated using Flesch on tweets written by mentioned users,
and (v) frequency of mentions, emoticons, and hashtags in T .
      </p>
      <p>
        Unlike traditional readability formulas that tend to map
readability levels with school grades, to tailor TweetRead to
the Twittersphere, we consider six levels of text complexity
following Levinston's [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] adult development stages.
3.
      </p>
    </sec>
    <sec id="sec-3">
      <title>HASHTAG RECOMMENDATION</title>
      <p>
        Hashtags are character strings used to represent concepts
on Twitter, starting with a # symbol. They are a core
Twitter feature and serve classi cation and search purposes.
Their unrestricted nature, however, creates di culties,
including the fact that the same concept can be represented by
di erent hashtags, hindering the search process of a concept
[
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. For example, tweets related to the Monaco Formula
1 Grand Prix can be searched using #monacoGP,
#monacoF1GP or #monacoF1 retrieving di erent results. Hashtag
recommendation aims at identifying suitable hashtags a user
can include in his tweet to reduce the space of tags generated
[
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] and facilitate the ease with which he and other users can
locate the corresponding tweet.
      </p>
      <p>
        Given that (i) the scope of this paper is to validate the
importance of considering a text complexity signal to enhance
1We empirically veri ed that among numerous supervised
techniques, logistic regression was the most promising one.
2Flesch estimates the readability of a text/tweet t, by
examining its length and the average length of terms in t.
a recommendation task and (ii) multiple and increasingly
complex systems have been developed for hashtag
recommendation [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], we base our study on an existing framework for
hashtag recommendation presented in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Given a tweet T ,
the proposed framework identi es existing hashtags to
recommend by following two major steps: (1) generate candidate
hashtags by recommending hashtags present in similar tweets,
using tf-idf based cosine similarity and (2) rank hashtags
from retrieved candidate tweets using di erent strategies.
The strategies presented in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] include:
      </p>
      <p>Similarity. Prioritizes hashtags included on tweets
that have the closes similarity to T , as estimated using
the well-known tf-idf and cosine similarity measure.
Global popularity. Prioritizes hashtags based on
their respective frequency of occurrence on Twitter.
Local popularity. Prioritizes hashtags based on their
frequencies of occurrence among the tweets retrieved
in response to T .</p>
      <p>We enhance the proposed strategies by taking advantage
of TweetRead, as follows:</p>
      <p>TweetRead. Prioritizes candidate hashtags that have
the same or similar text complexity (estimated using
TweetRead) with respect to T .</p>
      <p>PopularityTweetRead. Prioritizes hashtags based
on their frequencies of occurrence among tweets whose
readability level is estimated to match T 's.</p>
      <p>SimilarityTweetRead. Prioritizes candidate
hashtags based on their respective ranking scores computed
using Similarity only on tweets whose readability level
is estimated to match T 's .</p>
    </sec>
    <sec id="sec-4">
      <title>4. INITIAL ASSESSMENT</title>
      <p>In this section, we discuss an initial evaluation on
TweetRead, as well as its applicability for suggesting hashtags.</p>
      <p>
        TweetRead. Given that readability of social content is
an unexplored area, benchmark datasets that can be used
for evaluation purposes are unavailable. For this reason, we
built our own dataset. We initially gathered 172M tweets
over an 8-month period using Twitter streaming API. For
the purpose of this experiment we assume that the age of
people exactly corresponds to their readability level, and that
each tweet written by a user will have the same readability
level as its author. With that in mind, we followed the
framework presented in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], which examines patterns such
as \happy xth birthday", for determining the age of Twitter
users. In doing so, we eliminated from our dataset, users
(and their corresponding tweets) from whom age could not
be determined. Thereafter, we grouped labeled tweets into
6 age groups, which translates into a uniformly distributed
dataset of 22k tweets with their corresponding readability
levels. We followed a 10-cross-fold validation strategy and
measured the accuracy of the predicted readability levels
with respect to the ground truth. As shown in Table 1,
TweetRead signi cantly outperforms the baselines considered
for this assessment: Flesch [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] and Spache [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], which are two
well-known, traditional readability measures. The reported
results demonstrate the need for readability strategies that
examine information beyond standard text analysis, if they
are meant to be successfully used in the social networking
context.
      </p>
      <p>
        Hashtag recommendation. For evaluating the
strategies for hashtag recommendation presented in Section 3, we
used the aforementioned dataset. We treated the hashtag
of each corresponding tweet as the ground truth. In other
words, for each tweet T , we generated the corresponding
top-N hashtag recommendations and considered relevant the
ones matching the hashtags in T . As in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], we used the recall
measure to evaluate performance and determine to which
extend the correct hashtags were recommended within the
top N generated suggestions. As shown in Figure 1, even if
readability on its own is not a su cient factor to suggest
hashtags, when combined in-tandem with other content-based
and/or popularity strategies, it leads to the improvement of
the overall hashtag recommendation process.
5.
      </p>
    </sec>
    <sec id="sec-5">
      <title>CONCLUSION AND FUTURE WORK</title>
      <p>In this paper, we presented TweetRead, a novel readability
assessment tool speci cally designed to predict the readability
of tweets. We also discussed the initial study conducted
to demonstrate the bene t of using a readability signal in
the hashtag recommendation task, which yielded promising
results. In the future, we plan to explore other applications of
readability in social networks, such as user recommendation,
advertisement targeting or re-tweet prediction. We will also
explore techniques to further enhance TweetRead and adapt
it to other social networks beyond Twitter.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>R.</given-names>
            <surname>Flesch</surname>
          </string-name>
          .
          <article-title>A new readability yardstick</article-title>
          .
          <source>Journal of Applied Psychology</source>
          ,
          <volume>32</volume>
          (
          <issue>3</issue>
          ):
          <fpage>221</fpage>
          ,
          <year>1948</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>F.</given-names>
            <surname>Godin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Slavkovikj</surname>
          </string-name>
          , W. De Neve,
          <string-name>
            <given-names>B.</given-names>
            <surname>Schrauwen</surname>
          </string-name>
          , and R. Van de Walle.
          <article-title>Using topic models for twitter hashtag recommendation</article-title>
          .
          <source>In WWW</source>
          , pages
          <volume>593</volume>
          {
          <fpage>596</fpage>
          . ACM,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>D. J.</given-names>
            <surname>Levinson</surname>
          </string-name>
          .
          <article-title>A conception of adult development</article-title>
          .
          <source>American psychologist</source>
          ,
          <volume>41</volume>
          (
          <issue>1</issue>
          ):
          <fpage>3</fpage>
          ,
          <year>1986</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>G.</given-names>
            <surname>Spache</surname>
          </string-name>
          .
          <article-title>A new readability formula for primary-grade reading materials</article-title>
          .
          <source>The Elementary School Journal</source>
          ,
          <volume>53</volume>
          (
          <issue>7</issue>
          ):
          <volume>410</volume>
          {
          <fpage>413</fpage>
          ,
          <year>1953</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>E.</given-names>
            <surname>Zangerle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Gassler</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Specht. Recommending</surname>
          </string-name>
          #
          <article-title>-tags in twitter</article-title>
          .
          <source>In SASWeb</source>
          <year>2011</year>
          , volume
          <volume>730</volume>
          , pages
          <fpage>67</fpage>
          {
          <fpage>78</fpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Hu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , and
          <string-name>
            <given-names>H.</given-names>
            <surname>Liu</surname>
          </string-name>
          .
          <article-title>Your age is no secret: Inferring microbloggers' ages via content and interaction analysis</article-title>
          .
          <source>In AAAI ICWSM</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>