<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Topical Community Detection: an Embedding User and Content Similarity Method</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Thi Bich Ngoc Hoang</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Institut de Recherche en Informatique de Toulouse, UMR5505 CNRS, Universite ́ de Toulouse</institution>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Economics, the University of Danang</institution>
          ,
          <country country="VN">Vietnam</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Community detection aims at partitioning a network into subgroups of densely connected nodes. While many approaches focus on community detection based on users' relationships, the latter may be not effectively enough since some communities may be topic dependent. In this paper, we propose a method that detects communities by considering users and their topics. More specifically, our approach combines cues extracted from the users' exchanges and the ones extracted from their posts. The data collection and the evaluation measures we intend to apply our method on are also presented in this paper. Yet, evaluation is not included in this paper.</p>
      </abstract>
      <kwd-group>
        <kwd>Social Media Analysis</kwd>
        <kwd>Community Detection</kwd>
        <kwd>Embedding User</kwd>
        <kwd>Content Similarity</kwd>
        <kwd>Twitter</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Community detection aims at partitioning a network into subgroups of densely
connected users [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] with similar interest, background or purpose. Community detection is
an important topic to discover the complex structure of social networks, and is applied
in several fields such as biology, sociology and computer science.
      </p>
      <p>
        It has been widely applied in several domains such as influence analysis [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ],
bibliometry [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ], network security [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], and criminology [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>It is also applied in social network analysis. A social network can be represented
as a network composed of nodes and links, where the nodes denote the users, and the
edges denote the relationships between these users.</p>
      <p>In that context, community detection aims at grouping similar users into clusters,
where users within a group tend to be more similar as compared to nodes outside the
group.</p>
      <p>
        Since community is originally determined based on linkage structure, previous
community detection methods tend to purely consider the network’s topology [
        <xref ref-type="bibr" rid="ref1 ref21 ref9">1,9,21</xref>
        ].
However, this information is not satisfactory in accurately defining the community
membership because the topology is often sparse and noisy [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ]. Other studies
considered only the content to identify groups of users [
        <xref ref-type="bibr" rid="ref22 ref24">22,24</xref>
        ] but the results are not highly
“Copyright c 2020 for this paper by its authors. Use permitted under Creative Commons
License Attribution 4.0 International (CC BY 4.0).”
convincing since the inappropriate content attributes could miss-lead the process of
community detection or some communities may be topic dependent. A few studies
investigated combining users’ link and users’ content [
        <xref ref-type="bibr" rid="ref15 ref20">15,20</xref>
        ]. The authors consider each
user as a node and the user’s post content as the attributes of the nodes under the form of
keywords. The authors then use a single assignment clustering method to detect
communities. The applicability of these methods is limited: as each node can belong to a
single community only, these methods cannot detect overlapping communities.
      </p>
      <p>In this paper, we take on a method to identify communities in social networks that
considers both users’ interaction and their message content while using light computing.
The users’ interaction is defined based on the retweet action while the users’ messages
are considered in term of the semantic similarity. We integrate these two factors to
detect communities. Our current work is to evaluate this method, thus results could not
be included in this paper.</p>
      <p>The rest of the paper is organized as follows: Section 2 presents the related work.
Section 3 describes three proposed approaches to identify communities in social
networks. Section 4 presents the experiment scenario, the data, and evaluation measures
that could be used to evaluate this scenario. Finally, Section 5 concludes this paper.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related work</title>
      <p>
        Most of recent work in social community detection focused on users’ interaction [
        <xref ref-type="bibr" rid="ref1 ref21 ref9">1,9,21</xref>
        ]
while very few of the studies consider the users’ message content [
        <xref ref-type="bibr" rid="ref22 ref24">22,24</xref>
        ] or combine
these two factors to identify groups in social networks [
        <xref ref-type="bibr" rid="ref15 ref19 ref20">15,19,20</xref>
        ]. We only report
studies that consider these both factors since they are comparatively few and new, and are
the more related to our work.
      </p>
      <p>
        Ruan et al. [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] extract communities by combing link strength and content
similarity in graph structure. Link strength is measured based on whether the link is likely to
reside within a community with high probability while content similarity is estimated
through cosine similarity. They first create content edges among nodes, then sample the
union of link edges and content edges with bias, retaining only edges that are relevant in
local neighbor hood. Finally, they partition the simplified graph into clusters. Also
integrating topology and content, Qui et al. [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] introduced a method of identifying social
communities under the framework of non-negative matrix factorization. Their method
uses adjacency to represent the network connectivity, then associates the
corresponding semantic description of each community by adding an attribute to each node. This
attribute corresponds to a keyword extracted from the content . The authors assume
that the description for the same community should be semantically similar while the
description among different communities should be different.
      </p>
      <p>
        zhao et al. [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ] proposed an approach to identify the topical opinion leader in
social community question answering by combining the topic sensitive influence and the
topical knowledge expertise. To measure the true topical influence of users, the authors
incorporated the network structure, the topic interest similarity and the topical
knowledge to measure the true topical influence. In their method, they infer each user’s topic
interest and knowledge authority from past posts. They confirmed the existence of
homophily which implies that a user follows another having the similar topic of interest.
To measure the topical knowledge expertise, the authors employ the topic-relevant
metrics that accounts for knowledge capacity, satisfaction and contribution.
      </p>
      <p>
        Surian et al. [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] identified communities on Twitter network by either considering
the tweet content or considering the follow relationship among users. They used Latent
Dirichlet Allocation (LDA) method to infer the topics from tweets and used Louvain
method to detect communities based on following interaction. They then measured the
alignment between topics and communities for the users who were part of the largest
connected component. They concluded that there are clear differences in the distribution
of topics across communities defined by the follower network. The authors considered
only the largest community detected by the two methods; thus their conclusion may not
be relevant for the other communities.
      </p>
      <p>Different from the above studies, we suggest to identify communities in social
network by considering both the embedding user (retweet relationship among users) and
the semantic similarity of the users’ message content.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Proposed method</title>
      <p>In this paper, we address the problem of community detection in social networks by
considering both the embedding users relationship and embedding users’ message content.</p>
      <p>
        In this approach, each user is considered as a node. We first detect users who
resent a post from other users and consider these interactions as relationships between
users (edges between nodes). In social networks, most of users re-post messages from
their friends (which appear on their timeline) when they agree with the message
content or find the messages interesting. These re-posting messages may be extensions of
or same as the original ones. The re-posts not only show indirect user relationships but
also correspond to high semantic similarity in their content. We thus hypothesize that
using the re-post interactions can help detecting communities with similar interest or
background. In the next step, we use a community detection method to detect
communities. Several community detection methods can be used in this purpose such as
InfoMap [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ], Label Propagation [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], Leading Eigenvector [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], Louvain [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ],
Spinglass [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], or Walktrap [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ].
      </p>
      <p>Whatever the social media considered, each user, either the user who writes an
original message or the one who resent it, is a node. The approach is then implemented
in the two following steps:
– We identify edges based on the resent relationship. If the user A resent a user B’s
post then we add an edge between the user A and the user B.
– We use traditional community detection methods to detect communities for above
identified nodes and edges.</p>
      <p>The result will be the number of communities and the members of each community.
The approach will be evaluated using the data set and metrics that are described in the
next section.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Evaluation framework</title>
      <p>To evaluate the proposed approach, we developed an evaluation scenario that we will
run on a tweet data set that we built. We will make this data set available to the research
community on demand. Moreover, we will use usual metrics for community detection
evaluation as presented in this section.
4.1</p>
      <sec id="sec-4-1">
        <title>Data set</title>
        <p>
          The data set we will use includes 20,000 retweets extracted from the 1 percent of tweets
collected during the second week of January 2017 by IRIT, France within a spam
detection project [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ]. Each tweet in this data set is composed of several pieces of
information regarding a twitter’s post such as the author of the tweet, the content of
the tweets and other objects. These 20; 000 retweets in our collection were created by
30; 271 users.
4.2
        </p>
      </sec>
      <sec id="sec-4-2">
        <title>Evaluation measures</title>
        <p>
          In community detection, algorithms are compared either on their efficiency (time taken
to partition the network) or effectiveness (how relevant the extracted communities are)
or both. With regard to effectiveness, various measures are used [
          <xref ref-type="bibr" rid="ref11 ref3 ref6">11,3,6</xref>
          ]; among this
we will apply two well-known and widely used measures which are Modularity and
Normalized Mutual Information as well as the newly defined f -divergence-based
metric [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ].
        </p>
        <p>
          Modularity [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] is used to measure the difference of fraction of the edges that fall
within communities and expected number of edges in a random graph:
Modularity =
1
        </p>
        <p>å (Axy
2M xy
dxdy )d(cx; cy)
2M
where x and y are nodes, M is the number of edges in the network, dx and dy are
the degrees of x and y respectively; d(cx,cy) equal to 1 when x and y belong to the same
community and 0 in the other case.</p>
        <p>
          Normalized Mutual Information (NMI) [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] is the measure used to evaluate the
similarity between two partitions X,Y. The measure is:
        </p>
        <p>NMI(X ;Y ) =</p>
        <p>2 åic=X1 åcjY=1 Ni jlog( NNii: NjN: j )
åic=X1 Ni:log( NNi: ) + åcjY=1 N: jlog( NN: j )
where Nij is the number of nodes in the community i (in X) that appear in the
partition j (in Y); cx and cy are the number of communities in X and the number of
communities in Y respectively; Ni. is the sum over row i of matrix Nij; N.j is the sum
over column j.</p>
        <p>
          IRIT, UMR CNRS 5505, Universite´ de Toulouse, France
Accordingly, if the communities in X match with the communities in Y then NMI
index is equal to 1; if the communities in X are totally different from the communities
in Y then the NMI index is 0; otherwise the amount will be in the range from 0 to 1.
f -divergence based metric [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]
        </p>
        <p>MDc2 (X ;Y ) = 1</p>
        <p>
          The results of our method will be compared to the state of art [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ] described in the
related work section 2. For this, we will first re-implement the state of the art methods
and applied them to our data set to obtain fair baselines.
5
        </p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>In this paper, we propose a new approach to detect communities in social network
considering both embedding users and messages content. We described the approach as
well as the experiment scenario to evaluate the method on a real tweet collection. We
have also described the evaluation metrics we will use to evaluate our approach and
compare it to related work.</p>
      <p>We are implementing the evaluation scenario and expect the result yields in the very
near future. We are also designing variants of the proposed approach.</p>
      <p>Acknowledgement. This work has been performed in the context of the
PREVISION project, which has received funding from the European Union’s Horizon 2020
research and innovation programme (H2020-SU-SEC-2018 ) under grant agreement No
833115. The paper reflects only the authors’ view and the Commission is not
responsible for any use that may be made of the information it contains (https://cordis.
europa.eu/project/id/833115).</p>
      <p>Ethical issue. While detecting communities from social media raises ethical issues,
they are beyond the scope of this paper.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Ball</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Karrer</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Newman</surname>
            ,
            <given-names>M.E.</given-names>
          </string-name>
          :
          <article-title>Efficient and principled method for detecting communities in networks</article-title>
          .
          <source>Physical Review E</source>
          <volume>84</volume>
          (
          <issue>3</issue>
          ),
          <volume>036103</volume>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Blondel</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guillaume</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lambiotte</surname>
          </string-name>
          , R.:
          <article-title>Fast unfolding of communities in large networks</article-title>
          .
          <source>Journal of Statistical Mechanics: Theory and Experiment</source>
          <year>2008</year>
          (
          <year>October 2008</year>
          ). https://doi.org/10.1088/
          <fpage>1742</fpage>
          -
          <lpage>5468</lpage>
          /
          <year>2008</year>
          /10/P10008
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Danon</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Diaz-Guilera</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Duch</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Arenas</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Comparing community structure identification</article-title>
          .
          <source>Journal of Statistical Mechanics: Theory and Experiment</source>
          <year>2005</year>
          (
          <volume>09</volume>
          ),
          <source>P09008</source>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Fortunato</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Community detection in graphs</article-title>
          .
          <source>Physics Reports</source>
          <volume>486</volume>
          ,
          <fpage>75</fpage>
          -
          <lpage>174</lpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Gao</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ma</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
          </string-name>
          , A.Y.,
          <string-name>
            <surname>Zhou</surname>
            ,
            <given-names>H.H.</given-names>
          </string-name>
          , et al.:
          <article-title>Community detection in degree-corrected block models</article-title>
          .
          <source>The Annals of Statistics</source>
          <volume>46</volume>
          (
          <issue>5</issue>
          ),
          <fpage>2153</fpage>
          -
          <lpage>2185</lpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Haroutunian</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mkhitaryan</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mothe</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>f-Divergence Measures for Evaluation in Community Detection (regular paper)</article-title>
          .
          <source>In: Workshop on Collaborative Technologies and Data Science in Smart City Applications (CODASSCA</source>
          <year>2018</year>
          ). pp.
          <fpage>137</fpage>
          -
          <lpage>145</lpage>
          . AUA NEWSROOM (American University of Armenia, affiliated with the University of California)), http://newsroom.aua.am/ (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Haroutunian</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mkhitaryan</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mothe</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>A New Information-Theoretical Distance Measure for Evaluating Community Detection Algorithms</article-title>
          .
          <source>Journal of Universal Computer Science</source>
          <volume>25</volume>
          (
          <issue>8</issue>
          ),
          <fpage>887</fpage>
          -
          <lpage>903</lpage>
          (
          <year>2019</year>
          ), http://www.jucs.org/jucs_25_8/a_new_information_ theoretical/jucs_25_
          <fpage>08</fpage>
          _
          <fpage>0887</fpage>
          _0903_haroutunian.pdf
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8. Karatas¸, A., S¸ ahin, S.:
          <article-title>Application areas of community detection: A review</article-title>
          .
          <source>In: 2018 International Congress on Big Data, Deep Learning and Fighting Cyber Terrorism (IBIGDELFT)</source>
          . pp.
          <fpage>65</fpage>
          -
          <lpage>70</lpage>
          . IEEE (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Karrer</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Newman</surname>
            ,
            <given-names>M.E.</given-names>
          </string-name>
          :
          <article-title>Stochastic blockmodels and community structure in networks</article-title>
          .
          <source>Physical review E</source>
          <volume>83</volume>
          (
          <issue>1</issue>
          ),
          <volume>016107</volume>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          , Cheng,
          <string-name>
            <given-names>X.</given-names>
            ,
            <surname>Su</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Sun</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.</surname>
          </string-name>
          :
          <article-title>Community-based seeds selection algorithm for location aware influence maximization</article-title>
          .
          <source>Neurocomputing</source>
          <volume>275</volume>
          ,
          <fpage>1601</fpage>
          -
          <lpage>1613</lpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Mothe</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mkhitaryan</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haroutunian</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Community detection: Comparison of state of the art algorithms</article-title>
          .
          <source>In: 2017 Computer Science and Information Technologies (CSIT)</source>
          . pp.
          <fpage>125</fpage>
          -
          <lpage>129</lpage>
          . IEEE (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Newman</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Finding community structure in networks using the eigenvectors of matrices</article-title>
          .
          <source>Phys. Rev. E</source>
          <volume>74</volume>
          (
          <issue>036104</issue>
          ) (
          <year>September 2006</year>
          ). https://doi.org/10.1103/PhysRevE.74.036104
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Newman</surname>
            ,
            <given-names>M.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Girvan</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Finding and evaluating community structure in networks</article-title>
          .
          <source>Physical review E</source>
          <volume>69</volume>
          (
          <issue>2</issue>
          ),
          <volume>026113</volume>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Pons</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Latapy</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Computing communities in large networks using random walks</article-title>
          .
          <source>Lecture Notes in Computer Science</source>
          , Springer 3733 (
          <year>2005</year>
          ). https://doi.org/10.1007/1156959631
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Qin</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jin</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lei</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gabrys</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Musial-Gabrys</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Adaptive community detection incorporating topology and content in social networks</article-title>
          .
          <source>Knowledge-Based Systems 161</source>
          ,
          <fpage>342</fpage>
          -
          <lpage>356</lpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Raghavan</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Albert</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kumara</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Near linear time algorithm to detect community structures in large-scale networks</article-title>
          .
          <source>Physical Review E</source>
          <volume>76</volume>
          (
          <issue>036106</issue>
          ) (
          <year>September 2007</year>
          ). https://doi.org/10.1103/PhysRevE.76.036106
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Reichardt</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bornholdt</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Statistical mechanics of community detection</article-title>
          .
          <source>Phys. Rev. E</source>
          <volume>74</volume>
          (
          <issue>016110</issue>
          ) (
          <year>March 2006</year>
          ). https://doi.org/10.1103/PhysRevE.74.016110
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Rosvall</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bergstrom</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Maps of random walks on complex networks reveal community structure</article-title>
          .
          <source>PNAS</source>
          <volume>105</volume>
          (
          <issue>4</issue>
          ),
          <fpage>1118</fpage>
          -
          <lpage>1123</lpage>
          (
          <year>July 2007</year>
          ). https://doi.org/10.1073/pnas.0706851105
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Ruan</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fuhry</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parthasarathy</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Efficient community detection in large networks using content and links</article-title>
          .
          <source>In: Proceedings of the 22nd international conference on World Wide Web</source>
          . pp.
          <fpage>1089</fpage>
          -
          <lpage>1098</lpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Surian</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nguyen</surname>
            ,
            <given-names>D.Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kennedy</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          , Johnson,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Coiera</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            ,
            <surname>Dunn</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.G.</surname>
          </string-name>
          :
          <article-title>Characterizing twitter discussions about hpv vaccines using topic modeling and community detection</article-title>
          .
          <source>Journal of medical Internet research</source>
          <volume>18</volume>
          (
          <issue>8</issue>
          ),
          <year>e232</year>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cui</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pei</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhu</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Community preserving network embedding</article-title>
          .
          <source>In: Thirty-first AAAI conference on artificial intelligence</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jin</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cao</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
          </string-name>
          , W.:
          <article-title>Semantic community identification in large attribute networks</article-title>
          .
          <source>In: Thirtieth AAAI Conference on Artificial Intelligence</source>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Washha</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Qaroush</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sedes</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Leveraging time for spammers detection on twitter</article-title>
          .
          <source>In: Proceedings of the 8th International Conference on Management of Digital EcoSystems</source>
          . pp.
          <fpage>109</fpage>
          -
          <lpage>116</lpage>
          . ACM (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McAuley</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leskovec</surname>
          </string-name>
          , J.:
          <article-title>Community detection in networks with node attributes</article-title>
          .
          <source>In: 2013 IEEE 13th International Conference on Data Mining</source>
          . pp.
          <fpage>1151</fpage>
          -
          <lpage>1156</lpage>
          . IEEE (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jin</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chi</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhu</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>Combining link and content for community detection: a discriminative approach</article-title>
          .
          <source>In: Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining</source>
          . pp.
          <fpage>927</fpage>
          -
          <lpage>936</lpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Zhang</surname>
          </string-name>
          , G.,
          <string-name>
            <surname>Jin</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gao</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jiao</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fogelman-Soulie´</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          :
          <article-title>Finding communities with hierarchical semantics by distinguishing general and specialized topics</article-title>
          .
          <source>In: Proceedings of the 27th International Joint Conference on Artificial Intelligence</source>
          . pp.
          <fpage>3648</fpage>
          -
          <lpage>3654</lpage>
          . AAAI Press (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Zhao</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fu</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          :
          <article-title>Identifying topical opinion leaders in social community question answering</article-title>
          .
          <source>In: International Conference on Database Systems for Advanced Applications</source>
          . pp.
          <fpage>372</fpage>
          -
          <lpage>387</lpage>
          . Springer (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>