<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Differential Analysis on Performance of Scientific Collaborations with the Evolution of Entity Popularity</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Fang Tan†</string-name>
          <email>cathytf@163.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tongyang Zhang</string-name>
          <email>tzhang39@163.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jian Xu </string-name>
          <email>issxj@mail.sysu.edu.cn</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>School of Information Management, Sun Yat-sen University</institution>
          ,
          <addr-line>Guangzhou Guangdong</addr-line>
          <country country="CN">China</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2021</year>
      </pub-date>
      <fpage>71</fpage>
      <lpage>73</lpage>
      <abstract>
        <p>In order to investigate the impact of research topic selection time on output performance of scientific collaborations, the aim of this study is to develop a differential analysis framework of scientific collaboration performance at different stages of entity popularity. The framework consists of three main sections: (1) data acquisition and processing; (2) stage division of entity popularity; (3) differential analysis on performance of scientific collaborations at different stages of entities popularity. Our findings show that the popularity stage that research topics are going through can play a role in the collaboration output performance.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Computing in government • Information systems → Information
retrieval
The ultimate success of scientific collaborations depends on a
number of factors, among which the importance of identifying
promising research topics as a key success factor should not be
underestimated [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The selection of a promising research topic can
not only help the scientific collaboration develop a reputation for
having an acute sense of active research domain, but also encourage
the process of scientific discovery scientists and promote the
development of the whole research field [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. However, research on
selecting topics behavior of scientists seldom consider about the
timing issue of topic selection, and most of them only focus on the
development law that underlie an individual's behavior. With more
and more collaborative research studies, scientific collaborations
have gradually replaced individuals as the mainstream research unit.
The study regards bio-entities as research topics to analyze the
evolution of topic popularity in biomedicine related research from
the perspective of entitymetrics. Through analyzing the effect
mechanisms of entity popularity on performance of scientific
collaborations, we provide a theoretical reference for relevant
decision makers in research topic selection and the management of
scientific research project.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Methodology</title>
      <p>The framework of differential analysis on performance of scientific
collaborations in different stages of entity popularity is shown in
Figure 1.
In Figure 1, three functional modules of the analysis framework and
concrete work done in modules are as follow.</p>
      <p>
        Data acquisition and processing. By using BERT [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] and Bio
BERT [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], we collect 317 Gene/Protein entities from the title and
abstract of 1,899,671 articles between 1988 and 2017 in PubMed
with author names disambiguated. All cited information of articles
is obtained from Web of Science (WOS).
      </p>
      <p>
        Stage division of entity popularity. After the normalization
processing of entity frequency, we deal with the division of entity
popularity stages based on the model tree proposed by Ma [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] As
seen in Figure 2, k is less than or equal to -0.05 when the entity’s
popularity stage is descending (short for “descending stage”); k is
greater than or equal to 0.05 when the entity’s popularity stage is
ascending (short for “ascending stage”).
      </p>
      <p>Copyright 2021 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).</p>
      <p>Differential analysis on performance of scientific collaborations
in various stages of entity popularity. Through comparing statistics
on the number of published periodical articles and citations of
teams of each scientific collaboration in the ascending and
descending stages, the impact of entity popularity stage that a
collaboration is going through when selecting a research topic on
its performance and the mechanism behind is discussed. For
scientific collaborations studying entities in various popularity
stages, we recognize authors of the same article as a research team.</p>
      <p>Teams in which the number of authors is less than three or over 10,
and groups in which ages of authors are all over 45 are ruled out.</p>
      <p>
        As long as the target entities appear in the title of abstract of an
article, we consider the team has studied the entities. The index of
normalized citations [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] can be calculated as below.
      </p>
      <p>SC=(PC-AC)/SDC #(1)
where, PC denotes the absolute citation of a certain article, AC
denotes the average absolute citation of all articles in the
publication year of the article, SDC denotes the standard deviation
of the absolute citation of all articles in the same year, and SC
denotes the normalized citation (SC).</p>
    </sec>
    <sec id="sec-3">
      <title>3 Preliminary Results</title>
    </sec>
    <sec id="sec-4">
      <title>3.1 Overview of Experimental Data</title>
      <p>Figure 2 shows the evolution of the number of articles of teams in
the ascending and descending stages by year. After the 21st century,
the number of teams conducting research in the ascending stage of
entities popularity has experienced rapid growth while the number
of teams on the other side shows a slightly declining trend.
As shown in Figure 3, either the overall average normalized
citations or the annual average normalized citations of teams in the
ascending stage remains significantly higher than that of teams in
the descending stage. For research outputs of teams in the
ascending stage, the earlier the published year, the more normalized
citations compared to teams in the descending stage. It illustrates
that team research in the ascending stage is more likely to have a
far-reaching academic influence than that in the descending stage.</p>
    </sec>
    <sec id="sec-5">
      <title>CONCLUSION AND FUTURE WORK</title>
    </sec>
    <sec id="sec-6">
      <title>Conclusion</title>
      <p>The study proposes a preliminary research design, applying the idea
of entitymetrics to team research performance and engaged in
preliminary differential analysis.</p>
      <p>Results show that: The popularity stage that research topics are
going through can play a role in the research performance of
scientific collaborations
Compared with the descending stage, the ascending stage puts more
positive impacts on collaboration research performance.
For different scientific collaboration modes, the study can be used
as a reference in choosing research topics. For instance, when
selecting topics for research, authors can implement the strategy of
choosing a topic the popularity of which is in its ascending stage as
a way to moderate the negative influence of descending stage on
collaborative performance.</p>
    </sec>
    <sec id="sec-7">
      <title>4.2 Future work</title>
      <p>In the future, differences in other aspects such as personnel
composition of a team will be considered in the study. Furthermore,
an important question waiting to be answered is how sustentation
funding, one of the most important external resources to encourage
team research, distributes in the two types of teams? Therefore, we
will further study other aspects of characteristics of teams at
different stages of topic selection in the future.</p>
    </sec>
    <sec id="sec-8">
      <title>ACKNOWLEDGMENTS</title>
      <p>This work is supported by National Social Science Fund of China
[18BTQ076].</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Ran</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <article-title>Arash Baghaei Lakeh and</article-title>
          et al.
          <year>2020</year>
          .
          <article-title>Examining the characteristics of impactful research topics: A case of three decades of HIV-AIDS research</article-title>
          .
          <source>Journal of Informetrics</source>
          ,
          <volume>15</volume>
          (
          <issue>1</issue>
          ):
          <fpage>101122</fpage>
          . DOI: https://doi.org/10.1016/j.joi.
          <year>2020</year>
          .
          <volume>101122</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Sten</given-names>
            <surname>Andler</surname>
          </string-name>
          .
          <year>1979</year>
          .
          <article-title>Predicate path expressions</article-title>
          .
          <source>In Proceedings of the 6th. ACM SIGACT-SIGPLAN Symposium on Principles of Programming Languages (POPL '79)</source>
          . ACM Press, New York, NY,
          <fpage>226</fpage>
          -
          <lpage>236</lpage>
          . DOI: https://doi.org/10.1145/567752.567774.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Ian</given-names>
            <surname>Editor</surname>
          </string-name>
          (Ed.).
          <year>2007</year>
          .
          <article-title>The title of book one (1st</article-title>
          . ed.).
          <source>The name of the series one</source>
          , Vol.
          <volume>9</volume>
          . University of Chicago Press, Chicago. DOI: https://doi.org/10.1007/3- 540-09237-4.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>David</given-names>
            <surname>Kosiur</surname>
          </string-name>
          .
          <year>2001</year>
          .
          <article-title>Understanding Policy-Based Networking (2nd</article-title>
          . ed.). Wiley, New York, NY.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>J.</given-names>
            <surname>Guan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Yan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. J.</given-names>
            <surname>Zhang</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>The impact of collaboration and knowledge networks on citations</article-title>
          .
          <source>Journal of Informetrics</source>
          ,
          <volume>11</volume>
          (
          <issue>2</issue>
          ):
          <fpage>407</fpage>
          -
          <lpage>422</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>