<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Relationship Between Paper Authorship Roles and Novelty from a Gender Perspective: Evidence from 81,137</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jiaqi Zeng</string-name>
          <email>zengjq@njust.edu.cn</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yi Zhao</string-name>
          <email>yizhao93@njust.edu.cn</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Chengzhi Zhang⋆</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Information Management, Nanjing University of Science and Technology 210094 Nanjing</institution>
          ,
          <country country="CN">China</country>
        </aff>
      </contrib-group>
      <fpage>69</fpage>
      <lpage>75</lpage>
      <abstract>
        <p>The division of labor within a research paper plays a crucial role in fostering efficient collaboration and knowledge innovation. The authors' engagement in different contributions influences the integration of specialized knowledge, the formation of diverse perspectives, and the stimulation of creativity, which in turn impact the novelty level of the paper. However, previous studies have lacked depth in exploring the relationship between paper division of labor and novelty, and have overlooked potential gender differences. This study, based on 81,137 papers from PLOS ONE, investigates the correlation between authors' contributions engagement, contributions engagement of authors of different genders, and paper novelty. The results show that, in the Writing-original draft preparation, Writing-review &amp; editing, Methodology, and Software, a higher proportion of author participation is associated with a higher likelihood of the paper achieving greater novelty. In terms of gender differences, women are more likely to participate in the Investigation, Data curation, Formal analysis, and Writing-original draft preparation, while men tend to play a more prominent role in Supervision, Resources, Funding acquisition, Conceptualization, and Software. Furthermore, the study shows that, regardless of gender, a greater proportion of participation in the Writing-original draft preparation, Writing-review &amp; editing, and Software is significantly associated with higher paper novelty. However, only for male authors, a greater proportion of participation in Methodology, Visualization, and Funding acquisition is associated with higher paper novelty.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Author division of labor</kwd>
        <kwd>Novelty</kwd>
        <kwd>Gender differences</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Scientific collaboration is defined as “a concerted effort by researchers to achieve a common goal of
generating new scientific knowledge” [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. It is widely recognized that scientific collaboration has a
positive impact on academic success [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Scientific collaboration relies heavily on the division of
labor, which effectively integrates scholars’ unique expertise, skills, and research experience,
fostering innovative thinking. Understanding how to achieve high-quality scientific collaboration
through division of labor is of great significance [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Novelty assessment is a crucial aspect of
academic quality evaluation. Previous studies have evaluated paper novelty from multiple
perspectives, using methods such as citation analysis, entity analysis, and semantic analysis [
        <xref ref-type="bibr" rid="ref4 ref5 ref6">4, 5,
6</xref>
        ]. However, these studies primarily focus on novelty assessment and lack in-depth exploration of
the underlying factors influencing paper novelty. Scientific division of labor and collaboration, by
bringing together diverse perspectives and expertise, provide the potential for generating
highnovelty research outcomes [
        <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
        ]. Contribution engagement, as a crucial indicator of the degree of
author involvement and effort in each contribution, has not been adequately explored in terms of
its impact on paper novelty. Therefore, this paper aims to empirically investigate the potential
relationship between paper division of labor and paper novelty. This study is of great significance
for scientific team formation, promoting a reasonable division of labor in papers, and driving
scientific innovation. We aim to answer the following research question:
      </p>
      <p>RQ1: Is author's participation in different research contributions in papers correlated with paper
novelty?</p>
      <p>
        Previous research on author division of labor based on author contribution statements has
revealed significant differences in division of labor among scholars of different genders. Haeussler
et al., based on data from 12,964 papers from PLOS ONE, found that women are more likely to
participate in experimental rather than conceptual activities [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Research on gender disparities in
novelty, such as that conducted by Liu et al., has found that biomedical doctoral dissertations
written by women exhibit lower scientific novelty compared to those written by men [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. While
previous studies have documented significant differences in division of labor between male and
female scholars, and have observed differences in the novelty of their publications, none have
delved into the question of whether gender disparities in division of labor influence the novelty of
research papers. Understanding whether the division contribution engagement of authors of
different genders is related to paper novelty is essential for investigating gender differences in
academia and uncovering gender-related factors in the creation of high-novelty papers. Therefore,
this paper further explores the following research question:
      </p>
      <p>RQ2: Is the research contribution engagement of authors of different genders correlated with
paper novelty?</p>
    </sec>
    <sec id="sec-2">
      <title>2. Methodology</title>
      <p>
        PLOS ONE is a multidisciplinary open access journal that supports the development of knowledge
dissemination. Therefore, this study selects articles published in PLOS ONE as the research data,
retrieving a total of 124,688 papers published between 2016 and 2024 from the PLOS ONE journal
website0. Focusing on gender differences in the division within collaborations, we exclude 1,860
single-authored papers. Author gender identification is conducted using the Genderize.io tool0, and
papers with incomplete gender identification are removed (11,985 papers), leaving 110,843 papers
for analysis. Lin et al. developed the SciSciNet dataset, which encompasses over 134 million
scientific publications and millions of external links related to funding and public uses, providing
metrics such as paper novelty [10]. We use the paper DOIs to match PLOS ONE papers with
SciSciNet records, obtaining the tail novelty metric (Atyp_10pct_Z), developed by Uzzi et al. [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
This metric measures novelty based on the commonality of co-cited journal pairs in the references.
A lower Atyp_10pct_Z value indicates higher paper novelty. We successfully match 81,137 papers
from 2016 to 2021.
      </p>
      <p>Since 2016, PLOS ONE has adopted the CRediT contribution taxonomy, encompassing 14
research contributions 0. Based on 81,137 PLOS ONE papers, this study investigates the relationship
between the author participation rate in each contribution (as shown in Formula 1), the male
author participation rate (as shown in Formula 2), the female author participation rate (as shown in
Formula 3), in relation to the novelty of the paper (Atyp_10pct_Z). The regression model controls
for team size (Teamsize), the proportion of contribution categories involved in each paper
(All_Contribution_per, as shown in Formula 4), the publication year (Fixed year), the proportion of
female/male authors in the paper (Per_f/Per_m).</p>
      <p>P_i=Contribution_i_autho rs (1)</p>
      <p>Total_authors
P_m_i=Contribution_i_male_authors (2)</p>
      <p>Total_authors
P_f_i=Contribution_i_female_authors (3)</p>
      <p>Total_authors
All_Contribution_per=CRediT_contributions (4)
14
Where i refers to one of the 14 contributions defined in the CRediT taxonomy: 'Conceptualization',
'Data curation', 'Formal analysis', 'Funding acquisition', 'Investigation', 'Methodology', 'Project
Administration', 'Resources', 'Software', 'Supervision', 'Validation', 'Visualization', 'Writing-original
0 https://journals.plos.org/plosone/
0 https://genderize.io/
0 https://credit.niso.org/
draft preparation', and 'Writing-review &amp; editing'. Contribution_i_authors represents the number of
authors contributing to i per paper. Total_authors represents the total number of authors per paper.
Contribution_i_male_authors and Contribution_i_female_authors signify the respective counts of
male and female authors contributing to i per paper, and CRediT_contributions represent the
contribution categories of CRediT involved in each paper.
3. Results
This study investigates whether gender-based preferences exist in the division of labor among
authors of PLOS ONE publications. It explores the relationship between author contribution
engagement and paper novelty, specifically examining whether this relationship differs between
male and female authors.</p>
      <sec id="sec-2-1">
        <title>3.1. Gender Differences in Author Contribution Engagement</title>
        <p>The 81,137 papers from PLOS ONE involve a total of 534,898 authors, with 208,733 female authors
(39%) and 326,165 male authors (61%). Figure 1 presents the author participation rates in
contributions and the difference in participation rates between genders. In Figure 1, All represents
the proportion of authors participating in i to the total number of authors in all papers. F-M
represents the difference in participation rate between male and female authors. It is calculated as
the proportion of female participation in i (i.e., the total number of female authors performing i in
all papers / the total number of female authors) minus the proportion of male participation in i (i.e.,
the total number of male authors performing i in all papers / the total number of male authors).
Here, i belongs to the 14 contributions of CRediT.</p>
        <p>All</p>
        <p>F-M
Resources, Data curation, Investigation, Validation, and Supervision are associated with lower
novelty.</p>
      </sec>
      <sec id="sec-2-2">
        <title>3.2. The Relationship Between Author Contribution Engagement and Paper</title>
      </sec>
      <sec id="sec-2-3">
        <title>Novelty: A Gendered Analysis</title>
        <p>To address RQ2, two additional multiple linear regression analyses (Models 2 &amp; 3 in Table 1) are
conducted, focusing specifically on the relationship between the participation rates of female and
male authors in various contribution roles and paper novelty. The regression results, presented in
Table 1, reveal that a greater proportion of female author participation in Writing-original draft
preparation, Writing-review &amp; editing, and Software contributions is significantly associated with
higher paper novelty. Conversely, a greater proportion of female author participation in Resources,
Data curation, Formal analysis, and Investigation is associated with lower paper novelty. For male
authors, a greater proportion of participation in Visualization, Writing-original draft preparation,
Writing-review &amp; editing, Methodology, Software, and Funding acquisition is significantly
associated with higher paper novelty. However, a greater proportion of male author participation
in Resources, Data curation, Investigation, Validation, and Supervision is associated with lower
paper novelty.</p>
        <p>The findings of this section not only confirm the correlation between author participation rates
in different contribution roles and paper novelty, but also reveal the differences in the relationship
between the participation rates of authors of different genders and paper novelty. Comparing the
relationship between male and female author participation rates and paper novelty, it is found that
only for males, participation in Visualization, Methodology, and Funding acquisition contributions
is positively correlated with paper novelty. This finding provides a new research perspective for
exploring the potential relationship between author gender and innovative outputs in scientific
collaboration.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>4. Conclusion and future works</title>
      <p>This study investigates the relationship between gender differences in author division of labor and
paper novelty. Our findings indicate that women are more likely than men to participate in
contribution roles related to analysis, data curation, and writing. Men, on the other hand, are more
likely to participate in contribution roles involving Supervision, Funding acquisition, and
Conceptualization of the paper. The results show that papers with a higher proportion of authors</p>
      <p>VARIABLES
Per_m
Teamsize
All_Contribution_per
P_m_conceptualization
P_m_resources
P_m_visualization
P_m_ writing-original
draft preparation
P_m_writing-review &amp;
editing
P_m_data curation
P_m_formal analysis
P_m_investigation
P_m_methodology
P_m_software
P_m_validation
P_m_funding acquisition
P_m_project
administration
P_m_supervision
Fixed year
Constant
Observations
R-squared
participating in the Writing-original draft preparation, Writing-review &amp; editing, Methodology,
and Software contributions are more likely to exhibit higher novelty. A greater proportion of both
male and female authors’ participation in Writing-original draft preparation, Writing-review &amp;
editing, and Software contributions is significantly associated with higher paper novelty.
Additionally, for male authors, a greater proportion of participation in Visualization, Methodology,
and Funding acquisition is significantly associated with higher paper novelty. The findings of this
study provide reference suggestions for optimizing the division of labor in research teams to
achieve the production of highly novel papers.</p>
      <p>
        This study’s data is confined to PLOS ONE publications. Future research could extend this
analysis to other academic journals, investigating the relationship between author contribution
engagement and paper novelty across different publication venues. Additionally, this study
employs the Atyp_10pct_Z metric, developed by Uzzi et al. [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], to assess paper novelty. Future
research could explore this relationship using alternative novelty metrics. Furthermore, while this
study identifies a correlation between author contribution engagement and paper novelty, and
observes differences between genders, future research could delve into the causal relationships
underlying this connection between division of labor engagement and paper novelty.
      </p>
    </sec>
    <sec id="sec-4">
      <title>Acknowledgements</title>
      <p>The paper is presented at the second Workshop on “Innovation Measurement for Scientific
Communication (IMSC) in the Era of Big Data” at 2024 ACM/IEEE Joint Conference on Digital
Libraries (JCDL). This work was partially supported by the National Natural Science Foundation of
China (No. 72074113).</p>
    </sec>
    <sec id="sec-5">
      <title>Declaration on Generative AI</title>
      <p>During the preparation of this work, the authors used GPT-4 in order to correct grammatical
errors, typos, and other writing mistakes. After using this tool, the authors reviewed and edited the
content as needed and takes full responsibility for the publication’s content.
[10] Z. Lin, Y. Yin, L. Liu, D. Wang, SciSciNet: A large-scale open data lake for the science of
science research, Scientific Data, 10 (2023) 315.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>J.S.</given-names>
            <surname>Katz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.R.</given-names>
            <surname>Martin</surname>
          </string-name>
          , What is research collaboration? , Research Policy,
          <volume>26</volume>
          (
          <year>1997</year>
          )
          <fpage>1</fpage>
          -
          <lpage>18</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>H.</given-names>
            <surname>Shen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Xie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Ao</surname>
          </string-name>
          , Y. Cheng,
          <article-title>The continuity and citation impact of scientific collaboration with different gender composition</article-title>
          ,
          <source>Journal of Informetrics</source>
          ,
          <volume>16</volume>
          (
          <year>2022</year>
          )
          <fpage>101248</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>C.</given-names>
            <surname>Lu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.Y.</given-names>
            <surname>Ahn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Ding</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , D. Ma,
          <article-title>Co‐contributorship network and division of labor in individual scientific collaborations</article-title>
          ,
          <source>Journal of the Association for Information Science and Technology</source>
          ,
          <volume>71</volume>
          (
          <year>2020</year>
          )
          <fpage>1162</fpage>
          -
          <lpage>1178</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>B.</given-names>
            <surname>Uzzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Mukherjee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Stringer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Jones</surname>
          </string-name>
          ,
          <article-title>Atypical combinations and scientific impact</article-title>
          ,
          <source>Science</source>
          ,
          <volume>342</volume>
          (
          <year>2013</year>
          )
          <fpage>468</fpage>
          -
          <lpage>472</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>S.</given-names>
            <surname>Shibayama</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Yin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Matsumoto</surname>
          </string-name>
          ,
          <article-title>Measuring novelty in science with word embedding</article-title>
          ,
          <source>PloS one</source>
          ,
          <volume>16</volume>
          (
          <year>2021</year>
          )
          <article-title>e0254034</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Luo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Lu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <article-title>Combination of research questions and methods: A new measurement of scientific novelty</article-title>
          ,
          <source>Journal of Informetrics</source>
          ,
          <volume>16</volume>
          (
          <year>2022</year>
          )
          <fpage>101282</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>C.</given-names>
            <surname>Haeussler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Sauermann</surname>
          </string-name>
          ,
          <article-title>Division of labor in collaborative knowledge production: The role of team size and interdisciplinarity</article-title>
          , Research Policy,
          <volume>49</volume>
          (
          <year>2020</year>
          )
          <fpage>103987</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Franceschet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Costantini</surname>
          </string-name>
          ,
          <article-title>The effect of scholar collaboration on impact and quality of academic papers</article-title>
          ,
          <source>Journal of informetrics</source>
          ,
          <volume>4</volume>
          (
          <year>2010</year>
          )
          <fpage>540</fpage>
          -
          <lpage>553</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>M.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Xie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. J.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Ding</surname>
          </string-name>
          ,
          <string-name>
            <surname>Y. Bu,</surname>
          </string-name>
          <article-title>The prominent and heterogeneous gender disparities in scientific novelty: Evidence from biomedical doctoral theses</article-title>
          ,
          <source>Information Processing &amp; Management</source>
          ,
          <volume>61</volume>
          (
          <year>2024</year>
          )
          <fpage>103743</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>