<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>On the E ect of Discussions on Pull Request Decisions</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>In: D. Di Nucci, C. De Roover (eds.): Proceedings of the 18th Belgium-Netherlands Software Evolution Workshop</institution>
          ,
          <addr-line>Brussels, Belgium, 28-11-2019, published at</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Mehdi Golzadeh, Alexandre Decan, Tom Mens Software Engineering Lab, University of Mons Mons</institution>
          ,
          <country country="BE">Belgium</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>This research is supported by the joint FNRS / FWO Excellence of Science project SECO-ASSIST and FNRS PDR T.</institution>
          <addr-line>0017.18</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2015</year>
      </pub-date>
      <abstract>
        <p>Open-source software relies on contributions from di erent types of contributors. Online collaborative development platforms, such as GitHub, usually provide explicit support for these contributions through the mechanism of pull requests, allowing project members and external contributors to discuss and evaluate the submitted code. These discussions can play an important role in the decision-making process leading to the acceptance or rejection of a pull request. We empirically examine in this paper 183K pull requests and their discussions, for almost 4.8K GitHub repositories for the Cargo ecosystem. We investigate the prevalence of such discussions, their participants and their size in terms of messages and durations, and study how these aspects relate to pull request decisions.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Index terms| collaborative development, pull
requests, discussions, software repository mining,
empirical analysis1
Today's open source software development is
increasingly relying on third-party contributors. Developers
contribute to di erent projects on online distributed
development platforms like GitHub. The
collaborative nature of software development it an inherently
social phenomenon [
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ]. GitHub embraces this social
nature by extending the traditional git work ow with
collaboration mechanisms such as pull requests (PR)
and commenting. The pull-based development
process [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] constitutes the primary means for integrating
code from thousands of developers. It allows
developers to participate in many projects without having
direct commit access. The primary advantage of a PR
is the decoupling of the development e ort from the
decision to merge the result to the project's codebase.
It helps developers to avoid frequent merge con icts
with other contributors.
      </p>
      <p>
        Through a built-in commenting mechanism, project
integrators can review the code submitted in a PR, and
ask contributors to improve their code, add
documentation and tests before deciding to integrate it [
        <xref ref-type="bibr" rid="ref4 ref5">4, 5</xref>
        ].
Therefore, the history of commenting activity on a PR
(including all pull request comments and pull request
review comments) provides a valuable source of
information. It enables analysis of who was involved in the
discussion about a PR (e.g. the PR creator, project
integrators, or other contributors). The discussions
that take place between the author of the PR and the
project integrators may play a key role in the ultimate
decision to merge the PR into the code base, if the
concerns raised by the project integrators were properly
addressed or discussed carefully by the PR author.
      </p>
      <p>While many studies have focused on the importance
of having successful PRs [6{9], there is much less
research on understanding the e ect of the presence of
discussions on the decision to accept or reject a PR.
Our research aims to empirically study the relation
between the PR commenting history and the nal PR
decision. As preliminary steps, we focus in this paper
on three research questions:</p>
      <p>RQ1 How prevalent are discussions in PRs? helps
us to determine whether the research goal is
worthwhile to pursue: if there is only a limited number
of PRs with discussions, then we will not be able to
draw statistically signi cant conclusions on their
relation with PR decisions. We show that most PRs
have at least a few comments and a few participants
involved in their discussions, and that the presence of
a discussion is related to the decision. In RQ2 Who
is involved in PR discussions? we identify and group
participants based on their role in a PR. We report
about their combined presence in discussions and
exhibit a relation between a PR decision and the
participants that are involved in its discussion. Finally, in
RQ3 How long are discussions? we measure discussion
length in terms of time and of number of comments
and show how they relate to a PR decision.</p>
      <p>The remainder of this paper is organized as follows.
Section 2 provides the necessary background of studies
related to PRs and comments. Section 3 presents the
data extraction and methodology. Section 4 presents
the preliminary results for the above research
questions. Section 5 discusses the threats to validity of our
study. Section 6 summarises the main ndings and
outlines future work.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Background</title>
      <p>
        Distributed software development on shared online
GitHub repositories is very frequently following a
pullbased development process [3{5]. Any contributor can
create forks of a repository, update them locally by
contributing code changes and, whenever ready,
request to have these changes merged back into the main
branch by submitting a PR [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. This pull-based
software development model o ers a distributed
collaboration mechanism that allows developers to contribute
code in a way that makes code changes trackable
and reviewable by version control systems. This
review mechanism has the additional e ect of increasing
awareness of all changes and allows the developer
community to form an opinion about the proposed changes
and the ultimate merge decision [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. Many
empirical studies have targeted pull requests from di erent
points of view, including evaluation of PRs through
discussion [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], factors in uencing acceptance or
rejection [
        <xref ref-type="bibr" rid="ref12 ref13 ref8 ref9">8, 9, 12, 13</xref>
        ] and, predicting potential future
contributors [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ].
      </p>
      <p>
        Moreover, there are studies which analyze the
content of PR to recommend core member to review,
analyze, evaluate and integrate PRs [15{19], recommend
PRs with high priority [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ], study the e ect of
geographical location of contributors on evaluation of
PRs [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ], and gender bias in PR acceptance or
rejection [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ]. Some studies targeted code reviews to
study the reasons and impact of confusion in code
reviews [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ], linguistic aspects of code review
comments [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ], the impact of continuous integration on
code reviews [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ], the challenges faced by code change
authors and reviewers [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ], how developers perceive
code review quality [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ], how presence of bots and the
e ect of organization and developer pro les on the PR
decision [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Methodology</title>
      <p>
        To carry out our empirical investigation, we need a
dataset containing a large number of repositories and
PRs. The dataset should exclude git repositories that
have been created merely for experimental or personal
reasons, or that only show sporadic traces of activity
and contributions [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ]. Registries of reusable software
packages (e.g., npm for JavaScript, Cargo for Rust,
or PyPI for Python) are good candidates to nd such
repositories, as they typically host thousands of active
software projects, and as one can expect most of them
to have an associated git repository.
      </p>
      <p>We selected the Cargo package registry for the Rust
programming language, because it contains tens of
thousands of projects, and a large majority of them
(nearly 85%) is being developed on GitHub. As both
Cargo and Rust are quite recent (Rust was introduced
in 2011), they contain a large number of repositories,
even after ltering out those that are inactive in terms
of contributions and discussions related to these
contributions.</p>
      <p>
        We relied on libraries.io data dump to extract the
metadata for more than 15K Cargo packages [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ]. We
ltered out 1,571 packages that did not have any
associated git repository and 413 packages whose
repository is not hosted on GitHub. Not all git
repositories were still available at the time we extracted the
data, and our nal list of repositories is composed of
9,954 candidates. For each of these repositories, we
retrieved using GitHub API its complete list of PRs
and, for each PR, all related comments and PR review
comments. We found that 5,210 repositories did not
have any PRs, hence only 4,744 repositories were
retained for further analysis, accounting for more than
188K PRs.
      </p>
      <p>As our goal is to study the relation between
discussions and PR decisions, we decided to remove all PRs
for which no decision was (yet) taken. Such PRs
represent a small fraction of our dataset (around 2.6%). Our
nal dataset contains more than 183K PRs, submitted
by 13,623 contributors and accounting for nearly 1M
comments.</p>
      <p>For each PR in this dataset, we have access to its
creation date, its decision date, its decision, the
person that made that decision, the author of the PR,
and all the comments that were made, including PR
review comments. It is important to note that the very
rst comment visible in a PR corresponds to the PR
description, and is not considered as a PR comment
in this paper, following the distinction also made by
GitHub. For each comment, we retrieved its creation
date and its owner. We distinguish between four
categories of owners:
1. author corresponds to the contributor submitting
the PR;
2. integrator refers to the person having accepted or
rejected a previous PR in the same project;
3. decider refers to the integrator who accepted or
rejected the PR currently under consideration;
and
4. other corresponds to any other participant (e.g.,
users, bots, external contributors).
4</p>
    </sec>
    <sec id="sec-4">
      <title>Research Results</title>
      <p>RQ1 How prevalent are discussions in PRs?
With this rst research question, we aim to get
insights into the prevalence of discussions in PRs. For
each PR in the dataset, we computed its number of
comments, its number of distinct participants and its
number of comment exchanges between one of the
integrators and the author, i.e., the number of times there
is one comment from an integrator followed by an
answer from the PR author. Fig. 1 shows the proportion
of PRs having at least a given number of comments,
participants, and comment exchanges.
comments
participants
comment exchanges
1.0</p>
      <p>We observe that while 48.8% of all PRs have at least
two comments and 42.4% of all PRs have at least two
participants, only 31.9% of them have comment
exchanges. We also observe that all curves exhibit power
law behaviour: the proportion of PRs is exponentially
decreasing as the required number of comments,
participants or exchanges increases. For instance, around
80% of all PRs have less than 8 comments, 3
participants and 2 comment exchanges.</p>
      <p>Since the presence of comments, participants
and/or comment exchanges could a ect the acceptance
or rejection of a PR, we computed the proportion of
accepted (resp. rejected) PRs that have at least one
comment (has comments ), at least two participants
(has participants ) and at least one comment exchange
(has exchange). Fig. 2 reports on these proportions.
Note that by de nition a comment exchange implies at
least 2 participants, hence we have has exchange =)
has participants =) has comments.
has comments
has participants
has exchange
Accepted</p>
      <p>While we observe that a majority of PRs
(regardless of their decision) have comments, proportionally
more PRs have comments for rejected PRs (72.5%)
than for accepted ones (62.4%). Similar observations
can be made for the other criteria, suggesting a
relation between PR acceptance and the presence of a
comment/participant.</p>
      <sec id="sec-4-1">
        <title>RQ2 Who is involved in PR discussions?</title>
        <p>This research question focuses on the participants that
are involved in PR discussions. We distinguish
between four categories of participants, as explained in
Section 3. For each PR, each participant involved in
the discussion was classi ed in author, integrator,
decider or other. Fig. 3 shows the proportion of PR
discussions in function of the presence of categories of
participants.</p>
        <p>We observe that the author of a
PR is involved in most discussions
(64%=6+12+3+3+3+4+20+13), as is the case
for deciders (62%=11+9+20+12+3+4+1+2)
and integrators (57%=6+9+1+1+3+4+20+13).
Other participants are involved in only 23%
(=2+1+4+3+3+3+1+6) of the discussions. We
observe that the most frequent combinations of
participants involve the author and some integrator/decider.
For instance, the pair composed of author/integrator
is the most frequent one (40%=13+20+4+3) followed
by the pair author/integrator (39%=20+12+4+3).
24% (=20+4) of the discussions involve the author,
an integrator and the decider. 29% (=6+6+11+6) of
all cases involve a single participant only.</p>
        <p>Similar to what was done for RQ1, we grouped PRs
according to their decision, and we computed the
proportion of PRs with respect to the presence of
participants of each category. Fig. 4 reports on these
proportions.
1.0</p>
        <p>We observe some interesting di erences between
accepted and rejected PRs mainly based on the presence
of authors and integrators. 51.4% of rejected PRs
involve the author of that PR and 49.6% involve an
integrator, while for accepted PRs only 39.1% involve
the author and 34.3% involve an integrator. While
integrators are proportionally more involved in rejected
than accepted PRs, the opposite is true when it comes
to the decider of a PR: a decider is involved in 42.6%
of accepted PRs but \only" in 22.0% of the rejected
ones. Finally, when considering all other participants
there is only a slight di erence between accepted PRs
(14.4%) and rejected PRs (17.4%).</p>
      </sec>
      <sec id="sec-4-2">
        <title>RQ3 How long are discussions?</title>
        <p>The last research question focuses on the length of
discussions in terms of number of comments and time
between the rst and last comment. We computed these
two characteristics for discussions having at least 2
comments. These account for 49% of all PRs
considered so far. The results are reported in Fig. 5,
combining a scatter plot and two density plots (one for each
considered characteristic).</p>
        <p>We observe from the density plots that most
discussions have a few comments and last for a short period
of time. For instance, the median number of
comments is 5 and the median duration is 0.7 days. We
observe from the scatter plot a di erence between
discussions in accepted and rejected PRs, both for the
0
20
40</p>
        <p>
          60 80 100 120 140
number of comments
number of comments and the duration. We
statistically compared these distributions by means of
MannWhitney-U tests. The null hypothesis was rejected in
both cases (p &lt; 0:001), indicating a statistically
signi cant di erence between these distributions.
However, we found this di erence to be negligible (Cli 's
delta jdj = 0:025) for the number of comments [
          <xref ref-type="bibr" rid="ref30 ref31">30,31</xref>
          ],
and small (jdj = 0:219) for the duration of these
discussions, indicating a higher duration in rejected PRs
than in accepted ones. For instance, the median
duration is 1.69 days for rejected PRs and 0.6 for accepted
ones.
        </p>
        <p>The two regression lines superposed on the scatter
plot re ect the average time between comments (i.e.,
the ratio between duration and comments). We
computed this ratio for all considered discussions, and we
statistically compared their distributions for accepted
and rejected PRs using a Mann-Whitney-U test. We
found a statistically signi cant di erence between the
two distributions (p &lt; 0:001) and a small e ect size
(jdj = 0:258), indicating a higher discussion ratio in
accepted PRs than in rejected PRs. For instance, the
median average time between comments is 0.08 for
accepted PRs, and 0.26 for rejected PRs.
5</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Threats to Validity</title>
      <p>
        Since our analyses are based on data from git
repositories on GitHub, our results may be exposed to the
usual threats related to mining data from GitHub such
as \a large portion of repositories are not for
software development" and \two thirds of projects are
personal" [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ]. However, given that our dataset is
composed of git repositories related to Cargo projects, it is
unlikely to be a ected by such threats. On the other
hand, the selection bias induced by our dataset
being exclusively based on repositories related to Cargo
projects is a threat to external validity [
        <xref ref-type="bibr" rid="ref32">32</xref>
        ], since the
results and conclusions cannot be generalized outside
the scope of this study.
      </p>
      <p>
        The main threat to construct validity is that \most
pull requests appear as non-merged even if they are
actually merged" [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ], potentially leading to an
overestimation of the number of rejected PRs to the
detriment of accepted ones. Fully addressing this threat
is not possible, but we could rely on heuristics to
detect whether PR commits are actually part of the main
branch. Such heuristics are likely to change the gures
reported in this paper, but are unlikely to a ect the
ndings we obtained. Indeed, even if some PRs were
wrongly identi ed as non-merged (=rejected), we
already exhibited di erences in PR discussions between
accepted and rejected PRs.
      </p>
      <p>Another threat to construct validity stems from the
presence of bots and contributors with multiple
identities. We mitigated the problem of multiple
identities by relying on GitHub usernames to identify
contributors instead of the \author" eld values. We did
not consider the presence of bots in this work. This
may have led to an overestimation of the number of
comments and participants, but our ndings should
not be signi cantly a ected, assuming that bots
represent only a fraction of the considered comments. In
our future work, we will study heuristics to detect bot
comments in order to take them into account in our
analyses.</p>
      <p>Finally, the lack of distinction between the di erent
types of comments in our dataset represents a threat
to internal validity. Not all comments are equal, but
have been treated as such in this work. We did not
di erentiate based on the size or content of the
comments. Similarly, we did not distinguish between PR
comments and PR review comments, even if they do
not serve the same purpose. Making such distinctions
can potentially lead to di erent results, and will be
explored in future work to gain additional insights.
6</p>
    </sec>
    <sec id="sec-6">
      <title>Conclusion</title>
      <p>In this preliminary research, we empirically studied
183K PRs and their discussions, accounting for around
1M comments. We showed that discussions are
prevalent in PRs and there are proportionally more
comments, participants and comment exchanges for
rejected PRs than for accepted ones. We identi ed and
grouped participants based on their role in a PR, and
showed that a majority of discussions involved the
author, the decider or one of the integrators. We showed
that the presence of these participants is related to PR
decisions.</p>
      <p>Finally, we considered discussion length in terms
of duration and number of comments. We observed
that most discussions have only a few comments and
do not last for long. While we have not found large
di erences between accepted and rejected PRs based
on their number of comments, we found that
discussions in rejected PRs are longer, and that discussions
in accepted PRs are more intense.</p>
      <p>This paper is part of a broader study and our
intention is to gain a deeper understanding of the dynamics
and patterns of discussions in pull requests, and their
impact on PR decisions. Our goal is to provide
techniques and tools to allow the community to perform
better. Reducing the time to make decisions for pull
requests can help the community to encourage better
contributions by reducing the time required to reject
contributions of insu cient quality or relevance, and
by reducing the time to review and accept positive
contributions. Moreover, based on the insights obtained
during this study we aim to develop techniques to
increase the productivity of contributions in terms of
code quality and contribution time.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Laura</surname>
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Dabbish</surname>
            , H. Colleen Stuart, Jason Tsay, and
            <given-names>James D.</given-names>
          </string-name>
          <string-name>
            <surname>Herbsleb</surname>
          </string-name>
          .
          <article-title>Social coding in GitHub: transparency and collaboration in an open software repository</article-title>
          .
          <source>In Int'l Conf. Computer Supported Cooperative Work</source>
          , pages
          <volume>1277</volume>
          {
          <fpage>1286</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Tom</given-names>
            <surname>Mens</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Marcelo</given-names>
            <surname>Cataldo</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Daniela</given-names>
            <surname>Damian</surname>
          </string-name>
          .
          <article-title>The social developer: The future of software development</article-title>
          .
          <source>IEEE Software</source>
          ,
          <volume>36</volume>
          , January{
          <year>February 2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Georgios</given-names>
            <surname>Gousios</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Martin</given-names>
            <surname>Pinzger</surname>
          </string-name>
          , and Arie van Deursen.
          <article-title>An exploratory study of the pull-based software development model</article-title>
          .
          <source>In International Conference on Software Engineering</source>
          , pages
          <volume>345</volume>
          {
          <fpage>355</fpage>
          . ACM,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>G.</given-names>
            <surname>Gousios</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Zaidman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Storey</surname>
          </string-name>
          , and Arie van Deursen.
          <article-title>Work practices and challenges in pullbased development: The integrator's perspective</article-title>
          .
          <source>In International Conference on Software Engineering</source>
          , volume
          <volume>1</volume>
          , pages
          <fpage>358</fpage>
          {
          <fpage>368</fpage>
          . IEEE, May
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Georgios</given-names>
            <surname>Gousios</surname>
          </string-name>
          ,
          <string-name>
            <surname>Margaret-Anne Storey</surname>
            , and
            <given-names>Alberto</given-names>
          </string-name>
          <string-name>
            <surname>Bacchelli</surname>
          </string-name>
          .
          <article-title>Work practices and challenges in pull-based development: The contributor's perspective</article-title>
          .
          <source>In International Conference on Software Engineering</source>
          , pages
          <volume>285</volume>
          {
          <fpage>296</fpage>
          . ACM,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Jason</given-names>
            <surname>Tsay</surname>
          </string-name>
          , Laura Dabbish, and
          <string-name>
            <given-names>James</given-names>
            <surname>Herbsleb</surname>
          </string-name>
          .
          <article-title>Let's talk about it: Evaluating contributions through discussion in github</article-title>
          .
          <source>In Proceedings of the 22Nd ACM SIGSOFT International Symposium on Foundations of Software Engineering, FSE</source>
          <year>2014</year>
          , pages
          <fpage>144</fpage>
          {
          <fpage>154</fpage>
          , New York, NY, USA,
          <year>2014</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Olga</given-names>
            <surname>Baysal</surname>
          </string-name>
          , Oleksii Kononenko, Reid Holmes, and
          <string-name>
            <given-names>Michael W.</given-names>
            <surname>Godfrey</surname>
          </string-name>
          .
          <article-title>Investigating technical and non-technical factors in uencing modern code review</article-title>
          .
          <source>Empirical Software Engineering</source>
          ,
          <volume>21</volume>
          (
          <issue>3</issue>
          ):
          <volume>932</volume>
          {
          <fpage>959</fpage>
          ,
          <string-name>
            <surname>Jun</surname>
          </string-name>
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Mohammad</given-names>
            <surname>Masudur Rahman and Chanchal</surname>
          </string-name>
          <string-name>
            <given-names>K.</given-names>
            <surname>Roy</surname>
          </string-name>
          .
          <article-title>An insight into the pull requests of GitHub</article-title>
          .
          <source>In Working Conference on Mining Software Repositories</source>
          , pages
          <volume>364</volume>
          {
          <fpage>367</fpage>
          . ACM,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Di</given-names>
            <surname>Chen</surname>
          </string-name>
          , Kathryn T. Stolee, and
          <string-name>
            <given-names>Tim</given-names>
            <surname>Menzies</surname>
          </string-name>
          .
          <article-title>Replication can improve prior results: A github study of pull request acceptance</article-title>
          .
          <source>In Proceedings of the 27th International Conference on Program Comprehension, ICPC '19</source>
          , pages
          <fpage>179</fpage>
          {
          <fpage>190</fpage>
          ,
          <string-name>
            <surname>Piscataway</surname>
          </string-name>
          , NJ, USA,
          <year>2019</year>
          . IEEE Press.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Filkov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Devanbu</surname>
          </string-name>
          , and
          <string-name>
            <given-names>B.</given-names>
            <surname>Vasilescu</surname>
          </string-name>
          .
          <article-title>Wait for it: Determinants of pull request evaluation latency on GitHub</article-title>
          .
          <source>In Working Conference on Mining Software Repositories</source>
          , pages
          <volume>367</volume>
          {
          <fpage>371</fpage>
          , May
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Jason</surname>
            <given-names>Tsay</given-names>
          </string-name>
          , Laura Dabbish, and
          <string-name>
            <given-names>James</given-names>
            <surname>Herbsleb</surname>
          </string-name>
          .
          <article-title>In uence of social and technical factors for evaluating contribution in GitHub</article-title>
          .
          <source>In International Conference on Software Engineering</source>
          , pages
          <volume>356</volume>
          {
          <fpage>366</fpage>
          . ACM,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Igor</surname>
            <given-names>Steinmacher</given-names>
          </string-name>
          , Gustavo Pinto, Igor Scaliante Wiese, and
          <string-name>
            <surname>Marco</surname>
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Gerosa</surname>
          </string-name>
          .
          <article-title>Almost there: A study on quasi-contributors in open source software projects</article-title>
          .
          <source>In Proceedings of the 40th International Conference on Software Engineering</source>
          , ICSE '
          <volume>18</volume>
          , pages
          <fpage>256</fpage>
          {
          <fpage>266</fpage>
          , New York, NY, USA,
          <year>2018</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>M.</given-names>
            <surname>Wessel</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Steinmacher</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Wiese</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Gerosa</surname>
          </string-name>
          .
          <article-title>Should i stale or should i close? an analysis of a bot that closes abandoned issues and pull requests</article-title>
          .
          <source>In 2019 IEEE/ACM 1st International Workshop on Bots in Software Engineering (BotSE)</source>
          , pages
          <fpage>38</fpage>
          {
          <fpage>42</fpage>
          , May
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Damien</surname>
            <given-names>Legay</given-names>
          </string-name>
          , Alexandre Decan, and Tom Mens.
          <article-title>On the impact of pull request decisions on future contributions</article-title>
          . arXiv e-prints, page arXiv:
          <year>1812</year>
          .06269,
          <year>Dec 2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Yin</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C. X.</given-names>
            <surname>Ling</surname>
          </string-name>
          .
          <article-title>Reviewer recommender of pull-requests in GitHub</article-title>
          .
          <source>In International Conference on Software Maintenance and Evolution</source>
          , pages
          <volume>609</volume>
          {
          <fpage>612</fpage>
          . IEEE, Sep.
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Yin</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C. X.</given-names>
            <surname>Ling</surname>
          </string-name>
          .
          <article-title>Who should review this pull-request: Reviewer recommendation to expedite crowd collaboration</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <surname>In</surname>
          </string-name>
          Asia-Paci c Software Engineering Conference, volume
          <volume>1</volume>
          , pages
          <fpage>335</fpage>
          {
          <fpage>342</fpage>
          ,
          <string-name>
            <surname>Dec</surname>
          </string-name>
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>Jing</given-names>
            <surname>Jiang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.-H</given-names>
            <surname>He</surname>
          </string-name>
          , and
          <string-name>
            <given-names>X.-Y</given-names>
            <surname>Chen</surname>
          </string-name>
          .
          <article-title>Coredevrec: Automatic core member recommendation for contribution evaluation</article-title>
          .
          <source>Journal of Computer Science and Technology</source>
          ,
          <volume>30</volume>
          :
          <fpage>998</fpage>
          {
          <fpage>1016</fpage>
          , 09
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <surname>Manoel Limeira de Lima Junior</surname>
            , Daricelio Moreira Soares, Alexandre Plastino, and
            <given-names>Leonardo</given-names>
          </string-name>
          <string-name>
            <surname>Murta</surname>
          </string-name>
          .
          <article-title>Automatic assignment of integrators to pull requests: The importance of selecting appropriate attributes</article-title>
          .
          <source>Journal of Systems and Software</source>
          ,
          <volume>144</volume>
          :
          <fpage>181</fpage>
          {
          <fpage>196</fpage>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20] E. v. d. Veen,
          <string-name>
            <surname>G.</surname>
          </string-name>
          <article-title>Gousios, and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Zaidman</surname>
          </string-name>
          .
          <article-title>Automatically prioritizing pull requests</article-title>
          .
          <source>In Working Conference on Mining Software Repositories</source>
          , pages
          <volume>357</volume>
          {
          <fpage>361</fpage>
          . IEEE, May
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <surname>Ayushi</surname>
            <given-names>Rastogi</given-names>
          </string-name>
          , Nachiappan Nagappan, Georgios Gousios, and Andre van der Hoek.
          <article-title>Relationship between geographical location and evaluation of developer contributions in github</article-title>
          .
          <source>In International Symposium on Empirical Software Engineering and Measurement. ACM</source>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <surname>Josh</surname>
            <given-names>Terrell</given-names>
          </string-name>
          , Andrew Ko nk, Justin Middleton, Clarissa Rainear, Emerson Murphy-Hill, and
          <string-name>
            <given-names>Chris</given-names>
            <surname>Parnin</surname>
          </string-name>
          .
          <article-title>Gender bias in open source: Pull request acceptance of women versus men</article-title>
          .
          <source>01</source>
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <surname>Felipe</surname>
            <given-names>Ebert</given-names>
          </string-name>
          , Fernando Castor, Nicole Novielli, and
          <string-name>
            <given-names>Alexander</given-names>
            <surname>Serebrenik</surname>
          </string-name>
          .
          <article-title>Confusion in code reviews: Reasons, impacts, and coping strategies</article-title>
          .
          <source>pages 49{60</source>
          , 02
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>Vasiliki</given-names>
            <surname>Efstathiou</surname>
          </string-name>
          and
          <string-name>
            <given-names>Diomidis</given-names>
            <surname>Spinellis</surname>
          </string-name>
          .
          <article-title>Code review comments: Language matters</article-title>
          . CoRR, abs/
          <year>1803</year>
          .02205,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <surname>M. M. Rahman</surname>
            and
            <given-names>C. K.</given-names>
          </string-name>
          <string-name>
            <surname>Roy</surname>
          </string-name>
          .
          <article-title>Impact of continuous integration on code reviews</article-title>
          .
          <source>In 2017 IEEE/ACM 14th International Conference on Mining Software Repositories (MSR)</source>
          , pages
          <fpage>499</fpage>
          {
          <fpage>502</fpage>
          , May
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>L.</given-names>
            <surname>MacLeod</surname>
          </string-name>
          , M. Greiler,
          <string-name>
            <given-names>M.</given-names>
            <surname>Storey</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Bird</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Czerwonka</surname>
          </string-name>
          .
          <article-title>Code reviewing in the trenches: Challenges and best practices</article-title>
          .
          <source>IEEE Software</source>
          ,
          <volume>35</volume>
          (
          <issue>4</issue>
          ):
          <volume>34</volume>
          {
          <fpage>42</fpage>
          ,
          <year>July 2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>O.</given-names>
            <surname>Kononenko</surname>
          </string-name>
          , O. Baysal, and
          <string-name>
            <given-names>M. W.</given-names>
            <surname>Godfrey</surname>
          </string-name>
          .
          <article-title>Code review quality: How developers see it</article-title>
          .
          <source>In 2016 IEEE/ACM 38th International Conference on Software Engineering (ICSE)</source>
          , pages
          <fpage>1028</fpage>
          {
          <fpage>1038</fpage>
          , May
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <surname>Eirini</surname>
            <given-names>Kalliamvakou</given-names>
          </string-name>
          , Georgios Gousios, Kelly Blincoe, Leif Singer, Daniel M. German, and
          <string-name>
            <given-names>Daniela</given-names>
            <surname>Damian</surname>
          </string-name>
          .
          <article-title>The promises and perils of mining GitHub</article-title>
          .
          <source>In Int'l Conf. Mining Software Repositories</source>
          , pages
          <volume>92</volume>
          {
          <fpage>101</fpage>
          . ACM,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>Jeremy</given-names>
            <surname>Katz</surname>
          </string-name>
          . Libraries.
          <article-title>io open source repository and dependency metadata (version 1.4.0) [data set]</article-title>
          . http://doi.org/10.5281/zenodo.2536573,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>N.</given-names>
            <surname>Cli</surname>
          </string-name>
          . Dominance statistics:
          <article-title>Ordinal analyses to answer ordinal questions</article-title>
          .
          <source>Psychological Bulletin</source>
          ,
          <volume>114</volume>
          (
          <issue>3</issue>
          ):
          <volume>494</volume>
          {
          <fpage>509</fpage>
          ,
          <year>1993</year>
          . cited By 364.
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <surname>Jeanine</surname>
            <given-names>Romano</given-names>
          </string-name>
          , Je rey
          <string-name>
            <given-names>D</given-names>
            <surname>Kromrey</surname>
          </string-name>
          , Jesse Coraggio, Je Skowronek, and
          <string-name>
            <given-names>Linda</given-names>
            <surname>Devine</surname>
          </string-name>
          .
          <article-title>Exploring methods for evaluating group di erences on the NSSE and other surveys: Are the t-test and Cohen's d indices the most appropriate choices? In Annual Meeting of the Southern Association for Institutional Research</article-title>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>C.</given-names>
            <surname>Wohlin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Runeson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Host</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. C.</given-names>
            <surname>Ohlsson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Regnell</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Wesslen</surname>
          </string-name>
          . Experimentation in Software Engineering - An
          <string-name>
            <surname>Introduction</surname>
          </string-name>
          . Kluwer,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>