<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Seville, Spain
* Corresponding author.
†These authors contributed equally.
$ oscar.diaz@ehu.eus (O. Díaz); xavier.franch@upc.edu (X. Franch); xabier.garmendiad@ehu.eus (X. Garmendia)</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Accumulation and Evolution of Design Knowledge in Research Software</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Oscar Díaz</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Xavier Franch</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Xabier Garmendia</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Universitat Politecnica de Catalunya, Barcelona</institution>
          ,
          <addr-line>Catalunya</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of the Basque Country (UPV/EHU)</institution>
          ,
          <addr-line>San Sebastián</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2025</year>
      </pub-date>
      <volume>000</volume>
      <fpage>0</fpage>
      <lpage>0003</lpage>
      <abstract>
        <p>This project argues that while research software (RS) is increasingly recognized as a pillar of research, its potential for transferability beyond initial use cases is poorly explored. It highlights that software developed for specific scientific problems often remains isolated, limiting knowledge accumulation and evolution across projects. The project focuses specifically on peer-to-peer knowledge transfer rather than more common asymmetric collaboration models (such as open-source or institutional sustainability initiatives) by creating a framework that considers transferability a socio-technical problem. Here, we explore the social perspective by using manuscript bylines as indicators of collaborative profiles. We propose a structural model examining three key dimensions: work-shared efort, role-played efort, and cognitive-distance efort. This work contributes to the ongoing discussion of research assessment methods while providing practical tools for understanding collaborative dynamics in research.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Research Collaboration</kwd>
        <kwd>Research Software</kwd>
        <kwd>Structural Equation Modelling</kwd>
        <kwd>Byline</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Research Software (RS) is given increasing recognition as one of the pillars of research. If this software
pillar is not up, then research reproducibility might be jeopardized. This raises issues on software
sustainability, i.e., the capacity of the software to endure. This criticality increases when software is not
a means but a research end in itself. While in vaccine development the software might be a means to
ifnd the vaccine, the digital society has an increasing number of areas where software is the vaccine, i.e.,
where RS is the very intervention to address the problem (hereafter RS will denote this design-ingrained
software). Because RS is developed in response to a specific scientific problem, it is often developed
by individuals who did not anticipate that their software would be used by others or reused in other
projects. Yet, some RS has a significant potential for transferability to broader application, which
expands the initial use in an individual research project a potential that is currently not fully explored.
While reproducibility looks backward regarding what has been done, transferability looks forward by
providing eventual transferee researchers with evidence that the research findings could be applicable
to other contexts, situations, times, or populations. Transferability looks at whether and by which
means it is possible to achieve success in the target context than reproducing the efects of the original
context. In doing so, design knowledge is advanced. Specifically, this project considers three pathways
for advancement: confidence, projectability, and fitness.</p>
      <p>Confidence refers to the extent of the evaluation comprehensiveness (e.g., rigor, width). RS with
higher evaluation confidence will be used with less risk than RS with lower evaluation confidence,
hence encouraging transferability. Second, projectability, i.e., the extent of the problem-analysis
comprehensiveness in terms of the context that frames the existing software. Low projectability would
indicate a very specific context with restrictive goals. In contrast, high projectability would indicate more
general characterization that increases the chances of the target context to be accommodated within
the primary context. Finally, fitness addresses the extent of the solution-analysis comprehensiveness, in
terms of being operational for users to be applied to (and solve) the real-world problem in focus. Fitness
not only relates to the extent of the coding efort, but of the understanding of the design principles
behind the RS, and hence, the maturity of the theory underlying the RS. Accordingly, the lower the
iftness of the RS, the greater the efort remaining in order to apply the RS to a new problem.</p>
      <p>These pathways reflect distinct approaches to knowledge accumulation. Investigating and
conceptualizing these approaches is the main objective of the SUSTRA project. SUSTRA (acronym for
"Accumulation and Evolution of Design Knowledge in Research Software: From SUStainability to
TRAnsferability", see Table 1 for details) is a 3-year Spanish-funded research project that tackles three
research questions:
• How could software be engineered for transferability vs. traditional reusability/portability
approaches?
• Which sort of transferability scenarios can be considered as test cases for transferability
interventions?
• How to increase trust while reducing friction between participating peers?</p>
    </sec>
    <sec id="sec-2">
      <title>2. Background: Research Software as the depositories of Design</title>
    </sec>
    <sec id="sec-3">
      <title>Knowledge</title>
      <p>
        Research Software (RS) is developed in academia for research purposes [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. If RS is weak, research
reproducibility sufers, raising concerns about software sustainability—its ability to endure and add
value. Diferent research types imply diferent RS types. R. Wieringa distinguishes ‘knowledge-seeking’
research, which aims to understand the world, from ‘solution-seeking’ research, which seeks to change
it via interventions like software [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Knowledge-seeking studies use tools like NVivo or R scripts
for analysis (e.g., debugging, smartphone use). If the issue is problematic (e.g., costly debugging,
nomophobia), solution-seeking studies aim to address it through software interventions. The problem
might be the same (costly debugging, nomophobia), yet the solution often needs to be tuned to the
context (adults vs. teenagers).
      </p>
      <p>
        Hence, in solution-seeking research, RS does not exist in isolation but must address the context
within which their utility is needed and demonstrated. Hence, software is not an ancillary companion,
but the focal point of research from which Design Principles are distilled. Since Design Principles are
not always directly actionable, their embodiments (i.e. the designed-ingrained software) are the way
to impact the world and measure the extent the ingrained Design Principles are useful in accomplishing
‘the human purpose’. Design Principles generalized ‘the active ingredients’ which are conjectured to be
the drivers of ‘the change in the world’. Without such a generalization efort, solution-seeking studies
can quickly be indistinguishable from mere software development. Generalization is key. Some RS has
a significant potential for broader application, which expands beyond the initial intended use in an
individual research project—a potential which is currently poorly explored. Indeed, DSR distinguishes
between fitness-for-use (i.e., the ability of the design artifact to perform in the current application
context with the current set of goals in the problem space) from fitness-for-evolution (i.e., the ability
of the solution to adapt to changes in the problem space over time). Unfortunately, this potential is
hardly realized. In 2020, Von Brocke et al observe,“to date, most studies focus on a single DSR project,
aiming at deriving design knowledge within this project, while knowledge accumulation and evolution
across projects is rarely considered as an antecedent or contribution of the project” [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. In the same
vein but this time talking about artifact resulting from top-rank conference, Timperley et al. remark
“among other challenges, mismatched expectations, misaligned incentives, and poor communication
between the creators, users, and reviewers of artifacts lead to suboptimal outcomes and experiences for
all involved, and prevent the full potential of those artifacts from being realized” [3]. This leads to this
project’s hypothesis:
      </p>
      <p>Accumulation and Evolution of Design Knowledge calls for a RS-based Model of
Transferability where technical and social factors are considered</p>
      <sec id="sec-3-1">
        <title>This leads to this project’s design questions [4]:</title>
        <p>How to design a Model of Transferability that satisfies both technical and social factors
in order to inform researchers accumulate and evolve design knowledge in RS as the IT
artifact in DSR projects?</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>3. A Model of Transferability</title>
      <p>We borrow the term ’transferability’ from qualitative studies. It describes the extent to which the
outcomes of a successful intervention evaluated in a Primary Context can be achieved in a Target Context
(see Figure 1). While replicability looks backwards by regarding what has been done, transferability looks
forward by providing potential transferee researchers with “thick" (detailed, complete) descriptions so
that someone wishing to reuse the knowledge in another context can be clear about how the target
context is similar (and diferent) from the originating (researched) context. As the same phenomenon/a
is/are found to apply in diferent contexts, the generality of the knowledge is given evidence and
accumulated.</p>
      <p>Figure 1 depicts the Model of Transferability resulting from the SUSTRA project. The aim is to
transfer an intervention from a Primary Context to a Target Context. This model suggests that the
reflections on transferability should focus more on whether and by which means it is possible to achieve
intervention success in the Target Context than on “reproducing” the efects of the Primary Context,
because contextual influences in the Target Context usually difer from influences in the Primary Context.
Notice the diference with the notion of ‘usability’. If usability is the degree to which a software artifact
is able to be used to meet end-user goals, then transferability is the extent to which software is able
to be used to meet research goals, i.e., to advance knowledge. Hence, a software might exhibit high
usability but low transferability, and the other way around. Diferences stem from both the stakeholders
(end-users vs. researchers) and the goals (usage vs. knowledge advancement). This model includes
three main constructs: the transferability’s object (’the what’), the transferability’s agents (’the who’),
and the transferability’s aim (’the why’).</p>
      <p>The object of transferability. Our model considers RS as a ‘holistic intervention’ (see Figure
2), i.e., the transfer is not limited to the software per se but encompasses a careful description of the
context and evaluation procedure where the software somehow demonstrates its utility. A context
might be described along Stol and Fitzgerald’s ABC framework for framing SE research: Actors (A) (e.g.,
software professionals, software systems, and their users); their behavior (B) (e.g., coordination among
developers; developer productivity; and the context (C) of a specific system or organization [ 5]. The
Primary Context symbolizes the form in which evidence was gained and is available. In order to decide
on the transfer of the intervention, the transferee researcher needs to contrast the conditions of the
Primary Context vs. his or her own context, i.e., the Target Context. The transferee researcher should
anticipate changes and reactions in the target population and the environment, which may, in turn,
lead to adaptations and further development of the RS.</p>
      <p>
        The aims of transferability. RS is software developed by researchers for researchers. Consequently,
RS difers from commercial software in both the stakeholders (researchers) and the aim (research).
In commercial software, software reuse pursues gains in quality, cost or productivity [6]. Certainly,
these are also valid points for RS. Yet, RS’s ultimate goal is to advance knowledge. The stakeholders
are researchers, not end users. No matter the quality, cost or productivity, design artifacts are not
considered successful unless they serve to sustain ‘new (design) knowledge’. The question arises about
how knowledge can be advanced. Three possible dimensions are possible to increase the confidence,
the projectability or the fitness of the RS [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>The agents of transferability. Our tentative Model of Transferability involves ‘equal-footing peers’
(see Figure 2 ). This scenario departs from Open-Source Software where collaboration is asymmetric
between an existing community and an interested practitioner. Also, our approach difers from the
Software Sustainability Institute initiative. Launched in the UK (the SSI institute) and soon followed
by the US (the URSSI institute) and Australia (the AUSSI institute), these governmental organizations
disseminate best practices for RS sustainability, and build community around them. Here, collaboration
is also asymmetrical. By contrast, we investigate peer-to-peer collaboration in a research context.</p>
      <p>In this setting, the research manuscript serves as a primary representation of the collaborative
efort. Cheong and Corbitt assert that many studies have highlighted a strong positive link between
scientific collaboration and the practice of co-authorship [ 7]. In the same vein, Oh et al. construct
knowledge networks based on co-authorship patterns extracted from four major IS journals [8]. Similarly,
Isfandyari-Moghaddam et al. resort to authorship to derive networks representing collaboration patterns
at diferent levels, including individual, institutional, and country-wise collaborations [ 9]. This suggests
that the manuscript’s byline (i.e., the line of text on a manuscript that lists the authors’ names) could
be used to establish the collaboration profile. Recognising the key role of this concept, the SUSTRA
project includes a work package for formulating a model of byline trustworthiness. The rest of this
report focuses on this work package.</p>
    </sec>
    <sec id="sec-5">
      <title>4. A Model of Byline Trustworthiness for Collaborative Assessment</title>
      <p>The byline brings an afordable method to compute the collaborative profile in the aforementioned
scenarios. Firstly, it is consistent and verifiable, allowing other researchers, committees, and peers to
replicate the results if they have access to the same data set. Secondly, it is cost-efective, which is
important as stakeholders may not have the resources or time to conduct surveys and arrange
questionnaires. Additionally, the sample size (linking collaboration events with publications) is significant for
long careers and statistically more meaningful than case studies. Lastly, collecting bibliographical data
is non-intrusive and non-reactive, meaning it does not afect the collaboration process [10].</p>
      <p>Specifically, the work package addresses the following research question:</p>
      <p>RQ1: What level of trust does the Information Systems community grant to the byline for
collaboration-based decisions taken?
Measuring trustworthiness directly can be challenging because trust is a complex, multifaceted, and
subjective concept influenced by social and cultural norms. This complexity makes it dificult to develop
a single measurement applicable across diferent cultural settings. Indeed, the variation in authorship
strategies across disciplines and the subjective nature of interpreting such data underscore the necessity
of using proxies and indirect measures to gauge trustworthiness. Specifically, provided the byline
reflects the collaborative efort, we can use proxies based on efort. These proxies include the amount
of work shared by the researcher, the researcher’s role in the project, and the level of familiarity an
author has with their co-authors.These proxies ofer significant practical advantages for assessing
collaboration through bylines. Most notably, they rely on readily available bibliometric data that can be
systematically extracted from databases like DBLP, making them cost-efective and scalable. Unlike
alternative approaches such as surveys or interviews, these proxies don’t require direct interaction with
researchers or time-consuming manual data collection. The measurements can be automated, allowing
for consistent application across large datasets and time periods. The proxies are also particularly
convenient for diferent stakeholders. For promotion committees, they provide quantifiable metrics
that can be easily incorporated into evaluation processes. For researchers seeking collaborators, these
measures ofer quick insights into potential partners’ collaborative patterns. The data’s public nature
ensures transparency and verifiability, allowing all parties to access and validate the same information.</p>
      <p>Based on these premises, we create a Structural Model that aims to indirectly measure the
trustworthiness of the byline through these proxies. Figure 3 depicts this model.</p>
      <sec id="sec-5-1">
        <title>4.1. The Work-Shared Efort Perception</title>
        <p>Rationale The extent and nature of the work share assigned to an individual directly impact their
contribution and involvement in the collaborative process. An individual tasked with a substantial
portion of the research, such as data collection, analysis, or writing significant sections of the publication,
will inherently have a deeper engagement and a more pronounced role in the collaboration. Conversely, a
smaller or more peripheral work share might limit the individual’s opportunity to contribute significantly,
potentially afecting the overall collaborative dynamics.</p>
        <p>H1: The work-share efort perception is associated with the byline trustworthiness perception.
Measurement model The number of authors allows for diferent insights: few-author publication
ofers evidence of real contribution, yet too many few-author publication might be a symptom of limited
inter-disciplinary, and low collaboration efort. Specifically, we aim to assess work-share in terms of
the personal workload and impact on the time to complete the research. This results in three items:
WSE01, WSE02 and WSE03 (see Figure 3).</p>
        <p>Rationale The role of an individual defines her responsibilities, authority, and influence over the
project’s direction. Key roles, such as the principal investigator or lead author, involve not only
substantial work but also decision-making authority, coordination of team eforts, and often, the
responsibility for integrating the contributions of all team members into a cohesive output. Diferent
roles may also involve varying degrees of interaction with other team members, influencing how
collaboration occurs. For instance, a role that acts as a bridge between diferent parts of the team (like a
project manager or coordinator) necessitates a high level of collaborative efort.</p>
        <p>H2: The role-played efort perception is associated with the byline trustworthiness perception.
Measurement model The order of co-authors in a publication can follow diferent criteria that vary
according to the diversity of structural dynamics of the field. That said, a common agreement exists
that the authors’ position matters. After reviewing 41 articles, Hilario et al conclude that ‘there is an
association between the order of authors in the byline, the type of activity performed by the author,
and the regularity of their participation in the fundamental stages of the development of the paper’
[11]. On these grounds, Figure 3 includes RPE01, RPE02 and RPE03 items.</p>
      </sec>
      <sec id="sec-5-2">
        <title>4.2. The Cognitive-distance Efort</title>
        <p>Rationale This proxy refers to the diferences in cognitive and knowledge frameworks among team
members. It encompasses variations in perspectives, expertise, problem-solving approaches, and mental
models. It can be a double-edged sword. Too little cognitive distance can lead to homogeneity of
thought, reducing creativity and potential innovation. Too much can hinder efective communication
and consensus-building.</p>
        <p>H3: The cognitive-distance efort perception is associated with the byline trustworthiness
perception.</p>
        <p>Measurement model We conceptualize intimacy as the degree to which an author is knowledgeable
about their co-authors. While high awareness is often associated with positive outcomes [12], having too
many interpersonal relationships can gradually decrease the likelihood of generating new knowledge
[13]. In this context, knowledge leads to selectivity in choosing partners, as individuals tend to prefer
working with trusted partners rather than taking the risks associated with collaborating with unfamiliar
individuals. However, this preference for the ‘comfort zone’ can hinder the contribution of new insights.
Figure 3 includes three items: CDE01, CDE02 and CDE03.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>5. Conclusion</title>
      <p>As research software increasingly serves not just as a means but as the primary intervention itself, the
need for efective knowledge transfer becomes paramount. This issue is not only a technical problem
but also a social challenge. The SUSTRA project explores the social dimensions of research collaboration
through byline analysis, identifying work-shared efort, role-played efort, and cognitive-distance efort
as key factors in understanding collaboration dynamics.</p>
      <p>That said, we recognize that reducing complex collaborative relationships to quantifiable metrics
risks overlooking the qualitative aspects of research partnerships that often drive innovation. Our hope
is that this dual focus on technical transferability and social collaboration provides a comprehensive
framework for advancing knowledge accumulation in software-driven research. By conceptualizing
research software as a baton in the relay race of knowledge advancement, we can better support the
scientific community’s collective progress toward addressing increasingly complex societal challenges.</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgments</title>
      <p>Research supported by MCIN/AEI/10.13039/501100011033/FEDER, UE and the “European Union
NextGenerationEU/PRTR” under contract PID2021-125438OB-I00. Xabier Garmendia enjoys a grant
from the University of the Basque Country - PIF20/236.</p>
    </sec>
    <sec id="sec-8">
      <title>Declaration on Generative AI</title>
      <sec id="sec-8-1">
        <title>The author(s) have not employed any Generative AI tools.</title>
        <p>[3] C. S. Timperley, L. Herckis, C. Le Goues, M. Hilton, Understanding and improving artifact
sharing in software engineering research, Empirical Software Engineering (EMSE) 26 (2021) 1–41.
doi:10.1007/s10664-021-09971-3.
[4] R. J. Wieringa, Research goals and research questions, in: Design science methodology for
information systems and software engineering, Springer, 2014, pp. 13–23.
[5] K. J. Stol, B. Fitzgerald, The abc of software engineering research, ACM Transactions on Software</p>
        <p>Engineering and Methodology (TOSEM) 27 (2018). doi:10.1145/3241743.
[6] S. Apel, D. Batory, C. Kästner, G. Saake, Feature-Oriented Product Lines, Springer Berlin Heidelberg,
2013.
[7] F. Cheong, B. J. Corbitt, A social network analysis of the co-authorship network of the pacific
asia conference on information systems from 1993 to 2008 (2009). URL: https://aisel.aisnet.org/
pacis2009/23.
[8] W. Oh, J. N. Choi, K. Kim, Coauthorship dynamics and knowledge capital: The patterns of
crossdisciplinary collaboration in information systems research, Journal of Management Information
Systems 22 (2005) 266–292.
[9] A. Isfandyari-Moghaddam, M. K. Saberi, S. Tahmasebi-Limoni, et al., Global scientific collaboration:
A social network analysis and data mining of the co-authorship networks, Journal of Information
Science 49 (2023) 1126–1141.
[10] E. E. Tarkang, M. Kweku, F. B. Zotor, Publication practices and responsible authorship: a review
article, Journal of public health in Africa 8 (2017).
[11] C. M. Hilário, M. C. C. Grácio, D. Martínez-Ávila, D. Wolfram, Authorship order as an indicator of
similarity between article discourse and author citation identity in informetrics, Scientometrics
128 (2023) 5389–5410.
[12] D. Lavie, The competitive advantage of interconnected firms: An extension of the resource-based
view, Academy of Management Review 31 (2006) 638–658.
[13] D. Lavie, I. Drori, Collaborating for knowledge creation and application: The case of
nanotechnology research programs, Organization Science 23 (2012) 704–724.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>S.</given-names>
            <surname>Hettrick</surname>
          </string-name>
          ,
          <source>Research software sustainability: Report on a Knowledge Exchange workshop</source>
          ,
          <source>Technical Report, Knowledge Exchange</source>
          ,
          <year>2016</year>
          . URL: https://digitalcommons.unl.edu/scholcom/6/.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>J.</given-names>
            <surname>Vom</surname>
          </string-name>
          <string-name>
            <surname>Brocke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Winter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hevner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Maedche</surname>
          </string-name>
          ,
          <article-title>Accumulation and evolution of design knowledge in design science research: A journey through time and space</article-title>
          ,
          <source>Journal of the Association for Information Systems</source>
          <volume>21</volume>
          (
          <year>2020</year>
          )
          <article-title>9</article-title>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>