<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>ORCID:</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Social Aspects of Machine Learning Model Evaluation: Model Interpretation and Justification from ML-practitioners' Perspective</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Victoria Zakharova</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alena Suvorova</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>HSE University</institution>
          ,
          <addr-line>3A Kantemirovskaya Street, St Petersburg, 194100</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2021</year>
      </pub-date>
      <volume>000</volume>
      <fpage>0</fpage>
      <lpage>0003</lpage>
      <abstract>
        <p>Machine Learning (ML) is now widely applied in various life spheres. Experts from different domains become involved in the decision-making on the basis of complex machine learning models that causes in-creased interest in the research in model explainability. However, little is known about the ways that ML-practitioners use to describe and justify their models to others. This work aims to fill the research gap in understanding how data specialists evaluate machine learning models and how they communicate results to third parties. To explore that, the qualitative research design is suggested and semi-structured interviews with MLpractitioners are conducted. The decision-making process will be explored from a sociological perspective according to which data specialists are considered as actors who tend to construct knowledge rather than passively take it. The potential result of this work is to reveal the role of data specialists in model explanation and justification and describe methods they could use to explain complex models to domain experts with non-technical backgrounds.</p>
      </abstract>
      <kwd-group>
        <kwd>1 Machine Learning</kwd>
        <kwd>Algorithm Evaluation</kwd>
        <kwd>Knowledge Sharing</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Digitalization promotes innovations and facilitates a process of globalization. With that, ongoing
digital transformation causes the emergence of new tasks together with new methods for their solutions
which are rarely clear for a wide audience but accepted since they provide solutions for urgent issues
[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. This tendency is noticeable in the applied domains when medium-size companies, large
corporations, and small start-ups appeal to non-traditional digital solutions to present unique values of
their works to strengthen competitiveness and take an outstanding position among the other market
players. Data-driven approaches have achieved their recognition in customer-oriented settings that are
thought to have an impact on society and its characteristics causing far-reaching effects [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. For
example, the banking sphere has changed with the help of the implementation of chat-bots based on
machine learning algorithms, that give answers to clients quicker or send personalized notifications that
are also already used in such industries as retail [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], healthcare [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], and insurance [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
      <p>
        As one of the consequences, being motivated by the up-growing demand for analytical expertise at
the labour market some people adhere to follow trends and take roles of problem-solvers to deal with
latter-day challenges [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Expanding knowledge to boost expertise and diving into the data science
sphere, such roles become diverse and barely clearly defined due to uncertainty. Moreover, specialists
have to collaborate with each other to reach the commonly established goals such as releasing new
digital products or upgrading existing infrastructure with advanced algorithms. Simplifying the
concepts, model builders, model breakers, and consumers can be distinguished [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Considering a
ground stage of the technological development and knowledge formation about that, the first two
mentioned roles are taken by actors who are interested in facilitating innovations initially and make
decisions based on data introducing state-of-the-art project results to the mass. Specifically, data
specialists have not only to develop and evaluate statistical models such as machine learning ones but
also come to an agreement with stakeholders who are far from direct work with mathematical
algorithms, but they are who can ensure the promotion and assistance of the further project realization.
      </p>
      <p>
        Motivation for conducting this research is based on the recent increase in scientific papers that
emphasize the importance of machine learning practitioners’ expertise dealing with algorithms
promoting transparency analysis as an integral part of the work with algorithms and, by this, paying
precise attention to a need for clarification of the data- and machine-learning-based solutions to
providing understanding for all involved parties [
        <xref ref-type="bibr" rid="ref10 ref7 ref8 ref9">7, 8, 9, 10</xref>
        ]. As for a potential work contribution, this
works attempts to present theoretical justification of model evaluation process with consideration of
practices of model interpretation and sharing the knowledge to the other involved actors supported with
qualitative data provided by machine learning practitioners seeing a case from their perspective.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Research and Problem Statement</title>
      <p>
        Presently, there is plenty of research papers proving scientific interests towards data-driven
innovations from the managerial, economical, and social perspectives. One of the research themes is
related to studying a working process performed by data practitioners. In particular, these research are
more focused on data-oriented skill-set [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], data science role division [
        <xref ref-type="bibr" rid="ref12 ref13 ref14">12, 13, 14</xref>
        ], team collaborations
[
        <xref ref-type="bibr" rid="ref15 ref16">15, 16</xref>
        ], tools and practices within the workflow with the notion of practical settings [
        <xref ref-type="bibr" rid="ref17 ref18 ref19">17, 18, 19</xref>
        ]. In
addition, other studies focus on the role of explanation in decision-making revealing that data specialists
tend to trust algorithms too much and make decisions in a biased manner [
        <xref ref-type="bibr" rid="ref20 ref21">20, 21</xref>
        ]. However, little is
known how data specialists, who implement complex models (i.e., machine learning ones), evaluate
models in non-academic settings, and how they translate the obtained information to the others involved
directly or indirectly in their work. Actually, several studies related to that issue have been aimed at a
direction of practitioner’s work investigation, but they are much more empirical rather than theoretically
justified [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], and experts’ needs and opinions about interpretability is rarely provided.
      </p>
      <p>
        Thus, this research is aimed at studying practices (i.e., practical actions based on the real-life
working experience) of data science specialists with the focus on the model evaluation stage and
communicating their knowledge about model quality and other characteristics with the third parties.
The following research questions are proposed: How do specialists perform model evaluation and what
they pay attention at? How do data practitioners explain complex statistical models to other people
without a deep understanding of data science principles and ideas? The relevance of studying this issue
stems from the idea that data-experts are the first who interact with algorithms, who have specific
knowledge to understand them, and their decisions are initial for promoting the use of algorithms in
production, which might have a significant impact on society over time [
        <xref ref-type="bibr" rid="ref22 ref23">22, 23</xref>
        ].
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Research Design</title>
      <p>
        In the framework of this research, the description of the practical work of specialists is planning to
be supported with empirical data collected via semi-structured interviews with practitioners working in
different business spheres with data-intense applications. An interview-based approach is used to
understand experience, positions, attitudes, and to know opinions of industry practitioners who are
direct guides to the world of technology [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ]. Variability sample or, in other words, interviewing
practitioners from various domains is thought to be applicable for reviewing common
(domainindependent) patterns and discrepancies to provide explanations of the performed actions and formed
viewpoints with the help of shared real-life. As for sampling technique, convenient and snowball
samplings were performed, and, as a result, 16 interviews with 11 men and 5 women have been
conducted. The main criteria for recruiting participants were that they had to have at least one year of
practical experience in the industry, as well as they had to practice machine learning algorithms for
problem-solving at their work.
      </p>
      <p>
        The obtained results will be analyzed with the help of thematic qualitative analysis in order to
explore the general case from the perspective of the applied theoretical framework. In detail, this work
is planning to be based on theory in order to justify its results by grounded interpretation of empirics.
As for the theory, a concept of “worlds” introduced by Boltanski and Thévenot in 2006 [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ] is chosen
for the elaboration of data practitioners’ work. According to that, there are a few “worlds” or ways of
thinking related to how people and objects dwell together being guided by their own interests,
intentions, and perception of particular issues. These “worlds”, that are prone to experience conflicts,
reach compromises, and collaborate on justification, are the following: inspired, fame, domestic, civic,
market, and industrial. Taking into account a fact that data scientists generally work in a business
sphere, an idea that these practitioners have to work together not only with each other but also with
managers and stakeholders that are more likely to be related to the other “worlds”, especially market
one, seems to be straightforward.
      </p>
    </sec>
    <sec id="sec-4">
      <title>4. Plans and Preliminary Results</title>
      <p>Further plans of this research are mainly focused on data analysis to obtain justified answers to the
research questions. In beforehand, findings emphasize the difficulty of contacting a few “worlds”.
Precisely, data practitioners actually evaluate models with the help of mathematical metrics that are
understandable for them, and further, they have to consider interests of the others such as managers
who are more likely to concern about financial payoffs and stakeholders who decide whether they
should invest to an ML-based project or not. The situation becomes more complicated when there is a
necessity to review the models in a social context (e.g., whether obscene content that was unblocked by
mistake is causing moral injury to users). In addition, data practitioners support the idea that one of the
managerial purposes is to sell projects reeling in superiors. Moreover, sometimes managers can attempt
to take part in market tenders offering technical solutions that hardly can be realized by data specialists.
In general, these insights strengthen the idea that there is a high need in building effective
communication between the “worlds” to inform about the capabilities of each of the parties, in particular
converting mathematical metrics to business ones to demonstrate the efficiency and potential benefits
justifiably.</p>
      <p>As for interpretable machine learning methods (which appear to be one of the highly debatable topics
in data science communities), several practitioners mentioned the usefulness of such tools for revealing
model transparency with a certain degree of confidence since there were cases when they helped to
define which model would be better in terms of its algorithm or even elaborate on a project case
considering it step-by-step making representation of the work easier for experts from the other “worlds”.
The others pointed that they did not use interpretable machine learning methods in their project
workflow since they are not worth it: strict explanations are required by stakeholders but
computationally and timely expensive.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Acknowledgements</title>
    </sec>
    <sec id="sec-6">
      <title>6. References</title>
      <p>The work is supported by the Russian Science Foundation grant (project No. 19-71-00064).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>J.J.</given-names>
            <surname>Kassema</surname>
          </string-name>
          , Products and
          <article-title>Services Improvement through Innovation and Creativity: Case of IT Business Sector</article-title>
          . Social Science Research Network, Rochester, NY,
          <year>2019</year>
          . https://doi.org/10.2139/ssrn.348581111.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>A.</given-names>
            <surname>Mugrauer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Pers</surname>
          </string-name>
          ,
          <article-title>Marketing managers in the age of AI: A multiple-case study of B2C firms</article-title>
          ,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>T.</given-names>
            <surname>Calle-Jimenez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Orellana-Alvear</surname>
          </string-name>
          ,
          <string-name>
            <surname>R.</surname>
          </string-name>
          <article-title>Prado-Imbacuan, GIS and User Experience in Decision Support for Retail Type Organizations</article-title>
          .
          <source>In: 2019 International Conference on Information Systems and Software Technologies (ICI2ST)</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>156</fpage>
          -
          <lpage>161</lpage>
          . https://doi.org/10.1109/ICI2ST.
          <year>2019</year>
          .
          <volume>00029</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>L.</given-names>
            <surname>Syed</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Jabeen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Manimala</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.A.</given-names>
            <surname>Elsayed</surname>
          </string-name>
          ,
          <article-title>Data Science Algorithms and Techniques for Smart Healthcare Using IoT and Big Data Analytics</article-title>
          . In: Mishra,
          <string-name>
            <given-names>M.K.</given-names>
            ,
            <surname>Mishra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.S.P.</given-names>
            ,
            <surname>Patel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.S.</given-names>
            , and
            <surname>Misra</surname>
          </string-name>
          , R. (eds.)
          <article-title>Smart Techniques for a Smarter Planet: Towards Smarter Algorithms</article-title>
          , Springer International Publishing, Cham,
          <year>2019</year>
          , pp.
          <fpage>211</fpage>
          -
          <lpage>241</lpage>
          . https://doi.org/10.1007/978-3-
          <fpage>030</fpage>
          - 03131-21124.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>A.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Ramasubramanian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Shivam</surname>
          </string-name>
          ,
          <article-title>Building an Enterprise Chatbot: Work with Protected Enterprise Data Using Open Source Frameworks</article-title>
          . Apress,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>S.</given-names>
            <surname>Kampakis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Problem</given-names>
            <surname>Solving</surname>
          </string-name>
          . In: S. Kampakis (ed.)
          <article-title>The Decision Maker's Handbook to Data Science: A Guide for Non-Technical Executives</article-title>
          , Managers, and
          <string-name>
            <surname>Founders</surname>
          </string-name>
          . Apress, Berkeley, CA,
          <year>2020</year>
          , pp.
          <fpage>89</fpage>
          -
          <lpage>95</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>S.R.</given-names>
            <surname>Hong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Hullman</surname>
          </string-name>
          , E. Bertini,
          <article-title>Human Factors in Model Interpretability: Industry Practices, Challenges, and Needs</article-title>
          .
          <source>Proc. ACM Hum.-Comput. Interact. 4</source>
          ,
          <fpage>1</fpage>
          -
          <lpage>26</lpage>
          ,
          <year>2020</year>
          . https://doi.org/10.1145/33928789.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>C.</given-names>
            <surname>Molnar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Casalicchio</surname>
          </string-name>
          ,
          <string-name>
            <surname>B. Bischl,</surname>
          </string-name>
          <article-title>Interpretable Machine Learning A Brief History, State-ofthe-Art and Challenges</article-title>
          . arXiv:
          <year>2010</year>
          .09337 [cs, stat],
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>W.J.</given-names>
            <surname>Murdoch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Kumbier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Abbasi-Asl</surname>
          </string-name>
          ,
          <string-name>
            <surname>B. Yu,</surname>
          </string-name>
          <article-title>Interpretable machine learning: definitions, methods, and applications</article-title>
          .
          <source>Proc Natl Acad Sci USA</source>
          .
          <volume>116</volume>
          ,
          <year>2019</year>
          , pp.
          <fpage>22071</fpage>
          -
          <lpage>22080</lpage>
          . https://doi.org/10.1073/pnas.190065411616.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>H.</given-names>
            <surname>Suresh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.R.</given-names>
            <surname>Gomez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.K.</given-names>
            <surname>Nam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Satyanarayan</surname>
          </string-name>
          , Beyond Expertise and
          <article-title>Roles: A Framework to Characterize the Stakeholders of Interpretable Machine Learning and their Needs</article-title>
          .
          <source>arXiv:2101</source>
          .09824 [cs],
          <year>2021</year>
          . https://doi.org/10.1145/3411764.344508823.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>T.</given-names>
            <surname>Stadelmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Stockinger</surname>
          </string-name>
          ,
          <string-name>
            <surname>G.</surname>
          </string-name>
          <article-title>Heinatz B ̈urki, M. Braschler, Data Scientists</article-title>
          . In: Braschler,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Stadelmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            , and
            <surname>Stockinger</surname>
          </string-name>
          , K. (eds.)
          <source>Applied Data Science: Lessons Learned for the DataDriven Business</source>
          , Springer International Publishing, Cham,
          <year>2019</year>
          , pp.
          <fpage>31</fpage>
          -
          <lpage>45</lpage>
          . https://doi.org/10.1007/978-3-
          <fpage>030</fpage>
          -11821-1322.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>S.</given-names>
            <surname>Baˇskarada</surname>
          </string-name>
          , A. Koronios,
          <article-title>Unicorn data scientist: the rarest of breeds</article-title>
          .
          <source>Program</source>
          ,
          <volume>51</volume>
          ,
          <year>2017</year>
          , pp.
          <fpage>65</fpage>
          -
          <lpage>74</lpage>
          . https://doi.org/10.1108/PROG0720160053.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>M.</given-names>
            <surname>Kim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Zimmermann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>DeLine</surname>
          </string-name>
          , A. Begel,
          <article-title>Data Scientists in Software Teams: State of the Art and Challenges</article-title>
          .
          <source>IEEE Transactions on Software Engineering. 44</source>
          ,
          <year>2018</year>
          ,
          <fpage>1024</fpage>
          -
          <lpage>1038</lpage>
          . https://doi.org/10.1109/TSE.
          <year>2017</year>
          .
          <volume>275437413</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>J.S.</given-names>
            <surname>Saltz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.W.</given-names>
            <surname>Grady</surname>
          </string-name>
          ,
          <article-title>The ambiguity of data science team roles and the need for a data science workforce framework</article-title>
          .
          <source>In: 2017 IEEE International Conference on Big Data (Big Data)</source>
          ,
          <year>2017</year>
          , pp.
          <fpage>2355</fpage>
          -
          <lpage>2361</lpage>
          . https://doi.org/10.1109/BigData.
          <year>2017</year>
          .
          <volume>825819019</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>A.Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Mittal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Brooks</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Oney</surname>
          </string-name>
          ,
          <article-title>How Data Scientists Use Computational Notebooks for Real-Time Collaboration</article-title>
          .
          <source>Proc. ACM Hum.-Comput. Interact., 3</source>
          ,
          <year>2019</year>
          ,
          <volume>39</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>39</lpage>
          :
          <fpage>30</fpage>
          . https://doi.org/10.1145/335914125.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>A.X.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Muller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <article-title>How do Data Science Workers Collaborate? Roles, Workflows, and Tools</article-title>
          .
          <source>Proc. ACM Hum.-Comput. Interact., 4</source>
          ,
          <year>2020</year>
          ,
          <volume>022</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>022</lpage>
          :
          <fpage>23</fpage>
          . https://doi.org/10.1145/3392826.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>N.</given-names>
            <surname>Boukhelifa</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.-E. Perrin</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Huron</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Eagan</surname>
          </string-name>
          ,
          <article-title>How Data Workers Cope with Uncertainty: A Task Characterisation Study</article-title>
          .
          <source>In: Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems</source>
          , Association for Computing Machinery, New York, NY, USA,
          <year>2017</year>
          , pp.
          <fpage>3645</fpage>
          -
          <lpage>3656</lpage>
          . https://doi.org/10.1145/3025453.3025738.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>A.</given-names>
            <surname>Crisan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Fiore-Gartland</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Tory, Passing the Data Baton: A Retrospective Analysis on Data Science Work and Workers</article-title>
          .
          <source>IEEE Transactions on Visualization and Computer Graphics</source>
          ,
          <volume>27</volume>
          ,
          <year>2021</year>
          ,
          <fpage>1860</fpage>
          -
          <lpage>1870</lpage>
          . https://doi.org/10.1109/TVCG.
          <year>2020</year>
          .
          <volume>3030340</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>P.</given-names>
            <surname>Pereira</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Cunha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.P.</given-names>
            <surname>Fernandes</surname>
          </string-name>
          ,
          <article-title>On Understanding Data Scientists</article-title>
          .
          <source>In: 2020IEEE Symposium on Visual Languages and Human-Centric Computing (VL/HCC)</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>5</lpage>
          https://doi.org/10.1109/VL/HCC50065.
          <year>2020</year>
          .
          <volume>912726918</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <surname>D.-A. Ho</surname>
            ,
            <given-names>O. Beyan,</given-names>
          </string-name>
          <article-title>Biases in Data Science Lifecycle</article-title>
          . arXiv:
          <year>2009</year>
          .09795 [cs],
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>H.</given-names>
            <surname>Kaur</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Nori</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Jenkins</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Caruana</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wallach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. Wortman</given-names>
            <surname>Vaughan</surname>
          </string-name>
          , Interpreting Interpretability:
          <article-title>Understanding Data Scientists' Use of Interpretability Tools for Machine Learning</article-title>
          .
          <source>In: Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems. Association for Computing Machinery</source>
          , New York, NY, USA,
          <year>2020</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>14</lpage>
          . https://doi.org/10.1145/3313831.337621912.
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>U.</given-names>
            <surname>Garzcarek</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Steuer</surname>
          </string-name>
          ,
          <article-title>Approaching Ethical Guidelines for Data Scientists</article-title>
          . In: Bauer,
          <string-name>
            <given-names>N.</given-names>
            ,
            <surname>Ickstadt</surname>
          </string-name>
          ,
          <string-name>
            <surname>K.</surname>
          </string-name>
          , L ̈ubke,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Szepannek</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            ,
            <surname>Trautmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            , and
            <surname>Vichi</surname>
          </string-name>
          , M.(eds.)
          <article-title>Applications in Statistical Computing: From Music Data Analysis to Indus-trial Quality Improvement</article-title>
          , Springer International Publishing, Cham,
          <year>2019</year>
          , pp.
          <fpage>151</fpage>
          -
          <lpage>169</lpage>
          . https://doi.org/10.1007/978-3-
          <fpage>030</fpage>
          -25147- 510.
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>S.</given-names>
            <surname>Passi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.J</given-names>
            .
            <surname>Jackson</surname>
          </string-name>
          ,
          <article-title>Trust in Data Science: Collaboration, Translation, and Accountability in Corporate Data Science Projects</article-title>
          .
          <source>Proc. ACM Hum.-Comput. Interact. 2</source>
          ,
          <issue>2018</issue>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>28</lpage>
          https://doi.org/10.1145/327440517.
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>N.</given-names>
            <surname>Seaver</surname>
          </string-name>
          ,
          <article-title>Algorithms as culture: Some tactics for the ethnography of algorithmic systems</article-title>
          .
          <source>Big Data &amp; Society</source>
          , 4,
          <issue>2053951717738104</issue>
          ,
          <year>2017</year>
          . https://doi.org/10.1177/205395171773810420.
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>L.</given-names>
            <surname>Boltanski</surname>
          </string-name>
          , L. Th ́evenot,
          <source>On Justification: Economies of Worth</source>
          . Princeton University Press, Princeton,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>