<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Tools/frameworks that support development process of AI-based software: validations in white literature</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Tugba</forename><surname>Gurgen Erdogan</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Computer Engineering Department</orgName>
								<orgName type="institution">Hacettepe University</orgName>
								<address>
									<addrLine>Beytepe Campus</addrLine>
									<postCode>06800</postCode>
									<settlement>Ankara</settlement>
									<country key="TR">Turkey</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Haluk</forename><surname>Altunel</surname></persName>
							<affiliation key="aff1">
								<orgName type="department">Computer Engineering Department</orgName>
								<orgName type="institution">Bilkent University</orgName>
								<address>
									<postCode>06800</postCode>
									<settlement>Bilkent</settlement>
									<country key="TR">Turkey</country>
								</address>
							</affiliation>
							<affiliation key="aff2">
								<orgName type="institution">Softtech Inc Hacettepe Teknokent</orgName>
								<address>
									<addrLine>6.arge, Beytepe</addrLine>
									<postCode>06800</postCode>
									<settlement>Ankara</settlement>
									<country key="TR">Turkey</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Ayca</forename><forename type="middle">Kolukısa</forename><surname>Tarhan</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Computer Engineering Department</orgName>
								<orgName type="institution">Hacettepe University</orgName>
								<address>
									<addrLine>Beytepe Campus</addrLine>
									<postCode>06800</postCode>
									<settlement>Ankara</settlement>
									<country key="TR">Turkey</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Tools/frameworks that support development process of AI-based software: validations in white literature</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">8041F92080A10CEC3B719215020B7FB0</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T23:17+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Tool</term>
					<term>framework</term>
					<term>artificial intelligence</term>
					<term>machine learning</term>
					<term>software development</term>
					<term>development process</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Context: Artificial Intelligence (AI)-based software has gained increasing interest, especially in the last decade, due to advancements in underlying technologies and demands in varying business domains. With the proliferation to develop such software, there appears a need for developing methods and supporting tools/frameworks. Purpose: In this paper, we focus on tools/frameworks to automate AI-based software development process, from a holistic view. We review the scientific studies that were empirically validated and also evaluate their proposals with respect to basic characteristics including theme, research methods, types, domains, and a number of cases in empirical validations. Method: We elicit relevant studies (with the contribution type of tool or framework) from a larger set of primary studies identified by a systematic literature review on AI-based software development process. We select 14 primary studies in this context and analyze them with respect to the purposes of the proposals. Results: We review tools/frameworks that support AI-based software development process under four headings: software system development process, the development process of fair software, model development process, and model deployment and operation processes. We observe that domains of empirical validation are diverse while the number of empirical cases applied for validation is limited. Also, only half of the primary studies provide links to their proposals as open-source, which is very important for the repeatability of the empirical validations.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Almost every day, we hear about intelligent systems such as self-driving cars and unmanned aerial vehicles that come to life with artificial intelligence. In addition and in parallel to the era of digitalization, the demand for smart systems that analyze existing data and turn it into an advantage for institutions and customers in many business areas such as trade, health, finance, production, education, etc. is increasing. Accordingly, the growing scale of Internet-based systems and innovations in social platforms, autonomous systems and cloud computing, and IWSM-MENSURA <ref type="bibr">'22:</ref> The Joint Conference of the 31th International Workshop on Software Measurement (IWSM) and the 16th International Conference on Software Process and Product Measurement (MENSURA) <ref type="bibr">Sep 28-30, 2022</ref>, Izmir, Turkey * Corresponding author. tugba@cs.hacettepe.edu.tr (T. G. Erdogan); altunel@bilkent.edu.tr (H. Altunel); atarhan@cs.hacettepe.edu.tr (A. K. Tarhan) 0000-0003-1491-8739 (T. G. Erdogan); 0000-0003-1103-3644 (H. Altunel); 0000-0003-1466-9605 (A. K. Tarhan) the advancement of artificial intelligence (AI) and machine learning (ML) techniques in the last decade, have created a new specialty for the software industry: AI-based software system development.</p><p>Several studies have reported challenges about AI-based software development, more frequently for ML software <ref type="bibr" target="#b0">[1,</ref><ref type="bibr" target="#b1">2,</ref><ref type="bibr" target="#b2">3]</ref> or deep learning (DL) software <ref type="bibr" target="#b3">[4]</ref>, and more specifically for development phases such as requirements <ref type="bibr" target="#b4">[5]</ref> or testing <ref type="bibr" target="#b5">[6]</ref>. In addition, large companies such as Google and Microsoft have shared their experiences of AI-based (or data-centric) software development in recent years. For example, Google reported that machine learning software used in everyday life will likely require ongoing maintenance costs <ref type="bibr" target="#b6">[7]</ref>. Microsoft reported about the challenges AI teams face in managing data, component abstraction, and reusing software, noting that customizing and reusing AI-based software requires different methods and skills than those applied in traditional software development <ref type="bibr" target="#b7">[8]</ref>. Consequently, development methods and supporting technologies must be adapted to meet the specific challenges of AI-based software engineering.</p><p>As a response to the need stated above, in this paper, we focus on and review in detail the studies in scientific (or white) literature, which propose the tools/frameworks to support the development process of AI-based software. We select 14 studies that have been empirically validated from a larger set of primary studies identified by a systematic literature review (SLR) on AI-based software development process. We review the studies by grouping them according to their purposes of proposals and evaluate their proposals with respect to basic characteristics including theme, research methods and types, domains and number of cases in empirical validations, contexts of empirical validations, and open links to the proposed tools/frameworks.</p><p>The rest of this paper is organized as follows. Section 2 provides a summary of the related studies. Section 3 overviews the study search and selection processes, and potential validity threats. Section 4 reviews the primary studies in detail, and Section 5 evaluates them with respect to basic characteristics of validation. Finally, Section 6 concludes the paper.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Related Work</head><p>A list of related secondary studies, which include primary studies on tools/frameworks for AI-based software development, are provided in Table <ref type="table">1</ref>. As seen from the table, there is no study that solely investigates, reviews in detail, and evaluates the tools/frameworks which support the AI-based software development process. Therefore, ours is the first study that is intended for this specific purpose.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Methodology</head><p>First, we have applied guidelines to conduct systematic literature reviews in software engineering <ref type="bibr" target="#b16">[17]</ref>, and have searched and identified 85 primary studies that propose development models or processes for AI-based software. In search and study selection, we have not focused on specific phases of software development such as requirements or design; rather, we took a holistic view for the software development process.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Table 1</head><p>Related secondary studies Ref. Focus <ref type="bibr" target="#b8">[9]</ref> This survey study presents a comprehensive overview of recent and cutting-edge AI software, including open-source frameworks and libraries, which support the implementation of ML and DL algorithms. However, it does not specifically focus on the development process of these algorithms. <ref type="bibr" target="#b9">[10]</ref> This study mines both academic and grey literature, identifies 29 engineering best practices for ML applications, and conducts a survey among 300+ practitioners to determine the degree of adoption of these practices. Although it refers to a number of tools in both white and grey literature, its focus is not on tools/frameworks that support the development process, and it does not include a detailed review. <ref type="bibr" target="#b10">[11]</ref> This survey describes major research efforts where ML systems have been deployed at the edge of computer networks, focusing on the operational aspects including compression techniques, tools, frameworks, and hardware used in successful applications of intelligent edge systems. Yet, it does not focus on tools/frameworks to support the development or deployment processes. <ref type="bibr" target="#b11">[12]</ref> This study summarizes secondary data from non-grey and grey academic literature. It identifies, analyzes, and synthesizes the challenges of ML-enabled software development, and re-frames the traditional SE development process to engineer the ML software development process. However, it does not specifically review the primary studies that propose tools/frameworks to support the development process. <ref type="bibr" target="#b12">[13]</ref> This study presents the results from a systematic mapping study on the life cycle of AI models. It yields 405 primary studies, mapped in 5 different research topics. It reports that only a minority of publications focus on data management and model production problems, and that more studies should address the AI life cycle from a holistic perspective. This study does not have a specific focus on tools/frameworks that support the life cycle of AI models. <ref type="bibr" target="#b2">[3]</ref> This study identifies and synthesizes the current state of software engineering (SE) research for engineering ML systems by a systematic literature review of 141 primary studies. It highlights that SE aspects do not have a mature set of tools and techniques. It reviews in detail the primary studies under the knowledge areas of SWEBOK <ref type="bibr" target="#b13">[14]</ref>, including the ones that propose tools/frameworks in each area. However, only few of these primary studies support the development process of ML software. <ref type="bibr" target="#b14">[15]</ref> The study surveys existing efforts for DevOps, and categorizes them according to the life cycle stages that they contribute. It outlines directions for future work in quality-aware DevOps, in particular at AI for DevOps and DevOps for AI software. In terms of tools/frameworks, it refers to only one primary study which proposes a software architecture solution that can ensure the continuous development of computer vision applications. <ref type="bibr" target="#b15">[16]</ref> To synthesize knowledge on SE approaches for building, operating, and maintaining AI-based systems, this study presents results from a systematic mapping of 248 primary studies. It classifies their subjects to SWEBOK's knowledge areas, including the "Software Engineering Process" area and its supporting tools/frameworks. However, it only refers to these studies in the classification scheme and does not include a detailed review.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">Research Questions</head><p>Research questions (RQs) of this review study are described below: </p><formula xml:id="formula_0">RQ1. What</formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">Search strategy and study selection</head><p>Search sentences that were run in digital libraries as of January 2022 are listed below:</p><p>1. ("artificial intelligence software" OR "AI software" OR "machine learning software" OR "ML software") AND ("development approach" OR "development process" OR "development model" OR "development life-cycle") 2. ("artificial intelligence applications" OR "AI applications" OR "machine learning applications" OR "ML applications") AND ("development approach" OR "development process" OR "development model" OR "development life-cycle") 3. ("artificial intelligence software" OR "AI software" OR "machine learning software" OR "ML software") AND ("software development" OR "application development") AND ("approach" OR "process" OR "model" OR "life-cycle") 4. ("artificial intelligence applications" OR "AI applications" OR "machine learning applications" OR "ML applications") AND ("software development" OR "application development") AND (approach OR process OR model OR life-cycle) 5. ("artificial intelligence software" OR "AI software" OR "machine learning software" OR "ML software") AND (maturity OR capability) AND ("software development" OR "software engineering") AND (approach OR process OR model OR life-cycle) 6. ("artificial intelligence applications" OR "AI applications" OR "machine learning applications" OR "ML applications") AND (maturity OR capability) AND ("software development" OR "software engineering") AND (approach OR process OR model OR life-cycle) 7. ("artificial intelligence software" OR "AI software" OR "machine learning software" OR "ML software") AND (challenges OR difficulties OR problems) AND ("software development" OR "software engineering") 8. ("artificial intelligence applications" OR "AI applications" OR "machine learning applications" OR "ML applications") AND (challenges OR difficulties OR problems) AND ("software development" OR "software engineering")</p><p>The number of studies retrieved from the digital libraries by these search sentences are given in Table <ref type="table" target="#tab_1">2</ref>. While identifying and selecting the primary studies, we have eliminated studies applying inclusion and exclusion criteria in the order of the digital libraries shown in the table. The inclusion criteria are: I1) papers focus on the holistic view of software development process rather than specific phases of software development; I2) papers are in the area of both software engineering and AI. The exclusion criteria are: E1) publication language being other than English; E2) full-text not being available; E3) books and theses; E4) duplicate results from different search methods, and E5) papers are explicitly short (e.g., less than six pages).</p><p>Until now, we have identified contributions made and research methods <ref type="bibr" target="#b17">[18]</ref> employed in the primary studies, which are listed in Table <ref type="table" target="#tab_2">3</ref>. Full details can be found in the study's transparent online spreadsheet (https://tinyurl.com/ytnxhhuy). Here, we should note that we are still in the data extraction stage for the larger pool, and present the initial results from it on the basis of supporting tools/frameworks in this paper. (1) 5 (excluded in this paper),</p><p>In order to construct the study pool in line with the purpose of this paper, we have selected 14 primary studies that have a contribution type of tool or framework and a research method type of validation or evaluation. In validation research, techniques investigated are novel and have not yet been implemented in practice (used for example experiments, i.e. work done in the lab) while in evaluation research, techniques are implemented in practice (i.e. solution implementation) and an evaluation of the technique is conducted in terms of benefits and drawbacks (i.e. implementation evaluation) <ref type="bibr" target="#b18">[19]</ref>. The reason for selecting a subset of primary studies with the stated characteristics is that, as we already mentioned in the introduction section, we would like to identify, review and evaluate the empirically validated tools/frameworks that support the development process of AI-based software.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3.">Potential threats to validity</head><p>A number of validity threats are of concern for this study as adopted from <ref type="bibr" target="#b19">[20,</ref><ref type="bibr" target="#b20">21]</ref> in terms of construct, internal, external and conclusion validity. The guideline for conducting SLRs has been followed and a research protocol has been developed and reviewed to cope with construct validity. One threat of this type regarding the applied protocol can be excluding the studies for the specific phases of software development, such as requirements and design. However, we have done this purposefully, since the subject area is quite broad and we have considered the software development process as a whole. With this strategy applied, we have not only retrieved the studies that propose development models or processes as a primary contribution but also the studies that embed such contributions secondarily into their AI-based software development efforts. Another threat of construct validity is related to the research method (i.e. SLR) that we have employed in this study. Including the grey literature in addition to the white literature could have elicited further proposals of tools/frameworks for the AI-based software development process. We claim this threat so that it can be considered while reading our findings and also be addressed in future studies. In terms of internal validity, we have applied independent voting among authors for inclusion/exclusion of the studies and for quality assessment of initially included studies. Additionally, we have carried out pilot data extraction and have conducted regular meetings to ensure consistency and commonality during data extraction. Regarding external validity, we admit that the results we obtained are valid only for the primary study pool that we have created, and it cannot be generalized in other contexts. Finally, in terms of conclusion validity, we have made several meetings to analyze and synthesize the results of data extraction. Since we have used a standardized format (i.e. Google Sheet) in data analysis, we believe other researchers with the same study pool would come up with similar results.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Review of Tools/Frameworks</head><p>In this section, we review the selected primary studies on tools/frameworks that support AI-based software development process by grouping the studies under four headings. We performed thematic synthesis <ref type="bibr" target="#b21">[22]</ref> to group the studies as: software system development process, development process of fair software, model development process, and model deployment and operation processes. The studies falling under these headings are elaborated in the following subsections.</p><p>Full details with respect to the research questions raised can be found in the study's online spreadsheet (https://tinyurl.com/ytnxhhuy) in the following columns, respectively: RQ1: Type (AD); RQ2: Purpose (AO), theme (AR), domain (AV); RQ3: Link to download (AQ); RQ4: Validation type (Z, AA), research methods (AB-AK), context (AV).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1.">Tools/Frameworks That Support Software System Development Process</head><p>The primary studies grouped under this heading are summarized in Table <ref type="table">4</ref>, each with its title, purpose and scope. As seen from the table, these three studies have different concerns, which indicates that there is a diversity of the studies in this field. The first one is related to health informatics, and we know there is a trend in applying AI techniques in this domain to reveal important insights about healthcare data and healthcare processes. The second one is about deep learning-based project development using the principles of MDE (Model Driven Engineering) to solve the challenges to build the deep learning-based project development, particularly for neural networks which have complicated architecture and dependencies. The third one is seeking solutions for the scalability and performance problems for the development of conversational AI software like mobile software, and proposes a framework inspiring Cell theory and BDI (Belief-Desire-Intention) software model. These proposals can be considered as example solutions to common problems in AI software development, and as the trending topics for the researchers.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Table 4</head><p>Studies that support software system development process</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Study, Year</head><p>Purpose and Scope P1 <ref type="bibr" target="#b22">[23]</ref>, 2020 -It investigates the interaction between software engineering and machine learning within the context of health systems.</p><p>-It proposes SEMLHI framework (the framework and methodology of software engineering for ML in health informatics) that includes four modules: software, machine learning, machine learning algorithms, and health informatics data.</p><p>-The SEMLHI methodology includes seven phases: designing, implementing, maintaining and defining workflows; structuring information; ensuring security and privacy; performance testing and evaluation; and releasing the software applications. -CellS framework has five important components: anima, delibera, cell, plan and ligand.</p><p>-It is used in a resources-constrained mobile system for conversational AI software like a software dialogue system.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2.">Tools/Frameworks That Support Development Process of Fair Software</head><p>The primary studies grouped under this heading are summarized in Table <ref type="table">5</ref>. It is apparent from the table that most of the studies focus on bias mitigation in ML software. Measurement of bias is handled in a maturity framework from the organizational level by applying ethical guidelines and standards, and fairness measurement is handled through a scalable measurement system for computing fairness metrics. It is obvious that ethical AI activities should be addressed with bias detection, bias/fair measurement, and bias mitigation methods starting from the initial stages of the AI development life cycle.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Table 5</head><p>Studies that support development process of fair software</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Study, Year</head><p>Purpose and Scope P3 <ref type="bibr" target="#b25">[26]</ref>, 2019 -It presents a maturity framework based on ethical principles and best practices to evaluate the organization capability in order to effectively govern AI bias.</p><p>-It provides a measurement of bias governance capability maturity of the organizations by separating the scoring in two sections as "consideration" and "action". P8 <ref type="bibr" target="#b26">[27]</ref>, 2021 -It introduces Fairea, a model behaviour mutation approach for benchmarking and quantitatively evaluating bias mitigation methods for ML software.</p><p>-It includes three bias mitigation approaches: pre-processing, in-processing and postprocessing.</p><p>-There are three primary steps: baseline creation with model behaviour mutation, bias mitigation effectiveness region division, and quantitative evaluation of trade-off effectiveness. P9 <ref type="bibr" target="#b27">[28]</ref>, 2020 -It proposes a method named Fairway for bias detection and bias mitigation in binary classification models to remove ethical bias from training data and trained models.</p><p>-This handy tool combines two bias mitigation approaches: pre-processing (before model training) and in-processing (while model training). P11 <ref type="bibr" target="#b28">[29]</ref>, 2020 -It presents LinkedIn Fairness Toolkit (LiFT), a framework for scalable computation of fairness metrics as part of ML systems.</p><p>-It highlights the key requirements in deployed settings, and presents the design of a fairness measurement system. -LiFT comprises bias measurement and mitigation components that can be integrated into different stages of an ML training and serving system.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.3.">Tools/Frameworks To Support Model Development Process</head><p>The primary studies grouped under this heading are summarized in Table <ref type="table" target="#tab_4">6</ref>. As seen from the table, even though introduced models have different specific targets, they all have the common focus on data contribution in model development. The first model focuses on the data scientists' cognitive workflows that include data collection. The second model introduces a validation framework for data. The last model provides an open source library with different data modalities. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.4.">Tools/Frameworks That Support Model Deployment and Operation Processes</head><p>The primary studies grouped under this heading are summarized Table <ref type="table" target="#tab_5">7</ref>. It is apparent from the table that all studies for model deployment and operation take software, data and its machine learning models into account together. That means pipeline and deployment of software are not independent of data and machine learning models. In addition to that, the use cases are limited and only selected models are used. They need to be applied in different domains with real use cases to get wider insight for further usage areas.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Evaluation of Tools/Frameworks</head><p>After grouping and reviewing the primary studies in the previous section, we further evaluate their proposals with respect to basic characteristics of empirical validations, which include research method and type, domain and number of empirical cases, context of empirical cases, and link to proposed tool/framework. Table <ref type="table">8</ref> presents these characteristics for all the included primary studies in columns.</p><p>As seen from the Table <ref type="table">8</ref>, the evaluation type of research ( <ref type="formula">8</ref>) is slightly more than validation type <ref type="bibr" target="#b5">(6)</ref> in the primary studies. Case study research (in eight studies) and lab experiments (in six studies) are the most frequently used research methods. While domains of empirical validation are diverse, the number of empirical cases applied for validation is limited. Contexts of empirical validation are various in alignment with the diversity of the domains. Finally, we see that only half <ref type="bibr" target="#b6">(7)</ref> of the primary studies provide links to their proposals as open source, which is very important for the repeatability of the empirical validations as well as for further validation studies by other researchers or practitioners. -The infrastructure is CHASE-CI for managing fast GPU appliances for machine learning and storage managed through Kubernetes on the high-speed.</p><p>-It presents a software containerization approach and libraries for turning into a distributed computer for big data analysis.</p><p>-It explains a use case in earth science phenomena with object segmentation workflow.</p><p>In terms of the quantity of the cases included in empirical validation, the following studies come to the fore, only the first one employing validation type research: P2 <ref type="bibr" target="#b32">[33]</ref> (for model deployment and operation processes) with seven cases from software intensive embedded system domain, P9 <ref type="bibr" target="#b27">[28]</ref> (for development process of fair software) with five cases regarding five datasets used from UC Irvine ML Repository, and P13 <ref type="bibr" target="#b31">[32]</ref> (for model development process) with five cases from mixed domains including telemetry, healthcare, energy, water supply, and data science.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">Conclusion</head><p>In this study, we reviewed tools/frameworks within white literature in a systematic way for tools/frameworks that support the AI-based software development process from a holistic perspective. We selected and analyzed 14 primary studies and we grouped them under four headings: software system development process, the development process of fair software, model development process, and model deployment and operation processes. Within these four groups, the scopes of the studies are seen as diversified, and data and model dependency of the tools or frameworks is observed such as P1 proposing a framework for health informatics whereas P5 is focusing on mobile systems. Additionally, data and machine learning models have effects on the software development process, and only half of the studies include the links to their proposals. On the other hand, more than half of the studies are presented with evaluations while the rest of them use validation type research.</p><p>These findings underline the fact that the AI-based software development process is an emerging field of research and needs further investigation. Furthermore, some of the proposed tools or frameworks need to be sharpened and to be validated in different domains in various industrial contexts. For researchers and practitioners, the next step can be the comparison of new tools/frameworks with existing models in industry in terms of performance and quality metrics. Additionally, a comparison of the tools/frameworks that support traditional software development processes may provide valuable findings in the maturation of the overall software development in the AI/ML area. As the last point, practitioners in industry can test the seven of the tools/frameworks with their access links given in Table <ref type="table">8</ref>, evaluate their real-life performances, and share their results publicly as the feedback to the community as well as the researchers who introduced them.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>-</head><label></label><figDesc>It supports deep learning based project development. -It proposes an ML-oriented artifact model inspiring Model Driven Development (MDE) and the evaluation of the proposed concepts are combined into a Maven based build infrastructure. -A Maven plugin is developed to automate build which provides install and deploy goals for all kinds of archives of the proposed artifact model. P5[25], 2021 -It presents the CellS framework to improve smart software development on multicore mobile processor systems.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head></head><label></label><figDesc>is the type of the proposed tool/framework? (plugin, library, toolkit, etc.) RQ2. For what purpose, theme (data, model, maturity, etc.) or domain (e.g., health) has the tool/framework been proposed? RQ3. Does the proposal include a tool? If yes, is the tool accessed publicly or is it open-source? If yes, what is its url? RQ4. Has the proposal been validated (with weak or strong empirical study)? If yes, what are the research methods (case study, survey etc.) used in and the context (scope, etc.) of validation?</figDesc><table /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2</head><label>2</label><figDesc>Number of primary studies retrieved and selected</figDesc><table><row><cell>Sentence</cell><cell></cell><cell cols="4">Number of initially retrieved studies</cell><cell></cell><cell>Number of selected</cell><cell>Number of selected pri-</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell>studies for</cell><cell>mary stud-</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell>AI-based</cell><cell>ies to review</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell>software de-</cell><cell>in this paper</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell>velopment</cell><cell></cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell>process</cell><cell></cell></row><row><cell></cell><cell>Web</cell><cell cols="2">Scopus IEEE</cell><cell>Springer</cell><cell>ACM</cell><cell>Google</cell><cell></cell><cell></cell></row><row><cell></cell><cell>of Sci-</cell><cell></cell><cell>Xplore</cell><cell>Link</cell><cell>DL</cell><cell>Scholar</cell><cell></cell><cell></cell></row><row><cell></cell><cell>ence</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>S1</cell><cell>5</cell><cell>116</cell><cell>706</cell><cell>196</cell><cell>128</cell><cell>3690</cell><cell>17</cell><cell>2</cell></row><row><cell>S2</cell><cell>26</cell><cell>52</cell><cell>302</cell><cell>141</cell><cell>259</cell><cell>7760</cell><cell>20</cell><cell>3</cell></row><row><cell>S3</cell><cell>22</cell><cell>33</cell><cell>237</cell><cell>345</cell><cell>215</cell><cell>6190</cell><cell>12</cell><cell>2</cell></row><row><cell>S4</cell><cell>33</cell><cell>85</cell><cell>27</cell><cell>915</cell><cell>408</cell><cell>10100</cell><cell>4</cell><cell>1</cell></row><row><cell>S5</cell><cell>5</cell><cell>8</cell><cell>44</cell><cell>395</cell><cell>232</cell><cell>3930</cell><cell>13</cell><cell>5</cell></row><row><cell>S6</cell><cell>4</cell><cell>9</cell><cell>8</cell><cell>1343</cell><cell>428</cell><cell>7540</cell><cell>4</cell><cell>0</cell></row><row><cell>S7</cell><cell>66</cell><cell>43</cell><cell>762</cell><cell>997</cell><cell>357</cell><cell>8370</cell><cell>10</cell><cell>1</cell></row><row><cell>S8</cell><cell>34</cell><cell>81</cell><cell>90</cell><cell>4065</cell><cell>609</cell><cell>14800</cell><cell>5</cell><cell>0</cell></row><row><cell>Total</cell><cell>15</cell><cell>8</cell><cell>15</cell><cell>4</cell><cell>38</cell><cell>5</cell><cell>85</cell><cell>14</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 3</head><label>3</label><figDesc>Types of contributions and research methods used in classifying primary studies</figDesc><table /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Types of contributions Types of research No.of studies per research method in this paper</head><label></label><figDesc></figDesc><table><row><cell>Tool/Framework</cell><cell>(1) Solution proposal,</cell></row><row><cell>Tasks/Phases</cell><cell>(2) Validation research,</cell></row><row><cell>Workflow</cell><cell>(3) Evaluation research,</cell></row><row><cell>Taxonomy</cell><cell>(4) Experience paper.</cell></row><row><cell>Guideline</cell><cell></cell></row><row><cell>List of challenges</cell><cell></cell></row><row><cell>Solutions-to challenges</cell><cell></cell></row><row><cell>Language</cell><cell></cell></row><row><cell>Lessons learned</cell><cell></cell></row><row><cell>List of best practices</cell><cell></cell></row><row><cell>Empirical study only</cell><cell></cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_4"><head>Table 6</head><label>6</label><figDesc>Studies that support model development process It provides guidance to develop, and metrics to evaluate. -It comes up with Machine Learning (ML) Bazaar as open source libraries for components to create general-purpose, multi-task, end-to-end AutoML systems that provide solutions to different data modalities (image, text, graph, tabular, relational, etc.) and problem types (classification, regression, anomaly detection, graph matching, etc.).</figDesc><table><row><cell cols="2">Study Purpose and Scope</cell></row><row><cell>P6[30],</cell><cell>-It focuses on observing and analyzing data scientists' cognitive workflows as they</cell></row><row><cell>2021</cell><cell>develop predictive models.</cell></row><row><cell></cell><cell>-It proposes DSWorkFlow that covers data collection,workflow reconstruction, and</cell></row><row><cell></cell><cell>feature extraction stages.</cell></row><row><cell>P12[31],</cell><cell>-It helps in systematizing the adoption of data validation processes and tools in industrial</cell></row><row><cell>2021</cell><cell>ML projects.</cell></row><row><cell></cell><cell>-It introduces a data validation framework (DVF) that includes validation process,</cell></row><row><cell></cell><cell>validation artifacts, data validation types, data validation tool setup, and feedback and</cell></row><row><cell></cell><cell>mitigation strategy.</cell></row><row><cell>P13[32],</cell><cell>-</cell></row><row><cell>2020</cell><cell></cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_5"><head>Table 7</head><label>7</label><figDesc>Studies that support model deployment-operation processes</figDesc><table><row><cell>Study,</cell><cell>Purpose and Scope</cell></row><row><cell>Year</cell><cell></cell></row><row><cell>P2[33],</cell><cell>-It proposes a framework for selecting a certain architecture.</cell></row><row><cell>2020</cell><cell>-It outlines the trade-off between device cost, model performance, and data privacy in</cell></row><row><cell></cell><cell>selection of architecture alternatives.</cell></row><row><cell></cell><cell>-Total of five different architectural alternatives exist from centralized cloud to fully</cell></row><row><cell></cell><cell>decentralized edge architectures. It explains seven case companies in the embedded</cell></row><row><cell></cell><cell>system domain.</cell></row><row><cell>P7[34],</cell><cell>-It presents a framework for supporting collaborative data science development and</cell></row><row><cell>2021</cell><cell>open-source feature engineering.</cell></row><row><cell></cell><cell>-The framework's name is Ballet that includes feature definition, feature engineering</cell></row><row><cell></cell><cell>pipeline, feature API validation, ML performance validation, and project management</cell></row><row><cell></cell><cell>capabilities.</cell></row><row><cell></cell><cell>-It presents two use cases, first one is in disease prediction and the other one is in income</cell></row><row><cell></cell><cell>prediction.</cell></row><row><cell>P10[35],</cell><cell></cell></row><row><cell>2020</cell><cell></cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_6"><head>-</head><label></label><figDesc>It introduces a development environment namely Gestalt that allows developers to implement a classification pipeline, analyzes data as it moves through that pipeline, and supports easy transition between implementation and experiment.-It explains 2 use cases: sentiment analysis, gesture recognition.-It underlines the contributions of flexibility and visualization.</figDesc><table><row><cell>P14[36],</cell><cell>-It introduces a cognitive hardware and software composed infrastructure.</cell></row><row><cell>2019</cell><cell></cell></row></table></figure>
		</body>
		<back>
			<div type="annex">
<div xmlns="http://www.tei-c.org/ns/1.0" />			</div>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">A taxonomy of software engineering challenges for machine learning systems: An empirical investigation</title>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">E</forename><surname>Lwakatare</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Raj</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Bosch</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">H</forename><surname>Olsson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Crnkovic</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International Conference on Agile Software Development</title>
				<meeting><address><addrLine>Cham</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="227" to="243" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Sofware engneering challenges for machine learning applications: A literature review</title>
		<author>
			<persName><forename type="first">F</forename><surname>Kumeno</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Intelligent Decision Technologies</title>
		<imprint>
			<biblScope unit="volume">13</biblScope>
			<biblScope unit="page" from="463" to="476" />
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">A software engineering perspective on engineering machine learning systems: State of the art and challenges</title>
		<author>
			<persName><forename type="first">G</forename><surname>Giray</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Systems and Software</title>
		<imprint>
			<biblScope unit="volume">180</biblScope>
			<biblScope unit="page">111031</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Software engineering challenges of deep learning</title>
		<author>
			<persName><forename type="first">A</forename><surname>Arpteg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Brinne</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Crnkovic-Friis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Bosch</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">44th Euromicro Conference on Software Engineering and Advanced Applications (SEAA), IEEE</title>
				<imprint>
			<date type="published" when="2018">2018. 2018</date>
			<biblScope unit="page" from="50" to="59" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Car, Requirements engineering challenges in building ai-based complex systems</title>
		<author>
			<persName><forename type="first">H</forename><surname>Belani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Vukovic</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">2019 IEEE 27th International Requirements Engineering Conference Workshops (REW), IEEE</title>
				<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="252" to="255" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<title level="m" type="main">Software testing: Issues and challenges of artificial intelligence &amp; machine learning</title>
		<author>
			<persName><forename type="first">K</forename><surname>Sugali</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2021">2021</date>
			<publisher>IJAIA</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Hidden technical debt in machine learning systems</title>
		<author>
			<persName><forename type="first">D</forename><surname>Sculley</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Holt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Golovin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Davydov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Phillips</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Ebner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Chaudhary</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Young</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J.-F</forename><surname>Crespo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Dennison</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Advances in neural information processing systems</title>
		<imprint>
			<biblScope unit="volume">28</biblScope>
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Software engineering for machine learning: A case study</title>
		<author>
			<persName><forename type="first">S</forename><surname>Amershi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Begel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Bird</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Deline</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Gall</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Kamar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Nagappan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Nushi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Zimmermann</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">IEEE/ACM 41st International Conference on Software Engineering: Software Engineering in Practice (ICSE-SEIP)</title>
				<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2019">2019. 2019</date>
			<biblScope unit="page" from="291" to="300" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Machine learning and deep learning frameworks and libraries for large-scale data mining: a survey</title>
		<author>
			<persName><forename type="first">G</forename><surname>Nguyen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Dlugolinsky</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Bobák</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Tran</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Á</forename><surname>López</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>García</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Heredia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Malík</surname></persName>
		</author>
		<author>
			<persName><surname>Hluchỳ</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Artificial Intelligence Review</title>
		<imprint>
			<biblScope unit="volume">52</biblScope>
			<biblScope unit="page" from="77" to="124" />
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Adoption and effects of software engineering best practices in machine learning</title>
		<author>
			<persName><forename type="first">A</forename><surname>Serban</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Van Der Blom</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Hoos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Visser</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 14th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM)</title>
				<meeting>the 14th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM)</meeting>
		<imprint>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="1" to="12" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Machine learning at the network edge: A survey</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">S</forename><surname>Murshed</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Murphy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Hou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Khan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Ananthanarayanan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Hussain</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ACM Computing Surveys (CSUR)</title>
		<imprint>
			<biblScope unit="volume">54</biblScope>
			<biblScope unit="page" from="1" to="37" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Software engineering for data mining (ml-enabled) software applications</title>
		<author>
			<persName><forename type="first">S</forename><surname>Saeed</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">M</forename><surname>Abubakar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Karabatak</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">2021 9th International Symposium on Digital Forensics and Security (ISDFS), IEEE</title>
				<imprint>
			<date type="published" when="2021">2021</date>
			<biblScope unit="page" from="1" to="9" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Systematic mapping study on the machine learning lifecycle</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Xie</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Cruz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Heck</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">S</forename><surname>Rellermeyer</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">IEEE/ACM 1st Workshop on AI Engineering-Software Engineering for AI (WAIN)</title>
				<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2021">2021. 2021</date>
			<biblScope unit="page" from="70" to="73" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<monogr>
		<author>
			<persName><forename type="first">A</forename><surname>Abran</surname></persName>
		</author>
		<title level="m">Swebok, Guide to the Software Engineering Body of Knowledge</title>
				<imprint>
			<date type="published" when="2004">2004</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Quality-aware devops research: Where do we stand?</title>
		<author>
			<persName><forename type="first">A</forename><surname>Alnafessah</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">U</forename><surname>Gias</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Zhu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Casale</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Filieri</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Access</title>
		<imprint>
			<biblScope unit="volume">9</biblScope>
			<biblScope unit="page" from="44476" to="44489" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Software engineering for ai-based systems: a survey</title>
		<author>
			<persName><forename type="first">S</forename><surname>Martínez-Fernández</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Bogner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Franch</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Oriol</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Siebert</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Trendowicz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">M</forename><surname>Vollmer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Wagner</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ACM Transactions on Software Engineering and Methodology (TOSEM)</title>
		<imprint>
			<biblScope unit="volume">31</biblScope>
			<biblScope unit="page" from="1" to="59" />
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<monogr>
		<author>
			<persName><forename type="first">B</forename><surname>Kitchenham</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Charters</surname></persName>
		</author>
		<title level="m">Guidelines for performing systematic literature reviews in software engineering</title>
				<imprint>
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Requirements engineering paper classification and evaluation criteria: a proposal and a discussion</title>
		<author>
			<persName><forename type="first">R</forename><surname>Wieringa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Maiden</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Mead</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Rolland</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Requirements engineering</title>
		<imprint>
			<biblScope unit="volume">11</biblScope>
			<biblScope unit="page" from="102" to="107" />
			<date type="published" when="2006">2006</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Systematic mapping studies in software engineering</title>
		<author>
			<persName><forename type="first">K</forename><surname>Petersen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Feldt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Mujtaba</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Mattsson</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">12th International Conference on Evaluation and Assessment in Software Engineering (EASE)</title>
				<imprint>
			<date type="published" when="2008">2008</date>
			<biblScope unit="volume">12</biblScope>
			<biblScope unit="page" from="1" to="10" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<monogr>
		<author>
			<persName><forename type="first">C</forename><surname>Wohlin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Runeson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Höst</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">C</forename><surname>Ohlsson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Regnell</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Wesslén</surname></persName>
		</author>
		<title level="m">Experimentation in software engineering</title>
				<imprint>
			<publisher>Springer Science &amp; Business Media</publisher>
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">A map of threats to validity of systematic literature reviews in software engineering</title>
		<author>
			<persName><forename type="first">X</forename><surname>Zhou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Jin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Huang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">2016 23rd Asia-Pacific Software Engineering Conference (APSEC)</title>
				<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2016">2016</date>
			<biblScope unit="page" from="153" to="160" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Recommended steps for thematic synthesis in software engineering</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">S</forename><surname>Cruzes</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Dyba</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">2011 international symposium on empirical software engineering and measurement</title>
				<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2011">2011</date>
			<biblScope unit="page" from="275" to="284" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">A novel software engineering approach toward using machine learning for improving the efficiency of health systems</title>
		<author>
			<persName><forename type="first">M</forename><surname>Moreb</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">A</forename><surname>Mohammed</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Bayat</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Access</title>
		<imprint>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="page" from="23169" to="23178" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">Artifact and reference models for generative machine learning frameworks and build systems</title>
		<author>
			<persName><forename type="first">A</forename><surname>Atouani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">C</forename><surname>Kirchhof</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Kusmenko</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Rumpe</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 20th ACM SIGPLAN International Conference on Generative Programming: Concepts and Experiences</title>
				<meeting>the 20th ACM SIGPLAN International Conference on Generative Programming: Concepts and Experiences</meeting>
		<imprint>
			<date type="published" when="2021">2021</date>
			<biblScope unit="page" from="55" to="68" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">Cells: A cell-inspired efficient software framework for ai-enabled application on resources-constrained mobile system</title>
		<author>
			<persName><forename type="first">C.-H</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M.-C</forename><surname>Wu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Electronics</title>
		<imprint>
			<biblScope unit="volume">10</biblScope>
			<biblScope unit="page">568</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">An instrument to evaluate the maturity of bias governance capability in artificial intelligence projects</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">L</forename><surname>Coates</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Martin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IBM Journal of Research and Development</title>
		<imprint>
			<biblScope unit="volume">63</biblScope>
			<biblScope unit="page" from="7" to="8" />
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b26">
	<analytic>
		<title level="a" type="main">Fairea: A model behaviour mutation approach to benchmarking bias mitigation methods</title>
		<author>
			<persName><forename type="first">M</forename><surname>Hort</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">M</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Sarro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Harman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 29th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering</title>
				<meeting>the 29th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering</meeting>
		<imprint>
			<date type="published" when="2021">2021</date>
			<biblScope unit="page" from="994" to="1006" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b27">
	<analytic>
		<title level="a" type="main">Fairway: A way to build fair ml software</title>
		<author>
			<persName><forename type="first">J</forename><surname>Chakraborty</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Majumder</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Yu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Menzies</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering</title>
				<meeting>the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering</meeting>
		<imprint>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="654" to="665" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b28">
	<analytic>
		<title level="a" type="main">Lift: A scalable framework for measuring fairness in ml applications</title>
		<author>
			<persName><forename type="first">S</forename><surname>Vasudevan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Kenthapadi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 29th ACM International Conference on Information &amp; Knowledge Management</title>
				<meeting>the 29th ACM International Conference on Information &amp; Knowledge Management</meeting>
		<imprint>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="2773" to="2780" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b29">
	<analytic>
		<title level="a" type="main">Dsworkflow: A framework for capturing data scientists&apos; workflows</title>
		<author>
			<persName><forename type="first">M</forename><surname>Mash</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Rosenthal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Simmons</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing Systems</title>
				<imprint>
			<date type="published" when="2021">2021</date>
			<biblScope unit="page" from="1" to="7" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b30">
	<analytic>
		<title level="a" type="main">On the experiences of adopting automated data validation in an industrial machine learning project</title>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">E</forename><surname>Lwakatare</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Rånge</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Crnkovic</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Bosch</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">IEEE/ACM 43rd International Conference on Software Engineering: Software Engineering in Practice (ICSE-SEIP)</title>
				<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2021">2021. 2021</date>
			<biblScope unit="page" from="248" to="257" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b31">
	<analytic>
		<title level="a" type="main">The machine learning bazaar: Harnessing the ml ecosystem for effective system development</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">J</forename><surname>Smith</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Sala</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">M</forename><surname>Kanter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Veeramachaneni</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data</title>
				<meeting>the 2020 ACM SIGMOD International Conference on Management of Data</meeting>
		<imprint>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="785" to="800" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b32">
	<analytic>
		<title level="a" type="main">Ai deployment architecture: Multi-case study for key factor identification</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">M</forename><surname>John</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">H</forename><surname>Olsson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Bosch</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">2020 27th Asia-Pacific Software Engineering Conference (APSEC)</title>
				<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="395" to="404" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b33">
	<analytic>
		<title level="a" type="main">Enabling collaborative data science development with the ballet framework</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">J</forename><surname>Smith</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Cito</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Lu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Veeramachaneni</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Proceedings of the ACM on Human-Computer Interaction</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="page" from="1" to="39" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b34">
	<analytic>
		<title level="a" type="main">Gestalt: integrated support for implementation and analysis in machine learning</title>
		<author>
			<persName><forename type="first">K</forename><surname>Patel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Bancroft</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">M</forename><surname>Drucker</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Fogarty</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">J</forename><surname>Ko</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Landay</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 23nd annual ACM symposium on User interface software and technology</title>
				<meeting>the 23nd annual ACM symposium on User interface software and technology</meeting>
		<imprint>
			<date type="published" when="2010">2010</date>
			<biblScope unit="page" from="37" to="46" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b35">
	<analytic>
		<title level="a" type="main">Workflow-driven distributed machine learning in chase-ci: A cognitive hardware and software ecosystem community infrastructure</title>
		<author>
			<persName><forename type="first">I</forename><surname>Altintas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Marcus</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Nealey</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">L</forename><surname>Sellars</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Graham</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Mishin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Polizzi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Crawl</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Defanti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Smarr</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)</title>
				<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2019">2019. 2019</date>
			<biblScope unit="page" from="865" to="873" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
