=Paper=
{{Paper
|id=Vol-3745/paper21
|storemode=property
|title=Relationship between Team Diversity and Innovation Performance in Interdisciplinary Research Teams within the Field of Artificial Intelligence: Decision Tree Analysis
|pdfUrl=https://ceur-ws.org/Vol-3745/paper21.pdf
|volume=Vol-3745
|authors=Junwan Liu,Chenchen Huang,Shuo Xu
|dblpUrl=https://dblp.org/rec/conf/eeke/LiuHX24
}}
==Relationship between Team Diversity and Innovation Performance in Interdisciplinary Research Teams within the Field of Artificial Intelligence: Decision Tree Analysis==
Relationship between Team Diversity and Innovation Performance in Interdisciplinary Research Teams within the Field of Artificial Intelligence: Decision Tree Analysis Junwan Liu1,*, Chenchen Huang1, and Shuo Xu1 1 College of Economics and Management, Beijing University of Technology, Beijing, 100124, China Abstract Interdisciplinary research teams are crucial in solving complex problems by providing creative solutions that single- discipline teams cannot achieve. Previous studies have primarily focused on the linear relationship between independent variables and team innovation performance, neglecting the non-linear aspect. To address this gap, this paper examines the non-linear relationship between diverse factors and the innovation performance of interdisciplinary research teams in artificial intelligence. By utilizing the Classification and Regression Tree (CART) model, the study reveals that activity diversity and interdisciplinary research team innovation performance exhibit a U-shaped relationship in terms of “novelty” innovation performance. Furthermore, this relationship is influenced by research interest diversity. Specifically, low research interest diversity leads to low innovation performance as activity diversity increases. Meanwhile, research interest diversity emerges as the most critical factor impacting innovation performance. The importance of member diversity, institutional diversity, and activity diversity on innovation performance should not be ignored. Through decision tree analysis, this paper extends research on the multifactor combination, complex nonlinear relationships, and multipath influence mechanism of team diversity on interdisciplinary research teams’ innovation performance. Keywords Interdisciplinary research team, Team diversity, Innovation performance, Classification and regression tree (CART) model diversity and task-related diversity[6]. Demographic diversity 1. Introduction* encompasses variations in team members’ demographic attributes, Growing globalization and intense market competition have such as age, gender, and institutional backgrounds[7]. Task- transformed the scientific model and increased the number of related diversity pertains to the diverse qualities that team specialized research teams. In order to enhance the efficiency and members bring to their academic or professional pursuits, quality of scientific research, more research teams are including workplace functions, knowledge, and education[8]. transitioning from single teams to diversified teams[1]. Creating successful teams with demographic and task-related Interdisciplinary research has become a necessary choice in this diversity is not a straightforward process of simply combining context[2], allowing teams to draw upon a wide range of individuals from different disciplines. Horwitz et al.[6] discovered disciplines and expertise to address complex scientific questions. that while demographic diversity did not significantly impact team The diversity of team members from various disciplines and performance, task-related diversity positively influenced it. backgrounds contributes to greater knowledge and innovative Diverse teams struggle with issues such as gender differences, results[3]. Thus, effective interdisciplinary collaboration is crucial team conflict, and collaboration[9]. Given these contradictory for achieving scientific and innovative breakthroughs[4]. findings, our focus is on investigating the impact of both Previous research highlights the significance of diversity as a demographic diversity and task-related diversity on the innovation crucial factor influencing the success of interdisciplinary research performance of interdisciplinary research teams, while analyzing teams[5]. Diversity can be broadly categorized as demographic the varying importance of different diversity factors. Existing studies have primarily focused on exploring the linear relationship between independent variables and team innovation performance, Joint Workshop of the 5th Extraction and Evaluation of Knowledge overlooking the nonlinear aspect. The nonlinear relationship Entities from Scientific Documents and the 4th AI + Informetrics (EEKE- between these characteristics and team innovation performance, AII2024), April 23~24, 2024, Changchun, China and Online * Corresponding author. especially in the context of demographic diversity and task-related EMAIL: liujunwan@bjut.edu.cn (Junwan Liu) diversity in interdisciplinary research teams, remains unclear. © Copyright 2024 for this paper by its authors. Use permitted under Creative This paper aims to investigate the impact and decision-making Commons License Attribution 4.0 International (CC BY 4.0). mechanisms of team diversity on the innovation performance of interdisciplinary research teams. Firstly, team innovation CEUR ceur-ws.org Workshop ISSN 1613-0073 Proceedings 134 performance is divided into novelty and impact[10]. Second, we This paper focuses on empirical research in the field of will investigate the influence of team diversity on the artificial intelligence (AI). The dataset is derived from the interdisciplinary research team’s innovation performance in terms information of the most influential scholar award winners on the of both the demographic diversity and task-related diversity of AMiner website (https://www.aminer.cn/ai2000) 2023 AI 2000 team members. From a social categorization perspective, we annual list. There are three reasons for selecting these scholars as assume that gender diversity, national diversity, and institutional the subjects of the study. Firstly, AI research is inherently diversity are included in the demographic diversity in this context. interdisciplinary[11]. Second, since its launch in 2006, the Meanwhile, relying on the informational decision-making AMiner platform has already been used by many researchers[12]. perspective, we hypothesize that task-related diversity includes Third, since 2017, the AMiner platform has been publishing the sociability diversity, activity diversity, research interest diversity, annual AI 2000 most influential scholar list. The purpose of this and member diversity. Specifically, we address the following list is to annually rank the 2000 scholars who are expected to be research questions in this paper: RQ1: What is the complex highly cited in the field of AI over the next ten years (2020-2029). relationship structure among demographic diversity, task-related diversity, and the innovation performance of interdisciplinary 2.2 Team Recognition research teams? RQ2: What combinations of characteristics Firstly, we obtain the dataset from the AMiner website, which promote high levels of team innovation performance? RQ3: What covers information on the most influential scholars in the 2023 AI diversity characteristics should researchers focus on to enhance 2000 annual list. The dataset consists of 195 selected scholars and the innovation performance of interdisciplinary research teams? their collaborators, with five of these scholars receiving awards in two or more subfields. The public information of the selected 2. Data and methods scholars is obtained from their personal websites and academic This paper investigates the impact of team diversity on social networking platforms. The papers of selected scholars are interdisciplinary research teams’ innovation performance. The downloaded from the Web of Science database. Finally, 25,285 basic process is shown in Figure 1. Firstly, raw data is processed papers are submitted by 195 selected scholars. to form authors’ collaborative relationship data and measure each Second, we conduct community detection on the author co- author’s collaborative tie strength. Second, stable collaborative occurrence network for each selected scholar using the Louvain relationships are identified using a pre-set threshold (super tie). algorithm. This algorithm divides the nodes in the network into Then, members of interdisciplinary research teams are identified, different communities based on modularity metrics, which assess and the diversity index of each team is measured. Again, team the collaborative relationships between the nodes. The algorithm innovation performance is divided into novelty and impact to be identifies strong connections within the same community and measured. Finally, by using team diversity as the conditional sparser connections between different communities. In the attribute and innovation performance as the decision attribute, the network, each co-author is represented as a node, and edges impact of team diversity on the interdisciplinary research team’s represent collaborations between selected scholars and co-authors innovation performance is explored using the CART model. who have published papers together. Figure 2 shows the author Data Processing co-occurrence network for Silver, D selected scholars. Different Paper publication Collaborative tie colors in the figure represent various communities determined by data strength Collaborative Strong tie Interdisciplinary Research modularity, with the community to which Silver and other Author Networks Teams collaborative Super tie selected scholars belong shown in purple. relationship Team Diversity Innovation gender institutional national sociability performance diversity diversity diversity diversity activity member research interest diversity Novelty Impact diversity diversity CART Data Training Decision tree classification result x1 CART algorithm to obtain decision rules Decision tree classification result x2 What kind of diversity has an impact on the interdisciplinary research teams' innovation performance? ······ ······ Figure 2: Collaboration Network of Silver, D scholar Figure 1: Research Framework Next, we calculate the collaborative tie strength among the nodes in each community to filter out the core collaborators of 2.1 Data collection each selected scholar. Collaborative tie strength, also known as a “super tie”, has been extensively studied in scientific collaborative 135 networks[13]. It represents a long-term and stable collaboration, 35 similar to life partners, characterized by high intensity, close ties, 30 and long durations[14]. To identify the core collaborators among 25 Team Number the 195 selected scholars, we calculate the super tie for each 20 community. Specifically, when a member’s collaborative tie 15 strength exceeds his or her community’s super tie threshold, that 10 member is referred to as a super tie collaborator and core team member. Equations 1 and 2 demonstrate the formula for 5 calculating the super tie[14]. Additionally, we employ the 0 2 3 4 5 6 7 8 9 10 11 12 13 14 17 18 22 23 28 33 39 Anderson-Darling test to examine the distribution of collaborative Team Size intensity 𝐾𝑖𝑗 /𝐾𝑖 among the members. Our analysis indicates that the statistical distribution 𝑃(𝐾𝑖𝑗 ) of all members’ collaborative Figure 3: Size Distribution of Interdisciplinary Research Teams intensity conforms to an exponential distribution, with the average Table 1 collaborative strength of members being 2.83. Distribution of Members from Different Disciplinary Fields 〈𝐾𝑖 〉 = 𝑆𝑖−1 ∑𝑆𝑗=1 𝑖 𝐾𝑖𝑗 (1) Discipline Percentage of members 𝐾𝑖𝑐 = (〈𝐾𝑖 〉 − 1) ln 𝑆𝑖 (2) Computer and information science 71.18% Where the collaborative tie strength 𝐾𝑖𝑗 is defined as the Electrical engineering, electronic 21.80% engineering, information engineering cumulative number of papers co-authored by the selected scholar Environmental engineering 2.03% in community 𝑖 and scholar 𝑗 over the time between their first and Nano-technology 1.52% last paper. 𝑆𝑖 represents the number of different co-authors of Physical science 1.09% selected scholars in the community 𝑖. 〈𝐾𝑖 〉 represents the average Health science 0.94% collaborative tie strength 𝐾𝑖𝑗 . Each scholar 𝑗 with 𝐾𝑖𝑗 > 𝐾𝑖𝑐 is Clinical medicine 0.51% Mathematics 0.36% labelled as a super tie collaborator of community 𝑖. Materials engineering 0.22% Eventually, we identify 195 research teams and their 1,217 Medical engineering 0.22% core members. Figure 3 shows the distribution of team sizes for Basic medicine 0.14% 195 teams. The largest team size is 57 members, and 165 teams are smaller than 10 members, which represents 85% of all teams. 2.3 Variables Previous research defines interdisciplinary teams as groups of scientists from different disciplines who collaborate to address 2.3.1 Dependent variables complex problems[15]. To verify the interdisciplinarity of these We calculate the degree of team novelty using the novelty index teams, we utilize a method that maps member affiliations to proposed by Lee et al.[10]. This index measures the novelty of a disciplinary classifications[16] to more accurately determine the team’s paper based on the rarity of prior citation pairs. The disciplinary backgrounds of the members. Specifically, we extract calculation involves two steps. secondary institutions from each member’s address, retain the Complete the first step of the operation on the paper level. (1) disciplinary terms in the secondary institution names, and match List all paired reference combinations for each paper. (2) Record these terms to the discipline field in the OECD classification the corresponding journal pairs. (3) Aggregate the pairs of journal scheme. In this way, each member’s institution can be precisely combinations published from year t-2 to year t as 𝑈𝑡 set. The time matched to his or her research discipline. The results indicate that window from year t-2 to year t is chosen to ensure data robustness. 165 teams have members from two different disciplinary We then calculate the commonness value using Equation 3[10]. (4) backgrounds, 27 teams have members from three different This equation assigns each paper a range of commonness values. disciplinary backgrounds, and 3 teams have members from four The commonness values of each paper are ranked, and the 10th different disciplinary backgrounds, thus reinforcing that the 195 percentile is taken as the commonness value of the paper. Using teams in this study are interdisciplinary research teams. Table 1 the 10th percentile instead of the minimum value helps reduce demonstrates the distribution of members from different noise and increase the reliability of the measure. (5) The disciplinary fields. Specifically, 71.18% of the members are from commonness value is transformed using a natural logarithm to the field of computer and information science, 21.80% are from obtain an approximately normally distributed variable. The final the fields of electrical engineering, electronic engineering, and novelty value for that paper is obtained by adding a negative sign. information engineering, while other fields encompass 𝑜𝑏𝑠𝑒𝑟𝑣𝑒𝑑 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑝𝑎𝑖𝑟𝑠𝑖𝑗𝑡 𝑁𝑖𝑗𝑡 environmental engineering, nanotechnology, and physical Commonness𝑖𝑗𝑡 = = 𝑁𝑖𝑡 𝑁𝑗𝑡 = 𝑒𝑥𝑝𝑒𝑐𝑡𝑒𝑑 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑝𝑎𝑖𝑟𝑠𝑖𝑗𝑡 × ×𝑁𝑡 𝑁𝑡 𝑁𝑡 sciences, among others. To explore the factors influencing the 𝑁𝑖𝑗𝑡 ×𝑁𝑡 interdisciplinary research teams’ innovation performance, we (3) 𝑁𝑖𝑡 ×𝑁𝑗𝑡 download each team member’s papers from the Web of Science Where 𝑁𝑖𝑗𝑡 is the number of occurrences of journal pairs (i, j) database and collect 91,025 papers from all teams. in 𝑈𝑡 set. 𝑁𝑖𝑡 is the number of journal pairs in 𝑈𝑡 set that contain 136 journal 𝑖. 𝑁𝑗𝑡 is the number of journal pairs in 𝑈𝑡 set that contain To assess the diversity and differences in a team’s overall journal 𝑗, and 𝑁𝑡 is the number of all journal pairs in 𝑈𝑡 set. sociability and activity, we utilize Equation 4 to calculate the Calculating novelty at the team level is the next stage. The diversity of these two evaluation indicators. number of paper publications by each team is counted and divided The research interest diversity of scholar is based on the by the team size to calculate the team’s novelty value. breadth of the field of interest. Using the topic model, we identify We measure a team’s impact using forward citations[10]. each scholar’s field of study and assign their papers to relevant High-impact papers are defined as those in the top 1% of citation topics. The 𝑃𝐴 (t) topic distribution is obtained by Equation 7, and distribution. This definition follows Uzzi et al.[17] and considers the research interest diversity is defined as the threshold of the the use of short citation time windows can lead to the incorrect distribution of the 𝑃𝐴 (𝑡), which is calculated by Equation 8. #𝑝𝑎𝑝𝑒𝑟𝑠 𝑜𝑓 𝐴 𝑏𝑒𝑙𝑜𝑛𝑔 𝑡𝑜 𝑡𝑜𝑝𝑖𝑐 𝑡 identification of highly cited papers[18]. First, the process of 𝑃𝐴 (t) = (7) #𝑎𝑙𝑙 𝑝𝑎𝑝𝑒𝑟 𝑜𝑓 𝐴 identifying high-impact papers is completed at the paper level. (1) Rank all papers from highest to lowest citation count. (2) By using research interest diversity(A) = − ∑ 𝑃𝐴 (𝑡)𝑙𝑜𝑔𝑃𝐴 (𝑡) a five-year moving window, we define papers in the top 1% of the 𝑡∈𝑎𝑙𝑙 𝑡𝑜𝑝𝑖𝑐 𝑜𝑓 𝐴 rankings from year t-5 to year t as high-impact papers. (3) We use (8) a dummy variable to indicate whether each paper is a high-impact Member diversity refers to the extent of diversity in publication, assigning a value of 1 if it is and 0 otherwise. Next, collaborative relationships among team members. A low diversity the number of high-impact papers per team is counted and divided indicates frequent collaboration with the same co-authors, while a by the team size to finally obtain the team’s impact value. high diversity reflects collaboration with diverse co-authors, as calculated by Equation 9[23]. 𝑟𝑒𝑙𝑎𝑡𝑖𝑜𝑛𝑖 2.3.2 Independent variables member diversity = (9) 𝑐𝑜𝑎𝑢𝑡ℎ𝑜𝑟𝑖 (𝑛𝑜𝑛−𝑑𝑢𝑝𝑙𝑖𝑐𝑎𝑡𝑒) Gender diversity refers to the subjective or objective similarities Where 𝑟𝑒𝑙𝑎𝑡𝑖𝑜𝑛𝑖 refers to the number of collaborations in and differences between team members in terms of gender[19]. published journal articles, and 𝑐𝑜𝑎𝑢𝑡ℎ𝑜𝑟𝑖 (𝑛𝑜𝑛 − 𝑑𝑢𝑝𝑙𝑖𝑐𝑎𝑡𝑒) National diversity is defined as having team members from represents the total number of non-replicating co-authors. Table 2 different national backgrounds, which introduces sociological is the examples of co-authors’ member diversity of member A. categorization and the potential for diverse cognitive Table 2 perspectives[20]. Institutional diversity refers to the presence of a Examples of Co-authors’ Member Diversity of Member A variety of members in different institutions[21]. All of the above Article 1 Article 2 demographic diversity indicators mentioned above are measured Author A, B, C, D A, C, D using the Simpson index, which is calculated using Equation 4. Relation A-B, A-C, A-D A-C, A-D 𝑛 𝐻 = 1 − ∑𝑖=1 𝑃𝑖2 (4) Coauthor(non-duplicate) A, B, C, D Member diversity 5/4=1.25 Where 𝑛 is the total number of categories, 𝑃𝑖 is the percentage of members of the group 𝑖. The higher the 𝐻 value, the greater is the value of the diversity. 2.4 Data Characteristics and Analysis Using the AMiner platform, we algorithmically obtain data on Table 3 provides the results of the exploratory analysis, including sociability, activity, and research interest diversity indices to averages, mediums, minimum, and maximum values for assess the academic proficiency of members. Definitions and calculated indicators. Figure 4 and Figure 5 provide correlation formulas for these indicators are provided[22]. plots between diversity indicators and interdisciplinary research The sociability index is derived from considering both the teams’ innovation performance. There is no strong correlation number of scholars’ collaborators and their collaborative papers, between population diversity, task-related diversity and as shown in Equation 5. innovation performance. 𝑠𝑜𝑐𝑖𝑎𝑏𝑖𝑙𝑖𝑡𝑦(𝐴) = 1 + ∑𝑒𝑎𝑐ℎ 𝑐𝑜𝑎𝑢𝑡ℎ𝑜𝑟(𝑐) 𝑙 𝑛(#𝑐𝑜𝑝𝑎𝑝𝑒𝑟𝑐 ) (5) Where the #𝑐𝑜𝑝𝑎𝑝𝑒𝑟𝑐 is the number of papers co-authored between scholar and co-authors. The activity index measures a scholar’s frequency and number of recent publications, along with the significance of each paper, as shown in Equation 6. activity(A) = ∑𝑒𝑎𝑐ℎ 𝑦𝑒𝑎𝑟(𝑛)𝑖𝑛 𝑟𝑒𝑐𝑒𝑛𝑡 𝑁 𝑦𝑒𝑎𝑟𝑠 IS(𝐺𝑛 ) × weight(n) (6) Where, in 𝑛 years (𝑛 belongs to near 𝑁 years), 𝐺𝑛 is a group of papers published by scholars in n years, weight(n) = 𝛼 this year−n , and the following principles are applied to the values of 𝑛 and 𝛼: Figure 4: Correlation between Diversity Indicators and Team’s if the current month is in the first half of the year (month-0.40 > -0.31 - Low 29.00% 61.00% handling complex relationships. These models rely on the least Novelty - - - - > -0.40 <= -0.31 - High 12.00% 79.00% squares method, which requires rigorous hypothesis testing and - - - - <= -0.40 - - High 14.00% 66.00% variable control[26]. In the context of understanding how - <= -0.72 - - - <= 0.47 <= -0.08 Low 9.00% 78.00% diversity impacts the teams’ innovation performance, it is crucial - > -0.72 - - - <= 0.47 <= -0.08 High 31.00% 60.00% to consider multiple independent variables and potential interactions among them. Incorrect selection of control variables - - - - - > 0.47 <= -0.08 Low 7.00% 86.00% Impact or omission of important variables may lead to biased results in - - - - - <= -0.29 > -0.08 High 8.00% 93.00% regression analysis. In contrast, the CART model offers a more - - - - - (-0.29,0.08] > -0.08 Low 5.00% 70.00% flexible and robust solution[27]. It constructs a decision tree using - - - - - (0.08,0.47] > -0.08 High 19.00% 74.00% recursive binary splitting, dividing data subsets into smaller Firstly, in the “novelty”, interdisciplinary research teams have subsets[24]. This nonparametric approach eliminates the need for a higher proportion of high innovation performance ratings. In the 138 “impact”, there is a higher proportion of interdisciplinary teams to prioritize the diversity of institutions represented within the with a low innovation performance rating than in the “novelty” teams. innovation performance. Secondly, activity diversity and member diversity split as root nodes, which are key factors affecting the novelty and impact of a team, respectively. Third, the confidence coefficients for most of the decision rules are above 60%, indicating that the weight of the sample size supporting the ≤ > current decision rule in the leaf node’s sample size is 60% or more. This suggests that the results are highly interpretable. Figure 6 shows that there are a total of three rules to determine whether a team has high or low innovation performance. The CART model divides data into approximately two branches based on whether the most important feature “activity diversity” is less than or equal to -0.40. The results show that with lower activity diversity, team members can focus more on their original thinking, enhancing team innovation performance without the need to be concerned about publication frequency and quantity. When activity diversity is higher, an increase in research interest diversity contributes to teams achieving high levels of innovation Figure 7: Decision Tree for Team’s Impact performance. Horizontal comparison reveals that activity diversity is a crucial factor influencing innovation performance. Interdisciplinary research teams with low activity diversity are not 3.2 Feature Importance Analysis influenced by research interest diversity, whereas interdisciplinary The model’s final characteristic importance for explanatory research teams with high activity diversity are impacted by variables is presented in Figure 8. Among the factors affecting a research interest diversity. team’s novelty, research interest diversity has the highest characteristic importance of 0.97, while activity diversity has a characteristic importance of 0.03. Among the factors affecting a team’s impact, research interest diversity has the highest characteristic importance of 0.48. Member diversity and institutional diversity closely followed with 0.31 and 0.22, respectively. This result shows that research interest diversity is ≤ > most strongly associated with interdisciplinary research teams' innovation performance. This shows that team members' diverse expertise backgrounds enable knowledge integration and reconfiguration, which are crucial elements in innovation performance[21]. 0.03 Novelty AD 0 Impact MD 0 0.31 SD 0 0 0.97 Figure 6: Decision Tree for Team’s Novelty RID 0.48 Figure 7 shows six rules to determine whether a team has high ND 0 0 or low innovation performance. The CART model divides the data 0 ID 0.22 into two branches based on whether the most important feature 0 “member diversity” is less than or equal to -0.08. The results GD 0 reveal that interdisciplinary research teams with member diversity more than -0.08 need to control research interest diversity to 0.0 0.2 0.4 0.6 0.8 1.0 achieve high innovation performance. For interdisciplinary Figure 8: Characteristic Importance of Explanatory Variables research teams with member diversity smaller than -0.08, institutional diversity is a key influence on innovation 4. Conclusion performance. Horizontal comparison shows that member diversity is a key factor influencing innovation performance. This paper finds a U-shaped relationship between activity Interdisciplinary research teams with low member diversity tend diversity and team innovation performance in “novelty” innovation performance. However, this relationship is impacted by research interests diversity. Specifically, interdisciplinary 139 teams with low activity diversity are able to improve their [4] Salazar, M., Lant, T. (2018). Facilitating innovation in interdisciplinary teams: The role of leaders and integrative communication. Informing Science, 21, innovation performance independently of research interest 157. diversity. In contrast, low research interest diversity leads to low [5] Benoliel, P., Somech, A. (2015). The role of leader boundary activities in enhancing interdisciplinary team effectiveness. Small Group Research, 46(1), innovation performance when activity diversity increases. In 83-124. terms of “impact” innovation performance, increasing member [6] Horwitz, S. K., Horwitz, I. B. (2007). The effects of team diversity on team diversity and managing the range of research interests can be outcomes: A meta-analytic review of team demography. Journal of management, 33(6), 987-1015. beneficial. In addition, interdisciplinary research teams with low [7] Lee, D. S., Lee, K. C., Seo, Y. W. (2015). An analysis of shared leadership, member diversity need to focus on the institutional diversity of diversity, and team creativity in an e-learning environment. Computers in Human Behavior, 42, 47-56. team members, as institutional diversity has a positive impact on [8] Jackson, S. E., Joshi, A., Erhardt, N. L. (2003). Recent research on team and the team’s effectiveness. organizational diversity: SWOT analysis and implications. Journal of In the evaluation of various factors, research interest diversity Management, 29(6), 801-830. [9] Abramo, G., D’Angelo, C. A., Di Costa, F. (2018). The effects of gender, age emerges as the most significant determinant of innovation and academic rank on research diversification. Scientometrics, 114(2), 373- performance in interdisciplinary research teams. This implies a 387. [10] Lee, Y. N., Walsh, J. P., Wang, J. (2015). Creativity in scientific teams: close association between research interest diversity and the Unpacking novelty and impact. Research policy, 44(3), 684-697. team’s ability to innovate. Researchers with distinct research [11] Zhuang, Y., Cai, M., Li, X., Luo, X., Yang, Q., Wu, F. (2020). The next breakthroughs of artificial intelligence: The interdisciplinary nature of AI. themes bring a diverse range of knowledge and contribute to the Engineering, 6(3), 245. reconfiguration of knowledge by identifying and integrating [12] Wu, J., Ou, G., Liu, X., Dong, K. (2022). How does academic education insights from different fields[30]. background affect top researchers’ performance? Evidence from the field of artificial intelligence. Journal of Informetrics, 16(2), 101292. Managers should consider research team diversity when [13] Li, Y., Li, N., Guo, J., Li, J., Harris, T. B. (2018). A network view of advice- developing it. To create a healthy innovation environment, they giving and individual creativity in teams: A brokerage-driven, socially perpetuated phenomenon. Academy of Management Journal, 61(6), 2210- should pay attention to the heterogeneity of different 2229. organizational and disciplinary backgrounds to which team [14] Petersen, A. M. (2015). Quantifying the impact of weak, strong, and super members belong and strive to optimize the level of knowledge ties in scientific careers. Proceedings of the National Academy of Sciences, 112(34), E4671-E4680. diversity in the team. Furthermore, managers should encourage [15] Wagner, C. S., Roessner, J. D., Bobb, K., Klein, J. T., Boyack, K. W., Keyton, and promote activity diversity. Meanwhile, the costs of too much J., Rafols, I., Börner, K. (2011). Approaches to understanding and measuring interdisciplinary scientific research (IDR): A review of the literature. Journal research interest diversity need to be noted to avoid the of informetrics, 5(1), 14-26. phenomenon of “too much of a good thing”. This is because [16] Liu, X., Bu, Y., Li, M., Li, J. (2024). Monodisciplinary collaboration disrupts knowledge diversity among collaborative members in different science more than multidisciplinary collaboration. Journal of the Association for Information Science and Technology, 75(1), 59-78. research fields is often considered a double-edged sword. As [17] Uzzi, B., Mukherjee, S., Stringer, M., Jones, B. (2013). Atypical Wang et al have found, increasing knowledge diversity leads to a combinations and scientific impact. Science, 342(6157), 468-472. [18] Wang, J. (2014). Unpacking the Matthew effect in citations. Journal of decline in social influence after a certain peak[21]. Informetrics, 8(2), 329-339. Our study has limitations. Firstly, it focuses solely on the [19] Van Knippenberg, D., Schippers, M. C. (2007). Work group diversity. Annu. Rev. Psychol., 58, 515-541. AMiner platform in AI, limiting its scope and generalizability. [20] Ayub, N., Jehn, K. A. (2006). National diversity and conflict in multinational Future studies should broaden the research fields. Secondly, the workgroups: The moderating effect of nationalism. International Journal of sample of 195 interdisciplinary teams may not fully reflect AI Conflict Management, 17(3), 181-202. [21] Wang, G., Gan, Y., Yang, H. (2022). The inverted U-shaped relationship team diversity and complexity, potentially suffering from between knowledge diversity of researchers and societal impact. Scientific sampling error. A more representative sample is needed. Thirdly, Reports, 12(1), 18585. [22] Zhuo, L. I. N., Haohai, H. U. A. N. G. (2022). Top Experts Identification and while we focused on diversity within teams, future research could Evaluation of International Cooperation on Artificial Intelligence in China. explore diversity in other research team activities. Lastly, our Journal of Library and Information Science in Agriculture, 2022,34(1): 86-95. team recognition method overlooks member turnover dynamics. [23] Liao, C. H. (2011). How to improve research quality? Examining the impacts of collaboration intensity and member diversity in collaboration networks. Future studies should introduce a dynamic analysis of member Scientometrics, 86(3), 747-761. flow for more accurate core member identification. [24] Breiman, L. (1984). Classification and regression trees (1st ed.). Routledge. [25] Aboseif, E., Hanna, A. S. (2023). Defining the Success Status of Construction Projects Based on Quantitative Performance Metrics Thresholds. Journal of ACKNOWLEDGMENTS Management in Engineering, 39(2), 04022073. [26] Shmueli, G. (2010). To explain or to predict? Statistical Science, 25(3), 289- This work was supported partially by the National Natural 310. [27] Klusowski, J. M., Tian, P. M. (2024). Large scale prediction with decision Science Foundation of China (Grant No. 72174016). Our gratitude trees. Journal of the American Statistical Association, 119(545), 525-537. also goes to the anonymous reviewers and the editor for their [28] Pahmi, S., Saepudin, S., Maesarah, N., Solehudin, U. I. (2018). valuable comments. Implementation of CART (classification and regression trees) algorithm for determining factors affecting employee performance. In 2018 International Conference on Computing, Engineering, and Design (ICCED) (pp. 57-62). REFERENCES IEEE. [29] Chen, W., Xie, X., Wang, J., Pradhan, B., Hong, H., Bui, D. T., Duan, Z., Ma, [1] Dong, Y., Ma, H., Tang, J., Wang, K. (2018). Collaboration diversity and J. (2017). A comparative study of logistic model tree, random forest, and scientific impact. arXiv preprint arXiv,1806.03694. classification and regression tree models for spatial prediction of landslide [2] Van Noorden, R. (2015). Interdisciplinary research by the numbers. Nature, susceptibility. Catena, 151, 147-160. 525(7569), 306-307. [30] Zhang, L., Li, X. (2016). How to reduce the negative impacts of knowledge [3] Taylor, A., Greve, H. R. (2006). Superman or the fantastic four? Knowledge heterogeneity in engineering design team: Exploring the role of knowledge combination and experience in innovative teams. Academy of management reuse. International Journal of Project Management, 34(7), 1138-1149. journal, 49(4), 723-740. 140