=Paper= {{Paper |id=Vol-3745/paper21 |storemode=property |title=Relationship between Team Diversity and Innovation Performance in Interdisciplinary Research Teams within the Field of Artificial Intelligence: Decision Tree Analysis |pdfUrl=https://ceur-ws.org/Vol-3745/paper21.pdf |volume=Vol-3745 |authors=Junwan Liu,Chenchen Huang,Shuo Xu |dblpUrl=https://dblp.org/rec/conf/eeke/LiuHX24 }} ==Relationship between Team Diversity and Innovation Performance in Interdisciplinary Research Teams within the Field of Artificial Intelligence: Decision Tree Analysis== https://ceur-ws.org/Vol-3745/paper21.pdf

Relationship between Team Diversity and Innovation Performance in
Interdisciplinary Research Teams within the Field of Artificial
Intelligence: Decision Tree Analysis
Junwan Liu1,*, Chenchen Huang1, and Shuo Xu1

1 College of Economics and Management, Beijing University of Technology, Beijing, 100124, China

Abstract
Interdisciplinary research teams are crucial in solving complex problems by providing creative solutions that single-
discipline teams cannot achieve. Previous studies have primarily focused on the linear relationship between independent
variables and team innovation performance, neglecting the non-linear aspect. To address this gap, this paper examines the
non-linear relationship between diverse factors and the innovation performance of interdisciplinary research teams in
artificial intelligence. By utilizing the Classification and Regression Tree (CART) model, the study reveals that activity
diversity and interdisciplinary research team innovation performance exhibit a U-shaped relationship in terms of “novelty”
innovation performance. Furthermore, this relationship is influenced by research interest diversity. Specifically, low
research interest diversity leads to low innovation performance as activity diversity increases. Meanwhile, research interest
diversity emerges as the most critical factor impacting innovation performance. The importance of member diversity,
institutional diversity, and activity diversity on innovation performance should not be ignored. Through decision tree
analysis, this paper extends research on the multifactor combination, complex nonlinear relationships, and multipath
influence mechanism of team diversity on interdisciplinary research teams’ innovation performance.

Keywords
Interdisciplinary research team, Team diversity, Innovation performance, Classification and regression tree (CART) model

diversity and task-related diversity[6]. Demographic diversity
1. Introduction* encompasses variations in team members’ demographic attributes,
Growing globalization and intense market competition have such as age, gender, and institutional backgrounds[7]. Task-
transformed the scientific model and increased the number of related diversity pertains to the diverse qualities that team
specialized research teams. In order to enhance the efficiency and members bring to their academic or professional pursuits,
quality of scientific research, more research teams are including workplace functions, knowledge, and education[8].
transitioning from single teams to diversified teams[1]. Creating successful teams with demographic and task-related
Interdisciplinary research has become a necessary choice in this diversity is not a straightforward process of simply combining
context[2], allowing teams to draw upon a wide range of individuals from different disciplines. Horwitz et al.[6] discovered
disciplines and expertise to address complex scientific questions. that while demographic diversity did not significantly impact team
The diversity of team members from various disciplines and performance, task-related diversity positively influenced it.
backgrounds contributes to greater knowledge and innovative Diverse teams struggle with issues such as gender differences,
results[3]. Thus, effective interdisciplinary collaboration is crucial team conflict, and collaboration[9]. Given these contradictory
for achieving scientific and innovative breakthroughs[4]. findings, our focus is on investigating the impact of both
Previous research highlights the significance of diversity as a demographic diversity and task-related diversity on the innovation
crucial factor influencing the success of interdisciplinary research performance of interdisciplinary research teams, while analyzing
teams[5]. Diversity can be broadly categorized as demographic the varying importance of different diversity factors. Existing
studies have primarily focused on exploring the linear relationship
between independent variables and team innovation performance,
Joint Workshop of the 5th Extraction and Evaluation of Knowledge overlooking the nonlinear aspect. The nonlinear relationship
Entities from Scientific Documents and the 4th AI + Informetrics (EEKE- between these characteristics and team innovation performance,
AII2024), April 23~24, 2024, Changchun, China and Online
* Corresponding author. especially in the context of demographic diversity and task-related
EMAIL: liujunwan@bjut.edu.cn (Junwan Liu) diversity in interdisciplinary research teams, remains unclear.
© Copyright 2024 for this paper by its authors. Use permitted under Creative This paper aims to investigate the impact and decision-making
Commons License Attribution 4.0 International (CC BY 4.0).
mechanisms of team diversity on the innovation performance of
interdisciplinary research teams. Firstly, team innovation

CEUR
ceur-ws.org
Workshop ISSN 1613-0073
Proceedings

134
performance is divided into novelty and impact[10]. Second, we This paper focuses on empirical research in the field of
will investigate the influence of team diversity on the artificial intelligence (AI). The dataset is derived from the
interdisciplinary research team’s innovation performance in terms information of the most influential scholar award winners on the
of both the demographic diversity and task-related diversity of AMiner website (https://www.aminer.cn/ai2000) 2023 AI 2000
team members. From a social categorization perspective, we annual list. There are three reasons for selecting these scholars as
assume that gender diversity, national diversity, and institutional the subjects of the study. Firstly, AI research is inherently
diversity are included in the demographic diversity in this context. interdisciplinary[11]. Second, since its launch in 2006, the
Meanwhile, relying on the informational decision-making AMiner platform has already been used by many researchers[12].
perspective, we hypothesize that task-related diversity includes Third, since 2017, the AMiner platform has been publishing the
sociability diversity, activity diversity, research interest diversity, annual AI 2000 most influential scholar list. The purpose of this
and member diversity. Specifically, we address the following list is to annually rank the 2000 scholars who are expected to be
research questions in this paper: RQ1: What is the complex highly cited in the field of AI over the next ten years (2020-2029).
relationship structure among demographic diversity, task-related
diversity, and the innovation performance of interdisciplinary 2.2 Team Recognition
research teams? RQ2: What combinations of characteristics
Firstly, we obtain the dataset from the AMiner website, which
promote high levels of team innovation performance? RQ3: What
covers information on the most influential scholars in the 2023 AI
diversity characteristics should researchers focus on to enhance
2000 annual list. The dataset consists of 195 selected scholars and
the innovation performance of interdisciplinary research teams?
their collaborators, with five of these scholars receiving awards in
two or more subfields. The public information of the selected
2. Data and methods scholars is obtained from their personal websites and academic
This paper investigates the impact of team diversity on social networking platforms. The papers of selected scholars are
interdisciplinary research teams’ innovation performance. The downloaded from the Web of Science database. Finally, 25,285
basic process is shown in Figure 1. Firstly, raw data is processed papers are submitted by 195 selected scholars.
to form authors’ collaborative relationship data and measure each Second, we conduct community detection on the author co-
author’s collaborative tie strength. Second, stable collaborative occurrence network for each selected scholar using the Louvain
relationships are identified using a pre-set threshold (super tie). algorithm. This algorithm divides the nodes in the network into
Then, members of interdisciplinary research teams are identified, different communities based on modularity metrics, which assess
and the diversity index of each team is measured. Again, team the collaborative relationships between the nodes. The algorithm
innovation performance is divided into novelty and impact to be identifies strong connections within the same community and
measured. Finally, by using team diversity as the conditional sparser connections between different communities. In the
attribute and innovation performance as the decision attribute, the network, each co-author is represented as a node, and edges
impact of team diversity on the interdisciplinary research team’s represent collaborations between selected scholars and co-authors
innovation performance is explored using the CART model. who have published papers together. Figure 2 shows the author
Data Processing co-occurrence network for Silver, D selected scholars. Different
Paper publication Collaborative tie colors in the figure represent various communities determined by
data strength
Collaborative
Strong tie
Interdisciplinary Research modularity, with the community to which Silver and other
Author Networks Teams
collaborative Super tie
selected scholars belong shown in purple.
relationship
Team
Diversity

Innovation
gender institutional national sociability performance
diversity diversity diversity diversity
activity member
research interest diversity Novelty Impact
diversity diversity

CART Data Training

Decision tree
classification result x1 CART algorithm to obtain decision rules

Decision tree
classification result x2 What kind of diversity has an impact
on the interdisciplinary research
teams' innovation performance?
······ ······
Figure 2: Collaboration Network of Silver, D scholar
Figure 1: Research Framework
Next, we calculate the collaborative tie strength among the
nodes in each community to filter out the core collaborators of
2.1 Data collection each selected scholar. Collaborative tie strength, also known as a
“super tie”, has been extensively studied in scientific collaborative

135
networks[13]. It represents a long-term and stable collaboration, 35
similar to life partners, characterized by high intensity, close ties, 30
and long durations[14]. To identify the core collaborators among 25

Team Number
the 195 selected scholars, we calculate the super tie for each 20
community. Specifically, when a member’s collaborative tie
15
strength exceeds his or her community’s super tie threshold, that
10
member is referred to as a super tie collaborator and core team
member. Equations 1 and 2 demonstrate the formula for 5
calculating the super tie[14]. Additionally, we employ the 0
2 3 4 5 6 7 8 9 10 11 12 13 14 17 18 22 23 28 33 39
Anderson-Darling test to examine the distribution of collaborative
Team Size
intensity 𝐾𝑖𝑗 /𝐾𝑖 among the members. Our analysis indicates that
the statistical distribution 𝑃(𝐾𝑖𝑗 ) of all members’ collaborative Figure 3: Size Distribution of Interdisciplinary Research Teams
intensity conforms to an exponential distribution, with the average Table 1
collaborative strength of members being 2.83. Distribution of Members from Different Disciplinary Fields
〈𝐾𝑖 〉 = 𝑆𝑖−1 ∑𝑆𝑗=1
𝑖
𝐾𝑖𝑗 (1) Discipline Percentage of members
𝐾𝑖𝑐 = (〈𝐾𝑖 〉 − 1) ln 𝑆𝑖 (2) Computer and information science 71.18%
Where the collaborative tie strength 𝐾𝑖𝑗 is defined as the Electrical engineering, electronic
21.80%
engineering, information engineering
cumulative number of papers co-authored by the selected scholar Environmental engineering 2.03%
in community 𝑖 and scholar 𝑗 over the time between their first and Nano-technology 1.52%
last paper. 𝑆𝑖 represents the number of different co-authors of Physical science 1.09%
selected scholars in the community 𝑖. 〈𝐾𝑖 〉 represents the average Health science 0.94%
collaborative tie strength 𝐾𝑖𝑗 . Each scholar 𝑗 with 𝐾𝑖𝑗 > 𝐾𝑖𝑐 is Clinical medicine 0.51%
Mathematics 0.36%
labelled as a super tie collaborator of community 𝑖.
Materials engineering 0.22%
Eventually, we identify 195 research teams and their 1,217 Medical engineering 0.22%
core members. Figure 3 shows the distribution of team sizes for Basic medicine 0.14%
195 teams. The largest team size is 57 members, and 165 teams
are smaller than 10 members, which represents 85% of all teams. 2.3 Variables
Previous research defines interdisciplinary teams as groups of
scientists from different disciplines who collaborate to address 2.3.1 Dependent variables
complex problems[15]. To verify the interdisciplinarity of these We calculate the degree of team novelty using the novelty index
teams, we utilize a method that maps member affiliations to proposed by Lee et al.[10]. This index measures the novelty of a
disciplinary classifications[16] to more accurately determine the team’s paper based on the rarity of prior citation pairs. The
disciplinary backgrounds of the members. Specifically, we extract
calculation involves two steps.
secondary institutions from each member’s address, retain the
Complete the first step of the operation on the paper level. (1)
disciplinary terms in the secondary institution names, and match
List all paired reference combinations for each paper. (2) Record
these terms to the discipline field in the OECD classification the corresponding journal pairs. (3) Aggregate the pairs of journal
scheme. In this way, each member’s institution can be precisely
combinations published from year t-2 to year t as 𝑈𝑡 set. The time
matched to his or her research discipline. The results indicate that
window from year t-2 to year t is chosen to ensure data robustness.
165 teams have members from two different disciplinary
We then calculate the commonness value using Equation 3[10]. (4)
backgrounds, 27 teams have members from three different
This equation assigns each paper a range of commonness values.
disciplinary backgrounds, and 3 teams have members from four
The commonness values of each paper are ranked, and the 10th
different disciplinary backgrounds, thus reinforcing that the 195
percentile is taken as the commonness value of the paper. Using
teams in this study are interdisciplinary research teams. Table 1
the 10th percentile instead of the minimum value helps reduce
demonstrates the distribution of members from different
noise and increase the reliability of the measure. (5) The
disciplinary fields. Specifically, 71.18% of the members are from
commonness value is transformed using a natural logarithm to
the field of computer and information science, 21.80% are from
obtain an approximately normally distributed variable. The final
the fields of electrical engineering, electronic engineering, and
novelty value for that paper is obtained by adding a negative sign.
information engineering, while other fields encompass 𝑜𝑏𝑠𝑒𝑟𝑣𝑒𝑑 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑝𝑎𝑖𝑟𝑠𝑖𝑗𝑡 𝑁𝑖𝑗𝑡
environmental engineering, nanotechnology, and physical Commonness𝑖𝑗𝑡 = = 𝑁𝑖𝑡 𝑁𝑗𝑡 =
𝑒𝑥𝑝𝑒𝑐𝑡𝑒𝑑 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑝𝑎𝑖𝑟𝑠𝑖𝑗𝑡 × ×𝑁𝑡
𝑁𝑡 𝑁𝑡
sciences, among others. To explore the factors influencing the 𝑁𝑖𝑗𝑡 ×𝑁𝑡
interdisciplinary research teams’ innovation performance, we (3)
𝑁𝑖𝑡 ×𝑁𝑗𝑡
download each team member’s papers from the Web of Science Where 𝑁𝑖𝑗𝑡 is the number of occurrences of journal pairs (i, j)
database and collect 91,025 papers from all teams. in 𝑈𝑡 set. 𝑁𝑖𝑡 is the number of journal pairs in 𝑈𝑡 set that contain

136
journal 𝑖. 𝑁𝑗𝑡 is the number of journal pairs in 𝑈𝑡 set that contain To assess the diversity and differences in a team’s overall
journal 𝑗, and 𝑁𝑡 is the number of all journal pairs in 𝑈𝑡 set. sociability and activity, we utilize Equation 4 to calculate the
Calculating novelty at the team level is the next stage. The diversity of these two evaluation indicators.
number of paper publications by each team is counted and divided The research interest diversity of scholar is based on the
by the team size to calculate the team’s novelty value. breadth of the field of interest. Using the topic model, we identify
We measure a team’s impact using forward citations[10]. each scholar’s field of study and assign their papers to relevant
High-impact papers are defined as those in the top 1% of citation topics. The 𝑃𝐴 (t) topic distribution is obtained by Equation 7, and
distribution. This definition follows Uzzi et al.[17] and considers the research interest diversity is defined as the threshold of the
the use of short citation time windows can lead to the incorrect distribution of the 𝑃𝐴 (𝑡), which is calculated by Equation 8.
#𝑝𝑎𝑝𝑒𝑟𝑠 𝑜𝑓 𝐴 𝑏𝑒𝑙𝑜𝑛𝑔 𝑡𝑜 𝑡𝑜𝑝𝑖𝑐 𝑡
identification of highly cited papers[18]. First, the process of 𝑃𝐴 (t) = (7)
#𝑎𝑙𝑙 𝑝𝑎𝑝𝑒𝑟 𝑜𝑓 𝐴
identifying high-impact papers is completed at the paper level. (1)
Rank all papers from highest to lowest citation count. (2) By using research interest diversity(A) = − ∑ 𝑃𝐴 (𝑡)𝑙𝑜𝑔𝑃𝐴 (𝑡)
a five-year moving window, we define papers in the top 1% of the 𝑡∈𝑎𝑙𝑙 𝑡𝑜𝑝𝑖𝑐 𝑜𝑓 𝐴

rankings from year t-5 to year t as high-impact papers. (3) We use (8)
a dummy variable to indicate whether each paper is a high-impact Member diversity refers to the extent of diversity in
publication, assigning a value of 1 if it is and 0 otherwise. Next, collaborative relationships among team members. A low diversity
the number of high-impact papers per team is counted and divided indicates frequent collaboration with the same co-authors, while a
by the team size to finally obtain the team’s impact value. high diversity reflects collaboration with diverse co-authors, as
calculated by Equation 9[23].
𝑟𝑒𝑙𝑎𝑡𝑖𝑜𝑛𝑖
2.3.2 Independent variables member diversity = (9)
𝑐𝑜𝑎𝑢𝑡ℎ𝑜𝑟𝑖 (𝑛𝑜𝑛−𝑑𝑢𝑝𝑙𝑖𝑐𝑎𝑡𝑒)
Gender diversity refers to the subjective or objective similarities Where 𝑟𝑒𝑙𝑎𝑡𝑖𝑜𝑛𝑖 refers to the number of collaborations in
and differences between team members in terms of gender[19]. published journal articles, and 𝑐𝑜𝑎𝑢𝑡ℎ𝑜𝑟𝑖 (𝑛𝑜𝑛 − 𝑑𝑢𝑝𝑙𝑖𝑐𝑎𝑡𝑒)
National diversity is defined as having team members from represents the total number of non-replicating co-authors. Table 2
different national backgrounds, which introduces sociological is the examples of co-authors’ member diversity of member A.
categorization and the potential for diverse cognitive Table 2
perspectives[20]. Institutional diversity refers to the presence of a Examples of Co-authors’ Member Diversity of Member A
variety of members in different institutions[21]. All of the above Article 1 Article 2
demographic diversity indicators mentioned above are measured Author A, B, C, D A, C, D
using the Simpson index, which is calculated using Equation 4. Relation A-B, A-C, A-D A-C, A-D
𝑛
𝐻 = 1 − ∑𝑖=1 𝑃𝑖2 (4) Coauthor(non-duplicate) A, B, C, D
Member diversity 5/4=1.25
Where 𝑛 is the total number of categories, 𝑃𝑖 is the percentage
of members of the group 𝑖. The higher the 𝐻 value, the greater is
the value of the diversity. 2.4 Data Characteristics and Analysis
Using the AMiner platform, we algorithmically obtain data on Table 3 provides the results of the exploratory analysis, including
sociability, activity, and research interest diversity indices to averages, mediums, minimum, and maximum values for
assess the academic proficiency of members. Definitions and calculated indicators. Figure 4 and Figure 5 provide correlation
formulas for these indicators are provided[22]. plots between diversity indicators and interdisciplinary research
The sociability index is derived from considering both the teams’ innovation performance. There is no strong correlation
number of scholars’ collaborators and their collaborative papers, between population diversity, task-related diversity and
as shown in Equation 5. innovation performance.
𝑠𝑜𝑐𝑖𝑎𝑏𝑖𝑙𝑖𝑡𝑦(𝐴) = 1 + ∑𝑒𝑎𝑐ℎ 𝑐𝑜𝑎𝑢𝑡ℎ𝑜𝑟(𝑐) 𝑙 𝑛(#𝑐𝑜𝑝𝑎𝑝𝑒𝑟𝑐 ) (5)
Where the #𝑐𝑜𝑝𝑎𝑝𝑒𝑟𝑐 is the number of papers co-authored
between scholar and co-authors.
The activity index measures a scholar’s frequency and number
of recent publications, along with the significance of each paper,
as shown in Equation 6.
activity(A) = ∑𝑒𝑎𝑐ℎ 𝑦𝑒𝑎𝑟(𝑛)𝑖𝑛 𝑟𝑒𝑐𝑒𝑛𝑡 𝑁 𝑦𝑒𝑎𝑟𝑠 IS(𝐺𝑛 ) × weight(n)
(6)
Where, in 𝑛 years (𝑛 belongs to near 𝑁 years), 𝐺𝑛 is a group of
papers published by scholars in n years, weight(n) = 𝛼 this year−n ,
and the following principles are applied to the values of 𝑛 and 𝛼: Figure 4: Correlation between Diversity Indicators and Team’s
if the current month is in the first half of the year (month -0.40 > -0.31 - Low 29.00% 61.00%
handling complex relationships. These models rely on the least
Novelty - - - - > -0.40 <= -0.31 - High 12.00% 79.00%
squares method, which requires rigorous hypothesis testing and
- - - - <= -0.40 - - High 14.00% 66.00%
variable control[26]. In the context of understanding how
- <= -0.72 - - - <= 0.47 <= -0.08 Low 9.00% 78.00%
diversity impacts the teams’ innovation performance, it is crucial
- > -0.72 - - - <= 0.47 <= -0.08 High 31.00% 60.00%
to consider multiple independent variables and potential
interactions among them. Incorrect selection of control variables - - - - - > 0.47 <= -0.08 Low 7.00% 86.00%
Impact
or omission of important variables may lead to biased results in - - - - - <= -0.29 > -0.08 High 8.00% 93.00%
regression analysis. In contrast, the CART model offers a more - - - - - (-0.29,0.08] > -0.08 Low 5.00% 70.00%
flexible and robust solution[27]. It constructs a decision tree using - - - - - (0.08,0.47] > -0.08 High 19.00% 74.00%
recursive binary splitting, dividing data subsets into smaller Firstly, in the “novelty”, interdisciplinary research teams have
subsets[24]. This nonparametric approach eliminates the need for a higher proportion of high innovation performance ratings. In the

138
“impact”, there is a higher proportion of interdisciplinary teams to prioritize the diversity of institutions represented within the
with a low innovation performance rating than in the “novelty” teams.
innovation performance. Secondly, activity diversity and member
diversity split as root nodes, which are key factors affecting the
novelty and impact of a team, respectively. Third, the confidence
coefficients for most of the decision rules are above 60%,
indicating that the weight of the sample size supporting the ≤ >
current decision rule in the leaf node’s sample size is 60% or more.
This suggests that the results are highly interpretable.
Figure 6 shows that there are a total of three rules to determine
whether a team has high or low innovation performance. The
CART model divides data into approximately two branches based
on whether the most important feature “activity diversity” is less
than or equal to -0.40. The results show that with lower activity
diversity, team members can focus more on their original thinking,
enhancing team innovation performance without the need to be
concerned about publication frequency and quantity. When
activity diversity is higher, an increase in research interest
diversity contributes to teams achieving high levels of innovation
Figure 7: Decision Tree for Team’s Impact
performance. Horizontal comparison reveals that activity diversity
is a crucial factor influencing innovation performance.
Interdisciplinary research teams with low activity diversity are not
3.2 Feature Importance Analysis
influenced by research interest diversity, whereas interdisciplinary The model’s final characteristic importance for explanatory
research teams with high activity diversity are impacted by variables is presented in Figure 8. Among the factors affecting a
research interest diversity. team’s novelty, research interest diversity has the highest
characteristic importance of 0.97, while activity diversity has a
characteristic importance of 0.03. Among the factors affecting a
team’s impact, research interest diversity has the highest
characteristic importance of 0.48. Member diversity and
institutional diversity closely followed with 0.31 and 0.22,
respectively. This result shows that research interest diversity is
≤ >
most strongly associated with interdisciplinary research teams'
innovation performance. This shows that team members' diverse
expertise backgrounds enable knowledge integration and
reconfiguration, which are crucial elements in innovation
performance[21].

0.03 Novelty
AD 0 Impact
MD 0
0.31
SD 0
0
0.97
Figure 6: Decision Tree for Team’s Novelty RID 0.48
Figure 7 shows six rules to determine whether a team has high ND 0
0
or low innovation performance. The CART model divides the data 0
ID 0.22
into two branches based on whether the most important feature 0
“member diversity” is less than or equal to -0.08. The results GD 0
reveal that interdisciplinary research teams with member diversity
more than -0.08 need to control research interest diversity to 0.0 0.2 0.4 0.6 0.8 1.0
achieve high innovation performance. For interdisciplinary Figure 8: Characteristic Importance of Explanatory Variables
research teams with member diversity smaller than -0.08,
institutional diversity is a key influence on innovation 4. Conclusion
performance. Horizontal comparison shows that member diversity
is a key factor influencing innovation performance. This paper finds a U-shaped relationship between activity
Interdisciplinary research teams with low member diversity tend diversity and team innovation performance in “novelty”
innovation performance. However, this relationship is impacted
by research interests diversity. Specifically, interdisciplinary

139
teams with low activity diversity are able to improve their [4] Salazar, M., Lant, T. (2018). Facilitating innovation in interdisciplinary teams:
The role of leaders and integrative communication. Informing Science, 21,
innovation performance independently of research interest 157.
diversity. In contrast, low research interest diversity leads to low [5] Benoliel, P., Somech, A. (2015). The role of leader boundary activities in
enhancing interdisciplinary team effectiveness. Small Group Research, 46(1),
innovation performance when activity diversity increases. In 83-124.
terms of “impact” innovation performance, increasing member [6] Horwitz, S. K., Horwitz, I. B. (2007). The effects of team diversity on team
diversity and managing the range of research interests can be outcomes: A meta-analytic review of team demography. Journal of
management, 33(6), 987-1015.
beneficial. In addition, interdisciplinary research teams with low [7] Lee, D. S., Lee, K. C., Seo, Y. W. (2015). An analysis of shared leadership,
member diversity need to focus on the institutional diversity of diversity, and team creativity in an e-learning environment. Computers in
Human Behavior, 42, 47-56.
team members, as institutional diversity has a positive impact on [8] Jackson, S. E., Joshi, A., Erhardt, N. L. (2003). Recent research on team and
the team’s effectiveness. organizational diversity: SWOT analysis and implications. Journal of
In the evaluation of various factors, research interest diversity Management, 29(6), 801-830.
[9] Abramo, G., D’Angelo, C. A., Di Costa, F. (2018). The effects of gender, age
emerges as the most significant determinant of innovation and academic rank on research diversification. Scientometrics, 114(2), 373-
performance in interdisciplinary research teams. This implies a 387.
[10] Lee, Y. N., Walsh, J. P., Wang, J. (2015). Creativity in scientific teams:
close association between research interest diversity and the Unpacking novelty and impact. Research policy, 44(3), 684-697.
team’s ability to innovate. Researchers with distinct research [11] Zhuang, Y., Cai, M., Li, X., Luo, X., Yang, Q., Wu, F. (2020). The next
breakthroughs of artificial intelligence: The interdisciplinary nature of AI.
themes bring a diverse range of knowledge and contribute to the Engineering, 6(3), 245.
reconfiguration of knowledge by identifying and integrating [12] Wu, J., Ou, G., Liu, X., Dong, K. (2022). How does academic education
insights from different fields[30]. background affect top researchers’ performance? Evidence from the field of
artificial intelligence. Journal of Informetrics, 16(2), 101292.
Managers should consider research team diversity when [13] Li, Y., Li, N., Guo, J., Li, J., Harris, T. B. (2018). A network view of advice-
developing it. To create a healthy innovation environment, they giving and individual creativity in teams: A brokerage-driven, socially
perpetuated phenomenon. Academy of Management Journal, 61(6), 2210-
should pay attention to the heterogeneity of different 2229.
organizational and disciplinary backgrounds to which team [14] Petersen, A. M. (2015). Quantifying the impact of weak, strong, and super
members belong and strive to optimize the level of knowledge ties in scientific careers. Proceedings of the National Academy of Sciences,
112(34), E4671-E4680.
diversity in the team. Furthermore, managers should encourage [15] Wagner, C. S., Roessner, J. D., Bobb, K., Klein, J. T., Boyack, K. W., Keyton,
and promote activity diversity. Meanwhile, the costs of too much J., Rafols, I., Börner, K. (2011). Approaches to understanding and measuring
interdisciplinary scientific research (IDR): A review of the literature. Journal
research interest diversity need to be noted to avoid the of informetrics, 5(1), 14-26.
phenomenon of “too much of a good thing”. This is because [16] Liu, X., Bu, Y., Li, M., Li, J. (2024). Monodisciplinary collaboration disrupts
knowledge diversity among collaborative members in different science more than multidisciplinary collaboration. Journal of the Association
for Information Science and Technology, 75(1), 59-78.
research fields is often considered a double-edged sword. As [17] Uzzi, B., Mukherjee, S., Stringer, M., Jones, B. (2013). Atypical
Wang et al have found, increasing knowledge diversity leads to a combinations and scientific impact. Science, 342(6157), 468-472.
[18] Wang, J. (2014). Unpacking the Matthew effect in citations. Journal of
decline in social influence after a certain peak[21]. Informetrics, 8(2), 329-339.
Our study has limitations. Firstly, it focuses solely on the [19] Van Knippenberg, D., Schippers, M. C. (2007). Work group diversity. Annu.
Rev. Psychol., 58, 515-541.
AMiner platform in AI, limiting its scope and generalizability. [20] Ayub, N., Jehn, K. A. (2006). National diversity and conflict in multinational
Future studies should broaden the research fields. Secondly, the workgroups: The moderating effect of nationalism. International Journal of
sample of 195 interdisciplinary teams may not fully reflect AI Conflict Management, 17(3), 181-202.
[21] Wang, G., Gan, Y., Yang, H. (2022). The inverted U-shaped relationship
team diversity and complexity, potentially suffering from between knowledge diversity of researchers and societal impact. Scientific
sampling error. A more representative sample is needed. Thirdly, Reports, 12(1), 18585.
[22] Zhuo, L. I. N., Haohai, H. U. A. N. G. (2022). Top Experts Identification and
while we focused on diversity within teams, future research could Evaluation of International Cooperation on Artificial Intelligence in China.
explore diversity in other research team activities. Lastly, our Journal of Library and Information Science in Agriculture, 2022,34(1): 86-95.
team recognition method overlooks member turnover dynamics. [23] Liao, C. H. (2011). How to improve research quality? Examining the impacts
of collaboration intensity and member diversity in collaboration networks.
Future studies should introduce a dynamic analysis of member Scientometrics, 86(3), 747-761.
flow for more accurate core member identification. [24] Breiman, L. (1984). Classification and regression trees (1st ed.). Routledge.
[25] Aboseif, E., Hanna, A. S. (2023). Defining the Success Status of Construction
Projects Based on Quantitative Performance Metrics Thresholds. Journal of
ACKNOWLEDGMENTS Management in Engineering, 39(2), 04022073.
[26] Shmueli, G. (2010). To explain or to predict? Statistical Science, 25(3), 289-
This work was supported partially by the National Natural 310.
[27] Klusowski, J. M., Tian, P. M. (2024). Large scale prediction with decision
Science Foundation of China (Grant No. 72174016). Our gratitude trees. Journal of the American Statistical Association, 119(545), 525-537.
also goes to the anonymous reviewers and the editor for their [28] Pahmi, S., Saepudin, S., Maesarah, N., Solehudin, U. I. (2018).
valuable comments. Implementation of CART (classification and regression trees) algorithm for
determining factors affecting employee performance. In 2018 International
Conference on Computing, Engineering, and Design (ICCED) (pp. 57-62).
REFERENCES IEEE.
[29] Chen, W., Xie, X., Wang, J., Pradhan, B., Hong, H., Bui, D. T., Duan, Z., Ma,
[1] Dong, Y., Ma, H., Tang, J., Wang, K. (2018). Collaboration diversity and J. (2017). A comparative study of logistic model tree, random forest, and
scientific impact. arXiv preprint arXiv,1806.03694. classification and regression tree models for spatial prediction of landslide
[2] Van Noorden, R. (2015). Interdisciplinary research by the numbers. Nature, susceptibility. Catena, 151, 147-160.
525(7569), 306-307. [30] Zhang, L., Li, X. (2016). How to reduce the negative impacts of knowledge
[3] Taylor, A., Greve, H. R. (2006). Superman or the fantastic four? Knowledge heterogeneity in engineering design team: Exploring the role of knowledge
combination and experience in innovative teams. Academy of management reuse. International Journal of Project Management, 34(7), 1138-1149.
journal, 49(4), 723-740.

140