<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>O. Pavlova);</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Analysis of key parameters for choosing a kind of sport based on human morphofunctional indicators using statistical and machine learning methods⋆</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Olga Pavlova</string-name>
          <email>pavlovao@khmnu.edu.ua</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Vitalii Alekseiko</string-name>
          <email>vitalii.alekseiko@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Valeriia Shvaiko</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nikita Vusatyi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Houda El</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bouhissi</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Roman Gakh</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Khmelnytskyi National University</institution>
          ,
          <addr-line>Institutska str., 11, Khmelnytskyi, 29016</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>LIMED Laboratory, Faculty of Exact Sciences,University of Bejaia</institution>
          ,
          <addr-line>06000, Bejaia</addr-line>
          ,
          <country country="DZ">Algeria</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>West Ukrainian National University</institution>
          ,
          <addr-line>11 Lvivska st., Ternopil, 46001</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
      </contrib-group>
      <volume>000</volume>
      <fpage>0</fpage>
      <lpage>0003</lpage>
      <abstract>
        <p>The work analyzes the influence of human morphofunctional indicators on the predisposition to a certain kind of sport. The existing decision support systems in the sports sphere are considered. The role of artificial intelligence for parameter analytics is determined. Statistical analysis of the data was performed. The statistical significance of each parameter was established using the t-test. The correlation value between sports, as well as between morphofunctional indicators using the Pearson and Spearman methodologies, is calculated for a more comprehensive understanding of the relationships. The importance of morphofunctional indicators for choosing a kind of sport is investigated using machine learning technologies, namely the random forest method is chosen.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Kind of sport</kwd>
        <kwd>correlation</kwd>
        <kwd>data analysis</kwd>
        <kwd>artificial intelligence</kwd>
        <kwd>morphofunctional indicators 1</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>In today’s virtualized world, more and more people are striving to find the optimal sport for
themselves, which will not only contribute to physical development, but also correspond to
their individual characteristics and capabilities. However, the human factor often causes false
decisions: people choose activities that do not meet their physiological characteristics or do not
take into account potential risks, which can lead to injuries or loss of motivation. As a result,
they quickly stop training without achieving the desired results.</p>
      <p>There are many approaches to choosing a sport [1, 2] according to a person’s preferences,
but most studies are focused on the general choice of group or individual sports, as well as the
availability of training in a particular region. However, in addition to these undoubtedly
important parameters, it is also advisable to take into account the morphofunctional
characteristics of a person.</p>
      <p>The aim of the work is to identify the correlation between morphofunctional indicators and
sports, as well as to establish the importance of each indicator.</p>
      <p>At this stage, artificial intelligence (AI) can play a significant role, which is a powerful tool
for selecting a sport basedon human morphofunctional indicators. Using large amounts of data,
the system can analyze the physical characteristics of professional athletes in various sports
and forms the necessary parameters and standards on the basis of this. This allows to make an
objective and scientifically based choice of optimal activity for a particular person.</p>
      <p>In addition, AI is able to analyze individual user data such as height, weight, body
composition, endurance level, etc., and offer the most suitable sport. The system based on AI
can also assess potential risks, anticipate possible health problems, and make recommendations
about exertion, training intensity, and safety techniques. This approach not only increases the
effectiveness of sports, but also contributes to the long-term involvement of people in an active
lifestyle, helping them to avoid overloads and injuries.</p>
      <p>Thus, the introduction of AI in the field of sports opens up new opportunities for a
personalized approach to the choice of physical activities. This allows everyone to find the best
option for maintaining health, developing physical abilities and achieving sports goals.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Literature Review</title>
      <p>In our previous works [ 3, 4] the indicators-based decision support system and method for
choosing kind of sport were proposed. Also, the indicators and values of their impact on
different kinds of sport were determined. Moreover, the analysis of the relevant studies
describing using AI for decision support in choosing kind of sport was conducted.</p>
      <p>The article [5] demonstrates the potential of AI to improve sports performance analysis and
decision-making during training and competition. The study [6] provides a practical perspective
on the application of AI in real-world health tracking systems that can be integrated into sports
applications. The research [7] describes the integration of AI to facilitate more accurate analysis
and prediction of sports training results using mobile sensors and deep neural networks. The
article [8] discusses current trends and practical solutions in sports technologies based on the
use of artificial intelligence. The research [9] analyzes which technological innovations
influence the development of sports applications and how AI contributes to their improvement.
The article [10] demonstrates the ability to detect and classify different types of sports activities
from live video streams using convolutional neural networks. The study [11] describes the
development of a mobile application for personalized coaching of runners that uses AI to
analyze data from sensors and biometric indicators to optimize the training process. The article
[12] identifies AI's impact on China's sports industry. The study [13] is devoted to the legal
aspects of implementing AI systems in the sports sector under the European Union legislation.</p>
      <p>The research [14] analyzes morphofunctional indicators of an elite Chilean mountain
runner. It was provided laboratory experiments to improve training strategy. The article [15]
reveals medical point of view. This study focuses on analyzing the effects of isometric and
isotonic exercise training on the morpho-functional parameters of the right ventricle in
Olympic athletes. Also it was analyzed machine learning approaches [16, 17, 18] and its usage
t =</p>
      <p>X1 − X2</p>
      <p>,
s2 s2
1 + n22
n1
Where:
X1, X2 – sample means of groups 1 and 2;
s12, s22 – sample variances of groups 1 and 2;
n1, n2 – sample sizes of groups 1 and 2.</p>
      <p>The larger the values, the greater the difference between the two groups.</p>
      <p>Pearson and Spearman methodologies were used to determine the correlation between
parameters. Comparing the two approaches allows for a more comprehensive assessment due
to the possibility of analyzing both linear and monotonic dependencies.</p>
      <p>Pearson's correlation coefficient determines the linear relationship between two variables:
for health monitoring [19]. However, the considered studies do not solve the problem of
morphofunctional indicators’ analysis and choosing a kind of sport on basis of them.</p>
      <p>On the basis of provided study it was formulated hypothesis accordingly to the topic of the
research: There is a significant difference between the indicators’ means of two kind of sports.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Methodology</title>
      <p>The study used expert data from specialists in the field of physical education and sports. A
detailed description of the data used is given in previous works [3, 4].</p>
      <p>
        The t-test is used to test a statistical hypothesis, which is used to compare the means of two
groups and determine whether the difference is statistically significant. The formula 1 describes
if the difference between the means of two groups is significant [20].
(
        <xref ref-type="bibr" rid="ref1">1</xref>
        )
(
        <xref ref-type="bibr" rid="ref2">2</xref>
        )
(
        <xref ref-type="bibr" rid="ref3">3</xref>
        )
r =
      </p>
      <p>∑(Xi − X)(Yi − Y)
(Xi − X)2 ∙ (Yi − Y)2
,
Where:
Xi, Yi – individual data points;
X, Y – the mean values of X and Y;</p>
      <p>
        In formula (
        <xref ref-type="bibr" rid="ref2">2</xref>
        ) the numerator represents the covariance between X and Y, and the
denominator normalizes the values by their standard deviations. The r value 1 is represent
perfect positive correlation, 0 – no correlation and –1 – perfect negative correlation.
      </p>
      <p>The Spearman correlation measures the monotonic relationship between two variables by
ranking the data before calculation:
ρ = 1 −
Where:
di – the difference between the ranks of corresponding values Xi and Yi;
n – the number of data points.</p>
      <p>Similar to Pearson correlation, the ρ value 1 is represent perfect increasing monotonic
relationship, 0 – no monotonic relationship and –1 – perfect decreasing monotonic relationship.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Results and Discussions</title>
      <p>All morphofunctional indicators used in the research are presented in Table 1 with description.
For better visualization on plots it is used numbering. Also Table 1 shows T -test results. All
indicators have extremely small p-values (&lt; 0.05), meaning they are statistically significant.</p>
      <p>The smallest p-value is observed for indicator 14. This indicates an extremely strong
difference between the groups. Indicators 7 and 12 also have very small p-values, indicating
clear group differences. Indicator 14 has the highest t-statistic (80.00). Indicators 7, 10, and 12
also have high t-statistics. Indicators 4 and 2 have the lowest t -statistics, but still show
significant differences. Thus, all indicators show statistically significant differences between the
two groups.</p>
      <p>Relationships between kind of sports on the basis of important for them morphofunctional
human indicators are shown in Figure 1 as a adjency matrix heatmapT.o check for the presence
of relationships, a binary classification was used, where 0 means the absence of such indicators,
and 1 indicates their presence.</p>
      <p>Figure 3 shows a histogram of indicators. X-axis represents the range of values for a
particular indicator and Y-axis represents the number of observations.</p>
      <p>The boxplot also shows outliers. It means values that are significantly different from the
majority of the data. It can be seen from the chart, that Indicators 6 and 9 have larger outliers
compared to the other indicators. Indicators with taller squares and longer whiskers (e.g.
indicators 5, 6, and 9) have higher variability. Indicators with shorter rectangles (e.g. indicator
10) have lower variability. It means the data is more consistent. If the median is closer to the
bottom of the rectangle, the data is positively skewed. If the median is closer to the top, the data
is negatively skewed. Thus, for some indicators, there is a skew. It means that the data is
asymmetrically distributed.</p>
      <p>The correlation matrix is presented in Figure 4. It displays the results of the Pearson
correlation. According to this approach, linear relationships between indicators are measured
in the range from –1 to 1, where 1 reflects a perfect positive correlation, 0 indicates no
correlation, and –1 indicates a perfect negative correlation.</p>
      <p>The Spearman correlation matrix is presented in Figure 5. It measures monotonic relationships,
which, however, do not necessarily have to be linear.</p>
      <p>The results of the study showed that some indicators have strong positive and negative
relationships. Thus, indicators 7 and 14 demonstrate a strong negative correlation (–0.84),
Indicator 3 and 4 demonstrate a strong positive correlation (0.76). Most correlations are weak
or moderate.</p>
      <p>Spearman correlation provides a slightly different view of the relationships. Stronger
correlations are observed between some indicators. The «Sport» column suggests which
indicators contribute more to the classification. Indicator 14 has a high Spearman correlation
with sport (0.87), which means that it can strongly influence the classification.</p>
      <p>Pearson correlation gives more accurate results under linear relationships. If the data has
strong outliers or nonlinear trends, Spearman correlation is more reliable. Spearman correlation
also helps to rank indicators that influence the choice of sports.</p>
      <p>It was provided Random Forest Classification to identify the most important indicators for
determining the most suitable kind of sport on the basis of morphofunctional indicators. Results
are presented in Figure 6. Thus indicators 6, 5, 9 and 11 havemore significant impact then other.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusions</title>
      <p>As a result of the conducted research, it was possible to establish dependencies between sports,
based on morphofunctional indicators, which play a key role for them. Correlations between
morphofunctional indicators were also established, which contributes to a comprehensive
understanding of the criteria on the basis of which the choice of sport is made.</p>
      <p>The conducted data analysis allowed us to confirm the hypothesis that there is a difference
between the parameters for different sports.</p>
      <p>The analysis of the importance of indicators for making a decision about the predisposition
to a sport, performed using AI technologies, allows us to capture hidden patterns and more
accurately select sports.</p>
      <p>The further efforts of the authors will be focused on approaches that provides intellectual
analysis of human morphofunctional indicators and choosing a kind of sport based on them
using AI technologies.</p>
    </sec>
    <sec id="sec-6">
      <title>Declaration on Generative AI</title>
      <p>During the preparation of this work, the authors used Grammarly in order to: grammar and
spelling check; DeepL Translate in order to: some phrases translation into English. After using
these tools/services, the authors reviewed and edited the content as needed and take full
responsibility for the publication’s content.
[5] Akan, S., &amp; Varlı, S. (2022). Use of deep learning in soccer videos analysis: survey.</p>
      <p>
        Multimedia Systems, 29(
        <xref ref-type="bibr" rid="ref3">3</xref>
        ), 897–915. https://doi.org/10.1007/s00530-022-01027-0.
[6] psico-smart.com. (28 August, 2024). Integration of artificial intelligence in health and
wellness tracking systems.
https://psico-smart.com/en/blogs/blog-integration-of-artificialintelligence-in-health-and-wellness-tracking-systems-163046.
[7] Lu, L., Xi, Y., &amp; Zhang, X. (2023). Application of mobile sensors based on deep neural
networks in sports psychological detection and prevention. Preventive Medicine, 173,
107613. https://doi.org/10.1016/j.ypmed.2023.107613.
[8] Harper, G. (2025, February 7). AI in Sports: A Complete Guide. TopDevelopers.co.
      </p>
      <p>https://www.topdevelopers.co/blog/ai-in-sports/.
[9] Seçkin, A. Ç., Ateş, B., &amp; Seçkin, M. (2023). Review on Wearable technology in Sports:
Concepts, challenges and opportunities. Applied Sciences, 13(18), 10399.
https://doi.org/10.3390/app131810399.
[10] Peppas, K., Tsolakis, A. C., Krinidis, S., &amp; Tzovaras, D. (2020). Real-Time physical activity
recognition on smart mobile devices using convolutional neural networks. Applied
Sciences, 10(23), 8482. https://doi.org/10.3390/app10238482.
[11] Pashaie, S., Mohammadi, S., &amp; Golmohammadi, H. (2024). Unlocking athlete potential: The
evolution of coaching strategies through artificial intelligence. Proceedings of the
Institution of Mechanical Engineers Part P Journal of Sports Engineering and Technology.
https://doi.org/10.1177/17543371241300889.
[12] Lu, L., Yang, S., &amp; Li, Q. (2024). The interaction of digital economy, artificial intelligence
and sports industry development --based on China PVAR analysis of provincial panel data.</p>
      <p>
        Heliyon, 10(
        <xref ref-type="bibr" rid="ref4">4</xref>
        ), e25688. https://doi.org/10.1016/j.heliyon.2024.e25688.
[13] A. Orlando, "AI for Sport in the EU Legal Framework," 2022 IEEE International Workshop
on Sport, Technology and Research (STAR), Trento – Cavalese, Italy, 2022, pp. 100-105,
doi: 10.1109/STAR53492.2022.9860029.
[14] C. Nieto-Jiménez, E. Baez-San Martin, D. Rojas-Valverde, R. Yáñez-Sepúlveda, &amp; M. Tuesta
(2025). Morpho-functional characterization of an elite Chilean mountain runner: Insights
from a high-performance case study. Journal of Human Sport and Exercise , 20(
        <xref ref-type="bibr" rid="ref2">2</xref>
        ), 409-418.
https://doi.org/10.55860/7xs51s84
[15] G. Di Gioia, D. Polito, S. P. Crispino, V. Maestrini, A. Nenna, A. Segreti, M. R. Squeo, E.
      </p>
      <p>
        Lemme, &amp; A. Pelliccia (2023). Influence of isometric versus isotonic exercise training on
right ventricular morpho‐functional parameters in Olympic athletes. Echocardiography,
41(
        <xref ref-type="bibr" rid="ref1">1</xref>
        ). https://doi.org/10.1111/echo.15731
[16] M. Sirola, &amp; J. E. Hulsund (2021). Machine-Learning Methods in Prognosis of Ageing
Phenomena in Nuclear Power Plant Components. International Journal of Computing,
20(
        <xref ref-type="bibr" rid="ref1">1</xref>
        ), 11-21. https://doi.org/10.47839/ijc.20.1.2086
[17] Roman Lynnyk, Victoria Vysotska, Yurii Matseliukh, Yevhen Burov, Lyubomyr Demkiv,
Andrij Zaverbnyj, Anatoliy Sachenko, Inna Shylinska, Iryna Yevseyeva, Oksana Bihun.
DDOS Attacks Analysis Based on Machine Learning in Challenges of Global Changes.
CEUR Workshop Proceedings (CEUR-WS.org) MoMLeT+DS 2020 Modern Machine
Learning Technologies and Data Science Workshop 2020, pp. 159-171. ISSN 1613-0073.
[18] H. Lipyanina, V. Maksymovych, A. Sachenko, T. Lendyuk, A. Fomenko, I. Kit, (2020).
      </p>
      <p>
        Assessing the Investment Risk of Virtual IT Company Based on Machine Learning. In:
Babichev, S., Peleshko, D., Vynokurova, O. (eds) Data Stream Mining &amp; Processing. DSMP
2020. Communications in Computer and Information Science, vol 1158. Springer, Cham.
https://doi.org/10.1007/978-3-030-61656-4_11
[19] S. Shafi Bhat, V. Selvam, &amp; G. A. Ansari (2023). Predicting Life Style of Early Diabetes
Mellitus using Machine Learning Technique. International Journal of Computing, 22(
        <xref ref-type="bibr" rid="ref3">3</xref>
        ),
345-351. https://doi.org/10.47839/ijc.22.3.3230
[20] Hayes, A. (2024, October 4). T-Test: What it is with multiple formulas and when to use
them. Investopedia. https://www.investopedia.com/terms/t/t-test.asp.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <article-title>[1] Choosing the right sport for you (for teens)</article-title>
          .
          <source>(January</source>
          <year>2021</year>
          ). https://kidshealth.org/en/teens/find-sports.html
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Venture</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2022</year>
          , May 10).
          <article-title>What kind of sport are you into? 5 great tips on how to choose the right one</article-title>
          .
          <source>Troy Media</source>
          .
          <article-title>Re-inventing How Canadians Stay in Touch</article-title>
          . https://troymedia.com/joint-venture/
          <article-title>what-kind-of-sport-are-you-into-5-great-tips-onhow-to-choose-the-right-one/</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>O.</given-names>
            <surname>Pavlova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Soltyk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Shvaiko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ilchyshyna</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. El</given-names>
            <surname>Bouhissi</surname>
          </string-name>
          .
          <article-title>Human Morphofunctional Indicators-Based Decision Support System for Choosing Kind of Sport</article-title>
          .
          <source>MoMLeT+DS</source>
          <year>2023</year>
          .
          <volume>322</volume>
          -
          <fpage>333</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>T.</given-names>
            <surname>Hovorushchenko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Pavlova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Shvaiko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kuzmin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Kulbachnyi</surname>
          </string-name>
          .
          <article-title>Method and system for choosing kinds of sport based on human morphofunctional indicators</article-title>
          .
          <source>ITTAP</source>
          <year>2023</year>
          .
          <volume>661</volume>
          -
          <fpage>671</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>