<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>August</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>A survey of attention mechanisms for wearable sensor-based human activity recognition</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Xin Wang</string-name>
          <email>xinwang@zut.edu.cn</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yan Wang</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yingrui Geng</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hongnian Yu</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hongmei Yang</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Xiaoxu Wen</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Aihui Wang</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>School of Computing, Engineering and the Built Environment, Edinburgh Napier University</institution>
          ,
          <addr-line>Edinburgh EH10 5DT</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>School of Electric and Information, Zhongyuan University of Technology</institution>
          ,
          <addr-line>Zhengzhou 450007</addr-line>
          ,
          <country country="CN">China</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <volume>2</volume>
      <fpage>8</fpage>
      <lpage>29</lpage>
      <abstract>
        <p>Attention mechanisms, widely used in many fields such as computer vision (CV) and natural language processing (NLP), enable deep learning networks to extract more important information from the input, thereby improving performance and eficiency. Recently, attention mechanisms are introduced to wearable sensor-based human activity recognition (WSHAR) for learning more robust feature representations. This paper investigates the attention mechanisms in WSHAR with a special focus on the principles of computing attention and the targets on which the attention works in a network and future directions. The aim is to provide readers with a clearer understanding of attention mechanisms in WSAHR and motivate more diverse work in the future.</p>
      </abstract>
      <kwd-group>
        <kwd>Attention mechanisms</kwd>
        <kwd>deep learning</kwd>
        <kwd>human activity recognition</kwd>
        <kwd>wearable sensors</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Recently, attention mechanisms have become enormously popular in deep learning [
        <xref ref-type="bibr" rid="ref1 ref2 ref3">1, 2, 3</xref>
        ].
Researchers apply attention mechanisms to networks, allowing models to dynamically focus
on key parts of the input to perform specific tasks more efectively. The basic principle of
the attention mechanism is to weigh the input information so that parts of input with higher
weights are considered more relevant to the task and have a greater impact on the model’s
decisions [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Attention mechanisms were first introduced into the encoder-decoder network
for natural language processing (NLP) [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], enabling the decoder to selectively access the input
sequence parts that are important to the context. Vaswani et al. [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] performed an innovative
development on attention mechanisms, where they relied solely on a self-attention mechanism
to model the global dependencies of the input. The mechanism overcame the dificulties such as
performance degradation and computational ineficiency caused by recurrent neural networks
The 5th International Symposium on Advanced Technologies and Applications in the Internet of Things (ATAIT 2023),
(A. Wang)
CEUR
(RNNs) as the length of the input sequence increased, and showed state-of-the-art results on NLP
tasks when it was proposed. Another mainstream application scenario of attention mechanisms
is in computer vision (CV) for image classification [
        <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
        ], anomaly detection [9, 10] or semantic
segmentation [11, 12]. Attention mechanisms in CV can efectively solve the problem of image
information overload, which contributes to saving computational resources [13].
      </p>
      <p>In the past decade, with the advances in Artificial Intelligence (AI) and sensor technologies,
human activity recognition (HAR) is gradually playing an important role in many cross-cutting
areas, such as smart healthcare [14], motion monitoring [15] and human-robot interaction [16].
Depending on the type of sensors arranged, HAR can be classified into three categories: 1) vision
sensor-based HAR (VSHAR), 2) ambient sensor-based HAR (ASHAR) and 3) wearable
sensorbased HAR (WSHAR) [17]. The WSHAR systems are more portable, require less computational
resources. They overcome the limitation of working only in specified areas which cameras and
ambient sensors have, thus becoming a research hotspot in the field of HAR recently. Learning
robust feature representations from raw wearable sensor data is critical for WSHAR tasks.
As an efective and powerful performance-enhancing network, attention mechanisms have
also been introduced into WSHAR to learn more valuable features from raw signals [18, 19].
Applying attention mechanisms to raw data from diferent wearable sensors enables the model
to focus on the data that contribute more to activity recognition, avoiding the efect of noise
caused by individual faulty sensors. For example, Mahmud et al. [20] proposed the sub-module
named Sensor Modality Attention to weigh the data captured from diferent sensor modalities
according to their varying contribution levels. The weighted representations obtained showed
better performance than the raw data. Combining attention mechanisms with some typical
neural networks, such as convolutional neural networks (CNNs) and long short-term memory
network (LSTM), can also help improve the performance in handling HAR tasks. Sun et al. [21]
introduced an attention layer into the LSTM to automatically capture the important temporal
dependencies. Singh et al. [22] utilized self-attention to select and learn important time points
of spatial-temporal features captured by the combination of CNN and LSTM, which achieved a
significant enhancement over the existing methods.</p>
      <p>Although several papers have reviewed the attention mechanisms or their applications in
natural language processing [23, 24] and computer vision [25, 26], few can be found in WSHAR.
The use of attention mechanisms in WSHAR tasks difers in the calculation principle and the
target that the attention works on in a network. This paper thus provides a concise review and
discussion on the applications of attention mechanisms in WSHAR. The main contributions of
this work are as follows.</p>
      <p>• Identifying the attention mechanisms in WSHAR according to the principles of computing
attention and presenting the theoretical explanations accordingly;
• Exploring the applications of attention mechanisms in WSHAR tasks according to the
targets that the attention works on in a network, and mining the corresponding motivations
and reasons behind;
• Providing the future directions of attention mechanisms in WSHAR, i.e., exploring the
feasibility of introducing the cross-attention mechanism to perform WSHAR tasks.</p>
      <p>The rest of the paper is organized as follows: Section 2 provides theoretical principles of
the generic attention mechanisms in WSHAR. Section 3 identifies and summarizes the three
attention methods in WSHAR tasks. Section 4 discusses the future directions of attention-based
WSHAR. Finally, the conclusion is presented in Section 5.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Attention mechanism theoretical principles</title>
      <p>Thanks to the high compatibility, and excellent performance in feature and model optimization,
attention mechanisms have been acquiring great success in WSHAR and a large number of
attention-based HAR models have emerged in a short period of time. According to the principles
of computing attention, there are two basic attention mechanisms in WSHAR: weight-based
attention and self-attention. Before looking into the attention methods in WSHAR, we detail
the two types of attention mechanisms in this section.</p>
      <sec id="sec-2-1">
        <title>2.1. Weight-based attention</title>
        <p>Extracting important features in the input elements helps further enhance the performance
of HAR models. The weight-based attention learns the importance of the input elements. It
assigns diferent weight coeficients to each element, thus increasing the proportion of important
elements in feature extraction and reducing or eliminating unimportant elements. As illustrated
in Figure 1, the input elements   are first fed into a specific network for features learning
to obtain the attention representations   . Then a softmax operation is performed on   to
produce the attention scores   . Finally, the obtained attention scores   are fused with the input
elements to derive the weighted attention output. Usually, there are two ways of fusion, one is
to directly fuse the attention scores with the original input elements   to obtain the weighted
input vectors   , and the other is to apply the attention scores to the attention representations  
to get the weighted attention representations   .</p>
        <p>xi
ai</p>
        <p> i
where  and  in (1) are the parameters learned by the network when linear transformations
are applied to   , and  denotes the nonlinear activation function that help extract the nonlinear
features of inputs.</p>
        <p>The weight-based attention has various variants in HAR due to diferent element objects,
action ranges and network structures. According to the action element object, it can be divided
into spatial attention, temporal attention, etc. The networks designed to capture attention
representations are diverse, such as fully connected layers, CNNs, RNNs.</p>
        <p>
          Self-attention is a mechanism proposed by Vaswani et al. [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] in Transformer. Unlike
weightbased attention, self-attention contains both attention score calculation and feature extraction,
i.e., self-attention enables the extraction of features from inputs independently, as shown in
  = softmax (  ) = 
  =   ⋅  
  =   ⋅  
(2)
(3)
(4)
(5)
transformations to the input  to obtain the transformation matrices Query ( ), Key ( ) and
Value ( ).   ,   ,   and   are the elements of  ,  ,  and  , respectively.   ,   and   are the
trainable parameter matrices. Secondly, comparing   with each   , which means conducting
matrix operations on  and the transpose of  to get the scores as follows:
where   represents the dimension of  and the scaled operation leads to having more stable
gradients. Then the results are normalized by softmax to obtain the attention scores. Finally,
 =
 ⋅
        </p>
        <p>√ 
multiply the attention scores with  to give the final output  . The entire formula for
selfattention can be expressed as:
 (,  ,  ) =
softmax (</p>
        <p>) ⋅ 
 ⋅</p>
        <p>√</p>
        <p>
          The multi-head attention proposed in [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] performs multiple self-attention operations on
the input in parallel. Then it stitches the output of each self-attention together to extract the
information obtained by multiple self-attention through an additional parameter matrix   , as
defined in ( 7).
        </p>
        <p>(,  ,  ) =</p>
        <p>⋅ Concat ( 1,  2, … ,  ℎ)
where   indicates the output of the  -th self-attention and ℎ is the number of self-attention.
Multi-head attention enables the extraction of more valuable features from diferent views,
which enhances model’s performance and thus is widely used in WSHAR.
(6)
(7)</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Attention methods in WSHAR</title>
      <p>Section 2 detials the attention mechanisms based on computational principles. Both attentions
can enhance the information processing ability of neural networks. Factors such as the target
objects or the binding networks contribute to the diversity of applications of both attention
mechanisms in WSHAR. In this section, we review the attention methods in HAR according
to the targets that the attention works on in a network and give explanations of the function
of each attention applied in a specific WSHAR task, with a summary of the related works in</p>
      <sec id="sec-3-1">
        <title>3.1. Temporal attention</title>
        <p>
          Activity data from wearable sensors are time series signals with high time dependence [
          <xref ref-type="bibr" rid="ref13">41</xref>
          ].
Extracting temporal features from raw sensor data is crucial to improve the recognition
performance. Although RNNs and their variants, such as LSTM and gated recurrent unit (GRU),
specialize in learning the sequential dependencies [
          <xref ref-type="bibr" rid="ref14 ref15 ref16">42, 43, 44</xref>
          ], they treat all time steps of sensor
data equally. It means that noise or unimportant signals are also fed into models for training,
which may lead to a degradation of recognition accuracy. The temporal attention mechanism
weights the time steps in the input or obtained representations to make the temporal
models mentioned above that focus on the information extraction on important time steps [
          <xref ref-type="bibr" rid="ref17">45</xref>
          ].
Therefore, a model with temporal attention is able to learn more robust temporal features from
the raw sensor data or representations and exhibits high eficiency [
          <xref ref-type="bibr" rid="ref18">46</xref>
          ]. Figure 3 presents the
generic structure of temporal attention in WSHAR, where  and  represent the number of
samples and sensor channels, respectively.
        </p>
        <p>Haque et al. [27] introduced a temporal attention named hierarchical context-based attention
to the two-layer GRU model to learn the hierarchy of temporal features. The attention generates
diverse importance for temporal features learned bu GRU, efectively capturing the context
of relevant time steps. Similarly, Betancourt et al. [29] added a two-layer self-attention after
an LSTM layer to improve the model’s performance. Self-attention layer compares each time</p>
        <sec id="sec-3-1-1">
          <title>Category</title>
        </sec>
        <sec id="sec-3-1-2">
          <title>Temporal attention</title>
        </sec>
        <sec id="sec-3-1-3">
          <title>Spatial attention</title>
        </sec>
        <sec id="sec-3-1-4">
          <title>Temporal &amp; spatial attention</title>
        </sec>
        <sec id="sec-3-1-5">
          <title>Method description</title>
        </sec>
        <sec id="sec-3-1-6">
          <title>Conducting weight-based attention on the temporal features generated by GRU</title>
        </sec>
        <sec id="sec-3-1-7">
          <title>Conducting self-attention or weight-based attention on the temporal features learned by LSTM</title>
        </sec>
        <sec id="sec-3-1-8">
          <title>Performing self-attention on the representations of ConvLSTM model from the temporal dimension</title>
        </sec>
        <sec id="sec-3-1-9">
          <title>Executing weight-based attention on the spatial features generated by diferent CNN three times in succession</title>
        </sec>
        <sec id="sec-3-1-10">
          <title>Performing weight-based attention on the 3D spatial features learned by CNN</title>
        </sec>
        <sec id="sec-3-1-11">
          <title>Executing self-attention on the temporal features generated by</title>
        </sec>
        <sec id="sec-3-1-12">
          <title>GRU and spatial features generated by CNN, respectively</title>
        </sec>
        <sec id="sec-3-1-13">
          <title>Conducting weight-based attention on the spatial or temporal features generated by CNN</title>
        </sec>
        <sec id="sec-3-1-14">
          <title>Executing weight-based attention on the input and model-generated representations from the spatial dimension or the temporal dimension, respectively</title>
          <p>Temporal Attention</p>
          <p>Temporal Attention</p>
          <p>Reference
Inputs
step in the representations learned by the last LSTM layer with all time steps to determine the
more relevant time steps. The obtained relevance helps to extract robust time dependencies.
The LSTM network with the attention module improved the accuracy by 4.0% and 4.2% on the
UCI-HAR and MTUT-HAR datasets, respectively. Singh et al. [22] used self-attention to further
learn the spatial-temporal representations generated by CNN and LSTM. The ConvLSTM model
with self-attention shows better performance on all six publicly available HAR datasets than
those without self-attention.</p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Spatial attention</title>
        <p>
          Wearable sensors such as accelerometers, gyroscopes or magnetometers capture diferent
information about the same activity and bring diferent contributions to activity recognition [
          <xref ref-type="bibr" rid="ref19">47</xref>
          ].
The involvement of diferent body parts during the execution of an activity is diverse, so the
contribution of data collected from diferent positions to activity recognition also varies [
          <xref ref-type="bibr" rid="ref20">48</xref>
          ].
Hence, activity data from wearable sensors are not only rich in time dependencies but can
have complex spatial features. Spatial attention assigns varying weights to data from
diferent sensors or wearing positions in the spatial dimension according to their importance for
activity recognition. Thus spatial attention-based models can learn more robust spatial feature
representations, as shown in Figure 4.
        </p>
        <p>Spatial Attention</p>
        <p>Spatial Attention
s
e
s
s
a
l
c
y
tiit
v
c
A</p>
        <p>Wang et al. [33] proposed an attention-based CNN architecture that separately adds attention
submodules after the third, fourth and fith CNN layers, respectively. The attention submodule
weighs local features of the total spatial features learned by a CNN to enhance the noticeable
parts and weaken the less significant parts. The CNN model with attention submodules shows
a more eficient backpropagation and improves recognition accuracy. Sarkar et al. [ 34] applied
continuous wavelet transform to convert the time series from sensors with the size of  ×  to 2D
frequency-time domain scalograms with the size of  ×  ×  , here  is the number of timestamps
and  denotes the number of sensor channels. Then they conducted the spatial attention to  to
improve the features maps in CNN, thus enhancing CNN’s ability to learn deeper features.</p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Temporal &amp; spatial attention</title>
        <p>Temporal and spatial attention mechanisms have demonstrated their particular strengths in
WSHAR tasks. Based on their advantages, the temporal &amp; spatial attention can extract more
robust spatial-temporal features from the raw sensor data or feature representations learned by
the model, as shown in Figure 5.</p>
        <p>Temporal Attention</p>
        <p>Spatial Attention</p>
        <p>Temporal Attention
Spatial Attention
k
r
o
w
t
e
n
lti
a
a
p</p>
        <p>S
Inputs</p>
        <p>
          Ma et al. [35] proposed the AttnSense model that adds diferent attention mechanisms after
CNN and GRU, respectively. The attention subnet after CNN takes self-attention to give
varying weights for diferent sensor modalities from the spatial dimension, aiming to prioritize
the important modalities. The attention subnet after GRU uses self-attention to increase the
feature extraction of time steps with important contributions, while weakening the impact of
unimportant time steps. The comparison results on three publicly available HAR datasets show
that the model with two attention subnets achieves higher accuracy than using either one or
neither. Gao et al. [36] introduced channel attention and temporal attention in the DanHAR
model for feature re-extraction on the representations generated by CNN from the spatial and
temporal dimensions, respectively. The channel attention they proposed for HAR time series
signals is essentially a kind of spatial attention that uses max-pooling to combine the important
representations generated through multiple filters in the convolutional layer. In the temporal
attention, they applied global average-pooling and max-pooling to aggregate important global
contextual information. Similarly, Zheng [
          <xref ref-type="bibr" rid="ref11">39</xref>
          ] also used spatial &amp; temporal attention in the
LGSTNet model, with the diference that Zheng applies temporal attention to the original input
windows and spatial attention to the representations generated by 2D CNN. The probabilities
generated by temporal and spatial attention can enhance the contributions of the important
segments of the local spatial-temporal features. The ablation experiment results indicate that
the attention mechanisms can improve the recognition performance of the LGSTNet model,
especially in recognizing some similar activities.
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Future directions</title>
      <p>
        The studies mentioned above have demonstrated the potential of the self-attention mechanism
in human activity recognition [29, 22, 35]. The Q, K and V in WSHAR tasks are almost from the
same data source, which is efective in capturing sequential features and global information.
Recently, some studies have improved self-attention to propose a mechanism called
crossattention, where Q and K come from the same data source and V comes from another data source.
Thus, cross-attention enables to learn the inter-dependencies of diferent data sources. Lin et
al. [
        <xref ref-type="bibr" rid="ref21">49</xref>
        ] proposed a cross-attention mechanism that alternates attention to information within
image slices to acquire local information and captures attention information between local image
slices to obtain global information. Their model improves the recognition rate in vision tasks and
reduces the computational efort of the self-attention mechanism in the standard Transformer.
Bhatti et al. [
        <xref ref-type="bibr" rid="ref22">50</xref>
        ] established attention-based cross-modality for connecting diferent locations
of wearable information for emotion recognition.
      </p>
      <p>For multi-position wearable HAR, how to fuse the data provided by multiple heterogeneous
sensors and learn the data relationships within and between sensors is also one of the main
challenges. The sensor data from diferent wearing positions are either similar or diferent.
Therefore, introducing a cross-attention mechanism to establish the interaction of sensor
information from diferent wearing positions can be further explored.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion</title>
      <p>This paper summarizes attention mechanisms used in wearable sensor-based human activity
recognition. Firstly, based on the principles of computing attention, we divide the attention
mechanisms commonly used in WSHAR into two categories, i.e., weight-based attention and
self-attention, and theoretically demonstrate their principles. Secondly, targeting the objects
on which the attention mechanism acts, we survey and discuss the applications of attention
mechanisms in WSHAR in terms of the attention methods: temporal attention, spatial attention
and spatial &amp; temporal attention. Finally, we point out that introducing the cross-attention
mechanisms to WSHAR can benefit learning global and cross correlations in related works.
Future work should consider running the codes of works mentioned above and further mine
the attention principles in specific WSHAR tasks.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>This work was supported by Key Technologies R &amp; D Programs of Henan (No. 222102210016
and No. 232102211020), Henan Provincial Foreign Experts Program (No. GZS2022012) and
Zhongyuan University of Technology, Research Team Development Project on Machine
Intelligence and High-Dimensional Data Analysis (No. K2022TD001).
mechanisms in deep neural networks for image classification and object detection, Pattern
Recognition 123 (2022) 108411. doi:10.1016/j.patcog.2021.108411.
[9] S. Chang, Y. Li, S. Shen, J. Feng, Z. Zhou, Contrastive attention for video anomaly detection,</p>
      <p>IEEE Transactions on Multimedia 24 (2021) 4067–4076. doi:10.1109/TMM.2021.3112814.
[10] Q. Li, R. Yang, F. Xiao, B. Bhanu, F. Zhang, Attention-based anomaly detection in
multiview surveillance videos, Knowledge-Based Systems 252 (2022) 109348. doi:10.1016/j.
knosys.2022.109348.
[11] M. Wu, C. Zhang, J. Liu, L. Zhou, X. Li, Towards accurate high resolution satellite image
semantic segmentation, IEEE Access 7 (2019) 55609–55619. doi:10.1109/ACCESS.2019.
2913442.
[12] R. Li, S. Zheng, C. Duan, J. Su, C. Zhang, Multistage attention resu-net for semantic
segmentation of fine-resolution remote sensing images, IEEE Geoscience and Remote
Sensing Letters 19 (2021) 1–5. doi:10.1109/LGRS.2021.3063381.
[13] Z. Niu, G. Zhong, H. Yu, A review on the attention mechanism of deep learning,
Neurocomputing 452 (2021) 48–62. doi:10.1016/j.neucom.2021.03.091.
[14] F. Serpush, M. B. Menhaj, B. Masoumi, B. Karasfi, Wearable sensor-based human activity
recognition in the smart healthcare system, Computational intelligence and neuroscience
2022 (2022). doi:10.1155/2022/1391906.
[15] Z. Wang, J. Wang, H. Zhao, S. Qiu, J. Li, F. Gao, X. Shi, Using wearable sensors to capture
posture of the human lumbar spine in competitive swimming, IEEE Transactions on
Human-Machine Systems 49 (2019) 194–205. doi:10.1109/THMS.2019.2892318.
[16] A. Anagnostis, L. Benos, D. Tsaopoulos, A. Tagarakis, N. Tsolakis, D. Bochtis, Human
activity recognition through recurrent neural networks for human–robot interaction in
agriculture, Applied Sciences 11 (2021) 2188. doi:10.3390/app11052188.
[17] Y. Wang, S. Cang, H. Yu, A survey on wearable sensor modality centred human activity
recognition in health care, Expert Systems with Applications 137 (2019) 167–190. doi:10.
1016/j.eswa.2019.04.057.
[18] S. Chaudhari, V. Mithal, G. Polatkan, R. Ramanath, An attentive survey of attention
models, ACM Transactions on Intelligent Systems and Technology (TIST) 12 (2021) 1–32.
doi:10.1145/3465055.
[19] K. Chen, L. Yao, D. Zhang, X. Wang, X. Chang, F. Nie, A semisupervised recurrent
convolutional attention model for human activity recognition, IEEE transactions on neural
networks and learning systems 31 (2019) 1747–1756. doi:10.1109/TNNLS.2019.2927224.
[20] S. Mahmud, M. Tanjid Hasan Tonmoy, K. Kumar Bhaumik, A. Mahbubur Rahman, M.
Ashraful Amin, M. Shoyaib, M. Asif Hossain Khan, A. Ahsan Ali, Human activity recognition
from wearable sensor data using self-attention, in: Twenty-fourth European Conference on
Artificial Intelligence (ECAI), IOS Press, 2020, pp. 1332–1339. doi: 10.3233/FAIA200236.
[21] B. Sun, M. Liu, R. Zheng, S. Zhang, Attention-based lstm network for wearable human
activity recognition, in: 2019 Chinese Control Conference (CCC), 2019, pp. 8677–8682.
doi:10.23919/ChiCC.2019.8865360.
[22] S. P. Singh, M. K. Sharma, A. Lay-Ekuakille, D. Gangwar, S. Gupta, Deep convlstm with
self-attention for human activity decoding using wearable sensors, IEEE Sensors Journal
21 (2020) 8575–8582. doi:10.1109/JSEN.2020.3045135.
[23] D. Hu, An introductory survey on attention mechanisms in nlp problems, in:
Intelligent Systems and Applications: Proceedings of the 2019 Intelligent Systems Conference
(IntelliSys) Volume 2, Springer, 2020, pp. 432–448. doi:10.1007/978-3-030-29513-4_31.
[24] N. Zhang, J. Kim, A survey on attention mechanism in nlp, in: 2023 International
Conference on Electronics, Information, and Communication (ICEIC), IEEE, 2023, pp. 1–4.
doi:10.1109/ICEIC57457.2023.10049971.
[25] M.-H. Guo, T.-X. Xu, J.-J. Liu, Z.-N. Liu, P.-T. Jiang, T.-J. Mu, S.-H. Zhang, R. R. Martin, M.-M.</p>
      <p>Cheng, S.-M. Hu, Attention mechanisms in computer vision: A survey, Computational
Visual Media 8 (2022) 331–368. doi:10.1007/s41095-022-0271-y.
[26] X. Yang, An overview of the attention mechanisms in computer vision, in: Journal of
Physics: Conference Series, volume 1693, IOP Publishing, 2020, p. 012173. doi:10.1088/
1742-6596/1693/1/012173.
[27] M. N. Haque, M. T. H. Tonmoy, S. Mahmud, A. A. Ali, M. A. H. Khan, M. Shoyaib, Gru-based
attention mechanism for human activity recognition, in: 2019 1st International Conference
on Advances in Science, Engineering and Robotics Technology (ICASERT), IEEE, 2019, pp.
1–6. doi:10.1109/ICASERT.2019.8934659.
[28] T. R. Mim, M. Amatullah, S. Afreen, M. A. Yousuf, S. Uddin, S. A. Alyami, K. F. Hasan,
M. A. Moni, Gru-inc: An inception-attention based approach using gru for human activity
recognition, Expert Systems with Applications 216 (2023) 119419. doi:10.1016/j.eswa.
2022.119419.
[29] C. Betancourt, W.-H. Chen, C.-W. Kuan, Self-attention networks for human activity
recognition using wearable devices, in: 2020 IEEE international conference on systems,
man, and cybernetics (SMC), IEEE, 2020, pp. 1194–1199. doi:10.1109/SMC42975.2020.
9283381.
[30] L. Liu, J. He, K. Ren, J. Lungu, Y. Hou, R. Dong, An information gain-based model and
an attention-based rnn for wearable human activity recognition, Entropy 23 (2021) 1635.
doi:10.3390/e23121635.
[31] X. Yin, Z. Liu, D. Liu, X. Ren, A novel cnn-based bi-lstm parallel model with attention
mechanism for human activity recognition with noisy data, Scientific Reports 12 (2022)
1–11. doi:10.1038/s41598-022-11880-8.
[32] M. A. Khatun, M. A. Yousuf, S. Ahmed, M. Z. Uddin, S. A. Alyami, S. Al-Ashhab, H. F.</p>
      <p>Akhdar, A. Khan, A. Azad, M. A. Moni, Deep cnn-lstm with self-attention model for human
activity recognition using wearable sensor, IEEE Journal of Translational Engineering in
Health and Medicine 10 (2022) 1–16. doi:10.1109/JTEHM.2022.3177710.
[33] K. Wang, J. He, L. Zhang, Attention-based convolutional neural network for weakly
labeled human activities’ recognition with wearable sensors, IEEE Sensors Journal 19
(2019) 7598–7604. doi:10.1109/JSEN.2019.2917225.
[34] A. Sarkar, S. S. Hossain, R. Sarkar, Human activity recognition from sensor data using
spatial attention-aided cnn with genetic algorithm, Neural Computing and Applications
35 (2023) 5165–5191. doi:10.1007/s00521-022-07911-0.
[35] H. Ma, W. Li, X. Zhang, S. Gao, S. Lu, Attnsense: multi-level attention mechanism
for multimodal human activity recognition, in: Proceedings of the 28th International
Joint Conference on Artificial Intelligence, 2019, pp. 3109–3115. doi: 10.5555/3367471.
3367473.
[36] W. Gao, L. Zhang, Q. Teng, J. He, H. Wu, Danhar: Dual attention network for multimodal</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>H.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Dai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>So</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q. V.</given-names>
            <surname>Le</surname>
          </string-name>
          , Pay attention to mlps,
          <source>Advances in Neural Information Processing Systems</source>
          <volume>34</volume>
          (
          <year>2021</year>
          )
          <fpage>9204</fpage>
          -
          <lpage>9215</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>M.-H. Guo</surname>
            ,
            <given-names>Z.-N.</given-names>
          </string-name>
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>T.-J.</given-names>
          </string-name>
          <string-name>
            <surname>Mu</surname>
            ,
            <given-names>S.-M.</given-names>
          </string-name>
          <string-name>
            <surname>Hu</surname>
          </string-name>
          ,
          <article-title>Beyond self-attention: External attention using two linear layers for visual tasks</article-title>
          ,
          <source>IEEE Transactions on Pattern Analysis and Machine Intelligence</source>
          <volume>45</volume>
          (
          <year>2022</year>
          )
          <fpage>5436</fpage>
          -
          <lpage>5447</lpage>
          . doi:
          <volume>10</volume>
          .1109/TPAMI.
          <year>2022</year>
          .
          <volume>3211006</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Jing</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Guo</surname>
          </string-name>
          ,
          <article-title>Attention based spatiotemporal graph attention networks for trafic flow forecasting</article-title>
          ,
          <source>Information Sciences 607</source>
          (
          <year>2022</year>
          )
          <fpage>869</fpage>
          -
          <lpage>883</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.ins.
          <year>2022</year>
          .
          <volume>05</volume>
          .127.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>S.</given-names>
            <surname>Serrano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. A.</given-names>
            <surname>Smith</surname>
          </string-name>
          , Is attention interpretable?,
          <source>in: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>2931</fpage>
          -
          <lpage>2951</lpage>
          . doi:
          <volume>10</volume>
          . 18653/v1/
          <fpage>P19</fpage>
          - 1282.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>D.</given-names>
            <surname>Bahdanau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Cho</surname>
          </string-name>
          ,
          <string-name>
            <surname>Y. Bengio,</surname>
          </string-name>
          <article-title>Neural machine translation by jointly learning to align and translate</article-title>
          ,
          <source>arXiv preprint arXiv:1409.0473</source>
          (
          <year>2014</year>
          ). doi:
          <volume>10</volume>
          .48550/arXiv.1409.0473.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>A.</given-names>
            <surname>Vaswani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Shazeer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Parmar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Uszkoreit</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Jones</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. N.</given-names>
            <surname>Gomez</surname>
          </string-name>
          , Ł. Kaiser,
          <string-name>
            <surname>I. Polosukhin</surname>
          </string-name>
          ,
          <article-title>Attention is all you need</article-title>
          ,
          <source>Advances in neural information processing systems</source>
          <volume>30</volume>
          (
          <year>2017</year>
          ). URL: https://dl.acm.org/doi/10.5555/3295222.3295349.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Peng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <article-title>Object-part attention model for fine-grained image classification</article-title>
          ,
          <source>IEEE Transactions on Image Processing</source>
          <volume>27</volume>
          (
          <year>2017</year>
          )
          <fpage>1487</fpage>
          -
          <lpage>1500</lpage>
          . doi:
          <volume>10</volume>
          .1109/TIP.
          <year>2017</year>
          .
          <volume>2774041</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>A. M.</given-names>
            <surname>Obeso</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Benois-Pineau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. S. G.</given-names>
            <surname>Vázquez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Á. R.</given-names>
            <surname>Acosta</surname>
          </string-name>
          ,
          <article-title>Visual vs internal attention human activity recognition using wearable sensors</article-title>
          ,
          <source>Applied Soft Computing</source>
          <volume>111</volume>
          (
          <year>2021</year>
          )
          <article-title>107728</article-title>
          . doi:
          <volume>10</volume>
          .1016/j.asoc.
          <year>2021</year>
          .
          <volume>107728</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [37]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Tang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Teng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Min</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Song</surname>
          </string-name>
          ,
          <article-title>Triple cross-domain attention on human activity recognition using wearable sensors</article-title>
          ,
          <source>IEEE Transactions on Emerging Topics in Computational Intelligence</source>
          <volume>6</volume>
          (
          <year>2022</year>
          )
          <fpage>1167</fpage>
          -
          <lpage>1176</lpage>
          . doi:
          <volume>10</volume>
          .1109/TETCI.
          <year>2021</year>
          .
          <volume>3136642</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [38]
          <string-name>
            <given-names>C.</given-names>
            <surname>Gao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Jiang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Hu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <surname>Y. Zhang,</surname>
          </string-name>
          <article-title>Bi-stan: bilinear spatial-temporal attention network for wearable human activity recognition</article-title>
          ,
          <source>International Journal of Machine Learning and Cybernetics</source>
          (
          <year>2023</year>
          )
          <fpage>1</fpage>
          -
          <lpage>17</lpage>
          . doi:
          <volume>10</volume>
          .1007/s13042-023-01781-1.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [39]
          <string-name>
            <given-names>G.</given-names>
            <surname>Zheng</surname>
          </string-name>
          ,
          <article-title>A novel attention-based convolution neural network for human activity recognition</article-title>
          ,
          <source>IEEE Sensors Journal</source>
          <volume>21</volume>
          (
          <year>2021</year>
          )
          <fpage>27015</fpage>
          -
          <lpage>27025</lpage>
          . doi:
          <volume>10</volume>
          .1109/JSEN.
          <year>2021</year>
          .
          <volume>3122258</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [40]
          <string-name>
            <given-names>M.</given-names>
            <surname>Zeng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Gao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O. J.</given-names>
            <surname>Mengshoel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Langseth</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Lane</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <article-title>Understanding and improving recurrent networks for human activity recognition by continuous attention</article-title>
          ,
          <source>in: Proceedings of the 2018 ACM international symposium on wearable computers</source>
          ,
          <year>2018</year>
          , pp.
          <fpage>56</fpage>
          -
          <lpage>63</lpage>
          . doi:
          <volume>10</volume>
          .1145/3267242.3267286.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [41]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Tang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Min</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <article-title>Multiscale deep feature learning for human activity recognition using wearable sensors</article-title>
          ,
          <source>IEEE Transactions on Industrial Electronics</source>
          <volume>70</volume>
          (
          <year>2022</year>
          )
          <fpage>2106</fpage>
          -
          <lpage>2116</lpage>
          . doi:
          <volume>10</volume>
          .1109/TIE.
          <year>2022</year>
          .
          <volume>3161812</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [42]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Chevalier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <surname>Z. Zhang,</surname>
          </string-name>
          <article-title>Deep residual bidir-lstm for human activity recognition using wearable sensors</article-title>
          ,
          <source>Mathematical Problems in Engineering</source>
          <year>2018</year>
          (
          <year>2018</year>
          )
          <fpage>1</fpage>
          -
          <lpage>13</lpage>
          . doi:
          <volume>10</volume>
          .1155/
          <year>2018</year>
          /7316954.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [43]
          <string-name>
            <given-names>L.</given-names>
            <surname>Wang</surname>
          </string-name>
          , R. Liu,
          <article-title>Human activity recognition based on wearable sensor using hierarchical deep lstm networks</article-title>
          ,
          <source>Circuits, Systems, and Signal Processing</source>
          <volume>39</volume>
          (
          <year>2020</year>
          )
          <fpage>837</fpage>
          -
          <lpage>856</lpage>
          . doi:
          <volume>10</volume>
          . 1007/s00034-019-01116-y.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [44]
          <string-name>
            <given-names>S.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <article-title>Deep bidirectional gru network for human activity recognition using wearable inertial sensors</article-title>
          ,
          <source>in: 2022 3rd International Conference on Electronic Communication and Artificial Intelligence (IWECAI)</source>
          , IEEE,
          <year>2022</year>
          , pp.
          <fpage>238</fpage>
          -
          <lpage>242</lpage>
          . doi:
          <volume>10</volume>
          . 1109/IWECAI55315.
          <year>2022</year>
          .
          <volume>00054</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [45]
          <string-name>
            <given-names>R. A.</given-names>
            <surname>Hamad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kimura</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W. L.</given-names>
            <surname>Woo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Wei</surname>
          </string-name>
          ,
          <article-title>Dilated causal convolution with multi-head self attention for sensor human activity recognition</article-title>
          ,
          <source>Neural Computing and Applications</source>
          <volume>33</volume>
          (
          <year>2021</year>
          )
          <fpage>13705</fpage>
          -
          <lpage>13722</lpage>
          . doi:
          <volume>10</volume>
          .1007/s00521-021-06007-5.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [46]
          <string-name>
            <given-names>J.</given-names>
            <surname>Pan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Hu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Yin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <article-title>Gru with dual attentions for sensor-based human activity recognition</article-title>
          ,
          <source>Electronics</source>
          <volume>11</volume>
          (
          <year>2022</year>
          )
          <article-title>1797</article-title>
          . doi:
          <volume>10</volume>
          .3390/electronics11111797.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [47]
          <string-name>
            <given-names>K.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Yao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Guo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Yu</surname>
          </string-name>
          , Y. Liu,
          <article-title>Deep learning for sensor-based human activity recognition: Overview, challenges, and opportunities</article-title>
          ,
          <source>ACM Computing Surveys (CSUR) 54</source>
          (
          <year>2021</year>
          )
          <fpage>1</fpage>
          -
          <lpage>40</lpage>
          . doi:
          <volume>10</volume>
          .1145/3447744.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [48]
          <string-name>
            <given-names>T.</given-names>
            <surname>Sztyler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Stuckenschmidt</surname>
          </string-name>
          , W. Petrich,
          <article-title>Position-aware activity recognition with wearable devices</article-title>
          ,
          <source>Pervasive and mobile computing 38</source>
          (
          <year>2017</year>
          )
          <fpage>281</fpage>
          -
          <lpage>295</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.pmcj.
          <year>2017</year>
          .
          <volume>01</volume>
          .008.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [49]
          <string-name>
            <given-names>H.</given-names>
            <surname>Lin</surname>
          </string-name>
          , X. Cheng, X.
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Shen</surname>
          </string-name>
          ,
          <article-title>Cat: Cross attention in vision transformer</article-title>
          ,
          <source>in: 2022 IEEE International Conference on Multimedia and Expo (ICME)</source>
          , IEEE,
          <year>2022</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>6</lpage>
          . doi:
          <volume>10</volume>
          .1109/ICME52920.
          <year>2022</year>
          .
          <volume>9859720</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [50]
          <string-name>
            <given-names>A.</given-names>
            <surname>Bhatti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Behinaein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Hungler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Etemad</surname>
          </string-name>
          , Attx:
          <article-title>Attentive cross-connections for fusion of wearable signals in emotion recognition</article-title>
          ,
          <source>arXiv preprint arXiv:2206.04625</source>
          (
          <year>2022</year>
          ). doi:
          <volume>10</volume>
          .48550/arXiv.2206.04625.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>