<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Gait-Robust Heading Estimation Using Horizontal Acceleration for Smartphone-based PDR</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Kazuma Kano</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Takuto Yoshida</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Shin Katayama</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kenta Urano</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Takuro Yonezawa</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nobuo Kawaguchi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Graduate School of Engineering, Nagoya University</institution>
          ,
          <addr-line>Nagoya</addr-line>
          ,
          <country country="JP">Japan</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This study tackles heading estimation for Pedestrian Dead Reckoning (PDR) with smartphones. In dealing with changes in the holding posture of smartphones, it works to consider the relationship between sensor orientation and heading. However, the existing methods lack robustness to various gaits, such as sideways and backward walking. Therefore, we propose a novel method considering various spatiotemporal features of horizontal acceleration with deep learning. The proposed method calculates horizontal acceleration in the global coordinate system from measured acceleration, gravitational acceleration, and rotation vector. Then, it inputs the horizontal acceleration over a certain period into a deep neural network model and predicts the unit vector directed to the mean heading during that period. We created a dataset covering multiple gaits and evaluated the method using four models: Convolutional Neural Network (CNN), Bidirectional Long Short-Term Memory (BiLSTM), DualCNN-LSTM, and DualCNNTransformer. Consequently, we found that the proposed method was more robust to gaits than the existing methods, with the DualCNN-LSTM and DualCNN-Transformer models achieving the highest accuracy.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;dataset</kwd>
        <kwd>deep learning</kwd>
        <kwd>indoor localizaton</kwd>
        <kwd>indoor positioning</kwd>
        <kwd>pedestrian dead reckoning</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <sec id="sec-1-1">
        <title>Measure acceleration, gravitational acceleration, and rotation vector</title>
      </sec>
      <sec id="sec-1-2">
        <title>Calculate horizontal acceleration in global coordinate system</title>
        <sec id="sec-1-2-1">
          <title>Horizontal acceleration in global coordinate system over a certain period</title>
        </sec>
        <sec id="sec-1-2-2">
          <title>Unit vector directed to mean heading</title>
        </sec>
      </sec>
      <sec id="sec-1-3">
        <title>Estimate headings with deep learning</title>
        <p>CNN</p>
        <sec id="sec-1-3-1">
          <title>BiLSTM</title>
        </sec>
        <sec id="sec-1-3-2">
          <title>DualCNN-LSTM</title>
        </sec>
        <sec id="sec-1-3-3">
          <title>DualCNN-Transformer</title>
          <p>
            Walking speed estimation methods include combining step detection and stride estimation[
            <xref ref-type="bibr" rid="ref2 ref3">2,
3</xref>
            ] and directly computing walking speeds[
            <xref ref-type="bibr" rid="ref4">4, 5</xref>
            ]. These methods have attained adequately high
accuracy. On the other hand, heading estimation methods are divided into just regarding sensor
rotations around the vertical axis as the heading changes[6] and considering the relationship
between the sensor orientations and actual headings. The latter methods can deal with changes
in the holding posture of smartphones during positioning. However, the existing methods still
have problems, such as a lack of robustness against gait diferences.
          </p>
          <p>Therefore, we propose a data-driven approach to estimate headings based on horizontal
acceleration in the Global Coordinate System (GCS). We aim to improve gait robustness by
considering various spatiotemporal features with deep learning. Fig. 1 illustrates an overview
of the proposed method. First, we calculate horizontal acceleration in GCS from acceleration,
gravitational acceleration, and rotation vector measured by a smartphone. Then, we input
the horizontal acceleration in GCS over a certain period into a deep neural network model to
predict the unit vector directed to the mean heading during that period. This study explores
four models: Convolutional Neural Network (CNN), Bidirectional Long Short-Term Memory
(BiLSTM), DualCNN-LSTM, and DualCNN-Transformer. We created a dataset supporting
multiple gaits and evaluated the estimation accuracy. The result showed that the proposed
method improved gait robustness compared to the existing and especially achieved the highest
accuracy when using the DualCNN-LSTM and DualCNN-Transformer models.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>2.1. Heading Estimation Methods Based on Horizontal Acceleration in Global</p>
      <p>Coordinate System
Horizontal acceleration tends to spread along heading directions because people repeatedly
accelerate and decelerate in walking. Deng et al. proposed Rotation Matrix and Principal
Component Analysis (RMPCA), which estimates heading as the first principal component of
horizontal acceleration obtained via PCA[7]. Ban et al. estimated heading as the mean vector
of horizontal acceleration while the magnitude of acceleration without gravity component
exceeded a certain threshold[8]. For convenience, we refer to this method as Horizontal
Acceleration Mean (HAM) method. Both methods preprocess acceleration by transforming it
into GCS with the origin at the sensor’s position. Here, GCS refers to a right-handed coordinate
system defined as follows:
• X-axis is horizontal and points east at the origin.
• Y-axis is horizontal and points north at the origin.</p>
      <p>• Z-axis is vertical and points upward at the origin.</p>
      <p>
        Transforming into GCS in advance helps identify the problems at sensor orientation estimation
and heading estimation. It also enables independent heading estimation at each time step,
avoiding the accumulation of heading errors even during prolonged positioning. However,
these methods lack gait robustness as they do not account for temporal information or other
spatial features.
2.2. Pedestrian Dead Reckoning Methods Using Deep Learning
Deep learning, which can automatically choose and consider various features, seems efective
in improving gait robustness. Chen et al. proposed IONet, which uses BiLSTM to estimate
trajectories and demonstrated improved positioning accuracy compared to conventional PDR
methods[9]. IONet trains the model to output moved distances and heading displacements
from acceleration and angular velocity in Sensor Coordinate System (SCS). Then, it sequentially
integrates the displacements to estimate the headings. Chen et al. also attempted to reduce the
computational complexity by applying WaveNet[10]. Kawaguchi et al. proposed
DualCNNLSTM as a model for walking speed estimation, designed to handle various gaits such as fast
walking and stepping in place[
        <xref ref-type="bibr" rid="ref4">4, 5</xref>
        ]. DualCNN-LSTM has two paths consisting of convolutional
layers with diferent kernel sizes connected in parallel to an LSTM layer. It is presumed to
accommodate various gaits by extracting a wide range of features from short-term to long-term.
Although many studies have applied deep learning to PDR, they have not suficiently discussed
the gait robustness of heading estimation. In addition, previous studies usually input sensor
measurement data into models before the coordinate transformation, making it impossible to
separate errors in sensor orientation estimation and heading estimation.
2.3. Activity Recognition Methods Using Deep Learning
Human Activity Recognition (HAR) is another representative research field involving sensor
measurement data analysis. Ha et al. used CNN to recognize actions in car assembly lines and
daily life based on acceleration measured by multiple sensors and multimodal data[11]. Zhao et
al. applied BiLSTM and improved classification accuracy compared with plain LSTM[ 12]. Shavit
et al. applied Transformer, a state-of-the-art model in research fields such as natural language
processing and computer vision, to HAR and improved classification accuracy compared to
simple CNN[13].
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Methodology</title>
      <p>We use deep neural network models to estimate headings from horizontal acceleration in
GCS. Fig. 2 describes the training and prediction processes of the proposed method. In the
ofline phase, we train models following the solid blue arrows. First, measure acceleration,
gravitational acceleration, and rotation vector with a smartphone while measuring the trajectory
with surveying equipment. Next, align their timestamps and resample at 100 (Hz). Then,
calculate horizontal acceleration in GCS and denoise them before inputting to the model. Finally,
train the model by backpropagating the losses between the estimated headings and ground
truth computed from the actual trajectory. In the online phase, we predict headings following
the dashed orange arrows.
3.1. Deriviation of Horizontal Acceleration
Horizontal acceleration in GCS is calculated from acceleration and gravitational acceleration in
SCS and rotation vector. Firstly, project the acceleration  onto the gravitational acceleration
 to obtain the vertical component of acceleration . Subtracting  from  gives horizontal
acceleration ℎ . The superscript ’’ stands for SCS.</p>
      <p>=
ℎ =  − 
︂(  ·  )︂
 · 

Secondly, transform the horizontal acceleration ℎ into GCS using the rotation vector. The
rotation vector corresponds to the smartphone orientations in GCS with the origin at the
smartphone’s position. Applying the rotation operation  represented by the rotation vector to
the horizontal acceleration ℎ in SCS yields horizontal acceleration ℎ in GCS. The superscript
’’ stands for GCS.</p>
      <p>ℎ =  (ℎ )
Finally, apply the Gaussian filter with a standard deviation  = 1.1 to remove noises. The
ifltered horizontal acceleration is calculated as the convolution of the Gaussian kernel and
original horizontal acceleration ℎ along the time dimension. We obtain each component of
(1)
(2)
(3)</p>
      <sec id="sec-3-1">
        <title>Sensor measurement data for estimation</title>
      </sec>
      <sec id="sec-3-2">
        <title>Smartphone</title>
      </sec>
      <sec id="sec-3-3">
        <title>Acceleration in SCS</title>
      </sec>
      <sec id="sec-3-4">
        <title>Gravitational acceleration in SCS</title>
      </sec>
      <sec id="sec-3-5">
        <title>Position data for ground truth</title>
      </sec>
      <sec id="sec-3-6">
        <title>Surveying instrument</title>
      </sec>
      <sec id="sec-3-7">
        <title>Rotation</title>
        <p>vector</p>
      </sec>
      <sec id="sec-3-8">
        <title>Trajectory in GCS</title>
      </sec>
      <sec id="sec-3-9">
        <title>Time synchronization</title>
      </sec>
      <sec id="sec-3-10">
        <title>Resampling</title>
      </sec>
      <sec id="sec-3-11">
        <title>Horizontal acceleration calculation</title>
      </sec>
      <sec id="sec-3-12">
        <title>Heading calculation</title>
      </sec>
      <sec id="sec-3-13">
        <title>Horizontal acceleration in SCS</title>
      </sec>
      <sec id="sec-3-14">
        <title>Coordinate transformation</title>
      </sec>
      <sec id="sec-3-15">
        <title>Horizontal acceleration in GCS</title>
      </sec>
      <sec id="sec-3-16">
        <title>Denoising</title>
      </sec>
      <sec id="sec-3-17">
        <title>Deep neural network model</title>
      </sec>
      <sec id="sec-3-18">
        <title>Estimated heading in GCS</title>
      </sec>
      <sec id="sec-3-19">
        <title>Loss</title>
      </sec>
      <sec id="sec-3-20">
        <title>Loss calculation</title>
      </sec>
      <sec id="sec-3-21">
        <title>True heading in GCS</title>
      </sec>
      <sec id="sec-3-22">
        <title>Training</title>
      </sec>
      <sec id="sec-3-23">
        <title>Prediction</title>
        <p>the filtered horizontal acceleration at time
^,
^
,
=
=
√
√
1
2
1
2


∑︁
=−</p>
        <p>∑︁
=− 
,−  exp
,−  exp
︂(
︂(
−
−

2 )︂
2 2

2 )︂
2 2
where 
,− 
and</p>
        <p>,− 

−</p>
        <p>denote each component of the original horizontal acceleration at time
. We round the kernel radius  to 3 because the exponential term is negligibly small.
(4)
(5)
3.2. Heading Estimation Model
3.2.1. Input and Output
At ention (d ，nhead )</p>
        <p>Dropout (p )
LayerNorm
FC-1 (ch )</p>
        <p>GeLU
Dropout (p )
FC-2 (ch )
Dropout (p )
LayerNorm
The model receives the horizontal acceleration in GCS over a certain period and outputs the unit
vector directed to the mean heading during that period. Note that the trajectories measured by
surveying equipment and the headings computed from them may contain errors. We intend to
alleviate their influence by taking the average. Additionally, estimating the heading as a vector
rather than an angle helps eliminate discontinuity at the ± 180 (deg) boundary and stabilize
the learning process[14]. The x and y components of the vector correspond to the cosine and
sine of the angle, respectively. In this paper, we empirically set the input length to four seconds.
Since resampled at 100 (Hz), the input size is 400 × 2, and the output size is 1 × 2.
3.2.2. Network Architecture
This study considers four models: CNN, BiLSTM, DualCNN-LSTM, and DualCNN-Transformer.
Fig. 3 displays the model architectures. The CNN model has three stacked convolutional
layers, extracting short-term features at the shallow layers and long-term at the deep. The
BiLSTM model extracts features across the entire data in the BiLSTM layers. The concatenated
forward and backward hidden states of the second BiLSTM layer are fed into the fully connected
layer. The DualCNN-LSTM model extracts various-timescale features in two diferent-sized
convolutional paths and captures how those features lie throughout the data in the LSTM layer.
The inputs of two paths are shared, and their outputs are concatenated and passed to the LSTM
layer. The DualCNN-Transformer model extracts features in two diferent-sized convolutional
paths and considers the relationship among them by self-attention. The concatenated outputs
of two paths with learnable positional encoding added are passed to the Transformer encoder.
The output of the Transformer encoder is averaged over the time dimension and then fed into
the fully connected layer. We normalize the final output to have a norm of 1 for the CNN
and DualCNN-LSTM models. On the other hand, we do not normalize for the BiLSTM and
DualCNN-Transformer models because it hinders their learning processes.
3.2.3. Training and Hyperparameters
We perform estimation and update the weights at 10 (Hz) during training. In other words, the
window to slice input data slides by ten samples at a time. We employ Mean Squared Error
(MSE) as a loss function and Adam with a learning rate of 0.001 as an optimizer. The batch
size is set to 512. We train models for 100 epochs and use the weights at the epoch with the
smallest validation loss. We determined the hyperparameters by grid search from the candidate
values listed in Tables 1, 2, 3, and 4. Parameters written in bold indicate what resulted in the
smallest validation loss. Note that we constrained some parameter combinations due to the
large search space. For the DualCNN-LSTM model, the kernel sizes  and , the dropout
proportions  and , for Conv-s and Conv-l, respectively, satisfy the following relationship:
2 − 1 =</p>
        <p>= 
2ℎ = 2ℎ =  = 4ℎ</p>
        <p>2 − 1 = 
For the DualCNN-Transformer model as well, the numbers of out channels ℎ of Conv-s and
ℎ of Conv-l, the dimension  of Attention, the number of out channels ℎ of FC-3, and
the kernel sizes  of Conv-s and  of Conv-l satisfy the following relationship:</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Evaluation</title>
      <p>4.1. Dataset
We created a dataset supporting multiple gaits for evaluation. We collected acceleration,
gravitational acceleration, rotation vector, angular velocity, and geomagnetic field with a smartphone
(Google Pixel 4, Android 10). These sensor measurement data can be retrieved via Android
Sensor Framework API and are recorded in SCS. The gravitational acceleration and rotation
vector are internally computed from the acceleration, angular velocity, and geomagnetic field.
We simultaneously measured 3D position data in GCS using a laser surveying instrument
(TOPCON GT-1205). We instructed the subjects to hold the smartphone in front of their chest
and walk along markers we placed beforehand.
(6)
(7)
(8)
(9)
-20
-10
-20
-30
-20Position (m)</p>
      <p>-10
(a) Rectangle
0
-30
0</p>
      <p>-5
-20Position (m-1)0
(b) Hourglass</p>
      <p>0Position (m)5
(c) Straight line
10</p>
      <p>The subjects are eight males aged 21 to 23 years. The gaits consist of forwards, right sideways,
left sideways, and backwards. The walking courses have three shapes: rectangle, hourglass, and
straight line. Fig. 5 exhibits the instances of trajectories and walking postures. Their colors
indicate the temporal transition, meaning the subjects moved from purple to yellow. The arrows
represent the smartphone orientations at each time computed from the rotation vector. The side
lengths of the rectangular and hourglass-shaped courses range from 20 to 31 (m). The subjects
circulate the route by a single gait in the rectangular and hourglass-shaped courses. These data
include cornering movements (i.e., heading changes during walking). On the other hand, the
subjects change their gaits every 5 (m) and change their headings by 180 (deg) every 15 (m) in
the straight-line courses. These data include gait changes and heading changes by 180 (deg).
Total walking distance and recording time are 8543 (m) and 120 minutes, respectively.
4.2. Experimental Conditions
We evaluate the heading estimation accuracy of the proposed method for each model. We also
evaluate and compare RMPCA[7] and HAM[8], based on horizontal acceleration in GCS, and
IONet[9], based on deep learning. First of all, we split the dataset into three subsets for training,
validation, and testing. However, we allocated all data of the hourglass-shaped courses for
training due to limited data available. Then, we conducted grid search using the training and
validation subsets to determine the hyperparameters. Finally, we performed estimation at 100
(Hz) on the testing subset and evaluated the results.</p>
      <p>The proposed method trains the model to output the mean heading over a certain period.
However, we treat it as the estimated heading at the central time during evaluation. We used
our implementation of IONet based on the original paper because it is not publicly available.
IONet sequentially calculates headings by integrating estimated heading displacements, so we
provided the ground truth as the initial heading. The evaluation metrics are MSE of heading
cosine and sine. The MSE value  takes a minimum of 0 when the estimated headings ^ and
the ground truth   coincide for all time 1 ≤  ≤  and a maximum of 2 when they difer by</p>
      <p>cos(^) − cos( ))︁ 2 + ︁( sin(^) − sin( ))︁ 2)︂
4.3. Results and Discussions
Table 5 summarizes MSE for each course shape and gait. Fig. 6 presents the estimated headings
for some test data by the proposed method on the left and the comparative methods on the
right. Fig. 7 shows the corresponding trajectories based on the actual walking speeds computed
from the ground truth trajectories. To begin with, we discuss the efects of model structure on
estimation accuracy and gait robustness on the basis of the evaluation results of the proposed
method. The CNN model was very accurate for forward walking but not stable for other
gaits. It seems overly optimized for forwards, the most common gait in the dataset. Besides,
feature timescales should difer depending on gaits, and the present CNN model may not
adequately perceive diferences in gaits. The receptive field of the third convolutional layer is
37, indicating that this model estimates headings by combining approximately 0.4 seconds of
features. Extending the receptive field by increasing the kernel strides or the number of layers
may improve accuracy for various gaits. The BiLSTM model could estimate heading correctly to
some extent for all gaits but had lower accuracy compared to other models. A possible reason is
that this model cannot suficiently grasp local features due to the absence of feature extraction
at convolutional layers.</p>
      <p>The DualCNN-LSTM model achieved high accuracy overall and estimated headings robustly
for every gait. It seems to successfully consider a wide range of features through the
diferentsized convolutional layers and the LSTM layer. The DualCNN-Transformer model achieved
high accuracy as well. Incidentally, the quantity of its weight parameters is less than half of
that in the DualCNN-LSTM model due to the lower number of channels in convolutional layers.
From these observations, we have concluded that extracting local features, in which models like
CNN excel, is crucial for boosting estimation accuracy. At the same time, extracting long-term
features, for which models like LSTM and Transformer are suitable, improves robustness to
All</p>
      <p>BiLSTM
DualCNN-Transformer
30</p>
      <p>Time (s)
10
20
40
50
60
0
10
20</p>
      <p>30
Time (s)
40
50
60
(b) Rectangle sideways
10
20
40
50
60
0
10
20
40
50
60
diferent gaits.</p>
      <p>Next, we examine each comparative method. RMPCA was satisfactory accurate for forward
walking but experienced a significant decrease in accuracy for sideways and backwards. It tends
to cause errors of approximately ± 90 (deg) for sideways and ± 180 (deg) for backwards, as
(a) Rectangle forwards
(c) Rectangle backwards
(d) Straight line</p>
      <p>30
Time (s)</p>
      <p>30
Time (s)</p>
      <p>15
Time (s)</p>
      <p>BiLSTM
DualCNN-Transformer
20
-40
20
-40
0
seen in Fig. 6(b)(c). The principal component of horizontal acceleration is seemingly associated
with the anterior-posterior direction of the body rather than the heading direction. HAM lost
accuracy in backward walking. Additionally, focusing on the cornering movements, HAM
occasionally estimated them as the opposite turns, shown around 15 seconds in Fig. 6(a) and Fig.
7(a). It is presumed that HAM cannot recognize heading changes since it does not consider the
temporal order of input data. IONet precisely tracked headings during the spans with minimal
heading changes, but the errors increased at the cornering movements, as shown in Fig. 6 and
7. This behavior is attributed to its sequentially heading estimation, accumulating errors over
time. The model tends to estimate heading displacements smaller, so increasing the proportion
of data with heading changes may improve accuracy. Moreover, IONet often failed to detect
the heading changes of approximately 180 (deg) on the straight-line courses. This problem
could be solved by training the model to output heading displacements as vectors, similar to
the proposed method.</p>
      <p>Through this experiment, we confirmed that the proposed method improved gait robustness
compared to RMPCA and HAM. We also found the DualCNN-LSTM and DualCNN-Transformer
models, which consider features with a wide range of timescales, were especially efective.
Furthermore, we observed the advantages of the proposed method over IONet not only in the
separation of orientation estimation and heading estimation and no need for initial heading but
also suitability for long-duration positioning and robustness to heading changes of 180 (deg).
However, there is room for improvement in estimation accuracy for forward walking, which
was not much diferent from the existing methods.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Summary</title>
      <p>This paper focused on heading estimation for PDR with smartphones. The existing methods
have limitations regarding gait robustness and calculation interpretability. Therefore, we
proposed an approach that inputs horizontal acceleration in GCS, calculated from acceleration,
gravitational acceleration, and rotation vector, into a deep neural network model to estimate
headings. Additionally, we created a dataset consisting of forward, sideways, and backward
walking to evaluate gait robustness. We prepared four models: CNN, BiLSTM, DualCNN-LSTM,
and DualCNN-Transformer, and evaluated the heading estimation accuracy in comparison with
RMPCA, HAM, and IONet. As a result, the proposed method outperformed in gait
robustness, especially when using the DualCNN-LSTM and DualCNN-Transformer models. Further
improvement in estimation accuracy and expansion of the dataset are future challenges.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>This work is partially supported by JSPS KAKENHI (JP22K18422), NEDO (JPNP23003), NICT
(22609), and TRUSCO Nakayama Corporation.
[5] N. Kawaguchi, J. Nozaki, T. Yoshida, K. Hiroi, T. Yonezawa, K. Kaji, End-to-end
walking speed estimation method for smartphone pdr using dualcnn-lstm., in: IPIN (Short
Papers/Work-in-Progress Papers), 2019, pp. 463–470.
[6] W. Kang, Y. Han, Smartpdr: Smartphone-based pedestrian dead reckoning for indoor
localization, IEEE Sensors Journal 15 (2015) 2906–2916. doi:10.1109/JSEN.2014.2382568.
[7] Z.-A. Deng, G. Wang, Y. Hu, D. Wu, Heading estimation for indoor pedestrian navigation
using a smartphone in the pocket, Sensors 15 (2015) 21518–21536. URL: https://www.mdpi.
com/1424-8220/15/9/21518. doi:10.3390/s150921518.
[8] R. Ban, K. Kaji, K. Hiroi, N. Kawaguchi, Indoor positioning method integrating pedestrian
dead reckoning with magnetic field and wifi fingerprints, in: 2015 Eighth International
Conference on Mobile Computing and Ubiquitous Networking (ICMU), 2015, pp. 167–172.
doi:10.1109/ICMU.2015.7061061.
[9] C. Chen, X. Lu, A. Markham, N. Trigoni, Ionet: Learning to cure the curse of drift in
inertial odometry, Proceedings of the AAAI Conference on Artificial Intelligence 32 (2018).
URL: https://ojs.aaai.org/index.php/AAAI/article/view/12102. doi:10.1609/aaai.v32i1.
12102.
[10] C. Chen, P. Zhao, C. X. Lu, W. Wang, A. Markham, N. Trigoni, Deep-learning-based
pedestrian inertial navigation: Methods, data set, and on-device inference, IEEE Internet
of Things Journal 7 (2020) 4431–4441. doi:10.1109/JIOT.2020.2966773.
[11] S. Ha, J.-M. Yun, S. Choi, Multi-modal convolutional neural networks for activity
recognition, in: 2015 IEEE International Conference on Systems, Man, and Cybernetics, 2015, pp.
3017–3022. doi:10.1109/SMC.2015.525.
[12] Y. Zhao, R. Yang, G. Chevalier, X. Xu, Z. Zhang, Deep residual bidir-lstm for human activity
recognition using wearable sensors, Mathematical Problems in Engineering 2018 (2018)
7316954.
[13] Y. Shavit, I. Klein, Boosting inertial-based human activity recognition with transformers,</p>
      <p>IEEE Access 9 (2021) 53540–53547. doi:10.1109/ACCESS.2021.3070646.
[14] Q. Wang, H. Luo, L. Ye, A. Men, F. Zhao, Y. Huang, C. Ou, Pedestrian heading estimation
based on spatial transformer networks and hierarchical lstm, IEEE Access 7 (2019) 162309–
162322. doi:10.1109/ACCESS.2019.2950728.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>R.</given-names>
            <surname>Harle</surname>
          </string-name>
          ,
          <article-title>A survey of indoor inertial positioning systems for pedestrians</article-title>
          ,
          <source>IEEE Communications Surveys &amp; Tutorials</source>
          <volume>15</volume>
          (
          <year>2013</year>
          )
          <fpage>1281</fpage>
          -
          <lpage>1293</lpage>
          . doi:
          <volume>10</volume>
          .1109/SURV.
          <year>2012</year>
          .
          <volume>121912</volume>
          . 00075.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M.</given-names>
            <surname>Alzantot</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Youssef</surname>
          </string-name>
          , Uptime:
          <article-title>Ubiquitous pedestrian tracking using mobile phones</article-title>
          ,
          <source>in: 2012 IEEE Wireless Communications and Networking Conference (WCNC)</source>
          ,
          <year>2012</year>
          , pp.
          <fpage>3204</fpage>
          -
          <lpage>3209</lpage>
          . doi:
          <volume>10</volume>
          .1109/WCNC.
          <year>2012</year>
          .
          <volume>6214359</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>I.</given-names>
            <surname>Klein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Asraf</surname>
          </string-name>
          ,
          <article-title>Stepnet-deep learning approaches for step length estimation</article-title>
          ,
          <source>IEEE Access 8</source>
          (
          <year>2020</year>
          )
          <fpage>85706</fpage>
          -
          <lpage>85713</lpage>
          . doi:
          <volume>10</volume>
          .1109/ACCESS.
          <year>2020</year>
          .
          <volume>2993534</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>T.</given-names>
            <surname>Yoshida</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Nozaki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Urano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Hiroi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Yonezawa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Kawaguchi</surname>
          </string-name>
          ,
          <article-title>Gait dependency of smartphone walking speed estimation using deep learning (poster)</article-title>
          ,
          <source>in: Proceedings of the 17th Annual International Conference on Mobile Systems</source>
          , Applications, and Services, MobiSys '19,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computing Machinery, New York, NY, USA,
          <year>2019</year>
          , pp.
          <fpage>641</fpage>
          -
          <lpage>642</lpage>
          . URL: https://doi.org/10.1145/3307334.3328667. doi:
          <volume>10</volume>
          .1145/3307334.3328667.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>