<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>REM Estimation Based on Combination of Multi-Timescale Estimations and Automatic Adjustment of Personal Bio-vibration Data of Mattress Sensor</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Iko Nakari</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Naoya Matsuda</string-name>
          <email>matsuda.naoyag@cas.lab.uec.ac.jp1</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Keiki Takadama</string-name>
          <email>keiki@inf.uec.ac.jp2</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>The University of Electro-Communications 1-5-1 Chofugaoka</institution>
          ,
          <addr-line>Chofu, Tokyo</addr-line>
          ,
          <country country="JP">Japan</country>
          <addr-line>182-8585</addr-line>
        </aff>
      </contrib-group>
      <fpage>74</fpage>
      <lpage>80</lpage>
      <abstract>
        <p>This paper proposes the novel REM estimation method based on the combination of REM estimations with multitimescale logarithmic spectrums calculated from overnight bio-vibration data acquired from mattress sensor. Concretely, this paper learns each Random Forests for multiple scale spectrums, and counts the number of REM estimation in the length of the window, and estimates REM if the counted number exceeds the threshold. The threshold is automatically determined based on the REM estimation ratio to the total sleep length for each person to consider individual differences. Through the human subject experiments, the following implications have been revealed: (1) the combination of RFs learned with each scale spectrum improves the Precision and Recall of REM estimation, and Accuracy, Precision, Recall and Specificity are 80.2%, 51.4%, 47.0% and 48.5%, respectively; and (2) the automatic adjustment of the threshold can be flexibly adapted to data with large individual differences without the need to retrain the model.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        According to the survey conducted by the Ministry of
Health, Labour and Welfare, it is estimated that about one in
three Japanese adults feel sleepy during the day at least three
times a week. In addition to that, Japan has the shortest sleep
time among the OECD member countries (Organization for
Economic Cooperation and Development 2019), which
suggests that many people in Japan are sleep-deprived. The
accumulation of sleep deprivation (especially4-6 hours of
sleep) leads to a state of sleep debt. In the state of sleep
debt, the ability to think and make decisions is equivalent
to staying up all night
        <xref ref-type="bibr" rid="ref9">(Van Dongen et al. 2003)</xref>
        , and it is a
factor in the increased risk of industrial and traffic accidents.
It also decreases immune function and increases the risk of
developing lifestyle-related diseases such as depression and
dementia
        <xref ref-type="bibr" rid="ref4 ref5">(Mullington et al. 2009; Holingue et al. 2018)</xref>
        . For
individuals to stay healthy and for the government to reduce
health care costs, these sleep problems should be solved as
soon as possible.
      </p>
      <p>
        To solve these sleep problems, it is important to increase
the amount of sleep time, but many people suffer from
the problem of poor sleep quality even if they sleep for
a long time. For the facts, it is necessary to understand
sleep quality. The standard method for measuring sleep
quality (sleep stage) is to evaluate biological data acquired by
___________________________________
In T. Kido, K. Takadama (Eds.), Proceedings of the AAAI 2022 Spring Symposium
“How Fair is Fair? Achieving Wellbeing AI”, Stanford University, Palo Alto, California,
USA, March 21–23, 2022. Copyright © 2022 for this paper by its authors. Use permitted
under Creative Commons License Attribution 4.0 International (CC BY 4.0).
Polysomnography (PSG) test based on the Rechtschaffen
&amp; Kales (R&amp;K) method
        <xref ref-type="bibr" rid="ref7">(Rechtschaffen and Kales 1968)</xref>
        .
However, the PSG test is a highly restrictive method and
requires a person to attach multiple electrodes to his/her head
and body, which burdens physical and mental on his/her and
prevents obtaining data of sleep as usual. To address the
problems, the demand for sleep stage estimation methods
by simple sensors (such as mattress sensors) has increased
as an alternative to the PSG test. For example, Watanabe
developed a mattress sensor and focused on the relation
between heart rate variability and sleep stage
        <xref ref-type="bibr" rid="ref12">(Watanabe and
Watanabe 2004)</xref>
        . The accuracies of the method are reported
as follows: 42.8% in three stages (NREM/REM/WAKE)
estimation; 82.6% in NREM estimation; 70.5% in WAKE
estimation; and 38.3% in REM estimation. As the results show,
the accuracy of the method is not high, especially the
accuracy of REM sleep estimation. This is because that REM
sleep estimation is mainly based on rapid eye movements in
the R&amp;K method, and mattress sensors cannot measure eye
movements. Even though REM sleep has other
characteristics (i.e., unstable heart and respiration rate) acquired from
the mattress sensor, it is hard to estimate REM sleep because
of the following points. (1) The characteristics that appear
in REM sleep appear intermittently rather than all the time
during REM sleep. (2) The heart rate gets unstable by body
movements. (3) The heart rate is easily affected by
individual differences and daily physical condition.
      </p>
      <p>
        To tackle the problems, it is necessary to estimate REM
sleep from a new perspective, including physiological
characteristics. However, since we do not know what to
focus on, machine learning (ML) is a good way to estimate
REM sleep from a new perspective. In this study, Random
Forests
        <xref ref-type="bibr" rid="ref1">(Breiman 2001)</xref>
        is employed for the ML model
because it is easier to analyze what the model learned from the
data than deep learning (Goodfellow et al. 2016), which is
widely employed because of its high prediction accuracy. It
is essential for analyzing models easier because it leads to
the interpretability of the model in the future. However, it
cannot deal with the problem (1) mentioned before by
applying the ML to sleep stage estimation because each epoch
(30 seconds) is estimated without considering before/after
the corresponding epoch. Due to this, it is difficult to
estimate REM sleep when the characteristics do not appear at
the epoch. In addition to that, the ML is not good at learning
data with individual differences.
      </p>
      <p>To overcome these problems, this paper aims to improve
the accuracy of REM sleep estimation and proposes the
novel REM sleep estimation method with a mattress sensor
that can consider before/after the corresponding epoch and
be automatically adjusted the REM sleep estimation
threshold for each person. Concretely, this paper employs TANITA
sleep scan SL511 (Japan) as the mattress sensor for
acquiring bio-vibration data, and prepares several RFs for learning
multi-timescale logarithmic spectrums. It is combined that
the REM sleep estimations by each RF for the final output
of estimation, and the estimation sensitivity is automatically
adjusted based on the REM sleep estimation ratio out of all
epochs in an overnight sleep.</p>
      <p>This paper is organized as follows. The next section
describes the sleep mechanism especially REM sleep. Section
3 describes the related works of non-contact sleep stage
estimation and RF which is the main ML method in our
proposed method and Section 4 proposes our multiple scales
REM estimation method. The experiment is conducted in
Section 5 and the result are analyzed in Sections 6. Finally,
our conclusion is given in Section 7.</p>
    </sec>
    <sec id="sec-2">
      <title>Sleep Mechanism</title>
      <sec id="sec-2-1">
        <title>Sleep Stage</title>
        <p>The sleep stage is an indicator of the depth of sleep defined
by the R&amp;K method. The depth of sleep is classified into six
stages in each epoch (30 seconds), i.e., WAKE, REM,
NonREM1 (N1), N2, N3, and N4 (N4 is often included in N3).
The proportion of each sleep stage in healthy young adults
per night is as follows: WAKE is 1-5%; REM is 15-25%;
N1 is 5-20%; N2 is 45-75%; and N3 is 10-22% (depending
on age and physical condition on that day). In order to
determine the sleep stage, the R&amp;K method needs biological
data such as electroencephalography (EEG),
electrooculogram (EOG), and electromyogram (EMG) acquired by the
PSG test. Figure 1 shows the example of the overnight sleep
stages, where the vertical axis indicates the sleep stage and
the horizontal axis indicates the time. As shown in Figure 1,
the structure of the sleep stage in a healthy person repeats
deep sleep (N3 sleep) and shallow sleep (above N3 sleep)
alternately, and the regular sleep repeats this cycle (about
90 to 120 minutes) three to five times a night. Each cycle is
connected by about 20 to 30 minutes of REM sleep, and this
cycle is called the ultradian rhythm.</p>
      </sec>
      <sec id="sec-2-2">
        <title>Characteristics of REM Sleep</title>
        <p>The physiological characteristics of REM sleep are as
follows:
• rapid eye movement;
• decreased skeletal muscle activity;
• increased or unstable heart rate and respiratory rate;
• changes in autonomic function.</p>
        <p>In particular, REM sleep is determined by focusing on “rapid
eye movement” and “decreased skeletal muscle activity” in
the R&amp;K method. This is because that the two
characteristics are clearly expressed in the biological data acquired by
WAKE</p>
        <p>REM
NREM1
NREM2
NREM3
0
90
180
270
360 Time (min)
the PSG test and are easy to evaluate. Note that these
characteristics occur intermittently, rather than continuously.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Related Works</title>
      <sec id="sec-3-1">
        <title>Sleep Stage Estimation by mattress sensor</title>
        <p>
          Watanabe et al. tried to extract the relation between the
change in the heart rate and sleep stages through the
frequency band containing the multiple biological rhythms of a
human to build a foundation of sleep stage estimation from
heart rate variability (HRV)
          <xref ref-type="bibr" rid="ref10">(Watanabe and Watanabe 2001)</xref>
          .
They focused on two biological rhythms that the ultradian
rhythm and the circadian rhythm, which is an approximate
25 hours cycle. From their study, the relations between the
frequency of HRV and sleep stage have been revealed, and
they built a sleep stage estimation method based on the heart
rate data acquired from the air mattress sensor
          <xref ref-type="bibr" rid="ref12">(Watanabe
and Watanabe 2004)</xref>
          .
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>Random Forests</title>
        <p>
          This study employs Random Forests (RF)
          <xref ref-type="bibr" rid="ref1">(Breiman 2001)</xref>
          .
The RF model repeats random sampling from training data,
randomly construct decision trees with different conditional
branches, and classify them by majority rule of those results.
In this research, Gini impurity is the splitting condition, it
becomes low when all the samples contained in each node of
the decision tree are the same. RF processing is as follows:
1. Generate bootstrapped sample (Sj ) from training data set
(S).
2. One-third of the original data is called Out-Of-Bug
(OOB), and it is used for constructing decision tree. Each
node processing is as follows:
(a) Extract mtry features randomly with not allowing
duplicate value.
(b) Choose the feature that minimizes Gini impurity, and
divide nodes.
        </p>
        <sec id="sec-3-2-1">
          <title>3. Repeat 1. to 2. Ntree times.</title>
          <p>Where Ntree is the number of decision trees to be
constructed. In the classification problem, it is recommended
to use the square root of the total number of features for the
variable mtry, which used to divide the nodes of decision
trees.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Proposed Method: Multi-Timescale REM</title>
    </sec>
    <sec id="sec-5">
      <title>Estimation</title>
      <p>The proposed method, Multi-Timescale REM estimation
starts from learning several RFs with each scale
biovibration data, then combines REM predictions by each RF,
(a) power spectrum
(b) logarithmic spectrum</p>
      <sec id="sec-5-1">
        <title>Input data</title>
        <p>
          To extract characteristics of bio-vibration data by the ML,
the frequency analysis is applied for decomposing each
vibrations (i.e., heartbeats, respiration and body movement) to
frequencies. This process is conducted as follows.
1. Applying the Fast Fourier Transform (FFT)
          <xref ref-type="bibr" rid="ref2">(Cooley and
Tukey 1965)</xref>
          to the bio-vibration data in a L-second
window to convert the data to a power spectrum (note that
the sampling frequency of the mattress sensor is 16Hz,
and data size is L 16). In this study, window size (L) is
set as next for capturing several scales of REM. L = f32,
64, 128, 256g. According to the sampling theorem
          <xref ref-type="bibr" rid="ref8">(Shannon 1949)</xref>
          , the frequency that can be analyzed by FFT
is up to 8Hz, so that the data size of power spectrum is
L 8, and the frequency resolution is 1/L Hz. Figure 2(a)
shows the example of power spectrum (L = 64)
calculated from bio-vibration data, where the vertical axis
indicates the density of power spectrum and the horizontal
axis indicates the frequency. In particular, the frequency
band between 0.1Hz and 0.3Hz is related to the
respiration, and the frequency band between 0.6Hz and 1.5Hz is
related to the heartbeats. Regarding the BM, the
larger/smaller BM, the higher/lower density of the power
spectrum. However, as shown in Figure 2(a), it is difficult to
understand the shape of the power spectrum above 1Hz
because of the high density of frequencies below 1Hz.
2. In order to make it easier to understand above 1Hz and
for RF to learn, power spectrum is converted into a
logarithmic spectrum (log10). Figure 2(b) shows the example
of the logarithmic spectrum converted from the power
spectrum of Figure 2(a), where the vertical axis indicates
        </p>
        <p>RF
32 sec.</p>
        <p>RF
64 sec.
the density (logarithmic value) of the spectrum and the
horizontal axis indicates the frequency. Furthermore, the
density of each frequency in the logarithmic spectrum is
normalized to 0, 1 based on the value of the density of
the overall frequency.
3. This logarithmic spectrum is calculated per 30-second
(stride size is 30-second) and labeled with the correct
sleep stage (REM/Not-REM) determine by R&amp;K method
for RF to learn. Figure 3 shows the example of strides
(window size is 128 seconds) and how to label sleep stage
to the spectrum. When labeling sleep stage to the
spectrum, bio-vibration data often have multiple sleep stages,
so that, in this study, the sleep stage which is labeled to
the spectrum is determined by a majority vote of the
proportions occupied by those sleep stages. Note that, when
using RF for REM prediction (not learning phase), the
logarithmic spectrum is not labeled with the correct sleep
stage, and the output of the prediction is for first epoch.
The number of the input data that can be calculated from
one subject (in case of seven hours of sleep) is about 840.</p>
      </sec>
      <sec id="sec-5-2">
        <title>REM estimation based on multiple scales spectrum</title>
        <p>Figure 4 shows the overview of the proposed
MultiTimescale REM Estimation. The flow of the method is as
follows: (1) preparing RFs for a number of scales (each
window size spectrum), and learning each scale spectrum
by each RF; (2) combining the number of REM predictions
by each RF in each window (note that, this window is
different from window size of spectrum); (3) exploring
optimal threshold for REM estimation from overnight data, and
REM sleep is detected when the number of REM predictions
in a window counted in (2) exceeds the threshold.</p>
      </sec>
      <sec id="sec-5-3">
        <title>Combining REM predictions by each RF: Our method</title>
        <p>outputs four REM predictions for each epoch from RF of 32
sec., 64 sec., 128 sec. and 256 sec., and the REM Prediction
Count (PC) is counted for each epoch as shown in top of
Figure 4(2). Since REM sleep do not occur singly (one epoch)
Algorithm 1: Exploring optimal threshold
but in clusters (successive epochs), in order to consider the
state before/after the epoch wanted to estimate, the method
prepare a window to count PC for Ne epochs before/after the
epoch, which called Windowed Prediction Count (WPC) as
shown in bottom of Figure 4(2).</p>
      </sec>
      <sec id="sec-5-4">
        <title>Automatic adjustment of REM estimation threshold:</title>
        <p>The finally output of the proposed REM estimation for an
epoch is determined by the value of WPC. If the WPC
exceeded a certain threshold, then the epoch is detected as
REM sleep, as shown in the Figure 4(3). It has a
proportional relationship between the size of the threshold and the
REM estimation ratio, which means the ratio of estimation
out of all epochs in an overnight sleep (without considering
correct and incorrect answers). According to the
physiological characteristics of sleep, the proportion of REM sleep per
overnight sleep is about 20%, so that the proposed method
explores the threshold which the REM estimation ratio is
about 20% for each person to avoid excessive or negative
estimation.</p>
        <p>The algorithm of exploring optimal threshold is described
in Algorithm 1 The algorithm counts the number of epoch
detected as REM sleep by a threshold in overnight sleep, and
calculate the REM estimation ratio, while the REM sleep
count is equal to 0 (see line 6 to 13). Then, the optimal
threshold is extracted as the previous threshold where the
REM estimation ratio exceeds 25% (see line 15 to 20).</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Experiments</title>
      <p>To investigate the effectiveness of the proposed
MultiTimescale REM Estimation, this paper conducted the
human subject experiment of the nine of healthy subjects.
The performance of the REM estimation is compared with
RFs learned with each window size of logarithmic
spectrum. The information of subjects is summarized in Table 1.
The column “ID (Age)” indicates the ID of the subject and
age of that. The columns from “WAKE” to “N34” indicate
the number of epochs in each sleep stage (WAKE, REM,
NREM1, NREM2 and NREM34), and the column “Total”
indicates the total number of epochs in one night. The
average number of epochs (30 seconds) of sleep is 664 130.
As evaluation criteria, this study employs five evaluation
indicators, Accuracy, Precision, Recall, F-measure and
Specificity of REM estimation. In addition, this study evaluates
the REM estimation ratio to see if REM estimations are
being made at an appropriate frequency.
setup
The electrodes were attached to the body and head of each
subject to acquire EEG, EOG and EMG, and mattress
sensor was placed under the mattress in the bed to acquire
biovibration data in one night. After sleep, the correct sleep
stages for each subject were determined according to the
R&amp;K method based on the data measured by PSG (helped
by medical specialist), and the bio-vibration data measured
by mattress sensor is converted to logarithmic spectrums of
several scales (i.e., window sizes are L = f32, 64, 128, 256,
512g) of which are labeled with the correct sleep stage in
each epoch.</p>
      <p>The logarithmic spectrum of each scale is learned with
different RFs from each other. The training data is
generated from the eight subjects, and the validation data is the
other subjects. The ratio of REM sleep and not-REM sleep
of training data is 1:3 because REM sleep accounts for 20%
of one night sleep and to prevent excessive REM estimation.
The data which have large BM are excluded because it
affect the shape of the spectrum and difficult to learn with RF.
The parameters of RF are set as follows: (i) the maximum
depth of decision tree is 10; (ii) the number of decision tree
is 50; (iii) the number of the features employed to construct
the decision tree is 16, 23, 32, 46 and 64 for window size 32,
64, 128, 256 and 512 respectively. The window size Ne for
counting WPC is set as 3.</p>
      <sec id="sec-6-1">
        <title>Results</title>
        <sec id="sec-6-1-1">
          <title>Type of RF</title>
          <p>32 sec. window
64 sec. window
128 sec. window
256 sec. window
512 sec. window</p>
          <p>Proposed 1
(same TH (= 2))</p>
          <p>Proposed 2
(consider individual)</p>
          <p>W3
2R
N112
N304</p>
          <p>0:00:00
columns are the evaluation indicators. Each value of
indicators are expressed as mean value of nine subjects standard
deviation value of those, and the value is a percentage. As
shown in Table 2, the Accuracy, Precision, Recall and
Fmeasure are getting high rate as window size of spectrum
getting wide, but when window size of spectrum is too wide
(i.e., 512 sec.), these evaluation indicators get worsen than
any other results. The reason why the value of Specificity in
512 seconds is most largest than any other results is that the
REM estimation ratio of the RF leaned with 512 seconds is
small (i.e., the REM estimation is passive).</p>
          <p>Focusing on the results of two proposed methods, the
values of Accuracy, Recall and F-measure outperform any other
results (i.e., single RF leaned with each window size of
spectrum). The values of Precision and Specificity are not the
best among the result but these values are close to the best.
In addition, the REM estimation ratio is larger than any other
results, and the ratio is close to the average ratio of REM
sleep per one night (about 20%). The standard deviation
of each evaluation indicators are smaller for the proposed
method 2, which considers individual differences, than for
the proposed method 1, which does not considers individual
differences, and this fact suggests the results are stable for
each subject.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>Discussion</title>
      <sec id="sec-7-1">
        <title>How the combination of multiple RF results contributes?</title>
        <p>NREM34), the horizontal axis indicates the time, the blue
line indicates the correct sleep stage, and orange line, gray
line, yellow line and green line are REM estimation result of
32, 64, 128 and 256 seconds respectively. As shown in
Figure 5, the areas of the actual REM sleep marked by red
cycles tend to have a concentration of REM estimation by four
types of RF. On the other hand, the actual not REM sleep
areas tend to have few REM estimations. Based on the
proposed method, the number of REM estimations by each RF
in a window interval of 3 epochs (1.5 minutes) before/after
is shown in Figure 6. In Figure 6, the left and right
vertical axes indicate the number of windowed REM prediction
count (WPC) and sleep stage, respectively, the horizontal
axis indicates the time, and the orange and blue lines indicate
the WPC and correct sleep stage, respectively. As shown in
Figure 6, the WPC tends to be high in the areas of the actual
REM sleep, and it tends to be low in that of actual not REM
sleep. The proposed method (the combination of multiple
RF results) exploits this tendency and estimates REM sleep
when the WPC exceeds a certain threshold. The threshold
for REM estimation in the proposed method was determined
by the sensitivity analysis of REM estimation threshold as
shown in the Figure 8, where the vertical axis indicates the
percentage, horizontal axis indicates the threshold, and blue,
orange, gray, yellow, purple and green lines indicate
Accuracy, Precision, Recall, F-measure, Specificity and REM
estimation ratio, respectively. From the figure, the smaller the
threshold value, the higher the REM estimation ratio, and
the better the estimation of the actual REM sleep (as shown
in the Figure 8, Recall), while the larger the threshold value,
0:30:14
1:00:29
2
R
1N12
PC15
W
10
5
0
0:00:00
1 2
Accuracy
F-mesure
4 5
Precision
Specificity</p>
        <p>Recall</p>
        <p>REM detection ratio
3
6
7
8
9
10
the lower the REM estimation ratio and the better the
estimation of actual not-REM sleep (as shown in the Figure 8,
Specificity). In this study, the threshold value (= 2) was
chosen based on the Precision and Recall are almost equal and
F-measure is the largest. However, this threshold is
susceptible to individual differences (e.g., age and physical
condition on the day) and must be carefully determined for each
subject.</p>
      </sec>
      <sec id="sec-7-2">
        <title>Consideration of individual differences in the proposed method</title>
        <p>As mentioned above, this section discusses the importance
of the threshold setting in the proposed method with subject
“I” who have to set a significantly different threshold
compared to other subjects. Figure 7 shows the number of WPC
of subject I, where the left and right vertical axes indicate the
number of REM estimations and sleep stage, respectively,
the horizontal axis indicates the time, and the orange and
blue lines indicate the WPC and correct sleep stage,
respectively. Compared to the results of subject “E” in Figure 6, the
WPC of subject “I” is excessive, and it is not desirable to set
a threshold of the same value. To deal with the individual
differences the proposed method focused on the
physiological characteristics about sleep that REM sleep accounts for
about 20% of the total sleep in one night.</p>
        <p>Table 3 shows the thresholds and results that were
automatically adjusted for each subject so that the REM
estimation ratio to the overall sleep time is about 20%. In the
Table 3, the column “ID (Age)” indicates the subject ID and
age, the columns “TH” and “REM estimation ratio” indicate
the threshold (the value is an integer) for REM estimation
and REM estimation ratio (the value is percentage) based
on the threshold. The other columns indicate each
evaluation indicators (the value is a percentage). Each threshold
is set to a value close to 20% without the REM estimation
ratio exceeding 25%. As shown in Table 3, the thresholds
for all subjects, except subject “I”, are set between 1 and 4.
If the threshold value of 4 is given to the subject “I” like
the other subjects, the REM estimation ratio will be 86.1%
and the Accuracy, Precision, Recall, F-measure and
Specificity will be 35.9%, 25.7%, 99.4%,40.9% and 17.7%,
respectively. This situation should be avoided in real
applications, and the proposed method makes the REM estimation
with fewer wrong estimation by the condition that the keep
the REM estimation ratio around 20%.</p>
        <p>In general, to improve the accuracy of the estimation for
such data, it is necessary to retrain the model by
collecting similar data or devising new input features, which are
difficult tasks and take a long time to do. By contrast, the
proposed method does not need to do these things to
improve the accuracy, and the only thing needed to do is set
the REM estimation threshold. In addition, the threshold can
be automatically determined based on the REM estimation
ratio so that the proposed method makes it easy to adapt to
individual differences. Therefore, the differences in the
automatically determined thresholds, as shown in column TH
of Table 3, represent individual differences. Since the heart
rate is increased or unstable during REM sleep, it can be
inferred, for example, that if the value of the threshold is high,
heart rate of overnight may be higher or more unstable than
an average person.</p>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>Conclusion</title>
      <p>This paper proposed the novel REM estimation method that
combination of multiple RF learned with different timescale
of spectrums and investigates its effectiveness through
comparison of the REM estimation by single RF learned with
each scale spectrums. Concretely, the proposed method
learns several RFs with each scale spectrums, then counts
the number of REM estimation in the length of the window
and estimates REM sleep if the counted number exceeds the
threshold. Furthermore, the threshold is automatically
determined for each person based on the REM estimation
ratio to the overall sleep time for considering individual
differences. The results of the human subject experiments, the
Accuracy, Precision, Recall and Specificity of the REM
estimation are 80.2( 5:5)%, 51.4( 15:0)%, 47.0( 15:5)% and
48.5( 13:7)%, respectively. Through experiments, the
following implications have been revealed: (1) the combination
of RFs learned with multiple window sizes spectrum
separately improves the Precision of REM estimation and Recall
of that, rather than RF learned with only a particular window
size spectrum; (2) the automatic adjustment of the threshold
based on the REM estimation ratio to the total sleep length
can be flexibly adapted to data with large individual
differences without the need to retrain the model.</p>
      <p>The future task is following: (1) to investigate the validity
of the combination of multiple RFs; (2) to validate whether
it is effective for other sleep stages.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Breiman</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <year>2001</year>
          .
          <article-title>Random forests</article-title>
          .
          <source>Machine learning</source>
          ,
          <volume>45</volume>
          (
          <issue>1</issue>
          ):
          <fpage>5</fpage>
          -
          <lpage>32</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Cooley</surname>
            ,
            <given-names>J. W.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Tukey</surname>
            ,
            <given-names>J. W.</given-names>
          </string-name>
          <year>1965</year>
          .
          <article-title>An algorithm for the machine calculation of complex Fourier series</article-title>
          .
          <source>Mathematics of computation</source>
          ,
          <volume>19</volume>
          (
          <issue>90</issue>
          ):
          <fpage>297</fpage>
          -
          <lpage>301</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          2016.
          <article-title>Deep learning</article-title>
          , volume
          <volume>1</volume>
          . MIT press Cambridge.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Holingue</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Wennberg</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Berger</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Polotsky</surname>
          </string-name>
          , V. Y.; and
          <string-name>
            <surname>Spira</surname>
            ,
            <given-names>A. P.</given-names>
          </string-name>
          <year>2018</year>
          .
          <article-title>Disturbed sleep and diabetes: A potential nexus of dementia risk</article-title>
          .
          <source>Metabolism</source>
          ,
          <volume>84</volume>
          :
          <fpage>85</fpage>
          -
          <lpage>93</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Mullington</surname>
            ,
            <given-names>J. M.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Haack</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Toth</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Serrador</surname>
            ,
            <given-names>J. M.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Meier-Ewert</surname>
            ,
            <given-names>H. K.</given-names>
          </string-name>
          <year>2009</year>
          .
          <article-title>Cardiovascular, inflammatory, and metabolic consequences of sleep deprivation</article-title>
          .
          <source>Progress in cardiovascular diseases</source>
          ,
          <volume>51</volume>
          (
          <issue>4</issue>
          ):
          <fpage>294</fpage>
          -
          <lpage>302</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>2019. GENDER</surname>
            <given-names>EQUALITY</given-names>
          </string-name>
          ,
          <article-title>Gender Data Portal</article-title>
          . https: //www.oecd.org/gender/data/.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Rechtschaffen</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Kales</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <year>1968</year>
          .
          <article-title>A Manual of Standardized Terminology, Techniques and Scoring System for Sleep Stages of Human Subjects</article-title>
          . Washington DC.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Shannon</surname>
            ,
            <given-names>C. E.</given-names>
          </string-name>
          <year>1949</year>
          .
          <article-title>Communication in the presence of noise</article-title>
          .
          <source>Proceedings of the IRE</source>
          ,
          <volume>37</volume>
          (
          <issue>1</issue>
          ):
          <fpage>10</fpage>
          -
          <lpage>21</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Van Dongen</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ; Maislin,
          <string-name>
            <given-names>G.</given-names>
            ;
            <surname>Mullington</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. M.</given-names>
            ; and
            <surname>Dinges</surname>
          </string-name>
          ,
          <string-name>
            <surname>D. F.</surname>
          </string-name>
          <year>2003</year>
          .
          <article-title>The cumulative cost of additional wakefulness: dose-response effects on neurobehavioral functions and sleep physiology from chronic sleep restriction and total sleep deprivation</article-title>
          .
          <source>Sleep</source>
          ,
          <volume>26</volume>
          (
          <issue>2</issue>
          ):
          <fpage>117</fpage>
          -
          <lpage>126</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Watanabe</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Watanabe</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <year>2001</year>
          .
          <article-title>Estimation of the sleep stages by the non-restrictive air mattress sensor relation between the change in the heart rate and sleep stages</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <source>Transactions of the Society of Instrument and Control Engineers</source>
          ,
          <volume>37</volume>
          (
          <issue>9</issue>
          ):
          <fpage>821</fpage>
          -
          <lpage>828</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Watanabe</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Watanabe</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <year>2004</year>
          .
          <article-title>Noncontact method for sleep stage estimation</article-title>
          .
          <source>IEEE Transactions on biomedical engineering</source>
          ,
          <volume>51</volume>
          (
          <issue>10</issue>
          ):
          <fpage>1735</fpage>
          -
          <lpage>1748</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>