<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Novel Approach of Cognitive Base Station with Dynamic Spectrum Management For High-speed Rail</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Qingting Wu</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yiming Wang</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Zhijie Yin</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hongyu Deng</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Cheng Wuy</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>School of Urban Rail Transportation, Soochow University</institution>
          ,
          <addr-line>Suzhou</addr-line>
          ,
          <country country="CN">China</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The characteristic of fast movement in high-speed rail seriously affects the stability of vehicular wireless communication. Applying cognitive technology to individual users often brings frequent channel switch and inefficient blind learning. To address these issues this paper proposes a novel concept of Cognitive Base Station (CBS), which has the capability of forecasting spectrum holes and assigning spectrum to individuals. We then give the model of cognitive base station and evaluate the performance in our simulation platform within high-speed rail environment. The experiment results further prove that the model can significantly improve the performance of vehicular communication. Project supported by the National Nature Science Foundation of China (No. 61471252) and the Natural Science Foundation of Jiangsu Province (No. BK20130303). yCorresponding Author: cwu@suda.edu.cn</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>With the development of era, the demand for rail transit is
rapidly increasing. When travelling on train, the passengers
always hope to enjoy better communication quality and faster
data access service. European Rail Traffic Management
System (ERTMS) is a revolution in railways to guarantee the
communication, which is consist of European Train Control
System (ETCS) and a mobile-communications network
optimized for railways called GSM-R.</p>
      <p>GSM-R is the Global System for Mobile
CommunicationsRailway in the worldwide and is dedicated to provide the
bidirectional radio bearer for the train signaling systems, which
operates in a 4MHz band (876-880 MHz for uplink and
921925 MHz for downlink) [Sniady and Soler, 2012]. It is
possible to divide the authorized band into 19 channels of 200KHz
width in each GSM-R group. The rail line is covered with
GSM-R groups and each consists of many GSM-R cells. A
single GSM-R cell can use only few of the channels in a round
robin manner, because the same channel cannot be reused by
neighboring cells due to interference. Each cell is equipped
with a base station. The base station is made up of building
baseband unit (BBU) and radio remote unit (RRU). RRU is
always deployed outside along the railway and BBU is
inside. One BBU is connected to multiple RRUs. BBU and RRU
are used to process baseband signal and radio frequency
signal, respectively. To ensure the communication between RRU
and passengers, two vehicular stations (VS) are installed on
the top and final carriages of the train. The network
architecture is illustrated in Fig. 1 [isheng Zhao et al., 2013], [Tian
et al., 2012]. The GSM-R system consists of base transceiver
stations (BTS) along the railway lines and embedded
GSMR mobiles connected to antennas on the roof of the trains.
The train has to be permanently connected to the trains
control center. This connection has a high priority level, and if
the modem connection is lost, the train stops automatically
[Dudoyer et al., 2012].</p>
      <p>However, under the circumstance of high-speed railway
[Zhang et al., 2012], vehicular communication often shows
unstable, even sometime dreadful [Ai et al., 2014].
Usually, when the speed is up to 350 kilometers per hour, there
unavoidably arises some issues, such as Doppler shifts, fast
cell switching and the penetration loss [Zhou and Ai, 2014].
The Doppler shifts results from the relative motion between
a vehicle and a base station. Doppler Effect becomes another
pivotal factor degrading system performance, which
increases randomness of received signal [Liu et al., 2011], [Li and
Zhao, 2012], [Dybala and Radkowski, 2013]. The high speed
operation of the train leads to fast cell switching. As a train
moves across the footprint of the satellite beam, the
receiving signal level may vary, especially towards the edge of the
beam, which significantly impacts service rates even causing
service drops [Li et al., 2013], [Alkayal and Saada, 2013].
The fully enclosed body structure with good sealing
property of the high-speed train results in penetration loss.
Typically, the terminals inside the train connect to the base
stations along the railway tracks via wireless links, in which the
large penetration loss will directly degrade the
communication link quality and decrease the cell coverage [Zhu et al.,
2013], [Liu et al., 2012]. Furthermore, Federal
Communications Commission (FCC) released the investigation on the
usage of spectrum In 2003. It suggested that the authorized
band in 3 6GHz range is less than 0:5% utilized on
average. And so is the band below 3GHz, which is less than
35% [Commission and others, 2003]. Just based on these
viewpoints, it is necessary to introduce a novel architecture
for high-speed vehicular communication to address the issues
RRU</p>
      <p>train
R</p>
      <p>CR
CR</p>
      <p>CR</p>
      <p>R</p>
      <p>CR
CR</p>
      <p>CR
vs
from individual user’s high-speed movement along the rails
and the inefficiency in the spectrum usage.</p>
      <p>In recent years, a lot of researchers used cognitive radio
(CR) to improve the performance of wireless communication.
The basic idea of CR networks is that the unlicensed devices
(also called cognitive radio users or secondary users) need to
vacate the spectrum band once detect the licensed devices
(also known as primary users). Simon HayKin defined the CR as
an intelligent wireless communication system that is aware of
its environment and uses the methodology of
under-standingby-building to learn from the environment and adapt to
statistical variations in the input stimuli [Haykin, 2005]. Letaief
presented a cognitive space-time-frequency coding technique
that can opportunistically adjust its coding structure by
adapting itself to the dynamic spectrum environment [Letaief and
Zhang, 2009]. Soyeon Kim proposed a CR operational
algorithm for mobile cellular systems, which was applicable to the
multiple secondary user environment [Kim and Sung, 2014].
These results proved CR technology can significantly reduce
interference to licensed users, while maintaining a high
probability of successful transmissions in a cognitive radio (CR)
ad hoc network.</p>
      <p>There are few publications about applying CR to the field
of urban rail transit. Wu proposed a wireless cognitive model
for high-speed individuals’ spectrum management and show a
small performance improvement in wireless communication
[Wu et al., 2015]. Although using cognitive radio in
highspeed-railway has improved the performance, there are still
so many issues that are open to address:
(1) Most of the cognitive radio users usually sense in the
same environment and each user is independent. So they
compete each other for the spectrum resources, which
leads to blind learning and frequent conflicts.
(2) The rail transit contains a large number of CR
users. While every user sense the environment, the
system works with heavy workload and high computational
complexity.
(3) The operations of mutual competition and cooperation
between the CR users interfere with not only primary
users, but also themselves and their neighbors.
(4) Spectrum holes in each base station are different. It
would inevitably occur spectrum handoff.</p>
      <p>For addressing the above issues, we try to propose a novel
model of cognitive base station in the paper. Our proposed
CBS attempts to use the authorized bands for railway without
interrupting PUs. The CBS model should satisfy the
following conditions:
(1) The CBS can forecast spectrum holes according to its
experience and assign spectrum to individuals within its
range of coverage. In this way, the computational
complexity of the entire network can be reduced.
(2) The rail transit runs daily over a fixed route according
to its timetable. The CBS can take the advantage of
these characteristics, cooperate with each other to
forecast spectrum holes on the whole route.</p>
      <p>This paper is organized as follow. We first introduce the
concept of cognitive base station and its mathematical
model in Section 2. Section 3 then applies the novel CBS model</p>
    </sec>
    <sec id="sec-2">
      <title>Radio</title>
    </sec>
    <sec id="sec-3">
      <title>Environment</title>
    </sec>
    <sec id="sec-4">
      <title>Spectrum</title>
    </sec>
    <sec id="sec-5">
      <title>Mobility</title>
    </sec>
    <sec id="sec-6">
      <title>Spectrum</title>
    </sec>
    <sec id="sec-7">
      <title>Decision</title>
    </sec>
    <sec id="sec-8">
      <title>Spectrum</title>
    </sec>
    <sec id="sec-9">
      <title>Sharing</title>
    </sec>
    <sec id="sec-10">
      <title>Spectrum</title>
    </sec>
    <sec id="sec-11">
      <title>Sensing</title>
      <p>with RL into the scenario of high-speed rail, and propose the
cooperation mechanism of multiple CBS agents. The
experimental simulation results are given in Section 4. We conclude
this paper in Conclusion.
2</p>
      <sec id="sec-11-1">
        <title>Cognitive Base Station Model</title>
        <p>Our proposed CBS is deployed along the railway, which
works as a spectrum assigner. It learns from feedback
received through interactions with an external environment and
assigns spectrum to the passengers in the range of coverage.
We consider each CBS to be an agent, which has four
spectrum management functions: spectrum sensing, spectrum
mobility, spectrum decision and spectrum sharing [Chkirbene
and Hamdi, 2015], [Lee and Akyildiz, 2012]. Fig. 2 gives
the steps of the cognitive cycle within the framework of CBS,
which is formed by the spectrum-aware operations. Each
CBS agent uses reinforcement learning to operate spectrum
management. All of the agents can sense the environment, obtain
its own current state about spectrum usage, and communicate
with each other for the purpose of cooperation. They then
make decision according to its own state and the whole
network situation, then use spectrum mobility to choose actions.
Finally, these CBS agents continue to send its new state to the
other neighbor CBS agents.</p>
        <p>We assume that our cognitive radio network along
highspeed rail consists of a collection of CBS agents and CR user
agents. Each CBS agent has its own PUs and available
spectrums. The CBS agents undertake decisions on choosing the
spectrum independently of the CR user agents in the range.
A choice of spectrum by the CBS agent i is essentially the
choice of the frequency represented by f i 2 F . The CR
user agents continuously monitor the spectrum that the CBS
agent choose in each slot time. We assume perfect sensing,
in which, the CBS agents correctly infer the presence of the
PUs if the former lies within the PUs’ transmission range.</p>
        <p>Long-term Awareness of Spectrum Usage
Characterizing the spectrum bands based on their
activity, and in particular, learning about the utilization of the
channel is a key function of the CR users. Online
learning algorithms must be developed that allow the CBS
agents to continuously gather information about its radio
environment, and construct a utilization function. Apart
from simply classifying the spectrum as busy or
available, it is beneficial if a probability distribution of the
anticipated transmission/silent durations of the PUs can
be derived. We propose a tightly integrated
reinforcement learning equipped link layer protocol to schedule
the transmissions between CBS agents and CR user
agents over time.</p>
        <p>End-to-End Learning
Distributed networks rely on multihop forwarding of
packets between a source-destination pair. Each CBS
agent on this path learns of its own spectrum environment
over time, and this information can be leveraged at the
start and end points of the path to make optimal decisions
regarding the spectrum choices and routing options. As
an example, spectrum switching costs locally at a node
affects end-to-end delays. While spectrum
characteristics can be locally inferred, the specific choice of the
spectrum at each link to minimize intra-path switching
must be undertaken at the end points of the path. We
explore ways to share this learning and spectrum
awareness obtained by a node between its local neighbors, and
subsequently over multiple hops to the destination. The
cost of this learning and the benefits are investigated as
part of this project.
3
3.1</p>
      </sec>
      <sec id="sec-11-2">
        <title>SPECTRUM MANAGEMENT BASED</title>
      </sec>
      <sec id="sec-11-3">
        <title>COGNITIVE BASE STATION</title>
        <sec id="sec-11-3-1">
          <title>The Q-Learning</title>
          <p>Reinforcement learning, which is inspired by psychological
learning theory from biology [Waltz and Fu, 1965], enables
the agent to learn behavior through trail-and error interactions
with a dynamic environment [Sutton and Barto, 1998]. The
classical reinforcement algorithm is Q-Learning, the process
of which is as follows [Puterman, 1994]. On each step of
interaction the agent chooses an action according to the
external environment based on its current state. As a result, the
action changes the environment and receives a reward. The
agent need to develop a policy, that maximizes the long-run
measure of reinforcement.</p>
          <p>The classic reinforcement learning algorithm is
formulated as follows. At each time t, the agent perceives its current
state st 2 S and the set of possible actions Ast . The agent
chooses an action a 2 Ast and receives from the
environment a new state st+1 and a reward rt+1. Based on these
interactions, the reinforcement learning agent must develop a
policy : S ! A which maximizes the long-term reward
R = Pt rt for MDPs, where 0 1 is a
discounting factor for subsequent rewards. The long-term reward is
PU agents</p>
          <p>CBS agent</p>
          <p>PU agents</p>
          <p>PU agents</p>
          <p>PU agents</p>
          <p>CBS agent
CR
agent
CBS agent</p>
          <p>CBS agent
the expected accumulated reward that the agent expects to
receive in the future under the policy, which can be specified
by a value function. In this way, the Q-learning can
calculate an update to its expected discounted reward, Q(st; at) as
follows:</p>
          <p>Q(st; at)</p>
          <p>Q(st; at) +
where is the discount factor such that 0 &lt; 1. The agent
stores the state-action values in a table Q [Wu et al., 2010],
[Jiang et al., 2011], [Bkassiny et al., 2013].</p>
          <p>Recently the reinforcement learning has attracted
increasing interest in the machine learning and artificial intelligence
communities. Kadam etc. applied the Q-Learning into
routing data in Wireless Sensor Network scenario to route data
efficiently from one source to multiple mobile sinks [Kadam
and Srivastava, 2012]. It turned out that the algorithm can
extend the network lifetime.</p>
        </sec>
        <sec id="sec-11-3-2">
          <title>3.2 Application to Cognitive Base Station</title>
          <p>We illustrate the high-speed railway environment with CBS
agents along the way in Fig. 3 . We further model a
cognitive radio network as consisting of a set of Cognitive Base
Stations, denoted CBS, a set of primary users, denoted P U ,
and a set of available frequencies, denoted SP . We assume
that the topological structure of a given network is fixed.</p>
          <p>Spectrum holes vary due to the behavior of PUs, which
causes the change of environment. CBS agents can perceive
the states within the environment. The state of an CBS agent
is the current spectrum of its transmission. The state of the
multi-agent system includes the state of every CBS agent. We
therefore define the state of the system at time t, denoted st,
as</p>
          <p>st = (s~p)t
, where s~p is a vector of spectrums across all agents. Here
spi are the spectrum on the ith agent and spi 2 S~P .
Normally, if there are m spectrums, we can using the index
~
to specify these spectrums. In this way, we have SP =
fSP 1; SP 2; :::; SP mg.</p>
          <p>At a particular time and a particular state, the CBS will take
action according to learning results to either switch channel
or transmit. At time t we define at = k, where k is the action
that CBS chooses at time t and
k 2 fswitch to channel1; switch to channel2;
:::; switch to channelm; transmit datag:
Once the CBS agent has detected any active PU, it would
take action to channel switching. We use the Q table to
store state-action values. At time t, the state is spt and the
action is k, then we can calculate the value Q(spt; k) by the
above Q-learning formulas. If PU is detected, the CBS agent
would switch to the other available spectrum with the largest
Q-value.</p>
          <p>The reward is the estimate for spectrum usage availablity
on a CBS agent. The different network situation results in
different rewards as follows.</p>
          <p>CR-PU interference: If a PU’s activity occurs in the
spectrum shared by any CR user, and in the slot same
selected for transmission, then a high penalty of 15 is
assigned. The intuitive meaning of this is as follows: We
can avoid the collisions among the CR users using the
mediation from the CBS agents. However, the
concurrent use of the spectrum with a PU goes against the
principle of protection of the licensed devices, and hence,
must be strictly avoided.</p>
          <p>Successful Transmission: If none of the above
conditions are observed to be true in the given transmission slot,
then packet is successfully transmitted from the sender
to receiver, and a reward of +5 is assigned, which is
found experimentally to give the best results.</p>
          <p>Initial state and
reward</p>
          <p>Yes
Assign -15</p>
          <p>reward
Change state
Is PU on？</p>
          <p>No</p>
          <p>Assign +5
reward</p>
          <p>Once detected the primary user, a harsh punishment will be
given. Otherwise, a positive reward will be assigned. Fig. 4
illustrates the proposed process, and Algorithm 1 describes
our algorithm for implementing the Q-learning on CBS agent.
4
4.1</p>
        </sec>
      </sec>
      <sec id="sec-11-4">
        <title>EXPERIMENTAL SIMULATION</title>
        <sec id="sec-11-4-1">
          <title>Experimental Design</title>
          <p>In this section, we describe preliminary results from applying
our reinforcement learning based approach to the cognitive
radio model. To detect the PUs correctly is the necessary
prerequisite. The overall aim of our proposed learning based
approach is to allow the CBS agents to decide on an optimal
choice of spectrum so that (i) PUs are not affected, and (ii) CR
users share the spectrum in a fair manner. These two rules are
to simulate the public’s behaviors in Urban Rail Transit
Environment. That is, those bands that are frequently occupied
by licensed users are rarely utilized because of open areas or
relatively closed environment, and the public can
opportunistically use band resources with a same probability.</p>
          <p>Our novel CBS network simulator within the framework
of high-speed rail has been designed to investigate the effect
of the proposed reinforcement learning technique on the
network operation. The implemented ns-2 model is composed of
several modifications to the physical, link and network layers
in the form of stand-alone C++ modules. The PU Activity
Block describes the activity of PUs based on the on-off
model, including their transmission range, location, and spectrum
band of use. The Channel Block contains a channel table
with the background noise, capacity, and occupancy status.
The Spectrum Sensing Block implements the energy-based
sensing functionalities, and if a PU is detected, the Spectrum
Management Block is notified. This, in turn causes the device
to switch to the next available channel, and also alert the
upper layers of the change of frequency. The Spectrum Sharing
Block coordinates the distributed channel access, and
calculates the interference at any given node due to the ongoing
Algorithm 1 Pseudo code of Q-learning on CBS
Main()
Initialize state st and action at and their Q~ value;
repeat</p>
          <p>Q-learning(st, at, Q~ )
until all episodes are traversed
Q-with-Kanerva(st, at, Q~ )
repeat</p>
          <p>Take action st, observe reward rt, get next state st+1
Get Q(stat) from the Q-table;
for all actions a* under new state st+1 do</p>
          <p>Generate the state-action pair st+1at+1 from state
st+1 and action a*</p>
          <p>Get Q(st+1at+1) from the Q-table;
end for
= r + maxQ(st+1at+1) Q(stat)</p>
          <p>Q~ =
Q~ = Q~ + Q~
st = st+1
if random probability " then
for all actions a* under current state st do</p>
          <p>at = argmaxaQ(stat)
end for
else</p>
          <p>at = random action
end if
until st is terminal
transmissions in the network. The Cross Layer Repository
facilitates the information sharing between the different
protocol stack layers.</p>
          <p>We conduct our experiment in the following scenario: there
are 2 trains which take on 21 passengers for each and 5 CBS
agents aside the railway. The average speed of train is 10m=s.
We have 10 primary users in the range of each CBS. The
activity of primary users is based on ON-OFF model and each
primary user is assigned the spectrum randomly from 5
spectrums (small network) or10 spectrums (large network) . The
CBS agent senses the spectrum holes per 0:1 second and
assigns available spectrum to CR user agent. The simulation
parameters are summarized in Table 1.
4.2</p>
        </sec>
        <sec id="sec-11-4-2">
          <title>Experimental results</title>
          <p>We compare the performance of our CBS with reinforcement
learning (CBS-RL) scheme with the CBS with Round-Robin
scheme (CBS-RR), which is a typical way in GSM-R
system. The Round-robin (RR) scheme employs the principle
that once a spectrum is not available, the agent switches to
next channel in equal portions and in circular order, handling
all switches without priority (also known as cyclic executive).</p>
          <p>This method is simple, easy to implement, and
starvationfree. In our RL-based scheme, the exploration rate is set to
0:2, which we found experimentally to give the best results.</p>
          <p>The initial learning rate is set to 0:8, and it is decreased by
a scaling factor of 0:995 after each time slot.</p>
          <p>Figure 5(a) shows an example about the distribution of
Chan.4
Chan.3
Chan.2
Chan.1
0
0
0
0
0
14
12
s10
e
h
c
it 8
w
S 
l
e 6
n
n
a
hC4
2
0
100
100
100
100
100
200
200
200
200
200
300
300
300
300
300
400 500
The Number of Epoch
400 500
The Number of Epoch
400 500
The Number of Epoch
400 500
The Number of Epoch
400 500
The Number of Epoch
600
600
600
600
600
700
700
700
700
700
800
800
800
800
800</p>
          <p>Channel5
Channel4
Channel3
Channel2
Channel1
900
900
900
900
900
(a) An example about the distribution of spectrums occupancy on CBS with 5 spectrums.
(b) Average rewards for 5 spectrum bands</p>
          <p>(c) Average rewards for 10 spectrum bands
25
20
s
e
h
itc15
w
S 
l
e
n10
n
a
h
C
5
0
CBS-­‐RR
CBS-­‐RL
CBS-­‐RR
CBS-­‐RL
Epoch
Epoch
(d) Cumulative number of channel switching for 5 spectrum bands (e) Cumulative number of channel switching for 10 spectrum bands
spectrums occupancy on the CBS with 5 spectrums.
Spectrums occupancy on CBS follows the ON-OFF model: the
ON mode is in the normal distribution with the parameter</p>
          <p>= 25, and the OFF mode is in the exponential
distribution with the parameter . the value of which is randomly
generated.</p>
          <p>Figure 5(b) and 5(c) show the average rewards received by
CBS agent across all spectrums using the CBS-RL scheme.</p>
          <p>The result in Figure 5(b) shows that after learning over 1000
epochs, Channel 5 receives the largest positive reward of
approximately +5:5, while Channel 1, 2, 3 and 4 gets a reward
of approximately 11:8, +0:7, 5:1 and +3:3. The results
indicate that our approach pushes the CBS agents to
gradually achieve higher positive rewards and choose more suitable
spectrum for their transmission. The results also indicate that
the reward tends to be suitable to the distribution of spectrums
occupancy. A similar trend is observed in Figure 5(c), with
Channel 10 receiving the highest average reward of
approximately +5:2.</p>
          <p>Figure 5(d) and 5(e) show the cumulative number of
channel switching using CBS-RL and CBS-RR schemes. The
result in Figure 5(d) shows the average number of channel
switches for the small topology. We observe that after
learning, the CBS-RL scheme tends to decrease number of channel
switching to 5, while CBS-RR keeps the channel switches
to approximately 12. For the large topology in Figure 5(e),
the CBS-RL scheme reduces the channel switches to 6, while
CBS-RR keeps the channel switches approximately 23. The
results indicate that our proposed CBS-RL approach can keep
the channel switches lower than the CBS-RR approach and
converge to an optimal solution.
5</p>
        </sec>
      </sec>
      <sec id="sec-11-5">
        <title>CONCLUSIONS</title>
        <p>To address the issues of frequent channel switches and
inefficient blind learning in high-speed rail, we propose a novel
concept of Cognitive Base Station, which has the capability
of forecasting spectrum holes and assigning spectrum to
individuals. Our simulation results prove that after autonomous
learning, the CBS-RL scheme can forecast spectrum holes.
In this way, our proposed model can significantly improve
the performance of vehicular communication, which can
decrease cell-switching and unsuccessful transmission.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <source>[Ai</source>
          et al.,
          <year>2014</year>
          ]
          <string-name>
            <given-names>Bo</given-names>
            <surname>Ai</surname>
          </string-name>
          , Xiang Cheng, Thomas Kurner, Zhangdui Zhong, Ke Guan, Ruisi He,
          <string-name>
            <surname>Lei Xiong</surname>
          </string-name>
          , David W Matolak, David G Michelson, and
          <string-name>
            <surname>Cesar</surname>
          </string-name>
          Briso-Rodriguez.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <article-title>Challenges toward wireless communications for highspeed railway</article-title>
          .
          <source>Intelligent Transportation Systems</source>
          , IEEE Transactions on,
          <volume>15</volume>
          (
          <issue>5</issue>
          ):
          <fpage>2143</fpage>
          -
          <lpage>2158</lpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <source>[Alkayal and Saada</source>
          , 2013]
          <article-title>Fisal Alkayal and Johnny Bou Saada</article-title>
          .
          <article-title>Compact three phase inverter in silicon carbide technology for auxiliary converter used in railway applications</article-title>
          .
          <source>In Power Electronics and Applications (EPE)</source>
          ,
          <year>2013</year>
          15th European Conference on, pages
          <fpage>1</fpage>
          -
          <lpage>10</lpage>
          . IEEE,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [Bkassiny et al.,
          <year>2013</year>
          ]
          <string-name>
            <given-names>Mario</given-names>
            <surname>Bkassiny</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Yang</given-names>
            <surname>Li</surname>
          </string-name>
          , and
          <article-title>Sudharman K Jayaweera. A survey on machine-learning techniques in cognitive radios</article-title>
          .
          <source>Communications Surveys &amp; Tutorials</source>
          , IEEE,
          <volume>15</volume>
          (
          <issue>3</issue>
          ):
          <fpage>1136</fpage>
          -
          <lpage>1159</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <source>[Chkirbene and Hamdi</source>
          , 2015]
          <string-name>
            <given-names>Zina</given-names>
            <surname>Chkirbene</surname>
          </string-name>
          and
          <string-name>
            <given-names>Noureddine</given-names>
            <surname>Hamdi</surname>
          </string-name>
          .
          <article-title>A survey on spectrum management in cognitive radio networks</article-title>
          .
          <source>International Journal of Wireless and Mobile Computing</source>
          ,
          <volume>8</volume>
          (
          <issue>2</issue>
          ):
          <fpage>153</fpage>
          -
          <lpage>165</lpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <source>[Commission and others</source>
          , 2003]
          <string-name>
            <given-names>Federal</given-names>
            <surname>Communications</surname>
          </string-name>
          Commission et al.
          <article-title>Facilitating opportunities for flexible, efficient, and reliable spectrum use employing cognitive radio technologies</article-title>
          . Et docket, (
          <volume>03</volume>
          -108):
          <fpage>05</fpage>
          -
          <lpage>57</lpage>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [Dudoyer et al.,
          <year>2012</year>
          ]
          <string-name>
            <given-names>Stephen</given-names>
            <surname>Dudoyer</surname>
          </string-name>
          , Virginie Deniau, Ricardo Adriano, MN Ben Slimen, Jean Rioult, Benoˆıt Meyniel, and
          <string-name>
            <given-names>Marion</given-names>
            <surname>Berbineau</surname>
          </string-name>
          .
          <article-title>Study of the susceptibility of the gsm-r communications face to the electromagnetic interferences of the rail environment</article-title>
          .
          <source>Electromagnetic Compatibility</source>
          , IEEE Transactions on,
          <volume>54</volume>
          (
          <issue>3</issue>
          ):
          <fpage>667</fpage>
          -
          <lpage>676</lpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <source>[Dybala and Radkowski</source>
          , 2013]
          <string-name>
            <given-names>Jacek</given-names>
            <surname>Dybala</surname>
          </string-name>
          and
          <string-name>
            <given-names>Stanislaw</given-names>
            <surname>Radkowski</surname>
          </string-name>
          .
          <article-title>Reduction of doppler effect for the needs of wayside condition monitoring system of railway vehicles</article-title>
          .
          <source>Mechanical Systems and Signal Processing</source>
          ,
          <volume>38</volume>
          (
          <issue>1</issue>
          ):
          <fpage>125</fpage>
          -
          <lpage>136</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <source>[Haykin</source>
          , 2005]
          <string-name>
            <given-names>Simon</given-names>
            <surname>Haykin</surname>
          </string-name>
          .
          <article-title>Cognitive radio: brainempowered wireless communications</article-title>
          .
          <source>Selected Areas in Communications, IEEE Journal on</source>
          ,
          <volume>23</volume>
          (
          <issue>2</issue>
          ):
          <fpage>201</fpage>
          -
          <lpage>220</lpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <article-title>[isheng Zhao et al</article-title>
          .,
          <year>2013</year>
          ] isheng Zhao,
          <string-name>
            <given-names>Xi</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Yi</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>and Hong</given-names>
            <surname>Ji</surname>
          </string-name>
          .
          <article-title>Resource allocation for high-speed railway downlink mimo-ofdm system using quantum-behaved particle swarm optimization</article-title>
          .
          <source>In Communications (ICC)</source>
          ,
          <year>2013</year>
          IEEE International Conference on, pages
          <fpage>2343</fpage>
          -
          <lpage>2347</lpage>
          . IEEE,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [Jiang et al.,
          <year>2011</year>
          ]
          <string-name>
            <given-names>Tianzi</given-names>
            <surname>Jiang</surname>
          </string-name>
          , David Grace, and
          <string-name>
            <given-names>Paul D</given-names>
            <surname>Mitchell</surname>
          </string-name>
          .
          <article-title>Efficient exploration in reinforcement learningbased cognitive radio spectrum sharing</article-title>
          .
          <source>Communications, IET</source>
          ,
          <volume>5</volume>
          (
          <issue>10</issue>
          ):
          <fpage>1309</fpage>
          -
          <lpage>1317</lpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <source>[Kadam and Srivastava</source>
          , 2012]
          <string-name>
            <given-names>Kaveri</given-names>
            <surname>Kadam</surname>
          </string-name>
          and
          <string-name>
            <given-names>Navin</given-names>
            <surname>Srivastava</surname>
          </string-name>
          .
          <article-title>Application of machine learning (reinforcement learning) for routing in wireless sensor networks (wsns)</article-title>
          .
          <source>In Physics and Technology of Sensors (ISPTS)</source>
          ,
          <year>2012</year>
          1st International Symposium on, pages
          <fpage>349</fpage>
          -
          <lpage>352</lpage>
          . IEEE,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <source>[Kim and Sung</source>
          , 2014]
          <string-name>
            <given-names>Soyeon</given-names>
            <surname>Kim</surname>
          </string-name>
          and
          <string-name>
            <given-names>Wonjin</given-names>
            <surname>Sung</surname>
          </string-name>
          .
          <article-title>Operational algorithm for wireless communication systems using cognitive radio</article-title>
          .
          <source>In Communication, Networks and Satellite (COMNETSAT)</source>
          ,
          <year>2014</year>
          IEEE International Conference on, pages
          <fpage>29</fpage>
          -
          <lpage>33</lpage>
          . IEEE,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <source>[Lee and Akyildiz</source>
          , 2012]
          <article-title>Won-Yeol Lee and Ian F Akyildiz</article-title>
          .
          <article-title>Spectrum-aware mobility management in cognitive radio cellular networks</article-title>
          .
          <source>Mobile Computing</source>
          , IEEE Transactions on,
          <volume>11</volume>
          (
          <issue>4</issue>
          ):
          <fpage>529</fpage>
          -
          <lpage>542</lpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <source>[Letaief and Zhang</source>
          , 2009]
          <article-title>Khaled Ben Letaief and Wei Zhang. Cooperative communications for cognitive radio networks</article-title>
          .
          <source>Proceedings of the IEEE</source>
          ,
          <volume>97</volume>
          (
          <issue>5</issue>
          ):
          <fpage>878</fpage>
          -
          <lpage>893</lpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <source>[Li and Zhao</source>
          , 2012]
          <string-name>
            <given-names>Jinxing</given-names>
            <surname>Li</surname>
          </string-name>
          and
          <string-name>
            <given-names>Youping</given-names>
            <surname>Zhao</surname>
          </string-name>
          .
          <article-title>Radio environment map-based cognitive doppler spread compensation algorithms for high-speed rail broadband mobile communications</article-title>
          .
          <source>EURASIP Journal on Wireless Communications and Networking</source>
          ,
          <year>2012</year>
          (1):
          <fpage>1</fpage>
          -
          <lpage>18</lpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <surname>[Li</surname>
          </string-name>
          et al.,
          <year>2013</year>
          ]
          <string-name>
            <given-names>Ying</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <surname>Lei</surname>
            <given-names>Lei</given-names>
          </string-name>
          , Zhangdui Zhong, and
          <string-name>
            <given-names>Siyu</given-names>
            <surname>Lin</surname>
          </string-name>
          .
          <article-title>Performance analysis for high-speed railway communication network using stochastic network calculus</article-title>
          .
          <source>In Wireless, Mobile and Multimedia Networks (ICWMMN</source>
          <year>2013</year>
          ), 5th IET International Conference on, pages
          <fpage>100</fpage>
          -
          <lpage>105</lpage>
          . IET,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [Liu et al.,
          <year>2011</year>
          ] Qiuyan Liu,
          <string-name>
            <given-names>Miao</given-names>
            <surname>Wang</surname>
          </string-name>
          , and Zhangdui Zhong.
          <article-title>Statistics of capacity analysis in high speed railway communication systems</article-title>
          .
          <source>Tamkang Journal of Science and Engineering</source>
          ,
          <volume>14</volume>
          (
          <issue>3</issue>
          ):
          <fpage>209</fpage>
          -
          <lpage>215</lpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [Liu et al.,
          <year>2012</year>
          ] Liu Liu, Cheng Tao, Jiahui Qiu, Houjin Chen,
          <string-name>
            <given-names>Li</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Weihui</given-names>
            <surname>Dong</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Yao</given-names>
            <surname>Yuan</surname>
          </string-name>
          .
          <article-title>Position-based modeling for wireless channel on high-speed railway under a viaduct at 2.35 ghz</article-title>
          . Selected Areas in Communications,
          <source>IEEE Journal on</source>
          ,
          <volume>30</volume>
          (
          <issue>4</issue>
          ):
          <fpage>834</fpage>
          -
          <lpage>845</lpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <source>[Puterman</source>
          , 1994]
          <string-name>
            <given-names>M. L.</given-names>
            <surname>Puterman</surname>
          </string-name>
          .
          <article-title>Markov decision processes</article-title>
          . In Wiley,
          <year>1994</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <source>[Sniady and Soler</source>
          , 2012]
          <string-name>
            <given-names>Aleksander</given-names>
            <surname>Sniady</surname>
          </string-name>
          and Jose? Soler.
          <article-title>An overview of gsm-r technology and its shortcomings</article-title>
          .
          <source>In ITS Telecommunications (ITST)</source>
          ,
          <year>2012</year>
          12th International Conference on, pages
          <fpage>626</fpage>
          -
          <lpage>629</lpage>
          . IEEE,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <source>[Sutton and Barto</source>
          , 1998]
          <string-name>
            <given-names>R.</given-names>
            <surname>Sutton</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Barto</surname>
          </string-name>
          .
          <source>Reinforcement Learning: An Introduction. Bradford Books</source>
          ,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [Tian et al.,
          <year>2012</year>
          ]
          <string-name>
            <given-names>Lin</given-names>
            <surname>Tian</surname>
          </string-name>
          ,
          <string-name>
            <surname>Juan Li</surname>
            ,
            <given-names>Yi</given-names>
          </string-name>
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>Jinglin</given-names>
          </string-name>
          <string-name>
            <surname>Shi</surname>
            , and
            <given-names>Jihua</given-names>
          </string-name>
          <string-name>
            <surname>Zhou</surname>
          </string-name>
          .
          <article-title>Seamless dual-link handover scheme in broadband wireless communication systems for highspeed rail</article-title>
          . Selected Areas in Communications,
          <source>IEEE Journal on</source>
          ,
          <volume>30</volume>
          (
          <issue>4</issue>
          ):
          <fpage>708</fpage>
          -
          <lpage>718</lpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <source>[Waltz and Fu</source>
          , 1965]
          <string-name>
            <given-names>M. D.</given-names>
            <surname>Waltz</surname>
          </string-name>
          and
          <string-name>
            <given-names>K. S.</given-names>
            <surname>Fu</surname>
          </string-name>
          .
          <article-title>A heuristic approach to reinforcment learning control systems</article-title>
          .
          <source>In IEEE Transactions on Automatic Control</source>
          ,
          <volume>10</volume>
          :
          <fpage>390</fpage>
          -
          <lpage>398</lpage>
          .,
          <year>1965</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          <string-name>
            <surname>[Wu</surname>
          </string-name>
          et al.,
          <year>2010</year>
          ] Cheng Wu, Kaushik Chowdhury, Marco Di Felice, and
          <string-name>
            <given-names>Waleed</given-names>
            <surname>Meleis</surname>
          </string-name>
          .
          <article-title>Spectrum management of cognitive radio using multi-agent reinforcement learning</article-title>
          .
          <source>In Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: Industry track</source>
          , pages
          <fpage>1705</fpage>
          -
          <lpage>1712</lpage>
          . International Foundation for Autonomous Agents and
          <string-name>
            <given-names>Multiagent</given-names>
            <surname>Systems</surname>
          </string-name>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          <string-name>
            <surname>[Wu</surname>
          </string-name>
          et al.,
          <year>2015</year>
          ] Cheng Wu, Yiming Wang,
          <string-name>
            <surname>Xiang Qiang</surname>
          </string-name>
          , and Zhaoyang Zhang.
          <article-title>Adaptive spectrum management of cognitive radio in intelligent transportation system</article-title>
          .
          <source>In Applied Mechanics and Materials</source>
          , volume
          <volume>743</volume>
          , pages
          <fpage>765</fpage>
          -
          <lpage>773</lpage>
          . Trans Tech Publ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [Zhang et al.,
          <year>2012</year>
          ]
          <string-name>
            <given-names>Jiayi</given-names>
            <surname>Zhang</surname>
          </string-name>
          , Zhenhui Tan, Xiaoxi Yua,
          <string-name>
            <given-names>Haibo</given-names>
            <surname>Wang</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Linwen</given-names>
            <surname>Zhang</surname>
          </string-name>
          .
          <article-title>Review of public broadband access systems for high-speed railways and key technologies</article-title>
          .
          <source>Journal of the China Railway Society</source>
          ,
          <volume>34</volume>
          (
          <issue>1</issue>
          ):
          <fpage>46</fpage>
          -
          <lpage>53</lpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          <source>[Zhou and Ai</source>
          , 2014]
          <string-name>
            <given-names>Yuzhe</given-names>
            <surname>Zhou</surname>
          </string-name>
          and
          <string-name>
            <given-names>Bo</given-names>
            <surname>Ai</surname>
          </string-name>
          .
          <article-title>Quality of service improvement for high-speed railway communications</article-title>
          .
          <source>Communications, China</source>
          ,
          <volume>11</volume>
          (
          <issue>11</issue>
          ):
          <fpage>156</fpage>
          -
          <lpage>167</lpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [Zhu et al.,
          <year>2013</year>
          ]
          <string-name>
            <given-names>Xiangqian</given-names>
            <surname>Zhu</surname>
          </string-name>
          , Shanzhi Chen, Haijing Hu, Xin Su, and
          <string-name>
            <given-names>Yan</given-names>
            <surname>Shi</surname>
          </string-name>
          .
          <article-title>Tdd-based mobile communication solutions for high-speed railway scenarios</article-title>
          .
          <source>Wireless Communications</source>
          , IEEE,
          <volume>20</volume>
          (
          <issue>6</issue>
          ):
          <fpage>22</fpage>
          -
          <lpage>29</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>