<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>International Workshop on Data Mining and Knowledge Engineering, October</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Data Recovery Algorithms Based on the Nearest Neighbor Method for Predicting Traffic Flows</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Arthur Zhigalov</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Irina Bolodurina</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lubov Zabrodina</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Orenburg State University</institution>
          ,
          <addr-line>Prospekt Pobedy, 13, Orenburg, 460018</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2020</year>
      </pub-date>
      <volume>1</volume>
      <fpage>5</fpage>
      <lpage>16</lpage>
      <abstract>
        <p>This research examines the problem of short-term forecasting of traffic flows in conditions of incomplete information. For the task of forecasting, a mathematical model has been compiled, which makes it possible to compare the main characteristics of the traffic flow with the value of the forecast - the speed of the flow. Within the framework of this work, various forecasting algorithms have been investigated, which make it possible to analyze the influence of the number of recorded observations and similar records, as well as allowing modifications to increase the forecast accuracy. Also, a data recovery approach has been developed to eliminate noisy and missing data. Experiments of short-term forecasting were carried out on an open dataset from Yandex and confirmed the importance of the stage of preprocessing and analysis of the effectiveness of recovery algorithms. After applying the proposed approach to data processing and recovery, the predicted value turned out to be more accurate on average by 2 km / h. The results of the study showed that the combined model with weighing showed a more accurate result compared to simple models, the average error was 9.81 km / h. Among the possible directions for further research, one can single out the search for several combined methods that have equivalent algorithms using neural networks.</p>
      </abstract>
      <kwd-group>
        <kwd>1 2 data recovery algorithm</kwd>
        <kwd>short-term forecasting</kwd>
        <kwd>modeling of transport networks</kwd>
        <kwd>data preprocessing</kwd>
        <kwd>nearest neighbors method</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <sec id="sec-1-1">
        <title>With the rapid growth of the number of vehicles and the progress of urbanization, the number of</title>
        <p>traffic congestions in large cities increases every year, resulting in low efficiency of transport
networks and wasted time, wasted fuel, and excessive air pollution. Also, intelligent traffic
management is a key issue, which is also an important tool to guide scientific decision making in
traffic management. Traffic forecasting is a challenging task in intelligent traffic management.</p>
      </sec>
      <sec id="sec-1-2">
        <title>The goal is to anticipate changes over time in the number of vehicles at observation points (eg intersections or stations). The time interval is usually set to 5, 15, or 30 minutes. The forecasting task is associated with the processing of current data and the construction of a forecasting model for traffic flows.</title>
        <p>Because traffic data has sharp nonlinearities resulting from transitions from free flow to failure,
and then to congested traffic, predicting traffic flow is challenging due to rapidly changing traffic
conditions. Also, processing the data before building the model can significantly affect the result of
the prediction algorithm. This is due to many factors, such as speed noise, missed travel times, traffic
accidents (data outliers), etc. Such phenomena affect the accuracy of the constructed forecast. In this
regard, the purpose of this work is to develop a data recovery algorithm for solving the problem of
short-term forecasting of traffic flow.</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>Related works</title>
      <sec id="sec-2-1">
        <title>The problem of predicting traffic flows has been studied in several studies both from the point of view of small road sections and for large-scale road networks.</title>
      </sec>
      <sec id="sec-2-2">
        <title>In the framework of the research conducted by Y. Han and F. Moutarde [1], the change in global</title>
        <p>congestion configurations in networks (spatial congestion configurations of the entire network) is
considered. As part of the development of a unified data mining system, the authors presented a
description of the most typical time patterns of network traffic status and a long-term forecast of
large-scale traffic dynamics. The experiments have shown the effectiveness of clustering and
forecasting based on the results of tensor factorization.</p>
      </sec>
      <sec id="sec-2-3">
        <title>In [2], A. Ladino et al. proposed a real-time forecasting technique for dynamic travel time based on merging individual forecasts obtained from clustered time series. The resulting prediction error along the horizon of future values within the cross-validation algorithm has the smallest value if there is a priori knowledge about the cluster.</title>
      </sec>
      <sec id="sec-2-4">
        <title>The authors of the study [3] developed a deep learning model for predicting traffic flows to solve the problem of non-linearity, recovery, and congestion. In some cases, when the traffic mode changes abruptly, deep learning provides fairly accurate short-term traffic forecasts. However, the results obtained are not generalized to global large-scale networks.</title>
      </sec>
      <sec id="sec-2-5">
        <title>The application of deep learning to short-term real-time traffic speed prediction is also discussed in</title>
        <p>the study [4]. The experiments have shown that the performance of the speed prediction model
increases when the missing values are taken into account and restored in the multi-view approach.</p>
      </sec>
      <sec id="sec-2-6">
        <title>Statistical problems in processing a data stream with different quality are presented in more detail in [5]. The authors of the study analyzed algorithms for converting missing or fuzzy data, estimated speed, and made a forecast of driving time on motorway networks. The results obtained can be used in conjunction with other short-term forecasting algorithms.</title>
      </sec>
      <sec id="sec-2-7">
        <title>The study [6] presents a system for predicting traffic congestion using the K-nearest neighbor</title>
        <p>algorithm. However, traffic congestion modeling is based on the example of a specific road, based on
its spatial and temporal data, and requires generalization to large-scale networks.</p>
      </sec>
      <sec id="sec-2-8">
        <title>In the framework of [7], an improved K-nearest neighbor approach is presented in conjunction with the method of ranking local minima of distances, which in most cases exceeds competing algorithms in terms of forecast accuracy.</title>
      </sec>
      <sec id="sec-2-9">
        <title>A modification of the k-nearest neighbor method is also presented in the study [8] and is based on a three-level nonparametric regression algorithm. According to experiments, the proposed algorithm has improved predictive ability, as well as significantly increased the accuracy and performance of short-term forecasting of traffic flows in real time.</title>
      </sec>
      <sec id="sec-2-10">
        <title>The authors of the study [9] conducted a comparative analysis of the average and weighted nonparametric regression model of k-means, and also evaluated the reliability of the predicted result. The experiments confirmed the feasibility of using the methods in short-term traffic flow forecasting and showed a fairly high accuracy.</title>
      </sec>
      <sec id="sec-2-11">
        <title>In [10], attention is paid to the problem of predicting the state of short-term motion based on spatio-temporal characteristics in conditions with damaged or missing data. Critical sections of the road had the strongest impact on the accuracy of the forecast, and therefore, the authors determined the need to use a spatial-temporal correlation algorithm.</title>
      </sec>
      <sec id="sec-2-12">
        <title>Thus, the research has shown that data processing before building the model can significantly affect the accuracy of the prediction algorithm. In this regard, the task of research and analysis of data recovery algorithms for solving the problem of short-term forecasting is relevant.</title>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Generalized statement of the problem of forecasting traffic flows</title>
      <p>The transport network is represented as a graph 
= ( ,  ), 

∈ 
representing road
segments and a set of vertexes</p>
      <p>representing intersections. The following values can be used as the
predicted traffic flows value on the edges of the road graph: flow density, average time of vehicle
passing along the road section. In this paper, the average speed of a vehicle passing a road section will
be used for conducting a numerical experiment. Let’s assume that data processing has been completed
where   is number of observations on the  -th edge;   
are observed time points;    ∈  + is speed
at a time</p>
      <p>
        . Taking into account the entered designations, a formal statement of the problem of
getting a forecast for a given parameter of the transport network’s traffic flows: Using the road graph
= ( ,  ) and current data on the traffic flows value in the form (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ), we need to calculate the
estimation of the traffic flows parameters for all edges of the transport network:
and there is an observation database in the form of sets of traffic flows values for all sections of the
road network in the form:
where  is forecast horizon;  ∗ is current time;  ( )is function for predicting the traffic flows value
vi on edge ui at time  ∗ +  . When building a model for short-term forecasting of traffic flow, it is


necessary to:
forecast value;
linear regression
define a model that displays the values of known observations of the traffic flows value in the
determine the parameters of the algorithm of the mathematical model of traffic flows based on
real data from the observation database.
      </p>
      <p>A prediction model based on information about nearest neighbors using

=   ,   = (   (</p>
      <p>),    ),
  ( ∗ +  ) =  ( )
{   ( ∗ +  ) ∈  +</p>
      <p>,
∀  ∗ ≥</p>
      <p>N
i=1


where  
  is the amount of traffic flow on the road   at time   . Then in matrix form the dependency</p>
      <p>When building a model for predicting traffic flows, appropriate databases are needed, which often
have data gaps and noise in practice. The missing information can be filled in when performing local
forecasting for the road network graph in sections with gaps. The idea is to use information about
traffic flow on neighboring edges to predict this value based on similar incidents in the past. The
function of restoring the value of the transport flow, depending on the values on neighboring edges,
can be represented as:
vd,t =  (vdr1,t , vdr2,t , . . . , vdrN,t ),</p>
      <p>r
where   , is speed on the road  in time  of the day  ;  is some recovery function. We will use
linear regression as a function. We will look for the type of function  as a linear dependence on the
speeds on adjacent roads:
 (vdr1,t , vdr2,t , . . . , vdrN,t )= b1vdr1,t + b2vdr2,t + . . . + bN vdrN,t + ε = X N i = ∑ biv di,t + ε,
r
where   are linear regression coefficients; ε is the approximation error function  .</p>
      <p>
        The goal is to find the coefficients   . These coefficients can be found using the least squares
method. Let there are a history of observing the road, as well as a vector of points 
=
( 1,  2, . . . ,   )where   indicates a point in time in the past. Vector of transport flow values at time
points ti on edge for which the forecast will be built we denote by   . For each edge there is a certain
number of adjacent edges  . Let there are a matrix of the form:
(
        <xref ref-type="bibr" rid="ref1">1</xref>
        )
(
        <xref ref-type="bibr" rid="ref2">2</xref>
        )
(
        <xref ref-type="bibr" rid="ref3">3</xref>
        )
(
        <xref ref-type="bibr" rid="ref4">4</xref>
        )
(
        <xref ref-type="bibr" rid="ref5">5</xref>
        )
(
        <xref ref-type="bibr" rid="ref6">6</xref>
        )
… ) ,
  =
      </p>
      <p>+  ,
looks like this:</p>
      <p>
        = (ε1, . . ., ε ).
deviations is minimal:
where  is vector of linear regression coefficients  = ( 1, . . . ,   );  is the vector of forecast errors
It is necessary to select the coefficients  = ( 1, . . . ,   ), so that the sum of the squares of

(
        <xref ref-type="bibr" rid="ref4">4</xref>
        ).
conditions:
history.
      </p>
    </sec>
    <sec id="sec-4">
      <title>5. Forecasting models</title>
      <p>where   ( )is linear combination of traffic flow values and coefficient vector at a time  .</p>
      <p>
        To find the coefficient vector we transform the matrix form (
        <xref ref-type="bibr" rid="ref7">7</xref>
        ) to a system of equations, which in
matrix form has the form
      </p>
      <sec id="sec-4-1">
        <title>1. training takes place at those time intervals</title>
        <p>=  1,  2, . . . ,   for which all values of traffic
flows values on adjacent edges are known at given time points;</p>
        <p>2. trecovery occurs at those points where the traffic flows value is unknown at the time   +1 but
on neighboring edges all traffic flows values at the predicted time are present in the observation</p>
      </sec>
      <sec id="sec-4-2">
        <title>The solution of this system of equations allows us to obtain coefficients for the linear dependence To restore using the least squares method, select sections of the road graph that meet the following (7) (8)</title>
        <p>
          (
          <xref ref-type="bibr" rid="ref9">9</xref>
          )
(11)
(12)
(13)
 , =

1
        </p>
        <p>( 1( 1)+  2( 2)+ ⋯ +   (  )).</p>
        <p>For unequal models, we will take into account their coefficients. In this method, the ensemble of
models  is defined as the weighted sum of</p>
        <p>weak models (combined model II):

=  1 1( )+  2 2( )+ ⋯ +   
 ( ).
where   is coefficient with which the weak model is included in the ensemble.</p>
        <p>When predicting the amount of traffic flows on the road  at some point in time t we can use
where   , is speed on the road  at time  of day  .
observations of this section. Then the base model I is represented in following form:
 
 ,
=</p>
        <p>(  −1, − , … ,   −1, −1, … ,   −1, +1, … ,   −1, + ,   −2, − , … ),</p>
        <p>To take into account the situation with a small number of observations we will take the average
value of the traffic flows value for all observations on the transport network and shift it in its
direction. Then the base model II is represented as follows:
 
 ,
=

∗ 

+ 
+ 
∗ 
,</p>
        <p>
          (
          <xref ref-type="bibr" rid="ref10">10</xref>
          )
is number of
where
        </p>
        <p>
          alue are traffic flows values obtained from the formula (
          <xref ref-type="bibr" rid="ref9">9</xref>
          ); 
observations used in the base model I; 
is the value of the traffic flows at a given time
 for all roads and all previous days;  is the free parameter of the model.
        </p>
        <p>To account for differences in traffic flow characteristics, it is possible take into account
observations that are most similar to the current behavior of the traffic flows at a given time. For this
purpose we enter the similarity metric expressed by the formula:
 (  ,  )= (
1
| |
∑</p>
        <p>∑
 ∈  ∈
|  , −  
| |
  , |)
−1
,
known   , and    , .
where   , is speed on the road  at time  of day  ;   - day number  in the history of observations;
 - many roads in the transport network;  is a set of moments in time for which both dimensions are</p>
      </sec>
      <sec id="sec-4-3">
        <title>Then the base model II is modified so that it takes into account the found similarity values. This model is designated as the basic model III.</title>
      </sec>
      <sec id="sec-4-4">
        <title>There are examples of constructing an ensemble of traffic flows forecasting models. Let’s build combined models. Let there be a vector of prediction models</title>
        <p>= ( 1,  2, . . . ,   ) consisting of 
weak models. Then the constructed model for predicting traffic flows will have the form (combined
model I):</p>
        <p>To find linear regression coefficients  1,  2, . . . ,   predictions  1,  2, . . . ,   , are made for one of
the previous days,  ∗ is the forecast vector of the combined model. Then the Least Absolute</p>
      </sec>
      <sec id="sec-4-5">
        <title>Deviation problem is solved:</title>
      </sec>
      <sec id="sec-4-6">
        <title>Model (14) is designated as a combined model II.</title>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>6. Simulation results</title>
      <sec id="sec-5-1">
        <title>To build the model, we used an open source of Internet data from Yandex. The source data</title>
        <p>consists of two parts: a description of the city’s street network and observation data for these streets.
The observations cover 31 days for the Moscow road graph. Intersections in Moscow correspond to
the vertices of the graph, and street segments correspond to arcs (a two-way street corresponds to two
multidirectional arcs).
of the road network graph:</p>
      </sec>
      <sec id="sec-5-2">
        <title>The data are heterogeneous, for some roads a large number of speed observations are known, for others, on the contrary, less. Figure 1 shows the distribution of the number of observations in sections (14) (15)</title>
        <p>The metric of the average deviation of the forecast values from the observed measurements can
serve as an estimation measure (15).</p>
        <p>1
 =</p>
        <p>∑ | ∗ −  ̂| ,
where  is total number of plots;  ̂ is predicted speed on the section;  ∗ is observed speed on the site.</p>
      </sec>
      <sec id="sec-5-3">
        <title>The results of evaluating algorithms using the metric (15) are shown in figure 2 before data recovery.</title>
      </sec>
      <sec id="sec-5-4">
        <title>The results of evaluating algorithms using the metric (15) are shown in figure 3 after data recovery.</title>
        <p>Based on the results obtained, the following conclusions can be made:
1. The approach with data recovery using information about the nearest neighbors allowed
increasing the number of observations, the number of data increased by about 1.5 times. Forecasts
based on the new data were more accurate.</p>
        <p>2. The basic model III makes a significant contribution to the combined model. In this case, the
coefficient  3 = 0, 92. The other models account for only 0,08.</p>
      </sec>
      <sec id="sec-5-5">
        <title>3. The model based on average observations of the road makes an average error of 12.31 km/h,</title>
        <p>the best model (combined model II) gives an average error of 9.81 km/h, i.e. the improvement is on
average 3 km/h.</p>
      </sec>
      <sec id="sec-5-6">
        <title>4. Combining prediction models by learning from different samples using sampling (combined model I) does not improve, since the models are not equivalent.</title>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>7. Conclusion</title>
      <sec id="sec-6-1">
        <title>In this study, we consider the problem of short-term forecasting in conditions of incomplete</title>
        <p>information, the purpose of which is to develop an adaptive mathematical model for predicting traffic
flows with the required accuracy. A comparative analysis and selection of the optimal TP model are
carried out, as well as algorithms for predicting the flow rate of the transport network are studied.</p>
      </sec>
      <sec id="sec-6-2">
        <title>Methods for building traffic forecasts based on previous measurements have shown that</title>
        <p>processing and clearing raw data is an important step before predicting traffic flow. So according to
the processed data, the results were more accurate by an average of 2.3 km/h.</p>
      </sec>
      <sec id="sec-6-3">
        <title>The combined model with weighing showed a more accurate result compared to simple models, with an average error of 9.81 km/h. When combining models, the average approach does not improve the model, since the presented algorithms are not equivalent.</title>
      </sec>
      <sec id="sec-6-4">
        <title>Among the possible directions for further research, we can highlight the search for several combined methods with equivalent algorithms using neural networks.</title>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgments</title>
      <sec id="sec-7-1">
        <title>The study was carried out with the financial support of the Russian Foundation for Basic Research in the framework of scientific projects No. 20-57-53019, 20-07-01065, as well as a grant from the President of the Russian Federation for state support of leading scientific schools of the Russian Federation (NSh-2502.2020.9).</title>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A.</given-names>
            <surname>Ladino</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Alain</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Canudas</surname>
          </string-name>
          and
          <string-name>
            <given-names>H.</given-names>
            <surname>Fourati</surname>
          </string-name>
          .
          <article-title>"A real time forecasting tool for dynamic travel time from clustered time series" Transportation research</article-title>
          . Part C, Emerging Technologies
          <volume>80</volume>
          (
          <year>2017</year>
          ):
          <fpage>216</fpage>
          -
          <lpage>238</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>H.</given-names>
            <surname>Yufei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Fabien</surname>
          </string-name>
          .
          <article-title>"Analysis of Large-scale Traffic Dynamics using Non-negative Tensor Factorization" ITS World Congress (</article-title>
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>L.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Wei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Dong</surname>
          </string-name>
          .
          <article-title>"An Improved K-nearest Neighbor Model for Short-term Traffic Flow Prediction"</article-title>
          <source>Procedia - Social and Behavioral Sciences</source>
          <volume>96</volume>
          (
          <year>2013</year>
          ):
          <fpage>653</fpage>
          -
          <lpage>662</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>N.</given-names>
            <surname>Polson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Sokolov</surname>
          </string-name>
          .
          <article-title>"Deep learning for short-term traffic flow prediction" Transportation Research Part C: Emerging Technologies 79 (</article-title>
          <year>2017</year>
          ):
          <fpage>1</fpage>
          -
          <lpage>17</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>P.</given-names>
            <surname>Bickel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Kwon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Rice</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Measuring</given-names>
            <surname>Traffic</surname>
          </string-name>
          .
          <source>Statist Statistical Science</source>
          <volume>4</volume>
          (
          <year>2007</year>
          ):
          <fpage>581</fpage>
          -
          <lpage>597</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>R.</given-names>
            <surname>Rishab</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Shreyas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Rajashree</surname>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Rohith</surname>
          </string-name>
          .
          <article-title>"Traffic Congestion Prediction System using K-Nearest Neighbour Algorithm"</article-title>
          <source>International Research Journal of Engineering and Technology</source>
          <volume>7</volume>
          (
          <year>2020</year>
          ):
          <fpage>2628</fpage>
          -
          <lpage>2630</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>S.</given-names>
            <surname>Gutha</surname>
          </string-name>
          .
          <article-title>"A deep learning approach to real-time short-term traffic speed prediction with spatialtemporal feature prediction" Electronic Theses and Dissertations (</article-title>
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>X.</given-names>
            <surname>Pang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Huang</surname>
          </string-name>
          .
          <article-title>"A Short-Term Traffic Flow Forecasting Method Based on a Three-Layer K-Nearest Neighbor Non-Parametric Regression Algorithm"</article-title>
          <source>Journal of Transportation Technologies</source>
          <volume>6</volume>
          (
          <year>2016</year>
          ):
          <fpage>200</fpage>
          -
          <lpage>206</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Gang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Yunpeng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Haiyang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Yilong</surname>
          </string-name>
          ,
          <string-name>
            <surname>X.</surname>
          </string-name>
          <article-title>Jindong "ShortTerm Traffic State Prediction Based on the Spatiotemporal Features of Critical Road"</article-title>
          <source>Sections Sensors</source>
          <volume>18</volume>
          (
          <year>2018</year>
          ):
          <fpage>2287</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zheng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Su</surname>
          </string-name>
          .
          <article-title>"Short-term traffic volume forecasting: A k-nearest neighbor approach enhanced by constrained linearly sewing principle component algorithm" Transportation Research Part C: Emerging Technologies 43 (</article-title>
          <year>2014</year>
          ):
          <fpage>143</fpage>
          -
          <lpage>157</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>