<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A weighted sparse-input neural network technique applied to identify important features for vortex-induced vibration</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Leixin Ma</string-name>
          <email>leixinma@mit.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Themistocles L. Resvanis</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>J. Kim Vandiver</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Mechanical Engineering, Massachusetts Institute of Technology</institution>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Flow-induced vibration depends on a large number of parameters or features. On the one hand, the number of candidate physical features may be too big to construct an interpretable and transferrable model. On the other hand, failure to account for key dependence among features may oversimplify the model. Feature selection is found to be able to reduce the dimension of the physical problem by identifying the most important features for a certain prediction task. In this paper, a weighted sparse-input neural network (WSPINN) is proposed, where the prior physical knowledge is leveraged to constrain the neural network optimization. The effectiveness of this approach is evaluated when applied to the vortex-induced vibration of a long flexible cylinder with Reynolds number from 104 to 105. The important physical features affecting the flexible cylinders' crossflow vibration amplitude are identified.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Vortex-induced vibration (VIV) is a multi-physics problem
associated with a number of features (or variables) that
characterize either the structure or the flow individually or
their interaction. As flow passes around a cylinder, the
wake becomes unstable. The periodically shed vortices
induce unsteady forces on the cylinder which lead to VIV.
Moreover, the VIV of long cylinders in ocean currents may
vary from single mode dominated, narrow-band random
vibration to multi-mode response, characterized by
broadband random vibration. Different current profiles may
cause structural vibration with standing waves or travelling
wave patterns
        <xref ref-type="bibr" rid="ref1 ref23">(Bourguet et.al. 2011; Vandiver et.al. 2018)</xref>
        .
The complexity of the nonlinear fluid-structure interaction
process, especially for the VIV of long, flexible cylinders
in high Reynolds numbers fluid flows, precludes exact
analytical solutions and CFD simulations are not yet up to
the task.
      </p>
      <p>Copyright © 2020, for this paper by its authors. Use permitted under
Creative Commons License Attribution 4.0 International (CC BY 4.0).</p>
      <p>
        To identify the key mechanisms and the governing
dimensionless parameters behind the complicated
fluidstructure interaction process, extensive investigations have
been made through structural response measurement, flow
visualization and various force modeling techniques
        <xref ref-type="bibr" rid="ref17">(Sarpkaya 2004)</xref>
        . VIV research over the past decades has
revealed that Strouhal number, Reynolds number, mass
ratio, damping parameter etc. are all relevant VIV features
        <xref ref-type="bibr" rid="ref21 ref23 ref6 ref9">(Vandiver 1993, Govardhan and Williamson 2006,
Vandiver et.al. 2018)</xref>
        . However, if one is only interested in
predicting a certain quantity of interest, such as cylinder’s
vibration amplitude in the crossflow direction, some of
these candidate features may be redundant or unimportant.
      </p>
      <p>
        Feature selection algorithms are intended to extract the
most important features out of the full set of candidate
features with the goal of keeping prediction accuracy at a
desirable level, but with a reduced set of features. A
preanalysis of a features’ importance can be conducted by
examining the statistical correlations among the features.
However, the statistical analysis often fails to consider the
complicated interactions among the physical input
parameters (features). To solve this problem, the importance of
each feature subset can be assessed according to their
prediction accuracy using a learning machine, such as deep
neural network (DNN). Several learning machine-based
feature selection approaches have already been developed
to iteratively search the optimal feature subset that gives
similar prediction accuracy as the full feature set, but they
can be computationally expensive especially when the
number of input features becomes very large
        <xref ref-type="bibr" rid="ref7">(Guyon
2003)</xref>
        .
      </p>
      <p>
        To efficiently identify important features in a learning
machine, several regularization techniques are introduced
to the machine learning process.
        <xref ref-type="bibr" rid="ref15">Rudy et. al. (2017</xref>
        )
developed a sequential threshold ridge regression, which helped
discover governing partial differential equations of a
system from measured time series. Inspired by the
effectiveness of the group lasso regularization in linear regression,
        <xref ref-type="bibr" rid="ref4">Feng (2017)</xref>
        and
        <xref ref-type="bibr" rid="ref18">Scardapane (2017)</xref>
        developed a
sparseinput neural network by imposing group lasso
regularization on the weight groups connecting each input neuron.
The effectiveness of the approach was demonstrated
through theoretical derivations and empirical evidence.
      </p>
      <p>
        However, for physical problems, some of the system’s
properties may be known in advance or can be obtained
from the governing physical laws and dimensional analysis
        <xref ref-type="bibr" rid="ref19">(Sonin, 2001)</xref>
        . Studies have shown that incorporating the
prior physical knowledge can help build more interpretable
machine learning models
        <xref ref-type="bibr" rid="ref25">(Ye et. al 2018)</xref>
        .
      </p>
      <p>
        In this paper, the sparse-input neural network proposed
by
        <xref ref-type="bibr" rid="ref4">Feng (2017)</xref>
        and
        <xref ref-type="bibr" rid="ref18">Scardapane (2017)</xref>
        is modified to
efficiently identify the important features on top of prior
physical information. Comparison with searching all
combinations of additional features shows its effectiveness in
building compact predictive models, while maintaining prior
physical information. The method was applied to the VIV
response amplitude prediction problem at dominant
vibration frequencies. On top of the Reynolds number and
damping parameter, the in-line-cross-flow coupling and
modal participation are found to be important global VIV
features.
      </p>
    </sec>
    <sec id="sec-2">
      <title>Weighted sparse-input neural network (WSPINN) incorporating prior physical knowledge</title>
      <p>
        We consider a fully connected DNN with P input features
x ∈ RP in the input layer and M neurons in the first hidden
layer that predict a certain target output y ∈ R1 . The weight
connecting the pth input feature and mth neuron in the first
hidden layer is denoted as wpm. Figure 1 shows an example
of the DNN with P=3 and M=4. The sparse-input neural
network
        <xref ref-type="bibr" rid="ref18 ref4">(Feng 2017; Scardapane 2017)</xref>
        aims at
accomplishing two tasks simultaneously: On the one hand, it
minimizes L ( y, yˆ ) , which is the prediction error (or loss)
between the predicted yˆ and the measured y. Meanwhile,
it tries to constrain the number of input features to the
DNN to be no greater than k. To implement this constraint,
we need to group the weights outgoing from the same input
feature together, and then limit the number of non-zero
weight groups to be no larger than k. Hence, the
mathematical expression for the optimization objective can be
expressed as,
w 0


={ p :Wp ≠ 0} ≤ k = p :


      </p>
      <p>
        M 2 
∑ ( wpm ) ≠ 0 ≤ k
m=1 
Where w 0 is the l0 norm of weight vector w. |…| is the
cardinality of the weight groups; p and m are the index for
min L ( y, yˆ )
w
subject to
(1)
the input feature and the neuron in the first hidden layer,
respectively. The magnitude of weight group for feature p
is measured by W . Since the l0 norm is non-convex and
p
non-differentiable, l1 norm, which calculates the sum of
absolute values of the vectors, is often used as a convex
proxy
        <xref ref-type="bibr" rid="ref20">(Tibshirani, 1996)</xref>
        . It can be shown geometrically
that l1 norm is the closest convex approximation for l0
norm
        <xref ref-type="bibr" rid="ref14">(Rosasco, 2010)</xref>
        . Following the convex
approximation, we obtain Equation (1),
min  ∑ L ( y(n) , yˆ(n) ) + λ ∑ ∑ ( wpm )2 
      </p>
      <p> 1 N M
w  N n =1 p∈P m =1
(2)
The second term in Equation (2) introduces bias term for
prediction. The hyperparameter λ is known as the group
lasso penalty, which adjusts the sparsity of the input
features versus the prediction accuracy. When λ grows, the
neural network will try to minimize the sum of the weight
groups, and therefore more weight groups are likely to
shrink to near 0. The input features with nonzero weight
groups are the remaining features that contribute to the
prediction. In this way, the model can be built out of fewer
input features, but the prediction accuracy may decrease
due to the loss of information.</p>
      <p>
        However, for many physical problems, some of the
features are known to be important in advance, which are
termed as prior knowledge. In this case, the objective is to
select a small number of additional features that will
complement the input features that are considered prior
knowledge and lead to predictions of acceptable accuracy.
Since the conventional sparse-input neural network cannot
tell the difference between prior knowledge and additional
features, the optimization objective in Equation (2) needs
to be modified as follows,
min L ( y, yˆ ) + spλ ∑ ∑M ( wpm )2 + saλ ∑ ∑M ( wpm )2  (3)
w  p∈Pp m =1 p∈Pa m =1
Where Pp denotes the feature set representing the prior
knowledge, and Pa denotes the set of all the additional
features. The parameters sp and sa are the weights assigned to
the prior knowledge and additional features, respectively.
These weights represent the level of confidence on the
feature’ importance for prediction
        <xref ref-type="bibr" rid="ref9">(Lian, 2018)</xref>
        . The
conventional sparse-input neural network in Equation (2) is a
special case for the weighted formulation in Equation (3),
where sa=sp=1, which assumes equal confidence for all the
features’ importance. For the prior knowledge (i.e., feature
set known to be important), we’d like to prevent the
algorithm from minimizing their weight groups to near 0, hence
sp/sa should be set close to 0.
      </p>
    </sec>
    <sec id="sec-3">
      <title>Relevant features for long flexible cylinders subjected to vortex-induced vibrations</title>
    </sec>
    <sec id="sec-4">
      <title>Flexible cylinder VIV modeling</title>
      <p>Figure 2 is a sketch of a tensioned elastic cylinder under a
linearly sheared current profile U(z) distributed along axis
z, which causes the cylinders’ vibration in both the inline
(IL) and crossflow (CF) directions with respect to the
incoming current. The vibration of the elastic cylinder can be
approximated as a tensioned Euler-Bernoulli beam. The
equation of motion in crossflow direction and inline
direction can be expressed as,
m ( z ) ∂2 y + cs ( z ) ∂∂yt − P ( z, t ) ∂2 y + EI ( z ) ∂4 y =Fcf(4)
m ( z )
∂t 2
∂2 x
∂t 2
+ cs ( z )
∂x
∂t
∂z2
∂2 x
∂z2
∂z4
∂4 x
∂z4
− P ( z, t )
+ EI ( z )
=Fil
(5)</p>
      <p>
        Where x and y are the displacement in inline and
crossflow direction. m(z) is the cylinder’s mass per unit length,
P(z,t) is tension of the vibrating cylinder, EI(z) represents
bending stiffness. cs is the structural damping coefficient
per unit length, Fcf and Fil are the vortex induced forces on
the cylinder. The loading transfers energy from fluid to the
structure in a well-defined region with length Lin, which is
the “power-in” region. Outside this region, the vortex
loading dissipates energy by transferring energy from the
structure to the fluid through hydrodynamic damping
coefficient ch(z). The location of the “power-in” region can be
identified from structural vibration measurements in
experiments or simulation
        <xref ref-type="bibr" rid="ref12">(Rao 2015)</xref>
        . Under steady-state,
narrow-banded vibration, the total power dissipation in the
flexible pipe can be normalized to an equivalent damping
coefficient ce
        <xref ref-type="bibr" rid="ref23">(Vandiver et.al. 2018)</xref>
        .
The VIV loading is the result of nonlinear interaction
between vortex shedding and structural vibration via
complicated feedback mechanisms that depend on the
structural properties, the current profile and the structure’s motion
at every instant Hence, the parameterization for VIV force
in the “power-in” region may involve,
      </p>
      <p>Fcf = f (U ( z ) , x ( z, t ) , y ( z, t ) , ρ ,µ , L, L in , D, m ( z ) ,
(6)
(7)
(8)
cs,cf ( z ) , ch,cf ( z ) , P ( z, t ) , EI ( z ))
Fil = f (U ( z ) , x ( z, t ) , y ( z, t ) , ρ ,µ , L, L in , D, m ( z ) ,</p>
      <p>cs,il ( z ) , ch,il ( z ) , P ( z, t ) , EI ( z ))
Where ρ , µ , L and D are fluid density and dynamic
viscosity, cylinder’s length and diameter, respectively.</p>
      <p>If the spatiotemporal root-mean-square (rms) amplitude
of crossflow vibration Arms,cf in the power-in region is the
target output, then from Equations (4)-(7), the predictive
model can be expressed as,</p>
      <p>Arms,cf = f (U ( z ) , x ( z, t ) , y ( z, t ) , ρ ,µ , L, L in , D, m ( z ) ,</p>
      <p>cs,cf ( z ) , ch,cf ( z ) , P ( z, t ) , EI ( z ))</p>
      <p>It can be observed that Equations (6)-(8) involves
spatial-temporal distribution of structural response and system
properties, which will be further simplified and represented
by some global VIV features.</p>
    </sec>
    <sec id="sec-5">
      <title>Spatial-temporal analysis for typical VIV</title>
      <p>
        VIV measurements from the 2011 Shell experiments on a
38-m-long cylinder
        <xref ref-type="bibr" rid="ref10 ref11">(Lie et al 2013)</xref>
        were studied in this
investigation.
      </p>
      <p>The measured crossflow displacements in a linearly
sheared current are presented in Figure 3. The top figures
are the CF response time series at two locations within the
“power-in” region, while their corresponding wavelet
analysis is shown at the bottom. The vibration is found to
be narrow-banded with the dominant frequency  
drifting in time. Given the dominant vibration frequency and
structural properties, the corresponding wavenumber k
can be estimated by the dispersion relationship.</p>
      <p>Meanwhile, Figure 4 shows the corresponding
spatialtemporal distribution of crossflow displacement for the
same test condition. The response is nonstationary, with a
mixture of standing wave and travelling wave components.
To better capture the temporal variation of the vibration
signal, a moving window analysis is conducted. The
vibration signal is windowed into overlapping time frames over
each 3 vibration cycles, with 75% overlap.</p>
      <p>
        Complex proper orthogonal decomposition (POD) is
conducted on the crossflow displacement in each
spatialtemporal window in the “power-in” region to decompose
the displacement in each window into several orthogonal
complex modes
        <xref ref-type="bibr" rid="ref3">(Feeny 2008)</xref>
        . The ratio between the
modal energy of the dominant POD mode and the total energy
is defined as κ , which suggests the dominance of the
principal mode. Additionally, by comparing the real and
imaginary component of the dominant complex mode, the
travelling wave index α can be defined, with α = 1 for
travelling waves, and α = 0 for standing waves
        <xref ref-type="bibr" rid="ref3">(Feeny 2008)</xref>
        .
The middle and bottom of Figure 4 shows the temporal
variation of the travelling wave index and the modal
dominance factor analyzed in the power-in region, which
suggests that the VIV process is single POD mode dominated,
but the mode may vary from standing to travelling waves.
Analysis from inline vibration also shows similar
spatialtemporal distribution.
For a homogeneous, tensioned cylinder in uniform or
linearly sheared current undergoing narrow-banded VIV,
Equation (8) can be approximated by the following
relevant global quantities,
=f(Urms , ∆U , Arms,il ,ωcf ,ωil , kcf , kil ,α cf ,α il ,
      </p>
      <p>
        κ cf ,κ il , ρ ,µ , L, L in , D, m, cs , ce,cf , ce,il , P0 , P, EI )
Where Urms and ∆U / L are the spatial root-mean-square
and the shear gradient of the current profile, respectively
within the power-in region. Arms,cf and Arms,il are the
spatiotemporal rms for the crossflow and inline VIV amplitude
in the power-in region. cs and ce are respectively, the
structural damping coefficient and the equivalent rigid cylinder
damping coefficients that will lead to the same power
dissipation as discussed by
        <xref ref-type="bibr" rid="ref23">(Vandiver et.al. 2018)</xref>
        . P0 and P
are the initial tension before VIV and the mean tension
during the VIV process, respectively.
      </p>
      <p>
        Non-dimensionalizing Equation (9) gives,
Ac*f = f (Re,β , Ai*l ,Vrcf ,Vril , Lkcf ,α cf ,α il ,κ cf ,κ il ,
(9)
L / L in , L / D, m*ζ , cc*f , ci*l , P0 L2 EI
, P
( EIkc2f ))
(10)
Where Ac*f = Arms,cf / D , Ai*l = Arms,il / D are the
dimensionless crossflow and inline response amplitude, respectively.
Re = ρUrms D / µ is Reynolds number; β =D/ ( Urms )( ∆U / L)
is known as the shear parameter; Vrcf = 2π U / (ωcf D ) ,
Vril = 2π U / (ωil D ) are the crossflow and inline reduced
velocities, respectively; m ζ = 4mζ / (πρ D2 ) is known as
*
the mass damping parameter in the VIV literature, which
historically has been thought to be important in controlling
rigid cylinder’s VIV amplitude. cc*f = 2ce,cf ωcf / ρU r2ms and
ci*l = 2ce,ilωil / ρU r2ms are the dimensionless forms of the
equivalent damping parameter in the crossflow and inline
directions, respectively
        <xref ref-type="bibr" rid="ref23">(Vandiver et.al. 2018)</xref>
        .
      </p>
      <p>Although Equation (10) suggests that crossflow response
prediction in the “power-in” region may require
considering the effect of all the 17 dimensionless variables, it is
likely that the dimension of the input features can be
further reduced due to redundancy or correlation between
features or irrelevance to the prediction target. We are
interested in finding a smaller and more manageable subset
of parameters that are ultimately the most important out of
the full set when it comes to determining the CF response
amplitude. The motivation behind this is our interest to
understand what causes the CF response variability that is
observed in the temporal domain. At the very least, we
would like to start associating changes to certain
parameters with that variability that is often observed but is too
complicated to understand.</p>
    </sec>
    <sec id="sec-6">
      <title>Feature selection for flexible cylinder VIV</title>
    </sec>
    <sec id="sec-7">
      <title>Dataset description</title>
      <p>
        The dataset is from a set of experiments conducted by
Shell Oil Co. in 2011 at Marintek. The vibration of
38meter-long cylinders under various current profiles were
measured. The test matrix included two pipes with
different diameters (30-mm and 80-mm) but of the same
bending stiffness. The cylinders were tested in uniform and
linearly sheared current profiles with the maximum flow
speed, Umax, ranging from 0.5 m/s to 2.5 m/s. This resulted
in the Reynolds number Re ranging from 1.0 ×104 –
20×104. The dataset also included cases where the 80-mm
pipe was covered with strakes over 50% of its length. The
pipe tests were conducted in uniform flows with Umax
varying from 0.5 m/s to 1.5 m/s. The strakes dissipated
vibration energy and limited the power-in region to Lin = 0.5L ,
50% of the cylinder’s length. Detailed descriptions of the
experiments can be found in
        <xref ref-type="bibr" rid="ref10">Lie (2013)</xref>
        and
        <xref ref-type="bibr" rid="ref12">Rao (2015)</xref>
        .
      </p>
      <p>
        The structural damping ratio ζ in the experiment was
around 0.5%
        <xref ref-type="bibr" rid="ref23">(Vandiver et.al. 2018)</xref>
        . The cross flow
reduced velocity Vrcf varies in a narrow range from 6 to 9
and Vrcf / Vril ≈ 2 .
      </p>
    </sec>
    <sec id="sec-8">
      <title>Deep neural network setup</title>
      <p>
        The deep neural network was constructed using two hidden
layers. Each hidden layer had twenty neurons using a
sigmoid activation function. The total number of data points
was around 6000. 70% of the experimental data were used
as the training data, while the rest was used as the test data.
The input variables x were standardized to keep the
features at the same scale, while the output variables y were
normalized to values between 0 to 1. The mean absolute
percentage error (MAPE) was chosen as the loss function
between prediction and measurement L ( y, yˆ ) . The neural
network optimization was conducted via FTRL algorithm
        <xref ref-type="bibr" rid="ref11">(McMahan 2013)</xref>
        . During neural network training, the
batch size was 128 and learning rate was 0.01. The sa and
sp in Equation (3) were fixed to be 1 and 0.02, respectively.
After the optimization, we remove the input features whose
magnitude of the weight groups have shrunk to near 0 from
prediction model. In this paper, the magnitude of a weight
group is considered to be near 0 when it’s value is less than
5% of the maximum magnitude among all of the input
features.
      </p>
    </sec>
    <sec id="sec-9">
      <title>Prior physical knowledge for VIV</title>
      <p>
        Experimental studies on small spring-mounted rigid
cylinders show that the response amplitude increases with
increasing Reynolds number in the range 103 to 104 and
decreases as the dimensionless damping increases
        <xref ref-type="bibr" rid="ref22 ref6">(Govardhan &amp; Williamson 2006, Vandiver 2012)</xref>
        .
      </p>
      <p>
        Similarly, studies on long flexible cylinders have shown
that the Reynolds number and the dimensionless damping
continue to play important roles on the VIV response
amplitude
        <xref ref-type="bibr" rid="ref12 ref13">(Resvanis 2012, Rao 2015)</xref>
        but as discussed earlier,
the large number of potentially relevant parameters and the
response variability result in scatter in the data.
      </p>
      <p>Because it is known that Reynolds number Re and
dimensionless damping parameter cc*f are important, these
two parameters are designated to be used as prior
knowledge. The shear parameter β which is ideally suited
to differentiating between uniform or sheared flows was
the third parameter that was chosen as prior knowledge
before starting the feature selections process.</p>
      <p>Feature selection
knowledge
on
top
of
prior
physical
The feature selection procedure was conducted by
increasing the hyperparameter λ from 0.01 until all the input
features except the prior knowledge shrank to 0. Figure 5
demonstrates how varying the value of λ determines the
number of features chosen by the proposed algorithm. In
the figure the retained features are indicated by the
presence of a black bar at each λ value tested.</p>
      <p>The prediction error varies with the retained features in
the prediction model, which is presented in the bottom part
of Figure 5. At each number of features, a brute force
approach that searches all the possible combinations of the
additional features is also carried out. The error obtained
from the WSPINN is compared with hundreds runs of
DNN predictions using combinatorically searched features
in addition to the 3 features representing prior knowledge.
The comparison suggests that the WSPINN is able to find
the feature subsets that gives smallest prediction error
among all the feature combinations. Besides, it can be
observed that there could be multiple combinations of
features that give similar prediction accuracy. For example,
both the additional features Ai*l ,κ cf and Ai*l ,κ il gives
prediction error around 13%. This suggests the correlations and
interactions among some of the VIV features.</p>
      <p>After balancing the prediction accuracy with the sparsity
of input features, we find that the feature subset containing
5 features: Re, β , cc*f , Ai*l ,κ cf gives 13% MAPE, which is
close to 10.6% MAPE using all 17 features.</p>
      <p>We have also applied the WSPINN algorithm to other
VIV related problems, such as the prediction for rigid
cylinder’s VIV amplitude and flexible cylinders’ VIV
amplitude at higher harmonics etc. Because of space limitations
we cannot demonstrate this here. Moreover, since this
paper only studied the important parameters for VIV sheared
and uniform current profiles, the importance of the features
may be different for more complicated current profiles.</p>
    </sec>
    <sec id="sec-10">
      <title>Physical insight interpretation</title>
      <p>The importance of the identified features for flexible
cylinder VIV can be examined by systematically varying the
ranges of input features to the constructed neural network
models. Figure 6 and Figure 7 show the effect of varying
Re, cc*f while constraining the other variables in the
prediction model to characteristic values most often observed in
the Shell experiments. The black dots are the experimental
measurements within 20% from the referenced values and
are included to demonstrate that the prediction model
(contours) did in fact have data in that vicinity.</p>
      <p>The results demonstrate that increasing Reynolds
number tends to increase the spatiotemporal CF RMS
amplitude. This Reynolds number effect is obvious in the
uniform flow data which typically has small dimensionless
damping values ( cc*f &lt;0.3-0.4). While the Reynolds number
effect is virtually non-existent when looking at the sheared
flow cases with damping parameters ( cc*f &gt;0.4).</p>
      <p>
        Figure 8 shows the effect of varying Ai*l and κ cf while
constraining the other variables. It can be found that the
crossflow response tends to increase with inline response.
Such a relationship has also been observed in
springmounted rigid cylinder’s VIV experiments
        <xref ref-type="bibr" rid="ref2">(Dahl 2008)</xref>
        ,
where the fluctuating inline force increased with crossflow
motion. Finally, the prediction model suggests that as the
mode-participation factor increases so does the CF
response amplitude. Note that both standing wave and
travelling wave response can result in high mode-participation
factors and in this situation, the factor primarily
characterizes whether all points on the flexible cylinder are
responding in a similar manner (spanwise coherence).
      </p>
    </sec>
    <sec id="sec-11">
      <title>Special properties of the approach compared to other machine learning</title>
      <p>1. Direct and learning task dependent dimension reduction
in the original feature space while retaining the prior
information in the model.</p>
      <p>The WSPINN is one of the dimension reduction
approaches. However, different from widely used PCA or
auto-encoders, WSPINN seeks to reduce the dimension
directly in the original input feature space. Moreover,
through machine learning prediction, WSPINN is able to
identify most important input feature with respect to the
target output. The much smaller constraints placed on the
prior knowledge also allows the prior knowledge to retain
in the prediction model to improve prediction and also
identify additional important features.
2. High prediction accuracy due to the universal
approximation property of the DNN (Hornik, 1993)</p>
      <p>The WSPINN is a feature selection approach
embedded in DNN, which is able to predict nonlinear
inputoutput relationships accurately. For instance, for the
crossflow VIV amplitude prediction, the prediction accuracy
from the DNN and linear regression given Re, β , cc*f , A* ,κ cf
il
are 13% and 25%, respectively. However, training DNN
with WSPINN requires several rounds of iterations to
optimize the weights in each layer, hence it was found to be
more computationally expensive than most of the other
machine learning methods We consider the computational
cost acceptable since our intention is not to create a fast
predictive tool but rather to use machine learning to reduce
the dimensionality of the problem as we try to understand
the importance of each of the many governing parameters.</p>
    </sec>
    <sec id="sec-12">
      <title>Conclusion</title>
      <p>In this paper, we modify and propose changes to a
sparseinput neural network so it can efficiently select additional
features which can complement a subset of features known
to be important in advance (i.e. prior knowledge). The
algorithm was applied to the experimental results from
vortex-induced vibration of flexible cylinders. The
complicated spatiotemporal response measurements of the
continuous system are reduced to an equivalent 2 Degree of
Freedom system. The proposed algorithm is then used to
investigate the role of Reynolds number, damping parameter,
and shear parameter (3 parameters for which we have prior
knowledge), as well as 14 other parameters that the
dimensional analysis indicated might be important. The
algorithm was able to reduce the 14 additional parameters to
just 2 additional parameters on top of the prior knowledge.
We found that this feature selection technique is much
more efficient than a brute force combinatorial search.</p>
    </sec>
    <sec id="sec-13">
      <title>Acknowledgement</title>
      <p>This research has been sponsored by the members of the
SHEAR7 Joint Industry Project: BP, Chevron,
ExxonMobil, Petrobras, SBM Offshore, Shell International
Exploration and Production, Equinor, &amp; Technip USA.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Bourguet</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Karniadakis</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Triantafyllou</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <year>2011</year>
          .
          <article-title>Vortexinduced vibrations of a long flexible cylinder in shear flow</article-title>
          .
          <source>Journal of Fluid Mechanics</source>
          <volume>677</volume>
          :
          <fpage>342</fpage>
          -
          <lpage>382</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Dahl</surname>
            ,
            <given-names>J. J. M.</given-names>
          </string-name>
          <year>2008</year>
          .
          <article-title>Vortex-induced vibration of a circular cylinder with combined in-line and cross-flow motion (Doctoral dissertation</article-title>
          , Massachusetts Institute of Technology).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Feeny</surname>
            ,
            <given-names>B.F.</given-names>
          </string-name>
          ,
          <year>2008</year>
          .
          <article-title>A complex orthogonal decomposition for wave motion analysis</article-title>
          .
          <source>Journal of Sound and Vibration</source>
          ,
          <volume>310</volume>
          (
          <issue>1</issue>
          -
          <fpage>2</fpage>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Feng</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Simon</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <year>2017</year>
          .
          <article-title>Sparse-input neural networks for high-dimensional nonparametric regression and classification</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <article-title>arXiv preprint</article-title>
          .
          <source>arXiv:1711</source>
          .
          <fpage>07592</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Govardhan</surname>
            ,
            <given-names>R.N.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Williamson</surname>
            ,
            <given-names>C.H.K.</given-names>
          </string-name>
          ,
          <year>2006</year>
          .
          <article-title>Defining the 'modified Griffin plot'in vortex-induced vibration: revealing the effect of Reynolds number using controlled damping</article-title>
          .
          <source>Journal of fluid mechanics. 561</source>
          ,
          <fpage>147</fpage>
          -
          <lpage>180</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Guyon</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Elisseeff</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <year>2003</year>
          .
          <article-title>An introduction to variable and feature selection</article-title>
          .
          <source>Journal of machine learning research</source>
          ,
          <volume>3</volume>
          (Mar).
          <fpage>1157</fpage>
          -
          <lpage>1182</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Hornik</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <year>1993</year>
          .
          <article-title>Some new results on neural network approximation</article-title>
          .
          <source>Neural networks</source>
          ,
          <volume>6</volume>
          (
          <issue>8</issue>
          ),
          <fpage>1069</fpage>
          -
          <lpage>1072</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Lian</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Lau</surname>
            ,
            <given-names>V.K.</given-names>
          </string-name>
          ,
          <year>2018</year>
          .
          <article-title>Weighted LASSO for sparse recovery with statistical prior support information</article-title>
          .
          <source>IEEE Transactions on Signal Processing</source>
          ,
          <volume>66</volume>
          (
          <issue>6</issue>
          ).
          <fpage>1607</fpage>
          -
          <lpage>1618</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Lie</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          et.al.,
          <year>2013</year>
          ,
          <string-name>
            <surname>August.</surname>
          </string-name>
          <article-title>Comprehensive riser VIV model tests in uniform and sheared flow</article-title>
          .
          <source>In ASME 2012 31st International Conference on Ocean, Offshore and Arctic Engineering</source>
          .
          <fpage>923</fpage>
          -
          <lpage>930</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>McMahan</surname>
            ,
            <given-names>H.B.</given-names>
          </string-name>
          et.al.,
          <year>2013</year>
          ,
          <string-name>
            <surname>August.</surname>
          </string-name>
          <article-title>Ad click prediction: a view from the trenches</article-title>
          .
          <source>In Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining</source>
          .
          <volume>1222</volume>
          -
          <fpage>1230</fpage>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Rao</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <year>2015</year>
          .
          <article-title>The flow of power in the vortex-induced vibration of flexible cylinders</article-title>
          .
          <source>Ph. D. Dissertation</source>
          , Department of Mechanical Engineering, Massachusetts Institute of Technology, Cambridge, MA.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Resvanis</surname>
            ,
            <given-names>T.L.</given-names>
          </string-name>
          et.al.,
          <year>2012</year>
          ,
          <string-name>
            <surname>July.</surname>
          </string-name>
          <article-title>Reynolds number effects on the vortex-induced vibration of flexible marine risers</article-title>
          .
          <source>In ASME 2012 31st International Conference on Ocean, Offshore and Arctic Engineering</source>
          .
          <fpage>751</fpage>
          -
          <lpage>760</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <surname>Rosasco</surname>
          </string-name>
          ,
          <year>2010</year>
          .
          <article-title>Statistical Learning Theory and Applications</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>Rudy</surname>
            ,
            <given-names>S.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brunton</surname>
            ,
            <given-names>S.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Proctor</surname>
            ,
            <given-names>J.L.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Kutz</surname>
            ,
            <given-names>J.N.</given-names>
          </string-name>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <article-title>Data-driven discovery of partial differential equations</article-title>
          .
          <source>Science Advances</source>
          ,
          <volume>3</volume>
          (
          <issue>4</issue>
          ), p.
          <fpage>e1602614</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <surname>Sarpkaya</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <year>2004</year>
          .
          <article-title>A critical review of the intrinsic nature of vortex-induced vibrations</article-title>
          .
          <source>Journal of fluids and structures</source>
          ,
          <volume>19</volume>
          (
          <issue>4</issue>
          ):
          <fpage>389</fpage>
          -
          <lpage>447</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <surname>Scardapane</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Comminiello</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hussain</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Uncini</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <year>2017</year>
          .
          <article-title>Group sparse regularization for deep neural networks</article-title>
          .
          <source>Neurocomputing</source>
          ,
          <volume>241</volume>
          :
          <fpage>81</fpage>
          -
          <lpage>89</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <surname>Sonin</surname>
            ,
            <given-names>A.A.</given-names>
          </string-name>
          ,
          <year>2001</year>
          .
          <article-title>Dimensional analysis</article-title>
          .
          <source>Technical report</source>
          , Massachusetts Institute of Technology.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <surname>Tibshirani</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <year>1996</year>
          .
          <article-title>Regression shrinkage and selection via the lasso</article-title>
          .
          <source>Journal of the Royal Statistical Society: Series B (Methodological)</source>
          ,
          <volume>58</volume>
          (
          <issue>1</issue>
          ), pp.
          <fpage>267</fpage>
          -
          <lpage>288</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <surname>Vandiver</surname>
            ,
            <given-names>J.K.</given-names>
          </string-name>
          ,
          <year>1993</year>
          .
          <article-title>Dimensionless parameters important to the prediction of vortex-induced vibration of long, flexible cylinders in ocean currents</article-title>
          .
          <source>Journal of Fluids and Structures</source>
          ,
          <volume>7</volume>
          (
          <issue>5</issue>
          ),
          <fpage>423</fpage>
          -
          <lpage>455</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <string-name>
            <surname>Vandiver</surname>
            ,
            <given-names>J.K.</given-names>
          </string-name>
          ,
          <year>2012</year>
          .
          <article-title>Damping parameters for flow-induced vibration</article-title>
          .
          <source>Journal of fluids and structures</source>
          ,
          <volume>35</volume>
          .
          <fpage>105</fpage>
          -
          <lpage>119</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <surname>Vandiver</surname>
            ,
            <given-names>J.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ma</surname>
          </string-name>
          , L. and
          <string-name>
            <surname>Rao</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <year>2018</year>
          .
          <article-title>Revealing the effects of damping on the flow-induced vibration of flexible cylinders</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <source>Journal of Sound and Vibration</source>
          ,
          <volume>433</volume>
          :
          <fpage>29</fpage>
          -
          <lpage>54</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          <string-name>
            <surname>Ye</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Davidson</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Gupta</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <year>2018</year>
          .
          <article-title>Interpretable intuitive physics model</article-title>
          .
          <source>In Proceedings of the European Conference on Computer Vision (ECCV)</source>
          .
          <volume>87</volume>
          -
          <fpage>102</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>