=Paper=
{{Paper
|id=Vol-2783/paper09
|storemode=property
|title=Online Parameter Estimation and Optimal Input Design
|pdfUrl=https://ceur-ws.org/Vol-2783/paper09.pdf
|volume=Vol-2783
|authors=Barzin Hosseini,Johannes Diepolder,Florian Holzapfel
|dblpUrl=https://dblp.org/rec/conf/birthday/HosseiniDH20
}}
==Online Parameter Estimation and Optimal Input Design==
Online Parameter Estimation and Optimal Input Design ? ?? Barzin Hosseini[0000−0002−5767−0115] , Johannes Diepolder[0000−0002−5962−6126] , and Florian Holzapfel[0000−0003−4733−9186] Institute of Flight System Dynamics, Technische Universität München, Garching bei München, Germany {barzin.hosseini,johannes.diepolder,florian.holzapfel}@tum.de Abstract. In flight system identification, flight test data is used to es- timate model parameters. The aircraft is excited during the flight with a set of inputs (maneuvers) to generate the data for parameter estimation. Different approaches exist to design the inputs which result in an ade- quate information content in the acquired data during the flight tests. In most cases, inputs are applied to the aircraft which are designed based on a priori knowledge about the system. In this paper we investigate a method to update the model parameters during the flight and update the inputs onboard the aircraft. This method can increase the achieved parameter accuracy and is expected to reduce the required time for the flight test program, and therefore, also the program cost. Keywords: Parameter estimation · Optimal input design · Optimal con- trol · System identification. 1 Introduction In flight vehicle system identification, a mathematical model is deducted from recorded flight data using a series of test maneuvers. If a fixed model structure is assumed, the system identification task narrows down to a parameter estimation problem. It has been shown that the type of inputs applied to an aircraft has a significant influence on the accuracy of the parameter estimates [11, 12]. In this context, it is desirable to apply inputs to the dynamic system which are expected to maximize the information content in the flight test data. As such, optimal inputs can help to reduce the flight test time while resulting in more accurate parameter estimates. This reduces the overall cost of a system identification project, specially when flight tests are expensive to perform and flight time is limited on the flight vehicle under investigation (e.g. cruise missiles). This problem can be formulated as an optimal control problem with a special cost ? This work is supported by the German Federal Ministry for Economic Affairs and Energy as part of the LuFo program (grant-ID 20Q1719D). ?? Copyright c 2020 for this paper by its authors. Use permitted under Creative Com- mons License Attribution 4.0 International (CC BY 4.0). Online parameter estimation and optimal input design 129 function. However, an initial model (with initial values for the parameters) is required to pose and solve the optimal control problem. Since determining the parameter values in the model is the primary goal of the flight tests and the subsequent parameter estimation, accurate parameter values often do not exist when designing the experiments. Different approaches have been suggested to solve this problem, such as a robust approach to design the optimal inputs, in which the uncertainty of the optimal inputs can be considered in the input design process [5]. In this paper, we investigate the application of online parameter estimation to adapt the optimal inputs during the runtime of the experiment. Besides flight testing for system identification and parameter estimation, in- direct adaptive control is also an important use case for onboard adaption of optimal input design for parameter estimation. In such cases, the controller is adjusted based on the change in the system parameters. Therefore, maneuvers are pursued which lead to more accurate parameter estimates and therefore a better performing controller. Many previous studies such as [8] have investigated the topic of onboard optimal input design in connection with indirect adaptive control. The problem discussed in this paper can be formulated as a Dual-Mode Batch-to-Batch Optimization as presented in [4]. Based on this approach, it is possible to assess our solution method and quantify the possible loss of optimal- ity. This is however beyond the scope of the present study and will be investigated in future work. 2 Problem Definition The problem in parameter estimation is to find the parameter values in a fixed model structure θ̂ = θ̃(Z, u) . (1) In the above equation, θ̂ is the vector of the parameter estimates, θ̃ is the estimator and Z and u are the measured outputs and inputs. In this paper, we limit our study to linear systems of the following form ẋ = Ax + Bu, (2) y = Cx + Du, (3) z(ti ) = y(ti ) + v(ti ), i = 1, 2, . . . , N, (4) where x is the vector of the system states, y is the output vector, u is the input vector and A, B, C and D are the system-, input-, output- and feed-through matrices. Moreover, v is a zero-mean Gaussian white noise sequence with the covariance matrix R. It can be shown that the Fisher information matrix M provides an indica- tion about the information content in the gathered flight test data during the experiment 130 B. Hosseini et al. N T X ∂y(ti ) −1 ∂y(ti ) M= R . (5) i=1 ∂θ ∂θ The inverse of the Fisher information matrix, the Dispersion matrix D = M −1 is the theoretical lower bound for the covariance matrix of the parameter estimates if an unbiased estimator is used [7, 12]. The square roots of the diag- onal elements of the Dispersion matrix D are called Cramér-Rao bounds and are the theoretical lower bounds for parameter estimate standard deviations. If an asymptotically efficient estimator is used, the Cramér-Rao bounds are an estimate for the parameter estimate standard deviations. Since the Cramér-Rao bounds and the Dispersion matrix do not depend on the measurements, an initial model and initial values for parameters can be used to design inputs that minimize the remaining uncertainty in the parameters. In this study, we choose to minimize the trace of the Dispersion matrix which is the sum of the estimate for the parameter variances under consideration of practical state and input constraints. The optimal control problem (OCP) can be written as tr M −1 minimize u subject to ẋ = f (u, x, θ) , x ∈ [xlb , xub ] , (6) u ∈ [ulb , uub ] , θ = θ0 , t ∈ [0, T ], where T is the duration of the maneuver. Since the estimation of the parameter values is the ultimate goal of the flight test campaign, no precise values of the parameter estimates are available before the flight tests. A common approach is to perform flight tests with suboptimal inputs or optimal inputs based on inaccurate parameters, record the data and perform offline parameter estimation. The new parameter values are used to de- sign new inputs which are used for subsequent flight tests. This becomes a costly iterative process. Robust optimal control methods have been used in [5] to solve this uncertain optimal control problem. However, as the aircraft is excited over time, more information becomes available about the aircraft dynamics which can then be used to estimate more accurate parameter values. The new param- eter values can be used to adjust the maneuvers onboard the aircraft. To reduce the required time between the maneuvers, we suggest estimating the parameters online while new measurements become available. 3 Solution Method The solution method applied in this study consists of an Extended Kalman Filter (EKF) for the online parameter estimation. The approaches for designing the optimal inputs are based on [6]. Online parameter estimation and optimal input design 131 3.1 Online Parameter Estimation Different methods exist for online flight vehicle parameter estimation. These methods allow to update the parameter values onboard the aircraft as new mea- surements become available rather than recording the measurements over a time period and analyzing them in batches. Online parameter estimation methods can be formulated in time and frequency domain as described in [10, 12]. In this study we use an EKF for estimating the states and parameters in realtime. The implementation in this study is based on [12]. The state vector of the dynamic system (2) is augmented with the system parameters as follows x xa = . (7) θ The dynamics of the original system states is governed by (2). Since we assume the parameters to be constant, their dynamics is governed by θ̇ = 0. (8) The governing equations for the augmented system become ẋa = Aa xa + B a u, (9) y = C a xa + Du, (10) z(ti ) = y(ti ) + v(ti ), i = 1, 2, . . . , N, (11) (12) with the initial conditions E[xa (0)] = x̄a,0 , (13) E{[xa (0) − x̄a,0 ][xa (0) − x̄a,0 ]} = P a,0 , (14) and A(θ) 0 B(θ) x̄ , C a = B(θ) 0 , x̄a,0 = 0 , Aa = , Ba = (15) 0 0 0 θ where x̄a,0 is the mean value of the augmented states at the initial time point and P a,0 is its respective covariance matrix. The dynamic system described above is not linear anymore since the new aug- mented states which include the parameters are elements of the system matrix A and are multiplied with the state vector x. Therefore, the equations governing the nonlinear system (9) can be written as 132 B. Hosseini et al. ẋa = f (xa , u) , (16) y = g(xa ) , (17) z(ti ) = y(ti ) + v(ti ), i = 1, 2, . . . , N. (18) The parameters of the system (2) are now states of the augmented sys- tem (16) and can be estimated via an EKF as described in [12]. In addition to the parameter values, an estimate for the parameter covariance matrix is also computed by the Kalman filter. 3.2 Optimal Input Design As mentioned above, optimal inputs for parameter estimation are determined by solving the optimal control problem described in (6). The cost function of the mentioned problem is non-convex and previous studies in [6] have shown that an initial guess near the optimal point is required to achieve a solution via the direct method for optimal control (using gradient based optimization). The two stage optimization method of [6] is applied to solve the optimal control problem stated in (6). First, a dynamic programming method with a relatively rough dis- cretization is applied to generate an initial guess for the optimal control and the respective system response. In a second step the direct method for optimal con- trol (see [1]) is initialized with the results of the first step and applied to find the final solution. In this case the original optimal control problem (6) is transcribed into a nonlinear programming problem and solved with a gradient based solver. We use an optimal control and parameter estimation toolbox, Falcon.m [13], de- veloped at the Institute of Flight System Dynamics of TU Munich for the second optimization stage. The details of the solution methodology and implementation for designing optimal inputs based on an initial model can be found in [6]. 3.3 Onboard Adjustment of the Inputs As stated before, the optimal inputs computed offline are based on an initial model with an initial set of parameters. As shown in Fig. 1 an optimal maneuver is designed offline with the initial values of the parameters. This maneuver is injected during the flight. An EKF updates the parameter values during the optimization as new data becomes available. After each maneuver, the gradient based optimization is repeated with the updated parameter values to adapt the optimal inputs. Thereby, the optimal input generated with the last set of parameters and the corresponding system states are used to initialize the optimization with the direct method for optimal control. Similarly, each EKF run for online parameter estimation is initialized by previous estimates for the parameters, their covariance matrix and the state parameter correlations. In the first parameter estimation run, we initialize the covariance matrix of the parameters by a diagonal matrix with a large value Online parameter estimation and optimal input design 133 Pre-Flight In-Flight Dynamic Online Parameter Onboard Initial Model Estimation Programming Maneuver Maneuver Adaption Direct Method Injection for Optimal Check Control Not Met Iteration Met End Criteria Fig. 1. Algorithmic steps of the pre-flight and in-flight maneuver optimization. (here 105 ) on its diagonal. No correlation is assumed between the parameters and the states of the system in the initial covariance matrix of the augmented states (for the first maneuver). The initial state covariance matrix is considered as a tuning parameter and should be chosen considering the accuracy of the aircraft’s navigation system and controller. 4 Numerical Results A short period model of an aircraft is used to illustrate the proposed method in this study. The linear model was featured in multiple previous studies such as [2, 6, 9, 11]. The model is described as α̇ Z 1 α Z = α + η η, (19) q̇ M α Mq q Mη where α is the angle of attack, q the pitch rate, and η the elevator deflection. The output equation is y1 (t) 1 0 α(t) = . (20) y2 (t) 0 1 q(t) The measurement equations of the system are: ym1 (i) 1 0 y1 (ti ) v1 (ti ) = + , i = 1, 2, . . . , N, (21) ym2 (i) 0 1 y2 (ti ) v2 (ti ) v1 (ti ) v(i) = . (22) v2 (ti ) In the above equation, v(ti ) is the i-th realization of a zero mean white Gaussian random process with the following covariance matrix 2.0 0.0 R= . (23) 0.0 1.0 134 B. Hosseini et al. The model parameter values are provided in Table 1. The parameters Mα , Mq , and Mη are unknown. We design the inputs such that the information con- tent with respect to these parameter is maximized (see (6)). Other parameters are considered to be known in this example. Significantly different initial pa- rameter values (provided in Table 1) have been used to design the first optimal input signal (prior to the experiment). Four parameter estimation and input de- sign iterations (described in Section 3.3) are performed during the test run-time. After each maneuver optimization, new parameter values are provided by the EKF based on virtual measurement data generated from the model described in (19)–(23). A sampling rate of 100 Hz is assumed and the run-time of each maneuver is 4 s. The discrete time step for the solution of the optimal control problem (6) with the direct method for optimal control is set to 0.05 s. We perform 4 iterations of parameter estimation and input optimization during the flight. The following input and state bounds are considered to design the inputs η ∈ [−12.5◦ , 12.5◦ ] , (24) ◦ ◦ α ∈ [−10 , 10 ] , (25) q ∈ −12◦ s−1 , 12◦ s−1 . (26) It can be seen in Fig. 2 that the structure of the optimal inputs remain roughly consistent as the parameter values get updated. This is in agreement with using the optimal inputs from the previous set of parameters as an initial guess for the maneuver optimization in the next step. Figure 3 shows the outputs of the dynamic system corresponding to the inputs of Fig. 2 at each iteration. The change of the parameters and the respective estimate for their standard deviations after each iteration are shown in Fig. 4. It can be seen that the final values of the standard deviations in Fig. 4 are consistently lower than the Cramér-Rao bounds in Table 1. This is due to the fact that the Kalman filter uses the available data in all of the five maneuvers, whereas only one maneuver run has been considered in Table 1. It is also worth mentioning that no process noise has been considered in this study. The initial state covariance matrix for each EKF run is set to 0.2 0.0 P0 = . (27) 0.0 0.1 The achieved parameter Cramér-Rao bounds are consistently lower than the values determined for the same model in previous studies such as [5, 11]. This is mostly due to the higher sampling rate in this study and the fact that only three (main) model parameters have been selected for parameter estimation and therefore also optimal input design. The results show that the proposed method in this study achieves a similar accuracy in the parameter estimates to methods that assume a more accurate initial guess for designing the optimal inputs. Online parameter estimation and optimal input design 135 Table 1. Parameter values and optimization results for the linear short period example. True Initial Final Cramér-Rao Parameter parameter parameter parameter bounds value value value Zα −0.737 −0.737 −0.737 − Mα −0.562 −0.112 −0.543 0.0244 Mq −1.588 −0.318 −1.621 0.0335 Zη 0.005 0.005 0.005 − Mη −1.66 −2.324 −1.687 0.0194 10 0 η [◦ ] Offline −10 iter 1 iter 2 iter 3 iter 4 −20 0 0.5 1 1.5 2 2.5 3 3.5 4 t [s] Fig. 2. Optimal elevator deflection. 4.1 Parameter Estimation Two parameter estimation experiments are performed to investigate the im- provement in the accuracy of the parameter estimates if the optimal inputs are adjusted during the experiment run-time, as suggested in this study. The param- eter estimation experiments are performed using a special implementation of the Maximum Likelihood estimation method in the output error form as described in [3]. The same initial guesses are set for parameters as in Table 1. The results of the parameter estimation experiment are provided in Table 2. It can be seen that the optimal input adapted during the runtime of the experiment results in a significant improvement in parameter accuracy. Figure 5 visualizes the absolute value of the correlation matrices after each of the parameter estimation experi- ments. The correlation between the parameters Mq and Mη is higher when the adjusted inputs after optimization are in use. This may be contributed by the fact that only the diagonal elements of the dispersion matrix D are explicitly considered in the cost function. 136 B. Hosseini et al. 0 α [◦ ] −5 −10 0 0.5 1 1.5 2 2.5 3 3.5 4 10 q [◦ s−1 ] 0 −10 0 0.5 1 1.5 2 2.5 3 3.5 4 t [s] Fig. 3. Response of the linear short period model to the optimal inputs with the available set of parameters at each iteration. Table 2. Estimated parameter values and their standard deviations (Std Devs) for the linear short period example. Estimated Estimated Parameter Parameter parameter parameter std devs with std devs with Parameter with the with the the pre-flight the adapted pre-flight adapted optimal input input optimal input input Mα (−0.562) −0.61 0.0947 −0.562 0.0255 Mq (−1.588) −1.585 0.0585 −1.603 0.0347 Mη (−1.66) −1.688 0.0273 −1.668 0.02 5 Conclusion and Outlook In this study we show that adaption of optimal inputs for parameter estimation during the experiment run-time significantly improves the results of a subse- quent parameter estimation. However, this is a very complex process and can be difficult to successfully implement and apply to a real aircraft. First and fore- most, the application of the proposed method in real-world systems is limited by the existence of process noise and deficiencies regarding the model structure, such as unmodeled nonlinearities. Among the implementation aspects, the low computing power and memory capabilities of common flight control computers can be considered challenges associated with this approach. Furthermore, since the parameter values used to design the optimal inputs are not accurate, state bounds are not guaranteed to be satisfied during the flight. Online parameter estimation and optimal input design 137 0 Estimated Parameter 0.2 Parameter −0.2 True Parameter Value 0.15 Standard Deviation σ(Mα ) Mα 0.1 −0.4 0.05 −0.6 0 0 1 2 3 4 5 1 2 3 4 5 −2 ·10 8 −0.5 6 σ(Mq ) Mq −1 4 2 −1.5 0 0 1 2 3 4 5 1 2 3 4 5 −2 ·10 4 −1.6 −1.8 3 σ(Mη ) Mη −2 2 −2.2 1 0 1 2 3 4 5 1 2 3 4 5 iter iter Fig. 4. Parameter values and their standard deviations at each iteration. (a) Offline Optimization (b) Onboard Optimization Fig. 5. Absolute value of the correlation matrices. 138 B. Hosseini et al. To address this issue, more conservative state bounds can be considered for the first maneuvers and the bounds can be expanded as more accurate parameter estimates become available. The additional efforts required to implement this method can however be advantageous in use cases where flight test time is very expensive and no good initial guess for the system model exists. In the case of linear system identifica- tion, changing the trim point of the aircraft during the flight also changes the values of the linear system’s matrix coefficients. This method can be applied to adapt the optimal inputs of the aircraft to the new trim point during the flight. It might be possible, especially in the case of nonlinear systems, that signifi- cantly different parameter values result in significantly different maneuvers and therefore cause convergence problems in the short time between two maneuvers. This can be solved by updating the maneuvers online, as new parameter values become available. Online methods for optimal input design are currently being investigated by authors. Furthermore, it is planned to test the algorithm and its capabilities in a real-world experiment using an unmanned aircraft. Utiliza- tion of other online parameter estimation algorithms (e.g. the frequency domain method as discussed in [10]) will also be investigated in future work. 6 Acknowledgement This work is supported by the German Federal Ministry for Economic Affairs and Energy as part of the LuFo program (grant-ID 20Q1719D). References 1. Betts, J.T.: Practical methods for optimal control and estimation using nonlinear programming. The Boeing Company, Seattle, Washington (2010) 2. Chen, R.T.N.: Input design for aircraft parameter identification: using time optimal control formulation. AGARD CP-172 (1974) 3. Göttlicher, C.: System identification for aerial applications using optimal control methods. Ph.D. thesis, Chair of Flight System Dynamics, Technical University of Munich (2018) Online parameter estimation and optimal input design 139 4. Gros, S.: Dual-mode batch-to-batch optimization as a Markov decision pro- cess. Industrial & Engineering Chemistry Research 58(30), 13780–13791 (2019). https://doi.org/10.1021/acs.iecr.8b06471 5. Hosseini, B., Botkin, N., Diepolder, J., Holzapfel, F.: Robust optimal in- put design for flight vehicle system identification. In: AIAA SciTech, Or- lando. American Institute of Aeronatics and Astronautics, p. 0290 (2020). https://doi.org/10.2514/6.2020-0290 6. Hosseini, B., Göttlicher, C., Holzapfel, F.: Optimal input design for flight vehicle system identification via dynamic programming and the direct method for optimal control. In: AIAA SciTech Forum, San Diego. American Institute of Aeronatics and Astronautics, p. 1319 (2019). https://doi.org/10.2514/6.2019-1319 7. Jategaonkar, R.V.: Flight vehicle system identification: a time domain method- ology. American Institute of Aeronautics and Astronautics, Inc., Braunschweig (2006) 8. Królikowski, A.: Application of input-signal design in system identification for adaptive control. International Journal of Systems Science 17(2), 305–318 (1986). https://doi.org/10.1080/00207728608926805 9. Mehra, R.K.: Optimal inputs for linear system identification. IEEE Transactions on Automatic Control AC-19(3), 192–200 (1974). https://doi.org/10.1109/TAC.1974.1100554 10. Morelli, E.: Autonomous real-time global aerodynamic modeling in the frequency domain. In: AIAA SciTech, Orlando. American Institute of Aeronatics and Astro- nautics, p. 0761 (2020). https://doi.org/10.2514/6.2020-0761 11. Morelli, E.A.: Practical input optimization for aircraft parameter estimation ex- periments. ScD Dissertation, The George Washington Univeristy, Joint Institute for Advancement of Flight Sciences, Hampton, VA (1990) 12. Morelli, E.A., Klein, V.: Aircraft system identification: theory and practice, 2nd edition. Sunflyte Enterprises, Williamsburg, Virginia (2016) 13. Rieck, M., Bittner, M., Grüter, B., Diepolder, J., Piprek, P.: Falcon.m user guide (2018), www.falcon-m.com