Abstract

This paper considers the problem of optimizing a missile autopilot. In particular, the paper investigates the application of an online learning technique to learn and optimize the gains of a three-loop topology autopilot for a planar missile modeled with nonlinear dynamics and nonlinear aerodynamics forces and moments. The classical autopilot for a missile is based on a three-loop topology, where each loop consists of tunable proportional gains. An adaptive three-loop autopilot is constructed by augmenting the classical autopilot’s fixed-gain controllers with a learning-based controller, which is recursively optimized using retrospective cost optimization. Numerical simulations show that online learning improves the tracking performance of the classical autopilot in both nominal and off-nominal interception scenarios.

1 Introduction↩︎

The design of autopilots for missiles is a challenging control problem due to several factors, including nonlinear dynamics, nonminimum phase behavior due to nose-mounted gyro sensor and tail-fin actuation, and uncertain aerodynamic loading. Furthermore, in a typical mission, a missile undergoes a range of Mach numbers and aggressive maneuvers, which results in strongly varying aerodynamic loading. Model-based control of missiles is thus impractical. Instead, a common classical missile autopilot is based on a three-loop topology that uses the IMU data, including the acceleration and angular velocity measurements, with flight-scheduled gains. The gain-scheduled controller consists of gains scheduled to several operating conditions. However, designing the controller gains for each operating condition is expensive and time-consuming.

Several nonlinear adaptive control techniques have been investigated for missile autopilot design. Model-based nonlinear control techniques such as adaptive backstepping is explored in [1], \({\mathcal{L}}_1\) control is explored in [2], MRAC is explored in [3], input-output linearization is explored in [4], [5], and Lyapunov-based approaches are explored in [6], [7]. In this paper, we consider the retrospective cost adaptive control (RCAC) to recursively optimize the gains of a classical three-loop autopilot. RCAC has been previously investigated for the missile control application. In particular, an auto-regressive-moving-average adaptive control law was used to generate pitch rate commands in [8], whereas RCAC was used to improve the missile’s ability to intercept a target under sensor failures in [9]. However, this paper investigates the ability of RCAC to tune the gains of the three-loop autopilot. In particular, we augment the fixed-gain controllers in the three-loop autopilot with adaptive controllers in parallel. This architecture allows RCAC to both improve the autopilot’s tracking performance as well as automatically tune the gains of the autopilot from measured data instead of models.

The paper is organized as follows. Section 2 briefly reviews the nonlinear dynamics of a missile and the classical three-loop autopilot topology. Section 3 presents the adaptive augmentation of the three-loop autopilot. Three scenarios are presented in Section 4 to investigate the performance of the adaptive autopilot in nominal and off-nominal conditions. Finally, the paper concludes with a summary in Section 5.

2 Missile Dynamics and Control↩︎

This section briefly reviews the equations of motion of a missile and the classical three-loop autopilot.

2.1 Longitudinal Dynamics↩︎

Let \(\rm F_A\) be an inertial frame and let \(\rm F_B\) be a frame fixed to the missile body as shown in 1.

Figure 1: Free body diagram of a missile.

The frame \(\rm F_B\) is obtained by rotating it about \({\hat{\jmath}}_{\rm A}\) by the pitch angle \(\theta,\)

and thus \(\overset{\rightharpoonup}{\omega}_{\rm B/A} = \dot{\theta} \hat{\jmath}_{\rm B}.\) Let \({\rm c}\) denote the center of mass of the missile and let \(w\) denote a point with zero inertial acceleration. Let \(\rm F_C\) be a frame such that \({\hat{\imath}}_{\rm C}= \hat{v}_{{\rm c}/w/{\rm A}}\) and \({\hat{\jmath}}_{\rm C}= {\hat{\jmath}}_{\rm B},\) where \(\hat{v}_{{\rm c}/w/{\rm A}}\) is the unit vector along the velocity \(\overset{\rightharpoonup}{v}_{{\rm c}/w/{\rm A}}\) of the missile. Therefore, \(\overset{\rightharpoonup}{v}_{{\rm c}/w/{\rm A}} = V {\hat{\imath}}_{\rm C},\) where \(V\) is the magnitude of the missile velocity. Note that the frame \(\rm F_C\) is obtained by rotating it about \({\hat{\jmath}}_{\rm A}\) by the flight path angle \(\gamma.\)

The angle of attack is thus \(\alpha \stackrel{\triangle}{=}\theta - \gamma.\)

The equations of motion of a missile are \[\begin{align} \dot{M} &= \frac{T}{ma} \cos (\alpha) - \frac{g}{a} \sin (\gamma) - \frac{\rho a M^2 S}{2m} \big[C_{\rm N}\sin (\alpha) + C_{\rm A}\cos (\alpha) \big] , \tag{1}\\ \dot{\gamma} &= \frac{T}{maM} \sin (\alpha) - \frac{g}{aM}\cos (\gamma) + \frac{\rho a M S}{2m} \big[C_{\rm N}\cos (\alpha) - C_{\rm A}\sin (\alpha) \big] , \tag{2} \\ \dot{\theta} &= q, \tag{3} \\ \dot{q} &= \frac{\rho a^2 M^2 S d}{2I_y} C_{\rm M} ,\\ \dot{h} &= Ma\sin(\gamma), \\ \dot{X} &= Ma\cos(\gamma), \tag{4} \end{align}\] where \(m\) is the mass of the missile, \(a\) is the speed of sound, \(M\) is the Mach number, \(T\) is the thrust applied to the missile along \({\hat{\imath}}_{\rm B},\) \(I_y\) is the moment of inertia, \(q\) is the pitch rate, \(h\) is the altitude, and \(X\) is the downrange. The derivation of the equations of motion 1 4 and the vehicle parameters used in this study are detailed in Appendix A. The aerodynamic coefficients \(C_{\rm N}, C_{\rm A},\) and \(C_{\rm M}\) are nonlinear functions of the angle of attack \(\alpha\), fin deflection angle \(\delta,\) and the pitch rate \(q,\) as shown in 7 9 .

2.2 Three-loop Autopilot↩︎

This section briefly reviews the three-loop autopilot. Several multi-loop autopilot topologies for normal-acceleration tracking are described in more detail in [10]. The control architecture for normal acceleration tracking is shown in Figure 2. In practice, the normal acceleration reference \(a_{z,\rm ref}\) is given by the guidance law. The three-loop autopilot uses the reference normal acceleration and the measurements of the normal acceleration and the pitch rate to compute the fin deflection command \(u\).

Figure 2: Control architecture to track normal acceleration commands.

In particular, the control signal \(u\) generated by the three-loop autopilot is \[\begin{align} u = K_q q + \int ( K_\theta q + K_{\rm a}(a_{z, \rm ref} - K_{{\rm a}_z} a_z) ) {\rm d}t, \label{eq:u953LA} \end{align}\tag{5}\] where \(K_{\rm q}, K_\theta, K_{\rm a},\) and \(K_{{\rm a}_z}\) are the tunable proportional gains. In this work, as considered in [10], we set \(K_{\rm q}= 0.464, K_\theta = 15.62474, K_{\rm a}= 0.2446459,\) and \(K_{{\rm a}_z} = 0.9278,\) \(a_{z, \rm ref}\) is the normal acceleration command, \(a_{z}\) is the sensed acceleration at the IMU given by \[\begin{align} a_z \stackrel{\triangle}{=} a_{z, \rm CG} - \dot{q} d_{\rm IMU}, \end{align}\] where \(d_{\rm IMU}=0.5 \;\rm m\) is the distance from the center of gravity to the IMU, and the gravity-corrected normal acceleration of the center of gravity \(a_{z, \rm CG}\) is \[\begin{align} a_{z, \rm CG} = \dfrac{\rho a^2 M^2 S}{2{m}} C_{\rm N}. \end{align}\] The IMU provides the output vector \(y\) that contains the measurements of the normal acceleration \(a_z\) and pitch rate \(q.\) The implementation of the control law 5 is shown in Figure 3. In the rest of the paper, the three-loop autopilot with fixed gains is denoted by F-TLA.

Figure 3: Classical three loop controller topology [10].

3 Adaptive TLA↩︎

In the adaptive augmentaion, shown in Figure 4, the input signal computed by the three-loop autopilot is modified as \[\begin{align} u(t) = u_{\rm TLA }(t) + u_{\rm a}(t), \end{align}\] where \(u_{\rm TLA }(t)\) is the control signal computed using 5 and \(u_{\rm a}(t)\) is the control signal computed by the adaptive controller. In particular, the control signal \(u_{\rm a}(t)\) is computed by the RCAC algorithm. Since RCAC is a discrete-time algorithm that updates the control signal at a fixed timestep, the control \(u_{\rm a}(t)\) in between the control updates is held constant. Letting \(T_{\rm s}\) denote the timestep, for \(t \in (k T_{\rm s}, (k+1) T_{\rm s}),\) the control \(u_{\rm a}(t)\) is thus given by \[\begin{align} u_{\rm a}(t) = u_{k}, \end{align}\] where \(k\) is the iteration number and \(u_k\) is the control signal computed by RCAC. In particular, an adaptive dense structured controller, described in [11], is chosen so that the regressor matrix is given by \[\begin{align} {\phi_k} = \begin{bmatrix} u_{k-1} & \cdots & u_{k-n_c}& z_{k-1}& \cdots & z_{k-n_c} \gamma_k \end{bmatrix}, \end{align}\] where \(z_k \stackrel{\triangle}{=}a_{z,\rm ref} - a_z,\) \(\gamma_k \stackrel{\triangle}{=}\sum_{i=1}^k z_i\) is the accumulated error, and the adaptive controller gain \(\theta_k \in {\mathbb{R}}^{2 n_c+1}\) is updated by retrospective cost optimization. The adaptive control signal \(u_k\) is thus an adaptive weighting of the regressor matrix, in the form of \(u_k = \Phi_k \theta_k.\) The adaptive weight \(\theta_k\) is determined at each time step \(k\) by minimizing the retrospective cost function \[\begin{align} J(k) &\stackrel{\triangle}{=} \sum_{i=1}^{k} \lambda^{k-i} [\hat{z}^{\mathrm{T}}_{i}R_z\hat{z}_{i}+(\phi_{i}\hat{\theta})^{\mathrm{T}}R_{u}(\phi_{i}\hat{\theta})] + \lambda^{k}(\hat{\theta}-\theta _{0})^{\mathrm{T}}R_{\theta}(\hat{\theta}-\theta_{0}), \end{align}\] where \(\hat{z}_i \stackrel{\triangle}{=}z_i + \phi_{{\rm f},i} \hat{\theta} - u_{{\rm f}, i}\) is the retrospective performance, \(\phi_{{\rm f}, i} \stackrel{\triangle}{=}G_{\rm f}({\boldsymbol{\textrm{q}}}) \phi_i\) and \(u_{{\rm f}, i} \stackrel{\triangle}{=}G_{\rm f}({\boldsymbol{\textrm{q}}}) u_i\) are the filtered regressor and the input, \({\boldsymbol{\textrm{q}}}\) is the forward-shift operator, \(\lambda\) is a forgetting factor, \(R_z\), \(R_u\), and \(R_\theta\) are hyperparameters, and \(G_f\) is a filter. The RCAC algorithm to compute \(u_k\) is described in [12]–[14].

As shown in [15], the cost function that is retrospectively minimized in RCAC includes a filter, \(G_{\rm f}\), which should consist of any nonminimum phase zeros of the system. In this work, we update the filter \(G_{\rm f}\) with the real nonminimum phase zeros of the linearized missile dynamics 1 4 , computed at the current state. Thus, the filter \(G_{\rm f}\) is time varying and is updated at every step. In the rest of the paper, the three-loop autopilot with the adaptive augmentation is denoted by A-TLA.

Figure 4: Adaptive three loop controller topology.

4 Numerical Simulation↩︎

This section investigates the performance improvements with the adaptive augmentation of the TLA. In particular, the performance of the A-TLA is investigated in the case of an arbitrary normal acceleration command and an interception mission scenario, described in Appendix B. The missile dynamics and the TLA described in previous sections are simulated with Matlab’s ode45 routine. In all scenarios considered in this paper, the initial Mach number of the missile is 2.5, the initial flight path angle is \(45\) degrees, the initial pitch angle is \(45\) degrees, the initial pitch rate is \(0\) \(\rm rad/s,\) the initial altitude is \(3,500\) \(\rm m,\) a constant thrust of \(3800\) newtons, and the fin deflection angle and its rate are \(0\) degrees and \(0\) \(\rm rad/s,\) respectively.

4.1 RCAC Hyperparameter Optimization↩︎

The RCAC hyperparameter \(R_z\) is held constant at unity, while \(R_u\) and \(R_\theta\) are numerically optimized using a particle swarm optimziation (PSO) framework. In a PSO, the parameter vector to optimize is treated as a particle with a position and velocity. A swarm of particles are evaluated and each particle’s motion is influenced by the best local and global parameters previously found. The objective of PSO is thus to find the set of parameters that minimize a cost function. The cost function used to tune the RCAC hyperparameters in the PSO framework is \[\begin{align} J_{\rm PSO} = \frac{1}{N} \sum_{i=1}^N \frac{2}{g}|z_i| + 0.2 {\rm max}({|\dot{q_i}|-\dot{q}_{\rm max}},0), \end{align}\] where \(\dot{q}_{\rm max} = 3\) \(\rm degrees/s^2,\) where \(N\) is the length of the simulation, and the data \(z_i\) and \(u_i\) are generated by the closed-loop response to the command \(a_{z, \rm ref} = 0.5g -8g {\rm sign}(\sin(0.3t)).\)

The controller optimized by RCAC is a \(4\)th-order ARMA controller with a built-in integrator. Various parameterizations that can be used in the RCAC framework are described in [16]. Note that, in our preliminary work, the cost function is chosen by trial and error. In the particle swarm, \(R_u\) is restricted between \(0\) and \(20\) and \(R_\theta\) is restricted between \(10^0\) and \(10^{15}.\) The RCAC hyperparameters after swarm optimization are \((R_u, R_\theta) = (0.25427, 10^{14.398}).\) In all numerical experiments described next in the paper, the RCAC hyperparameters are fixed.

Next, the performance improvements due to the adaptive augmentation are investigated. The acceleration reference is a constant step of magnitude \(10g.\) To demonstrate the performance recovery with the adaptive augmentation, we manually degrade the F-TLA by scaling all of its gains by \(\alpha_{\rm TLA} \in {\mathbb{R}}\) Figure 5 shows the closed-loop step response with the F-TLA and the A-TLA for various scaling factor values \(\alpha_{\rm TLA}.\) Note that the performance of the F-TLA and the A-TLA is similar in the case where \(\alpha_{\rm TLA} = 1.\) On the other hand, the degraded F-TLA’s step response performance degrades, as expected. However, the A-TLA performs similarly for a wide range of \(\alpha_{\rm TLA},\) thus demonstrating the performance improvements due to the adaptive augmentation.

Figure 5: The evolution of the normal acceleration for the F-TLA and A-TLA to a 10 g step command at various scalings of the gains in the TLA .

4.2 Harmonic Response↩︎

Next, we set \(a_{z, \rm ref} = 10 g \sin(t).\) Figure 6 shows the normal acceleration tracking responsE with the F-TLA and the A-TLA, using the same legend and \(\alpha_{\rm TLA}\) scalings decribed previously.

Figure 6: The evolution of the normal acceleration for the F-TLA and A-TLA to a 10 g sinusoidal command.

4.3 Interception Response↩︎

Next, we consider a missile mission where the objective of the missile is to intercept a target. To achieve interception, the normal acceleration commands are generated using a proportional guidance law based on the pursuit-evasion dynamics described in Appendix B. We consider the evader model described in [17] as the target. The target’s initial speed is 0.85 Mach, its flight path angle is 15 degrees, with an altitude of 4 km, and 1 km of horizontal distance from the missile.

The missile thrust is assumed to be \[\begin{align} T(t) = \begin{cases} 15,000 {\rm N}, & t \in (0, 10), \\ 2,000 {\rm N}, & t \in (10, 20), \\ 0 {\rm N}, & t > 20. \end{cases} \end{align}\] The missile’s initial speed is Mach 0.5, its initial flight path angle is 0 degrees, and its initial altitude is 3 km. The hyperparameters used in the RCAC are the same as those used in previous numerical simulations. Figure 7 shows the trajectory of the evader (in solid black), the trajectory of an ideal pursuer (in solid blue), the trajectory of the missile with the A-TLA (in solid green), and the trajectory of the missile with the nominally tuned F-TLA (in dashed red). Note that the ideal pursuer is assumed to be a point mass whose normal acceleration is instantaneously equal to the normal acceleration commanded by the guidance law. Furthermore, note that, in the case of a nominally tuned F-TLA, the performance of the A-TLA is similar to that of the F-TLA.

Figure 8 shows the commanded normal acceleration, the response of the missile with the F-TLA and A-TLA. and the gains of the adaptive controller updated by the RCAC algorithm. Note that, in the case of a nominally tuned F-TLA, the normal acceleration response of the A-TLA is similar to that of the F-TLA. Furthermore, note that the large increase in the normal acceleration command is due to the missile’s terminal behaviour.

Figure 7: Interception trajectories with the nominally tuned F-TLA and A-TLA.

Figure 8: Normal acceleration command and response with the nominally tuned F-TLA and A-TLA. The third subfigure shows the gains updated by RCAC.

Next, to investigate the performance recovery due to the adaptive augmentation, we degrade the F-TLA by scaling all of the F-TLA gains by the scaler \(\alpha_{\rm TLA} = 0.2.\) Figure 9 shows the trajectory of the evader (in solid black), the trajectory of an ideal pursuer (in solid blue), the trajectory of the missile with the A-TLA (in solid green), and the trajectory of the missile with the nominally tuned F-TLA (in dashed red). Note that, in the case of off-nominal F-TLA, the interception performance degrades substantially, however, A-TLA compensates for the degraded F-TLA and recovers the performance.

Figure 10 shows the commanded normal acceleration and the response of the missile with the F-TLA and A-TLA. Note that, in the case of off-nominal F-TLA, the normal acceleration response degrades. However, A-TLA compensates for the degraded F-TLA and generates the required normal acceleration to recover performance.

Figure 9: Interception trajectories with the off-nominal F-TLA and A-TLA. The F-TLA is degraded by scaling the nominal fixed gains by a scalar factor \alpha{\rm TLA} = 0.2. — Figure 9: Interception trajectories with the off-nominal F-TLA and A-TLA. The F-TLA is degraded by scaling the nominal fixed gains by a scalar factor \(\alpha{\rm TLA} = 0.2.\)

Figure 10: Normal acceleration command and response with the off-nominal F-TLA and A-TLA. Note that the A-TLA adjusts the controller gains, shown in the third subfigure, to maintain performance.

Next, to investigate the robustness of the A-TLA, we vary various parameters of the nonlinear missile dynamics. In particular, we consider two scenarios, where first, we vary the fin-deflection coefficients \(d_{\rm N}\) in 7 and \(d_{\rm M}\) in 9 , and second, we vary all the coefficients which multiply the angle of attack in 7 and 9 . In each case, the coefficient to be scaled is multiplied by the scalar factor \(\alpha_{\rm X}.\) In this work, we set \(\alpha_{\rm X}\) to five equispaced values between \(0.2\) and \(2.\) The interception scenario described in the previous section is considered to investigate the robustness of the the A-TLA. Figure 11 shows the trajectory tracking response of the missile with the F-TLA in dashed lines and with the A-TLA in solid lines. Note that in each case, the A-TLA compensates for the variation in the missile dynamics and maintains the performance close to the ideal pursuer trajectory, whereas, the performance of the F-TLA degrades substantially in terms of time of flight and miss distance.

Figure 11: Interception trajectories with the F-TLA (dashed lines) and A-TLA (solid lines) in the scenarios where a) the fin-deflection coefficient is scaled, b) the anlge of attack coefficients are scaled, Note that the A-TLA maintains performance close to the ideal pursuer for a wider range of parameters.

5 Conclusions↩︎

This paper investigated the application of a data-driven output feedback adaptive controller to improve the performance of the classical three-loop autopilot in off-nominal conditions. In particular, an adaptive proportional-integral controller, optimized by the retrospective cost adaptive control algorithm, augments a nominally tuned, fixed-gain three-loop autopilot. The particle swarm optimization framework was used to optimize the hyperparameters of the adaptive control algorithm. The particle-swarm optimized adaptive algorithm was then used to numerically investigate the performance of the adaptive three-loop autopilot in the nominal and off-nominal scenarios. Numerical simulations showed that adaptive augmentation maintains the missile’s performance close to the ideal pursuer’s trajectory in off-nominal scenarios, where the performance with the fixed-gain three-loop autopilot significantly degrades.

Appendix A: Missile Longitudinal Dynamics↩︎

As shown in Figure 1, \(\overset{\rightharpoonup}{v}_{{\rm c}/w/{\rm A}} = V {\hat{\imath}}_{\rm C},\) and thus \[\begin{align} \overset{\rightharpoonup}{a}_{{\rm c}/w/{\rm A}} &= \dot{V} {\hat{\imath}}_{\rm C} + V \stackrel{{\rm A}\bullet}{ {\hat{\imath}}_{\rm C}} = \dot{V} {\hat{\imath}}_{\rm C} - V \dot{\gamma} {\hat{k}}_{\rm C}. \end{align}\] The total force on the missile is \[\begin{align} \overset{\rightharpoonup}{f}_{\rm B} &= m g {\hat{k}}_{{\rm A}} - f_{\rm n} {\hat{k}}_{{\rm B}} - f_{\rm a} {\hat{\imath}}_{{\rm B}} + T {\hat{\imath}}_{\rm B} \nonumber\\ &= \big( - m g\sin (\gamma) - f_{\rm n}\sin (\alpha) + (T-f_{\rm a}) \cos (\alpha) \big) {\hat{\imath}}_{\rm C} + \big( m g\cos (\gamma) -f_{\rm n}\cos (\alpha) - (T-f_{\rm a}) \sin (\alpha) \big) {\hat{k}}_{\rm C}, \end{align}\] where \(m\) is the mass of the missile, \(g\) is the acceleration due to gravity, \(f_{\rm n}\) and \(f_{\rm a}\) are the normal and axial aerodynamic forces on the missile, and \(T\) is the thrust, and the total moment relative to \({\rm c}\) is \[\begin{align} \overset{\rightharpoonup}{M}_{\rm B/c} = {\mathcal{M}} {\hat{\jmath}}_{\rm B}, \end{align}\] where the aerodynamic forces and moments are parameterized by \[\begin{align} f_{\rm n} &= \frac{1}{2} \rho S V^2 C_{\rm N}, \quad f_{\rm a} = \frac{1}{2} \rho S V^2 C_{\rm A}, \quad {\mathcal{M}} = \frac{1}{2} \rho S V^2 d C_{\rm M}, \label{eq:aero95eq} \end{align}\tag{6}\] where \(d\) is the chord length, \(S\) is the reference surface area, and \(\rho\) is the air density calculated by the International Standard Atmosphere model. Furthermore, the aerodynamic coefficients \(C_{\rm N}, C_{\rm A},\) and \(C_{{\rm M}}\) are parameterized by \[\begin{align} C_{\rm N} &= a_{\rm N}\alpha^3 + b_{\rm N}\alpha|\alpha| + c_{\rm N}(2 - {M}/{3})\alpha + d_{\rm N}\delta, \tag{7} \\ C_{\rm A} &= a_{\rm A}, \tag{8} \\ C_{{\rm M}} &= a_{\rm M}\alpha^3 +b_{\rm M}\alpha|\alpha| + c_{\rm M}({8M}/{3} - 7)\alpha + d_{\rm M}\delta + e_{\rm M}q, \tag{9} \end{align}\] where \(\delta\) is the fin deflection angle. The transfer function from the fin deflection angle command \(u\) to the fin deflection angle \(\delta\) is \[\begin{align} G_{\delta u}(s) = \frac{\omega_a^2}{s^2 + 2 \zeta \omega_a + \omega_a^2}. \end{align}\] where \(\zeta\) is the damping ratio and \(\omega_a\) is the natural frequency of the actuator.

Resolving the force \(\overset{\rightharpoonup}{f}_{\rm B}\) and the inertial acceleration \(\stackrel{{\rm A}\bullet}{\overset{\rightharpoonup}{v}}_{{\rm c}/w/{\rm A}}\) in \(\rm F_{C}\), and using the Newton’s second law, \(m \stackrel{{\rm A}\bullet}{\overset{\rightharpoonup}{v}}_{{\rm c}/w/{\rm A}} = \overset{\rightharpoonup}{f}_{\rm B},\) yields \[\begin{align} m \dot{V} &= - m g\sin (\gamma) - f_{\rm n}\sin (\alpha) + (T-f_{\rm a}) \cos (\alpha) , \tag{10}\\ -mV \dot{\gamma} &= m g\cos (\gamma) -f_{\rm n}\cos (\alpha) - (T-f_{\rm a}) \sin (\alpha). \tag{11} \end{align}\] Similarly, resolving the moment \(\overset{\rightharpoonup}{M}_{\rm B/c}\) in \(\rm F_{C}\) and using the Euler’s equation yields \[\begin{align} I_y \ddot \theta = {\mathcal{M}}. \label{eq:32moment32qdot} \end{align}\tag{12}\] which can be rewritten in the state-space form as \[\begin{align} \dot{\theta} &= q, \quad \dot{q} = \frac{\rho V^2 S d}{2I_y} C_{\rm M}, \label{qdot32eq} \end{align}\tag{13}\] where \(q\) is the pitch rate.

For the simulations presented in this work, the values of the parameters parameterizing the aerodynamic coefficient are given in Table 1, and the physical properties of the missile are given in Table 2.

Table 1: Parameter values.
Parameter	Value	Parameter	Value
\(a_{\rm N}\)	-19.373	\(a_{\rm M}\)	40.440
\(b_{\rm N}\)	31.023	\(b_{\rm M}\)	-64.015
\(c_{\rm N}\)	9.717	\(c_{\rm M}\)	2.922
\(d_{\rm N}\)	1.948	\(d_{\rm M}\)	-11.803
\(a_A\)	0.3005	\(e_m\)	-1.719
\(\omega_a\)	\(150\) \(\rm rad/s\)	\(\zeta\)	0.7

Table 2: Physical properties of the missile.
Parameter	Value	Parameter	Value
Mass \(m\)	204.0227 kg	\(S\)	\(0.0409\) \(\rm m^2\)
\(d\)	\(0.2286\) \(\rm m\)	\(I_y\)	\(247.4336\) \(\rm kg \;m^2\)
\(d_{\rm imu}\)	\(0.5\) m

Appendix B: Interception Guidance↩︎

This appendix describes the derivation of the normal acceleration command using proportional guidance. A planar interception geometry for a pursuer \({\rm P}\) and an evader \({\rm E}\) is shown in Figure 12 [18].

Figure 12: Interception geometry for the pursuer and the evader.

The interception dynamics for the pursuer and the evader is \[\begin{align} \dot{V}_{\rm P} &= \frac{T_{\rm P}-D_{\rm P}}{m_{\rm P}} - g \sin \gamma_{\rm P}, \\ \dot{V}_{\rm E} &= \frac{T_{\rm E}-D_{\rm E}}{m_{\rm E}} - g \sin \gamma_{\rm E}, \\ \dot{\gamma}_{\rm P} &= -\frac{1}{V_{\rm P}} \left( \frac{T_{z,{\rm P}}}{m_{\rm P}} + g \cos \gamma_{\rm P}\right), \label{eq:dot95gamma95P} \\ \dot{\gamma}_{\rm E} &= -\frac{1}{V_{\rm E}} \left(n_{z,{\rm E}} + g \cos \gamma_{\rm E}\right), \end{align}\tag{14}\] where \(V_{\rm P}, V_{\rm E}\) are the velocities, \(\gamma_{\rm P},\) \(\gamma_{\rm E}\) are the flight-path angles, \(T_{\rm P}, T_{\rm E}\) are the thrust, \(D_{\rm P}, D_{\rm E}\) are the drag, and \(n_{z,{\rm P}}, n_{z,{\rm E}}\) are the normal accelerations of the pursuer and evader, respectively. The line of sight angle \(\beta\) satisfies \[\begin{align} \dot{\beta} = \frac{1}{R} \left( V_{\rm P}\sin(\beta - \gamma_{\rm P}) - V_{\rm E}\sin(\beta - \gamma_{\rm E}) \right) \label{eq:dotbeta} \end{align}\tag{15}\]

The proportional guidance law is \[\begin{align} \dot{\gamma}_{\rm P} = \lambda \dot{\beta}. \label{eq:prop95guid} \end{align}\tag{16}\] Substituting 14 and 15 in 16 yields the normal acceleration command \[\begin{align} n_{z,{\rm P}} &= -\frac{V_{\rm P}}{R} \left( V_{\rm P}\sin(\beta - \gamma_{\rm P}) - V_{\rm E}\sin(\beta - \gamma_{\rm E}) \right) - g \cos \gamma_{\rm P}. \end{align}\]

References↩︎

[1]

Kim, S.-H., Kim, Y.-S., and Song, C., “A robust adaptive nonlinear control approach to missile autopilot design,”Control Engineering Practice, Vol. 12, No. 2, 2004, pp. 149–154. , https://www.sciencedirect.com/science/article/pii/S0967066103000169.

[2]

Erdos, D., Shima, T., Kharisov, E., and Hovakimyan, N., “L1 adaptive control integrated missile autopilot and guidance,”AIAA guidance, navigation, and control conference, 2012, p. 4465.

[3]

Ouda, A. N., “A robust adaptive control approach to missile autopilot design,”International Journal of Dynamics and Control, Vol. 6, No. 3, 2018, pp. 1239–1271. , https://doi.org/10.1007/s40435-017-0352-4.

[4]

Tsourdos, A., and White, B. A., “Adaptive flight control design for nonlinear missile,”Control Engineering Practice, Vol. 13, No. 3, 2005, pp. 373–382. , https://www.sciencedirect.com/science/article/pii/S0967066104001157, aerospace IFAC 2002.

[5]

Lee, C.-H., Jun, B.-E., Lee, J.-I., and Tahk, M.-J., “Nonlinear missile autopilot design via three loop topology and time-delay adaptation scheme,”2013 13th International Conference on Control, Automation and Systems (ICCAS 2013), 2013, pp. 50–54. .

[6]

Fu, L.-C., Chang, W.-D., Yang, J.-H., and Kuo, T.-S., “Adaptive robust bank-to-turn missile autopilot design using neural networks,”Journal of Guidance, Control, and Dynamics, Vol. 20, No. 2, 1997, pp. 346–354.

[7]

Hou, M., Liang, X., and Duan, G., “Adaptive block dynamic surface control for integrated missile guidance and autopilot,”Chinese Journal of Aeronautics, Vol. 26, No. 3, 2013, pp. 741–750. , https://www.sciencedirect.com/science/article/pii/S1000936113000861.

[8]

Sobolic, F. M., Cruz, G., and Bernstein, D. S., “An inner-loop/outer-loop architecture for an adaptive missile autopilot,”2015 American Control Conference (ACC), 2015, pp. 850–855. .

[9]

Fuentes, R., Hoagg, J., Anderton, B., D’Amato, A., and Bernstein, D., “Investigation of cumulative retrospective cost adaptive control for missile application,”AIAA Guidance, Navigation, and Control Conference, 2010, p. 7577.

[10]

Mracek, C., and Ridgely, D., “Missile longitudinal autopilots: connections between optimal control and classical topologies,”AIAA guidance, navigation, and control conference and exhibit, 2005, p. 6381.

[11]

Goel, A., Islam, S. A. U., and Bernstein, D. S., “Adaptive Control of MIMO Systems Using Sparsely Parameterized Controllers,”2020 American Control Conference (ACC), IEEE, 2020, pp. 5340–5345.

[12]

Kamaldar, M., Islam, S. A. U., Sanjeevini, S., Goel, A., Hoagg, J. B., and Bernstein, D. S., “Adaptive digital PID control of first-order-lag-plus-dead-time dynamics with sensor, actuator, and feedback nonlinearities,”Advanced Control for Applications, Vol. 1, No. 1, 2019, p. e20. .

[13]

Oveissi, P., Trivedi, A., Goel, A., Tumuklu, O., Hanquist, K. M., Farahmandi, A., and Philbrick, D., “Learning-based Adaptive Thrust Regulation of Solid Fuel Ramjet,”AIAA SCITECH 2023 Forum, 2023, p. 2533.

[14]

Oveissi, P., Goel, A., Tumuklu, O., and Hanquist, K. M., “Adaptive Combustion Regulation in Solid Fuel Ramjet,”AIAA SCITECH 2024 Forum, 2024, p. 0743.

[15]

Rahman, Y., Xie, A., and Bernstein, D. S., “Retrospective cost adaptive control: Pole placement, frequency response, and connections with LQG control,”IEEE Control Systems Magazine, Vol. 37, No. 5, 2017, pp. 28–69.

[16]

Goel, A., U. Islam, S. A., and Bernstein, D. S., “Adaptive Control of MIMO Systems Using Sparsely Parameterized Controllers,”2020 American Control Conference (ACC), 2020, pp. 5340–5345. .

[17]

Islam, S. A. U., and Bernstein, D., “Minimum Time-of-Flight Interceptor Guidance Using Real-Time-Implementable Model-Predictive Guidance,”AIAA SCITECH 2022 Forum, 2022, p. 1377.

[18]

Kabamba, P. T., and Girard, A. R., Fundamentals of Aerospace navigation and guidance, Cambridge University Press, 2014.

Graduate Research Assistant, Department of Mechanical Engineering, University of Maryland, Baltimore County, 1000 Hilltop Circle, Baltimore, MD 21250.↩︎
Postdoctoral Research Associate, Department of Aerospace & Mechanical Engineering, University of Arizona, 1130 N. Mountain Avenue, Tucson, AZ 85721. AIAA Member.↩︎
Assistant Professor, Department of Aerospace & Mechanical Engineering, University of Arizona, 1130 N. Mountain Avenue, Tucson, AZ 85721. AIAA Senior Member.↩︎
Assistant Professor, Department of Mechanical Engineering, University of Maryland, Baltimore County, 1000 Hilltop Circle, Baltimore, MD 21250.↩︎
Undergraduate Student, Department of Mechanical Engineering, 1000 Hilltop Circle, Baltimore, MD 21250.↩︎
Graduate Student, Department of Mechanical Engineering, 1000 Hilltop Circle, Baltimore, MD 21250.↩︎
Principal Staff, Johns Hopkins University Applied Physics Laboratory, Laurel, MD, 20723.↩︎
Assistant Professor, Department of Mechanical Engineering, 1000 Hilltop Circle, Baltimore, MD 21250.↩︎

Thrust Regulation in a Solid Fuel Ramjet using
Dynamic Mode Adaptive Control
Swarm-optimized Adaptive Augmentation of Missile Autopilot

Abstract

1 Introduction↩︎

2 Missile Dynamics and Control↩︎

2.1 Longitudinal Dynamics↩︎

2.2 Three-loop Autopilot↩︎

3 Adaptive TLA↩︎

4 Numerical Simulation↩︎

4.1 RCAC Hyperparameter Optimization↩︎

4.2 Harmonic Response↩︎

4.3 Interception Response↩︎

5 Conclusions↩︎

Appendix A: Missile Longitudinal Dynamics↩︎

Appendix B: Interception Guidance↩︎

References↩︎

Subjects

Updated on Academus

Thrust Regulation in a Solid Fuel Ramjet using Dynamic Mode Adaptive ControlSwarm-optimized Adaptive Augmentation of Missile Autopilot

Abstract

1 Introduction↩︎

2 Missile Dynamics and Control↩︎

2.1 Longitudinal Dynamics↩︎

2.2 Three-loop Autopilot↩︎

3 Adaptive TLA↩︎

4 Numerical Simulation↩︎

4.1 RCAC Hyperparameter Optimization↩︎

4.2 Harmonic Response↩︎

4.3 Interception Response↩︎

5 Conclusions↩︎

Appendix A: Missile Longitudinal Dynamics↩︎

Appendix B: Interception Guidance↩︎

References↩︎

Subjects

Updated on Academus

Thrust Regulation in a Solid Fuel Ramjet using
Dynamic Mode Adaptive Control
Swarm-optimized Adaptive Augmentation of Missile Autopilot