Fourth-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Nonlinear Systems (4th-CASAM-N): I. Mathematical Framework

Cacuci, Dan Gabriel

doi:10.3390/jne3010004

Open AccessArticle

Fourth-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Nonlinear Systems (4th-CASAM-N): I. Mathematical Framework

by

Dan Gabriel Cacuci

Center for Nuclear Science and Energy, University of South Carolina, Columbia, SC 29208, USA

J. Nucl. Eng. 2022, 3(1), 37-71; https://doi.org/10.3390/jne3010004

Submission received: 7 January 2022 / Revised: 15 February 2022 / Accepted: 16 February 2022 / Published: 23 February 2022

(This article belongs to the Special Issue Feature Paper Special Issue for Editorial Board Members (EBMs) of Journal of Nuclear Engineering)

Download Versions Notes

Abstract

:

This work presents the fourth-order comprehensive sensitivity analysis methodology for nonlinear systems (abbreviated as “4th-CASAM-N”) for exactly and efficiently computing the first-, second-, third-, and fourth-order functional derivatives (customarily called “sensitivities”) of physical system responses (i.e., “system performance parameters”) to the system’s (or model) parameters. The qualifier “comprehensive” indicates that the 4th-CASAM-N methodology enables the exact and efficient computation not only of response sensitivities with respect to the customary model parameters (including computational input data, correlations, initial and/or boundary conditions) but also with respect to imprecisely known material boundaries, caused by manufacturing tolerances, of the system under consideration. The 4th-CASAM-N methodology presented in this work enables the hitherto very difficult, if not intractable, exact computation of all of the first-, second-, third-, and fourth-order response sensitivities for large-scale systems involving many parameters, as usually encountered in practice. Notably, the implementation of the 4th-CASAM-N requires very little additional effort beyond the construction of the adjoint sensitivity system needed for computing the first-order sensitivities. The application of the principles underlying the 4th-CASAM-N to an illustrative paradigm nonlinear heat conduction model will be presented in an accompanying work.

Keywords:

fourth-order adjoint sensitivity analysis methodology; exact expressions for efficient computation of first-, second-, third-, and fourth-order response sensitivities; large-scale systems

1. Introduction

The computational model of a physical system comprises the following conceptual components: (a) a well-posed system of that relate the system’s independent variables and parameters to the system’s state (i.e., dependent) variables; (b) probability distributions, moments thereof, inequality and/or equality constraints that define the range of variations of the system’s parameters; and (c) one or several quantities, customarily referred to as system responses (or objective functions, or indices of performance), which are computed using the mathematical model. This works presents a new, general-purpose methodology for computing exactly and efficiently functional derivatives (called “sensitivities”) of results (“system responses”), predicted by nonlinear mathematical models of systems (physical, engineering, biological) involving imprecisely known (i.e., uncertain) parameters, including input data, correlations, initial and/or boundary conditions, as well as manufacturing tolerances that affect domain of the model’s definition in phase space. This new method is called the “fourth-order comprehensive sensitivity analysis methodology for nonlinear systems” (abbreviated as “4th-CASAM-N”) since it enables the hitherto very difficult, if not intractable, exact computation of all of the first-, second-, third-, and fourth-order response sensitivities for large-scale systems involving many parameters, as is usually encountered in practice. The foundation for the material presented in this work is provided by the first-order adjoint sensitivity analysis procedure for nonlinear systems that was originally formulated in a general, functional analytic framework by Cacuci [1,2].

The aims and means of sensitivity theory/analysis are occasionally confused with the aims and means of optimization theory. The algorithms underlying optimization theory compute the values at which the first-order derivatives of a response with respect to the state functions and/or parameters vanish. In contradistinction, sensitivity theory/analysis aims at computing the response sensitivities to parameters at the nominal values for the model’s parameters and state functions. Therefore, although both sensitivity analysis and optimization algorithms evaluate first-order (and occasionally higher-order) response derivatives with respect to model parameters, these algorithms serve different purposes and conceptually differ from each other.

Response sensitivities to model parameters, which are computed using the methods of sensitivity analysis, are needed in many activities, including: (i) determining the effects of parameter variations on the system’s behavior; (ii) ranking the importance of model parameters in influencing the system response under consideration; (iii) quantifying uncertainties induced in responses by parameter uncertainties (e.g., by using the method of “propagation of uncertainties”); (iv) prioritizing possible improvements for the system under consideration and possibly reducing conservatism and redundancy; (v) validating the model under consideration by comparison to experiments, while taking into account both experimental and model uncertainties; and (vi) performing “predictive modeling” (which includes data assimilation and model calibration) for the purpose of obtaining best-estimate predicted results with reduced predicted uncertainties, using, e.g., the methodologies presented in [3,4,5] and in references therein.

First-order response sensitivities can be computed by using either statistical or deterministic methods; a comparative review of the most popular of these methods was presented in [6,7]. It is known that sensitivities cannot be computed exactly using statistical methods; this can only be performed with deterministic methods. Furthermore, for a system comprising

T P

parameters, the computation of the first-order sensitivities by statistical methods requires

O (T P)

large-scale computations while the adjoint sensitivity analysis originally developed by Cacuci [1,2] requires a single large-scale computation per response.

Since nonlinear operators do not admit bona-fide adjoint operators (only a linearized form of a nonlinear operator admits an adjoint operator), responses of nonlinear models can depend only on the forward functions. In contradistinction, model responses for linear systems may involve the solutions of both the forward and the adjoint linear models that correspond to the respective physical system. Hence, responses for linear systems cannot always be treated as particular cases of nonlinear systems but there is a need to develop a dedicated sensitivity analysis methodology for response-coupled forward and adjoint linear systems. The general methodology for computing arbitrarily high-order sensitivities for response-coupled linear forward/adjoint systems was developed by Cacuci [8,9,10,11,12]. The overwhelming impact of the higher-order (i.e., second-, third- and fourth-order) sensitivities on the model response was illustrated by Fang and Cacuci, in [13] and references therein, by means of an OECD/NEA reactor physics benchmark modeled by the linear neutron transport equation and comprising 21,976 uncertain model parameters.

The general mathematical framework for adjoint sensitivity analysis of nonlinear systems was extended by Cacuci to second-order [14] and third-order [15] sensitivities. The extension of the adjoint sensitivity analysis methodology to include the computation of sensitivities of model responses with respect to imprecisely known domain boundaries was presented in [16]. By extending all of the previous theoretical developments of the adjoint sensitivity analysis methodology, this work presents the theoretical framework for the fourth-order comprehensive sensitivity analysis methodology for nonlinear systems (abbreviated as “4th-CASAM-N”). This work is structured as follows. Section 2 presents the mathematical formulation of the that would define the computational model of a generic nonlinear physical system, including the definition of a generic response which depends on the model’s state variables and parameters, which are considered to be uncertain (i.e., not known precisely). Besides initial conditions and correlations, the model parameters are also considered to include geometrical parameters that describe the system’s boundaries and internal interfaces. Section 3 presents the 4th-CASAM-N methodology while Section 4 summarizes the salient features of this novel methodology. The application of the principles underlying the 4th-CASAM-N is illustrated in an accompanying work [17] by means of a paradigm nonlinear heat conduction model.

2. Generic Mathematical Modeling of a Nonlinear Physical System

As already mentioned, the computational model of a physical system comprises that relate the system’s independent variables and parameters to the system’s state variables. The model parameters usually stem from processes that are external to the system under consideration and are seldom, if ever, known precisely. The known characteristics of the model parameters may include their nominal (expected/mean) values and, possibly, higher-order moments or cumulants (i.e., variance/covariances, skewness, kurtosis), which are usually determined from experimental data and/or processes external to the physical system under consideration. Occasionally, only inequality and/or equality constraints that delimit the ranges of the system’s parameters are known. Without loss of generality, the imprecisely known model parameters can be considered to be real-valued scalar quantities. These model parameters will be denoted as

α_{1}

,…,

α_{T P}

, where

T P

denotes the “total number of imprecisely known parameters” underlying the model under consideration. For subsequent developments, it is convenient to consider that these parameters are components of a “vector of parameters” denoted as

α ≜ {(α_{1}, \dots, α_{T P})}^{†} \in E_{α} \in ℝ^{T P}

, where

E_{α}

is also a normed linear space and where

ℝ^{T P}

denotes the

T P

-dimensional subset of the set of real scalars. The components of the

T P

-dimensional column vector

α \in ℝ^{T P}

are considered to include imprecisely known geometrical parameters that characterize the physical system’s boundaries in the phase space of the model’s independent variables. Matrices will be denoted using capital bold letters while vectors will be denoted using either capital or lower-case bold letters. The symbol “

≜

” will be used to denote “is defined as” or “is by definition equal to”. Transposition will be indicated by a dagger

(†)

superscript.

The generic nonlinear model is considered to comprise

T I

independent variables which will be denoted as

x_{i}, i = 1, \dots, T I

, and which are considered to be components of a

T I

-dimensional column vector denoted as

x ≜ {(x_{1}, \dots, x_{T I})}^{†} \in ℝ^{T I}

, where the sub/superscript “TI” denotes the “total number of independent variables”. The vector

x \in ℝ^{T I}

of independent variables is considered to be defined on a phase-space domain which will be denoted as

Ω (α)

and which is defined as follows:

Ω (α) ≜ \{- \infty \leq λ_{i} (α) \leq x_{i} \leq ω_{i} (α) \leq \infty; i = 1, \dots, T I\}

. The lower boundary point of an independent variable is denoted as

λ_{i} (α)

and the corresponding upper boundary point is denoted as

ω_{i} (α)

. A typical example of boundaries that depend on both geometrical parameters and material properties are the “boundaries facing vacuum” in models based on diffusion theory, where conditions are imposed on the “extrapolated boundary” of the respective spatial domain. The “extrapolated boundary” depends both on the imprecisely known physical dimensions of the problem’s domain and also on the medium’s properties, such as atomic number densities and microscopic transport cross sections. The boundary of

Ω (α)

, which will be denoted as

\partial Ω (α)

, comprises the set of all of the endpoints

λ_{i} (α), ω_{i} (α), i = 1, \dots, T I,

of the respective intervals on which the components of

x

are defined, i.e.,

\partial Ω (α) ≜ \{λ_{i} (α) \cup ω_{i} (α), i = 1, \dots, T I\}

.

A nonlinear physical system can be generally modeled by means of coupled which can be represented in operator form as follows:

N [u (x), α] = Q (x, α), x \in Ω_{x} (α) .

(1)

The quantities which appear in Equation (1) are defined as follows:

1.

u (x) ≜ {[u_{1} (x), \dots, u_{T D} (x)]}^{†}

is a

T D

-dimensional column vector of dependent variables; the abbreviation “

T D

” denotes “total number of dependent variables”. The functions

u_{i} (x), i = 1, \dots, T D

denote the system’s “dependent variables” (also called “state functions”);

u (x) \in E_{u}

, where

E_{u}

is a normed linear space over the scalar field

F

of real numbers.

2.

N [u (x); α] ≜ {[N_{1} (u; α), \dots, N_{T D} (u; α)]}^{†}

denotes a

T D

-dimensional column vector. The components

N_{i} (u; α), i = 1, \dots, T D

are operators (including differential, difference, integral, distributions, and/or finite or infinite matrices) acting (usually) nonlinearly on the dependent variables

u (x)

, the independent variables

x

, and the model parameters

α

. The mapping

N (u; α)

is defined on the combined domains of the model’s parameters and state functions, i.e.,

N : D \subset E \to E_{Q}

, where

D = D_{u} \oplus D_{α}

,

D_{u} \subset E_{u}

,

D_{α} \subset E_{α}

,

E = E_{u} \oplus E_{α}

.

3.

Q (x, α) ≜ {[q_{1} (x; α), \dots ., q_{T D} (x; α)]}^{†}

is a

T D

-dimensional column vector which represents inhomogeneous source terms, which usually depend nonlinearly on the uncertain parameters

α

. The vector

Q (x, α)

is defined on a normed linear space denoted as

E_{Q}

, i.e.,

Q \in E_{Q}

.

The equalities in this work are considered to hold in the weak (“distributional”) sense. The right sides of Equation (1) and of other various to be derived in this work may contain “generalized functions/functionals”, particularly Dirac distributions and derivatives thereof.

Boundary and/or initial conditions must also be provided if differential operators appear in Equation (1). In operator form, these boundaries and/or initial conditions are represented as follows:

B [u (x); α; x] - C (x, α) = 0, x \in \partial Ω_{x} (α) .

(2)

where the column vector

0

has

T D

components, all of which are identically zero, i.e.,

0 ≜ [ζ_{1}, \dots, ζ_{T D}]; ζ_{i} \equiv 0, i = 1, \dots, T D .

(3)

In Equation (2), the components

B_{i} (u; α), i = 1, \dots, T D

of

B (u; α) ≜ {[B_{1} (u; α), \dots, B_{T D} (u; α)]}^{†}

are nonlinear operators in

u (x)

and

α

, which are defined on the boundary

\partial Ω_{x} (α)

of the model’s domain

Ω_{x} (α)

. The components

C_{i} (x; α), i = 1, \dots, T D

of

C (x, α) ≜ {[C_{1} (x; α), \dots, C_{T D} (x; α)]}^{†}

comprise inhomogeneous boundary sources which are nonlinear functions of

α

.

Solving Equations (1) and (2) at the nominal parameter values, denoted as

α^{0} ≜ {[α_{1}^{0}, \dots, α_{i}^{0}, .., α_{T P}^{0}]}^{†}

, provides the “nominal solution”

u^{0} (x)

, i.e., the vectors

u^{0} (x)

and

α^{0}

satisfy the following:

N [u^{0} (x); α^{0}] = Q (x, α^{0}), x \in Ω_{x},

(4)

B [u^{0} (x); α^{0}; x] - C (x, α^{0}) = 0, x \in \partial Ω_{x} (α^{0}) .

(5)

The superscript “0” will be used throughout this work to denote “nominal values”.

The results computed using a mathematical model are customarily called “model responses” (or “system responses” or “objective functions” or “indices of performance”). In general, a function-valued (i.e., operator-type) response

R [u (x); α]

can be represented by a spectral expansion in multidimensional orthogonal polynomials or Fourier series of the form:

R [u (x); α] = \sum_{m_{1}} \dots \sum_{m_{T I}} c_{m_{1} \dots m_{T I}} [u (x); α] P_{m_{1}} (x_{1}) P_{m_{2}} (x_{2}) \dots P_{m_{T I}} (x_{T I}),

(6)

where the quantities

P_{m_{i}} (x_{i})

,

i = 1, \dots, T I

denote the corresponding spectral functions (e.g., orthogonal polynomials or Fourier exponential/trigonometric functions) and where the spectral Fourier coefficients

c_{m_{1} \dots m_{T I}} [u (x); α]

are defined as follows:

c_{m_{1} \dots m_{T I}} [u (x); α] ≜ \int_{λ_{1} (α)}^{ω_{1} (α)} \dots \int_{λ_{i} (α)}^{ω_{i} (α)} \dots \int_{λ_{T I} (α)}^{ω_{T I} (α)} R [u (x); α] P_{m_{1}} (x_{1}) \dots P_{m_{i}} (x_{i}) \dots P_{m_{T I}} (x_{T I}) d x_{1} \dots d x_{i} \dots d x_{T I}

(7)

The coefficients

c_{m_{1} \dots m_{T I}} [u (x); α]

can themselves be considered as “model responses” since the spectral polynomials

P_{m_{i}} (x_{i})

are perfectly well known while the expansion coefficients will contain all of the dependencies (directly or indirectly—through the state functions) of the respective response on the imprecisely known model parameters. This way, the sensitivity analysis of an operator-valued response

R [u (x); α]

can be reduced to the sensitivity analysis of the scalar-valued responses

c_{m_{1} \dots m_{T I}} [u (x); α]

.

A measurement of a physical quantity that depends on the model’s state functions and parameters can be considered to be a response denoted as

R_{p} [u (x); α]

, which is to be evaluated at

x = x^{p} (α)

, where

x^{p} (α) ≜ {[x_{1}^{p} (α), \dots x_{k}^{p} (α), \dots, x_{T I}^{p} (α)]}^{†}

denotes the location in the phase space of the specific “measurement point”. Such a measurement (or measurement-like) response can be represented mathematically as follows:

R_{p} [u (x); α] ≜ \int_{λ_{1} (α)}^{ω_{1} (α)} \dots \int_{λ_{T I} (α)}^{ω_{T I} (α)} F [u (x); α; x] δ [x_{1} - x_{1}^{p} (α)] \dots δ [x_{T I} - x_{T I}^{p} (α)] d x_{1} \dots d x_{T I}

(8)

where the function

F [u (x); α; x]

denotes the mathematical dependence of the measurement device on the model’s dependent variable(s), and where the quantity

δ [x_{i} - x_{i}^{p} (α)]

denotes the Dirac delta functional. The measurement’s location in phase space,

x^{p} (α)

, may itself be afflicted by measurement (experimental) uncertainties. Hence, it is convenient to consider the components of

x^{p} (α)

to be included among the components of the vector

α

of model parameters, even though

x^{p} (α)

appears only in the definition of the response but does not appear in Equations (1) and (2), which mathematically define the physical model. Thus, the physical “system” is defined to comprise both the system’s computational model and the system’s response. In most cases, the coordinates

x_{k}^{p} (α)

,

k = 1, \dots, T I

will be independent (albeit uncertain) model parameters, in which case

\partial x_{k}^{p} (α) / \partial α_{n} = 1, i f α_{n} \equiv x_{k}^{p}

and

\partial x_{k}^{p} (α) / \partial α_{n} = 0, i f α_{n} \neq x_{k}^{p}

.

The representations shown in Equations (6)–(8) indicate that model responses can be fundamentally analyzed by considering the following generic integral representation:

R [u (x); α] ≜ \int_{λ_{1} (α)}^{ω_{1} (α)} \dots \int_{λ_{T I} (α)}^{ω_{T I} (α)} S [u (x); α; x] d x_{1} \dots d x_{T I}

(9)

where

S [u (x); α]

is suitably differentiable nonlinear function of

u (x)

and of

α

. It is important to note that the components of

α

not only include parameters that appear in the defining the computational model per se, i.e., in Equations (1) and (2), but also include parameters that specifically occur only in the definition of the response under consideration.

It is also important to note that the system’s definition domain,

Ω (α)

, in phase space is considered to be imprecisely known, subject to uncertainties in the components of the vector of model parameters

α

. Therefore, the system domain’s boundary,

\partial Ω (α)

, as well as the model response

R [u (x); α]

, will be affected by the boundary uncertainties that affect the endpoints

λ_{i} (α), ω_{i} (α), i = 1, \dots, T I

. Such boundary uncertainties stem most often from manufacturing uncertainties.

3. The Fourth-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Nonlinear Systems (4th-CASAM-N)

The starting point for building the mathematical framework of the novel 4th-CASAM-N is provided by the mathematical framework of the “first-order comprehensive adjoint sensitivity analysis methodology for nonlinear systems” presented in [16] which considered that the domain’s boundaries are also subject to uncertainties, thus generalizing all previous work on first-order adjoint sensitivity analysis. The 1st-CASAM-N is briefly reviewed in Section 3.1. Section 3.2 presents the 2nd-CASAM-N, which generalizes the material presented in [9,14,15]. Section 3.3 presents the 3rd-CASAM-N, comprising original material that provides the basis for the development of the 4th-CASAM, which is presented in Section 3.4.

3.1. The First-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Nonlinear Systems (1st-CASAM-N)

The model and boundary parameters

α

are considered to be uncertain quantities, having unknown true values. The nominal (or mean) parameter vales

α^{0}

are considered to be known, and these will differ from the true values by quantities denoted as

δ α ≜ (δ α_{1}, \dots, δ α_{T P})

, where

δ α_{i} ≜ α_{i} - α_{i}^{0}

. Since the forward state functions

u (x)

are related to the model and boundary parameters

α

through Equations (1) and (2), it follows that the variations

δ α

in the model and boundary parameters will cause corresponding variations

v^{(1)} (x) ≜ {[δ u_{1} (x), \dots, δ u_{T D} (x)]}^{†}

around the nominal solution

u^{0} (x)

in the forward state functions. In turn, the variations

δ α

and

v^{(1)} (x)

will induce variations in the system’s response. Cacuci [1,2] has shown that the most general definition of the sensitivity of an operator-valued model response

R (e)

, where

e ≜ (α, u) \in E

, to variations

h ≜ (δ α, v^{(1)})

in the model parameters and state functions in a neighborhood around the nominal functions and parameter values

e^{0} ≜ (α^{0}, u^{0}) \in E

, is given by the first-order Gateaux (G) variation, which will be denoted as

δ R (e^{0}; h)

and is defined as follows:

δ R (e^{0}; h) ≜ {\{\frac{d}{d ε} [R (e^{0} + ε h)]\}}_{ε = 0} ≜ \lim_{ε \to 0} \frac{R (e^{0} + ε h) - R (e^{0})}{ε}

(10)

for a scalar

ε \in F

and for all (i.e., arbitrary) vectors

h \in E = E_{α} \times E_{u}

in a neighborhood

(e^{0} + ε h)

around

e^{0} = (α^{0}, u^{0}) \in E

. The G variation

δ R (e^{0}; h)

is an operator defined on the same domain as

R (e)

and has the same range as

R (e)

. The G variation

δ R (e^{0}; h)

satisfies the relation:

R (e^{0} + ε h) - R (e^{0}) = δ R (e^{0}; h) + Δ (h)

, with

\lim_{ε \to 0} [Δ (ε h)] / ε = 0

. The existence of the G variation

δ R (e^{0}; h)

does not guarantee its numerical computability. Numerical methods most often require that

δ R (e^{0}; h)

is linear in

h ≜ (δ α; v^{(1)})

in a neighborhood

(e^{0} + ε h)

around

e^{0} = (u^{0}, α^{0}) \in E

. Formally, the necessary and sufficient conditions for the G variation

δ R (e^{0}; h)

of a nonlinear operator

R (e)

to be linear and continuous in

h

in a neighborhood

(e^{0} + ε h)

around

e^{0} = (α^{0}, u^{0})

and therefore admit a total first-order G derivative, are as follows:

(i): $R (e)$ satisfies a weak Lipschitz condition at $e^{0}$ , namely:

$‖R (e^{0} + ε h) - R (e^{0})‖ \leq k ‖ε (e^{0})‖, k < \infty$

(11)
(ii): $R (e)$ satisfies the following condition

$R (e^{0} + ε h_{1} + ε h_{2}) - R (e^{0} + ε h_{1}) - R (e^{0} + ε h_{2}) + R (e^{0}) = o (ε); h_{1}, h_{2} \in E; ε \in F .$

(12)

In practice, the relation provided in Equations (11) and (12) are seldom used directly since the computation of the expression on the right side of Equation (10) reveals if the respective expression is linear (or not) in

h ≜ (δ α, v^{(1)})

and, hence, in

v^{(1)} (x)

. Numerical methods (e.g., Newton’s method, and variants thereof) for solving Equations (1) and (2) also require the existence of the first-order G derivatives of original model, in which case the components of the operators which appear in these must also satisfy the conditions provided in Equations (11) and (12). Therefore, the conditions provided in Equations (11) and (12) are henceforth considered to be satisfied by the operators which underly the physical system modeled by Equations (1) and (2), as well as by the model responses.

When the first-order G differential

δ R (e^{0}; h)

satisfies the conditions provided in Equations (11) and (12), it can be written in the following form:

δ R (e^{0}; h) \equiv {\{δ R [u (x); α; v^{(1)} (x); δ α]\}}_{α^{0}} = {\{\frac{\partial R (u; α)}{\partial u}\}}_{α^{0}} v^{(1)} (x) + {\{\frac{\partial R (u; α)}{\partial α}\}}_{α^{0}} δ α,

(13)

where

\frac{\partial []}{\partial u} v^{(1)} (x) ≜ \sum_{i = 1}^{T D} \frac{\partial []}{\partial u_{i} (x)} δ u_{i} (x)

(14)

\frac{\partial []}{\partial α} δ α ≜ \sum_{i = 1}^{T P} \frac{\partial []}{\partial α_{i}} δ α_{i}

(15)

This, the quantities

\partial R (u; α) / \partial u

and

\partial R (u; α) / \partial α

in Equation (13) denote the partial G derivatives of

R (e)

with respect to

u

and

α

, evaluated at the nominal parameter values (and hence also nominal values of the state functions). The notation

{\{\}}_{α^{0}}

will be used in this work to indicate that the quantity enclosed within the bracket is to be evaluated at the respective nominal parameter and state functions values. The quantity

{\{\partial R (u; α) / \partial α\}}_{α^{0}} δ α

is called the “direct-effect term” because it arises directly from parameter variations

δ α

. The direct-effect term can be computed once the nominal values

e^{0} = (u^{0}, α^{0})

are available. The quantity

{\{\partial R (u; α) / \partial u\}}_{α^{0}} v^{(1)} (x)

is called the “indirect-effect term” because it arises indirectly, through the variations in the state functions (which are caused through the model by parameter variations). The indirect-effect term can be quantified only after having determined the variations

v^{(1)} (x)

in terms of the variations

δ α

.

In particular, the first-order Gateaux differential

{\{δ R [u (x); α; v^{(1)} (x); δ α]\}}_{α^{0}}

of the generic response

R (u, α)

defined in Equation (9) has the following expression:

\begin{array}{l} {\{δ R [u (x); α; v^{(1)} (x); δ α]\}}_{α^{0}} \\ ≜ \{\frac{d}{d ε} \int_{λ_{1} (α^{0} + ε δ α)}^{ω_{1} (α^{0} + ε δ α)} \dots \int_{λ_{T I} (α^{0} + ε δ α)}^{ω_{T I} (α^{0} + ε δ α)} S (u^{0} + ε v^{(1)}, α^{0} + ε δ α; x) d x_{1} \dots d x_{T I}\} \\ = {\{δ R [u (x); α; δ α]\}}_{d i r} + {\{δ R [u (x); α; v^{(1)} (x)]\}}_{i n d}, \end{array}

(16)

where the “indirect-effect” term

{\{δ R [u (x); α; v^{(1)} (x)]\}}_{i n d}

comprises all dependencies on

v^{(1)} (x)

and is defined as follows:

{\{δ R [u (x); α; v^{(1)} (x)]\}}_{i n d} ≜ \int_{λ_{1} (α^{0})}^{ω_{1} (α^{0})} \dots \int_{λ_{T I} (α^{0})}^{ω_{T I} (α^{0})} {\{\frac{\partial S (u; α; x)}{\partial u}\}}_{α^{0}} v^{(1)} (x) d x_{1} \dots d x_{T I},

(17)

and where the “direct-effect” term

{\{δ R [u (x); α; δ α]\}}_{d i r}

comprises all dependencies on

δ α

and is defined as follows:

{\{δ R [u (x); α; δ α]\}}_{d i r} ≜ \sum_{j_{1} = 1}^{T P} {\{R^{(1)} [j_{1}; u (x); α]\}}_{d i r} δ α_{j_{1}},

(18)

with

\begin{array}{l} {\{R^{(1)} [j_{1}; u (x); α]\}}_{d i r} ≜ {\{\int_{λ_{1} (α)}^{ω_{1} (α)} \dots \int_{λ_{T I} (α)}^{ω_{T I} (α)} \frac{\partial S (u; α; α)}{\partial α_{j_{1}}} d x_{1} … d x_{T I}\}}_{α^{0}} \\ + \sum_{j = 1}^{T I} {\{\int_{λ_{1} (α)}^{ω_{1} (α)} … \int_{λ_{j - 1} (α)}^{ω_{j - 1} (α)} \int_{λ_{j + 1} (α)}^{ω_{j + 1} (α)} … \int_{λ_{T I} (α)}^{ω_{T I} (α)} S [u (x_{1}, ., ω_{j} (α), ., x_{N_{x}}); α] \frac{\partial ω_{j} (α)}{\partial α_{j_{1}}} d x_{1} … d x_{T I}\}}_{α^{0}} \\ - \sum_{j = 1}^{T I} {\{\int_{λ_{1} (α)}^{ω_{1} (α)} … \int_{λ_{j - 1} (α)}^{ω_{j - 1} (α)} \int_{λ_{j + 1} (α)}^{ω_{j + 1} (α)} … \int_{λ_{T I} (α)}^{ω_{T I} (α)} S [u (x_{1}, ., λ_{j} (α), ., x_{N_{x}}); α] \frac{\partial λ_{j} (α)}{\partial α_{j_{1}}} d x_{1} … d x_{T I}\}}_{α^{0}} . \end{array}

(19)

The first-order relationship between the vectors

v^{(1)} (x)

and

δ α

is determined by taking the G differentials of Equations (1) and (2). Thus, applying the definition of the G differential to Equations (1) and (2) yields the following:

{\{\frac{d}{d ε} N [u^{0} + ε v^{(1)} (x); α^{0} + ε δ α]\}}_{ε = 0} = {\{\frac{d}{d ε} Q (x, α^{0} + ε δ α)\}}_{ε = 0},

(20)

{\{\frac{d}{d ε} B [u^{0} + ε v^{(1)} (x); α^{0} + ε δ α]\}}_{ε = 0} - {\{\frac{d}{d ε} C (x, α^{0} + ε δ α)\}}_{ε = 0} = 0 .

(21)

Carrying out the differentiations with respect to

ε

in Equations (20) and (21), and setting

ε = 0

in the resulting expressions yields the following:

{\{V^{(1)} (u; α) v^{(1)} (x)\}}_{α^{0}} = {\{q^{(1)} (u; α; δ α)\}}_{α^{0}}, x \in Ω_{x},

(22)

{\{b_{V}^{(1)} (u; α; v^{(1)}; δ α)\}}_{α^{0}} = 0, x \in \partial Ω_{x} (α^{0}) .

(23)

In Equations (22) and (23), the superscript “(1)” indicates “1st-Level” and the various quantities which appear in these are defined as follows:

V^{(1)} (u; α) ≜ \{\frac{\partial N (u; α)}{\partial u}\} ≜ (\begin{matrix} \frac{\partial N_{1}}{\partial u_{1}} & \dots & \frac{\partial N_{1}}{\partial u_{T D}} \\ ⋮ & ⋱ & ⋮ \\ \frac{\partial N_{T D}}{\partial u_{1}} & \dots & \frac{\partial N_{T D}}{\partial u_{T D}} \end{matrix});

(24)

q^{(1)} (u; α; δ α) ≜ \frac{\partial [Q (α) - N (u; α)]}{\partial α} δ α ≜ \sum_{j_{1} = 1}^{T P} s_{V}^{(1)} (j_{1}; u; α) δ α_{j_{1}};

(25)

s_{V}^{(1)} (j_{1}; u; α) ≜ \frac{\partial [Q (α) - N (u; α)]}{\partial α_{j_{1}}};

(26)

{\{b_{V}^{(1)} (u; α; v^{(1)}; δ α)\}}_{α^{0}} ≜ {\{\frac{\partial B (u; α)}{\partial u} v^{(1)}\}}_{α^{0}} + \sum_{j_{1} = 1}^{T P} b_{V}^{(1)} (j_{1}; u; α) δ α_{j_{1}};

(27)

b_{V}^{(1)} (j_{1}; u; α) ≜ \frac{\partial [B (u; α) - C (α)]}{\partial α_{j_{1}}} .

(28)

The system comprising Equations (22) and (23) is called the “1st-Level Variational Sensitivity System” (1st-LVSS). In order to determine the solutions of the 1st-LVSS that would correspond to every parameter variation

δ α_{j_{1}}

,

j_{1} = 1, \dots, T P

, the 1st-LVSS would need to be solved

T P

times, with distinct right sides for each

δ α_{j_{1}}

, as follows:

{\{V^{(1)} (u; α) v^{(1)} (j_{1}; x)\}}_{α^{0}} = {\{s_{V}^{(1)} (j_{1}; u; α)\}}_{α^{0}}; j_{1} = 1, \dots, T P; x \in Ω_{x},

(29)

{\{\frac{\partial B (u; α)}{\partial u} v^{(1)} (j_{1}; x)\}}_{α^{0}} = b_{V}^{(1)} (j_{1}; α; u); j_{1} = 1, \dots, T P; x \in \partial Ω_{x} (α^{0}) .

(30)

Subsequently, the solutions

v^{(1)} (j_{1}; x)

could be used, in turn, in Equation (17) to compute the indirect-effect term corresponding to each parameter variation

δ α_{j_{1}}

,

j_{1} = 1, \dots, T P

, to obtain the following contribution from the indirect-effect term to the respective partial sensitivity of the response with respect to the parameter

α_{j_{1}}

:

{\{δ R [j_{1}; u (x); α; v^{(1)} (j_{1}; x)]\}}_{i n d} ≜ {\{\int_{λ_{1} (α)}^{ω_{1} (α)} \dots \int_{λ_{T I} (α)}^{ω_{T I} (α)} [\frac{\partial S (u; α; x)}{\partial u} v^{(1)} (j_{1}; x)] d x_{1} \dots d x_{T I}\}}_{α^{0}} .

(31)

Adding the contribution from the indirect-effect term obtained in Equation (31) to the contribution from the direct-effect term provided in Equation (18) yields the following expression for the sensitivity (i.e., partial G derivative)

\partial R [u (x); α] / \partial α_{j_{1}}

of the response

R [u (x); α]

to the parameter

α_{j_{1}}

,

j_{1} = 1, \dots, T P

:

\frac{\partial R [u (x); α]}{\partial α_{j_{1}}} = {\{R^{(1)} [j_{1}; u (x); α]\}}_{d i r} + {\{δ R [j_{1}; u (x); α; v^{(1)} (j_{1}; x)]\}}_{i n d} .

(32)

The quantities

\partial R [u (x); α] / \partial α_{j_{1}}

are independent of the parameter variations

δ α_{i_{1}}

and represent the first-order partial sensitivities (first-order partial G derivatives) of the response

R (e)

with respect to each of the model parameters

α_{j_{1}}

,

j_{1} = 1, \dots, T P

, evaluated at the nominal values

e^{0} = (α^{0}, u^{0})

. Computing the response sensitivities by using the solutions

v^{(1)} (j_{1}; x)

,

j_{1} = 1, \dots, T P

, of the 1st-LVSS requires

T P

large-scale forward computations in order to determine the functions

v^{(1)} (j_{1}; x)

,

j_{1} = 1, \dots, T P

. Since most problems of practical interest are characterized by many parameters (i.e.,

α

has many components) and comparatively few responses, it becomes prohibitively expensive to solve repeatedly the 1st-LVSS in order to determine the functions

v^{(1)} (j_{1}; x)

,

j_{1} = 1, \dots, T P

. Even though the 1st-LVSS contains first-order parameter and state-functions variations, it is called “first-level” (rather than “first-order”) in anticipation of determining second-order sensitivities, which will use “second-level” forward and adjoint systems. These “second-level” systems will not be called “second-order” because they will not contain second-order parameter and/or state-function variations, but will also contain only first-order variations, even though they will be used for determining second-order sensitivities. Similar terminology, i.e., “third-level” (as opposed to “third-order”) forward/adjoint systems, will be used for determining the third-order sensitivities, and so on.

In most practical situations, the number of model parameters significantly exceeds the number of functional responses of interest, i.e.,

T R ≪ T P

, so it would be advantageous to perform just

T R

(rather than

T P

) computations. The goal of the “1st-order comprehensive adjoint sensitivity analysis methodology for nonlinear systems (1st-CASAM-N)” is to compute exactly and efficiently the “indirect effect term” defined in Equation (17) without needing to compute explicitly the vectors

v^{(1)} (j_{1}; x)

,

j_{1} = 1, \dots, T P

. The qualifier “comprehensive” is meant to indicate that that the 1st-CASAM-N considers that the internal and external boundaries

\partial Ω_{x} (α)

of the phase-space domain depend on the uncertain model parameters

α

and are thereby imprecisely known, subject to uncertainties. Thus, the 1st-CASAM-N represents a generalization of the pioneering works by Cacuci [1,2] that conceived the “adjoint sensitivity analysis methodology”, in which the domain boundary was considered to be perfectly well known, free of uncertainties. The fundamental ideas underlying the 1st-CASAM-N are as conceived by Cacuci [1,2], aiming at eliminating the appearance of the vectors

v^{(1)} (j_{1}; x)

from the expression of the indirect-effect term defined in Equation (17). This elimination is achieved by expressing the right side of Equation (17) in terms of the solutions of the “1st-Level Adjoint Sensitivity System (1st-LASS)”, the construction of which requires the introduction of adjoint operators. Adjoint operators can be defined in Banach spaces but are most useful in Hilbert spaces. Since real Hilbert spaces provide the natural mathematical setting for computational purposes, the derivations presented in this section are set in real (as opposed to complex) Hilbert spaces, without affecting the generality of the concepts presented herein. Thus, the spaces

E_{u}

and

E_{Q}

are henceforth considered to be self-dual Hilbert spaces and will be denoted as

H_{1} (Ω_{x})

. The inner product of two vectors

u^{(a)} (x) \in H_{1}

and

u^{(b)} (x) \in H_{1}

will be denoted as

{〈u^{(a)}, u^{(b)}〉}_{1}

, and is defined as follows:

{〈u^{(a)}, u^{(b)}〉}_{1} ≜ {\{\int_{λ_{1} (α)}^{ω_{1} (α)} \dots \int_{λ_{T I} (α)}^{ω_{T I} (α)} [u^{(a)} (x) \cdot u^{(b)} (x)] d x_{1} \dots d x_{T I}\}}_{α^{0}}

(33)

where the dot indicates the “scalar product of two vectors” defined as follows:

u^{(a)} (x) \cdot u^{(b)} (x) ≜ \sum_{i = 1}^{T D} u_{i}^{(a)} (x) u_{i}^{(b)} (x)

. It is important to note that the inner product defined in Equation (33) is continuous in

α

, i.e., it holds at any value particular value of

α

, including at the nominal parameter values

α^{0}

.

The construction of the 1st-LASS commences by noting that the vector

v^{(1)} (x)

itself is independent of the index

j_{1} = 1, \dots, T P

, since

v^{(1)} (x) = \sum_{j_{1} = 1}^{T P} v^{(1)} (j_{1}; x) .

(34)

The next step is to form the inner product of Equation (22) with a vector

a^{(1)} (x) ≜ [a_{1}^{(1)} (x), \dots, a_{T D}^{(1)} (x)] \in H_{1}

, where the superscript “(1)” indicates “1st-Level”:

{\{{〈a^{(1)} (x), V^{(1)} (u; α) v^{(1)} (x)〉}_{1}\}}_{α^{0}} = {\{{〈a^{(1)} (x), q^{(1)} (u; α; δ α)〉}_{1}\}}_{α^{0}} .

(35)

Using the definition of the adjoint operator in

H_{1} (Ω_{x})

, the left side of Equation (35) is transformed as follows:

{\{{〈a^{(1)}, V^{(1)} (u; α) v^{(1)}〉}_{1}\}}_{α^{0}} = {\{{〈A^{(1)} (u; α) a^{(1)}, v^{(1)}〉}_{1}\}}_{α^{0}} + {\{{[P^{(1)} (u; α; a^{(1)}; v^{(1)})]}_{\partial Ω_{x}}\}}_{α^{0}},

(36)

where

{[P^{(1)} (u; α; a^{(1)}; v^{(1)})]}_{\partial Ω_{x}}

denotes the associated bilinear concomitant evaluated on the space/time domain’s boundary

\partial Ω_{x} (α^{0})

and where

A^{(1)} (u; α)

is the operator adjoint to

V^{(1)} (u; α)

, i.e.,

A^{(1)} (u; α) ≜ {[V^{(1)} (u; α)]}^{*}

. The symbol

{[]}^{*}

will be used in this work to indicate “adjoint” operator. In certain situations, it might be computationally advantageous to include certain boundary components of

{[P^{(1)} (u; α; a^{(1)}; v^{(1)})]}_{\partial Ω_{x}}

into the components of

A^{(1)} (u; α)

.

The domain of

A^{(1)} (u; α)

is determined by selecting appropriate adjoint boundary and/or initial conditions, which will be denoted in operator form as:

{\{b_{A}^{(1)} (u; a^{(1)}; α)\}}_{α^{0}} = 0, x \in \partial Ω_{x} (α^{0}) .

(37)

The above boundary conditions for

A^{(1)} (u; α)

are usually inhomogeneous, i.e.,

b_{A}^{(1)} (0; 0; α) \neq 0

, and are obtained by imposing the following requirements:

1. They must be independent of unknown values of

v^{(1)} (x)

and

δ α

;

2. The substitution of the boundary and/or initial conditions represented by Equations (23) and (37) into the expression of

{\{{[P^{(1)} (u; α; a^{(1)}; v^{(1)})]}_{\partial Ω_{x}}\}}_{α^{0}}

must cause all terms containing unknown values of

v^{(1)} (x)

to vanish.

Constructing the adjoint initial and/or boundary conditions for

A^{(1)} (u; α)

as described above and implementing them together with the variational boundary and/or initial conditions represented by Equation (23) into Equation (35) reduces the bilinear concomitant

{\{{[P^{(1)} (u; α; a^{(1)}; v^{(1)})]}_{\partial Ω_{x}}\}}_{α^{0}}

to a quantity that will contain boundary terms involving only known values of

δ α

,

α^{0}

,

u^{0}

, and

ψ^{(1)}

; this quantity will be denoted by

{\{{[{\hat{P}}^{(1)} (u; α; a^{(1)}; δ α)]}_{\partial Ω_{x}}\}}_{α^{0}}

. In general, the boundary terms represented by

{\{{[{\hat{P}}^{(1)} (u; α; a^{(1)}; δ α)]}_{\partial Ω_{x}}\}}_{α^{0}}

do not vanish automatically. In certain cases, however,

{\{{[{\hat{P}}^{(1)} (u; α; a^{(1)}; δ α)]}_{\partial Ω_{x}}\}}_{α^{0}}

may vanish automatically or it may be forced to vanish by considering appropriately constructed extensions of the adjoint operator

A^{(1)} (α, u)

; however, such extensions are seldom needed in practice. Since

{\{{[{\hat{P}}^{(1)} (u; α; a^{(1)}; δ α)]}_{\partial Ω_{x}}\}}_{α^{0}}

is linear in

δ α

, it can be expressed in the following form:

{[{\hat{P}}^{(1)} (u; α; a^{(1)}; δ α)]}_{\partial Ω_{x}} = \sum_{j_{1} = 1}^{T P} [\partial {\hat{P}}^{(1)} (u; α; a^{(1)}) / \partial α_{j_{1}}] δ α_{j_{1}} .

Implementing the forward and adjoint boundary and/or initial conditions given in Equations (23) and (37) into Equation (36) transforms the later into the following relation:

{\{{〈A^{(1)} (u; α) a^{(1)}, v^{(1)}〉}_{1}\}}_{α^{0}} = {\{{〈a^{(1)}, V^{(1)} (u; α) v^{(1)}〉}_{1}\}}_{α^{0}} - {\{{[{\hat{P}}^{(1)} (u; α; a^{(1)}; δ α)]}_{\partial Ω_{x}}\}}_{α^{0}},

(38)

Replacing the quantity

V^{(1)} (α; u) v^{(1)}

in the first term on the right side of Equation (38) by the right side of Equation (22) yields the following relation:

{\{{〈A^{(1)} (u; α) a^{(1)}, v^{(1)}〉}_{1}\}}_{α^{0}} = {\{{〈a^{(1)}, q^{(1)} (u; α; δ α)〉}_{1}\}}_{α^{0}} - {\{{[{\hat{P}}^{(1)} (u; α; a^{(1)}; δ α)]}_{\partial Ω_{x}}\}}_{α^{0}} .

(39)

The definition of the function

a^{(1)} (x)

will now be completed by requiring that the left side of Equation (39) is the same as the indirect-effect term defined in Equation (17), which is achieved by imposing the following relationship:

{\{A^{(1)} (u; α) a^{(1)} (x)\}}_{α^{0}} = {\{\partial S (u; α) / \partial u\}}_{α^{0}} ≜ s_{A}^{(1)} [u (x); α], x \in Ω_{x},

(40)

while satisfying the adjoint boundary conditions represented by Equation (37). The subscript “A” attached to the source term on the right side of Equation (40) indicates “adjoint”. Since the source

s_{A}^{(1)} [u (x); α]

may contain distributions (e.g., Dirac delta functions and derivatives thereof), the equality in Equation (40) is considered to hold in the weak sense. The well-known Riesz representation theorem ensures that the relationship in Equation (40) holds uniquely.

The results obtained in Equations (38)–(40) are now replaced in Equation (17) to obtain the following expression of the indirect-effect term as a function of

a^{(1)} (x)

:

\begin{array}{l} {\{δ R [u (x); α; v^{(1)} (x)]\}}_{i n d} = {\{{〈a^{(1)}, q^{(1)} (u; α; δ α)〉}_{1}\}}_{α^{0}} - {\{{[{\hat{P}}^{(1)} (u; α; a^{(1)}; δ α)]}_{\partial Ω_{x}}\}}_{α^{0}} \\ \equiv {\{δ R [u (x); α; a^{(1)} (x); δ α]\}}_{i n d} ≜ \sum_{j_{1} = 1}^{T P} {\{R^{(1)} [j_{1}; u (x); a^{(1)} (x); α]\}}_{i n d} δ α_{j_{1}}, \end{array}

(41)

where, for each

j_{1} = 1, \dots, T P

, the contribution of the indirect-effect term to the sensitivity of the response with respect to the parameter

α_{j_{1}}

is given by

\begin{array}{l} {\{R^{(1)} [j_{1}; u (x); a^{(1)} (x); α]\}}_{i n d} ≜ {\{〈 a^{(1)} (x), s_{V}^{(1)} {(j_{1}; u; α) 〉}_{1} - \partial {\hat{P}}^{(1)} (u; α; a^{(1)}) / \partial α_{j_{1}}\}}_{α^{0}} \\ = {\{\int_{λ_{1} (α)}^{ω_{1} (α)} … \int_{λ_{T I} (α)}^{ω_{T I} (α)} a^{(1)} (x) \cdot \frac{\partial [Q (α) - N (u; α)]}{\partial α_{j_{1}}} d x_{1} … d x_{T I} - \frac{\partial {\hat{P}}^{(1)} (u; α; a^{(1)})}{\partial α_{j_{1}}}\}}_{α^{0}} . \end{array}

(42)

As the identity on the right side of Equation (41) indicates, the desired elimination of all unknown values of

v^{(1)} (x)

from the expression of the indirect-effect term

{\{δ R [u (x); α; v^{(1)} (x)]\}}_{i n d}

is accomplished. Instead of depending on

v^{(1)} (x)

, the indirect-effect term

{\{δ R [u (x); α; v^{(1)} (x)]\}}_{i n d}

now depends on the adjoint function

a^{(1)} (x) \in H_{1}

.

Replacing in Equation (16) the result obtained in Equation (41) together with the expression provided (18) for the direct-effect term yields the following expression

\begin{array}{l} {\{δ R [u (x); α; v^{(1)} (x); δ α]\}}_{α^{0}} = {\{δ R [u (x); α; δ α]\}}_{d i r} + {\{{〈a^{(1)}, q^{(1)} (u; α; δ α)〉}_{1}\}}_{α^{0}} \\ - {\{{[{\hat{P}}^{(1)} (u; α; a^{(1)}; δ α)]}_{\partial Ω_{x}}\}}_{α^{0}} . \end{array}

(43)

The expressions of the first-order response sensitivities

{\{\partial R [u (x); α] / \partial α_{j_{1}}\}}_{α^{0}}

of the response

R [u (x); α]

with respect to the parameters

α_{j_{1}}

are obtained identifying in Equation (43) the quantities that multiply the respective parameter variations

δ α_{j_{1}}

,

j_{1} = 1, \dots, T P

. This identification yields the following expressions for the first-order response sensitivities

{\{\partial R [u (x); α] / \partial α_{j_{1}}\}}_{α^{0}}

,

j_{1} = 1, \dots, T P

, computed at the model’s nominal parameter and state function values:

\begin{array}{l} {\{\partial R [u (x); α] / \partial α_{j_{1}}\}}_{α^{0}} ≜ {\{R^{(1)} [j_{1}; u (x); a^{(1)} (x); α]\}}_{i n d} + {\{R^{(1)} [j_{1}; u (x); α]\}}_{d i r} \\ = {\{\int_{λ_{1} (α)}^{ω_{1} (α)} … \int_{λ_{T I} (α)}^{ω_{T I} (α)} a^{(1)} (x) \cdot \frac{\partial [Q (α) - N (u; α)]}{\partial α_{j_{1}}} d x_{1} … d x_{T I} - \frac{\partial {\hat{P}}^{(1)} (u; α; a^{(1)})}{\partial α_{j_{1}}}\}}_{α^{0}} \\ + {\{\int_{λ_{1} (α)}^{ω_{1} (α)} \dots \int_{λ_{T I} (α)}^{ω_{T I} (α)} \frac{\partial S (u; α; α)}{\partial α_{j_{1}}} d x_{1} \dots d x_{T I}\}}_{α^{0}} \\ + \sum_{j = 1}^{T I} {\{\int_{λ_{1} (α)}^{ω_{1} (α)} \dots \int_{λ_{j - 1} (α)}^{ω_{j - 1} (α)} \int_{λ_{j + 1} (α)}^{ω_{j + 1} (α)} \dots \int_{λ_{T I} (α)}^{ω_{T I} (α)} S [u (x_{1}, ., ω_{j} (α), ., x_{N_{x}}); α] \frac{\partial ω_{j} (α)}{\partial α_{j_{1}}} d x_{1} \dots d x_{T I}\}}_{α^{0}} \\ - \sum_{j = 1}^{T I} {\{\int_{λ_{1} (α)}^{ω_{1} (α)} \dots \int_{λ_{j - 1} (α)}^{ω_{j - 1} (α)} \int_{λ_{j + 1} (α)}^{ω_{j + 1} (α)} \dots \int_{λ_{T I} (α)}^{ω_{T I} (α)} S [u (x_{1}, ., λ_{j} (α), ., x_{N_{x}}); α] \frac{\partial λ_{j} (α)}{\partial α_{j_{1}}} d x_{1} \dots d x_{T I}\}}_{α^{0}} . \\ ≜ R^{(1)} [j_{1}; u (x); a^{(1)} (x); α]; j_{1} = 1, \dots, T P . \end{array}

(44)

As indicated by Equation (44), each of the first-order sensitivities

R^{(1)} [j_{1}; u (x); a^{(1)} (x); α]

of the response

R [u (x); α]

with respect to the model parameters

α_{j_{1}}

(including boundary and initial conditions) can be computed inexpensively after having obtained the function

a^{(1)} (x) \in H_{1}

, using just quadrature formulas to evaluate the various inner products involving

a^{(1)} (x) \in H_{1}

in the expression of the indirect-effect term obtained in Equation (42). The function

a^{(1)} (x) \in H_{1}

is obtained by solving numerically Equations (37) and (40), which is the only large-scale computation needed for obtaining all of the first-order sensitivities. Equations (37) and (40) will be called the first-level adjoint sensitivity system (1st-LASS), and its solution,

a^{(1)} (x) \in H_{1} (Ω_{x})

, will be called the first-level adjoint function. It is very important to note that the 1st-LASS is independent of parameter variation

δ α_{j_{1}}

,

j_{1} = 1, \dots, T P

, and therefore needs to be solved only once, regardless of the number of model parameters under consideration. Furthermore, since Equation (40) is linear in

a^{(1)} (x)

ψ_{1, i_{1}}^{(2)} (x)

, solving it requires less computational effort than solving the original Equation (1), which is nonlinear in

u (x)

.

3.2. The Second-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Nonlinear Systems (2nd-CASAM-N)

The 2nd-CASAM-N relies on the same fundamental concepts as introduced in [14], but the 2nd-CASAM-N also enables the computation of response sensitivities with respect to imprecisely known domain boundaries, thus including all possible types of uncertain parameters. Fundamentally, the second-order sensitivities are defined as the “1st-order sensitivities of the 1st-order sensitivities”. This definition stems from the inductive definition of the second-order total G differential of correspondingly differentiable function, which is also defined inductively as “the total 1st-order differential of the 1st-order total differential” of a function.

The computation of the second-order sensitivities

R^{(1)} [j_{1}; u (x); a^{(1)} (x); α]

requires the prior computation of the original forward function

u (x)

and the first-level adjoint function

a^{(1)} (x)

. For each

j_{1} = 1, \dots, T P

, the first-order sensitivities

R^{(1)} [j_{1}; u (x); a^{(1)} (x); α]

will be assumed to satisfy the conditions stated in Equations (11) and (12). Under these conditions, the first-order total G differential

{\{δ R^{(1)} [j_{1}; u (x); a^{(1)} (x); α; v^{(1)} (x); δ a^{(1)} (x); δ α]\}}_{α^{0}}

of

R^{(1)} [j_{1}; u (x); a^{(1)} (x); α]

will exist in a neighborhood around the nominal values of the parameters and state functions and will have, by definition, the following expression, for each

j_{1} = 1, \dots, T P

:

\begin{array}{l} {\{δ R^{(1)} [j_{1}; u (x); a^{(1)} (x); α; v^{(1)} (x); δ a^{(1)} (x); δ α]\}}_{α^{0}} \\ ≜ {\{\frac{d}{d ε} δ R^{(1)} [j_{1}; u (x) + ε v^{(1)} (x); a^{(1)} (x) + ε δ a^{(1)} (x); α + ε δ α]\}}_{ε = 0} \\ ≜ {\{δ R^{(1)} [j_{1}; u (x); a^{(1)} (x); α; δ α]\}}_{d i r} + {\{δ R^{(1)} [j_{1}; u (x); a^{(1)} (x); α; v^{(1)} (x); δ a^{(1)} (x)]\}}_{i n d}, \end{array}

(45)

where

\begin{array}{l} {\{δ R^{(1)} [j_{1}; u (x); a^{(1)} (x); α; δ α]\}}_{d i r} ≜ {\{\frac{\partial R^{(1)} [j_{1}; u (x); a^{(1)} (x); α]}{\partial α} δ α\}}_{α^{0}} \\ = \sum_{j_{2} = 1}^{T P} \frac{\partial R^{(1)} [j_{1}; u (x); a^{(1)} (x); α]}{\partial α_{j_{2}}} δ α_{j_{2}}, \end{array}

(46)

and

\begin{array}{l} {\{δ R^{(1)} [j_{1}; u (x); a^{(1)} (x); α; v^{(1)} (x); δ a^{(1)} (x)]\}}_{i n d} \\ ≜ \sum_{i = 1}^{T D} \frac{\partial R^{(1)} [j_{1}; u (x); a^{(1)} (x); α]}{\partial u_{i} (x)} δ u_{i} (x) + \sum_{i = 1}^{T D} \frac{\partial R^{(1)} [j_{1}; u (x); a^{(1)} (x); α]}{\partial a_{i}^{(1)} (x)} δ a_{i}^{(1)} (x) \\ ≜ {\{\partial R^{(1)} [j_{1}; u (x); a^{(1)} (x); α] / \partial u\}}_{α^{0}} v^{(1)} (x) \\ + {\{\partial R^{(1)} [j_{1}; u (x); a^{(1)} (x); α] / \partial a^{(1)}\}}_{α^{0}} δ a^{(1)} (x) . \end{array}

(47)

In Equation (45), the quantities

{\{δ R^{(1)} [j_{1}; u (x); a^{(1)} (x); α; v^{(1)} (x); δ a^{(1)} (x)]\}}_{i n d}

and

{\{δ R^{(1)} [j_{1}; u (x); a^{(1)} (x); α; δ α]\}}_{d i r}

denote, respectively, the indirect-effect term and the direct-effect term. The direct-effect term

{\{δ R^{(1)} [j_{1}; u (x); a^{(1)} (x); α; δ α]\}}_{d i r}

comprises all dependencies on the vector

δ α

of parameter variations and, in view of Equation (44), has the following expression obtained by using Equation (44):

\begin{array}{l} \frac{\partial R^{(1)} [j_{1}; u (x); a^{(1)} (x); α]}{\partial α_{j_{2}}} = \frac{\partial}{\partial α_{j_{2}}} \{\int_{λ_{1} (α)}^{ω_{1} (α)} \dots \int_{λ_{T I} (α)}^{ω_{T I} (α)} \frac{\partial S (u; α; α)}{\partial α_{j_{1}}} d x_{1} … d x_{T I}\} \\ + \frac{\partial}{\partial α_{j_{2}}} \{\int_{λ_{1} (α)}^{ω_{1} (α)} \dots \int_{λ_{T I} (α)}^{ω_{T I} (α)} a^{(1)} (x) \cdot \frac{\partial [Q (α) - N (u; α)]}{\partial α_{j_{1}}} d x_{1} \dots d x_{T I} - \frac{\partial {\hat{P}}^{(1)} (u; α; a^{(1)})}{\partial α_{j_{1}}}\} \\ + \sum_{j = 1}^{T I} \frac{\partial}{\partial α_{j_{2}}} \{\int_{λ_{1} (α)}^{ω_{1} (α)} \dots \int_{λ_{j - 1} (α)}^{ω_{j - 1} (α)} \int_{λ_{j + 1} (α)}^{ω_{j + 1} (α)} \dots \int_{λ_{T I} (α)}^{ω_{T I} (α)} S [u (x_{1}, ., ω_{j} (α), ., x_{N_{x}}); α] \frac{\partial ω_{j} (α)}{\partial α_{j_{1}}} d x_{1} \dots d x_{T I}\} \\ - \sum_{j = 1}^{T I} \frac{\partial}{\partial α_{j_{2}}} \{\int_{λ_{1} (α)}^{ω_{1} (α)} \dots \int_{λ_{j - 1} (α)}^{ω_{j - 1} (α)} \int_{λ_{j + 1} (α)}^{ω_{j + 1} (α)} \dots \int_{λ_{T I} (α)}^{ω_{T I} (α)} S [u (x_{1}, ., λ_{j} (α), ., x_{N_{x}}); α] \frac{\partial λ_{j} (α)}{\partial α_{j_{1}}} d x_{1} \dots d x_{T I}\}, \\ f o r j_{1}, j_{2} = 1, \dots, T P . \end{array}

(48)

The indirect-effect term

{\{δ R^{(1)} [j_{1}; u (x); a^{(1)} (x); α; v^{(1)} (x); δ a^{(1)} (x)]\}}_{i n d}

comprises all dependencies on the vectors

v^{(1)} (x)

and

δ a^{(1)} (x)

of variations in the state functions

u (x)

and

a^{(1)} (x)

, respectively. The components of the indirect-effect term have the following expressions obtained by using Equation (44):

\begin{array}{l} \frac{\partial R^{(1)} [j_{1}; u (x); a^{(1)} (x); α]}{\partial u_{i}} = \int_{λ_{1} (α)}^{ω_{1} (α)} \dots \int_{λ_{T I} (α)}^{ω_{T I} (α)} \frac{\partial^{2} S (u; α; α)}{\partial u_{i} \partial α_{j_{1}}} d x_{1} \dots d x_{T I} \\ + \int_{λ_{1} (α)}^{ω_{1} (α)} \dots \int_{λ_{T I} (α)}^{ω_{T I} (α)} a^{(1)} (x) \cdot \frac{\partial [Q (α) - \partial N (u; α) / \partial u_{i}]}{\partial α_{j_{1}}} d x_{1} \dots d x_{T I} - \frac{\partial^{2} {\hat{P}}^{(1)} (u; α; a^{(1)})}{\partial u_{i} \partial α_{j_{1}}} \\ + \sum_{j = 1}^{T I} \int_{λ_{1} (α)}^{ω_{1} (α)} \dots \int_{λ_{j - 1} (α)}^{ω_{j - 1} (α)} \int_{λ_{j + 1} (α)}^{ω_{j + 1} (α)} \dots \int_{λ_{T I} (α)}^{ω_{T I} (α)} \frac{\partial S [u (x_{1}, ., ω_{j} (α), ., x_{N_{x}}); α]}{\partial u_{i}} \frac{\partial ω_{j} (α)}{\partial α_{j_{1}}} d x_{1} \dots d x_{T I} \\ - \sum_{j = 1}^{T I} \int_{λ_{1} (α)}^{ω_{1} (α)} … \int_{λ_{j - 1} (α)}^{ω_{j - 1} (α)} \int_{λ_{j + 1} (α)}^{ω_{j + 1} (α)} \dots \int_{λ_{T I} (α)}^{ω_{T I} (α)} \frac{\partial S [u (x_{1}, ., λ_{j} (α), ., x_{N_{x}}); α]}{\partial u_{i}} \frac{\partial λ_{j} (α)}{\partial α_{j_{1}}} d x_{1} \dots d x_{T I}, \\ f o r j_{1} = 1, \dots, T P; i = 1, \dots, T D; \end{array}

(49)

and

\begin{array}{l} \frac{\partial R^{(1)} [j_{1}; u (x); a^{(1)} (x); α]}{\partial a_{i}^{(1)}} = \frac{\partial}{\partial a_{i}^{(1)}} \{\int_{λ_{1} (α)}^{ω_{1} (α)} \dots \int_{λ_{T I} (α)}^{ω_{T I} (α)} a^{(1)} (x) \cdot \frac{\partial [Q (α) - N (u; α)]}{\partial α_{j_{1}}} d x_{1} \dots d x_{T I}\}, \\ f o r j_{1} = 1, \dots, T P; i = 1, \dots, T D . \end{array}

(50)

The direct-effect term

{\{δ R^{(1)} [j_{1}; u (x); a^{(1)} (x); α; δ α]\}}_{d i r}

can be computed immediately while the indirect-effect term

{\{δ R^{(1)} [j_{1}; u (x); a^{(1)} (x); α; v^{(1)} (x); δ a^{(1)} (x)]\}}_{i n d}

can be computed only after having determined the vectors

v^{(1)} (x)

and

δ a^{(1)} (x)

. The vector

v^{(1)} (x)

is the solution of the 1st-LVSS defined by Equations (22) and (23). On the other hand, the vector

δ a^{(1)} (x)

is the solution of the G differentiated 1st-LASS. Thus, taking the G differential of Equations (37) and (40) yields the following system of equations for

δ a^{(1)} (x)

:

{\{V_{21}^{(2)} (u; a^{(1)}; α) v^{(1)} (x) + V_{22}^{(2)} (u; α) δ a^{(1)} (x)\}}_{α^{0}} = {\{q_{2}^{(2)} (u; a^{(1)}; α; δ α)\}}_{α^{0}}, x \in Ω_{x},

(51)

{\{δ b_{A}^{(1)} (u; a^{(1)}; α; v^{(1)}; δ a^{(1)}; δ α)\}}_{α^{0}} = 0, x \in \partial Ω_{x} (α^{0}),

(52)

where

V_{21}^{(2)} (u; a^{(1)}; α) ≜ \frac{\partial [A^{(1)} (u; α) a^{(1)}]}{\partial u} - \frac{s^{(1)} [u (x); α]}{\partial u};

(53)

V_{22}^{(2)} (u; α) ≜ A^{(1)} (u; α)

(54)

q_{2}^{(2)} (u; α; a^{(1)}; δ α) ≜ \frac{s^{(1)} [u (x); α]}{\partial α} δ α - \frac{\partial [A^{(1)} (u; α) a^{(1)} (x)]}{\partial α} δ α;

(55)

δ b_{A}^{(1)} (u; a^{(1)}; α) ≜ \frac{\partial b_{A}^{(1)} (u; a^{(1)}; α)}{\partial u} v^{(1)} (x) + \frac{\partial b_{A}^{(1)} (u; a^{(1)}; α)}{\partial a^{(1)}} δ a^{(1)} (x) + \frac{\partial b_{A}^{(1)} (u; a^{(1)}; α)}{\partial α} δ α

(56)

Although Equations (51) and (52) are coupled to the 1st-LVSS, they can be solved sequentially, after having obtained the solution

v^{(1)} (x)

of the 1st-LVSS. Formally, the functions

v^{(1)} (x)

and

δ a^{(1)} (x)

are obtained by solving the following second-level variational sensitivity system (2nd-LVSS), which is obtained by concatenating the 1st-LVSS to Equations (51) and (52):

{\{V M^{(2)} [2 \times 2; U^{(2)} (2; x); α] V^{(2)} (2; x)\}}_{α^{0}} = {\{Q_{V}^{(2)} [2; U^{(2)} (2; x); α; δ α]\}}_{α^{0}}, x \in Ω_{x},

(57)

{\{B_{V}^{(2)} [2; U^{(2)} (2; x); V^{(2)} (2; x); α; δ α]\}}_{α^{0}} = 0 [2], 0 [2] ≜ {[0, 0]}^{†}, x \in \partial Ω_{x} (α^{0})

(58)

The argument “2” which appears in the list of arguments of the vector

U^{(2)} (2; x)

and the “variational vector”

V^{(2)} (2; x)

in Equations (57) and (58) indicates that each of these vectors is a two-block column vector, with each block comprising a column vector of dimension

T D

, defined as follows:

U^{(2)} (2; x) ≜ (\begin{matrix} u^{(1)} (x) \\ a^{(1)} (x) \end{matrix}); V^{(2)} (2; x) ≜ δ U^{(2)} (2; x) ≜ (\begin{matrix} v^{(2)} (1; x) \\ v^{(2)} (2; x) \end{matrix}) ≜ (\begin{matrix} v^{(1)} (x) \\ δ a^{(1)} (x) \end{matrix}) .

(59)

Thus, the (column) block vector

U^{(2)} (2; x)

has a total of

2 \times T D

components; evidently, the (column) block vector

V^{(2)} (2; x)

also has a total of

2 \times T D

components. In the relatively simple case regarding the components of either the vector

U^{(2)} (2; x)

or the vector

V^{(2)} (2; x)

, the numbers “1” and “2” could also be used as subscripts, but such a subscript notation would become unwieldy for the higher-level (adjoint) functions, which will be introduced in the sections to follow below. The superscript “(2)” which appears in the notation of the vectors

U^{(2)} (2; x)

and

V^{(2)} (2; x)

indicates “2nd-level”. Henceforth, such “higher-level” (i.e., level higher than first) variational and adjoint functions/vectors will be denoted using bold capital letters. The argument “2” in the expression

0 [2] ≜ {[0, 0]}^{†}

indicates that the quantity

0 [2]

is a two-block column vector comprising two vectors, each of which has

T D

components, all of which are zero-valued, as defined in Equation (3). Thus, the column vector

0 [2]

has a total of

2 \times T D

components, all of which are identically zero.

To distinguish block vectors from block matrices, two capital bold letter are used (and will henceforth be used) to denote block matrices, as in the case of “the second-level” “variational matrix”

V M^{(2)} [2 \times 2; u^{(2)} (x); α]

. The “2nd-level” is indicated by the superscript “(2)”. Subsequently in this work, levels higher than second will also be indicated by a corresponding superscript attached to the appropriate block vectors and/or block matrices. The argument “

2 \times 2

”, which appears in the list of arguments of

V M^{(2)} [2 \times 2; u^{(2)} (x); α]

, indicates that this matrix is a

2 \times 2

-dimensional block matrix comprising four matrices, each with of dimensions

T D \times T D

, having the following structure:

V M^{(2)} [2 \times 2; U^{(2)} (2; x); α] ≜ (\begin{matrix} V^{(1)} & 0 \\ V_{21}^{(2)} & V_{22}^{(2)} \end{matrix}) .

(60)

Thus, the matrix

V M^{(2)} [2 \times 2; u^{(2)} (x); α]

has a total of

{(2 \times T D)}^{2}

components (or elements). The other quantities which appear in Equations (57) and (58) are also two-block vectors, with the same structure as

V^{(2)} (2; x)

, and are defined as follows:

Q_{V}^{(2)} [2; U^{(2)} (2; x); α; δ α] ≜ (\begin{matrix} q_{V}^{(2)} (1; U^{(2)} (2; x); α; δ α) \\ q_{V}^{(2)} (2; U^{(2)} (2; x); α; δ α) \end{matrix}) ≜ (\begin{matrix} q^{(1)} (u; α; δ α) \\ q_{2}^{(2)} (u; a^{(1)}; α; δ α) \end{matrix});

(61)

\begin{array}{l} B_{V}^{(2)} [2; U^{(2)} (2; x); V^{(2)} (2; x); α; δ α] ≜ (\begin{matrix} b_{V}^{(2)} [1; U^{(2)} (2; x); V^{(2)} (2; x); α; δ α] \\ b_{V}^{(2)} [2; U^{(2)} (2; x); V^{(2)} (2; x); α; δ α] \end{matrix}) \\ ≜ (\begin{matrix} b_{V}^{(1)} (u^{(1)}; α; δ u^{(1)}; δ α) \\ δ b_{A}^{(1)} [U^{(2)} (2; x); V^{(2)} (2; x); α; δ α] \end{matrix}) . \end{array}

(62)

Solving the 2nd-LVSS requires

T P^{2}

large-scale computations, which is unrealistic to perform for large-scale systems comprising many parameters. The 2nd-CASAM-N circumvents the need for solving the 2nd-LVSS by deriving an alternative expression for the indirect-effect term defined in Equation (47), in which the function

V^{(2)} (2; x)

is replaced by a second-level adjoint function which is independent of variations in the model parameter and state functions. This second-level adjoint function will satisfy a second-level adjoint sensitivity system (2nd-LASS), which will be constructed by using the 2nd-LVSS as the starting point, following the same principles outlined in Section 3.1 The 2nd-LASS will be constructed in a Hilbert space which will be denoted as

H_{2} (Ω_{x})

and which comprises as elements block vectors of the same form as

V^{(2)} (2; x)

. Thus, a generic vector in

H_{2} (Ω_{x})

, denoted as

Ψ^{(2)} (2; x) ≜ {[ψ^{(2)} (1; x), ψ^{(2)} (2; x)]}^{†} \in H_{2} (Ω_{x})

, comprises two components of the form

ψ^{(2)} (1; x) ≜ {[ψ_{1}^{(2)} (1; x), \dots, ψ_{T D}^{(2)} (1; x)]}^{†} \in H_{1} (Ω_{x})

and

ψ^{(2)} (2; x) ≜ {[ψ_{1}^{(2)} (2; x), \dots, ψ_{T D}^{(2)} (2; x)]}^{†} \in H_{1} (Ω_{x})

, each of which are

T D

-dimensional column vectors; hence,

Ψ^{(2)} (2; x)

is a

(2 \times T D)

-dimensional column vector.

The inner product of two vectors

Ψ^{(2)} (2; x) ≜ {[ψ^{(2)} (1; x), ψ^{(2)} (2; x)]}^{†} \in H_{2} (Ω_{x})

and

Φ^{(2)} (x) ≜ {[φ^{(2)} (1; x), φ^{(2)} (2; x)]}^{†} \in H_{2} (Ω_{x})

in the Hilbert space

H_{2} (Ω_{x})

will be denoted as

{〈Ψ^{(2)} (2; x), Φ^{(2)} (x)〉}_{2}

and defined as follows:

{〈Ψ^{(2)} (2; x), Φ^{(2)} (x)〉}_{2} ≜ \sum_{i = 1}^{2} {〈ψ^{(2)} (i; x), φ^{(2)} (i; x)〉}_{1}

(63)

The inner product defined in Equation (63) is continuous in

α

in a neighborhood of

α^{0}

. Using the definition of the inner product defined in Equation (63), construct the inner product of Equation (57) with a vector

A^{(2)} (2; x) ≜ {[a^{(2)} (1; x), a^{(2)} (2; x)]}^{†} \in H_{2} (Ω_{x})

to obtain the following relation:

\begin{array}{l} {\{{〈A^{(2)} (2; x), V M^{(2)} [2 \times 2; u^{(2)} (x); α] V^{(2)} (2; x)〉}_{2}\}}_{α^{0}} \\ = {\{{〈A^{(2)} (2; x), Q_{V}^{(2)} [2; u^{(2)} (x); α; δ α]〉}_{2}\}}_{α^{0}} . \end{array}

(64)

The inner product on the left side of Equation (64) is now further transformed by using the definition of the adjoint operator to obtain the following relation:

\begin{array}{l} {\{{〈A^{(2)} (2; x), V M^{(2)} [2 \times 2; U^{(2)} (2; x); α] V^{(2)} (2; x)〉}_{2}\}}_{α^{0}} \\ = {\{{〈V^{(2)} (2; x), A M^{(2)} [2 \times 2; U^{(2)} (2; x); α] A^{(2)} (2; x)〉}_{2}\}}_{α^{0}} \\ + {\{{[P^{(2)} (U^{(2)}; A^{(2)}; V^{(2)}; α)]}_{\partial Ω_{x}}\}}_{α^{0}}, \end{array}

(65)

where the adjoint matrix-valued operator

A M^{(2)} [2 \times 2; u^{(2)} (x); α]

is defined as follows:

A M^{(2)} [2 \times 2; U^{(2)} (2; x); α] ≜ {[V M^{(2)} (2 \times 2; U^{(2)} (2; x); α)]}^{*} = (\begin{matrix} {[V^{(1) *}]}^{†} & {[V_{21}^{(2) *}]}^{†} \\ 0 & {[V_{22}^{(2) *}]}^{†} \end{matrix})

(66)

The matrix

A M^{(2)} [2 \times 2; u^{(2)} (x); α]

comprises

(2 \times 2)

block matrices, each with dimensions

T D^{2}

, thus comprising a total of

(2 \times 2) T D^{2}

components (or elements).

In Equation (66), the quantity

{\{{[P^{(2)} (U^{(2)}; A^{(2)}; V^{(2)}; α)]}_{\partial Ω_{x}}\}}_{α^{0}}

denotes the corresponding bilinear concomitant on the domain’s boundary, evaluated at the nominal values for the parameters and respective state functions. The definition domain of the adjoint (matrix-valued) operator

A M^{(2)} [2 \times 2; U^{(2)} (2; x); α]

is specified by requiring the function

A^{(2)} (2; x) ≜ {[a^{(2)} (1; x), a^{(2)} (2; x)]}^{†} \in H_{2} (Ω_{x})

to satisfy adjoint boundary/initial conditions denoted as follows:

{\{B_{A}^{(2)} [2; U^{(2)} (2; x); A^{(2)} (2; x); α]\}}_{α^{0}} = 0 [2], x \in \partial Ω_{x} (α^{0}) .

(67)

The second-level adjoint boundary/initial conditions represented by Equation (67) are determined by requiring that: (a) they must be independent of unknown values of

V^{(2)} (2; x)

; and (b) the substitution of the boundary and/or initial conditions represented by Equations (58) and (67) into the expression of

{\{{[P^{(2)} (U^{(2)}; A^{(2)}; V^{(2)}; α)]}_{\partial Ω_{x}}\}}_{α^{0}}

must cause all terms containing unknown values of

V^{(2)} (2; x)

to vanish.

Implementing the second-level (forward and adjoint) boundary/initial conditions, namely Equations (58) and (67) into Equation (65), will transform the later into the following form:

\begin{array}{l} {\{{〈V^{(2)} (2; x), A M^{(2)} [2 \times 2; U^{(2)} (2; x); α] A^{(2)} (2; x)〉}_{2}\}}_{α^{0}} \\ = {\{{〈A^{(2)} (2; x), V M^{(2)} [2 \times 2; U^{(2)} (2; x); α] V^{(2)} (2; x)〉}_{2}\}}_{α^{0}} \\ - {\{{[{\hat{P}}^{(2)} (U^{(2)}; A^{(2)}; α; δ α)]}_{\partial Ω_{x}}\}}_{α^{0}}, \end{array}

(68)

where

{\{{[{\hat{P}}^{(2)} (U^{(2)}; A^{(2)}; α; δ α)]}_{\partial Ω_{x}}\}}_{α^{0}}

denotes residual boundary terms which do not depend on

V^{(2)} (2; x)

but may not have vanished automatically. The right side of Equation (64) is now used to replace the vector

V M^{(2)} [2 \times 2; U^{(2)} (2; x); α] V^{(2)} (2; x)

in the first term on the right side of Equation (68), thereby obtaining the following relation:

\begin{array}{l} {\{{〈V^{(2)} (2; x), A M^{(2)} [2 \times 2; U^{(2)} (2; x); α] A^{(2)} (2; x)〉}_{2}\}}_{α^{0}} \\ = {\{{〈A^{(2)} (2; x), Q_{V}^{(2)} [2; U^{(2)} (2; x); α; δ α]〉}_{2}\}}_{α^{0}} - {\{{[{\hat{P}}^{(2)} (U^{(2)}; A^{(2)}; α; δ α)]}_{\partial Ω_{x}}\}}_{α^{0}} . \end{array}

(69)

The definition of the second-level adjoint function

A^{(2)} (2; x) ≜ {[a^{(2)} (1; x), a^{(2)} (2; x)]}^{†} \in H_{2} (Ω_{x})

is now completed by requiring that both the left side of Equation (69) and the right side of Equation (47) represent the “indirect-effect term”

{\{δ R^{(1)} [j_{1}; u (x); a^{(1)} (x); α; v^{(1)} (x)]\}}_{i n d}

. As shown in Equation (45), there are a total of

T P

indirect-effect terms, each one corresponding to a first-order sensitivity

R^{(1)} [j_{1}; u (x); a^{(1)} (x); α]

,

j_{1} = 1, \dots, T P

. Hence, there will be a total of

T P

second-level adjoint functions of the form

A^{(2)} (2; j_{1}; x) ≜ {[a^{(2)} (1; j_{1}; x), a^{(2)} (2; j_{1}; x)]}^{†} \in H_{2} (Ω_{x})

,

j_{1} = 1, \dots, T P

, with each such adjoint function corresponding to a specific

j_{1}

-dependent indirect-effect term. The left side of Equation (69) will be identical to the right side of Equation (47) by requiring that the following relation is satisfied for each

j_{1} = 1, \dots, T P

by the second-level adjoint functions (block vectors)

A^{(2)} (2; j_{1}; x) ≜ {[a^{(2)} (1; j_{1}; x), a^{(2)} (2; j_{1}; x)]}^{†} \in H_{2} (Ω_{x})

:

\begin{array}{l} {\{A M^{(2)} [2 \times 2; U^{(2)} (2; x); α] A^{(2)} (2; j_{1}; x)\}}_{α^{0}} \\ = {\{Q_{A}^{(2)} [2; j_{1}; U^{(2)} (2; x); α]\}}_{α^{0}}, j_{1} = 1, \dots, T P, x \in Ω_{x}, \end{array}

(70)

where

\begin{array}{l} Q_{A}^{(2)} [2; j_{1}; U^{(2)} (2; x); α] ≜ (\begin{matrix} q_{A}^{(2)} (1; j_{1}; U^{(2)}; α) \\ q_{A}^{(2)} (2; j_{1}; U^{(2)}; α) \end{matrix}) \\ ≜ (\begin{matrix} \partial R^{(1)} [j_{1}; u (x); a^{(1)} (x); α; v^{(1)} (x)] / \partial u \\ \partial R^{(1)} [j_{1}; u (x); a^{(1)} (x); α; v^{(1)} (x)] / \partial a^{(1)} \end{matrix}), j_{1} = 1, \dots, T P . \end{array}

(71)

The boundary conditions to be satisfied by each of the second-level adjoint functions

A^{(2)} (2; j_{1}; x) ≜ {[a^{(2)} (1; j_{1}; x), a^{(2)} (2; j_{1}; x)]}^{†} \in H_{2} (Ω_{x})

are as represented by Equation (67), namely:

{\{B_{A}^{(2)} [2; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α]\}}_{α^{0}} = 0 [2]; j_{1} = 1, \dots, T P; x \in \partial Ω_{x} (α^{0}) .

(72)

Since the source

Q_{A}^{(2)} [2; j_{1}; U^{(2)} (2; x); α]

may contain distributions (e.g., Dirac delta functions and derivatives thereof), the equality in Equation (70) is considered to hold in the weak sense. The well-known Riesz representation theorem ensures that the relationship in Equation (70) holds uniquely.

The system of equations represented by Equations (70)–(72) will be called the second-level adjoint sensitivity system (2nd-LASS) and its solution,

A^{(2)} (2; j_{1}; x) ≜ {[a^{(2)} (1; j_{1}; x), a^{(2)} (2; j_{1}; x)]}^{†}

, will be called the second-level adjoint function. The 2nd-LASS is independent of parameter variations

δ α

and variations

V^{(2)} (2; x)

in the respective state functions. It is also important to note that the

{(2 \times T D)}^{2}

-dimensional matrix

A M^{(2)} [2 \times 2; U^{(2)} (2; x); α]

is independent of the index

j_{1}

. Only the source term

Q_{A}^{(2)} [2; j_{1}; U^{(2)} (2; x); α]

depends on the index

j_{1}

. Therefore, the same solver can be used to invert the

A M^{(2)} [2 \times 2; U^{(2)} (2; x); α]

and numerically solve the 2nd-LASS for each

j_{1}

-dependent source

Q_{A}^{(2)} [2; j_{1}; U^{(2)} (2; x); α]

in order to obtain the corresponding

j_{1}

-dependent

2 \times T D

-dimensional second-level adjoint function (column vector)

A^{(2)} (2; j_{1}; x) ≜ {[a^{(2)} (1; j_{1}; x), a^{(2)} (2; j_{1}; x)]}^{†}

. Computationally, it would be efficient to store, if possible, the inverse matrix

{\{A M^{(2)} [2 \times 2; U^{(2)} (2; x); α]\}}^{- 1}

, in order to multiply the inverse matrix

{\{A M^{(2)} [2 \times 2; U^{(2)} (2; x); α]\}}^{- 1}

directly with the corresponding source term

Q_{A}^{(2)} [2; j_{1}; U^{(2)} (2; x); α]

, for each index

j_{1}

, in order to obtain the corresponding

j_{1}

-dependent

2 \times T D

-dimensional second-level adjoint function

A^{(2)} (2; j_{1}; x) ≜ {[a^{(2)} (1; j_{1}; x), a^{(2)} (2; j_{1}; x)]}^{†}

. The two components

a^{(2)} (1; j_{1}; x)

and

a^{(2)} (2; j_{1}; x)

of the second-level adjoint function

A^{(2)} (2; j_{1}; x) ≜ {[a^{(2)} (1; j_{1}; x), a^{(2)} (2; j_{1}; x)]}^{†}

are distinguished from each other by the use of the numbers “1” and, respectively, “2” in the respective list of arguments. In this particularly simple case, the numbers “1” and “2” could also be used as subscripts, in the customary notation for vector components, but such a use would not lend itself to generalizations because the subscript notation would become unwieldy for the higher-level adjoint functions, which will be introduced in the sections that follow below.

Using the underlying the 2nd-LASS and Equation (68) in Equation (47) yields the following expression for the indirect-effect term, for each

j_{1} = 1, \dots, T P

:

\begin{array}{l} {\{δ R^{(1)} [j_{1}; u (x); a^{(1)} (x); α; v^{(1)} (x); δ a^{(1)} (x)]\}}_{i n d} \\ = {\{A^{(2)} (2; j_{1}; x), Q_{V}^{(2)} {[2; U^{(2)} (2; x); α; δ α]}_{2}\}}_{α^{0}} \\ - {\{{[{\hat{P}}^{(2)} [U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α; δ α]]}_{\partial Ω_{x}}\}}_{α^{0}} \\ \equiv {\{δ R^{(1)} [j_{1}; u^{(2)} (x); A^{(2)} (2; j_{1}; x); α; δ α]\}}_{i n d} . \end{array}

(73)

As the identity in Equation (73) indicates, the dependence of the indirect-effect term on the functions

v^{(1)} (x)

and

δ a^{(1)} (x)

is replaced by the dependence on the second-level adjoint function

A^{(2)} (2; j_{1}; x)

, for each

j_{1} = 1, \dots, T P

.

Replacing the expression obtained in Equation (73) for the indirect-effect term together with the expression for the direct-effect term provided in Equation (46) yields the following expression for the total differential defined by Equation (45):

\begin{array}{l} {\{δ R^{(1)} [j_{1}; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α; δ α]\}}_{α^{0}} = {\{\frac{\partial R^{(1)} [j_{1}; u (x); a^{(1)} (x); α]}{\partial α} δ α\}}_{α^{0}} \\ + {\{{〈A^{(2)} (2; j_{1}; x), Q_{V}^{(2)} [2; U^{(2)} (2; x); α; δ α]〉}_{2}\}}_{α^{0}} - {\{{[{\hat{P}}^{(2)} (U^{(2)}; A^{(2)}; α; δ α)]}_{\partial Ω_{x}}\}}_{α^{0}} . \end{array}

(74)

Using Equations (25), (46), and (55) in Equation (74) yields the following component form for the total differential expressed by Equation (74):

\begin{array}{l} {\{δ R^{(1)} [j_{1}; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α; δ α]\}}_{α^{0}} \\ = \sum_{j_{2} = 1}^{T P} {\{R^{(2)} [j_{2}; j_{1}; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α]\}}_{α^{0}} δ α_{j_{2}}, j_{1} = 1, \dots, T P, \end{array}

(75)

where the quantity

{\{R^{(2)} [j_{2}; j_{1}; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α]\}}_{α^{0}}

denotes the second-order sensitivity of the generic scalar-valued response

R [u (x); α]

with respect to the parameters

α_{j_{1}}

and

α_{j_{2}}

computed at the nominal values of the parameters and respective state functions. The expression of the second-order sensitivity

R^{(2)} [j_{2}; j_{1}; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α]

of the response

R [u (x); α]

with respect to two model parameters

α_{j_{1}}

and

α_{j_{2}}

has the following expression:

\begin{array}{l} F o r j_{1}, j_{2} = 1, \dots, T P : R^{(2)} [j_{2}; j_{1}; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α] ≜ \frac{\partial R^{(1)} [j_{1}; u (x); a^{(1)} (x); α]}{\partial α_{j_{2}}} \\ - \frac{{\{\partial {\hat{P}}^{(2)} [U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α]\}}_{\partial Ω_{x}}}{\partial α_{j_{2}}} + {〈a^{(2)} (1; j_{1}; x), \frac{\partial [Q (α) - N (u; α)]}{\partial α_{j_{2}}}〉}_{1} \\ + {〈a^{(2)} (2; j_{1}; x), \frac{s^{(1)} [u (x); α]}{\partial α_{j_{2}}} - \frac{\partial [A^{(1)} (u; α) a^{(1)} (x)]}{\partial α_{j_{2}}}〉}_{1} ≜ \frac{\partial^{2} R [u (x); α]}{\partial α_{j_{2}} \partial α_{j_{1}}} . \end{array}

(76)

As Equations (70) and (72) indicate, solving the 2nd-LASS once provides the second-level adjoint function

a^{(2)} (j_{1}; x)

, for each index

j_{1} = 1, \dots, T P

, which enables the exact and efficient computation of the G differential

δ R^{(1)} [j_{1}; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α; δ α]

for each index

j_{1} = 1, \dots, T P

. Notably, the G differential

δ R^{(1)} [j_{1}; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α; δ α]

comprises one complete row (running on the index

j_{1} = 1, \dots, T P

) of mixed second-order partial sensitivities

R^{(2)} [j_{2}; j_{1}; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α] = \partial^{2} R / \partial α_{j_{2}} \partial α_{j_{1}}

. Thus, the exact computation of all of the partial second-order sensitivities,

{\{\partial^{2} R (α, u) / \partial α_{j} \partial α_{i}\}}_{(α^{0})},

i, j = 1, \dots, T P,

requires at most

T P

large-scale (adjoint) computations using the 2nd-LASS, rather than at least

O (T P^{2})

large-scale computations, which would be required by forward methods.

Since the adjoint matrix

A M^{(2)} [2 \times 2; U^{(2)} (2; x); α]

is block diagonal, solving the 2nd-LASS is equivalent to solving two 1st-LASS, with two different source terms. Thus, the “solvers” and the computer program used for solving the 1st-LASS can also be used for solving the 2nd-LASS. The 2nd-LASS was designated as the “second-level” rather than the “second-order” adjoint sensitivity system, since the 2nd-LASS does not involve any explicit second-order G derivatives of the operators underlying the original system but involves the inversion of the same operators that needed to be inverted for solving the 1st-LASS.

It is important to note that if the 2nd-LASS is solved

T P

-times, the second-order mixed sensitivities

R^{(2)} [j_{2}; j_{1}; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α] \equiv \partial^{2} R / \partial α_{j_{2}} \partial α_{j_{1}}

will be computed twice, in two different ways, in terms of two distinct second-level adjoint functions. Consequently, the symmetry property

\partial^{2} R [u (x); α] / \partial α_{j_{2}} \partial α_{j_{1}} = \partial^{2} R [u (x); α] / \partial α_{j_{1}} \partial α_{j_{2}}

enjoyed by the second-order sensitivities provides an intrinsic (numerical) verification that the components of the second-level adjoint function

A^{(2)} (2; j_{1}; x)

, as well as the first-level adjoint function

a^{(1)} (x)

are computed accurately.

The structure of the 2nd-LASS enables full flexibility for prioritizing the computation of the second-order sensitivities. The computation of the second-order sensitivities would logically be prioritized based on the relative magnitudes of the first-order sensitivities: the largest relative first-order response sensitivity should have the highest priority for computing the corresponding second-order mixed sensitivities; then, the second largest relative first-order response sensitivity should be considered next, and so on. The unimportant second-order sensitivities can be deliberately neglected while knowing the error incurred by neglecting them. Computing second-order sensitivities that correspond to vanishing first-order sensitivities may also be of interest, since vanishing first-order sensitivities may indicate critical points of the response in the phase space of model parameters.

3.3. The Third-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Nonlinear Systems (3rd-CASAM-N)

The second-order sensitivities

R^{(2)} [j_{2}; j_{1}; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α] \equiv \partial^{2} R / \partial α_{j_{2}} \partial α_{j_{1}}

will be assumed to satisfy the conditions stated in Equations (11) and (12) for each

j_{1}, j_{2} = 1, \dots, T P

, so that the first-order total G differential of

R^{(2)} [j_{2}; j_{1}; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α]

will exist and will be linear in the variations

V^{(2)} (2; x)

and

δ A^{(2)} (2; j_{1}; x)

in a neighborhood around the nominal values of the parameters and the respective state functions. By definition, the first-order total G differential of

R^{(2)} [j_{2}; j_{1}; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α]

, which will be denoted as

{\{δ R^{(2)} [j_{2}; j_{1}; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α; V^{(2)} (2; x); δ A^{(2)} (2; j_{1}; x); δ α]\}}_{α^{0}}

, is given by the following expression:

\begin{array}{l} {\{δ R^{(2)} [j_{2}; j_{1}; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α; V^{(2)} (2; x); δ A^{(2)} (2; j_{1}; x); δ α]\}}_{α^{0}} \\ ≜ {\{\frac{d}{d ε} δ R^{(2)} [j_{2}; j_{1}; U^{(2)} (2; x) + ε V^{(2)} (2; x); A^{(2)} (2; j_{1}; x) + ε δ A^{(2)} (2; j_{1}; x); α + ε δ α]\}}_{ε = 0} \\ ≜ {\{δ R^{(2)} [j_{2}; j_{1}; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α; δ α]\}}_{d i r} \\ + {\{δ R^{(2)} [j_{2}; j_{1}; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α; V^{(2)} (2; x); δ A^{(2)} (2; j_{1}; x)]\}}_{i n d} . \end{array}

(77)

In Equation (77), the quantity

{\{δ R^{(2)} [j_{2}; j_{1}; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α; δ α]\}}_{d i r}

denotes the “direct-effect term”, which comprises all of the dependencies on the vector

δ α

of parameter variations and has the following expression:

\begin{array}{l} {\{δ R^{(2)} [j_{2}; j_{1}; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α; δ α]\}}_{d i r} \\ ≜ {\{\frac{\partial R^{(2)} [j_{2}; j_{1}; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α]}{\partial α} δ α\}}_{α^{0}} \\ = \sum_{j_{3} = 1}^{T P} {\{\frac{\partial R^{(2)} [j_{2}; j_{1}; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α]}{\partial α_{j_{3}}}\}}_{α^{0}} δ α_{j_{3}} . \end{array}

(78)

In Equation (77), the quantity

{\{δ R^{(2)} [j_{2}; j_{1}; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α; V^{(2)} (2; x); δ A^{(2)} (2; j_{1}; x)]\}}_{i n d}

denotes the “indirect-effect term” which comprises all of the dependencies on the vectors

V^{(2)} (2; x)

and

δ A^{(2)} (2; j_{1}; x)

of variations in the state functions

U^{(2)} (2; x)

and

A^{(2)} (2; j_{1}; x)

; this indirect-effect term is defined as follows:

\begin{array}{l} {\{δ R^{(2)} [j_{2}; j_{1}; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α; V^{(2)} (2; x); δ A^{(2)} (2; j_{1}; x)]\}}_{i n d} \\ ≜ {\{\frac{\partial R^{(2)} [j_{2}; j_{1}; U^{(2)}; A^{(2)}; α]}{\partial U^{(2)} (2; x)}\}}_{α^{0}} V^{(2)} (2; x) + {\{\frac{\partial R^{(2)} [j_{2}; j_{1}; U^{(2)}; A^{(2)}; α]}{\partial A^{(2)} (2; j_{1}; x)}\}}_{α^{0}} δ A^{(2)} (2; j_{1}; x), \end{array}

(79)

where

\begin{array}{l} {\{\frac{\partial R^{(2)} [j_{2}; j_{1}; U^{(2)}; A^{(2)}; α]}{\partial U^{(2)} (2; x)}\}}_{α^{0}} V^{(2)} (2; x) ≜ {\{\frac{\partial R^{(2)} [j_{2}; j_{1}; U^{(2)}; A^{(2)}; α]}{\partial u (x)}\}}_{α^{0}} v^{(1)} (x) \\ + {\{\frac{\partial R^{(2)} [j_{2}; j_{1}; U^{(2)}; A^{(2)}; α]}{\partial a^{(1)} (x)}\}}_{α^{0}} δ a^{(1)} (x), \end{array}

(80)

and

\frac{\partial R^{(2)} [j_{2}; j_{1}; u^{(2)}; a^{(2)}; α]}{\partial A^{(2)} (2; j_{1}; x)} δ A^{(2)} (2; j_{1}; x) ≜ \sum_{i = 1}^{2} \frac{\partial R^{(2)} [j_{2}; j_{1}; u^{(2)}; a^{(2)}; α]}{\partial a^{(2)} (i; j_{1}; x)} δ a^{(2)} (i; j_{1}; x) .

(81)

The direct-effect term

{\{δ R^{(2)} [j_{2}; j_{1}; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α; δ α]\}}_{d i r}

can be computed immediately; however,

{\{δ R^{(2)} [j_{2}; j_{1}; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α; V^{(2)} (2; x); δ A^{(2)} (2; j_{1}; x)]\}}_{i n d}

can be computed only after having determined the vectors

V^{(2)} (2; x)

and

δ A^{(2)} (2; j_{1}; x)

. The vector

V^{(2)} (2; x)

is the solution of the 2nd-LVSS defined by Equations (57) and (58). The vector

δ A^{(2)} (2; j_{1}; x)

is the solution of the G-differentiated 2nd-LASS. Taking the G differential of Equations (70) and (72) yields the following system of equations for

δ A^{(2)} (2; j_{1}; x)

:

\begin{array}{l} \frac{\partial \{A M^{(2)} [2 \times 2; U^{(2)} (2; x); α] A^{(2)} (2; j_{1}; x)\}}{\partial U^{(2)} (2; x)} V^{(2)} (2; x) - \frac{\partial Q_{A}^{(2)} [2; j_{1}; u^{(2)} (x); α]}{\partial U^{(2)} (2; x)} V^{(2)} (2; x) \\ + A M^{(2)} [2 \times 2; U^{(2)} (2; x); α] δ A^{(2)} (2; j_{1}; x) = \frac{\partial Q_{A}^{(2)} [2; j_{1}; u^{(2)} (x); α]}{\partial α} \partial α \\ - \frac{\partial \{A M^{(2)} [2 \times 2; U^{(2)} (2; x); α] A^{(2)} (2; j_{1}; x)\}}{\partial α} \partial α, j_{1} = 1, \dots, T P; x \in Ω_{x}, \end{array}

(82)

{\{δ B_{A}^{(2)} [2; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α]\}}_{α^{0}} = 0 [2]; j_{1} = 1, \dots, T P; x \in \partial Ω_{x} (α^{0}) .

(83)

The quantities which appear in Equation (83) are evaluated at the nominal values of the parameters and respective state functions, but the notation

{\{\}}_{α^{0}}

, which indicates this evaluation, is omitted in order to simplify the notation.

Concatenating Equations (82) and (83) with the 2nd-LVSS represented by Equations (57) and (58) yields the following system of, which will be called the “3rd-order variational sensitivity system” (3rd-LVSS), for determining the vectors

V^{(2)} (2; x)

and

δ A^{(2)} (2; j_{1}; x)

:

\begin{array}{l} {\{V M^{(3)} [4 \times 4; U^{(3)} (4; j_{1}; x); α] V^{(3)} (4; j_{1}; x)\}}_{α^{0}} \\ = {\{Q_{V}^{(3)} [4; j_{1}; U^{(3)} (4; j_{1}; x); α; δ α]\}}_{α^{0}}, x \in Ω_{x}, \end{array}

(84)

{\{B_{V}^{(3)} [4; U^{(3)} (4; j_{1}; x); V^{(3)} (4; j_{1}; x); α; δ α]\}}_{α^{0}} = 0 [4]; 0 [4] ≜ {[0, 0, 0, 0]}^{†}, x \in \partial Ω_{x} (α^{0})

(85)

where

\begin{array}{l} V M^{(3)} (4 \times 4; U^{(3)}; α) ≜ (\begin{matrix} V M^{(2)} (2 \times 2) & 0 [2 \times 2] \\ V M_{21}^{(3)} (2 \times 2) & V M_{22}^{(3)} (2 \times 2) \end{matrix}); \\ U^{(3)} (4; j_{1}; x) ≜ (\begin{matrix} U^{(2)} (2; x) \\ A^{(2)} (2; j_{1}; x) \end{matrix}); \end{array}

(86)

\begin{array}{l} V^{(3)} (4; j_{1}; x) ≜ δ U^{(3)} (4; j_{1}; x) = (\begin{matrix} V^{(2)} (2; x) \\ δ A^{(2)} (2; j_{1}; x) \end{matrix}) \\ = {[v^{(1)} (x), δ a^{(1)} (x), δ a^{(2)} (1; j_{1}; x), δ a^{(2)} (2; j_{1}; x)]}^{†}; \end{array}

(87)

\begin{array}{l} V M_{21}^{(3)} (2 \times 2; x) ≜ \frac{\partial \{A M^{(2)} [2 \times 2; U^{(2)} (2; x); α] A^{(2)} (2; j_{1}; x)\}}{\partial U^{(2)} (2; x)} \\ - \frac{\partial Q_{A}^{(2)} [2; j_{1}; u^{(2)} (x); α]}{\partial U^{(2)} (2; x)}; \end{array}

(88)

V M_{22}^{(3)} (2 \times 2; x) ≜ A M^{(2)} [2 \times 2; U^{(2)} (2; x); α]; 0 [2 \times 2] ≜ (\begin{matrix} 0 & 0 \\ 0 & 0 \end{matrix});

(89)

\begin{array}{l} Q_{V}^{(3)} [4; j_{1}; U^{(3)} (4; j_{1}; x); α; δ α] ≜ (\begin{matrix} Q_{V}^{(2)} [2; U^{(2)} (2; x); α; δ α] \\ Q_{2}^{(3)} [2; j_{1}; U^{(3)} (4; j_{1}; x); α; δ α] \end{matrix}) \\ ≜ {\{q_{V}^{(3)} [1; j_{1}; U^{(3)} (4; j_{1}; x); α; δ α], \dots, q_{V}^{(3)} [4; j_{1}; U^{(3)} (4; j_{1}; x); α; δ α]\}}^{†}; \end{array}

(90)

\begin{array}{l} Q_{2}^{(3)} [2; j_{1}; U^{(3)} (4; j_{1}; x); α; δ α] ≜ \frac{\partial Q_{A}^{(2)} [2; j_{1}; u^{(2)} (x); α]}{\partial α} \partial α \\ - \frac{\partial \{A M^{(2)} [2 \times 2; U^{(2)} (2; x); α] A^{(2)} (2; j_{1}; x)\}}{\partial α} \partial α; \end{array}

(91)

B_{V}^{(3)} [4; U^{(3)} (4; j_{1}; x); V^{(3)} (4; j_{1}; x); α; δ α] ≜ (\begin{matrix} B_{V}^{(2)} [2; U^{(2)} (2; x); V^{(2)} (2; x); α; δ α] \\ δ B_{A}^{(2)} [2; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α] \end{matrix}) .

(92)

For subsequent reference, it is important to note that the quantities

q_{V}^{(3)} [i; j_{1}; U^{(3)} (4; j_{1}; x); α; δ α]

are linear in the parameter variations

δ α_{i}

,

i = 1, \dots, T P

, and can therefore be written in the following form:

q_{V}^{(3)} [i; j_{1}; U^{(3)} (4; j_{1}; x); α; δ α] ≜ \sum_{j_{3} = 1}^{T P} s_{V}^{(3)} [i; j_{3}; j_{1}; U^{(3)} (4; j_{1}; x); α] δ α_{j_{3}}

(93)

The variational matrix

V M^{(3)} (4 \times 4; U^{(3)}; α)

comprises

(4 \times 4)

block matrices, each comprising

T D^{2}

components/elements; thus, the matrix

V M^{(3)} (4 \times 4; U^{(3)}; α)

comprises a total of

(4 \times 4) T D^{2}

components/elements. Each of the vectors

V^{(3)} (4; j_{1}; x)

,

Q_{V}^{(3)} [4; j_{1}; U^{(3)} (4; j_{1}; x); α; δ α]

and

B_{V}^{(3)} [4; U^{(3)} (2; j_{1}; x); V^{(3)} (4; j_{1}; x); α; δ α]

comprise four

T D

-dimensional vectors, as shown in their respective definitions; thus, each of the vectors

V^{(3)} (4; j_{1}; x)

,

Q_{V}^{(3)} [4; j_{1}; U^{(3)} (4; j_{1}; x); α; δ α]

, and

B_{V}^{(3)} [4; U^{(3)} (2; j_{1}; x); V^{(3)} (4; j_{1}; x); α; δ α]

comprise

(4 \times T D)

components/elements.

Solving the 3rd-LVSS would require

T P^{3}

large-scale computations, which is unrealistic for large-scale systems comprising many parameters. The 3rd-CASAM-N circumvents the need for solving the 3rd-LVSS by deriving an alternative expression for the indirect-effect term defined in Equation (81), in which the function

V^{(3)} (4; j_{1}; x)

is replaced by a third-level adjoint function which is independent of parameter variations. This third-level adjoint function will be the solution of a third-level adjoint sensitivity system (3rd-LASS) which will be constructed by applying the same principles as those used for constructing the 1st-LASS and the 2nd-LASS. The Hilbert space appropriate for constructing the 3rd-LASS will be denoted as

H_{3} (Ω_{x})

and comprise element block vectors of the same form as

V^{(3)} (4; j_{1}; x)

. Thus, a generic block vector in

H_{3} (Ω_{x})

, denoted as

Ψ^{(3)} (4; x) ≜ {[ψ^{(3)} (1; x), ψ^{(3)} (2; x), ψ^{(3)} (3; x), ψ^{(3)} (4; x)]}^{†} \in H_{3} (Ω_{x})

, comprises four

T D

-dimensional vector components of the form

ψ^{(3)} (i; x) ≜ {[ψ_{1}^{(3)} (i; x), \dots, ψ_{T D}^{(3)} (i; x)]}^{†} \in H_{1} (Ω_{x})

,

i = 1, 2, 3, 4

, where each of these four components is a

T D

-dimensional column vector. The inner product of two vectors

Ψ^{(3)} (4; x) \in H_{3} (Ω_{x})

and

Φ^{(3)} (4; x) \in H_{3} (Ω_{x})

in the Hilbert space

H_{3} (Ω_{x})

will be denoted as

{〈Ψ^{(3)} (4; x), Φ^{(3)} (4; x)〉}_{3}

and defined as follows:

{〈Ψ^{(3)} (4; x), Φ^{(3)} (4; x)〉}_{3} ≜ \sum_{i = 1}^{4} {〈ψ^{(3)} (i; x), φ^{(3)} (i; x)〉}_{1}

(94)

The inner product defined in Equation (94) is continuous in

α

, in a neighborhood around

α^{0}

. Using the definition of the inner product defined in Equation (94), the inner product of Equation (84) is constructed with a vector

A^{(3)} (4; x) ≜ {[a^{(3)} (1; x), a^{(3)} (2; x), a^{(3)} (3; x), a^{(3)} (4; x)]}^{†} \in H_{3} (Ω_{x})

to obtain the following relation:

\begin{array}{l} {\{{〈A^{(3)} (4; x), V M^{(3)} [4 \times 4; U^{(3)} (4; j_{1}; x); α] V^{(3)} (4; j_{1}; x)〉}_{3}\}}_{α^{0}} \\ = {\{{〈A^{(3)} (4; x), Q_{V}^{(3)} [4; j_{1}; U^{(3)} (4; j_{1}; x); α; δ α]〉}_{3}\}}_{α^{0}}, x \in Ω_{x} . \end{array}

(95)

The inner product on the left side of Equation (95) is further transformed by using the definition of the adjoint operator to obtain the following relation:

\begin{array}{l} {\{{〈A^{(3)} (4; x), V M^{(3)} [4 \times 4; U^{(3)} (4; j_{1}; x); α] V^{(3)} (4; j_{1}; x)〉}_{3}\}}_{α^{0}} \\ = {\{{〈V^{(3)} (4; j_{1}; x), A M^{(3)} [4 \times 4; U^{(3)} (4; j_{1}; x); α] A^{(3)} (4; x)〉}_{3}\}}_{α^{0}} \\ + {\{{[P^{(3)} (U^{(3)}; A^{(3)}; V^{(3)}; α)]}_{\partial Ω_{x}}\}}_{α^{0}}, \end{array}

(96)

where

\begin{array}{l} A M^{(3)} [4 \times 4; U^{(3)} (4; j_{1}; x); α] ≜ {[V M^{(3)} (4 \times 4; U^{(3)}; α)]}^{*} \\ = (\begin{matrix} {\{{[V M^{(2)} (2 \times 2)]}^{*}\}}^{†} & {\{{[V M_{21}^{(3)} (2 \times 2)]}^{*}\}}^{†} \\ 0 [2 \times 2] & {\{{[V M_{22}^{(3)} (2 \times 2)]}^{*}\}}^{†} \end{matrix}), \end{array}

(97)

and where

{\{{[P^{(3)} (U^{(3)}; A^{(3)}; V^{(3)}; α)]}_{\partial Ω_{x}}\}}_{α^{0}}

denotes the corresponding bilinear concomitant on the domain’s boundary, evaluated at the nominal values for the parameters and respective state functions. The adjoint matrix

A M^{(3)} [4 \times 4; U^{(3)} (4; j_{1}; x); α]

comprises

{(4 \times T D)}^{2}

components/elements, while the adjoint function

A^{(3)} (4; x) \in H_{3} (Ω_{x})

comprises

(4 \times T D)

components/elements.

The domain of the adjoint matrix operator

A M^{(3)} [4 \times 4; U^{(3)} (4; j_{1}; x); α]

is specified by requiring that the function

A^{(3)} (4; x) ≜ {[a^{(3)} (1; x), a^{(3)} (2; x), a^{(3)} (3; x), a^{(3)} (4; x)]}^{†} \in H_{3} (Ω_{x})

satisfies adjoint boundary/initial conditions denoted as follows:

{\{B_{A}^{(3)} [4; U^{(3)} (4; j_{1}; x); A^{(3)} (4; x); α]\}}_{α^{0}} = 0 [4], x \in \partial Ω_{x} (α^{0}) .

(98)

The third-level adjoint boundary/initial conditions represented by Equation (98) are determined by requiring that: (a) they must be independent of unknown values of

V^{(3)} (4; j_{1}; x)

; and (b) the substitution of the boundary and/or initial conditions represented by Equations (85) and (98) into the expression of

{\{{[P^{(3)} (U^{(3)}; A^{(3)}; V^{(3)}; α)]}_{\partial Ω_{x}}\}}_{α^{0}}

must cause all terms containing unknown values of

V^{(3)} (4; j_{1}; x)

to vanish.

Implementing the boundary/initial conditions represented by Equations (85) and (98) into Equation (96) will transform the latter into the following form:

\begin{array}{l} {\{{〈V^{(3)} (4; j_{1}; x), A M^{(3)} [4 \times 4; U^{(3)} (4; j_{1}; x); α] A^{(3)} (4; x)〉}_{3}\}}_{α^{0}} \\ = {\{{〈A^{(3)} (4; x), V M^{(3)} [4 \times 4; U^{(3)} (4; j_{1}; x); α] V^{(3)} (4; j_{1}; x)〉}_{3}\}}_{α^{0}} \\ - {\{{[{\hat{P}}^{(3)} (U^{(3)}; A^{(3)}; δ α)]}_{\partial Ω_{x}}\}}_{α^{0}}, \end{array}

(99)

where

{\{{[{\hat{P}}^{(3)} (U^{(3)}; A^{(3)}; δ α)]}_{\partial Ω_{x}}\}}_{α^{0}}

denotes residual boundary terms which may not vanish automatically. The right side of Equation (95) is now used in the first term on the right side of Equation (68) to obtain the following relation:

\begin{array}{l} {\{{〈V^{(3)} (4; j_{1}; x), A M^{(3)} [4 \times 4; U^{(3)} (4; j_{1}; x); α] A^{(3)} (4; x)〉}_{3}\}}_{α^{0}} \\ = {\{{〈A^{(3)} (4; x), Q_{V}^{(3)} [4; j_{1}; U^{(3)} (4; j_{1}; x); α; δ α]〉}_{3}\}}_{α^{0}} - {\{{[{\hat{P}}^{(3)} (U^{(3)}; A^{(3)}; δ α)]}_{\partial Ω_{x}}\}}_{α^{0}} . \end{array}

(100)

The definition of the third-level adjoint function

A^{(3)} (4; x) \in H_{3} (Ω_{x})

is now completed by requiring that the left side of Equation (100) and the right side of Equation (81) represent the “indirect-effect term”

{\{δ R^{(2)} [j_{2}; j_{1}; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α; V^{(2)} (2; x); δ A^{(2)} (2; j_{1}; x)]\}}_{i n d}

for each of the indices

j_{1} = 1, \dots, T P; j_{2} = 1, ...., j_{1}

. Hence, there will be

T P (T P + 1) / 2

distinct third-level adjoint functions

A^{(3)} (4; j_{2}; j_{1}; x) ≜ {[a^{(3)} (1; j_{2}; j_{1}; x), a^{(3)} (2; j_{2}; j_{1}; x), a^{(3)} (3; j_{2}; j_{1}; x), a^{(3)} (4; j_{2}; j_{1}; x)]}^{†}

, corresponding to the indices

j_{1} = 1, \dots, T P; j_{2} = 1, ...., j_{1}

. Each of these distinct third-level adjoint functions will correspond to a specific

(j_{1}, j_{2})

-dependent indirect-effect term.

The left side of Equation (100) will be identical to the right side of Equation (81) by requiring that the following relation be satisfied by the third-level adjoint functions

A^{(3)} (4; j_{2}; j_{1}; x) ≜ {[a^{(3)} (1; j_{2}; j_{1}; x), a^{(3)} (2; j_{2}; j_{1}; x), a^{(3)} (3; j_{2}; j_{1}; x), a^{(3)} (4; j_{2}; j_{1}; x)]}^{†}

:

\begin{array}{l} {\{A M^{(3)} [4 \times 4; U^{(3)} (4; j_{1}; x); α] A^{(3)} (4; j_{2}; j_{1}; x)\}}_{α^{0}} \\ = {\{Q_{A}^{(3)} [4; j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); α]\}}_{α^{0}}, j_{1} = 1, \dots, T P; j_{2} = 1, ...., j_{1}, \end{array}

(101)

Q_{A}^{(3)} [4; j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); α] ≜ [q_{A}^{(3)} (1; j_{2}; j_{1}; U^{(3)}; α), \dots, q_{A}^{(3)} (1; j_{2}; j_{1}; U^{(3)}; α)];

(102)

q_{A}^{(3)} (1; j_{2}; j_{1}; U^{(3)}; α) ≜ \partial R^{(2)} [j_{2}; j_{1}; u^{(2)}; a^{(2)}; α] / \partial u^{(1)};

(103)

q_{A}^{(3)} (2; j_{2}; j_{1}; U^{(3)}; α) ≜ \partial R^{(2)} [j_{2}; j_{1}; u^{(2)}; a^{(2)}; α] / \partial a^{(1)};

(104)

q_{A}^{(3)} (3; j_{2}; j_{1}; U^{(3)}; α) ≜ \partial R^{(2)} [j_{2}; j_{1}; u^{(2)}; a^{(2)}; α] / \partial a^{(2)} (1; j_{1}; x);

(105)

q_{A}^{(3)} (3; j_{2}; j_{1}; U^{(3)}; α) ≜ \partial R^{(2)} [j_{2}; j_{1}; u^{(2)}; a^{(2)}; α] / \partial a^{(2)} (2; j_{1}; x) .

(106)

The boundary conditions to be satisfied by each of the third-level adjoint functions

A^{(3)} (4; j_{2}; j_{1}; x) ≜ {[a^{(3)} (1; j_{2}; j_{1}; x), a^{(3)} (2; j_{2}; j_{1}; x), a^{(3)} (3; j_{2}; j_{1}; x), a^{(3)} (4; j_{2}; j_{1}; x)]}^{†}

are those represented by Equation (98), namely

\begin{array}{l} {\{B_{A}^{(3)} [4; U^{(3)} (4; j_{1}; x); A^{(3)} (4; j_{2}; j_{1}; x); α]\}}_{α^{0}} = 0 [4]; \\ j_{1} = 1, \dots, T P; j_{2} = 1, \dots, j_{1}; x \in \partial Ω_{x} (α^{0}) . \end{array}

(107)

Since the source

Q_{A}^{(3)} [4; j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); α]

may contain distributions, the equality in Equation (101) is considered to hold in the weak sense. The Riesz representation theorem ensures that the weak equality in Equation (101) holds uniquely.

The system of represented by Equations (101) and (107) will be called the third-level adjoint sensitivity system (3rd-LASS); its solution,

A^{(3)} (4; j_{2}; j_{1}; x)

, will be called the third-level adjoint function. Using the underlying the 3rd-LASS and Equation (100) in Equation (81) yields the following expression for the indirect-effect term:

\begin{array}{l} {\{δ R^{(2)} [j_{2}; j_{1}; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α; V^{(2)} (2; x); δ A^{(2)} (2; j_{1}; x)]\}}_{i n d} \\ = {\{A^{(3)} (4; j_{2}; j_{1}; x), Q_{V}^{(3)} {[4; j_{1}; U^{(3)} (4; j_{1}; x); α; δ α]}_{3}\}}_{α^{0}} - {\{{[{\hat{P}}^{(3)} (U^{(3)}; A^{(3)}; δ α)]}_{\partial Ω_{x}}\}}_{α^{0}} \\ \equiv {\{δ R^{(2)} [j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); A^{(3)} (4; j_{2}; j_{1}; x); α]\}}_{i n d}; j_{1} = 1, \dots, T P; j_{2} = 1, \dots, j_{1} . \end{array}

(108)

As the identity in Equation (108) indicates, the dependence of the indirect-effect term on the function

V^{(3)} (4; j_{1}; x)

is replaced by the dependence on the adjoint function

A^{(3)} (4; j_{2}; j_{1}; x)

, for each

j_{1} = 1, \dots, T P; j_{2} = 1, \dots, j_{1}

.

Replacing the expression obtained in Equation (108) for the indirect-effect term together with the expression for the direct-effect term provided in Equation (78) yields the following expression for the total differential defined by Equation (77):

\begin{array}{l} {\{δ R^{(2)} [j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); A^{(3)} (4; j_{2}; j_{1}; x); α; δ α]\}}_{α^{0}} \\ = {\{\frac{\partial R^{(2)} [j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); α]}{\partial α} δ α\}}_{α^{0}} - {\{{[{\hat{P}}^{(3)} (U^{(3)}; A^{(3)}; δ α)]}_{\partial Ω_{x}}\}}_{α^{0}} \\ + {\{A^{(3)} (4; j_{2}; j_{1}; x), Q_{V}^{(3)} {[4; j_{1}; U^{(3)} (4; j_{1}; x); α; δ α]}_{3}\}}_{α^{0}} . \end{array}

(109)

In component form, the total differential expressed by Equation (109) has the following expression:

\begin{array}{l} {\{δ R^{(2)} [j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); A^{(3)} (4; j_{2}; j_{1}; x); α; δ α]\}}_{α^{0}} \\ = \sum_{j_{3} = 1}^{T P} {\{R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); A^{(3)} (4; j_{2}; j_{1}; x); α]\}}_{α^{0}} δ α_{j_{3}}, j_{1}; j_{2} = 1, \dots, T P, \end{array}

(110)

where the quantity

R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); A^{(3)} (4; j_{2}; j_{1}; x); α]

denotes the third-order sensitivity of the generic scalar-valued response

R [u (x); α]

with respect to any three model parameters

α_{j_{1}}

,

α_{j_{2}}

,

α_{j_{3}}

, and has the following expression:

\begin{array}{l} R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); A^{(3)} (4; j_{2}; j_{1}; x); α] ≜ \frac{\partial R^{(2)} [j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); α]}{\partial α_{j_{3}}} \\ - \frac{{[\partial {\hat{P}}^{(3)} (U^{(3)}; A^{(3)}; δ α)]}_{\partial Ω_{x}}}{\partial α_{j_{3}}} + \sum_{i = 1}^{4} {〈a^{(3)} (i; j_{2}; j_{1}; x), s_{V}^{(3)} [i; j_{3}; j_{1}; U^{(3)} (4; j_{1}; x); α]〉}_{1} \\ ≜ \frac{\partial^{3} R [u (x); α]}{\partial α_{j_{3}} \partial α_{j_{2}} \partial α_{j_{1}}}, f o r j_{1}, j_{2}, j_{3} = 1, \dots, T P . \end{array}

(111)

As Equations (101)–(107) indicate, solving the 3rd-LASS once provides the third-level adjoint function

A^{(3)} (4; j_{2}; j_{1}; x)

, for each of the indices

j_{1} = 1, \dots, T P; j_{2} = 1, \dots, j_{1}

. In turn, the availability of

A^{(3)} (4; j_{2}; j_{1}; x)

enables the exact and efficient computation of the G differential

δ R^{(2)} [j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); A^{(3)} (4; j_{2}; j_{1}; x); α]

. Thus, the exact computation of all of the partial third-order sensitivities

{\{\partial^{3} R (α, u) / \partial α_{j_{3}} \partial α_{j_{2}} \partial α_{j_{1}}\}}_{(α^{0})}

requires at most

T P (T P + 1) / 2

large-scale (adjoint) computations using the 3rd-LASS, rather than at least

O (T P^{3})

large-scale computations, which would be required by forward methods.

The matrix

A M^{(3)} [4 \times 4; U^{(3)} (4; j_{1}; x); α]

is block diagonal; therefore, solving the 3rd-LASS is equivalent to solving three 1st-LASS, with different source terms. The 3rd-LASS was designated as “the third-level” rather than “third-order” adjoint sensitivity system since the 3rd-LASS does not involve any explicit second-order and/or third-order G derivatives of the operators underlying the original system, but involves only the inversion of the same operators that needed to be inverted for solving the 1st-LASS.

By solving the 3rd-LASS

T P (T P + 1) / 2

times, the third-order mixed sensitivities

{\{\partial^{3} R (α, u) / \partial α_{j_{3}} \partial α_{j_{2}} \partial α_{j_{1}}\}}_{(α^{0})}

will be computed three times, in three different ways. Consequently, the multiple symmetries intrinsic to the third-order sensitivities provide an intrinsic numerical verification that the components of the first-, second-, and third-level adjoint functions are computed accurately.

The structure of the 3rd-LASS enables full flexibility for prioritizing the computation of the third-order sensitivities. The computation of the third-order sensitivities would logically be prioritized based on the relative magnitudes of the second-order sensitivities, so that the unimportant third-order sensitivities can be deliberately neglected while knowing the error incurred by neglecting them.

3.4. The Fourth-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Nonlinear Systems (4th-CASAM-N)

The third-order sensitivities

R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)}; A^{(3)}; α] \equiv \partial^{3} R [u (x); α] / \partial α_{j_{3}} \partial α_{j_{2}} \partial α_{j_{1}}

will be assumed to satisfy the conditions stated in Equations (11) and (12) for each

j_{1}, j_{2}, j_{3} = 1, \dots, T P

, so that the first-order total G differential of

R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)}; A^{(3)}; α]

will exist and will be linear in the variations

V^{(3)} (4; j_{1}; x) ≜ δ U^{(3)} (4; j_{1}; x)

and

δ A^{(3)} (4; j_{2}; j_{1}; x)

in a neighborhood around the nominal values of the parameters and the respective state functions. By definition, the first-order total G differential of

R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); A^{(3)} (4; j_{2}; j_{1}; x); α]

, which will be denoted as

{\{δ R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); A^{(3)} (4; j_{2}; j_{1}; x); α; V^{(3)} (4; j_{1}; x); δ A^{(3)} (4; j_{2}; j_{1}; x); δ α]\}}_{α^{0}}

, is given by the following expression:

\begin{array}{l} {\{δ R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); A^{(3)} (4; j_{2}; j_{1}; x); α; V^{(3)} (4; j_{1}; x); δ A^{(3)} (4; j_{2}; j_{1}; x); δ α]\}}_{α^{0}} \\ ≜ \{\frac{d}{d ε} R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)} (4; j_{1}; x) + ε V^{(3)} (4; j_{1}; x); \\ {A^{(3)} (4; j_{2}; j_{1}; x) + ε δ A^{(3)} (4; j_{2}; j_{1}; x); α + ε δ α\}}_{ε = 0} \\ ≜ {\{δ R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); A^{(3)} (4; j_{2}; j_{1}; x); α; δ α]\}}_{d i r} \\ + {\{δ R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); A^{(3)} (4; j_{2}; j_{1}; x); α; V^{(3)} (4; j_{1}; x); δ A^{(3)} (4; j_{2}; j_{1}; x)]\}}_{i n d} . \end{array}

(112)

In Equation (112), the quantity

{\{δ R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)}; A^{(3)}; α; δ α]\}}_{d i r}

denotes the “direct-effect term” which comprises all of the dependencies on the vector

δ α

of parameter variations and has the following expression:

\begin{array}{l} {\{δ R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); A^{(3)} (4; j_{2}; j_{1}; x); α; δ α]\}}_{d i r} ≜ \\ {\{\frac{\partial R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); A^{(3)} (4; j_{2}; j_{1}; x); α]}{\partial α} δ α\}}_{α^{0}} \\ = \sum_{j_{4} = 1}^{T P} {\{\frac{\partial R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); A^{(3)} (4; j_{2}; j_{1}; x); α]}{\partial α_{j_{4}}}\}}_{α^{0}} δ α_{j_{4}} . \end{array}

(113)

In Equation (112), the quantity

{\{δ R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)}; A^{(3)}; α; V^{(3)}; δ A^{(3)}]\}}_{i n d}

denotes the “indirect-effect term” which comprises all of the dependencies on the vectors

V^{(3)} (4; j_{1}; x)

and

δ A^{(3)} (4; j_{2}; j_{1}; x)

of variations in the state functions

U^{(3)} (4; j_{1}; x)

and

A^{(3)} (4; j_{2}; j_{1}; x)

; this indirect-effect term is defined as follows:

\begin{array}{l} {\{δ R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); A^{(3)} (4; j_{2}; j_{1}; x); α; V^{(3)} (4; j_{1}; x); δ A^{(3)} (4; j_{2}; j_{1}; x)]\}}_{i n d} \\ ≜ {\{\frac{\partial R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)}; A^{(3)}; α]}{\partial U^{(3)} (4; j_{1}; x)}\}}_{α^{0}} V^{(3)} (4; j_{1}; x) \\ + {\{\frac{\partial R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)}; A^{(3)}; α]}{\partial A^{(3)} (4; j_{2}; j_{1}; x)}\}}_{α^{0}} δ A^{(3)} (4; j_{2}; j_{1}; x), \end{array}

(114)

where

\begin{array}{l} \frac{\partial R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)}; A^{(3)}; α]}{\partial U^{(3)} (4; j_{1}; x)} V^{(3)} (4; j_{1}; x) \\ ≜ \frac{\partial R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)}; A^{(3)}; α]}{\partial u (x)} v^{(1)} (x) + \frac{\partial R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)}; A^{(3)}; α]}{\partial a^{(1)} (x)} δ a^{(1)} (x) \\ + \sum_{i = 1}^{2} \frac{\partial R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)}; A^{(3)}; α]}{\partial a^{(2)} (i; j_{1}; x)} δ a^{(2)} (i; j_{1}; x), \end{array}

(115)

and

\begin{array}{l} \frac{\partial R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)}; A^{(3)}; α]}{\partial A^{(3)} (4; j_{2}; j_{1}; x)} δ A^{(3)} (4; j_{2}; j_{1}; x) \\ ≜ \sum_{i = 1}^{4} \frac{\partial R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)}; A^{(3)}; α]}{\partial a^{(3)} (i; j_{2}; j_{1}; x)} δ a^{(3)} (i; j_{2}; j_{1}; x) . \end{array}

(116)

The direct-effect term

{\{δ R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)}; A^{(3)}; α; δ α]\}}_{d i r}

can be computed immediately; however, the indirect-effect term

{\{δ R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)}; A^{(3)}; α; V^{(3)}; δ A^{(3)}]\}}_{i n d}

can be computed only after having determined the vectors

V^{(3)} (4; j_{1}; x)

and

δ A^{(3)} (4; j_{2}; j_{1}; x)

. The vector

V^{(3)} (4; j_{1}; x)

is the solution of the 3rd-LVSS defined by Equations (84) and (85). The vector

δ A^{(3)} (4; j_{2}; j_{1}; x)

is the solution of the G-differentiated 3rd-LASS. Taking the G differential of Equations (101) and (107) yields the following system of for

δ A^{(3)} (4; j_{2}; j_{1}; x)

and for

j_{1} = 1, \dots, T P; j_{2} = 1, \dots, j_{1}

:

\begin{array}{l} \frac{\partial \{A M^{(3)} [4 \times 4; U^{(3)} (4; j_{1}; x); α] A^{(3)} (4; j_{2}; j_{1}; x)\}}{\partial U^{(3)} (4; j_{1}; x)} V^{(3)} (4; j_{1}; x) \\ - \frac{\partial Q_{A}^{(3)} [4; j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); α]}{\partial U^{(3)} (4; j_{1}; x)} V^{(3)} (4; j_{1}; x) \\ + A M^{(3)} [4 \times 4; U^{(3)} (4; j_{1}; x); α] δ A^{(3)} (4; j_{2}; j_{1}; x) \\ = \frac{\partial Q_{A}^{(3)} [4; j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); α]}{\partial α} \partial α \\ - \frac{\partial \{A M^{(3)} [4 \times 4; U^{(3)} (4; j_{1}; x); α] A^{(3)} (4; j_{2}; j_{1}; x)\}}{\partial α} \partial α, x \in Ω_{x}, \end{array}

(117)

{\{δ B_{A}^{(3)} [4; U^{(3)} (4; j_{1}; x); A^{(3)} (4; j_{2}; j_{1}; x); α]\}}_{α^{0}} = 0 [4]; x \in \partial Ω_{x} (α^{0}) .

(118)

The quantities which appear in Equation (117) are evaluated at the nominal values of the parameters and respective state functions, but the notation

{\{\}}_{α^{0}}

, which indicates this evaluation, is omitted in order to simplify the notation.

Concatenating Equations (117) and (118) with the 3rd-LVSS represented by Equations (84) and (85) yields the following system of, which will be called the “4th-order variational sensitivity system” (4th-LVSS), for determining the vectors

V^{(3)} (4; j_{1}; x)

and

δ A^{(3)} (4; j_{2}; j_{1}; x)

, for

j_{1} = 1, \dots, T P; j_{2} = 1, \dots, j_{1}

:

\begin{array}{l} {\{V M^{(4)} [8 \times 8; U^{(4)} (8; j_{2}; j_{1}; x); α] V^{(4)} (8; j_{2}; j_{1}; x)\}}_{α^{0}} \\ = {\{Q_{V}^{(4)} [8; j_{1}; U^{(4)} (8; j_{2}; j_{1}; x); α; δ α]\}}_{α^{0}}, x \in Ω_{x}, \end{array}

(119)

{\{B_{V}^{(4)} [8; U^{(4)} (8; j_{1}; x); V^{(4)} (8; j_{1}; x); α; δ α]\}}_{α^{0}} = 0 [8]; x \in \partial Ω_{x} (α^{0});

(120)

where

0 [8] ≜ {[0, 0, 0, 0, 0, 0, 0, 0]}^{†}

, and

V M^{(4)} [8 \times 8; U^{(4)} (8; j_{2}; j_{1}; x); α] ≜ (\begin{matrix} V M^{(3)} [4 \times 4; U^{(3)} (4; j_{1}; x); α] & 0 [4 \times 4] \\ V M_{21}^{(4)} (4 \times 4; x) & V M_{22}^{(4)} (4 \times 4; x) \end{matrix});

(121)

U^{(4)} (8; j_{2}; j_{1}; x) ≜ (\begin{matrix} U^{(3)} (4; j_{1}; x) \\ A^{(3)} (4; j_{2}; j_{1}; x) \end{matrix});

(122)

\begin{array}{l} V^{(4)} (8; j_{2}; j_{1}; x) ≜ δ U^{(4)} (8; j_{2}; j_{1}; x) = (\begin{matrix} V^{(3)} (4; j_{1}; x) \\ δ A^{(3)} (4; j_{2}; j_{1}; x) \end{matrix}) \\ = [v^{(1)} (x), δ a^{(1)} (x), δ a^{(2)} (1; j_{1}; x), δ a^{(2)} (2; j_{1}; x), \\ {δ a^{(3)} (1; j_{2}; j_{1}; x), \dots, δ a^{(3)} (4; j_{2}; j_{1}; x)]}^{†}; \end{array}

(123)

\begin{array}{l} V M_{21}^{(4)} (4 \times 4; x) ≜ - \frac{\partial Q_{A}^{(3)} [4; j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); α]}{\partial U^{(3)} (4; j_{1}; x)} \\ + \frac{\partial \{A M^{(3)} [4 \times 4; U^{(3)} (4; j_{1}; x); α] A^{(3)} (4; j_{2}; j_{1}; x)\}}{\partial U^{(3)} (4; j_{1}; x)}; \end{array}

(124)

V M_{22}^{(4)} (4 \times 4; x) ≜ A M^{(3)} [4 \times 4; U^{(3)} (4; j_{1}; x); α];

(125)

\begin{array}{l} Q_{V}^{(4)} [8; j_{1}; U^{(4)} (4; j_{2}; j_{1}; x); α; δ α] ≜ (\begin{matrix} Q_{V}^{(3)} [4; j_{1}; U^{(3)} (4; j_{1}; x); α; δ α] \\ Q_{2}^{(4)} [4; j_{1}; U^{(4)} (4; j_{2}; j_{1}; x); α; δ α] \end{matrix}) \\ \equiv {\{q_{V}^{(4)} [1; j_{1}; U^{(4)} (4; j_{2}; j_{1}; x); α; δ α], \dots, q_{V}^{(4)} [8; j_{1}; U^{(4)} (4; j_{2}; j_{1}; x); α; δ α]\}}^{†}; \end{array}

(126)

\begin{array}{l} Q_{2}^{(4)} [4; j_{1}; U^{(4)} (4; j_{2}; j_{1}; x); α; δ α] ≜ \frac{\partial Q_{A}^{(3)} [4; j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); α]}{\partial α} \partial α \\ - \frac{\partial \{A M^{(3)} [4 \times 4; U^{(3)} (4; j_{1}; x); α] A^{(3)} (4; j_{2}; j_{1}; x)\}}{\partial α} \partial α; \end{array}

(127)

\begin{array}{l} B_{V}^{(4)} [8; U^{(4)} (8; j_{2}; j_{1}; x); V^{(4)} (8; j_{2}; j_{1}; x); α; δ α] \\ ≜ (\begin{matrix} B_{V}^{(3)} [4; U^{(3)} (4; j_{1}; x); V^{(3)} (4; j_{1}; x); α; δ α] \\ δ B_{A}^{(2)} [2; U^{(2)} (2; x); A^{(2)} (2; j_{1}; x); α] \end{matrix}) . \end{array}

(128)

For subsequent reference, it is noted that the quantities

q_{V}^{(4)} [i; j_{2}; j_{1}; U^{(4)} (4; j_{2}; j_{1}; x); α; δ α]

are linear in the parameter variations

δ α_{i}

,

i = 1, \dots, T P

, and can therefore be written in the following form:

q_{V}^{(4)} [i; j_{2}; j_{1}; U^{(4)} (4; j_{2}; j_{1}; x); α; δ α] ≜ \sum_{j_{4} = 1}^{T P} s_{V}^{(3)} [i; j_{4}; j_{2}; j_{1}; U^{(4)} (4; x); α] δ α_{j_{4}}; i = 1, \dots, 8 .

(129)

The variational matrix

V M^{(4)} [8 \times 8; U^{(4)} (8; j_{2}; j_{1}; x); α]

comprises

(8 \times 8)

block matrices, each comprising

T D^{2}

components/elements; thus, the matrix

V M^{(4)} [8 \times 8; U^{(4)} (8; j_{2}; j_{1}; x); α]

comprises a total of

(8 \times 8) T D^{2}

components/elements. Each of the vectors

V^{(4)} (8; j_{2}; j_{1}; x)

,

Q_{V}^{(4)} [8; j_{1}; U^{(3)} (4; j_{1}; x); α; δ α]

, and

B_{V}^{(4)} [8; U^{(4)} (8; j_{2}; j_{1}; x); V^{(4)} (8; j_{2}; j_{1}; x); α; δ α]

comprise eight

T D

-dimensional vectors, as shown in their respective definitions; thus, each of the vectors

B_{V}^{(4)} [8; U^{(4)} (8; j_{2}; j_{1}; x); V^{(4)} (8; j_{2}; j_{1}; x); α; δ α]

,

V^{(4)} (8; j_{2}; j_{1}; x)

,

Q_{V}^{(4)} [8; j_{1}; U^{(3)} (4; j_{1}; x); α; δ α]

comprise

(8 \times T D)

components/elements.

Solving the 4th-LVSS would require

T P^{4}

large-scale computations, which is unrealistic for large-scale systems comprising many parameters. The 4th-CASAM-N circumvents the need for solving the 3rd-LVSS by deriving an alternative expression for the indirect-effect term defined in Equation (114), in which the function

V^{(4)} (8; j_{2}; j_{1}; x)

is replaced by a fourth-level adjoint function which is independent of parameter variations. This fourth-level adjoint function will be the solution of a fourth-level adjoint sensitivity system (4th-LASS) which will be constructed by applying the same principles as those used for constructing the 1st-LASS, the 2nd-LASS, and the 3rd-LASS. The Hilbert space appropriate for constructing the 4th-LASS will be denoted as

H_{4} (Ω_{x})

and comprises block vectors elements of the same form as

V^{(4)} (8; j_{2}; j_{1}; x)

. Thus, a generic block vector in

H_{4} (Ω_{x})

will have the structure

Ψ^{(4)} (8; x) ≜ {[ψ^{(4)} (1; x), \dots, ψ^{(4)} (8; x)]}^{†} \in H_{4} (Ω_{x})

, comprising eight

T D

-dimensional vectors of the form

ψ^{(4)} (i; x) ≜ {[ψ_{1}^{(4)} (i; x), \dots, ψ_{T D}^{(4)} (i; x)]}^{†} \in H_{1} (Ω_{x})

,

i = 1, \dots, 8

. The inner product of two vectors

Ψ^{(4)} (8; x) \in H_{4} (Ω_{x})

and

Φ^{(4)} (8; x) \in H_{4} (Ω_{x})

in the Hilbert space

H_{4} (Ω_{x})

will be denoted as

{〈Ψ^{(4)} (8; x), Φ^{(4)} (8; x)〉}_{4}

and defined as follows:

{〈Ψ^{(4)} (8; x), Φ^{(4)} (8; x)〉}_{4} ≜ \sum_{i = 1}^{8} {〈ψ^{(3)} (i; x), φ^{(3)} (i; x)〉}_{1}

(130)

The inner product defined in Equation (130) is continuous in

α

, in a neighborhood around

α^{0}

. Using the definition of the inner product defined in Equation (130), construct the inner product of Equation (119) with a vector

A^{(4)} (8; x) ≜ {[a^{(4)} (1; x), \dots, a^{(4)} (8; x)]}^{†} \in H_{4} (Ω_{x})

to obtain the following relation:

\begin{array}{l} {\{{〈A^{(4)} (8; x), V M^{(4)} [8 \times 8; U^{(4)} (8; j_{2}; j_{1}; x); α] V^{(4)} (8; j_{2}; j_{1}; x)〉}_{3}\}}_{α^{0}} \\ = {\{{〈A^{(4)} (8; x), Q_{V}^{(4)} [8; j_{1}; U^{(4)} (8; j_{2}; j_{1}; x); α; δ α]〉}_{3}\}}_{α^{0}}, x \in Ω_{x} . \end{array}

(131)

The inner product on the left side of Equation (131) is further transformed by using the definition of the adjoint operator to obtain the following relation:

\begin{array}{l} {\{{〈A^{(4)} (8; x), V M^{(4)} [8 \times 8; U^{(4)} (8; j_{2}; j_{1}; x); α] V^{(4)} (8; j_{2}; j_{1}; x)〉}_{3}\}}_{α^{0}} = \\ {\{{〈V^{(4)} (8; j_{2}; j_{1}; x), A M^{(4)} [8 \times 8; U^{(4)} (8; j_{2}; j_{1}; x); α] A^{(4)} (8; x)〉}_{3}\}}_{α^{0}} \\ + {\{{[P^{(4)} (U^{(4)}; A^{(4)}; V^{(4)}; α)]}_{\partial Ω_{x}}\}}_{α^{0}}, \end{array}

(132)

where

\begin{array}{l} A M^{(4)} [8 \times 8; U^{(4)} (8; j_{2}; j_{1}; x); α] ≜ {[V M^{(4)} (8 \times 8; U^{(4)}; α)]}^{*} \\ = (\begin{matrix} {\{{[V M^{(3)} (4 \times 4)]}^{*}\}}^{†} & {\{{[V M_{21}^{(4)} (4 \times 4)]}^{*}\}}^{†} \\ 0 [4 \times 4] & {\{{[V M_{22}^{(3)} (4 \times 4)]}^{*}\}}^{†} \end{matrix}), \end{array}

(133)

and where

{\{{[P^{(4)} (U^{(4)}; A^{(4)}; V^{(4)}; α)]}_{\partial Ω_{x}}\}}_{α^{0}}

denotes the corresponding bilinear concomitant on the domain’s boundary, evaluated at the nominal values for the parameters and respective state functions. The adjoint matrix

A M^{(4)} [8 \times 8; U^{(4)} (8; j_{2}; j_{1}; x); α]

comprises

(8 \times 8) T D^{2}

components/elements, while the adjoint function

A^{(4)} (8; x) \in H_{4} (Ω_{x})

comprises

(8 \times T D)

components/elements.

The domain of the adjoint matrix operator

A M^{(4)} [8 \times 8; U^{(4)} (8; j_{2}; j_{1}; x); α]

is specified by requiring that the function

A^{(4)} (8; x) ≜ {[a^{(4)} (1; x), \dots, a^{(4)} (8; x)]}^{†} \in H_{4} (Ω_{x})

satisfies adjoint boundary/initial conditions denoted as follows:

{\{B_{A}^{(4)} [8; U^{(4)} (8; j_{2}; j_{1}; x); A^{(4)} (8; x); α]\}}_{α^{0}} = 0 [8], x \in \partial Ω_{x} (α^{0}) .

(134)

The fourth-level adjoint boundary/initial conditions represented by Equation (134) are determined by requiring that: (a) they must be independent of unknown values of

V^{(4)} (8; j_{2}; j_{1}; x)

; and (b) the substitution of the boundary and/or initial conditions represented by Equations (85) and (98) into the expression of

{\{{[P^{(4)} (U^{(4)}; A^{(4)}; V^{(4)}; α)]}_{\partial Ω_{x}}\}}_{α^{0}}

must cause all terms containing unknown values of

V^{(4)} (8; j_{2}; j_{1}; x)

to vanish.

Implementing the boundary/initial conditions represented by Equations (128) and (134) into Equation (132) will transform the later relation into the following form:

\begin{array}{l} {\{{〈V^{(4)} (8; j_{2}; j_{1}; x), A M^{(4)} [8 \times 8; U^{(4)} (8; j_{2}; j_{1}; x); α] A^{(4)} (8; x)〉}_{3}\}}_{α^{0}} \\ = - {\{{[{\hat{P}}^{(4)} (U^{(4)}; A^{(4)}; δ α)]}_{\partial Ω_{x}}\}}_{α^{0}} \\ + {\{{〈A^{(4)} (8; x), V M^{(4)} [8 \times 8; U^{(4)} (8; j_{2}; j_{1}; x); α] V^{(4)} (8; j_{2}; j_{1}; x)〉}_{3}\}}_{α^{0}}, \end{array}

(135)

where

{\{{[{\hat{P}}^{(4)} (U^{(4)}; A^{(4)}; δ α)]}_{\partial Ω_{x}}\}}_{α^{0}}

denotes residual boundary terms which may have not vanish automatically. The right side of Equation (131) is now used in the first term on the right side of Equation (135) to obtain the following relation:

\begin{array}{l} {\{{〈V^{(4)} (8; j_{2}; j_{1}; x), A M^{(4)} [8 \times 8; U^{(4)} (8; j_{2}; j_{1}; x); α] A^{(4)} (8; x)〉}_{3}\}}_{α^{0}} = \\ {\{{〈A^{(4)} (8; x), Q_{V}^{(4)} [8; j_{1}; U^{(4)} (8; j_{2}; j_{1}; x); α; δ α]〉}_{3}\}}_{α^{0}} - {\{{[{\hat{P}}^{(4)} (U^{(4)}; A^{(4)}; δ α)]}_{\partial Ω_{x}}\}}_{α^{0}} . \end{array}

(136)

The definition of the fourth-level adjoint function

A^{(4)} (8; x) ≜ {[a^{(1)} (1; x), \dots, a^{(4)} (8; x)]}^{†} \in H_{4} (Ω_{x})

is now completed by requiring that the left side of Equation (136) and the right side of Equation (114) represent the “indirect-effect term”

{\{δ R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)}; A^{(3)}; α; V^{(3)}; δ A^{(3)}]\}}_{i n d}

, for each of the indices

j_{1} = 1, \dots, T P

;

j_{2} = 1, ...., j_{1}

;

j_{3} = 1, ...., j_{2}

. Hence, there will be

T P (T P + 1) (T P + 2) / 6

distinct fourth-level adjoint functions

A^{(4)} (8; x) \in H_{4} (Ω_{x})

, each corresponding to one combination of the indices

j_{1} = 1, \dots, T P

;

j_{2} = 1, ...., j_{1}

;

j_{3} = 1, ...., j_{2}

. Each of these distinct fourth-level adjoint functions will correspond to a specific

(j_{1}, j_{2}, j_{3})

-dependent indirect-effect term.

The left side of Equation (136) will be identical to the right side of Equation (114) by requiring that the following relation be satisfied by the fourth-level adjoint functions

A^{(4)} (8; j_{3}; j_{2}; j_{1}; x) ≜ {[a^{(4)} (1; j_{3}; j_{2}; j_{1}; x), \dots, a^{(4)} (8; j_{3}; j_{2}; j_{1}; x)]}^{†} \in H_{4} (Ω_{x})

, for each value of the indices

j_{1} = 1, \dots, T P

;

j_{2} = 1, ...., j_{1}

;

j_{3} = 1, ...., j_{2}

:

\begin{array}{l} A M^{(4)} [8 \times 8; U^{(4)} (8; j_{2}; j_{1}; x); α] A^{(4)} (8; j_{3}; j_{2}; j_{1}; x) \\ = Q_{A}^{(4)} [8; j_{3}; j_{2}; j_{1}; U^{(4)} (4; j_{1}; x); α], \end{array}

(137)

where

\begin{array}{l} Q_{A}^{(4)} [8; j_{3}; j_{2}; j_{1}; U^{(4)} (4; j_{1}; x); α] ≜ \{q_{A}^{(4)} [1; j_{3}; j_{2}; j_{1}; U^{(4)} (4; j_{1}; x); α], \dots \\ {\dots, q_{A}^{(4)} [8; j_{3}; j_{2}; j_{1}; U^{(4)} (4; j_{1}; x); α]\}}^{†}, j_{1} = 1, \dots, T P; j_{2} = 1, \dots, j_{1}; j_{3} = 1, \dots, j_{2}; \end{array}

(138)

q_{A}^{(4)} [1; j_{3}; j_{2}; j_{1}; U^{(4)} (4; j_{1}; x); α] ≜ \partial R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)}; A^{(3)}; α] / \partial u (x);

(139)

q_{A}^{(4)} [2; j_{3}; j_{2}; j_{1}; U^{(4)} (4; j_{1}; x); α] ≜ \partial R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)}; A^{(3)}; α] / \partial a^{(1)} (x);

(140)

q_{A}^{(4)} [2 + i; j_{3}; j_{2}; j_{1}; U^{(4)} (4; j_{1}; x); α] ≜ \frac{\partial R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)}; A^{(3)}; α]}{\partial a^{(2)} (i; j_{1}; x)}; i = 1, 2;

(141)

q_{A}^{(4)} [4 + i; j_{3}; j_{2}; j_{1}; U^{(4)} (4; j_{1}; x); α] ≜ \frac{\partial R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)}; A^{(3)}; α]}{\partial a^{(3)} (i; j_{2}; j_{1}; x)}; i = 1, 2, 3, 4;

(142)

The boundary conditions to be satisfied by each of the fourth-level adjoint functions

A^{(4)} (8; j_{3}; j_{2}; j_{1}; x) ≜ {[a^{(4)} (1; j_{3}; j_{2}; j_{1}; x), \dots, a^{(4)} (8; j_{3}; j_{2}; j_{1}; x)]}^{†} \in H_{4} (Ω_{x})

are those represented by Equation (134), namely

\begin{array}{l} {\{B_{A}^{(4)} [8; U^{(4)} (8; j_{2}; j_{1}; x); A^{(4)} (8; j_{3}; j_{2}; j_{1}; x); α]\}}_{α^{0}} = 0 [8], x \in \partial Ω_{x} (α^{0}); \\ j_{1} = 1, \dots, T P; j_{2} = 1, \dots, j_{1}; j_{3} = 1, \dots, j_{2} . \end{array}

(143)

Since the source

Q_{A}^{(4)} [8; j_{3}; j_{2}; j_{1}; U^{(4)} (4; j_{1}; x); α]

may contain distributions, the equality in Equation (137) is considered to hold in the weak sense, and the Riesz representation theorem ensures that this weak equality holds uniquely.

The system of represented by Equations (101) and (107) will be called the fourth-level adjoint sensitivity system (4th-LASS); its solution,

A^{(4)} (8; j_{3}; j_{2}; j_{1}; x)

, will be called the fourth-level adjoint function. Using the underlying the 4th-LASS and Equation (136) in Equation (114) yields the following expression for the indirect-effect term:

\begin{array}{l} {\{δ R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); A^{(3)} (4; j_{2}; j_{1}; x); α; V^{(3)} (4; j_{1}; x); δ A^{(3)} (4; j_{2}; j_{1}; x)]\}}_{i n d} \\ ≜ {\{{〈A^{(4)} (8; x), Q_{V}^{(4)} [8; j_{1}; U^{(4)} (8; j_{2}; j_{1}; x); α; δ α]〉}_{3}\}}_{α^{0}} - {\{{[{\hat{P}}^{(4)} (U^{(4)}; A^{(4)}; δ α)]}_{\partial Ω_{x}}\}}_{α^{0}} \\ \equiv {\{δ R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(4)} (8; j_{2}; j_{1}; x); A^{(4)} (8; j_{3}; j_{2}; j_{1}; x); α]\}}_{i n d} . \end{array}

(144)

As the identity in Equation (144) indicates, the dependence of the indirect-effect term on the function

V^{(4)} (8; j_{2}; j_{1}; x)

is replaced by the dependence on the adjoint function

A^{(4)} (8; j_{3}; j_{2}; j_{1}; x)

for each

j_{1} = 1, \dots, T P;

j_{2} = 1, \dots, j_{1}

;

j_{3} = 1, \dots, j_{2}

.

Replacing the expression obtained in Equation (144) for the indirect-effect term together with the expression for the direct-effect term provided in Equation (113) yields the following expression for the total differential defined by Equation (112):

\begin{array}{l} {\{δ R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(4)} (8; j_{2}; j_{1}; x); A^{(4)} (8; j_{3}; j_{2}; j_{1}; x); δ α]\}}_{α^{0}} \\ = {\{\frac{\partial R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); A^{(3)} (4; j_{2}; j_{1}; x); α]}{\partial α} δ α\}}_{α^{0}} - {\{{[{\hat{P}}^{(4)} (U^{(4)}; A^{(4)}; δ α)]}_{\partial Ω_{x}}\}}_{α^{0}} \\ + {\{{〈A^{(4)} (8; j_{3}; j_{2}; j_{1}; x), Q_{V}^{(4)} [8; j_{1}; U^{(4)} (8; j_{2}; j_{1}; x); α; δ α]〉}_{3}\}}_{α^{0}} \end{array}

(145)

In component form, the total differential expressed by Equation (145) can be written in the following form, for each

j_{1} = 1, \dots, T P;

j_{2} = 1, \dots, j_{1}

;

j_{3} = 1, \dots, j_{2}

:

\begin{array}{l} {\{δ R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(4)} (8; j_{2}; j_{1}; x); A^{(4)} (8; j_{3}; j_{2}; j_{1}; x); δ α]\}}_{α^{0}} \\ = \sum_{j_{4} = 1}^{T P} {\{R^{(4)} [j_{4}; j_{3}; j_{2}; j_{1}; U^{(4)} (8; j_{2}; j_{1}; x); A^{(4)} (8; j_{3}; j_{2}; j_{1}; x); α]\}}_{α^{0}} δ α_{j_{4}}, \end{array}

(146)

where the quantity

R^{(4)} [j_{4}; j_{3}; j_{2}; j_{1}; U^{(4)} (8; j_{2}; j_{1}; x); A^{(4)} (8; j_{3}; j_{2}; j_{1}; x); α]

denotes the fourth-order sensitivity of the generic scalar-valued response

R [u (x); α]

with respect to any four model parameters

α_{j_{1}}

,

α_{j_{2}}

,

α_{j_{3}}

,

α_{j_{3}}

, and has the following expression:

\begin{array}{l} R^{(4)} [j_{4}; j_{3}; j_{2}; j_{1}; U^{(4)} (8; j_{2}; j_{1}; x); A^{(4)} (8; j_{3}; j_{2}; j_{1}; x); α] \\ ≜ \frac{\partial R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(3)} (4; j_{1}; x); A^{(3)} (4; j_{2}; j_{1}; x); α]}{\partial α_{j_{4}}} - \frac{{[\partial {\hat{P}}^{(4)} (U^{(4)}; A^{(4)}; δ α)]}_{\partial Ω_{x}}}{\partial α_{j_{4}}} \\ + \sum_{i = 1}^{8} {〈a^{(4)} (i; j_{3}; j_{2}; j_{1}; x), s_{V}^{(3)} [i; j_{4}; j_{2}; j_{1}; U^{(4)} (4; x); α]〉}_{1} \\ \equiv \partial^{4} R [u (x); α] / \partial α_{j_{1}} \partial α_{j_{3}} \partial α_{j_{3}} \partial α_{j_{4}}; j_{1}, j_{2}, j_{3}, j_{4} = 1, \dots, j_{3} = 1, \dots, T P . \end{array}

(147)

As Equations (137) and (143) indicate, solving the 4th-LASS once provides the fourth-level adjoint function

A^{(4)} (8; j_{3}; j_{2}; j_{1}; x)

for each of the indices

j_{1} = 1, \dots, T P;

j_{2} = 1, \dots, j_{1}

, and

j_{3} = 1, \dots, j_{2}

. In turn, the availability of

A^{(4)} (8; j_{3}; j_{2}; j_{1}; x)

enables the exact and efficient computation of the G differential

δ R^{(3)} [j_{3}; j_{2}; j_{1}; U^{(4)} (8; j_{2}; j_{1}; x); A^{(4)} (8; j_{3}; j_{2}; j_{1}; x); δ α]

. Thus, the exact computation of all of the partial fourth-order sensitivities,

\partial^{4} R [u (x); α] / \partial α_{j_{1}} \partial α_{j_{3}} \partial α_{j_{3}} \partial α_{j_{4}}

requires at most

T P (T P + 1) (T P + 2) / 6

large-scale (adjoint) computations using the 4th-LASS, rather than at least

O (T P^{4})

large-scale computations, which would be required by forward methods.

The adjoint matrix

A M^{(4)} [8 \times 8; U^{(4)} (8; j_{2}; j_{1}; x); α]

is block diagonal; therefore, solving the 4th-LASS is equivalent to solving four 1st-LASS, with different source terms. The 4th-LASS was designated as the “fourth-level” rather than “fourth-order” adjoint sensitivity system since the 3rd-LASS does not involve any explicit second-order, third-order, and/or fourth-order G derivatives of the operators underlying the original system but involves the inversion of the operators similar to those that needed to be inverted for solving the 1st-LASS.

By solving the 4th-LASS

T P (T P + 1) (T P + 2) / 6

times, the fourth-order mixed sensitivities

\partial^{4} R [u (x); α] / \partial α_{j_{1}} \partial α_{j_{3}} \partial α_{j_{3}} \partial α_{j_{4}}

will be computed four times, in four different ways using distinct adjoint functions. Consequently, the multiple symmetries intrinsic to the fourth-order sensitivities provide an intrinsic numerical verification that the components of the first-, second-, third-, and fourth-level adjoint functions are computed accurately.

The structure of the 4th-LASS enables full flexibility for prioritizing the computation of the third-order sensitivities. The computation of the fourth-order sensitivities would be prioritized based on the relative magnitudes of the third-order sensitivities, so that the unimportant fourth-order sensitivities can be deliberately neglected while knowing the error incurred by neglecting them.

4. Discussion

This work has presented the “fourth-order comprehensive sensitivity analysis methodology for nonlinear systems (abbreviated as “4th-CASAM-N”), which enables the hitherto very difficult, if not intractable, exact computation of all of the first-, second-, third-, and fourth-order response sensitivities for large-scale nonlinear systems involving many parameters. The qualifier “comprehensive” indicates that the fourth-CASAM-N methodology enables the exact and efficient computation not only of response sensitivities with respect to the customary model parameters (including computational input data, correlations, initial and/or boundary conditions) but also with respect to imprecisely known material boundaries, which would be caused by manufacturing tolerances.

It has been shown that the first-order sensitivities of the system response under consideration with respect to the model parameters (including boundary and initial conditions) can be computed inexpensively, using just quadrature formulas, after having obtained the first-level adjoint state function. The first-level adjoint state function is obtained by solving once the first-level adjoint sensitivity system (1st-LASS), which is the sole large-scale computation needed for obtaining all of the first-order sensitivities. This is because the 1st-LASS is independent of parameter variations, and therefore needs to be solved only once, regardless of the number of model parameters (denoted as “

T P

”) under consideration. Furthermore, solving the 1st-LASS requires less computational effort than solving the nonlinear underlying the original models, since the 1st-LASS is linear in the first-level adjoint function.

The second-order sensitivities which correspond to each first-order sensitivity are also computed by using inexpensive quadrature formulas, after having obtained the second-level adjoint state function. The computation of the second-level adjoint state function requires solving the second-level adjoint sensitivity system (2nd-LASS). Since the 2nd-LASS is block diagonal, solving it is equivalent to solving two 1st-LASS, with two different source terms. Thus, the “solvers” and the computer program used for solving the 1st-LASS can also be used for solving the 2nd-LASS. The 2nd-LASS was designated as the “second-level” rather than the “second-order” adjoint sensitivity system, since the 2nd-LASS does not involve any second-order G derivatives of the operators underlying the original system but involves the inversion of the same operators that need to be inverted to solve the 1st-LASS. The structure of the 2nd-LASS enables full flexibility for prioritizing the computation of the second-order sensitivities. The computation of the second-order sensitivities would logically be prioritized based on the relative magnitudes of the first-order sensitivities: the largest relative first-order response sensitivity should have the highest priority for computing the corresponding second-order mixed sensitivities; then, the second largest relative first-order response sensitivity should be considered next, and so on. The unimportant second-order sensitivities can be deliberately neglected while knowing the error incurred by neglecting them. Computing second-order sensitivities that correspond to vanishing first-order sensitivities may also be of interest, since vanishing first-order sensitivities may indicate critical points of the response in the phase space of model parameters. If the 2nd-LASS is solved

T P

times, the second-order mixed sensitivities will be computed twice, in two different ways, in terms of two distinct second-level adjoint functions. Consequently, the symmetry property enjoyed by the second-order sensitivities provides an intrinsic (numerical) verification that the components of the first- and second-level adjoint functions are computed accurately.

The exact computation of all of the partial third-order sensitivities that correspond to a second-order sensitivities is also accomplished by using quadrature formulas after having obtained the third-level adjoint function. Thus, for each second-order sensitivity, the third-level adjoint function is obtained by solving the third-level adjoint sensitivity system (3rd-LASS) once, which is equivalent to solving three 1st-LASS, with different source terms. The 3rd-LASS is designated as “the third-level” rather than the “third-order” adjoint sensitivity system since the 3rd-LASS does not involve any explicit second-order and/or third-order G derivatives of the operators underlying the original system. By solving the 3rd-LASS

T P (T P + 1) / 2

times, the third-order mixed sensitivities will be computed three times, in three different ways. Consequently, the multiple symmetries intrinsic to the third-order sensitivities provide an intrinsic numerical verification that the components of the first-, second- and third-level adjoint functions are computed accurately. The structure of the 3rd-LASS enables full flexibility for prioritizing the computation of the third-order sensitivities. The computation of the third-order sensitivities would logically be prioritized based on the relative magnitudes of the second-order sensitivities, so that the unimportant third-order sensitivities can be deliberately neglected while knowing the error incurred by neglecting them.

The exact computation of all of the partial fourth-order sensitivities that correspond to a third-order sensitivity also involves the use of quadrature formulas after having obtained the fourth-level adjoint function by solving the fourth-level adjoint sensitivity system (4th-LASS). The 4th-LASS is designated as the “fourth-level” rather than “fourth-order” adjoint sensitivity system since the 3rd-LASS does not involve any explicit second-order, third-order, and/or fourth-order G derivatives of the operators underlying the original system but involves the inversion of the operators similar to those that needed to be inverted for solving the 1st-LASS. The 4th-LASS is block diagonal; solving it is equivalent to solving four 1st-LASS, with different source terms. If the 4th-LASS is solved

T P (T P + 1) (T P + 2) / 6

times, the fourth-order mixed-response sensitivities will be computed four times, in four different ways using distinct adjoint functions. Consequently, the multiple symmetries intrinsic to the fourth-order sensitivities provide an intrinsic numerical verification that the components of the first-, second-, third-, and fourth-level adjoint functions are computed accurately. The structure of the 4th-LASS enables full flexibility for prioritizing the computation of the third-order sensitivities. The computation of the fourth-order sensitivities would be prioritized based on the relative magnitudes of the third-order sensitivities, so that the unimportant fourth-order sensitivities can be deliberately neglected while knowing the error incurred by neglecting them.

In summary, the implementation of the 4th-CASAM-N requires very little additional effort beyond the construction of the adjoint sensitivity system needed for computing the first-order sensitivities. An illustrative application of the 4th-CASAM-N to a paradigm nonlinear heat conduction is presented in the accompanying work [17].

Funding

This research received no external funding.

Data Availability Statement

Not applicable.

Conflicts of Interest

The author declares no conflict of interest.

References

Cacuci, D.G. Sensitivity Theory for Nonlinear Systems: I. Nonlinear Functional Analysis Approach. J. Math. Phys. 1981, 22, 2794–2802. [Google Scholar] [CrossRef]
Cacuci, D.G. Sensitivity Theory for Nonlinear Systems: II. Extensions to Additional Classes of Responses. J. Math. Phys. 1981, 22, 2803–2812. [Google Scholar] [CrossRef]
Práger, T.; Kelemen, F.D. Adjoint Methods and their Application in Earth Sciences, Chapter 4A. In Advanced Numerical Methods for Complex Environmental Models: Needs and Availability; Faragó, I., Havasi, Á., Zlatev, Z., Eds.; Bentham Science Publishers: Sharjah, United Arab Emirates, 2014; pp. 203–275. [Google Scholar]
Daescu, D.N.; Navon, I.M. Sensitivity Analysis in Nonlinear Variational Data Assimilation: Theoretical Aspects and Applications, Chapter 4B. In Advanced Numerical Methods for Complex Environmental Models: Needs and Availability; Faragó, I., Havasi, Á., Zlatev, Z., Eds.; Bentham Science Publishers: Sharjah, United Arab Emirates, 2014; pp. 276–300. [Google Scholar]
Cacuci, D.G.; Navon, M.I.; Ionescu-Bujor, M. Computational Methods for Data Evaluation and Assimilation; Chapman & Hall/CRC: Boca Raton, FL, USA, 2014. [Google Scholar]
Cacuci, D.G. Sensitivity and Uncertainty Analysis: Theory; Chapman & Hall/CRC: Boca Raton, FL, USA, 2003; Volume 1. [Google Scholar]
Cacuci, D.G.; Ionescu-Bujor, M.; Navon, M.I. Sensitivity and Uncertainty Analysis: Applications to Large Scale Systems; Chapman & Hall/CRC: Boca Raton, FL, USA, 2005; Volume 2. [Google Scholar]
Cacuci, D.G. Second-Order Adjoint Sensitivity Analysis Methodology (2nd-ASAM) for Computing Exactly and Efficiently First- and Second-Order Sensitivities in Large-Scale Linear Systems: I. Computational Methodology. J. Comp. Phys. 2015, 284, 687–699. [Google Scholar] [CrossRef] [Green Version]
Cacuci, D.G. The Second-Order Adjoint Sensitivity Analysis Methodology; Taylor & Francis/CRC Press: Boca Raton, FL, USA, 2018; 305p. [Google Scholar]
Cacuci, D.G. Towards Overcoming the Curse of Dimensionality: The Third-Order Adjoint Method for Sensitivity Analysis of Response-Coupled Linear Forward/Adjoint Systems, with Applications to Uncertainty Quantification and Predictive Modeling. Energies 2019, 12, 4216. [Google Scholar] [CrossRef] [Green Version]
Cacuci, D.G. Fourth-Order Comprehensive Adjoint Sensitivity Analysis (4th-CASAM) of Response-Coupled Linear Forward/Adjoint Systems. I. Theoretical Framework. Energies 2021, 14, 3335. [Google Scholar] [CrossRef]
Cacuci, D.G. The nth-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Response-Coupled Forward/Adjoint Linear Systems (nth-CASAM-L): I. Mathematical Framework. Energies 2021, 14, 8314. [Google Scholar] [CrossRef]
Fang, R.; Cacuci, D.G. Fourth-Order Adjoint Sensitivity and Uncertainty Analysis of an OECD/NEA Reactor Physics Benchmark: I. Computed Sensitivities. J. Nucl. Energy 2021, 2, 281–308. [Google Scholar] [CrossRef]
Cacuci, D.G. Second-Order Adjoint Sensitivity Analysis Methodology (2nd-ASAM) for Large-Scale Nonlinear Systems: I. Theory. Nucl. Sci. Eng. 2016, 184, 16–30. [Google Scholar] [CrossRef]
Cacuci, D.G. On the Need to Determine Accurately the Impact of Higher-Order Sensitivities on Model Sensitivity Analysis, Uncertainty Quantification and Best-Estimate Predictions. Energies 2021, 14, 6318. [Google Scholar] [CrossRef]
Cacuci, D.G. First-Order Comprehensive Adjoint Sensitivity Analysis Methodology (1st-CASAM) for Computing Efficiently the Exact Response Sensitivities for Physical Systems with Imprecisely Known Boundaries and Parameters: General Theory and Illustrative Paradigm Applications. Ann. Nucl. Energy 2021, 151, 107913. [Google Scholar]
Cacuci, D.G. Fourth-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Nonlinear Systems (4th-CASAM-N): II. Illustrative Application to a Nonlinear Hear Conduction Model. Fluids, 2021; submitted. [Google Scholar]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Cacuci, D.G. Fourth-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Nonlinear Systems (4th-CASAM-N): I. Mathematical Framework. J. Nucl. Eng. 2022, 3, 37-71. https://doi.org/10.3390/jne3010004

AMA Style

Cacuci DG. Fourth-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Nonlinear Systems (4th-CASAM-N): I. Mathematical Framework. Journal of Nuclear Engineering. 2022; 3(1):37-71. https://doi.org/10.3390/jne3010004

Chicago/Turabian Style

Cacuci, Dan Gabriel. 2022. "Fourth-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Nonlinear Systems (4th-CASAM-N): I. Mathematical Framework" Journal of Nuclear Engineering 3, no. 1: 37-71. https://doi.org/10.3390/jne3010004

Article Menu

Fourth-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Nonlinear Systems (4th-CASAM-N): I. Mathematical Framework

Abstract

1. Introduction

2. Generic Mathematical Modeling of a Nonlinear Physical System

3. The Fourth-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Nonlinear Systems (4th-CASAM-N)

3.1. The First-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Nonlinear Systems (1st-CASAM-N)

3.2. The Second-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Nonlinear Systems (2nd-CASAM-N)

3.3. The Third-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Nonlinear Systems (3rd-CASAM-N)

3.4. The Fourth-Order Comprehensive Adjoint Sensitivity Analysis Methodology for Nonlinear Systems (4th-CASAM-N)

4. Discussion

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI