A D-Optimal Sequential Calibration Design for Computer Models

Diao, Huaimin; Wang, Yan; Wang, Dianpeng

doi:10.3390/math10091375

Open AccessArticle

A D-Optimal Sequential Calibration Design for Computer Models

by

Huaimin Diao

¹,

Yan Wang

²

and

Dianpeng Wang

^1,*

¹

School of Mathematics and Statistics, Beijing Institute of Technology, Beijing 100081, China

²

School of Statistics and Data Science, Beijing University of Technology, Beijing 100124, China

^*

Author to whom correspondence should be addressed.

Mathematics 2022, 10(9), 1375; https://doi.org/10.3390/math10091375

Submission received: 20 March 2022 / Revised: 15 April 2022 / Accepted: 18 April 2022 / Published: 20 April 2022

(This article belongs to the Special Issue Optimal Experimental Design and Statistical Modeling)

Download

Browse Figures

Versions Notes

Abstract

:

The problem with computer model calibration by tuning the parameters associated with computer models is significant in many engineering and scientific applications. Although several methods have been established to estimate the calibration parameters, research focusing on the design of calibration parameters remains limited. Therefore, this paper proposes a sequential computer experiment design based on the D-optimal criterion, which can efficiently tune the calibration parameters while improving the prediction ability of the calibrated computer model. Numerical comparisons of the simulated and real data demonstrate the efficiency of the proposed technique.

Keywords:

calibration; computer models; fisher information; sequential D-optimal; surrogate model

MSC:

62K05; 62L05

1. Introduction

Experiments are conducted to explore or optimize physical phenomena. In some applications, such as national defense, medicine, and manufacturing, the physical experiments may be difficult to be conducted due to economic, technical, or ethical limitations. To reduce the experimental cost, mathematical models, which are also called computer models, are developed to mimic, understand, and predict the physical phenomena in many applications [1,2,3]. The computer models are useful and efficient only if they can approximate the physical process well. Oftentimes, the computer models contain a set of calibration parameters, which are physical unobservable variables. The computer models’ fidelity to physical process relies on the unknown values of calibration parameters. Then, physical data and computer model outputs are combined to estimate the calibration parameters such that the computer model matches the physical process. This procedure is referred to as computer model calibration in the literature.

Numerous models have been proposed in the literature for the problem of computer model calibration, such as [2,4,5,6,7]. Among them, the Kennedy and O’Hagan model is the most commonly used. The Kennedy and O’Hagan model integrates the physical data and computer model outputs through a Bayesian framework. Any posterior quantity can serve as the point estimate of calibration parameter depending on the loss function specified. In practice, the most commonly used predictor of the physical process is a calibrated computer model, see [8,9]. Since the nonlinear effects from the discrepancy function can be hard to interpret and also may open up the possibility of overfitting with limited physical observations. Ref. [10] pointed out that an interpretable calibration parameter should allow the computer model to predict the real physical phenomena well even without the discrepancy function.

How to perform the experiments efficiently to tune the calibration parameters accurately under some metrics plays an important role. Although we are not the first to look into the problem of design for calibration, it has not received enough attention. Based on the Kennedy and O’Hagan model, some designs have been proposed in the literature. Ref. [11] employed the Kullback–Liebler (KL) divergence criterion as a function of the computer model inputs and obtained the estimate of calibration parameters by minimizing this criterion. Ref. [12] focused on the problem of functional calibration by generating sequential designs for the physical and computer experiments. Ref. [13] used results from the nonlinear optimal design theory to design such experiments. Ref. [14], based on the Kennedy and O’Hagan model, proposed an optimal sequential design for both computer and physical experiments by regarding integrated mean squared prediction error. Based on the Bayesian model calibration framework of [6], a D-optimal design for the physical experiment was proposed by [15]. Ref. [16] proposed a follow-up optimal experiment design for computer models calibration. In some practices, no physical experiments can be conducted after the initial design due to limitations. Thus, these designs, considering the physical experiments, are hard and impracticable. Ref. [17] presented an adaptive design for computer experiments to estimate the calibration parameters by using the expected improvement (EI) algorithm. It aims at reducing the calibration error induced by the uncertainty of the emulator of computer models but not at improving the estimation of calibration parameters. Inspired by this, we divert effort on the designs only considering computer experiments to estimate the calibration parameters. The D-optimal criterion is well known and widely used in the literature, which can help gather more information about the calibration parameters by minimizing the asymptotic variances of estimate. This paper proposes a sequential computer experiment calibration design using the D-optimal criterion and presents a fast algorithm to generate the designs.

The article is organized as follows. Section 2 reviews the Kennedy and O’Hagan calibration method. In Section 3, the proposed local D-optimal sequential design is presented. A fast algorithm for generating the corresponding designs is suggested. In Section 4, some simulation studies are made to demonstrate the performance of the proposed design. Conclusions and remarks are given in Section 5. Appendix A shows the derivation of the Fisher Information Matrix (FIM) for the calibration parameters.

2. Calibration of Computer Models

An important reference for computer calibration is the work of [6]. In this section, we will review some related background about the Kennedy and O’Hagan model. Let

y^{p}

be the observation of physical process and

x = (x_{1}, \dots, x_{r}) \in X \subset R^{r}

be the control variables, which are also the set of inputs for physical process. According to [6], the physical observation

y^{p}

can be modeled as

\begin{matrix} y^{p} (x) = y^{c} (x, θ) + δ (x) + ϵ, \end{matrix}

(1)

where

y^{c} (\cdot, \cdot)

is a computer model,

θ \in T \subset R^{h}

is the set of calibration parameters,

δ (\cdot)

is the discrepancy function which is independent of the computer model

y^{c} (\cdot, \cdot)

;

ϵ \sim N (0, λ^{2})

is the observation error and

λ^{2}

is the corresponding variance. In the literature, the most popular methods to fit the computer model

y^{c} (\cdot, \cdot)

and discrepancy function

δ (\cdot)

are the Gaussian processes due to analytical tractability. Thus, the prior information about both

y^{c} (\cdot, \cdot)

and

δ (\cdot)

is considered as

\begin{matrix} y^{c} (\cdot, \cdot) \sim G P (m (\cdot, \cdot), c_{1} {(\cdot, \cdot), (\cdot, \cdot)}), \\ δ (\cdot) \sim G P (0, c_{2} (\cdot, \cdot)), \end{matrix}

(2)

where

m (\cdot, \cdot)

is the mean function of computer model. Assume

x

and

x^{'}

denote the values of control inputs, and

t

and

t^{'}

denote the values of calibration inputs. According to the literature [18,19],

c_{1} {(\cdot, \cdot), (\cdot, \cdot)}

and

c_{2} (\cdot, \cdot)

are usually the corresponding separable covariance functions such as

\begin{matrix} c_{1} {(x, t), (x^{'}, t^{'})} = σ_{1}^{2} exp \{- {(x - x^{'})}^{T} Ω_{x}^{- 1} (x - x^{'})\} exp \{- {(t - t^{'})}^{T} Ω_{θ}^{- 1} (t - t^{'})\}, \\ c_{2} (x, x^{'}) = σ_{2}^{2} exp \{- {(x - x^{'})}^{T} Ω_{δ}^{- 1} (x - x^{'})\} . \end{matrix}

(3)

Here,

σ_{1}^{2}

and

σ_{2}^{2}

are the variance parameters,

\begin{matrix} Ω_{x} = d i a g {ω_{x}^{1}, ω_{x}^{2}, \dots, ω_{x}^{r}} \\ Ω_{θ} = d i a g {ω_{θ}^{1}, ω_{θ}^{2}, \dots, ω_{θ}^{h}} \end{matrix}

and

\begin{matrix} Ω_{δ} = d i a g {ω_{δ}^{1}, ω_{δ}^{2}, \dots, ω_{δ}^{r}} . \end{matrix}

In terms of the mean function

m (\cdot, \cdot)

, the linear model structure is always considered, i.e.,

\begin{matrix} m (x, t) = h {(x, t)}^{T} β, \end{matrix}

(4)

where

h (x, t) = {(h_{1} (x, t), h_{2} (x, t), \dots, h_{p} (x, t))}^{T}

is a vector of p known functions over

X

and

β = {(β_{1}, β_{2}, \dots, β_{p})}^{T}

is the corresponding unknown regression coefficients.

Let

D^{p} = {x_{1}^{p}, x_{2}^{p}, \dots, x_{q}^{p}}

be the design for physical experiment with q points,

y^{p} = {y_{1}^{p}, y_{2}^{p}, \dots, y_{q}^{p}}^{T}

be the corresponding physical outputs,

D^{c} = {(x_{1}^{c}, t_{1}^{c}), (x_{2}^{c}, t_{2}^{c}), \dots, (x_{n}^{c}, t_{n}^{c})}

be the design for computer experiment with n points, and

y^{c} = {y_{1}^{c}, y_{2}^{c}, \dots, y_{n}^{c}}^{T}

be the corresponding computer outputs. Thus, the full output

d = {(y^{c T}, y^{p T})}^{T}

is normally distributed given

(θ, β, σ_{1}^{2}, σ_{2}^{2}, λ^{2}, Ω_{x}, Ω_{θ}, Ω_{δ})

, and the corresponding likelihood function can be yielded. In order to express the mean and variance matrix of full output clearly, we define the following notations. Let

ψ_{1} = (σ_{1}^{2}, Ω_{x}, Ω_{θ}), ψ_{2} = (σ_{2}^{2}, Ω_{δ})

,

φ = (λ^{2}, ψ_{1}, ψ_{2})

and

D_{θ}^{p} = {(x_{1}^{p}, θ), (x_{2}^{p}, θ), \dots, (x_{q}^{p}, θ)}

be the augmented design points by calibration parameters

θ

. Then the mean and variance matrix for full output vector

d

given

(θ, β, φ)

can be derived as

\begin{matrix} E (d | θ, β, φ) = H (θ) β, \end{matrix}

(5)

and

\begin{matrix} v a r (d | θ, β, φ) = (\begin{matrix} V_{1} (D^{c}) & C_{1} (D^{c}, D_{θ}^{p}) \\ C_{1} (D^{c}, D_{θ}^{p}) & λ I_{q} + V_{1} (D_{θ}^{p}) + V_{2} (D^{p}) \end{matrix}), \end{matrix}

(6)

where

V_{1} (D^{c})

is the variance matrix of

y^{c}

with

(i, j)

element

c_{1} {(x_{i}^{c}, t_{i}^{c}), (x_{j}^{c}, t_{j}^{c})},

i, j \in {1, 2, \dots, n}

;

C_{1} (D^{c}, D_{θ}^{p})

is the matrix with

(i, j)

element

c_{1} {(x_{i}^{c}, t_{i}^{c}), (x_{j}^{p}, θ)},

i \in {1, \dots, n}, j \in {1, \dots, q}

;

V_{1} (D_{θ}^{p})

and

V_{2} (D^{p})

are defined similar with

V_{1} (D^{c})

; and

I_{q}

is a

q \times q

identity matrix. Then the posterior for the parameters given the data can be written as

\begin{matrix} p (θ, β, φ | d, D^{c}, D_{θ}^{p}) \propto p (d | θ, β, φ) π (θ, β, φ), \end{matrix}

(7)

where

π (θ, β, φ)

is the prior for unknown parameters. The MCMC techniques are usually used to determine the posterior distribution, but require complex computations. To simplify the computations, we adopt the modularization by the literature [20], namely, first estimate the emulator of the computer model and then the discrepancy.

The modular approach in [20] is considered here to estimate the parameters in the model, which can be described as follows. The maximum likelihood estimates (MLEs)

\hat{β}

of

β

and

{\hat{ψ}}_{1}

of

ψ_{1}

can be obtained based on the computer experimental data

(D^{c}, y^{c})

. For the calibration parameter

θ

, which is a tuning parameter, there is no “true value”. The goal of calibration is to find out some type of best-fitting value of

θ

. It is easy to obtain the least-squares estimate of

θ

, i.e., the value

\tilde{θ}

that minimizes the differences between the physical outputs and the computer model outputs by regarding fixing the

β

and

ψ_{1}

at their MLEs. The bias data

(D^{p}, y^{p} - {\hat{y}}^{c} (D_{\tilde{θ}}^{p}))

can be employed to compute the MLEs

{\hat{ψ}}_{2}

of

ψ_{2}

and

\hat{λ}

of

λ

, where

{\hat{y}}^{c} (D_{\tilde{θ}}^{p}))

is the prediction from the surrogate model. Then, considering the MLEs

\hat{β}

and

\hat{φ}

as fixed values, the posterior mean

\hat{θ}

of the calibration parameters

θ

is deduced according to the prior information and observation data. As pointed out by [8], despite the maximum likelihood estimate plug-in being only approximately Bayesian, the resulting answers seem to be close to those from a full Bayesian analysis.

3. Sequential D-Optimal Design for Calibration Parameters

How to design the physical and computer experiments efficiently is critical to calibrating the computer models. Usually, the design for the physical and computer experiments is conducted separately in practice. A space-filling design is oftentimes used as the initial design for computer experiments, and some uniform designs or factorial designs are employed for physical experiments. Due to the limitation of physical experiments, after the initial designs, only computer experiments are considered to improve the estimation of calibration parameters sequentially. Here, we propose a sequential D-optimal design for computer experiments, which improves the estimation by maximizing the determinant of the FIM.

3.1. D-Optimal Criterion

Following the model presented in Section 2, full output vector

d

is normally distributed given

(θ, β, φ)

. Thus, the FIM can be derived as

\begin{matrix} I (θ) = \int - (\frac{\partial^{2}}{\partial θ^{2}} ln (p (d | θ, β, φ))) p (d | θ, β, φ) d d, \end{matrix}

(8)

where

p (d | θ, β, φ)

is the conditional distribution density function, also known as the likelihood function. Similar to [15,21], the formula of FIM of the calibration parameter is presented in the following lemma. The process of the derivation is shown in Appendix A.

Lemma 1.

Let

d | θ, β, φ

be distributed as

G P (H (θ) β, v a r (d | θ, β, φ))

. Then, the

(i, j)

th element of the FIM is

\begin{matrix} I_{i j} (θ) = \frac{\partial {(H (θ) β)}^{T}}{\partial θ_{i}} {(v a r (d | θ, β, φ))}^{- 1} \frac{\partial (H (θ) β)}{\partial θ_{j}} + \\ \frac{1}{2} t r ({(v a r (d | θ, β, φ))}^{- 1} \frac{\partial (v a r (d | θ, β, φ))}{\partial θ_{j}} {(v a r (d | θ, β, φ))}^{- 1} \frac{\partial (v a r (d | θ, β, φ))}{\partial θ_{i}}] . \end{matrix}

(9)

where

t r (\cdot)

is the trace of the matrix, and

i, j \in {1, \dots, h}

.

The corresponding D-optimal design is generated by maximizing

\begin{matrix} D_{θ} = arg max_{D \in X} log | I (θ) |, \end{matrix}

(10)

which is similar to [22,23], where

X

denotes the experimental space, and

| \cdot |

is the determinant of a matrix. The inverse of FIM is the asymptotic variance matrix of the estimate of calibration parameters. Thus, maximizing (10) is equivalent to minimizing the volume of the confidence ellipsoid of the estimate, which can help improve the estimation. Obviously, it is not an easy task to create a D-optimal design by maximizing (10) directly. Here, we utilize the one-point-at-a-time strategy, i.e., add the computer design points by using the D-optimal criterion sequentially, which is referred to as sequential D-optimal design hereafter in this paper.

3.2. Algorithm for Generating Sequential D-Optimal Design

Let

D^{p}

and

D^{c}

be the initial designs for physical and computer experiments, respectively. In the process of the

i

th iteration, assume

D^{s} = {(x_{1}^{s}, {\hat{θ}}_{0}), (x_{2}^{s}, {\hat{θ}}_{1}), \dots, (x_{i}^{s}, {\hat{θ}}_{i - 1})}

be the computer experimental points generated sequentially, and

y^{s} = {y_{1}^{s}, y_{2}^{s}, \dots, y_{i}^{s}}

be the corresponding computer outputs. Then the estimate

{\hat{θ}}_{i}

can be obtained based on the current full data

{(D^{c}, y^{c}), (D^{s}, y^{s}), (D^{p}, y^{p})}

. For a fair comparison, the stopping rule is set as ‘

i \geq N

’, where N is the prefixed number of the sequential design points. Then, the next computer experiment point is selected by maximizing

\begin{matrix} x_{i + 1}^{s} = \underset{x \in X}{arg max} log {| I (θ) |}_{θ = {\hat{θ}}_{i}, β = \hat{β}, φ = \hat{φ}} . \end{matrix}

(11)

Evaluate the computer model at

(x_{i + 1}^{s}, {\hat{θ}}_{i})

and denote the output as

y_{i + 1}^{s}

. The sequential design is augmented as

D^{s} = D^{s} \cup {(x_{i + 1}^{s}, {\hat{θ}}_{i})}

, and the corresponding computer outputs are augmented as

y^{s} = y^{s} \cup {y_{i + 1}^{s}}

. The value

{\hat{θ}}_{N}

is used as the final estimate of calibration parameter

θ

. Note that it is an r-dimensional optimization problem to find out the next computer experiment point by maximizing (11), which is a challenge. In order to overcome this problem, discrete optimization is commonly performed. For example, [16] used an algorithm (e.g., [24]) based on a fine grid of the r-dimensional input space. Ref. [17] used a greedy fashion scheme, in which the calibration parameter estimate was considered as the value that maximized the EI criterion over a grid. However, a greedy search based on fine grids may be time-consuming. We employ the following procedure to search for the next design point. Let

C

be a space-filling design with k points over computer experimental space

X

and

f_{{\hat{θ}}_{i}} (c_{l}) = log {| I (θ) |}_{θ = {\hat{θ}}_{i}, x = c_{l}}

be the corresponding logarithm value of the determinant of FIM at

c_{l}

and

{\hat{θ}}_{i}

. It is generally reasonable that

k = 500

for

r < 5

and

k = 10^{r}

or

20^{r}

for

r \geq 5

, which is similar to [25]. In the numerical simulation studies in Section 4 and the real data analysis in Section 5, since

r \leq 3

, then

k = 500

is a reasonable choice. Based on

C

and the corresponding evaluations, we can calculate

f_{{\hat{θ}}_{i}} (\cdot)

. Then, the next computer experiment point is selected by maximizing

\begin{matrix} x_{i + 1}^{s} = \underset{x \in C}{arg max} f_{{\hat{θ}}_{i}} (x) . \end{matrix}

(12)

The details about the algorithm to create the sequential D-optimal design are presented as Algorithm 1.

Algorithm 1: Algorithm for Generating Sequential D-optimal Design

Input: Given physical observation data {

D^{p} = (x_{1}^{p}, \dots, x_{q}^{p}), y^{p}

}, initial computer
experiment data {

D^{c} = ((x_{1}^{c}, t_{1}^{c}), \dots, (x_{n}^{c}, t_{n}^{c})), y^{c}

}.
Output: N-run sequential D-optimal design
Mathematics 10 01375 i001

4. Simulation Studies

In this section, we investigate the performance of the new proposed sequential D-optimal design using simulation studies. Two numerical simulation examples and one real data analysis are used to compare the performance of the proposed design with the EI [17] and IMSPE designs [16]. To make the comparison fair, the IMSPE design is implemented only regarding computer design in the simulation studies. To evaluate the performance of the designs, the following three statistical metrics are considered:

Mean square error (MSE);
Mean prediction discrepancy (MPD);
Mean square prediction error (MSPE).

The MSE is used to demonstrate the effectiveness of the estimate of the calibration parameters, which is defined as

\begin{matrix} M S E_{l} = \frac{1}{M} \sum_{m = 1}^{M} | | {\hat{θ}}_{l m} - θ^{*} {| |}^{2}, for l = 1, \dots, N, \end{matrix}

where

| | . | |

denotes the corresponding Euclidean distance. Ref. [26] proved that under certain conditions, the Kennedy–O’Hagan calibration estimator converges to the minimizer of the norm of the residual function

\hat{δ} (x)

in the reproducing kernel Hilbert space. As a result, we assume that

θ^{*} = arg min | | \hat{δ} (x) {| |}_{N}

is the best value of calibration parameter. For more details about the reproducing kernel Hilbert space

N

, please refer to [26,27].

{\hat{θ}}_{l m}

is the estimate of the calibration parameters at the

m

th replication after l sequential design points, and M is the number of the simulation replications. The MPD is considered to assess the predictive performance of the calibrated computer models, which defined as

\begin{matrix} M P D_{l} = \frac{1}{M} \sum_{m = 1}^{M} | | {\hat{y}}^{c} (D^{p}, {\hat{θ}}_{l m}) - y^{p} (D^{p}) {| |}^{2}, for l = 1, \dots, N, \end{matrix}

where

{\hat{y}}^{c} (D_{{\hat{θ}}_{l m}}^{p})

is the prediction of the computer model with

{\hat{θ}}_{l m}

being the estimate of the calibration parameters at the

m

th replication with l sequential design points. The MSPE is used to assess the accuracy of the predictions by combining the calibrated computer model and discrepancy, which is defined as

\begin{matrix} M S P E_{l} = \frac{1}{M} \sum_{m = 1}^{M} | | {\hat{y}}^{p} (D^{p}, {\hat{θ}}_{l m}) - y^{p} (D^{p}) {| |}^{2}, for l = 1, \dots, N, \end{matrix}

where

{\hat{y}}^{p} (D^{p}, {\hat{θ}}_{l m}) = {\hat{y}}^{c} (D^{p}, {\hat{θ}}_{l m}) + \hat{δ} (D^{p})

is the prediction associated with the physical experiments.

4.1. Case Study I

In this section, an example with one calibration parameter and one control variable is considered, which is formulated as

\begin{matrix} y^{p} (x) = sin (θ x) + exp (- 2 | x |) + δ (x) + ϵ, \end{matrix}

where

ϵ \sim N (0, 0.01)

,

δ (x) \equiv 0

,

x \in [- 5, 5]

and

θ \in [0, 3]

. The best value of the calibration parameter is

θ^{*} = 1.5

in this case. A constant mean

β

and a product-form Matérn correlation function with

ν = \frac{1}{2}

are selected as the prior for

y^{c} (x, θ)

. For the calibration parameter, we select the prior of

θ

to be

θ \sim N (2, 0.01)

. The size of the physical experiment design (uniform design) is set as

q = 10

. An MmLHD in

[- 5, 5] \times [0, 3]

with 10 points is generated for the initial computer experiment design, i.e.,

n = 10

, and the

N = 30

points are to be added sequentially according to Algorithm 1. A total of 100 simulations are performed to calculate the metrics of performance. The results of

M S E_{l}

are shown in Figure 1. As the increase in computer experimental points sequentially, the calibration parameter approaches the best value, and the proposed method outperforms the other two designs. The results of

M P D_{l}

are shown in Figure 2, which shares a similar trend with

M S E_{l}

. The comparison between the original computer model and the calibrated computer model is shown in Figure 3. We can see that the calibrated computer models are closer to the physical observations. As the number of sequential computer experiments increases, the differences between the computer model and physical observations decrease, and the proposed method performs better than the other two designs.

The results of

M S P E_{l}

are summarized in Table 1, which also shares a similar trend with

M P D_{l}

and

M S E_{l}

. The performance of prediction by combining calibrated computer models and discrepancy function is shown in Figure 4. In Figure 4, the predictions by using our method approximate the physical observations best.

4.2. Case Study II

In this section, another example which includes three calibration parameters and two control variables and is given in [28] is considered. In this case study, a discrepancy between computer model outputs and physical outputs is also regarding. The physical process is described as

\begin{matrix} y^{p} (x) = 7 {[sin (2 π θ_{1} - π)]}^{2} + 2 {(2 π θ_{2} - π)}^{2} sin (2 π x_{1} - π) + 6 θ_{3} (x_{2} - 0.5) + δ (x) + ϵ, \end{matrix}

where,

ϵ \sim N (0, 0.01)

,

δ (x) = cos (2 π x_{1} - π) + 2 ({x_{2}}^{2} - x_{2} + 1 / 6)

,

x = (x_{1}, x_{2}) \in {[0, 1]}^{2}

,

θ = (θ_{1}, θ_{2}, θ_{3}) \in [0, 0.25] \times [0, 0.5] \times [0, 1]

. The best value of the calibration parameter is

θ^{*} = (0.2, 0.3, 0.8)

. Assume that the expectation of

y^{c} (x, θ)

is

β

, and the correlation function of

y^{c} (x, θ)

is Gaussian. The expectation of

δ (x)

is set to be zero, and the correlation function of

δ (x)

is also assumed to be Gaussian. Let the prior for

θ_{1}

,

θ_{2}

, and

θ_{3}

be

U (0, 0.25)

,

U (0, 0.5)

, and

U (0, 1)

, respectively. A total of 500 simulations are performed to assess the performances of the sequential designs. A design with 10 points are employed for physical experiments, and a MmLHD with

n = 20

points over

{[- 1, 1]}^{3} \times [0, 0.25] \times [0, 0.5]

is utilized as the initial design for computer experiments. In this case, the number of sequential points is set to be

N = 40

. The results of

M S E_{l}

are drawn in Figure 5. As the size of sequential experiment points increases, the calibration parameter approaches the best value. The proposed method has smaller MSEs than the other two methods, which means the proposed method performs better. Figure 6 shows the results of

M P D_{l}

, which also demonstrate the effectiveness of the proposed design for calibration.

The simulation results in Figure 1 and Figure 5 show that, regardless of whether the discrepancy function exists or not, with the increase in sequential points, the estimate of calibration parameters gradually approaches its best value, indicating that the estimation of calibration parameters converges. However, the strict mathematical proof of this conclusion is a complicated problem, which cannot be solved in this paper and will be paid attention to in a future study.

From Figure 6, we can easily find out that when 20 points are sequentially added, the discrepancy tends to be stable. As shown in Figure 7, the physical prediction combining discrepancy and calibrated computer model by utilizing the proposed sequential D-optimal method is the closest to the physical observations, followed by the IMSPE method and EI method. The results regarding the MSPE are presented in Table 2, which shares similar conclusions as those in the case study I.

4.3. Real Data Analysis

In this section, a real data example with three control inputs and one calibration input is considered, which is presented in [8]. Figure 8 shows a concise description of the resistance spot welding process. Two metal sheets of a particular thickness (thickness) are compressed through two electrodes under a specifically applied load (load). A direct current of a certain magnitude (current) passes through the sheets via the two electrodes, and the heat produced by the current flow causes the welding surfaces to melt. After cooling, a weld nugget with a specific dimension (diameter) is formed, which is of particular interest. In this manner, the two metal plates are welded.

The resistance at the contact surface is particularly critical in determining the magnitude of heat generated. Because the contact resistance at the contact surface is not well understood as a function of temperature, the calibration parameter is specified and adjusted based on the field data. The effect of this calibration parameter on the behavior of the model is a focus in this case. Ref. [8] comprehensively described the inputs for this example. According to the evaluation of the model developer, the verification experiment focuses on three control inputs (thickness, load, and current). Table 3 lists the control and calibration inputs and the corresponding intervals.

A constant mean

β

and the product-form exponential correlation function are considered as the prior for

y^{c} (x, θ)

. As (2) shows, a zero mean and the Gaussian correlation function are considered as the prior for

δ (x)

, and more details can be found in [2,6,28,29], etc. We randomly choose 10 physical experiments from the non-replicated 12 physical experiments. An initial computer design with 20 points is generated and the corresponding outputs are simulated according to [8].

N = 40

points are added sequentially according to Algorithm 1. The results about the MSPE are summarized in Table 4, which illustrates the superior of the new proposed method due to the smaller MSPE values.

5. Conclusions and Remark

In order to reduce the cost of physical experiments, mathematical models or computer models are utilized to approximate the real physical process. However, the computer models’ fidelity to physical process depends on the physical unobservable calibration parameters. This paper proposed a D-optimal design to augment computer experiments sequentially by regarding the Kennedy and O’Hagan model. By using the D-optimal criterion, computer design points are selected sequentially to gather more comprehensive information to reduce the uncertainty about the estimate of the calibration parameter. Then, the computer model can mimic the real physical process well with the tuned calibration parameter. Simulation studies are made to assess the performance of the newly proposed method compared with EI and IMSPE methods. The results show that the newly proposed method outperforms the other two methods in terms of

M S E

,

M P D

, and

M S P E

. An analysis based on the real data introduced in [8] also demonstrates the superior performance of the new method.

Author Contributions

Conceptualization, D.W. and Y.W.; writing—original draft preparation, H.D.; writing—review and editing, D.W. and Y.W.; funding acquisition, D.W. and Y.W. All authors have read and agreed to the published version of the manuscript.

Funding

The research of H.D. and D.W. was supported by the National Natural Science Foundation of China (Grant no. NSFC 11801034 and Grant no. NSFC 12171033), and Y.W.’s research was supported by the National Natural Science Foundation of China (12101024) and the Natural Science Foundation of Beijing Municipality (1214019).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

KL	Kullback–Leibler
EI	Expected Improvement
GP	Gaussian Process
MLE	Maximum Likelihood Estimation
FIM	Fisher Information Matrix
MSE	Mean Square Error
MPD	Mean Prediction Discrepancy
MSPE	Mean Square Prediction Error
IMSPE	Integrated Mean Square Prediction Error
MmLHD	Maximin Latin Hypercube Design

Appendix A

Proof of Lemma 1.

Since

d | θ, β, φ

is distributed as

G P (H (θ) β, v a r (d | θ, β, φ))

, thus the likelihood function is

p (d | θ, β, φ) = {(2 π)}^{- \frac{n + q}{2}} {| v a r (d | θ, β, φ) |}^{- \frac{1}{2}} exp [- \frac{1}{2} {(d - H (θ) β)}^{T} {(v a r (d | θ, β, φ))}^{- 1} (d - H (θ) β)] .

Let

L = ln (p (d | θ, β, φ))

,

L^{(1)} = \frac{\partial L}{\partial θ}

, and

L^{(2)} = \frac{\partial^{2} L}{\partial θ^{2}}

, then the

i

th element of

L^{(1)}

is

\frac{\partial L}{\partial θ_{i}} = - \frac{1}{2} t r [{(v a r (d | θ, β, φ))}^{- 1} \frac{\partial v a r (d | θ, β, φ)}{\partial θ_{i}}] - \frac{\partial {(d - H (θ) β)}^{T}}{\partial θ_{i}} {(v a r (d | θ, β, φ))}^{- 1} (d - H (θ) β),

where

i = {1, \dots, h}

, and the

(i, j)

th element of

L^{(2)}

is

\begin{matrix} \frac{\partial^{2} L}{\partial θ_{i} \partial θ_{j}} & = & - \frac{1}{2} t r [- {(v a r (d | θ, β, φ))}^{- 1} \frac{\partial v a r (d | θ, β, φ)}{\partial θ_{j}} {[v a r (d | θ, β, φ)]}^{- 1} \frac{\partial v a r (d | θ, β, φ)}{\partial θ_{i}} \\ + {(v a r (d | θ, β, φ))}^{- 1} \frac{\partial^{2} v a r (d | θ, β, φ)}{\partial θ_{i} \partial θ_{j}}] \\ - {\frac{\partial^{2} (d - H (θ) β)}{\partial θ_{i} \partial θ_{j}}}^{T} {(v a r (d | θ, β, φ))}^{- 1} (d - H (θ) β) \\ - \frac{\partial {(d - H (θ) β)}^{T}}{\partial θ_{i}} {(v a r (d | θ, β, φ))}^{- 1} \frac{\partial (d - H (θ) β)}{\partial θ_{j}} \\ - \frac{\partial {(d - H (θ) β)}^{T}}{\partial θ_{i}} \frac{\partial {(v a r (d | θ, β, φ))}^{- 1}}{\partial θ_{j}} (d - H (θ) β) \\ - \frac{1}{2} {(d - H (θ) β)}^{T} \frac{\partial^{2} {(v a r (d | θ, β, φ))}^{- 1}}{\partial θ_{i} \partial θ_{j}} (d - H (θ) β), \end{matrix}

where

i, j = {1, \dots, h}

.

Since the FIM can be derived as

I (θ) = \int [- L^{(2)} p (d | θ, β, φ)] d d,

thus the

(i, j)

th element of

I (θ)

is

\begin{matrix} I_{i j} (θ) & = & \int [- \frac{\partial^{2} L}{\partial θ_{i} \partial θ_{j}} p (d | θ, β, φ)] d d \\ = & \frac{1}{2} t r [- {(v a r (d | θ, β, φ))}^{- 1} \frac{\partial v a r (d | θ, β, φ)}{\partial θ_{j}} {(v a r (d | θ, β, φ))}^{- 1} \frac{\partial v a r (d | θ, β, φ)}{\partial θ_{i}}] \\ + & {(v a r (d | θ, β, φ))}^{- 1} \frac{\partial^{2} v a r (d | θ, β, φ)}{\partial θ_{i} \partial θ_{j}} \\ + \frac{\partial {(d - H (θ) β)}^{T}}{\partial θ_{i}} {(v a r (d | θ, β, φ))}^{- 1} \frac{\partial (d - H (θ) β)}{\partial θ_{j}} \\ + \frac{1}{2} t r [v a r (d | θ, β, φ) \frac{\partial^{2} {(v a r (d | θ, β, φ))}^{- 1}}{\partial θ_{i} \partial θ_{j}}] \\ = & \frac{\partial {(d - H (θ) β)}^{T}}{\partial θ_{i}} {(v a r (d | θ, β, φ))}^{- 1} \frac{\partial (d - H (θ) β)}{\partial θ_{j}} \\ + \frac{1}{2} t r [{(v a r (d | θ, β, φ))}^{- 1} \frac{\partial v a r (d | θ, β, φ)}{\partial θ_{j}} {(v a r (d | θ, β, φ))}^{- 1} \frac{\partial v a r (d | θ, β, φ)}{\partial θ_{i}}] . \end{matrix}

□

References

Hoffman, R.M.; Sudjianto, A.; Du, X.; Stout, J. Robust Piston Design and Optimization Using Piston Secondary Motion Analysis. In SAE 2003 World Congress & Exhibition; SAE International: Warrendale, PA, USA, 2003. [Google Scholar] [CrossRef]
Higdon, D.; Kennedy, M.; Cavendish, J.C.; Cafeo, J.A.; Ryne, R.D. Combining Field Data and Computer Simulations for Calibration and Prediction. Siam J. Sci. Comput. 2004, 26, 448–466. [Google Scholar] [CrossRef] [Green Version]
Malhotra, R.; Liang, X.; Belytschko, T.; Jian, C. Mechanics of fracture in single point incremental forming. J. Mater. Process. Technol. 2012, 212, 1573–1590. [Google Scholar] [CrossRef]
Cox, D.D.; Park, J.S.; Singer, C.E. A statistical method for tuning a computer code to a data base. Comput. Stat. Data Anal. 2001, 37, 77–92. [Google Scholar] [CrossRef]
Loeppky, J.L.; Bingham, D.; Welch, W.J. Computer Model Calibration or Tuning in Practice; Technical Report; University of British Columbia: Vancouver, BC, Canada, 2006. [Google Scholar]
Kennedy, M.C.; O’Hagan, A. Bayesian calibration of computer models. J. R. Stat. Soc. Ser. B 2001, 63, 425–464. [Google Scholar] [CrossRef]
Higdon, D.; Gattiker, J.; Williams, B.; Rightley, M. Computer Model Calibration Using High-Dimensional Output. J. Am. Stat. Assoc. 2008, 103, 570–583. [Google Scholar] [CrossRef] [Green Version]
Bayarri, M.J.; Berger, J.O.; Paulo, R.; Sacks, J.; Cafeo, J.A.; Cavendish, J.; Lin, C.H.; Tu, J. A Framework for Validation of Computer Models. Technometrics 2007, 49, 138–154. [Google Scholar] [CrossRef]
Wang, Y.; Yue, X.; Tuo, R.; Hunt, J.H.; Shi, J. Effective model calibration via sensible variable identification and adjustment with application to composite fuselage simulation. Ann. Appl. Stat. 2020, 14, 1759–1776. [Google Scholar] [CrossRef]
Gu, M.; Wang, L. Scaled Gaussian stochastic process for computer model calibration and prediction. SIAM/ASA J. Uncertain. Quantif. 2018, 6, 1555–1583. [Google Scholar] [CrossRef]
Pratola, M.T.; Sain, S.R.; Bingham, D.; Wiltberger, M.; Rigler, E.J. Fast Sequential Computer Model Calibration of Large Nonstationary Spatial—Temporal Processes. Technometrics 2013, 55, 232–242. [Google Scholar] [CrossRef]
Ezzat, A.A.; Pourhabib, A.; Ding, Y. Sequential Design for Functional Calibration of Computer Models. Technometrics 2018, 60, 286–296. [Google Scholar] [CrossRef]
Silvey, S. Optimal Design: An Introduction to the Theory for Parameter Estimation; Chapman and Hall: London, UK, 1980; Volume 1. [Google Scholar]
Leatherman, E.R.; Dean, A.M.; Santner, T.J. Designing combined physical and computer experiments to maximize prediction accuracy. Comput. Stat. Data Anal. 2017, 113, 346–362. [Google Scholar] [CrossRef]
Krishna, A.; Joseph, V.R.; Shan, B.; Brenneman, W.A.; Myers, W.R. Robust experimental designs for model calibration. J. Qual. Technol. 2021, 1–12. [Google Scholar] [CrossRef]
Ranjan, P.; Lu, W.; Bingham, D.; Reese, S.; Holloway, J.P. Follow-Up Experimental Designs for Computer Models and Physical Processes. J. Stat. Theory Pract. 2011, 5, 119–136. [Google Scholar] [CrossRef]
Damblin, G.; Barbillon, P.; Keller, M.; Pasanisi, A.; Parent, E. Adaptive numerical designs for the calibration of computer codes. SIAM/ASA J. Uncertain. Quantif. 2018, 6, 151–179. [Google Scholar] [CrossRef] [Green Version]
Overstall, A.M.; Woods, D.C. Multivariate emulation of computer simulators: Model selection and diagnostics with application to a humanitarian relief model. J. R. Stat. Soc. 2016, 65, 483–505. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Conti, S.; O’Hagan, A. Bayesian emulation of complex multi-output and dynamic computer models. J. Stat. Plan. Inference 2010, 140, 640–651. [Google Scholar] [CrossRef]
Liu, F.; Bayarri, M.J.; Berger, J.O. Modularization in Bayesian analysis, with emphasis on analysis of computer models. Bayesian Anal. 2009, 4, 119–150. [Google Scholar]
Boukouvalas, A.; Dan, C.; Stehlik, M. Approximately Optimal Experimental Design for Heteroscedastic Gaussian Process Models; Unassigned Technical Report; Aston University: Birmingham, UK, 2009. [Google Scholar]
Chaloner, K.; Larntz, K. Optimal Bayesian design applied to logistic regression experiments. J. Stat. Plan. Inference 1989, 21, 191–208. [Google Scholar] [CrossRef] [Green Version]
Kiefer, J.; Wolfowitz, J. The equivalence of two extremum problems. Can. J. Math. 1960, 12, 363–366. [Google Scholar] [CrossRef]
Nguyen, M.N.K. Algorithm: A Fedorov Exchange Algorithm for D-Optimal Design. J. R. Stat. Soc. 1994, 43, 669–677. [Google Scholar]
Chen, R.; Wang, Y.; Wu, C. Finding optimal points for expensive functions using adaptive RBF–based surrogate model via uncertainty quantification. J. Glob. Optim. 2020, 77, 919–948. [Google Scholar] [CrossRef]
Tuo, R.; Wang, Y.; Wu, C.F.J. On the improved rates of convergence for Matern–type kernel ridge regression with application to calibration of computer models. SIAM/ASA J. Uncertain. Quantif. 2020, 8, 1522–1547. [Google Scholar] [CrossRef]
Berlinet, A.; Thomas, A. Reproducing Kernel Hilbert Spaces in Probability and Statistics; Springer Science and Business Media: Berlin/Heidelberg, Germany, 2011. [Google Scholar]
Wong, R.; Storlie, C.B.; Lee, T. A frequentist approach to computer model calibration. J. R. Stat. Soc. 2017, B79, 635–648. [Google Scholar] [CrossRef] [Green Version]
Tuo, R.; Wu, C.F.J. Efficient calibration for imperfect computer models. Ann. Stat. 2015, 43, 2331–2352. [Google Scholar] [CrossRef]

Figure 1.

M S E_{l}

of

θ

obtained using the proposed sequential D-optimal method, EI method, and IMSPE method by sequentially adding computer experiment points.

Figure 1.

M S E_{l}

of

θ

obtained using the proposed sequential D-optimal method, EI method, and IMSPE method by sequentially adding computer experiment points.

Figure 2.

M P D_{l}

obtained using the proposed sequential D-optimal method, EI method, and IMSPE method.

Figure 2.

M P D_{l}

obtained using the proposed sequential D-optimal method, EI method, and IMSPE method.

Figure 3. Model outputs before and after calibration through the proposed sequential D-optimal method, EI method, and IMSPE method, compared with the physical observations. The left and right panels show the results obtained by sequentially adding 14 and 30 points, respectively.

Figure 4. Physical predictions before and after calibration with the proposed sequential D-optimal method, EI method, and IMSPE method with 30 sequential computer experimental points.

Figure 5.

M S E_{l}

of

θ

obtained using the proposed sequential D-optimal method, EI method, and IMSPE method by sequentially adding computer experiment points.

Figure 5.

M S E_{l}

of

θ

obtained using the proposed sequential D-optimal method, EI method, and IMSPE method by sequentially adding computer experiment points.

Figure 6.

M P D_{l}

obtained using the proposed sequential D-optimal method, EI method, and IMSPE method.

Figure 6.

M P D_{l}

obtained using the proposed sequential D-optimal method, EI method, and IMSPE method.

Figure 7. Physical predictions before and after calibration using the D-optimal method, EI method, and IMSPE method. The left, middle, and right panels show the results of sequentially adding 20, 25, and 40 points, respectively.

Figure 8. Schematic of the spot-welding process.

Table 1. The results about

M S P E_{l}

.

Table 1. The results about

M S P E_{l}

.

Number of Sequential Points	Before Calibration	D-Optimal	EI	IMSPE
14	0.5892141	0.1429672	0.5869769	0.4524031
30	0.5892141	0.03625579	0.218899	0.2244712

Table 2. The results for

M S P E_{l}

.

Table 2. The results for

M S P E_{l}

.

Number of Sequential Points	Before Calibration	D-Optimal	EI	IMSPE
20 points	39.68042	1.187998	8.09301	1.808175
25 points	39.68042	0.7716595	8.97966	1.48396
40 points	39.68042	0.5952004	10.52075	1.336039

Table 3. Control inputs and calibration input.

Load (kN)	Thickness (mm)	Current (kA)	Resistance
$[4.0, 5.3]$	1 or 2	$[21, 26]$ for thickness 1	$[0.8, 8]$
		$[24, 29]$ for thickness 2

Table 4. MSPE before and after calibration.

Number of Sequential Points	Before Calibration	D-Optimal	EI	IMSPE
20 points	1.384987	0.432674	0.9045898	0.4400638
25 points	1.384987	0.3496204	0.8424071	0.3827046
40 points	1.384987	0.3328246	1.041911	0.3804081

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Diao, H.; Wang, Y.; Wang, D. A D-Optimal Sequential Calibration Design for Computer Models. Mathematics 2022, 10, 1375. https://doi.org/10.3390/math10091375

AMA Style

Diao H, Wang Y, Wang D. A D-Optimal Sequential Calibration Design for Computer Models. Mathematics. 2022; 10(9):1375. https://doi.org/10.3390/math10091375

Chicago/Turabian Style

Diao, Huaimin, Yan Wang, and Dianpeng Wang. 2022. "A D-Optimal Sequential Calibration Design for Computer Models" Mathematics 10, no. 9: 1375. https://doi.org/10.3390/math10091375

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A D-Optimal Sequential Calibration Design for Computer Models

Abstract

1. Introduction

2. Calibration of Computer Models

3. Sequential D-Optimal Design for Calibration Parameters

3.1. D-Optimal Criterion

3.2. Algorithm for Generating Sequential D-Optimal Design

4. Simulation Studies

4.1. Case Study I

4.2. Case Study II

4.3. Real Data Analysis

5. Conclusions and Remark

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI