Exact Solution for the Production Planning Problem with Several Regimes Switching over an Infinite Horizon Time

Covei, Dragos-Patru

doi:10.3390/math11204307

Open AccessArticle

Exact Solution for the Production Planning Problem with Several Regimes Switching over an Infinite Horizon Time

by

Dragos-Patru Covei

Department of Applied Mathematics, The Bucharest University of Economic Studies, Piata Romana, 1st District, 010374 București, Romania

Mathematics 2023, 11(20), 4307; https://doi.org/10.3390/math11204307

Submission received: 13 September 2023 / Revised: 12 October 2023 / Accepted: 13 October 2023 / Published: 16 October 2023

(This article belongs to the Section Computational and Applied Mathematics)

Download Versions Notes

Abstract

:

We consider a stochastic production planning problem with regime switching. There are

k \geq 1

regimes corresponding to different economic cycles. The problem is to minimize the production costs and analyze the problem by the value function approach. Our main contribution is to show that the optimal production is characterized by an exact solution of an elliptic system of partial differential equations. A verification result is given for the determined solution.

Keywords:

production planning; regime switching; PDE system

MSC:

35B08; 35B09; 35J67; 49L12; 49K15; 60G46

1. Introduction and Proposal of the Paper

We consider a factory producing

N \geq 1

types of economic goods that stores them in an inventory-designated place. The model is described mathematically in the next.

Let

(Ω, F, F

,

P)

be a complete filtered probability space, where P is the historical probability and

F = {F_{t}| t \in [0, \infty)},

is generated by an

R^{N}

-valued Brownian motion denoted by

w = (w_{1}, \dots, w_{N})

with respect to the probability P.

In the production planning problem, the regime switching is captured by a continuous-time homogeneous Markov chain

ε (t)

adapted to

F

that can take k different values, modeling k regimes, which should be noted by

1, 2, \dots, k

. The Markov chain’s rate matrix that denotes the strongly ireductible generator of

ε

, is denoted by

G = {[ϑ_{i j}]}_{k \times k}

where

ϑ_{i i} = - a_{i i} < 0 for all i, ϑ_{i j} = a_{i j} \geq 0 for all i \neq j,

and the diagonal elements

ϑ_{i i}

may be expressed as

ϑ_{i i} = - \underset{j \neq i}{Σ} ϑ_{i j} .

(1)

In this case, if

P_{t} (t) = E [ε (t)] \in R

, then

\frac{d P_{t} (t)}{d t} = G ε (t) .

(2)

Moreover,

ε (t)

s explicitly described by the integral form

ε (t) = ε (0) + \int_{0}^{t} G ε (u) d u + M (t),

(3)

where

M (t)

is a martingale with respect to

F

. Here and hereafter, we use the notation from other papers to keep the applicative character of the problem,

p (t) = (p_{1} (t), \dots, p_{N} (t)),

which represents the production rate at time t (control variable) adjusted for the demand rate.

These adjusted-for-demand inventory levels are modeled by the following system of stochastic differential equations

d y_{i} (t) = p_{i} d t + σ_{ε (t)} d w_{i}, y_{i} (0) = y_{i}^{0} for i = 1, \dots, N,

(4)

where

y_{i} (t)

is an Itô process in

R

(i.e., the inventory level of good i, at times

t,

adjusted for demand),

p_{i}

is the deterministic part,

σ_{ε (t)}

is a random regime-dependent constant (non-zero) diffusion coefficient taking on the values

σ_{1}

,

σ_{2}

, …,

σ_{k}

, and

y_{i}^{0}

is the initial condition (i.e., initial inventory level of goods i).

The stochasticity here is due to demand adjustment, which is random and dependent on the regime. This is the most commonly used process when the demand is more volatile in some periods (e.g., some states of the Markov chain) and less volatile in other periods.

The performance over time of a demand-adjusted production

p (t) = (p_{1} (t), \dots, p_{N} (t)),

is measured by means of its cost. At this point, we introduce the cost functional, which yields the cost

J (p_{1}, \dots, p_{N}) : = E \int_{0}^{\infty} {(| p (t) |}^{2} + {| y (t) |}^{2}) e^{- α_{ε (t)} t} d t, y (t) = (y_{1} (t), \dots, y_{N} (t)),

(5)

which measures the quadratic loss.

We measure deviations from the demand, from what place the loss. Here,

α_{ε (t)}

is a regime-dependent, taking on the values

α_{1} > 0

,

α_{2} > 0

, …,

α_{k} > 0

, constant psychological rate of time discount from what place the exponential discounting.

At the moment, we are ready to frame our objective, which is to minimize the cost functional, i.e.,

inf_{p_{1}, \dots, p_{N}} J (p_{1}, \dots, p_{N}),

(6)

Subject to the Itô Equation (4), the cost functional involves adjusted-for-demand inventory levels y whose dynamic is given by (4), and it depends on the choice of the demand-adjusted production p. Minimizing the cost functional in (6) means selecting the demand-adjusted production p so that it minimizes J (of (5)). Notice that J involves both y and p.

This model problem was proposed by Bensoussan, Sethi, Vickson, and Derzko [1] in the context of no regime switching in the economy and for the case of a factory producing one type of economic goods. Later, many other authors were concerned with regime switching.

In production management, Cadenillas, Lakner, and Pinedo [2] adapted the model problem in [1] to study the optimal production stochastic control planning problem of a company within an economy characterized by two-state regime switching with limited/unlimited information. Later, Dong, Malikopoulos, Djouadi, and Kuruganti [3] applied in civil engineering the model described by [2] to the study of the optimal stochastic control problem for home energy systems with solar and energy storage devices when the demand is subject to Brownian motion; the two switching regimes are the peak and off peak energy demand.

A good deal of attention to this subject has been also devoted by Pirvu and Zhang [4], where the authors studied the effect of high versus low discount rates to a consumption-investment decision problem.

After that, there have been numerous applications of regime switching in many important problems in economics, operations research, actuarial science, finance, reinsurance, and other fields, for example, the portfolio optimization problem in a defaultable market with finitely-many economical regimes is considered by Capponi and Figueroa-López in [5]; the pricing of derivatives using a stochastic discount factor modeled as a regime-switching geometric Brownian motion is discussed by Elliott and Hamada in [6]; the production control in a manufacturing system with multiple machines, which are subject to breakdowns and repairs, is considered by Gharbi and Kenne in [7]; the problem of the pricing of European-style options with switches among a finite number of states is discussed by Yao, Zhang, and Zhou in [8]; and no later, Wang, Chang, and Fang [9] considered the optimal portfolio and consumption rule with a Cox–Ingersoll–Ross (CIR) model in a general utility framework.

There are of course other research studies that may also serve to better explain the importance of regime switching in the real world.

In a precursor to this article, Covei and Pirvu [10] formulate and analyze the production-planning problem in the continuous-time case, with no regime switching in the economy over an infinite time. In [11], the author improved the results of [10], in the sense that the value function in the production model is given in the closed form. Related works that deal with no regime switching in the economy are Sheng-Zhu-Wang [12] and Qin-Bai-Ralescu [13].

Recently, Canepa, Covei, and Pirvu [14] considered the production planning problem with regime switching in the economy over a finite horizon time. Here, the solution is obtained through numerical approaches. However, a closed-form expression for the corresponding case of regime switching on a particular state space consisting of two regimes over an infinite horizon time is available in the paper of [15]. So, at least one question suggested by the paper of [16] has some nice features: can we obtain a closed-form solution when the state space consists of several numbers of states? Our present paper fills the gap in the literature by proving a closed-form solution to the stochastic production planning problem with regime switching in the economy over an infinite horizon in a general state space.

To conclude this introduction, our paper is structured as follows. In Section 2, we give the relationship of our model with a system of partial differential equations (PDE) system. Section 3 presents a closed-form solution and the uniqueness of the solution for our production planning problem. A numerical approximation of the solution for the production planning problem is also given in Section 4. In Section 5, we present a verification result. We introduce in Section 6 the equilibrium production rates as the subgame perfect production rates. They are the output of an interpersonal game between the present self and future selves. The equilibrium production rates are time consistent, meaning there is no incentive to deviate from them. It turns out that in our setting the optimal production rates are among the equilibrium ones so they are time consistent. In Section 7, we give some applications. Finally, in Section 8, we discuss our strategy.

The technique presented in this paper makes a methodological contribution that is of independent interest in other considerable numbers of works on regime switching.

Having presented the model that we want to solve, now we provide our means to tackle it.

2. Reduction of the Model to a PDE System

Our approach is based on the value function and dynamic programming, which leads to the Hamilton–Jacobi–Bellman (HJB) system of equations.

To characterize the value function, we apply the probabilistic approach. We search for functions

V (x, 1)

, …,

V (x, k)

such that the stochastic process

S^{p} (t)

defined below

S^{p} (t) = e^{- α_{ε (t)} t} V (y (t), ε (t)) - \int_{0}^{t} {[| p (s) |}^{2} + {| y (s) |}^{2}] e^{- α_{ε (s)} s} d s,

(7)

is supermartingale for all

p (t) = (p_{1} (t), \dots, p_{N} (t)),

And martingale for the optimal control

p^{*} (t) = (p_{1}^{*} (t), \dots, p_{N}^{*} (t)) .

As shown by [10], if this is achieved, with the following transversality condition

lim_{t \to \infty} E [e^{- α_{ε (t)} t} V (y (t), ε (t))] = 0,

(8)

some estimates on the value function yield that

- V (x, i) = inf J (p_{1}, \dots, p_{N}),

(9)

where

x = (x_{1}, \dots, x_{N}) \in R^{N}

assumes values

(y_{1} (0), \dots, y_{N} (0)) .

Once such a function is found, it turns out that

(u_{1}, \dots, u_{k})

with

u_{1} (x) = - V (x, 1), \dots, u_{k} (x) = - V (x, k),

is the value function. We search for

u_{1}, \dots, u_{k}

, the functions in

C^{2} [0, \infty)

, and the supermartingale/martingale requirement yields by using Itô’s Lemma for Markov-modulated diffusion, the HJB system of equations, which characterizes the value function

- (\begin{matrix} \frac{σ_{1}^{2}}{2} Δ u_{1} \\ \dots \\ \frac{σ_{k}^{2}}{2} Δ u_{k} \end{matrix}) + G_{a, α} (\begin{matrix} u_{1} \\ \dots \\ u_{k} \end{matrix}) - (\begin{matrix} {|x|}^{2} \\ \dots \\ {|x|}^{2} \end{matrix}) = (\begin{matrix} inf_{p} {p \nabla u_{1} + {|p|}^{2}} \\ \dots \\ inf_{p} {p \nabla u_{k} + {|p|}^{2}} \end{matrix}),

(10)

where

G_{a, α} = (\begin{matrix} a_{11} + α_{1} & - a_{12} & \dots & - a_{1 k} \\ - a_{21} & a_{22} + α_{2} & \dots & - a_{2 k} \\ \dots & \dots & \dots & \dots \\ - a_{k 1} & - a_{k 2} & \dots & a_{k k} + α_{k} \end{matrix}) .

For the transformation of the HJB system, it is essential to observe that

inf_{p} {p \nabla u_{i} + {|p|}^{2}} = - \frac{1}{4} {|\nabla u_{i}|}^{2}, i = 1, 2, \dots, k .

(11)

Thus, the HJB system (10) can be written as a PDE system

\{\begin{matrix} - \frac{σ_{1}^{2}}{2} Δ u_{1} + (a_{11} + α_{1}) u_{1} - \sum_{i = 2}^{k} a_{1 i} u_{i} - {|x|}^{2} = - \frac{1}{4} {|\nabla u_{1}|}^{2}, \\ \dots \\ - \frac{σ_{k}^{2}}{2} Δ u_{k} + (a_{k k} + α_{k}) u_{k} - \sum_{i = 1}^{k - 1} a_{k i} u_{i} - {|x|}^{2} = - \frac{1}{4} {|\nabla u_{k}|}^{2} . \end{matrix}

(12)

To perform the verification, i.e., show that the HJB system gives the solution to the optimization problem, one should write (12) with the following boundary condition

u_{1} (x) \to \infty, \dots, u_{k} (x) \to \infty, as | x | \to \infty .

(13)

The value function will give us in turn the candidate optimal control. The first-order optimality conditions on the left-hand side of (11) are sufficient for optimality since we deal with a quadratic (convex) function, and they produce the candidate optimal control as follows:

p_{i}^{*} (t) = {\bar{p}}_{i} (y_{1} (t), \dots, y_{N} (t), ε (t)), i = 1, \dots, N,

and

{\bar{p}}_{i} (x_{1}, \dots, x_{N}, j) = - \frac{1}{2} \frac{\partial u_{j}}{\partial x_{i}} (x_{1}, \dots, x_{N}), for i \in {1, \dots, n}, j \in {1, \dots, k} .

(14)

The production rate

{\bar{p}}_{i}

is allowed to be negative. A negative production rate would correspond to a write-off or disposal of inventory (for example, due to obsolescence or perishability).

Our next goal of this paper is to determine the candidate optimal control in closed form.

3. Closed-Form Solution for the PDE System

In spite of their clear simplicity, the PDE system (12) with boundary conditions (13) presents a host of mathematical difficulties arising from the presence of nonlinear gradient terms

{|\nabla u_{1}|}^{2}

, …,

{|\nabla u_{k}|}^{2}

, see for details [17].

The following result will be proved and is the main original element of the article.

Theorem 1.

Assume that

G_{a, α}

is a positive definite matrix with all elements of

G_{a, α}^{- 1}

positive. Then, the PDE system (12) with boundary condition (13) has a unique radially symmetric convex positive classical solution with quadratic growth.

Proof of Theorem 1.

In the following, we construct the function

(u_{1}, \dots, u_{k}) \in C^{2} [0, \infty) \times \dots \times C^{2} [0, \infty),

which satisfies (12) with boundary condition (13). One way of solving this partial differential equation is to show that there exists

(u_{1} (x), \dots, u_{k} (x)) = (β_{1} {|x|}^{2} + η_{1}, \dots, β_{k} {|x|}^{2} + η_{k}), with β_{1}, \dots, β_{k}, η_{1}, \dots, η_{k} \in (0, \infty),

(15)

that solves (1).

The main task for the proof of existence of (15) is performed by proving that there exists

β_{1}, \dots, β_{k}, η_{1}, \dots, η_{k} \in (0, \infty),

such that

\{\begin{matrix} - \frac{2 β_{1} N σ_{1}^{2}}{2} + (a_{11} + α_{1}) (β_{1} {|x|}^{2} + η_{1}) - \sum_{i = 2}^{k} a_{1 i} (β_{i} {|x|}^{2} + η_{i}) - {|x|}^{2} = - \frac{1}{4} {(2 β_{1} |x|)}^{2}, \\ \dots \\ - \frac{2 β_{k} N σ_{k}^{2}}{2} + (a_{k k} + α_{k}) (β_{k} {|x|}^{2} + η_{k}) - \sum_{i = 1}^{k - 1} a_{k i} (β_{i} {|x|}^{2} + η_{i}) - {|x|}^{2} = - \frac{1}{4} {(2 β_{k} |x|)}^{2}, \end{matrix}

or equivalently, after grouping the terms

\{\begin{matrix} {|x|}^{2} [- \sum_{i = 2}^{k} a_{1 i} β_{i} + (a_{11} + α_{1}) β_{1} + β_{1}^{2} - 1] - β_{1} N σ_{1}^{2} - \sum_{i = 2}^{k} a_{1 i} η_{i} + (a_{11} + α_{1}) η_{1} = 0, \\ \dots \\ {|x|}^{2} [- \sum_{i = 1}^{k - 1} a_{k i} β_{i} + (a_{k k} + α_{k}) β_{k} + β_{k}^{2} - 1] - β_{k} N σ_{k}^{2} - \sum_{i = 1}^{k - 1} a_{k i} η_{i} + (a_{k k} + α_{k}) η_{k} = 0 . \end{matrix}

Now, we consider the system of equations

\{\begin{matrix} - \sum_{i = 2}^{k} a_{1 i} β_{i} + (a_{11} + α_{1}) β_{1} + β_{1}^{2} - 1 = 0 \\ \dots \\ - \sum_{i = 1}^{k - 1} a_{k i} β_{i} + (a_{k k} + α_{k}) β_{k} + β_{k}^{2} - 1 = 0 \\ - β_{1} N σ_{1}^{2} - \sum_{i = 2}^{k} a_{1 i} η_{i} + (a_{11} + α_{1}) η_{1} = 0 \\ \dots \\ - β_{k} N σ_{k}^{2} - \sum_{i = 1}^{k - 1} a_{k i} η_{i} + (a_{k k} + α_{k}) η_{k} = 0 . \end{matrix}

(16)

To solve (16), we can rearrange those equations 1, …, k such

(\begin{matrix} a_{11} + α_{1} & \dots & - a_{1 k} \\ \dots & \dots & \dots \\ - a_{k 1} & \dots & a_{k k} + α_{k} \end{matrix}) (\begin{matrix} β_{1} \\ \dots \\ β_{k} \end{matrix}) = (\begin{matrix} 1 - β_{1}^{2} \\ \dots \\ 1 - β_{k}^{2} \end{matrix}) .

(17)

The arguments in [18,19] say that System (17) has a unique positive solution. In fact, denoting by

\{\begin{matrix} h_{1} (β_{1}, \dots, β_{k}) = - \sum_{i = 2}^{k} a_{1 i} β_{i} + (a_{11} + α_{1}) β_{1} + β_{1}^{2} - 1, \\ \dots \\ h_{k} (β_{1}, \dots, β_{k}) = - \sum_{i = 1}^{k - 1} a_{k i} β_{i} + (a_{k k} + α_{k}) β_{k} + β_{k}^{2} - 1, \end{matrix}

(18)

It happens that

\{\begin{matrix} \frac{\partial h_{1}}{\partial β_{1}} (β_{1}, . ., β_{k}) = (a_{11} + α_{1}) + 2 β_{1} > 0, h_{1} (0, β_{2}, \dots, β_{k}) < 0, lim_{β_{1} \to \infty} h_{1} = \infty, \\ \dots \\ \frac{\partial h_{k}}{\partial β_{k}} (β_{1}, \dots, β_{k}) = (a_{k k} + α_{k}) + 2 β_{k} > 0, h_{k} (β_{1}, \dots, β_{k - 1}, 0) < 0, lim_{β_{k} \to \infty} h_{k} = \infty, \end{matrix}

concluding the arguments in [18,19]. Next, letting

(β_{1}, \dots, β_{k}) \in (0, \infty) \times \dots \times (0, \infty)

a unique solution of (17), we observe that the Equations

k + 1

, …,

2 k

of (16) can be written equivalently as

(\begin{matrix} β_{1} N σ_{1}^{2} \\ \dots \\ β_{k} N σ_{k}^{2} \end{matrix}) = (\begin{matrix} a_{11} + α_{1} & \dots & - a_{1 k} \\ \dots & \dots & \dots \\ - a_{k 1} & \dots & a_{k k} + α_{k} \end{matrix}) (\begin{matrix} η_{1} \\ \dots \\ η_{k} \end{matrix}),

(19)

from where using the fact that

G_{a, α}^{- 1}

has all elements positive, we can see that there exist and are unique

η_{1}

, …,

η_{k} \in (0, \infty)

that solve (16) and then

(u_{1} (x), \dots, u_{k} (x)),

solve (12). This finishes the proof of Theorem 1. □

Because our solution depends on solving a nonlinear algebraic system of equations, the exact solution of the PDE system cannot be determined using a computer software. In order to be implemented, the solution of the PDE system (12) in a software application in the next section, it is necessary to give the numerical approximation of solution to (16), and therefore, the arguments in [18,19] are used again.

4. Numerical Solution of an Algebraic Nonlinear System in Building the Solution for the PDE System

We intend to approximate

β_{1}, \dots, β_{k}, η_{1}, \dots, η_{k} \in (0, \infty)

in (15) by the Newton–Raphson method. To do this, we denote

h_{1} (β_{1}, \dots, β_{k})

, …,

h_{k} (β_{1}, \dots, β_{k})

as in (18) and

J_{(h_{1}, \dots, h_{k})} = (\begin{matrix} a_{11} + α_{1} + 2 β_{1} & \dots & - a_{1 k} \\ \dots & \dots & \dots \\ - a_{k 1} β_{1} & \dots & a_{k k} + α_{k} + 2 β_{k} \end{matrix}),

The Jacobian matrix of (18). For

n = 1, 2, \dots

we find the approximate of the unique parameters

(β_{1}, \dots, β_{k}) \in (0, \infty) \times \dots \times (0, \infty),

In the following way,

(\begin{matrix} β_{1}^{n + 1} \\ \dots \\ β_{k}^{n + 1} \end{matrix}) = (\begin{matrix} β_{1}^{n} \\ \dots \\ β_{k}^{n} \end{matrix}) - {(\begin{matrix} a_{11} + α_{1} + 2 β_{1}^{n} & \dots & - a_{1 k} \\ \dots & \dots & \dots \\ - a_{k 1} & \dots & a_{k k} + α_{k} + 2 β_{k}^{n} \end{matrix})}^{- 1} (\begin{matrix} h_{1} (β_{1}^{n}, \dots, β_{k}^{n}) \\ \dots \\ h_{k} (β_{1}^{n}, \dots, β_{k}^{n}) \end{matrix}),

with

β_{1}^{0}, \dots, β_{k}^{0} \in (0, \infty)

. Clearly

η_{1}

,…,

η_{k} \in (0, \infty)

are easily determined from (19). Some other interesting numerical iterations can be applied in obtaining an optimal numerical solution of (15), which might be efficiently computed with reduced number of iterations and quick CPU time. For example, quasi-Newton variants: the AGD method (see [20]), the SM method (see [21]), or the accelerated double-step-size method (see [22]).

Now, we will move on to the verification result, which is also inspired from [15].

5. Verification

Next, we show that the control of (14) obtained in our reduction strategy is indeed optimal. We apply the supermartingale and martingale approaches.

Repeating the same argument in [14], as the first step, we can show that the stochastic process

S^{p} (t)

defined below

S^{p} (t) = e^{- α_{ε (t)} t} V (y (t), ε (t)) - \int_{0}^{t} {[| p (s) |}^{2} + {| y (s) |}^{2}] e^{- α_{ε (s)} s} d s,

is supermartingale for all

p (t) = (p_{1} (t), \dots, p_{N} (t)),

And martingale for the optimal control

p^{*} (t) = (p_{1}^{*} (t), \dots, p_{N}^{*} (t)) .

Owing to the well-known Itô Lemma for Markov-modulated diffusion (see [8] for more on this), we have

\begin{matrix} d S^{p} (s) & = & e^{- α_{ε (s)} s} [\frac{σ_{ε (s)}^{2}}{2} Δ V (y (s), ε (s)) - {|y (s)|}^{2} + p (s) \nabla V (y (s), ε (s)) \\ - {|p (s)|}^{2} - (α_{ε (s)} + a_{ε (s) ε (s)}) V (y (s), ε (s)) \\ + \sum_{i = 1, i \neq ε (s)}^{k} a_{ε (s) i} V (y (s), i)] d s + d Z (s), \end{matrix}

for some martingale

Z (s)

, and

Z (0) = 0

. Therefore,

\begin{matrix} E S^{p} (t) & = & S^{p} (0) + E [\int_{0}^{t} e^{- α_{ε (s)} s} [\frac{σ_{ε (s)}^{2}}{2} Δ V (y (s), ε (s)) - {|y (s)|}^{2} + p (s) \nabla V (y (s), ε (s))] d s] \\ + E [\int_{0}^{t} e^{- α_{ε (s)} s} [- {|p (s)|}^{2} - (α_{ε (s)} + a_{ε (s) ε (s)}) V (y (s), ε (s))] d s] \\ + E [\int_{0}^{t} e^{- α_{ε (s)} s} [\sum_{i = 1, i \neq ε (s)}^{k} a_{ε (s) i} V (y (s), i)] d s] . \end{matrix}

Then, the claim yields considering HJB Equations (10) and (12), which says that

S^{p} (t)

is martingale for the optimal control and supermartingale otherwise. This last fact combined with the transversality condition yields the claim.

In the second step, let us establish the optimality of

(p_{1}^{*}, \dots, p_{N}^{*})

. Consider the quadratic estimate on the value function

V (x, 1) = - β_{1} {|x|}^{2} - η_{1}, \dots, V (x, k) = - β_{k} {|x|}^{2} - η_{k},

(20)

where

β_{i}

and

η_{i} \in (0, \infty)

are the solutions of (16).

Let us provide a lower-bound estimate for

α_{1}, \dots, α_{k}

so that the transversality condition (8) is met and

lim_{t \to \infty} E [e^{- α_{ϵ (t)} t} | y (t) |^{2}] = 0

holds true. The SDE system (4) in this case becomes

d y_{i} (t) = - β_{ϵ (t)} y_{i} (t) d t + σ_{ε (t)} d W^{i} (t), i = 1, \dots N .

Using Itô’s Lemma, one obtains

\begin{matrix} d {(y_{i} (t))}^{2} & = & 2 y_{i} (t) d y_{i} (t) + d y_{i} (t) d y_{i} (t) \\ = & [- 2 β_{ϵ (t)} {(y_{i} (t))}^{2} + σ_{ϵ (t)}^{2}] d t + 2 y_{i} (t) σ_{ϵ (t)} d W^{i} (t) . \end{matrix}

We introduce

F_{i} (t) = E [{(y_{i} (t))}^{2}] .

By taking expectations in the above equation, we obtain

\begin{matrix} F_{i} (t) & = & E [\int_{0}^{t} [- 2 β_{ϵ (s)} {(y_{i} (s))}^{2} + σ_{ϵ (s)}^{2}] d s + [{(y_{i} (0))}^{2}]] \\ = & E [\int_{0}^{t} [- 2 β_{ϵ (s)} {(y_{i} (s))}^{2} + σ_{ϵ (s)}^{2}] d s] + y_{i}^{2} (0)) . \end{matrix}

Let

D_{2} = max {σ_{1}^{2}, \dots, σ_{k}^{2}}, D_{3} = max ([{(y_{1} (0))}^{2}], \dots, [{(y_{k} (0))}^{2}]) .

Then, in the light of the above equation, we obtain

F_{i} (t) \leq \int_{0}^{t} D_{2} d s + D_{3} .

Hence, we have that

F_{i} (t) \leq D_{2} t + D_{3} .

Therefore, one must choose

α_{1}, \dots, α_{k} \in (0, \infty)

for the transversality condition to hold true, and the proof is completed. Finally, a simple system of nonlinear Equations (16) remains to be solved.

6. The Equilibrium Production

For a production rate

{p_{i} (t)}_{t \geq 0}

and its corresponding inventory level

{y_{i} (t)}_{t \geq 0}

given by (4), we introduce equilibrium production as the subgame perfect production in the definition below (for more on this economic concept see [23]).

Definition 1.

Let

F = (F_{i}, i = 1, \dots N) : R \times {1, 2, \dots k} \to R^{N}

be a vector map such that for any

x > 0

and

i \in {1, 2, \dots, k}

lim inf_{ϵ ↓ 0} \frac{J ({\bar{p}}_{i}) - J (p_{i}^{ϵ})}{ϵ} \leq 0,

(21)

where the subgame perfect production

{\bar{p}}_{i} (s) : = F_{i} ({\bar{y}}_{i} (s), ϵ (s)) .

Here, the process

{{\bar{y}}_{i} (s)}_{s \geq 0}

is the inventory level process corresponding to

{{\bar{p}}_{i} (s)}_{s \geq 0}

. The production rate

{{p^{ϵ}}_{i} (s)}_{s \geq 0}

is defined by

{p^{ϵ}}_{i} (s) = \{\begin{matrix} {\bar{p}}_{i} (s), s \in [0, \infty] ∖ E_{ϵ, 0} \\ p_{i} (s), s \in E_{ϵ, 0}, \end{matrix}

(22)

E_{ϵ, 0} = [0, ϵ];

{p_{i} (s)}_{s \in E_{ϵ, 0}}

is any production rate. If (21) holds true, then

{\bar{p}}_{i} (s),

i = 1 \dots N,

is a subgame perfect production rate.

The equilibrium production is by design time consistent, meaning that they will be implemented at a future date even if the optimization criterion is updated. In some situations, the optimal production may be time inconsistent meaning that they will fail to be implemented in the future because they are not optimal anymore if the optimization criterion is updated; they will be implementable only in the presence of a commitment mechanism, that is why sometimes they are referred to as pre commitment production. Let us remark that, in our setting, the optimal production rate

{\bar{p}}_{i}, i \in {1, \dots, N},

(23)

is a subgame perfect production with

F_{i} (x, j) : = - \frac{1}{2} \frac{\partial u_{j}}{\partial x_{i}} (x),

Since

({\bar{p}}_{i}, i = 1 \dots N) = arg min_{p_{1}, \dots, p_{N}} J (p_{1}, \dots, p_{N})

and thus (21) is automatically satisfied. Therefore, the equilibrium production is time consistent.

7. Applications

We offer some applications, which also are inspired by the paper of Ghosh, Arapostathis, and Marcus [16].

Application 1. Suppose there is one machine producing two products, and let

ε (t)

be the machine state that can take values in two regimes, 1 = good and 2 = bad, i.e., for every

t \in [0, \infty)

, we have

ε (t) \in {1, 2}

. We consider

ε (t)

a continuous-time Markov chain with generator

(\begin{matrix} - \frac{1}{2} & \frac{1}{2} \\ \frac{1}{2} & - \frac{1}{2} \end{matrix}),

And the inventory

y_{i} (t)

, which is governed by the Itô system of stochastic differential Equations (4) with the diffusion

σ_{1} = σ_{2} = \frac{1}{\sqrt{2}}

, and let

α_{1} = α_{2} = \frac{1}{2}

be the discount factor. Under these assumptions, the system (17) becomes

(\begin{matrix} a_{11} + α_{1} & - a_{11} \\ - a_{22} & a_{22} + α_{2} \end{matrix}) (\begin{matrix} β_{1} \\ β_{2} \end{matrix}) = (\begin{matrix} 1 - β_{1}^{2} \\ 1 - β_{2}^{2} \end{matrix}),

or, with our data

\{\begin{matrix} β_{1}^{2} + β_{1} - \frac{1}{2} β_{2} - 1 = 0 \\ β_{2}^{2} - \frac{1}{2} β_{1} + β_{2} - 1 = 0 \end{matrix}

which has a unique positive solution

β_{1} = \frac{1}{4} (\sqrt{17} - 1), β_{2} = \frac{1}{4} (\sqrt{17} - 1) .

On the other hand, System (19) becomes

(\begin{matrix} β_{1} N σ_{1}^{2} \\ β_{2} N σ_{2}^{2} \end{matrix}) = (\begin{matrix} a_{11} + α_{1} & - a_{11} \\ - a_{22} & a_{22} + α_{2} \end{matrix}) (\begin{matrix} η_{1} \\ η_{2} \end{matrix}),

or, with our data

(\begin{matrix} β_{1} \\ β_{2} \end{matrix}) = (\begin{matrix} 1 & - \frac{1}{2} \\ - \frac{1}{2} & 1 \end{matrix}) (\begin{matrix} η_{1} \\ η_{2} \end{matrix}),

which has a unique positive solution

η_{1} = \frac{4}{3} β_{1} + \frac{2}{3} β_{2} = \frac{1}{2} (\sqrt{17} - 1), η_{2} = \frac{2}{3} β_{1} + \frac{4}{3} β_{2} = \frac{1}{2} (\sqrt{17} - 1) .

Then,

V ((x_{1}, x_{2}), 1) = V ((x_{1}, x_{2}), 2) = - \frac{1}{4} (\sqrt{17} - 1) (x_{1}^{2} + x_{2}^{2}) - \frac{1}{2} (\sqrt{17} - 1)

and furthermore, the production rate is

{\bar{p}}_{i} (x_{1}, x_{2}, j) = - \frac{1}{2} (\sqrt{17} - 1) x_{i}, for i \in {1, 2}, j \in {1, 2} .

We also give the approximate of

β_{1}

and

β_{2}

, and

η_{1}

and

η_{2}

, by using the Newton–Raphson method. Denote

\begin{matrix} h_{1} (β_{1}, β_{2}) = - a_{12} β_{2} + (a_{11} + α_{1}) β_{1} + β_{1}^{2} - 1 \\ h_{2} (β_{1}, β_{2}) = - a_{21} β_{1} + (a_{22} + α_{2}) β_{2} + β_{2}^{2} - 1 \end{matrix}

and

J_{(h_{1}, \dots, h_{k})} = (\begin{matrix} 2 β_{1} + 1 & - \frac{1}{2} \\ - \frac{1}{2} & 2 β_{2} + 1 \end{matrix}) .

We construct

\{\begin{matrix} (\begin{matrix} β_{1}^{n + 1} \\ β_{2}^{n + 1} \end{matrix}) = (\begin{matrix} β_{1}^{n} \\ β_{2}^{n} \end{matrix}) - {(\begin{matrix} a_{11} + α_{1} + 2 β_{1}^{n} & - a_{1 k} \\ - a_{k 1} & a_{k k} + α_{k} + 2 β_{k}^{n} \end{matrix})}^{- 1} (\begin{matrix} h_{1} (β_{1}^{n}, β_{2}^{n}) \\ h_{2} (β_{1}^{n}, β_{2}^{n}) \end{matrix}) \\ β_{1}^{0} = β_{2}^{0} = 0.1 . \end{matrix}

Using the standard computation, approximations to four digits are

\begin{matrix} n = 1 ⟹ & β_{1}^{1} = 1.4429 & and & β_{2}^{1} = 1.4429 \\ n = 2 ⟹ & β_{1}^{2} = 0.9102 & and & β_{2}^{2} = 0.9102 \\ n = 3 ⟹ & β_{1}^{3} = 0.7808 & and & β_{2}^{3} = 0.7808 \\ n = 4 ⟹ & β_{1}^{4} = 0.7808 & and & β_{2}^{4} = 0.7808 \end{matrix}

On the other hand,

β_{1} = β_{2} = \frac{1}{4} (\sqrt{17} - 1) ≃ 0.780 7 .

Clearly, the approximations for

η_{1}

and

η_{2}

are

η_{1} = η_{2} ≃ 1 . 561 6 .

Application 2. Suppose there is one machine producing three products, and let

ε (t)

be the machine state that can take values in three regimes 1, 2, and 3, i.e., for every

t \in [0, \infty)

, we have

ε (t) \in {1, 2, 3}

. We consider

ε (t)

a continuous-time Markov chain with generator

(\begin{matrix} - 3 & 3 & 0 \\ 4 & - 7 & 3 \\ 0 & 4 & - 4 \end{matrix}),

and the inventory

y_{i} (t)

, which is governed by (4) with

σ_{1} = σ_{2} = σ_{3} = \frac{1}{\sqrt{3}}

, and let

α_{1} = α_{2} = α_{3} = 1

be the discount factor. Under these assumptions, System (17) becomes

(\begin{matrix} a_{11} + 1 & - a_{11} & 0 \\ - a_{22} & a_{22} + a_{11} + 1 & - a_{11} \\ 0 & - a_{22} & a_{22} + 1 \end{matrix}) (\begin{matrix} β_{1} \\ β_{2} \\ β_{3} \end{matrix}) = (\begin{matrix} 1 - β_{1}^{2} \\ 1 - β_{2}^{2} \\ 1 - β_{3}^{2} \end{matrix}),

or, with our data

\{\begin{matrix} β_{1}^{2} + 4 β_{1} - 3 β_{2} - 1 = 0 \\ β_{2}^{2} + 8 β_{2} - 4 β_{1} - 3 β_{3} - 1 = 0 \\ β_{3}^{2} + 5 β_{3} - 4 β_{2} - 1 = 0 \end{matrix}

which has a unique positive solution

β_{1} = β_{2} = β_{3} = \frac{1}{2} (\sqrt{5} - 1) .

On the other hand, System (19) becomes

(\begin{matrix} β_{1} N σ_{1}^{2} \\ β_{2} N σ_{2}^{2} \\ β_{3} N σ_{3}^{2} \end{matrix}) = (\begin{matrix} a_{11} + 1 & - a_{11} & 0 \\ - a_{22} & a_{22} + a_{11} + 1 & - a_{11} \\ 0 & - a_{22} & a_{22} + 1 \end{matrix}) (\begin{matrix} η_{1} \\ η_{2} \\ η_{3} \end{matrix}),

or, with our data

(\begin{matrix} β_{1} \\ β_{2} \\ β_{3} \end{matrix}) = (\begin{matrix} 3 + 1 & - 3 & 0 \\ - 4 & 4 + 3 + 1 & - 3 \\ 0 & - 4 & 4 + 1 \end{matrix}) (\begin{matrix} η_{1} \\ η_{2} \\ η_{3} \end{matrix}),

from where

(\begin{matrix} η_{1} \\ η_{2} \\ η_{3} \end{matrix}) = (\begin{matrix} \frac{7}{13} & \frac{15}{52} & \frac{9}{52} \\ \frac{5}{13} & \frac{5}{13} & \frac{3}{13} \\ \frac{4}{13} & \frac{4}{13} & \frac{5}{13} \end{matrix}) (\begin{matrix} β_{1} \\ β_{2} \\ β_{3} \end{matrix}),

has a unique positive solution

η_{1} = η_{2} = η_{3} = \frac{1}{2} \sqrt{5} - \frac{1}{2} .

Then,

\begin{matrix} V ((x_{1}, x_{2}, x_{3}), 1) & = & V ((x_{1}, x_{2}, x_{3}), 2) = V ((x_{1}, x_{2}, x_{3}), 3) \\ = & - \frac{1}{2} (\sqrt{5} - 1) (x_{1}^{2} + x_{2}^{2} + x_{3}^{2} + 1) \end{matrix}

and furthermore, the production rate is

{\bar{p}}_{i} (x_{1}, x_{2}, x_{3}, j) = - \frac{1}{2} (\sqrt{5} - 1) x_{i}, for i \in {1, 2, 3}, j \in {1, 2, 3} .

We also point out that the numerical approximations for

β_{1}

,

β_{2}

, and

β_{3}

, using the Newton–Raphson method described, are

\begin{matrix} n = 1 ⟹ & β_{1}^{1} = 0.8418 & β_{2}^{1} = 1.017 & β_{3}^{1} = 1.2789 \\ n = 2 ⟹ & β_{1}^{2} = 0.6575 & β_{2}^{2} = 0.6761 & β_{3}^{2} = 0.7066 \\ n = 3 ⟹ & β_{1}^{3} = 0.6192 & β_{2}^{3} = 0.6196 & β_{3}^{3} = 0.6202 \\ n = 4 ⟹ & β_{1}^{4} = 0.618 & β_{2}^{4} = 0.618 & β_{3}^{4} = 0.618 \end{matrix}

when

β_{1}^{0} = 1

,

β_{2}^{0} = 2

, and

β_{3}^{0} = 3

. Clearly,

\frac{1}{2} (\sqrt{5} - 1) ≃ 0.618

.

8. Final Remarks and Conclusions

When

w_{i}

is correlated with correlation

ρ,

the HJB system (10) becomes

- (\begin{matrix} \frac{σ_{1}^{2}}{2} Δ u_{1} \\ \dots \\ \frac{σ_{k}^{2}}{2} Δ u_{k} \end{matrix}) + G_{a, α} (\begin{matrix} u_{1} \\ \dots \\ u_{k} \end{matrix}) - \frac{ρ}{2} (\begin{matrix} σ_{1}^{2} \sum_{i \neq j} \frac{\partial^{2} u_{1}}{\partial x_{i} \partial x_{j}} \\ \dots \\ σ_{k}^{2} \sum_{i \neq j} \frac{\partial^{2} u_{k}}{\partial x_{i} \partial x_{j}} \end{matrix}) - (\begin{matrix} {|x|}^{2} \\ \dots \\ {|x|}^{2} \end{matrix}) = (\begin{matrix} inf_{p} {p \nabla u_{1} + {|p|}^{2}} \\ \dots \\ inf_{p} {p \nabla u_{k} + {|p|}^{2}} \end{matrix}),

which has the same solution as (10), due to the mixed derivative terms (see [17] for details).

In summary, we have reduced the stochastic production-planning problem with several regime switching in the economy to demonstrate that there is an exact solution for the PDE system that models the stochastic production problem.

Funding

This research received no external funding.

Data Availability Statement

No data were used.

Acknowledgments

The author would like to thank the referees for their valuable discussions.

Conflicts of Interest

The authors declare no conflict of interest.

References

Bensoussan, A.; Sethi, S.P.; Vickson, R.; Derzko, N. Stochastic production planning with production constraints. SIAM J. Control 1984, 22, 920–935. [Google Scholar] [CrossRef]
Cadenillas, A.; Lakner, P.; Pinedo, M. Optimal production management when demand depends on the business cycle. Oper. Res. 2013, 61, 1046–1062. [Google Scholar] [CrossRef]
Dong, J.; Malikopoulos, A.; Djouadi, S.M.; Kuruganti, T. Application of Optimal Production Control theory for Home Energy Management in a Micro Grid. In Proceedings of the 2016 American Control Conference (ACC), Boston, MA, USA, 6–8 July 2016; pp. 5014–5019. [Google Scholar]
Pirvu, T.A.; Zhang, H. Investment-consumption with regime-switching discount rates. Math. Soc. Sci. 2014, 71, 142–150. [Google Scholar] [CrossRef]
Capponi, A.; Figueroa-López, J.E. Dynamic Portfolio Optimization with a Defaultable Security and Regime-Switching. Math. Financ. 2012, 207–249. [Google Scholar] [CrossRef]
Elliott, R.; Hamada, A.S. Option Pricing Using A Regime Switching Stochastic Discount Factor. Int. J. Theor. Appl. Financ. 2014, 17, 1–26. [Google Scholar] [CrossRef]
Gharbi, A.; Kenne, J.P. Optimal production control problem in stochastic multiple-product multiple-machine manufacturing systems. IIE Trans. 2003, 35, 941–952. [Google Scholar] [CrossRef]
Yao, D.D.; Zhang, Q.; Zhou, X.Y. A regime-switching model for european options. In Stochastic Processes, Optimization, and Control Theory: Applications in Financial Engineering, Queueing Networks, and Manufacturing Systems; Yan, H., Yin, G., Zhang, Q., Eds.; International Series in Operations Research & Management Science; Springer: New York, NY, USA, 2006; Chapter 14; Volume 94, pp. 281–300. [Google Scholar]
Wang, C.F.; Chang, H.; Fang, Z.M. Optimal Portfolio and Consumption Rule with a CIR Model under HARA Utility. J. Oper. Res. Soc. China 2018, 6, 107–137. [Google Scholar] [CrossRef]
Covei, D.-P.; Pirvu, T.A. An elliptic partial differential equation and its application. Appl. Math. Lett. 2020, 101, 1–7. [Google Scholar] [CrossRef]
Covei, D.-P. An elliptic partial differential equation modeling the production planning problem. J. Appl. Anal. Comput. 2021, 11, 903–910. [Google Scholar]
Sheng, L.; Zhu, Y.; Wang, K. Uncertain dynamical system-based decision making with application to production-inventory problems. Appl. Math. Model. 2018, 56, 275–288. [Google Scholar] [CrossRef]
Qin, Z.; Bai, M.; Ralescu, D. A fuzzy control system with application to production planning problems. Inf. Sci. 2011, 181, 1018–1027. [Google Scholar] [CrossRef]
Canepa, E.C.; Covei, D.-P.; Pirvu, T.A. Stochastic production planning with regime switching. J. Ind. Manag. 2023, 19, 1697–1713. [Google Scholar]
Covei, D.-P.; Pirvu, T.A. An elliptic partial differential equations system and its applications. Carpathian J. Math. 2021, 37, 427–440. [Google Scholar] [CrossRef]
Ghosh, M.K.; Arapostathis, A.; Marcus, S.I. Optimal Control of Switching Diffusions with Application to Flexible Manufacturing Systems. SIAM J. Control Optim. 1992, 31, 1183–1204. [Google Scholar] [CrossRef]
Covei, D.-P. On a parabolic partial differential equation and system modeling a production planning problem. Electron. Arch. 2022, 30, 1340–1353. [Google Scholar] [CrossRef]
Győri, I.; Hartung, F.; Mohamady, N.A. Existence and uniqueness of positive solutions of a system of nonlinear algebraic equations. Period. Math. Hung. 2017, 75, 114–127. [Google Scholar] [CrossRef]
Győri, I.; Hartung, F.; Mohamady, N.A. Boundedness of positive solutions of a system of nonlinear delay differential equations. Discret. Contin. Dyn. Syst.—Ser. B 2018, 23, 809–836. [Google Scholar] [CrossRef]
Andrei, N. An acceleration of gradient descent algorithm with backtracking for unconstrained optimization. Numer. Algorithms 2006, 42, 63–73. [Google Scholar] [CrossRef]
Stanimirovic, P.S.; Miladinovic, M.B. Accelerated gradient descent methods with line search. Numer. Algorithms 2010, 54, 503–520. [Google Scholar] [CrossRef]
Petrovic, M.J. An Accelerated Double Step Size model in unconstrained optimization. Appl. Math. Comput. 2015, 250, 309–319. [Google Scholar] [CrossRef]
Ekeland, I.; Pirvu, T.A. Investment and consumption without commitment. Math. Financ. Econ. 2008, 2, 57–86. [Google Scholar] [CrossRef]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Covei, D.-P. Exact Solution for the Production Planning Problem with Several Regimes Switching over an Infinite Horizon Time. Mathematics 2023, 11, 4307. https://doi.org/10.3390/math11204307

AMA Style

Covei D-P. Exact Solution for the Production Planning Problem with Several Regimes Switching over an Infinite Horizon Time. Mathematics. 2023; 11(20):4307. https://doi.org/10.3390/math11204307

Chicago/Turabian Style

Covei, Dragos-Patru. 2023. "Exact Solution for the Production Planning Problem with Several Regimes Switching over an Infinite Horizon Time" Mathematics 11, no. 20: 4307. https://doi.org/10.3390/math11204307

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Exact Solution for the Production Planning Problem with Several Regimes Switching over an Infinite Horizon Time

Abstract

1. Introduction and Proposal of the Paper

2. Reduction of the Model to a PDE System

3. Closed-Form Solution for the PDE System

4. Numerical Solution of an Algebraic Nonlinear System in Building the Solution for the PDE System

5. Verification

6. The Equilibrium Production

7. Applications

8. Final Remarks and Conclusions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI