Next Article in Journal
Clique Search in Graphs of Special Class and Job Shop Scheduling
Next Article in Special Issue
Modified Remora Optimization Algorithm for Global Optimization and Multilevel Thresholding Image Segmentation
Previous Article in Journal
Stochastic Game Analysis of Cooperation and Selfishness in a Random Access Mechanism
Previous Article in Special Issue
Response Times Reconstructor Based on Mathematical Expectation Quotient for a High Priority Task over RT-Linux
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Optimal Operation for Reduced Energy Consumption of an Air Conditioning System Using Neural Inverse Optimal Control

by
Flavio Muñoz
1,
Ramon Garcia-Hernandez
2,*,
Jose Ruelas
1,
Juan E. Palomares-Ruiz
1 and
Carlos Álvarez-Macías
2
1
Tecnologico Nacional de Mexico/ITS de Cajeme, Carretera Internacional a Nogales km 2, Cd. Obregon 85024, Sonora, Mexico
2
Tecnologico Nacional de Mexico/Instituto Tecnologico de La Laguna, Blvd. Revolucion y Av. Instituto Tecnologico de La Laguna S/N, Col. Centro, Torreon 27000, Coahuila, Mexico
*
Author to whom correspondence should be addressed.
Mathematics 2022, 10(5), 695; https://doi.org/10.3390/math10050695
Submission received: 6 January 2022 / Revised: 20 February 2022 / Accepted: 21 February 2022 / Published: 23 February 2022

Abstract

:
For a comfortable thermal environment, the main parameters are indoor air humidity and temperature. These parameters are strongly coupled, causing the need to search for multivariable control alternatives that allow efficient results. Therefore, in order to control both the indoor air humidity and temperature for direct expansion (DX) air conditioning (A/C) systems, different controllers have been designed. In this paper, a discrete-time neural inverse optimal control scheme for trajectories tracking and reduced energy consumption of a DX A/C system is presented. The dynamic model of the plant is approximated by a recurrent high-order neural network (RHONN) identifier. Using this model, a discrete-time neural inverse optimal controller is designed. Unscented Kalman filter (UKF) is used online for the neural network learning. Via simulation the scheme is tested. The proposed approach effectiveness is illustrated with the obtained results and the control proposal performance against disturbances is validated.

1. Introduction

Not just a comfortable level of indoor air temperature is the only objective of air conditioning (A/C) systems [1,2]. Additionally, maintaining an adequate level of indoor air humidity is essential for an A/C system, since the efficient operation of building A/C systems, indoor air quality (IAQ) and building thermal comfort for occupants is directly affected by indoor humidity [3,4].
Recently, the use of direct expansion (DX) air conditioning (A/C) systems has had an exponential increase in different types of buildings small to medium scale [5,6]. More flexibility for installation, reduced operating cost and more energy savings, are advantages of the DX A/C, compared to chilled water based central A/C systems. Single speed compressors and fans, which rely on an on–off cycle to control indoor temperature only, are typical characteristics of conventional A/C systems [7]. The result is an indoor humidity imbalance, causing a thermal comfort unwanted level for the occupants [8]. However, by varying the fan speed and the compressor speed it is possible to control the humidity and air temperature simultaneously, achieving this with the development of variable speed (VS) driver technology [9]. For humidity and temperature control in air VS DX A/C, different control strategies have been designed and used, from the traditional proportional integral derivative (PID) control to advanced and robust controllers [10]. These include direct digital control [11], multi-input multi-output control method [12], neural network based [13,14,15], fuzzy logic controller (FLC) [16], Genetic and Swarm Algorithms [17] and Adaptive Control [18]. The development of those type of controllers requires a strong knowledge of mathematical modeling and advanced control techniques. In [19,20,21], control schemes that could be applied to the A/C system using fractional modeling are presented.
On the other hand, a control scheme based on plant dynamic model is necessary for realistic situations. This is a motivation to develop models based on an Artificial Neural Network (ANN) to model the plant dynamics to be controlled. Specifically, both for nonlinear process control and identification, ANN have been widely used. Feedforward ANN is one of the most popular for dynamical systems modeling, but the difficulties in the training step have limited its application [22]. When there is management of a state space structure, recurrent neural networks (RNN) have a better performance than classical feedback networks. However, when classical gradient optimization algorithms are used in its learning, its evolution is slow and very poorly approaches satisfactory results in longer input sequences, leading to a complicated numerical problem [23].
In contrast, the extended Kalman filter (EKF) algorithm has been introduced to train neural networks [24,25], with improved learning convergence [26]. Unfortunately, the EKF’s main drawback is the derivation of the Jacobian matrices, which can be complex, causing implementation difficulties [23,27]. Therefore, an unscented Kalman Filter (UKF) is proposed to solve the EKF’s problems. UKF is a filtering algorithm which uses Unscented Transformation (UT) [28]. The essential difference between EKF and UKF systems is the representation of Gaussian Random Variables (GRV) for propagation through system dynamics [29].
Therefore, a RHONN is proposed to identify the dynamic model for VS DX A/C system, assuming all the states available to be measured. The algorithm implemented for the RHONN learning uses UKF.
Determining control signals which will allow a process to comply with physical constraints and minimize a cost functional simultaneously is the optimal control theory objective [30]. Unfortunately, the difficult and complex process of solving the Hamilton Jacobi Bellman (HJB) equation is required, an alternative to avoid solving the HJB equation is to use the inverse optimal control [31].
Before establishing that the control optimizes a cost functional, a stabilizing feedback control must be developed, according to the inverse approach. The determination a posteriori of the cost functional for the stabilizing feedback control law is an essential feature of the inverse approach [24,32].
Applications of this complete control scheme are illustrated in: [33], where an optimal inverse neural control for discrete-time impulsive systems is determined. Reference [34] presents a discrete-time inverse optimal control scheme for a doubly-fed induction generator using a neural network. One more example in [35] where a neural controller for an induction motor is synthesized.
All the characteristics and strengths mentioned motivate the realization of this research work, since it allows establishing a multivariable and robust control technique, capable of tracking thermal profiles established by a user and rejecting both external and internal disturbances. This creates conditions for adequate energy consumption and thermal comfort. Furthermore, this optimal operation is reflected in the control signals generated by the proposed scheme.
Section 2 presents the methodology to obtain a VS DX A/C system mathematical model. Then, a brief review of the discrete-time neural identification for nonlinear systems is described, followed by a section where the inverse optimal controller is established. Section 3 shows the results and discussions that illustrate the application of the proposed inverse optimal control and neural identifier employing simulations. The last section presents the conclusions of this work.

2. Methodology

2.1. The Experimental VS DX A/C System and Its Dynamic Modeling

An air distribution sub-system (air side) and a DX refrigeration plant (refrigerant side) are the main components of the experimental VS DX A/C system, as shown simplified schematic diagram in Figure 1. An electronic expansion valve (EEV), a variable-speed rotor compressor, an air-cooled tube-plate-finned condenser and a high-efficiency tube-louver-finned DX evaporator are major elements that make up the DX refrigeration plant.
The energy and mass conservation principles are used mainly to obtain the mathematical model of the VS DX A/C system. Further, for the development of the mathematical model, it should be considered: (a) the thermal losses in air ducts are negligible; (b) two regions; dry-cooling region and wet-cooling region on the airside of the DX evaporator; and (c) no fresh air intake to the system and the perfect air mixing inside all heat exchangers and the thermal space [12,36].
The mathematical model complete analysis is presented in [36], obtaining the following equations.
C p ρ V d T 2 d t = C p ρ ( T 1 T 2 ) f + Q s p f + Q l o a d ,
ρ V d W 2 d t = ρ ( W 1 W 2 ) f + M ,
C p ρ V h 1 d T 3 d t = C p ρ ( T 2 T 3 ) f + α 1 A 1 ( T W T 2 + T 3 2 ) ,
C p ρ V h 2 d T 1 d t + ρ V h 2 h f g d W 1 d t = C p ρ ( T 3 T 1 ) f + ρ h f g ( W 2 W 1 ) f + α 2 A 2 ( T W T 3 + T 1 2 ) ,
( C p ρ V ) W d T W d t = α 1 A 1 ( T 2 + T 3 2 T W ) + α 2 A 2 ( T 3 + T 1 2 T W ) M r e f 1 ( h r 2 h r 1 ) ,
d W 1 d t ( a T 1 + b ) d T 1 d t c = 0 .
All model parameters are defined in the nomenclature, before references section. The dynamic model for the experimental VS DX A/C system is formed by Equations (1)–(6) and its state space representation in a compact format is given by:
x ˙ = g ( x , u )  
where the state variables x = [x1 x2 x3 x4 x5 x6]T = [T1 T2 T3 Tw W1 W2]T, the output variables y = [T2 W2]T, input variables u = [u1 u2]T = [f s]T, and the function g(x,u), define the dynamic system, respectively.
g(x,u) is defined as follows:
g ( x , u ) = [ u 1 ( h f g ( x 6 x 5 ) + ( x 3 x 1 ) C p ) ρ + g 2 ( x ) A 2 α 2 ρ V h 2 ( g 1 ( x ) h f g + C p ) u 1 ( k s p l + C p ( x 1 x 2 ) ρ ) + Q l o a d ρ C p V g 3 ( x ) α 1 A 1 + u 1 C p ( x 2 x 3 ) ρ C p ρ V h 1 α 1 A 1 g 4 ( x ) + α 2 A 2 g 5 ( x ) V c o m v s λ ( h r 2 h r 1 ) u 2 ( ρ C p V ) W ( C p ( x 3 x 1 ) + u 1 h f g ( x 6 x 5 ) ) ρ + α 2 A 2 g 2 ( x ) ρ V h 2 ( C p + h f g g 1 ( x ) ) g 1 ( x ) ρ ( x 5 x 6 ) u 1 + M ρ V ] ,
with
g 1 ( x ) = a x 1 + b c   , g 2 ( x ) = x 4 x 3 + x 1 2 ,
g 3 ( x ) = x 4 x 2 + x 3 2 , g 4 ( x ) = x 2 + x 3 2 x 4 , g 5 ( x ) = x 3 + x 1 2 x 4
where
  a = 0.0396 , b = 0.085 , c = 1000 .
The relationship between input variables and state variables is nonlinear, therefore Equation (7) is nonlinear.
In [36], the detailed validation of the above mathematical model for the VS DX A/C experimental system is reported.

2.2. Discrete-Time High-Order Neural Network

For pattern recognition and static systems modeling, multilayer neural networks are commonly used, since input and output mapping is learned through neural network (NN) training. A neural network, even with a single hidden layer, has the ability to uniformly approximate any continuous function over a compact domain, considering that there are enough synaptic connections in the NN, this fact has been demonstrated in different theoretical works.
Recurrent High-Order Neural Networks (RHONN) are considered extensions of the first order Hopfield model. They are used in control tasks due to their high number of interactions between neurons, as proposed in [37,38]. In addition, the RHONN model is flexible and allows incorporating a priori information about the structure of the system in the neural model.

2.2.1. Nonlinear System Neural Identification

The MIMO nonlinear in Equation (7) is identified using a RHONN defined as
χ k + 1 = w i T φ i ( x k , u k ) ,
where χ is state vector of neurons that identifies the i-th component of state vector x in (7), xk = [x1,k x2,kxn,k]T, wi is the respective online adapted weight vector, I = 1,…n; and uk = [u1,k u2,kum,k]T is the input vector to the neural network; φ is an Lp dimensional vector defined as
φ i ( x k , u k ) = [ φ i 1 φ i 2 φ i L p ] = [ j I 1 Z i j d i j ( 1 ) j I 2 Z i j d i j ( 2 ) j I L p Z i j d i j ( L p ) ] ,
where Lp is the number p of high order connections, dij are nonnegative integers, and {I1, I2, … ILp} is a collection of unordered subsets of {1, 2, …, n + m}. Here Zi is a vector defined as
Z i = [ Z i 1 Z i n Z i n + 1 Z i n + m ] = [ S ( x 1 , k ) S ( x n , k ) u 1 , k u m , k ] .
From Equation (11), the input vector to the neural network is uk = [u1,k, u2,k, …, um,k]. S(•) is defined by
S ( ς ) = e ς e ς e ς + e ς ,
where ς is any real value.
The unscented Kalman filter (UKF) algorithm [28] is used to train the RHONN identifier.

2.2.2. The UKF Training Algorithm

For recurrent neural networks (RNN), the best well-known training approach is the backpropagation through time learning [39]. Unfortunately, a first order gradient descent method such as backpropagation can have a very slow learning speed [26]. However, the Kalman filter (KF) as an algorithm for state / parameter estimation has become popular in the last four decades. This filter is especially useful for real-time applications, due to its easy implementation and computationally efficient calculation [23,27]. Nevertheless, for state estimation, especially of nonlinear systems, the original KF is often not good enough, according to the research community. The extended Kalman filter (EKF) is an extension of the KF to deal the non-linear system through a linearization procedure [27].
The unscented Kalman filter (UKF) is the nonlinear generalization of KF, which has several successful applications, such as recurrent neural network training [40]. Both the UKF and the EKF are conceptually similar, having the same basic principle, but the implementation is significantly different; only function evaluations instead of Taylor approximation, and no derivatives are needed (i.e., Jacobian or Hessian calculation).
A relatively new method for calculating the random variable statistics, which undergone a nonlinear transformation is called the unscented transformation (UT) and is used by the UKF [23]. The UT is UKF central technique used to handle the nonlinearity in a nonlinear transformation y = f(x), where f is an L × 1 vector-valued function, x and y are L × 1 vectors. Here, x is a random variable assumed to be normally distributed (Gaussian) with covariance Px, and mean x ¯ . An approach that replaces analytical linearization is provided by the UT which offers a statistical alternative, where the EKF Jacobian matrices are used. In [41] a detailed analytical comparison about linearization techniques is presented. The sigma-points used by UT are small set of points selected based on the a priori conditions, i.e., from the assumed prior distribution, the points are selected. Based on the selected scaling parameters for the UT, the confidence level from prior distribution or the sigma-points spread is determined. For the scaling parameters, there are different representations and notations. These representations are equivalent and affect both the sigma-points spread and the weight vectors used to reconstruct the a posteriori statistics (after the transformation).
Using three scaling parameters, the UT scaling can be represented fully [28,42]. α is the primary scaling parameter and determines the sigma-points spread. The α parameter has a variation range from 10-4 to 1. A wider sigma-points spread is caused by larger α, while a tighter (closer) sigma-points spread is the result of smaller α. The β parameter is used to include information about the prior distribution and is the secondary scaling parameter (β = 2 is optimal, for Gaussian distributions). The κ parameter is usually set to 0 and is the third scaling parameter [28]. In addition to the three parameters mentioned, a scaling parameter, λ, and two weight vectors, ηc (covariance) and ηm (mean) are defined
λ = α 2 ( L + κ ) L   , η 0 m = λ L + λ   , η 0 c = λ L + λ + 1 α 2 + β   , η i m = η i c = 1 2 ( L + λ ) , i = 1 , , 2 L   ,
where L is the state vector length. Then to generate 2L + 1 sigma-points, the prior mean x ¯ , the covariance Px and the parameter λ of the random variable x are used, as in
χ = [ x ¯ x ¯ + L + λ P x x ¯ L + λ P x ] .
Here χ is an L × (2L +1) matrix of sigma-points, where each sigma-point is represented by a column of this matrix, note that in Equation (14), the sum of a vector to each of the matrix columns defines the sum of a vector and matrix. Alternatively, an L × L matrix can be used for standard matrix addition, as long as the L × 1 column vector x ¯ can be multiplied with a 1 × L row vector of ones. It is also possible to notice that (14) contains the square root of a matrix. While there are different methods of calculating a matrix square root, the Cholesky ones is the recommended both in terms of computational efficiency and performance [43]. Here, the Cholesky decomposition is used to calculate a lower triangular matrix, and can then be used as a matrix square root representation, i.e.,
P x = ( P x ) ( P x ) T ,
where P x is a lower triangular matrix. Note that a principal matrix square root is different from this representation, which takes the form of Equation (15) that generally is non-triangular and without the transpose. Other method, called the “square-root UKF (SR-UKF)”, of handling the matrix square root was proposed [44], which can obtain different performance results, but has improved computational complexity.
Each point is passed through the nonlinear function, once the sigma-points have been generated, i.e., each column of the sigma-point matrix, χ, is propagated through the nonlinearity, as in
ψ ( i ) = f ( χ ( i ) ) , i = 0 , 1 , , 2 L ,
where the superscript (i) corresponds to the ith column of the matrix, whereas ψ is a matrix of transformed sigma-points. Then, the mean and covariance, using weighted averages, are estimated of these transformed sigma-points using the weight vectors that were defined in Equation (13), as in
y ¯ i = 0 2 L η i m ψ ( i )   , P y i = 0 2 L η i c ( ψ ( i ) y ¯ ) ( ψ ( i ) y ¯ ) T   ,
where Py is the estimated covariance matrix of y, and y ¯ their estimated mean. These values correspond to the estimated statistical properties after the nonlinear transformation, or a posteriori statistics.

2.3. Inverse Optimal Control Introduction

Consider the following an affine discrete nonlinear system
θ k + 1 = f ( θ k ) + g ( θ k ) u k θ 0 = θ ( 0 ) ,
where θ k n is the state of system at time k , u m , f : n n , g : n n × m , are smooth and bounded mapping. f(0) = 0 is assumed, and denotes the nonnegative integers set. The trajectory tracking for the system is associated to the following meaningful cost functional in Equation (18)
( ϑ k ) = n = k ( l ( ϑ n ) + u n T R u n ) ,
where ϑ k = θ k θ δ , k with θ δ , k as the desired trajectory for θk; ϑ k n ;   ( ϑ k ) : n + ;   l ( ϑ k ) : n + is a positive semidefinite function and R : n m × m is a real symmetric positive definite weighting matrix. The cost functional of Equation (19) is a performance measure [30]. To vary the weighting on control efforts according to the state value, the R entries can be functions of the system state, although they can also be fixed [30]. We assume that the full state θk is available, considering the state feedback control design problem.
Utilizing the optimal value function * ( θ k ) for (18) as Lyapunov function V(θk), Equation (19) can be rewritten as
V ( ϑ k ) = l ( ϑ k ) + u k T R u k + n = k ( l ( ϑ n ) + u n T R u n ) = l ( ϑ k ) + u k T R u k + V ( ϑ k + 1 ) ,
that requires the boundary condition V(0) = 0, to V ( ϑ k ) becomes a Lyapunov function. From Bellman optimality principle [45,46], it is known that, for the infinite horizon optimization case, the value function V ( ϑ k ) satisfies the discrete-time Bellman equation and becomes time-invariant [45,47,48]
V * ( ϑ k ) = min u k { l ( ϑ k ) + u k T R u k + V * ( ϑ k + 1 ) } ,
where V * ( ϑ k + 1 ) depends on both ϑ k and uk through of ϑ k + 1 in (18). Note that the Bellman equation is solved backward in time [47]. It is defined the discrete-time Hamiltonian H( ϑ k , uk ), in order to establish the conditions that the optimal control law must satisfy, as
H ( ϑ k , u k ) = l ( ϑ k ) + u k T R u k + V ( ϑ k + 1 ) V ( ϑ k ) .  
The optimal control law should satisfy H ( ϑ k , u k ) u k = 0 , that it is a necessary condition, then
0 = 2 R u k + V ( ϑ k + 1 ) u k = 2 R u k + ϑ k + 1 u k V ( ϑ k + 1 ) ϑ k + 1 = 2 R ( ϑ k ) u k + g T ( θ k ) V ( ϑ k + 1 ) ϑ k + 1 .  
Therefore, to achieve trajectory tracking, the optimal control law is formulated as
u k * = 1 2 R 1 g T ( θ k ) V ( ϑ k + 1 ) ϑ k + 1 ,
with the boundary condition V(0) = 0. It is necessary to solve the following HJB equation, for solving the trajectory tracking inverse optimal control problem
0 = l ( ϑ k ) + V ( ϑ k + 1 ) V ( ϑ k ) + 1 4 V T ( ϑ k + 1 ) ϑ k + 1 g ( θ k ) R 1 g T ( θ k ) V ( ϑ k + 1 ) ϑ k + 1 ,
which is a challenging task. It is proposed to solve the inverse optimal control problem.
Definition 1. 
Consider the tracking error [30] as  ϑ k = θ k θ δ , k , θδ,k being the desired trajectory for θk. Let’s define the control law
u k * = 1 2 R 1 g T ( θ k ) V ( ϑ k + 1 ) ϑ k + 1 .  
It will be inverse optimal (globally) stabilizing along the desired trajectory θδ,k if:
i.
It achieves (global) asymptotic stability of θk = 0 for system (18) along reference θδ,k;
ii.
V ( ϑ k ) is (radially unbounded) positive definite function such that inequality
V ¯ : = V ( ϑ k + 1 ) V ( ϑ k ) + u k * T R ( ϑ k ) u k * 0 ,
is satisfied.
If chosen l ( ϑ k ) : = V ¯ , then V ( ϑ k ) is a solution for Equation (25), and cost functional Equation (19) is minimized.
Definition 1 establishes that, the knowledge of V ( ϑ k ) is basis to formulate the inverse optimal control law for trajectory tracking. Then, a control Lyapunov function (CLF) V ( ϑ k ) can be proposed, such that (i) and (ii) are guaranteed. Hence, instead of solving Equation (25) for V ( ϑ k ) a quadratic candidate CLF V ( ϑ k ) for Equation (24) is proposed with the form
V ( ϑ k ) = 1 2 ϑ k T P ϑ k , P = P T > 0 ,
to ensure stability of the tracking error ϑ k , where
ϑ k = θ k θ δ . k = [ θ 1 , k θ 1 δ . k θ n , k θ n δ . k ] .  
It is referred to as the inverse optimal control law, to the control law in Equation (24) with Equation (27) and it optimizes the meaningful cost functional of Equation (19). Consequently, by considering V ( ϑ k ) as in Equation (27), control law in Equation (24) takes the following form
u k * = 1 4 R 1 g T ( θ k ) ϑ k + 1 T P ϑ k + 1 ϑ k + 1 = 1 2 R 1 g T ( θ k ) P ϑ k + 1 = 1 2 ( R + 1 2 g T ( θ k ) P g ( θ k ) ) 1 g T ( θ k ) P ( f ( θ k ) θ δ , k + 1 ) .
The existence of the inverse in Equation (29) is ensured, since P and R are symmetric matrices and positive definite.

3. Results and Discussion

3.1. Identification and Control Scheme Application

The complete identification and control scheme is shown in Figure 2. This scheme main components are the VS DX A/C system model, the inverse optimal controller, and the neural identifier.
Via simulation, the identification and control scheme are applied to the dynamic model for the VS DX A / C system. The mentioned model is formed by Equations (1)–(6).
Applying the neural identifier developed in Section 2.2.1, it is possible to obtain for the VS DX A/C system a discrete-time neural reduced model which uses UKF for training, as follows
χ 1 , k + 1 = w 11 , k S ( x 2 , k ) + w 12 k x 6 , k + w 1 u 2 , k   , χ 2 , k + 1 = w 21 , k S ( x 2 , k ) + w 22 k S ( x 6 , k ) + w 2 u 1 , k   ,
where χ1 and χ2 identify the air temperature in the conditioned space x2 and moisture content of air-conditioned space x6, respectively. The NN training is performed online and all its states are initialized in random way. The parameter selection of the RHONN identifier is heuristic as
L = 2 α = 1 × 10 3 β = 0.5 κ = 1 R 1 = 0.1 R 2 = 0.1 Q 1 = 0.1 I Q 2 = 0.1 I w 1 = 0.1 w 2 = 0.1
where I is the 2 × 2 identity matrix. The neural network structure in Equation (30) is chosen as in [25] in order to minimize the state estimation error.
Using the control laws described in Section 2.3, the desired reference trajectories x1δ,k and x2δ,k are tracked by the states χ1,k and χ2,k, respectively. Applying Equation (29), it is possible to do this as follows
u k = 1 2 ( R + 1 2 g T ( χ k ) P g ( χ k ) ) 1 × g T ( χ k ) P ( f ( χ k ) x δ , k + 1 ) ,
where the P and R selection is completed heuristically as
P = [ 1275 750 750 1275   ] R = 2 × 10 2 I .  

3.2. Controllability Simulation Test

A test to examine the control scheme performance is conducted. The reference trajectories are stepped waves for temperature and moisture, using the experimental VS DX A/C system model, proposed neural identifier and developed controller. The result of the trajectory tracking performance for the moisture content of air-conditioned space and the air temperature in the conditioned space, in presence of a disturbance simulated by a uniform random signal, is presented in Figure 3. The disturbance is a random variation of a model parameter, thus simulating an internal variation in the system. By applying the inverse optimal control, the obtained control signals are portrayed in Figure 4. Additionally, Figure 4 shows fan and compressor efficient operation, which contributes to reduced energy consumption.
Trajectory tracking with disturbances test has operation conditions close to reality. As seen in Figure 3, humidity and indoor air temperature settings were initially set at 10.5 g kg−1 for W2 and 24 °C for T2. First, at 760 s, humidity and indoor air temperature settings were changed to 11 g kg−1 for W2 and 23 °C for T2. Next, at 1660 s, humidity and indoor air temperature settings were changed to 10.25 g kg−1 for W2 and 22 °C for T2. At 2300 s, humidity and indoor air temperature settings were changed to 10 g kg−1 for W2 and 21.5 °C for T2. At 3110 s, humidity and indoor air temperature settings were changed to 9.75 g kg−1 for W2 and 19 °C for T2. Finally, at 4000 s, humidity and indoor air temperature settings were changed to 10 g kg−1 for W2 and 20 °C for T2.
Finally, a variation of the sensible heat load is added as an external disturbance in a new test, as shown in Figure 5.
Figure 6 shows the trajectory tracking considering internal and external disturbances. The control signals are displayed in Figure 7.
Again, Figure 7 shows an efficient operation due to the performance of the control signals, achieving reduced energy consumption.
There is a continuous search for alternatives that reduce the energy consumption of air conditioning systems since they are equipment commonly used from home, to schools, to offices, to department stores and to large companies. In most countries, the existence of air conditioning systems that do not have a variable speed compressor is extensive, increasing electrical energy consumption. However, the updating of this equipment is taking place gradually. Hence the importance of developing this type of work. The results obtained are comparable with [12], where a multivariable controller was designed, getting a small range of operation for the A/C system. Reference [14] proposes an ANN/fuzzy logic controller to control both temperature and humidity of an A/C system.

4. Conclusions

In this paper, the A/C system model utilized was taken from a VS DX A/C system, commonly used in homes, offices, classrooms and even small to medium-sized businesses. The conditions under which the model was obtained have been mentioned in the methodology. However, when using NN any condition not considered in the analysis is absorbed by the neural model. On the other hand, the paper presents trajectory tracking using neural inverse optimal control for nonlinear systems and is inverse optimal in the sense that, a posteriori, minimizes a meaningful cost functional achieving reduced energy consumption. Using an unscented Kalman filter, the neural network training is performed online. The proposed identification and control scheme by means of the simulation results shows its effectiveness and robustness. Therefore, this research shows a novel scheme exposed to a wider temperature and humidity range and a more significant number of their variations in the desired references, presenting a more efficient performance. In addition, the control signals that result from the proposed control scheme show reduced energy consumption. This work may have future research that will further reduce this energy consumption by implementing other novel control algorithms, which establish new solution alternatives.

Author Contributions

Conceptualization, F.M. and R.G.-H.; methodology, F.M.; software, F.M.; validation, F.M. and R.G.-H.; formal analysis, F.M.; investigation, F.M. and R.G.-H.; resources, F.M. and J.R.; data curation, F.M. and J.E.P.-R.; writing—original draft preparation, F.M. and R.G.-H.; writing—review and editing, F.M., R.G.-H., J.R. and J.E.P.-R.; visualization, F.M., J.R., J.E.P.-R. and C.Á.-M.; supervision, F.M., R.G.-H., J.R., J.E.P.-R. and C.Á.-M.; project administration, F.M., R.G.-H., J.R., J.E.P.-R. and C.Á.-M.; funding acquisition, F.M. and R.G.-H. All authors have read and agreed to the published version of the manuscript.

Funding

This research work was funded by Tecnologico Nacional de Mexico (TecNM) grants.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

This work was completed with the support of Tecnologico Nacional de Mexico (TecNM) projects.

Conflicts of Interest

The authors declare no conflict of interest.

Nomenclature

a,b,cAdaptation constants (dimensionless)
A1Heat transfer area of the DX evaporator in dry-cooling region (m2)
A2Heat transfer area of the DX evaporator in wet-cooling region (m2)
CpSpecific heat of air (kJ kg−1 K−1)
fAir volumetric flow rate (m3 s−1)
hfgLatent heat of vaporization of water (kJ/kg)
MMoisture load in the conditioned space (kg s−1)
MrefMass flow rate of refrigerant (kg s−1)
QloadSensible heat load in the conditioned space (kW)
QspfHeat gain of supply fan (kW)
sCompressor speed (rpm)
T1Temperature of air leaving the DX evaporator (°C)
T2Air temperature in the conditioned space (°C)
T3Air temperature leaving the dry-cooling region of the DX evaporator (°C)
TwTemperature of the DX evaporator wall (°C)
VVolume of the conditioned space (m3)
Vh1Air side volume of the DX evaporator in dry-coolingregion on air side (m3)
Vh2Air side volume of the DX evaporator in wet-coolingregion on air side (m3)
W1Moisture content of air leaving the DX evaporator (kg kg−1 dry air)
W2Moisture content of air-conditioned space (kg kg−1 dry air)
α1Heat transfer coefficient between air and the DXevaporator wall in dry-cooling region (kWm−2 °C−1)
α2Heat transfer coefficient between air and the DXevaporator wall in wet-cooling region (kWm−2 °C−1)
ρDensity of moist air (kg m−3)

Abbreviations

The following abbreviations are used in this manuscript:
A/C Air Conditioning
ANNArtificial Neural Network
CLFControl Lyapunov Function
DXDirect eXpansion
EEVElectronic Expansion Valve
EKFExtended Kalman Filter
GRVGaussian Random Variables
HJBHamilton Jacobi Bellman
IAQIndoor Air Quality
MIMOMulti-Input Multi-Output
PIDProportional Integral Derivative
RHRelative Humidity
RHONNRecurrent High-Order Neural Network
UKFUnscented Kalman Filter
UTUnscented Transformation
VSDVariable-Speed Drive

References

  1. Chen, W.; Chan, M.; Deng, S.; Yan, H.; Weng, W. A direct expansion based enhanced dehumidification air conditioning system for improved year-round indoor humidity control in hot and humid climates. Build. Environ. 2018, 139, 95–109. [Google Scholar] [CrossRef]
  2. Chen, W.; Chan, M.; Weng, W.; Yan, H.; Deng, S. An experimental study on the operational characteristics of a direct expansion based enhanced dehumidification air conditioning system. Appl. Energy 2018, 225, 922–933. [Google Scholar] [CrossRef]
  3. Zhang, H.; Yoshino, H. Analysis of indoor humidity environment in Chinese residential buildings. Build. Environ. 2010, 45, 2132–2140. [Google Scholar] [CrossRef]
  4. Chen, W.; Chan, M.; Weng, W.; Yan, H.; Deng, S. Development of a steady-state physical-based mathematical model for a direct expansion based enhanced dehumidification air conditioning system. Int. J. Refrig. 2018, 91, 55–68. [Google Scholar] [CrossRef]
  5. Bordrick, J.; Gilbride, T.L. Focusing on Buyer’s Needs: DOE’s Engineering Technology Program. Energy Eng. 2002, 99, 18–37. [Google Scholar] [CrossRef]
  6. Zhang, G.Q. China HVACR annual volume II business. Chin. Constr. Ind. Press Beijing 2002, 2, 44–45. [Google Scholar]
  7. Kang, C.-S.; Hyun, C.-H.; Park, M. Fuzzy logic-based advanced on–off control for thermal comfort in residential buildings. Appl. Energy 2015, 155, 270–283. [Google Scholar] [CrossRef]
  8. Toftum, J.; Fanger, P.O. Air humidity requirements for human comfort. ASHRAE Trans. 1999, 105, 641–647. [Google Scholar]
  9. Krakow, K.I.; Lin, S.; Zeng, Z.-S. Temperature and humidity control during cooling and dehumidifying by compressor and evaporator fan speed variation. ASHRAE Trans. 1995, 101, 292–304. [Google Scholar]
  10. Diaz-Mendez, S.E.; Patiño-Carachure, C.; Herrera-Castillo, J.A. Reducing the energy consumption of an earth–air heat exchanger with a PID control system. Energy Convers. Manag. 2014, 77, 1–6. [Google Scholar] [CrossRef]
  11. Li, Z.; Deng, S. A DDC-based capacity controller of a direct expansion (DX) air conditioning (A/C) unit for simultaneous indoor air temperature and humidity control–Part I: Control algorithms and preliminary controllability tests. Int. J. Refrig. 2007, 30, 113–123. [Google Scholar] [CrossRef]
  12. Qi, Q.; Deng, S. Multivariable control of indoor air temperature and humidity in a direct expansion (DX) air conditioning (A/C) system. Build. Environ. 2009, 44, 1659–1667. [Google Scholar] [CrossRef]
  13. Li, N.; Xia, L.; Shiming, D.; Xu, X.; Chan, M.Y. Dynamic modeling and control of a direct expansion air conditioning system using artificial neural network. Appl. Energy 2012, 91, 290–300. [Google Scholar] [CrossRef]
  14. Li, Z.; Xu, X.; Deng, S.; Pan, D. A novel neural network aided fuzzy logic controller for a variable speed (VS) direct expansion (DX) air conditioning (A/C) system. Appl. Therm. Eng. 2015, 78, 9–23. [Google Scholar] [CrossRef]
  15. Xia, Y.; Yan, H.; Deng, S.; Chan, M.-Y. A new capacity controller for a direct expansion air conditioning system for operational safety and efficiency. Build. Serv. Eng. Res. Technol. 2018, 39, 21–37. [Google Scholar] [CrossRef] [Green Version]
  16. Diaz, S.E.; Sierra, J.M.T.; Herrera, J.A. The use of earth–air heat exchanger and fuzzy logic control can reduce energy consumption and environmental concerns even more. Energy Build. 2013, 65, 458–463. [Google Scholar] [CrossRef]
  17. Garces-Jimenez, A.; Gomez-Pulido, J.-M.; Gallego-Salvador, N.; Garcia-Tejedor, A.-J. Genetic and Swarm Algorithms for Optimizing the Control of Building HVAC Systems Using Real Data: A Comparative Study. Mathematics 2021, 9, 2181. [Google Scholar] [CrossRef]
  18. Adegbenro, A.; Short, M.; Angione, C. An Integrated Approach to Adaptive Control and Supervisory Optimisation of HVAC Control Systems for Demand Response Applications. Energies 2021, 14, 2078. [Google Scholar] [CrossRef]
  19. Jajarmi, A.; Pariz, N.; Effati, S.; Kamyad, A.V. Infinite horizon optimal control for nonlinear interconnected large-scale dynamical systems with an application to optimal attitude control. Asian J. Control 2012, 14, 1239–1250. [Google Scholar] [CrossRef]
  20. Baleanu, D.; Zibaei, S.; Namjoo, M.; Jajarmi, A. A nonstandard finite difference scheme for the modeling and nonidentical synchronization of a novel fractional chaotic system. Adv. Differ. Equ. 2021, 2021, 308. [Google Scholar] [CrossRef]
  21. Jajarmi, A.; Baleanu, D.; Zarghami Vahid, K.; Mobayen, S. A general fractional formulation and tracking control for immunogenic tumor dynamics. Math. Methods Appl. Sci. 2022, 45, 667–680. [Google Scholar] [CrossRef]
  22. Willis, M.J.; Montague, G.A.; Di Massimo, C.; Tham, M.T.; Morris, A.J. Artificial neural networks in process estimation and control. Automatica 1992, 28, 1181–1187. [Google Scholar] [CrossRef]
  23. Trebatický, P. Recurrent neural network training with the extended kalman filter. In Proceedings of the Student Research Conf. in Informatics and Information Technologies, Bratislava, Slovakia, 27 April 2005; pp. 57–64. [Google Scholar]
  24. Feldkamp, L.A.; Prokhorov, D.V.; Feldkamp, T.M. Simple and conditioned adaptive behavior from Kalman filter trained recurrent networks. Neural Netw. 2003, 16, 683–689. [Google Scholar] [CrossRef]
  25. Alanis, A.Y.; Sanchez, E.N.; Loukianov, A.G.; Perez-Cisneros, M.A. Real-time discrete neural block control using sliding modes for electric induction motors. IEEE Trans. Control Syst. Technol. 2010, 18, 11–21. [Google Scholar] [CrossRef]
  26. Leung, C.-S.; Chan, L.-W. Dual extended Kalman filtering in recurrent neural networks. Neural Netw. 2003, 16, 223–239. [Google Scholar] [CrossRef]
  27. Haykin, S. Kalman filters. In Kalman Filtering and Neural Networks; Haykin, S., Ed.; John Wiley & Sons, Inc.: New York, NY, USA, 2001; pp. 1–21. ISBN 978-0-471-36998-1. [Google Scholar]
  28. Julier, S.J.; Uhlmann, J.K. New extension of the Kalman filter to nonlinear systems. In Proceedings of the Signal Processing, Sensor Fusion, and Target Recognition VI, Orlando, FL, USA, 28 July 1997; pp. 182–193. [Google Scholar]
  29. Trebatický, P.; Pospíchal, J. Neural Network Training with Extended Kalman Filter Using Graphics Processing Unit. In Artificial Neural Networks-ICANN 2008; Springer: Berlin/Heidelberg, Germany, 2008; pp. 198–207. [Google Scholar]
  30. Kirk, D.E. Optimal Control Theory: An Introduction, 2nd ed.; Dover Publication, Inc.: Mineola, NY, USA, 2004. [Google Scholar]
  31. Krstic, M.; Kanellakopoulos, I.; Kokotovic, P.V. Nonlinear and Adaptive Control Design; John Wiley and Sons: New York, NY, USA, 1995. [Google Scholar]
  32. Do, K.D.; Jiang, Z.P.; Pan, J. Simultaneous Tracking and Stabilization of Mobile Robots: An Adaptive Approach. IEEE Trans. Automat. Contr. 2004, 49, 1147–1152. [Google Scholar] [CrossRef]
  33. Hernandez-Mejia, G.; Alanis, A.Y.; Hernandez-Vargas, E.A. Neural inverse optimal control for discrete-time impulsive systems. Neurocomputing 2018, 314, 101–108. [Google Scholar] [CrossRef]
  34. Ruiz-Cruz, R.; Sanchez, E.N.; Loukianov, A.G.; Ruz-Hernandez, J.A. Real-Time Neural Inverse Optimal Control for a Wind Generator. IEEE Trans. Sustain. Energy 2019, 10, 1172–1183. [Google Scholar] [CrossRef]
  35. Quintero-Manríquez, E.; Sanchez, E.N.; Antonio-Toledo, M.E.; Muñoz, F. Neural control of an induction motor with regenerative braking as electric vehicle architecture. Eng. Appl. Artif. Intell. 2021, 104, 104275. [Google Scholar] [CrossRef]
  36. Qi, Q.; Deng, S. Multivariable control-oriented modeling of a direct expansion (DX) air conditioning (A/C) system. Int. J. Refrig. 2008, 31, 841–849. [Google Scholar] [CrossRef]
  37. Narendra, K.S.; Parthasarathy, K. Identification and control of dynamical systems using neural networks. IEEE Trans. Neural Netw. 1990, 1, 4–27. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  38. Rovithakis, G.A.; Christodoulou, M.A. Adaptive Control with Recurrent High-Order Neural Networks; Advances in Industrial Control; Springer: Berlin, Germany, 2000; ISBN 978-1-4471-1201-3. [Google Scholar]
  39. Haddad, W.M.; Chellaboina, V.-S.; Fausz, J.L.; Abdallah, C. Optimal discrete-time control for non-linear cascade systems. J. Frankl. Inst. 1998, 335, 827–839. [Google Scholar] [CrossRef] [Green Version]
  40. Prokhorov, D.V. Kalman Filter Training of Neural Networks: Methodology and Applications. In Proceedings of the International Joint Conference on Neural Networks, IJCNN2004 Tutorials, Budapest, Hungary, 25–29 July 2004. [Google Scholar]
  41. Rhudy, M.; Gu, Y.; Napolitano, M.R. An Analytical Approach for Comparing Linearization Methods in EKF and UKF. Int. J. Adv. Robot. Syst. 2013, 10, 208. [Google Scholar] [CrossRef] [Green Version]
  42. Wan, E.A.; van der Merwe, R. The Unscented Kalman Filter. In Kalman Filtering and Neural Networks; Haykin, S., Ed.; John Wiley & Sons, Inc.: New York, NY, USA, 2001; pp. 221–280. ISBN 978-0-471-36998-1. [Google Scholar]
  43. Rhudy, M.; Gu, Y.; Gross, J.; Napolitano, M.R. Evaluation of Matrix Square Root Operations for UKF within a UAV GPS/INS Sensor Fusion Application. Int. J. Navig. Obs. 2011, 2011, 1–11. [Google Scholar] [CrossRef] [Green Version]
  44. Van der Merwe, R.; Wan, E.A. The square-root unscented Kalman filter for state and parameter-estimation. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, Salt Lake City, UT, USA, 7–11 May 2001; Volume 6, pp. 3461–3464. [Google Scholar]
  45. Başar, T.; Olsder, G.J. Dynamic Noncooperative Game Theory, 2nd ed.; Academic Press: New York, NY, USA, 1995; ISBN 978-0-89871-429-6. [Google Scholar]
  46. Lewis, F.L.; Syrmos, V.L. Optimal Control, 2nd ed.; John Wiley & Sons: New York, NY, USA, 1995. [Google Scholar]
  47. Al-Tamimi, A.; Lewis, F.L.; Abu-Khalaf, M. Discrete-Time Nonlinear HJB Solution Using Approximate Dynamic Programming: Convergence Proof. IEEE Trans. Syst. Man Cybern. Part B 2008, 38, 943–949. [Google Scholar] [CrossRef] [PubMed]
  48. Ohsawa, T.; Bloch, A.M.; Leok, M. Discrete Hamilton-Jacobi theory and discrete optimal control. In Proceedings of the 49th IEEE Conference on Decision and Control (CDC), Atlanta, GA, USA, 15–17 December 2010; pp. 5438–5443. [Google Scholar]
Figure 1. Schematic diagram of the experimental VS DX A/C system.
Figure 1. Schematic diagram of the experimental VS DX A/C system.
Mathematics 10 00695 g001
Figure 2. Proposed identification and control scheme.
Figure 2. Proposed identification and control scheme.
Mathematics 10 00695 g002
Figure 3. Tracking performance of VS DX A/C system with internal disturbances.
Figure 3. Tracking performance of VS DX A/C system with internal disturbances.
Mathematics 10 00695 g003
Figure 4. Control signals for tracking performance of VS DX A/C system with internal disturbances.
Figure 4. Control signals for tracking performance of VS DX A/C system with internal disturbances.
Mathematics 10 00695 g004
Figure 5. Variation of sensible heat load in the conditioned space.
Figure 5. Variation of sensible heat load in the conditioned space.
Mathematics 10 00695 g005
Figure 6. Tracking performance of VS DX A/C system with internal and external disturbances.
Figure 6. Tracking performance of VS DX A/C system with internal and external disturbances.
Mathematics 10 00695 g006
Figure 7. Control signals for tracking performance of VS DX A/C system with internal and external disturbances.
Figure 7. Control signals for tracking performance of VS DX A/C system with internal and external disturbances.
Mathematics 10 00695 g007
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Muñoz, F.; Garcia-Hernandez, R.; Ruelas, J.; Palomares-Ruiz, J.E.; Álvarez-Macías, C. Optimal Operation for Reduced Energy Consumption of an Air Conditioning System Using Neural Inverse Optimal Control. Mathematics 2022, 10, 695. https://doi.org/10.3390/math10050695

AMA Style

Muñoz F, Garcia-Hernandez R, Ruelas J, Palomares-Ruiz JE, Álvarez-Macías C. Optimal Operation for Reduced Energy Consumption of an Air Conditioning System Using Neural Inverse Optimal Control. Mathematics. 2022; 10(5):695. https://doi.org/10.3390/math10050695

Chicago/Turabian Style

Muñoz, Flavio, Ramon Garcia-Hernandez, Jose Ruelas, Juan E. Palomares-Ruiz, and Carlos Álvarez-Macías. 2022. "Optimal Operation for Reduced Energy Consumption of an Air Conditioning System Using Neural Inverse Optimal Control" Mathematics 10, no. 5: 695. https://doi.org/10.3390/math10050695

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop