Next Article in Journal
COVID-19 Detection Systems Using Deep-Learning Algorithms Based on Speech and Image Data
Next Article in Special Issue
Optimal Task Abort and Maintenance Policies Considering Time Redundancy
Previous Article in Journal
TAID-LCA: Segmentation Algorithm Based on Ternary Trees
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Data-Driven Maintenance Priority and Resilience Evaluation of Performance Loss in a Main Coolant System

1
School of Management Engineering, Zhengzhou University, Zhengzhou 450001, China
2
School of Electrical Engineering, Zhengzhou University, Zhengzhou 450001, China
3
Department of Electrical & Computer Engineering, University of Massachusetts, Dartmouth, MA 02747, USA
4
Department of Management Science, University of Strathclyde, Glasgow G1 1XQ, UK
*
Author to whom correspondence should be addressed.
Mathematics 2022, 10(4), 563; https://doi.org/10.3390/math10040563
Submission received: 7 January 2022 / Revised: 8 February 2022 / Accepted: 8 February 2022 / Published: 11 February 2022

Abstract

:
The main coolant system (MCS) plays a vital role in the stability and reliability of a nuclear power plant. However, human errors and natural disasters may cause some reactor coolant system components to fail, resulting in severe consequences such as nuclear leakage. Therefore, it is crucial to perform a resilience analysis of the MCS, to effectively reduce and prevent losses. In this paper, a resilience importance measure (RIM) for performance loss is proposed to evaluate the performance of the MCS. Specifically, a loss importance measure (LIM) is first proposed to indicate the component maintenance priority of the MCS under different failure conditions. Based on the LIM, RIMs for single component failure and multiple component failures were developed to measure the recovery efficiency of the system performance. Finally, a case study was conducted to demonstrate the proposed resilience measure for system reliability. Results provide a valuable reference for increasing the system security of the MCS and choosing the appropriate total maintenance cost.

1. Introduction

In the early days of nuclear reactors, nuclear power plants (NPPs) were mainly used for military purposes. As a result of the industry’s continuous development and progress, NPPs began to be used primarily for nuclear power generation. However, following the widespread establishment and use of NPPs, nuclear accidents have highlighted the dangers of nuclear reactors. After the Chernobyl nuclear accident, people began to pay attention to nuclear safety and study the safety issues associated with nuclear reactors. In addition, issues such as waste and the economy also affect social and environmental acceptability [1]. For nuclear energy, promoting strengths and avoiding weaknesses has become a problem to be considered.
At present, the pressurized water reactor (PWR) is the most competitive reactor used in NPPs, and accounts for the largest proportion. The core system of the PWR is the main coolant system (MCS), also known as the primary loop system. It is composed of two to four identical cooling loops according to its capacity. Its main functions include transferring energy and cooling the core. In addition, because the water in the primary circuit is closed and the heat exchange is carried out through the heat transfer of the tube wall, the MCS can also prevent the leakage of radioactive materials. On 28 March 1979, a serious nuclear leakage occurred at the Three Mile Island nuclear power station in the United States, namely the Three Mile Island nuclear accident. At the time, the MCS was not functioning due to human errors. This accident resulted in serious consequences, and the clean-up cost of reactor two alone reached USD 1 billion [2].
Due to the setting of redundant components, failure of some components may not cause the failure of the whole MCS, but some degradation in the system performance. In addition, failed components may have a negative effect on the components to which they are connected, possibly leading to cascading effects with more serious consequences. At this time, maintenance is essential for managing system reliability, preventing system failures, and improving the effectiveness of system operations [3]. Therefore, this paper considers the preventive maintenance of the remaining operational components while maintaining the failed components, to restore the system performance to a great extent. By combining resilience with importance measures, the recovery process of the MCS after an accident can be better guided. Specifically, in this paper, the resilience of the MCS is represented by the loss and recovery of system performance. The loss importance measure (LIM) is proposed to indicate the maintenance priority of the MCS components under different failure conditions. Considering the limited maintenance cost, we determined the preventive maintenance components set to maximize the system performance. Then, the resilience importance measures (RIMs) for single component failure and multiple component failures were used to evaluate the recovery efficiency of the MCS.
The rest of this paper is organized as follows. Section 2 presents some relevant works. Section 3 introduces the MCS of a nuclear reactor and presents the problem descriptions. Section 4 presents the component maintenance priority and proposes the resilience measure of the MCS. Section 5 uses a case study to illustrate the proposed method. Finally, Section 6 presents the conclusions.

2. Relevant Work

System resilience is the ability of a system to respond and recover quickly from an external disruptive event. In recent years, studies have been undertaken to analyze system resilience. Panteli and Mancarella [4] introduced a new sequential Monte-Carlo-based time-series simulation model to assess the resilience of power systems. Mao et al. [5] developed two metrics, including the resilience of cumulative performance loss and the resilience of restoration rapidity, to measure the resilience of the supply chain networks. Ali et al. [6] proposed an approach that considers the resilient behavior of collaborative Cyber-Physical systems to achieve the fail-operational goal in autonomous platooning systems. Zarei et al. [7] developed a fuzzy hybrid multi-criteria decision-making model for quantifying resilience. Hajializadeh and Imani [8] presented a new framework for building a resilience and vulnerability-informed decision support system. Kim et al. [9] established a resilience assessment model by quantifying the relationship between resilience and resilience components in the recovery from emergency accidents in NPPs. Combining the Markov model with dynamic Bayesian networks, Cai et al. [10] developed a novel evaluation methodology for resilience evaluation under the influence of various external disasters. Zeng et al. [11] developed a non-homogeneous semi-Markov reward process model for resilience analysis of multi-state systems.
The above research on system resilience shows that there is no unanimous measurement standard and method for the concept of system resilience. The study of the resilience of the MCS should focus on restoring the system to normal operation to the greatest extent as soon as possible, that is, determining how to quickly improve the efficiency recovery of the MCS, to avoid more serious consequences.
For the study of reactor reliability, Mullor et al. [12] described a general method for optimal reliability and imperfect maintenance activities of repairable equipment. The proposed procedure was illustrated using a real data example of an NPP. Rejc and Cepin [13] proposed an advanced method for common cause failure modeling and estimation, allowing more detailed reliability analyses. Ma et al. [14] investigated the reliability analysis and maintenance optimization approaches of two-unit warm standby cooling equipment. He et al. [15] used the methodology to assess the reliability of squib valves in pressurized water NPPs. Hu and Peng [16] developed a dynamic reliability model with random and dependent transition probabilities for a non-repairable discrete-time multi-state system. Mamdikar et al. [17] devised a reliability analysis framework validated with 32 safety-critical system instances of the NPP. Wakankar et al. [18] proposed an architecture model to quantitatively evaluate the reliability of the large reactor safety system. Tripathi et al. [19] used the dynamic flowgraph methodology to study the NPP and analyze its reliability. Most research regarding the reactor relates to its reliability evaluation and analysis, and the different costs of the MCS are not considered.
Importance measures can quantitatively describe the importance of each component of the system, which is valuable for design and failure analysis. In recent years, increasing research has been conducted on the importance theory [20,21,22]. For example, Xu et al. [23] proposed a new component importance measure for multi-state networks based on resilience from the perspective of the post-disaster recovery process. Chybowska et al. [24] developed methods for evaluating the importance of events during disasters. Fu et al. [25] proposed a new time-dependent importance measure for degrading components to address the component reassignment problem of degrading components. Fang et al. [26] used importance measure methods in the complex project risk management field and established priorities for further decision making. Kala [27] presented a new importance measure in a reliability-oriented global sensitivity analysis. Dui et al. [28] proposed a joint importance to identify components or component groups that can be used for preventive maintenance. Dui et al. [29] proposed a resilience importance measure to quantify the contribution of a component to the loss of system performance.
The main concerns of NPPs relate to their high cost, uncertainty, and severe aftermath of accidents. However, the above studies did not focus on these issues. In addition, when some components of the MCS fail, there is a lack of an appropriate importance measure to reflect the change in the maintenance priority of other components so as to guide the recovery process of the system. In order to prevent further escalation of the accident, determining how to quickly recover the system performance in the case of component failure has become a crucial issue to be considered. In the process of recovery, the quantitative expression of the resilience of the MCS is an open and challenging question.
In this paper, we address these questions by proposing a loss importance measure (LIM) that can indicate the maintenance priority of the MCS components under different failure conditions, in addition to resilience importance measures (RIMs) for single component failure and multiple component failures that can be used to evaluate the recovery efficiency of the MCS.

3. MCS and Problem Descriptions

This section analyzes the working principle and system structure of the reactor MCS, and describes the structure and function of the main components. The schematic diagram of the reactor coolant system is shown in Figure 1. The system consists of three similar loops. Each loop has a steam generator and a main pump connected with the reactor by the main pipe to form a closed cooling circuit. The nuclear fuel in the reactor releases much heat energy through the nuclear fission reaction. The coolant takes away the heat generated and cools the fuel assembly. Driven by the reactor coolant pump (PCR), the coolant flows into the steam generator through the pipe. The heat is transferred to the secondary circuit’s main feedwater through the U-tube’s wall heat transfer. The main feedwater is heated into steam and led into the main steam pipe, thus causing the steam turbine to generate power. A pressurizer is set in the first loop to compensate and adjust the volume and pressure of the system coolant. When the pressure is lower than a specific set value, the heater at the bottom of the pressurizer operates to heat the water in the pressurizer and generate steam, making the pressure rise. When the pressure is too high, the spray system is adjusted, and the coolant is introduced from the cold pipe of the system. The saturated steam in the pressurizer is condensed into water after being sprayed by the coolant to reduce the pressure value of the system. The pressure relief valve and safety valve are installed on the top of the pressurizer to protect the system from overpressure. When the system pressure exceeds a specific set value, the pressure relief valve and safety valve open, and part of the steam is sent to the pressure relief tank to reduce the system pressure. A safety injection pump and safety injection tank are set in each loop to form a safety injection system (RIS). When a loss of coolant accident (LOCA) occurs in the reactor coolant system (RCP) or the main steam system piping (VVP) breaks, the RIS can complete the core emergency cooling function so that the residual heat of the core can be discharged in time to ensure the integrity of the containment and limit the further development of the accident [30].
In this paper, the main components of MCS are analyzed and studied, and their influences on the system reliability are considered. It is assumed that the failure time of each component follows the Weibull distribution W ( t | θ , γ ) , and components of the same type have the same parameters. The scale and shape parameters of each component are shown in Table 1. Table 2 lists the names and codes of the main components. During the system’s working process, the steam generator and main coolant pump are in operation most of the time. According to engineering experience, they are prone to failures during the system’s service life, so they are most likely to be the vulnerable components.
In the MCS, there are three loops in the system. When a component in one of the loops fails, the other two loops are not affected by the faulty component. The whole system continues to run, but the system performance is degraded. When one component and its backups fail, the whole loop may fail. For example, when LOCA occurs and both X4 and X7 fail, the safety injection system cannot operate normally, and the whole first loop fails. The remainder of the components without backup are key components, and the failure of any key component can cause the whole system to crash. The specific problems to be solved in this paper are as follows.
  • As a multi-state system, different states of components will lead to different performance levels of the MCS; how can the impact of the status change in other components on the system performance, when some components of the MCS fail, be evaluated?
  • Under the constraints of maintenance resources, how can the combination of maintenance components to achieve the highest system performance, when one or more components of the MCS fail, be determined?
  • In the current competitive energy market, the nuclear industry is committed to reducing maintenance costs while maintaining safe and reliable operations. Therefore, how can the maintenance efficiency of optimal preventive maintenance policies corresponding to different total maintenance costs at a certain time be measured so as to control maintenance costs and improve recovery efficiency?

4. Component Maintenance Priority and Resilience Measure for the MCS

The failure process of the system is the superposition of the failure processes of all components. Moreover, due to the comprehensive effects of each component state, the system presents different utilities. Therefore, we consider using system performance to quantitatively represent the loss of system performance caused by component failure. We use a 0 a 1 a M to represent the performance levels corresponding to the state space { 0 , 1 , 2 , , M } of the system. It is assumed that all the components of the MCS have two states: perfect function and failure (1 and 0). The state of component i is indicated by X i ( t ) ; a system state represents a combination of all system component states, the state of the MCS is indicated by S ( X ( t ) ) , and X ( t ) is the state space of all MCS components. By default, a 0 = 0 when the MCS is at state 0 (complete failure). Thus, the performance of the system can be measured by the system utility expectation of different system states, which is expressed as:
U ( X ( t ) ) = j = 0 M a j Pr [ S ( X ( t ) ) = j ] = j = 1 M a j Pr [ S ( X ( t ) ) = j ] = j = 1 M a j Pr [ S ( X 1 ( t ) , X 2 ( t ) , , X n ( t ) ) = j ] ,
where a j is the performance level of the MCS when the system state is j. The states and corresponding performance level parameters of the MCS are shown in Appendix A.

4.1. LIM and Determining Component Maintenance Priority

We define the LIM in this section. LIM refers to the impact of other non-failed components on system performance when some components of MCS fail. The higher the LIM value of a component, the more the system performance will recover when the component is repaired, so the higher maintenance priority of the component. It can provide a theoretical basis for the determination of component maintenance priority.
1.
When component i fails, the state of component i becomes 0, and the system performance can be expressed as:
U ( 0 i , X ( t ) ) = j = 1 M a j Pr [ S ( X 1 ( t ) , , X i 1 ( t ) , 0 i , X i + 1 ( t ) , , X n ( t ) ) = j ] .
In this case, the LIM of component k is expressed in the form of a partial derivative. The LIM of component k is:
I k / i ( t ) = ( U ( 0 i , X ( t ) ) ) ρ k ( t ) = j = 1 M ( a j a j 1 ) [ Pr ( S ( 1 k , 0 i , X ( t ) ) j ) Pr ( S ( 0 k , 0 i , X ( t ) ) j ) ] , ρ k ( t ) = Pr [ X k ( t ) = 1 ] .
The proof of Equation (3) is shown in Appendix B.
2.
Similarly, when multiple components fail, the system performance is:
U ( 0 N , X ( t ) ) = j = 1 M a j Pr [ S ( 0 N , X ( t ) ) = j ] ,
where N is the set of failed components.
At this point, the LIM of component k is equal to:
I k / N ( t ) = ( U ( 0 N , X ( t ) ) ) ρ k ( t ) = j = 1 M ( a j a j 1 ) [ Pr ( S ( 1 k , 0 N , X ( t ) ) j ) Pr ( S ( 0 k , 0 N , X ( t ) ) j ) ]
When each component has the same maintenance cost, the preventive maintenance priority of the components can be determined by the ranking of their importance measures. However, in practice, the limited maintenance cost constraints should be considered, and each component’s maintenance cost is often different (a component with a high importance measure may need a higher maintenance cost). Therefore, to maximize the expected system performance under cost constraints, the integer programming method can be used to determine the set of preventive maintenance components.
When a single component i fails, the following integer programming problem needs to be solved:
Z = max k G , k i I k / i ( t ) x k , s . t .   c i + x k c k C , x k { 0 , 1 } ,
where G is the set of system components, c i is the maintenance cost of component i, c k is the preventive maintenance cost of component k, and x k is the decision variable indicating whether to repair component k. x k is a 0–1 variable. C represents the total maintenance cost constraint.
When multiple components fail, the following integer programming problem needs to be solved:
Z = max k G , k N I k / N ( t ) x k , s . t .   c N + x k c k C , x k { 0 , 1 } ,
where c N is the cost required to repair the failed components.
For the integer programming model above, the optimal solution is { x k * , k i } (for the single component failure case) and { x k * , k N } (for the multi-component failures case). Thus, the optimal component maintenance priority set is { k | x k * = 1 } .

4.2. RIM

After determining the preventive maintenance component set based on LIM, in order to measure the recovery efficiency of preventive maintenance when one or more components fail, this section proposes the RIM based on the system resilience.
Firstly, based on Equation (1), we can obtain:
d U ( X ( t ) ) d t = d ( j = 1 M a j Pr [ S ( X ( t ) ) = j ] ) d t = j = 1 M a j i = 1 n d R i ( t ) d t Pr [ S ( X ( t ) ) = j ] R i ( t ) .
where R i ( t ) is the reliability of component i, i [ 1 , n ] , n is the number of system components.
Because
Pr [ S ( X ( t ) ) = j ] = Pr [ X i ( t ) = 1 ] Pr [ S ( 1 i , X ( t ) ) = j ] + Pr [ X i ( t ) = 0 ] Pr [ S ( 0 i , X ( t ) ) = j ] = R i ( t ) Pr [ S ( 1 i , X ( t ) ) = j ] + ( 1 R i ( t ) ) Pr [ S ( 0 i , X ( t ) ) = j ]
and λ i ( t ) = d R i ( t ) / d t R i ( t ) , then we have:
d U ( X ( t ) ) d t = i = 1 n j = 1 M a j R i ( t ) λ i ( t ) { Pr [ S ( 1 i , X ( t ) ) = j ] Pr [ S ( 0 i , X ( t ) ) = j ] } = i = 1 n I i ( t )
where I i ( t ) is the Integrated importance measure of component i. The loss of system performance per unit time when components fail is expressed as a performance loss importance measure (PLIM). The improvement of system performance per unit time in the case of component maintenance is expressed as a performance recovery importance measure (PRIM). Then the RIM of the maintenance component set is discussed in two cases.
  • Single component failure
The loss of system performance per unit time is equal to the loss of system performance caused by the failure of component i. Based on Equation (9), PLIM of component u is obtained as follows:
P L I M u ( t ) = d ( U ( X ( t ) ) U ( 0 u , X ( t ) ) ) d t = d U ( X ( t ) ) d t d U ( 0 u , X ( t ) ) d t = i = 1 n I i ( t ) + i = 1 n I i ( t ) 0 u = I u ( t )
where I u ( t ) is the Integrated importance measure of component u.
In the process of repairing the failed component u and preventively maintaining component set { k | x k * = 1 , k u } , the improvement of system performance per unit time is equal to the sum of PRIM of all non-failed components in the system:
P R I M k / u ( t ) = I k / u ( t ) { k | x k * = 1 , k u } * I k / u ( t ) { k | x k * = 1 , k u } ,
where P R I M k / u ( t ) is the contribution of the set of preventively maintained components to system performance improvement while repairing component u. I k / u ( t ) * represents system performance after the preventive maintenance, I k / u ( t ) represents system performance before the preventive maintenance.
Based on Equations (10) and (11), we can define and evaluate the RIM of component k as:
R I M k / u ( t ) = P R I M k / u ( t ) P L I M u ( t )
That, is, the RIM of component k is equal to the ratio of the sum of PRIM values of all preventively maintained components to PLIM. The larger RIM of components, the higher the recovery efficiency of system performance when they are repaired, which means that higher maintenance priority should be provided for these components for the best return in improving the system performance.
2.
Multiple components failures
The failed components are represented as the set N = { i 1 , i 2 , i 3 i y } . The loss of system performance per unit time is equal to the loss of performance caused by the failure of components i 1 , i 2 , i 3 i y to state 0. The PLIM for the set is:
P L I M N ( t ) = d ( U ( X ( t ) ) U ( 0 N , X ( t ) ) ) d t = d U ( X ( t ) ) d t d U ( 0 N , X ( t ) ) d t = i = 1 n I i ( t ) + i = 1 n I i ( t ) 0 N = i = 1 y I i ( t ) .
Based on Equation (11), the PRIM for multi-component failure is:
P R I M k / N ( t ) = I k / N ( t ) { k | x k * = 1 , k N } * I k / N ( t ) { k | x k * = 1 , k N }
Based on Equations (13) and (14), the RIM for the failed component k is defined and evaluated as:
R I M k / N ( t ) = P R I M k / N ( t ) P L I M N ( t )
In addition, an illustration using the parallel-serial structure of the MCS is shown in Appendix C to indicate the specific calculation process of the above formula.

5. Case Study

This section takes the MCS in Figure 1 to illustrate the proposed method in Section 4. The component maintenance priority based on the LIM under different illustrative failure conditions is shown in Figure 2 and Figure 3.
The maintenance priority of each component under different conditions can be shown by the LIM values of the MCS components. It can be seen from Figure 2 and Figure 3 that LIM values decrease rapidly before 1000 h and then decrease slowly to 0. It can also be observed that the maintenance priority of the reactor pressure vessel, pressurizer, and safety valve are always high because they are the key components of the MCS, and the failure of these components may lead to the failure of the whole system. Therefore, we should focus on the maintenance of these components. Safety injection pumps and safety injection tanks always have the lowest maintenance priority; thus, they have a lower chance of receiving preventive maintenance. Therefore, the high reliability of these two components should be strictly maintained to ensure the normal operation of the reactor.
When the MCS starts to operate, the maintenance priority of the safety valve is higher than that of the reactor pressure vessel, but the slope of the safety valve curve is larger. The maintenance priorities of the two components change as time proceeds. Therefore, the maintenance focus needs to be adjusted over time. In addition, comparing the curves under different conditions, it can be seen that the impact of the backup components corresponding to the failed components on the system performance will become greater, so the preventive maintenance priority of the backup components will also become higher. Therefore, when a component fails, more attention should be paid to its redundant components in other loops to ensure their normal operation.
After determining the maintenance priority of each component under different conditions, considering the cost constraints, different total maintenance costs will have corresponding different preventive maintenance policies. The maintenance and preventive maintenance costs of each component are shown in Table 3.
Solving the optimization problems of Equations (6) and (7), the optimal preventive maintenance component set at a specific time under different cost constraints can be obtained. Four cases are selected to analyze the impact of different maintenance cost constraints on the selection of preventive maintenance components. The results are shown in Table 4, Table 5, Table 6 and Table 7, where 1 means the component is selected for preventive maintenance, and 0 means the component is not selected for maintenance. Considering the maintenance cost and maintenance priority, it can be seen that, due to the lower preventive maintenance cost and higher maintenance priority, the safety valve has priority among the preventive maintenance components in different situations. Considering the influence of time, at 600 h, the choice of the reactor pressure vessel or safety valve for preventive maintenance is preferred due to the large gap between the maintenance priority curves of key components and other components. At 1000 h, the gap between the repair priority curves of each component becomes smaller, and the low maintenance costs of the safety injection pump and safety injection tank make them the preferred choice for preventive maintenance. In addition, the failure of different components will also result in the selection of their preferred components for preventive maintenance. When X10 fails, the pressurizer is more likely to be selected for preventive maintenance due to the increase in total maintenance cost. When X11 fails, the reactor pressure vessel is more likely to be selected. In the case of X9 and X10 failures, preventive maintenance for safety injection pump is preferred. When X10, X11, and X13 fail, preventive maintenance for safety injection tank is preferred.
Finally, the RIM values corresponding to the optimal preventive maintenance policies with different total maintenance costs are calculated when some components fail at a specific time. The results are shown in Figure 4. With the increase in the total maintenance cost, the number of components selected for preventive maintenance increases, and then the RIM value increases continuously in the form of a ladder. When X10 fails at 600 h, the RIM value does not increase significantly in the early stage but increases rapidly when the total maintenance cost is 19,000, and the subsequent increase is no longer obvious. In view of this situation, the total maintenance cost of 19,000 is the best, and can result in high recovery efficiency and control the total maintenance cost. When X11 fails, and the total maintenance cost is 16,000 and 21,500, the RIM value increases rapidly at 600 h. The appropriate maintenance cost should be selected according to the actual cost and performance requirements. When multiple components fail, the set of components corresponding to different maintenance costs changes less, so the change in the RIM value is less than that of a single component. The total maintenance cost also needs to be controlled according to the actual situation.

6. Conclusions

This paper proposes a novel LIM measure to evaluate the effect of the failure of one or multiple components on the system performance in the case of different failure conditions. The evaluation of the LIM can facilitate the determination of the maintenance priority of the MCS components. Based on the LIM, the optimal preventive maintenance component set under a cost constraint is calculated. Then, we propose the RIM of preventive maintenance to quantitatively evaluate the loss of system performance caused by component failure and the recovery of system performance after preventive maintenance of components so as to calculate the recovery efficiency.
For the case study conducted on the MCS, based on the LIM evaluation, it is verified that the key components of the MCS are the reactor pressure vessel, pressurizer, and safety valve. The case study also demonstrates that a component failure can lead to higher maintenance priority of its redundant components in other loops. The simulation results for the RIM show that, with the increase in the total maintenance cost, the RIM value increases continuously in the form of a ladder, and the RIM value increases significantly at some specific maintenance cost values. The results can provide valuable information to guide the recovery process.
In future research, we will consider extending and applying the proposed method to other technological or engineering multi-state systems. By defining the system performance and analyzing the impact of components on the system performance, the maintenance priority of components can be determined based on this to formulate the corresponding maintenance strategy to improve system resilience. We are also interested in extending the proposed LIM and RIM for risk analysis of safety-critical systems, including MCSs.

Author Contributions

Conceptualization, H.D. and L.C.; methodology, H.D.; software, Z.X.; validation, L.C., L.X. and B.L.; formal analysis, L.C.; investigation, H.D. and L.C.; resources, L.X.; data curation, B.L.; writing—original draft preparation, H.D. and Z.X.; writing—review and editing, L.X. and B.L.; supervision, H.D. and L.C.; funding acquisition, H.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Ministry of Education’s Humanities and Social Sciences Planning Fund (No. 20YJA630012), the Key Science and Technology Program of Henan Province (No. 222102520019), the Program for Science & Technology Innovation Talents in Universities of Henan Province (No. 22HASTIT022), the Program for young backbone teachers in Universities of Henan Province (No. 2021GGJS007), and the National Natural Science Foundation of China (No. U1904211).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The authors confirm that the data supporting the findings of this study are available within the article.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

According to Table A1, the MCS has 48 states, namely intermediate states 1–46, complete failure state 47, and perfect state 48. Each state represents the failure of the corresponding components. For example, in system state 40, ‘X11 X12 X13’ means that components X11, X12, and X13 are in the failure state, whereas other components are the in working state. When all components in the system stop working, the system is in complete failure state, a j = 0 . When all components are working, a j = 1 .
Table A1. System states and corresponding performance level parameters.
Table A1. System states and corresponding performance level parameters.
j System States a j j System States a j
1X9 0.925X11X9 0.54
2X12 0.926X11X12 0.54
3X14 0.927X16X14 0.54
4X17 0.928X16X17 0.54
5X10 0.829X10X11X13 0.432
6X15 0.830X15X16X18 0.432
7X11 0.631X10X11X9 0.432
8X16 0.632X10X11X12 0.432
9X13 0.933X15X16X14 0.432
10X18 0.934X15X16X17 0.432
11X9X140.8135X10X9X13 0.648
12X9X170.8136X10X12X13 0.648
13X12X140.8137X15X14X18 0.648
14X12X170.8138X15X17X18 0.648
15X10X90.7239X11X9X13 0.486
16X10X120.7240X11X12X13 0.486
17X15X140.7241X16X14X18 0.486
18X15X170.7242X16X17X18 0.486
19X10X130.7243X10X11X9X130.3888
20X15X180.7244X10X11X12X130.3888
21X11X130.5445X15X16X14X180.3888
22X16X180.5446X15X16X17X180.3888
23X10X110.4847failure0
24X15X160.4848perfect function1

Appendix B

Proof of Equation (3):
We have:
U ( X ( t ) ) = j = 1 M ( a j a j 1 ) Pr [ S ( 0 i , X ( t ) ) j ] + I i G ( t ) ρ i 1 ( t ) , ρ i 1 ( t ) = Pr [ X i ( t ) = 1 ] ,
where I G ( i ) is the Griffith importance of component i. When component i fails, system performance equals:
U ( 0 i , X ( t ) ) = j = 1 M a j Pr [ S ( X 1 ( t ) , , X i 1 ( t ) , 0 i , X i + 1 ( t ) , , X n ( t ) ) = j ]
At this time, the influence of component k on system performance can be expressed as:
I k / i ( t ) = ( U ( 0 i , X ( t ) ) ) ρ k 1 ( t ) = I i G ( t ) ρ i 1 ( t ) ρ k 1 ( t ) = j = 1 M ( a j a j 1 ) [ Pr ( S ( 1 k , 0 i , X ( t ) ) j ) Pr ( S ( 0 k , 0 i , X ( t ) ) j ) ] ,
where ρ k 1 ( t ) = Pr [ X k ( t ) = 1 ] .

Appendix C

Illustration using the parallel-series structure of the MCS:
Figure A1 is the high-level abstraction of the MCS, which appears as a parallel-series structure. In order to facilitate the calculation, the components are recoded. We assume that all components X i j have two states: perfect function and failure. The reliability of component X i j is R i j ( t ) , the failed component is X a b . Based on Equation (3), the LIM of component X m n can be expressed as:
I m n / a b ( t ) = ( U ( 0 a b , X ( t ) ) ) ρ m n = z = 1 M s z [ Pr ( S ( 1 m n , 0 a b , X ( t ) ) = z ) Pr ( S ( 0 m n , 0 a b , X ( t ) ) = z ) ]
where z represents the state of the MCS, s z is the performance level corresponding to the system state.
Figure A1. High-level parallel-series structure of the MCS.
Figure A1. High-level parallel-series structure of the MCS.
Mathematics 10 00563 g0a1
1.
When the failed components at state Z 1 are in series with X a b and a = 1 , we have:
Pr ( S 1 2 n , 0 1 b , X t = z 1 ) ) = 1 [ ( 1 j n R 2 j t ) 1 R 31 t R 32 t R 33 t R 34 t R 35 t = 1 ( 1 j n R 2 j t ) ( 1 j = 1 5 R 3 j t ) ] , Pr ( S 0 2 n , 0 1 b , X t = z 1 ) ) m a = R 31 t R 32 t R 33 t R 34 t R 35 t = j = 1 5 R 3 j t , Pr ( S 1 3 n , 0 1 b , X t = z 1 ) ) = 1 [ ( 1 j n R 3 j t ) 1 R 21 t R 22 t R 23 t R 24 t R 25 t = 1 ( 1 j n R 3 j t ) ( 1 j = 1 5 R 2 j t ) ] , Pr ( S 0 3 n , 0 1 b , X t = z 1 ) ) = R 21 t R 22 t R 23 t R 24 t R 25 t = j = 1 5 R 2 j t ,
thus,
[ Pr ( S ( 1 2 n , 0 1 b , X ( t ) ) = z 1 ) Pr ( S ( 0 2 n , 0 1 b , X ( t ) ) = z 1 ) ] = 1 [ ( 1 j n R 2 j ( t ) ) ( 1 j = 1 5 R 3 j ( t ) ) j = 1 5 R 3 j ( t ) = ( j n R 2 j ( t ) ) ( 1 j = 1 5 R 3 j ( t ) ) ,
[ Pr ( S ( 1 3 n , 0 1 b , X ( t ) ) = z 1 ) Pr ( S ( 0 3 n , 0 1 b , X ( t ) ) = z 1 ) ] = 1 [ ( 1 j n R 3 j ( t ) ) ( 1 j = 1 5 R 2 j ( t ) ) j = 1 5 R 2 j ( t ) = ( j n R 3 j ( t ) ) ( 1 j = 1 5 R 2 j ( t ) ) .
When a = 2 ,
Pr ( S 1 1 n , 0 2 b , X t = z 1 ) ) = 1 [ ( 1 j n R 1 j ) 1 R 31 t R 32 t R 33 t R 34 t R 35 t = 1 ( 1 j n R 1 j t ) ( 1 j = 1 5 R 3 j t ) ] , Pr ( S 0 1 n , 0 2 b , X t = z 1 ) ) = R 31 t R 32 t R 33 t R 34 t R 35 t = j = 1 5 R 3 j t , Pr ( S 1 3 n , 0 2 b , X t = z 1 ) ) = 1 [ ( 1 j n R 3 j ) 1 R 11 t R 12 t R 13 t R 14 t R 15 t R 16 t = 1 ( 1 j n R 3 j t ) ( 1 j = 1 6 R 1 j t ) ] , Pr ( S 0 3 n , 0 2 b , X t = z 1 ) ) = R 11 t R 12 t R 13 t R 14 t R 15 t R 16 t = j = 1 6 R 1 j t ,
so,
[ Pr ( S ( 1 1 n , 0 2 b , X ( t ) ) = z 1 ) Pr ( S ( 0 1 n , 0 2 b , X ( t ) ) = z 1 ) ] = 1 [ ( 1 j n R 1 j ( t ) ) ( 1 j = 1 5 R 3 j ( t ) ) j = 1 5 R 3 j ( t ) = ( j n R 1 j ( t ) ) ( 1 j = 1 5 R 3 j ( t ) ) ,
[ Pr ( S ( 1 3 n , 0 2 b , X ( t ) ) = z 1 ) Pr ( S ( 0 3 n , 0 2 b , X ( t ) ) = z 1 ) ] = 1 [ ( 1 j n R 3 j ( t ) ) ( 1 j = 1 6 R 1 j ( t ) ) j = 1 6 R 1 j ( t ) = ( j n R 3 j ( t ) ) ( 1 j = 1 6 R 1 j ( t ) ) .
By combining Equations (A5)–(A8), we obtain:
[ Pr ( S ( 1 m n , 0 a b , X ( t ) ) = z 1 ) Pr ( S ( 0 m n , 0 a b , X ( t ) ) = z 1 ) ] = ( j n R m j ( t ) ) ( 1 R i j ( t ) ) , i m ,   i a
Similarly, it can be concluded that:
[ Pr ( S ( 1 a b , X ( t ) = z 1 ) ] [ Pr ( S ( 0 a b , X ( t ) = z 1 ) ] = 0
2.
When the failure components at state Z 2 are in parallel with X a b and a = 1 ,
Pr ( S ( 1 2 n , 0 1 b , X ( t ) ) = z 1 ) ) = j n R 2 j ( t ) ,   Pr ( S ( 0 2 n , 0 1 b , X ( t ) ) = z 1 ) ) = 0 , Pr ( S ( 1 3 n , 0 1 b , X ( t ) ) = z 1 ) ) = j n R 3 j ( t ) ,   Pr ( S ( 0 3 n , 0 1 b , X ( t ) ) = z 1 ) ) = 0 ,
so,
Pr ( S ( 1 2 n , 0 1 b , X ( t ) ) = z 1 ) )   Pr ( S ( 0 2 n , 0 1 b , X ( t ) ) = z 1 ) ) = j n R 2 j ( t ) , Pr ( S ( 1 3 n , 0 1 b , X ( t ) ) = z 1 ) )   Pr ( S ( 0 3 n , 0 1 b , X ( t ) ) = z 1 ) ) = j n R 3 j ( t ) .
When a = 2 or a = 3 , after calculation, we can obtain:
[ Pr ( S ( 1 m n , 0 a b , X ( t ) ) = z 2 ) Pr ( S ( 0 m n , 0 a b , X ( t ) ) = z 2 ) ] = j n R m j ( t )
Similarly, it can be concluded that:
[ Pr ( S ( 1 a b , X ( t ) = z 2 ) ] [ Pr ( S ( 0 a b , X ( t ) = z 2 ) ] = ( j b R a j ( t ) ) ( 1 R i j ( t ) ) ,   i a ,   X i j { f a i l u r e   c o m p o n e n t s   i n   s t a t e   Z 2 } .
Based on Equations (A9) and (A11), the LIM of X m n can be obtained as:
I m n / a b ( t ) = z 1 s z 1 [ ( j n R m j ( t ) ) ( 1 R i j ( t ) ) , i m ,   i a ] + z 2 s z 2 j n R m j ( t )
Then, based on Equations (A10) and (A12), the PLIM for component X a b can be expressed as:
P L I M a b ( t ) = I a b I I M ( t ) = z a z R a b ( t ) λ a b ( t ) { [ Pr ( S ( 1 a b , X ( t ) = z ) ] [ Pr ( S ( 0 a b , X ( t ) = z ) ] } = [ Pr ( S ( 1 a b , X ( t ) = z 2 ) ] [ Pr ( S ( 0 a b , X ( t ) = z 2 ) ] = ( j b R a j ( t ) ) ( 1 R i j ( t ) ) ,   i a ,   X i j { f a i l u r e   c o m p o n e n t s   i n   s t a t e   Z 2 } .
We use A to represent the set of failed components. The time before maintenance is represented by t 1 , and t 2 means the time after maintenance. Based on Equations (12) and (A14), the PRIM is:
P R I M a b ( t ) = R i j ( t 2 ) λ i j ( t 2 ) X i j A I i j / a b ( t 2 ) R i j ( t 1 ) λ i j ( t 1 ) X i j A I i j / a b ( t 1 )
Based on Equation (12), the RIM for X a b is obtained as:
R I M A / a b ( t ) = P R I M A / a b ( t ) P L I M a b ( t )

References

  1. Ashley, S.F.; Fenner, R.A.; Nuttall, W.J.; Parks, G.T. Life-cycle impacts from novel thorium-uranium-fuelled nuclear energy systems. Energy Convers. Manag. 2015, 101, 136–150. [Google Scholar] [CrossRef]
  2. Bot, P. Human reliability data, human error and accident models—Illustration through the Three Mile Island accident analysis. Reliab. Eng. Syst. Saf. 2004, 83, 153–167. [Google Scholar] [CrossRef]
  3. Li, Y.F.; Xu, J.; Liang, Z.L.; Wang, K.B. Generalized condition-based maintenance optimization for multi-component systems considering stochastic dependency and imperfect maintenance. Reliab. Eng. Syst. Saf. 2021, 211, 107592. [Google Scholar]
  4. Panteli, M.; Mancarella, P. Modeling and Evaluating the Resilience of Critical Electrical Power Infrastructure to Extreme Weather Events. IEEE Syst. J. 2017, 11, 1733–1742. [Google Scholar] [CrossRef]
  5. Mao, X.; Lou, X.; Yuan, C.; Zhou, J. Resilience-Based Restoration Model for Supply Chain Networks. Mathematics 2020, 8, 163. [Google Scholar] [CrossRef] [Green Version]
  6. Ali, N.; Hussain, M.; Hong, J.E. Fault-Tolerance by Resilient State Transition for Collaborative Cyber-Physical Systems. Mathematics 2021, 9, 2851. [Google Scholar] [CrossRef]
  7. Zarei, E.; Ramavandi, B.; Darabi, A.H.; Omidvar, M. A framework for resilience assessment in process systems using a fuzzy hybrid MCDM model. J. Loss Prev. Process Ind. 2021, 69, 104375. [Google Scholar] [CrossRef]
  8. Hajializadeh, D.; Imani, M. RV-DSS: Towards a resilience and vulnerability- informed decision support system framework for interdependent infrastructure systems. Comput. Ind. Eng. 2021, 156, 107276. [Google Scholar] [CrossRef]
  9. Kim, J.T.; Kim, J.; Seong, P.H.; Park, J. Quantitative resilience evaluation on recovery from emergency situations in nuclear power plants. Ann. Nucl. Energy 2021, 156, 108220. [Google Scholar] [CrossRef]
  10. Cai, B.; Zhang, Y.; Wang, H.; Liu, Y.; Ji, R.; Gao, C.; Kong, X.; Liu, J. Resilience evaluation methodology of engineering systems with dynamic-Bayesian-network-based degradation and maintenance. Reliab. Eng. Syst. Saf. 2021, 209, 107464. [Google Scholar] [CrossRef]
  11. Zeng, Z.G.; Du, S.J.; Ding, Y. Resilience analysis of multi-state systems with time-dependent behaviors. Appl. Math. Model. 2021, 90, 889–911. [Google Scholar] [CrossRef]
  12. Mullor, R.; Mulero, J.; Trottini, M. A modelling approach to optimal imperfect maintenance of repairable equipment with multiple failure modes. Comput. Ind. Eng. 2019, 128, 24–31. [Google Scholar] [CrossRef] [Green Version]
  13. Rejc, Z.B.; Cepin, M. An extension of Multiple Greek Letter method for common cause failures modelling. J. Loss Prev. Process Ind. 2014, 29, 144–154. [Google Scholar] [CrossRef]
  14. Ma, X.Y.; Liu, B.; Yang, L.; Peng, R.; Zhang, X.D. Reliability analysis and condition-based maintenance optimization for a warm standby cooling system. Reliab. Eng. Syst. Saf. 2020, 193, 106588. [Google Scholar] [CrossRef]
  15. He, J.; Bao, T.; Wu, J.; Shao, G.; Du, D.; Le, X.; Zhang, Q. Reliability assessment and data processing techniques of the squib valve in pressurized water NPPs. Nucl. Eng. Des. 2018, 332, 59–69. [Google Scholar] [CrossRef]
  16. Hu, L.M.; Peng, R. Reliability modeling for a discrete time multi-state system with random and dependent transition probabilities. Proc. Inst. Mech. Eng. Part O J. Risk Reliab. 2019, 233, 747–760. [Google Scholar] [CrossRef]
  17. Mamdikar, M.R.; Kumar, V.; Singh, P.; Singh, L. Reliability and performance analysis of safety-critical system using transformation of UML into state space models. Ann. Nucl. Energy 2020, 146, 107628. [Google Scholar] [CrossRef]
  18. Wakankar, A.; Kabra, A.; Bhattacharjee, A.K.; Karmakar, G. Architectural model driven dependability analysis of computer-based safety system in nuclear power plant. Nucl. Eng. Technol. 2019, 51, 463–478. [Google Scholar] [CrossRef]
  19. Tripathi, M.; Singh, L.K.; Singh, S. Dynamic reliability framework for a Nuclear Power Plant using dynamic flowgraph methodology. Ann. Nucl. Energy 2020, 143, 107467. [Google Scholar] [CrossRef]
  20. Levitin, G.; Ben-Haim, H. Importance of protections against intentional attacks. Reliab. Eng. Syst. Saf. 2008, 93, 639–646. [Google Scholar] [CrossRef]
  21. Dui, H.; Li, S.; Xing, L.; Liu, H. System performance-based joint importance analysis guided maintenance for repairable systems. Reliab. Eng. Syst. Saf. 2019, 186, 162–175. [Google Scholar] [CrossRef]
  22. Levitin, G.; Podofillini, L.; Zio, E. Generalised importance measures for multi-state elements based on performance level restrictions. Reliab. Eng. Syst. Saf. 2003, 82, 287–298. [Google Scholar] [CrossRef]
  23. Xu, Z.P.; Ramirez-Marquez, J.E.; Liu, Y.; Xiahou, T.F. A new resilience-based component importance measure for multi-state networks. Reliab. Eng. Syst. Saf. 2020, 193, 106591. [Google Scholar] [CrossRef]
  24. Chybowska, D.; Chybowski, L.; Guze, S.; Wilczynski, P. A method for determining critical events during large disasters of production platforms. J. Loss Prev. Process Ind. 2021, 72, 104528. [Google Scholar] [CrossRef]
  25. Fu, Y.Q.; Yuan, T.; Zhu, X.Y. Importance-measure based methods for component reassignment problem of degrading components. Reliab. Eng. Syst. Saf. 2019, 190, 106501. [Google Scholar] [CrossRef]
  26. Fang, C.; Marle, F.; Xie, M. Applying importance measures to risk analysis in engineering project using a risk network model. IEEE Syst. J. 2017, 11, 1548–1556. [Google Scholar] [CrossRef]
  27. Kala, Z. New Importance Measures Based on Failure Probability in Global Sensitivity Analysis of Reliability. Mathematics 2021, 9, 2425. [Google Scholar] [CrossRef]
  28. Dui, H.; Tian, T.; Zhao, J.; Wu, S. Comparing with the joint importance under consideration of consecutive-k-out-of-n system structure changes. Reliab. Eng. Syst. Saf. 2022, 219, 108255. [Google Scholar] [CrossRef]
  29. Dui, H.; Zheng, X.; Wu, S. Resilience analysis of maritime transportation systems based on importance measures. Reliab. Eng. Syst. Saf. 2021, 209, 107461. [Google Scholar] [CrossRef]
  30. Yang, Z.D.; Zhou, L.Y.; Zhou, J.; Luan, X.C.; Wang, J.L. Simulation research on passive safety injection system of marine nuclear power plant based on compressed gas. Ann. Nucl. Energy 2020, 145, 107552. [Google Scholar]
Figure 1. Schematic diagram of the MCS.
Figure 1. Schematic diagram of the MCS.
Mathematics 10 00563 g001
Figure 2. Component maintenance priority in the case of a single component failure.
Figure 2. Component maintenance priority in the case of a single component failure.
Mathematics 10 00563 g002
Figure 3. Component maintenance priority in the case of multiple component failures.
Figure 3. Component maintenance priority in the case of multiple component failures.
Mathematics 10 00563 g003
Figure 4. RIM when total maintenance cost changes.
Figure 4. RIM when total maintenance cost changes.
Mathematics 10 00563 g004
Table 1. Scale and shape parameters of each component.
Table 1. Scale and shape parameters of each component.
No.Component θ γ
1Reactor pressure vessel18602.43
2Pressurizer27303.92
3Safety valve35671.76
4Steam generator42352.14
5Main coolant pump61652.36
6Safety injection pump73043.46
7Safety injection tank30512.03
Table 2. Major components of the MCS.
Table 2. Major components of the MCS.
CodeNameCodeName
X1Reactor pressure vesselX10Steam generator No.2
X2PressurizerX11Main coolant pump No.2
X3Safety valveX12Safety injection pump No.4
X4Safety injection pump No.1X13Safety injection tank No.2
X5Steam generator No.1X14Safety injection pump No.5
X6Main coolant pump No.1X15Steam generator No.3
X7Safety injection pump No.2X16Main coolant pump No.3
X8Safety injection tank No.1X17Safety injection pump No.6
X9Safety injection pump No.3X18Safety injection tank No.3
Table 3. Maintenance cost and preventive maintenance cost of each component.
Table 3. Maintenance cost and preventive maintenance cost of each component.
ComponentsNo.Repair CostPM Cost
Reactor pressure vesselX112,0005700
PressurizerX270003400
Safety valveX350002300
Safety injection pumpX4, X7, X9, X12, X14, X1740001800
Steam generatorX5, X10, X1590004200
Main coolant pumpX6, X11, X1610,0004700
Safety injection tankX8, X13, X1830001500
Table 4. X10 failure, t = 600 h.
Table 4. X10 failure, t = 600 h.
Cost Constraint15,00016,00017,00018,00019,00020,00021,00022,000
Part
Reactor pressure vessel00110011
Pressurizer10001111
Safety valve11111111
Steam generator01001100
Main coolant pump00000000
Safety injection pump00000000
Safety injection tank00000001
Table 5. X11 failure, t = 600 h.
Table 5. X11 failure, t = 600 h.
Cost Constraint15,00016,00017,00018,00019,00020,00021,00022,000
Part
Reactor pressure vessel00011101
Pressurizer01000011
Safety valve11111111
Steam generator00000000
Main coolant pump00100010
Safety injection pump10000100
Safety injection tank00000000
Table 6. X10 and X9 failures, t = 1000 h.
Table 6. X10 and X9 failures, t = 1000 h.
Cost Constraint18,00019,00020,00021,00022,00023,00024,00025,000
Part
Reactor pressure vessel00000110
Pressurizer00010001
Safety valve11111111
Steam generator00101001
Main coolant pump00000000
Safety injection pump11011111
Safety injection tank01000000
Table 7. X10, X11, X13 failures, t = 1000 h.
Table 7. X10, X11, X13 failures, t = 1000 h.
Cost Constraint27,00028,00029,00030,00031,00032,00033,00034,000
Part
Reactor pressure vessel00000110
Pressurizer00000001
Safety valve11111111
Steam generator00011001
Main coolant pump00000000
Safety injection pump01100000
Safety injection tank11111111
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Dui, H.; Xu, Z.; Chen, L.; Xing, L.; Liu, B. Data-Driven Maintenance Priority and Resilience Evaluation of Performance Loss in a Main Coolant System. Mathematics 2022, 10, 563. https://doi.org/10.3390/math10040563

AMA Style

Dui H, Xu Z, Chen L, Xing L, Liu B. Data-Driven Maintenance Priority and Resilience Evaluation of Performance Loss in a Main Coolant System. Mathematics. 2022; 10(4):563. https://doi.org/10.3390/math10040563

Chicago/Turabian Style

Dui, Hongyan, Zhe Xu, Liwei Chen, Liudong Xing, and Bin Liu. 2022. "Data-Driven Maintenance Priority and Resilience Evaluation of Performance Loss in a Main Coolant System" Mathematics 10, no. 4: 563. https://doi.org/10.3390/math10040563

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop