Next Article in Journal
Effects of Pulsed Electric Fields and Ultrasound Processing on Proteins and Enzymes: A Review
Previous Article in Journal
Removal of Phenol from Aqueous Solution Using Internal Microelectrolysis with Fe-Cu: Optimization and Application on Real Coking Wastewater
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multi-Objective Nonlinear Programming Model for Reducing Octane Number Loss in Gasoline Refining Process Based on Data Mining Technology

1
School of Statistics and Mathematics, Zhejiang Gongshang University, Hangzhou 310018, China
2
School of Finance, Zhejiang Gongshang University, Hangzhou 310018, China
3
Hangzhou College of Commerce, Zhejiang Gongshang University, Hangzhou 310018, China
*
Author to whom correspondence should be addressed.
Processes 2021, 9(4), 721; https://doi.org/10.3390/pr9040721
Submission received: 13 March 2021 / Revised: 14 April 2021 / Accepted: 15 April 2021 / Published: 19 April 2021

Abstract

:
To simultaneously reduce automobile exhaust pollution to the environment and satisfy the demand for high-quality gasoline, the treatment of fluid catalytic cracking (FCC) gasoline is urgently needed to minimize octane number (RON) loss. We presented a new systematic method for determining an optimal operation scheme for minimising RON loss and operational risks. Firstly, many data were collected and preprocessed. Then, grey correlative degree analysis and Pearson correlation analysis were used to reduce the dimensionality, and the major variables with representativeness and independence were selected from the 367 variables. Then, the RON and sulfur (S) content were predicted by multiple nonlinear regression. A multi-objective nonlinear optimization model was established with the maximum reduction in RON loss and minimum operational risk as the objective function. Finally, the optimal operation scheme of the operating variable corresponding to the sample with a RON loss reduction greater than 30% in 325 samples was solved in Python.

1. Introduction

Gasoline is the main fuel for small vehicles. The exhaust emitted from gasoline combustion has an important impact on the atmospheric environment [1,2]. Therefore, countries around the world have set increasingly stringent petrol quality standards (Table 1).
With the continuous improvement in the requirements for environmental protection, the quality of gasoline products is also constantly improving, in which octane number (RON) is an important index of the quality of gasoline products [3], which directly affects whether the gasoline is qualified. Therefore, in order to meet the requirements of environmental protection and reduce the emission of harmful substances in gasoline tail gas, the demand for high-octane gasoline is increasing each year.
Heavy oil usually accounts for 40–60% of crude oil, which is difficult to directly used given the high impurity content represented by sulfur (S). In order to effectively use heavy oil resources, it is necessary to develop heavy oil lighterization technology centering on FCC to convert heavy oil into gasoline, diesel and low carbon olefin [4].
As a secondary processing unit of petroleum, the fluid catalytic cracking (FCC) unit performs the task of blending slag oil and lightening heavy oil. FCC gasoline is the main source of motor gasoline, which accounts for a large proportion of fuel in many countries at present [5]. For example, more than 70% of gasoline in China is produced by FCC, so more than 95% of S and olefins in finished gasoline is from FCC. Desulfurization and dealkylation are the key to improving the quality of FCC gasoline, so FCC gasoline must be refined to meet the gasoline quality requirements. This has attracted the attention of many scholars [6,7,8,9]. In recent years, several new FCC gasoline desulfurization and olefins reduction processes have been developed abroad, such as ExxonMobil’s Octgain process [10] and Scanfining process [11], Universal Oil Products Company’s ISAL process [12], and Institute of Petrole France’s Primer-G process [6].
In order to protect the environment and reduce the emission of harmful substances in gasoline exhaust, gasoline cleanness will be necessary. The core aspect of the cleaning is to significantly reduce the contents of S, olefin, aromatic hydrocarbon, and benzene in gasoline, while maintaining a high RON [13].
With the increasing demand for high-octane gasoline at present, research on RON is also increasing [14,15,16,17,18]. The measures to improve the gasoline RON of heavy oil FCC unit are being analyzed and discussed in order to achieve the goal of increasing the gasoline RON.
RON is the most important index reflecting the gasoline combustion performance, and is used as a commodity brand of gasoline (such as #92, #95, and #89). In the process of desulfurization and olefins reduction of FCC gasoline, the octane number of gasoline is generally reduced using existing technology. Each reduction in octane rating by one unit is equivalent to a loss of about RMB 150 per ton. Take a 1 million tons/year catalytic cracking gasoline refining unit as an example, if RON loss can be reduced by 0.3 units, the economic benefit would be RMB 45 million.
The chemical process is usually modeled by data association or mechanism modeling, and some achievements have been made [19,20,21,22,23,24,25]. However, because of the complexity of the refining process and the diversity of the equipment, their operation variables (control variables) are highly nonlinear, with coupling between the relationships. Given the relatively few variables in traditional data correlation models, mechanism models for the analysis of the raw material demand do not responding to the process optimization in a timely manner, so the effect is not ideal. Recently, data mining has been widely used in various fields [26,27]. Typically, the task of data mining is to predict variables that are difficult to obtain from experiments. By fitting the function well, the predictive variable can be output quickly through the input of the independent variable. Therefore, data mining assisted by machine learning [28,29] is a good tool to solve chemical process.
Because of the complexity of the engineering process system, most engineering knowledge is based on a variety of empirical equations. Therefore, it is very difficult to excavate the internal relationship in the process of engineering. Kalogirou [30] presented a typical study, applying data mining methods to optimize and design engineering applications. Artificial neural network (ANN) was applied to the training of transient simulation data of typical solar energy system in industrial engineering. Then, genetic algorithm [31,32] was used to estimate the optimal size of the parameters according to the results of ANN. But given the “black box” nature of neural networks, this means that we don’t know how and why the neural network will produce a certain output. If we want to show the optimization process and explain the operation scheme better, applying the optimization model will be a good choice.
Sinopec Gaoqiao Petrochemical was established in 1957. It is one of the earliest petrochemical plants in China. With total assets of 19.3 billion yuan, it is one of Sinopec’s key 10 million-ton-level oil refining bases and clean oil production bases. It is the backbone enterprise of refined oil supply in Shanghai and the only fuel-lubricant oil refining enterprise in the Yangtze River Delta region, known as the cradle of China’s chemical industry.
At present, an FCC gasoline refining desulfurization unit of Sinopec Gaoqiao Petrochemical has been running for 4 years and has accumulated large amounts of historical data. The average RON loss of its gasoline products is 1.37 units (%), but the minimum RON loss of similar units is only 0.6 units. Therefore, there is still a large room for optimization.
The goal of this study was to optimize and adjust the operation plan of the FCC refinery desulfurization unit in the petrochemical plant.

2. Process and Methods

2.1. Sample

We used four years of data from April 2017 to May 2020 from Sinopec Gaoqiao Petrochemical, using the data of 325 samples, considering 354 operation sites of the FCC gasoline refining desulfurization unit, and 367 variables including 7 raw material properties, 2 properties of raw adsorbents, 2 properties of regenerative adsorbents, 2 product properties, and so on. Data acquisition information is provided in Table 2.

2.2. Modeling Purposes

Based on the data from these samples, a mathematical model was established to predict the RON loss of gasoline, and the optimum operating conditions of the samples were obtained. Under the condition of S content being no more than 5 μg/g, the RON loss of gasoline was reduced by at least 30%.

2.3. Modeling Procedures

The modeling procedure was performed according to the following steps:
  • Data preprocessing.
  • Based on the data of 325 samples, major variables were selected from 367 operational variables by dimension reduction to establish the RON loss reduction model.
  • According to the main variables selected, data mining technology was used to establish the RON loss prediction model, and the model was verified.
  • Based on the regression model, the optimization model was established. Using the computer algorithm to solve it, so as to obtain the optimal operation scheme.

2.4. Data Preprocessing

For the data comparison in this study, a common situation occurred: Some of the data only contained part of the time point, so we needed to eliminate outliers from the incomplete value free sample data.
Delete the data that is difficult to recover and correct (abbreviated as delete). Replace the blank value with the average of the data before and after the two hours (abbreviated as mean patch). For the data that can be made up, mean patch is carried out after it is removed.
When processing data, it is sometimes necessary to exclude certain outliers in order to improve the accuracy of the data. In this case, the Laida criterion, also known as the 3 σ criterion, can be considered.
Assume that a set of measurement variables contains only random error. Let these variables be x 1 , x 2 , . . . , x n . Take the average of them, and denote it as x. And then calculate the residual error v i = x i x , i = 1 , 2 , , n . The standard error σ is calculated by using Bessel’s formula. If the residual error v i of a measurement value x i satisfies | v i | = | x i x | > 3 σ , it is considered that the residual error belongs to the gross error and should be eliminated. Bessel’s formula is as follows:
σ = [ 1 n 1 i = 1 n V i 2 ] 1 / 2 = i = 1 n x i 2 ( i = 1 n x i ) 2 / n / ( n 1 ) 1 / 2
The data preprocessing method is shown in Figure 1.

2.5. Descending Dimension

In order to establish the RON loss prediction model and the RON loss reduction model for gasoline, it was necessary to screen out the main variables that affected the model. Given the 367 variables involved, variable filtering was the top priority in the modeling. Variable screening was needed to reduce the dimension of raw materials, adsorbent properties, and operating variables. The operating variables were highly nonlinear and strongly coupled, so the main variables needed to be selected having representativeness and with the the two elements being independent.
For a screening factor or feature selection problem, the amount of computation increases exponentially as the number of variables increases, and we wanted to identify the set of variables that reflected the characteristics of the problem while minimizing the amount of computation. Thus, in essence, this is a combinatorial optimization problem. For this optimization problem, each method can only provide an approximate solution because the subset search methods and evaluation criteria are different, and the rationality of the solutions of different methods is affected by their own limitations. In order to obtain a more systematic and reasonable subset of variables, we used integration learning as a reference and used a combination of various methods to form an integration method to minimize the problems caused by the single datum and methods, and thus improve the effect.
There are many types of dimension reduction methods [33,34]. According to the data’s characteristics in this article, principal component analysis (PCA) should not be used because the variables obtained through PCA are no longer independent and representative. There were only 325 data samples and 367 variables, so direct multiple linear regression dimension reduction was not possible. If we regressed directly, there would be serious overfitting. Similarly, random forest and other machine learning algorithms were not appropriate. As such, first, we used grey relational analysis [35,36,37] to screen out the variables with a large correlation degree with octane number to identify the representative variables. Then, the Pearson correlation coefficient [38,39,40] was used to investigate the correlation between variables to obtain both representative and independent variable sets, and only one of the strongly correlated variable sets was retained. The same method was used to screen out the main variables affecting the S content.
Grey correlation degree analysis [37] provides a reliable theoretical basis for system prediction, control, and decision making. This method is suitable for time series data, that is, it can accurately and rapidly reflect the situation change in data over time. When the sample data reflect the change trend in two factors are basically the same, the shapes of the time series curve are relatively close, reflecting the high correlation degree between the two factors, or, vice versa, reflecting the low correlation degree between the two factors. The specific steps are as follows:
(1)
Deterministic analysis sequence
An index is determined as a dependent variable, and its time data series, as a reference series, denoted as X 0 . The remaining indexes are taken as independent variables, and their data sequence is taken as a comparison sequence, denoted as X i , where i = 1 , 2 , , m and the number of observation samples is n. Then, a two-dimensional table of ( m + 1 ) × n for the sequence data is obtained, where the value is X i ( k ) and i = 0 , 1 , , m ; k = 1 , 2 , , n .
(2)
Dimensionless processing
In order to eliminate the influence of different index dimensions on the data analysis results, variables need to be standardized.
Forward pointer:
X i ( k ) min k X i ( k ) max k X i ( k ) min k X i ( k )
Negative pointer:
max k X i ( k ) X i ( k ) max k X i ( k ) min k X i ( k )
(3)
Calculate the correlation coefficient between each factor
ξ i ( k ) = min i min k x 0 ( k ) x i ( k ) + ρ max i max k x 0 ( k ) x i ( k ) x 0 ( k ) x i ( k ) + ρ max i max k x 0 ( k ) x i ( k )
where i = 1 , 2 , , m , ξ i ( k ) is the coefficient of the correlation between the comparison sequence and the reference sequence; ρ ( 0 , 1 ) is the resolution coefficient, usually taken as 0.5; and min i min k x 0 ( k ) x i ( k ) and max i max k x 0 ( k ) x i ( k ) are the two-stage minimum difference and the two-stage maximum difference, respectively.
(4)
Calculate the correlation degree between all the variables
ξ i = 1 n k = 1 n ξ i ( k ) , i = 1 , 2 , , m

2.6. Regression

Among the many statistical models, the multiple regression model is the most frequently used [41,42,43]. If the relationship between dependent variables and independent variables can be expressed in linear form, multiple linear regression models can be used for analysis. The multiple linear regression model uses the known data set to fit the dependent variables of interest and the main influencing factors, and then the effect of the influencing variables can be analyzed through the obtained linear expression to explain and predict the changing trend in the dependent variables with the changes in these influencing factors.
The ordinary linear regression model of dependent variable y and variable x 1 , x 2 , , x p is:
y = ω 0 + ω 1 x 1 + ω 2 x 2 + · · · + ω p x p + ε
where ω 0 is the regression constant, ω j , j = 1 , 2 , , p represents the regression coefficient; y is the predictive variable; x i , i = 1 , 2 , , p is the explanatory variable; and ε is the random error term. Moreover, it is assumed that E ( ε ) = 0 , V a r ( ε ) = 1 . When p = 1 , the above ordinary linear regression model is a unitary linear regression model; when p > 1 , it is a multiple linear regression model.
In practical problems, the relationship between dependent variables and independent variables of many regression models is not linear, and some regression models can be transformed into linear relationships by functional transformation of independent variables or dependent variables, and linear regression can be used to solve unknown parameters and perform regression diagnosis. Common transformations are shown in Table 3.

2.7. Optimization

The first determining objective functions were the largest RON loss decline and minimum operation risk. According to the problem, the corresponding constraint conditions were given to establish a multi-objective nonlinear optimization model [44]. Then, we transformed the multi-objective model into single-objective optimization model, and by introducing a new variable to the absolute value, nonlinear optimization becomes linear optimization. Finally, the optimal operation scheme of operation variables was obtained by Python.
In practice, the multi-objective problem is widely used, and there are many methods available to solve this problem. Generally, the basic approach involves transforming solving the multi-objective problem into solving a single-objective problem.
The common methods to transform multi-objective problem into single-objective problem include optimization method, linear weighted sum method, stratified sequencing method and so on. When one objective is clearly important among multiple objectives, optimization method is more appropriate.
Multi-objective programming is generally expressed as:
min f ( x ) = i = 1 n c i x i , i = 1 , 2 , n min g ( x ) = i = 1 n d i x i , i = 1 , 2 , n s . t . i = 1 n a i x i b i , b i , = b i , x i 0 , i = 1 , 2 n .
Optimization method can optimize the main objective while taking into account other objectives, then the problem becomes to find the optimal value of the main objective. Other objectives are constraining conditions that limit their variation within a certain range. Thus, multi-objective problem can be reduced to the following single-objective problem.
min g ( x ) = i = 1 n d i x i , i = 1 , 2 , n s . t . f ( x ) k ( k ) i = 1 n a i x i b i , b i , = b i , x i 0 , i = 1 , 2 n .

3. Results and Discussion

3.1. Filtering of Major Variables

The idea of reducing dimensionality first and then modeling is often used in engineering problems. In order to screen out the main variables affecting the model, the grey correlation degree analysis was first used to screen out the variables with a greater correlation degree with RON to obtain representative variables. It was solved in Python to produce Figure 2.
After obtaining the grey correlative degree results of the independent variables and RON, we found that most of the non-operated variables in the independent variables are ranked in the top 150 of the grey correlative degree results, and most of the variables after the 150 are operated variables. This is also logical, because it is the non-operated variables that have a decisive impact on the change of RON, and the operated variables can only play an auxiliary role.
We can know from Figure 2 that the grey correlative degree of variables after the number of 150 began to decline significantly. If we choose more than 150 variables, this will not significantly increase the number of final independent variables. The variables after 150 are basically operating variables that have a relatively general impact on RON and they will have a strong relationship with the operated variables in the top 150 variables. These added variables will be replaced by the top 150 variables in the correlation analysis. So we finally adopted 150 as the demarcation point.
Perform Pearson correlation analysis on these 150 variables. Only one of them was kept in the strongly correlated variable group, identifying 19 variables that were representative and independent. The variable names and properties are shown in Table 4.
The correlation coefficients of these 19 variables are shown in Table 5.
As shown in Table 5, we found independence between variables. Therefore, these 19 variables are both representative and independent major variables affecting the gasoline RON.
The 14 main variables affecting S content were obtained by the same method. The variable names and properties are shown in Table 6.
The correlation coefficients of these 14 variables are shown in Table 7.
As shown in Table 7, we found independence between variables. Therefore, these 14 variables were found to be both representative and independent main variables affecting S content.

3.2. Regression Analysis Model

First, the screened variables were taken as independent variables, the RON and S content of the product were taken as dependent variables to establish a multiple regression model, and the T-test and F-test were carried out to construct the product prediction model.
The logarithm model, inverse model, quadratic model, and combination model were transformed into linear regression separately in the form shown in Table 3.
Then, stepwise regression analysis was used for the linear model, logarithm model, inverse model, quadratic model, and combination model, separately. We used mean square error (MSE) as the evaluation criterion to determine the optimal case in each model. The optimal model of various models were obtained and their effects were recorded as shown in Table 8.
In this study, the MSE and MAE were selected as the measurement indexes of the prediction effect and prediction accuracy.
Mean square error:
M S E = 1 T t = 1 T e t 2
Mean absolute error:
M A E = 1 T i = 1 T e t
Then, we compared these models to obtain Figure 3.
As shown in Figure 3, both the MSE and MAE of the combination model were obviously the smallest, and R 2 was also the best, so we chose the combination model.
Based on the 325 sample data and using the main variables selected, the product RON was used as the predictive variable in the least squares method, and the variables with P > 1 were eliminated by the t-test. The optimal model of combination model was obtained as
y = 0.7855 + 0.9958 x 1 0.0103 x 2 + 0.0004 x 3 0.0032 x 4 + 0.0012 x 5 0.0046 x 6 + 0.2709 l n x 7 0.2904 l n x 8 ,
which can be abbreviated to
y = a 0 + i = 1 6 a i x i + i = 7 8 a i l n x i .
Similarly, the S content prediction model was obtained
z = 17.4300 + 0.0743 t 1 + 0.115 t 2 0.0368 t 3 + 0.4615 l n t 4 + 3.2104 l n t 5 1.6812 l n t 6 .
That is,
z = c 0 + i = 1 3 c i t i + i = 4 6 c i l n t i .
The symbolic descriptions of (12) and (14) were listed in Table 9.
It should be noted that x 3 and t 5 are the same variable, and x 6 and t 3 are also the same variable, so there is a certain correlation between the product RON and S content in the production process.

3.3. RON Loss Reduction Multi-Objective Optimization Model

Our goal was to optimize and adjust the operation plan of the FCC refinery desulfurization unit in the petrochemical plant to try to reduce the RON loss of gasoline by more than 30% under the premise of ensuring the desulfurization effect of gasoline products (S content being not more than 5 μg/g). In addition to the consideration of RON loss and operational risk, other factors such as the range of operational variables for the feasibility of S content operations were also considered. Only after the comprehensive measurement of all factors can the decision be more reasonable, which is a typical multi-objective optimization problem.
Combined with the actual situation, we ignored the minor factors. Therefore, we set up two objective functions: maximum RON loss reduction and minimum operational risk.

3.3.1. Objective Function 1

Taking the maximum RON loss reduction as the target, the corresponding function relation was obtained: RON loss = original RON loss – adjusted RON loss. If R 0 represents raw material RON, R 1 represents original product RON, and Y 1 represents adjusted product RON, then RON loss can be expressed as
R 0 R 1 ( R 0 y 1 ) = y 1 R 1 .
Substitute (12) into the above equation to obtain the objective function
m a x a 0 + i = 1 6 a i x i + i = 7 8 a i l n x i R 1 .

3.3.2. Objective Function 2

In order to minimize the operational risk, we wrote the corresponding function relations. In the actual engineering operation process, for either manual adjustment or computer or machine control, each operation may pose certain risks: the less, the better. The original value of x i is denoted by r i , and Δ i represents the maximum amount of change in the operation variable each time. The original value of t i is denoted by s i , and λ i represents the maximum amount of change in the operation variable each time. As x 1 , x 2 are not operation variables, i = 3 , 4 , , 8 in the function expression for the number of operations of RON. As t 1 , t 4 are not operation variables, i = 2 , 3 , 5 , 6 in the function expression for the number of operations of S content. In addition, the number of operations that need to change but do not reach a multiple of Δ i and λ i should be rounded up, even if it is less than Δ i and λ i . Then, the objective function with the smallest number of operations can be expressed as
min i = 3 8 x i r i Δ i + i = 2 3 t i s i λ i + i = 5 6 t i s i λ i .
However, the number of operations is related to the variable, and the relationship containing the variable needs to be rounded up. This was difficult to achieve in the solver process, so we needed to remove the integer symbol. After removing the integer symbol, this expression expresses the relative modification range. Similarly, the smaller the relative modification range, the fewer the operation times, and the lower the operation risk. Therefore, the function expression of objective function 2 can be expressed as
min i = 3 8 x i r i Δ i + i = 2 3 t i s i λ i + i = 5 6 t i s i λ i .

3.3.3. Constraint Condition 1

First, since optimization was carried out, the product RON after the optimized operation had to be larger than the previous product RON R 1 ; otherwise, the optimization would be meaningless. In addition, the product RON was transformed from the raw material RON, so the product RON after optimized operation could not be greater than the raw material RON R 0 . Therefore, the constraint condition1 was
R 1 a 0 + i = 1 6 a i x i + i = 7 8 a i l n x i R 0 .

3.3.4. Constraint Condition 2

Our goal was to reduce the RON loss of gasoline to more than 30% under the condition that the S content was no more than 5 μg/g, so we needed to restrict the S content to less than or equal to 5 μg/g. Therefore, the constraint condition 2 was
c 0 + i = 1 3 c i t i + i = 4 6 c i l n t i 5 .

3.3.5. Constraint Condition 3

Due to process requirements and operation experience, each operation variable has its operating range, with [ l i , u i ] denoting the operating range corresponding to x i , and [ L i , U i ] denoting the operating range corresponding to t i . In addition, x 1 , x 2 , t 1 , t 4 are not operation variables, so i = 3 , 4 , , 8 for x i and i = 2 , 3 , 5 , 6 for t i . Therefore, the corresponding constraint condition 3 is
l i x i u i , i = 3 , 4 , , 8 .
L i t i U i , i = 2 , 3 , 5 , 6 .
It ensures the robustness of the model. Because as long as the variable meets the constraints of the scope of operation, the operability of the solution can be guaranteed.

3.3.6. Constraint Condition 4

As x 3 and t 5 are the same variable, and x 6 and t 3 are also the same variable, the constraint condition 4 is
x 3 = t 5 ,
x 6 = t 3 .

3.3.7. Constraint Condition 5

As x 1 , x 2 , t 1 , t 4 are not operation variables, they are equal to their original value and do not change, so the constraint condition 5 is
x i = r i , i = 1 , 2 ,
t i = s i , i = 1 , 4 .
By sorting out the above objective function and constraint conditions, the multi-objective nonlinear optimization model of the RON loss reduction is expressed as
m a x a 0 + i = 1 6 a i x i + i = 7 8 a i l n x i R 1 , min i = 3 8 x i r i Δ i + i = 2 3 t i s i λ i + i = 5 6 t i s i λ i , s . t . R 1 a 0 + i = 1 6 a i x i + i = 7 8 a i l n x i R 0 , c 0 + i = 1 3 c i t i + i = 4 6 c i l n t i 5 , l i x i u i , i = 3 , 4 , , 8 , L i t i U i , i = 2 , 3 , 5 , 6 , x 3 = t 5 , x 6 = t 3 , x i = r i , i = 1 , 2 , t i = s i , i = 1 , 4 .
Although the RON loss reduction multi-objective optimization model was established, we found that there were two urgent problems to be addressed before solving: The dual objective needed to be changed into a single objective and the absolute values needed to be removed, and nonlinear programming needed to be changed into linear programming. Then, we used Python to solve the model.
Here, RON loss reduction was an obviously important main goal, so risk minimization was set as a constraint condition to transform the multi-objective problem into a single-objective problem. We set the maximum value of the constraint range corresponding to operational risk as T, then the RON loss reduction multi-objective nonlinear optimization model was transformed into the single-objective nonlinear model.
There were also absolute values in the model, so the nonlinear optimization was changed into linear optimization by introducing new variables to the absolute value. We introduced the non-negative variable x i 1 , x i 2 , i = 3 , 4 , , 8 , and t i 1 , t i 2 , i = 2 , 3 , 5 , 6 .
Let
x i r i = x i 1 + x i 2 , i = 3 , 4 , , 8 , x i r i = x i 1 x i 2 , i = 3 , 4 , , 8 , x i 1 , x i 2 0 , i = 3 , 4 , , 8 , t i s i = t i 1 + t i 2 , i = 2 , 3 , 5 , 6 , t i r i = t i 1 t i 2 , i = 2 , 3 , 5 , 6 , t i 1 , t i 2 0 , i = 2 , 3 , 5 , 6 .
Thus,
x i = x i 1 x i 2 + r i , i = 3 , 4 , , 8 ,
t i = t i 1 t i 2 + s i , i = 2 , 3 , 5 , 6 .
In conclusion, the multi-objective nonlinear optimization model is transformed into
m a x a 0 + a 1 x 1 + a 2 x 2 + i = 3 6 a i ( x i 1 x i 2 + r i ) + i = 7 8 a i l n ( x i 1 x i 2 + r i ) R 1 s . t . i = 3 8 x i 1 + x i 2 Δ i + i = 2 3 t i 1 + t i 2 λ i + i = 5 6 t i 1 + t i 2 λ i T , a 0 + a 1 x 1 + a 2 x 2 + i = 3 6 a i x i 1 x i 2 + γ i + i = 7 8 a i ln x i 1 x i 2 + γ i R 1 , a 0 + a 1 x 1 + a 2 x 2 + i = 3 6 a i x i 1 x i 2 + γ i + i = 7 8 a i ln x i 1 x i 2 + γ i R 0 , c 0 + c 1 t 1 + c 4 ln t 4 + i = 2 3 c i t i 1 t i 2 + s i + i = 5 6 c i ln t i 1 t i 2 + s i 5 , x i 1 x i 2 + r i l i , i = 3 , 4 , , 8 , x i 1 x i 2 + r i u i , i = 3 , 4 , , 8 , t i 1 t i 2 + s i L i , i = 2 , 3 , 5 , 6 , t i 1 t i 2 + s i U i , i = 2 , 3 , 5 , 6 , x 3 1 x 3 2 + r 3 = t 5 1 t 5 2 + s 5 , x 6 1 x 3 2 + r 6 = t 3 1 t 3 2 + s 3 , x 1 = r 1 , x 2 = r 2 , t 1 = s 1 , t 4 = s 4 , x i 1 , x i 2 0 , i = 3 , 4 , , 8 , t i 1 , t i 2 0 , i = 2 , 3 , 5 , 6 .
We used SLSQP in Python to solve the model. Some adjustments to the sample with a loss reduction of 30% were in Table 10.
Where, N represents total number of adjustments, N m a x denotes the number of adjustments of the variable with the largest number of adjustments.
Note that in the model, the larger the T, the more samples can reach the 30% loss reduction. The T value is usually set according to practical experience. In this study, it was set to 120. After calculation, we found that 245 of the 325 samples achieved 30% loss reduction, that is, 78.2% reached the target, indicating that a good effect was achieved.
We randomly selected 10 samples with a loss reduction greater than 30% and compared the product RON before and after optimization to obtain Figure 4.
The operation scheme of the main variables of some samples whose loss reduction was greater than 30% is provided in Table 11.
It can be seen from Table 11 that t 2 and t 6 need not to be adjusted, while x 4 , x 5 , x 6 , t 3 are optimal in the fixed state regardless of which sample.
Their feature descriptions are summarized into Table 12.
From the above results, we can briefly summarize the optimal operation scheme. For any sample, adjust S-ZORB.TE_2103.PV, S-ZORB.FT_9002.DACA, S-ZORB.TE_5004.DACA to 410, 650, 40 respectively. S-ZORB.FC_2801.PV, S-ZORB.TE_2501.DACA, S-ZORB. TXE_3201A.DACA need be adjusted differently depending on the initial state. And there is no need to adjust S-ZORB.TE_1203.PV, S-ZORB.PDC_2702.DACA.

4. Conclusions

This research was conducted on the basis of four years’ data from Sinopec Gaoqiao Petrochemical, including 325 samples, 354 operation sites of the FCC gasoline refining desulfurization unit, and 367 variables. Data preprocessing was first carried out on these data. Then, the main variables were selected from 367 operational variables through grey correlation analysis and Pearson correlation analysis. Then, according to the selected major variables, the RON loss prediction model was established by multiple regression analysis, and the model was verified. The same method was used to predict S content. Notably, x 3 and t 5 are the same variable and x 6 and t 3 are also the same variable, so there is a certain correlation between product RON and S content in the production process. In the end, under the condition that the S content would be no more than 5 μg/g, the reduction in RON loss was more than 30%, and the operating conditions after the optimization of the major variables were obtained.
We presented a new systematic method for determining an optimal operation scheme for minimising RON loss and operational risks. For the data with highly nonlinear and strongly coupled, it was preprocessed first, dimension reduction and regression prediction were carried out, finally an optimization model was established to solve the optimization scheme. With the system we’ve built, just input data that needs to be improved, operation scheme to reduce RON loss can be output.
This system can be applied to the dimensionality reduction, prediction, and operation scheme optimization of data with the same properties. The regression model in the paper can be used to fields related to forecasting, such as medical care, geography, finance, and industry. The optimization model is suitable for optimizing model parameters, seeking extreme values, and so on. The optimization model is based on the regression model, which can well solve most of the problems that require modeling or optimization due to data clutter, such as the establishment of a bank credit information system, the measurement of the effectiveness of biomedical reagents, etc.

Author Contributions

Conceptualization, X.L. and T.J.; methodology, X.L. and M.X.; software, Y.L. and X.H.; validation, X.L. and T.J.; data curation, Y.L. and X.H.; writing–review and editing, X.L. and M.X.; visualization, Y.L. and X.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (11971433) and First Class Discipline of Zhejiang-A (Zhejiang Gongshang University-Statistics 1020JYN4120004G-091), Zhejiang Provincial Department of Education (Y202045232).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

FCCfluid catalytic cracking
RONoctane number
Ssulfur
ANNartificial neural network
PHDDoctor of Philosophy
LIMSLaboratory Information Management System
PCAprincipal component analysis
MSEmean square error
MAEmean absolute error
SLSQPSequential Least Squares Programming

References

  1. Gaffney, J.S.; Marley, N.A.; Martin, R.S.; Dixon, R.W.; Reyes, L.G.; Popp, C.J. Potential air quality effects of using ethanol-gasoline fuel blends: A field study in Albuquerque, New Mexico. Environ. Sci. Technol. 1997, 31, 3053–3061. [Google Scholar] [CrossRef]
  2. Nelson, P.; Quigley, S. The hydrocarbon composition of exhaust emitted from gasoline fuelled vehicles. Atmos. Environ. 1984, 18, 79–87. [Google Scholar] [CrossRef]
  3. Barcia, P.S.; Silva, J.A.; Rodrigues, A.E. Adsorption dynamics of C5–C6 isomerate fractions in zeolite beta for the octane improvement of gasoline. Energy Fuels 2010, 24, 1931–1940. [Google Scholar] [CrossRef]
  4. Ahmadlou, M.; Rezakazemi, M. Computational fluid dynamics simulation of moving-bed nanocatalytic cracking process for the lightening of heavy crude oil. J. Porous Media 2018, 21. [Google Scholar] [CrossRef]
  5. Samolada, M.; Baldauf, W.; Vasalos, I. Production of a bio-gasoline by upgrading biomass flash pyrolysis liquids via hydrogen processing and catalytic cracking. Fuel 1998, 77, 1667–1675. [Google Scholar] [CrossRef]
  6. Shih, S.; Owens, P.; Palit, S.; Tryjankowski, D. Mobil’s OCTGAIN™ Process: FCC Gasoline Desulfurization Reaches a New Performance Level. In Proceedings of the NPRA 1999 Annual Meeting, San Antonio, TX, USA, 21–23 March 1999. [Google Scholar]
  7. Zainullin, R.; Koledina, K.; Akhmetov, A.; Gubaidullin, I. Kinetics of the catalytic reforming of gasoline. Kinet. Catal. 2017, 58, 279–289. [Google Scholar] [CrossRef]
  8. Ayanoğlu, A.; Yumrutaş, R. Production of gasoline and diesel like fuels from waste tire oil by using catalytic pyrolysis. Energy 2016, 103, 456–468. [Google Scholar] [CrossRef]
  9. Wang, W.; Wang, S.; Wang, Y.; Liu, H.; Wang, Z. A new approach to deep desulfurization of gasoline by electrochemically catalytic oxidation and extraction. Fuel Process. Technol. 2007, 88, 1002–1008. [Google Scholar] [CrossRef]
  10. Nocca, J.; Cosyns, J.; Debuisschert, Q.; Didillon, B. The domino interaction of refinery processes for gasoline quality attainment. In Proceedings of the NPRA Annual Meeting, San Antonio, TX, USA, 26–28 March 2000. [Google Scholar]
  11. Halbert, T.R.; Greeley, J.P. Technology options for meeting low sulfur targets. In Proceedings of the NPRA Annual Meeting, AM-00-11, San Antonio, TX, USA, 26–28 March 2000. [Google Scholar]
  12. Nelson, P.; George, J.; Edward, J. Meeting gasoline pool sulfur and octane targets with ISAL (r) process. In Proceedings of the NPRA Annual Meeting, AM-00-52, San Antonio, TX, USA, 23–25 March 2000. [Google Scholar]
  13. Jiang, H.; Huang, S. Eight-lump reaction kinetic model for the maximizing isoparaffin process for cleaning gasoline and enhancing propylene yield. Energy Fuels 2016, 30, 10770–10776. [Google Scholar] [CrossRef]
  14. Sadighi, S.; Zahedi, S.; Hayati, R.; Bayat, M. Studying Catalyst Activity in an Isomerization Plant to Upgrade the Octane Number of Gasoline by Using a Hybrid Artificial-Neural-Network Model. Energy Technol. 2013, 1, 743–750. [Google Scholar] [CrossRef]
  15. Hamadi, A.S. Selective additives for improvement of gasoline octane number. Tikrit J. Eng. Sci. 2010, 17, 22–35. [Google Scholar]
  16. Druzgalski, C.L.; Lapointe, S.; Whitesides, R.; McNenly, M.J. Predicting octane number from microscale flame dynamics. Combust. Flame 2019, 208, 5–14. [Google Scholar] [CrossRef]
  17. Daly, S.R.; Niemeyer, K.E.; Cannella, W.J.; Hagen, C.L. Predicting fuel research octane number using Fourier-transform infrared absorption spectra of neat hydrocarbons. Fuel 2016, 183, 359–365. [Google Scholar] [CrossRef] [Green Version]
  18. Santikunaporn, M.; Alvarez, W.E.; Resasco, D.E. Ring contraction and selective ring opening of naphthenic molecules for octane number improvement. Appl. Catal. A Gen. 2007, 325, 175–187. [Google Scholar] [CrossRef]
  19. Diwekar, U.M.; Rubin, E. Stochastic modeling of chemical processes. Comput. Chem. Eng. 1991, 15, 105–114. [Google Scholar] [CrossRef] [Green Version]
  20. Te Braake, H.; Van Can, H.; Verbruggen, H. Semi-mechanistic modeling of chemical processes with neural networks. Eng. Appl. Artif. Intell. 1998, 11, 507–515. [Google Scholar] [CrossRef]
  21. Rice, R.G.; Do, D.D. Applied Mathematics and Modeling for Chemical Engineers; John Wiley & Sons: Hoboken, NJ, USA, 2012. [Google Scholar]
  22. Tränkle, F.; Zeitz, M.; Ginkel, M.; Gilles, E. PROMOT: A modeling tool for chemical processes. Math. Comput. Model. Dyn. Syst. 2000, 6, 283–307. [Google Scholar] [CrossRef]
  23. McBride, K.; Sundmacher, K. Overview of surrogate modeling in chemical process engineering. Chem. Ing. Tech. 2019, 91, 228–239. [Google Scholar] [CrossRef] [Green Version]
  24. Kulov, N.; Gordeev, L. Mathematical modeling in chemical engineering and biotechnology. Theor. Found. Chem. Eng. 2014, 48, 225–229. [Google Scholar] [CrossRef]
  25. Ni, J.; Zhou, Y.; Li, S. Hamiltonian Monte Carlo-Based D-Vine Copula Regression Model for Soft Sensor Modeling of Complex Chemical Processes. Ind. Eng. Chem. Res. 2020, 59, 1607–1618. [Google Scholar] [CrossRef]
  26. Xuan, Z.; Wei, G.; Ni, Z.; Zhang, J. Decoupling Offloading Decision and Resource Allocation via Deep Reinforcement Learning and Sequential Least Squares Programming. In Proceedings of the International Conference on Communications and Networking, Shanghai, China, 20–21 November 2020; pp. 554–566. [Google Scholar]
  27. Sultana, N.; Hossain, S.; Taher, S.; Khan, A.; Razzak, S.; Haq, B. Modeling and optimization of non-edible papaya seed waste oil synthesis using data mining approaches. S. Afr. J. Chem. Eng. 2020, 33, 151–159. [Google Scholar] [CrossRef]
  28. Arendt, K.; Jradi, M.; Wetter, M.; Veje, C.T. Modestpy: An open-source python tool for parameter estimation in functional mock-up units. In Proceedings of the American Modelica Conference, Cambridge, MA, USA, 9–10 October 2018. [Google Scholar]
  29. Johansson, R.; Johansson, R.; John, S. Numerical Python; Springer: Berlin/Heidelberg, Germany, 2019; Volume 1. [Google Scholar]
  30. Kalogirou, S.A. Optimization of solar systems using artificial neural-networks and genetic algorithms. Appl. Energy 2004, 77, 383–405. [Google Scholar] [CrossRef]
  31. Kumar, M.; Husain, M.; Upreti, N.; Gupta, D. Genetic Algorithm: Review and Application; SSRN 3529843; SSRN: Rochester, NY, USA, 2010. [Google Scholar]
  32. Whitley, D. A genetic algorithm tutorial. Stat. Comput. 1994, 4, 65–85. [Google Scholar] [CrossRef]
  33. Benner, P.; Mehrmann, V.; Sorensen, D.C. Dimension Reduction of Large-Scale Systems; Springer: Berlin/Heidelberg, Germany, 2005; Volume 45. [Google Scholar]
  34. Fodor, I.K. A Survey of Dimension Reduction Techniques; Technical Report; Citeseer: University Park, PA, USA, 2002. [Google Scholar]
  35. Kuo, Y.; Yang, T.; Huang, G.W. The use of grey relational analysis in solving multiple attribute decision-making problems. Comput. Ind. Eng. 2008, 55, 80–93. [Google Scholar] [CrossRef]
  36. Yazdani, M.; Kahraman, C.; Zarate, P.; Onar, S.C. A fuzzy multi attribute decision framework with integration of QFD and grey relational analysis. Expert Syst. Appl. 2019, 115, 474–485. [Google Scholar] [CrossRef] [Green Version]
  37. Hu, Y.C. A multivariate grey prediction model with grey relational analysis for bankruptcy prediction problems. Soft Comput. 2020, 24, 4259–4268. [Google Scholar] [CrossRef]
  38. Ezekiel, M. Methods of Correlation Analysis; American Psychological Association: Washington, DC, USA, 1930. [Google Scholar]
  39. Zhuang, X.; Yang, Z.; Cordes, D. A technical review of canonical correlation analysis for neuroscience applications. Hum. Brain Mapp. 2020, 41, 3807–3833. [Google Scholar] [CrossRef] [PubMed]
  40. Zhao, J.; Deng, J.; Chen, L.; Wang, T.; Song, J.; Zhang, Y.; Shu, C.M.; Zeng, Q. Correlation analysis of the functional groups and exothermic characteristics of bituminous coal molecules during high-temperature oxidation. Energy 2019, 181, 136–147. [Google Scholar] [CrossRef]
  41. Aiken, L.S.; West, S.G.; Reno, R.R. Multiple Regression: Testing and Interpreting Interactions; Sage: Newcastle upon Tyne, UK, 1991. [Google Scholar]
  42. Allison, P.D. Multiple Regression: A Primer; Pine Forge Press: Newcastle upon Tyne, UK, 1999. [Google Scholar]
  43. Cohen, J.; Cohen, P.; West, S.G.; Aiken, L.S. Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences; Routledge: London, UK, 2013. [Google Scholar]
  44. Miettinen, K. Nonlinear Multiobjective Optimization; Springer: Berlin/Heidelberg, Germany, 2012; Volume 12. [Google Scholar]
Figure 1. Data preprocessing method.
Figure 1. Data preprocessing method.
Processes 09 00721 g001
Figure 2. RON grey correlative degree.
Figure 2. RON grey correlative degree.
Processes 09 00721 g002
Figure 3. Comparison of the optimal case effects of various models.
Figure 3. Comparison of the optimal case effects of various models.
Processes 09 00721 g003
Figure 4. Comparison diagram of product RON before and after optimization.
Figure 4. Comparison diagram of product RON before and after optimization.
Processes 09 00721 g004
Table 1. European Union and China’s main specifications for automotive gasoline.
Table 1. European Union and China’s main specifications for automotive gasoline.
Standard of Gasoline for VehiclesRONS Content (μg/g, ≤)Benzene Content (%, ≤)Aromatic Content (%, ≤)Olefin Content (%, ≤)
China 3 (2010)90–9715014030
China 4 (2014)90–975014028
China 5 (2017)89–951014024
China 6-A (2019)89–95100.83518
China 6-B (2023)89–95100.83515
European 5 (2009)951013518
European 6 (2013)951013518
World Fuel Code (5 class gasoline)951013510
Table 2. Raw data sampling.
Table 2. Raw data sampling.
Database SourceVariableDate (YYYY.MM)Frequency
Honeywell PHDoperation variable data2017.04–2019.093 min/time
2019.10–2020.056 min/time
LIMS experimental databasedata on raw materials, products, catalysts2017.04–2020.052 times/week
Table 3. Common transformations.
Table 3. Common transformations.
ModelOriginal ModelTransformNew Model
Logarithm y = a + b ln t x = ln t y = a + b x
Inverse y = a + b 1 t x = 1 t y = a + b x
Quadratic y = a + b t + c t 2 x 1 = t , x 2 = t 2 y = a + b x 1 + c x 2
Combination y = a + b t + c ln t x 1 = t , x 2 = ln t y = a + b x 1 + c x 2
Table 4. Main RON variables.
Table 4. Main RON variables.
CodeProperty
1Raw Material RONNonoperational Variable
2Saturated HydrocarbonNonoperational Variable
3Aromatic HydrocarbonNonoperational Variable
4Density(20 °C)Nonoperational Variable
5S-ZORB.FC_2801.PVOperational Variable
6S-ZORB.TE_2103.PVOperational Variable
7S-ZORB.FT_9201.PVOperational Variable
8S-ZORB.SIS_TE_6010.PVOperational Variable
9S-ZORB.TC_1606.PVOperational Variable
10S-ZORB.FC_1203.PVOperational Variable
11S-ZORB.FT_9002.DACAOperational Variable
12S-ZORB.LT_9001.DACAOperational Variable
13S-ZORB.LC_5102.DACAOperational Variable
14S-ZORB.TE_2501.DACAOperational Variable
15S-ZORB.TE_5004.DACAOperational Variable
16S-ZORB.SIS_PT_2602.PVOperational Variable
17S-ZORB.PDC_2702.DACAOperational Variable
18S-ZORB.TXE_3201A.DACAOperational Variable
19S-ZORB.CAL.LINE.PVOperational Variable
Table 5. The correlation coefficients of the major RON variables.
Table 5. The correlation coefficients of the major RON variables.
12345678910111213141516171819
11.000−0.4290.0810.2460.088−1.002−1.224−1.0180.0650.378−1.1360.0480.030−1.0590.185−1.0130.017−1.1740.075
2−1.4291.0000.006−1.138−1.4790.122−1.0590.123−1.1040.1270.223−1.192−1.268−1.112−1.3190.1410.0410.1340.157
30.0810.0061.0000.445−1.111−1.070−1.0010.1700.060−1.1000.033−1.094−1.1320.061−1.133−1.0750.1190.2930.120
40.246−1.1380.4451.000−1.158−1.128−1.1340.2250.102−1.0170.133−1.026−1.147−1.045−1.085−1.056−1.0070.2730.109
50.088−1.479−1.111−1.1581.0000.4420.3030.0900.122−1.0050.2010.2350.3350.2720.4530.1490.377−1.249−1.365
6−1.0020.122−1.070−1.1280.4421.0000.3000.594−1.2260.4230.3880.031−1.1860.4840.2950.4660.494−1.0920.014
7−1.224−1.059−1.001−1.1340.3030.3001.0000.3360.022−1.1260.3570.0160.0500.2660.0540.1410.2020.181−1.002
8−1.0180.1230.1700.2250.0900.5940.3361.000−1.0970.2650.491−1.082−1.2730.3610.1600.2760.2270.3700.355
90.065−1.1040.0600.1020.122−1.2260.022−1.0971.000−1.1040.0600.1820.264−1.303−1.014−1.124−1.082−1.005−1.182
100.3780.127−1.100−1.017−1.0050.423−1.1260.265−1.1041.0000.080−1.061−1.0980.1310.1500.1920.230−1.1440.278
11−1.1360.2230.0330.1330.2010.3880.3570.4910.0600.0801.000−1.002−1.1090.0550.0680.1880.2640.188−1.058
120.048−1.192−1.094−1.0260.2350.0310.016−1.0820.182−1.061−1.0021.0000.276−1.0490.1700.021−1.047−1.042−1.003
130.030−1.268−1.132−1.1470.335−1.1860.050−1.2730.264−1.098−1.1090.2761.000−1.2430.128−1.1210.013−1.225−1.282
14−1.059−1.1120.061−1.0450.2720.4840.2660.361−1.3030.1310.055−1.049−1.2431.0000.1210.2000.1240.2950.259
150.185−1.319−1.133−1.0850.4530.2950.0540.160−1.0140.1500.0680.1700.1280.1211.0000.1100.007−1.029−1.073
16−1.0130.141−1.075−1.0560.1490.4660.1410.276−1.1240.1920.1880.021−1.1210.2000.1101.0000.162−1.0620.073
170.0170.0410.119−1.0070.3770.4940.2020.227−1.0820.2300.264−1.0470.0130.1240.0070.1621.000−1.144−1.252
18−1.1740.1340.2930.273−1.249−1.0920.1810.370−1.005−1.1440.188−1.042−1.2250.295−1.029−1.062−1.1441.0000.383
190.0750.1570.1200.109−1.3650.014−1.0020.355−1.1820.278−1.058−1.003−1.2820.259−1.0730.073−1.2520.3831.000
Table 6. Major S content variables.
Table 6. Major S content variables.
CodeProperty
1Raw Material S contentNonoperational Variable
2Raw Material RONNonoperational Variable
3OlefinNonoperational Variable
4Density(20 °C)Nonoperational Variable
5S-ZORB.FC_2801.PVOperational Variable
6S-ZORB.TE_2103.PVOperational Variable
7S-ZORB.FT_9201.PVOperational Variable
8S-ZORB.AT_1001.PVOperational Variable
9S-ZORB.AC_6001.PVOperational Variable
10S-ZORB.TC_1606.PVOperational Variable
11S-ZORB.TE_1203.PVOperational Variable
12S-ZORB.LC_5102.DACAOperational Variable
13S-ZORB.TE_5004.DACAOperational Variable
14S-ZORB.PDC_2702.DACAOperational Variable
Table 7. The correlation coefficient of the main S content variables.
Table 7. The correlation coefficient of the main S content variables.
1234567891011121314
11.0000.4750.3930.1910.174−0.105−0.1180.4310.1380.1130.0520.2440.0910.088
20.4751.0000.3660.2460.088−1.002−1.2240.219−1.0320.0650.0390.0300.1850.017
30.3930.3661.000−1.0390.485−1.0860.0550.4970.0740.0750.1240.2970.344−1.083
40.1910.246−1.0391.000−1.158−1.128−1.134−1.102−1.0200.1020.025−1.147−1.085−1.007
50.1740.0880.485−1.1581.0000.4420.3030.4850.1350.1220.2600.3350.4530.377
6−1.105−1.002−1.086−1.1280.4421.0000.3000.1010.005−1.226−1.265−1.1860.2950.494
7−1.118−1.2240.055−1.1340.3030.3001.0000.0910.0290.022−1.0080.0500.0540.202
80.4310.2190.497−1.1020.4850.1010.0911.0000.1040.1100.1140.2590.1910.190
90.138−1.0320.074−1.0200.1350.0050.0290.1041.0000.0930.0340.091−1.0140.080
100.1130.0650.0750.1020.122−1.2260.0220.1100.0931.0000.3290.264−1.014−1.082
110.0520.0390.1240.0250.260−1.265−1.0080.1140.0340.3291.0000.2460.1350.042
120.2440.0300.297−1.1470.335−1.1860.0500.2590.0910.2640.2461.0000.1280.013
130.0910.1850.344−1.0850.4530.2950.0540.191−1.014−1.0140.1350.1281.0000.007
140.0880.017−1.083−1.0070.3770.4940.2020.1900.080−1.0820.0420.0130.0071.000
Table 8. Functional form. MSE, mean square error; MAE, mean absolute error; R 2 , coefficient of determination.
Table 8. Functional form. MSE, mean square error; MAE, mean absolute error; R 2 , coefficient of determination.
ModelLinearLogarithmInverseQuadraticCombination
MSE0.0870.0660.2070.1640.043
MAE0.1980.1880.4110.3850.154
R 2 0.9480.9500.8140.8830.956
Table 9. Nomenclature.
Table 9. Nomenclature.
SymbolMeaningUnit
yProduct RON%
x 1 Raw Material RON%
x 2 Saturated Hydrocarbon%
x 3 S-ZORB.FC_2801.PVNm3/h
x 4 S-ZORB.TE_2103.PV°C
x 5 S-ZORB.FT_9002.DACANm3/h
x 6 S-ZORB.TE_5004.DACA°C
x 7 S-ZORB.TE_2501.DACA°C
x 8 S-ZORB.TXE_3201A.DACA°C
zProduct S contentμg/g
t 1 Olefin%
t 2 S-ZORB.TE_1203.PV°C
t 3 S-ZORB.TE_5004.DACA°C
t 4 Raw Material S contentμg/g
t 5 S-ZORB.FC_2801.PVNm3/h
t 6 S-ZORB.PDC_2702.DACA
Table 10. Adjustments to the sample with a loss reduction of 30%.
Table 10. Adjustments to the sample with a loss reduction of 30%.
Sample R 0 R 1 Adjusted R 1 Loss of DeclineN N max
190.60089.22089.77840.4%12645
290.50089.32089.68430.9%12648
390.70089.32089.89741.8%12650
490.40089.02089.61343.0%12546
589.60088.32088.81939.0%12535
691.00089.59090.21844.5%12640
790.40089.20089.62835.6%12637
890.50089.20089.74441.9%12640
990.40089.30089.63930.8%12543
1090.20088.80089.44045.7%12634
1190.20088.90089.42140.1%12738
1290.20088.90089.42040.0%12629
1389.20087.70088.41547.6%12434
1486.80085.40086.01443.8%12536
1587.80086.10086.94749.8%12752
1688.40086.70087.53949.4%12549
1888.00086.49087.11241.2%12754
2088.80087.59087.95730.3%12748
2188.10086.59087.25043.7%12641
2389.60087.99088.74246.7%12548
2489.00087.59088.13939.0%12547
2589.30087.89088.45540.1%12545
2689.20087.69088.36444.6%12741
2788.70087.29087.85139.8%12546
2989.70088.29088.83538.7%12438
3089.20087.99088.35430.1%12638
Table 11. Operation scheme of some samples.
Table 11. Operation scheme of some samples.
VariableState53272101133173208235264313
x 3 Original649.341649.651652.010650.353648.496698.963748.972752.160735.709849.255
New688.459689.943673.658693.481680.361714.732780.464781.161776.337879.403
x 4 Original427.132425.923421.054427.430426.620426.620428.348424.131425.028424.969
New410.000410.000410.000410.000410.000410.000410.000410.000410.000410.000
x 5 Original401.752414.780514.261377.199452.605550.321448.482464.489411.784454.247
New650.000650.000650.000650.000650.000650.000650.000650.000650.000650.000
x 6 Original64.96061.00564.00167.41956.94764.55261.73367.67150.02066.559
New40.00040.00040.00040.00040.00040.00040.00040.00040.00040.000
x 7 Original215.227216.749167.173210.013225.360190.146174.339202.865206.900171.674
New250.000250.000222.727246.926234.211240.377224.549246.049250.000215.856
x 8 Original366.214413.725410.599426.298402.040431.804311.955377.150359.226396.256
New357.878394.361410.599426.298349.257431.711311.955377.150326.958396.256
t 2 Original30.11435.91332.24833.64729.62133.85332.25431.20334.21734.640
New30.11435.91332.24833.64729.62133.85332.25431.20334.21734.640
t 3 Original64.96061.00564.00167.41956.94764.55261.73367.67150.02066.559
New40.00040.00040.00040.00040.00040.00040.00040.00040.00040.000
t 5 Original649.341649.651652.010650.353648.496698.963748.972752.160735.709849.255
New688.459689.943673.658693.481680.361714.732780.464781.161776.337879.403
t 6 Original27.15126.58527.37526.09726.35231.07830.95724.44229.66827.450
New27.15126.58527.37526.09726.35231.07830.95724.44229.66827.450
Table 12. Features of variables in the optimal operation scheme.
Table 12. Features of variables in the optimal operation scheme.
FeatureVariableValue
Stable x 4 410
x 5 650
x 6 = t 3 40
Unstable x 3 = t 5 /
x 7 /
x 8 /
Unadjusted t 2 /
t 6 /
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Liu, X.; Liu, Y.; He, X.; Xiao, M.; Jiang, T. Multi-Objective Nonlinear Programming Model for Reducing Octane Number Loss in Gasoline Refining Process Based on Data Mining Technology. Processes 2021, 9, 721. https://doi.org/10.3390/pr9040721

AMA Style

Liu X, Liu Y, He X, Xiao M, Jiang T. Multi-Objective Nonlinear Programming Model for Reducing Octane Number Loss in Gasoline Refining Process Based on Data Mining Technology. Processes. 2021; 9(4):721. https://doi.org/10.3390/pr9040721

Chicago/Turabian Style

Liu, Xiao, Yilai Liu, Xuejun He, Min Xiao, and Tao Jiang. 2021. "Multi-Objective Nonlinear Programming Model for Reducing Octane Number Loss in Gasoline Refining Process Based on Data Mining Technology" Processes 9, no. 4: 721. https://doi.org/10.3390/pr9040721

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop