Next Article in Journal
Histone Deacetylase Inhibitor Panobinostat Benefits the Therapeutic Efficacy of Oncolytic Herpes Simplex Virus Combined with PD-1/PD-L1 Blocking in Glioma and Squamous Cell Carcinoma Models
Next Article in Special Issue
Editorial: Infectious Disease Epidemiology and Transmission Dynamics
Previous Article in Journal
Improved Expression of SARS-CoV-2 Spike RBD Using the Insect Cell-Baculovirus System
Previous Article in Special Issue
Spatio-Temporal Epidemiology of the Spread of African Swine Fever in Wild Boar and the Role of Environmental Factors in South Korea
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Structural Identifiability of a Humidity-Driven Epidemiological Model of Influenza Transmission

1
School of Mathematics, Changchun Normal University, Changchun 130032, China
2
WHO Collaborating Centre for Infectious Disease Epidemiology and Control, School of Public Health, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
3
Laboratory of Data Discovery for Health Limited, Hong Kong Science and Technology Park, Hong Kong, China
4
Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY 10032, USA
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Viruses 2022, 14(12), 2795; https://doi.org/10.3390/v14122795
Submission received: 28 October 2022 / Revised: 8 December 2022 / Accepted: 10 December 2022 / Published: 15 December 2022
(This article belongs to the Special Issue Infectious Disease Epidemiology and Transmission Dynamics)

Abstract

:
Influenza epidemics cause considerable morbidity and mortality every year worldwide. Climate-driven epidemiological models are mainstream tools to understand seasonal transmission dynamics and predict future trends of influenza activity, especially in temperate regions. Testing the structural identifiability of these models is a fundamental prerequisite for the model to be applied in practice, by assessing whether the unknown model parameters can be uniquely determined from epidemic data. In this study, we applied a scaling method to analyse the structural identifiability of four types of commonly used humidity-driven epidemiological models. Specifically, we investigated whether the key epidemiological parameters (i.e., infectious period, the average duration of immunity, the average latency period, and the maximum and minimum daily basic reproductive number) can be uniquely determined simultaneously when prevalence data is observable. We found that each model is identifiable when the prevalence of infection is observable. The structural identifiability of these models will lay the foundation for testing practical identifiability in the future using synthetic prevalence data when considering observation noise. In practice, epidemiological models should be examined with caution before using them to estimate model parameters from epidemic data.

1. Introduction

Influenza epidemics cause considerable morbidity and mortality every year worldwide [1,2,3,4,5,6]. According to The World Health Organization, influenza leads to about 3 to 5 million cases of severe illness and about 290,000 to 650,000 deaths annually [7]. This burden might be alleviated by understanding historical and current transmission dynamics and predicting further transmission trends of influenza to assist public health authorities in designing effective interventions and vaccination strategies, especially in some special situations (for example, the emergence of influenza A (H1N1) viruses and the potential rebound of influenza in the post-COVID-19 pandemic period) [8,9,10,11,12].
Climate-driven epidemiological models, such as models driven by humidity, are mainstream tools for understanding seasonal transmission dynamics and predicting future trends in influenza activity. Recently, climate-driven epidemiological models have been successfully applied to recreate historical activity time series of influenza and to forecast the week with the highest influenza activity in temperate, tropical and subtropical regions (for example, in the United States and Hong Kong) [13,14,15,16,17]. Confident predictions using these models depend on testing various numerical optimization algorithms, such as filter-based data-assimilation algorithms, which fit the model to epidemic data when parameterizing models to simulate the transmission dynamics of influenza [13]. However, the model structure identifiability needs to be tested to avoid the optimization algorithm falling into a set of the locally optimal solution.
Model identifiability includes structural and practical identifiability, involving investigation of whether unknown model parameters can be uniquely determined from noise-free epidemic data and accurately identified from noisy data, respectively [18,19,20,21,22,23,24]. The structural identifiability of the model is a fundamental prerequisite for practical identifiability and for the model to be used in practice [25]. It is necessary, but insufficient, to accurately identify model parameters from actual noisy data because a model that is structurally identifiable might be unidentifiable when noisy data are used. The scaling method, which is a structural identifiability analysis method that has been proposed in recent years, is based on the scale invariance of the equations [26]. Compared with existing structural identifiability methods (such as differential algebra), this method has the advantage of simple operation (no advanced computing skills are required) and low computational cost, particularly when analyzing high-dimensional non-linear models [27,28,29,30]. This method has been used to analyze the structural identifiability of mathematical modeling describing biological processes, such as the generalized mass-action model [31,32,33].
In this study, we apply the scaling method to analyse the structural identifiability of several types of commonly used humidity-driven epidemiological models. We investigate whether the key epidemiological parameters (infectious period, the average duration of immunity, the average latency period, the maximum and minimum daily basic reproductive number) can be determined simultaneously when the population prevalence of infected people is observable.

2. Methods

Here, we briefly introduce the process from building an epidemiological model to applying the model in real-world applications. After building an epidemiological model of influenza transmission, we test the structural identifiability of the model to investigate the properties of the model itself (the upper part of Figure 1). If the model is structurally identifiable, then we test the practical identifiability of the model, which determines whether the model is identifiable for noise data (e.g., reported noise), using synthetic data experiments. If the model is also practically identifiable, then this model can potentially be used in practice after evaluating its performance for specific functions (e.g., inference, forecasting, etc.). On the other hand, if a model is structurally unidentifiable, any parameter estimated by optimization algorithms might be unreliable; then, we need to consider modifying the model. For some complex models, for example, agent-based influenza transmission models [34], we can only test the practical identifiability directly after building the model (the lower part of Figure 1), as current mathematical methods may not be able to theoretically test the structural identifiability of these complex models. Here, we mainly focus on testing the structural identifiability of humidity-driven epidemiological models.

2.1. Humidity-Driven Epidemiological Model

We test the structural identifiability of several commonly used humidity-driven epidemiological models. The form of each model is as follows.
1.
SIS model:
d S d t = I D β ( t ) I S N α , d I d t = β ( t ) I S N I D + α .
2.
SIRS model:
d S d t = N S I L β ( t ) I S N α , d I d t = β ( t ) I S N I D + α .
3.
SEIR model:
d S d t = β ( t ) I S N α , d E d t = β ( t ) I S N E W + α , d I d t = E W I D .
4.
SEIRS model:
d S d t = N S E I L β ( t ) I S N α , d E d t = β ( t ) I S N E W + α , d I d t = E W I D .
where N, S, E, and I represent the total number of people, the number of susceptible people, the number of exposed people, and the number of infectious individuals, respectively. N = S + I in the SIS model, N = S + I + R ie all parameters and unobse N = S + E + I + R in the SEIR model and the SEIRS model. α represents the rate at which influenza viruses are imported into the model due to travel. t represents time, such as the day, week or year. β ( t ) represents the transmission rate at time t. D represents the mean infectious period in all four models. L represents the average duration of immunity in the SIRS model and the SEIRS model. W represents the average latency period in the SEIR model and the SEIRS model. The flow diagrams of these models are presented in Figure A1 in Appendix A.
The basic reproductive number, R 0 ( t ) , represents the average number of secondary infections generated by a primary case in a fully susceptible population at time t, which is proportional to the transmission rate. The expression is as follows:
R 0 ( t ) = β ( t ) D .
The influenza virus survival and transmission are relative to the absolute humidity (AH) shown from laboratory experiments [35]. The specific humidity (SH) is a measure of AH, in which q ( t ) represents SH at time t. In this model, the humidity factor modulates R 0 ( t ) through an exponential relationship:
R 0 ( t ) = e ( a · q ( t ) + b ) + R 0 min ,
where a = 180 is estimated by fitting laboratory influenza virus survival to the value of AH using a regression model. b = log ( R 0 max R 0 min ) , R 0 max and R 0 min are the maximum and minimum daily basic reproductive number, respectively. Parameter sets Θ 1 = D , R 0 max , R 0 min in the SIS model, Θ 2 = D , L , R 0 max , R 0 min in the SIRS model, Θ 3 = D , W , R 0 max , R 0 min in the SEIR model and Θ 4 = D , L , W , R 0 max , R 0 min in the SEIRS model may be estimated by fitting the model to the epidemic data. We analyse the structural identifiability of these models, which tests whether each parameter set can be uniquely determined simultaneously when the prevalence data is observable (using the observation record from the beginning to the end time, such as one influenza season in temperate regions).

2.2. The Frameworks for the Scaling Method

The scaling method is easy to use for identifying the structural identifiability of a non-linear model based on simple scaling transformations and the solution of simple sparse systems of equations [26]. The ordinary differential equations(ODE) model, which is applied to the frameworks of the scaling method, is as follows:
d x i d t = f i ( x 1 , , x s , x s + 1 , , x n ; θ 1 , , θ m ) , x i ( 0 ) = x i , 0 , i = 1 , , n ,
where x i ( 0 ) represents the initial conditions, d x i d t represents the change of x i over time, depending on m parameters θ j , and the number of state variables is n. f i is a function characterising the specific details of the change rate of x i . The simplicity of this method depends on the ability to decompose functions f i as a sum of P functional independent components, f i j ,
d x i d t = f i ( x 1 , , x s , x s + 1 , , x n ; θ 1 , , θ m ) = j = 1 P f i j ( x ¯ j , θ ¯ j ) .
A property of f i j is that f i j is functionally independent of f i k for every j k . Here, x j ¯ and θ j ¯ represent the subset of variables and parameters of function f i j , respectively. In simple terms, if f i x i , x 2 , , , f n x i , x 2 , are linearly independent functions, then the only solution of the equation
i = 1 n a i f i x 1 , x 2 , = 0
is a 1 = a 2 = , , = a n = 0 . The functional independence theorems used in this work are presented in Appendix A.
Based on decomposing functions, the steps of the scaling method are as follows:
Step 1.
Scale all parameters and unobserved variables using unknown scaling factors, μ :
θ i μ θ i θ i i = 1 , , m , x j μ x j x j j = s + 1 , , n ,
and substitute them into Equation (8). The experiment measures variables x 1 , , x s without modifying them (n is the total number of state variables, x 1 , , x s is the observable state variable and x s + 1 , , x n is the unobserved state variable).
Step 2.
Obtain the scaled version for each functionally independent function. Namely,
f i j ( x ¯ j , θ ¯ j ) = f i j ( x ¯ j , μ θ ¯ j θ ¯ j ) i = 1 , , s
and
f i j ( x ¯ j , θ ¯ j ) = 1 μ x i f i j ( μ x ¯ j x ¯ j , μ θ ¯ j θ ¯ j ) i = s + 1 , , n .
Step 3.
Find scaling factor combinations that maintain the system invariant. Only the parameters θ ¯ j with a solution μ θ ¯ j = 1 are identifiable. Only the variables, x ¯ j with μ x ¯ j = 1 are observable. Otherwise, parameters whose scaling factors are coupled form identifiable groups but cannot be identified independently.

3. Results

Here, we demonstrate how to use the simple scaling method to test the identifiability of the four humidity-driven epidemiological models introduced in the Methods section.
We consider a scenario where only I is observed, representing a kind of epidemic data that can be collected in practice. For example, in the UK, the COVID-19 Infection Survey identified those people testing positive for coronavirus (COVID-19) in private residential households (surveillance sensors) at a point in time to help the government make decisions on how to respond to the emerging epidemic and provide information to the public [36]. This infection survey can, in principle, be extended to survey influenza to identify new positive cases of influenza regularly around the influenza season. In this scenario, we can obtain I from the sentinel surveillance systems. We test whether the humidity-driven epidemiological models in Equations (1)–(4) are structurally identifiable, respectively.

3.1. SIS Model

For the SIS model, we test whether parameter set Θ 1 = D , R 0 max , R 0 min can be determined uniquely from the observable I. First, we investigate whether the differential equation in Equation (1) can be decomposed into a sum of linearly independent functions. This is a prerequisite for using the scaling method. For the differential equation associated with S, we have:
f S 1 = I D , f S 2 = β ( t ) I S N .
According to Theorem A1, the generalized Wronskian determinant is as follows:
W S = 0 f S 1 0 f S 2 1 f S 1 1 f S 2 = I D β ( t ) I S N 1 D β ( t ) ( S + I ) N = β ( t ) I 2 D N 0 .
So, f S 1 and f S 2 are linearly independent functions. Similarly, for the differential equation associated with I, we have:
f I 1 = β ( t ) I S N , f I 2 = I D .
The corresponding generalized Wronskian determinant is as follows:
W I = 0 f I 1 0 f I 2 1 f I 1 1 f I 2 = β ( t ) I S N I D β ( t ) ( S + I ) N 1 D = β ( t ) I 2 D N 0 .
So, f I 1 and f I 2 are linearly independent functions. Next, we explore whether parameter set Θ 1 = D , R 0 max , R 0 min can uniquely be determined from the observable I using the scaling method. The steps of the scaling method are as follows:
Step 1.
We scale the parameter set Θ 1 = D , R 0 max , R 0 min and the unobserved variable (S) by unknown scaling factors:
D μ D D , R 0 max μ R 0 max R 0 max , R 0 min μ R 0 min R 0 min , S μ S S .
Step 2.
We obtain the scaled version for each functional linear independent function in Equations (13) and (15).
Step 3.
We obtain the identifiability equations:
I μ S μ D D = I D ,
I μ S S μ S N μ D D e a q ( t ) μ R 0 max R 0 max μ R 0 min R 0 min + μ R 0 min R 0 min = I S N D e a q ( t ) R 0 max R 0 min + R 0 min ,
I μ S S N μ D D e a q ( t ) μ R 0 max R 0 max μ R 0 min R 0 min + μ R 0 min R 0 min = I S N D e a q ( t ) R 0 max R 0 min + R 0 min ,
I μ D D = I D .
Manipulating the above formulas, the identifiability equations are:
μ D = 1 , μ S = 1 , e a q ( t ) μ R 0 max R 0 max μ R 0 min R 0 min + μ R 0 min R 0 min = e a q ( t ) R 0 max R 0 min + R 0 min .
From the last formula in Equation (22), we further manipulate this formula and obtain the following equation:
e a q ( t ) ( μ R 0 max 1 ) R 0 max ( μ R 0 min 1 ) R 0 min = ( 1 μ R 0 min ) R 0 min .
e a q ( t ) is not equal to zero. When the left and right sides of the equation are equal, we have:
( μ R 0 max 1 ) R 0 max ( μ R 0 min 1 ) R 0 min = 0 , ( 1 μ R 0 min ) R 0 min = 0 , R 0 max > 0 , R 0 min > 0 .
By solving the above equation, we can obtain:
μ R 0 max = 1 , μ R 0 min = 1 .
Therefore, the SIS model is identifiable. Namely, parameter set Θ 1 = D , R 0 max , R 0 min can be determined uniquely from the observable I.

3.2. SIRS Model

For the SIRS model, we test whether the parameter set Θ 2 = D , L , R 0 max , R 0 min can be determined uniquely from the observable I. First, we investigate whether the differential equation in Equation (2) can be decomposed into a sum of linearly independent functions. For the differential equation associated with S, we have: For the differential equation associated with S, we have:
f S 1 = N S I L , f S 2 = β ( t ) I S N .
According to Theorem A1, the generalized Wronskian determinant is as follows:
W S = 0 f S 1 0 f S 2 1 f S 1 1 f S 2 = N S I L β ( t ) I S N 2 L β ( t ) S N β ( t ) I N = β ( t ) [ I ( I N ) + S ( S N ) ] L N 0 .
So, f S 1 and f S 2 are linearly independent functions. Similarly, for the differential equation associated with I, we have:
f I 1 = β ( t ) I S N , f I 2 = I D .
The corresponding generalized Wronskian determinant is as follows:
W I = 0 f I 1 0 f I 2 1 f I 1 1 f I 2 = β ( t ) I S N I D β ( t ) ( S + I ) N 1 D = β ( t ) I 2 D N 0 .
So, f I 1 and f I 2 are linearly independent functions. Next, we explore whether the parameter set Θ 2 = D , L , R 0 max , R 0 min can be determined uniquely from the observable I using the scaling method. The steps of the scaling method are as follows:
Step 1.
We scale the parameters and unobserved variables by unknown scaling factors:
D μ D D , L μ L L , R 0 max μ R 0 max R 0 max , R 0 min μ R 0 min R 0 min , S μ s S .
Step 2.
We obtain the scaled version for each functional linear independent function in Equations (26) and (28).
Step 3.
We obtain the identifiability equations:
N μ S S I μ S μ L L = N S I L ,
I μ S S μ S μ D D N [ e a q ( t ) ( μ R 0 max R 0 max μ R 0 min R 0 min ) + μ R 0 min R 0 min ] = I S D N [ e a q ( t ) ( R 0 max R 0 min ) + R 0 min ] ,
I μ S S μ D D N [ e a q ( t ) ( μ R 0 max R 0 max μ R 0 min R 0 min ) + μ R 0 min R 0 min ] = I S D N [ e a q ( t ) ( R 0 max R 0 min ) + R 0 min ] ,
I μ D D = I D .
Manipulating Equations (31)–(34), the identifiability equations are:
μ D = 1 , μ S = 1 , μ L = 1 , ( μ R 0 max 1 ) R 0 max = ( μ R 0 min 1 ) ( 1 1 e a q ( t ) ) R 0 min
From the last formula in Equation (35), the left side of Equation (35) is constant, the right side of this equation is the function of t, and e a q ( t ) is not equal to zero. For the equation to be satisfied, μ R 0 max 1 = 0 and μ R 0 min 1 = 0 . Hence, the system has a unique solution ( μ R 0 max = μ R 0 min = 1 ). It follows that the SIRS model is identifiable.

3.3. SEIR Model

For the SEIR model, we test whether the parameter set Θ 3 = D , W , R 0 max , R 0 min can be determined uniquely from the observable I. First, we investigate whether the differential equation in Equation (3) can be decomposed into a sum of linearly independent functions. For the differential equation associated with S, we have:
f S 1 = β ( t ) I S N .
According to Theorem A1, the generalized Wronskian determinant is as follows:
W S = 0 f S 1 = β ( t ) I S N 0 .
So, f S 1 is a linearly independent function. Similarly, for the differential equation associated with E, we have:
f E 1 = β ( t ) I S N , f E 2 = E W .
The corresponding generalized Wronskian determinant is as follows:
W E = 0 f E 1 0 f E 2 1 f E 1 1 f E 2 = β ( t ) I S N E W β ( t ) ( S + I ) N 1 W = β ( t ) N W E ( S + I ) I S 0 .
So, f E 1 and f E 2 are linearly independent functions. For the differential equation associated with I, we have:
f I 1 = E W , f I 2 = I D .
The generalized Wronskian determinant is as follows:
W I = 0 f I 1 0 f I 2 1 f I 1 1 f I 2 = E W I D 1 W 1 D = I E W D 0 .
So, f I 1 and f I 2 are linearly independent functions. Next, we explore whether parameter set Θ 3 = D , W , R 0 max , R 0 min can be determined uniquely from observable I using the scaling method. The steps of the scaling method are as follows:
Step 1.
We scale the parameter set Θ 3 = D , W , R 0 max , R 0 min and the unobserved variable (S) by unknown scaling factors:
W μ W W , D μ D D , R 0 max μ R 0 max R 0 max , R 0 min μ R 0 min R 0 min , S μ S S , E μ E E .
Step 2.
We obtain the scaled version for each functional linear independent function in Equations (36), (38) and (40).
Step 3.
We obtain the identifiability equations:
I μ S S μ S N μ D D e a q ( t ) μ R 0 max R 0 max μ R 0 min R 0 min + μ R 0 min R 0 min = I S N D e a q ( t ) R 0 max R 0 min + R 0 min ,
I μ S S μ E N μ D D e a q ( t ) μ R 0 max R 0 max μ R 0 min R 0 min + μ R 0 min R 0 min = I S N D e a q ( t ) R 0 max R 0 min + R 0 min ,
μ E E μ E μ W W = E W ,
μ E E μ W W = E W ,
I μ D D = I D .
According to formulas (45)–(47), it is easy to get:
μ W = 1 , μ D = 1 , μ E = 1 .
Pressing μ D = 1 into the formula (43), we can use Equation (23) in the SIS model. Then by Equation (24) in the SIS model, we can obtain:
μ R 0 max = 1 , μ R 0 min = 1 .
Next, pressing the above results into formula (44), we can get μ S = 1 . Therefore, the SEIR model is identifiable. Namely, the parameter set Θ 3 = D , W , R 0 max , R 0 min can be uniquely determined from the observable I.

3.4. SEIRS Model

For the SEIRS model, we test whether the parameter set Θ 4 = D , L , W , R 0 max , R 0 min can be determined uniquely from observable I. First, we investigate whether the differential equation in Equation (4) can be decomposed into a sum of linearly independent functions. For the differential equation associated with S, we have:
f S 1 = N S E I L , f S 2 = β ( t ) I S N .
According to Theorem A1, the generalized Wronskian determinant is as follows:
W S = 0 f S 1 0 f S 2 1 f S 1 1 f S 2 = N S E I L β ( t ) I S N 1 L β ( t ) ( S + I ) N
= β ( t ) N L I S ( N S E I ) ( S + I ) 0 .
So, f S 1 and f S 2 are linearly independent functions. Similarly, for the differential equation associated with E, we have:
f E 1 = β ( t ) I S N , f E 2 = E W .
The corresponding generalized Wronskian determinant is as follows:
W E = 0 f E 1 0 f E 2 1 f E 1 1 f E 2 = β ( t ) I S N E W β ( t ) ( S + I ) N 1 W = β ( t ) N W E ( S + I ) I S 0 .
So, f E 1 and f E 2 are linearly independent functions. For the differential equation associated with I, we have:
f I 1 = E W , f I 2 = I D .
The generalized Wronskian determinant is as follows:
W I = 0 f I 1 0 f I 2 1 f I 1 1 f I 2 = E W I D 1 W 1 D = I E W D 0 .
So, f I 1 and f I 2 are linearly independent functions. Next, we explore whether the parameter set Θ 4 = D , L , W , R 0 max , R 0 min can be determined uniquely from the observable I using the scaling method. The steps of the scaling method are as follows:
Step 1.
We scale the parameter set Θ 4 = D , L , W , R 0 max , R 0 min and the unobserved variable (S) by unknown scaling factors:
L μ L L , W μ W W , D μ D D , R 0 max μ R 0 max R 0 max , R 0 min μ R 0 min R 0 min , S μ S S , E μ E E .
Step 2.
We obtain the scaled version for each functional linear independent function in Equations (50), (53) and (55).
Step 3.
We obtain the identifiability equations:
N μ S S μ E E I μ S μ L L = N S E I L .
I μ S S μ S N μ D D e a q ( t ) μ R 0 max R 0 max μ R 0 min R 0 min + μ R 0 min R 0 min = I S N D e a q ( t ) R 0 max R 0 min + R 0 min ,
I μ S S μ E N μ D D e a q ( t ) μ R 0 max R 0 max μ R 0 min R 0 min + μ R 0 min R 0 min , = I S N D e a q ( t ) R 0 max R 0 min + R 0 min ,
μ E E μ E μ W W = E W ,
μ E E μ W W = E W ,
I μ D D = I D .
Similar to the previous derivation of the corresponding part of the SEIR model, it is easy to obtain:
μ L = 1 , μ W = 1 , μ D = 1 , μ R 0 max = 1 , μ R 0 min = 1 , μ S = 1 , μ E = 1 .
Therefore, the SEIRS model is identifiable. Namely, the parameter set Θ 4 = { D , L , W , R 0 max , R 0 min } can be determined uniquely from the observable I.

4. Discussion

In some existing studies, the authors explored the performance of different data assimilation algorithms when inferring parameters from observational epidemic data [13]. For example, ensemble filters are better at reproducing historical influenza incidence time series than particle Markov chain Monte Carlo. However, we still do not know whether the model is structurally unidentifiable, which affects the performance of the optimization algorithms. In this study, we applied the scaling method to analyse the structural identifiability of four types of commonly used humidity-driven epidemiological models when prevalence data is observable. Specifically, we investigated whether each parameter set Θ 1 = D , R 0 max , R 0 min in the SIS model, Θ 2 = D , L , R 0 max , R 0 min in the SIRS model, Θ 3 = D , W , R 0 max , R 0 min in the SEIR model and Θ 4 = D , L , W , R 0 max , R 0 min in the SEIRS model can be uniquely determined (D is the infectious period, L is the average duration of immunity, W is the latency period, and R 0 max , R 0 min are the combination of the maximum and minimum daily basic reproductive number and the minimum daily basic reproductive number). We find each model is identifiable when the prevalence is observable. Aside from considering prevalence data as observational data, this study also considered the number of new cases as observational data. For example, when we introduced the differential equation of the new case changes over time in the SIRS model, the differential equation decomposition components (obtained based on Equation (8) from the frameworks of the scaling method part) are not linear-independent. So the scaling method cannot be used to determine whether the model is structurally identifiable when the new cases are observable.
Much work needs to be done to further test, validate, and improve the ability of humidity-driven epidemiological models to predict influenza activity. In the future, we will test more expanded humidity-driven epidemiological models (such as a model that includes more than one exposed state) to provide theoretical support for these models to be used in practice. This investigation involved testing of the structure identifiability of different humidity-driven epidemiological models. In the future, we will assess the practical identifiability of these models to provide synthetic experimental support to enable these models to be used in practice. In addition, we will explore other model structure identifiability methods to test the structural identification of the humidity-driven epidemiological model when the number of new cases is observable. We will also consider more data types, such as the cumulative number of incidences.
In conclusion, our analysis suggests that the structural identifiability of these models can lay the foundation for testing practical identifiability in the future. In practice, epidemiological models should be examined with caution before using them to estimate model parameters from epidemic data.

Author Contributions

Conceptualization, Y.B., E.H.Y.L. and S.P.; methodology, C.Z., X.Z. and Y.B.; writing—original draft preparation, C.Z., X.Z., Y.B., E.H.Y.L. and S.P.; writing—review and editing, Y.B., E.H.Y.L. and S.P.; supervision, E.H.Y.L. and S.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the AIR@InnoHK programme of the Innovation and Technology Commission of the Hong Kong Special Administrative Region, the National Science Foundation of China (grant no. 72104208), the Health and Medical Research Fund, Food and Health Bureau, Government of the Hong Kong Special Administrative Region (grant no. 21200632), the National Institutes of Health (research funding to S.P. and grant no. R01AI163023), the Centers for Disease Control and Prevention (research funding to S.P. and grant no. U01CK000592 and 75D30122C14289) and the National Science Foundation (research funding to S.P. and grant no. DMS-2229605).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

We show the theorem to prove that the functions in the humidity-driven epidemiological model are linearly independent.
Theorem A1. 
((Generalized Wronskian theorem) [37]). Let Δ 0 , , Δ n 1 be differential operators denoting by
Δ s = ( x 1 ) j 1 ( x m ) j m , j 1 + + j m s .
Then if the generalized Wronskian, W, over the functions f i ( x 1 , x 2 , , x m ) , with i = 1 , , n ,
W = Δ 0 f 1 Δ 0 f n Δ n 1 f 1 Δ n 1 f n
is different from zero, then functions f i are linearly independent. The only solution of the equation
i = 1 n a i f i ( x 1 , x 2 , ) = 0
is a 1 = = a n = 0 .
Figure A1. Flow diagram of the four types of humidity-driven epidemiological models.
Figure A1. Flow diagram of the four types of humidity-driven epidemiological models.
Viruses 14 02795 g0a1

References

  1. Iuliano, A.D.; Roguski, K.M.; Chang, H.H.; Muscatello, D.J.; Palekar, R.; Tempia, S.; Cohen, C.; Gran, J.M.; Schanzer, D.; Cowling, B.J.; et al. Estimates of global seasonal influenza-associated respiratory mortality: A modelling study. Lancet 2018, 391, 1285–1300. [Google Scholar] [CrossRef] [PubMed]
  2. Simonsen, L.; Spreeuwenberg, P.; Lustig, R.; Taylor, R.J.; Fleming, D.M.; Kroneman, M.; Van Kerkhove, M.D.; Mounts, A.W.; Paget, W.J.; Teams, G.C. Global mortality estimates for the 2009 Influenza Pandemic from the GLaMOR project: A modeling study. PLoS Med. 2013, 10, e1001558. [Google Scholar] [CrossRef] [Green Version]
  3. Paget, J.; Spreeuwenberg, P.; Charu, V.; Taylor, R.J.; Iuliano, A.D.; Bresee, J.; Simonsen, L.; Viboud, C. Global mortality associated with seasonal influenza epidemics: New burden estimates and predictors from the GLaMOR Project. J. Glob. Health 2019, 9, 020421. [Google Scholar] [CrossRef]
  4. Cox, N.J.; Subbarao, K. Global epidemiology of influenza: Past and present. Annu. Rev. Med. 2000, 51, 407–421. [Google Scholar] [CrossRef] [PubMed]
  5. Du, Z.; Nugent, C.; Galvani, A.P.; Krug, R.M.; Meyers, L.A. Modeling mitigation of influenza epidemics by baloxavir. Nat. Commun. 2020, 11, 2750. [Google Scholar] [CrossRef]
  6. Hayden, F.G.; Asher, J.; Cowling, B.J.; Hurt, A.C.; Ikematsu, H.; Kuhlbusch, K.; Lemenuel-Diot, A.; Du, Z.; Meyers, L.A.; Piedra, P.A.; et al. Reducing influenza virus transmission: The potential value of antiviral treatment. Clin. Infect. Dis. 2022, 74, 532–540. [Google Scholar] [CrossRef] [PubMed]
  7. Influenza (Seasonal). Available online: https://www.who.int/health-topics/influenza-seasonal (accessed on 6 October 2022).
  8. Du, Z.; Bai, Y.; Wang, L.; Herrera-Diestra, J.L.; Yuan, Z.; Guo, R.; Cowling, B.J.; Meyers, L.A.; Holme, P. Optimizing COVID-19 surveillance using historical electronic health records of influenza infections. PNAS Nexus 2022, 1, pgac038. [Google Scholar] [CrossRef] [PubMed]
  9. Du, Z.; Javan, E.; Nugent, C.; Cowling, B.J.; Meyers, L.A. Using the COVID-19 to influenza ratio to estimate early pandemic spread in Wuhan, China and Seattle, US. EClinicalMedicine 2020, 26, 100479. [Google Scholar] [CrossRef]
  10. Fineberg, H.V. Pandemic preparedness and response—lessons from the H1N1 influenza of 2009. N. Engl. J. Med. 2014, 370, 1335–1342. [Google Scholar] [CrossRef] [Green Version]
  11. Lee, S.S.; Viboud, C.; Petersen, E. Understanding the rebound of influenza in the post COVID-19 pandemic period holds important clues for epidemiology and control. Int. J. Infect. Dis. 2022, 122, 1002–1004. [Google Scholar] [CrossRef]
  12. Baker, R.E.; Park, S.W.; Yang, W.; Vecchi, G.A.; Metcalf, C.J.E.; Grenfell, B.T. The impact of COVID-19 nonpharmaceutical interventions on the future dynamics of endemic infections. Proc. Natl. Acad. Sci. USA 2020, 117, 30547–30553. [Google Scholar] [CrossRef] [PubMed]
  13. Yang, W.; Karspeck, A.; Shaman, J. Comparison of filtering methods for the modeling and retrospective forecasting of influenza epidemics. PLoS Comput. Biol. 2014, 10, e1003583. [Google Scholar] [CrossRef] [PubMed]
  14. Shaman, J.; Karspeck, A.; Yang, W.; Tamerius, J.; Lipsitch, M. Real-time influenza forecasts during the 2012–2013 season. Nat. Commun. 2013, 4, 2837. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Yang, W.; Cowling, B.J.; Lau, E.H.; Shaman, J. Forecasting influenza epidemics in Hong Kong. PLoS Comput. Biol. 2015, 11, e1004383. [Google Scholar] [CrossRef] [Green Version]
  16. Pei, S.; Kandula, S.; Yang, W.; Shaman, J. Forecasting the spatial transmission of influenza in the United States. Proc. Natl. Acad. Sci. USA 2018, 115, 2752–2757. [Google Scholar] [CrossRef] [Green Version]
  17. Du, Z.; Fox, S.J.; Ingle, T.; Pignone, M.P.; Meyers, L.A. Projecting the Combined Health Care Burden of Seasonal Influenza and COVID-19 in the 2020–2021 Season. MDM Policy Pract. 2022, 7, 23814683221084631. [Google Scholar] [CrossRef]
  18. Bellman, R.; Åström, K.J. On structural identifiability. Math. Biosci. 1970, 7, 329–339. [Google Scholar] [CrossRef]
  19. Katafygiotis, L.S.; Beck, J.L. Updating models and their uncertainties. II: Model identifiability. J. Eng. Mech. 1998, 124, 463–467. [Google Scholar] [CrossRef]
  20. Massonis, G.; Banga, J.R.; Villaverde, A.F. Structural identifiability and observability of compartmental models of the COVID-19 pandemic. Annu. Rev. Control 2021, 51, 441–459. [Google Scholar] [CrossRef]
  21. Tuncer, N.; Le, T.T. Structural and practical identifiability analysis of outbreak models. Math. Biosci. 2018, 299, 1–18. [Google Scholar] [CrossRef]
  22. Gallo, L.; Frasca, M.; Latora, V.; Russo, G. Lack of practical identifiability may hamper reliable predictions in COVID-19 epidemic models. Sci. Adv. 2022, 8, eabg5234. [Google Scholar] [CrossRef] [PubMed]
  23. Wieland, F.G.; Hauber, A.L.; Rosenblatt, M.; Tönsing, C.; Timmer, J. On structural and practical identifiability. Curr. Opin. Syst. Biol. 2021, 25, 60–69. [Google Scholar] [CrossRef]
  24. Villaverde, A.F.; Pathirana, D.; Fröhlich, F.; Hasenauer, J.; Banga, J.R. A protocol for dynamic model calibration. Brief. Bioinform. 2022, 23, bbab387. [Google Scholar] [CrossRef] [PubMed]
  25. Sauer, T.; Berry, T.; Ebeigbe, D.; Norton, M.M.; Whalen, A.J.; Schiff, S.J. Identifiability of infection model parameters early in an epidemic. SIAM J. Control Optim. 2021, 60, S27–S48. [Google Scholar] [CrossRef]
  26. Castro, M.; de Boer, R.J. Testing structural identifiability by a simple scaling method. PLoS Comput. Biol. 2020, 16, e1008248. [Google Scholar] [CrossRef]
  27. Pohjanpalo, H. System identifiability based on the power series expansion of the solution. Math. Biosci. 1978, 41, 21–33. [Google Scholar] [CrossRef]
  28. Chappell, M.J.; Godfrey, K.R. Structural identifiability of the parameters of a nonlinear batch reactor model. Math. Biosci. 1992, 108, 241–251. [Google Scholar] [CrossRef]
  29. Denis-Vidal, L.; Joly-Blanchard, G. An easy to check criterion for (un) indentifiability of uncontrolled systems and its applications. IEEE Trans. Autom. Control 2000, 45, 768–771. [Google Scholar] [CrossRef]
  30. Xia, X.; Moog, C.H. Identifiability of nonlinear systems with application to HIV/AIDS models. IEEE Trans. Autom. Control 2003, 48, 330–336. [Google Scholar] [CrossRef]
  31. Koelle, K.; Farrell, A.P.; Brooke, C.B.; Ke, R. Within-host infectious disease models accommodating cellular coinfection, with an application to influenza. Virus Evol. 2019, 5, vez018. [Google Scholar] [CrossRef]
  32. Chis, O.T.; Banga, J.R.; Balsa-Canto, E. Structural identifiability of systems biology models: A critical comparison of methods. PLoS ONE 2011, 6, e27755. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Miao, H.; Xia, X.; Perelson, A.S.; Wu, H. On identifiability of nonlinear ODE models and applications in viral dynamics. SIAM Rev. 2011, 53, 3–39. [Google Scholar] [CrossRef] [PubMed]
  34. Laskowski, M.; Demianyk, B.C.P.; Witt, J.; Mukhi, S.N.; Friesen, M.R.; McLeod, R.D. Agent-based modeling of the spread of influenza-like illness in an emergency department: A simulation study. IEEE Trans. Inf. Technol. Biomed. 2011, 15, 877–889. [Google Scholar] [CrossRef] [PubMed]
  35. Shaman, J.; Pitzer, V.E.; Viboud, C.; Grenfell, B.T.; Lipsitch, M. Absolute humidity and the seasonal onset of influenza in the continental United States. PLoS Biol. 2010, 8, e1000316. [Google Scholar] [CrossRef]
  36. Coronavirus (COVID-19) Infection Survey, UK Statistical Bulletins. Available online: https://www.ons.gov.uk/peoplepopulationandcommunity/healthandsocialcare/conditionsanddiseases/bulletins/coronaviruscovid19infectionsurveypilot/previousReleases (accessed on 21 October 2022).
  37. Schmidt, W.M. Diophantine Approximation; Springer Science & Business Media: Berlin, Germany, 1996. [Google Scholar]
Figure 1. The process from building the model to putting it into use. The grey arrows show the feedback from testing the model identifiability results to modifying the model. The blue box illustrates the process of testing the model structural identifiability that our work focuses on.
Figure 1. The process from building the model to putting it into use. The grey arrows show the feedback from testing the model identifiability results to modifying the model. The blue box illustrates the process of testing the model structural identifiability that our work focuses on.
Viruses 14 02795 g001
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Zhang, C.; Zhang, X.; Bai, Y.; Lau, E.H.Y.; Pei, S. The Structural Identifiability of a Humidity-Driven Epidemiological Model of Influenza Transmission. Viruses 2022, 14, 2795. https://doi.org/10.3390/v14122795

AMA Style

Zhang C, Zhang X, Bai Y, Lau EHY, Pei S. The Structural Identifiability of a Humidity-Driven Epidemiological Model of Influenza Transmission. Viruses. 2022; 14(12):2795. https://doi.org/10.3390/v14122795

Chicago/Turabian Style

Zhang, Chunyang, Xiao Zhang, Yuan Bai, Eric H. Y. Lau, and Sen Pei. 2022. "The Structural Identifiability of a Humidity-Driven Epidemiological Model of Influenza Transmission" Viruses 14, no. 12: 2795. https://doi.org/10.3390/v14122795

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop