Artificial Neural Network for Fast and Versatile Model Parameter Adjustment Utilizing PAT Signals of Chromatography Processes for Process Control under Production Conditions

Mouellef, Mourad; Szabo, Glaenn; Vetter, Florian Lukas; Siemers, Christian; Strube, Jochen

doi:10.3390/pr10040709

Open AccessEditor’s ChoiceArticle

Artificial Neural Network for Fast and Versatile Model Parameter Adjustment Utilizing PAT Signals of Chromatography Processes for Process Control under Production Conditions

by

Mourad Mouellef

¹,

Glaenn Szabo

¹,

Florian Lukas Vetter

¹,

Christian Siemers

² and

Jochen Strube

^1,*

¹

Institute for Separation and Process Technology, Clausthal University of Technology, Leibnizstraße 15, D-38678 Clausthal-Zellerfeld, Germany

²

Institute for Electrical Information Technology, Clausthal University of Technology, Julius-Albert-Str. 4, D-38678 Clausthal-Zellerfeld, Germany

^*

Author to whom correspondence should be addressed.

Processes 2022, 10(4), 709; https://doi.org/10.3390/pr10040709

Submission received: 14 March 2022 / Revised: 1 April 2022 / Accepted: 3 April 2022 / Published: 5 April 2022

(This article belongs to the Special Issue Towards Autonomous Operation of Biologics and Botanicals)

Download

Browse Figures

Versions Notes

Abstract

:

Preparative chromatography is a well-established operation in chemical and biotechnology manufacturing. Chromatography achieves high separation performances, but often has to deal with the yield versus purity trade-off as the optimization criterium regarding through-put. The initial trade-off is often disturbed by the well-known phenomenon of chromatogram shifts over process lifetime, and has to be corrected by operators via adjustment of peak fraction cutting. Nevertheless, with regard to autonomous operation and batch to continuous processing modes, an advanced process control strategy is needed to identify and correct shifts from the optimal operation point automatically. Previous studies have already presented solutions for batch-to-batch variance and process control options with the aid of rigorous physico-chemical process modeling. These models can be implemented as distinct digital twins as well as statistical process operation data analyzers. In order to utilize such models for advanced process control (APC), the model parameters have to be updated with the aid of inline Process Analytical Technology (PAT) data to describe the actual operational status. This updating process also includes any operational change phenomena that occur, and its relation to their physico-chemical root cause. Typical phenomena are fluid dynamic changes due to packing breakage, channelling or compression as well as mass transfer and phase equilibrium-related separation performance decrease due to adsorbent aging or feed and buffer composition changes. In order to track these changes, an Artificial Neural Network (ANN) is trained in this work. The ANN training is in this first step, based on the simulation results of a distinct and previously experimentally validated process model. The model is implemented in the open source tool CasADi for Python. This allows the implementation of interfaces to process control systems, among others, with relatively low effort. Therefore, PAT signals can easily be incorporated for sufficient adjustment of the process model for appropriate process control. Further steps would be the implementation of optimization routines based on PAT and ANN predictions to derive optimal operation points with the model.

Keywords:

parameter estimation; machine learning; ion-exchange chromatography; chromatography modeling; artificial neural networks

1. Introduction

The utilization of machine-learning approaches in chromatography is a rising field of research, which ranges from extracting crucial process information from measurement data in real-time via partial least squares algorithms [1] to separation factor prediction for chromatography process optimization with the aid of artificial neural networks (ANNs) [2]. Special focus lies on the ANNs, which become more and more accessible to a wide field of researches via frameworks like Tensorflow [3] or The MathWorks Inc. MATLAB toolboxes for Artificial Intelligence [4]. Artificial neural networks (ANNs) enabled computer-aided solutions for problems that were nearly impossible or difficult to solve with conventional algorithms within acceptable time limits. Typical applications are computer visions applications or natural language processing [5,6]. More relevant modeling contributions were the reduction in computational effort and/or the description of the not yet (sufficiently) described physico-chemical relationships [7,8,9]. In general, artificial neural networks consist of interconnected neurons that send information in the form of activations signals over weighted connections to other neurons, which map inputs onto outputs [5]. The process of finding the weights of these connection is called training. Usually, this training is performed by supplying the neural network with input and output data from which it learns underlying relationships via so-called backpropagation algorithms. After the training, the ANN is capable of mapping previously unknown inputs onto outputs, be it classification or regression tasks [5,10]. Further information on ANNs can be found in Fausett [5] and Goodfellow [10].

Some previous works have already investigated the possibility of determining chromatography model parameters by utilizing ANNs. Their results show that a maximum of three different experiments are necessary to predict model parameters within milliseconds after the training process, thus shortening model development time. [11,12]. In addition, both groups suggested that automated real-time model parameter estimation during chromatographic experiments should be possible, especially due to the short computation time. In this work, the possibility of utilizing the suggested approach in a production environment is investigated. The previous approaches were performed in a lab environment where multiple experiments with high process parameter variation can be conducted. One example is the use of different salt gradients. Therefore, the available information in general and for ANN training is much higher, as in a controlled production environment. In contrast, production data are only known to be of some variance. The decisions of the operators mostly take this into account by adjusting fraction cut points. Purity is kept, but non-optimal yield losses are accepted [13,14]. Additionally, accessible chromatograms in a production environment are limited to the latest chromatograms from previous batch runs or cycles in case of continuous/cyclic processes like the multicolumn counter-current solvent gradient purification process (MCSGP) [15] or periodic counter-current chromatography [16]. Such operational data can be analyzed, and necessary actions predicted [17]. On the other hand, the acceptable variation of the process in a production environment is limited due to the regulatory defined design and control space [18]. For example, the input feed mixture, purity and yield requirements, the column itself or gradient length and steepness can be assumed constant over the process lifetime. Therefore, the authors mainly consider typical column packing fouling/aging phenomena as governing effects on the chromatograms, which still cause a considerable economic impact on the process [19,20]. The phenomena observed are related to either fluid dynamics or mass-transfer and phase equilibrium. Fluid-dynamic variations in fluid distribution, packing breakage, channelling, swelling or compression can be described by axial dispersion and/or voidage adjustments, whereas, any adsorbent aging phenomena are related to mass transfer and/or phase equilibrium behavior. To automatically mitigate these effects, models previously used for process development can be used to calculate process conditions such as new fraction cut points. However, the effects on fluid dynamics, column packing and absorption phase equilibria caused by aging should be taken into account in order to achieve optimal results. In order to perform this study, a chromatography model was developed in Python [3] with the CasADi framework [21] for a preparative chromatography separation of a three-component protein mixture based on previous works [22,23]. It is assumed that a valid PAT strategy for online concentration determination is implemented, and its feasibility has already been shown in several publications [24,25,26,27]. The concept for advanced process control is shown in totality in Figure 1.

The PAT signal of the chromatogram is evaluated by a partial least squares algorithm (PLS). These data are transferred to the ANN, which estimates appropriate model parameter adjustments for the digital twin. The digital twin is implemented as a rigorous physico-chemical process model. Such a process model with updated parameter adapts to the current operation state, and is utilized to predict new operation set-points. For example, to fit the necessary purity-yield requirements within the predefined control space.

As a first step, a sensitivity analysis is performed to identify the impact of relevant parameters on chromatograms, which may change over process lifetime. After that, ANNs are trained from simulative generated data to predict relevant model parameters from a single chromatogram.

2. Materials and Methods

2.1. Chromatography Modeling

The necessary data for the ANN were generated through simulations. All simulations were performed in a Python 3.8 environment [28] with the CasADi Framework [21] on a Dell Optiplex 7010 System. All programming was performed in the Spyder Integrated Development Environment (IDE) [29]. For ANN implementation, the Tensorflow 3 [3] backend of Keras v. 2.4.0 [30] was used. Commonly used chromatography models for mass transport by convection and dispersion are based on the general rate model Equation (1) or on the lumped pore diffusion model Equation (2) [31,32]. These equations describe the mass balance of the stationary phase [15]. This work utilized the lumped pore diffusion model.

ε_{p, i} \frac{\partial c_{p, i}}{\partial t} + (1 - ε_{p, i}) \frac{\partial q_{i}}{\partial t} = \frac{1}{r^{2}} \frac{\partial}{\partial r} [r^{2} (ε_{p, i} D_{p, i} \frac{\partial c_{p, i}}{\partial r} + (1 - ε_{p, i}) D_{S, i} \frac{\partial q_{i}^{*}}{\partial r})]

(1)

ε_{p, i} \frac{\partial c_{p, i}}{\partial t} + (1 - ε_{p, i}) \frac{\partial q_{i}}{\partial t} = \frac{6}{d_{p}} \frac{(1 - ε_{S})}{ε_{S}} k_{e f f, i} (c_{i} - c_{p, i})

(2)

where the parameters mean diameter of the resin particle

d_{p}

, the porosity

ε_{p, i}

and the voidage

ε_{S}

describe the column packing. The variable

c_{p, i}

represents the concentration of the component in the pores of the resin,

q_{i}

the loading,

c_{i}

the concentration in the continuous phase, and

k_{e f f, i}

the effective mass transport coefficient. The variable t represents the time. The boundary conditions are described by Equation (3) for the column inlet and Equation (4) for the column outlet.

D_{ax}

depicts the axial dispersion coefficient, L the length of the column, and x the length domain [31].

u c_{i n, i} (t) = u c_{i} (t, 0) - D_{ax} \frac{\partial c_{i}}{\partial x} (t, 0)

(3)

\frac{\partial c_{i}}{\partial x} (t, L) = 0

(4)

The mass transfer coefficient

k_{e f f, i}

is given by Equation (5). Here,

k_{f, i}

is the film mass transfer coefficient,

r_{p}

the particle radius, and

D_{p, i}

the pore diffusion coefficient. The pore diffusion coefficient

D_{p, i}

was calculated according to the correlation of Carta [33] and

k_{f, i}

according to Wilson and Geankoplis [34].

k_{e f f, i} = \frac{1}{\frac{1}{k_{f, i}} + \frac{r_{p}}{D_{p, i}}}

(5)

In this work, the competitive Langmuir-isotherm Equation (6), which has already demonstrated its performance in (bio-) chromatography, was used [35,36,37]. The adsorption and desorption behaviour of the components can also be described by various other approaches [31,35,36,38,39,40].

q_{i} = \frac{q_{m a x, i} K_{e q, i} c_{i}}{1 + \sum_{j} K_{e q, j} c_{j}}

(6)

with

q_{m a x, i}

as the maximum loading capacity of component

i

and

K_{e q, i}

as the Langmuir coefficient of component

i

. To include salt dependence into Equation (6), the Langmuir coefficient can be written as shown in Equation (7) [31].

q_{m a x, i} K_{e q, i} = H_{i}

(7)

The salt dependence of the maximum loading

q_{m a x, i}

and the Henry coefficient

H_{i}

can then be expressed by Equations (8) and (9) and the empiric coefficients

a_{1, i}

,

a_{2, i}

,

b_{1, i}

and

b_{2, i}

for each component

i

[41,42].

q_{m a x, i} = b_{1, i} c_{p, i} + b_{2, i}

(8)

H_{i} = a_{1, i} {c_{p, i}}^{a_{2, i}}

(9)

The spatial discretization of the partial differential equations system followed a finite differences scheme.

2.2. Model Parameter Choice and ANN Dataset Generation

As previously mentioned, the dataset has to reflect the chromatography process conditions in a production environment. In this environment, the components are fixed and the option of multiple gradient experiments or arbitrary alternation of injection volumes is not available. This case stands in clear contrast to the previous work [12], where isotherm parameters for arbitrary 3 components mixtures were estimated. The data for this study were generated by an chromatography model based on the previous piloting case study of a monoclonal antibody manufacturing process from Kornecki et al. [22] and the work of Zobel-Roos et al. [43]. The model comprises three proteins, immunoglobulin G (IgG), a weak binding host cell protein (HCP1) and a strong binding host cell protein (HCP2). To reflect a production environment, the column was up-scaled to a length of 15 cm and a diameter of 20 cm. The process parameters were increased to 1.6 L/min buffer flow, a 15-column volume (CV) gradient and an injection volume of 2.6 L with

5 g / L

IgG,

2 g / L

HCP1 and

2 g / L

HCP2.

As stated before, the ANN maps inputs on outputs. Therefore, the relevant outputs for the prior described use-case, deviation of model parameters due to column aging, must be identified. A subset of parameters can already be excluded or neglected based on expert knowledge. This comprises component properties like the molecular mass of the three proteins or the particle diameter of the packing and others, which should not alter during the process lifetime. Additionally, some parameters can be excluded beforehand because they can be substituted in others through correlations. This leads to an implicit consideration of these via other parameters. The molecular masses, tortuosity, steric factors, protein radius, particle diameter and molecular diffusion coefficient are considered constant [44]. The film diffusion coefficient can be substituted into the mass transfer coefficient as shown in Equation (5). The pore diffusion coefficient can be substituted in the particle porosity, as shown in [33]. The influence of the remaining model parameters on the chromatograms is investigated through one parameter at a time sensitivity analysis. The variation range of each parameter is intentionally greater as it is expected by expert knowledge for the investigated use-case. The flow was excluded from this rule because it is considered as a well controllable input parameter. An overview of the remaining parameters and their variation ranges for the sensitivity study is given in Table 1. The results of the sensitivity study are shown in Figure 2.

From Figure 2a,b,d, a clear impact of the salt-dependency describing parameters

a_{1}

,

a_{2}

and

b_{2}

of the Langmuir isotherm (Equations (6)–(9)) on the chromatogram can be seen. The impact of

b_{1}

(c) is minor but still noticeable. Hence, all of these parameters are considered in the following steps. The flow in (e) has nearly no impact on the resulting chromatogram within the given variation range. In addition,

D_{ax}

shows a high impact of values greater

0.1 {cm}^{2} / s

in subplot (f). Because of this and the direct correlation of the flow and

D_{ax}

, which is described by Chung and Wen [45], both parameters are considered relevant in further steps. Following the results of (g) and (j), the pore radius

r_{p}

and porosity

ε_{p}

are not considered. The high impact of the voidage

ε_{s}

seen in (i) is considered in the following steps. Also,

k_{e f f}

from subplot (h) is considered for now even though its values must decrease below

10^{- 4} {cm}^{2} / s

to show an impact on the chromatogram. This effect can be explained by the fact that for a predetermined flow, the mass transport can be assumed instantaneous above a certain threshold.

Because of the number of different parameters, the authors chose to split the set of these parameters into fluid dynamics, a column packing set, and a phase equilibrium parameter set in the first step. The mass transfer coefficient is also moved to the fluid dynamic and column packing set because the effects are similar. The reasoning is to facilitate the identification of parameters, which definitely need more than one batch experiment for an adequate estimation, and to reduce the total amount of needed simulations. The fluid dynamic and column packing set contains the parameter variation from flow, voidage, axial dispersion and the mass transfer coefficient. The other set contains the variations of

a_{1}

,

a_{2}

,

b_{1}

and

b_{2}

of all components. All parameter variations were uniformly distributed and varied in the boundaries of Table 1. As input for the ANN, the chromatogram of each component was reduced to 38 data points to reduce the complexity of the input data. Additionally, the concentration value at each peak maximum was included in each entry. The data reduction scheme, its drawbacks and its benefits are explained in detail in the previous publication [12]. Accordingly, a single dataset entry consists of 39 data points. Another approach to reducing chromatogram information into fewer points can be found in Wang et al. [11].

3. Results

The aforementioned fluid dynamic and column packing dataset, and the phase equilibrium parameter dataset from 2.2. were generated to examine the utilization of a single-batch chromatography experiment for parameter estimation via ANNs. Therefore, two ANNs were trained in this process. With the first ANN, the general possibility of estimating the packing and fluid dynamic parameters and the mass transfer coefficient of each component from a single chromatogram was investigated. That possibility for the phase equilibrium parameters was investigated with the second ANN. After the training process of all ANNs was completed, the corresponding parameters were predicted from training and validation set. The results were evaluated and used to generate a third dataset in which all remaining parameters were varied. The prediction of a single data entry costs 200 milliseconds of computation time, including the loading of input data and the ANN model itself.

3.1. Variation of Packing and Fluid Dynamic Parameters

This ANN was trained with 1000 chromatograms, split into 70% training data and 30% validation data. The FF-ANN consists of 118 input neurons, 120 neurons with tanh activation in hidden layer 1 and 20% dropout probability, 80 neurons with tanh activation in hidden layer 2 with 20% dropout probability, and 5 output neurons for

D_{ax}

,

ε_{s}

,

k_{e f f, 1}

,

k_{e f f, 2}

and

k_{e f f, 3}

. Adam was chosen as the optimizer. Training was performed over 10,000 epochs with a batch size of 16. The results after training can be seen in Figure 3.

As shown in Figure 3, the parameters

D_{ax}

and

ε_{s}

can be predicted well from a single experiment while the prediction performance of the

k_{e f f}

values of all components seems insufficient with rising values of

k_{e f f}

. As seen before in Figure 2h, the impact of values greater

10^{- 4} {cm}^{2} / s

is negligible. Therefore, the worse prediction accuracy can be explained by the low sensitivity of

k_{e f f}

at higher values. The prediction performance of

k_{e f f, H C P 2}

is notably worse compared to the others. The reason can be seen in Figure 2h,f. The peak tailing and fronting behaviour of IgG and HCP1 changes depending on whether

k_{e f f}

or

D_{ax}

is varied. The behaviour of the HCP2 peak is the same in both cases. Hence, the distinction between the effects of

D_{ax}

and

k_{e f f}

on the chromatogram is more difficult. To investigate the prediction accuracy further, the chromatograms were resimulated with the predicted parameters. An example chromatogram with the original parameters and ANN-predicted parameters is given in Figure 4.

The predicted parameters have an error of lower than 5%, except for

k_{e f f, H C P 2}

. With values of

~ 0.02 {cm}^{2} / s

(original) and

~ 0.007 {cm}^{2} / s

, the error is 172%. Despite the high error of

k_{e f f, H C P 2}

, the original chromatogram and the resimulated chromatogram perfectly match with the coefficient of determination (

R^{2})

of over 99%. The confidence interval at 95% level of the R² of the original and resimulated chromatograms of IgG is

[97.4, 99.9]

, of HCP1 is

[97.5, 99.9]

and of HCP2 is

[97.1, 99.4]

. Hence, these parameters can be predicted from a single chromatogram with

k_{e f f}

only needing to be a rough estimate whether it is smaller or above

10^{- 4} {cm}^{2} / s

.

3.2. Variation of Phase Equilibrium Parameters

To test the prediction ability of the phase equilibrium parameters, an FF-ANN with 118 input neurons, 117 neurons with 20% dropout and tanh activation in hidden layer 1, 100 neurons with tanh activation function and 20% dropout, and 12 output neurons with relu activation was implemented. As training algorithm, the Adam optimizer was utilized. The dataset of 1000 simulative experiments was split into 70% training and 30% validation data. Training was performed over 10,000 epochs with a batch size of 16. The R² values of the original values over predicted values of the test set are shown in Table 2.

Again, it could be assumed that the prediction quality of all parameters is insufficient. Therefore, this hypothesis is tested by resimulating all chromatograms with the ANN predicted parameters. Some example chromatograms are given in Figure 5.

In contrast to the assumptions based on Table 2, the resimulated chromatograms match the original chromatograms. This even applies to extreme conditions seen in Figure 5a,b. Based on this, the results of the previous work [12] and the scenario in the next step, the capability of predicting the phase equilibrium parameters is assumed to be sufficient.

3.3. Variation of Phase Equilibrium, Fluid Dynamic and Packing Parameters at Once

The previous steps have shown that the prediction of relevant parameters from a single chromatogram is possible under the previously stated conditions Therefore a final dataset was created. For this dataset, the following scenario was set. The previously presented chromatography step starts as an optimized process with the initial parameters of Table 1. It is assumed that the parameters are affected by column aging/fouling and therefore move in the direction of decreasing performance of the column (except the flow) within the range of Table 1. The column is deemed insufficient as soon as the target protein’s peak area (IgG) is overlapped by the side components’ (HCP1, HCP2) peak area by more than 10%. Furthermore, it is assumed that a validated but noisy PAT concept for online concentration measurement is implemented. Under these conditions, the covered parameter range of the dataset is reduced but extra complexity is added by the noise. The noisy initial chromatogram is shown in Figure 6.

To generate a dataset that suffices the overlapping area rule and does not add unnecessary complexity to the dataset, each chromatogram was evaluated after its simulation. If the 10% rule was violated, the chromatogram was discarded. This process was repeated until the dataset contained 1000 entries. Unexpectedly, no entries with

k_{e f f}

values below

10^{- 4} {cm}^{2} / s

were generated. Because greater values had no impact on the chromatograms, the

k_{e f f}

prediction was discarded after further investigation. Thus, the complexity of the prediction task can be reduced further. The resulting dataset was split into 70% training and 30% validation data. Afterwards, a feed-forward ANN with 118 input neurons, 118 neurons with 20% dropout and tanh activation function in hidden layer I, 100 neurons with 20% dropout and tanh activation layer and 14 output neurons with linear activation was trained. The neural network training utilized the Adam algorithm. It was trained over 30,000 epochs with a batch size of 16. Again, the validation data were used to test the ANN performance. The predicted parameters were used to resimulate the validation set chromatograms. The results of the R² are summarized in the box plot in Figure 7.

Except for a few outliers, a high prediction performance can be seen. The confidence intervals at 95% level for IgG, HCP1 and HCP2 are

[0.97, 0.98]

,

[0.96, 0.97]

and

[0.96, 0.97]

, respectively. The performance may be increased by further optimizing the ANN structure, adding additional chromatograms to the dataset, or applying smoothing algorithms to the input data instead of using the noisy raw data. Additionally, more chromatograms from previous batches could be used to supply the ANN with more information and, therefore, increase prediction performance.

4. Conclusions

In this study, an ANN for chromatography model parameter prediction under preparative production conditions was developed and evaluated. Instead of using a wide parameter range with multiple chromatograms like in previous works for a more screening-like approach, only a single and noisy chromatogram was used for parameter prediction. After excluding irrelevant parameters for the set scenario via expert knowledge and one-parameter-at-a-time sensitivity studies, the ANN was trained to predict the voidage, the axial dispersion coefficient, and phase equilibrium parameters. Although the prediction of the individual parameters indicated poor performance, simulating the chromatograms with predicted values showed that a high agreement occurs between the original and the chromatograms with predicted parameters. The authors explain this with the sensitivity of the parameters in certain areas. Therefore, the ANN can be used to track parameter deviations of the specified process. A schematic application was shown in Figure 1. The presented approach could be used as supportive tool in chromatography processes with model-based control and optimization in production environments. The influence of typical deviations caused by column aging or feed and buffer variations on the model parameters can be tracked by the ANN within 200 milliseconds. Therefore, a quick adjustment of the parameters to the current state of the process is possible by a chromatogram given by PAT. A drift of the model from reality can thus be counteracted, which leads to increased performance of the control and optimization model over the whole process lifetime.

Author Contributions

Conceptualization: C.S., J.S.; methodology, design and evaluation: G.S., M.M; chromatography modeling: F.L.V., writing, editing, and reviewing M.M., F.L.V., C.S. and J.S.; supervision: J.S. All authors have read and agreed to the published version of the manuscript.

Funding

The authors want to gratefully acknowledge the Bundesministerium für Wirtschaft und Energie (BMWi), especially Dr. Michael Gahr (Projektträger FZ Jülich), for funding the scientific work. We also kindly acknowledge the support by Open Access Publishing Fund of the Clausthal University of Technology.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data cannot be made publicly available.

Acknowledgments

The authors would like to thank Reinhard Ditz, formerly of Merck KGaA, Darmstadt, for paper revision and fruitful discussions; Andreas Potschka from the Institute of Mathematics at the Clausthal University of Technology for helpful input; as well as the ITVP lab-team, especially Frank Steinhäuser, Volker Strohmeyer and Thomas Knebel, for their efforts and support.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or the decision to publish the results.

References

Hattori, Y.; Tajiri, Y.; Otsuka, M. Tablet Characteristics Prediction by Powder Blending Process Analysis Based on near Infrared Spectroscopy. J. Near Infrared Spectrosc. 2013, 21, 1–9. [Google Scholar] [CrossRef]
Golubović, J.; Protić, A.; Zečević, M.; Otašević, B.; Mikić, M. Artificial neural networks modeling in ultra performance liquid chromatography method optimization of mycophenolate mofetil and its degradation products. J. Chemom. 2014, 28, 567–574. [Google Scholar] [CrossRef]
Abadi, M.; Agarwal, A.; Barham, P.; Brevdo, E.; Chen, Z.; Citro, C.; Corrado, G.S.; Davis, A.; Dean, J.; Devin, M.; et al. Tensorflow: Large-Scale Machine Learning on Heterogeneous Distributed Systems. Available online: https://static.googleusercontent.com/media/research.google.com/en//pubs/archive/45166.pdf (accessed on 19 December 2021).
The MathWorks Inc. MATLAB Statistics and Machine Learning Toolbox; The MathWorks Inc.: Natick, MA, USA, 2019. [Google Scholar]
Fausett, L.V. Fundamentals of Neural Networks: Architectures, Algorithms, and Applications; Prentice Hall: Englewood Cliffs, NJ, USA, 1994; ISBN 0-13-334186-0. [Google Scholar]
Gudivada, V.N.; Rao, C.R. (Eds.) Computational Analysis and Understanding of Natural Languages: Principles, Methods and Applications; Elsevier: Amsterdam, The Netherlands, 2018; ISBN 978-0-444-64042-0. [Google Scholar]
Asprion, N.; Böttcher, R.; Pack, R.; Stavrou, M.-E.; Höller, J.; Schwientek, J.; Bortz, M. Gray-Box Modeling for the Optimization of Chemical Processes. Chem. Ing. Tech. 2019, 91, 305–313. [Google Scholar] [CrossRef]
Hagge, T.; Stinis, P.; Yeung, E.; Tartakovsky, A.M. Solving differential equations with unknown constitutive relations as recurrent neural networks. arxiv 2017, arXiv:1710.02242. [Google Scholar]
Gao, W.; Engell, S. Neural Network-Based Identification of Nonlinear Adsorption Isotherms. IFAC Proc. Vol. 2004, 37, 721–726. [Google Scholar] [CrossRef]
Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning; MIT Press: Cambridge, MA, USA; London, UK, 2016; ISBN 978-0262035613. [Google Scholar]
Wang, G.; Briskot, T.; Hahn, T.; Baumann, P.; Hubbuch, J. Estimation of adsorption isotherm and mass transfer parameters in protein chromatography using artificial neural networks. J. Chromatogr. A 2017, 1487, 211–217. [Google Scholar] [CrossRef]
Mouellef, M.; Vetter, F.L.; Zobel-Roos, S.; Strube, J. Fast and Versatile Chromatography Process Design and Operation Optimization with the Aid of Artificial Intelligence. Processes 2021, 9, 2121. [Google Scholar] [CrossRef]
Ley, C.; Elvers, B.; Bellussi, G.; Bus, J.; Drauz, K.; Greim, H.; Hessel, V.; Kleemann, A.; Kutscher, B.; Meijer, G.; et al. (Eds.) Ullmann’s Encyclopedia of Industrial Chemistry; Wiley: Chichester, UK, 2010; ISBN 3527306730. [Google Scholar]
Golshan-Shirazi, S.; Guiochon, G. Optimization of experimental conditions in preparative liquid chromatography. J. Chromatogr. A 1991, 536, 57–73. [Google Scholar] [CrossRef]
Ströhlein, G.; Aumann, L.; Müller-Späth, T.; Tarafder, A.; Morbidelli, M. CONTINUOUS PROCESSING: The Multiclomn Countercurrent Solvent Gradient Purification Process: A continuous chromatographic process for monoclonal antibodies without using Protein A. BioPharm Int. 2007, 22, 42–48. [Google Scholar]
Godawat, R.; Brower, K.; Jain, S.; Konstantinov, K.; Riske, F.; Warikoo, V. Periodic counter-current chromatography—Design and operational considerations for integrated and continuous purification of proteins. Biotechnol. J. 2012, 7, 1496–1508. [Google Scholar] [CrossRef]
Helling, C.; Dams, T.; Gerwat, B.; Belousov, A.; Strube, J. Physical characterization of column chromatography: Stringend control over equipment performance in biopharmaceutical production. Trends Chromatogr. 2013, 8, 55–71. [Google Scholar]
International Council for Harmonisation of Technical Requirements for Pharmaceuticals for Human Use. ICH-Endorsed Guide for ICH Q8/Q9/Q10 Implementation. 6 December 2011. Available online: https://database.ich.org/sites/default/files/Q8_Q9_Q10_Q%26As_R4_Points_to_Consider_0.pdf (accessed on 28 February 2022).
Zhang, J.; Siva, S.; Caple, R.; Ghose, S.; Gronke, R. Maximizing the functional lifetime of Protein A resins. Biotechnol. Prog. 2017, 33, 708–715. [Google Scholar] [CrossRef]
Rathore, A.S.; Bracewell, D.G.; Phatak, M.; Ma, G. Re-use of Protein A Resin: Fouling and Economics. BioPharm Int. 2015, 28, 28–33. [Google Scholar]
Andersson, J.A.E.; Gillis, J.; Horn, G.; Rawlings, J.B.; Diehl, M. CasADi: A software framework for nonlinear optimization and optimal control. Math. Prog. Comp. 2019, 11, 1–36. [Google Scholar] [CrossRef]
Kornecki, M.; Schmidt, A.; Lohmann, L.; Huter, M.; Mestmäcker, F.; Klepzig, L.; Mouellef, M.; Zobel-Roos, S.; Strube, J. Accelerating Biomanufacturing by Modeling of Continuous Bioprocessing—Piloting Case Study of Monoclonal Antibody Manufacturing. Processes 2019, 7, 495. [Google Scholar] [CrossRef] [Green Version]
Zobel-Roos, S.; Schmidt, A.; Mestmäcker, F.; Mouellef, M.; Huter, M.; Uhlenbrock, L.; Kornecki, M.; Lohmann, L.; Ditz, R.; Strube, J. Accelerating Biologics Manufacturing by Modeling or: Is Approval under the QbD and PAT Approaches Demanded by Authorities Acceptable Without a Digital-Twin? Processes 2019, 7, 94. [Google Scholar] [CrossRef] [Green Version]
Bakeev, K.A. (Ed.) Process Analytical Technology: Spectroscopic Tools and Implementation Strategies for the Chemical and Pharmaceutical Industries; Blackwell: Oxford, UK, 2006; ISBN 1-4051-2103-3. [Google Scholar]
Kessler, W. Multivariate Datenanalyse für die Pharma-, Bio- und Prozessanalytik: Ein Lehrbuch, 1st ed.; Wiley: Weinheim, Germany, 2008; ISBN 9783527312627. (In German) [Google Scholar]
Rathore, A.S.; Kapoor, G. Application of process analytical technology for downstream purification of biotherapeutics. J. Chem. Technol. Biotechnol. 2015, 90, 228–236. [Google Scholar] [CrossRef]
Großhans, S.; Rüdt, M.; Sanden, A.; Brestrich, N.; Morgenstern, J.; Heissler, S.; Hubbuch, J. In-line Fourier-transform infrared spectroscopy as a versatile process analytical technology for preparative protein chromatography. J. Chromatogr. A 2018, 1547, 37–44. [Google Scholar] [CrossRef]
van Rossum, G. The Python Language Reference, Release 3.0.1 [Repr.]; Python Software Foundation: Hampton, NH, USA; SoHo Books: Redwood City, CA, USA, 2010; ISBN 1441412697. [Google Scholar]
Spyder IDE. Available online: https://www.spyder-ide.org/ (accessed on 19 December 2021).
Keras. Available online: https://keras.io (accessed on 19 December 2021).
Guiochon, G. Fundamentals of Preparative and Nonlinear Chromatography, 2nd ed.; Elsevier: Amsterdam, The Netherlands, 2006; ISBN 978-0123705372. [Google Scholar]
Shekhawat, L.K.; Rathore, A.S. An overview of mechanistic modeling of liquid chromatography. Prep. Biochem. Biotechnol. 2019, 49, 623–638. [Google Scholar] [CrossRef]
Carta, G.; Rodrigues, A.E. Diffusion and convection in chromatographic processes using permeable supports with a bidisperse pore structure. Chem. Eng. Sci. 1993, 48, 3927–3935. [Google Scholar] [CrossRef]
Wilson, E.J.; Geankoplis, C.J. Liquid Mass Transfer at Very Low Reynolds Numbers in Packed Beds. Ind. Eng. Chem. Fund. 1966, 5, 9–14. [Google Scholar] [CrossRef]
Carta, G.; Jungbauer, A. Protein Chromatography; Wiley: Weinheim, Germany, 2010; ISBN 9783527318193. [Google Scholar]
Seidel-Morgenstern, A.; Guiochon, G. Modelling of the competitive isotherms and the chromatographic separation of two enantiomers. Chem. Eng. Sci. 1993, 48, 2787–2797. [Google Scholar] [CrossRef]
Leśko, M.; Åsberg, D.; Enmark, M.; Samuelsson, J.; Fornstedt, T.; Kaczmarski, K. Choice of Model for Estimation of Adsorption Isotherm Parameters in Gradient Elution Preparative Liquid Chromatography. Chromatographia 2015, 78, 1293–1297. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Mollerup, J.M. A Review of the Thermodynamics of Protein Association to Ligands, Protein Adsorption, and Adsorption Isotherms. Chem. Eng. Technol. 2008, 31, 864–874. [Google Scholar] [CrossRef]
Brooks, C.A.; Cramer, S.M. Steric mass-action ion exchange: Displacement profiles and induced salt gradients. AIChE J. 1992, 38, 1969–1978. [Google Scholar] [CrossRef]
Langmuir, I. The adsorption of gases on plane surfaces of glass, mica and platinum. J. Am. Chem. Soc. 1918, 40, 1361–1403. [Google Scholar] [CrossRef] [Green Version]
Zobel-Roos, S. Entwicklung, Modellierung und Validierung von integrierten kontinuierlichen Gegenstrom-Chromatographie-Prozessen. Ph.D. Thesis, Shaker Verlag GmbH, Clausthal University of Technologies, Clausthal-Zellerfeld, Germany, 2018. [Google Scholar]
Seidel-Morgenstern, A. Experimental determination of single solute and competitive adsorption isotherms. J. Chromatogr. A 2004, 1037, 255–272. [Google Scholar] [CrossRef]
Zobel-Roos, S.; Mouellef, M.; Ditz, R.; Strube, J. Distinct and Quantitative Validation Method for Predictive Process Modelling in Preparative Chromatography of Synthetic and Bio-Based Feed Mixtures Following a Quality-by-Design (QbD) Approach. Processes 2019, 7, 580. [Google Scholar] [CrossRef] [Green Version]
Young, M.E.; Carroad, P.A.; Bell, R.L. Estimation of diffusion coefficients of proteins. Biotechnol. Bioeng. 1980, 22, 947–955. [Google Scholar] [CrossRef]
Chung, S.F.; Wen, C.Y. Longitudinal dispersion of liquid flowing through fixed and fluidized beds. AIChE J. 1968, 14, 857–866. [Google Scholar] [CrossRef]

Figure 1. Advanced process control concept for process chromatography—batch and continuous.

Figure 2. Shows the sensitivity analysis of the non a priori excluded parameters on each component. Green curves represent HCP1, red curves IgG, orange curves HCP2. Upper and lower variation ranges for the sensitivity analysis are given in Table 1. (a) Shows the impact on the chromatograms of parameters

a_{1, i}

, which describe the salt dependence of the henry coefficients. (b) Depicts the impact on the chromatograms of parameters

a_{2, i}

, which also describe the salt dependence of the Henry coefficients. (c) Illustrates the impact of the parameters

b_{1, i}

, which describe the salt dependence of the maximum loading capacities. (d) Depicts the impact of the parameters

b_{2, i}

, which also describe the salt dependence of the maximum loading capacities. (e) Shows the impact of the volume flow. (f) Shows the impact of

D_{ax}

. (g) Depicts the impact of

r_{p}

. (h) Shows the impact of

k_{eff, i}

. (i) Illustrates the impact of

ε_{s}

. (j) Shows the impact of

ε_{p}

.

Figure 2. Shows the sensitivity analysis of the non a priori excluded parameters on each component. Green curves represent HCP1, red curves IgG, orange curves HCP2. Upper and lower variation ranges for the sensitivity analysis are given in Table 1. (a) Shows the impact on the chromatograms of parameters

a_{1, i}

, which describe the salt dependence of the henry coefficients. (b) Depicts the impact on the chromatograms of parameters

a_{2, i}

, which also describe the salt dependence of the Henry coefficients. (c) Illustrates the impact of the parameters

b_{1, i}

, which describe the salt dependence of the maximum loading capacities. (d) Depicts the impact of the parameters

b_{2, i}

, which also describe the salt dependence of the maximum loading capacities. (e) Shows the impact of the volume flow. (f) Shows the impact of

D_{ax}

. (g) Depicts the impact of

r_{p}

. (h) Shows the impact of

k_{eff, i}

. (i) Illustrates the impact of

ε_{s}

. (j) Shows the impact of

ε_{p}

.

Figure 3. Predicted over original data of the test set. (a) depicts the results of the

D_{a x}

prediction with an

R^{2}

over 99%, (b) depicts the

ε_{s}

prediction with R² over 99%. The results of

k_{eff, I g G}

,

k_{eff, H C P 1}

and

k_{eff, H C P 2}

are illustrated in (c), (d) and (e) with R² of 91%, 88% and 59%, respectively.

Figure 3. Predicted over original data of the test set. (a) depicts the results of the

D_{a x}

prediction with an

R^{2}

over 99%, (b) depicts the

ε_{s}

prediction with R² over 99%. The results of

k_{eff, I g G}

,

k_{eff, H C P 1}

and

k_{eff, H C P 2}

are illustrated in (c), (d) and (e) with R² of 91%, 88% and 59%, respectively.

Figure 4. Chromatogram of the simulation with the original parameters (solid lines) and the resimulated chromatogram with the ANN-predicted parameters (dashed lines). The red lines represent the target component IgG, the green line the side component group HCP1, and the yellow line the side component group HCP2. The blue line represents the salt gradient.

Figure 5. Comparison of four chromatograms with original and ANN-predicted isotherm parameters from the validation set. In all subplots, the protein group HCP1 is green, the protein group HCP2 is yellow, and the IgG is red. Solid lines represent the chromatograms original isotherm parameters. Dashed lines represent the chromatograms with ANN-predicted isotherm parameters: (a) Depicts strongly overlapping peaks with nearly maximum loading of IgG and HCP2, and strongly non-ideal Gauss-peaks with clear competitive behaviour. (b) Illustrates similar behavior like (a) with peak switching. (c,d) show the typical “shark-fin” shape of highly loaded columns with baseline separation.

Figure 6. The initial noisy chromatogram before the separation performance decreases. The green curve shows HCP1, the red curve IgG, the yellow curve HCP2 and the blue curve shows the salt gradient.

Figure 7. Box plots of the R² of the validation chromatograms with predicted parameters over the chromatograms with the original parameters. The red box plot represents IgG, green represents HCP1 and yellow represents HCP2.

Table 1. Upper and lower limits of the varied parameters from equation in the sensitivity analysis. The parameters

a_{1}, a_{2}, b_{1} and b_{2}

originate from Equations (8) and (9), and describe the salt dependency of the Henry coefficient and maximum loading.

D_{a x}, r_{p}, k_{eff}, ε_{s} and ε_{p}

represent the axial dispersion coefficient, the particle radius, the effective mass transfer coefficient, the voidage and the particle porosity, respectively. The parameters originate from Equations (2), (3) and (5).

Table 1. Upper and lower limits of the varied parameters from equation in the sensitivity analysis. The parameters

a_{1}, a_{2}, b_{1} and b_{2}

originate from Equations (8) and (9), and describe the salt dependency of the Henry coefficient and maximum loading.

D_{a x}, r_{p}, k_{eff}, ε_{s} and ε_{p}

represent the axial dispersion coefficient, the particle radius, the effective mass transfer coefficient, the voidage and the particle porosity, respectively. The parameters originate from Equations (2), (3) and (5).

Parameter	Lower Bound	Initial	Upper Bound
$a_{1, IgG}$	$0.40$	$0.78$	$1.60$
$a_{1, HCP 1}$	$0.80$	$1.62$	$3.20$
$a_{1, HCP 2}$	$0.05$	$0.99$	$0.20$
$a_{2, IgG}$	$- 3.60$	$- 2.98$	$- 1.50$
$a_{2, HCP 1}$	$- 3.60$	$- 3.00$	$- 1.50$
$a_{2, HCP 2}$	$- 3.60$	$- 3.01$	$- 1.50$
$b_{1, IgG}$	$- 0.36$	$- 0.24$	$- 0.12$
$b_{1, HCP 1}$	$- 0.23$	$- 0.15$	$- 0.08$
$b_{1, HCP 2}$	$- 0.15$	$- 0.01$	$- 0.05$
$b_{2, IgG}$	$0.13 g / L$	$0.25 g / L$	$0.50 g / L$
$b_{2, HCP 1}$	$0.13 g / L$	$0.25 g / L$	$0.50 g / L$
$b_{2, HCP 2}$	$0.05 g / L$	$0.11 g / L$	$0.25 g / L$
$flow$	$1.568 L / \min$	$1600 L / \min$	$1.632 L / \min$
$D_{ax}$	$1.3 \times 10^{- 6} {cm}^{2} / s$	$1.3 \times 10^{- 3} {cm}^{2} / s$	$1.3 {cm}^{2} / s$
$r_{p}$	$1 \times 10^{- 7} cm$	$1 \times 10^{- 5} cm$	$1 \times 10^{- 3} cm$
$k_{eff, IgG}$	$1 \times 10^{- 5} {cm}^{2} / s$	$1.5 \times 10^{- 2} {cm}^{2} / s$	$1 \times 10^{2} {cm}^{2} / s$
$k_{eff, HCP 1}$	$1 \times 10^{- 5} {cm}^{2} / s$	$1.2 \times 10^{- 2} {cm}^{2} / s$	$1 \times 10^{2} {cm}^{2} / s$
$k_{eff, HCP 2}$	$1 \times 10^{- 5} {cm}^{2} / s$	$2.7 \times 10^{- 2} {cm}^{2} / s$	$1 \times 10^{2} {cm}^{2} / s$
$ε_{s}$	$0.24$	$0.36$	$0.48$
$ε_{p}$	$0.3$	$0.78$	$0.78$

Table 2. Coefficient of determination of the phase equilibrium parameters from the test set.

	$a_{1, IgG}$	$a_{1, HCP 1}$	$a_{1, HCP 2}$	$a_{2, IgG}$	$a_{2, HCP 1}$	$a_{2, HCP 2}$	$b_{1, IgG}$	$b_{1, HCP 1}$	$b_{1, HCP 2}$	$b_{2, IgG}$	$b_{2, HCP 1}$	$b_{2, HCP 2}$
$R^{2}$	$83 %$	$85 %$	$84 %$	$78 %$	$73 %$	$91 %$	$56 %$	$10 %$	$55 %$	$80 %$	$64 %$	$92 %$

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Mouellef, M.; Szabo, G.; Vetter, F.L.; Siemers, C.; Strube, J. Artificial Neural Network for Fast and Versatile Model Parameter Adjustment Utilizing PAT Signals of Chromatography Processes for Process Control under Production Conditions. Processes 2022, 10, 709. https://doi.org/10.3390/pr10040709

AMA Style

Mouellef M, Szabo G, Vetter FL, Siemers C, Strube J. Artificial Neural Network for Fast and Versatile Model Parameter Adjustment Utilizing PAT Signals of Chromatography Processes for Process Control under Production Conditions. Processes. 2022; 10(4):709. https://doi.org/10.3390/pr10040709

Chicago/Turabian Style

Mouellef, Mourad, Glaenn Szabo, Florian Lukas Vetter, Christian Siemers, and Jochen Strube. 2022. "Artificial Neural Network for Fast and Versatile Model Parameter Adjustment Utilizing PAT Signals of Chromatography Processes for Process Control under Production Conditions" Processes 10, no. 4: 709. https://doi.org/10.3390/pr10040709

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Artificial Neural Network for Fast and Versatile Model Parameter Adjustment Utilizing PAT Signals of Chromatography Processes for Process Control under Production Conditions

Abstract

1. Introduction

2. Materials and Methods

2.1. Chromatography Modeling

2.2. Model Parameter Choice and ANN Dataset Generation

3. Results

3.1. Variation of Packing and Fluid Dynamic Parameters

3.2. Variation of Phase Equilibrium Parameters

3.3. Variation of Phase Equilibrium, Fluid Dynamic and Packing Parameters at Once

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI