Mathematical Modelling by Help of Category Theory: Models and Relations between Them

Legatiuk, Dmitrii

doi:10.3390/math9161946

Open AccessArticle

Mathematical Modelling by Help of Category Theory: Models and Relations between Them

by

Dmitrii Legatiuk

Chair of Applied Mathematics, Bauhaus-Universität Weimar, 99423 Weimar, Germany

Mathematics 2021, 9(16), 1946; https://doi.org/10.3390/math9161946

Submission received: 24 July 2021 / Revised: 7 August 2021 / Accepted: 13 August 2021 / Published: 15 August 2021

(This article belongs to the Special Issue Modeling and Simulation in Engineering)

Download Versions Notes

Abstract

:

The growing complexity of modern practical problems puts high demand on mathematical modelling. Given that various models can be used for modelling one physical phenomenon, the role of model comparison and model choice is becoming particularly important. Methods for model comparison and model choice typically used in practical applications nowadays are computation-based, and thus time consuming and computationally costly. Therefore, it is necessary to develop other approaches to working abstractly, i.e., without computations, with mathematical models. An abstract description of mathematical models can be achieved by the help of abstract mathematics, implying formalisation of models and relations between them. In this paper, a category theory-based approach to mathematical modelling is proposed. In this way, mathematical models are formalised in the language of categories, relations between the models are formally defined and several practically relevant properties are introduced on the level of categories. Finally, an illustrative example is presented, underlying how the category-theory based approach can be used in practice. Further, all constructions presented in this paper are also discussed from a modelling point of view by making explicit the link to concrete modelling scenarios.

Keywords:

category theory; mathematical modelling; abstraction; formal approaches; functors

MSC:

00A71; 06A75; 18B99; 18C10

1. Introduction

The rapid development of modern technologies naturally leads to higher demands for the mathematical modelling process because practical problems nowadays require advanced coupled models. Moreover, typically several models can be used for modelling a given physical phenomenon, and thus a model selection process must be made. Evidently, the model selection influences the quality of a final coupled model. In this regard, one of the most important tasks of a modeller is understanding the role of individual models in a complete coupled model, as well as studying how different models are related along with the practical meaning of this relation.

In engineering applications, various factors leading to reduction of the quality of the final coupled model are typically referred to as uncertainties. According to [1], three types of uncertainties arising during the modelling process can be distinguished: (i) Model inputs, (ii) numerical approximation, and (iii) model form. While the first two types can be identified and treated by the help of computational and statistical methods, see for example [2,3] and references therein, the third type requires an extra treatment. The model form uncertainty implies that a conceptual modelling error has been made, i.e., basic physical assumptions of models have been violated. Considering that the impact of such conceptual modelling errors on the whole modelling process is much more profound, it is necessary to develop tools towards addressing conceptual modelling errors.

Consideration of mathematical models based only on their physical assumptions, i.e., without considering a specific engineering example or performing computations with a model, requires tools of abstract mathematics. Several approaches to using abstract mathematics in applied mathematical modelling, such as graph theory [3], abstract Hilbert spaces [4,5], abstract algebraic approach [6,7], predicate logic [8,9], type theory [10,11], and category theory [12,13], have been proposed in recent years. In this paper, we aim at revisiting and further developing the category theory-based modelling methodology introduced in [13]. The motivation for using category theory for abstract description of mathematical models is based on several aspects: (i) The abstract nature of category theory allows description of very different objects and structures on common basis; (ii) a practical interpretation of abstract constructions provided by category theory-based modelling methodology is straightforward, and thus the methodology can really be used in engineering practice; (iii) category theory naturally provides scaling possibilities implying that description of more sophisticated objects and structures can be done by using the same principles as descriptions of their individual parts; (iv) finally, various applications of category theory scattering from modelling of dynamical systems [14] to ontological representation of knowledge [15] presented in recent years indicate that advantages of category theory are seen and accepted now not only by mathematicians, but also by people interested in applications.

As we have already mentioned, the category theory-based modelling methodology discussed in this paper has been originally proposed in [13]. After publishing this work, several new ideas on categorical modelling methodology providing a deeper understanding of mathematical models and modelling process have appeared in recent years. Therefore, it is necessary to revise ideas presented in [13] with new results and more refined categorical constructions. Moreover, it is worth to mention, that the use of category theory-based modelling methodology for analysis of models appearing in real-world engineering problems from the field of aeroelastic analysis of bridges has been presented in [16]. This work indicated practical advantages of using category theory for modelling purposes. To this end, the category theory-based modelling methodology presented in this paper aims at a consistent description of mathematical models and relations between them in the language of category theory. For the sake of clarity, we focus in this paper only on individual mathematical models, while coupled models will be treated in future research using results from the current paper as a basis.

Abstract categorical descriptions of mathematical models requires at first defining universal properties of models, which are properties shared by models in general, i.e., independent on a particular problem of an engineering field. If a universal model property is defined, then all categorical constructions used in one specific modelling application can be directly transferred to another field. Thus, we will start our construction with defining such a universal model property which is common for all models. Moreover, the main goal is to keep track of real physical and engineering interpretations of the constructions introduced in the category theory-based modelling methodology. The paper is organised as follows: Section 2 presents a general structure of categories of mathematical models together with a detailed discussion on practical interpretation of the introduced definition; after that, relations between mathematical models are discussed in Section 3; Section 4 formalises the problem of having different formulations of the same mathematical model by introducing the notion of convertible mathematical models; Section 5 provides an illustrative example how categorical constructions introduced in the previous sections can be used for comparison and analysis of models. Finally, in Section 6 we discuss a universal arrow in the framework of category theory-based modelling methodology, as well as establish a connection to an abstract algebraic approach, after we draw conclusions and discuss shortly the scope of future work. For making the paper self-contained, some basic definitions from category theory are presented in the Appendix A.

2. Categories of Mathematical Models

Before starting with categorical constructions, it is important to underline, that models used in practice can be generally classified in two types:

Physics-based models—models which are based on mathematical formalisations of physical laws and assumptions;
Data-driven models—models which are based on representations of data, e.g., results of experiments or measurements obtained from a monitoring system.

This paper deals with physics-based models, which are referred to simply as mathematical models, because this type of models is typically implied by the term mathematical modelling. Moreover, because mathematical models are based on physical assumptions formalised by the help of mathematical expressions, they provide a richer basis for abstract considerations, compared to data-driven models, which are very often black-box models not relying on any physical assumptions.

We start our construction with the introduction of concrete categories

{Model}_{i}

,

i = 1, 2, \dots

, which are associated with mathematical models used to describe a certain physical phenomenon, such as, for example, models of elasticity theory or heat conduction. The term “associated” has been used, because, strictly speaking, the objects of categories

{Model}_{i}

,

i = 1, 2, \dots

are not mathematical models themself, but rather sets of basic physical assumptions on which the corresponding mathematical models are created. However, to keep notations short and transparent, we will refer to these categories simply as to categories of mathematical models. The following definition introduces basic structure of these categories:

Definition 1

(Category of mathematical models). Let

{Model}_{1}

be a category of mathematical models describing a given physical phenomenon. Then for all objects of

{Model}_{1}

the following assumptions hold:

(i): Each object is a finite non-empty set – set of assumptions of a mathematical model, denoted by ${Set}_{A}$ , where $A$ is the corresponding mathematical model;
(ii): Morphisms (arrows) are relations between these sets;
(iii): For each set of assumptions and its corresponding model exists a mapping

${Set}_{A} \overset{S}{\mapsto} A;$
(iv): All objects are related to mathematical models acting in the same physical dimension.

Let us now provide some motivation from the modelling perspective and comments for the assumptions used in this definition:

Assumption (i). This assumption comes naturally from the modelling process: A mathematical model is created to describe a certain physical phenomenon or process, and evidently, it is possible only if physical background of the phenomenon or process is clearly stated, i.e., assumptions to be satisfied by the model are formulated. Moreover, for a stronger distinction between different mathematical models, the set of assumptions is understood in a broader sense: Not only basic physical assumptions are listed, but all further modifications and simplifications of the model, such as for example a linearisation of original equations, are also elements of the set of assumptions. The requirements for the set of assumptions to be finite comes from the fact that no model possess an infinite set of physical assumptions. Therefore, consideration of more general sets is not necessary.
It is also important to remark that having finite sets as objects in the category is one possible way to approach mathematical models. Alternatively, one could think of working directly with mathematical expressions (equations) representing the models. However, in this case it will be more difficult to distinguish models, since the same set of assumptions can be formalised differently in terms of final equations, as we will see in Section 4.
Assumption (ii). This assumption, in fact, introduces the structure of categories of mathematical models. The main point here is that instead of working with discrete categories, it is beneficial to study more elaborated structure. Since the objects in categories of mathematical models are sets, it is natural to use relations between sets as morphism in the categories. We will make these relations more specific in Section 3.
Assumption (iii). This assumption formally describes the process of obtaining the final form of a model, e.g., differential or integral equation, from basic physical assumptions. In this case, mapping S is, in fact, a formalisation process consisting in writing basic physical assumptions in terms of mathematical expressions, which constitute a mathematical model in the end of the formalisation process. Naturally, the formalisation process can be done by different means and approaches, for example first ideas on using type theory to describe the formalisation process towards detecting conceptual modelling errors have been presented in [10,11].
We also would like to remark, that originally, mapping S has been called invertible in [13]. The invertibility in this case means, that set of assumptions can be uniquely reconstructed from the final form of a model. While that such a reconstruction is theoretically indeed possible, it is generally not unique. Even if we consider the following canonical parabolic equation

$u_{t} = a^{2} u_{x x},$

then without extra context it cannot be decided if this is a heat equation or a diffusion equation. Therefore, the invertibility of a mapping S has been dropped from Definition 1.
Assumption (iv). This assumption ensures that we do not treat equally models from different dimensions.

It is also important to mention that according to Definition 1, models with different parameters, e.g., material constants, will be corresponded to the same set of assumptions. For example, if we consider the set of assumptions leading to the Lamé equation (partial differential equation with constant coefficients), then it is clear that infinite number of constant coefficients exists, but all these specific models are originated from the same set of assumptions. In general, models originating from the same set of assumptions, but having different material parameters are just particular instance of a general set of assumptions. This fact is particularly important for engineering applications, where stochasticity of material parameters in deterministic models is often considered as stochastic modelling. However, as we discussed above, the stochasticity only in material parameters does not change basic modelling assumptions, because the fact that a constant is chosen according to a certain probability law does not principally affect the assumption of having constant coefficients. In contrast, modelling of physical process by the help of stochastic partial differential equations is based on completely different modelling assumptions, see for example [17], and therefore, should not be put together with “classical” mathematical models.

3. Relations between Mathematical Models

This section is devoted to defining relations between sets of assumptions, which are objects in categories of mathematical models, as introduced in Definition 1. The main requirement for such relations is that their must define a universal model property, which is independent on a specific problem, meaning that boundary or initial conditions (but not coupling/transmission conditions!) should not have influence on the model property. For satisfying this requirement, the comparison of mathematical models by the help of universal model property called model complexity is proposed [13]:

Definition 2

(Complexity of mathematical models). Let A and B be mathematical models in a category

{Model}_{1}

. We say that model A has higher complexity than model B if and only if

{Set}_{A} \subset {Set}_{B}

, but

{Set}_{B} ⊄ {Set}_{A}

. Consequently, two models are called equal, in the sense of complexity, iff

{Set}_{B} = {Set}_{A}

.

The model complexity in this definition is defined relatively, since we do not describe it explicitly. From the point of view of physics, model complexity reflects the fact that a model which has less assumptions provides a more accurate description of a physical phenomenon under consideration. Thus, the model complexity is a relative quality measure of how good a mathematical model represents a given physical phenomenon. The relativity in the measure comes from the fact, that any comparison needs at least two objects, and one model cannot be assessed with respect to its ability represent the corresponding physical process, otherwise that would imply that the exact representation of the physical process is known a priori.

It is important to underline, that the notion of model complexity proposed in Definition 2 is neither related to the notion of complexity of an algorithm, nor to the notion of complexity used for statistical models, where the number of parameters is typically served as complexity measure. The advantage of the notion of model complexity introduced in Definition 2 is the fact that it does not depend on specific boundary or initial conditions, since typically basic model assumptions are not influenced by them. Nonetheless, if boundary conditions are essential for basic model assumptions, e.g., singular boundary conditions, then they will be automatically listed in the corresponding set of assumptions, since such boundary conditions are critical for describing the physical process. Thus, the model complexity introduced in Definition 2 is a universal model property.

Additionally, Definition 2 might sound a bit counterintuitive, since it states that a model satisfying less modelling assumption is more complex, and not of higher simplicity, as it could be expected as well. In fact, both points of view on the complexity are possible, and differ only in the general understanding of modelling assumptions. Definition 2 is based on the idea that modelling assumptions act as restrictions for a model, and thus implying that a model with less modelling assumptions is more general. Nonetheless, another perspective on the notion of model complexity still can be considered, which would reflect the opposite point of view that model assumptions are not restrictions, but rather generalisations of models. This discussion is also directly related to the following important remark:

Remark 1.

Sets of assumptions introduced in Definition 2 are assumed to be written by the help of a natural language. While intuitively it is clear how to formulate these sets, as well as how to compare them in the sense of model complexity, from the formal perspective it is not so straightforward. In fact, a formal comparison of sets of assumptions written in a natural language can be done only by the help of a detailed semantic analysis of these sentences, and only after that, sentences, and hence sets of assumptions, can be rigorously compared. As a possible way around this problem, stricter rules on formulating sets of assumptions might be imposed. In that case, a kind of basic “alphabet” containing allowed expressions and symbols could be introduced. Moreover, perhaps a combination of a natural language and mathematical expressions complemented by strict rules could be a suitable option. Different possibilities to address the problem of a rigorous comparison of sets of assumptions will be studied in future work.

From the point of relational algebra, model complexity is a binary relation in a category of mathematical models. Hence, the objects in categories of mathematical models can be ordered by using model complexity. However, the ordering of objects defined by model complexity is only partial, and not total, since examples of mathematical models which should belong to the same category but cannot be ordered according to Definition 2 can be easily found, see for example aerodynamic models used in bridge engineering [16]. Naturally, in some cases mathematical models can constitute a category with totally ordered objects. To have a clear distinction between categories with partial and total ordering of objects, we introduce the following definition [16]:

Definition 3.

Let

{Model}_{1}

be a category of mathematical models in which n objects

{Set}_{A_{j}}

,

j = 1, \dots, n

can be ordered according to Definition 2 as follows

{Set}_{A_{i}} \subset {Set}_{A_{j}}, f o r i < j \leq n .

Moreover, let X be the set of all modelling assumptions used in this category. Then category

{Model}_{1}

contains totally ordered objects, and therefore is associated with totally ordered models, iff

X = {Set}_{A_{1}} \cup {Set}_{A_{2}} \cup \dots \cup {Set}_{A_{n}}, a n d {Set}_{A_{n}} = X,

otherwise, the category

{Model}_{1}

contains partially ordered objects corresponding to partially ordered models.

As a direct consequence of this definition we have the following corollary:

Corollary 1.

In a totally ordered category

{Model}_{1}

with n objects always exist two unique objects:

Object ${Set}_{A_{1}}$ satisfying ${Set}_{A_{1}} \subset {Set}_{A_{i}}$ $\forall i = 2, \dots, n$ , which is called the most complex object, and the associated model $A_{1}$ is called the most complex model;
Object ${Set}_{A_{n}}$ satisfying ${Set}_{A_{n}} = {Set}_{A_{1}} \cup {Set}_{A_{2}} \cup \dots \cup {Set}_{A_{n}}$ , which is called the the simplest object element, and the associated model $A_{n}$ is called the simplest model.

It is worth to mention, that in the framework of introduced modelling formalism, the most complex object and the simplest object are, in fact, initial object and terminal object in categories of mathematical models, respectively. Note that, although categories of mathematical models have finite sets as objects, the initial and terminal objects are different to the ones in the classical category

Sets

, where these are given by the empty set and one-element set, correspondingly. The difference comes precisely from the modelling background of our categories, since while formally it is still possible to consider the empty and one-element sets as sets of assumptions of some (fictitious) models, it does not make sense from the modelling perspective.

The proof of Corollary 1 is straightforward, and we only would like to mention, that uniqueness of objects

{Set}_{A_{1}}

and

{Set}_{A_{n}}

follows immediately from Definition 2 and from the fact that a totally ordered category is considered. The situation is trickier in the case of partially ordered categories:

Proposition 1.

For a partially ordered category

{Model}_{1}

with n objects one of the following statements holds:

(i): The most complex object ${Set}_{A_{1}}$ and the simplest object ${Set}_{A_{n}}$ do not exist;
(ii): The most complex object ${Set}_{A_{1}}$ exists, while the simplest object ${Set}_{A_{n}}$ does not exist;
(iii): The most complex object ${Set}_{A_{1}}$ does not exist, while the simplest object ${Set}_{A_{n}}$ exists;
(iv): The most complex object ${Set}_{A_{1}}$ and the simplest object ${Set}_{A_{n}}$ exist simultaneously.

Proof.

We prove this proposition by straightforwardly constructing corresponding structures of partially ordered categories. We start the proof by proving cases (ii) and (iii) at first, since the proof of case (i) will be based on cases (ii) and (iii), and finally we will prove case (iv). We consider a category with one object

{Set}_{A_{1}}

, and the rest objects we construct explicitly from

{Set}_{A_{1}}

. Without loss of generality we assume that

{Set}_{A_{1}}

contains at least one element, which will be denoted by

A_{1}^{(1)}

. The objects

{Set}_{A_{2}}

and

{Set}_{A_{3}}

are then constructed from

{Set}_{A_{1}}

by adding different elements

A_{1}^{(2)}

and

A_{1}^{(3)}

to

{Set}_{A_{1}}

, correspondingly, i.e., we obtain new sets of assumptions by adding two different assumptions. This construction is shown by the diagram

implying that

{Set}_{A_{1}} \subset {Set}_{A_{2}}

and

{Set}_{A_{1}} \subset {Set}_{A_{3}}

, but

{Set}_{A_{2}}

and

{Set}_{A_{2}}

are not related. Thus,

{Set}_{A_{1}}

is the most complex object in this category, but no the simplest object exists. Thus, the case (ii) is proved.

The proof of case (iii) is analogues to case (ii), where only instead of adding extra assumptions, we remove different assumptions from the initial set. Thus, for simplicity, we assume that

{Set}_{A_{1}}

has at least two different assumption. The rest of the proof follows immediately.

To prove case (i), we consider now two distinct objects

{Set}_{A_{1}}

and

{Set}_{A_{2}}

given by

{Set}_{A_{1}} = \{A_{1}^{(1)}, A_{1}^{(2)}, A_{1}^{(3)}\}

and

{Set}_{A_{2}} = \{A_{1}^{(1)}, A_{1}^{(2)}, A_{2}^{(1)}\}

, respectively. Similar to cases (ii) and (iii), we construct now two other objects in two different ways as follows:

\begin{matrix} {Set}_{A_{3}} & = & \{A_{1}^{(1)}, A_{1}^{(2)}, A_{1}^{(3)}\} \ \{A_{1}^{(2)}, A_{1}^{(3)}\} = \{A_{1}^{(1)}\}, \\ {Set}_{A_{4}} & = & \{A_{1}^{(1)}, A_{1}^{(2)}, A_{1}^{(3)}\} \ \{A_{1}^{(1)}, A_{1}^{(3)}\} = \{A_{1}^{(2)}\}, \end{matrix}

and

\begin{matrix} {Set}_{A_{3}} & = & \{A_{1}^{(1)}, A_{1}^{(2)}, A_{2}^{(1)}\} \ \{A_{1}^{(2)}, A_{2}^{(1)}\} = \{A_{1}^{(1)}\}, \\ {Set}_{A_{4}} & = & \{A_{1}^{(1)}, A_{1}^{(2)}, A_{2}^{(1)}\} \ \{A_{1}^{(1)}, A_{2}^{(1)}\} = \{A_{1}^{(2)}\} . \end{matrix}

This construction is illustrated by the following diagram:

Thus, the constructed category is partially ordered, and since objects

{Set}_{A_{1}}

and

{Set}_{A_{2}}

are not related, this category does not contain neither the most complex nor the simplest objects, since no object satisfies assumptions of Corollary 1.

For proving case (iv), let us consider the object

{Set}_{A_{1}} = \{A_{1}^{(1)}, A_{1}^{(2)}, A_{1}^{(3)}, A_{1}^{(4)}\}

, and let us construct several other objects according to the following commutative diagram

While the diagram is commutative, but the objects on the left side are not related to the objects of the right side in the sense of Definition 2. Thus, we have a partially ordered category, where both the most complex object

\{A_{1}^{(1)}\}

and the simplest object

\{A_{1}^{(1)}, A_{1}^{(2)}, A_{1}^{(3)}, A_{1}^{(4)}\}

exist simultaneously. Hence, the proposition is proved. □

Next, we have the following theorem:

Theorem 1.

Consider a category

{Model}_{1}

with n objects. If the most complex object

{Set}_{A_{1}}

and the simplest object

{Set}_{A_{n}}

exist simultaneously in the category

{Model}_{1}

, then

{Model}_{1}

is either a totally ordered category, or contains at least two totally ordered subcategories.

Proof.

The proof of the theorem follows immediately from Corollary 1, Proposition 1, and Definition 3. Looking at the proof of the case (iv) in Proposition 1, we see immediately that two totally ordered subcategories exist. The case of only one totally ordered subcategory is excluded by the assumption of simultaneous existence of the most complex and the simplest objects. Further, if the most complex and the simplest objects exist simultaneously and all objects in the category

{Model}_{1}

are related by the help of complexity, then it follows immediately that

{Model}_{1}

is a totally ordered category. □

Evidently, the last statement can be straightforwardly generalised as follows:

Theorem 2.

Every partially ordered category of mathematical models contains at least one totally ordered category of mathematical models as a subcategory.

4. Convertible Mathematical Models

In this section, we will discuss the mappings S between sets of assumptions and the corresponding models appearing in Definition 1, and as we will see from the upcoming discussion, the role of mappings S provides clear reasoning why objects of categories of mathematical models are sets of assumptions and not the models themselves. The mappings S are generally not invertible, because they represent a formalisation process of basic modelling assumptions in terms of mathematical expressions. Moreover, these mappings are also not unique, since the same set of assumptions can be formalised differently. However, if objects in a category have been ordered (partially or totally) according their complexity, then the mappings will preserve this structure. Thus, these mappings are structure preserving mappings, i.e., they are functors.

Because the mappings between sets of assumptions and the corresponding mathematical models are functorial, then, in fact, the mathematical models constitute also a category. However, since final form of a model depends on the formalisation process, it is more difficult to work directly with categories of models, rather than to describe categories of sets of assumptions, as we have done already. Nonetheless, we will point out now some results related to the models directly. First, we summarise the above discussion in the following definition:

Definition 4.

Let

{Set}_{A_{1}}

be an object in the category

{Model}_{1}

, and let

B_{1}

and

B_{2}

be two possible model formulations associated with the object

{Set}_{A_{1}}

via two functors F and G. Then the model formulations

B_{1}

and

B_{2}

are connected via a natural transformation of functors ϑ, and the model formulations

B_{1}

and

B_{2}

are called convertible. This construction corresponds to the commutative diagram

moreover, models which are instantiated by convertible model formulations will be called convertible models.

Obviously, because different model formulations are related to the same set of assumptions, the model complexity of these formulations remains the same. Thus, we have immediately the following corollary:

Corollary 2.

Convertible models have the same complexity.

The discussion about convertible mathematical models underlines once more why sets of assumptions are considered as objects in categories of mathematical models, and not model formulations directly. Assume for a moment, that the latter would be the case and consider the following diagram with three objects for simplicity:

Moreover, assume additionally that the model formulations

A_{1}

and

A_{2}

are convertible in the sense of Definition 4, while the model formulation

A_{3}

is not associated with the same set of assumptions. Thus, we would end up with two kinds of morphisms in the category: Morphism f plays the same role as the natural transformation

ϑ

in Definition 4, while morphisms g and h represent complexity-relation on the level of model formulations. Obviously, it is necessary to be able to distinguish between the two kinds of morphisms, which would imply much more complicated constructions for the structure of the category, as well as for relations between its objects.

As a simple immediate example indicating the necessity for considering convertible mathematical models, let us consider the classical model of linear elasticity describing deformations of an elastic body in a static case. The classical formulation of this model is given by the following system of equations

\{\begin{matrix} div \tilde{σ} + ρ K = 0, \\ \tilde{ε} = \frac{1}{2} [\nabla u + {(\nabla u)}^{T}], \\ \tilde{σ} = 2 μ (\frac{ν}{1 - 2 ν} ϑ \tilde{E} + \tilde{ε}), \end{matrix} ϑ = div u = \frac{\partial u_{1}}{\partial x_{1}} + \frac{\partial u_{2}}{\partial x_{2}} + \frac{\partial u_{3}}{\partial x_{3}},

(1)

where

\tilde{σ}

is a symmetric stress tensor,

\tilde{ε}

is a symmetric strain tensor,

u

is a displacement vector,

ρ

is a material density,

ν

is the Poisson’s ration, and

K

is the volume force. System of Equation (1) is the classical tensor version of elasticity equations, see for example [18]. However, the Lamé equation

μ Δ u + (λ + μ) grad div u + ρ K = 0,

(2)

is often used in practice as well. Furthermore, model of linear elasticity can be also written as follows

D M D u = 0, with D = \sum_{k = 1}^{3} e_{k} \partial_{k}, and u = u_{0} + u,

(3)

where the multiplication operator M is defined by

M u : = \frac{m - 2}{2 (m - 1)} u_{0} + u, m : = ν^{- 1} .

Equation (3) is a quaternionic form of elasticity model with D denoting the Dirac operator, see [19] for all details on quaternionic analysis and its applications.

For the sake of clarity of further considerations, let us denote the models (1)–(3) as follows:

\begin{matrix} B_{1} : = & \{\begin{matrix} div \tilde{σ} + ρ K = 0, \\ \tilde{ε} = \frac{1}{2} [\nabla u + {(\nabla u)}^{T}], \\ \tilde{σ} = 2 μ (\frac{ν}{1 - 2 ν} ϑ \tilde{E} + \tilde{ε}), \end{matrix} ϑ = div u = \frac{\partial u_{1}}{\partial x_{1}} + \frac{\partial u_{2}}{\partial x_{2}} + \frac{\partial u_{3}}{\partial x_{3}}, \\ B_{2} : = & μ Δ u + (λ + μ) grad div u + ρ K = 0, \\ B_{3} : = & D M D u = 0, with D = \sum_{k = 1}^{3} e_{k} \partial_{k}, and u = u_{0} + u . \end{matrix}

A possible representation of these models is provided by the diagram

Here, functor S is a formalisation process of basic set of assumptions of linear elasticity

{Set}_{A_{1}}

in the tensor form of model formulation

B_{1}

, after that, the tensor form can be further reformulated into the Lamé equation

B_{2}

, or into the quaternionic form

B_{3}

via functorial mappings F and G. In some sense, the above diagram reflects traditional way of developing different model formulations: At first, the original form is introduced, and after that, several more specific forms better suitable for selected methods are introduced. Moreover, looking in particular at the quaternionic formulation

B_{3}

, it becomes clear that this form is not obtained directly through the formalisation process of

{Set}_{A_{1}}

(at least no quaterninic-based modelling of linear elasticity has been reported till now), but through reformulation of either Lamé equation or the tensor form, see again [19].

5. Illustrative Examples

In this section, we illustrate the constructions of category theory-based modelling methodology presented in previous sections on two examples: First, we discuss classical models of beam theories, and after that, we discuss aerodynamic models used in bridge engineering. These examples have been already presented in works [13,16] at the time of first steps towards developing the category theory-based modelling methodology. Therefore, it is necessary to revisit these examples for underlying further development of the theory.

5.1. Categorical Modelling of Beam Theories

Transverse vibrations of one-dimensional beams are typically modelled by one of three common beam theories: Bernoulli–Euler theory, Rayleigh theory, and Timoshenko theory. Thus, let us consider a category of mathematical models, denoted by

Beam

, containing as objects sets of assumptions

{Set}_{B - E}

,

{Set}_{R}

,

{Set}_{T}

corresponding to the Bernoulli–Euler, Rayleigh, and Timoshenko beam theories, respectively. We start our discussion on the construction of category

Beam

by explicitly listing the sets of assumptions, which are given in Table 1.

Remark 2.

The assumptions, as listed in Table 1, are formulated by the help of natural language, however in some cases it is more convenient to formulate sets of assumptions directly in terms of mathematical expressions, or as a mixture of both. While from the set-theoretic point of view such a freedom in writing sets of assumptions is not completely justified, it is acceptable in our setting because each set of assumption written in natural language can be rigorously formalised in terms of mathematical expressions. Thus, writing mathematical expressions in sets of assumptions can be considered as a kind of syntactic sugar, similar to programming languages terminology. Of course, this analogy not perfect but reflects a general point of view on writing sets of assumptions.

Since derivation of beam models is well known, it will be omitted. Set of assumption

{Set}_{B - E}

of the Bernoulli–Euler theory leads to the following beam equation:

ρ F \frac{\partial^{2} u}{\partial t^{2}} + E I_{y} \frac{\partial^{4} u}{\partial x^{4}} = 0,

where E is the Young’s modulus of the material,

I_{y}

is the moment of inertia,

r h o

is the density of material, and F is the area of cross section. Next, set of assumption

{Set}_{R}

of the Rayleigh theory leads to the equation:

ρ F \frac{\partial^{2} u}{\partial t^{2}} + E I_{y} \frac{\partial^{4} u}{\partial x^{4}} - ρ I_{y} \frac{\partial^{4} u}{\partial x^{2} \partial t^{2}} = 0 .

Finally, if the effect of bending of cross sections is taken into account, then set of assumption

{Set}_{T}

of the Timoshenko theory is obtained, which leads to the system of differential equations:

\{\begin{matrix} ρ F \frac{\partial^{2} u}{\partial t^{2}} - ℵ μ F \frac{\partial^{2} u}{\partial x^{2}} + ℵ μ F \frac{\partial φ}{\partial x} & = & 0, \\ ρ I_{y} \frac{\partial^{2} φ}{\partial t^{2}} - E I_{y} \frac{\partial^{2} φ}{\partial x^{2}} + ℵ μ F (φ - \frac{\partial u}{\partial x}) & = & 0, \end{matrix}

where

φ

is the angle of rotation of the normal to the mid-surface of the beam, ℵ is the Timoshenko shear coefficient, which depends on the geometry of the beam, and

μ

is the shear modulus. After some calculations this system can be reformulated in terms of only one partial differential equation for u as follows:

ρ F \frac{\partial^{2} u}{\partial t^{2}} + E I_{y} \frac{\partial^{4} u}{\partial x^{4}} - ρ I_{y} (1 + \frac{E}{ℵ μ}) \frac{\partial^{4} u}{\partial x^{2} \partial t^{2}} + \frac{ρ^{2} I_{y}}{ℵ μ} \frac{\partial^{4} u}{\partial t^{4}} = 0 .

Looking at the above beam models from the categorical perspective, we can summarise these models and their sets of assumptions as follows:

\begin{matrix} {Set}_{B - E} & \overset{S}{\mapsto} & ρ F \frac{\partial^{2} u}{\partial t^{2}} + E I_{y} \frac{\partial^{4} u}{\partial x^{4}} = 0 & = : A, \\ {Set}_{R} & \overset{S}{\mapsto} & ρ F \frac{\partial^{2} u}{\partial t^{2}} + E I_{y} \frac{\partial^{4} u}{\partial x^{4}} - ρ I_{y} \frac{\partial^{4} u}{\partial x^{2} \partial t^{2}} = 0 & = : B, \\ {Set}_{T} & \overset{S}{\mapsto} & ρ F \frac{\partial^{2} u}{\partial t^{2}} + E I_{y} \frac{\partial^{4} u}{\partial x^{4}} - ρ I_{y} (1 + \frac{E}{ℵ μ}) \frac{\partial^{4} u}{\partial x^{2} \partial t^{2}} + \frac{ρ^{2} I_{y}}{ℵ μ} \frac{\partial^{4} u}{\partial t^{4}} = 0 & = : C_{1}, \\ {Set}_{T} & \overset{S}{\mapsto} & \{\begin{matrix} ρ F \frac{\partial^{2} u}{\partial t^{2}} - ℵ μ F \frac{\partial^{2} u}{\partial x^{2}} + ℵ μ F \frac{\partial φ}{\partial x} & = & 0, \\ ρ I_{y} \frac{\partial^{2} φ}{\partial t^{2}} - E I_{y} \frac{\partial^{2} φ}{\partial x^{2}} + ℵ μ F (φ - \frac{\partial u}{\partial x}) & = & 0 . \end{matrix} & = : C_{2}, \end{matrix}

where S are formalisation mappings, as discussed before. It is worth making the remark:

Remark 3.

Note that, in general, mappings S can be different for each set of assumptions, or, can be the same if all equations are derived based on the same principle, e.g., the Hamilton’s principle. If the fact that different formalisation processes have been used to obtain models from the sets of assumptions in one category is essential for the application, then it is necessary to indicate this fact by using sub-scripts, i.e.,

S_{1}

,

S_{2}

, …, otherwise the general notation for the formalisation mappings might be kept.

By using Definition 2, the category

Beam

can be straightforwardly equipped with the commutative diagram

The morphisms f, g, and h indicate the simple fact, that one beam theory can be obtained from another by weakening basic assumptions. Moreover, the above diagram clearly indicate that the object

{Set}_{T}

(Timoshenko theory) is the most complex, the object

{Set}_{R}

(Rayleigh theory) has higher complexity than the object

{Set}_{B - E}

(Bernoulli–Euler theory), which is the simplest object. The same ordering holds for the corresponding model instantiations. Next, let us list the following facts we know about the category

Beam

:

It is a totally ordered category;
The object ${Set}_{B - E}$ is the initial object of this category;
The object ${Set}_{T}$ is the terminal object of this category;
Models $C_{1}$ and $C_{2}$ are convertible, since they represent different formulations of the assumptions of Timoshenko theory.

Note that, first three facts, as well as the commutative diagram presented above, do not require, in fact, models themself, because these facts are solely obtained simply from the sets of assumptions, i.e., by looking at the objects in the category

Beam

. Thus, the categorical point of view introduced in the previous section reflects the following idea:

The principle difference between models lies not in their final form, but in the basic modelling assumptions these models are constructed from.

Finally, let us look at the level of models, where the following diagram is obtained

where

ϑ

denotes a natural transformation appearing in the definition of convertible models, recall Definition 4.

5.2. Category of Aerodynamic Models Revisited

Next, we briefly revisit the example of aerodynamic models used in bridge engineering presented in [16]. Since the idea is only briefly discuss categorical constructions introduced in previous sections, we will not present aerodynamic models in details, but we refer to works [20,21]. We consider the category

AeroModel

containing as objects the following sets of assumptions of mathematical models: (i)

ST

(steady model); (ii)

LST

(linear steady model); (iii)

QS

(quasi-steady model); (iv)

LQS

(linear quasi-steady model); (v)

LU

(linear unsteady model); (vi)

MQS

(modified quasi-steady model); (vii)

MBM

(mode-by-mode model); (viii)

CQS

(corrected quasi-steady model); (ix)

HNL

(hybrid nonlinear model); (x)

MNL

(modified nonlinear model); and, (xi)

NLU

(nonlinear unsteady model). The structure of category

AeroModel

is provided by the following diagram (adapted from [16]):

Let us now list some facts we know about the category AeroModel:

It is a partially ordered category;
The object LST is the initial object of this category;
The object NLU is the terminal object of this category;
According to Theorem 1 several totally ordered subcategories exists, which are

Additionally, we can say that no models associated to the objects of AeroModel are convertible, but for that it is necessary to take a look at the derivation of models, see again [16] and references therein.

6. Further Characterisations of Mathematical Models and Conclusions

In this section, we present some further ideas on characterisations of mathematical models. One of the most important aspect of applications of category theory is a definition of a universal mapping property (UMP), or simply, a universal arrow, which provides, in fact, a categorical characterisation of objects, see [22,23] for details. Hence, it is important to discuss the universal arrow definition also in the context of category theory-based modelling methodology.

Let us consider a formalisation functor

S : Model \to M

, where

M

denotes formally a category of instantiations of mathematical models corresponding to the objects in

Model

. Let m be an object of

M

, then a universal arrow from m to S is a pair

〈 r, u 〉

consisting of an object r of

Model

and an arrow

r : m \to S r

of

M

, such that to every pair

〈 d, f 〉

with d an object of

Model

and

f : c \to S d

an arrow of

M

, there is a unique arrow

f^{'} : r \to d

of

Model

with

S f^{'} \circ u = f

. Practical meaning of a universal arrow in the context of category theory-based modelling methodology is that to the same set of assumption can correspond only convertible model formulations.

Finally, we would like to provide another possible definition of a mathematical model in general, which would summarise our discussion in this paper:

Definition 5.

A mathematical model

M

is a triple

M = 〈 Set, M, S 〉

, where

$Set$ is the set of assumptions of the model;
$M$ is an instantiation of the model in terms of mathematical expressions and equations;
S is a formalisation mapping, which formalises the set of assumptions $Set$ into the model instantiation $M$ .

Relations between the models can be introduced again by the help of Definition 2. Definition 5 proposes an abstract description of a mathematical model similar to the abstract algebraic approach presented in [6]. Thus, a connection between the category theory-based modelling methodology and abstract algebraic approach is established. Hence, both approaches to the modelling process in engineering might complement each other, and therefore, the connection between both approaches will be studied in future research.

In this paper, we have revisited the category theory-based modelling methodology proposed in recent years. The main idea of this modelling methodology is representation of mathematical models by the help of categorical constructions. We have presented revised results from previous works, as well as new results and ideas supporting a deeper understanding of the modelling process in engineering. Moreover, two illustrative practical examples, namely categorical perspective of beam models and on aerodynamic models from bridge engineering, have been revisited. As it can be clearly seen from the examples, the category theory-based modelling methodology presented in this paper is indeed applicable in practice and provides various characterisations of mathematical models, relations between them, and final formulations of models. Finally, we have describe a universal arrow in the framework of the proposed modelling methodology.

Additionally, we would like to remark how the category theory-based modelling methodology presented in this paper can be used in a model selection process. After constructing a category of mathematical models, we can formulate criteria which must be satisfied by a model for a given practical problem, and thus a subcategory of models satisfying these criteria can be constructed. Because we are on the abstract level of models, it is difficult to introduce a quantifiable criterion for the optimal model choice. Nonetheless, on the abstract level, the simplest model satisfying the criteria can be regarded as “the optimal choice” in this case, because generally there is no need for overcomplicating the model. Furthermore, the difference in model assumptions, and thus in model complexity, can be quantified by the help of numerical calculations, as it has been illustrated in [16] for the case of aerodynamic models.

The scope of future research is related to a revision and deeper understanding of coupled mathematical models. A categorical description of a coupled mathematical model will use constructions and ideas introduced in this paper. However, due to the more complex nature of coupled models, it is expected that more refined and advanced constructions will be necessary for a proper description of such models. Moreover, further ideas on a formal model comparison and model selection procedure, as well as a more strict approach to the formulation of sets of assumptions, will be considered in future work.

Funding

This research is supported by the German Research Foundation (DFG) through grant LE 3955/4-1.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

I would like to thank the reviewers for very helpful comments, which help not only improving the paper, but also brought new ideas for future research.

Conflicts of Interest

The author declares no conflict of interest.

Appendix A. Some Basic Definitions from Category Theory

Following the classical works in category theory [22,23], we list here few important definitions.

Definition A1.

A category consists of the following data:

Objects: $A, B, C, \dots$
Arrows (morphisms): $f, g, h, \dots$
For each arrow f, there are given objects $dom (f)$ and $cod (f)$ called the domain and codomain of f, respectively. We write

$f : A ⟶ B o r A \overset{f}{⟶} B$

to indicate that $A = dom (f)$ and $B = cod (f)$ .
Given arrows $f : A ⟶ B$ and $g : B ⟶ C$ , that is, with $cod (f) = dom (g)$ , there is given an arrow

$g \circ f : A ⟶ C$

called the composite of f and g.
For each object A, there is given an arrow

$1_{A} : A ⟶ A$

called the identity arrow of A.

These data are required to satisfy the following laws:

Associativity: $h \circ (g \circ f) = (h \circ g) \circ f$ for all $f : A ⟶ B$ , $g : B ⟶ C$ , $h : C ⟶ D$ .
Unit: $f \circ 1_{A} = f = 1_{B} \circ f$ for all $f : A ⟶ B$ .

Definition A2.

A functor

F : C ⟶ D

between categories

C

and

D

is a mapping of objects to objects and arrows to arrows, in such a way that

(a): $F (f : A ⟶ B) = F (f) : F (A) ⟶ F (B)$ ,
(b): $F (1_{A}) = 1_{F (A)}$ ,
(c): $F (g \circ f) = F (g) \circ F (f)$ .

That is, F respects domains and codomains, identity arrows, and composition.

Definition A3.

For categories

C

,

D

and functors

F, G : C ⟶ D

a natural transformation

ϑ : F ⟶ G

is a family of arrows in

D

{(ϑ_{C} : F C ⟶ G C)}_{C \in C},

such that, for any

f : C ⟶ C^{'}

in

C

, one has

ϑ_{C^{'}} \circ F (f) = G (f) \circ ϑ_{C}

, that is, the following diagram commutes:

Definition A4.

In any category

C

, and object

0 is initial if for any object C there is a unique morphism $0 ⟶ C$ ,
1 is terminal if for any object C there is a unique morphism $C ⟶ 1$ .

Definition A5.

A subcategory

S

of a category

C

is a collection of some of the objects and some of the arrows of

C

, which includes with each arrow f both the object

dom f

and the object

cod f

, with each object s its identity arrow

1_{S}

and with each pair of composable arrows

s ⟶ s^{'} ⟶ s^{″}

their composite.

References

Oberkampf, W.L.; Roy, C.J. Verification and Validation in Scientific Computing; Cambridge University Press: New York, NY, USA, 2010. [Google Scholar]
Babuska, I.; Oden, J. Verification and validation in computational engineering and science: Basis concepts. Comput. Methods Appl. Mech. Eng. 2004, 193, 4057–4066. [Google Scholar] [CrossRef]
Keitel, H.; Karaki, G.; Lahmer, T.; Nikulla, S.; Zabel, V. Evaluation of coupled partial models in structural engineering using graph theory and sensitivity analysis. Eng. Struct. 2011, 33, 3726–3736. [Google Scholar] [CrossRef]
Dutailly, J.C. Hilbert Spaces in Modelling of Systems; 2014; 47p, Available online: https://hal.archives-ouvertes.fr/hal-00974251 (accessed on 1 August 2021).
Dutailly, J.C. Common Structures in Scientific Theories; 2014; 34p, Available online: https://hal.archives-ouvertes.fr/hal-01003869 (accessed on 1 August 2021).
Legatiuk, D.; Smarsly, K. An abstract approach towards modeling intelligent structural systems. In Proceedings of the 9th European Workshop on Structural Health Monitoring, Manchester, UK, 10–13 July 2018. [Google Scholar]
Nefzi, B.; Schott, R.; Song, Y.Q.; Staples, G.S.; Tsiontsiou, E. An operator calculus approach for multi-constrained routing in wireless sensor networks. In Proceedings of the 16th ACM International Symposium on Mobile Ad Hoc Networking and Computing, New York, NY, USA, 22–25 June 2015. [Google Scholar]
Vassilyev, S.N. Method of reduction and qualitative analysis of dynamic systems: I. J. Comput. Syst. Int. 2006, 45, 17–25. [Google Scholar] [CrossRef]
Vassilyev, S.N.; Davydov, A.V.; Zherlov, A.K. Intelligent control via new efficient logics. In Proceedings of the 17th World Congress The International Federation of Automatic Control, Seoul, Korea, 6–11 July 2008. [Google Scholar]
Gürlebeck, K.; Nilsson, H.; Legatiuk, D.; Smarsly, K. Conceptual modelling: Towards detecting modelling errors in engineering applications. Math. Methods Appl. Sci. 2020, 43, 1243–1252. [Google Scholar] [CrossRef]
Legatiuk, D.; Nilsson, H. Abstract modelling: Towards a typed declarative language for the conceptual modelling phase. In Proceedings of the 8th International Workshop on Equation-Based Object-Oriented Modeling Languages and Tools, Weßling, Germany, 1 December 2017. [Google Scholar]
Foley, J.D.; Breiner, S.; Subrahmanian, E.; Dusel, J.M. Operands for complex system design specification, analysis and synthesis. Proc. R. Soc. 2021, 477. [Google Scholar] [CrossRef]
Gürlebeck, K.; Hofmann, D.; Legatiuk, D. Categorical approach to modelling and to coupling of models. Math. Methods Appl. Sci. 2017, 40, 523–534. [Google Scholar] [CrossRef]
Behrisch, M.; Kerkhoff, S.; Pöschel, R.; Schneider, F.M.; Siegmund, S. Dynamical systems in categories. Appl. Categ. Struct. 2015, 25, 29–57. [Google Scholar] [CrossRef]
Spivak, D.; Kent, R. Ologs: A categorical framework for knowledge representation. PLoS ONE 2012, 7, e24274. [Google Scholar] [CrossRef] [PubMed]
Kavrakov, I.; Legatiuk, D.; Gürlebeck, K.; Morgenthal, G. A categorical perspective towards aerodynamic models for aeroelastic analyses of bridges. R. Soc. Open Sci. 2019, 6, 181848. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Holden, H.; Ksendal, B.; Ubøe, J.; Zhang, T. Stochastic Partial Differential Equations. A Modeling, White Noise Functional Approach; Springer Science+Business Media: New York, NY, USA, 2010. [Google Scholar]
Lurie, A.I. Theory of Elasticity; Foundations of Engineering Mechanics; Springer: Berlin/Heidelberg, Germany, 2005. [Google Scholar]
Gürlebeck, K.; Habetha, K.; Sprößig, W. Application of Holomorphic Functions in Two and Higher Dimensions; Springer International Publishing: Berlin/Heidelberg, Germany, 2016. [Google Scholar]
Kavrakov, I.; Morgenthal, G. A comparative assessment of aerodynamic models for buffeting and flutter of long-span bridges. Engineering 2017, 3, 823–838. [Google Scholar] [CrossRef]
Kavrakov, I.K.; Morgenthal, G. A synergistic study of a CFD and semi-analytical models for aeroelastic analysis of bridges in turbulent wind conditions. J. Fluids Struct. 2018, 82, 59–85. [Google Scholar] [CrossRef] [Green Version]
Awodey, S. Category Theory; Oxford University Press Inc.: New York, NY, USA, 2010. [Google Scholar]
Mac Lane, S. Categories for the Working Mathematician; Springer: New York, NY, USA, 1978. [Google Scholar]

Table 1. Sets of assumptions of beam theories.

Assumptions	${Set}_{B - E}$	${Set}_{R}$	${Set}_{T}$
1. Cross sections of a beam that are planes remain planes after the deformation process	+	+	+
2. Normal stresses on planes parallel to the axis of a beam are infinitesimal	+	+	+
3. A beam has a constant cross section	+	+	+
4. A beam is made of a homogeneous isotropic material	+	+	+
5. Cross sections of a beam perpendicular to its axis remain perpendicular to the deformed axis	+	+
6. Rotation inertia of cross sections of a beam is omitted	+

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Legatiuk, D. Mathematical Modelling by Help of Category Theory: Models and Relations between Them. Mathematics 2021, 9, 1946. https://doi.org/10.3390/math9161946

AMA Style

Legatiuk D. Mathematical Modelling by Help of Category Theory: Models and Relations between Them. Mathematics. 2021; 9(16):1946. https://doi.org/10.3390/math9161946

Chicago/Turabian Style

Legatiuk, Dmitrii. 2021. "Mathematical Modelling by Help of Category Theory: Models and Relations between Them" Mathematics 9, no. 16: 1946. https://doi.org/10.3390/math9161946

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Mathematical Modelling by Help of Category Theory: Models and Relations between Them

Abstract

1. Introduction

2. Categories of Mathematical Models

3. Relations between Mathematical Models

4. Convertible Mathematical Models

5. Illustrative Examples

5.1. Categorical Modelling of Beam Theories

5.2. Category of Aerodynamic Models Revisited

6. Further Characterisations of Mathematical Models and Conclusions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Some Basic Definitions from Category Theory

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI