Next Article in Journal
An Enhanced Naked Mole Rat Algorithm for Optimal Cross-Layer Solution for Wireless Underground Sensor Networks
Next Article in Special Issue
Parallel Algorithms for Solving Inverse Gravimetry Problems: Application for Earth’s Crust Density Models Creation
Previous Article in Journal / Special Issue
An Approach to Building Decision Support Systems Based on an Ontology Service
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Modeling the Context of the Problem Domain of Time Series with Type-2 Fuzzy Sets

by
Anton A. Romanov
*,
Aleksey A. Filippov
,
Valeria V. Voronina
,
Gleb Guskov
and
Nadezhda G. Yarushkina
Department of Information Systems, Ulyanovsk State Technical University, 32 Severny Venetz Street, 432027 Ulyanovsk, Russia
*
Author to whom correspondence should be addressed.
Mathematics 2021, 9(22), 2947; https://doi.org/10.3390/math9222947
Submission received: 4 October 2021 / Revised: 4 November 2021 / Accepted: 17 November 2021 / Published: 18 November 2021
(This article belongs to the Special Issue Numerical Analysis and Scientific Computing)

Abstract

:
Data analysis in the context of the features of the problem domain and the dynamics of processes are significant in various industries. Uncertainty modeling based on fuzzy logic allows building approximators for solving a large class of problems. In some cases, type-2 fuzzy sets in the model are used. The article describes constructing fuzzy time series models of the analyzed processes within the context of the problem domain. An algorithm for fuzzy modeling of the time series was developed. A new time series forecasting scheme is proposed. An illustrative example of the time series modeling is presented. The benefits of contextual modeling are demonstrated.

1. Intro

The activity of any modern organization is associated with the timely adoption of urgent management decisions [1]. This is required to evaluate the states of some complex systems. The decision-maker (DM) requires knowledge of the specifics of the problem domain and the ability to use various tools for data analysis and working with existing knowledge. This process forms the requirements for data analysis methods. Methods should work in some contexts of analyzed object behavior. Methods consider features and limitations of analyzed objects. The same dataset in different subject areas can have different interpretations and analysis results.
The analysis of a set of the time series in [2,3] for process detection is used. The intelligent process analysis described by the authors does not apply knowledge about the correlation between objects of a problem area. The process analyses with intelligent methods in these works do not consider the features of the problem domain. The process detection quality improvement needs more complex models than quantitative indicators by a time series model.
Currently, a large number of works describe various approaches to the application of context in data mining [4,5,6,7,8,9]. The use of context will improve the quality and efficiency of data analysis presented by time series.
In the works considered, the context is formed based on ontologies. Ontologies are used for:
  • Increasing the interoperability of data processing systems represented by THE time series due to data unification based on ontological representations [4];
  • Improving the accuracy of search engine requests in predictive analytics [5];
  • Associations of the time series with messages in social networks, considering the specifics of the problem domain: company structure, names of employees, etc. [6];
  • The transition from numerical data to semantic fragments of the neural network structure [7];
  • Detecting changes in a group of related time series [9].
Modeling various processes and systems under uncertainty has many practical applications. Traditionally, time series models are used to model dynamics. These models allow reflecting the history of changes in process indicators. A time-series basis is a specific approach for analyzing numerical data, unlike, for example, classical statistical methods. Time series models allow reflecting the dependence of a sequence of the time series point values on points at previous moments. The problem is that the values of the indicators represented by a time series in complex systems can contain a cumulative assessment of the influence of several factors. The problem lies in improving the accuracy of the approximation of the modeled system based on time series models with uncertainty, noise, and insufficient data. Fuzzy time series models allow us to extract easily understandable rules for a researcher to perceive from historical data for analysis and forecasting tasks. However, several problems are still relevant to the solution. Time series modeling is often based only on the history of the data itself. That condition does not allow identifying the correct patterns, limitations, and features of the modeled processes. The main objects for building an accurate model of the analyzed process involve the information about changes in the indicators and the conditions of the solved problem. In other words, the analysis of the number sequence of different real-world objects may have different results [10,11]. The conditions of this modeling are called contextual modeling of the time series. Time series models, built based on type-2 fuzzy sets, can have more modeling uncertainty benefits. Their advantages are:
  • The ability to model the uncertainty of the choice of antecedents and consequents in the rules;
  • The ability to select the parameters of type-1 membership functions;
  • A reduction in the number of type-1 fuzzy labels while maintaining the quality of the time series approximation.
The task of the modeling context is closely related to granular computing. The common factor is that the modeling result is problem-oriented. The theory of granular computations to the time series analysis in the works by [12,13,14] is considered. Information granules presented by sets of fuzzy tendencies of the time series can make time series context modeling:
G = { T , D , I } ,
where
T—the direction of tendency;
D—linguistic value of duration;
I—linguistic value of intensity.
The choice of fuzzy tendencies for modeling information granules is because they contain information about the dynamics of the time series. In this case, the novelty is the use of type-2 fuzzy sets for information granule construction and the creation of rules set about the patterns of behavior of the time series. The second feature is that the model includes the context of the problem domain, which defines the conditions for modeling the time series.
Improving the quality of the time series modeling and forecasting by using only the history of changes has a natural limitation: it is impossible to create an immutable and accurate time-series mathematical model without using an object’s knowledge. We propose to append the time series model with fuzzy rules for adapting the base mathematical model (or a set of models) to the changing external conditions of the object’s functioning, which affect the changes in indicators.
Thus, context analysis means the models use additional information about the various conditions of the functioning of a given model to reduce the dimension of the analyzed data and increase the accuracy of modeling.

2. Related Work

The possibility of an integral representation of knowledge about the object behavior and its uncertainty is the main advantage of modeling information granules. The process of obtaining and presenting information granules is hierarchical and multi-stage [15]. Information granule modeling is a multidisciplinary process. Such modeling opens up opportunities for creating intelligent systems with the interpretation of modeling results [13]. In decision making, the problem domain can have different degrees of uncertainty: uncertainty in the input data, uncertainty in the principles of control, insufficient input data, noise in data, etc. Including expert knowledge about the processes is not always objectively accurate. The papers [16,17,18,19,20] discuss various methods to overcome these problems: decision trees, clustering, deep learning methods, ontological engineering, fuzzy logic, time series models, etc.
In decision-making systems, it is often necessary to analyze data with time variability. This increases the complication of the analysis due to the growing amount of data and the complication of the applied models. In some research works on dynamic data analysis, time series models with information granules have been used. The works [14,21] show that time series forecasting and reducing the data dimensions can be made by information granule modeling. The authors consider the approach of [22], based on type-2 fuzzy sets for justified granularity. Granules are created by a balance between their experimental rationale and semantics. The work [23] discusses the entropy approach for interval discretization of information granules when predicting prices in the stock exchange. The conclusion is that granular computing is a method of generalization in knowledge representation and as a basis for various modeling tools. Non-linear dependencies can be more accurately represented in information granule models in comparison to interval models [24].
Work [25] discusses the context concept in information retrieval, considering context as an extension of local information about the object under study with some global information. In the article [26], the citation context helps to rank the scientific publication citations’ importance and gives additional information used by machine learning models. In [27], this context helps to improve the accuracy of local methods for detecting objects in images. In this work, the context promotes obtaining information about related objects with the analyzed one.
Large volumes of data in the databases of information systems are a great source for analysis behavior on complex organizational and technical systems. Using such information for process identification is important for management [28].
Time series models can represent uncertainty by value or by time. The following approaches in works [12,29,30] was described:
  • Fuzzy Transform (F-transform) [29,30] is a soft computing approximation technique. The advantages of the F-transform in various applications are demonstrated, such as time series analysis and approximation, image compression, anomaly detection.
  • Time series fuzzification by time in [12] is described.
The use of high-order fuzzy sets helps to model the uncertainty of various real-world objects. The work [31] shows that uncertainty can be divided into random (inaccuracies in the processing of statistical signals) and linguistic (inaccuracies in the expert statements).
The hybridization of the time series modeling approaches allows the creation of intellectual methods for data processing and analysis for decision-making systems.

3. Time Series Model

A model of discrete numerical time series is created with preliminary fuzzification by type-2 fuzzy sets. This approach helps to simplify the procedure for forming a rule base for fuzzy inference when analyzing time series. The problem of determining the boundaries of intervals of type-2 fuzzy sets lies in solving them. The time series model considers the context of the problem domain with the conditions and the nature of its main tendencies. A triangular form of fuzzy sets is used due to the small computational complexity.
A discrete numerical time series is given as:
t s = { t s t } , t [ 0 , l ] , t N
where t s t —time series value at a time point t; l—length of the time series. At each moment t > 0 , the value of the tendency of the time series can be determined:
T e n d t = t s t t s t 1 ,
where T e n d t —a numerical representation of the direction and intensity of the tendency of a time series at a point in time t; t s t , t s t 1 —time series values at moment t and t 1 , respectively.
For fuzzy modeling of the time series tendencies, a universe of type-2 fuzzy sets is defined as: U = A ˜ i , and i N is the number of fuzzy sets in the universe.
Type-2 fuzzy sets can be represented as:
A ˜ = ( ( x , u ) , μ A ˜ ( x , u ) ) | x U x , u J x [ 0 , 1 ]
where x U x and u J x [ 0 , 1 ] in which 0 μ A ˜ ( x , u ) 1 , J x is the range of values of the function μ ( x ) . x—is a crisp time series value, and U x —is a universe of the time series values.
Lower membership function A ˜ is called the function μ ̲ A : U x [ 0 , 1 ] and defined as μ ̲ A ( x ) = inf J x , x U x .
Upper membership function A ˜ is called the function μ ¯ A : U x [ 0 , 1 ] and defined as μ ¯ A ( x ) = sup J x , x U x .
Type-2 fuzzy set A ˜ will be interval if μ A ( x , u ) = 1 x U x , u J x .
Time series modeling needs to define interval fuzzy sets and their shape. Figure 1 shows the appearance of the sets.
Triangular fuzzy sets are defined as follows:
A ˜ i = ( A ˜ i U , A ˜ i L ) = ( ( a i 1 u , a i 2 u , a i 3 u , h ( A ˜ i U ) ) , ( a i 1 l , a i 2 l , a i 3 l , h ( A ˜ i l ) ) ) .
where A ˜ i U and A ˜ i L are triangular type-1 fuzzy sets, a i 1 u , a i 2 u , a i 3 u , a i 1 l , a i 2 l , a i 3 l are reference points of type-2 interval fuzzy set A ˜ i , and h is the maximum value of the membership function of the element a i (for the upper and lower membership functions, respectively), means that ( ˜ A ) i depends of height of triangle.
An operation of combining fuzzy sets of type 2 is required when working with a rule base based on the values of a time series. The combined operation is defined as follows:
A ˜ 1 A ˜ 2 = ( A ˜ 1 U , A ˜ 1 L ) ( A ˜ 2 U , A ˜ 2 L ) = ( ( a 11 u + a 21 u , a 12 u + a 22 u , a 13 u + a 23 u ; m i n ( h 1 ( A ˜ 1 U ) , h 1 ( A ˜ 2 U ) ) ) , m i n ( h 2 ( A ˜ 1 U ) , h 2 ( A ˜ 2 U ) ) ) ; ( a 11 l + a 21 l , a 12 l + a 22 l , a 13 l + a 23 l ; m i n ( h 1 ( A ˜ 1 L ) , h 1 ( A ˜ 2 L ) ) , m i n ( h 2 ( A ˜ 1 L ) , h 2 ( A ˜ 2 L ) ) ) ;
Proposition 1.
A fuzzy time series model, reflecting the context of the problem domain, will be described by two sets of type-2 fuzzy labels:
t s ˜ = ( A ˜ , A ˜ C ) ,
where A ˜ —a set of type-2 fuzzy sets describing the tendencies of the time series obtained from the analysis of the points of the time series, | A ˜ | = l 1 ; A ˜ C —a set of type-2 fuzzy sets describing the trends of the time series obtained from the context of the problem domain of the time series, | A ˜ C | < < l 1 .
The component A ˜ of model (6) is extracted from time series values by fuzzifying all numerical representations of the time series tendencies. By the representation of information granules in the form of fuzzy tendencies of the time series (1), the numerical values of the tendencies are fuzzified: A t ˜ = μ ( T e n d t ) = μ ( t s t t s t 1 ) , t > 0 .
The component A ˜ C of model (6) by expert or analytical methods is formed and describes the most general behavior of the time series. This component is necessary for solving problems:
  • Justification of the choice of the boundaries of the type-2 fuzzy set intervals when modeling a time series.
  • Analysis and forecasting of a time series with a lack of data or when they are noisy.
Thus, the time series context, represented by the component A ˜ C of model (6), is determined by the following parameters:
  • Rate of tendency change A t ˜ C .
  • Number of tendency changes | A ˜ C | .

4. Modeling Algorithm

The modeling procedure contains the following steps:
  • Check the constraints of the time series: discreteness; length being more than two values.
  • Calculate the tendencies T e n d t of the time series by (3) at each moment t > 0 .
  • Determine the universe for the fuzzy values of the time series tendencies: U = A ˜ i , i N is the number of fuzzy sets in the universe. Type-2 fuzzy sets A ˜ are given by membership functions of a triangular form, and at the second level, they are intervals; see Figure 1.
  • By an expert or analytical method, obtain the rules from the time series as a set of pairs of type-2 fuzzy sets: R u l e s C = { R r C } , r N , where R r C is a pair ( A ˜ i C , A ˜ k C ) , A ˜ i C is the antecedent of the rules, A ˜ k C is the consequent of the rules and i , k are the indices specifying the relation of the consecutive in time of the antecedent and the consequent, i < k < = l .
  • Fuzzify the time series tendencies A ˜ = { A t ˜ } = { μ ( t s t t s t 1 ) } t [ 0 , , l ] and create a rule base: R u l e s = { R r C } , r N , where R k is a pair ( A ˜ i , A ˜ k ) , A i ˜ is the antecedent of the rules, A k ˜ is the consequent of the rules and i , k are the indices specifying the relation of the consecutive in time of the antecedent and the consequent, i < k < = l .
  • Merge the rule base with rules derived from the context: R u l e s R u l e s C .
  • Correct the fuzzy sets extracted from the time series along the context boundaries of the intervals of type-2 fuzzy sets:
    i n f ( A t ˜ ) = m a x ( i n f ( A ˜ t ) , i n f ( A ˜ t C ) ) s u p ( A t ˜ ) = m i n ( s u p ( A ˜ t ) , s u p ( A ˜ t C ) )
  • Apply the resulting rule base for fuzzy inference when modeling and predicting time series values.

5. Time Series Forecasting with Using of Context

The analysis of dynamic data using the context can be presented by the following scheme, Figure 2.
According to the presented scheme:
  • The source data of the proposed approach is the set of modeled objects O. The first operation of the procedure is forming a request by the decision-maker (DM) for managing interest objects, setting goals (block #1 of Figure 2).
  • Functions that help to study different object’s behavior:
    n u m ( O ) O n u m function for obtaining the values of the numerical attributes of an object or number of child objects of the same type.
    Function r u l e ( O ) O r u l e for obtaining object behavior, for example as a set of “if-then” rules: o i r u l e = { A , C } , where A is a set of antecedents, C is a set of consequences.
  • Merging of previous defined results is the function (block #2 of Figure 2).
    a g g r ( O n u m , O r u l e ) { o i n u m , O ^ r u l e } ,
    where O ^ r u l e is a subset of the behavior rules for a numeric attribute o i n u m .
  • Using several models with different approaches (statistical, neural, fuzzy) can provide better forecasting results (block #3.1). Studies show that using only one model for time series forecasting is not the best way [32,33,34].
Proposition 2.
The proposed forecasting scheme defines choosing the model by context information. The selection on the following criteria is based: tendencies, periodic component, length of the time series, presence of outliers, and noise.
  • According to the block #4 of the Figure 2 Then selection function id:
    S e l e c t M o d e l ( o i n u m , O ^ r u l e , T S ) { o i n u m , O ^ r u l e , T S ^ } ,
    where T S is a set of available time series models; T S ^ is a set of selected models.
The main task is to create a time series model that considers the constraints and conventions of the time series behavior. The behavior from the context of the object’s functioning is extracted:
  • Time series baseline y b a s e .
  • The base time series tendency t e n d b a s e .
  • Maximum/minimum time series values bounds b o u n d = ( y m i n , y m a x ) .
  • The time series tendencies change rate t e n d Δ .
  • Range of the time series non-anomalous values a c c e p t = ( y m i n a c c e p t , y m a x a c c e p t ) .
The context information can be divided into two classes: defining time series behavior (time series baseline, the primary tendency) and describing the modeling result, and evaluating forecasted values.
The formal definition of both classes is:
D e f = { y b a s e , t e n d b a s e } , D e s c = { b o u n d , t e n d Δ , a c c e p t , y b a s e , t e n d b a s e }
The time series model o i n u m can define as a combination of the following components:
{ o i n u m , O ^ r u l e , T S ^ , D e f ^ , D e s c ^ } ,
where T S ^ is a set of the time series models. T S ^ is the dynamic model of numerical attribute representation O n u m of control object. The model of formed with using non-time series components D e f ^ , D e s c ^ , as E x t r a c t ( O ^ r u l e ) ( D e f ^ , D e s c ^ ) .
Components of the time series model D e f ^ , D e s c ^ defined by the following expression:
y D e f = α y b a s e + β y t e n d b a s e ,
where α , β are the weight coefficients.
The model uses only contextual information and does not use the numerical time series values. The proposed contextual model of the time series with weights:
y m o d e l = w D e f y D e f + ( 1 w D e f ) y ,
where w D e f is a contextual time series model weight. The result of another selected time series model is y. Forecast values are weighted in the same way.
The D e c s model component validates modeling and forecasting results:
E r r v a l i d = i = 1 n i s V a l i d ( d e s c i ) n | D e s c | ,
where the function i s V a l i d ( d e s c i ) has a range [ 0 , 1 ] and helps to check the constraints.

6. Forming a Context for Time Series Analysis and Forecasting

Time series context modeling can use ontology as a knowledge base about domain objects. The ontology can contain an object’s relations, restrictions, and a set of properties. The ontology helps to select the best time series forecast method through using logical rules [35]. The set of rules is based on the time series properties, see (10).
Here, an example of ontology for context representation, not only for type 2 time series models, is described.
M O m O n u m T S D e f D e s c I n t e r v a l I n t e r v a l h a s N a m e . S t r i n g h a s N a m e . S t r i n g h a s M i n V a l u e . I n t e g e r h a s M i n V a l u e . I n t e g e r h a s M a x V a l u e . I n t e g e r h a s M a x V a l u e . I n t e g e r M h a s N a m e . S t r i n g h a s N a m e . S t r i n g h a s T e n d e n c y . B o o l e a n h a s T e n d e n c y . B o o l e a n h a s P e r i o d . B o o l e a n h a s P e r i o d . B o o l e a n h a s S e a s o n . B o o l e a n h a s S e a s o n . B o o l e a n h a s S m o o t h . B o o l e a n h a s S m o o t h . B o o l e a n h a s F u z z y . B o o l e a n h a s F u z z y . B o o l e a n l e n g t h . I n t e r v a l l e n g t h . I n t e r v a l O m h a s N a m e . S t r i n g h a s N a m e . S t r i n g h a s P r o p e r t y . O n u m D e f h a s N a m e . S t r i n g h a s N a m e . S t r i n g h a s B a s e . I n t e g e r h a s B a s e . I n t e g e r h a s T e n d B a s e . I n t e g e r h a s T e n d B a s e . I n t e g e r
O n u m h a s N a m e . S t r i n g h a s N a m e . S t r i n g h a s D e f . D e f h a s D e f . D e f h a s T S . T S D e s c h a s N a m e . S t r i n g h a s N a m e . S t r i n g h a s B a s e . I n t e g e r h a s B a s e . I n t e g e r h a s T e n d B a s e . I n t e g e r h a s T e n d B a s e . I n t e g e r h a s T e n d D e l t a . I n t e g e r h a s T e n d D e l t a . I n t e g e r h a s B o u n d . I n t e r v a l h a s B o u n d . I n t e r v a l h a s A c c e p t . I n t e r v a l h a s A c c e p t . I n t e r v a l T S h a s N a m e . S t r i n g h a s N a m e . S t r i n g h a s D e f . D e f h a s D e f . D e f ,
where I n t e r v a l is a concept representing an integer interval; h a s N a m e is a functional role for “has a name” axiom; h a s M i n V a l u e and h a s M a x V a l u e are functional roles for “has a minimal value” and “has a maximal value” axioms; S t r i n g is a string data type; I n t e g e r is an integer data type; M is a concept representing some method for analyzing or forecasting a time series; h a s T e n d e n c y is a functional role for “has the ability to work with tendencies” axiom; h a s P e r i o d is a functional role for “has the ability to work with periodicity” axiom; h a s S e a s o n is a functional role for “has the ability to work with seasonality” axiom; h a s S m o o t h is a functional role for “has the ability to use smoothing” axiom; h a s F u z z y is a functional role for “has the ability to use fuzzy values” axiom; l e n g t h is a functional role for “has an acceptable interval of the time series length” axiom; B o o l e a n is a boolean data type; O m is a concept representing some control object; h a s D e f is a functional role for “has a time series behavior” axiom; h a s P r o p e r t y is a functional role for “has a property” axiom; D e f is a concept representing a time series behavior; h a s B a s e is a functional role for “has a time series baseline” axiom; h a s T e n d B a s e is a functional role for “has a main tendency” axiom; O n u m is a concept representing a control object property; h a s D e s c is a functional role for “has a modeling result description” axiom; h a s T S is a functional role for “has a time series” axiom; D e s c is a concept representing a modeling result description; h a s T e n d D e l t a is a functional role for “has a rate of change of trends” axiom; h a s B o u n d is a functional role for “has a range of acceptable values” axiom; T S is a concept representing a time series of values of control object property.

7. Results

The proposed approach is demonstrated based on the problem of planning the production of an aircraft enterprise. Machining centers (machines) are connected to the monitoring system and collect information about the operating modes of the equipment. The purpose of collecting information is to obtain information about the equipment load. Depending on the volume of production, it is possible to receive feedback on the technical condition of the equipment and the compliance of the actual operating time of the equipment with the planned time.
The time series of the indicator “Percentage of equipment operation time” will be analyzed. This indicator reflects the efficiency of the use of equipment. The machining center may, at other times, be in a state of downtime, scheduled maintenance, changeover, or emergency repair. An indicator of effective planning of equipment operation modes will be the maximization of the percentage of equipment operation time according to the production program. The states of downtime, changeovers, planned repairs of the time spent and frequency of occurrence should be minimized. Furthermore, the state of emergency repair must be eliminated.
For example, consider the time series of the machining center. This time series characterizes the indicator “Percentage of equipment operating time”. In the time interval from May to December, the values contain the average monthly estimate. The time series is shown in Figure 3. The numerical data of the time series in Table 1 are given.
The time series model by using the algorithm described in Section 4 is formed. Previously, the time series could be smoothed by using existing methods to improve their tendencies. The time series presented in the experiment will not be smoothed since it is short and already contains aggregated values. Furthermore, smoothing will result in the loss of information.
The first step of the time series modeling is the computing of numerical tendencies. The result is shown in Table 2.
The universe of fuzzy sets must cover a numeric interval in the range [ 6.21 , 12.96 ] ; in other words, the boundaries as the minimum and maximum value of the obtained time series tendencies are defined.
As an example, let us choose three type-2 fuzzy sets for the fuzzification of a numerical interval. Such a rough partition will help to demonstrate the efficiency of correcting type-2 fuzzy set intervals boundaries. This results in a set with an indication of intervals for the lower and upper membership functions:
A ˜ = { A ˜ 1 ( 6.21 , 0.15 ) ( 7.21 , 1.15 ) , A ˜ 2 ( 0.15 , 6.2 ) ( 0.85 , 7.2 ) , A ˜ 3 ( 5.2 , 11.96 ) ( 4.2 , 12.96 ) } .
A set of fuzzy labels are created and extracted from the context by an expert method:
{ A ˜ 2 C , A 2 C , A 2 C , A ˜ 1 C , A ˜ 1 C , A ˜ 3 C , A ˜ 1 C } , A ˜ i C U .
where for type-2 fuzzy sets, A ˜ i C , the following parameters are defined:
A ˜ C = { A ˜ 1 C ( 6.21 , 0 ) ( 6 , 1 ) , A ˜ 2 ( 0.05 , 6.2 ) ( 1 , 8 ) , A ˜ 3 ( 5.2 , 11.96 ) ( 7 , 12 ) } .
Fuzzification of the original numerical time series is performed:
{ A ˜ 2 , A ˜ 2 , A ˜ 1 , A ˜ 1 , A ˜ 3 , A ˜ 1 , A ˜ 1 , A ˜ 1 } , A ˜ i U
The fuzzy rules of the extracted time-series tendencies are represented as follows:
R u l e s = { A ˜ 2 A ˜ 2 , A ˜ 2 A ˜ 1 , A ˜ 1 A ˜ 1 , A ˜ 1 A ˜ 3 , A ˜ 3 A ˜ 1 , A ˜ 1 A ˜ 1 , A ˜ 1 A ˜ 1 }
The fuzzy rules of the tendencies taken out of context are presented as follows:
R u l e s C = { A ˜ 2 C A ˜ 2 C , A ˜ 2 C A ˜ 1 C , A ˜ 1 C A ˜ 1 C , A ˜ 1 C A ˜ 3 C , A ˜ 3 C A ˜ 1 C }
The rule sets are merged:
R u l e s C R u l e s = { A ˜ 2 A ˜ 2 , A ˜ 2 A ˜ 1 , A ˜ 1 A ˜ 1 , A ˜ 1 A ˜ 3 , A ˜ 3 A ˜ 1 , A ˜ 1 A ˜ 1 , A ˜ 1 A ˜ 1 , A ˜ 2 C A ˜ 2 C , A ˜ 2 C A ˜ 1 C , A ˜ 1 C A ˜ 1 C , A ˜ 1 C A ˜ 3 C , A ˜ 3 C A ˜ 1 C }
Then, correction of type-2 fuzzy sets based on the intervals set boundaries A i C is performed. Figure 4 shows a time series based on this rule set.
Now, consider the case of a context change. The rule base of the sets A i C and the boundaries of the type-2 fuzzy set intervals will change. This case will be considered when the requirements for the equipment operability increase in production. Analysis of the textual information of such a case makes it possible to distinguish an “increase” tendency for the “equipment operability” indicator. This information will serve as a summary of the modified context. The rule base will be presented as follows:
A ˜ 2 C A ˜ 2 C , A ˜ 2 C A ˜ 1 C , A ˜ 1 C A ˜ 1 C , A ˜ 1 C A ˜ 2 C , A ˜ 2 C A ˜ 1 C
For type-2 fuzzy sets A ˜ i C , the following parameters are defined:
A ˜ C = { A ˜ 1 C ( 6.21 , 0 ) ( 3 , 1 ) , A ˜ 2 ( 0.05 , 6.2 ) ( 1 , 5 ) , A ˜ 3 ( 5.2 , 11.96 ) ( 5 , 10 ) } .
Figure 5 shows a time series based on the changed context.
We can then conclude that the context information can influence the construction of the time series model. The benefits of this approach would be:
  • The ability to adjust the time series model using external information from the context;
  • The ability to build a time series model when the number of A i components is limited (not enough time series points);
  • The ability to search for anomalies in the received time series to generate recommendations.
The difference from other modeling methods, such as methods of exponential smoothing, methods of fuzzy modeling, and compression of the time series [11], is the possibility of including external information from the context of the problem domain of the time series into the model. Analysis of texts can provide such information about the modeled object. Indicators in linguistic form can be expressed. Information from external knowledge bases can also be retrieved.

8. An Experiment on Contextual Time Series Forecasting

According to the Figure 2, the experiment involves working with context and time series models.
For example, we use the following fragment of the ontology that represents the context:
b o u n d : I n t e r v a l a c c e p t : I n t e r v a l o t D e s c : D e s c o t D e f : D e f o t T S : T S o p e r a t i o n a l T i m e : O n u m d e p a r t m e n t : O m o t T S , o t D e s c : h a s D e s c o p e r a t i o n a l T i m e , o t T S : h a s T S o p e r a t i o n a l T i m e , o t D e f : h a s D e f d e p a r t m e n t , o p e r a t i o n a l T i m e : h a s P r o p e r t y
The ontology describes the same object and its properties, as in the previous section. The main management object is department, the analyzed object property is operationalTime. In ontology, the time series represents property values otTS.
The enterprise equipment gives a lot of monitoring indicators. We analyze the operational time of each machine. The time series is one of the output datasets of block 1.
Time series model construction is according to block 3.1 of the scheme. For example, the following time series forecasting methods are used: exponential Holt time series model, exponential Holt–Winters time series model with seasonal component. For comparison, we use F-Transform [29] with the “If–Then” rule base.
The part of ontology that describes the exponential Holt time series model:
t i m e S e r i e s L e n g t h : I n t e r v a l h o l t : M t i m e S e r i e s L e n g t h , 5 : h a s M i n V a l u e t i m e S e r i e s L e n g t h , M A X : h a s M a x V a l u e h o l t , t r u e : h a s T e n d e n c y h o l t , f a l s e : h a s P e r i o d h o l t , f a l s e : h a s S e a s o n h o l t , t r u e : h a s S m o o t h h o l t , f a l s e : h a s F u z z y h o l t , t i m e S e r i e s L e n g t h : l e n g t h
The length of the forecast is 3 points, because of the small length of the time series.
Without using the context, we can build a forecast on a test interval, see Figure 6, a Forecast estimation by the SMAPE criterion [36] is made.
As the next step, according to the expression (13), we can build forecasts for the time series, see Figure 7.
According to block 4, forecasts can use context information for the time series model (10):
b o u n d , 0 : h a s M i n V a l u e b o u n d , 100 : h a s M a x V a l u e a c c e p t , 50 : h a s M i n V a l u e a c c e p t , 60 : h a s M a x V a l u e o t D e s c , 50 : h a s B a s e o t D e s c , 0 : h a s T e n d B a s e o t D e s c , 5 : h a s T e n d D e l t a o t D e s c , b o u n d : h a s B o u n d o t D e s c , a c c e p t : h a s A c c e p t o t D e f , 50 : h a s B a s e o t D e f , 0 : h a s T e n d B a s e
The model parameter w D e f is set equality to 0.3. The weight of the numerical model forecast in contrast with the context forecast is greater.
The formed context model components allow the following actions:
  • Check and filter the model restrictions on seasonality, frequency. Holt, Holt–Winters, F-Transform time series models satisfy this requirement.
  • The following forecasting models satisfy the expression (14): Holt, Holt–Winters. The F-transform method with the obtained value of the last point at 47.81 is not suitable by the requirement: the forecast values are within acceptable boundaries (50, 60).
We can conclude that the forecast of the operational time of the equipment in the department will decrease. The forecasted levels of the time series match with the contextual restrictions.
The model components can take the following form in case of context changes (e.g., production volume increases):
b o u n d , 0 : h a s M i n V a l u e b o u n d , 100 : h a s M a x V a l u e a c c e p t , 50 : h a s M i n V a l u e a c c e p t , 60 : h a s M a x V a l u e o t D e s c , 50 : h a s B a s e o t D e s c , 1 : h a s T e n d B a s e o t D e s c , 5 : h a s T e n d D e l t a o t D e s c , b o u n d : h a s B o u n d o t D e s c , a c c e p t : h a s A c c e p t o t D e f , 50 : h a s B a s e o t D e f , 1 : h a s T e n d B a s e
The forecast for these conditions show on the Figure 8.
This forecast is given as an example demonstrating the change in forecast results due to the context change.
The Holt–Winters time series model does not satisfy restrictions (Equation (14)), although it shows the best estimate for SMAPE. It needs to select another model for time series forecasting.
The experiment shows that the context information in time series modeling allows increasing the forecasting quality due to choosing a better model or a model that matches the decision-maker expectations.

9. Discussion

The described results show that the time series model can widely use the concept of an information granule. Time series granulation can be based on model components, such as tendencies and others. However, an information granule can reflect many other attributes of objects in the problem area: expert opinions, knowledge and correlation with other object parameters. The opportunity lies in the hybridization of the time series analysis methods, considering the context of the problem area.
Another main advantage of the proposed approach is the explanatory component. Recommender systems based on the proposed principle can, at any time, provide confirmation of the issued conclusion. This is possible due to the connection between the analyzed time series and the objects of the real world. For example, during the analysis of the text materials of the context, it will be obvious which parameter of which object will be analyzed. The fuzzy models help develop easy-to-use numerical representations of indicators, or the most general, fuzzy ones.
The proposed approach at every stage of analytics of the dynamic data can be embedded: descriptive analytics, predictive analytics, and prescriptive analytics. The advantage is that we can analyze a separate indicator and form conclusions about the forecast, and vice versa, the use of predicting and prescription for the behavior of some objects helps to describe others.
The disadvantages of the proposed approach include high computational complexity, the use of an expert to form the ontology of the problem area, or the use of methods for extracting and studying object knowledge.
Therefore, the goal of further research will be an extended analysis of the dynamic data of related objects in context. The task of advanced fuzzy model studies is to consider the restrictions on the choice of intervals of type-2 fuzzy sets when approximating time series.

Author Contributions

Conceptualization, A.A.R.; Data curation, A.A.F.; Formal analysis, V.V.V.; Methodology, N.G.Y.; Visualization, G.G. All authors contributed equally. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Ministry of Science and Higher Education of the Russian Federation in the framework of state task no. 075-00233-20-05 “Research of intelligent predictive multimodal analysis of big data, and the extraction of knowledge from different sources”. This research was funded by RFBR and the government of Ulyanovsk, according to research projects: 18-47-732016, 18-47-730022, 19-47-730005, and 18-47-730019.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Hu, X.; Yang, S.; Zhu, Y.-R. Multiple attribute decision-making based on three-parameter generalized weighted heronian mean. Mathematics 2021, 9, 1363. [Google Scholar] [CrossRef]
  2. Dunkl, R.; Rinderle-Ma, S.; Grossmann, W.; Froschl, K.A. A method for analyzing time series data in process mining: Application and extension of decision point analysis. In Proceedings of the International Conference on Advanced Information Systems Engineering, CAiSE Forum 2014, Thessaloniki, Greece, 16–20 June 2014; pp. 68–84. [Google Scholar]
  3. Dunkl, R.; Rinderle-Ma, S.; Grossmann, W.; Froschl, K.A. Decision point analysis of the time series data in process-aware information systems. In Proceedings of the Pre-proceedings of CAISE’14 Forum, Thessaloniki, Greece, 16–20 June 2014; pp. 33–40. [Google Scholar]
  4. Henson, C.A.; Neuhaus, H.; Sheth, A.P.; Thirunarayan, K.; Buyya, R. An ontological representation of the time series observations on the semantic sensor web. In Proceedings of the 1st International Workshop on the Semantic Sensor Web, Heraklion, Greece, 1 June 2009; pp. 79–95. [Google Scholar]
  5. Li, Z.; Xu, W.; Zhang, L.; Lau, R.Y. An ontology-based Web mining method for unemployment rate prediction. Decis. Supp. Syst. 2014, 66, 114–122. [Google Scholar] [CrossRef]
  6. Qu, H.; Sardelich Nascimento, M.; Qomariyah, N.N.; Kazakov, D.L. Integrating time series with social media data in an ontology for the modelling of extreme financial events. In Proceedings of the LREC 2016 Proceedings, European Language Resources Association (ELRA), Portorož, Slovenia, 23–28 May 2016; pp. 57–63. [Google Scholar]
  7. Deng, S.; Pan, J.Z.; Chen, J.; Chen, H. Deep Learning for Knowledge-Driven Ontology Stream Prediction. In Proceedings of the China Conference on Knowledge Graph and Semantic Computing, Hangzhou, China, 24–27 August 2019; Springer: Singapore, 2019; Volume 957, pp. 52–54. [Google Scholar]
  8. Chou, J.S.; Tran, D.S. Forecasting energy consumption time series using machine learning techniques based on usage patterns of residential householders. Energy 2018, 165, 709–726. [Google Scholar] [CrossRef]
  9. Chen, X.C.; Steinhaeuser, K.; Boriah, S.; Chatterjee, S.; Kumar, V. Contextual time series change detection. In Proceedings of the 2013 SIAM International Conference on Data Mining, Austin, TX, USA, 2–4 May 2013; pp. 503–511. [Google Scholar]
  10. Romanov, A.; Yarushkina, N.; Filippov, A. Application of Time Series Analysis and Forecasting Methods for Enterprise Decision-Management. Lect. Notes Comput. Sci. 2020, 12415, 326–337. [Google Scholar]
  11. Romanov, A.; Filippov, A.; Yarushkina, N. Extraction and forecasting time series of production processes. In Proceedings of the International Conference on Information Technologies, Saratov, Russia, 7–8 February 2019; pp. 173–184. [Google Scholar]
  12. Romanov, A.; Voronina, V.; Guskov, G.; Moshkina, I.; Yarushkina, N. Discrete and fuzzy models of the time series in the tasks of forecasting and diagnostics. Axioms 2020, 9, 49. [Google Scholar] [CrossRef]
  13. Pedrycz, W. Interpretable Artificial Intelligence: A Perspective of Granular Computing; Springer Nature: Cham, Switzerland, 2021; 937p. [Google Scholar]
  14. Dong, R.; Pedrycz, W. A granular time series approach to long-term forecasting and trend forecasting. Phys. A Stat. Mech. Appl. 2008, 387, 3253–3270. [Google Scholar] [CrossRef]
  15. Zadeh, L.A. Toward a theory of fuzzy information granulation and its centralityin human reasoning and fuzzy logic. Fuzzy Sets Syst. 1997, 90, 111–127. [Google Scholar] [CrossRef]
  16. Minaev, Y.N.; Filimonova, O.Y.; Minaeva, Y.I. Granular, Fuzzy Set and Tesor-Trace Characteristics of Multidimensional Time Series. Elect. Simul. 2017, 39, 51–74. (In Russian) [Google Scholar]
  17. Yao, Y. Three-way decision and granular computing. Int. J. Approx. Reason. 2018, 103, 107–123. [Google Scholar] [CrossRef]
  18. Singh, P.; Dhiman, G. A hybrid fuzzy time series forecasting model based on granular computing and bio-inspired optimization approaches. J. Comput. Sci. 2018, 27, 370–385. [Google Scholar] [CrossRef]
  19. Leng, J.; Chen, Q.; Mao, N.; Jiang, P. Combining granular computing technique with deep learning for service planning under social manufacturing contexts. Know.-Based Syst. 2018, 143, 295–306. [Google Scholar] [CrossRef]
  20. Lisi, F.A.; Mencar, C. A granular computing method for OWL ontologies. Fund. Inf. 2018, 159, 147–174. [Google Scholar] [CrossRef]
  21. Bose, M.; Mali, K. Designing fuzzy time series forecasting models: A survey. Int. J. Approx. Reason. 2019, 111, 78–99. [Google Scholar] [CrossRef]
  22. Liu, S.; Pedrycz, W.; Gacek, A.; Dai, Y. Development of information granules of higher type and their applications to granular models of the time series. Eng. Appl. Artif. Intell. 2018, 71, 60–72. [Google Scholar] [CrossRef]
  23. Chen, M.Y.; Chen, B.T. A hybrid fuzzy time series model based on granular computing for stock price forecasting. Inform. Sci. 2015, 294, 227–241. [Google Scholar] [CrossRef]
  24. Maciel, L.; Ballini, R.; Gomide, F. Evolving granular analytics for interval time series forecasting. Granul. Comput. 2016, 1, 213–224. [Google Scholar] [CrossRef] [Green Version]
  25. Xu, J.; Croft, W.B. Improving the effectiveness of information retrieval with local context analysis. ACM Trans. Inf. Syst. (TOIS) 2000, 18, 79–112. [Google Scholar] [CrossRef]
  26. Hassan, S.U.; Safder, I.; Akram, A.; Kamiran, F. A novel machine-learning approach to measuring scientific knowledge flows using citation context analysis. Scientometrics 2018, 116, 973–996. [Google Scholar] [CrossRef]
  27. Liang, X.; Zhang, J.; Zhuo, L.; Li, Y.; Tian, Q. Small object detection in unmanned aerial vehicle images using feature fusion and scaling-based single shot detector with spatial context analysis. IEEE Trans. Circuits Syst. Video Technol. 2019, 30, 1758–1770. [Google Scholar] [CrossRef]
  28. Alalwan, J.A.; Thomas, M.A.; Weistroffer, H.R. Decision support capabilities of enterprise content management systems: An empirical investigation. Decis. Support Syst. 2014, 68, 39–48. [Google Scholar] [CrossRef] [Green Version]
  29. Martino, F.D.; Perfilieva, I.; Sessa, S. A Summary of F-Transform Techniques in Data Analysis. Electronics 2021, 10, 1771. [Google Scholar] [CrossRef]
  30. Novák, V.; Mirshahi, S. On the Similarity and Dependence of Time Series. Mathematics 2021, 9, 550. [Google Scholar] [CrossRef]
  31. Mendel, J.M.; John, R.I.B. Type-2 Fuzzy Sets Made Simple. IEEE Trans. Fuzzy Syst. 2002, 10, 117–127. [Google Scholar] [CrossRef]
  32. Fildes, R.; Hibon, M.; Makridakis, S.; Meade, N. Generalising about univariate forecasting methods: Further empirical evidence. Int. J. Forecast. 1998, 14, 339–358. [Google Scholar] [CrossRef]
  33. Makridakis, S.; Hibon, M. The M3-Competition: Results, conclusions and implications. Int. J. Forecast. 2000, 16, 451–476. [Google Scholar] [CrossRef]
  34. Small, G.; Wong, R. The validity of forecasting. In Proceedings of the Pacific Rim Real Estate Society International Conference, Christchurch, New Zealand, 21–23 January 2002; pp. 1–14. [Google Scholar]
  35. Romanov, A.; Yarushkina, N.; Filippov, A. Application of Time Series Analysis and Forecasting Methods for Enterprise Decision-Management. In Proceedings of the International Conference on Artificial Intelligence and Soft Computing, Zakopane, Poland, 12–14 October 2020; Lecture Notes in Computer Science. Rutkowski, L., Scherer, R., Korytkowski, M., Pedrycz, W., Tadeusiewicz, R., Zurada, J.M., Eds.; Springer: Cham, Switzerland, 2020; Volume 12415. [Google Scholar]
  36. SMAPE Criterion by Computational Intelligence in Forecasting (CIF). Available online: http://irafm.osu.cz/cif/main.php (accessed on 4 November 2021).
Figure 1. The shape of the upper and lower membership functions.
Figure 1. The shape of the upper and lower membership functions.
Mathematics 09 02947 g001
Figure 2. Analyzing dynamic data with the context considering.
Figure 2. Analyzing dynamic data with the context considering.
Mathematics 09 02947 g002
Figure 3. Time series of equipment operating time percentage.
Figure 3. Time series of equipment operating time percentage.
Mathematics 09 02947 g003
Figure 4. Time series representation based on the rule set.
Figure 4. Time series representation based on the rule set.
Mathematics 09 02947 g004
Figure 5. Time series representation based on modified context.
Figure 5. Time series representation based on modified context.
Mathematics 09 02947 g005
Figure 6. Forecast of % of operational time of department.
Figure 6. Forecast of % of operational time of department.
Mathematics 09 02947 g006
Figure 7. Forecast of % of operational time of department with using of contextual information.
Figure 7. Forecast of % of operational time of department with using of contextual information.
Mathematics 09 02947 g007
Figure 8. Forecast of % of the operational time of the department with use of contextual information.
Figure 8. Forecast of % of the operational time of the department with use of contextual information.
Mathematics 09 02947 g008
Table 1. Time series of equipment operating time percentage.
Table 1. Time series of equipment operating time percentage.
Monthly Time PointWorking Hours According to the Program, %
30 April 202037.13
31 May 202039.89
30 June 202041.6
30 July 202035.39
30 August 202033.46
30 September 202045.42
30 October 202041.99
30 November 202040.93
30 December 202036.07
Table 2. Time series tendencies.
Table 2. Time series tendencies.
Monthly Time PointValue of the Tendency
31 May 20202.76
30 June 20201.71
30 July 2020−6.21
30 August 2020−1.93
30 September 202011.96
30 October 2020−3.43
30 November 2020−1.06
30 December 2020−4.86
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Romanov, A.A.; Filippov, A.A.; Voronina, V.V.; Guskov, G.; Yarushkina, N.G. Modeling the Context of the Problem Domain of Time Series with Type-2 Fuzzy Sets. Mathematics 2021, 9, 2947. https://doi.org/10.3390/math9222947

AMA Style

Romanov AA, Filippov AA, Voronina VV, Guskov G, Yarushkina NG. Modeling the Context of the Problem Domain of Time Series with Type-2 Fuzzy Sets. Mathematics. 2021; 9(22):2947. https://doi.org/10.3390/math9222947

Chicago/Turabian Style

Romanov, Anton A., Aleksey A. Filippov, Valeria V. Voronina, Gleb Guskov, and Nadezhda G. Yarushkina. 2021. "Modeling the Context of the Problem Domain of Time Series with Type-2 Fuzzy Sets" Mathematics 9, no. 22: 2947. https://doi.org/10.3390/math9222947

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop