Next Article in Journal
Statistical Tests for Extreme Precipitation Volumes
Next Article in Special Issue
Differential Evolution for Neural Networks Optimization
Previous Article in Journal
Parameter Estimation of Lindley Distribution Based on Progressive Type-II Censored Competing Risks Data with Binomial Removals
Previous Article in Special Issue
On the Efficacy of Ensemble of Constraint Handling Techniques in Self-Adaptive Differential Evolution
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Dynamic Parallel Mining Algorithm of Association Rules Based on Interval Concept Lattice

1
College of Science, North China University of Science and Technology, 21 Bohai Road, Tangshan 063210, China
2
Hebei Key Laboratory of Data Science and Application, 21 Bohai Road, Tangshan 063210, China
3
Tangshan Key Laboratory of Engineering Computing, 21 Bohai Road, Tangshan 063210, China
4
Department of mathematics and information sciences, Tangshan Normal University, No. 156 Jianshe North Road, Tangshan 063009, China
*
Author to whom correspondence should be addressed.
Mathematics 2019, 7(7), 647; https://doi.org/10.3390/math7070647
Submission received: 19 May 2019 / Revised: 11 July 2019 / Accepted: 17 July 2019 / Published: 19 July 2019
(This article belongs to the Special Issue Evolutionary Algorithms in Intelligent Systems)

Abstract

:
An interval concept lattice is an expansion form of a classical concept lattice and a rough concept lattice. It is a conceptual hierarchy consisting of a set of objects with a certain number or proportion of intent attributes. Interval concept lattices refine the proportion of intent containing extent to get a certain degree of object set, and then mine association rules, so as to achieve minimal cost and maximal return. Faced with massive data, the structure of an interval concept lattice is more complex. Even if the lattice structures have been united first, the time complexity of mining interval association rules is higher. In this paper, the principle of mining association rules with parameters is studied, and the principle of a vertical union algorithm of interval association rules is proposed. On this basis, a dynamic mining algorithm of interval association rules is designed to achieve rule aggregation and maintain the diversity of interval association rules. Finally, the rationality and efficiency of the algorithm are verified by a case study.

1. Introduction

A concept lattice [1] is a conceptual hierarchy constructed according to the binary relationship between objects and attributes in data sets. As an effective tool for knowledge representation, a concept lattice is widely used in knowledge discovery, rule mining, information retrieval, and other fields because of its accuracy and completeness [2,3,4].
Concept lattice theory mainly focuses on the following aspects: A concept lattice extent model [5,6,7], concept lattice construction and rule extraction [8,9,10], concept lattice merging [11,12,13,14,15], concept lattice reduction [16,17], concept lattice modification [18], etc.
In a classic concept lattice, the concept extents have all the attributes or only one attribute, sometimes. Hence, the support and confidence degree of the extracted association rules would be reduced greatly. To solve this problem, the authors have put forward a new concept lattice structure: An interval concept lattice, and the construction methods, compression, and maintenance of lattice structure were studied [19,20,21,22].
From the perspective of a concept lattice, the relationship between intents is association rules, while the relationship between extents is its embodiment. A concept lattice is the unity of intent and extent, and the relationship between its nodes also reflects the relationship between the generalization and specialization of concepts. Therefore, a concept lattice is suitable for the application of a basic data structure in association rules mining. Many scholars have conducted in-depth research on rule mining based on a concept lattice [23,24,25,26,27,28].
Previously, we studied the structure characteristics of an interval concept lattice and gave two measurement standards of the uncertainty rule—precision and uncertainty. Then, a mining model of interval association rule with parameters was constructed [29]. The algorithm can mine and optimize rules according to the adjustment of parameters, which is of great significance to the mining of rules with uncertainties. Next, the complex relationship between interval parameters and association rules and the optimization algorithm of the rule base were given [30]. By adjusting the parameters, the purpose of controlling and optimizing rules was achieved.
In the era of data explosion, people’s demand for data processing is getting higher and higher. The real-time updating of data requires the efficient processing of dynamic data. For example, in the supermarket shopping system, the massive transaction information generated every day can only mine local association rules, but cannot provide a timely and accurate decision-making plan for decision-makers as a whole. However, the time and space complexity of the process will increase rapidly with the increase of the amount of data, and the mining association rules will be missing. Therefore, it is necessary to study the dynamic mining of interval association rules in order to grasp uncertain rules in real time.
The authors have studied the consistency of interval concept lattices, discussed the decision theorem of the concept of consistent intent, and designed a vertical union algorithm of interval concept lattices based on the breadth-first principle [31]. Furthermore, the sequential traversal method was used to scan the lattice structure, and a union algorithm of interval concept lattices was proposed from a transverse point of view [32]. Based on the research results of the union algorithm of interval concept lattice, this paper carries out the dynamic mining of interval association rules.

2. Concepts and Methods

2.1. Interval Concept Lattice

Definition 1
([33]).For the formal context ( U , A , R ) , where U = { x 1 , x 2 , , x 3 } is the object sets and each x i ( i n ) denotes an object; A = { a 1 , a 2 , , a m } is the attribute set, and each a j ( j m ) denotes an attribute; R is the binary relationship between U and A . R U × a . If ( x , a ) R , then we record that x has the attribute a , and write as x R a .
Definition 2
([33]).For the formal context ( U , A , R ) , operators f , g are defined as follows:
  • x U , f ( x ) = { y | y A , x R y } , i.e., f is the mapping between x and its attributes;
  • y a , g ( y ) = { x | x U , x R y } , i.e., g is the mapping between y and its objects.
Definition 3
([33]).For the formal context ( U , A , R ) , if f ( X ) = Y , g ( Y ) = X for X U , Y A , then the sequence < X , Y > is called a formal concept, or concept for short. X is the extent and Y is the intent.
Rough concept lattice R L ( U , A , R ) based on rough set theory was studied in Reference [7], where the upper approximation extent and lower approximation extent refer to the maximal concept set and the minimal concept set respectively which have all the attributes in Y A .
Definition 4
([33]).For the formal context ( U , A , R ) and its rough concept lattice R L ( U , A , R ) , ( M , N , Y ) is the rough concept. Set an interval [ α , β ] ( 0 α β 1 ) , then α upper bound extent M α and β lower bound extent M β are:
M α = { x | x M , | f ( x ) Y | / | Y | α , 0 α 1 }
M β = { x | x M , | f ( x ) Y | / | Y | β , 0 α β 1 }
Y is the concept intent and | Y | is the number of elements in Y , that is base number. M α refers to the objects which may be covered by α × | Y | attributes or more in Y . M β refers to the objects which may be covered by β × | Y | attributes or more in Y .
Definition 5
([33]).Let ( U , A , R ) be a formal context and ( M α , M β , Y ) be an interval concept. Then, Y is the intent; M α is the α upper bound extent and M β is the β lower bound extent.
Definition 6.
Suppose that ( U , A , R ) has two interval concepts, ( M 1 α , M 1 β , Y 1 ) and ( M 2 α , M 2 β , Y 2 ) . If the two meet Y 1 Y 2 , | Y 2 | | Y 1 | = 1 , M 1 α = M 2 α and M 1 β = M 2 β , then ( M 1 α , M 1 β , Y 1 ) is called the redundant concept.
Definition 7.
Suppose that ( U , A , R ) has an interval concept, ( M α , M β , Y ) . If it meets M α = M β = , then ( M α , M β , Y ) is called the empty concept.
Definition 8.
Suppose ( U , A , R ) has an interval concept, C = ( M α , M β , Y ) . If C is neither the redundant concept nor the empty concept, then C is called the existence concept. L α β ( U , A , R ) is a collection of all the existence concepts.
Definition 9.
L α β ( U , A , R ) ¯ refers to all the [ α , β ] interval concepts, which include: Existence concepts, redundant concepts, and empty concepts, that is:
( M 1 α , M 1 β , Y 1 ) ( M 2 α , M 2 β , Y 2 ) Y 1 Y 2 ,
Then “≤”is called the partial order relationship of L α β ( U , A , R ) ¯ .
Definition 10.
If all the concepts in L α β ( U , A , R ) ¯ meet “≤”, then L α β ( U , A , R ) is called interval concept lattice on the formal context ( U , A , R ) .
Definition 11.
In the interval concept lattice L α β ( U , A , R ) , if C = ( M α , M β , Y ) L α β ( U , A , R ) , then the layer of the Lattice Structure is | A | + 1 and node C is at Layer | Y | . In particular, when Y = , C was recorded on the zeroth layer.

2.2. Interval Association Rules

Formal context ( U , A , R ) can describe a database, where U represents an object set; A represents an attribute set. For x U , a A , x R a represents the item-set where a belongs to x .
Definition 12.
Given the minimal support threshold θ , for any interval concept node C , if the number of objects in the upper bound extent is not less than | U | × θ , then C is called the α -upper bound frequent node, and the corresponding Y is called the α -upper bound frequent item-set; if the number of objects in the lower bound extent is not less than | U | × θ , then C is called the β -lower bound frequent node, the corresponding Y is called the β -lower bound frequent item-set.
The father and son concept in the interval concept lattice does not have a specific relationship in frequency, which is different from the classical concept lattice.
If the association rule A B corresponds to the interval concept node ( C 1 , C 2 ) ( C 1 = ( M 1 α , M 1 β , Y 1 ) , C 2 = ( M 2 α , M 2 β , Y 2 ) ) and C 1 C 2 , then rule A B is generated by node binary ( C 1 , C 2 ) .
The α -upper bound association rules and the β -lower bound association rules can be extracted by two lower bound extents of the interval concept. The calculation methods of confidence and support are as follows:
The α -upper bound rule A B :
C o n f ( A B ) = | M 2 α | / | M 1 α |
S u p p o r t ( A B ) = | M 2 α | / | U |
The β -lower bound A B :
C o n f ( A B ) = | M 2 β | / | M 1 β |
S u p p o r t ( A B ) = | M 2 β | / | U |
Definition 13.
Given the minimal support threshold θ and the minimal confidence threshold c . Node binary ( C 1 , C 2 ) is called α -upper bound candidate binary which consists of two frequent concept nodes C 1 = ( M 1 α , M 1 β , Y 1 ) and C 2 = ( M 2 α , M 2 β , Y 2 ) , where M 2 α M 1 α and | M 2 α | / | M 1 α | c . When ( C 1 , C 2 ) meets M 2 β M 1 β and | M 2 β | / | M 1 β | c , it is called β -lower bound candidate binary.
Definition 14.
For interval association rule A B , if A B is frequent item-sets, S u p p o r t ( A B ) θ and C o n f ( A B ) c , i.e., | P ( A B ) | / | P ( A ) | c , then it is called the strong association rule.
Definition 15.
If A B is the strong association rule, then C D must be the strong association rule, then we say “ A B can derive C D ”.
Theorem 1.
If C D , then A B can derive A C .
Proof. 
C B | C | < | B | | A C | < | A B | S u p p o r t ( A C ) = | C | < | U | < S u p p o r t ( A B ) = | B | / | U | and C o n f ( A C ) = | A C | / | A | < C o n f ( A B ) = | A B | / | A | .□
Theorem 2.
In the interval concept lattice, if ( C 1 , C 2 ) and ( C 1 , C 3 ) are candidate binaries and C 3 > C 2 , then all the rules in R u l e s ( C 1 , C 3 ) can be derived from R u l e s ( C 1 , C 2 ) .
Definition 16.
Suppose A B is α -upper bound association rule derived from ( C 1 , C 2 ) . The upper bound extent of C 1 is M 1 α = { x 1 , x 2 , , x m , } . The intent of C 1 is Y 1 .The upper bound extent of C 2 is M 2 α = { o 1 , o 2 , , o m , } . The intent of C 2 is Y 2 . The precision of A B is
P D A B = min { min i = 1 m | x i . Y Y 1 | | Y 1 | , min i = 1 n | o i . Y Y 2 | | Y 2 | }
The uncertainty of A B is U D A B = 1 P D A B .
Definition 17.
Let Ω = { R u l e 1 , R u l e 2 , , R u l e k } denotes to α -rules set, the uncertainty of R u l e i is U D α R i , then the uncertainty of α -rules sets is
U D α R l u e s s e t = max i = 1 k ( U D α R i )
Let Ω = { R u l e 1 , R u l e 2 , , R u l e m } denote to β -rules set, the uncertainty of R u l e j is U D β R i , then the uncertainty of β -rules sets is
U D β R l u e s s e t = max j = 1 m ( U D β R i )
and the uncertainty of interval association rules is U D = max ( U D α R l u e s s e t , U D β R l u e s s e t ) .
Theorem 3.
Suppose the minimal support threshold θ and the minimal confidence threshold c . In the same context, when one of the interval parameters α and β is unchanged and the other is larger, the process of extracting association rules will change as follows:
(1) 
The number of frequent nodes generated does not increase;
(2) 
The node with the largest intent in frequent nodes does not increase in intent cardinality.
(3) 
The number of candidate binary arrays generated does not increase;
(4) 
The number of generated association rules does not increase.

3. Algorithm and Results

In the mining method of interval association rules, the lattice structure is united with the vertical union algorithm first, and then the association rules are extracted by using the parametric association rules mining algorithm. Because the structure of the interval concept lattice is complex, the memory space required is large, and the time complexity of mining interval association rules after vertical merging is high, this method does not meet the requirements of the current era of large data. In this section, the principle of a vertical union algorithm of interval association rules is proposed, and on this basis, a dynamic mining algorithm of interval association rules is designed.

3.1. Vertical Union Principle of Interval Association Rules

If the interval concept lattices L α β ( U 1 , A 1 , R 1 ) ¯ and L α β ( U 2 , A 2 , R 2 ) ¯ are consistent and A 1 = A 2 = A . U 1 U 2 = φ , then L α β ( U , A , R ) ¯ can be gotten through the union of the two. Suppose the minimal support threshold θ and the minimal confidence threshold c , we can extract the association rules of L α β ( U 1 , A , R 1 ) ¯ and L α β ( U 2 , A , R 2 ) ¯ . Here, α R l u e s s e t * is the upper bound association rules set derived from L α β ( U 1 , A , R 1 ) ¯ . α R l u e s s e t * * is the upper bound association rules set derived from L α β ( U 2 , A , R 2 ) ¯ ; α R l u e s s e t is the upper bound association rules set derived from L α β ( U , A , R ) ¯ .
C 1 * = ( M 1 α * , M 1 β * , Y 1 ) ,   C 2 * = ( M 2 α * , M 2 β * , Y 2 )   and   C 1 * , C 2 * L α β ( U 1 , A , R 1 ) ¯ ;
C 1 * * = ( M 1 α * * , M 1 β * * , Y 1 ) ,   C 2 * * = ( M 2 α * * , M 2 β * * , Y 2 )   and   C 1 * * , C 2 * * L α β ( U 2 , A , R 2 ) ¯ ;
C 1 = ( M 1 α , M 1 β , Y 1 ) ,   C 2 = ( M 2 α , M 2 β , Y 2 )   and   C 1 , C 2 L α β ( U , A , R ) ¯ .
Now, taking the upper bound extent as an example (the union principle of lower bound extent and upper bound extent), the vertical union of interval association rules α R l u e s s e t * and α R l u e s s e t * * is carried out as follows:
Theorem 4.
If C 1 * , C 2 * constitutes the association rule, and R u l e s ( C 1 , C 2 ) α R l u e s s e t ; C 1 * * , C 2 * * the constitutes association rule, and R u l e s ( C 1 , C 2 ) α R l u e s s e t , then R u l e s ( C 1 , C 2 ) α R l u e s s e t .
Proof. 
R u l e s ( C 1 , C 2 ) α R l u e s s e t and R u l e s ( C 1 , C 2 ) α R l u e s s e t , so:
Relation 1: M 2 α * M 1 α * and M 2 α * * M 1 α * * , then M 2 α * M 2 α * * M 1 α * M 1 α * * , i.e., M 2 α M 1 α .
Relation 2: | M 2 α * | | M 1 α * | c and | M 2 α * * | | M 1 α * * | c , then
| M 2 α * | + | M 2 α * * | c ( | M 1 α * | + | M 1 α * * | ) , | M 2 α * | + | M 2 α * * | | M 1 α * | + | M 1 α * * | c , i.e. , | M 2 α | | M 1 α | c .
Then R u l e s ( C 1 , C 2 ) α R l u e s s e t can be derived from two relations. □
Theorem 5.
If at least one of the two association rules R u l e s ( C 1 , C 2 ) and R u l e s ( C 1 , C 2 ) in L α β ( U 1 , A , R 1 ) ¯ and L α β ( U 2 , A , R 2 ) ¯ does not exist, then there must be no R u l e s ( C 1 , C 2 ) in α R l u e s s e t .
Because the vertical union of interval association rules is based on the objects set with certain proportion attributes, it is necessary to introduce support, confidence, accuracy, and uncertainty to measure accurately.
The upper bound frequencies of C 1 * , C 2 * are θ 1 * and θ 2 * , respectively.
The upper bound frequencies of C 1 * * , C 2 * * are θ 1 * * and θ 2 * * , respectively.
The upper bound frequencies of C 1 , C 2 are θ 1 and θ 2 , respectively.
The support degree, confidence degree, accuracy, and uncertainty of R u l e s ( C 1 , C 2 ) obtained by vertical union of association rules R u l e s ( C 1 , C 2 ) and R u l e s ( C 1 , C 2 ) can be deduced from frequency to its solution formula, as follows.
Theorem 6.
The support degree of R u l e s ( C 1 , C 2 ) is:
S u p p o r t ( C 1 C 2 ) = | U 1 | θ 2 * + | U 2 | θ 2 * * | U 1 | + | U 2 |
Theorem 7.
The confidence degree of R u l e s ( C 1 , C 2 ) is
C o n f ( C 1 C 2 ) = | U 1 | θ 2 * + | U 2 | θ 2 * * | U 1 | θ 1 * + | U 2 | θ 1 * *
Theorem 8.
R u l e s ( C 1 , C 2 ) = M i n { R u l e s ( C 1 * , C 2 * ) , R u l e s ( C 1 * * , C 2 * * ) } .

3.2. Dynamic Mining Algorithms for Interval Association Rules

3.2.1. Algorithm Design

According to Theorems 4 and 5, the result of the vertical union of interval association rules only occurs in the data with the same interval association rules and adjacent data. In order to ensure that the mined interval association rules are not lost, the algorithm retains the non-united rules on the basis of a vertical union of the same rules, which facilitates the mining of interval association rules later, and measures the occurrence frequency and frequency of the same rules with frequency. The basic idea of the algorithm is that whenever an interval association rule is generated, it is transformed into an array representation, and it is mined with the set of interval rules that have been united and retain the non-united rules, so that the interval rules can be aggregated again and again.
In order to distinguish different rules, association rules mined from interval concept lattices are stored in the form of arrays R S [ i ] = { R u l e , F N 1 , F N 2 , U , S u p p o r t , C o n f , P D , U D , F l a g , N u m } .
R S [ i ] represents the i th association rule in the interval association rules set R S ;
R u l e represents the interval association rules R u l e ( C x , C y ) ;
F N 1 , F N 2 represents the frequency degree of frequency nodes C x and C y in R u l e ( C x , C y ) ;
U represents the number of objects in the context;
S u p p o r t , C o n f , P D and U D represents the support, confidence, accuracy, and uncertainty degree of R u l e ( C x , C y ) ;
F l a g marked according to whether rules have been merged or not. F l a g = 0 denotes that rules are not united vertically; F l a g = 1 indicates that the rules have been vertically united; N u m denotes the number of occurrences of rule R u l e ( C x , C y ) in previously united rule sets.
Based on the above algorithm principle and analysis, we design a dynamic extraction and union algorithm of interval association rules, DMA (Dynamic Mining Algorithm). See Algorithm 1.
Algorithm 1. DMA (Dynamic Mining Algorithm)
Input: Association rule sets R S 1 , R S 2 , R S k
Output: Association rule set R S
Step1 R S = R S 1 ;
Step2 The interval association rules in the rule set R S k is stored in the form of arrays. Set R S * and initialize. Comparing the R u l e of rule set R S and R S k , we unit interval association rules vertically according to Theorems 4 and 5. For the rules that have been united in R S and R S k , let F l a g = 1 and put the united rule number in R S * . According to Theorems 6–8, calculate the frequency, S u p p o r t , C o n f , P D , and U D of C x and C y in R S * . F l a g =1, N u m = N u m + 1. Delete the rules of F l a g = 1 in R S and R S k , and renumbering. Let rules number in R S * add the renumbering number in R S . Putting the remain rules of R S into R S * ; then renumbering in R S k , and putting the remain rules of R S k into R S * .
 Rule Vertical Union ( R S , R S k )
1
{
2
g = 0 ;
3
R S [ g ] * = { R u l e , F N 1 , F N 2 , U , S u p p o r t , C o n f ,
P D , U D , F l a g , N u m }
4
{ R u l e = ;
5
F N 1 = F N 2 = 0 ;
6
U = R S [ 1 ] . U + R S k [ 1 ] . U ;
7
S u p p o r t = C o n f = P D = U D = F l a g = 0 ;
8
N u m = 1 ;
9
}
10
For ( i=1; | R S | ; i++)
11
 {For (j=1; | R S k | ; j++)
12
  If( R S [ i ] . F l a g = R S k [ j ] . F l a g = 0 ||
   R S [ i ] . R u l e = R S k [ j ] . R u l e )
13
  { g + 1 ;
14
    N u m + 1 ;
15
    R S [ i ] . F l a g = 1 ;
16
    R S k [ j ] . F l a g = 1 ;
17
    R S [ g ] * . F N 1 = R S [ i ] . U * R S [ i ] . F N 1 + R S k [ j ] . U * R S k [ j ] . F N 1 R S [ i ] . U + R S k [ j ] . U
18
    R S [ g ] * . F N 2 = R S [ i ] . U * R S [ i ] . F N 2 + R S k [ j ] . U * R S k [ j ] . F N 2 R S [ i ] . U + R S k [ j ] . U
19
   R S [ g ] * . S u p p o r t = R S [ g ] * . F N 2
20
   R S [ g ] * . C o n f = R S [ i ] . U * R S [ i ] . F N 2 + R S k [ j ] . U * R S k [ j ] . F N 2 R S [ i ] . U * R S [ i ] . F N 1 + R S k [ j ] . U * R S k [ j ] . F N 1
21
   R S [ g ] * . P D = min { R S [ i ] . P D , R S k [ j ] . P D } ;
22
    R S [ g ] * . U D = 1 R S [ g ] * . P D ;
23
    }
24
  }
25
  For each R S [ i ] in R S ;
26
  {If R S [ i ] . F l a g = 0 ;
27
    g + 1 ;
28
    R S * [ g ] = R S [ i ] ;}
29
  For each R S k [ j ] in R S k ;
30
  {If R S k [ j ] . F l a g = 0 ;
31
    g + 1 ;
32
   R S * [ g ] = R S k [ j ] ;}
33
 }
Step3 R S = R S *

3.2.2. Algorithm Analysis

The main content of DMA is embodied in the function Rule Vertical Union ( R S , R S k ). Lines 1 to 9 implement the initialization of parallel rule set R S * . Lines 10–12 nested for loop implements searching for interval association rules in R S and R S k that correspond to the same rules and are not united. Lines 13 to 16 number the interval association rules found in R S * . Num+1 is used to count the rule, and the corresponding rule F l a g in united R S , R S k is recorded as 1. Lines 17 to 25 implement the assignment of interval association rule R S * [ g ] . In lines 26–34, the unconsolidated rules in R S , R S k are put into R S * to maintain the diversity of the united rule set.
Compared with Step 2, the algorithm achieves the step requirements, and the assignment of Step 1 and Step 3 make the DMA algorithm dynamic, so that the algorithm has integrity and correctness. Compared with the general method, the algorithm realizes the merging of rules to rules, eliminating the process of vertical merging of interval concept lattices, thus greatly reducing the time complexity and space complexity of the implementation process.
If the number of association rules in two sub lattices is n and m respectively, the time complexity of the algorithm is less than O ( n × m ) + O ( n ) + O ( m ) . Compared with the interval association rule mining algorithm with parameters, the algorithm is more efficient.

3.3. Example Study

Set the formal contexts as shown in Table 1 and Table 2:
Set α = 0.6 , β = 0.7 , θ = 50 % , c = 60 % .
The DMA algorithm is used to mine interval association rules in parallel. Take the upper bound interval association rules as an example (the mining of lower bound association rules is similar).
Among them, the upper bound frequent nodes from F C 1 and F C 2 are shown in Table 3 and Table 4 respectively, and the corresponding upper bound association rules are shown in Table 5 and Table 6 respectively.
The DMA algorithm is used to mine the previous association rules of F C 1 and F C 2 in parallel. The parallel results of the different association rules are the same part of the combined association rules deleted in Table 5 and Table 6, respectively.
The parallel results of the same association rules are shown in Table 7.

4. Discussion

Considering the characteristics of interval concept lattices, the vertical union method of interval concept lattices and the principle of mining association rules with parameters were studied in this paper. Considering the intrinsic relationship between interval association rules, this paper presents some metrics, such as the confidence and credibility of united rules, and proposes a dynamic mining algorithm for interval association rules, which realizes rule aggregation and keeps the diversity of interval association rules. The rationality and efficiency of the algorithm are proved by algorithm analysis and case study, which provides timely, effective and abundant decision information for decision makers in data analysis. The next step is to optimize the algorithm for incomplete information systems in the context of large data. In the mining process, the stability and timing choice of the rule union is another problem to be studied in depth.

Author Contributions

Y.Y. contributed to the main body of the paper. The main models and algorithms were constructed; R.Z. was mainly responsible for the writing of the paper, colleagues performed data processing; B.L. guided the overall process of the paper.

Funding

This research was funded by the National Natural Science Foundation of China (61370168, 61472340), Natural Science Foundation of Hebei Province (F2016209344).

Acknowledgments

The authors thank the support of Hebei Key Laboratory for Data Science and Application and the technical assistance of Chunying Zhang and Lihong Li from North China University of Science and Technology.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Wille, R. Restructing lattice theory: An approach based on hierarchies of concepts. In Ordered Sets; Rival, I., Ed.; Reidel: Dordrecht, The Netherlands, 1982; pp. 445–470. [Google Scholar]
  2. Jiang, C. Applications of concept lattice in the digital library. Inf. Sci. 2010, 28, 1908–1911. [Google Scholar]
  3. Cole, R.; Stumme, G. CEM—A conceptual email manager. In Proceedings of the 7th International Conference on Conceptual Structures (ICCS’2000), Darmstadt, Germany, 14–18 August 2000; Springer: Berlin/Heidelberg, Germany, 2000. [Google Scholar]
  4. Ganter, B.; Wille, R. Formal Concept Analysis: Mathematical Foundations; Springer: Berlin, Germany, 1999; pp. 53–85. [Google Scholar]
  5. Zhao, W. Research of Data Mining Based on Concept Lattice and Its Extended Models; HeFei University of Technology: Hefei, China, 2002. [Google Scholar]
  6. Zhang, J.; Zhang, S.; Zheng, L. Weighted concept lattice and incremental construction. Pattern Recognit. Artif. Intell. 2005, 18, 171–176. [Google Scholar]
  7. Yang, H.; Zhang, J. Rough concept lattice and construction arithmetic. Comput. Eng. Appl. 2007, 43, 172–175. [Google Scholar]
  8. Ho, T.B. An approach to concept formation based on formal concept analysis. IEICE Trans. Inf. Syst. 1995, 78, 553–559. [Google Scholar]
  9. Liu, B.; Li, Y. Construction principles and algorithms of concept lattice generated by random decision formal context. Comput. Sci. 2013, 40, 90–92. [Google Scholar]
  10. Godin, R. Incremental concept formation algorithm based on Galois (concept) lattices. Comput. Intell. 1995, 11, 246–267. [Google Scholar] [CrossRef]
  11. Li, Y.; Liu, Z.; Chen, L.; Xu, X.; Cheng, W. Horizontal union algorithm of multiple concept lattices. Acta Electron. Sin. 2004, 32, 1849–1854. [Google Scholar]
  12. Zhi, H.; Zhi, D.; Liu, Z. Theory and algorithm of concept lattice union. Acta Electron. Sin. 2010, 38, 455–459. [Google Scholar]
  13. Zhang, L.; Shen, X.; Han, D.; An, G. Vertical union algorithm of concept lattice based on synonymous concept. Comput. Eng. Appl. 2007, 43, 95–98. [Google Scholar]
  14. Xu, M.; He, Z.; Liu, Y.; Li, J.; Zhu, Q. Bidirectional incremental formation algorithm of concept lattice. J. Chin. Comput. Syst. 2014, 1, 172–176. [Google Scholar]
  15. Yao, J.; Yang, S.; Lin, X.; Peng, Y. Incremental double sequence algorithm of concept lattice union. Appl. Res. Comput. 2013, 30, 1038–1040. [Google Scholar]
  16. Zhang, W.; Wei, L.; Qi, J. Attribute Reduction Theory and Method of Concept Lattice. Sci. China Ser. E Inf. Sci. 2005, 35, 628–639. [Google Scholar]
  17. Wang, X.; Zhang, W. Attribute reduction in concept lattice and attribute characteristics. Comput. Eng. Appl. 2008, 44, 1–4. [Google Scholar]
  18. Hu, L.; Zhang, J.; Zhang, S. Pruning based incremental construction of concept lattice. J. Comput. Appl. 2006, 26, 1659–1661. [Google Scholar]
  19. Liu, B.; Zhang, C. New concept lattice structure-interval concept lattice. Comput. Sci. 2012, 39, 273–277. [Google Scholar]
  20. Wang, L. Research of Effective Construction Algorithm for Interval Concept Lattice and Its Application; North China University of Science and Technology: Tangshan, China, 2015. [Google Scholar]
  21. Zhang, C.; Wang, L.; Liu, B. Dynamic reduction theory for interval concept lattice based on covering and its realization. J. Shandong Univ. (Nat. Sci.) 2014, 49, 15–21. [Google Scholar]
  22. Zhang, C.; Wang, L.; Liu, B. Longitudinal maintenance theory and algorithm for interval concept lattice. Comput. Eng. Sci. 2015, 37, 1221–1226. [Google Scholar]
  23. Wang, Z.; Hu, K.; Hu, X.; Liu, Z.; Zhang, D. General and incremental algorithms of rule extraction based on concept lattice. Chin. J. Comput. 1999, 22, 167–171. [Google Scholar]
  24. Sahami, M. Learning classification rules using lattices. In European Conference on Machine Learning; Springer: Berlin/Heidelberg, Germany, 1995; pp. 343–346. [Google Scholar]
  25. Liang, J.; Wang, J. An algorithm for extracting rule-generating sets based on concept lattice. J. Comput. Res. Dev. 2004, 41, 1339–1344. [Google Scholar]
  26. Xie, Z.; Liu, Z. A fast incremental algorithm for building concept lattice. Chin. J. Comput. 2002, 25, 490–496. [Google Scholar]
  27. Hu, K.; Lu, Y.; Shi, C. An Integrated Mining Approach for Classification and Association Rule Based on Concept Lattice. J. Softw. 2000, 11, 1478–1484. [Google Scholar]
  28. Wang, W. Research on Association Rule Mining and Change Model Based on Concept Lattice; Shandong University: Jinan, China, 2012. [Google Scholar]
  29. Wang, L.; Zhang, C.; Liu, B. Mining Algorithm of Interval Association Rule with Parameters and Its Application. J. Front. Comput. Sci. Technol. 2016, 10, 1546–1554. [Google Scholar]
  30. Xu, J.; Li, M.; Liu, B. Rule optimization method and application based on interval concept lattice. Comput. Eng. Appl. 2017, 53, 167–173. [Google Scholar]
  31. Zhang, R.; Zhang, C.; Wang, L.; Liu, B. Vertical union algorithm of interval concept lattices. J. Comput. Appl. 2015, 35, 1217–1221. [Google Scholar]
  32. Yang, Y.; Zhang, R.; Liu, B. Dynamic Horizontal Union Algorithm for Multiple Interval Concept Lattices. Mathematics 2019, 7, 159. [Google Scholar] [CrossRef]
  33. Liu, B.; Zhang, C.; Yang, Y. Research on Interval Concept Lattice and its Construction Algorithm. Prz. Elektrotechniczny 2013, 2013, 1. [Google Scholar]
Table 1. Formal context F C 1 .
Table 1. Formal context F C 1 .
abcde
111100
200010
311010
410101
Table 2. Formal context F C 2 .
Table 2. Formal context F C 2 .
abcde
(1)10110
(2)11010
(3)01011
Table 3. The frequent nodes of F C 1 .
Table 3. The frequent nodes of F C 1 .
Frequent NodeFrequent DegreeFrequent NodeFrequent Degree
a75%acd100%
b50%ace50%
c75%ade50%
d50%bcd75%
ab50%bce50%
ac50%cde50%
abc75%abcd50%
abd50%abce50%
abe75%abcde75%
Table 4. The frequent nodes of F C 2 .
Table 4. The frequent nodes of F C 2 .
Frequent NodeFrequent DegreeFrequent NodeFrequent Degree
a67%acd67%
b67%ade100%
d100%bcd100%
ad67%bde67%
bd67%cde67%
abc67%abcd67%
abd100%abde67%
abe67%abcde100%
Table 5. The previous rules and its measurement results on F C 1 .
Table 5. The previous rules and its measurement results on F C 1 .
RuleSupportConfidenceAccuracyUncertaintyFrequent Number
a b c d e 75%100%60%40%1
a b c d e 75%100%60%40%1
a b e c d 75%100%60%40%1
a c d b e 75%75%60%40%1
a b c e 50%67%75%25%1
c a b e 50%67%75%25%1
a c b e 50%100%75%25%1
a b c e 50%67%67%33%1
a b e c 50%67%67%33%1
a c e b 50%100%67%33%1
a b c d 50%67%75%25%1
b a c d 50%100%75%25%1
a b c d 50%100%75%25%1
a b c d 50%67%67%33%1
a b d c 50%100%67%33%1
b c d a 50%67%67%33%1
c d e 50%67%67%33%1
c b e 50%67%67%33%1
a d e 50%67%67%33%1
a c e 50%67%67%33%1
c a e 50%67%67%33%1
a c e 50%100%67%33%1
a b e 75%100%67%33%1
a b d 50%67%67%33%1
b a e 50%100%67%33%1
a b e 50%100%67%33%1
a b c 75%100%67%33%1
a c 50%67%100%0%1
c a 50%67%100%0%1
a b 50%67%100%0%1
b a 50%100%100%0%1
Table 6. The previous rules and its measurement results on F C 2 .
Table 6. The previous rules and its measurement results on F C 2 .
Rule SupportConfidenceAccuracyUncertaintyFrequent Number
d a b c e 100%100%60%40%1
a b d c e 100%100%60%40%1
a d e b c 100%100%60%40%1
b c d a e 100%100%60%40%1
b a d e 67%100%75%25%1
d a b e 67%67%75%25%1
b d a e 67%100%75%25%1
a b d e 67%67%67%33%1
a b e d 67%100%67%33%1
a d e b 67%67%67%33%1
b d e a 67%100%67%33%1
a b c d 67%100%75%25%1
d a b c 67%67%75%25%1
a d b c 67%100%75%25%1
a b c d 67%100%67%33%1
a b d c 67%67%67%33%1
a c d b 67%100%67%33%1
b c d a 67%67%67%33%1
d c e 50%67%67%33%1
b d e 67%100%67%33%1
d b e 67%67%67%33%1
b d e 67%100%67%33%1
d b c 100%100%67%33%1
d a e 100%100%67%33%1
a c d 67%100%67%33%1
d a c 67%67%67%33%1
a d c 67%100%67%33%1
b a e 67%100%67%33%1
d a b 100%100%67%33%1
a b c 67%100%67%33%1
Table 7. The previous rules set and its measurement results after vertical union.
Table 7. The previous rules set and its measurement results after vertical union.
RuleSupportConfidenceAccuracyUncertaintyFrequent Number
a b c d 57%80%75%25%2
a b c d 57%80%67%33%2
a b d c 57%80%67%33%2
b c d a 57%67%67%33%2
a b c 71%100%67%33%2

Share and Cite

MDPI and ACS Style

Yang, Y.; Zhang, R.; Liu, B. Dynamic Parallel Mining Algorithm of Association Rules Based on Interval Concept Lattice. Mathematics 2019, 7, 647. https://doi.org/10.3390/math7070647

AMA Style

Yang Y, Zhang R, Liu B. Dynamic Parallel Mining Algorithm of Association Rules Based on Interval Concept Lattice. Mathematics. 2019; 7(7):647. https://doi.org/10.3390/math7070647

Chicago/Turabian Style

Yang, Yafeng, Ru Zhang, and Baoxiang Liu. 2019. "Dynamic Parallel Mining Algorithm of Association Rules Based on Interval Concept Lattice" Mathematics 7, no. 7: 647. https://doi.org/10.3390/math7070647

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop