Efficient Breadth-First Reduct Search

Boonjing, Veera; Chanvarasuth, Pisit

doi:10.3390/math8050833

Open AccessArticle

Efficient Breadth-First Reduct Search

by

Veera Boonjing

¹ and

Pisit Chanvarasuth

^2,*

¹

Department of Computer Engineering, Faculty of Engineering, King Mongkut’s Institute of Technology Ladkrabang, Ladkrabang, Bangkok 10520, Thailand

²

School of Management Technology, Sirindhorn International Institute of Technology, Thammasat University, Pathum Thani, Bangkok 10200, Thailand

^*

Author to whom correspondence should be addressed.

Mathematics 2020, 8(5), 833; https://doi.org/10.3390/math8050833

Submission received: 14 April 2020 / Revised: 12 May 2020 / Accepted: 18 May 2020 / Published: 21 May 2020

(This article belongs to the Section Mathematics and Computer Science)

Download

Browse Figures

Versions Notes

Abstract

:

This paper formulates the problem of determining all reducts of an information system as a graph search problem. The search space is represented in the form of a rooted graph. The proposed algorithm uses a breadth-first search strategy to search for all reducts starting from the graph root. It expands nodes in breadth-first order and uses a pruning rule to decrease the search space. It is mathematically shown that the proposed algorithm is both time and space efficient.

Keywords:

reduct; subset; feature selection; breadth-first search; rooted graph

1. Introduction

In machine learning, feature selection is a process that selects relevant features used as the input in learning models. Its intention is to obtain optimal features so that the model can be used to accurately predict the output. Such optimal features are known as reducts in rough set theory. A reduct of an information system with conditional attributes and a decision attribute is defined as a minimal subset of a set of conditional attributes, in which its degree of dependency on the decision attribute is the same as the set of conditional attributes; (see [1] for a formal definition). A reduct could be any nonempty subset of the conditional attributes with the degree of dependency. Hence, the number of possible reducts is exponential with respect to the number of conditional attributes. With the aim of computational efficiency, a number of algorithms are proposed as a solution to finding a single reduct or multiple reducts without exhaustively investigating all of these possibilities. Therefore, many heuristic algorithms [2,3,4,5,6] and metaheuristic algorithms [7,8,9,10,11,12,13,14] have been proposed, as discussed in [15]. This class of algorithms is known as approximate algorithms. They give a single reduct or multiple reducts but not an exhausted list of reducts as the exact algorithms do. As the results of approximate algorithms requiring a parameter setting, they may produce different reducts in different runs. Moreover, they could create reducts that are not optimal. The exact algorithms are necessary if we are interested in computing the best reduct, with a given criterion, out of all reducts. However, finding all reducts involves generating and examining all possible reducts. In the literature, this can be done based on a discernibility matrix [16] and a power set tree [17,18]. Therefore, these exact algorithms have a time complexity exponential with respect to the number of conditional attributes.

In this work, we propose an exact algorithm to find all reducts without generating and examining all possible reducts. A simple representation of the possible reducts called a solution rooted graph is proposed. The rooted graph is formed by possible subsets of conditional attributes and their connections. Its root node is an n-subset attribute. The node is connected to its (n − 1)-subset nodes. Each (n − 1)-subset node is connected to its (n − 2)-subset nodes. This continues until reaching a 0-subset node. Hence, each node of the rooted graph except for a 0-subset node is a possible reduct. Furthermore, there are n node types based on their cardinalities in the search space: n-subset, (n − 1)-subset, (n − 2)-subset, …, and 1-subset. The proposed algorithm searches on the solution rooted graph with the breadth-first search. This work adopts the breadth-first search because it is complete, i.e., it assures finding all reducts. This algorithm still involves generating and examining all possible reducts from n-subset type to 1-subset type, type by type. We know that any node with a degree of dependency less than the graph root is not a reduct and its subsets are not reducts according to “the monotonic property of dependency”. Then, all of these subsets can be eliminated from consideration without losing any optimal reducts. The proposed algorithm is equipped with this rule of elimination as the pruning rule, which is the basis of its efficiency.

This paper is organized as follows. Section 2 briefly gives a sufficient background on reducts in terms of the degree of dependency. Section 3 describes the new efficient breadth-first reduct search algorithm. Analysis of the algorithm is given in Section 4. We conclude the paper in Section 5. An illustrative example is shown in Appendix A.

2. Basic Concepts

This section gives the background on reducts in terms of the degree of dependency. More details on reducts and rough sets can be found in [1].

Definition 1.

Any 4-tuple IS = <U, A = C ∪ D, V, f>; C ∩ D = Φ is called an information system; where U is a finite set of objects; A is a finite set of attributes; C is a finite set of conditional attributes; D is a finite set of decision attributes; V = ∪_{p ∈ A}Vp, where V_p is a domain of the attribute p, and f: U × A → V is a function called an information function f(x_i, q) for every q ∈ A and x_i ∈ U. An information system is denoted by IS = (U, A).

Example 1.

Let us consider the simple information system shown in Table 1. We adopt this table to illustrate the basic concepts in the following examples.

From Table 1, let P be C₁. We have

U = {x₁, x₂, x₃, x₄, x₅, x₆, x₇, x₈, x₉, x₁₀},

A = {C₁, C₂, C₃, C₄, C₅, D},

VP = {1, 2, 3},

f(x₄, C₁) = {2},

and f(x₁₀, D) = {1}.

Definition 2.

Let R be any nonempty subset of A. An R-indiscernibility relation, denoted by IND(R), is defined as IND(R) = {(x_i, x_j): (x_i, x_j) ∈ U × U, a ∈ R, f (x_i, a) = f (x_j, a)}.

Note that if (x_i, x_j) ∈ IND(R), then objects x_i and x_j are called indiscernible with respect to R. The relation IND(R) is an equivalence relation. Therefore, it forms a partition U/IND(R); the equivalence classes of the R-indiscernibility relation.

Example 2.

Let R be {C₁, C₂, C₃, C₄, C₅}. We have

IND(R) = {(x₂, x₃)},

IND({C₁}) = {(x₁, x₈), (x₁, x₁₀), (x₈, x₁₀), (x₄, x₅), (x₄, x₇), (x₅, x₇), (x₂, x₃), (x₂, x₆), (x₂, x₉), (x₃, x₆), (x₃, x₉), (x₆, x₉)},

U/IND(R) = {{x₂, x₃}, {x₁}, {x₄}, {x₅}, {x₆}, {x₇}, {x₈}, {x₉}, {x₁₀}},

and U/IND({C₁}) = {{x₁, x₈, x₁₀}, {x₄, x₅, x₇}, {x₂, x₃, x₆, x₉}}.

Definition 3.

Let R be any nonempty subset of A and X be any nonempty subset of U. The R-lower approximation of X, denoted by RX, is defined as ∪ {Y ∈ U/IND(R): Y ⊆ X}. The R-lower approximation of X contains all objects with the known values of R that belong to X. The P-lower approximation of X contains all objects that with the knowledge of attributes P can be classified as belonging to concept X.

Example 3.

Let R be {C₁, C₂, C₃, C₄, C₅}. We have

U/IND (D) = {{x₁, x₂, x₅}, {x₃, x₄, x₈, x₁₀}, {x₆, x₇, x₉}},

X₁ = {x₁, x₂, x₅}, X₂ = {x₃, x₄, x₈, x₁₀}, X₃ = {x₆, x₇, x₉},

U/IND(R) = {{x₂, x₃}, {x₁}, {x₄}, {x₅}, {x₆}, {x₇}, {x₈}, {x₉}, {x₁₀}},

RX₁= {x₁} ∪ {x₅} = {x₁, x₅},

RX₂ = {x₄} ∪ {x₈} ∪ {x₁₀} = {x₄, x₈, x₁₀},

and RX₃ = {x₆} ∪ {x₇} ∪ {x₉} = {x₆, x₇, x₉}.

Definition 4.

Let R be any nonempty subset of A and X be any nonempty subset of U. Then, the positive region of the partition U/IND (D) with respect to R, denoted by POS_R (D), is defined as

\cup_{X \in U / I N D (D)}

RX.

Example 4.

From Example 3, we have

POS_R(D) = RX₁ ∪ RX₂ ∪ RX₃ = {x₁, x₄, x₅, x₆, x₇, x₈, x₉, x₁₀}.

Theorem 1

([19]).Let IS = (U, A = C∪ D). If B ⊆ C, then we have: POS_B (D) ⊆ POS_C (D).

Proof.

For each x ∈ POS_B(D), there is y ∈ U/IND(D) such that [x]_IND(B) ⊆ y where [x]_IND(B) is the equivalence class of x with regard to equivalence relation IND(B). Since B ⊆ C, we have [x] _IND(C) ⊆ [x] _{IND (B)}. Thus, [x] _IND(C) ⊆ y, then x ∈ POS_C (D). Therefore, POS_B (D) ⊆ POS_C (D). □

Definition 5.

Let R be any nonempty subset of C. D depends on R in a degree k (0 ≤ k ≤ 1) where

k = γ_R (D) = |POS_R (D)|/|U|.

Example 5.

From Example 4, we have k = γ_R (D) = |POS_R(D)|/|U| = |{x₁, x₄, x₅, x₆, x₇, x₈, x₉, x₁₀}|/|U| = 8/10 = 0.8. Then, D depends on R with degree 0.8.

Example 6.

Let R be {C₁, C₂}. We have

U/IND(D) = {{x₁, x₂, x₅}, {x₃, x₄, x₈, x₁₀}, { x₆, x₇, x₉}},

X₁ = {x₁, x₂, x₅}, X₂ = {x₃, x₄, x₈, x₁₀}, X₃ = {x₆, x₇, x₉},

U/IND(R) = {{x₂, x₃, x₉}, {x₁}, {x₄}, {x₅, x₇}, {x₆}, {x₈, x₁₀}},

RX₁ = {x₁} = {x₁},

RX₂ = {x₄} ∪ {x₈, x₁₀} = {x₄, x₈, x₁₀},

RX₃= {x₆} = {x₆},

POS_R(D) = RX₁ ∪ RX₂ ∪ RX₃ = {x₁, x₄, x₆, x₈, x₁₀},

and k = γ_R(D) = |{x₁, x₄, x₆, x₈, x₁₀}|/|U| = 5/10 = 0.5.

Definition 6.

Let C’ be any nonempty subset of C. C’ is a D-reduct (reduct with respect to D) of C, if C’ is a minimal subset of C such that γ_C(D) = γ_C’(D).

Example 7.

Let R be {C₁, C₂, C₃}. We have

U/IND(D) = {{x₁, x₂, x₅}, {x₃, x₄, x₈, x₁₀}, {x₆, x₇, x₉}},

X₁ = {x₁, x₂, x₅}, X₂ = {x₃, x₄, x₈, x₁₀}, X₃ = {x₆, x₇, x₉},

U/IND(R) = {{x₂, x₃}, {x₁}, {x₄}, {x₅}, {x₆}, {x₇}, {x₈}, {x₉}, {x₁₀}},

RX₁ = {x₁} ∪ {x₅} = {x₁, x₅},

RX₂ = {x₄} ∪ {x₈} ∪ {x₁₀} = {x₄, x₈, x₁₀},

and RX₃ = {x₆} ∪ {x₇} ∪ {x₉}= {x₆, x₇, x₉},

POS_R(D) = RX₁ ∪ RX₂ ∪ RX₃ = {x₁, x₄, x₅, x₆, x₇, x₈, x₉, x₁₀},

and k = γ_R(D) = |{x₁, x₄, x₅, x₆, x₇, x₈, x₉, x₁₀}|/|U| = 8/10 = 0.8.

Since each nonempty subset R’ ∈ {{C₁},{ C₂},{ C₃},{ C₁, C₂},{ C₁, C₃},{ C2, C3}} of R gives γ_R’(D) < γ_R(D), then R = {C₁, C₂, C₃} is a D-reduct of C.

Therefore, we find a reduct by obtaining a minimal subset of conditional attribute (C) so that the decision attributes (D) depend on it in the same degree as depending on C. A brute force approach to the problem is to check all subsets of C, with a minimal condition, satisfying the dependency of D on C (γ_C(D)). Each minimal subset obtained (reduct) requires checking all of its 2ⁿ − 2 subsets—excluding itself and an empty set; where n is its cardinality.

3. Efficient Breadth-First Reduct Search Algorithm

Let C = {C₁, C₂, C₃, …, C_n} be a conditional attribute of an information system IS(C, D). There are n + 1 types of subsets of C: n-subset, (n − 1)-subset, (n − 2)-subset, …, 1-subset, and 0-subset. Reducts could be any types of these subsets except for the 0-subset. The breadth-first reduct search algorithm orderly investigates the following 2ⁿ − 1 subsets, type by type: n-subset, (n − 1)-subset, (n − 2)-subset, …, and 1-subset. Thereafter, these subsets are referred to as reduct candidates. For each subset C’ in reduct candidates, if γ_C’(D) = γ_C(D), then C’ is a new element of the reduct set, and all of its supersets in the reduct set, if they exist, are eliminated. If γ_C’(D) < γ_C(D), all subsets of C’ are not reduct candidates. Each nonreduct C’ reduces 2^|c’| − 2 elements of the reduct candidates, according to Theorem 1. A reduct set is obtained once all reduct candidates are investigated.

The algorithm implements the idea above by using three data structures: Candidate Queue (the first-come first-served structure), Reduct_Set, and NonReduct_Set to maintain the reduct candidates, reducts, and nonreducts, respectively. Initially, it calculates k = γ_C(D), adds C to Candidate Queue, and sets both Reduct_Set and NonReduct_Set to an empty set. It loops to update the data of Candidate Queue, Reduct_Set, and NonReduct_Set if Candidate Queue is not empty. For each loop, it gets an element C’ from Candidate Queue and calculates γ_C’(D). If γ_C’(D) = k, then it adds C’ to Reduct_Set using the updateReduct_Set(C’) procedure. It also generates all (|C’| − 1)-subsets and insert them into Candidate Queue using the updateCandidate Queue(C’) procedure. If γ_C’(D) < k, it inserts C’ and all its subsets into NonReduct_Set using the updateNonReduct_Set(C’) procedure. The algorithm (in detail) is as shown in Figure 1.

There are three major procedures in our algorithm: updateReduct_Set(C’), updateCandidate Queue(C’), and updateNonReduct_Set(C’).

updateReduct_Set(C’)

Each element in Reduct_Set is not a reduct if we can find any of its subsets that are also reducts. Therefore, we have to test whether each reduct in Reduct_Set is a superset of a new reduct. If it is a superset, we eliminate it from Reduct_Set before putting the new reduct into it, to gain the reduct minimal condition. For example, let Reduct_Set be {{C₁, C₂, C₃, C₄, C₅}}, let C’ be {C₁, C₂, C₃}, and γ_C’(D) = γ_C(D); therefore, a new reduct is C’. However, there is a reduct {C₁, C₂, C₃, C₄, C₅} in Reduct_Set that is a superset of C’. We, therefore, remove {C₁, C₂, C₃, C₄, C₅} from the Reduct_Set and insert C’ into it. This gives the new Reduct_Set = {{C₁, C₂, C₃}}. The procedure (in detail) is as shown in Figure 2.

updateCandidate_Queue(C’)

The procedure generates all (|C’| − 1)-subsets from C’ and tests whether each is a reduct candidate. A reduct candidate is not a subset of any NonReduct_Set element. Such a candidate is appended into Candidate Queue. For example, let C’ be {C₁, C₂, C₃, C₄}, then a set of 3-subsets of C’ is {{C₁, C₂, C₃}, {C₁, C₂, C₄}, {C₂, C₃, C₄}, {C₁, C₃, C₄}}. If NonReduct_Set contains an element {C₂, C₃, C₄, C₅}, then {C₂, C₃, C₄} is not a reduct candidate since it is a subset of {C₂, C₃, C₄, C₅}. Therefore, {C₁, C₂, C₃}, {C₁, C₂, C₄}, and {C₁, C₃, C₄} are appended into Candidate Queue. The details of the procedure are shown in Figure 3.

updateNonReduct_Set(C’)

The property of the positive region as shown in Theorem 1 allows us to reduce the search space, i.e., the number of reduct candidates. We know that if C’ ⊆ C, then we have POS_C’(D) ⊆ POS_C(D). This infers that γ_C’(D) ≤ γ_C(D). In addition, if γ_C(D) = k (the degree of dependency of D on C in the original data), then C’ and its subsets are not reducts. All of the subsets can be eliminated from the candidates. For example, let C be a conditional attribute {C₁, C₂, C₃, C₄, C₅} with γ_C(D) = 0.8 and let C’ be {C₁, C₂, C₄} with γ_C’(D) = 0.6. Then, C’ and its subsets cannot be reducts according to Theorem 1. We, therefore, do not need to explore these subsets. We then remove all subsets of C’ from Candidate Queue. The proposed algorithm stores these candidates in NonReduct_Set using the procedure updateNonReduct_Set(C’). The procedure (in detail) is as shown in Figure 4.

4. Analysis of Algorithm

Let C = {C₁, C₂, C₃, …, C_n} be a conditional attribute of an IS(C, D). Additionally, let L_k be a set of k-subsets. We know that |L_k| =

(\begin{matrix} n \\ k \end{matrix})

and

\sum_{k = 0}^{n} (\begin{matrix} n \\ k \end{matrix})

= 2ⁿ. The algorithm searches for reducts from each L_k level by level starting with k = n, k = n − 1, and so on, until k = 1. Therefore, the size of its search space is

\sum_{k = 1}^{n} (\begin{matrix} n \\ k \end{matrix})

= 2ⁿ − 1. For the best case, C is the only element of Reduct_Set and each element of L_n-1 does not satisfy a reduct property. It tests C and all elements of L_n−1. The number of tests is 1 +

(\begin{matrix} n \\ n - 1 \end{matrix})

= 1 + n. For the best-case scenario, the time complexity is O(n).

For the worst-case scenario, the algorithm gives a 1-subset reduct as an element of Reduct_Set. If all generated subsets satisfy a reduct property, the number of test is

\sum_{k = 1}^{n} (\begin{matrix} n \\ k \end{matrix})

=

(\begin{matrix} n \\ 1 \end{matrix})

+

(\begin{matrix} n \\ 2 \end{matrix})

+ … +

(\begin{matrix} n \\ n - 2 \end{matrix})

+

(\begin{matrix} n \\ n - 1 \end{matrix})

+

(\begin{matrix} n \\ n \end{matrix})

. Therefore, the worst-case time complexity is O(

(\begin{matrix} n \\ 1 \end{matrix})

+

(\begin{matrix} n \\ 2 \end{matrix})

+ … +

(\begin{matrix} n \\ n - 2 \end{matrix})

+

(\begin{matrix} n \\ n - 1 \end{matrix})

+

(\begin{matrix} n \\ n \end{matrix})

) = O(

(\begin{matrix} n \\ m \end{matrix})

), where m = ⌊med⌋ and med is the median of n, n − 1, n − 2, …, 1, 0. Since

(\begin{matrix} n \\ m \end{matrix})

= n (n − 1)(n − 2)…(n − m − 1)/m!, then the worst-case time complexity is O(n^m). However, the algorithm applies a property of the positive region to reduce the search space once it finds a nonreduct subset. Each such l-subset could eliminate 2^l−1 candidate elements in L_k where k = l − 1, l − 2, …, 1. These nonreduct subsets are stored in NonReduct_Set. Any subset that is a subset of an element of NonReduct_Set is not included as an element of Candidate Queue.

In general, the space complexity of the breadth-first search algorithm is that all candidates remain in Candidate Queue. Therefore, it depends on the size of the largest Candidate Queue. For the best case, the space complexity is O(n). We observe that |L_m| is the largest among all subset levels; where m = ⌊med⌋ and med is the median of n, n − 1, n − 2, …, 1, 0. Since the algorithm generates and tests level by level, its worst-case space complexity is determined by O(

(\begin{matrix} n \\ m \end{matrix})

) = O(n^m).

5. Conclusions

This paper presents a simple and efficient solution to the problem of finding all reducts in an information system. The problem is formulated as a search problem where the search space is a rooted graph. The rooted graph is a connected graph of possible reducts and their connections. Its root is a set of all conditional attributes. Each of its k-subset nodes is connected by (k − 1)-subset nodes where k is a non-negative integer not larger than the cardinality of the graph root. The proposed algorithm searches this graph using a breadth-first search strategy, starting from the graph root. It expands nodes in breadth-first order. With the monotonic property of the positive region (Theorem 1) as the pruning rule, it can prune all nonreduct nodes in the search space early. An illustrative example is given to demonstrate the algorithm. The algorithm’s efficiency is confirmed by the results of the algorithm analysis. Let n be the cardinality of conditional attributes, and m be the floor of the median of n, n − 1, n − 2, …, 1, 0; it is shown that both the time and space complexity of the algorithm are O (n) and O (n^m) for the best case and the worst case, respectively.

Author Contributions

Conceptualization, V.B. and P.C.;methodology, V.B.; software, V.B.; validation, V.B. and P.C.; formal analysis, V.B.; investigation, V.B.;resources, P.C.; data curation, V.B.; writing—original draft preparation, V.B.; writing—review and editing, P.C.;visualization, V.B.; supervision, V.B.; project administration, P.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Illustrative Example

Let us consider an input to the algorithm as an information system IS (C.D) from Table 1 where {C₁, C₂, C₃, C₄, C₅}.

Input: An information system IS(C.D)
Step 1:
    Calculate k = γ_C(D). We get k = 0.8.
Step 2:
Candidate_Queue = <C>
Reduct_Set= {}
NonReduct_Set = {}
Step 3: 
    Iteration 1: Candidate_Queue is not empty.
	Loop
	   Get an element of Candidate_Queue and assign it to C’.
		Then we have Candidate_Queue = <> and C’ = C.
	   Calculate γ_C’(D) = 0.8.
	   We have γ_C’(D) = k. Then
	   Reduct_Set= {{C₁, C₂, C₃, C₄, C₅}},
	   Candidate_Queue = <{C₁, C₂, C₃, C₄}, {C₁, C₂, C₃, C₅}, {C₁, C₂, C₄, C₅}, {C₁, C₃, C₄, C₅},
	   {C₂, C₃, C₄, C₅}>, and NonReduct_Set = {}.
	End Loop
    Iteration 2: Candidate_Queue is not empty.
	Loop
	    Get an element of Candidate_Queue and assign it to C’.
	    Then we have
		  Candidate_Queue = <{C₁, C₂, C₃, C₅}, {C₁, C₂, C₄, C₅}, {C₁, C₃, C₄, C₅},
		    {C₂, C₃, C₄, C₅}>
		  and C’ = {C₁, C₂, C₃, C₄},.
	    Calculate γ_C’(D) = 0.8.
	    We have γ_C’(D) = k. Then Reduct_Set= {{C₁, C₂, C₃, C₄}}, Candidate_Queue
	    = <{C₁, C₂, C₃, C₅}, {C₁, C₂, C₄, C₅}, {C₁, C₃, C₄, C₅}, {C₂, C₃, C₄, C₅}, {C₁, C₂, C₃},
		  {C₁, C₂, C₄}, {C₂, C₃, C₄}, {C₁, C₃, C₄}>,
	    and NonReduct_Set = {}.
	End Loop
    Iteration 3: Candidate_Queue is not empty.
	Loop
	    Get an element of Candidate_Queue and assign it to C’.
	    Then we have
	    Candidate_Queue = <{C₁, C₂, C₄, C₅}, {C₁, C₃, C₄, C₅}, {C₂, C₃, C₄, C₅}, {C₁, C₂, C₃},
		  {C₁, C₂, C₄}, {C₂, C₃, C₄}, {C₁, C₃, C₄}> 
	    and C’ = {C₁, C₂, C₃, C₅}.
	    Calculate γ_C’(D) = 0.8.
	    We have γ_C’(D) = k. Then Reduct_Set = {{C₁, C₂, C₃, C₄}, {C₁, C₂, C₃, C₅}},
	    Candidate_Queue = <{C₁, C₂, C₄, C₅}, {C₁, C₃, C₄, C₅}, {C₂, C₃, C₄, C₅}, {C₁, C₂, C₃},
	    {C₁, C₂, C₄}, {C₂, C₃, C₄}, {C₁, C₃, C₄}, {C₁, C₂, C₅}, {C₂, C₃, C₅}, {C₁, C₃, C₅} >,
	   and NonReduct_Set = {}.
	End Loop
    Iteration 4: Candidate_Queue is not empty.
	Loop
	    Get an element of Candidate_Queue and assign it to C’.
	    Then we have
	    Candidate_Queue = <{C₁, C₃, C₄, C₅}, {C₂, C₃, C₄, C₅}, {C₁, C₂, C₃}, {C₁, C₂, C₄},
		  {C₂, C₃, C₄}, {C₁, C₃, C₄}, {C₁, C₂, C₅}, {C₂, C₃, C₅}, {C₁, C₃, C₅} >
	    and C’ = {C₁, C₂, C₄, C₅}.
	    Calculate γ_C’(D) = 0.6.
	    We have γ_C’(D) < k. Then Reduct_Set= {{C₁, C₂, C₃, C₄}, {C₁, C₂, C₃, C₅}},
	    Candidate_Queue = <{C₁, C₃, C₄, C₅}, {C₂, C₃, C₄, C₅}, {C₁, C₂, C₃}, {C₂, C₃, C₄},
		{C₁, C₃, C₄}, {C₂, C₃, C₅}, {C₁, C₃, C₅} >,
	    and NonReduct_Set = {{C₁, C₂, C₄, C₅}}.
	End Loop
    Iteration 5: Candidate_Queue is not empty.
	Loop
	    Get an element of Candidate_Queue and assign it to C’.
		Then we have
	    Candidate_Queue = <{C₂, C₃, C₄, C₅}, {C₁, C₂, C₃}, {C₂, C₃, C₄}, {C₁, C₃, C₄},
		  {C₂, C₃, C₅}, {C₁, C₃, C₅} >,
	    and C’ = {C₁, C₃, C₄, C₅}. Calculate γ_C’(D) = 0.8.
	    We have γ_C’(D) = k. Then Reduct_Set= {{C₁, C₂, C₃, C₄}, {C₁, C₂, C₃, C₅},
	    {C₁, C₃, C₄, C₅}}, Candidate_Queue = < {C₂, C₃, C₄, C₅}, {C₁, C₂, C₃}, {C₂, C₃, C₄},
	    {C₁, C₃, C₄}, {C₂, C₃, C₅}, {C₁, C₃, C₅}, {C₃, C₄, C₅}>,
	    and NonReduct_Set = {{C₁, C₂, C₄, C₅}}.
	End Loop
    Iteration 6: Candidate_Queue is not empty.
	Loop
	    Get an element of Candidate_Queue and assign it to C’.
	    Then we have
	    Candidate_Queue = <{C₁, C₂, C₃}, {C₂, C₃, C₄}, {C₁, C₃, C₄}, {C₂, C₃, C₅}, {C₁, C₃, C₅},
		  {C₃, C₄, C₅}>
	    and C’ = {C₂, C₃, C₄, C₅}.
	    Calculate γ_C’(D) = 0.8.
	    We have γ_C’(D) = k. Then
	    Reduct_Set= {{C₁, C₂, C₃, C₄}, {C₁, C₂, C₃, C₅}, {C₁, C₃, C₄, C₅}, {C₂, C₃, C₄, C₅}},
	    Candidate_Queue = < {C₁, C₂, C₃}, {C₂, C₃, C₄}, {C₁, C₃, C₄}, {C₂, C₃, C₅}, {C₁, C₃, C₅},
	    {C₃, C₄, C₅}>,
	    and NonReduct_Set = {{C₁, C₂, C₄, C₅}}.
	End Loop
    Iteration 7: Candidate_Queue is not empty.
	Loop
	    Get an element of Candidate_Queue and assign it to C’.
	    Then we have Candidate_Queue = < {C₂, C₃, C₄}, {C₁, C₃, C₄}, {C₂, C₃, C₅},
		  {C₁, C₃, C₅}, {C₃, C₄, C₅}>
	    and C’ = {C₁, C₂, C₃ }.
	    Calculate γ_C’(D) = 0.8.
	    We have γ_C’(D) = k. Then Reduct_Set= {{C₁, C₃, C₄, C₅},{C₂, C₃, C₄, C₅},
		{C₁, C₂, C₃ }},
	    Candidate_Queue = < {C₂, C₃, C₄}, {C₁, C₃, C₄}, {C₂, C₃, C₅}, {C₁, C₃, C₅}, { C₃, C₄, C₅},
	    {C₁, C₃ }, { C₂, C₃ }>,
	    and NonReduct_Set = {{C₁, C₂, C₄, C₅}}.
	End Loop
    Iteration 8: Candidate_Queue is not empty.
	Loop
	    Get an element of Candidate_Queue and assign it to C’.
	    Then we have Candidate_Queue = <{C₁, C₃, C₄}, {C₂, C₃, C₅}, {C₁, C₃, C₅},
		  {C₃, C₄, C₅},{C₁, C₃}, {C₂, C₃ }>,
	    and C’ = {C₂, C₃, C₄}.
	    Calculate γ_C’(D) = 0.8.
	    We have γ_C’(D) = k. Then Reduct_Set= {{C₁, C₃, C₄, C₅}, {C₁, C₂, C₃}, {C₂, C₃, C₄}},
	    Candidate_Queue = < {C₁, C₃, C₄}, {C₂, C₃, C₅}, {C₁, C₃, C₅}, {C₃, C₄, C₅}, {C₁, C₃},
		  {C₂, C₃}, {C₃, C₄ }>,
	    and NonReduct_Set = {{C₁, C₂, C₄, C₅}}.
	End Loop
    Iteration 9: Candidate_Queue is not empty.
	Loop
	    Get an element of Candidate_Queue and assign it to C’.
	    Then we have Candidate_Queue = <{C₂, C₃, C₅}, {C₁, C₃, C₅}, {C₃, C₄, C₅}, {C₁, C₃ },
		  {C₂, C₃}, {C₃, C₄}>,
	    and C’ = {C₁, C₃, C₄}.
	    Calculate γ_C’(D) = 0.8.
	    We have γ_C’(D) = k. Then Reduct_Set= {{C₁, C₂, C₃},{C₂, C₃, C₄}, {C₁, C₃, C₄}},
	    Candidate_Queue
	    = <{C₂, C₃, C₅}, {C₁, C₃, C₅},{C₃, C₄, C₅}, {C₁, C₃}, {C₂, C₃},{C₃, C₄}>,
	    and NonReduct_Set = {{C₁, C₂, C₄, C₅}}.
	End Loop
    Iteration 10: Candidate_Queue is not empty.
	Loop
	    Get an element of Candidate_Queue and assign it to C’.
	    Then we have Candidate_Queue = <{C₁, C₃, C₅}, {C₃, C₄, C₅}, {C₁, C₃}, {C₂, C₃},
		  {C₃, C₄}>,
	    and C’ = {C₂, C₃, C₅}.
	    Calculate γ_C’(D) = 0.8.
	    We have γ_C’(D) = k. Then Reduct_Set= {{C₁, C₂, C₃}, {C₂, C₃, C₄}, {C₁, C₃, C₄},
	    {C₂, C₃, C₅}},
	    Candidate_Queue = <{C₁, C₃, C₅}, {C₃, C₄, C₅}, {C₁, C₃ }, {C₂, C₃}, {C₃, C₄}, {C₃, C₅}>,
	    and NonReduct_Set = {{C₁, C₂, C₄, C₅}}.
	End Loop
    Iteration 11: Candidate_Queue is not empty
	Loop
	    Get an element of Candidate_Queue and assign it to C’.
	    Then we have Candidate_Queue = <{C₃, C₄, C₅}, {C₁, C₃}, {C₂, C₃},{C₃, C₄}, {C₃, C₅}>,
	    and C’ = {C₁, C₃, C₅}.
	    Calculate γ_C’(D) = 0.4.
	    We have γ_C’(D) < k. Then
	    Reduct_Set= {{C₁, C₂, C₃}, {C₂, C₃, C₄}, {C₁, C₃, C₄}, {C₂, C₃, C₅}},
	    Candidate_Queue = <{C₃, C₄, C₅}, {C₂, C₃},{C₃, C₄}>,
	    and NonReduct_Set = {{C₁, C₂, C₄, C₅}, {C₁, C₃, C₅}}.
	End Loop
    Iteration 12: Candidate_Queue is not empty.
	Loop
	    Get an element of Candidate_Queue and assign it to C’.
	    Then we have Candidate_Queue = <{C₂, C₃},{C₃, C₄}>
	    and C’ = {C₃, C₄, C₅}.
	    Calculate γ_C’(D) = 0.6.
	    We have γ_C’(D) < k. Then
	    Reduct_Set= {{C₁, C₂, C₃},{C₂, C₃, C₄}, {C₁, C₃, C₄}, {C₂, C₃, C₅}},
	    Candidate_Queue = < {C₂, C₃}>,
	    and NonReduct_Set = {{C₁, C₂, C₄, C₅}, {C₁, C₃, C₅}, { C3, C4, C5}}.
	End Loop
    Iteration 13: Candidate_Queue is not empty.
	Loop
	    Get an element of Candidate_Queue and assign it to C’.
	    Then we have Candidate_Queue = < >
	    and C’ = {C₂, C₃}.
	    Calculate γ_C’(D) = 0.8.
	    We have γ_C’(D) = k. Then, Reduct_Set = {{C₁, C₃, C₄}, {C₂, C₃}},
	    Candidate_Queue = < >,
	    and NonReduct_Set = {{C₁, C₂, C₄, C₅}, {C₁, C₃, C₅}, { C3, C4, C5}}.
	End Loop
Step 4:
    Return Reduct_Set = {{C₁, C₃, C₄}, {C₂, C₃}}.

The algorithm obtains a result in 13 iterations. That is to say, only 13 subsets, of 2⁵ − 1 = 31 subsets, are explored to get all reducts. Therefore, it is a feasible solution to the problem of finding all reducts of an information system.

References

Pawlak, Z. Rough Sets: Theoretical Aspects of Reasoning about Data; Kluwer: Dordrecht, The Netherlands, 1991. [Google Scholar]
Hu, X.H.; Cercone, N. Learning in relational databases: A rough set approach. Int. J. Comput. Intell. 1995, 11, 323–338. [Google Scholar] [CrossRef]
Miao, D.Q.; Hu, G.R. A heuristic algorithm for reduction of knowledge. J. Comput. Res. Dev. 1999, 36, 681–684. [Google Scholar]
Qian, Y.H.; Liang, J.Y.; Pedrycz, W.; Dang, C.Y. Positive approximation: An accelarator for attribute reduction in rough set thoery. Artifical Intell. 2010, 174, 595–618. [Google Scholar] [CrossRef] [Green Version]
Dai, J.H.; Hu, H.; Zheng, G.J.; Hu, Q.H.; Han, H.F.; Shi, H. Attribute reduction in interval-valued information systems based on information entropies. Front. Inf. Technol. Electron. Eng. 2016, 17, 919–928. [Google Scholar] [CrossRef]
Benouini, R.; Batioua, I.; Ezghari, S.; Zenkouar, K.; Zahi, A. Fast feature selection algorithm for neighborhood rough set model based on Bucket and Trie structures. Granul. Comput. 2019, 1–9. [Google Scholar] [CrossRef]
Chebrolu, S.; Sanjeevi, S.G. Attribute reduction on real-valued data in rough set theory using hybrid artificial bee colony: Extended FTSBPSD algorithm. Soft Comput. 2017, 21, 7543–7569. [Google Scholar] [CrossRef]
Chebrolu, S.; Sanjeevi, S.G. Attribute reduction in decision-theoretic rough set models using genetic algorithm. In Proceedings of the International Conference on Swarm, Evolutionary, and Memetic Computing (LNCS 7076), Visakhapatnam, India, 19–21 December 2011; pp. 307–314. [Google Scholar]
Chebrolu, S.; Sanjeevi, S.G. Attribute reduction on continuous data in rough set theory using ant colony optimization metaheuristic. In Proceedings of the Third International Symposiumon Women in Computing and Informatics, Kochi, India, 10–13 August 2015; pp. 17–24. [Google Scholar]
Chebrolu, S.; Sanjeevi, S.G. Attribute reduction in decision-theoretic rough set model using particle swarm optimization with the threshold parameters determined using LMS training rule. Procedia Comput. Sci. 2015, 57, 527–536. [Google Scholar] [CrossRef] [Green Version]
Chen, Y.M.; Miao, D.Q.; Wang, R.Z. A rough set approach to feature selection based on ant colony optimization. Pattern Recognit. Lett. 2010, 31, 226–233. [Google Scholar] [CrossRef]
Min, F.; Zhang, Z.H.; Dong, J. Ant colony optimization with partial-complete searching for attribute reduction. J. Comput. Sci. 2018, 25, 170–182. [Google Scholar] [CrossRef]
Jia, X.Y.; Liao, W.H.; Tang, Z.M.; Shang, L. Minimum cost attribute reduction in decision-theoretic rough set models. Inf. Sci. 2013, 219, 151–167. [Google Scholar] [CrossRef]
Cheng, Y.; Zheng, Z.R.; Wang, J.; Yang, L.; Wan, S.H. Attribute reduction based on genetic algorithm for the coevolution of meteorological data in the industrial internet of things. Wirel. Commun. Mob. Comput. 2019, 2019, 3525347. [Google Scholar] [CrossRef]
Zhang, N.; Gao, X.Y.; Yu, T.Y. Heuristic Approaches to Attribute Reduction for Generalized Decision Preservation. Appl. Sci. 2019, 9, 2841. [Google Scholar] [CrossRef] [Green Version]
Starzyk, J.; Nelson, D.E.; Sturtz, K. Reduct generation in information systems. Bull. Int. Rough Set Soc. 1998, 3, 19–22. [Google Scholar]
Chen, Y.; Miao, D.; Wang, R.; Wu, K. A rough set approach to feature selection based on power set tree. Knowl. Based Syst. 2011, 24, 275–281. [Google Scholar] [CrossRef]
Rezvan, M.T.; Hamadani, A.Z.; Hejazi, S.R. An exact feature selection algorithm based on rough set theory. Complexity 2015, 20, 50–62. [Google Scholar] [CrossRef]
Li, H.; Zhou, X.; Zhao, J.; Liu, D. Attribute Reduction in Decision-Theoretic Rough Set Model: A Further Investigation. In Proceedings of the Rough Sets and Knowledge Technology—6th International Conference, RSKT 2011, Banff, AB, Canada, 9–12 October 2011; pp. 466–475. [Google Scholar]

Figure 1. Efficient breadth-first reduct search algorithm.

Figure 2. updateReduct_Set(C’) procedure.

Figure 3. updateCandidate Queue(C’) procedure.

Figure 4. updateNonReduct_Set(C’) procedure.

Table 1. Example dataset.

	C₁	C₂	C₃	C₄	C₅	D
x₁	1	2	1	3	1	2
x₂	3	3	2	3	1	2
x₃	3	3	2	3	1	1
x₄	2	1	1	2	1	1
x₅	2	2	1	1	1	2
x₆	3	2	3	1	1	3
x₇	2	2	3	1	1	3
x₈	1	1	2	2	1	1
x₉	3	3	3	1	1	3
x₁₀	1	1	1	1	1	1

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Boonjing, V.; Chanvarasuth, P. Efficient Breadth-First Reduct Search. Mathematics 2020, 8, 833. https://doi.org/10.3390/math8050833

AMA Style

Boonjing V, Chanvarasuth P. Efficient Breadth-First Reduct Search. Mathematics. 2020; 8(5):833. https://doi.org/10.3390/math8050833

Chicago/Turabian Style

Boonjing, Veera, and Pisit Chanvarasuth. 2020. "Efficient Breadth-First Reduct Search" Mathematics 8, no. 5: 833. https://doi.org/10.3390/math8050833

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Efficient Breadth-First Reduct Search

Abstract

1. Introduction

2. Basic Concepts

3. Efficient Breadth-First Reduct Search Algorithm

4. Analysis of Algorithm

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

Appendix A. Illustrative Example

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI