Least Squares in a Data Fusion Scenario via Aggregation Operators

de Jesus, Gildson Queiroz; Palmeira, Eduardo Silva

doi:10.3390/axioms11120678

Open AccessArticle

Least Squares in a Data Fusion Scenario via Aggregation Operators

by

Gildson Queiroz de Jesus

^*

and

Eduardo Silva Palmeira

Postgraduate Program in Science and Technology Computational Modeling (PPGMC), Department of Exact Science and Technology (DCET), State University of Santa Cruz (UESC), Ilhéus 45662-900, BA, Brazil

^*

Author to whom correspondence should be addressed.

Axioms 2022, 11(12), 678; https://doi.org/10.3390/axioms11120678

Submission received: 11 October 2022 / Revised: 24 November 2022 / Accepted: 25 November 2022 / Published: 28 November 2022

(This article belongs to the Special Issue New Trends in Fusion Processes Based on Aggregation and Pre-aggregation Functions: Theory and Applications)

Download Review Reports Versions Notes

Abstract

:

In this paper, appropriate least-squares methods were developed to operate in data fusion scenarios. These methods generate optimal estimates by combining measurements from a finite collection of samples. The aggregation operators of the average type, namely, ordered weighted averaging (OWA), Choquet integral, and mixture operators, were applied to formulate the optimization problem. Numerical examples about fitting curves to a given set of points are provided to show the effectiveness of the proposed algorithms.

Keywords:

least squares; aggregation operators; data fusion

MSC:

93E24; 03E72; 47S40; 94A16

1. Introduction

Several studies have been carried out on data science. Datasets play an important role in several areas of knowledge, since information can be extracted from them. This information can be used, for example, in decision making, product improvement, process automation, and trend forecasting [1,2,3].

A number of methods and algorithms have been developed in the literature to extract different information from datasets through mathematical and computational methods. In general, these algorithms were developed to model datasets collected from a single source. In this regard, few algorithms have been formulated to solve the problem in a data fusion scenario, that is, in a scenario where data comes from different sources [4].

The least-squares method (LSM) is a widely used technique for data modeling based on the minimization of a quadratic function [4,5,6,7,8,9]. LSM was initially conceived for modeling data from a single source. In [4], an LSM was developed considering a data fusion situation (LSM-DF), that is, a method considering data from different sources. LSM-DF was designed for weighted data fusion.

From a mathematical point of view, the LSM-DF is based on a weighted average of the length of residual vectors of the equations

b_{k} = A_{k} x + v_{k}

with

k = 1, 2, \dots, L

, expressed by

\sum_{k = 1}^{L} | | v_{k} {| |}_{W_{k}}^{2} = \sum_{k = 1}^{L} v_{k}^{T} W_{k} v_{k}

where

W_{k}

are the weights, that is, an aggregation of L values with their corresponding weightings. Here, a very interesting question arises: is weighted averaging the best method for aggregating the data in all scenarios? Within this context, the study of different aggregation methods has recently gained prominence.

Aggregation operators constitute a subarea of fuzzy theory that has the characteristic of combining finite datasets of the same nature into a single dataset [1,2,6,7,10,11,12,13,14,15,16,17,18,19]. These operators are basically classified into three categories: mean, conjunctive, and disjunctive. Applications of these operators can be found in medical problems, image processing, decision making, and engineering problems.

W_{k}

weights are directly related to the

| | v_{k} {| |}^{2}

length of residual vectors. However, in some situations, it would be interesting to dynamically allocate the weights to the

W_{k}

weightings, putting more weight on the more important

| | v_{k} {| |}^{2}

values. Thus, considering the above, the aggregation operators can be considered to bw a viable alternative to change the behavior of LSM-DF.

This study seeks to optimally combine the least-squares method and the aggregation operators of the average type, more specifically, the ordered weighted averaging (OWA) [3,20,21,22] Choquet integral, [23,24], and mixture [25,26] operators. Furthermore, the aim of this study is to formulate and solve appropriate least-squares methods to model finite collections of datasets of the same nature. An important goal of these algorithms is to generate optimal estimates that aggregate data of different sources. This is necessary for situations that involve systems that can operate under different failure conditions. A numerical example is presented to show the effectiveness of the proposed algorithm.

This paper is organized as follows: in Section 2, preliminary results are related with an admissible order for matrices, aggregation operators, and LSM. In Section 3, LSM-DF via aggregation operators are deduced. In Section 4, a numerical example is shown.

2. Preliminaries

This section addresses topics that form the theoretical basis for the development of LSM-DF via aggregation operators. Initially, the admissible order for matrices is discussed, followed by the aggregation operators of the average type and the classical least-squares method.

2.1. Admissible Order for Matrices

In this section, we present the concept of admissible order for matrices based on [2,16,27]. This is a special way to consider total orders on the set of all matrices of order

m \times n

with scalar in

R

(set of real numbers) denoted by

R^{m \times n}

.

Let

A, B \in R^{m \times n}

. It is clear that

A \leq_{M} B

given by

A \leq_{M} B if and only if a_{i j} \leq b_{i j}, \forall i, j

is a partial order on

R^{m \times n}

.

Considering a matrix

A \in R^{m \times n}

as a vector of columns, i.e.,

A = [A_{1}, A_{2}, \dots, A_{n}]

where

A_{i}

are the columns of A (

i \in {1, 2, \dots, n}

), then ≤ can be defined as

A \leq_{M} B if and only if A_{i} \leq_{M} B_{i}, \forall i \in {1, 2, \dots, n} .

One can extend that partial order for a total order by considering the concept of admissible order as follows.

Definition 1.

A total order ≼ on

R^{m \times n}

is admissible if, for each

A, B \in R^{m \times n}

we have that

A ≼ B

whenever

A \leq_{M} B

.

Example 1.

Let be A and B column matrices on

R^{m \times 1}

and

π_{i} (A) = a_{i 1}

the projection on the i-th line of A. Then,

A ≼_{c} B \Leftrightarrow \exists k \in {1, 2, \dots, m} s . t . π_{k} (A) < π_{k} (B) a n d \forall i, 1 \leq i < k, π_{i} (A) = π_{i} (B)

is an admissible order.

Therefore, one can generalize an admissible order on

R^{m \times n}

by considering the following definition: Let

A, B \in R^{m \times n}

such that

A = [A_{1}, A_{2}, \dots, A_{n}]

and

B = [B_{1}, B_{2}, \dots, B_{n}]

. Then

A ≼_{M} B \Leftrightarrow \exists k \in {1, 2, \dots, m} s . t . A_{k} ≼_{c} B_{k} a n d \forall i, 1 \leq i < k, A_{i} = B_{i}

is an admissible order on

R^{m \times n}

.

2.2. Aggregation Operators

Aggregation operators are numeric operators that combine multiple input values into a single output value. In this data fusion process, operators aggregate data from different sources to obtain a single unit of data from the conducted analysis. Next, the operators used in this study are presented: OWA, Choquet integral, and mixture operators.

Definition 2

([12]).(OWA operator) Providing an n- dimensional weight vector, that is, a

W = (w_{1}, w_{2}, \dots, w_{n})

with

\sum_{k = 1}^{n} w_{k} = 1

, the

{O W A}_{W} : {[0, 1]}^{n} \to [0, 1]

function is defined by

\begin{matrix} O W A_{W} (x_{1}, x_{2}, \dots, x_{n}) = \sum_{k = 1}^{n} w_{k} x_{(k)} \end{matrix}

(1)

where

(x_{(1)}, x_{(2)}, \dots, x_{(n)})

is the descending order of vector

(x_{1}, x_{2}, \dots, x_{n})

and is named an ordered weighted average function.

Example 2.

Defining the

w = (w_{1}, w_{2}, \dots, w_{n})

vector of weights, where

w_{i} = 0

and

w_{k} = 1

, for some fixed

k \in {1, 2, \dots, n}

. So

O W A_{w} (x) = x_{k}

is the so-called static OWA operator.

Remark 1.

As one can see in Definition 2, the sum of all the weights in the OWA aggregation results is 1 (

\sum_{k = 1}^{n} w_{k} = 1

). If the weights are matrices, the sum is given by

\sum_{k = 1}^{L} | | W_{k} {| |}_{1} = 1

where

| | • {| |}_{1}

is the norm of the matrices given by

\begin{matrix} {| | A | |}_{1} = m a x_{1 \leq j \leq s} \sum_{i = 1}^{r} | a_{i j} |, w h e r e A \in R^{r x s} . \end{matrix}

(2)

Remark 2.

The entries in the OWA aggregation must be sorted; if the entries are a matrix, an ordering relation must be used over the set

R^{m x n}

. So, we can consider an admissible order on

R^{m x n}

as defined in 1.

The next definition is the fuzzy discrete measure, a significant result for the definition of the Choquet integral operator.

Definition 3

([15]).A discrete fuzzy measure is a function

μ : 2^{N} \to [0, 1]

where

N = {1, 2, \dots, n}

and

2^{N}

is the group of parts of

N

, such that:

$M_{1} :$ $μ (X) \leq μ (Y)$ when $X \subseteq Y$
$M_{2} :$ $μ (\emptyset) = 0$ and $μ (N) = 1$ .

Definition 4

([10]).(Choquet integral operator)

μ : 2^{N} \to [0, 1]

is a discrete fuzzy measure. The discrete Choquet integral related to the measure μ is the function

C_{μ} : {[0, 1]}^{n} \to [0, 1]

defined by:

\begin{matrix} C_{μ} (x_{1}, x_{2}, \dots, x_{n}) = \sum_{k = 1}^{n} x_{[k]} [μ ({j \in N : x_{j} \geq x_{[k]}}) - μ ({j \in N : x_{j} \geq x_{[k + 1]}})] \end{matrix}

(3)

where

(x_{[1]}, x_{[2]}, \dots, x_{[n]}) = S o r t (x_{1}, x_{2}, \dots, x_{n})

is an ascending ordering of the vector

(x_{1}, x_{2}, \dots, x_{n})

and

x_{[n + 1]} = 2

by convention.

The Choquet integral operator can also be calculated with the following simplified expression:

\begin{matrix} C_{μ} (x_{1}, x_{2}, \dots, x_{n}) = \sum_{k = 1}^{n} [x_{[k]} - x_{[k - 1]}] μ (G_{k}) \end{matrix}

(4)

where

x_{[0]} = 0

and

G_{k} = {[k], [k + 1], \dots, [n]}

.

Example 3.

Considering fuzzy discrete measure

μ_{⊥} (X) = \{\begin{matrix} 1, & s e X = N \\ 0, & o t h e r w i s e . \end{matrix}

Thus, the following Choquet integral can be defined by:

C_{μ_{⊥}} (x_{1}, x_{2}, \dots, x_{n}) = [x_{[1]} - x_{[0]}] μ_{⊥} (G_{1}) + \dots + [x_{[n]} - x_{[n - 1]}] μ_{⊥} (G_{n}) .

μ_{⊥} (G_{1}) = 1

and

μ_{⊥} (G_{i}) = 0

for the other values of i; therefore, the result is

C_{μ_{⊥}} (x_{1}, x_{2}, \dots, x_{n}) = x_{[1]} = min (x_{1}, x_{2}, \dots, x_{n})

.

Definition 5

([15]).(Mixture Operator)

w_{1}, w_{2}, \dots, w_{n} : [0, 1] \to [0, + \infty)

are functions called weight functions. The

M I X_{w_{1}, w_{2}, \dots, w_{n}} : {[0, 1]}^{n} \to [0, 1]

function is defined by:

\begin{matrix} M I X_{w_{1}, w_{2}, \dots, w_{n}} (x_{1}, x_{2}, \dots, x_{n}) = \frac{\sum_{k = 1}^{n} w_{k} (x_{k}) . x_{k}}{\sum_{k = 1}^{n} w_{k} (x_{k})} \end{matrix}

(5)

is called the mixture function associated with the weight functions

w_{1}, w_{2}, \dots, w_{n}

.

Example 4.

Defining

w_{i} (x_{i}) = \{\begin{matrix} \frac{1}{n}, & s e x_{i} = 0 \\ x_{i}, & otherwise . \end{matrix}

For simplicity, consider

n = 3

. In this case, considering that

M I X_{w_{1}, w_{2}, w_{3}} (x_{1}, x_{2}, x_{3}) = \{\begin{matrix} 0, & s e x_{1} = x_{2} = x_{3} = 0 \\ \frac{{x_{1}}^{2} + {x_{2}}^{2} + {x_{3}}^{2}}{x_{1} + x_{2} + x_{3}}, & otherwise \end{matrix}

is the mixture function determined by the

w_{i}

weights defined above.

2.3. Least-Squares Method

LSM is a widely known and applied mathematical optimization method used to solve several problems, including parameter estimation. This method consists of finding an optimal solution to the problem by minimizing the square of a residual vector.

Considering the equation

\begin{matrix} b = A x + v \end{matrix}

(6)

where

x \in R^{n \times 1}

is an unknown vector,

A \in R^{N \times n}

is a known parameter matrix,

b \in R^{N \times 1}

is a known vector, and

v \in R^{N \times 1}

is a vector named residual.

The least-squares problem is to find a solution

\hat{x}

that minimizes the length of the residual vector, that is, satisfying the following property:

\begin{matrix} | | b - A \hat{x} {| |}^{2} \leq | | b - {A x | |}^{2} \end{matrix}

(7)

for all

x \in R^{n \times 1}

. The

| | • {| |}^{2}

denotes the square of Euclidean norm

\begin{matrix} {| | v | |}^{2} = v^{T} v . \end{matrix}

(8)

Therefore, the solution to the least-squares problem consists of solving the optimization problem

\begin{matrix} m i n_{x} J (x) \end{matrix}

(9)

where the functional cost

J (x)

is given by

\begin{matrix} J (x) & = & | | b - {A x | |}^{2} \\ = & {(b - A x)}^{T} (b - A x) . \end{matrix}

(10)

Theorem 1

([4]).(Least-Squares Method) If matrix A has full rank, then there is a single optimal solution

\hat{x}

for least-squares Problem (9) that is given by

\begin{matrix} \hat{x} = {[A^{T} A]}^{- 1} [A^{T} b] . \end{matrix}

(11)

Moreover, the resulting minimal value of the cost function can be written as

\begin{matrix} J (\hat{x}) = b^{T} b - b^{T} A {(A^{T} A)}^{- 1} A^{T} b . \end{matrix}

(12)

3. LSM-DF via Aggregation Operators

In this section, LSM-DF is developed via aggregation operators. LSM-DF via an OWA operator, LSM-DF via a Choquet integral operator, and LSM-DF via a mixture operator are also presented. These LSM-DFs are an alternative to estimation problems in the case of several datasources.

The next result is necessary to the proof of the LSM-DF via aggregation operators.

Lemma 1.

If matrices

A_{k}

have full rank and matrix

W_{k}

is symmetric definite-positive with

k = 1, 2, \dots, L

, then

{\bar{A}}^{T} \bar{WA}

where

\bar{A} = [\begin{matrix} A_{1} \\ A_{2} \\ ⋮ \\ A_{L} \end{matrix}], \bar{W} = [\begin{matrix} W_{1} & 0 & \dots & 0 \\ 0 & W_{2} & \dots & 0 \\ ⋮ & ⋮ & \dots & ⋮ \\ 0 & 0 & \dots & W_{L} \end{matrix}]

(13)

is nonsingular.

Proof.

Let suppose that

{\bar{A}}^{T} \bar{WA}

is singular; then, there must exist a nonzero vector

λ

, such that

{\bar{A}}^{T} \bar{WA} λ = 0

, which implies that

λ^{T} {\bar{A}}^{T} \bar{WA} λ = 0

, i.e.,

λ^{T} {[\begin{matrix} A_{1} \\ A_{2} \\ ⋮ \\ A_{L} \end{matrix}]}^{T} [\begin{matrix} W_{1} & 0 & \dots & 0 \\ 0 & W_{2} & \dots & 0 \\ ⋮ & ⋮ & \dots & ⋮ \\ 0 & 0 & \dots & W_{L} \end{matrix}] [\begin{matrix} A_{1} \\ A_{2} \\ ⋮ \\ A_{L} \end{matrix}] λ = 0

(14)

\begin{matrix} λ^{T} A_{1}^{T} W_{1} A_{1} λ + λ^{T} A_{2}^{T} W_{2} A_{2} λ + \dots + λ^{T} A_{L}^{T} W_{L} A_{L} λ = 0 \end{matrix}

(15)

(15) can be rewritten as

\begin{matrix} | | A_{1} {λ | |}_{W_{1}}^{2} + | | A_{2} {λ | |}_{W_{2}}^{2} + \dots + | | A_{L} λ {| |}_{W_{L}}^{2} = 0 . \end{matrix}

(16)

| | • {| |}_{W}^{2}

denotes the square of the weighted Euclidean norm

\begin{matrix} {| | v | |}_{W}^{2} = v^{T} W v . \end{matrix}

(17)

As matrices

W_{k}

are symmetric definite-positive, it follows from (16) that

| | A_{k} λ {| |}_{W_{k}}^{2} = 0

so that

A_{k} λ = 0

with

k = 1, 2, \dots, L

. This, in turn, means that the columns of

A_{k}

are linearly dependent. Hence,

A_{k}

is not full-rank. □

3.1. LSM-DF via OWA Operator

For the deduction of LSM-DF via OWA operator, the following equations should be considered

\begin{matrix} b_{(k)} = A_{(k)} x + v_{(k)}, k = 1, 2, \dots, L \end{matrix}

(18)

where

x \in R^{n \times 1}

is an unknown vector,

A_{(k)} \in R^{N \times n}

known parameters arrays,

b_{(k)} \in R^{N \times 1}

known vectors, and

v_{(k)} \in R^{N \times 1}

vectors named residuals.

A solution to the least-squares problem via operator OWA

\hat{x}

must minimize the length of the residual vector, that is, it must satisfy the following property:

\begin{matrix} \sum_{k = 1}^{L} | | b_{(k)} - A_{(k)} \hat{x} {| |}_{W_{k}}^{2} \leq \sum_{k = 1}^{L} | | b_{(k)} - A_{(k)} x {| |}_{W_{k}}^{2} \end{matrix}

(19)

for all x∈

R^{n \times 1}

and where

W_{k}

are a positive-definite symmetric matrices.

Optimal solution

\hat{x}

is found by solving the following minimization problem:

\begin{matrix} m i n_{x} J_{O W A} (x) . \end{matrix}

(20)

Functional

J_{O W A} (x)

can be defined as

\begin{matrix} J_{O W A} (x) : = O W A_{W} (J_{1} (x), J_{2} (x), \dots, J_{L} (x)) \end{matrix}

(21)

where

W = (W_{1}, W_{2}, \dots, W_{n})

are weight matrices and

\begin{matrix} J_{k} (x) & : = & | | v_{(k)} {| |}^{2} \\ = & | | b_{(k)} - A_{(k)} x {| |}^{2}, k = 1, 2, \dots, L . \end{matrix}

(22)

Therefore, by defining the OWA operator, Function (21) can be rewritten as

\begin{matrix} J_{O W A} (x) & : = & \sum_{k = 1}^{L} | | b_{(k)} - A_{(k)} x {| |}_{W_{k}}^{2} \\ = & \sum_{k = 1}^{L} {(b_{(k)} - A_{(k)} x)}^{T} W_{k} (b_{(k)} - A_{(k)} x) . \end{matrix}

(23)

The next theorem brings the solution to the least-squares problem via the OWA operator in (20).

Theorem 2.

(LSM-DF via OWA Operator) If matrices

A_{(k)}

with

k = 1, 2, \dots, L

have full rank and

W_{k}

are symmetric definite-positive matrices, then there is a unique optimal solution

\hat{x}

to the least-squares problem via OWA operator (LSM-DF via OWA operator) that is given by:

\begin{matrix} \hat{x} = {[\sum_{k = 1}^{L} A_{(k)}^{T} W_{k} A_{(k)}]}^{- 1} [\sum_{k = 1}^{L} A_{(k)}^{T} W_{k} b_{(k)}] . \end{matrix}

(24)

The corresponding minimal value of

J_{O W A} (x)

is

\begin{matrix} J_{O W A} (\hat{x}) = \sum_{k = 1}^{L} b_{(k)}^{T} W_{k} b_{(k)} - \sum_{k = 1}^{L} b_{(k)}^{T} W_{k} A_{(k)} {(\sum_{k = 1}^{L} A_{(k)}^{T} W_{k} A_{(k)})}^{- 1} \sum_{k = 1}^{L} A_{(k)}^{T} W_{k} b_{(k)} . \end{matrix}

(25)

Proof.

Consider the cost function

\begin{matrix} J_{O W A} (x) = \sum_{k = 1}^{L} {(b_{(k)} - A_{(k)} x)}^{T} W_{k} (b_{(k)} - A_{(k)} x) \end{matrix}

(26)

\begin{matrix} J_{O W A} (x) & = & {(b_{(1)} - A_{(1)} x)}^{T} W_{1} (b_{(1)} - A_{(1)} x) + {(b_{(2)} - A_{(2)} x)}^{T} W_{2} (b_{(2)} - A_{(2)} x) \\ + \dots + & {(b_{(L)} - A_{(L)} x)}^{T} W_{L} (b_{(L)} - A_{(L)} x) \end{matrix}

(27)

J_{O W A} (x) = {[\begin{matrix} (b_{(1)} - A_{(1)} x) \\ (b_{(2)} - A_{(2)} x) \\ ⋮ \\ (b_{(L)} - A_{(L)} x) \end{matrix}]}^{T} [\begin{matrix} W_{1} & 0 & \dots & 0 \\ 0 & W_{2} & \dots & 0 \\ ⋮ & ⋮ & \dots & ⋮ \\ 0 & 0 & \dots & W_{L} \end{matrix}] [\begin{matrix} (b_{(1)} - A_{(1)} x) \\ (b_{(2)} - A_{(2)} x) \\ ⋮ \\ (b_{(L)} - A_{(L)} x) \end{matrix}]

(28)

\begin{matrix} J_{O W A} (x) & = & {([\begin{matrix} b_{(1)} \\ b_{(2)} \\ ⋮ \\ b_{(L)} \end{matrix}] - [\begin{matrix} A_{(1)} \\ A_{(2)} \\ ⋮ \\ A_{(L)} \end{matrix}] x)}^{T} [\begin{matrix} W_{1} & 0 & \dots & 0 \\ 0 & W_{2} & \dots & 0 \\ ⋮ & ⋮ & \dots & ⋮ \\ 0 & 0 & \dots & W_{L} \end{matrix}] \\ \times & ([\begin{matrix} b_{(1)} \\ b_{(2)} \\ ⋮ \\ b_{(L)} \end{matrix}] - [\begin{matrix} A_{(1)} \\ A_{(2)} \\ ⋮ \\ A_{(L)} \end{matrix}] x) \end{matrix}

(29)

that can be rewritten in matrix form as

\begin{matrix} J_{O W A} (x) = {(\bar{b} - \bar{A} x)}^{T} \bar{W} (\bar{b} - \bar{A} x) \end{matrix}

(30)

where

\bar{A} = [\begin{matrix} A_{(1)} \\ A_{(2)} \\ ⋮ \\ A_{(L)} \end{matrix}], \bar{b} = [\begin{matrix} b_{(1)} \\ b_{(2)} \\ ⋮ \\ b_{(L)} \end{matrix}], \bar{W} = [\begin{matrix} W_{1} & 0 & \dots & 0 \\ 0 & W_{2} & \dots & 0 \\ ⋮ & ⋮ & \dots & ⋮ \\ 0 & 0 & \dots & W_{L} \end{matrix}] .

(31)

Entries

(A_{(1)}, A_{(2)}, \dots, A_{(L)})

and

(b_{(1)}, b_{(2)}, \dots, b_{(L)})

are descending orders of

(A_{1}, A_{2}, \dots, A_{L})

and

(b_{1}, b_{2}, \dots, b_{L})

, respectively.

\bar{W}

is a diagonal positive-definite symmetric matrix with entries

W_{k}

.

To find the critical point in x,

J_{O W A} (x)

must be differentiated and equal to zero

\begin{matrix} \frac{\partial}{\partial x} [x^{T} {\bar{A}}^{T} \bar{WA} x - x^{T} {\bar{A}}^{T} \bar{W} \bar{b} - {\bar{b}}^{T} \bar{WA} x + {\bar{b}}^{T} \bar{W} \bar{b}] = 0 \\ \Rightarrow & x^{T} {\bar{A}}^{T} \bar{WA} - {\bar{b}}^{T} \bar{WA} = 0 . \end{matrix}

(32)

Via Lemma 1, matrix

{\bar{A}}^{T} \bar{WA}

is invertible. Therefore,

\begin{matrix} \hat{x} = {[{\bar{A}}^{T} \bar{WA}]}^{- 1} [{\bar{A}}^{T} \bar{W} \bar{b}] . \end{matrix}

(33)

Replacing (31) into (33), the solution can be rewritten as

\begin{matrix} \hat{x} = {[\sum_{k = 1}^{L} A_{(k)}^{T} W_{k} A_{(k)}]}^{- 1} [\sum_{k = 1}^{L} A_{(k)}^{T} W_{k} b_{(k)}] . \end{matrix}

(34)

In fact, for the Hermitian matrix to be defined as positive

\begin{matrix} \frac{\partial^{2} J_{O W A} (x)}{\partial x^{T} \partial x} = {\bar{A}}^{T} \bar{WA} = \sum_{k = 1}^{L} A_{(k)}^{T} W_{k} A_{(k)} > 0 \end{matrix}

(35)

J_{O W A} (x)

in (30) must be a strictly convex function; therefore,

\hat{x}

is a unique global minimum.

The minimal cost

J_{O W A} (\hat{x})

can be expressed as

\begin{matrix} J_{O W A} (\hat{x}) & = & \sum_{k = 1}^{L} | | b_{(k)} - A_{(k)} \hat{x} {| |}_{W_{k}}^{2} \\ = & {(\bar{b} - \bar{A} \hat{x})}^{T} \bar{W} (\bar{b} - \bar{A} \hat{x}) \\ = & {\bar{b}}^{T} \bar{W} \bar{b} - {\bar{b}}^{T} \bar{WA} \hat{x} - {\hat{x}}^{T} {\bar{A}}^{T} \bar{W} \bar{b} + {\hat{x}}^{T} {\bar{A}}^{T} \bar{WA} \hat{x} \end{matrix}

(36)

Replacing (33) into (36) results in

\begin{matrix} J_{O W A} (\hat{x}) = {\bar{b}}^{T} \bar{W} \bar{b} - {\bar{b}}^{T} \bar{WA} {({\bar{A}}^{T} \bar{WA})}^{- 1} {\bar{A}}^{T} \bar{W} \bar{b} . \end{matrix}

(37)

Replacing (31) into (37), the optimal cost can be rewritten as

\begin{matrix} J_{O W A} (\hat{x}) = \sum_{k = 1}^{L} b_{(k)}^{T} W_{k} b_{(k)} - \sum_{k = 1}^{L} b_{(k)}^{T} W_{k} A_{(k)} {(\sum_{k = 1}^{L} A_{(k)}^{T} W_{k} A_{(k)})}^{- 1} \sum_{k = 1}^{L} A_{(k)}^{T} W_{k} b_{(k)} . \end{matrix}

(38)

□

Remark 3.

Applying

k = 1

in Theorem (2), the LSM-DF via OWA operator reduces to the classical LSM in Theorem (1).

3.2. LSM-DF via Choquet Integral Operator

The deduction of the LSM-DF via the Choquet integral operator follows from the equations

\begin{matrix} b_{[k]} = A_{[k]} x + v_{[k]}, k = 1, 2, \dots, L \end{matrix}

(39)

where

x \in R^{n \times 1}

is an unknown vector,

A_{[k]} \in R^{N \times n}

known parameters matrices,

b_{[k]} \in R^{N \times 1}

known vectors, and

v_{[k]} \in R^{N \times 1}

vectors named residuals.

A solution to the least-squares problem via the Choquet integral operator

\hat{x}

must minimize the length of the residual vector, that is, it must satisfy the following property:

\begin{matrix} \sum_{k = 1}^{L} | | b_{[k]} - A_{[k]} \hat{x} {| |}_{I μ (G_{k})}^{2} \leq \sum_{k = 1}^{L} | | b_{[k]} - A_{[k]} x {| |}_{I μ (G_{k})}^{2} \end{matrix}

(40)

for all x∈

R^{n \times 1}

and where

I μ (G_{k})

is a matrix identity multiplied by discrete fuzzy measure.

The optimal solution

\hat{x}

is found by solving the following minimization problem:

\begin{matrix} m i n_{x} J_{C_{μ}} (x) \end{matrix}

(41)

Functional

J_{C_{μ}} (x)

can be defined as

J_{C_{μ}} (x) : = C_{μ} ({\bar{J}}_{1} (x), {\bar{J}}_{2} (x), \dots, {\bar{J}}_{L} (x))

(42)

where

\begin{matrix} {\bar{J}}_{k} (x) & : = & | | v_{[k]} - v_{[k - 1]} {| |}^{2} \\ = & | | (b_{[k]} - A_{[k]} x) - (b_{[k - 1]} - A_{[k - 1]} x) {| |}^{2} \\ = & | | (b_{[k]} - b_{[k - 1]}) - (A_{[k]} - A_{[k - 1]}) x {| |}^{2}, k = 1, 2, \dots, L . \end{matrix}

(43)

Therefore, by defining the Choquet integral operator, Function (42) can be rewritten as

\begin{matrix} J_{C_{μ}} (x) & : = & \sum_{k = 1}^{L} | | (b_{[k]} - b_{[k - 1]}) - (A_{[k]} - A_{[k - 1]}) x {| |}_{I μ (G_{k})}^{2} \\ = & \sum_{k = 1}^{L} [(b_{[k]} - b_{[k - 1]}) - {(A_{[k]} - A_{[k - 1]}) x]}^{T} I μ (G_{k}) \\ • & [(b_{[k]} - b_{[k - 1]}) - (A_{[k]} - A_{[k - 1]}) x] . \end{matrix}

(44)

where

I μ (G_{k})

is a positive-definite symmetric matrix.

The next theorem brings the solution to the least-squares problem via the Choquet integral operator in (41).

Theorem 3.

(LSM-DF via Choquet Integral Operator) If the

A_{[k]} - A_{[k - 1]}

matrices with

k = 1, 2, \dots, L

have a full rank and

I μ (G_{k})

are symmetric definite-positive matrices, then there is a single optimal solution

\hat{x}

for the least-squares problem via Choquet integral operator (LSM-DF via Choquet integral operator) that is given by:

\begin{matrix} \hat{x} & = & {[\sum_{k = 1}^{L} {(A_{[k]} - A_{[k - 1]})}^{T} I μ (G_{k}) (A_{[k]} - A_{[k - 1]})]}^{- 1} \\ • & [\sum_{k = 1}^{L} {(A_{[k]} - A_{[k - 1]})}^{T} I μ (G_{k}) (b_{[k]} - b_{[k - 1]})] . \end{matrix}

(45)

The corresponding minimal value of

J_{C_{μ}} (x)

is

\begin{matrix} J_{C_{μ}} (\hat{x}) & = & \sum_{k = 1}^{L} {(b_{[k]} - b_{[k - 1]})}^{T} I μ (G_{k}) (b_{[k]} - b_{[k - 1]}) \\ - & \sum_{k = 1}^{L} {(b_{[k]} - b_{[k - 1]})}^{T} I μ (G_{k}) (A_{[k]} - A_{[k - 1]}) \\ • & {(\sum_{k = 1}^{L} {(A_{[k]} - A_{[k - 1]})}^{T} I μ (G_{k}) (A_{[k]} - A_{[k - 1]}))}^{- 1} \\ • & \sum_{k = 1}^{L} {(A_{[k]} - A_{[k - 1]})}^{T} I μ (G_{k}) (b_{[k]} - b_{[k - 1]}) . \end{matrix}

(46)

Proof.

Consider functional cost

\begin{matrix} J_{C_{μ}} (x) & = & \sum_{k = 1}^{L} {[(b_{[k]} - b_{[k - 1]}) - (A_{[k]} - A_{[k - 1]}) x]}^{T} I μ (G_{k}) \\ • & [(b_{[k]} - b_{[k - 1]}) - (A_{[k]} - A_{[k - 1]}) x] \end{matrix}

(47)

Using the matrices, this can be rewritten as

\begin{matrix} J_{C_{μ}} (x) = {(b^{'} - A^{'} x)}^{T} W^{'} (b^{'} - A^{'} x) \end{matrix}

(48)

where

\begin{matrix} A^{'} & = & [\begin{matrix} A_{[1]} - A_{[0]} \\ A_{[2]} - A_{[1]} \\ ⋮ \\ A_{[L]} - A_{[L - 1]} \end{matrix}], b^{'} = [\begin{matrix} b_{[1]} - b_{[0]} \\ b_{[2]} - b_{[1]} \\ ⋮ \\ b_{[L]} - b_{[L - 1]} \end{matrix}], \\ W^{'} & = & [\begin{matrix} I μ (G_{1}) & 0 & \dots & 0 \\ 0 & I μ (G_{2}) & \dots & 0 \\ ⋮ & ⋮ & \dots & ⋮ \\ 0 & 0 & \dots & I μ (G_{L}) \end{matrix}] . \end{matrix}

(49)

Entries

(A_{[1]}, A_{[2]}, \dots, A_{[L]})

, and

(b_{[1]}, b_{[2]}, \dots, b_{[L]})

are ascending orders of

(A_{1}, A_{2}, \dots, A_{L})

, and

(b_{1}, b_{2}, \dots, b_{L})

, respectively.

W^{'}

is a diagonal symmetric definite-positive matrix with entries

I μ (G_{k})

.

On the basis of Function (48) and the solution of LSM-DF via the OWA operator presented in Theorem (2), the solution to Optimization Problem (41) is given by

\hat{x} = [{A^{'}}^{T} W^{'} A^{'}] - 1 [{A^{'}}^{T} W^{'} b^{'}] .

(50)

which, through Matrices (49), can be rewritten as

\begin{matrix} \hat{x} & = & {[\sum_{k = 1}^{L} {(A_{[k]} - A_{[k - 1]})}^{T} I μ (G_{k}) (A_{[k]} - A_{[k - 1]})]}^{- 1} \\ • & [\sum_{k = 1}^{L} {(A_{[k]} - A_{[k - 1]})}^{T} I μ (G_{k}) (b_{[k]} - b_{[k - 1]})] . \end{matrix}

(51)

Similar to the procedure performed in Theorem (2), the minimal cost

J_{C_{μ}} (\hat{x})

can be expressed as

J_{C_{μ}} (\hat{x}) = {b^{'}}^{T} W^{'} b^{'} - {b^{'}}^{T} W^{'} A^{'} {({A^{'}}^{T} W^{'} A^{'})}^{- 1} {A^{'}}^{T} W^{'} b^{'} .

(52)

Replacing (49) into (52), the optimal cost can be rewritten as

\begin{matrix} J_{C_{μ}} (\hat{x}) & = & \sum_{k = 1}^{L} {(b_{[k]} - b_{[k - 1]})}^{T} I μ (G_{k}) (b_{[k]} - b_{[k - 1]}) \\ - & \sum_{k = 1}^{L} {(b_{[k]} - b_{[k - 1]})}^{T} I μ (G_{k}) (A_{[k]} - A_{[k - 1]}) \\ • & {(\sum_{k = 1}^{L} {(A_{[k]} - A_{[k - 1]})}^{T} I μ (G_{k}) (A_{[k]} - A_{[k - 1]}))}^{- 1} \\ • & \sum_{k = 1}^{L} {(A_{[k]} - A_{[k - 1]})}^{T} I μ (G_{k}) (b_{[k]} - b_{[k - 1]}) . \end{matrix}

(53)

□

Remark 4.

A_{[0]}

is the null matrix and

b_{[0]}

is the null vector by convention.

Remark 5.

By applying

k = 1

in Theorem (3), the LSM-DF via Choquet integral operator reduces to the classical LSM in Theorem (1).

3.3. LSM-DF via Mixture Operator

For the deduction of the LSM-DF via the mixture operator, it is necessary to adapt the mixture operator presented in Definition (5).

The weight functions that are dynamic in the mixture operator uses were previously calculated and became constant (static) weight functions. Thus, the adapted mixture operator is calculated in two steps. In the first step, the weights are calculated and fixed. In the next step, aggregations are carried out. The next definition brings the adapted mixture operator.

Definition 6.

(Adapted Mixture Operator) The adapted MIX function can be calculated using the following steps:

Step 1: weight functions $w_{k} (x_{k})$ with $k = 1, 2, \dots, n$ can be calculated and fixed as follows:

$\begin{matrix} w_{1} (x_{1}) = w_{1}, w_{2} (x_{2}) = w_{2}, \dots, w_{n} (x_{n}) = w_{n} . \end{matrix}$

(54)
Step 2: with the fixed weight functions, the MIX function can be calculated as follows:

$\begin{matrix} M I X_{w_{1}, w_{2}, \dots, w_{n}} (x_{1}, x_{2}, \dots, x_{n}) = \frac{\sum_{k = 1}^{n} w_{k} x_{k}}{\sum_{k = 1}^{n} w_{k}} . \end{matrix}$

(55)

Now, the LSM-DF via the mixture operator must be deduced. The following equation must be considered:

\begin{matrix} b_{k} = A_{k} x + v_{k}, k = 1, 2, \dots, L \end{matrix}

(56)

where

x \in R^{n \times 1}

is an unknown vector,

A_{k} \in R^{N \times n}

known parameters matrices,

b_{k} \in R^{N \times 1}

known vectors, and

v_{k} \in R^{N \times 1}

vectors named residuals.

A solution to the least-squares problem via the mixture operator must minimize the length of the residual vector, that is, it must satisfy the following property:

\begin{matrix} \frac{\sum_{k = 1}^{L} | | b_{k} - A_{k} \hat{x} {| |}_{W_{k}}^{2}}{\sum_{k = 1}^{L} | | W_{k} {| |}^{2}} \leq \frac{\sum_{k = 1}^{L} | | b_{k} - A_{k} x {| |}_{W_{k}}^{2}}{\sum_{k = 1}^{L} | | W_{k} {| |}^{2}} \end{matrix}

(57)

for all x∈

R^{n \times 1}

and where

W_{k}

is a positive-definite symmetric matrix.

Optimal solution

\hat{x}

is found by solving the following minimization problem:

\begin{matrix} m i n_{x} J_{M I X} (x) \end{matrix}

(58)

Functional

J_{M I X} (x)

can be defined as

\begin{matrix} J_{M I X} (x) : = M I X_{W_{1}, W_{2}, \dots, W_{L}} ({\underset{̲}{J}}_{1} (x), {\underset{̲}{J}}_{2} (x), \dots, {\underset{̲}{J}}_{L} (x)) \end{matrix}

(59)

where

\begin{matrix} {\underset{̲}{J}}_{k} (x) & : = & | | v_{k} {| |}^{2} \\ = & | | b_{k} - A_{k} x {| |}^{2}, k = 1, 2, \dots, L . \end{matrix}

(60)

By defining Mixture Operator (59), the function can be rewritten as

\begin{matrix} J_{M I X} (x) & : = & \frac{\sum_{k = 1}^{L} | | b_{k} - A_{k} x {| |}_{W_{k}}^{2}}{\sum_{k = 1}^{L} | | W_{k} {| |}^{2}} \\ = & \frac{\sum_{k = 1}^{L} {(b_{k} - A_{k} x)}^{T} W_{k} (b_{k} - A_{k} x)}{\sum_{k = 1}^{L} | | W_{k} {| |}^{2}} . \end{matrix}

(61)

The next theorem brings the solution to the least-squares problem via the mixture operator in (58).

Theorem 4.

(LSM-DF via Mixture Operator) If the

A_{k}

matrices with

k = 1, 2, \dots, L

have a full rank and

W_{k}

are symmetric definite-positive matrices, then there is a single optimal solution

\hat{x}

to the least-squares problem via the mixture operator (LSM-DF via mixture operator) (58) that is given by:

\begin{matrix} \hat{x} = {[\sum_{k = 1}^{L} A_{k}^{T} W_{k} A_{k}]}^{- 1} [\sum_{k = 1}^{L} A_{k}^{T} W_{k} b_{k}] . \end{matrix}

(62)

The corresponding minimal value of

J_{M I X} (x)

is

\begin{matrix} J_{M I X} (\hat{x}) & = & \sum_{k = 1}^{L} b_{k}^{T} W_{k} b_{k} - \sum_{k = 1}^{L} b_{k}^{T} W_{k} A_{k} {(\sum_{k = 1}^{L} A_{k}^{T} W_{k} A_{k})}^{- 1} \sum_{k = 1}^{L} A_{k}^{T} W_{k} b_{k} . \end{matrix}

(63)

Proof.

Consider the function

\begin{matrix} J_{M I X} (x) = [\frac{\sum_{k = 1}^{L} {(b_{k} - A_{k} x)}^{T} W_{k} (b_{k} - A_{k} x)}{\sum_{k = 1}^{L} | | W_{k} {| |}^{2}}] \end{matrix}

(64)

that can be rewritten as

\begin{matrix} J_{M I X} (x) = α {(β - A x)}^{T} W (β - A x) \end{matrix}

(65)

where

\begin{matrix} A & = & [\begin{matrix} A_{1} \\ A_{2} \\ ⋮ \\ A_{L} \end{matrix}], β = [\begin{matrix} b_{1} \\ b_{2} \\ ⋮ \\ b_{L} \end{matrix}], W = [\begin{matrix} W_{1} & 0 & \dots & 0 \\ 0 & W_{2} & \dots & 0 \\ ⋮ & ⋮ & \dots & ⋮ \\ 0 & 0 & \dots & W_{L} \end{matrix}], \\ α & = & \frac{1}{\sum_{k = 1}^{L} | | W_{k} {| |}^{2}} \end{matrix}

(66)

where

W

is a diagonal positive-definite symmetric matrix with entries

W_{k}

.

To find the solution to optimization problem

\hat{x}

,

J (x)

must be differentiated in (65) and equal to zero. On the basis of the theorem, the solution of the derivative is given by

\begin{matrix} \frac{\partial}{\partial x} [α {(β - A x)}^{T} W (β - A x)] = 0 \\ α \frac{\partial}{\partial x} [{(β - A x)}^{T} W (β - A x)] = 0 . \end{matrix}

(67)

On the basis of Theorem (2), the solution of the derivative is given by

\begin{matrix} α [x^{T} A^{T} W A - β^{T} W A] = 0 . \end{matrix}

(68)

Therefore,

\begin{matrix} \hat{x} = {[A^{T} W A]}^{- 1} [A^{T} W β] . \end{matrix}

(69)

Through Matrices (66), the solution can be rewritten as

\begin{matrix} \hat{x} = {[\sum_{k = 1}^{L} A_{k}^{T} W_{k} A_{k}]}^{- 1} [\sum_{k = 1}^{L} A_{k}^{T} W_{k} b_{k}] . \end{matrix}

(70)

Minimal cost

J_{M I X} (\hat{x})

can be expressed as

\begin{matrix} J_{M I X} (\hat{x}) = β^{T} W β - β^{T} W A \hat{x} - {\hat{x}}^{T} A^{T} W β + {\hat{x}}^{T} A^{T} W A \hat{x} \end{matrix}

(71)

replacing(69) into (71), the result is

\begin{matrix} J_{M I X} (\hat{x}) = β^{T} W β - β^{T} W A {(A^{T} W A)}^{- 1} A^{T} W β . \end{matrix}

(72)

Replacing (66) into (72), the optimal cost can be rewritten as

J_{M I X} (\hat{x}) = \sum_{k = 1}^{L} b_{k}^{T} W_{k} b_{k} - \sum_{k = 1}^{L} b_{k}^{T} W_{k} A_{k} (\sum_{k = 1}^{L} A_{k}^{T} W_{k} A_{k}) - 1 \sum_{k = 1}^{L} A_{k}^{T} W_{k} b_{k} .

(73)

□

Remark 6.

The optimal solution of the LSM-DF via a mixture operator reduces to the LSM-DF in [4].

4. Illustrative Example

In this section, we present artificially created (by authors) datasets in order to illustrate the behavior, effectiveness, and the relationship between the proposed methods for finding the best fitting curve to a given set of points from a mathematical point of view. Table 1 shows two simulated datasets about income and consumption.

First, the LSM was separately applied to the datasets, and the following results were found:

\begin{matrix} {\hat{y}}_{1} = 0.49 x_{1} + 52.69, \end{matrix}

(74)

\begin{matrix} {\hat{y}}_{2} = 0.49 x_{2} + 53.65 . \end{matrix}

(75)

The MSEs between

{\hat{y}}_{1}

with

y_{1}

and

{\hat{y}}_{2}

with

y_{2}

were

211.52

and

221.67

, respectively. Model (74) was more accurate than Model (75).

Second, the LSM-DF via OWA, Choquet integral, and mixture operators were calculated in the two datasets, and the following weighting matrices were used in the simulation:

W_{1} = 0.7 * d i a g (10)

and

W_{2} = 0.3 * d i a g (10)

; more weight was given to

W_{1}

than to

W_{2}

. The following results were found:

\begin{matrix} {\hat{y}}_{O} = 0.49 x + 53.34, \end{matrix}

(76)

\begin{matrix} {\hat{y}}_{C} = 0.49 x + 52.65, \end{matrix}

(77)

\begin{matrix} {\hat{y}}_{M} = 0.49 x + 52.95 . \end{matrix}

(78)

The MSEs between

{\hat{y}}_{O}

,

{\hat{y}}_{C}

and

{\hat{y}}_{M}

with

y_{1}

were

211.18

,

211.57

,

211.28

, respectively. The MSEs between

{\hat{y}}_{O}

,

{\hat{y}}_{C}

and

{\hat{y}}_{M}

with

y_{2}

were

222.14

,

223.87

and 223 respectively. Table 2 and Table 3 compare samples with regard to

x_{1}

and

x_{2}

, respectively, of Equations (76)–(78). Table 4 compares the samples of

y_{1}

to the samples generated by Equations (74), (76)–(78). Table 5 compares the samples of

y_{2}

with the samples generated with Equations (76)–(78).

MSE shows that Models (76)–(78) were more accurate than Model (74). The LSM-DF via OWA, Choquet integral, and mixture operators outperformed the LSM.

5. Conclusions

In this paper, the LSM-DF was studied through aggregation operators in order to explore different ways to aggregate data. More specifically, the LSM-DF via an OWA operator, the LSM-DF via a Choquet integral operator, and the LSM-DF via a mixture operator were defined. These operators were particularly chosen due to their efficiency when applied to other methods in different areas of knowledge [12,13,22,24,26]. These new methods provide a theoretical framework with variations of the classic least square, which may be more suitable in certain applications. For instance, LSM-DF via OWA operator could be chosen for situations where one wants to place greater weights on the first data entries.

The main objective of developing these methods is to estimate an optimal parameter for situations involving more than one dataset, and to show how it can be changed for different types of data. The methods were mathematically demonstrated by applying aggregation operators of the average type to optimization problem. The illustrate example was set up to demonstrate the mathematical behavior of these procedures trough fitting curves in comparison with an approach that does not incorporate the aggregation operators in its formulation.

In future studies, we want to explore some applications that can show the advantages and disadvantages of each method, and set up LSM for other aggregation operators such as a weighted OWA (WOWA) operator and a Sugeno integral operator. Furthermore, these methods will be extended to models subject to parametric uncertainties.

Author Contributions

Conceptualization, G.Q.d.J. and E.S.P.; Methodology, G.Q.d.J. and E.S.P.; Formal analysis, G.Q.d.J. and E.S.P.; Investigation, G.Q.d.J. and E.S.P.; Writing—original draft, G.Q.d.J.; Writing—review & editing, E.S.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Cheng, C.H.; Wang, J.W.; Wu, M.C. OWA-weighted based clustering method for classification problem. Expert Syst. Appl. 2009, 36, 4988–4995. [Google Scholar] [CrossRef]
Milfont, T.; Mezzomo, I.; Bedregal, B.; Mansilla, E.; Bustince, H. Aggregation functions on n-dimensional ordered vectors equipped with an admissible order and an application in multi-criteria group decision-making. Int. J. Approx. Reason. 2021, 137, 34–50. [Google Scholar] [CrossRef]
Flores-Sosa, M.; León-Castro, E.; Merigó, J.M.; Yager, R.R. Forecasting the exchange rate with multiple linear regression and heavy ordered weighted average operators. Eur. J. Oper. Res. 2022, 248, 108863. [Google Scholar]
Sayed, A.H.; Al-Naffouri, T.Y.; Kailath, T. Robust Estimation for Uncertain Models in a Data Fusion Scenario. IFAC Proc. Vol. 2000, 33, 899–904. [Google Scholar] [CrossRef]
Kailath, T.; Sayed, A.S.; Hassibi, B. Linear Estimation, 3rd ed.; Prentice Hall: Upper Saddle River, NJ, USA, 2000; 854p. [Google Scholar]
Sayed, A.H.; Chandrasekaran, S. Parameter estimation with multiple sources and levels of uncertainties. IEEE Trans. Signal Process. 2000, 48, 680–692. [Google Scholar] [CrossRef]
Lopes, C.G.; Sayed, A.H. Diffusion least-mean squares over adaptive networks: Formulation and performance analysis. IEEE Trans. Signal Process. 2008, 56, 3122–3136. [Google Scholar] [CrossRef]
Cattivelli, F.; Lopes, C.G.; Sayed, A.H. Diffusion recursive least-squares for distributed estimation over adaptive networks. IEEE Trans. Signal Process. 2008, 56, 1865–1877. [Google Scholar] [CrossRef]
Takahashi, N.; Yamada, I.; Sayed, A.H. Diffusion least-mean squares with adaptive combiners: Formulation and performance analysis. IEEE Trans. Signal Process. 2010, 58, 4795–4810. [Google Scholar] [CrossRef]
Choquet, G. Theory of capacities. Ann. de lÍnstitut Fourier 1953, 5, 131–295. [Google Scholar] [CrossRef]
Give’on, Y. Lattice matrices. Inf. Control 1964, 7, 477–484. [Google Scholar] [CrossRef]
Yager, R.R. Ordered weighted averaging aggregation operators in multicriteria decision making. IEEE Trans. Syst. Man Cybern. 1988, 18, 183–190. [Google Scholar] [CrossRef]
Zhou, S.M.; Chiclana, F.; John, R.I.; Garibald, J.M. Type-1 OWA operators for aggregating uncertain information with uncertain weights induced by type-2 linguistic quantifiers. Fuzzy Sets Syst. 2008, 159, 3281–3296. [Google Scholar] [CrossRef]
Paternain, D.; Fernandez, J.; Bustince, H.; Mesiar, R.; Beliakov, G. Construction of image reduction operators using averaging aggregation functions. Fuzzy Sets Syst. 2015, 261, 87–111. [Google Scholar] [CrossRef]
Beliakov, G.; Bustince, H.; Calvo, T. A Practical Guide to Averaging Functions (Studies in Fuzziness and Soft Computing); Springer: Berlin/Heidelberg, Germany, 2016; Volume 329. [Google Scholar]
Bedregal, B.; Bustince, H.; Palmeira, E.; Dimuro, G.; Fernandez, J. Generalized Interval-valued OWA operators with interval weights derived from interval-valued overlap functions. Int. J. Approx. Reason. 2017, 90, 1–16. [Google Scholar] [CrossRef]
Joy, G. The Determinant and Rank of a Lattice Matrix. Glob. J. Pure Appl. Math. 2017, 13, 1745–1761. [Google Scholar]
Dimuro, G.P.; Fernandez, J.; Bedregal, B.; Mesiar, R.; Sanz, J.A.; Lucca, G.; Bustince, H. The state-of-art of the generalization of the Choquet integral: From aggregation and pre-aggregation to ordered directionally monotone functions. Inf. Fusion 2020, 57, 27–43. [Google Scholar] [CrossRef]
Asmus, T.; Dimuro, G.; Bedregal, B.; Sanz, J.A.; Fernandez, J.; Rodriguez-Martinez, J.; Mesiar, R.; Bustince, H. A constructive framework to define fusion functions with floating domains in arbitrary closed real intervals. Inf. Sci. 2022, 601, 800–829. [Google Scholar] [CrossRef]
Flores-Sosa, M.; Avilés-Ochoa, E.; Merigó, J.M.; Yager, R.R. Volatility GARCH models with the ordered weighted average (OWA) operators. Inf. Sci. 2021, 565, 46–61. [Google Scholar] [CrossRef]
Medina, J.; Yager, R.R. OWA operators with functional weights. Fuzzy Sets Syst. 2021, 414, 38–56. [Google Scholar] [CrossRef]
Flores-Sosa, M.; Avilés-Ochoa, E.; Merigó, J.M.; Kacprzyk, J. The OWA operator in multiple linear regression. Appl. Soft Comput. 2022, 124, 108985. [Google Scholar] [CrossRef]
Llamazares, B. Constructing Choquet integral-based operators that generalize weighted means and OWA operators. Inf. Fusion 2022, 23, 131–138. [Google Scholar] [CrossRef]
Jia, X.; Wang, Y. Choquet integral-based intuitionistic fuzzy arithmetic aggregation operators in multi-criteria decision-making. Expert Syst. Appl. 2022, 191, 116242. [Google Scholar] [CrossRef]
Pereira, R.A.M.; Ribeiro, R.A. Aggregation with generalized mixture operators using weighting functions. Fuzzy Sets Syst. 2003, 137, 43–58. [Google Scholar] [CrossRef]
Ribeiro, R.A.; Pereira, R.A.M. Generalized mixture operators using weighting functions: A comparative study with WA and OWA. Eur. J. Oper. Res. 2003, 145, 329–342. [Google Scholar] [CrossRef]
Santana, F.; Bedregal, B.; Viana, P.; Bustince, H. On admissible orders over closed subintervals of [0, 1]. Fuzzy Sets Syst. 2021, 399, 44–54. [Google Scholar] [CrossRef]

Table 1. Simulated datasets about income and consumption.

Income ( $x_{1}$ )	Consumption ( $y_{1}$ )	Income ( $x_{2}$ )	Consumption ( $y_{2}$ )
139	122	140	123
126	114	129	117
90	86	92	89
144	134	145	136
163	146	163	147
136	107	138	109
61	68	64	68
62	117	63	119
41	71	43	73
120	98	122	100

Table 2. Sample with regard to

x_{1}

of Equations (76)–(78).

Table 2. Sample with regard to

x_{1}

of Equations (76)–(78).

Income ( $x_{1}$ )	Consumption ( ${\hat{y}}_{O}$ )	Consumption ( ${\hat{y}}_{C}$ )	Consumption ( ${\hat{y}}_{M}$ )
139	121.45	120.76	121.06
126	115.08	114.39	114.69
90	97.44	96.75	97.05
144	123.90	123.21	123.51
163	133.21	132.52	132.82
136	1119.98	119.29	119.59
61	83,23	82.54	82.84
62	83.72	83.03	83.33
41	73.43	72.74	73.04
120	112.14	111.45	111.75

Table 3. Sample with regard to

x_{2}

of Equations (76)–(78).

Table 3. Sample with regard to

x_{2}

of Equations (76)–(78).

Income ( $x_{2}$ )	Consumption ( ${\hat{y}}_{O}$ )	Consumption ( ${\hat{y}}_{C}$ )	Consumption ( ${\hat{y}}_{M}$ )
140	121.94	121.25	121.55
129	116.55	115.86	116.16
92	98.42	97.73	98.03
145	124.39	123.70	124
163	133.21	132.52	132.82
138	120.96	120.27	120.57
64	87.40	84.01	84.31
63	84.21	83.52	83.82
43	74.41	73.72	74.02
122	113.12	112.43	112.73

Table 4. Sample of

y_{1}

and samples generated with Equations (74), (76)–(78).

Table 4. Sample of

y_{1}

and samples generated with Equations (74), (76)–(78).

$y_{1}$	${\hat{y}}_{1}$	${\hat{y}}_{O}$	${\hat{y}}_{C}$	${\hat{y}}_{M}$
122	120.80	121.45	120.76	121.06
114	114.43	115.08	114.39	114.69
86	96.79	97.44	96.75	97.05
134	123.25	123.90	123.21	123.51
146	132.56	133.21	132.52	132.82
107	119.33	119.98	119.29	119.59
68	82.58	83,23	82.54	82.84
117	83.07	83.72	83.03	83.33
71	72.78	73.43	72.74	73.04
98	111.49	112.14	111.45	111.75

Table 5. Sample of

y_{2}

and the samples generated by Equations (76)–(78).

Table 5. Sample of

y_{2}

and the samples generated by Equations (76)–(78).

$y_{2}$	${\hat{y}}_{2}$	${\hat{y}}_{O}$	${\hat{y}}_{C}$	${\hat{y}}_{M}$
123	122.25	121.94	121.25	121.55
117	116.86	116.55	115.86	116.16
89	98.73	98.42	97.73	98.03
136	124.70	124.39	123.70	124
147	133.52	133.21	132.52	132.82
109	121.27	120.96	120.27	120.57
68	85.01	87.40	84.01	84.31
119	84.52	84.21	83.52	83.82
73	74.72	74.41	73.72	74.02
100	113.43	113.12	112.43	112.73

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

de Jesus, G.Q.; Palmeira, E.S. Least Squares in a Data Fusion Scenario via Aggregation Operators. Axioms 2022, 11, 678. https://doi.org/10.3390/axioms11120678

AMA Style

de Jesus GQ, Palmeira ES. Least Squares in a Data Fusion Scenario via Aggregation Operators. Axioms. 2022; 11(12):678. https://doi.org/10.3390/axioms11120678

Chicago/Turabian Style

de Jesus, Gildson Queiroz, and Eduardo Silva Palmeira. 2022. "Least Squares in a Data Fusion Scenario via Aggregation Operators" Axioms 11, no. 12: 678. https://doi.org/10.3390/axioms11120678

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Least Squares in a Data Fusion Scenario via Aggregation Operators

Abstract

1. Introduction

2. Preliminaries

2.1. Admissible Order for Matrices

2.2. Aggregation Operators

2.3. Least-Squares Method

3. LSM-DF via Aggregation Operators

3.1. LSM-DF via OWA Operator

3.2. LSM-DF via Choquet Integral Operator

3.3. LSM-DF via Mixture Operator

4. Illustrative Example

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Income ( $x_{1}$ )	Consumption ( $y_{1}$ )	Income ( $x_{2}$ )	Consumption ( $y_{2}$ )
139	122	140	123
126	114	129	117
90	86	92	89
144	134	145	136
163	146	163	147
136	107	138	109
61	68	64	68
62	117	63	119
41	71	43	73
120	98	122	100

Income ( $x_{1}$ )	Consumption ( $y_{1}$ )	Income ( $x_{2}$ )	Consumption ( $y_{2}$ )
139	122	140	123
126	114	129	117
90	86	92	89
144	134	145	136
163	146	163	147
136	107	138	109
61	68	64	68
62	117	63	119
41	71	43	73
120	98	122	100

Income ( $x_{1}$ )	Consumption ( $y_{1}$ )	Income ( $x_{2}$ )	Consumption ( $y_{2}$ )
139	122	140	123
126	114	129	117
90	86	92	89
144	134	145	136
163	146	163	147
136	107	138	109
61	68	64	68
62	117	63	119
41	71	43	73
120	98	122	100