# Measure of Similarity between GMMs Based on Autoencoder-Generated Gaussian Component Representations

## Abstract

## 1. Introduction

## 2. Materials and Methods

#### 2.1. Baseline GMM Similarity Measures

#### 2.2. GMM Similarity Measures Based on Autoencoder-Generated Representations

#### 2.2.1. Autoencoder Architectures

#### 2.2.2. Ground Distances Regularization

#### 2.2.3. Forming the Feature Map Regularizer ${\mathcal{R}}_{FMR}(F,G)$

#### 2.2.4. Forming the GMM Similarity Measure Based on Autoencoder-Generated Representations

#### 2.3. Computational Complexity

## 3. Experimental Results

#### 3.1. Network Architectures

#### 3.2. Performances

## 4. Conclusions

## Author Contributions

## Funding

## Data Availability Statement

## Acknowledgments

## Conflicts of Interest

## Abbreviations

AI | Artificial Intelligence |

CNN | Convolutional Neural Network |

DNN | Deep Neural Network |

DPLM | Distance Preservation to the Local Mean |

EM | Expectation Maximization |

EMD | Earth Mover’s Distance |

FMR | Feature Map Regularized |

GMM | Gaussian Mixture Model |

KL | Kullback–Leibler |

KNN | K-Nearest Neighbors |

KTH | Royal Institute of Technology in Stockholm |

LP | Linear Programming |

MB | Matching-Based |

MC | Monte Carlo |

ML | Machine Learning |

MLE | Maximum Likelihood Estimate |

MSE | Mean Squared Error |

Probability Density Function | |

SPD | Symmetric Positive Definite |

TIPS | Textures Under Varying Illumination, Pose and Scale |

UC | Unscented Transform |

UIUC | University of Illinois Urbana-Champaign |

UMD | University of Maryland |

VB | Variational Bound |

VAE | Variational AutoEncoder |

WA | Weighted Average |

**Figure 1.**Illustration of the proposed fully connected autoencoder architecture for the low-dimensional embedding of Gaussian components represented by $vect\left({P}_{i}\right)$ into $h\in {\mathbb{R}}^{l}$. Colors indicate symmetric architecture of the network and the goal of learning unique embedding of the original Gaussian components (middle vector h shown in red).

**Figure 2.**Illustration of the proposed FMR regularized CNN autoencoder for the low-dimensional embedding of Gaussian components represented by SPD matrices ${P}_{i}\in {\mathrm{Sym}}_{++}(d+1)$ into $h\in {\mathbb{R}}^{l}$.

**Table 1.**Recognition accuracies for the proposed GMM-Autoenc-based measures when compared with KL-based as well as DPLM-based GMM similarity measures on UIUC database.

GMM Sim. Meas. | Accuracy | ||||||
---|---|---|---|---|---|---|---|

$\mathit{m}$ = 1 | $\mathit{m}$ = 5 | $\mathit{m}$ = 10 | |||||

$K{L}_{MB}$ | 0.82 | 0.80 | 0.80 | ||||

$K{L}_{WA}$ | 0.82 | 0.82 | 0.82 | ||||

$K{L}_{VB}$ | 0.82 | 0.82 | 0.82 | ||||

$\tilde{\mathit{l}}$ = 5 | $\tilde{\mathit{l}}$ = 7 | $\tilde{\mathit{l}}$ = 5 | $\tilde{\mathit{l}}$ = 7 | $\tilde{\mathit{l}}$ = 5 | $\tilde{\mathit{l}}$ = 7 | ||

$uDPL{M}_{MB}$ | 0.72 | 0.81 | 0.73 | 0.74 | 0.79 | 0.79 | |

$uDPL{M}_{WA}$ | 0.72 | 0.81 | 0.73 | 0.74 | 0.80 | 0.80 | |

$uDPL{M}_{VB}$ | 0.72 | 0.81 | 0.73 | 0.74 | 0.80 | 0.80 | |

$\mathit{l}$ = 20 | $\mathit{l}$ = 30 | $\mathit{l}$ = 20 | $\mathit{l}$ = 30 | $\mathit{l}$ = 20 | $\mathit{l}$ = 30 | ||

GMM-Autoenc${}_{1,AE}$ | 0.75 | 0.81 | 0.75 | 0.76 | 0.80 | 0.80 | |

GMM-Autoenc${}_{2,AE}$ | 0.75 | 0.81 | 0.76 | 0.76 | 0.80 | 0.80 | |

GMM-Autoenc${}_{3,AE}$ | 0.76 | 0.80 | 0.76 | 0.77 | 0.81 | 0.81 | |

GMM-Autoenc${}_{1,AECNN}$ | 0.75 | 0.80 | 0.73 | 0.77 | 0.81 | 0.80 | |

GMM-Autoenc${}_{2,AECNN}$ | 0.76 | 0.81 | 0.71 | 0.78 | 0.81 | 0.81 | |

GMM-Autoenc${}_{3,AECNN}$ | 0.77 | 0.81 | 0.71 | 0.79 | 0.81 | 0.80 |

**Table 2.**Recognition accuracies for the proposed GMM-Autoenc-based measures when compared with KL-based as well as DPLM-based GMM similarity measures on KTH-TIPS database.

GMM Sim. Meas. | Accuracy | ||||||
---|---|---|---|---|---|---|---|

$\mathit{m}$ = 1 | $\mathit{m}$ = 5 | $\mathit{m}$ = 10 | |||||

$K{L}_{MB}$ | 0.78 | 0.74 | 0.75 | ||||

$K{L}_{WA}$ | 0.78 | 0.78 | 0.78 | ||||

$K{L}_{VB}$ | 0.78 | 0.78 | 0.78 | ||||

$\tilde{\mathit{l}}$ = 5 | $\tilde{\mathit{l}}$ = 7 | $\tilde{\mathit{l}}$ = 5 | $\tilde{\mathit{l}}$ = 7 | $\tilde{\mathit{l}}$ = 5 | $\tilde{\mathit{l}}$ = 7 | ||

$uDPL{M}_{MB}$ | 0.57 | 0.73 | 0.69 | 0.71 | 0.63 | 0.72 | |

$uDPL{M}_{WA}$ | 0.57 | 0.73 | 0.72 | 0.75 | 0.64 | 0.75 | |

$uDPL{M}_{VB}$ | 0.57 | 0.73 | 0.72 | 0.75 | 0.63 | 0.75 | |

$\mathit{l}$ = 20 | $\mathit{l}$ = 30 | $\mathit{l}$ = 20 | $\mathit{l}$ = 30 | $\mathit{l}$ = 20 | $\mathit{l}$ = 30 | ||

GMM-Autoenc${}_{1,AE}$ | 0.71 | 0.75 | 0.73 | 0.75 | 0.72 | 0.74 | |

GMM-Autoenc${}_{2,AE}$ | 0.71 | 0.74 | 0.72 | 0.74 | 0.72 | 0.75 | |

GMM-Autoenc${}_{3,AE}$ | 0.72 | 0.75 | 0.73 | 0.75 | 0.73 | 0.76 | |

GMM-Autoenc${}_{1,AECNN}$ | 0.72 | 0.75 | 0.73 | 0.74 | 0.73 | 0.73 | |

GMM-Autoenc${}_{2,AECNN}$ | 0.73 | 0.75 | 0.71 | 0.75 | 0.74 | 0.76 | |

GMM-Autoenc${}_{3,AECNN}$ | 0.73 | 0.76 | 0.72 | 0.77 | 0.74 | 0.77 |

**Table 3.**Recognition accuracies for the proposed GMM-Autoenc-based measures when compared with KL-based as well as DPLM-based GMM similarity measures on UMD database.

GMM Sim. Meas. | Accuracy | ||||||
---|---|---|---|---|---|---|---|

m = 1 | m = 5 | m = 10 | |||||

$K{L}_{MB}$ | 0.75 | 0.73 | 0.72 | ||||

$K{L}_{WA}$ | 0.75 | 0.75 | 0.75 | ||||

$K{L}_{VB}$ | 0.75 | 0.75 | 0.75 | ||||

$\tilde{\mathit{l}}$ = 5 | $\tilde{\mathit{l}}$ = 7 | $\tilde{\mathit{l}}$ = 5 | $\tilde{\mathit{l}}$ = 7 | $\tilde{\mathit{l}}$ = 5 | $\tilde{\mathit{l}}$ = 7 | ||

$uDPL{M}_{MB}$ | 0.73 | 0.74 | 0.72 | 0.72 | 0.70 | 0.72 | |

$uDPL{M}_{WA}$ | 0.73 | 0.74 | 0.73 | 0.74 | 0.71 | 0.75 | |

$uDLP{M}_{VB}$ | 0.73 | 0.74 | 0.73 | 0.74 | 0.71 | 0.75 | |

$\mathit{l}$ = 20 | $\mathit{l}$ = 30 | $\mathit{l}$ = 20 | $\mathit{l}$ = 30 | $\mathit{l}$ = 20 | $\mathit{l}$ = 30 | ||

GMM-Autoenc${}_{1,AE}$ | 0.74 | 0.74 | 0.73 | 0.73 | 0.71 | 0.72 | |

GMM-Autoenc${}_{2,AE}$ | 0.73 | 0.74 | 0.73 | 0.74 | 0.73 | 0.74 | |

GMM-Autoenc${}_{3,AE}$ | 0.74 | 0.75 | 0.74 | 0.75 | 0.73 | 0.75 | |

GMM-Autoenc${}_{1,AECNN}$ | 0.74 | 0.75 | 0.73 | 0.73 | 0.72 | 0.72 | |

GMM-Autoenc${}_{2,AECNN}$ | 0.75 | 0.75 | 0.74 | 0.75 | 0.74 | 0.74 | |

GMM-Autoenc${}_{3,AECNN}$ | 0.74 | 0.75 | 0.74 | 0.75 | 0.74 | 0.75 |

