An Information Theoretic Approach to Privacy-Preserving Interpretable and Transferable Learning

Kumar, Mohit; Moser, Bernhard A.; Fischer, Lukas; Freudenthaler, Bernhard

doi:10.3390/a16090450

Open AccessArticle

An Information Theoretic Approach to Privacy-Preserving Interpretable and Transferable Learning

by

Mohit Kumar

^1,2,*

,

Bernhard A. Moser

^2,3,

Lukas Fischer

²

and

Bernhard Freudenthaler

²

¹

Faculty of Computer Science and Electrical Engineering, University of Rostock, 18051 Rostock, Germany

²

Software Competence Center Hagenberg GmbH, A-4232 Hagenberg, Austria

³

Institute of Signal Processing, Johannes Kepler University Linz, 4040 Linz, Austria

^*

Author to whom correspondence should be addressed.

Algorithms 2023, 16(9), 450; https://doi.org/10.3390/a16090450

Submission received: 7 June 2023 / Revised: 30 August 2023 / Accepted: 8 September 2023 / Published: 20 September 2023

(This article belongs to the Special Issue Deep Learning Techniques for Computer Security Problems)

Download

Browse Figures

Versions Notes

Abstract

:

In order to develop machine learning and deep learning models that take into account the guidelines and principles of trustworthy AI, a novel information theoretic approach is introduced in this article. A unified approach to privacy-preserving interpretable and transferable learning is considered for studying and optimizing the trade-offs between the privacy, interpretability, and transferability aspects of trustworthy AI. A variational membership-mapping Bayesian model is used for the analytical approximation of the defined information theoretic measures for privacy leakage, interpretability, and transferability. The approach consists of approximating the information theoretic measures by maximizing a lower-bound using variational optimization. The approach is demonstrated through numerous experiments on benchmark datasets and a real-world biomedical application concerned with the detection of mental stress in individuals using heart rate variability analysis.

Keywords:

privacy; interpretability; transferability; information theory; membership mappings; variational optimization; machine and deep learning

1. Introduction

Trust in the development, deployment, and use of AI is essential in order to fully utilize the potential of AI to contribute to human well-being and society. Recent advances in machine and deep learning have rejuvenated the field of AI with an enthusiasm that AI could become an integral part of human life. However, a rapid proliferation of AI will give rise to several ethical, legal, and social issues.

1.1. Trustworthy AI

In response to the ethical, legal, and social challenges that accompany AI, guidelines and ethical principles have been established [1,2,3,4] in order to evaluate the responsible development of AI systems that are good for humanity and the environment. These guidelines have introduced the concept of trustworthy AI (TAI), and the term TAI has quickly gained attention in research and practice. TAI is based on the idea that trust in AI will allow AI to realize its full potential in contributing to societies, economies, and sustainable development. As “trust” is a complex phenomenon being studied in diverse disciplines (i.e., psychology, sociology, economics, management, computer science, and information systems), the definition and realization of TAI remains challenging. While forming trust in technology, users express expectations about the technology’s functionality, helpfulness and reliability [5]. The authors in [6] state that “AI is perceived as trustworthy by its users (e.g., consumers, organizations, and society) when it is developed, deployed, and used in ways that not only ensure its compliance with all relevant laws and its robustness but especially its adherence to general ethical principles”.

Academicians, industries, and policymakers have developed in recent times for TAI several frameworks and guidelines including “Asilomar AI Principles” [7], “Montreal Declaration of Responsible AI” [8], “UK AI Code” [9], “AI4People” [4], “Ethics Guidelines for Trustworthy AI” [1], “OECD Principles on AI” [10], “Governance Principles for the New Generation Artificial Intelligence” [11], and “Guidance for Regulation of Artificial Intelligence Applications” [12]. However, it was argued in [13] that AI ethics lack a reinforcement mechanism, and economic incentives could easily override commitment to ethical principles and values.

The five principles of ethical AI [4] (i.e., beneficence, non-maleficence, autonomy, justice, and explicability) have been adopted for TAI [6]. Beneficence refers to promoting the well-being of humans, preserving dignity, and sustaining the planet. Non-maleficence refers to avoiding bringing harm to people and is especially concerned with the protection of people’s privacy and security. Autonomy refers to the promotion of human autonomy, agency, and oversight including the restriction of AI systems’ autonomy, where necessary. Justice refers to using AI for correcting past wrongs, ensuring shared benefits through AI, and preventing the creation of new harms and inequities by AI. Explicability comprises an epistemological sense and an ethical sense. Explicability refers in the epistemological sense to the explainable AI developed by creating interpretable AI models with high levels of performance and accuracy. In the ethical sense, explicability refers to accountable AI.

1.2. Motivation and Novelty

The core issues related to machine and deep learning that need to be addressed in order to fulfill the five principles of trustworthy AI are listed in Table 1.

Solution approaches to address issues concerning TAI have been identified in Table 1; however, a unified solution approach addressing all major issues does not exist. Despite the importance of the outlined TAI principles, their major limitation, as identified in [6], concerns the fact that the principles are highly general and provide little to no guidance for how they can be transferred into practice. To address this limitation, a data-driven research framework for TAI was outlined in [6]. However, to the best knowledge of the authors, no previous study presented a unified information theoretic approach to study the privacy, interpretability, and transferability aspects of trustworthy AI in a rigorous analytical manner. This motivated us in this study to develop a novel information theoretic approach for addressing the privacy, interpretability, and transferability aspects of trustworthy AI in a rigorous analytical manner. This study introduces a unified information theoretic approach to “privacy-preserving interpretable and transferable learning”, as represented in Figure 1, for addressing trustworthy AI issues, which is the novelty of this study.

1.3. Goal and Aims

Our goal is to develop a novel approach to trustworthy AI based on the hypothesis that information theory enables taking into account the privacy, interpretability, and transferability aspects of trustworthy AI principles during the development of machine learning and deep learning models by providing a way to study and optimize the inherent trade-offs. The aims focused on the development of our approach are the following:

Aim 1:: To develop an information theoretic approach to privacy that enables the quantification of privacy leakage in terms of the mutual information between sensitive private data and the data released to the public without the availability of prior knowledge about data statistics (such as joint distributions of public and private variables).
Aim 2:: To develop an information theoretic criterion for evaluating the interpretability of a machine learning model in terms of the mutual information between non-interpretable model outputs/activations and corresponding interpretable parameters.
Aim 3:: To develop an information theoretic criterion for evaluating the transferability (of a machine learning model from source to target domain) in terms of the mutual information between source domain model outputs/activations and target domain model outputs/activations.
Aim 4:: To develop analytical approaches to machine and deep learning allowing for the quantification of model uncertainties.
Aim 5:: To develop a unified approach to “privacy-preserving interpretable and transferable learning” for an analytical optimization of privacy–interpretability–transferability trade-offs.

1.4. Methodology

Figure 2 outlines the methodological workflow. For an information theoretic evaluation of the privacy leakage, interpretability, and transferability, we provide a novel method that consists of following three steps:

1.4.1. Defining Measures in Terms of the Information Leakages

The privacy, interpretability, and transferability measures are defined in terms of the information leakages:

Privacy leakage is measured as the amount of information about private/sensitive variables leaked by the shared variables;
Interpretability is measured as the amount of information about interpretable parameters leaked by the model;
Transferability is measured as the amount of information about the source domain model output leaked by the target domain model output.

1.4.2. Variational Membership Mapping Bayesian Models

In order to derive analytical expressions for the defined privacy leakage, interpretability, and transferability measures, the stochastic inverse models (governing the relationships amongst variables) will be required. In this study, the variational membership mappings are leveraged to build the required stochastic inverse models. Membership mappings [14,15] have been introduced as an alternative to deep neural networks in order to address the issues such as determining the optimal model structure, smaller training dataset, and iterative time-consuming nature of numerical learning algorithms [16,17,18,19,20,21,22]. A membership mapping represents data through a fuzzy set (characterized by a membership function such that the dimension of the membership function increases with an increasing data size). A remarkable feature of membership mappings is that these allow an analytical approach to the variational learning of a membership-mappings-based data representation model. Our idea is to employ membership mappings for defining a stochastic inverse model, which is then inferred using variational Bayesian methodology.

1.4.3. Variational Approximation of Information Theoretic Measures

The variational membership-mapping Bayesian models are used to determine the lower bounds on the defined information theoretic measures for privacy leakage, interpretability, and transferability. The lower bounds are then maximized using variational optimization methodology to derive analytically the expressions that approximate the privacy leakage, interpretability, and transferability measures. The analytically derived expressions form the basis of an algorithm that practically computes the measures using available data samples, where expectations over unknown distributions are approximated by sample averages.

1.5. Contributions

The main contributions of this study are following:

1.5.1. A Unified Approach to Study the Privacy, Interpretability, and Transferability Aspects of Trustworthy AI

The study introduces a novel information theoretic unified approach (as represented in Figure 1) to address the

Issues I1 and I2 of beneficence principle by means of transfer and federated learning;
Issues I3 and I4 of non-maleficence principle by means of privacy-preserving data release mechanisms;
Issue I5 of autonomy principle by means of analytical machine and deep learning algorithms that enable the user to quantify model uncertainties and hence to decide the level of autonomy given to AI systems;
Issue I6 of justice principle by means of federated learning;
Issue I7 of explicability principle by means of interpretable machine and deep learning models.

1.5.2. Information Theoretic Quantification of Privacy, Interpretability, and Transferability

The most important feature of our approach is that the notions of privacy, interpretability, and transferability are quantified by information theoretic measures allowing for the study and optimization of trade-offs (such as trade-off between privacy and transferability or trade-off between privacy and interpretability) in a practical manner.

1.5.3. Computation of Information Theoretic Measures without Requiring the Knowledge of Data Distributions

It is possible to derive analytical expressions for the defined measures provided that the knowledge regarding the data distributions is available. However, in practice, the data distributions are unknown, and thus, a way to approximate the defined measures is required. Therefore, a novel method that employs recently introduced membership mappings [14,15,16,17,18,19,20,21,22], is presented for approximating the defined privacy leakage, interpretability, and transferability measures. The method relies on inferring a variational Bayesian model that facilitates an analytical approximation of the information theoretic measures through variational optimization methodology. A computational algorithm is provided for practically calculating the privacy leakage, interpretability, and transferability measures. Finally, an algorithm is presented that provides

Information theoretic evaluation of privacy leakage, interpretability, and transferability in a semi-supervised transfer and multi-task learning scenario;
An adversary model for estimating private data and for simulating privacy attacks; and
An interpretability model for estimating interpretable parameters and for providing an interpretation to the non-interpretable data vectors.

1.6. Organization

This text is organized into sections. The proposed methodology relies on the membership mappings for data representation learning. Therefore, Section 2 has been dedicated to the review of membership mappings. An application of membership mappings to solve an inverse modeling problem by developing a variational membership-mapping Bayesian model is considered in Section 3. Section 4 presents the most important result of this study on the variational approximation of information leakage and development of a computational algorithm for calculating information leakage. The measures for privacy leakage, interpretability, and transferability are formally introduced in Section 5. Section 5 further provides an algorithm to study the privacy, interpretability, and transferability aspects in a unified manner. The application of proposed measures to study the trade-offs is also demonstrated through the experiments made on the widely used MNIST and “Office+Caltech256” datasets in Section 6. Section 6 further considers a biomedical application concerned with the detection of mental stress in individuals using heart rate variability analysis. Finally, the concluding remarks are provided in Section 7.

2. Mathematical Background

This section reviews the membership mappings and transferable deep learning from [14,15,22]. For a detailed mathematical study of the concepts used in this section, the readers are referred to previous works [14,15,22].

2.1. Notations

Let $n, N, p, M \in N$ .
Let $B (R^{N})$ denote the Borel $σ -$ algebra on $R^{N}$ , and let $λ^{N}$ denote the Lebesgue measure on $B (R^{N})$ .
Let $(X, A, ρ)$ be a probability space with unknown probability measure $ρ$ .
Let us denote by $S$ the set of finite samples of data points drawn i.i.d. from $ρ$ , i.e.,

$\begin{matrix} S & : = & {{(x^{i} \sim ρ)}_{i = 1}^{N} | N \in N} . \end{matrix}$

(1)
For a sequence $x = (x^{1}, \dots, x^{N}) \in S$ , let $| x |$ denote the cardinality, i.e., $| x | = N$ .
If $x = (x^{1}, \dots, x^{N}), a = (a^{1}, \dots, a^{M}) \in S$ , then $x \land a$ denotes the concatenation of the sequences $x$ and $a$ , i.e., $x \land a = (x^{1}, \dots, x^{N}, a^{1}, \dots, a^{M})$ .
Let us denote by $F (X)$ the set of $A$ - $B (R)$ measurable functions $f : X \to R$ , i.e.,

$\begin{matrix} F (X) & : = & {f : X \to R | f i s A - B (R) m e a s u r a b l e} . \end{matrix}$

(2)
For convenience, the values of a function $f \in F (X)$ at points in the collection $x = (x^{1}, \dots, x^{N})$ are represented as $f (x) = (f (x^{1}), \dots, f (x^{N}))$ .
Let $ζ_{x} : R^{| x |} \to [0, 1]$ be a membership function satisfying the following properties:
Nowhere Vanishing:
$ζ_{x} (y) > 0$ for all $y \in R^{| x |}$ , i.e.,

$\begin{matrix} supp [ζ_{x}] & = & R^{| x |} . \end{matrix}$

(3)

Positive and Bounded Integrals:
The functions $ζ_{x}$ are absolutely continuous and Lebesgue integrable over the whole domain such that for all $x \in S$ , we have

$\begin{matrix} 0 < \int_{R^{| x |}} ζ_{x} d λ^{| x |} < \infty . \end{matrix}$

(4)

Consistency of Induced Probability Measure:
The membership function induced probability measures $P_{ζ_{x}}$ , defined on any $A \in B (R^{| x |})$ , as

$\begin{matrix} P_{ζ_{x}} (A) & : = & \frac{1}{\int_{R^{| x |}} ζ_{x} d λ^{| x |}} \int_{A} ζ_{x} d λ^{| x |} \end{matrix}$

(5)

are consistent in the sense that for all $x, a \in S$ :

$\begin{matrix} P_{ζ_{x \land a}} (A \times R^{| a |}) & = & P_{ζ_{x}} (A) . \end{matrix}$

(6)

The collection of membership functions satisfying the aforementioned assumptions is denoted by

$\begin{matrix} Θ & : = & {ζ_{x} : R^{| x |} \to [0, 1] | (3), (4), (6), x \in S} . \end{matrix}$

(7)

2.2. Review of Variational Membership Mappings

Definition 1

(Student-t Membership Mapping [14]). A Student-t membership-mapping,

F \in F (X)

, is a mapping with input space

X = R^{n}

and a membership function

ζ_{x} \in Θ

that is Student-t like:

\begin{matrix} ζ_{x} (y) & = & {(1 + 1 / (ν - 2) {(y - m_{y})}^{T} K_{x x}^{- 1} (y - m_{y}))}^{- \frac{ν + | x |}{2}} \end{matrix}

(8)

where

x \in S

,

y \in R^{| x |}

,

ν \in R_{+} ∖ [0, 2]

is the degrees of freedom,

m_{y} \in R^{| x |}

is the mean vector, and

K_{x x} \in R^{| x | \times | x |}

is the covariance matrix with its

(i, j) -

th element given as

\begin{matrix} {(K_{x x})}_{i, j} & = & k r (x^{i}, x^{j}) \end{matrix}

(9)

where

k r : R^{n} \times R^{n} \to R

is a positive definite kernel function defined as

\begin{matrix} k r (x^{i}, x^{j}) & = & σ^{2} exp (- 0.5 \sum_{k = 1}^{n} w_{k} {|x_{k}^{i} - x_{k}^{j}|}^{2}) \end{matrix}

(10)

where

x_{k}^{i}

is the k-th element of

x^{i}

,

σ^{2}

is the variance parameter, and

w_{k} \geq 0

(for

k \in {1, \dots, n}

).

Given a dataset

{(x^{i}, y^{i}) | x^{i} \in R^{n}, y^{i} \in R^{p}, i \in {1, \dots, N}}

, it is assumed that there exist zero-mean Student-t membership mappings

F_{1}, \dots, F_{p} \in F (R^{n})

such that

\begin{matrix} y^{i} & \approx & {[\begin{matrix} F_{1} (x^{i}) & \dots & F_{p} (x^{i}) \end{matrix}]}^{T} . \end{matrix}

(11)

Under modeling scenario (11), [22] presents an algorithm (stated as Algorithm 1) for the variational learning of membership mappings.

Algorithm 1 Variational learning of the membership mappings [22]

Require:: Dataset $\{(x^{i}, y^{i}) | x^{i} \in R^{n}, y^{i} \in R^{p}, i \in {1, \dots, N}\}$ and maximum possible number of auxiliary points $M_{m a x} \in Z_{+}$ with $M_{m a x} \leq N$ .
1:: Choose $ν$ and $w = (w_{1}, \dots, w_{n})$ as in (12) and (14), respectively.
2:: Choose a small positive value $κ = 10^{- 1}$ .
3:: Set iteration count $i t = 0$ and ${M |}_{0} = M_{m a x}$ .
4:: while $τ (M |_{i t}, 1) < κ$ do
5:: ${M |}_{i t + 1} = ⌈ 0.9 M |_{i t} ⌉$
6:: $i t \leftarrow i t + 1$
7:: end while
8:: Set ${M = M |}_{i t}$ .
9:: if $τ (M, 1) \geq \frac{1}{p} \sum_{j = 1}^{p} var (y_{j}^{1}, \dots, y_{j}^{N})$ then
10:: $σ^{2} = 1$
11:: else
12:: $σ^{2} = \frac{1}{τ (M, 1)} \frac{1}{p} \sum_{j = 1}^{p} var (y_{j}^{1}, \dots, y_{j}^{N})$
13:: end if
14:: Compute $a = {a^{m}}_{m = 1}^{M}$ using (13), $K_{x x}$ using (9), $K_{a a}$ using (15), and $K_{x a}$ using (16).
15:: Set $β = 1$ .
16:: repeat
17:: Compute $α$ using (18).
18:: Update the value of $β$ using (19).
19:: until ( $β$ nearly converges)
20:: Compute $α$ using (18).
21:: return the parameters set $M = {α, a, M, σ, w}$ .

With reference to Algorithm 1, we have following:

The degrees of freedom associated to the Student-t membership mapping $ν \in R_{+} ∖ [0, 2]$ is chosen as

$\begin{matrix} ν & = & 2.1 \end{matrix}$

(12)
The auxiliary inducing points are suggested to be chosen as the cluster centroids:

$\begin{matrix} a = {a^{m}}_{m = 1}^{M} = c l u s t e r_c e n t r o i d ({x^{i}}_{i = 1}^{N}, M) \end{matrix}$

(13)

where $c l u s t e r_c e n t r o i d ({x^{i}}_{i = 1}^{N}, M)$ represents the k-means clustering on ${x^{i}}_{i = 1}^{N}$ .
The parameters $(w_{1}, \dots, w_{n})$ for kernel function (10) are chosen such that $w_{k}$ (for $k \in {1, 2, \dots, n}$ ) is given as

$\begin{matrix} w_{k} & = & {(max_{1 \leq i \leq N} (x_{k}^{i}) - min_{1 \leq i \leq N} (x_{k}^{i}))}^{- 2} \end{matrix}$

(14)

where $x_{k}^{i}$ is the k-th element of vector $x^{i} \in R^{n}$ .
$K_{a a} \in R^{M \times M}$ and $K_{x a} \in R^{N \times M}$ are matrices with their $(i, j)$ -th elements given as

$\begin{matrix} {(K_{a a})}_{i, j} & = & k r (a^{i}, a^{j}) \end{matrix}$

(15)

$\begin{matrix} {(K_{x a})}_{i, j} & = & k r (x^{i}, a^{j}) \end{matrix}$

(16)

where $k r : R^{n} \times R^{n} \to R$ is a positive definite kernel function defined as in (10).
The scalar-valued function $τ (M, σ^{2})$ is defined as

$\begin{matrix} τ (M, σ^{2}) & : = & \frac{T r (K_{x x}) - T r ({(K_{a a})}^{- 1} K_{x a}^{T} K_{x a})}{ν + M - 2} \end{matrix}$

(17)

where $a$ is given by (13), $ν$ is given by (12), and parameters $(w_{1}, \dots, w_{n})$ (which are required to evaluate the kernel function for computing matrices $K_{x x}$ , $K_{a a}$ , and $K_{x a}$ ) are given by (14).
$α = [\begin{matrix} α_{1} & \dots & α_{p} \end{matrix}] \in R^{M \times p}$ is a matrix with its j-th column defined as

$\begin{matrix} α_{j} & : = & {(K_{x a}^{T} K_{x a} + \frac{T r (K_{x x}) - T r ({(K_{a a})}^{- 1} K_{x a}^{T} K_{x a})}{ν + M - 2} K_{a a} + \frac{K_{a a}}{β})}^{- 1} {(K_{x a})}^{T} y_{j} \end{matrix}$

(18)
The disturbance precision value $β$ is iteratively estimated as

$\begin{matrix} \frac{1}{β} & = & \frac{1}{p N} \sum_{j = 1}^{p} \sum_{i = 1}^{N} {|y_{j}^{i} - \hat{F_{j} (x^{i})}|}^{2} \end{matrix}$

(19)

where $\hat{F_{j} (x^{i})}$ is the estimated membership-mapping output given as

$\begin{matrix} \hat{F_{j} (x^{i})} & = & (G (x^{i})) α_{j} . \end{matrix}$

(20)

Here, $G (x) \in R^{1 \times M}$ is a vector-valued function defined as

$\begin{matrix} G (x) & : = & [\begin{matrix} k r (x, a^{1}) & \dots & k r (x, a^{M}) \end{matrix}] \end{matrix}$

(21)

where $k r : R^{n} \times R^{n} \to R$ is defined as in (10).

Definition 2

(Membership-Mappings Prediction [22]). Given the parameters set

M = {α, a, M,

σ, w}

returned by Algorithm 1, the learned membership mappings could be used to predict output corresponding to any arbitrary input data point

x \in R^{n}

as

\begin{matrix} \hat{y} (x; M) & = & α^{T} {(G (x))}^{T} \end{matrix}

(22)

where

G (\cdot) \in R^{1 \times M}

is a vector-valued function (21).

2.3. Review of Membership-Mappings-Based Conditionally Deep Autoencoders

Definition 3

(Membership-Mapping Autoencoder [15]). A membership-mapping autoencoder,

G : R^{p} \to R^{p}

, maps an input vector

y \in R^{p}

to

G (y) \in R^{p}

such that

\begin{matrix} G (y) & \overset{\underset{def}{}}{=} & {[\begin{matrix} F_{1} (P y) & \dots & F_{p} (P y) \end{matrix}]}^{T}, \end{matrix}

(23)

where

F_{j}

(

j \in {1, 2, \dots, p}

) is a Student-t membership-mapping,

P \in R^{n \times p} (n \leq p)

is a matrix such that the product

P y

is a lower-dimensional encoding for y.

Definition 4

(Conditionally Deep Membership-Mapping Autoencoder (CDMMA) [15,22]). A conditionally deep membership-mapping autoencoder,

D : R^{p} \to R^{p}

, maps a vector

y \in R^{p}

to

D (y) \in R^{p}

through a nested composition of finite number of membership-mapping autoencoders such that

\begin{matrix} y^{l} & = & (G_{l} \circ \dots \circ G_{2} \circ G_{1}) (y), \forall l \in {1, 2, \dots, L} \end{matrix}

(24)

\begin{matrix} l^{*} & = & arg min_{l \in {1, 2, \dots, L}} {∥ y - y^{l} ∥}^{2} \end{matrix}

(25)

\begin{matrix} D (y) & = & y^{l^{*}}, \end{matrix}

(26)

where

G_{l} (\cdot)

is a membership-mapping autoencoder (Definition 3).

CDMMA discovers layers of increasingly abstract data representation with the lowest-level data features being modeled by the first layer and the highest-level data features being modeled by the end layer [15,22]. An algorithm (stated as Algorithm 2) has been provided in [15,22] for the variational learning of CDMMA.

Algorithm 2 Variational learning of CDMMA [15,22]

Require:: Dataset $Y = \{y^{i} \in R^{p} | i \in {1, \dots, N}\}$ ; the subspace dimension $n \in {1, 2, \dots, p}$ ; maximum number of auxiliary points $M_{m a x} \in Z_{+}$ with $M_{m a x} \leq N$ ; the number of layers $L \in Z_{+}$ .
1:: for $l = 1$ to L do
2:: Set subspace dimension associated to l-th layer as $n_{l} = max (n - l + 1, 1)$ .
3:: Define $P^{l} \in R^{n_{l} \times p}$ such that the i-th row of $P^{l}$ is equal to the transpose of eigenvector corresponding to the i-th largest eigenvalue of a sample covariance matrix of dataset $Y$ .
4:: Define a latent variable $x^{l, i} \in R^{n_{l}}$ , for $i \in {1, \dots, N}$ , as

$\begin{matrix} x^{l, i} & : = & \{\begin{matrix} P^{l} y^{i} & if l = 1, \\ P^{l} {\hat{y}}^{l - 1} (x^{l - 1, i}; M^{l - 1}) & if l > 1 \end{matrix} \end{matrix}$

(27)

where ${\hat{y}}^{l - 1}$ is the estimated output of the $(l - 1)$ -th layer computed using (22) for the parameters set $M^{l - 1} = {α^{l - 1}, a^{l - 1}, M^{l - 1}, σ^{l - 1}, w^{l - 1}}$ .
5:: Define $M_{m a x}^{l}$ as

$\begin{matrix} M_{m a x}^{l} & : = & \{\begin{matrix} M_{m a x} & if l = 1, \\ M^{l - 1} & if l > 1 \end{matrix} \end{matrix}$

(28)
6:: Compute parameters set $M^{l} = {α^{l}, a^{l}, M^{l}, σ^{l}, w^{l}}$ , characterizing the membership mappings associated to the l-th layer, using Algorithm 1 on dataset $\{(x^{l, i}, y^{i}) | i \in {1, \dots, N}\}$ with the maximum possible number of auxiliary points $M_{m a x}^{l}$ .
7:: end for
8:: return the parameters set $M = {{M^{1}, \dots, M^{L}}, {P^{1}, \dots, P^{L}}}$ .

Definition 5

(CDMMA Filtering [15,22]). Given a CDMMA with its parameters being represented by a set

M = {{M^{1}, \dots, M^{L}}, {P^{1}, \dots, P^{L}}}

, the autoencoder can be applied for filtering a given input vector

y \in R^{p}

as follows:

\begin{matrix} x^{l} (y; M) & = & \{\begin{matrix} P^{l} y, & l = 1 \\ P^{l} {\hat{y}}^{l - 1} (x^{l - 1}; M^{l - 1}) & l \geq 2 \end{matrix} \end{matrix}

(29)

Here,

{\hat{y}}^{l - 1}

is the output of the

(l - 1)

-th layer estimated using (22). Finally, CDMMA’s output,

D (y; M)

, is given as

\begin{matrix} \hat{D} (y; M) & = & {\hat{y}}^{l^{*}} (x^{l^{*}}; M^{l^{*}}) \end{matrix}

(30)

\begin{matrix} l^{*} & = & arg min_{l \in {1, \dots, L}} {∥ y - {\hat{y}}^{l} (x^{l}; M^{l}) ∥}^{2} . \end{matrix}

(31)

For a big dataset, the computational time required by Algorithm 2 for learning will be high. To circumvent the problem of large computation time for processing big data, it is suggested in [15,22] that the data be partitioned into subsets and corresponding to each data subset, a separate CDMMA is learned. This motivates the defining of a wide CDMMA as in Definition 6. For the variational learning of wide CDMMA, Algorithm 3 follows from [15,22], where the choice of number of subsets as

S = ⌈ N / 1000 ⌉

is driven by the consideration that each subset contains around 1000 data points, since processing the data points up to 1000 by CDMMA is not computationally a challenge.

Definition 6

(A Wide CDMMA [15,22]). A wide CDMMA,

WD : R^{p} \to R^{p}

, maps a vector

y \in R^{p}

to

WD (y) \in R^{p}

through a parallel composition of S (

S \in Z_{+}

) number of CDMMAs such that

\begin{matrix} WD (y) & = & D_{s^{*}} (y) \end{matrix}

(32)

\begin{matrix} s^{*} & = & arg min_{s \in {1, 2, \dots, S}} {∥ y - D_{s} (y) ∥}^{2}, \end{matrix}

(33)

where

D_{s} (y)

is the output of the s-th CDMMA.

Algorithm 3 Variational learning of wide CDMMA [15,22]

Require:: Dataset $Y = \{y^{i} \in R^{p} | i \in {1, \dots, N}\}$ ; the subspace dimension $n \in {1, 2, \dots, p}$ ; ratio $r_{m a x} \in (0, 1]$ ; the number of layers $L \in Z_{+}$ .
1:: Apply k-means clustering to partition $Y$ into S subsets, ${Y^{1}, \dots, Y^{S}}$ , where $S = ⌈ N / 1000 ⌉$ .
2:: for $s = 1$ to S do
3:: Build a CDMMA, $M^{s}$ , by applying Algorithm 2 on $Y^{s}$ taking n as the subspace dimension; maximum number of auxiliary points as equal to $r_{m a x} \times # Y^{s}$ (where $# Y^{s}$ is the number of data points in $Y^{s}$ ); and L is the number of layers.
4:: end for
5:: return the parameters set $P = {M^{s}}_{s = 1}^{S}$ .

Definition 7

(Wide CDMMA Filtering [15,22]). Given a wide CDMMA with its parameters being represented by a set

P = {M^{s}}_{s = 1}^{S}

, the autoencoder can be applied for filtering a given input vector

y \in R^{p}

as follows:

\begin{matrix} \hat{WD} (y; P) & = & \hat{D} (y; M^{s^{*}}) \end{matrix}

(34)

\begin{matrix} s^{*} & = & arg min_{s \in {1, 2, \dots, S}} {∥ y - \hat{D} (y; M^{s}) ∥}^{2}, \end{matrix}

(35)

where

\hat{D} (y; M^{s})

is the output of the s-th CDMMA estimated using (30).

2.4. Membership Mappings for Classification

A classifier (i.e., Definition 8) and an algorithm for its variational learning (stated as Algorithm 4) follows from [15,22].

Definition 8

(A Classifier [15,22]). A classifier,

C : R^{p} \to {1, 2, \dots, C}

, maps a vector

y \in R^{p}

to

C (y) \in {1, 2, \dots, C}

such that

\begin{matrix} C (y; {P_{c}}_{c = 1}^{C}) & = & arg min_{c \in {1, 2, \dots, C}} {∥ y - \hat{WD} (y; P_{c}) ∥}^{2} \end{matrix}

(36)

where

\hat{WD} (y; P_{c})

, computed using (34), is the output of the c-th wide CDMMA. The classifier assigns to an input vector the label of that class whose associated autoencoder best reconstructs the input vector.

Algorithm 4 Variational learning of the classifier [15,22]

Require:: Labeled dataset $Y = \{Y_{c} | Y_{c} = \{y^{i, c} \in R^{p} | i \in {1, \dots, N_{c}}\}, c \in {1, \dots, C}\}$ ; the subspace dimension $n \in {1, \dots, p}$ ; ratio $r_{m a x} \in (0, 1]$ ; the number of layers $L \in Z_{+}$ .
1:: for $c = 1$ to C do
2:: Build a wide CDMMA, $P_{c} = {M_{c}^{s}}_{s = 1}^{S_{c}}$ , by applying Algorithm 3 on $Y_{c}$ for the given n, $r_{m a x}$ , and L.
3:: end for
4:: return the parameters set ${P_{c}}_{c = 1}^{C}$ .

2.5. Review of Membership-Mappings-Based Privacy-Preserving Transferable Learning

A privacy-preserving semi-supervised transfer and multi-task learning problem has been recently addressed in [22] by means of variational membership mappings. The method, as suggested in [22], involves the following steps:

2.5.1. Optimal Noise Adding Mechanism for Differentially Private Classifiers

The approach suggested in [22] relies on a tailored noise adding mechanism to achieve a given level of differential privacy loss bound with the minimum perturbation of the data. In particularly, Algorithm 5 is suggested for a differentially private approximation of data samples and Algorithm 6 is suggested for building a differentially private classifier.

Algorithm 5 Differentially private approximation of data samples [22]

Require:: Dataset $Y = \{y^{i} \in R^{p} | i \in {1, \dots, N}\}$ ; differential privacy parameters: $d \in R_{+}$ , $ϵ \in R_{+}$ , $δ \in (0, 1)$ .
1:: A differentially private approximation of data samples is provided as

$\begin{matrix} y_{j}^{+ i} & = & y_{j}^{i} + F_{v_{j}^{i}}^{- 1} (r_{j}^{i}; ϵ, δ, d), r_{j}^{i} \in (0, 1) \end{matrix}$

(37)

$\begin{matrix} F_{v_{j}^{i}}^{- 1} (r_{j}^{i}; ϵ, δ, d) & = & \{\begin{matrix} \frac{d}{ϵ} log (\frac{2 r_{j}^{i}}{1 - δ}), & r_{j}^{i} < \frac{1 - δ}{2} \\ 0, & r_{j}^{i} \in [\frac{1 - δ}{2}, \frac{1 + δ}{2}], r_{j}^{i} \in (0, 1) \\ - \frac{d}{ϵ} log (\frac{2 (1 - r_{j}^{i})}{1 - δ}), & r_{j}^{i} > \frac{1 + δ}{2} \end{matrix} . \end{matrix}$

(38)

where $y_{j}^{+ i}$ is the j-th element of $y^{+ i} \in R^{p}$ .
2:: return $Y^{+} = \{y^{+ i} \in R^{p} | i \in {1, \dots, N}\}$ .

Algorithm 6 Variational learning of a differentially private classifier [22]

Require:: Differentially private approximated dataset: $Y^{+} = \{Y_{c}^{+} | c \in {1, \dots, C}\}$ ; the subspace dimension $n \in {1, \dots, p}$ ; ratio $r_{m a x} \in (0, 1]$ ; the number of layers $L \in Z_{+}$ .
1:: Build a classifier, ${P_{c}^{+}}_{c = 1}^{C}$ , by applying Algorithm 4 on $Y^{+}$ for the given n, $r_{m a x}$ , and L.
2:: return ${P_{c}^{+}}_{c = 1}^{C}$ .

2.5.2. Semi-Supervised Transfer Learning Scenario

The aim is to transfer the knowledge extracted by a classifier trained using a source dataset to the classifier of the target domain such that the privacy of the source dataset is preserved. Let

{Y_{c}^{s r}}_{c = 1}^{C}

be the labeled source dataset where

Y_{c}^{s r} = {y_{s r}^{i, c} \in R^{p_{s r}} | i \in {1, \dots, N_{c}^{s r}}}

represents the c-th labelled samples. The target dataset consist of a few labeled samples

{Y_{c}^{t g}}_{c = 1}^{C}

(with

Y_{c}^{t g} = {y_{t g}^{i, c} \in R^{p_{t g}} | i \in {1, \dots, N_{c}^{t g}}}

) and another set of unlabeled samples

Y_{*}^{t g} = {y_{t g}^{i, *} \in R^{p_{t g}} | i \in {1, \dots, N_{*}^{t g}}}

.

2.5.3. Differentially Private Source Domain Classifier

For a given differential privacy parameters:

d, ϵ, δ

; Algorithm 5 is applied on

Y_{c}^{s r}

to obtain the differentially private approximated data samples,

Y_{c}^{+ s r} = {y_{s r}^{+ i, c} \in R^{p_{s r}} | i \in {1, \dots, N_{c}^{s r}}}

, for all

c \in {1, \dots, C}

. Algorithm 6 is applied on

{Y_{c}^{+ s r}}_{c = 1}^{C}

to build a differentially private source domain classifier characterized by parameters sets

{P_{c}^{+ s r}}_{c = 1}^{C}

.

2.5.4. Latent Subspace Transformation Matrices

For a given subspace dimension

n_{s t} \in {1, 2, \dots, min (p_{s r}, p_{t g})}

, the source domain transformation matrix

V^{+ s r} \in R^{n_{s t} \times p_{s r}}

is defined as with its i-th row equal to the transpose of the eigenvector corresponding to the i-th largest eigenvalue of the sample covariance matrix computed on differentially private approximated source samples. The target domain transformation matrix

V^{t g} \in R^{n_{s t} \times p_{t g}}

is defined as with its i-th row equal to the transpose of the eigenvector corresponding to the i-th largest eigenvalue of the sample covariance matrix computed on target samples.

2.5.5. Subspace Alignment

A target sample is mapped to source-data-space via following transformation:

\begin{matrix} y_{t g \to s r} (y_{t g}) & = & \{\begin{matrix} y_{t g}, & p_{s r} = p_{t g} \\ {(V^{+ s r})}^{T} V^{t g} y_{t g}, & p_{s r} \neq p_{t g} \end{matrix} \end{matrix}

(39)

Both labeled and unlabeled target datasets are transformed to define the following sets:

\begin{matrix} Y_{c}^{t g \to s r} & : = & {y_{t g \to s r} (y_{t g}) | y_{t g} \in Y_{c}^{t g}} \end{matrix}

(40)

\begin{matrix} Y_{*}^{t g \to s r} & : = & {y_{t g \to s r} (y_{t g}) | y_{t g} \in Y_{*}^{t g}} . \end{matrix}

(41)

2.5.6. Target Domain Classifier

The k-th iteration for building the target domain classifier, where

k \in {1, \dots, i t_m a x}

, consists of the following updates:

\begin{matrix} {P_{c}^{t g} |_{k}}_{c = 1}^{C} & = & Algorithm 4 ({\{Y_{c}^{t g \to s r} \cup Y_{*, c}^{t g \to s r} |_{k - 1}\}}_{c = 1}^{C} {, n |}_{k}, r_{m a x}, L) \end{matrix}

(42)

\begin{matrix} Y_{*, c}^{t g \to s r} |_{k} & = & \{y_{t g \to s r}^{i, *} \in Y_{*}^{t g \to s r} | C (y_{t g \to s r}^{i, *}; {P_{c}^{t g} |_{k}}_{c = 1}^{C}) = c, i \in {1, \dots, N_{*}^{t g}}\} \end{matrix}

(43)

where

\{{n |}_{1} {, n |}_{2}, \dots\}

is a monotonically non-decreasing sequence.

2.5.7. source2target Model

The mapping from source to target domain is learned by means of a variational membership-mappings-based model as in the following:

\begin{matrix} M^{s r \to t g} & = & Algorithm 1 (D, M_{m a x}) \end{matrix}

(44)

\begin{matrix} D & : = & \{(\hat{WD} (y; P_{c}^{+ s r}), y) | y \in \{Y_{c}^{t g \to s r} \cup Y_{*, c}^{t g \to s r} |_{i t_m a x}\}, c \in \{1, \dots, C\}\} \end{matrix}

(45)

\begin{matrix} M_{m a x} & = & min (⌈ N^{t g} / 2 ⌉, 1000) \end{matrix}

(46)

where

N^{t g} = | D |

is the total number of target samples,

\hat{WD} (\cdot; \cdot)

is defined as in (34),

Y_{c}^{t g \to s r}

is defined as in (40), and

Y_{*, c}^{t g \to s r}

is defined as in (43).

2.5.8. Transfer and Multi-Task Learning

Both source and target domain classifiers are combined with the source2target model for predicting the label associated to a target sample

y_{t g \to s r}

as

\begin{matrix} \hat{c} (y_{t g \to s r}; {P_{c}^{t g}}_{c = 1}^{C}, {P_{c}^{+ s r}}_{c = 1}^{C}, M^{s r \to t g}) & = & arg min_{c \in {1, 2, \dots, C}} \{min ({∥y_{t g \to s r} - \hat{WD} (y_{t g \to s r}; P_{c}^{t g})∥}^{2}, \\ {∥y_{t g \to s r} - \hat{y} (\hat{WD} (y_{t g \to s r}; P_{c}^{+ s r}); M^{s r \to t g})∥}^{2}, \\ {∥y_{t g \to s r} - \hat{WD} (y_{t g \to s r}; P_{c}^{+ s r})∥}^{2})\} . \end{matrix}

(47)

where

\hat{y} (\cdot; M^{s r \to t g})

is the output of the source2target model computed using (22).

3. Variational Membership-Mapping Bayesian Models

We consider the application of membership mappings to solve the inverse modeling problem related to

x = f_{t \to x} (t)

, where

f_{t \to x} : R^{q} \to R^{n}

is a forward map. Specifically, a membership-mappings model is used to approximate the inverse mapping

f_{t \to x}^{- 1}

.

3.1. A Prior Model

Given a dataset:

{(x^{i}, t^{i}) | i \in {1, \dots, N}}

, Algorithm 1 can be used to build a membership-mappings model characterized by a set of parameters, say

M^{x \to t} = {α^{x \to t}, a,

M, σ, w}

(where

x \to t

indicates the mapping from x to t has been approximated by the membership mappings). It follows from (22) that the membership-mappings model predicted output corresponding to an input x is given as

\begin{matrix} \hat{t} (x; M^{x \to t}) & = & {(α^{x \to t})}^{T} {(G (x))}^{T} \end{matrix}

(48)

where

G (\cdot) \in R^{1 \times M}

is a vector-valued function defined as in (21). The k-th element of

\hat{t}

is given as

\begin{matrix} {\hat{t}}_{k} (x; M^{x \to t}) & = & (G (x)) α_{k}^{x \to t} \end{matrix}

(49)

where

α_{k}^{x \to t}

is the k-th column of matrix

α^{x \to t}

.

Expression (49) allows estimating for any arbitrary x the corresponding t using a membership-mappings model. This motivates introducing the following prior model:

\begin{matrix} t_{k} & = & (G (x)) θ_{k} + e_{k} \end{matrix}

(50)

\begin{matrix} θ_{k} & \sim & N (α_{k}^{x \to t}, Λ_{k}^{- 1}) \end{matrix}

(51)

\begin{matrix} e_{k} & \sim & N (0, γ^{- 1}) \end{matrix}

(52)

\begin{matrix} γ & \sim & Gamma (a_{γ}, b_{γ}) \end{matrix}

(53)

where

k \in {1, \dots, q}

;

N (α_{k}^{x \to t}, Λ_{k}^{- 1})

is the multivariate normal distribution with mean

α_{k}^{x \to t}

and covariance

Λ_{k}^{- 1}

; and

Gamma (a_{γ}, b_{γ})

is the Gamma distribution with shape parameter

a_{γ}

and rate parameter

b_{γ}

. The estimation provided by membership-mappings model

M^{x \to t}

(i.e., (49)) is incorporated by the prior model (50)–(53), since

\begin{matrix} E [t_{k}] & = & {\hat{t}}_{k} (x; M^{x \to t}) . \end{matrix}

(54)

3.2. Variational Bayesian Inference

Given the dataset,

{(x^{i} \in R^{n}, t^{i} \in R^{q}) | i \in {1, 2, \dots, N}}

, the variational Bayesian method is considered for an inference of the stochastic model (50), with priors as (51)–(53). For all

i \in {1, \dots, N}

and

k \in {1, \dots, q}

, we have

\begin{matrix} t_{k}^{i} & = & (G (x^{i})) θ_{k} + e_{k}^{i}, \end{matrix}

(55)

where

θ_{k} \sim N (α_{k}^{x \to t}, Λ_{k}^{- 1})

and

e_{k}^{i} \sim N (0, γ^{- 1})

. Define

t_{k} \in R^{N}

,

e_{k} \in R^{N}

, and

R_{x} \in R^{N \times M}

as

\begin{matrix} t_{k} & = & {[\begin{matrix} t_{k}^{1} & \dots & t_{k}^{N} \end{matrix}]}^{T} \end{matrix}

(56)

\begin{matrix} e_{k} & = & {[\begin{matrix} e_{k}^{1} & \dots & e_{k}^{N} \end{matrix}]}^{T} \end{matrix}

(57)

\begin{matrix} R_{x} & = & {[\begin{matrix} {(G (x^{1}))}^{T} & \dots & {(G (x^{N}))}^{T} \end{matrix}]}^{T} . \end{matrix}

(58)

For all

k \in {1, \dots, q}

, we have

\begin{matrix} t_{k} & = & R_{x} θ_{k} + e_{k} \end{matrix}

(59)

\begin{matrix} p (θ_{k}; α_{k}^{x \to t}, Λ_{k}) & = & \frac{1}{\sqrt{{(2 π)}^{M} | {(Λ_{k})}^{- 1} |}} exp (- 0.5 {(θ_{k} - α_{k}^{x \to t})}^{T} Λ_{k} (θ_{k} - α_{k}^{x \to t})) \end{matrix}

(60)

\begin{matrix} p (e_{k}; γ) & = & \frac{1}{\sqrt{{(2 π)}^{N} {(γ)}^{- N}}} exp (- 0.5 γ ∥ e_{k} ∥^{2}) \end{matrix}

(61)

\begin{matrix} p (γ; a_{γ}, b_{γ}) & = & (b_{γ}^{a_{γ}} / Γ (a_{γ})) {(γ)}^{a_{γ} - 1} exp (- b_{γ} γ) . \end{matrix}

(62)

Define the following sets:

\begin{matrix} t & = & {t_{1}, \dots, t_{q}} \end{matrix}

(63)

\begin{matrix} θ & = & {θ_{1}, \dots, θ_{q}} \end{matrix}

(64)

and consider the marginal probability of data

t

which is given as

\begin{matrix} p (t) & = & \int d θ d γ p (t, θ, γ) . \end{matrix}

(65)

Let

q (θ, γ)

be an arbitrary distribution. The log marginal probability of

t

can be expressed as

\begin{matrix} log (p (t)) & = & \int d θ d γ q (θ, γ) log (p (t)) \end{matrix}

(66)

\begin{matrix} = & \int d θ d γ q (θ, γ) log (\frac{p (t, θ, γ)}{q (θ, γ)}) + \int d θ d γ q (θ, γ) log (\frac{q (θ, γ)}{p (θ, γ | t)}) . \end{matrix}

(67)

Define

\begin{matrix} L (q (θ, γ), t) & : = & \int d θ d γ q (θ, γ) log (p (t, θ, γ) / q (θ, γ)) \end{matrix}

(68)

to express (66) as

\begin{matrix} log (p (t)) & = & L (q (θ, γ), t) + KL (q (θ, γ) ∥ p (θ, γ | t)) \end{matrix}

(69)

where

KL

is the Kullback–Leibler divergence of

p (θ, γ | t)

from

q (θ, γ)

and

L

, referred to as negative free energy, provides a lower bound on the the logarithmic evidence for the data.

The variational Bayesian approach minimizes the difference (in term of

KL

divergence) between variational and true posteriors via analytically maximizing negative free energy

L

over variational distributions. However, the analytical derivation requires the following widely used mean-field approximation:

\begin{matrix} q (θ, γ) & = & q (θ) q (γ) \end{matrix}

(70)

\begin{matrix} = & q (θ_{1}) \dots q (θ_{q}) q (γ) . \end{matrix}

(71)

Applying the standard variational optimization technique (as in [23,24,25,26,27,28,29]), it can be verified that the optimal variational distributions maximizing

L

are as follows:

\begin{matrix} q^{*} (θ_{k}) & = & \frac{1}{\sqrt{{(2 π)}^{M} | {({\hat{Λ}}_{k})}^{- 1} |}} exp (- 0.5 {(θ_{k} - {\hat{m}}_{k})}^{T} {\hat{Λ}}_{k} (θ_{k} - {\hat{m}}_{k})) \end{matrix}

(72)

\begin{matrix} q^{*} (γ) & = & ({({\hat{b}}_{γ})}^{{\hat{a}}_{γ}} / Γ ({\hat{a}}_{γ})) {(γ)}^{{\hat{a}}_{γ} - 1} exp (- {\hat{b}}_{γ} γ) \end{matrix}

(73)

where the parameters

({\hat{Λ}}_{k}, {\hat{m}}_{k}, {\hat{a}}_{γ}, {\hat{b}}_{γ})

satisfy the following:

\begin{matrix} {\hat{Λ}}_{k} & = & Λ_{k} + ({\hat{a}}_{γ} / {\hat{b}}_{γ}) {(R_{x})}^{T} R_{x} \end{matrix}

(74)

\begin{matrix} {\hat{m}}_{k} & = & {({\hat{Λ}}_{k})}^{- 1} (Λ_{k} α_{k}^{x \to t} + ({\hat{a}}_{γ} / {\hat{b}}_{γ}) {(R_{x})}^{T} t_{k}) \end{matrix}

(75)

\begin{matrix} {\hat{a}}_{γ} & = & a_{γ} + 0.5 q N \end{matrix}

(76)

\begin{matrix} {\hat{b}}_{γ} & = & b_{γ} + 0.5 \sum_{k = 1}^{q} \{∥ t_{k} - R_{x} {\hat{m}}_{k} ∥^{2} + T r ({({\hat{Λ}}_{k})}^{- 1} {(R_{x})}^{T} R_{x})\} . \end{matrix}

(77)

Algorithm 7 is suggested for variational Bayesian inference of the model.

Algorithm 7 Variational membership-mapping Bayesian model inference

Require:: Dataset $\{(x^{i} \in R^{n}, t^{i} \in R^{q}) | i \in {1, \dots, N}\}$ and maximum possible number of auxiliary points $M_{m a x} \in Z_{+}$ with $M_{m a x} \leq N$ .
1:: Apply Algorithm 1 on the dataset to build a variational membership-mappings model $M^{x \to t} = {α^{x \to t}, a, M, σ, w}$ .
2:: Choose non-informative priors for covariance matrix, i.e., $Λ_{k} = 10^{- 3} I_{M}, \forall k \in {1, \dots, q}$ .
3:: Choose non-informative priors for noise variance, i.e., $a_{γ} = 10^{- 3}, b_{γ} = 10^{- 3}$ .
4:: Initialize ${\hat{a}}_{γ} / {\hat{b}}_{γ} = 1$ .
5:: repeat
6:: update ${{\hat{Λ}}_{k}, {\hat{m}}_{k} | k \in {1, \dots, q}}, {\hat{a}}_{γ}, {\hat{b}}_{γ}$ using (74), (75), (76), (77).
7:: until convergence.
8:: return the parameters set ${BM}^{x \to t} = {{{\hat{m}}_{k}, {\hat{Λ}}_{k} | k \in {1, \dots, q}}, {\hat{a}}_{γ}, {\hat{b}}_{γ}}$ .

The functionality of Algorithm 7 is as follows:

Step 1 builds a variational membership-mappings model using Algorithm 1 from previous work [22].
Algorithm 7 choses at step 2 and 3 relatively non-informative priors.
The loop between step 5 and step 7 applies variational Bayesian inference to iteratively estimate the parameters of optimal distributions until convergence.

Remark 1

(Computational Complexity). The computational complexity of Algorithm 7 is asymptotically dominated by the computation of inverse of

M \times M

dimensional matrix

{\hat{Λ}}_{k}

in (75) to calculate

{\hat{m}}_{k}

. Thus, the computational complexity of Algorithm 7 is given as

O (M^{3})

, where M is the number of auxiliary points.

The optimal distributions determined using Algorithm 7 define the so-called Variational Membership-Mapping Bayesian Model (VMMBM) as stated in Remark 2.

Remark 2

(Variational Membership-Mapping Bayesian Model (VMMBM)). The inverse mapping,

f_{t \to x}^{- 1}

, is approximated as

\begin{matrix} t_{k} & = & (G (x)) θ_{k} + e_{k}, \end{matrix}

(78)

\begin{matrix} θ_{k} & \sim & N ({\hat{m}}_{k}, {\hat{Λ}}_{k}^{- 1}) \end{matrix}

(79)

\begin{matrix} e_{k} & \sim & N (0, γ^{- 1}) \end{matrix}

(80)

\begin{matrix} γ & \sim & G a m m a ({\hat{a}}_{γ}, {\hat{b}}_{γ}) \end{matrix}

(81)

where

k \in {1, \dots, q}

and

({\hat{m}}_{k}, {\hat{Λ}}_{k}, {\hat{a}}_{γ}, {\hat{b}}_{γ})

are returned by Algorithm 7.

Remark 3

(Estimation by VMMBM). Given any

x^{*}

, the variational membership-mapping Bayesian model

{BM}^{x \to t}

(returned by Algorithm 7) can be used to estimate corresponding

t^{*}

(such that

x^{*} = f_{t \to x} (t^{*})

) as

\begin{matrix} \tilde{t} (x^{*}; {BM}^{x \to t}) & = & {[\begin{matrix} (G (x)) {\hat{m}}_{1} & \dots & (G (x)) {\hat{m}}_{q} \end{matrix}]}^{T} . \end{matrix}

(82)

4. Evaluation of the Information-Leakage

Consider a scenario where a variable t is related to another variable x through a mapping

f_{t \to x}

such that

x = f_{t \to x} (t)

. The mutual information

I (t; x)

measures the amount of information obtained about variable t through observing variable x. Since

x = f_{t \to x} (t)

, the entropy

H (t)

remains fixed independent of mapping

f_{t \to x}

, and thus, the quantity

I (t; x) - H (t)

is a measure of the amount of information about t leaked by the mapping

f_{t \to x}

.

Definition 9

(Information Leakage). Under the scenario that

x = f_{t \to x} (t)

, a measure of the amount of information about t leaked by the mapping

f_{t \to x}

is defined as

\begin{matrix} I L_{f_{t \to x}} & : = & I (t; f_{t \to x} (t)) - H (t) \end{matrix}

(83)

\begin{matrix} = & I (t; x) - H (t) . \end{matrix}

(84)

The quantity

I L_{f_{t \to x}}

is referred to as the information leakage.

This section is dedicated to answering the question: How to calculate the information leakage without knowing data distributions?

4.1. Variational Approximation of the Information Leakage

The mutual information between t and x is given as

\begin{matrix} I (t; x) & = & H (t) - H (t | x) \end{matrix}

(85)

\begin{matrix} = & H (t) + \int p (t, x) log (p (t | x)) d t d x \end{matrix}

(86)

\begin{matrix} = & H (t) + {〈log (p (t | x))〉}_{p (t, x)} \end{matrix}

(87)

where

{〈g (x)〉}_{p (x)}

denotes the expectation of a function of random variable

g (x)

with respect to the probability density function

p (x)

;

H (t)

and

H (t | x)

are marginal and conditional entropies, respectively. Consider the conditional probability of t which is given as

\begin{matrix} p (t | x) & = & \int d θ d γ p (θ, γ, t | x) \end{matrix}

(88)

where

θ

is a set defined as in (63). Let

q (θ, γ)

be an arbitrary distribution. The log conditional probability of t can be expressed as

\begin{matrix} log (p (t | x)) & = & \int d θ d γ q (θ, γ) log (p (t | x)) \end{matrix}

(89)

\begin{matrix} = & \int d θ d γ q (θ, γ) log (\frac{p (θ, γ, t | x)}{p (θ, γ | t, x)}) \end{matrix}

(90)

\begin{matrix} = & \int d θ d γ q (θ, γ) log (\frac{p (θ, γ, t | x)}{q (θ, γ)}) + \int d θ d γ q (θ, γ) log (\frac{q (θ, γ)}{p (θ, γ | t, x)}) . \end{matrix}

(91)

Define

\begin{matrix} L (q (θ, γ), t, x) & : = & \int d θ d γ q (θ, γ) log (\frac{p (θ, γ, t | x)}{q (θ, γ)}) \end{matrix}

(92)

to express (91) as

\begin{matrix} log (p (t | x)) & = & L (q (θ, γ), t, x) + KL (q (θ, γ) ∥ p (θ, γ | t, x)) \end{matrix}

(93)

where

KL

is Kullback–Leibler divergence of

p (θ, γ | t, x)

from

q (θ, γ)

. Using (87),

\begin{matrix} I (t; x) & = & H (t) + {〈L (q (θ, γ), t, x)〉}_{p (t, x)} + {〈KL (q (θ, γ) ∥ p (θ, γ | t, x))〉}_{p (t, x)} . \end{matrix}

(94)

That is,

\begin{matrix} I L_{f_{t \to x}} & = & {〈L (q (θ, γ), t, x)〉}_{p (t, x)} + {〈KL (q (θ, γ) ∥ p (θ, γ | t, x))〉}_{p (t, x)} . \end{matrix}

(95)

Since Kullback–Leibler divergence is always non-zero, it follows from (95) that

{〈L〉}_{p (t, x)}

provides a lower bound on

I L_{f_{t \to x}}

i.e.,

\begin{matrix} I L_{f_{t \to x}} & \geq & {〈L (q (θ, γ), t, x)〉}_{p (t, x)} . \end{matrix}

(96)

Our approach to approximate

I L_{f_{t \to x}}

is to maximize its lower bound with respect to variational distribution

q (θ, γ)

. That is, we seek to solve

\begin{matrix} {\hat{I L}}_{f_{t \to x}} & = & max_{q (θ, γ)} {〈L (q (θ, γ), t, x)〉}_{p (t, x)} . \end{matrix}

(97)

Result 1

(Analytical Expression for the Information Leakage). Given the model (78)–(81),

{\hat{I L}}_{f_{t \to x}}

is given as

\begin{matrix} {\hat{I L}}_{f_{t \to x}} & = & - 0.5 q log (2 π) + 0.5 q \{ϝ ({\bar{a}}_{γ}) - log ({\bar{b}}_{γ})\} \\ - \frac{{\bar{a}}_{γ}}{2 {\bar{b}}_{γ}} \sum_{k = 1}^{q} {〈| t_{k} - G (x) {\bar{m}}_{k} |^{2}〉}_{p (t, x)} - \frac{{\bar{a}}_{γ}}{2 {\bar{b}}_{γ}} \sum_{k = 1}^{q} {〈T r ({({\bar{Λ}}_{k})}^{- 1} {(G (x))}^{T} G (x))〉}_{p (x)} \\ - \frac{1}{2} \sum_{k = 1}^{q} \{{({\hat{m}}_{k} - {\bar{m}}_{k})}^{T} {\hat{Λ}}_{k} ({\hat{m}}_{k} - {\bar{m}}_{k}) + T r ({\hat{Λ}}_{k} {({\bar{Λ}}_{k})}^{- 1}) - log (\frac{| {({\bar{Λ}}_{k})}^{- 1} |}{| {({\hat{Λ}}_{k})}^{- 1} |})\} + \frac{q M}{2} \\ - {\hat{a}}_{γ} log ({\bar{b}}_{γ} / {\hat{b}}_{γ}) + log (Γ ({\bar{a}}_{γ}) / Γ ({\hat{a}}_{γ})) - ({\bar{a}}_{γ} - {\hat{a}}_{γ}) Ψ ({\bar{a}}_{γ}) + ({\bar{b}}_{γ} - {\hat{b}}_{γ}) ({\bar{a}}_{γ} / {\bar{b}}_{γ}) . \end{matrix}

(98)

Here,

ϝ (\cdot)

is the digamma function and the parameters

({\bar{Λ}}_{k}, {\bar{m}}_{k}, {\bar{a}}_{γ}, {\bar{b}}_{γ})

satisfy the following:

\begin{matrix} {\bar{Λ}}_{k} & = & {\hat{Λ}}_{k} + ({\bar{a}}_{γ} / {\bar{b}}_{γ}) {〈{(G (x))}^{T} G (x)〉}_{p (x)} \end{matrix}

(99)

\begin{matrix} {\bar{m}}_{k} & = & {({\bar{Λ}}_{k})}^{- 1} ({\hat{Λ}}_{k} {\hat{m}}_{k} + \frac{{\bar{a}}_{γ}}{{\bar{b}}_{γ}} {〈{(G (x))}^{T} t_{k}〉}_{p (t, x)}) \end{matrix}

(100)

\begin{matrix} {\bar{a}}_{γ} & = & {\hat{a}}_{γ} + 0.5 q \end{matrix}

(101)

\begin{matrix} {\bar{b}}_{γ} & = & {\hat{b}}_{γ} + \frac{1}{2} \sum_{k = 1}^{q} {〈| t_{k} - G (x) {\bar{m}}_{k} |^{2}〉}_{p (t, x)} + \frac{1}{2} \sum_{k = 1}^{q} {〈T r ({({\bar{Λ}}_{k})}^{- 1} {(G (x))}^{T} G (x))〉}_{p (x)} . \end{matrix}

(102)

Proof of Result 1.

Consider

\begin{matrix} L (q (θ, γ), t, x) & = & {〈log (p (t | θ, γ, x))〉}_{q (θ, γ)} + {〈log (p (θ, γ) / q (θ, γ))〉}_{q (θ, γ)} . \end{matrix}

(103)

It follows from (78) and (80) that

\begin{matrix} log (p (t_{k} | θ_{k}, γ, x)) & = & - 0.5 log (2 π) + 0.5 log (γ) - 0.5 γ | t_{k} - G (x) θ_{k} |^{2} . \end{matrix}

(104)

Since

t = {[\begin{matrix} t_{1} & \dots & t_{q} \end{matrix}]}^{T}

, we have

\begin{matrix} log (p (t | θ, γ, x)) & = & - 0.5 q log (2 π) + 0.5 q log (γ) - 0.5 γ \sum_{k = 1}^{q} {| t_{k} - G (x) θ_{k} |}^{2} . \end{matrix}

(105)

Using (105) and (70)–(71) in (103), we have

\begin{matrix} L (q (θ, γ), t, x) & = & - \frac{q}{2} log (2 π) + \frac{q}{2} {〈log (γ)〉}_{q (γ)} - \frac{{〈γ〉}_{q (γ)}}{2} \sum_{k = 1}^{q} {〈| t_{k} - G (x) θ_{k} |^{2}〉}_{q (θ_{k})} \\ + \sum_{k = 1}^{q} {〈log (\frac{p (θ_{k}; {\hat{m}}_{k}, {\hat{Λ}}_{k})}{q (θ_{k})})〉}_{q (θ_{k})} + {〈log (\frac{p (γ; a_{γ}, b_{γ})}{q (γ)})〉}_{q (γ)} . \end{matrix}

(106)

Thus,

\begin{matrix} {〈L (q (θ, γ), t, x)〉}_{p (t, x)} & = & - \frac{q}{2} log (2 π) + \frac{q}{2} {〈log (γ)〉}_{q (γ)} - \frac{{〈γ〉}_{q (γ)}}{2} \sum_{k = 1}^{q} {〈| t_{k} |^{2}〉}_{p (t)} \\ - \frac{{〈γ〉}_{q (γ)}}{2} \sum_{k = 1}^{q} {〈{(θ_{k})}^{T} {〈{(G (x))}^{T} G (x)〉}_{p (x)} θ_{k}〉}_{q (θ_{k})} + {〈γ〉}_{q (γ)} \sum_{k = 1}^{q} {〈{(θ_{k})}^{T} {〈{(G (x))}^{T} t_{k}〉}_{p (t, x)}〉}_{q (θ_{k})} \\ + \sum_{k = 1}^{q} {〈log (\frac{p (θ_{k}; {\hat{m}}_{k}, {\hat{Λ}}_{k})}{q (θ_{k})})〉}_{q (θ_{k})} + {〈log (\frac{p (γ; a_{γ}, b_{γ})}{q (γ)})〉}_{q (γ)} . \end{matrix}

(107)

Now,

{〈L (q (θ, γ), t, x)〉}_{p (t, x)}

can be maximized with respect to

q (θ_{k})

and

q (γ)

using variational optimization. It can be seen that optimal distributions maximizing

{〈L (q (θ, γ), t, x)〉}_{p (t, x)}

are given as

\begin{matrix} q^{*} (θ_{k}) & = & \frac{1}{\sqrt{{(2 π)}^{M} | {({\bar{Λ}}_{k})}^{- 1} |}} exp (- 0.5 {(θ_{k} - {\bar{m}}_{k})}^{T} {\bar{Λ}}_{k} (θ_{k} - {\bar{m}}_{k})) \end{matrix}

(108)

\begin{matrix} q^{*} (γ) & = & ({({\bar{b}}_{γ})}^{{\bar{a}}_{γ}} / Γ ({\bar{a}}_{γ})) {(γ)}^{{\bar{a}}_{γ} - 1} exp (- {\bar{b}}_{γ} γ) \end{matrix}

(109)

where the parameters

({\bar{Λ}}_{k}, {\bar{m}}_{k}, {\bar{a}}_{γ}, {\bar{b}}_{γ})

satisfy (99)–(102). The maximum attained value of

{〈L (q (θ, γ), t, x)〉}_{p (t, x)}

is given as

\begin{matrix} max_{q (θ, γ)} {〈L (q (θ, γ), t, x)〉}_{p (t, x)} & = & - 0.5 q log (2 π) + 0.5 q \{ϝ ({\bar{a}}_{γ}) - log ({\bar{b}}_{γ})\} - \frac{{\bar{a}}_{γ}}{2 {\bar{b}}_{γ}} \sum_{k = 1}^{q} {〈| t_{k} - G (x) {\bar{m}}_{k} |^{2}〉}_{p (t, x)} \\ - \frac{{\bar{a}}_{γ}}{2 {\bar{b}}_{γ}} \sum_{k = 1}^{q} {〈T r ({({\bar{Λ}}_{k})}^{- 1} {(G (x))}^{T} G (x))〉}_{p (x)} - \sum_{k = 1}^{q} KL (q^{*} (θ_{k}) ∥ p (θ_{k}; {\hat{m}}_{k}, {\hat{Λ}}_{k})) \\ - KL (q^{*} (γ) ∥ p (γ; {\hat{a}}_{γ}, {\hat{b}}_{γ})) \end{matrix}

where

ϝ (\cdot)

is the digamma function. After substituting the maximum value in (97) and calculating Kullback–Leibler divergences, we obtain (98). □

4.2. An Algorithm for the Computing of Information Leakage

Result 1 forms the basis of Algorithm 8 that computes the information leakage using available data samples.

Algorithm 8 Estimation of information leakage,

I L_{f_{t \to x}} = I (t; x) - H (t)

, using variational approximation

Require:: Dataset $\{(x^{i} \in R^{n}, t^{i} \in R^{q}) | x^{i} = f_{t \to x} (t^{i}), i \in {1, \dots, N}\}$ .
1:: Apply Algorithm 7 on $\{(x^{i}, t^{i}) | i \in {1, \dots, N}\}$ with $M_{m a x} = min (⌈ N / 2 ⌉, 1000)$ (i.e., constraining the maximum possible number of auxiliary points $M_{m a x}$ below 1000 for computational efficiency) to obtain variational membership-mappings Bayesian model ${BM}^{x \to t} = {{{\hat{m}}_{k}, {\hat{Λ}}_{k} | k \in {1, \dots, q}}, {\hat{a}}_{γ}, {\hat{b}}_{γ}}$ .
2:: Initialize $\bar{a} / \bar{b}$ , e.g., as $\bar{a} / \bar{b} = \hat{a} / \hat{b}$ .
3:: repeat
4:: Update ${{\bar{Λ}}_{k}, {\bar{m}}_{k} | k \in {1, \dots, q}}, \bar{a}, \bar{b}$ using (99)-(102) where expectations $< \cdot >_{p (x)}$ and $< \cdot >_{p (t, x)}$ are approximated via sample averages.
5:: until convergence.
6:: Compute ${\hat{I L}}_{f_{t \to x}}$ using (98) where expectations $< \cdot >_{p (x)}$ and $< \cdot >_{p (t, x)}$ are approximated via sample averages.
7:: return ${\hat{I L}}_{f_{t \to x}}$ and the model ${BM}^{x \to t}$ .

The functionality of Algorithm 8 is as follows:

Step 1 applies Algorithm 7 for the inference of a variational membership-mappings Bayesian model.
The loop between step 3 and step 5 recursively estimates the parameters ( ${{\bar{Λ}}_{k}, {\bar{m}}_{k} | k \in {1, \dots, q}}, \bar{a}, \bar{b}$ ) using update rules (99)–(102).
Step 6 computes the information leakage using (98).

Remark 4

(Computational Complexity). The computational complexity of Algorithm 8 is asymptotically dominated by the computation of inverse of

M \times M

dimensional matrix

{\bar{Λ}}_{k}

in (100) to calculate

{\bar{m}}_{k}

. Thus, the computational complexity of Algorithm 8 is given as

O (M^{3})

, where M is the number of auxiliary points.

Example 1

(Verification of Information Leakage Estimation Algorithm). To demonstrate the effectiveness of Algorithm 8 in estimating information leakage, a scenario is generated where

t \in R^{10}

and

x \in R^{10}

are Gaussian distributed such that

x = t + ω

;

t \sim N (0, 5 I_{10})

;

ω \sim N (0, σ I_{10})

with

σ \in [1, 15]

. Since the data distributions in this scenario are known, the information leakage can be theoretically calculated and is given as

\begin{matrix} I L_{f_{t \to x}} & = & 5 log (1 + 5 / σ) - 0.5 log (| (2 π e 5 I_{10}) |) . \end{matrix}

For a given value of σ, 1000 samples of t and x were simulated and Algorithm 8 was applied for estimating information leakage. The experiments were carried out at different values of σ ranging from 1 to 15.

Figure 3 compares the plots of estimated and theoretically calculated values of information leakage against σ. A close agreement between the two plots in Figure 3 verifies the effectiveness of Algorithm 8 in estimating information leakage without knowing the data distributions.

5. Information Theoretic Measures for Privacy Leakage, Interpretability, and Transferability

5.1. Definitions

To define formally the information theoretic measures for privacy leakage, interpretability, and transferability; a few variables and mappings are introduced in Table 2. Definitions 10–12 provide the mathematical definitions of the information theoretic measures.

Definition 10

(Privacy Leakage). Privacy leakage (by the mapping from private variables to noise-added data vector) is a measure of the amount of information about private/sensitive variable

x_{s r}

leaked by the mapping

f_{x_{s r} \to y_{s r}^{+}}

and is defined as

\begin{matrix} I L_{f_{x_{s r} \to y_{s r}^{+}}} & : = & I (x_{s r}; f_{x_{s r} \to y_{s r}^{+}} (x_{s r})) - H (x_{s r}) \end{matrix}

(110)

\begin{matrix} = & I (x_{s r}; y_{s r}^{+}) - H (x_{s r}) . \end{matrix}

(111)

Definition 11

(Interpretability Measure). Interpretability (of noise-added data vector) is measured as the amount of information about interpretable parameters

t_{s r}

leaked by the mapping

f_{t_{s r} \to y_{s r}^{+}}

and is defined as

\begin{matrix} I L_{f_{t_{s r} \to y_{s r}^{+}}} & : = & I (t_{s r}; f_{t_{s r} \to y_{s r}^{+}} (t_{s r})) - H (t_{s r}) \end{matrix}

(112)

\begin{matrix} = & I (t_{s r}; y_{s r}^{+}) - H (t_{s r}) . \end{matrix}

(113)

Definition 12

(Transferability Measure). Transferability (from source domain data representation learning models (i.e.,

P_{1}^{+ s r}, \dots, P_{C}^{+ s r}

) to the target domain data representation learning models (i.e.,

P_{1}^{t g}, \dots, P_{C}^{t g}

)) is measured as the amount of information about source domain feature vector

{\hat{y}}_{t g}^{s r}

leaked by the mapping

f_{{\hat{y}}_{t g}^{s r} \to {\hat{y}}_{t g}^{t g}}

and is defined as

\begin{matrix} I L_{f_{{\hat{y}}_{t g}^{s r} \to {\hat{y}}_{t g}^{t g}}} & : = & I ({\hat{y}}_{t g}^{s r}; f_{{\hat{y}}_{t g}^{s r} \to {\hat{y}}_{t g}^{t g}} ({\hat{y}}_{t g}^{s r})) - H ({\hat{y}}_{t g}^{s r}) \end{matrix}

(114)

\begin{matrix} = & I ({\hat{y}}_{t g}^{s r}; {\hat{y}}_{t g}^{t g}) - H ({\hat{y}}_{t g}^{s r}) . \end{matrix}

(115)

Here,

{\hat{y}}_{t g}^{t g}

represents the target domain feature vector and

f_{{\hat{y}}_{t g}^{s r} \to {\hat{y}}_{t g}^{t g}} : R^{p_{s r}} \to R^{p_{s r}}

is the mapping from source domain feature vector

{\hat{y}}_{t g}^{s r}

to target domain feature vector

{\hat{y}}_{t g}^{t g}

.

Since the defined measures are in the form of information leakages, Algorithm 8 could be directly applied for practically computing the measures provided the availability of data samples.

5.2. A Unified Approach to Privacy-Preserving Interpretable and Transferable Learning

The presented theory allows us to develop an algorithm that implements privacy-preserving interpretable and transferable learning methodology in a unified manner.

Algorithm 9 is presented for a systematic implementation of the proposed privacy-preserving interpretable and transferable deep learning methodology. The functionality of Algorithm 9 is as follows:

Algorithm 9 Algorithm for privacy-preserving interpretable and transferable learning

Require:: The labeled source dataset: $Y^{s r} = {Y_{c}^{s r}}_{c = 1}^{C}$ (where $Y_{c}^{s r} = {y_{s r}^{i, c} \in R^{p_{s r}} | i \in {1, \dots, N_{c}^{s r}}}$ represents the c-th labeled samples); the set of private data: $X^{s r} = {X_{c}^{s r}}_{c = 1}^{C}$ (where $X_{c}^{s r} = {x_{s r} \in R^{n_{s r}} | x_{s r} = f_{x_{s r} \to y_{s r}}^{- 1} (y_{s r}), y_{s r} \in Y_{c}^{s r}}$ ); the set of interpretable parameters: $T^{s r} = {T_{c}^{s r}}_{c = 1}^{C}$ (where $T_{c}^{s r} = {t_{s r} \in R^{q} | t_{s r} = f_{t_{s r} \to y_{s r}}^{- 1} (y_{s r}), y_{s r} \in Y_{c}^{s r}}$ ); the set of a few labeled target samples: ${Y_{c}^{t g}}_{c = 1}^{C}$ (where $Y_{c}^{t g} = {y_{t g}^{i, c} \in R^{p_{t g}} | i \in {1, \dots, N_{c}^{t g}}}$ is the set of c-th labeled target samples); the set of unlabeled target samples: $Y_{*}^{t g} = {y_{t g}^{i, *} \in R^{p_{t g}} | i \in {1, \dots, N_{*}^{t g}}}$ ; and the differential privacy parameters: $d \in R_{+}$ , $ϵ \in R_{+}$ , $δ \in (0, 1)$ .
1:: A differentially private approximation of source dataset, $Y^{+ s r} = {Y_{c}^{+ s r}}_{c = 1}^{C}$ , is obtained using Algorithm 5 on $Y^{s r}$ .
2:: Differentially private source domain classifier, ${P_{c}^{+ s r}}_{c = 1}^{C}$ , is built using Algorithm 6 on $Y^{+ s r}$ taking subspace dimension as equal to $min (20, p_{s r})$ (where $p_{s r}$ is the dimension of source data samples), ratio $r_{m a x}$ as equal to 0.5, and number of layers as equal to 5.
3:: Taking subspace dimension $n_{s t} = min (⌈ p_{s r} / 2 ⌉, p_{t g})$ , the source domain transformation matrix $V^{+ s r} \in R^{n_{s t} \times p_{s r}}$ is defined as with its i-th row equal to the transpose of the eigenvector corresponding to the i-th largest eigenvalue of sample covariance matrix computed on differentially private approximated source samples. The target domain transformation matrix $V^{t g} \in R^{n_{s t} \times p_{t g}}$ is defined as with its i-th row equal to the transpose of the eigenvector corresponding to the i-th largest eigenvalue of the sample covariance matrix computed on target samples.
4:: For the case of heterogenous source and target domains, the subspace alignment approach is used to transform target samples via (40) and () for defining the sets ${Y_{c}^{t g \to s r}}_{c = 1}^{C}$ and $Y_{*}^{t g \to s r}$ .
5:: Initial target domain classifier, ${P_{c}^{t g} |_{0}}_{c = 1}^{C}$ , is built using Algorithm 4 on labeled target samples, ${Y_{c}^{t g \to s r}}_{c = 1}^{C}$ , taking subspace dimension as equal to $min (20, {min}_{1 \leq c \leq C} {N_{c}^{t g}} - 1)$ (where $N_{c}^{t g}$ is the number of c-th class labeled target samples), ratio $r_{m a x}$ as equal to 1, and number of layers as equal to 1.
6:: The target domain classifier is updated using (42) and (43) until 4 iterations taking the monotonically non-decreasing subspace dimension n sequence as ${min (5, p_{s r}), min (10, p_{s r}), min (15, p_{s r}), min (20, p_{s r})}$ and $r_{m a x = 0.5}$ .
7:: The mapping from source to target domain is learned by means of a model, $M^{s r \to t g}$ , defined as in (44).
8:: Compute privacy leakage, $I L_{f_{x_{s r} \to y_{s r}^{+}}}$ , and adversary model, ${BM}^{y_{s r}^{+} \to x_{s r}}$ , via applying Algorithm 8 on ${(y_{s r}^{+}, x_{s r}) | y_{s r}^{+} = f_{x_{s r} \to y_{s r}^{+}} (x_{s r}), x_{s r} \in X^{s r}, y_{s r}^{+} \in Y^{+ s r}}$ .
9:: Compute interpretability measure, $I L_{f_{t_{s r} \to y_{s r}^{+}}}$ , and interpretability model, ${BM}^{y_{s r}^{+} \to t_{s r}}$ , via applying Algorithm 8 on ${(y_{s r}^{+}, t_{s r}) | y_{s r}^{+} = f_{t_{s r} \to y_{s r}^{+}} (t_{s r}), t_{s r} \in T^{s r}, y_{s r}^{+} \in Y^{+ s r}}$ .
10:: Compute transferability measure, $I L_{f_{{\hat{y}}_{t g}^{s r} \to {\hat{y}}_{t g}^{t g}}}$ , via applying Algorithm 8 on $\{({\hat{y}}_{t g}^{t g} (y_{t g}), {\hat{y}}_{t g}^{s r} (y_{t g})) | y_{t g} \in {Y_{c}^{t g}}_{c = 1}^{C} \cup Y_{*}^{t g}\}$ , where

$\begin{matrix} {\hat{y}}_{t g}^{s r} (y_{t g}) & = & \hat{WD} (y_{t g \to s r} (y_{t g}); P_{f_{y_{t g} \to c} (y_{t g})}^{+ s r}) \end{matrix}$

(116)

$\begin{matrix} {\hat{y}}_{t g}^{t g} (y_{t g}) & = & \hat{WD} (y_{t g \to s r} (y_{t g}); P_{f_{y_{t g} \to c} (y_{t g})}^{t g}) \end{matrix}$

(117)

$\begin{matrix} f_{y_{t g} \to c} (y_{t g}) & = & \hat{c} (y_{t g \to s r} (y_{t g}); {P_{c}^{t g}}_{c = 1}^{C}, {P_{c}^{+ s r}}_{c = 1}^{C}, M^{s r \to t g}), \end{matrix}$

(118)

$y_{t g \to s r} (y_{t g})$ is defined as in (39), and $\hat{c} (\cdot)$ is defined by (47).
11:: return in the source domain: classifier ${P_{c}^{+ s r}}_{c = 1}^{C}$ ; privacy leakage $I L_{f_{x_{s r} \to y_{s r}^{+}}}$ and adversary model ${BM}^{y_{s r}^{+} \to x_{s r}}$ ; interpretability measure $I L_{f_{t_{s r} \to y_{s r}^{+}}}$ and interpretability model ${BM}^{y_{s r}^{+} \to t_{s r}}$ .
12:: return in the target domain: classifier ${P_{c}^{t g}}_{c = 1}^{C}$ .
13:: return for transfer and multi-task learning scenario: classifiers ${P_{c}^{+ s r}}_{c = 1}^{C}$ and ${P_{c}^{t g}}_{c = 1}^{C}$ ; source2target model $M^{s r \to t g}$ ; latent subspace transformation matrices $V^{+ s r}$ and $V^{t g}$ ; transferability measure $I L_{f_{{\hat{y}}_{t g}^{s r} \to {\hat{y}}_{t g}^{t g}}}$ .

Step 2 builds the differentially private source domain classifier following Algorithm 6 from previous work [22].
Step 6 results in the building of the target domain classifier using the method of [22].
An information theoretic evaluation of privacy leakage, interpretability, and transferability is undertaken at step 8, 9, and 10, respectively.
Step 8 also provides the adversary model ${BM}^{y_{s r}^{+} \to x_{s r}}$ , which can be used to estimate private data and thus to simulate privacy attacks;
Step 9 also provides the interpretability model ${BM}^{y_{s r}^{+} \to t_{s r}}$ , that can be used to estimate interpretable parameters and thus provide an interpretation to the non-interpretable data vectors.

6. Experiments

Experiments have been carried out to demonstrate the application of the proposed measures (for privacy leakage, interpretability, and transferability) to privacy-preserving interpretable and transferable learning. The methodology was implemented using MATLAB R2017b and the experiments have been made on an iMac (M1, 2021) machine with 8 GB RAM.

6.1. MNIST Dataset

The MNIST dataset contains

28 \times 28

sized images divided into a training set of 60,000 images and test set of 10,000 images. The images’ pixel values were divided by 255 to normalize the values in the range from 0 to 1. The

28 \times 28

normalized pixel values of each image were flattened to an equivalent 784-dimensional data vector.

6.1.1. Interpretable Parameters

For an MNIST digits dataset, there exist no additional interpretable parameters other than the pixel values. Thus, we defined corresponding to a pixel values vector

y \in {[0, 1]}^{784}

an interpretable parameter vector

t \in {0, 1}^{10}

such that the j-th element

t_{j} = 1

, if the j-th class label is associated to y; otherwise,

t_{j} = 0

. That is, interpretable vector t, in our experimental setting, represents the class label assigned to data vector y.

6.1.2. Private Data

Here, we assume that pixel values are private, i.e.,

x_{s r} = y_{s r}

.

6.1.3. Semi-Supervised Transfer Learning Scenario

A transfer learning scenario was considered in the same setting as in [22,30] where 60,000 training samples constituted the source dataset; a set of 9000 test samples constituted the target dataset, and the classification performance was evaluated on the remaining 1000 test samples. Out of 9000 target samples, only 10 samples per class were labeled and the remaining 8900 target samples remained as unlabeled.

6.1.4. Experimental Design

Algorithm 9 is applied with the differential privacy parameters as

d = 1

and

δ

= 1 × 10

^{- 5}

. The experiment involves six different privacy-preserving semi-supervised transfer learning scenarios with privacy-loss bound values as

ϵ = 0.1

,

ϵ = 0.25

,

ϵ = 0.5

,

ϵ = 1

,

ϵ = 2

, and

ϵ = 10

. For the computation of privacy leakage, interpretability measure, and transferability measure in Algorithm 9, a subset consisting of 5000 randomly selected samples was considered.

6.1.5. Results

The experimental results have been plotted in Figure 4. Figure 4a–c display the privacy–accuracy trade-off curve, privacy–interpretability trade-off curve, and privacy–transferability trade-off curve respectively. The following observations are made:

As expected and observed in Figure 4f, the transferability measure is positively correlated with the accuracy of source–domain classifier on target test samples.
Since we have defined the interpretable vector associated to a feature vector as representing the class label, the positive correlations of interpretability measure with the source domain classifier’s accuracy and the transferability measure are observed in Figure 4e and Figure 4f, respectively.
The results also verify the robust performance of Algorithm 9 under transfer and multi-task learning scenario, since the classification performance in the transfer and multi-task learning scenario, unlike the performance of the source domain classifier, is not adversely affected by a reduction in privacy leakage, interpretability measure, and transferability measure as observed in Figure 4a,e,f.

Table 3 reports the results obtained by the models that correspond to minimum privacy leakage, maximum interpretability measure, and maximum transferability measure. The robustness of transfer and multi-task learning scenario is further highlighted in Table 3. To achieve the minimum value of privacy leakage, the accuracy of source domain classifier must be decreased to 0.1760; however, the transfer and multi-task learning scenario achieves the minimum privacy leakage value with the accuracy of 0.9510. As observed in Table 3, the maximum transferability-measure models also correspond to the maximum interpretability-measure models.

As a visualization example, Figure 5 displays noise-added data samples for different values of information theoretic measures.

6.2. Office and Caltech256 Datasets

The “Office+Caltech256” dataset has 10 common categories of both Office and Caltech256 datasets. The dataset has four domains: amazon, webcam, dslr, and caltech256. This dataset has been widely used [31,32,33,34] for evaluating multi-class accuracy performance in a standard domain adaptation setting with a small number of labeled target samples. Following [32], the 4096-dimensional deep-net VGG-FC6 features are extracted from the images. However, for the learning of classifiers, the 4096-dimensional feature vectors are reduced to 100-dimensional feature vectors using principal components computed from the data of amazon domain. Thus, corresponding to each image, a 100-dimensional data vector is constructed.

6.2.1. Interpretable Parameters

Corresponding to a data vector

y \in R^{100}

, an interpretable parameter vector

t \in {0, 1}^{10}

is defined such that the j-th element

t_{j} = 1

, if the j-th class label is associated to y; otherwise,

t_{j} = 0

. That is, interpretable vector t, in our experimental setting, represents the class-label assigned to data vector y.

6.2.2. Private Data

Here, we assume that the extracted image feature vectors are private, i.e.,

x_{s r} = y_{s r}

.

6.2.3. Semi-Supervised Transfer Learning Scenario

Similarly to [31,32,33,34], the experimental setup is follows:

The number of training samples per class in the source domain is 20 for amazon and is 8 for the other three domains;
The number of labeled samples per class in the target domain is 3 for all the four domains.

6.2.4. Experimental Design

Taking a domain as the source and another domain as the target, 12 different transfer learning experiments are performed on the four domains associated to the “Office+Caltech256” dataset. Each of the 12 experiments is repeated 20 times via creating 20 random train/test splits. In all of the 240 (

= 12 \times 20

) experiments, Algorithm 9 is applied three times with varying values of privacy-loss bound: first with differential privacy parameters as

(d = 1, ϵ = 0.01, δ = 1 \times 10^{- 5})

, second with differential privacy parameters as

(d = 1, ϵ = 0.1, δ = 1 \times 10^{- 5})

, and third with differential privacy parameters as

(d = 1, ϵ = 1, δ = 1 \times 10^{- 5})

. As Algorithm 9 with different values of privacy-loss bound

ϵ

will result in different models, the transfer and multi-task learning models that correspond to the maximum interpretability measure and maximum transferability measure are considered for an evaluation.

6.2.5. Reference Methods

This dataset has been studied previously [31,32,33,34,35,36] and thus, as a reference, the performances of the following existing methods were considered:

ILS (1-NN) [32]: This method learns an Invariant Latent Space (ILS) to reduce the discrepancy between domains and uses Riemannian optimization techniques to match statistical properties between samples projected into the latent space from different domains.
CDLS [35]: The Cross-Domain Landmark Selection (CDLS) method derives a domain-invariant feature subspace for heterogeneous domain adaptation.
MMDT [34]: The Maximum Margin Domain Transform (MMDT) method adapts max-margin classifiers in a multi-class manner by learning a shared component of the domain shift as captured by the feature transformation.
HFA [36]: The Heterogeneous Feature Augmentation (HFA) method learns common latent subspace and a classifier under the max-margin framework.
OBTL [33]: The Optimal Bayesian Transfer Learning (OBTL) method employs a Bayesian framework to transfer learning through the modeling of a joint prior probability density function for feature-label distributions of the source and target domains.

6.2.6. Results

Table 4, Table 5, Table 6, Table 7, Table 8, Table 9, Table 10, Table 11, Table 12, Table 13, Table 14 and Table 15 report the results, and the first two best performances have been marked.

Finally, Table 16 summarizes the overall performance of the top four methods. As observed in Table 16, the maximum transferability-measure model remains as best performing in the maximum number of experiments. The most remarkable result observed is that the proposed methodology, despite being privacy-preserving, ensuring the differential privacy-loss bound to be less than equal to 1 and not requiring access to source data samples, performs better than even the non-private methods.

6.3. An Application Example: Mental Stress Detection

The mental stress detection problem is considered as an application example of the proposed privacy-preserving interpretable and transferable learning approach. The dataset from [17], consisting of heart rate interval measurements of different subjects, is considered for the study of an individual stress detection problem. In [17], a membership-mappings-based interpretable deep model was applied for an estimation of stress score; however, the current study deals with application of the proposed privacy-preserving interpretable and transferable deep learning method to solve the stress classification problem. The problem is concerned with the detection of stress in an individual based on the analysis of recorded sequence of R-R intervals,

{R R^{i}}_{i}

. The R-R data vector at

i -

th time-index,

y^{i}

, is defined as

\begin{matrix} y^{i} & = & {[\begin{matrix} R R^{i} & R R^{i - 1} & \dots & R R^{i - d} \end{matrix}]}^{T} . \end{matrix}

(119)

That is, the current interval and history of previous d intervals constitute the data vector. Assuming an average heartbeat of 72 beats per minute, d is chosen as equal to

72 \times 3 = 216

so that the R-R data vector consists of on average 3-minute-long R-R intervals sequences. A dataset, say

{y^{i}}_{i}

, is built via (1) preprocessing the R-R interval sequence

{R R^{i}}_{i}

with an impulse rejection filter [37] for artifacts detection and (2) excluding the R-R data vectors containing artifacts from the dataset. The dataset contains the stress score on a scale from 0 to 100. A label of either “no-stress” or “under-stress” is assigned to each

y^{i}

based on the stress score. Thus, we have a binary classification problem.

6.3.1. Interpretable Parameters

Corresponding to a R-R data vector, there exists the set of interpretable parameters: mental demand, physical demand, temporal demand, own performance, effort, and frustration. These are the six components of stress acquired using the NASA Task Load Index [38]. The NASA Task Load Index provides subjective assessment of stress where an individual provides a rating on the scale from 0 to 100 for each of the six components of stress (mental demand, physical demand, temporal demand, own performance, effort, and frustration). Thus, corresponding to each

217 —

dimensional R-R data vector, there exists a six-dimensional interpretable parameters vector acquired using the NASA Task Load Index.

6.3.2. Private Data

Here, we assume that heart rate values are private. As instantaneous heart rate is given as

H R^{i} = 60 / R R^{i}

; thus, information about private data is directly contained in the R-R data vectors.

6.3.3. Semi-Supervised Transfer Learning Scenario

Out of the total subjects, a randomly chosen subject’s data serve as the source domain data. Considering every other subject’s data as the target domain data, the transfer learning experiment is performed independently on each target subject where 50% of the target subject’s samples are labeled, and the remaining unlabeled target samples also serve as test data for evaluating the classification performance. However, only the target subjects, with data containing both the classes and at least 60 samples, were considered for experimentation. There are in total 48 such target subjects.

6.3.4. Experimental Design

Algorithm 9 is applied with

d = 1

,

ϵ \in {0.1, 0.5, 1, 2, 5, 8, 20, 50, 100, \infty}

, and

δ = 1 \times 10^{- 5}

. Each of the 48 experiments involves 10 different privacy-preserving semi-supervised transfer learning scenarios with privacy-loss bound values as

ϵ = 0.1

,

ϵ = 0.5

,

ϵ = 1

,

ϵ = 2

,

ϵ = 5

,

ϵ = 8

,

ϵ = 20

,

ϵ = 50

,

ϵ = 100

, and

ϵ = \infty

. The following two requirements are associated to this application example:

The private source domain data must be protected while transferring knowledge from source to target domain; and
The interpretability of the source domain model should be high.

In view of the aforementioned requirements, the models that correspond to the minimum privacy leakage and maximum interpretability measure amongst all the models obtained corresponding to 10 different choices of differential privacy-loss bound

ϵ

are considered for detecting stress.

6.3.5. Results

Figure 6 summarizes the experimental results where accuracies obtained by both minimum privacy-leakage models and maximum interpretability-measure models have been displayed as box plots.

It is observed in Figure 6 that the transfer and multi-task learning improves considerably the performance of source domain classifier. Table 17 reports the median values (of privacy leakage, interpretability measure, transferability measure, and classification accuracy) obtained in the experiments on 48 different subjects. The robust performance of transfer and multi-task learning scenario is further observed in Table 17.

As a visualization example, Figure 7 displays the noise-added source domain heart rate interval data for different values of information theoretic measures.

7. Concluding Remarks

The paper has introduced information theoretic measures for privacy leakage, interpretability, and transferability to study the trade-offs. This is the first study to develop an information theory-based unified approach to privacy-preserving interpretable and transferable learning. The experiments have verified that the proposed measures (for privacy leakage, interpretability, and transferability) can be used to study the trade-off curves (between privacy leakage, interpretability measure, and transferability measure) and thus to optimize the models for the given application requirements such as the requirement of minimum privacy leakage, the requirement of maximum interpretability measure, and the requirement of maximum transferability measure. The experimental results on the MNIST dataset showed that the transfer and multi-task learning scenario improved remarkable the accuracy from 0.1760 to 0.9510 while ensuring the minimum privacy leakage. The experiments on Office and Caltech256 datasets indicated that the proposed methodology, despite ensuring differential privacy-loss bound to be less than equal to 1 and not requiring an access to source data samples, performed better than even existing non-private methods in six out of 12 transfer learning experiments. The stress detection experiments on a real-world biomedical data led to the observation that the transfer and multi-task learning scenario improved the accuracy from 0.3411 to 0.9647 (while ensuring the minimum privacy leakage) and from 0.3602 to 0.9619 (while ensuring the maximum interpretability measure). The considered unified approach to privacy-preserving interpretable and transferable learning involves membership-mappings-based conditionally deep autoencoders, albeit other data representation learning models could be explored. The future work includes the following:

Although the text has not focused on federated learning, the transfer learning approach could be easily extended to the multi-party system and the transferability measure could be calculated for any pair of parties.
Also, the explainability of the conditionally deep autoencoders follows, similar to in [17], via estimating interpretable parameters from non-interpretable data feature vectors using variational membership-mapping Bayesian model.
Furthermore, the variational membership-mapping Bayesian model quantifies uncertainties on the estimation of parameters (of interest), which is also important for a user’s trust on the model.

Author Contributions

Conceptualization, M.K.; methodology, M.K.; writing—original draft preparation, M.K.; writing—review and editing, L.F.; project administration, B.F.; funding acquisition, B.A.M. and L.F. All authors have read and agreed to the published version of the manuscript.

Funding

The research reported in this paper has been supported by the Austrian Research Promotion Agency (FFG) COMET-Modul S3AI (Security and Safety for Shared Artificial Intelligence); FFG Sub-Project PETAI (Privacy Secured Explainable and Transferable AI for Healthcare Systems); FFG Grant SMiLe (Secure Machine Learning Applications with Homomorphically Encrypted Data); FFG Grant PRIMAL (Privacy-Preserving Machine Learning for Industrial Applications); and the Austrian Ministry for Transport, Innovation and Technology, the Federal Ministry for Digital and Economic Affairs, and the State of Upper Austria in the frame of the SCCH competence center INTEGRATE [(FFG grant no. 892418)] part of the FFG COMET Competence Centers for Excellent Technologies Programme.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:

TAI	Trustworthy Artificial Intelligence

References

High-Level Expert Group on AI. Ethics Guidelines for Trustworthy AI; Report; European Commission: Brussels, Belgium, 2019. [Google Scholar]
Floridi, L. Establishing the rules for building trustworthy AI. Nat. Mach. Intell. 2019, 1, 261–262. [Google Scholar] [CrossRef]
Floridi, L.; Cowls, J. A Unified Framework of Five Principles for AI in Society. Harv. Data Sci. Rev. 2019, 1. [Google Scholar] [CrossRef]
Floridi, L.; Cowls, J.; Beltrametti, M.; Chatila, R.; Chazerand, P.; Dignum, V.; Luetge, C.; Madelin, R.; Pagallo, U.; Rossi, F.; et al. AI4People—An Ethical Framework for a Good AI Society: Opportunities, Risks, Principles, and Recommendations. Minds Mach. 2018, 28, 689–707. [Google Scholar] [CrossRef]
Mcknight, D.H.; Carter, M.; Thatcher, J.B.; Clay, P.F. Trust in a Specific Technology: An Investigation of Its Components and Measures. ACM Trans. Manag. Inf. Syst. 2011, 2, 1–25. [Google Scholar] [CrossRef]
Thiebes, S.; Lins, S.; Sunyaev, A. Trustworthy artificial intelligence. Electron. Mark. 2020, 31, 447–464. [Google Scholar] [CrossRef]
Future of Life Institute. Asilomar AI Princples. 2017. Available online: https://futureoflife.org/ai-principles/ (accessed on 19 September 2023).
Université de Montréal. Montreal Declaration for a Responsible Development of AI. 2017. Available online: https://www.montrealdeclaration-responsibleai.com/the-declaration/ (accessed on 19 September 2023).
UK House of Lords. AI in the UK: Ready, Willing and Able? 2017. Available online: https://publications.parliament.uk/pa/ld201719/ldselect/ldai/100/10002.htm (accessed on 19 September 2023).
OECD. OECD Principles on AI. 2019. Available online: https://www.oecd.org/going-digital/ai/principles/ (accessed on 19 September 2023).
Chinese National Governance Committee for the New Generation Artificial Intelligence. Governance Principles for the New Generation Artificial Intelligence–Developing Responsible Artificial Intelligence. 2019. Available online: https://www.chinadaily.com.cn/a/201906/17/WS5d07486ba3103dbf14328ab7.html (accessed on 19 September 2023).
Vought, R.T. Guidance for Regulation of Artificial Intelligence Applications. 2020. Available online: https://www.whitehouse.gov/wp-content/uploads/2020/01/Draft-OMB-Memo-on-Regulation-of-AI-1-7-19.pdf (accessed on 19 September 2023).
Hagendorff, T. The Ethics of AI Ethics: An Evaluation of Guidelines. Minds Mach. 2020, 30, 99–120. [Google Scholar] [CrossRef]
Kumar, M.; Moser, B.; Fischer, L.; Freudenthaler, B. Membership-Mappings for Data Representation Learning: Measure Theoretic Conceptualization. In Proceedings of the Database and Expert Systems Applications—DEXA 2021 Workshops; Kotsis, G., Tjoa, A.M., Khalil, I., Moser, B., Mashkoor, A., Sametinger, J., Fensel, A., Martinez-Gil, J., Fischer, L., Czech, G., et al., Eds.; Springer International Publishing: Cham, Switzerland, 2021; pp. 127–137. [Google Scholar]
Kumar, M.; Moser, B.; Fischer, L.; Freudenthaler, B. Membership-Mappings for Data Representation Learning: A Bregman Divergence Based Conditionally Deep Autoencoder. In Proceedings of the Database and Expert Systems Applications—DEXA 2021 Workshops; Kotsis, G., Tjoa, A.M., Khalil, I., Moser, B., Mashkoor, A., Sametinger, J., Fensel, A., Martinez-Gil, J., Fischer, L., Czech, G., et al., Eds.; Springer International Publishing: Cham, Switzerland, 2021; pp. 138–147. [Google Scholar]
Kumar, M.; Freudenthaler, B. Fuzzy Membership Functional Analysis for Nonparametric Deep Models of Image Features. IEEE Trans. Fuzzy Syst. 2020, 28, 3345–3359. [Google Scholar] [CrossRef]
Kumar, M.; Zhang, W.; Weippert, M.; Freudenthaler, B. An Explainable Fuzzy Theoretic Nonparametric Deep Model for Stress Assessment Using Heartbeat Intervals Analysis. IEEE Trans. Fuzzy Syst. 2021, 29, 3873–3886. [Google Scholar] [CrossRef]
Kumar, M.; Singh, S.; Freudenthaler, B. Gaussian fuzzy theoretic analysis for variational learning of nested compositions. Int. J. Approx. Reason. 2021, 131, 1–29. [Google Scholar] [CrossRef]
Zhang, W.; Kumar, M.; Ding, W.; Li, X.; Yu, J. Variational learning of deep fuzzy theoretic nonparametric model. Neurocomputing 2022, 506, 128–145. [Google Scholar] [CrossRef]
Kumar, M.; Zhang, W.; Fischer, L.; Freudenthaler, B. Membership-Mappings for Practical Secure Distributed Deep Learning. IEEE Trans. Fuzzy Syst. 2023, 31, 2617–2631. [Google Scholar] [CrossRef]
Zhang, Q.; Yang, J.; Zhang, W.; Kumar, M.; Liu, J.; Liu, J.; Li, X. Deep fuzzy mapping nonparametric model for real-time demand estimation in water distribution systems: A new perspective. Water Res. 2023, 241, 120145. [Google Scholar] [CrossRef] [PubMed]
Kumar, M. Differentially private transferrable deep learning with membership-mappings. Adv. Comput. Intell. 2023, 3, 1. [Google Scholar] [CrossRef]
Kumar, M.; Stoll, N.; Stoll, R. Variational Bayes for a Mixed Stochastic/Deterministic Fuzzy Filter. IEEE Trans. Fuzzy Syst. 2010, 18, 787–801. [Google Scholar] [CrossRef]
Kumar, M.; Stoll, N.; Stoll, R.; Thurow, K. A Stochastic Framework for Robust Fuzzy Filtering and Analysis of Signals-Part I. IEEE Trans. Cybern. 2016, 46, 1118–1131. [Google Scholar] [CrossRef]
Kumar, M.; Stoll, N.; Stoll, R. Stationary Fuzzy Fokker-Planck Learning and Stochastic Fuzzy Filtering. IEEE Trans. Fuzzy Syst. 2011, 19, 873–889. [Google Scholar] [CrossRef]
Kumar, M.; Neubert, S.; Behrendt, S.; Rieger, A.; Weippert, M.; Stoll, N.; Thurow, K.; Stoll, R. Stress Monitoring Based on Stochastic Fuzzy Analysis of Heartbeat Intervals. IEEE Trans. Fuzzy Syst. 2012, 20, 746–759. [Google Scholar] [CrossRef]
Kumar, M.; Insan, A.; Stoll, N.; Thurow, K.; Stoll, R. Stochastic Fuzzy Modeling for Ear Imaging Based Child Identification. IEEE Trans. Syst. Man Cybern. Syst. 2016, 46, 1265–1278. [Google Scholar] [CrossRef]
Kumar, M.; Rossbory, M.; Moser, B.A.; Freudenthaler, B. An optimal (ϵ, δ)—Differentially private learning of distributed deep fuzzy models. Inf. Sci. 2021, 546, 87–120. [Google Scholar] [CrossRef]
Kumar, M.; Brunner, D.; Moser, B.A.; Freudenthaler, B. Variational Optimization of Informational Privacy. In Proceedings of the Database and Expert Systems Applications; Kotsis, G., Tjoa, A.M., Khalil, I., Fischer, L., Moser, B., Mashkoor, A., Sametinger, J., Fensel, A., Martinez-Gil, J., Eds.; Springer International Publishing: Cham, Switzerland, 2020; pp. 32–47. [Google Scholar]
Papernot, N.; Abadi, M.; Erlingsson, U.; Goodfellow, I.J.; Talwar, K. Semi-supervised Knowledge Transfer for Deep Learning from Private Training Data. In Proceedings of the ICLR, Toulon, France, 24–26 April 2017. [Google Scholar]
Hoffman, J.; Rodner, E.; Donahue, J.; Saenko, K.; Darrell, T. Efficient Learning of Domain-invariant Image Representations. arXiv 2013, arXiv:1301.3224. [Google Scholar]
Herath, S.; Harandi, M.; Porikli, F. Learning an Invariant Hilbert Space for Domain Adaptation. In Proceedings of the The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017. [Google Scholar]
Karbalayghareh, A.; Qian, X.; Dougherty, E.R. Optimal Bayesian Transfer Learning. IEEE Trans. Signal Process. 2018, 66, 3724–3739. [Google Scholar] [CrossRef]
Hoffman, J.; Rodner, E.; Donahue, J.; Kulis, B.; Saenko, K. Asymmetric and Category Invariant Feature Transformations for Domain Adaptation. Int. J. Comput. Vis. 2014, 109, 28–41. [Google Scholar] [CrossRef]
Tsai, Y.H.; Yeh, Y.; Wang, Y.F. Learning Cross-Domain Landmarks for Heterogeneous Domain Adaptation. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 5081–5090. [Google Scholar]
Li, W.; Duan, L.; Xu, D.; Tsang, I.W. Learning with Augmented Features for Supervised and Semi-Supervised Heterogeneous Domain Adaptation. IEEE Trans. Pattern Anal. Mach. Intell. 2014, 36, 1134–1148. [Google Scholar] [CrossRef] [PubMed]
McNames, J.; Thong, T.; Aboy, M. Impulse rejection filter for artifact removal in spectral analysis of biomedical signals. In Proceedings of the The 26th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, San Francisco, CA, USA, 1–5 September 2004; Volume 1, pp. 145–148. [Google Scholar] [CrossRef]
Hart, S.G.; Staveland, L.E. Development of NASA-TLX (Task Load Index): Results of empirical and theoretical research. Hum. Ment. Workload. 1988, 1, 139–183. [Google Scholar]

Figure 1. An information theoretic unified approach to “privacy-preserving interpretable and transferable learning” for studying the privacy–interpretability–transferability trade-offs while addressing beneficence, non-maleficence, autonomy, justice, and explicability principles of TAI.

Figure 2. The proposed methodology to evaluate privacy leakage, interpretability, and transferability in terms of the information leakages.

Figure 3. A comparison of the estimated information leakage values with the theoretically calculated values.

Figure 4. The plots between privacy leakage, interpretability measure, transferability measure, and accuracy for MNIST dataset.

Figure 5. An example of a source domain sample corresponding to different levels of privacy leakage, interpretability measure, and transferability measure.

Figure 6. The box plots of accuracies obtained in detecting mental stress on 48 different subjects.

Figure 7. A display of source domain R-R interval data corresponding to different levels of privacy leakage, interpretability measure, and transferability measure.

Table 1. Core issues with TAI principles and solution approaches.

TAI Principle	Issue	Solution Approach
Beneficence	I1: $\begin{matrix} non-availability of large \\ high-quality training data \end{matrix}$	transfer learning
Beneficence	I2: $\begin{matrix} models (intellectual properties) \\ are not widely available \end{matrix}$	federated learning
Non-maleficence	I3: $\begin{matrix} leakage of private information \\ embedded in training data \end{matrix}$	$\begin{matrix} privacy-preserving \\ data release mechanism \end{matrix}$
Non-maleficence	I4: $\begin{matrix} leakage of private information \\ embedded in model parameters \\ and model outputs \end{matrix}$	$\begin{matrix} privacy-preserving \\ machine and deep learning \end{matrix}$
Autonomy	I5: $\begin{matrix} user ’ s inability to quantify \\ model uncertainties lead to \\ indecisiveness regarding the level \\ of autonomy given to AI system \end{matrix}$	$\begin{matrix} analytical quantification of \\ model uncertainties \end{matrix}$
Justice	I6: $\begin{matrix} bias of training data \\ toward certain groups of people \\ leads to discrimination \end{matrix}$	federated learning
Explicability	I7: $\begin{matrix} user ’ s inability to understand \\ model functionality leads \\ to mistrust and obstruction \\ in establishing accountability \end{matrix}$	$\begin{matrix} interpretable machine and \\ deep learning models \end{matrix}$

Table 2. Introduced variables and mappings.

Symbol/Mapping	Definition/Meaning
$x_{s r} \in R^{n_{s r}}$	$\begin{matrix} vector representing private / sensitive variables associated to source domain \end{matrix}$
$y_{s r} \in R^{p_{s r}}$	$\begin{matrix} source domain data vector \end{matrix}$
$t_{s r} \in R^{q}$	$\begin{matrix} vector representing the set of interpretable parameters associated to \\ non-interpretable data vector y_{s r} \end{matrix}$
$y_{s r}^{+} \in R^{p_{s r}}$	$\begin{matrix} noise-added data vector (that is either publicly released or used for the training of - ( \\ source model) obtained from y_{s r} via Algorithm 5 \end{matrix}$
$f_{x_{s r} \to y_{s r}^{+}} : R^{n_{s r}} \to R^{p_{s r}}$	$\begin{matrix} mapping from private variables to noise-added data vector, i . e ., y_{s r}^{+} = f_{x_{s r} \to y_{s r}^{+}} (x_{s r}) \end{matrix}$
$f_{t_{s r} \to y_{s r}^{+}} : R^{q} \to R^{p_{s r}}$	$\begin{matrix} mapping from interpretable parameters to noise-added data vector, i . e ., y_{s r}^{+} = f_{t_{s r} \to y_{s r}^{+}} (t_{s r}) \end{matrix}$
${P_{c}^{+ s r}}_{c = 1}^{C}$	$\begin{matrix} differentially private source domain autoencoders, representing data features of \\ each of C classes, obtained via Algorithm 6 \end{matrix}$
$y_{t g} \in R^{p_{t g}}$	target domain data vector
$y_{t g \to s r} \in R^{p_{s r}}$	$\begin{matrix} representation of target domain data vector y_{t g} in source domain via transformation (39) \end{matrix}$
${P_{c}^{t g}}_{c = 1}^{C}$	$\begin{matrix} target domain autoencoders, representing data features of \\ each of C classes, obtained via Algorithm 6 \end{matrix}$
$f_{y_{t g} \to c} : R^{p_{t g}} \to {1, \dots, C}$	$\begin{matrix} mapping assigning class label to target domain data vector y_{t g} via (47), i . e ., \\ f_{y_{t g} \to c} (y_{t g}) = \hat{c} (y_{t g \to s r} (y_{t g}); {P_{c}^{t g}}_{c = 1}^{C}, {P_{c}^{+ s r}}_{c = 1}^{C}, M^{s r \to t g}) \end{matrix}$
${\hat{y}}_{t g}^{s r} \in R^{p_{s r}}$	$\begin{matrix} transformation of y_{t g} to source domain and filtering through the autoencoder \\ that represents the source domain feature vectors of the same class as that of y_{t g}, i . e ., \\ {\hat{y}}_{t g}^{s r} = \hat{WD} (y_{t g \to s r} (y_{t g}); P_{f_{y_{t g} \to c} (y_{t g})}^{+ s r}) \end{matrix}$
${\hat{y}}_{t g}^{t g} \in R^{p_{s r}}$	$\begin{matrix} transformation of y_{t g} to source domain and filtering through the autoencoder \\ that represents the target domain feature vectors of the same class as that of y_{t g}, i . e ., \\ {\hat{y}}_{t g}^{t g} = \hat{WD} (y_{t g \to s r} (y_{t g}); P_{f_{y_{t g} \to c} (y_{t g})}^{t g}) \end{matrix}$
$f_{{\hat{y}}_{t g}^{s r} \to {\hat{y}}_{t g}^{t g}} : R^{p_{s r}} \to R^{p_{s r}}$	$\begin{matrix} mapping from source domain feature vector {\hat{y}}_{t g}^{s r} to target domain feature vector {\hat{y}}_{t g}^{t g}, i . e ., \\ {\hat{y}}_{t g}^{t g} = f_{{\hat{y}}_{t g}^{s r} \to {\hat{y}}_{t g}^{t g}} ({\hat{y}}_{t g}^{s r}) \end{matrix}$

Table 3. Results of experiments on MNIST dataset for evaluating privacy leakage, interpretability, and transferability.

Method	Privacy Leakage	Interpretability Measure	Transferability Measure	Classification Accuracy
$\begin{matrix} minimum privacy leakage \\ transfer and multi-task learning \end{matrix}$	−50.72	−2.14	−664.52	0.9510
$\begin{matrix} minimum privacy leakage \\ source domain classifier \end{matrix}$	−50.72	−2.14	−664.52	0.1760
$\begin{matrix} maximum interpretability measure \\ transfer and multi-task learning \end{matrix}$	362.83	5.44	451.93	0.9920
$\begin{matrix} maximum interpretability measure \\ source domain classifier \end{matrix}$	362.83	5.44	451.93	0.9950
$\begin{matrix} maximum transferability measure \\ transfer and multi-task learning \end{matrix}$	362.83	5.44	451.93	0.9920
$\begin{matrix} maximum transferability measure \\ source domain classifier \end{matrix}$	362.83	5.44	451.93	0.9950

Table 4. Accuracy (in %, averaged over 20 experiments) obtained in amazon→caltech256 semi-supervised transfer learning experiments. The first and second best performances have been marked.

Method	Feature Type	Accuracy (%)
privacy-preserving maximum interpretability-measure model	VGG-FC6	82.6
privacy-preserving maximum transferability-measure model	VGG-FC6	82.6
non-private ILS (1-NN)	VGG-FC6	83.3
non-private CDLS	VGG-FC6	78.1
non-private MMDT	VGG-FC6	78.7
non-private HFA	VGG-FC6	75.5
non-private OBTL	SURF	41.5
non-private ILS (1-NN)	SURF	43.6
non-private CDLS	SURF	35.3
non-private MMDT	SURF	36.4
non-private HFA	SURF	31.0

Table 5. Accuracy (in %, averaged over 20 experiments) obtained in amazon→dslr semi-supervised transfer learning experiments. The first and second best performances have been marked.

Method	Feature Type	Accuracy (%)
privacy-preserving maximum interpretability-measure model	VGG-FC6	88.5
privacy-preserving maximum transferability-measure model	VGG-FC6	88.7
non-private ILS (1-NN)	VGG-FC6	87.7
non-private CDLS	VGG-FC6	86.9
non-private MMDT	VGG-FC6	77.1
non-private HFA	VGG-FC6	87.1
non-private OBTL	SURF	60.2
non-private ILS (1-NN)	SURF	49.8
non-private CDLS	SURF	60.4
non-private MMDT	SURF	56.7
non-private HFA	SURF	55.1

Table 6. Accuracy (in %, averaged over 20 experiments) obtained in amazon→webcam semi-supervised transfer learning experiments. The first and second best performances have been marked.

Method	Feature Type	Accuracy (%)
privacy-preserving maximum interpretability-measure model	VGG-FC6	89.3
privacy-preserving maximum transferability-measure model	VGG-FC6	89.3
non-private ILS (1-NN)	VGG-FC6	90.7
non-private CDLS	VGG-FC6	91.2
non-private MMDT	VGG-FC6	82.5
non-private HFA	VGG-FC6	87.9
non-private OBTL	SURF	72.4
non-private ILS (1-NN)	SURF	59.7
non-private CDLS	SURF	68.7
non-private MMDT	SURF	64.6
non-private HFA	SURF	57.4

Table 7. Accuracy (in %, averaged over 20 experiments) obtained in caltech256→amazon semi-supervised transfer learning experiments. The first and second best performances have been marked.

Method	Feature Type	Accuracy (%)
privacy-preserving maximum interpretability-measure model	VGG-FC6	92.6
privacy-preserving maximum transferability-measure model	VGG-FC6	92.6
non-private ILS (1-NN)	VGG-FC6	89.7
non-private CDLS	VGG-FC6	88.0
non-private MMDT	VGG-FC6	85.9
non-private HFA	VGG-FC6	86.2
non-private OBTL	SURF	54.8
non-private ILS (1-NN)	SURF	55.1
non-private CDLS	SURF	50.9
non-private MMDT	SURF	49.4
non-private HFA	SURF	43.8

Table 8. Accuracy (in %, averaged over 20 experiments) obtained in caltech256→dslr semi-supervised transfer learning experiments. The first and second best performances have been marked.

Method	Feature Type	Accuracy (%)
privacy-preserving maximum interpretability-measure model	VGG-FC6	89.1
privacy-preserving maximum transferability-measure model	VGG-FC6	89.1
non-private ILS (1-NN)	VGG-FC6	86.9
non-private CDLS	VGG-FC6	86.3
non-private MMDT	VGG-FC6	77.9
non-private HFA	VGG-FC6	87.0
non-private OBTL	SURF	61.5
non-private ILS (1-NN)	SURF	56.2
non-private CDLS	SURF	59.8
non-private MMDT	SURF	56.5
non-private HFA	SURF	55.6

Table 9. Accuracy (in %, averaged over 20 experiments) obtained in caltech256→webcam semi-supervised transfer learning experiments. The first and second best performances have been marked.

Method	Feature Type	Accuracy (%)
privacy-preserving maximum interpretability-measure model	VGG-FC6	87.8
privacy-preserving maximum transferability-measure model	VGG-FC6	87.7
non-private ILS (1-NN)	VGG-FC6	91.4
non-private CDLS	VGG-FC6	89.7
non-private MMDT	VGG-FC6	82.8
non-private HFA	VGG-FC6	86.0
non-private OBTL	SURF	71.1
non-private ILS (1-NN)	SURF	62.9
non-private CDLS	SURF	66.3
non-private MMDT	SURF	63.8
non-private HFA	SURF	58.1

Table 10. Accuracy (in %, averaged over 20 experiments) obtained in dslr→amazon semi-supervised transfer learning experiments. The first and second best performances have been marked.

Method	Feature Type	Accuracy (%)
privacy-preserving maximum interpretability-measure model	VGG-FC6	91.9
privacy-preserving maximum transferability-measure model	VGG-FC6	91.9
non-private ILS (1-NN)	VGG-FC6	88.7
non-private CDLS	VGG-FC6	88.1
non-private MMDT	VGG-FC6	83.6
non-private HFA	VGG-FC6	85.9
non-private OBTL	SURF	54.4
non-private ILS (1-NN)	SURF	55.0
non-private CDLS	SURF	50.7
non-private MMDT	SURF	46.9
non-private HFA	SURF	42.9

Table 11. Accuracy (in %, averaged over 20 experiments) obtained in dslr→caltech256 semi-supervised transfer learning experiments. The first and second best performances have been marked.

Method	Feature Type	Accuracy (%)
privacy-preserving maximum interpretability-measure model	VGG-FC6	82.9
privacy-preserving maximum transferability-measure model	VGG-FC6	82.9
non-private ILS (1-NN)	VGG-FC6	81.4
non-private CDLS	VGG-FC6	77.9
non-private MMDT	VGG-FC6	71.8
non-private HFA	VGG-FC6	74.8
non-private OBTL	SURF	40.3
non-private ILS (1-NN)	SURF	41.0
non-private CDLS	SURF	34.9
non-private MMDT	SURF	34.1
non-private HFA	SURF	30.9

Table 12. Accuracy (in %, averaged over 20 experiments) obtained in dslr→webcam semi-supervised transfer learning experiments. The first and second best performances have been marked.

Method	Feature Type	Accuracy (%)
privacy-preserving maximum interpretability-measure model	VGG-FC6	88.9
privacy-preserving maximum transferability-measure model	VGG-FC6	89.0
non-private ILS (1-NN)	VGG-FC6	95.5
non-private CDLS	VGG-FC6	90.7
non-private MMDT	VGG-FC6	86.1
non-private HFA	VGG-FC6	86.9
non-private OBTL	SURF	83.2
non-private ILS (1-NN)	SURF	80.1
non-private CDLS	SURF	68.5
non-private MMDT	SURF	74.1
non-private HFA	SURF	60.5

Table 13. Accuracy (in %, averaged over 20 experiments) obtained in webcam→amazon semi-supervised transfer learning experiments. The first and second best performances have been marked.

Method	Feature Type	Accuracy (%)
privacy-preserving maximum interpretability-measure model	VGG-FC6	92.3
privacy-preserving maximum transferability-measure model	VGG-FC6	92.3
non-private ILS (1-NN)	VGG-FC6	88.8
non-private CDLS	VGG-FC6	87.4
non-private MMDT	VGG-FC6	84.7
non-private HFA	VGG-FC6	85.1
non-private OBTL	SURF	55.0
non-private ILS (1-NN)	SURF	54.3
non-private CDLS	SURF	51.8
non-private MMDT	SURF	47.7
non-private HFA	SURF	56.5

Table 14. Accuracy (in %, averaged over 20 experiments) obtained in webcam→caltech256 semi-supervised transfer learning experiments. The first and second best performances have been marked.

Method	Feature Type	Accuracy (%)
privacy-preserving maximum interpretability-measure model	VGG-FC6	81.4
privacy-preserving maximum transferability-measure model	VGG-FC6	81.4
non-private ILS (1-NN)	VGG-FC6	82.8
non-private CDLS	VGG-FC6	78.2
non-private MMDT	VGG-FC6	73.6
non-private HFA	VGG-FC6	74.4
non-private OBTL	SURF	37.4
non-private ILS (1-NN)	SURF	38.6
non-private CDLS	SURF	33.5
non-private MMDT	SURF	32.2
non-private HFA	SURF	29.0

Table 15. Accuracy (in %, averaged over 20 experiments) obtained in webcam→dslr semi-supervised transfer learning experiments. The first and second best performances have been marked.

Method	Feature Type	Accuracy (%)
privacy-preserving maximum interpretability-measure model	VGG-FC6	90.8
privacy-preserving maximum transferability-measure model	VGG-FC6	90.2
non-private ILS (1-NN)	VGG-FC6	94.5
non-private CDLS	VGG-FC6	88.5
non-private MMDT	VGG-FC6	85.1
non-private HFA	VGG-FC6	87.3
non-private OBTL	SURF	75.0
non-private ILS (1-NN)	SURF	70.8
non-private CDLS	SURF	60.7
non-private MMDT	SURF	67.0
non-private HFA	SURF	56.5

Table 16. Comparison of the methods on “Office+Caltech256” dataset.

Method	Number of Experiments in Which Method Performed Best
privacy-preserving maximum transferability-measure model	6
privacy-preserving maximum interpretability-measure model	5
non-private ILS (1-NN)	5
non-private CDLS	1

Table 17. Results (median values) obtained in stress detection experiments on a dataset consisting of heart rate interval measurements.

Method	Privacy Leakage	Interpretability Measure	Transferability Measure	Classification Accuracy
$\begin{matrix} minimum privacy leakage \\ transfer and multi-task learning \end{matrix}$	−3.74	3.47	291.84	0.9647
$\begin{matrix} minimum privacy leakage \\ source domain classifier \end{matrix}$	−3.74	3.47	291.84	0.3411
$\begin{matrix} maximum interpretability measure \\ transfer and multi-task learning \end{matrix}$	0.43	23.92	773.36	0.9619
$\begin{matrix} maximum interpretability measure \\ source domain classifier \end{matrix}$	0.43	23.92	773.36	0.3602

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kumar, M.; Moser, B.A.; Fischer, L.; Freudenthaler, B. An Information Theoretic Approach to Privacy-Preserving Interpretable and Transferable Learning. Algorithms 2023, 16, 450. https://doi.org/10.3390/a16090450

AMA Style

Kumar M, Moser BA, Fischer L, Freudenthaler B. An Information Theoretic Approach to Privacy-Preserving Interpretable and Transferable Learning. Algorithms. 2023; 16(9):450. https://doi.org/10.3390/a16090450

Chicago/Turabian Style

Kumar, Mohit, Bernhard A. Moser, Lukas Fischer, and Bernhard Freudenthaler. 2023. "An Information Theoretic Approach to Privacy-Preserving Interpretable and Transferable Learning" Algorithms 16, no. 9: 450. https://doi.org/10.3390/a16090450

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Information Theoretic Approach to Privacy-Preserving Interpretable and Transferable Learning

Abstract

1. Introduction

1.1. Trustworthy AI

1.2. Motivation and Novelty

1.3. Goal and Aims

1.4. Methodology

1.4.1. Defining Measures in Terms of the Information Leakages

1.4.2. Variational Membership Mapping Bayesian Models

1.4.3. Variational Approximation of Information Theoretic Measures

1.5. Contributions

1.5.1. A Unified Approach to Study the Privacy, Interpretability, and Transferability Aspects of Trustworthy AI

1.5.2. Information Theoretic Quantification of Privacy, Interpretability, and Transferability

1.5.3. Computation of Information Theoretic Measures without Requiring the Knowledge of Data Distributions

1.6. Organization

2. Mathematical Background

2.1. Notations

2.2. Review of Variational Membership Mappings

2.3. Review of Membership-Mappings-Based Conditionally Deep Autoencoders

2.4. Membership Mappings for Classification

2.5. Review of Membership-Mappings-Based Privacy-Preserving Transferable Learning

2.5.1. Optimal Noise Adding Mechanism for Differentially Private Classifiers

2.5.2. Semi-Supervised Transfer Learning Scenario

2.5.3. Differentially Private Source Domain Classifier

2.5.4. Latent Subspace Transformation Matrices

2.5.5. Subspace Alignment

2.5.6. Target Domain Classifier

2.5.7. source2target Model

2.5.8. Transfer and Multi-Task Learning

3. Variational Membership-Mapping Bayesian Models

3.1. A Prior Model

3.2. Variational Bayesian Inference

4. Evaluation of the Information-Leakage

4.1. Variational Approximation of the Information Leakage

4.2. An Algorithm for the Computing of Information Leakage

5. Information Theoretic Measures for Privacy Leakage, Interpretability, and Transferability

5.1. Definitions

5.2. A Unified Approach to Privacy-Preserving Interpretable and Transferable Learning

6. Experiments

6.1. MNIST Dataset

6.1.1. Interpretable Parameters

6.1.2. Private Data

6.1.3. Semi-Supervised Transfer Learning Scenario

6.1.4. Experimental Design

6.1.5. Results

6.2. Office and Caltech256 Datasets

6.2.1. Interpretable Parameters

6.2.2. Private Data

6.2.3. Semi-Supervised Transfer Learning Scenario

6.2.4. Experimental Design

6.2.5. Reference Methods

6.2.6. Results

6.3. An Application Example: Mental Stress Detection

6.3.1. Interpretable Parameters

6.3.2. Private Data

6.3.3. Semi-Supervised Transfer Learning Scenario

6.3.4. Experimental Design

6.3.5. Results

7. Concluding Remarks

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI