Privacy-Preserving Outsourced Artificial Neural Network Training for Secure Image Classification

Deng, Guoqiang; Tang, Min; Zhang, Yuhao; Huang, Ying; Duan, Xuefeng

doi:10.3390/app122412873

Open AccessArticle

Privacy-Preserving Outsourced Artificial Neural Network Training for Secure Image Classification

by

Guoqiang Deng

^1,2,3

,

Min Tang

^2,3,

Yuhao Zhang

^2,3,

Ying Huang

^2,3 and

Xuefeng Duan

^2,3,*

¹

School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin 541004, China

²

School of Mathematics and Computing Science, Guangxi Colleges and Universities Key Laboratory of Data Analysis and Computation, Guilin University of Electronic Technology, Guilin 541004, China

³

Center for Applied Mathematics of Guangxi (GUET), Guilin 541002, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2022, 12(24), 12873; https://doi.org/10.3390/app122412873

Submission received: 11 November 2022 / Revised: 7 December 2022 / Accepted: 12 December 2022 / Published: 14 December 2022

(This article belongs to the Section Computing and Artificial Intelligence)

Download

Browse Figures

Versions Notes

Abstract

:

Artificial neural network (ANN) is powerful in the artificial intelligence field and has been successfully applied to interpret complex image data in the real world. Since the majority of images are commonly known as private with the information intended to be used by the owner, such as handwritten characters and face, the private constraints form a major obstacle in developing high-precision image classifiers which require access to a large amount of image data belonging to multiple users. State-of-the-art privacy-preserving ANN schemes often use full homomorphic encryption which result in a substantial overhead of computation and data traffic for the data owners, and are restricted to approximation models by low-degree polynomials which lead to a large accuracy loss of the trained model compared to the original ANN model in the plain domain. Consequently, it is still a huge challenge to train an ANN model in the encrypted-domain. To mitigate this problem, we propose a privacy-preserving ANN system for secure constructing image classifiers, named IPPNN, where the server is able to train an ANN-based classifier on the combined image data of all data owners without being able to observe any images using primitives, such as randomization and functional encryption. Our system achieves faster training time and supports lossless training. Moreover, IPPNN removes the need for multiple communications among data owners and servers. We analyze the security of the protocol and perform experiments on a large scale image recognition task. The results show that the IPPNN is feasible to use in practice while achieving high accuracy.

Keywords:

image classifier; neural network; privacy-preserving; non-interactive; function encryption; mask matrix

1. Introduction

Artificial Neural Network (ANN) learning is well-suited to tackle problems in which the training samples correspond to complex image data, such as inputs from video cameras and mobile phones [1]. In recent work, ANN has proven successful in text and image classification and is observed to achieve high accuracy in these tasks. Consequently, image classifier based on ANN has been widely used in many real-life fields [2,3,4]. Conceptually, ANN-based solutions rely on elaborate algorithms, but even more on large-scale training data. As the volume of data owned by a single user is insufficient to train reliable classifiers, aggregation of data is a solution to address the local limitations.

However, images are private mediums with the message intended to be used only by the owner. Due to the sensitive nature of the information content, there might be confidentiality and security constraints against sharing and using image data. These constraints form formidable obstacles in many image applications, including image classification [5], image synthesis [6], image retrieval [7,8], image sharing [9], and protection of location privacy in camera data of auto-driving vehicles [10]. As a consequence, the construction of privacy-preserving image classifiers is worth researching.

Existing approaches to train machine learning models in privacy-preserving settings mainly rely on secure multi-party computation [11], fully or partially homomorphic encryption [12,13,14,15] and secret sharing [16,17]. Most of these approaches have several limitations: (i) First, in traditional privacy-preserving machine learning schemes, the non-linear functions are hard to support using common cryptographic techniques, such as homomorphic encryption and boolean circuit. Hence, prior work proposes to approximate the non-linear function using polynomial or piecewise linear function [12,15,16,17,18]. It can be shown that approximation using a high-degree polynomial is able to provide a higher accuracy. While—to the best of our knowledge—for efficiency reasons, the degree of the approximation polynomial is limited to 5 or less, which results in a large accuracy loss of the trained model. (ii) Secondly, the majority of prior work requires multi-round communications between data users and servers [19,20,21,22,23,24,25,26], making a substantial overhead of computation and transmission cost. Furthermore, in practice, most of the users are unlikely to participate in the whole training process online. Consequently, the classifier needs to be learned only on encrypted image data without the help of users. Moreover, non-interactive setting is easy to extend to dynamical participation or dropping during the training phase.

The first known privacy-preserving scheme for neural network with non-interactive and lossless features simultaneously presented by [11], named NPMML. This method uses hybrid cryptographic techniques, including Paillier cryptosystem, RSA, and random mask to preserve data. Since the implementation of the NPMML utilizes multiple cryptosystems, the two servers need to interact several rounds for updating the parameters to accommodate the transformation of different masked techniques.

Towards this, we propose a solution that enables multiple users to share their private data to achieve a high-precision image classifier without violating privacy laws and communicating with the servers frequently. Specifically, we present a non-interactive and privacy-preserving image classifier based on an ANN model, named IPPNN. This scheme (i) substantially reduces the amount of communication required to train ANN model, which requires only one transmission for users; and (ii) does not require any approximation for non-linear functions and can support lossless training. To achieve these benefits, IPPNN orchestrates mask matrix and functional encryption for inner-product (FEIP) [27].

To summarize, our contributions are:

Non-Interactive. We propose IPPNN, a non-interactive and privacy-preserving image classifier based on ANN over distributed image data, which only requires communication between the users and the servers as a one-way interaction. Compared to interactive privacy-preserving ANN systems, our approach introduces less computation and communication overheads for the users.
High-Precision.IPPNN enables the building of high-precision image classification as it does not require the use of polynomials or linear piecewise function approximations to handle out the non-linear functions. IPPNN supports lossless classifier construction which does not alter any of the computation of the original ANN training algorithm.
Efficiency. We have implemented and evaluated the performance of IPPNN. For two popular MNIST datasets with a large amount of image data and high dimensionality, the experiments validate that IPPNN is several orders of magnitude faster than the state-of-the-art privacy-preserving ANN schemes without compromising the accuracy of classifiers.

The rest of paper is organized as follows. We review the related work in Section 2. In Section 3, we introduce ANN model and cryptographic primitive (functional encryption for inner-product). We overview the proposed privacy-preserving framework of IPPNN in the first part of Section 4, and the core idea and construction details of IPPNN is described in the latter part of Section 4. The security analysis and evaluation are presented in Section 5. The conclusion of our paper are presented in Section 6.

2. Related Work

Image processing is a well established field and has been researched extensively [4,28]. As the artificial intelligence (AI) and big data develop by leaps and bounds, the image-processing technology, such as image recognition [29], image classification [30], face recognition system [31], smart agriculture [3], lung cancer classification and prediction [2], surface defect detection system [32], are also renewing, people lead a more convenient and have a higher quality of life.

However, the application of AI techniques can cause privacy leakage and security risks since solutions based on AI are reliant on a large number of training samples commonly belonging to different users. To make effective and safe use of data distributed in different locations while satisfying privacy requirements, some approaches have incorporated privacy-preserving techniques into image processing [5,6,7,8,9,14,33]. For example, Fagbohungbe et al. [5] proposed a secure intelligent computing framework for image classification based on deep learning and edge computing. Yang et al. [6] presented a lightweight secure GAN framework for image synthesis based on secret sharing. Shen et al. [7] proposed a privacy-preserving medical image retrieval system based on blockchain.

Multiple privacy-preserving machine learning approaches have been presented for classification tasks. Prior work [19,20,21,22,23,24,25,26] proposed schemes with an interactive fashion which requires users to communicate with servers multiple rounds. The online computation resources constraints form an obstacle in developing based-AI classification systems. Although non-interactive approaches have been proposed in the last few years, most of fall into fully homomorphic encryption (FHE) [12,15] or secret sharing (SS) [16,17] algorithms. Protocols based-FHE are hardly to be applied in the real world due to a very considerable computation costs. Methods based on SS still have higher requirements for the network since the two servers need to communicate frequently. Our proposed privacy-preserving method for constructing image classifiers realizes non-interactive between users and servers. Moreover, the protocol is feasible and efficient to use in practice using functional encryption cryptographic scheme.

Due to the limitation of FHE and SS cryptographic primitive, some privacy computation schemes are forced to use polynomials or piecewise linear functions to appropriate original non-linear functions [15,16,17,18,26,34]. For support vector machines and logistic regression, the obtained model precision is acceptable. However, for neural network or deep learning, such approximation methods will cause a loss of precision, which is not comparable to the original model and can not be applied in production and life. Our method removes the need for the approximation functions. That is, the private training protocol does not alter the original ANN model and result in a loss of accuracy.

The closest approach to our IPPNN is the NPMML method [11], which is a non-interactive privacy-preserving multi-party neural network framework with a dual server architecture. The protocol is complicated for the reason that NPMML employs multiple cryptographic technologies to preserve data. The benefit of non-interactive and lossless features (quite similar to ours) comes with a substantial overhead of computation and communication costs. The detailed comparison of the IPPNN and the NPMML are covered in Section 5.6 and Section 5.7.

3. Preliminaries

3.1. Artificial Neural Network

Artificial neural network [35] provides a general method for learning non-linear functions and constructing prediction model from training samples. The most common architecture of ANN is composed of the input layer, multiple hidden layers, and the output layer. ANN models have proven successful in many practical problems, such as learning to recognize handwritten characters. Figure 1 shows an ANN model for image recognition.

Notation. Here, we suppose that the input data

x = (x_{1}; \dots; x_{a}) \in R^{a \times 1}

has a different features and each sample belongs to one of the c classes denoted by a label

y = (y_{1}, \dots, y_{c})

, the hidden layer has b neurons.

We represent the hidden layer weight matrix as

W^{(h)} \in R^{b \times a}

. Each row of

W^{(h)}

is a list of real numbers, which identifies a vector

w_{j}^{(h)} = (w_{j 1}^{(h)}, \dots, w_{j a}^{(h)}) \in R^{1 \times a}, j = 1, \dots, b

. Similarly, we denote the matrix

W^{(o)} \in R^{c \times b}

as an output layer weight matrix. Each row of

W^{(o)}

identifies a vector

w_{i}^{(o)} = (w_{i 1}^{(o)}, \dots, w_{i b}^{(o)}) \in R^{1 \times b}, i = 1, \dots, c

.

We call the last layer as the output layer

\begin{matrix} o = (o_{1}; \dots; o_{c}) = g (W^{(o)} h) \in R^{c \times 1}, \end{matrix}

where

h = (h_{1}; \dots; h_{b}) = f (W^{(h)} x) \in R^{b \times 1}

consists of all outputs of hidden layer. In specific,

f (w^{(h)} x) \in R

is a sigmoid function

f (z) = \frac{1}{1 + e^{- z}}

, where

w^{(h)}

is a hidden layer weight vector. The function g in ANN can be a softmax function:

g (z_{k}) = e^{z_{k}} / \sum_{i} e^{z_{i}}

. Note that in our privacy-preserving image classifier construction scheme based on ANN any activation function can substitute sigmoid function f or softmax function g.

Gradient descent algorithm. Considering cross entropy as the loss function,

\begin{matrix} E (w) = - \sum_{i = 1}^{c} y_{i} ln (o_{i}), \end{matrix}

(1)

where

y_{i}

is the class label,

o_{i}

is the output of prediction function, w denotes the model parameter.

Gradient descent search aims to determine a weight

w^{*}

that minimizes

E (w)

by starting with an arbitrary initial weight. Then, the weight is altered in the direction that produces the steepest descent. The training rule for gradient descent is

\begin{matrix} w_{t + 1} \leftarrow w_{t} - η \nabla E (w_{t}) \end{matrix}

(2)

where

η

is the learning rate, and

\nabla E (w_{t})

is the gradient computed at the tth iteration. This process continues until the global minimum is reached.

Derivation of the gradient descent rule. By selecting a mini-batch sample set S randomly from the whole database

DB = {(x^{(1)}, y^{(1)}), \dots, (x^{(m)}, y^{(m)})}

, the gradient with respect to the output layer weight

w_{i, j}^{(o)}

in a 3-layer-perception becomes

\begin{matrix} \frac{\partial E_{S}}{\partial w_{i, j}^{(o)}} = \frac{1}{|S|} \sum_{l \in S} (o_{i}^{(l)} - y_{i}^{(l)}) h_{j}^{(l)}, \end{matrix}

(3)

where

i = 1, \dots, c, j = 1, \dots, b .

Similarly, we update the hidden layer weight

w_{j, k}^{(h)}

via the formula as follows

\begin{matrix} \frac{\partial E_{S}}{\partial w_{j, k}^{(h)}} = \frac{1}{|S|} \sum_{l \in S} - h_{j}^{(l)} (1 - h_{j}^{(l)}) \sum_{i = 1}^{c} [(y_{i}^{(l)} - o_{i}^{(l)}) w_{i, j}^{(o)}] x_{k}^{(l)}, \end{matrix}

(4)

where

j = 1, \dots, b, k = 1, \dots, a .

3.2. Functional Encryption for Inner-Product (FEIP)

Functional encryption (FE) is a new paradigm of the public key that allows one party

P_{1}

to learn a function of ciphertext encrypted by another party

P_{2}

. In this paper, we employ a functional encryption for inner product (FEIP) scheme [27], which allows

P_{1}

to compute the inner product between vectors

x

and

y

, where

x

is encrypted by

P_{2}

and

y

is a plaintext vector given by

P_{1}

. A FEIP scheme consists of five algorithms. More specific details of FEIP scheme refers to [27].

Setup: The setup algorithm takes a security parameter as input and generates a mathematical group.
Master key generation: The key generation algorithm creates a public key $p k^{F E I P}$ together with a master secret key $m s k^{F E I P}$ .
Functional key derivation: The functional key derivation algorithm takes the master secret $m s k^{F E I P}$ and the vector $y$ as input to generate a functionally derived key $d k_{y}$ .
Encryption: The encryption algorithm applies to message vector $x$ and obtains a ciphertext $c t_{x}$ by using the public key $p k^{F E I P}$ .
Decryption: Given the ciphertext $c t_{x}$ of a message $x$ , the holder of the key $d k_{y}$ is able to compute the value of $〈x, y〉$ using the decryption algorithm.

Consider the ANN training scenarios. The computation of inner product of weight vector

w

and training sample

x

is a formidable step during the ANN training in the encrypted domain. A user wants to keep his sample

x

private but lets a server be able to compute the inner product

〈w, x〉

. During the Setup phase, a trusted third-party authority (

TPA

) provides the public key to the user. Then, the user encrypts

x

and transfers the ciphertext to the server. In order to acquire

〈w, x〉

for the server, the

TPA

generates a functionally derived key that depends on the weight

w

. The server decrypts the ciphertext using the functionally derived key and obtains the result

〈w, x〉

. In this way, the server is able to perform the inner product operation while maintaining the privacy of the user.

4. The Framework and Construction for Privacy-Preserving Image Classifier

We first introduce the framework of our proposed scheme, IPPNN, which is shown in Figure 2. The goal is to build a high-precision image classifier using neural network model without revealing training samples belonging to the users. Additionally, IPPNN enables cooperative learning without a need for any peer-to-peer communication among users and servers.

4.1. System Model

IPPNN has four types of entities: a group of users, a neural network training server (

S_{NN}

), an auxiliary server (

S_{AU}

) which does not collude with

S_{NN}

, and a trusted third-party authority (

TPA

) to enable functional encryption.

Users: Each user owns a dataset which contains multiple training samples and is willing to contribute his samples in a privacy-preserving way. In our system, the responsibility of a user is only to provide his data in well-designed encrypted version to the servers $S_{AU}$ and $S_{NN}$ .
$S_{NN}$ : A neural network training server to collaboratively build a global neural network model for image classification without model parameters information leakage. The server $S_{NN}$ orchestrates the private training process with the help of the server $S_{AU}$ . Finally, $S_{NN}$ is the owner of the image classifier.
$S_{AU}$ : An auxiliary server to help the trainer server $S_{NN}$ to complete a privacy-preserving neural network training. It does not possess any model updates parameters or optimized model weights.
$TPA$ : A trusted third-party authority $TPA$ is responsible for building the underlying cryptosystem, delivering the public key to each user and providing private key service to the server $S_{AU}$ .

4.2. Security Requirement

The main goal of IPPNN is to build an image classifier by training a neural network model: (i) without being able to observe any information belonging to the users; and (ii) without leaking the model parameters trained by the server

S_{NN}

.

Honest-but-curious $S_{NN}$ : We assume that the server $S_{NN}$ correctly follows the IPPNN protocol, but may try to learn private information belonging to users from the processed information provided by $S_{AU}$ .
Honest-but-curious $S_{AU}$ : We assume that the server $S_{AU}$ correctly follows the IPPNN protocol, but may try to learn private training samples from the encrypted information provided by data owners, and try to learn the trained model parameters.
Trusted $TPA$ : We assume that the $TPA$ is an independent entity trusted by the users and the servers $S_{NN}$ and $S_{AU}$ .

We now present our privacy-preserving scheme for achieving an image classifier. We first introduce the idea of IPPNN model. Then, the data encryption algorithm and a detailed secure training process are presented.

4.3. Core Idea of the IPPNN

The main challenge in privacy-preserving machine learning is to compute the non-linear functions without at the expense of user privacy, which is hard to support using the existing cryptographic technologies, including homomorphic encryption, Boolean circuit, and secret sharing. Hence, prior work proposes to replace the non-linear functions by approximation polynomials or piece-wise linear functions. These substitutions can be computed using secure computation techniques efficiently.

Our IPPNN removes the need for the use of polynomial approximation by using FEIP scheme, and removes the multi-round communications between the users and servers by using mask matrix. Consequently, the private training protocol does not alter any of the computation of the original ANN training algorithm and, therefore, result in the same output, i.e., there is no accuracy loss in the secure version. In addition, our scheme ensures that any user can go offline after uploading the encrypted local data to the server.

In the IPPNN scheme, a user needs to encrypt his training data in two ways: (i) by the random matrix; and (ii) by the public key generated by the

TPA

.

Specifically, in the first encryption way, the encrypted data are deliberately defined a tuple

(A x, A^{- 1})

, where A is a random matrix chosen by the user. The pair

(A x, A^{- 1})

is split into two parts and sent to

S_{AU}

and

S_{NN}

, respectively. By exploiting the mask matrix A the server

S_{AU}

will obtain the inner-product

w \cdot x

. Combining the linear transformation

w \cdot x

and non-linear activation function

f (\cdot)

together, the output of neuron

f (w \cdot x)

is sent to

S_{NN}

by

S_{AU}

. The operations for obtaining the gradient

\partial E_{S} / \partial w_{i, j}^{(o)}

can be executed by the server

S_{NN}

independently because the remainder computations only depend on model weights

w

and the provided label.

In the second encryption way, we observe that for obtaining the gradient

\partial E_{S} / \partial w_{j, k}^{(h)}

the feature dimension secure aggregation are required. From the computation process of the gradient

\partial E_{S} / \partial w_{i, j}^{(o)}

, we can obtain the coefficient

u_{k}^{(l)}

of the attribute

x_{k}^{(l)}

in the update rule (4), where

u_{k}^{(l)}

is defined by (7). In order to compute the gradient

\partial E_{S} / \partial w_{j, k}^{(h)}

, the user encrypt his samples using the FEIP cryptosystem with the public key. With the functional-derived key depending on the vector consisting of the coefficients

u_{k}^{(l)}

, the server

S_{NN}

is able to decrypt the ciphertext and acquire the gradient

\partial E_{S} / \partial w_{j, k}^{(h)}

.

4.4. Description of Proposed Scheme

We construct a scheme for the 3-layer perception which includes three phases: system initialization, user data encryption, and privacy-preserving training. In the system initialization phase,

TPA

generates a master secret key and public key based on a given security parameter; in the data encryption phase, the user sends his samples blinded by a random mask matrix A to the

S_{AU}

and the inverse of the matrix

A^{- 1}

to the

S_{NN}

, respectively, and transfer his encrypted sample attributes by the public key to the server

S_{NN}

; in the privacy-preserving training phase, the servers

S_{AU}

and

S_{NN}

coordinate the neural network training. More specific details of each stage are described as follows.

4.4.1. System Initialization

The

TPA

takes a security parameter

λ

as input and generates a group

(G, p, g)

and a master secret key

m s k^{F E I P} = s = (s_{1}, . . ., s_{l}) \leftarrow Z_{p}^{l}

, where p is the order of group G generated by

g \in G

, p is a

λ

-bit prime number and must be chosen, such that

p - 1

has at least one large prime factor [36],

l = | S |

is the size of a mini-batch. The master key generation algorithm creates a public key

p k^{F E I P} = {(h_{i} = g^{s_{i}})}_{i \in [l]}

corresponding to

s

. Then,

TPA

distributes the public keys

p k^{F E I P}

to the users, and provides private key service to the server

S_{NN}

during the training phase.

4.4.2. Data Encryption

To preserve the data privacy, the user needs to encrypt each sample of his local dataset D. In the following, according to the main idea mentioned in Section 4.3 we present a customized method for protecting users data, adapting to create privacy-preserving ANN model by the

S_{NN}

with the cooperation of the

S_{AU}

.

Assume that dataset D includes m samples. We first choose a mask matrix A to generate a two-part ciphertext

A^{- 1}

and

A x

where the two parts are utilized by the

S_{NN}

and

S_{AU}

, respectively. Specifically, the user selects a random non-singular matrix

A \in Z^{a \times a}

, then computes its inverse matrix

A^{- 1} \in Z^{a \times a}

and the blinded data

\tilde{x} = A x

using A, and then finally sends the encrypted version

\tilde{x}

to the

S_{AU}

, the matrix

A^{- 1}

and labels

{y^{(1)}, \dots, y^{(m)}}

to the

S_{NN}

. Then, we divide the dataset D into

⌈m / l⌉

groups. For each group, we use the public key

p k^{F E I P}

to encrypt the vector formed by the ith

(i = 1, 2, \dots, a)

feature of all of the l samples. The obtained ciphertexts are sent to the server

S_{NN}

.

The user data encryption approach is specified in Algorithm 1. The user sends

A x^{(i)}

to the server

S_{AU}

. Meanwhile, the user transfers

A^{- 1}

with labels

y

to the the

S_{NN}

. For IPPNN model, we require the user to share labels to the

S_{NN}

in plaintext. Note that the single label vector

y^{(i)}

is without violating privacy laws since

y^{(i)}

does not contain any substantial information.

Algorithm 1 User data encryption and uploading.

Input: user dataset

D = {(x^{(1)}, y^{(1)}), \dots, (x^{(m)}, y^{(m)})}

, batch size l, public key

p k^{F E I P} = {(h_{i} = g^{s_{i}})}_{i \in [l]}

.

Output: the blinded sample

{\tilde{x}}^{(i)}

,

i = 1, 2, \dots, m

, the encrypted vector for sample features.

1:: Choose a random non-singular matrix $A \in Z^{a \times a}$ and compute its inverse matrix $A^{- 1} \in Z^{a \times a}$ ;
2:: for $i = 1, 2, \dots, m$ do
3:: Compute ${\tilde{x}}^{(i)} = A x^{(i)}$ using the mask matrix A.
4:: end for
5:: Divide D into $⌈m / l⌉$ groups. Each group contains l samples.
6:: Add $l - (m \mod d)$ random samples uniformly picked from D to the last group if m is not divisible by l.
7:: for $i = 1, 2, \dots, ⌈m / l⌉$ do
8:: for $j = 1, 2, \dots, a$ do
9:: $r \leftarrow Z_{p}$
10:: Encrypt data sample features as

$C t_{j}^{(i)} = E n c_{p k^{F E I P}} (x_{j}^{((i - 1) l + 1)}, \dots, x_{j}^{(i l)})$

$= (g^{r}, h_{1}^{r} g^{x_{j}^{((i - 1) l + 1)}}, \dots, h_{l}^{r} g^{x_{j}^{(i l)}})$
11:: end for
12:: end for
13:: Send the encrypted data ${\tilde{x}}^{(i)}$ to the $S_{AU}$ .
14:: Send the matrix $A^{- 1}$ , ciphertexts $C t_{j}^{(i)}$ and labels ${y^{(1)}, \dots, y^{(m)}}$ to the $S_{NN}$ .

4.4.3. Privacy-Preserving Training

We now show the procedure of lossless ANN training for image classification by FEIP scheme and mask matrix support without interacting with users. The server

S_{NN}

initiates the protocol with a uniform

W^{(h)}

and

W^{(o)}

and sets the gradient update step

λ

. Through executing the IPPNN protocol, the server

S_{NN}

can update its weight matrices iteratively on the encrypted data with the help of the server

S_{AU}

, and finally obtain a well-trained model. We describe training process of per iteration in the IPPNN protocol below.

The Secure Computation of

\partial E_{S} / \partial w_{i, j}^{(o)}

. The update rule (3) with a random sample

x

can be described as

\begin{matrix} \frac{\partial E_{{x}}}{\partial w_{i, j}^{(o)}} & = (g (\sum_{k = 1}^{b} w_{i k}^{(o)} f (w_{k}^{(h)} \cdot x)) - y_{i}) f (w_{j}^{(h)} \cdot x) \end{matrix}

(5)

where

i = 1, \dots, c

,

j = 1, \dots, b

.

From (5), we find that the update rule requires

S_{NN}

to compute the gradients of the loss function (2) which involve inner-product

w_{k}^{(h)} \cdot x

, sigmoid function

f (\cdot)

, and softmax function

g (\cdot)

and cannot be completed using only homomorphic additions and multiplications. We first deal with the formidable obstacle

f (w_{k}^{(h)} \cdot x)

. Recall the server

S_{NN}

owns the model parameters

w_{i, k}^{(o)}

, with the inner product

f (w_{k}^{(h)} \cdot x)

and provided label

y_{i}

, the

S_{NN}

can obtain the output of hidden layer neurons. Then the computation of

(o_{i} - y_{i}) h_{j}

takes place in plaintext. We provide details of how

f (w_{k}^{(h)} \cdot x)

are computed by mask matrix structure.

1.: The $S_{NN}$ first computes

$\begin{matrix} {\tilde{W}}^{(h)} & = ({\tilde{w}}_{1}^{(h)}, \dots, {\tilde{w}}_{b}^{(h)}) \\ = W^{(h)} A^{- 1} \\ = (w_{1}^{(h)} A^{- 1}, \dots, w_{b}^{(h)} A^{- 1}) . \end{matrix}$

After that, the $S_{NN}$ sends ${\tilde{W}}^{(h)}$ to the $S_{AU}$ .
2.: Then the $S_{AU}$ takes ${\tilde{W}}^{(h)}$ and $\tilde{x}$ as input, and return the inner-products ${\tilde{w}}_{1}^{(h)} \cdot \tilde{x}, \dots, {\tilde{w}}_{b}^{(h)} \cdot \tilde{x}$ , that is $w_{1}^{(h)} \cdot x, \dots, w_{b}^{(h)} \cdot x$ . This is because that

${\tilde{w}}_{j}^{(h)} \cdot \tilde{x} = w_{j}^{(h)} A^{- 1} A x = w_{j}^{(h)} \cdot x .$

After that, an execution process of activation function takes place. This results in the

S_{AU}

having all outputs of hidden layer neurons

\begin{matrix} \begin{matrix} h = (h_{1}, \dots, h_{b}) = (f (w_{1}^{(h)} \cdot x), \dots, f (w_{b}^{(h)} \cdot x)) \end{matrix} \end{matrix}

(6)

Then, the

S_{AU}

sends

h

to the

S_{NN}

.

Once the server

S_{NN}

receives the outputs of hidden layer neurons, it prepares

w_{i k}^{(o)}

and

y_{i}

to perform the computation of

(g (\sum_{k = 1}^{b} w_{i k}^{(o)} h_{k}) - y_{i}) h_{j}

and obtains the gradient

\partial E_{{x}} / \partial w_{i, j}^{(o)}

. The gradient

\partial E_{S} / \partial w_{i, j}^{(o)}

in a mini-batch setting can be computed easily from a group of values

\partial E_{{x^{(l)}}} / \partial w_{i, j}^{(o)}, l = 1, . . ., | S | .

The secure computation of

\partial E_{S} / \partial w_{j, k}^{(h)}

. The server

S_{AU}

follows the FEIP in the process of obtaining

\partial E_{S} / \partial w_{i, j}^{(o)}

. As a result, the

S_{AU}

has the outputs of hidden layer neurons

h = (h_{1}, h_{2}, \dots, h_{b})

. The server

S_{NN}

can exploit the results of

\partial E_{S} / \partial w_{i, j}^{(o)}

and encrypted feature

x_{k}

of data sample to obtain the gradient

\partial E_{S} / \partial w_{j, k}^{(h)}

.

Take the kth feature vector

(x_{k}^{(1)}, \dots, x_{k}^{(l)})

of the first group samples as an example, with the help of FEIP, the server

S_{NN}

is able to acquire the gradient

\frac{\partial E_{S}}{\partial w_{j, k}^{(h)}} = \frac{1}{|S|} \sum_{l \in S} u_{k}^{(l)} x_{k}^{(l)}

where

\begin{matrix} u_{k}^{(l)} = \sum_{i = 1}^{c} [(y_{i}^{(l)} - g (\sum_{n = 1}^{b} w_{i n}^{(o)} h_{n}^{(l)})) w_{i, j}^{(o)}] h_{j}^{(l)} (h_{j}^{(l)} - 1), \end{matrix}

(7)

j = 1, \dots, b, k = 1, \dots, a

.

Note that the terms

h_{n}^{(l)} (n = 1, 2, \dots, b)

have already obtained from the computation of

\partial E_{S} / \partial w_{i, j}^{(o)}

. With the provided

w_{i k}^{(o)}

and labels

y_{i}^{(l)}

, the

S_{NN}

is able to acquire the value

u_{k}^{(l)}

. The

S_{NN}

takes the ciphertexts of vector

(x_{k}^{(1)}, \dots, x_{k}^{(l)})

, the public key

p k^{F E I P}

and functional key for the vector

u_{k} = (u_{k}^{(1)}, \dots, u_{k}^{(l)})

as input, and returns the inner-product

\sum_{l \in S} u_{k}^{(l)} x_{k}^{(l)}

. We provide the following steps towards this purpose.

1.: The $TPA$ takes the master private key $m s k^{F E I P} = s$ and the plaintext vector $u_{k} = (u_{k}^{(1)}, \dots, u_{k}^{(l)})$ sent by the $S_{NN}$ as input, and generates a functionally derived key $d k_{u_{k}} = 〈 u_{k}, s 〉$ as output.
2.: The $S_{NN}$ takes the ciphertexts

$C t_{k}^{(1)} = E n c_{p k^{F E I P}} (x_{k}^{(1)}, \dots, x_{k}^{(l)})$

$= (g^{r}, h_{1}^{r} g^{x_{k}^{(1)}}, \dots, h_{l}^{r} g^{x_{k}^{(l)}})$

received from the user, the public key $p k^{F E I P}$ and functionally derived key $d k_{u_{k}}$ as input, and return the inner-product $\sum_{l \in S} u_{k}^{(l)} x_{k}^{(l)}$ .

In specific,

g^{\sum_{l \in S} u_{k}^{(l)} x_{k}^{(l)}} = \prod_{i \in [l]} {(h_{i}^{r} g^{x_{k}^{(i)}})}^{u_{k}^{(i)}} / {(g^{r})}^{d k_{u_{k}}}

In order to recover the final inner-product value

\sum_{l \in S} u_{k}^{(l)} x_{k}^{(l)}

a discrete logarithm computation is implemented.

Correctness. Given the public key

p k^{F E I P} = {(h_{i} = g^{s_{i}})}_{i \in [l]}

, we have

\begin{matrix} \prod_{i \in [l]} {(h_{i}^{r} g^{x_{k}^{(i)}})}^{u_{k}^{(i)}} / {(g^{r})}^{d k_{u_{k}}} \\ = & \prod_{i \in [l]} {(g^{s_{i} r + x_{k}^{(i)}})}^{u_{k}^{(i)}} / g^{r (\sum_{i \in [l]} u_{k}^{(i)} s_{i})} \\ = & g^{\sum_{i \in [l]} u_{k}^{(i)} s_{i} r + \sum_{i \in [l]} u_{k}^{(i)} x_{k}^{(i)} - r (\sum_{i \in [l]} u_{k}^{(i)} s_{i})} \\ = & g^{\sum_{i \in [l]} u_{k}^{(i)} x_{k}^{(i)}} \end{matrix}

5. Analysis and Evaluation

In this section, we first analyze the security of IPPNN under the threat model described in Section 4.2. Recall that the

TPA

is trusted to generate the public key and master secret key, enabling the FEIP feasible, the servers

S_{AU}

and

S_{NN}

are non-colluding and could be honest-but-curious, the external adversary is malicious. We aim to verify if IPPNN can train an ANN model without revealing any information of users, meanwhile, hiding parameters trained by the server

S_{NN}

from other entities.

Our IPPNN scheme should protect the data privacy against Class-I (honest-but-curious

S_{AU}

) and Class-II (honest-but-curious

S_{NN}

), and model privacy against Class-I.

5.1. Data Privacy

The primary privacy constraint is that the servers

S_{AU}

and

S_{NN}

should not be able to observe the user data and similarly, the external adversary should not be able to observe data belonging to users.

5.1.1. Class-I

In Algorithm 1, the user utilizes the random matrix A to mask his samples

x^{(i)}

, and then sends

A x^{(i)}

to the

S_{AU}

. In this way, the

S_{AU}

never gets to know any sample included in the training set since A is randomly selected by the user. In the privacy-preserving training process, although the

S_{AU}

can obtain the inner-product

w_{1}^{(h)} \cdot x, \dots, w_{b}^{(h)} \cdot x

, but it cannot infer the user data without knowing the weight vectors

w_{1}^{(h)}, \dots, w_{b}^{(h)}

.

5.1.2. Class-II

In the computation of gradient

\partial E_{S} / \partial w_{i, j}^{(o)}

of output layer, the

S_{NN}

receives all outputs of hidden layer neurons

h

. Due to the non-linear property of sigmoid function f, the

S_{NN}

can not solve the system

{f (w_{1}^{(h)} \cdot x), \dots, f (w_{b}^{(h)} \cdot x)}

. In the computation of gradient

\partial E_{S} / \partial w_{j, k}^{(h)}

of hidden layer, the

S_{NN}

receives the ciphertext

C t_{j}^{(i)}, i = 1, 2, \dots, ⌈m / l⌉, j = 1, 2, \dots, a

(executed by step 10 of Algorithm 1). Under the DDH assumption, the encrypted scheme is against chosen-plaintext attacks.

5.1.3. External Adversary

In our threat model, we do not assume there exists an external adversary who can monitor network and try to infer users’ private samples and model parameters. If so, the solution is straightforward. The

S_{NN}

can generate a pair of a public key and a secret key by any public key cryptosystem. The user encrypts the matrix

A^{- 1}

by the public key, and the

S_{NN}

uses his secret key to decrypt the mask matrix. Even if the external adversary can obtain the information of

A x

, he can not deduce the data

x

.

5.2. Model Privacy

5.2.1. Class-I

The secondary privacy constraint is to protect the model privacy trained by the

S_{NN}

. We should make sure that the privacy of

W^{(h)}

and

W^{(o)}

should be preserved against the

S_{AU}

. Similar to the analysis of data privacy in Section 5.1.1, finding the solutions of system

{w_{1}^{(h)} \cdot x, \dots, w_{b}^{(h)} \cdot x}

is impossible. The reason is that the

S_{AU}

can establish a linear system with b equations and

(b + 1) a

unknowns only.

5.2.2. External Adversary

The external adversary can eavesdrop network channel to obtain the information of

{\tilde{W}}^{(h)}

. He can not deduce the model parameters

W^{(h)}

since he has no information of

A^{- 1}

.

From the above analysis, our scheme satisfies the security requirements defined in Section 4.2.

5.3. The Goal of Experiments

To evaluate the performance of our proposed scheme, we compare IPPNN with the Baseline and NPMML [11] algorithms.

(i) Precision measurement: To measure the model precision of our method and the influence of using Taylor approximation in neural network model, an experiment with IPPNN and Baseline (a centralized version, non-privacy-preserving setting) and Baseline_app (with Taylor polynomial approximation) is designed to compare the classification accuracy.

(ii) Efficiency measurement: For the purpose of evaluating he efficiency of our method, we compare the execution time of IPPNN and NPMML training different image datasets. We use the NPMML proposed in [11] as the comparative object because it is the closest state-of-the-art approach to ours. In NPMML, the neural network model is built using the combination of Paillier cryptosystem and a CCA-2 secure public-key encryption scheme.

5.4. Experimental Setup

We conduct all experiments on a service platform (ECS) with ecs.n4.xlarge machines with 4-Core vCPU and 8GB RAM running on Linux system. We implement the neural network training on two image datasets digits-MNIST [37] and Fashion-MNIST [38]. Both of them contain 60,000 training samples (28 × 28) and 10,000 test samples. Each image is transformed into a vector with dimension 784 as the input of the neural network.

For all experiments, we construct a fully connected multi-layer perception with one hidden layer (with sigmoid function) and a output layer (with softmax function). For IPPNN, we use the function encryption proposed in [27] as the basic cryptographic technology. For NPMML, we use Python-RSA [39] and Python-Paillier [40] to implement the underlying encryption primitives.

The parameter setting is: learning rate

η = 0.01

, scaling factor

= 10^{6}

(for the purpose of extending the original encryption function to real numbers), epoch

= 20

.

5.5. Evaluation of Accuracy

Our IPPNN removes the need for approximating neural network model by using Taylor series. As shown in Figure 3, the classification accuracy of IPPNN is comparable to that of the Baseline in the same environment. The difference in classification accuracy between the two methods is less than 0.5%, which is caused by extending the encryption function for inner product to real numbers by scaling factor. These results suggest that the IPPNN approach supports lossless image classifiers construction.

For the same network architecture with the same dataset, the accuracy obtained by the approximation model is much lower than the IPPNN. We observe that the errors in identifying the image classification by approximation models Baseline_app are not negligible. It is also noted that ANN model with a higher-degree polynomial can not cause precision improvement, which is not feasible to be used directly with training protocols using polynomial approximation.

5.6. Comparison of Execution Time

For the purpose of evaluating the efficiency of privacy-preserving image classifiers based on neural network algorithms, we compare the execution time of IPPNN and NPMML training different datasets and different network architecture under different size of security parameters, respectively. The training time of the two executing algorithms IPPNN and NPMML, respectively, is shown in Table 1.

These experiments illustrate that the running time of NPMML is roughly 25 times longer than that of IPPNN. To summarize the results: (i) the training time will more than double if the number of neurons in hidden layer become a double while other settings remain fixed. (ii) the training time will slightly less than double if the bits of security parameter become a double while other settings remain fixed. The above experimental results manifest that the IPPNN is more suitable for training large-volume image data sets with higher bits of security parameter and more neurons.

5.7. Comparison of Feature

For the purpose of measuring the properties of state-of-the-art privacy-preserving neural network algorithms, we compare the properties of current work in the literature in terms of lossless training, non-interactivity between users and cloud servers and techniques employed by the algorithms. From the Table 2, we can find most of schemes can not have non-interactive and lossless features simultaneously.

One of the main novelties of the IPPNN method is to remove the need for multiple communications between the users and the servers. In other words, the IPPNN is a non-interactive scheme which only requires communication between the users and the servers as a one-way transmission. The key point is that the encrypted data are specific-designed through two folds: (i) by the public key of function encryption and (ii) by the random mask matrix. The server

S_{AU}

will obtain the information of inner

w \cdot x

, by exploiting the blinded data

x

and mask matrix A. After computing the outputs of all neurons and sending them to the server

S_{NN}

, the tasks of the

S_{AU}

are completed. The rest of the work can be completed by the server

S_{NN}

alone by using the encrypted data by the public key. To summarize, the two servers are able to utilize the encrypted data to train the image classifier by cooperation without the help of the users. Once the users upload the encrypted data to the servers, they can be offline.

Although NPMML method has both of the characters with non-interactive and lossless, its efficiency is lower than ours. A few factors contribute to the IPPNN outperforms the NPMML. (i) Cryptographic primitive employed—reducing computation overhead. The running time gap between NPMML and IPPNN lies in the cryptographic techniques employed by these two algorithms. In NPMML, the neural network model is built using the combination of Paillier cryptosystem and a CCA-2 secure public-key encryption scheme, such as RSA, while the IPPNN employs the only one cryptographic technique, i.e., function encryption for inner product. Consequently, the NPMML has more complex implementation than ours. (ii) Well-designed encryption method—reducing communication overhead. At each iteration, the NPMML needs two-way transmission between the servers, while the IPPNN only needs one time to transfer the information.

6. Conclusions

We developed a non-interactive protocol for lossless training an ANN-based image classifier over images belonging to multiple users while preserving the privacy constraints. Meanwhile, we presented a theoretic analysis of the security of the protocol. We also experimented with an implementation of the protocol in the real machines on two large scale image datasets and demonstrate that our protocol is able to achieve high-precision performance in a feasible amount of execution time. The future direction of this work includes applying our method to image classifiers based on ANN over vertically partitioned data. We also plan to extend our protocol to use appropriate dimensionality reduction techniques to further increase the speed.

Author Contributions

G.D. contributed to conceptualization, methodology and writing—original draft preparation. M.T. contributed to formal analysis, writing—review and editing. Y.Z. contributed to software. Y.H. contributed to visualization. X.D. contributed to project administration, supervision and writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported in part by the National Natural Science Foundation of China under Grant 12201149 (X.D.), in part by the Guangxi Science and Technology Project under Grant Guike AD18281024 (M.T.).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors wish to thank the referee for his or her very helpful comments and useful suggestions.

Conflicts of Interest

The authors declare no conflict of interest.

References

Mitchell, T. Machine Learning; McGraw-Hill Education: New York, NY, USA, 1997. [Google Scholar]
Nageswaran, S.; Arunkumar, G.; Bisht, A.K.; Mewada, S.; Kumar, J.N.V.R.; Jawarneh, M.; Asenso, E. Lung cancer classification and prediction using machine learning and image processing. Biomed. Res. Int. 2022, 2022, 1755460. [Google Scholar] [CrossRef] [PubMed]
Sharma, A.; Georgi, M.; Tregubenko, M. Enabling smart agriculture by implementing artificial intelligence and embedded sensing. Comput. Ind. Eng. 2022, 165, 107936. [Google Scholar] [CrossRef]
Joshi, K.D.; Chauhan, V.; Surgenor, B. A flexible machine vision system for small part inspection based on a hybrid SVM/ANN approach. J. Intell. Manuf. 2020, 31, 103–125. [Google Scholar] [CrossRef]
Fagbohungbe, O.; Reza, S.R.; Dong, X.; Qian, L. Efficient privacy preserving edge intelligent computing framework for image classification in IoT. IEEE Trans. Emerg. Top. Comput. Intell. 2022, 6, 941–956. [Google Scholar] [CrossRef]
Yang, Y.; Mu, K.; Deng, R.H. Lightweight privacy-preserving GAN framework for model training and image synthesis. IEEE T. Inf. Foren. Sec. 2022, 17, 1083–1098. [Google Scholar] [CrossRef]
Shen, M.; Deng, Y. Privacy-preserving image retrieval for medical IoT systems: A blockchain-based approach. IEEE Netw. 2019, 33, 27–33. [Google Scholar] [CrossRef]
Xia, Z.; Xiong, N.; Vasilakos, A.V. EPCBIR: An efficient and privacy-preserving content-based image retrieval scheme in cloud computing. Inf. Sci. 2017, 387, 195–204. [Google Scholar] [CrossRef]
Yu, J.; Zhang, B.; Kuang, Z. iPrivacy: Image privacy protection by identifying sensitive objects via deep multi-task learning. IEEE T. Inf. Foren. Sec. 2017, 12, 1005–1016. [Google Scholar] [CrossRef]
Xiong, Z.; Cai, Z.; Han, Q. ADGAN: Protect your location privacy in camera data of auto-driving vehicles. IEEE Trans. Ind. Inform. 2020, 17, 6200–6210. [Google Scholar] [CrossRef]
Li, T.; Li, J.; Chen, X.; Liu, Z.; Lou, W.; Hou, Y.T. NPMML: A Framework for non-Interactive privacy-preserving multi-party machine learning. IEEE Trans. Dependable Secur. Comput. 2021, 18, 2969–2982. [Google Scholar] [CrossRef]
Li, P.; Li, J.; Huang, Z. Multi-key privacy-preserving deep learning in cloud computing. Future Gener. Comput. Syst. 2017, 74, 76–85. [Google Scholar] [CrossRef]
Ma, X.; Zhang, F.; Chen, X. Privacy preserving multi-party computation delegation for deep learning in cloud computing. Inf. Sci. 2018, 459, 103–116. [Google Scholar] [CrossRef]
Popescu, A.B.; Taca, I.A.; Nita, C.I. Privacy preserving classification of EEG data using machine learning and homomorphic encryption. Appl. Sci. 2021, 11, 7360. [Google Scholar] [CrossRef]
Fan, Y.; Bai, J.; Lei, X.; Zhang, Y.; Zhang, B.; Li, K.C.; Tan, G. Privacy preserving based logistic regression on big data. J. Netw. Comput. Appl. 2020, 171, 102769. [Google Scholar] [CrossRef]
Mohassel, P.; Zhang, Y. SecureML: A system for scalable privacy-preserving machine learning. Proc. IEEE Symp. Secur. Privacy (SP) 2017, 19–38. [Google Scholar]
De Cock, M.; Dowsley, R.; Nascimento, A.C.; Railsback, D.; Shen, J.; Todoki, A. High performance logistic regression for privacy-preserving genome analysis. BMC Med. Genom. 2021, 14, 1–18. [Google Scholar] [CrossRef]
Deng, G.; Tang, M.; Xi, Y.; Zhang, M. Privacy-Preserving Online Medical Prediagnosis Training Model Based on Soft-Margin SVM. IEEE Trans. Serv. Comput. 2022, 1–14. [Google Scholar] [CrossRef]
Xu, R.; Baracaldo, N.; Zhou, Y.; Anwar, A.; Joshi, J.; Ludwig, H. FedV: Privacy-preserving federated learning over vertically partitioned data. Proc. ACM Workshop Artif. Intell. Secur. 2021, 18, 181–192. [Google Scholar]
Li, Y.; Zhou, Y.; Jolfaei, A.; Yu, D.; Xu, G.; Zheng, X. Privacy preserving federated learning framework based on chained secure multiparty computing. IEEE Internet Things J. 2021, 8, 6178–6186. [Google Scholar] [CrossRef]
Xie, B.; Xiang, T.; Liao, X.; Wu, J. Achieving privacy-preserving online diagnosis with outsourced SVM in internet of medical things environment. IEEE Trans. Dependable Secure Comput. 2021, 19, 4113–4126. [Google Scholar] [CrossRef]
Mandal, K.; Gong, G. PrivFL: Practical privacy-preserving federated regressions on high-dimensional data over mobile networks. Proc. CCSW 2019, 57–68. [Google Scholar]
Du, W.; Li, A.; Li, Q. Privacy-preserving multiparty learning for logistic regression. Proc. Secure Comm. 2018, 549–568. [Google Scholar]
Shokri, R.; Shmatikov, V. Privacy-preserving deep learning. In Proceedings of the 22nd ACM SIGSAC Conference on Computer and Communications Security, Denver, CO, USA, 12–16 October 2015; pp. 1310–1321. [Google Scholar]
Abadi, M. Deep learning with differential privacy. In Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security, Vienna, Austria, 24–28 October 2016; pp. 308–318. [Google Scholar]
Jiang, Y.; Hamer, J.; Wang, C.; Jiang, X.; Kim, M.; Song, Y.; Xia, Y.; Mohammed, N.; Sadat, M.N.; Wang, S. SecureLR: Secure logistic regression model via a hybrid cryptographic protocol. IEEE/ACM Trans. Comput. Biol. Bioinf. 2019, 16, 113–123. [Google Scholar] [CrossRef] [PubMed]
Abdalla, M.; Bourse, F.; Caro, A.D.; Pointcheval, D. Simple functional encryption schemes for inner products. IACR Cryptol. ePrint Arch. 2015, 17, 733–751. [Google Scholar]
Mennel, L.; Symonowicz, J.; Wachter, S. Ultrafast machine vision with 2D material neural network image sensors. Nature 2020, 579, 62–66. [Google Scholar] [CrossRef]
Lu, W.; Du, R.; Niu, P. Soybean yield preharvest prediction based on bean pods and leaves image recognition using deep learning neural network combined with GRNN. Front. Plant. Sci. 2022, 12, 791256. [Google Scholar] [CrossRef]
Sultana, F.; Sufian, A.; Dutta, P. Advancements in image classification using convolutional neural network. In Proceedings of the 2018 Fourth International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN), Kolkata, India, 22–23 November 2018; pp. 122–129. [Google Scholar]
Zeng, J.; Qiu, X.; Shi, S. Image processing effects on the deep face recognition system. Math. Biosci. Eng. 2021, 18, 1187–1200. [Google Scholar] [CrossRef]
Pham, Q.T.; Liou, N.S. The development of on-line surface defect detection system for jujubes based on hyperspectral images. Comput. Electron. Agr. 2022, 194, 106743. [Google Scholar] [CrossRef]
Sirichotedumrong, W.; Maekawa, T.; Kinoshita, Y. Privacy-preserving deep neural networks with pixel-based image encryption considering data augmentation in the encrypted domain. In Proceedings of the 2019 IEEE International Conference on Image Processing (ICIP), Taipei, Taiwan, 22–25 September 2019; pp. 674–678. [Google Scholar]
Wang, F.; Zhu, H.; Lu, R.; Zheng, Y.; Li, H. A privacy-preserving and non-interactive federated learning scheme for regression training with gradient descent. Inf. Sci. 2021, 552, 183–200. [Google Scholar] [CrossRef]
Boehmke, B.; Greenwell, B. Hands-on Machine Learning With R; Chapman and Hall: London, UK; CRC: Portsmouth, UK, 2019. [Google Scholar]
ElGamal, T. A public key cryptosystem and a signature scheme based on discrete logarithms. IEEE. T. Inform. Theory 1985, 31, 469–472. [Google Scholar] [CrossRef]
LeCun, Y.; Cortes, C.; Christopher, J.C.B. MNIST Handwritten Digit Database. Available online: http://yann.lecun.com/exdb/mnist/ (accessed on 7 November 2022).
Xiao, H.; Rasul, K.; Vollgraf, R. Fashion-MNIST: A Novel Image Dataset for Benchmarking Machine Learning Algorithms. arXiv 2017, arXiv:1708.07747. [Google Scholar]
Sybren, A. Stuvel. Python-RSA, GitHub Repository. Available online: https://github.com/sybrenstuvel/python-rsa (accessed on 7 November 2022).
CSIRO’s Data61. Python Paillier Library, GitHub Repositorys. Available online: https://github.com/data61/python-paillier (accessed on 7 November 2022).

Figure 1. Image recognition by ANN model.

Figure 2. System model.

Figure 3. Comparison between IPPNN and Baseline with Taylor approximation in model accuracy (batchsize = 60). (a) Accuracy on digits-MNIST (b = 256). (b) Accuracy on digits-MNIST (b = 128). (c) Accuracy on fashion-MNIST (b = 256). (d) Accuracy on fashion-MNIST (b = 128).

Table 1. Comparison of time cost for training one mini-batch (60 samples).

Network Architecture	Datasets	Bits of Security Parameter	Training Time (s)
Network Architecture	Datasets	Bits of Security Parameter	IPPNN	NPMML
784 ↦ 256 ↦ 10	digits-MNIST	1024	1315	31,054
784 ↦ 128 ↦ 10	digits-MNIST	1024	532	13,583
784 ↦ 256 ↦ 10	Fashion-MNIST	1024	1355	31,154
784 ↦ 128 ↦ 10	Fashion-MNIST	1024	552	13,267
784 ↦ 256 ↦ 10	digits-MNIST	2048	2102	52,590
784 ↦ 128 ↦ 10	digits-MNIST	2048	903	24,587
784 ↦ 256 ↦ 10	Fashion-MNIST	2048	2238	52,990
784 ↦ 128 ↦ 10	Fashion-MNIST	2048	931	23,183

Training time: execution time for training one mini-batch (60 samples). Network architecture: No. neurons in input layer ↦ No. neurons in hidden layer ↦ No. neurons in output layer.

Table 2. Feature comparison of several superior schemes.

	Ours	NPMML [11]	FedV [19]	MK-FHE [12]	Chain-PPFL [20]	SecureML [16]	Ma et al. [13]
Non-interactive	✔	✔	✘	✘	✘	✔	✘
Lossless	✔	✔	✔	✘	✔	✘	✔
Technique	FE	Paillier&RSA	FE	FHE	FL	SS	FL

FE: function encryption. Paillier: Paillier cryptosystem. RSA: RSA cryptosystem. FHE: fully homomorphic encryption. SS: secret sharing. FL: federal learning. The symbol ✔ denotes the scheme has the feature of the corresponding row, while the symbol ✘ has not.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Deng, G.; Tang, M.; Zhang, Y.; Huang, Y.; Duan, X. Privacy-Preserving Outsourced Artificial Neural Network Training for Secure Image Classification. Appl. Sci. 2022, 12, 12873. https://doi.org/10.3390/app122412873

AMA Style

Deng G, Tang M, Zhang Y, Huang Y, Duan X. Privacy-Preserving Outsourced Artificial Neural Network Training for Secure Image Classification. Applied Sciences. 2022; 12(24):12873. https://doi.org/10.3390/app122412873

Chicago/Turabian Style

Deng, Guoqiang, Min Tang, Yuhao Zhang, Ying Huang, and Xuefeng Duan. 2022. "Privacy-Preserving Outsourced Artificial Neural Network Training for Secure Image Classification" Applied Sciences 12, no. 24: 12873. https://doi.org/10.3390/app122412873

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Privacy-Preserving Outsourced Artificial Neural Network Training for Secure Image Classification

Abstract

1. Introduction

2. Related Work

3. Preliminaries

3.1. Artificial Neural Network

3.2. Functional Encryption for Inner-Product (FEIP)

4. The Framework and Construction for Privacy-Preserving Image Classifier

4.1. System Model

4.2. Security Requirement

4.3. Core Idea of the IPPNN

4.4. Description of Proposed Scheme

4.4.1. System Initialization

4.4.2. Data Encryption

4.4.3. Privacy-Preserving Training

5. Analysis and Evaluation

5.1. Data Privacy

5.1.1. Class-I

5.1.2. Class-II

5.1.3. External Adversary

5.2. Model Privacy

5.2.1. Class-I

5.2.2. External Adversary

5.3. The Goal of Experiments

5.4. Experimental Setup

5.5. Evaluation of Accuracy

5.6. Comparison of Execution Time

5.7. Comparison of Feature

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI