Securely Computing Protocol of Set Intersection under the Malicious Model

Liu, Xin; Chen, Weitong; Xiong, Neal; Luo, Dan; Xu, Gang; Chen, Xiubo

doi:10.3390/electronics12112410

Open AccessArticle

Securely Computing Protocol of Set Intersection under the Malicious Model

by

Xin Liu

^1,2

,

Weitong Chen

¹,

Neal Xiong

³

,

Dan Luo

^4,*,

Gang Xu

⁵ and

Xiubo Chen

⁶

¹

School of Information Engineering, Inner Mongolia University of Science and Technology, Baotou 014010, China

²

School of Computer Science, Shaanxi Normal University, Xi’an 710062, China

³

Department of Computer Science and Mathematics, Sul Ross State University, Alpine, TX 79830, USA

⁴

Department of Computer, Tianjin Ren’ai College, Tianjin 301636, China

⁵

College of Information, North China University of Technology, Beijing 100144, China

⁶

Information Security Center, State Key Laboratory of Network and Switching Technology, Beijing University of Posts and Telecommunications, Beijing 100876, China

^*

Author to whom correspondence should be addressed.

Electronics 2023, 12(11), 2410; https://doi.org/10.3390/electronics12112410

Submission received: 30 April 2023 / Revised: 19 May 2023 / Accepted: 23 May 2023 / Published: 26 May 2023

(This article belongs to the Section Artificial Intelligence)

Download

Browse Figures

Versions Notes

Abstract

:

Private set intersection (PSI) is a valuable technique with various practical applications, including secure matching of communication packets in the Internet of Things. However, most of the currently available two-party PSI protocols are based on the oblivious transfer (OT) protocol, which is computationally expensive and results in significant communication overhead. In this paper, we propose a new coding method to design a two-party PSI protocol under the semi-honest model. We analyze possible malicious attacks and then develop a PSI protocol under the malicious model using the Paillier cryptosystem, cut-and-choose, zero-knowledge proof, and other cryptographic tools. By adopting the real/ideal model paradigm, we prove the protocol’s security under the malicious model, which is more efficient compared to the existing related schemes.

Keywords:

secure multi-party computation; set intersection; malicious attacks; real/ideal model paradigm

1. Introduction

With more and more attention paid to data value, secure data circulation will give full play to data value, which is conducive to accelerating social development. With the rapid development of blockchain, big data, and artificial intelligence, which are all major technological trends, joint computing from different data sources has great practical significance and has become the norm of computing. However, without effective prevention, data privacy and confidentiality are easily revealed in joint computing [1]. Therefore, protecting the confidentiality and privacy of data in joint computing is a serious challenge for network joint computing.

Secure multi-party computation (MPC), as the core technology of privacy computing, can protect data privacy and realize multi-party joint computing [2,3]. The Millionaires’ problem proposed by Professor Yao Qizhi is the earliest MPC problem [4]. Goldreich, Cramer, and other researchers have further studied it [5,6], expanding the research field of MPC, including privacy preserving data mining, geometric computing, set problems, and confidential scientific computing [7,8]. Research has solved many practical problems in many fields and continuously promoted the development of MPC.

The secure computation of private set intersection (PSI) is a crucial research field in the realm of secure multiparty computation [9]. PSI has broad applications in various domains such as artificial intelligence, data mining, and the Internet of Things, including safeguarding privacy during data mining, discovering private address books, tracking COVID-19 close contacts, and more [10,11]. In the context of the Internet of Things, the security module examines communication data packets between IoT devices and the network layer, and matches and filters them using the communication white list to enhance the security of network communication for IoT devices [12,13]. In the digital economy era, there is a high demand for both-sided PSI scenarios [14]. For example, in confidential social contact searches, finding common friends between two parties is also an application scenario for both-sided PSI.

An existing study [15] proposed a multi-party secure computation intersection scheme that can defend against malicious adversaries [16]; the authors in [17] proposed a privacy-preserving set intersection scheme based on Oblivious Linear Evaluation (OLE) primitives. Currently, most of the PSI schemes are built on Oblivious Transfer (OT) protocols [18], but OT-based protocols often require significant computational and storage costs and are only suitable for scenarios with large sets, and their advantages are not significant enough in small set scenarios [19].

The aim of this study is to introduce a highly efficient two-party secure multiparty computation (MPC) protocol designed for calculating set intersections under the malicious model. The key contributions of this paper are outlined below:

(1): This paper proposes an MPC protocol for set intersection under the semi-honest model, based on the Paillier encryption algorithm. It then analyzes potential malicious behaviors that could arise.
(2): Building on the semi-honest model protocol and considering potential malicious behaviors, this paper proposes an MPC protocol for computing set intersection under the malicious model, utilizing cryptographic tools such as cut-and-choose and zero-knowledge proof.
(3): Using the real/ideal model paradigm, the paper proves the security of the proposed protocol under the malicious model and analyzes and compares its efficiency.
(4): The efficacy of the proposed protocols is validated through a range of performance analyses, comparisons, and simulation experiments when compared to existing methodologies.

2. Related Work

As an important research field of MPC, the earliest PSI scheme [20] used the naive hash method to hash the set elements first, and then obtained the intersection by comparing the hash values. Although this scheme is very efficient, it is vulnerable to collision attack. In order to solve this problem, some PSI protocols use technologies such as efficient OT scheme, inadvertent pseudorandom function, and cuckoo hash [21,22], so as to achieve linear computational complexity and communication complexity.

Efraim et al. presented a PSI MPC protocol under the malicious model in study [15]. The authors in [15] introduce PSImple (Practical Multiparty Maliciously-Secure Private Set Intersection), which is the first maliciously secure multiparty PSI protocol that demonstrates concrete efficiency. The construction of PSImple is built upon the concepts of oblivious transfer and garbled Bloom filters. Furthermore, study [15] provides evidence that PSImple remains competitive with the most advanced concrete and efficient semi-honest multiparty PSI protocols available. However, the communication complexity of the protocol is considerable due to the use of garbled Bloom filter (GBF) [23], which requires extensive communication for its transmission.

In study [17], a novel method is introduced for calculating the intersection between sets using Oblivious Linear Function Evaluation (OLE) as a primitive. At an abstract level, the approach leverages OLE to effectively add two polynomials in a randomized manner while maintaining the roots of the resulting polynomials. By assigning the roots of the input polynomials to be the elements of the input sets, this directly leads to an intersection protocol with an optimal asymptotic communication complexity of O(mκ). The protocol presented in study [17] achieves information-theoretic security against a malicious adversary under the assumption of OLE’s availability. Nevertheless, this method exhibits high computational complexity.

Given the limitations of the aforementioned protocols, this paper proposes efficient MPC protocols for set intersection under both the semi-honest and malicious models, utilizing the Paillier encryption system, zero-knowledge proof, and cut-and-choose method. The protocol’s efficiency is then compared with that of existing protocols to provide insight into its effectiveness.

3. Preliminary Knowledge

3.1. Paillier Cryptosystem

Paillier proposed an additive homomorphic cryptosystem in 1999 [24], which consists of three phases:

Key generation: Two large prime numbers

p

and

q

are randomly selected, and

N = p q

, with

λ = l c m (p - 1, q - 1)

and

\gcd (p q, (p - 1) (q - 1)) = 1

. A random number

g \in Z_{N^{2}}^{*}

is chosen as the public key, while

λ

is the private key.

Encryption: Let

m

be the message to be encrypted, where

0 \leq m < n

. Choose a random integer

r

where

0 < r < n

. Compute the ciphertext as:

c = g^{m} r^{n} \mod n^{2} .

Decryption:

m = L (c^{λ} \mod n^{2}) {(L (g^{λ} \mod n^{2}))}^{- 1} \mod n .

Public key encryption, randomization, completeness, and resistance to attacks are the primary advantages of the Paillier cryptosystem. These advantages make the Paillier cryptosystem highly valuable for secure transmission and processing of encrypted data. The following provides a further explanation of these advantages:

(1): Public key encryption: The Paillier cryptosystem employs public and private keys for encryption and decryption operations. The public key can be publicly shared, allowing senders to encrypt data using the recipient’s public key and transmitting it through insecure channels. Only the holder of the private key can decrypt the data, ensuring that even if the public key is compromised, attackers cannot access the plaintext. This public key encryption scheme ensures secure transmission.
(2): Randomization: In the Paillier cryptosystem, the encryption process involves the use of random numbers to encrypt the plaintext. Even if the same plaintext is encrypted multiple times, the resulting ciphertext will differ due to the utilization of different random numbers. This randomization feature enhances the security of the cryptosystem by preventing attackers from extracting information about the plaintext by analyzing multiple ciphertexts.
(3): Completeness: The Paillier cryptosystem is complete, enabling encryption and decryption of arbitrary integers without limitations on specific data ranges. This flexibility allows the cryptosystem to handle various types of data, regardless of their size, through encryption and decryption operations.
(4): Resistance to attacks: Extensive cryptographic research and analysis have demonstrated the security and reliability of the Paillier cryptosystem. It withstands many common cryptographic attacks, including chosen plaintext attacks, chosen ciphertext attacks, and active attacks. These attacks attempt to deduce information about the keys or plaintext by obtaining plaintext–ciphertext pairs or manipulating the ciphertext. However, the Paillier cryptosystem effectively thwarts these attacks.

In conclusion, the advantages of the Paillier cryptosystem can be made as an effective tool for secure transmission and processing of encrypted data.

3.2. Zero-Knowledge Proof

The concept of zero-knowledge proof, as introduced in [25,26], involves a prover convincing a verifier that they possess a particular knowledge or answer without divulging said knowledge or answer. A zero-knowledge proof scheme is deemed secure if it satisfies three properties:

(1): Completeness: If the statement being proven is true, the verifier will always accept it.
(2): Soundness: If the statement being proven is false, the verifier will always reject it, and the probability of the prover successfully persuading the verifier is negligible.
(3): Zero-knowledge: During the proof, the verifier does not learn any information about the knowledge or answer being proven by the prover.

Zero-knowledge proofs offer the following benefits:

(1): Privacy safeguarding: Zero-knowledge proofs enable a prover to validate a statement’s veracity to a verifier without disclosing any specific details about the statement. By solely presenting the essential proof while keeping sensitive data concealed, individual privacy is upheld.
(2): Preservation of confidentiality: Zero-knowledge proofs ensure that the verifier cannot gain access to additional information regarding the statement, apart from verifying its validity. Although the prover can establish the truthfulness of a fact, the verifier cannot extract the specific information underpinning the proof, thereby maintaining confidentiality.
(3): Reliability and verifiability: Zero-knowledge proofs assure that the prover can correctly construct the proof in accordance with predefined rules and protocols, allowing the verifier to verify its validity. This enhances the proof’s dependability and the overall system’s verifiability.
(4): Efficiency: Zero-knowledge proofs can be executed within relatively short timeframes without excessive computational demands. This renders them practical and efficient for real-world applications.

To summarize, the advantages of zero-knowledge proofs encompass privacy protection, confidentiality preservation, reliability and verifiability, and efficiency. These advantages contribute to the high value of zero-knowledge proofs in domains such as secure protocols, identity authentication, and privacy protection, which explains their adoption in this manuscript.

3.3. Cut-and-Choose Method

The cut-and-choose technique is a commonly used cryptographic tool in the field of cryptography [27] and is crucial in the development of secure multiparty computation protocols. The fundamental concept is that one party creates multiple duplicates of garbled circuits in the protocol, while the other party requests that a subset of the circuits be opened for scrutiny. If the inspection is successful, the remaining circuits are computed and the ultimate output of the circuits is determined.

In the field of cryptography, the cut-and-choose technique is a widely used interactive zero-knowledge proof protocol employed for validating a participant’s accurate computation while preserving the confidentiality of private information. This approach offers several notable benefits:

(1): Security: The cut-and-choose method ensures robust security measures. Participants engage in interactions where one participant (usually the prover) executes the computation and submits evidence, while the other participant (the verifier) randomly selects and verifies a subset of the evidence to establish the correctness of the computation. This mechanism effectively thwarts attacks such as cheating, tampering, and forgery.
(2): Zero-knowledge property: The cut-and-choose method exhibits the zero-knowledge property, enabling the prover to demonstrate the correctness of their computation to the verifier without disclosing any sensitive inputs or computational methods. The verifier solely verifies that the computation’s outcome is correct, without requiring knowledge of the computation’s specifics.
(3): Resilience against attacks: This technique demonstrates robust resistance against various types of attacks. Even if the prover conducts erroneous computations or attempts to deceive by manipulating certain evidence elements, the verifier can effectively identify errors or cheating behaviors through random selection and verification of a portion of the evidence.
(4): Scalability: The cut-and-choose method can be readily scaled to accommodate different application scenarios and computational complexities. Enhancing the precision and reliability of verification can be achieved by increasing the number of evidence elements or selecting a larger number of random samples.
(5): Generality: The cut-and-choose method serves as a versatile zero-knowledge proof protocol applicable across diverse domains and computational tasks. It finds utility in verifying cryptographic protocols, password cracking results, data privacy, and numerous other applications.

Given these aforementioned advantages, the utilization of the cut-and-choose method in this manuscript enhances the protocol’s security.

3.4. Security under the Malicious Model

The real versus ideal model paradigm [28] is a well-established technique for proving security in the context of malicious models. This approach guarantees that the actual protocol provides a comparable level of security as the ideal model.

During the computation process, both parties involved rely on a trusted third party (TTP) to carry out the computation. In the ideal scenario, both parties are only able to obtain their respective results and no other information. To prove the security of the protocol under the malicious model, the results obtained through computation in this model must be identical to those obtained in the ideal model. However, it should be noted that in the malicious model, the protocol must ensure that at least one party is honest during execution; otherwise, the protocol cannot be deemed secure and dependable. The security definition under the malicious model can be found in [29].

3.5. Ranking Method

The fundamental principle and detailed protocol for the ranking method, where identical numbers are assigned the same rank, are presented in [30]. Specifically, each identical data in the array are assigned the same ordinal bit, and the ordinal bit of the next larger data are increased by only one bit. This approach of sorting treats the same elements in multiple arrays as a single element, and the outcome is equivalent to sorting only the distinct data in the array.

Problem Description: There are

n

participants

P_{1}, \dots, P_{n}

. For each

i \in [1, n]

, each

P_{i}

has a private and orderly array

T_{i} = (t_{i 1}, \dots, t_{i e_{i}})

;

T_{i}

is a standard array (no duplicate elements appear in the array).

N

participants need to conduct confidential calculation through cooperation. After the calculation, participants

P_{i}

can only get the sorting position

r_{i s}

in the joint array

T = (T_{1}, \dots, T_{n})

of each element

t_{i s}

;

s \in [1, e_{i}]

in their own array.

Calculation principle:

(1): Participants jointly agree on a complete set $J = [1, N]$ , satisfying $T_{i} \subseteq J$ . Under the complete set $J$ , each participant $P_{i}$ constructs an n-dimension vector according to their own array $T_{i}$ :

$Y_{i} = (y_{i 1}, \dots, y_{i N})$

(1)

where for each $j \in J$ , define:

$y_{i j} = \{\begin{matrix} 1, & j \in T_{i} \\ 0, & j \notin T_{i} \end{matrix}$

(2)
(2): Let $Y_{1}^{*} = Y_{1}$ , $P_{i} (i = 1, \dots, n - 1)$ take turns sending the vector $Y_{i}^{*}$ to $P_{i + 1}$ ; $P_{i + 1}$ construct a new vector according to $Y_{i}^{*}$ and $Y_{i + 1}$ :

$Y_{i + 1}^{*} = (y_{(i + 1) 1}^{*}, \dots, y_{(i + 1) N}^{*})$

where for each $j \in J$ , define:

$y_{(i + 1) j}^{*} = \{\begin{matrix} y_{(i + 1) j}, & y_{(i + 1) j} = 1 \\ y_{i j}^{*}, & y_{(i + 1) j} = 0 \end{matrix}$

(3)
(3): $P_{i}$ calculates the sorting position in the following way:

$r_{i s} = \sum_{j = 1}^{t_{i s}} y_{n j}^{*}$

(4)

3.6. Coding Method

Coding method 1. Participants

P_{i}

(i = 1, 2)

construct the coding vector of their own sets through the ranking method in which the same numbers have the same order [30]:

P_{i}

convert their private sets

S_{i}

into an ordered array

T_{i} = (t_{i 1}, \dots, t_{i e_{i}})

, and participants

P_{i}

agree on a complete set

J = [1, N]

satisfying

T_{i} \subseteq J

. The two participants use the ranking method for cooperative confidential calculation. After the calculation, the participants

P_{i}

could only know the sorting position

r_{i s}

of each element

t_{i s}

;

s \in [1, e_{i}]

in their combined array

T = (T_{1}, T_{2})

. At the same time,

P_{i}

can know the order

{r^{'}}_{i s}

(s = 1, \dots, e_{i})

of each element in their set. Make the set

U = \{u_{1}, \dots, u_{n}\}

which contains

N

elements. The process in which the participants

P_{i}

encode the private set

S_{i}

as a vector

V_{i}

is as follows:

P_{i}

make the elements in which the order are

r_{i s}

in the set

U

(i.e.,

k = r_{i s}

, where

k

is the subscript of element

u

in the set

U

) which be random even numbers, and the elements in other positions be random odd numbers, then

P_{i}

gets

V_{i} = (v_{i, 1}, \dots, v_{i, n})

, an n-dimension vector, respectively.

If the random odd or even numbers selected in the coding process are different, the coding vectors are different, that is, a set can be coded into many different vectors.

Example 1.

The following uses plaintext to demonstrate the coding process: Assume that participants

P_{1}

,

P_{2}

have sets

S_{1} = {1, 3}

and

S_{2} = {3, 2, 6}

, respectively. They convert

S_{1}

,

S_{2}

to an ordered array

T_{1} = (t_{11}, t_{12}) = (1, 3)

and

T_{2} = (t_{21}, t_{22}, t_{23}) = (2, 3, 6)

. Select

N = 7

, the complete set

U = \{u_{1}, \dots, u_{7}\} = \{0, \dots, 0\}

, and two participants construct 7-dimensional vectors

Y_{1} = (1, 0, 1, 0, 0, 0, 0)

,

Y_{2} = (0, 1, 1, 0, 0, 1, 0)

, respectively, according to Formula (1) in Section 3.5.

P_{1}

sends the vector

Y_{1}^{*} = Y_{1}

to

P_{2}

,

P_{2}

gets

Y_{2}^{*} = (1, 1, 1, 0, 0, 1, 0)

according to the Formula (3). According to

Y_{2}^{*}

, each participant determines the sorting position of the elements of their respective array in the joint array according to Formula (4), and the sorting result is shown in Table 1.

According to Table 1,

P_{i}

makes the elements in which order are

r_{i s}

in the set

U

(i.e.,

k = r_{i s}

, where

k

is the subscript of element

u

in the set

U

) and be random even numbers, and the elements at other positions be random odd numbers, then

P_{1}

gets an n-dimension vector

V_{1} = (2, 1, 6, 5, 3, 7, 9)

and

P_{2}

gets an n-dimension vector

V_{2} = (3, 2, 8, 4, 3, 1, 9)

.

3.7. Transformation Problem of Set Intersection

Problem description

Suppose there are two participants, Alice and Bob. Alice has a private set

A = \{e_{a, 1}, \dots, e_{a, l_{a}}\}

and Bob has a private set

B = \{e_{b, 1}, \dots, e_{b, l_{b}}\}

. They want to compute a set

T = A \cap B

without disclosing any additional information. After the implementation of the protocol, the participants cannot obtain any additional information except the intersection

T

and inferable information from

T

.

Problem transformation

Alice encodes

A

to

V_{a} = (x_{1}, \dots, x_{n})

, Bob encodes

B

to

V_{b} = (y_{1}, \dots, y_{n})

, and makes the set

T = \emptyset

. For

x_{1}

and

y_{1}

, …,

x_{n}

, and

y_{n}

, Alice or Bob gets the corresponding

a (x_{k} - y_{k})

(k = 1, \dots, n)

through calculation by using the Paillier cryptosystem, where

a

is a random odd number, and then each further determines whether

y_{k}

or

x_{k}

is an odd number or an even number according to what they own

x_{k}

or

y_{k}

, then makes

T \leftarrow T \cup \{u_{k}\}

or

T \leftarrow T \cup \emptyset

, and finally obtains the intersection

T

of

A

and

B

. Taking

x_{1}

and

y_{1}

as an example, the specific execution process is as follows:

Alice obtains

a (x_{1} - y_{1})

by calculations. Alice owns

x_{1}

, and makes the set

T = \emptyset

. ① In the case that

x_{1}

is an even number, if

a (x_{1} - y_{1})

is an even number (where

a

is a random odd number), according to the properties of odd and even numbers, Alice can conclude that

x_{1} - y_{1}

is an even number, since

x_{1}

is even, then

y_{1}

is even. Then Alice makes

T \leftarrow T \cup \{u_{1}\}

. If

a (x_{1} - y_{1})

is an odd number, Alice can conclude that

x_{1} - y_{1}

is an odd number, then

y_{1}

is odd. Then Alice makes

T \leftarrow T \cup \emptyset

. ② In the case that

x_{1}

is an odd number, Alice makes

T \leftarrow T \cup \emptyset

.

For

x_{2}

and

y_{2}

,

\dots

,

x_{n}

, and

y_{n}

, Alice performs similar calculations and judgments as the above execution. Eventually, Alice gets the intersection

T

of

A

and

B

, and tells Bob

T

.

According to the elements

u_{k}

(k = 1, \dots, n)

in the intersection

T

, the elements of the corresponding order position

{r^{'}}_{i s} = k

(where

i = 1, 2

and

s = 1, \dots, e_{i}

) in Alice and Bob’s respective private sets are the plain elements of the intersection set

T

.

Example 2.

Based on Example 1, it demonstrates the judgment process in plaintext: Let the private sets of Alice and Bob be

A = S_{1} = {1, 3}

and

B = S_{2} = {3, 2, 6}

, respectively, and the corresponding encoding vector be:

V_{a} = (x_{1}, \dots, x_{n}) = V_{1} = (2, 1, 6, 5, 3, 7, 9),

V_{b} = (y_{1}, \dots, y_{n}) = V_{2} = (3, 2, 8, 4, 3, 1, 9) .

For

x_{1} = 2

and

y_{1} = 3

, let Bob take random odd numbers

a = 3

. Alice obtained

a (x_{1} - y_{1}) = (3 \times (2 - 3) = - 3)

by calculations. Let the set

T = \emptyset

, because Alice knows

a

is odd, and

a (x_{1} - y_{1}) = - 3

is odd; according to the operational properties of odd and even numbers, Alice can conclude that

x_{1} - y_{1}

is odd (value is

- 1

), and has

x_{1} = 2

which is an even number, then concludes that

y_{1}

(value is 3) is odd, which makes

T \leftarrow T \cup \emptyset

.

For

x_{2}

and

y_{2}

,

\dots

,

x_{n}

, and

y_{n}

, Alice performs similar calculations and judgments as the above execution. Eventually, Alice gets the intersection

T = \{u_{3}\}

of

A

and

B

, and tells Bob

T

.

According to Table 1, the element

{3}

of the corresponding order position

{r^{'}}_{i s} = k = 3

(where

i = 1, 2

and

s = 1, \dots, e_{i}

) in the sets

A

and

B

is the plain element of the intersection set

T

, so the intersection

T = {3}

.

4. The MPC Protocol of Set Intersection under the Semi-Honest Model

The outline of the protocol is presented in Algorithm 1 and the detailed procedures of the Private Set Intersection (PSI) under the semi-honest model are elucidated in Protocol 1. The development of the MPC protocol in the presence of a malicious model is anchored on the semi-honest protocol and takes into account potential malicious conduct [29]. Hence, it is crucial to examine both the protocol and malicious behaviors in Protocol 1.

Algorithm 1. Computing the set intersection under the semi-honest model.

Input:

A

: Alice’s input;

B

: Bob’s input;

p k

(g, N)

: public key;

s k

(λ)

: Alice’s private key;

E n c o d e

: encode with the Coding method 1;

E

: encrypt by the Paillier system;

c

: ciphertexts;

D

: decrypt the ciphertexts.
(1)

E n c o d e (A) = V_{a} = (x_{1}, \dots, x_{n})

(2)

E n c o d e (B) = V_{b} = (y_{1}, \dots, y_{n})

(3)

E_{p k} (x) = c_{1} = g^{x_{1}} r_{1}^{N} \mod N^{2}

(4) Select

a

and

r_{2}

(5) Compute

c_{2} = c_{1}^{a} {(g^{{a y}_{1}})}^{- 1} r_{2}^{N} \mod N^{2} = g^{a (x_{1} - y_{1})} {(r_{1}^{a} r_{2})}^{N} \mod N^{2}

(6)

D (c_{2}) = a (x_{1} - y_{1})

(7) Make

T = \emptyset

(8) If

x_{1}

is even and

a (x_{1} - y_{1})

is even then

x_{1} - y_{1}

is even;

y_{1}

is even;
make

T \leftarrow T \cup {u_{1}}

;
else if

a (x_{1} - y_{1})

is odd then

x_{1} - y_{1}

is odd;

y_{1}

is odd;
make

T \leftarrow T \cup \emptyset

;
else if

x_{1}

is odd then
make

T \leftarrow T \cup \emptyset

(9) Perform the steps (3)–(8) for

x_{2}

and

y_{2}

,

\dots

,

x_{n}

and

y_{n}

(10) Obtain the intersection

T

(11) Obtain plain elements of

T

according to the elements

u_{k}

(k = 1, \dots, n)

in the

T

Output: Intersection of

A

and

B

.

Protocol 1. The MPC protocol of set intersection under the semi-honest model.

Input: Private set

A

of Alice, private set

B

of Bob.
Output: The intersection

T

of

A

and

B

.
Preparation: The Paillier cryptosystem’s public and private keys,

(g, N)

and

λ

respectively, are created by Alice. Following this, Alice transmits the public key to Bob.
(1) Alice encodes

A

to

V_{a} = (x_{1}, \dots, x_{n})

and Bob encodes

B

to

V_{b} = (y_{1}, \dots, y_{n})

. Since the following calculations are the same for

x_{1}

and

y_{1}

, …,

x_{n}

, and

y_{n}

, the calculation of

x_{1}

and

y_{1}

are described as an example. In each of the following steps,

x_{2}

and

y_{2}

, …,

x_{n}

, and

y_{n}

are calculated at the same time as

x_{1}

and

y_{1}

.
(2) Alice encrypts

x_{1}

with the public key to

c_{1} = g^{x_{1}} r_{1}^{N} \mod N^{2}

and sends

c_{1}

to Bob.
(3) Then Bob selects new random numbers

a

(

a

takes a random odd number) and

r_{2}

to calculate

c_{2} = c_{1}^{a} {(g^{{a y}_{1}})}^{- 1} r_{2}^{N} \mod N^{2} = g^{a (x_{1} - y_{1})} {(r_{1}^{a} r_{2})}^{N} \mod N^{2}

,
and sends

c_{2}

to Alice.
(4) Alice gets

a (x_{1} - y_{1})

by decrypting

c_{2}

. Alice owns

x_{1}

, and makes the set

T = \emptyset

. ① In the case that

x_{1}

is an even number, if

a (x_{1} - y_{1})

is an even number (where

a

is a random odd number), according to the properties of odd and even numbers, Alice can conclude that

x_{1} - y_{1}

is an even number, since

x_{1}

is even, then

y_{1}

is even. Then Alice makes

T \leftarrow T \cup {u_{1}}

. If

a (x_{1} - y_{1})

is an odd number, Alice can conclude that

x_{1} - y_{1}

is an odd number, then

y_{1}

is odd. Then Alice makes

T \leftarrow T \cup \emptyset

. ② In the case that

x_{1}

is an odd number, Alice makes

T \leftarrow T \cup \emptyset

.
(5) For

x_{2}

and

y_{2}

,

\dots

,

x_{n}

, and

y_{n}

, Alice and Bob perform the calculation and judgment in steps (2)–(4) above at the same time. Eventually, Alice obtains the intersection

T

of

A

and

B

.
(6) Alice sends

T

to Bob. According to the elements

u_{k}

(k = 1, \dots, n)

in the intersection

T

, the elements of the corresponding order position

{r^{'}}_{i s} = k

(where

i = 1, 2

and

s = 1, \dots, e_{i}

) in Alice and Bob’s respective private sets are the plain elements of the intersection set

T

.
The protocol ends.

For Protocol 1 to be secure, it is imperative that both Alice and Bob adhere to the semi-honest model. Failure to do so would compromise the protocol’s security, and as such, measures need to be taken to address any malicious behavior in the semi-honest model protocol. This ensures the protocol can withstand potential attacks by malicious participants.

5. The MPC Protocol of Set Intersection under the Malicious Model

Solution: A widely used approach to designing MPC protocols for the malicious model is to examine potential attacks on protocols in the semi-honest model and devise corresponding countermeasures [29]. This ensures that malicious parties cannot engage in harmful activities, or if they do, they can be detected by the other party.

It is worth noting that certain malicious behaviors that cannot be prevented under the ideal model are similarly unavoidable under the malicious model. Nonetheless, protocols under the malicious model must still maintain the same level of security as those under the ideal model. Malicious behaviors such as refusal to participate, false input, and premature termination of protocols are not taken into consideration in actual protocols.

In Protocol 1, possible malicious acts include:

(1): If Bob is characterized as being semi-honest, Alice may engage in deceitful behavior, such as deliberately conveying an incorrect set $T$ to Bob at the conclusion of the protocol, resulting in an inaccurate outcome for Bob. It is inequitable for either party to disclose the computation results to the other.
(2): When Alice is semi-honest, Bob can perform malicious acts such as: $①$ Bob does not choose a real random number $r_{2}$ when calculating $c_{2} = g^{a (x_{i} - y_{i})} {(r_{1}^{a} r_{2})}^{N} \mod N^{2}$ . However, as the decryption has eliminated the impact of $r_{2}$ , Bob cannot get any private information from $r_{2}$ . $②$ If $a$ selected by Bob is not a random odd number, but a random even number, then when Alice publishes the value of $a (x_{k} - y_{k})$ , Bob will get the correct result, but Alice will not get the correct result.

In view of the above malicious acts, Protocol 1 is improved by using such cryptographic methods such as cut-and-choose method and zero-knowledge proof, and Alice and Bob jointly generate random odd numbers

a

in Protocol 1. Neither Alice nor Bob knows the true value of

a

, but they can use

a

to complete the above protocol.

5.1. Specific Protocol

It is important to acknowledge that while the protocol cannot eliminate the possibility of participants engaging in malicious behavior, it is capable of detecting such behavior. During the execution of the protocol, if Alice is truthful while Bob is being deceptive, Alice can identify Bob’s malicious actions. Conversely, Bob can also detect any deceptive behavior on Alice’s part. However, if both Alice and Bob are acting maliciously, it has been demonstrated in theory that designing an MPC protocol that can address this scenario is impossible, and thus will not be further discussed. The outline of the protocol is presented in Algorithm 2, and the step-by-step procedure for the protocol is presented in Protocol 2.

Algorithm 2. Computing the set intersection under the malicious model.

Input:

A

: Alice’s input;

B

: Bob’s input;

(g_{a}, N_{a})

: public key generated by Alice;

(g_{b}, N_{b})

: Public key generated by Bob;

λ_{a}

: Alice’s private key;

λ_{b}

: Bob’s private key;

E n c o d e

: encode with the Coding method 1;

c

: ciphertexts.
(1) Compute

u = g_{a}^{λ_{a}} \mod N_{a}^{2}

(2) Compute

v = g_{b}^{λ_{b}} \mod N_{b}^{2}

(3) Exchange

(g_{a}, N_{a}, u)

and

(g_{b}, N_{b}, v)

(4)

E n c o d e (A) = V_{a} = (x_{1}, \dots, x_{n})

(5)

E n c o d e (B) = V_{b} = (y_{1}, \dots, y_{n})

(6) Select

m

odd numbers

a_{i}

(i = 1, \dots, m)

(7) Select

m

odd numbers

b_{i}

(i = 1, \dots, m)

(8)

(c_{1 a}^{i}, c_{2 a}^{i}) = (g_{a}^{a_{i} x_{1}} \mod N_{a}^{2}, g_{a}^{a_{i}} \mod N_{a}^{2})

(9)

(c_{1 b}^{i}, c_{2 b}^{i}) = (g_{b}^{b_{i} y_{1}} \mod N_{b}^{2}, g_{b}^{b_{i}} \mod N_{b}^{2})

(10) Select

m / 2

sets of

(c_{1 b}^{i}, c_{2 b}^{i})

from

m

sets of

(c_{1 b}^{i}, c_{2 b}^{i})

(11) Verify
If

b_{i} \mod 2 \neq 0

and

g_{b}^{b_{i} y_{1}} \mod N_{b}^{2} = c_{1 b}^{i}

then continue
else terminate
(12) Select

m / 2

sets of

(c_{1 a}^{i}, c_{2 a}^{i})

from

m

sets of

(c_{1 a}^{i}, c_{2 a}^{i})

(13) Verify
If

a_{i} \mod 2 \neq 0

and

g_{a}^{a_{i} x_{1}} \mod N_{a}^{2} = c_{1 a}^{i}

then continue
else terminate
(14) Select a group of

(c_{1 b}^{j}, c_{2 b}^{j})

and

(c_{1 a}^{i}, c_{2 a}^{i})

from the remaining

(c_{1 b}^{i}, c_{2 b}^{i})

and

(c_{1 a}^{i}, c_{2 a}^{i})

(15) Select random numbers

a \in Z_{b}^{*}

and

b \in Z_{a}^{*}

(16)

{c_{b} = E}_{b} (a b_{j} (x_{1} - y_{1})) = {(c_{2 b}^{j})}^{{a x}_{1}} {(c_{1 b}^{j})}^{- a} r_{1}^{N_{b}} \mod N_{b}^{2} = g_{b}^{{a b}_{j} (x_{1} - y_{1})} r_{1}^{N_{b}} \mod N_{b}^{2}

(17)

{c_{a} = E}_{a} (a_{i} b (x_{1} - y_{1})) = {(c_{1 a}^{i})}^{b} {(c_{2 a}^{i})}^{- b y_{1}} r_{2}^{N_{a}} \mod N_{a}^{2} = g_{a}^{a_{i} b (x_{1} - y_{1})} r_{2}^{N_{a}} \mod N_{a}^{2}

(18)

m_{a} = c_{a}^{λ_{a}} \mod N_{a}^{2}

(19)

m_{b} = c_{b}^{λ_{b}} \mod N_{b}^{2}

(20) Exchange

m_{a}

and

m_{b}

(21) Verify
If

\log_{c_{a}} m_{a} = \log_{g_{a}} u

and

\log_{c_{b}} m_{b} = \log_{g_{b}} v

then continue
else terminate
(22) Compute

L (m_{a}) / L (u)

to obtain

a_{i} (x_{1} - y_{1})

(23) Make

T = \emptyset

(24) If

y_{1}

is even and

a_{i} (x_{1} - y_{1})

is even then

x_{1} - y_{1}

is even;

x_{1}

is even;
make

T_{b} \leftarrow T_{b} \cup {u_{1}}

;
else if

a_{i} (x_{1} - y_{1})

is odd then

x_{1} - y_{1}

is odd;

x_{1}

is odd;
make

T_{b} \leftarrow T_{b} \cup \emptyset

;
else if

y_{1}

is odd then
make

T_{b} \leftarrow T_{b} \cup \emptyset

(25) Compute

L (m_{b}) / L (v)

to obtain

b_{j} (x_{1} - y_{1})

(26) Perform steps similar to step (24) to make

T_{a} \leftarrow T_{a} \cup {u_{1}}

or

T_{a} \leftarrow T_{a} \cup \emptyset

(27) Perform the steps (6)–(26) above for

x_{2}

and

y_{2}

,

\dots

,

x_{n}

, and

y_{n}

(28) Obtain

T_{a}

and

T_{b}

(29) Obtain plain elements of

T_{a}

and

T_{b}

according to the elements

u_{k}

(k = 1, \dots, n)

in the

T_{a}

and

T_{b}

Output: Intersection of

A

and

B

.

Protocol 2. The MPC protocol of the set intersection under the malicious model.

Input: Private set

A

of Alice; private set

B

of Bob.
Output: The intersection

T

of

A

and

B

.
Preparation: Alice and Bob each create a public key,

(g_{a}, N_{a})

and

(g_{b}, N_{b})

, respectively, for the Paillier cryptosystem, and compute values

u = g_{a}^{λ_{a}} \mod N_{a}^{2}

and

v = g_{b}^{λ_{b}} \mod N_{b}^{2}

. They then exchange values

(g_{a}, N_{a}, u)

and

(g_{b}, N_{b}, v)

.
(1) Alice encodes

A

to

V_{a} = (x_{1}, \dots, x_{n})

and Bob encodes

B

to

V_{b} = (y_{1}, \dots, y_{n})

. Since the following calculations are the same for

x_{1}

and

y_{1}

,

\dots

,

x_{n}

, and

y_{n}

, the calculation of

x_{1}

and

y_{1}

are described as an example. In each of the following steps,

x_{2}

and

y_{2}

,

\dots

,

x_{n}

, and

y_{n}

are calculated at the same time as

x_{1}

and

y_{1}

.
(2) For

x_{1}

and

y_{1}

, Alice and Bob each randomly select

m

odd numbers, denoted as

a_{i}

and

b_{i}

, respectively, where

i = 1, \dots, m

, and then compute the value of:

(c_{1 a}^{i}, c_{2 a}^{i}) = (g_{a}^{a_{i} x_{1}} \mod N_{a}^{2}, g_{a}^{a_{i}} \mod N_{a}^{2})

,

(c_{1 b}^{i}, c_{2 b}^{i}) = (g_{b}^{b_{i} y_{1}} \mod N_{b}^{2}, g_{b}^{b_{i}} \mod N_{b}^{2})

,
then publish

(c_{1 a}^{i}, c_{2 a}^{i})

,

(c_{1 b}^{i}, c_{2 b}^{i})

, respectively.
(3) Alice employs the cut-and-choose technique to randomly select

m / 2

sets of

(c_{1 b}^{i}, c_{2 b}^{i})

from a total of

m

sets of

(c_{1 b}^{i}, c_{2 b}^{i})

. She then requests Bob to make public the corresponding values of

b_{i}

and

g_{b}^{b_{i} y_{1}}

, which she subsequently validates using

b_{i} \mod 2 \neq 0

and

g_{b}^{b_{i} y_{1}} \mod N_{b}^{2} = c_{1 b}^{i}

. The protocol proceeds if the verification is successful, otherwise it halts.
(4) Bob randomly chooses

m / 2

groups of

(c_{1 a}^{i}, c_{2 a}^{i})

from a set of

m

groups of

(c_{1 a}^{i}, c_{2 a}^{i})

, and requests Alice to disclose the corresponding

a_{i}

and

g_{a}^{a_{i} x_{1}}

. Bob then verifies

a_{i} \mod 2 \neq 0

and

g_{a}^{a_{i} x_{1}} \mod N_{a}^{2} = c_{1 a}^{i}

. If the verification is successful, Bob proceeds with the protocol; otherwise, the protocol terminates.
(5) Alice and Bob respectively and randomly select a group of

(c_{1 b}^{j}, c_{2 b}^{j})

and

(c_{1 a}^{i}, c_{2 a}^{i})

from the remaining

(c_{1 b}^{i}, c_{2 b}^{i})

and

(c_{1 a}^{i}, c_{2 a}^{i})

, and respectively select random numbers

a \in Z_{b}^{*}

and

b \in Z_{a}^{*}

.
Alice calculates:

{c_{b} = E}_{b} (a b_{j} (x_{1} - y_{1})) = {(c_{2 b}^{j})}^{{a x}_{1}} {(c_{1 b}^{j})}^{- a} r_{1}^{N_{b}} \mod N_{b}^{2} = g_{b}^{{a b}_{j} (x_{1} - y_{1})} r_{1}^{N_{b}} \mod N_{b}^{2}

,
Bob calculates:

{c_{a} = E}_{a} (a_{i} b (x_{1} - y_{1})) = {(c_{1 a}^{i})}^{b} {(c_{2 a}^{i})}^{- b y_{1}} r_{2}^{N_{a}} \mod N_{a}^{2} = g_{a}^{a_{i} b (x_{1} - y_{1})} r_{2}^{N_{a}} \mod N_{a}^{2}

,
and they send the results to each other.
(6) Alice calculates

m_{a} = c_{a}^{λ_{a}} \mod N_{a}^{2}

, Bob calculates

m_{b} = c_{b}^{λ_{b}} \mod N_{b}^{2}

. Next,

m_{a}

and

m_{b}

exchange one another.
(7) In Section 3.2, both parties employ the zero-knowledge proof technique to demonstrate the accuracy of their computation results, thereby verifying

\log_{c_{a}} m_{a} = \log_{g_{a}} u

and

\log_{c_{b}} m_{b} = \log_{g_{b}} v

. If one party fails to pass, it is proved to be malicious.
(8) If all are proved, Bob has the ability to compute

L (m_{a}) / L (u)

, which enables him to derive

a_{i} b (x_{1} - y_{1})

, and subsequently acquire

a_{i} (x_{1} - y_{1})

. Bob owns

y_{1}

, and makes the set

T_{b} = \emptyset

. ① In the case that

y_{1}

is an even number, if

a_{i} (x_{1} - y_{1})

is an even number, according to the properties of odd and even numbers, Bob can conclude that

x_{1} - y_{1}

is an even number, so

x_{1}

is even. Then Bob makes

T_{b} \leftarrow T_{b} \cup {u_{1}}

. If

a_{i} (x_{1} - y_{1})

is an odd number, Bob can conclude that

x_{1} - y_{1}

is an odd number, then

x_{1}

is odd. Then Bob makes

T_{b} \leftarrow T_{b} \cup \emptyset

. ② In the case that

y_{1}

is an odd number, Bob makes

T_{b} \leftarrow T_{b} \cup \emptyset

. Alice can calculate

L (m_{b}) / L (v)

to obtain

a b_{j} (x_{1} - y_{1})

, and then obtain

b_{j} (x_{1} - y_{1})

. Alice uses similar methods above to make the set

T_{a} \leftarrow T_{a} \cup {u_{1}}

or

T_{a} \leftarrow T_{a} \cup \emptyset

.
(9) Alice obtains the set

T_{a}

, Bob obtains the set

T_{b}

; the intersection of

A

and

B

,

T = T_{a} = T_{b}

. According to the elements

u_{k}

(k = 1, \dots, n)

in the intersection

T

, the elements of the corresponding order position

{r^{'}}_{i s} = k

(where

i = 1, 2

and

s = 1, \dots, e_{i}

) in Alice and Bob’s respective private sets are the plain elements of the intersection set

T

.
The protocol ends.

5.2. Correctness Analysis

Protocol 2 treats Alice and Bob as equal, therefore only Alice’s implementation is subject to analysis.

(1): Step (1) in Protocol 2 is for the participants to obtain the coding vectors of their own sets. In this process, Alice converts the elements in her set into random even numbers in the vector $V_{a}$ , and the elements that do not exist into random odd numbers. In this way, it is avoided to directly use the original data for calculation.
(2): In step (2), Alice publishes $(c_{1 a}^{i}, c_{2 a}^{i})$ $(i = 1, \dots, m)$ , but the published information is encrypted, and Bob cannot obtain any valuable information.
(3): Steps (3)–(5) employ the cut-and-choose technique to ascertain the presence of any malicious behavior among the participants.
(4): In step (7), Alice is required to prove the correctness of the decryption outcome $m_{a}$ via a zero-knowledge proof. If the $a_{i}$ in the remaining $m / 2$ groups of $(c_{1 a}^{i}, c_{2 a}^{i})$ are also random odd numbers, Bob can calculate $a_{i} (x_{k} - y_{k})$ $(k = 1, \dots, n)$ and judge the parity of $x_{k}$ after publishing $m_{a}$ .
(5): During the implementation of the protocol, the malicious act Alice may successfully perform is that Bob chooses a certain $a_{i}$ which does not meet the requirements, and does not find it during the verification in step (3). In step (5), Bob mistakenly selects it, leading to an incorrect conclusion. Nevertheless, the information of $y_{k}$ remains inaccessible to Alice, because $a b_{j} (x_{k} - y_{k})$ is unsolvable for her (there are two unknowns in an equation), so Alice cannot judge the parity of $y_{k}$ from it.

Alice employs the aforementioned techniques for dishonest purposes. The circumstances that offer the highest likelihood of success are as follows: among

m

groups of

(c_{1 a}^{i}, c_{2 a}^{i})

controlled by Alice,

m - 1

groups satisfy the stipulated criteria while only one group fails to do so, thereby yielding a maximum probability of success denoted as

1 / m

. Assuming that

m = 20

, the most probable chance of successful deception amounts to

\frac{C_{19}^{10}}{C_{20}^{10}} \times \frac{1}{10} = \frac{1}{20}

. Conversely, if 10 groups do not meet the prescribed criteria, the probability of successful deception becomes

\frac{C_{10}^{10}}{C_{20}^{10}} = \frac{1}{184756}

. When

m = 40

, the probabilities are reduced to

2.5 \times 10^{- 2}

and

7.3 \times 10^{- 12}

, respectively. If more than

1 / 2

groups fail to meet the requirements, the probability of successful deception dwindles to zero since it will be discovered during the verification phase. Consequently, the protocol is deemed secure.

(6): In steps (7)–(9), the two parties exchanged ciphertext and decrypted it by themselves, avoiding the situation that one party informed the other of the result, which is fair.

5.3. Security Proof

The security of Protocol 2 against malicious attacks is demonstrated through the application of the real/ideal model paradigm. This proof methodology follows the subsequent steps.

Theorem 1.

Protocol 2, expressed as

Π

, is established to be secure in the presence of malicious adversaries.

Proof.

For Protocol 2 to securely compute function

F

, the participants must identify the approved policy

\bar{A} = (A_{1}, A_{2})

in the actual protocol. This policy should be indistinguishable from the policy

\bar{B} = (B_{1}, B_{2})

used in the protocol under the ideal model. The security of the protocol can be demonstrated by establishing the indistinguishability between

\bar{A} = (A_{1}, A_{2})

and

\bar{B} = (B_{1}, B_{2})

. □

In the protocol, at least one of

A_{1}

and

A_{2}

is honest, so there are two cases.

Case 1:

A_{1}

is honest,

A_{2}

is dishonest.

If

A_{1}

is honest, executing the protocol

Π

will result in:

R E A L_{Π, \overset{—}{A}} (x_{k}, y_{k}) = {F (x_{k}, A_{2} (y_{k}), A_{2} (c_{1 a}^{i}, c_{2 a}^{i}), m_{a}, S} (k = 1, \dots, n)

The sequence of messages received by

A_{2}

for zero-knowledge proof can be represented by the variable

S

.

Assuming

A_{1}

is an honest participant who follows the protocol honestly, the behavior of

B_{1}

can be determined. The objective is to demonstrate that

A_{2}

in the actual protocol is indistinguishable from

B_{2}

in the ideal model. To achieve this, it is necessary to identify an output strategy

\bar{B} = (B_{1}, B_{2})

under the ideal model that is indistinguishable from

R E A L_{_{Π, \overset{—}{A} (x_{k}, y_{k})}}

in the actual model. When the protocol is executed,

A_{2}

is the actual executor, and the correctness of the protocol must be confirmed based on the behavior

A_{2} (y_{k})

of

A_{2}

.

(1)

During the execution of the protocol, since

A_{1}

is an honest participant, it can be inferred that

B_{1}

is also honest, and will replicate the actions of

A_{1}

in transmitting the authentic information

x_{k}

to the trusted third party (TTP).

(2)

During the execution of the protocol, as

A_{2}

is acting in a dishonest manner,

B_{2}

is also acting dishonestly. The information that they send to TTP is dependent on the policy of

B_{2}

, and the policy of

B_{2}

aligns with the policy of

A_{2}

. Consequently, the input message that

B_{2}

transmits to TTP is

A_{2} (y_{2})

.

(3)

The input information obtained by TTP is

(x_{k}, A_{2} (y_{k}))

, and

F (x_{k}, A_{2} (y_{k}))

is calculated.

(4)

B_{2}

gets

F (x_{k}, A_{2} (y_{k}))

from TTP, uses

F (x_{k}, A_{2} (y_{k}))

to get an

v i e w_{B_{2}}^{F} (x_{k}, A_{2} (y_{k}))

which is computational, indistinguishable from the

v i e w_{A_{2}}^{\prod} (x_{k}, A_{2} (y_{k}))

obtained by

A_{2}

when the protocol is actually implemented, and hands

v i e w_{B_{2}}^{F} (x_{k}, A_{2} (y_{k}))

to

A_{2}

to get the output of

A_{2}

. Subsequently, based on its own input and the protocol outcome, the simulator

B_{2}

presumes that the input value of the other party satisfies the outcome, and conducts the protocol execution. In this case,

B_{2}

selects

x_{k}^{'}

to simulate the protocol and generates

F (x_{k}^{'}, A_{2} (y_{k})) = F (x_{k}, A_{2} (y_{k}))

. The particular steps involved in the implementation process of

B_{2}

are as follows:

①: $B_{2}$ sends the information required in step (2) to $A_{2}$ ;
②: After $A_{2}$ publishes the information in step (3), $B_{2}$ will verify it;
③: In step (4), $B_{2}$ will publish the information that $A_{2}$ requires $A_{1}$ to publish;
④: In step (5), $B_{2}$ chooses the necessary information from the remaining sets, computes the information, and then publicly discloses it;
⑤: Calculate $m_{a}^{'}$ in step (6) and publish it;
⑥: During step (7), zero-knowledge proof is utilized to authenticate the information, and as a result, $B_{2}$ acquires the message sequence $S^{'}$ .

B_{2}

obtains the following information during the execution of the protocol:

I D E A L_{F, \overset{—}{B}} (x_{k}, y_{k}) = {F (x_{k}, A_{2} (y_{k})), A_{2} (C_{1 a}^{i^{'}}, C_{2 a}^{i^{'}}), m_{a}^{'}, S^{'}} .

In steps (2)–(7) of the protocol, since the probability encryption algorithm adopted by the protocol is the same, then

(C_{1 a}^{i}, C_{2 a}^{i}) \overset{c}{\equiv} (C_{1 a}^{i^{'}}, C_{2 a}^{i^{'}})

,

m_{a}^{'} \overset{c}{\equiv} m_{a}

. Zero-knowledge proof guarantees

S^{'} \overset{c}{\equiv} S

. Then:

{I D E A L_{F, \overset{—}{B}} (x_{k}, y_{k})} \overset{c}{\equiv} {R E A L_{Π, \overset{—}{A}} (x_{k}, y_{k})} .

Case 2:

If

A_{1}

is dishonest and

A_{2}

is honest, there are two cases:

(1): If Alice ignores TTP after getting the information, TTP will send $⊥$ to Bob. Then:

$R E A L_{Π, \overset{—}{A}} (x_{k}, y_{k}) = {A_{1} (c_{1 b}^{i}, c_{2 b}^{i}), m_{b}, S, ⊥} .$
(2): Otherwise, TTP will send $F (A_{1} (x_{k}), y_{k})$ to Bob, then:

$R E A L_{Π, \overset{—}{A}} (x_{k}, y_{k}) = {A_{1} (c_{1 b}^{i}, c_{2 b}^{i}), m_{b}, S, F (A_{1} (x_{k}), y_{k})} (k = 1, \dots, n) .$

The sequence of messages received by

A_{1}

for zero-knowledge proof can be represented by the variable

S

.

As

A_{2}

is an honest participant and executes the protocol as prescribed, the behavior of

B_{2}

can be determined. The objective is to prove that

A_{1}

in the actual protocol is indistinguishable from

B_{1}

in the ideal model. To accomplish this, it is necessary to identify a policy

\bar{B} = (B_{1}, B_{2})

under the ideal model, whose output is indistinguishable from

R E A L_{Π, \overset{—}{A} (x_{k}, y_{k})}

in the actual model. During the execution of the protocol, the actual executor is

A_{1}

. Hence, while proving the protocol’s correctness, it is crucial to verify it based on the behavior

A_{1} (x_{k})

of

A_{1}

.

(1)

During the execution of the actual protocol,

A_{1}

is acting dishonestly; as a result,

B_{1}

is also acting dishonestly. The information transmitted by

B_{1}

to TTP is dependent on the policy of

B_{1}

, which is the same as the policy of

A_{1}

. Then

B_{1}

will send

A_{1} (x_{k})

to TTP.

(2)

During the execution of the actual protocol, as

A_{2}

is an honest participant, it can be inferred that

B_{2}

is also honest, and it sends TTP the real input information

y_{k}

.

(3)

The input information obtained by TTP is

(A_{1} (x_{k}), y_{k})

, and TTP calculates

F (A_{1} (x_{k}), y_{k})

.

(4)

B_{1}

uses

F (A_{1} (x_{k}), y_{k})

obtained from TTP to obtain

v i e w_{B_{1}}^{F} (A_{1} (x_{k}), y_{k})

, and

v i e w_{B_{1}}^{F} (A_{1} (x_{k}), y_{k})

should be computationally indistinguishable from

v i e w_{A_{1}}^{\prod} (A_{1} (x_{k}), y_{k})

obtained from the actual protocol implemented by

A_{1}

. The output of

A_{1}

is obtained by handing over the execution of the protocol to

A_{1}

after

v i e w_{B_{1}}^{F} (A_{1} (x_{k}), y_{k})

. Subsequently,

B_{1}

conducts the protocol execution by presuming that the other party’s input satisfies the outcome based on its own input and calculation results, that is,

B_{1}

selects

y_{k}^{'}

to simulate the protocol and makes

F (A_{1} (x_{k}), y_{k}^{'}) = F (A_{1} (x_{k}), y_{k}^{'})

. The specific implementation process of

B_{1}

is as follows:

①: $B_{1}$ sends the information required in step (2) to $A_{1}$ ;
②: In step (3), $B_{1}$ will publish the information that $A_{1}$ requires $A_{2}$ to publish;
③: After $A_{1}$ publishes the information in step (4) of the protocol, $B_{1}$ will verify it;
④: In step (5), $B_{1}$ chooses the necessary information from the remaining sets, computes the information, and then publicly discloses it;
⑤: Calculate $m_{b}^{'}$ in step (6) and publish it;
⑥: In step (7), ZKP is used to verify the information and $B_{1}$ obtains the message sequence $S^{'}$ .

When

B_{1}

executes the protocol, there are two situations:

(1): If $A_{1}$ ignores TTP after getting the information, then:

$I D E A L_{F, \overset{—}{B}} (x_{k}, y_{k}) = {A_{1} (c_{1 b}^{i^{'}}, c_{2 b}^{i^{'}}), m_{b}^{'}, S^{'}, ⊥} .$
(2): Otherwise,

I D E A L_{F, \overset{—}{B}} (x_{k}, y_{k}) = {A_{1} (c_{1 b}^{i^{'}}, c_{2 b}^{i^{'}}), m_{b}^{'}, S^{'}, F (A_{1} (x_{k}), y_{k}^{'})} .

In steps (2)–(7) of the protocol, since the probability encryption algorithm adopted by the protocol is the same, then

(C_{1 b}^{i}, C_{2 b}^{i}) \overset{c}{\equiv} (C_{1 b}^{i^{'}}, C_{2 b}^{i^{'}})

,

m_{b}^{'} \overset{c}{\equiv} m_{b}

. Zero-knowledge proof guarantees

S^{'} \overset{c}{\equiv} S

. Then:

{R E A L_{Π, \overset{—}{A}} (x_{k}, y_{k})} \overset{c}{\equiv} {I D E A L_{F, \overset{—}{B}} (x_{k}, y_{k})} .

In conclusion, for any probabilistic polynomial-time policy

\bar{A} = (A_{1}, A_{2})

that is acceptable during the actual protocol execution, there exists a probabilistic polynomial-time policy

\bar{B} = (B_{1}, B_{2})

that is acceptable under the ideal model. This makes

I D E A L_{F, \overset{—}{B}} (x_{k}, y_{k})

and

R E A L_{Π, \overset{—}{A}} (x_{k}, y_{k})

indistinguishable during computation. Consequently, Protocol 2 is secure under the malicious model.

6. Performance and Comparison of Protocols

The authors of [15,17] have implemented the calculation of PSI under the malicious model. In this study, the performance of Protocol 2 is evaluated by comparing its computational complexity, communication complexity, and experimental simulation time.

6.1. Computation Complexity

The protocol in study [15] is designed based on OT extension, and the computational complexity is

O (n^{2})

. The protocol in study [17] is based on the objective linear function evaluation (OLE), so it requires a lot of public key encryption operations, and the computation complexity is

O (n l b^{2} n)

.

In this paper’s Protocol 2, both Alice and Bob are required to encode their respective private sets into vectors using the secure ranking method where identical elements have the same order. The computational complexity of encoding vectors generated by both parties is denoted by

O (n)

. Additionally, Alice and Bob need to generate

m n

groups of modular index, where

m

is the number of randomly selected odd numbers

a_{i}

or

b_{i}

, and

n

is the number of elements in the set coding vector. Each party needs

2 m n

modular index operations, which require

4 m n

modular index operations in total. Both parties need to verify

n (m / 2)

modulus index and perform

n (m / 2)

modulus index operations, requiring a total of

m n

modulus index calculations. For each zero-knowledge proof of discrete logarithms, six modular exponential operations are performed, and each participant performs the zero-knowledge proof of

n

discrete logarithms, requiring a total of

12 n

modular exponential operations. Overall,

n (5 m + 12)

modular exponential operations are required, and

m = 20

is usually sufficient. Therefore, Protocol 2 requires

112 n

modular exponential operations, and the computation complexity is

O (n)

.

6.2. Communication Complexity

The communication complexity of a protocol is typically measured by the number of communication rounds required to complete the protocol. For example, the PSI MPC protocol described in study [15] under the malicious model requires eight rounds of communication. On the other hand, the PSI calculation under the malicious model in study [17] requires 12 rounds of communication.

In Protocol 2 proposed in this paper, the private sets of both Alice and Bob are required to be encoded into vectors, and the two parties need to conduct one round of communication to generate coding vectors using the secure ranking method in which the same numbers have the same order. In addition, Alice and Bob need to conduct four rounds of communication to implement other parts of Protocol 2. Therefore, the implementation of Protocol 2 requires five rounds of communication in total.

According to the results presented in Table 2, n represents the number of elements in the sets, it can be observed that for solving the PSI problem under the malicious model, the computational complexity of Protocol 2 is lower than that of studies [15,17], which implies that Protocol 2 is computationally more efficient. Moreover, the number of communication rounds required by Protocol 2 is lower than that of studies [15,17], indicating that the communication efficiency of Protocol 2 is higher. It can be concluded that Protocol 2 provides a more efficient and practical solution to the PSI problem under the malicious model.

It is worth noting that MPC protocols under the malicious model often require the use of cryptographic tools such as zero-knowledge proof and cut-and-choose, which may increase computation complexity and reduce efficiency. However, preprocessing or computing outsourcing can be used to improve efficiency [29].

6.3. Simulation Experiment

To assess the efficiency of Protocol 2 in this paper, a simulation was conducted using Python language on the PyCharm platform. The goal of the simulation is to compare the performance of Protocol 2 with existing protocols.

Experimental environment: Windows 10 64-bit system, Intel(R) Core(TM) i5-8400 CPU @ 2.80 GHz, 16 GB RAM.

Experimental parameter setting: The Paillier encryption scheme used in the experiment is based on 512-bit large primes

p

and

q

, resulting in a modulus

N = p q

with a length of 1024 bits. The discrete logarithm operation uses a module

p

with a length of 1024 bits, and the length of the random number is set to 64 bits. The length of PSI elements used in all calculations is set to 128 bits.

When the number of set elements of each participant is

2^{4}

,

2^{5}

,

2^{6}

,

2^{7}

,

2^{8}

,

2^{9}

, the simulation of the private set intersection protocol under the malicious model proposed in this paper was carried out. Table 3 shows the communication traffic and runtime of it.

As shown in Table 3, it can be concluded that Protocol 2 proposed in this paper has higher computing efficiency and lower communication traffic under the sets of different numbers of elements.

Next, the communication traffic and runtime of Protocol 2 in this paper are compared with those of related schemes under the sets with different numbers of elements. Figure 1 shows the communication traffic comparison between Protocol 2 and other existing schemes.

Based on the results shown in Figure 1, it can be concluded that Protocol 2 requires less communication traffic compared to study [15], and the communication efficiency is higher, especially when the number of elements in each set is the same.

When the number of set elements is different, the runtime comparison between Protocol 2 and other existing schemes is shown in Figure 2.

Figure 2 demonstrates that the runtime of Protocol 2 increases as the number of elements in the set increases. However, when the number of elements in the set is the same, Protocol 2 has a lower runtime compared to studies [15,17], indicating higher efficiency.

Additional performance assessment was carried out through communication experiments. For this purpose, Python programs were developed using the PyCharm platform to simulate experiments with a bandwidth of 100 Mbps. The objective was to ascertain the potential delay duration during the execution of Protocol 2. It should be emphasized that, in practical settings, the delay time between different networks can vary, which might affect the performance of protocol execution. Nevertheless, this aspect was not considered in the performance evaluation. The outcomes of the communication experiments for Protocol 2 are presented in Figure 3.

The experiment demonstrates that the proposed Protocol 2 shows consistently low delay durations, which exhibit a linear increase in correspondence with the number of elements in the set. This gradual growth rate of the delay time signifies the protocol’s remarkable communication efficiency.

To sum up, Protocol 2 is compared with the existing schemes through experiments, which shows that the set intersection protocol based under the malicious model in this paper is more efficient in terms of communication traffic and runtime. It is important to emphasize that while the protocol presented in this study demonstrates superior efficiency compared to existing privacy-preserving set intersection protocols, it is essential to consider practical limitations such as restricted computational resources and communication bandwidth, as these factors could potentially impact its overall performance.

7. Conclusions

Private set intersection is considered as a significant aspect of MPC, which has diverse application scenarios. In this study, the Paillier encryption system, cut-and-choose, and zero-knowledge proof have been utilized to devise a privacy set intersection protocol under the semi-honest model. Furthermore, an MPC protocol under the malicious model has also been developed, which ensures fairness among participants and can effectively combat malicious attacks. The security of the protocol has been verified using the real/ideal model paradigm. Additionally, this protocol has demonstrated superior efficiency and practicality when compared to existing schemes.

Author Contributions

Conceptualization, X.L. and W.C.; methodology, W.C.; investigation, X.L.; software, D.L. and G.X.; experimental simulation, D.L.; security proof, D.L.; modification of English grammar, D.L.; funding acquisition, D.L.; validation, N.X. and X.C.; writing—original draft, X.L.; writing—review and editing, X.L., N.X. and X.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China: Big Data Analysis based on Software Defined Networking Architecture, grant numbers 62177019 and F0701; NSFC, grant numbers 62271070, 72293583, and 61962009; Inner Mongolia Natural Science Foundation, grant number 2021MS06006; 2023 Inner Mongolia Young Science and Technology Talents Support Project, grant number NJYT23106; 2022 Fund Project of Central Government Guiding Local Science and Technology Development, grant number 2022ZY0024; 2022 Basic Scientific Research Project of Direct Universities of Inner Mongolia, grant number 20220101; 2022 “Western Light” Talent Training Program “Western Young Scholars” Project, grant number 22040601; the 14th Five-Year Plan of Education and Science of Inner Mongolia, grant number NGJGH2021167; 2023 Open Project of the State Key Laboratory of Network and Exchange Technology, grant number 230201; 2022 Inner Mongolia Postgraduate Education and Teaching Reform Project: JGSZ2022037; the 2022 Ministry of Education Central and Western China Young Backbone Teachers and Domestic Visiting Scholars Program, grant number 2022015; Inner Mongolia Discipline Inspection and Supervision Big Data Laboratory Open Project Fund, grant number IMDBD202020; Baotou Kundulun District Science and Technology Plan Project, grant number YF2020013; Inner Mongolia Science and Technology Major Project, grant number 2019ZD025; Project JCKY2021208B036, and the Fundamental Research Funds for Beijing Municipal Commission of Education, grant number 220201.

Data Availability Statement

The authors approve that data used to support the findings of this study are included in the article.

Conflicts of Interest

The authors declare no conflict of interest.

References

Weihong, X.I.E.; Qian, Z. The online website privacy disclosure behavior of users based on concerns-outcomes model. Soft Comput. 2022, 26, 11733–11747. [Google Scholar] [CrossRef]
Knott, B.; Venkataraman, S.; Hannun, A.; Sengupta, S.; Ibrahim, M.; van der Maaten, L. Crypten: Secure multi-party computation meets machine learning. Adv. Neural Inf. Process. Syst. 2021, 34, 4961–4973. [Google Scholar]
Zhou, J.; Feng, Y.; Wang, Z.; Guo, D. Using secure multi-party computation to protect privacy on a permissioned blockchain. Sensors 2021, 21, 1540. [Google Scholar] [CrossRef] [PubMed]
Yao, A.C. Protocols for secure computation. In Proceedings of the 23rd Annual Symposium on Foundation of Computer Science, Chicago, IL, USA, 3–5 November 1982; pp. 160–164. [Google Scholar]
Goldreich, O. The Fundamental of Crytography: Basic Application; Cambridge University Press: London, UK, 2004. [Google Scholar]
Cramer, R.; Damgard, I.B.; Nielsen, J.B. Secure Multiparty Compution; Cambridge University Press: London, UK, 2015. [Google Scholar]
Liu, J.; Tian, Y.; Zhou, Y.; Xiao, Y.; Ansari, N. Privacy preserving distributed data mining based on secure multi-party computation. Comput. Commun. 2020, 153, 208–216. [Google Scholar] [CrossRef]
Yao, Y.; Xiong, N.; Park, J.H.; Ma, L.; Liu, J. Privacy-preserving max/min query in two-tiered wireless sensor networks. Comput. Math. Appl. 2013, 65, 1318–1325. [Google Scholar] [CrossRef]
Nevo, O.; Trieu, N.; Yanai, A. Simple, fast malicious multiparty private set intersection. In Proceedings of the 2021 ACM SIGSAC Conference on Computer and Communications Security, Virtual Event, Republic of Korea, 15–19 November 2021; pp. 1151–1165. [Google Scholar]
Fu, A.; Zhang, X.; Xiong, N.; Gao, Y.; Wang, H.; Zhang, J. VFL: A verifiable federated learning with privacy-preserving for big data in industrial IoT. IEEE Trans. Ind. Inform. 2020, 18, 2513–2520. [Google Scholar] [CrossRef]
Kumar, P.; Kumar, R.; Srivastava, G.; Gupta, G.P.; Tripathi, R.; Gadekallu, T.R.; Xiong, N.N. PPSF: A privacy-preserving and secure framework using blockchain-based machine-learning for IoT-driven smart cities. IEEE Trans. Netw. Sci. Eng. 2021, 8, 2326–2341. [Google Scholar] [CrossRef]
Subramaniyaswamy, V.; Jagadeeswari, V.; Indragandhi, V.; Jhaveri, R.H.; Vijayakumar, V.; Kotecha, K.; Ravi, L. Somewhat homomorphic encryption: Ring learning with error algorithm for faster encryption of iot sensor signal-based edge devices. Secur. Commun. Netw. 2022, 2022, 2793998. [Google Scholar] [CrossRef]
Sengan, S.; Subramaniyaswamy, V.; Indragandhi, V.; Velayutham, P.; Ravi, L. Detection of false data cyber-attacks for the assessment of security in smart grid using deep learning. Comput. Electr. Eng. 2021, 93, 107211. [Google Scholar] [CrossRef]
Rosulek, M.; Trieu, N. Compact and malicious private set intersection for small sets. In Proceedings of the 2021 ACM SIGSAC Conference on Computer and Communications Security, Virtual Event, Republic of Korea, 15–19 November 2021; pp. 1166–1181. [Google Scholar]
Efraim, A.B.; Nissenbaum, O.; Omri, E.; Paskin-Cherniavsky, A. Psimple: Practical multiparty maliciously-secure private set intersection. Cryptol. Eprint Arch. 2021, 122. Available online: https://eprint.iacr.org/2021/122 (accessed on 22 May 2023).
Liu, X.; Zhang, R.L.; Xu, G.; Chen, X.B. Securely determine the inclusion relation of a point and a convex polygon in malicious model. J. Cryptologic Res. 2022, 9, 524–534. [Google Scholar] [CrossRef]
Ghosh, S.; Nilges, T. An algebraic approach to maliciously secure private set intersection. In Proceedings of the Annual International Conference on the Theory and Applications of Cryptographic Techniques, Darmstadt, Germany, 19–23 May 2019; pp. 154–185. [Google Scholar]
Rabin, M.O. How to exchange secrets with oblivious transfer. Cryptol. Eprint Arch. 2005, 2005, 187. [Google Scholar]
Chase, M.; Miao, P. Private set intersection in the internet setting from lightweight oblivious PRF. In Proceedings of the Annual International Cryptology Conference, Santa Barbara, CA, USA, 17–21 August 2020; pp. 34–63. [Google Scholar]
Chauhan, A.K.; Kumar, A.; Sanadhya, S.K. Quantum free-start collision attacks on double block length hashing with round-reduced AES-256. IACR Trans. Symmetric Cryptol. 2021, 2021, 316–336. [Google Scholar] [CrossRef]
Pinkas, B.; Rosulek, M.; Trieu, N.; Yanai, A. PSI from PaXoS: Fast, malicious private set intersection. In Proceedings of the Annual International Conference on the Theory and Applications of Cryptographic Techniques, Zagreb, Croatia, 10–14 May 2020; pp. 739–767. [Google Scholar]
Zhang, E.; Liu, F.H.; Lai, Q.; Jin, G.; Li, Y. Efficient multi-party private set intersection against malicious adversaries. In Proceedings of the 2019 ACM SIGSAC conference on cloud computing security workshop, London, UK, 11 November 2019; pp. 93–104. [Google Scholar]
Yousefipoor, V.; Eghlidos, T. An efficient, secure and verifiable conjunctive keyword search scheme based on rank metric codes over encrypted outsourced cloud data. Comput. Electr. Eng. 2023, 105, 108523. [Google Scholar] [CrossRef]
Paillier, P. Public-key cryptosystems based on composite degree residuosity classes. In Proceedings of the International Conference on the Theory and Applications of Cryptographic Techniques, Prague, Czech Republic, 2–6 May 1999; pp. 223–238. [Google Scholar]
Goldreich, O.; Oren, Y. Definitions and properties of zero-knowledge proof systems. J. Cryptol. 1994, 7, 1–32. [Google Scholar]
Chaum, D.; Pedersen, T.P. Transferred cash grows in size. In Proceedings of the Workshop on the Theory and Application of Cryptographic Techniques, Gold Coast, Australia, 13–16 December 1992; pp. 390–407. [Google Scholar]
Lindell, Y. Fast cut-and-choose-based protocols for malicious and covert adversaries. J. Cryptol. 2016, 29, 456–490. [Google Scholar] [CrossRef]
Goldreich, O. Foundations of Cryptography: Volume 2, Basic Applications; Cambridge University Press: Cambridge, UK, 2009. [Google Scholar]
Li, S.D.; Wang, W.L.; Du, R.M. Protocol for millionaires’ problem in malicious models. Sci. Sin. Inf. 2021, 51, 75–88. [Google Scholar] [CrossRef]
Li, S.D.; Du, R.M.; Yang, Y.J.; Wei, Q. Secure Multiparty Multi-Data Ranking. Chin. J. Comput. 2020, 43, 1448–1462. [Google Scholar]

Figure 1. Communication traffic comparison with related schemes (Reference [15]: Efraim, A.B. 2021, Reference [17]: Ghosh, S. 2019).

Figure 2. Runtime comparison with relevant schemes (Reference [15]: Efraim, A.B. 2021, Reference [17]: Ghosh, S. 2019).

Figure 3. The relationship between the delay time of Protocol 2 and the number of elements in the set.

Table 1. Results of the ranking method in which the same numbers have the same order.

Element	Algorithm	Sorting Position
1	1	1
2	1 + 1	2
3	1 + 1 + 1	3
6	1 + 1 + 1 + 0 + 0 + 1	4

Table 2. Comparison of relevant PSI protocols.

Protocol	Study [15]	Study [17]	Protocol 2
Number of participants	3	3	2
Security model	Malicious	Malicious	Malicious
Communication rounds	8	12	5
Communication complexity	$O (n l b n)$	$O (n)$	$O (n)$
Computation complexity	$O (n^{2})$	$O (n l b^{2} n)$	$O (n)$
Key methods	Garbled Bloom filters	Oblivious Linear Function Evaluation	Paillier cryptosystem; zero-knowledge proof; cut-and-choose

Table 3. Communication traffic and runtime of protocols with different numbers of set elements.

	$2^{4}$	$2^{5}$	$2^{6}$	$2^{7}$	$2^{8}$	$2^{9}$
Traffic (kb)/Runtime (s)	$2^{4}$	$2^{5}$	$2^{6}$	$2^{7}$	$2^{8}$	$2^{9}$
Protocol 2	4.72 $/$ 0.061	9.51 $/$ 0.118	19.12 $/$ 0.210	38.19 $/$ 0.401	76.45 $/$ 0.799	152.85 $/$ 1.589
Study [15]	5.03 $/$ 0.077	12.12 $/$ 0.123	36.06 $/$ 0.219	99.95 $/$ 0.430	340.83 $/$ 0.910	1001.51 $/$ 2.110
Study [17]	3.98 $/$ 0.073	7.92 $/$ 0.112	15.83 $/$ 0.205	31.65 $/$ 0.411	63.19 $/$ 0.841	126.37 $/$ 1.799

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Liu, X.; Chen, W.; Xiong, N.; Luo, D.; Xu, G.; Chen, X. Securely Computing Protocol of Set Intersection under the Malicious Model. Electronics 2023, 12, 2410. https://doi.org/10.3390/electronics12112410

AMA Style

Liu X, Chen W, Xiong N, Luo D, Xu G, Chen X. Securely Computing Protocol of Set Intersection under the Malicious Model. Electronics. 2023; 12(11):2410. https://doi.org/10.3390/electronics12112410

Chicago/Turabian Style

Liu, Xin, Weitong Chen, Neal Xiong, Dan Luo, Gang Xu, and Xiubo Chen. 2023. "Securely Computing Protocol of Set Intersection under the Malicious Model" Electronics 12, no. 11: 2410. https://doi.org/10.3390/electronics12112410

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Securely Computing Protocol of Set Intersection under the Malicious Model

Abstract

1. Introduction

2. Related Work

3. Preliminary Knowledge

3.1. Paillier Cryptosystem

3.2. Zero-Knowledge Proof

3.3. Cut-and-Choose Method

3.4. Security under the Malicious Model

3.5. Ranking Method

3.6. Coding Method

3.7. Transformation Problem of Set Intersection

4. The MPC Protocol of Set Intersection under the Semi-Honest Model

5. The MPC Protocol of Set Intersection under the Malicious Model

5.1. Specific Protocol

5.2. Correctness Analysis

5.3. Security Proof

6. Performance and Comparison of Protocols

6.1. Computation Complexity

6.2. Communication Complexity

6.3. Simulation Experiment

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI