# A Shannon-Theoretic Approach to the Storage–Retrieval Trade-Off in PIR Systems

## Abstract

## 1. Introduction

## 2. Preliminaries

#### 2.1. Problem Definition

#### 2.2. Some Relevant Known Results

#### 2.3. Multiple Description Source Coding

## 3. A Special Case: Slepian–Wolf Coding for Minimum Retrieval Rate

- At database-1, compress and store $({X}_{1}^{L},{X}_{2}^{L})$ losslessly;
- At database-2, encode ${Y}_{1}^{L}$ using a Slepian–Wolf code (or more precisely Sgarro’s code with uncertainty side information [29]), with either ${X}_{1}^{L}$ or ${X}_{2}^{L}$ at the decoder, whose resulting code index is denoted as ${C}_{{Y}_{1}}$; encode ${Y}_{2}^{L}$ in the same manner, independent of ${Y}_{1}^{L}$, whose code index is denoted as ${C}_{{Y}_{2}}$.

## 4. Main Result

#### 4.1. A General Inner Bound

- There exist deterministic functions ${f}_{1,1}$, ${f}_{1,2}$, ${f}_{2,1}$, and ${f}_{2,2}$ such that$$\begin{array}{c}{V}_{1}={f}_{1,1}({X}_{0},{X}_{1},{Y}_{1})={f}_{2,2}({X}_{0},{X}_{2},{Y}_{2}),\phantom{\rule{1.em}{0ex}}{V}_{2}={f}_{1,2}({X}_{0},{X}_{1},{Y}_{2})={f}_{2,1}({X}_{0},{X}_{2},{Y}_{1});\end{array}$$
- There exist non-negative coding rates$$\begin{array}{c}\phantom{\rule{2.em}{0ex}}({\beta}_{1}^{\left(0\right)},{\beta}_{1}^{\left(1\right)},{\beta}_{1}^{\left(2\right)},{\beta}_{2}^{\left(1\right)},{\beta}_{2}^{\left(2\right)},{\gamma}_{1}^{\left(0\right)},{\gamma}_{1}^{\left(1\right)},{\gamma}_{1}^{\left(2\right)},{\gamma}_{2}^{\left(1\right)},{\gamma}_{2}^{\left(2\right)})\hfill \\ \phantom{\rule{2.em}{0ex}}\phantom{\rule{2.em}{0ex}}\in {\mathcal{R}}_{MD}^{\ast}\left((({V}_{1},{V}_{2}),{X}_{0},{X}_{1},{X}_{2},{Y}_{1},{Y}_{2}),\right.\hfill \\ \phantom{\rule{2.em}{0ex}}\phantom{\rule{2.em}{0ex}}\phantom{\rule{2.em}{0ex}}\phantom{\rule{2.em}{0ex}}\left.\left(\{{X}_{0},{X}_{1},{Y}_{1}\},\{{X}_{0},{X}_{1},{Y}_{2}\},\{{X}_{0},{X}_{2},{Y}_{1}\},\{{X}_{0},{X}_{2},{Y}_{2}\}\right)\right);\hfill \end{array}$$
- There exist non-negative storage rates $({\alpha}_{1}^{\left(0\right)},{\alpha}_{1}^{\left(1\right)},{\alpha}_{1}^{\left(2\right)},{\alpha}_{2}^{\left(1\right)},{\alpha}_{2}^{\left(2\right)})$ such that$$\begin{array}{c}{\alpha}_{1}^{\left(0\right)}\le {\beta}_{1}^{\left(0\right)},{\alpha}_{1}^{\left(1\right)}\le {\beta}_{1}^{\left(1\right)},{\alpha}_{1}^{\left(2\right)}\le {\beta}_{1}^{\left(2\right)},{\alpha}_{2}^{\left(1\right)}\le {\beta}_{2}^{\left(1\right)},{\alpha}_{2}^{\left(2\right)}\le {\beta}_{2}^{\left(2\right)},\end{array}$$$$\begin{array}{c}\hfill {\gamma}_{1}^{\left(0\right)}-{\beta}_{1}^{\left(0\right)}+{\gamma}_{1}^{\left(1\right)}-{\beta}_{1}^{\left(1\right)}+{\gamma}_{1}^{\left(2\right)}-{\beta}_{1}^{\left(2\right)}<H\left({X}_{1}\right)+H\left({X}_{2}\right)+H\left({X}_{3}\right)-H({X}_{0},{X}_{1},{X}_{2}),\end{array}$$$$\begin{array}{c}\hfill ({\alpha}_{1}^{\left(0\right)},{\alpha}_{1}^{\left(1\right)},{\alpha}_{1}^{\left(2\right)},{\gamma}_{1}^{\left(0\right)},{\gamma}_{1}^{\left(1\right)},{\gamma}_{1}^{\left(2\right)})\in {\mathcal{R}}_{MD}^{\ast}\left((({V}_{1},{V}_{2}),{X}_{0},{X}_{1},{X}_{2}),\left(\{{X}_{0},{X}_{1},{X}_{2}\}\right)\right);\end{array}$$$$\begin{array}{c}\hfill {\gamma}_{2}^{\left(1\right)}-{\beta}_{2}^{\left(1\right)}+{\gamma}_{2}^{\left(2\right)}-{\beta}_{2}^{\left(2\right)}<I({Y}_{1};{Y}_{2}),\end{array}$$$$\begin{array}{c}\hfill ({\alpha}_{2}^{\left(1\right)},{\alpha}_{2}^{\left(2\right)},{\gamma}_{2}^{\left(1\right)},{\gamma}_{2}^{\left(2\right)})\in {\mathcal{R}}_{MD}^{\ast}\left((({V}_{1},{V}_{2}),{Y}_{1},{Y}_{2}),\left(\{{Y}_{1},{Y}_{2}\}\right)\right),\end{array}$$
- The normalized average retrieval and storage rates$$\begin{array}{cc}\hfill \phantom{\rule{1.em}{0ex}}& 2t\overline{\alpha}\ge {\alpha}_{1}^{\left(0\right)}+{\alpha}_{1}^{\left(1\right)}+{\alpha}_{1}^{\left(2\right)}+{\alpha}_{2}^{\left(1\right)}+{\alpha}_{2}^{\left(2\right)},\hfill \end{array}$$$$\begin{array}{cc}\hfill \phantom{\rule{1.em}{0ex}}& 4t\overline{\beta}\ge 2{\beta}_{1}^{\left(0\right)}+{\beta}_{1}^{\left(1\right)}+{\beta}_{1}^{\left(2\right)}+{\beta}_{2}^{\left(1\right)}+{\beta}_{2}^{\left(2\right)}.\hfill \end{array}$$

#### 4.2. Outer Bounds

#### 4.3. Specialization of the Inner Bound

- The distribution factorizes as follows$${P}_{{V}_{1},{V}_{2},{X}_{0},{X}_{1},{X}_{2},{Y}_{1},{Y}_{2}}={P}_{{V}_{1},{V}_{2}}{P}_{{X}_{0}|{V}_{1},{V}_{2}}{P}_{{X}_{1}|{V}_{1},{V}_{2}}{P}_{{X}_{2}|{V}_{1},{V}_{2}}{P}_{{Y}_{1}|{V}_{1},{V}_{2}}{P}_{{Y}_{2}|{V}_{1},{V}_{2}};$$
- There exist deterministic functions ${f}_{1,1}$, ${f}_{1,2}$, ${f}_{2,1}$, and ${f}_{2,2}$ such that$$\begin{array}{c}{V}_{1}={f}_{1,1}({X}_{0},{X}_{1},{Y}_{1})={f}_{2,2}({X}_{0},{X}_{2},{Y}_{2}),\end{array}$$$$\begin{array}{c}{V}_{2}={f}_{1,2}({X}_{0},{X}_{1},{Y}_{2})={f}_{2,1}({X}_{0},{X}_{2},{Y}_{1});\end{array}$$
- A set of rates$$\begin{array}{c}{\gamma}_{1}^{\left(0\right)}=I({V}_{1},{V}_{2};{X}_{0}),\phantom{\rule{0.166667em}{0ex}}{\gamma}_{1}^{\left(1\right)}=I({V}_{1},{V}_{2};{X}_{1}),\phantom{\rule{0.166667em}{0ex}}{\gamma}_{1}^{\left(2\right)}=I({V}_{1},{V}_{2};{X}_{2}),\end{array}$$$$\begin{array}{c}{\gamma}_{2}^{\left(1\right)}=I({V}_{1},{V}_{2};{Y}_{1}),\phantom{\rule{0.166667em}{0ex}}{\gamma}_{2}^{\left(2\right)}=I({V}_{1},{V}_{2};{Y}_{2}),\end{array}$$$$\begin{array}{c}{\beta}_{1}^{\left(0\right)}={\gamma}_{1}^{\left(0\right)},\phantom{\rule{0.166667em}{0ex}}{\beta}_{1}^{\left(1\right)}=I({V}_{1},{V}_{2};{X}_{1}|{X}_{0}),\phantom{\rule{0.166667em}{0ex}}{\beta}_{1}^{\left(2\right)}=I({V}_{1},{V}_{2};{X}_{2}|{X}_{0}),\end{array}$$$$\begin{array}{c}{\beta}_{2}^{\left(1\right)}=max(I({V}_{1},{V}_{2};{Y}_{1}|{X}_{0},{X}_{1}),I({V}_{1},{V}_{2};{Y}_{1}|{X}_{0},{X}_{2})),\end{array}$$$$\begin{array}{c}{\beta}_{2}^{\left(2\right)}=max(I({V}_{1},{V}_{2};{Y}_{2}|{X}_{0},{X}_{1}),I({V}_{1},{V}_{2};{Y}_{2}|{X}_{0},{X}_{2})),\end{array}$$
- The normalized average retrieval and storage rates$$\begin{array}{cc}\hfill \phantom{\rule{1.em}{0ex}}& 2t\overline{\alpha}\ge {\alpha}_{1}^{\left(0\right)}+{\alpha}_{1}^{\left(1\right)}+{\alpha}_{1}^{\left(2\right)}+{\alpha}_{2}^{\left(1\right)}+{\alpha}_{2}^{\left(2\right)},\hfill \end{array}$$$$\begin{array}{cc}\hfill \phantom{\rule{1.em}{0ex}}& 4t\overline{\beta}\ge 2{\beta}_{1}^{\left(0\right)}+{\beta}_{1}^{\left(1\right)}+{\beta}_{1}^{\left(2\right)}+{\beta}_{2}^{\left(1\right)}+{\beta}_{2}^{\left(2\right)}.\hfill \end{array}$$

## 5. Conclusions

## Author Contributions

## Funding

## Conflicts of Interest

$({\mathit{w}}_{1},{\mathit{w}}_{2})$ | ${\mathit{x}}_{0}=\left(00\right)$ | ${\mathit{x}}_{0}=\left(01\right)$ | ${\mathit{x}}_{0}=\left(10\right)$ | ${\mathit{x}}_{0}=\left(11\right)$ |
---|---|---|---|---|

(00) | $1/2$ | 1/2 | ||

(10) | $(1-p)/2$ | p | $(1-p)/2$ | |

(01) | $(1-p)/2$ | p | $(1-p)/2$ | |

(11) | 1/2 | 1/2 |

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

