Next Article in Journal
Significance of Double Stratification in Stagnation Point Flow of Third-Grade Fluid towards a Radiative Stretching Cylinder
Next Article in Special Issue
Combination of Ensembles of Regularized Regression Models with Resampling-Based Lasso Feature Selection in High Dimensional Data
Previous Article in Journal
Scattered Data Interpolation and Approximation with Truncated Exponential Radial Basis Function
Previous Article in Special Issue
Two-Stage Classification with SIS Using a New Filter Ranking Method in High Throughput Data
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Estimation of Sensitive Attribute Applying Geometric Distribution under Probability Proportional to Size Sampling

1
Department of Children Welfare, Woosuk University, Wanju Jeonbuk 55338, Korea
2
Department of Computer Science, Dongshin University, Naju Jeonnam 58245, Korea
3
Department of Applied Statistics, Dongguk University, Gyeongju Gyeongbuk 38066, Korea
*
Author to whom correspondence should be addressed.
Mathematics 2019, 7(11), 1102; https://doi.org/10.3390/math7111102
Submission received: 29 August 2019 / Revised: 25 October 2019 / Accepted: 31 October 2019 / Published: 14 November 2019
(This article belongs to the Special Issue Uncertainty Quantification Techniques in Statistics)

Abstract

:
In this paper, we extended Yennum et al.’s model, in which geometric distribution is used as a randomization device for a population that consists of different-sized clusters, and clusters are obtained by probability proportional to size (PPS) sampling. Estimators of a sensitive parameter, their variances, and their variance estimators are derived under PPS sampling and equal probability two-stage sampling, respectively. We also applied these sampling schemes to Yennum et al.’s generalized model. Numerical studies were carried out to compare the efficiencies of the proposed sampling methods for each case of Yennum et al.’s model and Yennum et al.’s generalized model.

1. Introduction

The randomized response model (RRM) was suggested by [1] to estimate the true population proportion of sensitive characteristics, such as illegal gambling, drug-abuse, tax evasion, the extent of illegal income, and the experience of abortion, among others [2,3,4].
Since Warner’s work, many scholars have developed the RRM in various ways. In [5,6], they arranged, summarized, and systemized various RRMs and emphasized their importance. In [7], sampling survey of sensitive attributes applied two-stage cluster sampling to RRM for a population consisting of equal-sized clusters, and [8] considered the cluster RRM for a population consisting of different-sized clusters, where the clusters are selected by probability proportional to size (PPS) sampling.
Recently, Yennum et al. [9] suggested a new randomization device to gather sensitive data in two-stages under the assumption of geometric distribution and made a generalization of their model encompassing generalized geometric distribution using [10] model.
Based on Yennum et al.’s work, it is assumed that the respondents are selected by simple random sampling with replacements, but a real survey selects respondents from various sampling schemes.
Now, we can consider a large sample of clusters. For example, to estimate the true population proportion of drug-abuse among high school students, it is possible to use a randomization device like Yennum et al.’s model via proportional sampling by considering the primary sampling unit as the school and the secondary sampling unit as the students.
From this point of view, we extend Yennum et al.’s model, in which geometric distribution is used as a randomization device based on a population that consists of different-sized clusters, and the clusters are selected by PPS sampling. Estimators of a sensitive parameter, their variances, and their variance estimators are derived by PPS sampling and equal probability two-stage sampling, respectively.
We also apply these methods to the case of Yennum et al.’s generalized model. Numerical studies are carried out to compare the efficiencies of the suggested methods in each case of Yennum et al.’s model and Yennum et al.’s generalized model.

2. An Estimation of Sensitive Attributes with Probability Proportional to Size Sampling under Yennum et al.’s Model

In Section 2, we consider a new sampling scheme to estimate sensitive attributes using Yennum et al.’s model, in which geometric distribution is used as a randomization device when n clusters are selected with proportional to size (PPS) sampling or equal probability sampling from a population that consists of N clusters of size, M i ( i = 1 , 2 , , N ) and m i ( i = 1 , 2 , , n ) units are selected by simple random sampling from each sampled cluster.
In Section 2.1, we consider the sampling method for the clusters via PPS sampling with replacements. Clusters by PPS sampling without replacement are considered in Section 2.2, and clusters by equal probability sampling are examined in Section 2.3.

2.1. PPS Sampling with Replacement

Let the population be composed of N clusters. In the first stage, the size of the n sample of the first sampling units (FSU) is selected with replacement by the selection probability p i for the i th cluster. In the second stage, m i second sampling units (SSU) are drawn by simple random sampling with replacement (SRSWR) from each FSU and are guided to carry out Yennum et al.’s randomization device.
First of all, the randomization device consists of two elements. The first randomization device for the i th cluster consists of two kinds of urns with white and black balls, where the selection probability of a white ball is W i , and the selection probability of a black ball is 1 W i .
On the other hand, the second randomization device is composed of two kinds of urns with balls. The first device with balls contains a slip of paper including two statements, such as “I have a sensitive attribute” with selection probability P i , and the other balls includes a statement such as “I do not have a sensitive attribute” with selection probability 1 P i . The second device with balls contains a slip of paper with the statement “I do not have a sensitive attribute” with selection probability T i and balls with the statement “I have a sensitive attribute” with selection probability 1 T i .
In the first stage, for the i th cluster, each interviewee draws a ball from the first randomization device, such as the urn with the white and black balls. When he or she selects a white ball, he or she is guided to pick balls from the first urn of the second randomization device, one after another, with replacement, until the first ball containing a statement matching his or her own status appears.
We assume that X i 1 is the total number of balls drawn before he or she obtains the first ball including his or her own status in the i th cluster, and X i 2 is the total number of balls drawn before he or she obtains the first ball with his or her own status of not having a sensitive attribute in the i th cluster. Similarly, when he or she draws a black ball, he or she is guided to pick balls from the second urn of the second randomization device, one after another, with replacement, until the first ball containing a statement matching his or her own status appears.
For the i th cluster, using the randomization device in Figure 1, the total number of balls taken by interviewees X i 1 , X i 2 , Y i 1 , Y i 2 are distributed via generalized geometric distribution. Let π i and 1 π i be the true population proportion of persons who have a sensitive attribute A i and A i c for the i th cluster. Assume that each interviewee in the i th cluster is drawn by SRSWR.
For the i th cluster, the total number for each ball selected by interviewees through the proposed two-stage device distributes one of the following random variables: X i 1 G e ( P i ) , X i 2 G e ( 1 P i ) , Y i 1 G e ( T i ) and Y i 2 G e ( 1 T i ) , where G e ( · ) represents the geometric distribution with a success probability. Let π i and 1 π i be the true population proportions of persons who have a sensitive attribute ( A i and A i c , respectively) for the i th cluster. Assume that each interviewee in the i th cluster is drawn by SRSWR.
Let Z i j be the j th observed answer in the i th cluster; Z i j can be expressed as
Z i j = { X i 1 , w i t h   p r o b a b i l i t y   W i π i     Y i 2 , w i t h   p r o b a b i l i t y   ( 1 W i ) π i        X i 2 , w i t h   p r o b a b i l i t y   W i ( 1 π i )           Y i 1 , w i t h   p r o b a b i l i t y   ( 1 W i ) ( 1 π i )
Then, we can find the expected value of Z i j as follows:
E ( Z i j ) = π i [ W i P i + ( 1 W i ) ( 1 T i ) ] + ( 1 π i ) [ W i ( 1 P i ) + ( 1 W i ) T i ] = π i [ W i P i + ( 1 W i ) ( 1 T i ) W i ( 1 P i ) ( 1 W i ) T i ] + W i ( 1 P i ) + ( 1 W i ) T i .
The expected value (2) can be expressed as follows:
( 1 T i ) P i { E ( Z i j ) ( 1 P i ) T i W i T i ( 1 W i ) ( 1 P i ) } P i T i ( 1 P i ) ( 1 T i ) = π i ψ i P i T i ( 1 P i ) ( 1 T i ) ,
where ψ i = W i ( 1 2 P i ) T i ( 1 T i ) + ( 1 W i ) ( 2 T i 1 ) P i ( 1 P i ) .
Now the estimator π ^ i for the true population proportion π i in the i th cluster is given by:
π ^ i = 1 ψ i [ P i T i ( 1 P i ) ( 1 T i ) 1 m i i = 1 m i Z i j W i T i P i ( 1 T i ) P i ( 1 W i ) ( 1 P i ) ( 1 T i ) ] .
When the interviewees are drawn by SRSWR from the i th cluster selected with a replacement by the sampling probability p i , the estimator π ^ p p s w r of the true population proportion π for a sensitive character is given by:
π ^ p p s w r = 1 n M 0 i = 1 n M i π ^ i p i = 1 n M 0 i = 1 n M i p i ψ i [ P i T i ( 1 P i ) ( 1 T i ) 1 m i j = 1 m i Z i j W i T i P i ( 1 T i ) P i ( 1 W i ) ( 1 P i ) ( 1 T i ) ] ,
where M 0 = i = 1 N M i .
Theorem 1:
The estimator π ^ p p s w r of the true population proportion of a sensitive attribute π under PPS with a replacement sampling scheme is an unbiased estimator.
Proof: 
E 1 E 2 ( π ^ p p s w r ) = E 1 E 2 [ 1 n M 0 i = 1 n M i π ^ i p i ] = E 1 [ 1 n M 0 i = 1 n M i E 2 ( π ^ i ) p i ] ,
and since:
E 2 ( π ^ i ) = 1 ψ i [ P i T i ( 1 P i ) ( 1 T i ) 1 m i i = 1 m i E 2 ( Z i j ) W i T i P i ( 1 T i ) P i ( 1 W i ) ( 1 P i ) ( 1 T i ) ] = π i .
we can obtain:
E 1 E 2 ( π ^ p p s w r ) = E 1 [ 1 n M 0 i = 1 n M i π i p i ] = 1 M 0 i = 1 N p i M i π i p i = π .
 □
Theorem 2:
The variance of π ^ p p s w r is obtained from a two-stage procedure, such that a sample of size n FSU is selected by replacement with sampling probability p i for the unit i from the population of N clusters with size M i elements in the i th cluster, and the SSUs with size m i are drawn by SRSWR from each FSU, as given by:
V ( π ^ p p s w r ) = 1 n M 0 2 i = 1 N p i [ M i π i p i M 0 π ] 2 + 1 n M 0 2 i = 1 N M i 2 m i p i [ π i ( 1 π i ) π i ψ i 2 A i + 1 ψ i 2 B i ] ,
where:
A i = W i ( 1 2 P i ) ( 2 P i + P i 2 ) T i 2 ( 1 T i ) 2 + ( 1 W i ) ( 2 T i 1 ) ( 2 T i + T i 2 ) P i 2 ( 1 P i ) 2 + W i 2 T i 2 ( 1 T i ) 2 ( 2 P i 1 ) + ( 1 W i ) 2 P i 2 ( 1 P i ) 2 ( 1 2 T i ) + 2 W i ( 1 W i ) P i T i ( 1 P i ) ( 1 T i ) ( P i T i ) ,
B i = ( 1 T i ) 2 P i 2 { W i ( 1 W i ) ( P i + T i 1 ) 2 + W i P i T i 2 + ( 1 W i ) ( 1 T i ) ( 1 P i ) 2 } .
Proof: 
Given X i 1 G ( P i ) , X i 2 G ( 1 P i ) , Y i 1 G ( T i ) , Y i 2 G ( 1 T i ) , where G represents the geometric distribution with a success probability. Since the expected values of Z i j and Z i j 2 are
E ( Z i j ) = π i [ W i P i + ( 1 W i ) ( 1 T i ) ] + ( 1 π i ) [ W i ( 1 P i ) + ( 1 W i ) T i ] ,
then:
E ( Z i j 2 ) = π i [ W i ( 2 P i ) P i 2 ] + ( 1 π i ) [ W i ( 1 + P i ) ( 1 P i ) 2 ] + ( 1 π i ) ( 1 V i ) [ ( 2 T i ) T i 2 ] + π i ( 1 W i ) [ ( 1 + T i ) ( 1 T i ) 2 ] .
Based on (7) and (8), the variance of Z i j is:
σ i Z 2 = E ( Z i j 2 ) [ E ( Z i j ) ] 2 ,
and, since Z i j is independent, the variance of π ^ i can be expressed by:
V ( π ^ i ) = V [ 1 ψ i ( P i T i ( 1 P i ) ( 1 T i ) m i j = 1 m i Z i j W i T i P i ( 1 T i ) P i ( 1 W i ) ( 1 P i ) ( 1 T i ) ) ] = P i 2 T i 2 ( 1 P i ) 2 ( 1 T i ) 2 n h 2 ψ h 2 j = 1 m i σ i Z 2 = π i ( 1 π i ) m i + π i m i ψ i 2 A i + 1 m i ψ i 2 B i .
Since V ( π ^ p p s w r ) = V 1 E 2 ( π ^ p p s w r ) + E 1 V 2 ( π ^ p p s w r ) , then the first and second terms are given, respectively, as:
V 1 E 2 ( π ^ p p s w r ) = V 1 E 2 [ 1 n M 0 i = 1 n M i π ^ i p i ] = V 1 [ 1 n M 0 i = 1 n M i π i p i ] = 1 n M 0 2 i = 1 N p i [ M i π i p i M 0 π ] 2 ,
and
E 1 V 2 ( π ^ p p s w r ) = E 1 V 2 [ 1 n M 0 i = 1 n M i π ^ i p i ] = E 1 [ 1 ( n M 0 ) 2 i = 1 n M i 2 p i 2 V 2 ( π ^ i ) ] = E 1 [ 1 ( n M 0 ) 2 i = 1 n M i 2 p i 2 V 2 { 1 ψ i P i T i ( 1 P i ) ( 1 T i ) 1 m i i = 1 m i Z i j W i T i P i ( 1 T i ) P i ( 1 W i ) ( 1 P i ) ( 1 T i ) } ] = E 1 [ 1 ( n M 0 ) 2 M i 2 m i p i 2 { π i ( 1 π i ) + π i ψ i 2 A i + 1 ψ i 2 B i } ] .
Then, we can obtain the variance (10).
Moreover, an unbiased estimator of V ( π ^ p p s w r ) is given by
V ^ ( π ^ p p s w r ) = 1 n M 0 2 i = 1 n p i [ M i π ^ i p i M 0 π ^ p p s w r ] 2 + 1 n M 0 2 i = 1 n M i 2 p i ( m i 1 ) [ π ^ i ( 1 π ^ i ) π ^ i ψ i 2 A i + 1 ψ i 2 B i ] .
 □
If the FSUs are selected proportional to size with M i , then p i = M i / M 0 . For this reason, we call this method “probability proportional to size” (PPS) sampling. When a sample of the FSU is selected by PPS sampling with replacement via sampling probability, p i = M i / M 0 for the i th cluster, and m i SSU are selected by SRSWR from each FSU. The estimator π ^ p p s w r of π is given by:
π ^ p p s w r = 1 n i = 1 n π ^ i = 1 n i = 1 n 1 m i [ π i ( 1 π i ) + π i ψ i 2 A i + 1 ψ i 2 B i ] ,
and the variance of π ^ p p s w r and its estimator are as follows:
V ( π ^ p p s w r ) = 1 n M 0 i = 1 N M i ( π i π ) 2 + 1 n M 0 i = 1 N M i m i [ π i ( 1 π i ) + π i ψ i 2 A i + 1 ψ i 2 B i ] ,
V ^ ( π ^ p p s w r ) = 1 n M 0 i = 1 n M i ( π ^ i π ^ p p s w r ) 2 + 1 n M 0 i = 1 n M i m i 1 [ π ^ i ( 1 π ^ i ) + π ^ i ψ i 2 A i + 1 ψ i 2 B i ] .

2.2. The PPS without Replacement

In this subsection, we consider PPS sampling without replacement to estimate the true population proportion of a sensitive character by applying Yennum et al.’s model, in which n FSUs are drawn by PPS sampling without replacement from the population of N clusters with M i elementary units for the i th cluster, and m i SSUs are drawn by SRSWR from each FSU.
From this two-stage sampling, the estimator π ^ p p s w o r of π is:
π ^ p p s w o r = 1 M 0 i = 1 n M i π ^ i θ i ,
where θ i is the first inclusion probability for the i th cluster.
The variance of π ^ p p s w o r is given by:
V ( π ^ p p s w o r ) = 1 M 0 2 i = 1 N j > i N ( θ i θ j θ i j ) [ M i π i θ i M j π j θ j ] 2 + 1 M 0 2 i = 1 N M i 2 m i θ i [ π i ( 1 π i ) + π i ψ i 2 A i + 1 ψ i 2 B i ] ,
where θ i j is the second inclusion probability of the i th and j th clusters.
Furthermore, the variance estimator of π ^ p p s w o r is as follows:
V ^ ( π ^ p p s w o r ) = 1 M 0 2 i = 1 n j > i n ( θ i θ j θ i j ) θ i j [ M i π ^ i θ i M j π ^ j θ j ] 2 + 1 M 0 2 i = 1 n M i 2 θ i ( m i 1 ) [ π ^ i ( 1 π ^ i ) + π ^ i ψ i 2 A i + 1 ψ i 2 B i ] .

2.3. Two-Stage Equal Probability Sampling

In this subsection, we consider a two-stage equal probability sampling design to estimate the true population proportion of a sensitive characteristic by applying Yennum et al.’s model, in which n FSUs are drawn by simple random sampling without replacement (SRSWOR) from a population of N clusters with M i elementary units for the i th cluster, and m i SSUs are drawn by SRSWR from each FSU.
From this two-stage sampling, the estimator π ^ w r of π is given by:
π ^ w r = N n M 0 i = 1 n M i π ^ i ,
where π ^ i is an estimator of the true population proportion for a sensitive characteristic for the i th cluster, which is the same as (4).
The variance of π ^ w r and its estimator are given as:
V ( π ^ w r ) = N 2 n M 0 2 1 ( N 1 ) i = 1 N ( M i π i M ¯ π ) 2 + N n M 0 2 i = 1 N M i 2 m i [ π i ( 1 π i ) + π i ψ i 2 A i + 1 ψ i 2 B i ] ,
V ^ ( π ^ w r ) = N 2 n M 0 2 1 ( n 1 ) i = 1 n ( M i π ^ i M ¯ π ^ w r ) 2 + N n M 0 2 i = 1 n M i 2 m i 1 [ π ^ i ( 1 π ^ i ) + π ^ i ψ i 2 A i + 1 ψ i 2 B i ] ,
where M ¯ = M 0 / N .

3. An Estimation of Sensitive Attributes with Probability Proportional to Size Sampling Under Yennum et al.’s Generalized Model

We consider Yennum et al.’s generalized model, in which generalized geometric distribution is used as a randomization device when n clusters are sampled by PPS sampling or equal probability sampling from the population, which consists of N clusters with size M i ( i = 1 , 2 , , N ) , and m i ( i = 1 , 2 , , n ) units are drawn by simple random sampling from each sampled cluster.
We develop the sampling schemes for PPS sampling with replacement in Section 3.1 and those for PPS sampling without replacement in Section 3.2. Finally, equal probability sampling is presented in Section 3.3.

3.1. PPS Sampling with Replacement

Let the population be composed of N clusters. In the first stage, a sample of n FSUs is drawn by replacement with the sampling probability p i for the i th cluster. In the second stage, m i SSUs are selected by SRSWR from each FSU and guided to apply Yennum et al.’s generalized randomization device.
If the interviewees in the i th cluster choose a white ball during the first stage, and if they have a sensitive attribute A (or A c ), then they are guided to pick replacement balls from the first urn of the second stage device until they take k i 2 (or k i 1 ) successive balls with their actual status for the first time and are then asked to determine the total number of balls as X i 1 (or X i 2 ).
If the interviewee in the i th cluster draws a black ball in the first stage, and if they have a sensitive attribute A c (or A ), then they are guided to take replacement balls from the second urn of the second stage device until they take k i 2 (or k i 1 ) successive balls with their actual status for the first time and are then asked to determine the total number of balls as Y i 1 (or Y i 2 ).
For the i th cluster, using the randomization device in Figure 1, the total number of balls taken by interviewees X i 1 , X i 2 , Y i 1 , and Y i 2 are distributed via generalized geometric distribution. Let π i and 1 π i be the true population proportion of persons who have a sensitive attribute A and A c for the i th cluster. Assume that each interviewee in the i th cluster is drawn by SRSWR.
For the j th surveyed answer in the i th cluster, Z i j can be expressed as:
Z i j = { X i 1   with   probability W i π i , Y i 2   with   probability ( 1 W i ) π i , X i 2   with   probability W i ( 1 π i ) , Y i 1   with   probability ( 1 W i ) ( 1 π i ) , .
The expected value of Z i j is given by:
E ( Z i j ) = W i π i E ( X i 1 ) + π i ( 1 W i ) E ( Y i 2 ) + ( 1 π i ) W i E ( X i 2 ) + ( 1 W i ) ( 1 π i ) E ( Y i 1 ) = π i [ W i { 1 P i k i 1 ( 1 P i ) P i k i 1 1 ( 1 P i ) k i 2 P i ( 1 P i ) k i 2 } + ( 1 W i ) { 1 ( 1 T i ) k i 1 T i ( 1 T i ) k i 1 1 T i k i 2 ( 1 T i ) T i k i 2 } ]     + W i { 1 ( 1 P i ) k i 2 P i ( 1 P i ) k i 2 } + ( 1 W i ) { 1 T i k i 2 ( 1 T i ) T i k i 2 } .
Then, the formula (22) can be expressed as:
E ( Z i j ) W i { 1 ( 1 P i ) k i 2 P i ( 1 P i ) k i 2 } ( 1 W i ) { 1 T i k i 2 ( 1 T i ) T i k i 2 } = π i [ W i { 1 P i k i 1 ( 1 P i ) P i k i 1 1 ( 1 P i ) k i 2 P i ( 1 P i ) k i 2 } + ( 1 W i ) { 1 ( 1 T i ) k i 2 T i ( 1 T i ) k i 2 1 T i k i 1 ( 1 T i ) T i k i 1 } ] .
The estimator π ^ i G of the population proportion π i for the i th cluster is given by:
π ^ i G = ( 1 T i ) k i 1 + 1 T i k i 2 + 1 ( 1 P i ) k i 2 + 1 P i k i 1 + 1 m i φ i 2 ( j = 1 m i Z i j φ i 1 ) ,
where:
φ i 1 = W i { 1 ( 1 P i ) k i 2 } ( 1 T i ) k i 1 + 1 T i k i 2 + 1 ( 1 P i ) P i k i 1 + ( 1 W i ) ( 1 T i k i 2 ) P i k i 1 + 1 ( 1 P i ) k i 2 + 1 T i ( 1 T i ) k i 1 + 1 ,
and:
φ i 2 = W i [ ( 1 P i ) k i 1 P i T i k i 2 + 1 ( 1 T i ) k i 1 + 1 ( 1 P i ) k i 2 { 1 ( 1 P i ) k i 2 } P i k i 1 ( 1 P i ) T i k i 2 + 1 ( 1 T i ) k i 1 + 1 ] + ( 1 W i ) [ { 1 ( 1 T i ) k i 1 } P i k i 1 + 1 ( 1 T i ) ( 1 P i ) k i 2 + 1 T i k i 2 ( 1 T i k i 2 ) P i k i 1 + 1 T i ( 1 P i ) k i 2 + 1 ( 1 T i ) k i 1 ] .
When the interviewees are sampled by SRSWR for the i th cluster selected with a replacement by sampling probability p i , the estimator π ^ G p p s w r of the true population proportion π of a sensitive attribute is:
π ^ G p p s w r = 1 n M 0 i = 1 n M i π ^ i G p i = 1 n M 0 i = 1 n M i p i [ ( 1 T i ) k i 1 + 1 T i k i 2 + 1 ( 1 P i ) k i 2 + 1 P i k i 1 + 1 m i φ i 2 ( j = 1 m i Z i j φ i 1 ) ] ,
where M 0 = i = 1 N M i .
Theorem 3:
The estimator π ^ G p p s w r of the true population proportion π of a sensitive character is an unbiased estimator.
Proof: 
E 1 E 2 ( π ^ G p p s w r ) = E 1 E 2 [ 1 n M 0 i = 1 n M i π ^ i G p i ] = E 1 [ 1 n M 0 i = 1 n M i E 2 ( π ^ i G ) p i ] ,
and, since:
E 2 ( π ^ i G ) = E 2 [ ( 1 T i ) k i 1 + 1 T i k i 2 + 1 ( 1 P i ) k i 2 + 1 P i k i 1 + 1 m i φ i 2 ( j = 1 m i Z i j φ i 1 ) ] = ( 1 T i ) k i 1 + 1 T i k i 2 + 1 ( 1 P i ) k i 2 + 1 P i k i 1 + 1 m i φ i 2 ( j = 1 m i E 2 ( Z i j ) φ i 1 ) = π i ,
we can obtain:
E 1 E 2 ( π ^ G p p s w r ) = E 1 [ 1 n M 0 i = 1 n M i π i p i ] = 1 M 0 i = 1 N p i M i π i p i = π .
 □
Theorem 4:
The variance of π ^ G p p s w r is obtained by a two-stage sampling scheme, such that a sample of n FSU is selected with replacement by sampling probability p i for the i th cluster from the population of N clusters consisting of M i elements for the i th cluster, and m i SSUs are drawn by SRSWR from each FSU, as given by:
V ( π ^ G p p s w r ) = 1 n M 0 2 i = 1 N p i [ M i π i p i M 0 π ] 2 + 1 n M 0 2 i = 1 N M i 2 m i p i [ { ( 1 T i ) k i 1 + 1 T i k i 2 + 1 ( 1 P ) k i 2 + 1 P i k i 1 + 1 } 2 φ i 2 2 σ i Z 2 ] ,
where:
σ i Z 2 = E ( Z i j 2 ) ( E ( Z i j ) ) 2 = π i [ W i ( 1 ( 2 k i 1 + 1 ) ( 1 P i ) P i k i 1 P i 2 k i 1 + 1 + ( 1 P i k i 1 ) 2 ( 1 P i ) 2 P i 2 k i 1 ) + ( 1 W i ) ( 1 ( 2 k i 1 + 1 ) T i ( 1 T i ) k i 1 ( 1 T i ) 2 k i 1 + 1 + ( 1 ( 1 T i ) k i 1 ) 2 T i 2 ( 1 T i ) 2 k i 1 ) ] + ( 1 π i ) [ W i ( 1 ( 2 k i 2 + 1 ) P i ( 1 P i ) k i 2 ( 1 P i ) 2 k i 2 + 1 + ( 1 ( 1 P i k i 1 ) 2 ) P i 2 ( 1 P i ) 2 k i 2 ) + ( 1 W i ) ( 1 ( 2 k i 2 + 1 ) ( 1 T i ) T i k i 2 T i 2 k i 2 + 1 + ( 1 T i k i 2 ) 2 ( 1 T i ) 2 T i 2 k i 2 ) ] [ π i { W i ( 1 P i k i 1 ( 1 P i ) P i 2 k i 1 1 ( 1 P i ) k i 2 P i ( 1 P i ) k i 2 ) + ( 1 W i ) ( 1 ( 1 T i ) k i 1 T i ( 1 T i ) k i 1 1 T i k i 2 ( 1 T i ) T i k i 2 ) } + ( 1 π i ) { W i ( 1 ( 1 P i ) k i 2 P i ( 1 P i ) k i 2 ) + ( 1 W i ) ( 1 T i k i 2 ( 1 T i ) T i k i 2 ) } ] 2 .
Proof: 
The total number of balls taken by interviewees for the i th cluster, X i 1 , X i 2 , Y i 1 and Y i 2 , are random variables with variances:
V ( X i 1 ) = 1 ( 2 k i 1 + 1 ) ( 1 P i ) P i k i 1 P i 2 k i 1 + 1 ( 1 P i ) 2 P i 2 k i 1 ,
V ( X i 2 ) = 1 ( 2 k i 2 + 1 ) P i ( 1 P i ) 2 k i 2 ( 1 P i ) 2 k i 2 + 1 P i 2 ( 1 P i ) 2 k i 2 ,
V ( Y i 1 ) = 1 ( 2 k i 2 + 1 ) ( 1 T i ) T i k i 2 T i 2 k i 2 + 1 ( 1 T i ) 2 T i 2 k i 2 ,
V ( Y i 2 ) = 1 ( 2 k i 1 + 1 ) T i ( 1 T i ) k i 1 ( 1 T i ) 2 k i 1 + 1 T i 2 ( 1 T i ) 2 k i 1 .
From (21), to drive the variance of π ^ G p p s w r we can obtain the expected values of Z i j and Z i j 2 as follows:
E ( Z i j ) = π i [ W i ( 1 P i k i 1 ( 1 P i ) P i k i 1 ) + ( 1 W i ) ( 1 ( 1 T i ) k i 1 T i ( 1 T i ) k i 1 ) ] + ( 1 π i ) [ W i ( 1 ( 1 P i ) k i 2 P i ( 1 P i ) k i 2 ) + ( 1 W i ) ( 1 T i k i 2 ( 1 T i ) T i k i 2 ) ] ,
E ( Z i j 2 ) = π i [ W i E ( X i 1 2 ) + ( 1 W i ) E ( Y i 2 2 ) ] + ( 1 π i ) [ W i E ( X i 2 2 ) + ( 1 W i ) E ( Y i 2 2 ) ] = π i [ W i ( 1 ( 2 k i 1 + 1 ) ( 1 P i ) P i k i 1 P i 2 k i 1 + 1 + ( 1 P i k i 1 ) 2 ( 1 P i ) 2 P i 2 k i 1 ) + ( 1 W i ) ( 1 ( 2 k i 1 + 1 ) T i ( 1 T i ) k i 1 ( 1 T i ) 2 k i 1 + 1 + { 1 ( 1 T i ) k i 1 } 2 T i 2 ( 1 T i ) 2 k i 1 ) ] + ( 1 π i ) [ W i ( 1 ( 2 k i 2 + 1 ) P i ( 1 P i ) k i 2 ( 1 P i ) 2 k i 2 + 1 + { 1 ( 1 P i k i 2 ) 2 } P i 2 ( 1 P i ) 2 k i 2 ) + ( 1 W i ) ( 1 ( 2 k i 2 + 1 ) ( 1 T i ) T i k i 2 T i 2 k i 2 + 1 + ( 1 T i k i 2 ) 2 ( 1 T i ) 2 T i 2 k i 2 ) ] .
Since V ( π ^ G p p s w r ) = V 1 E 2 ( π ^ G p p s w r ) + E 1 V 2 ( π ^ G p p s w r ) ,
V 1 E 2 ( π ^ G p p s w r ) = V 1 E 2 [ 1 n M 0 i = 1 n M i π ^ i G p i ] = V 1 [ 1 n M 0 i = 1 n M i π i p i ] = 1 n M 0 2 i = 1 N p i [ M i π i G p i M 0 π ] 2 ,
and:
E 1 V 2 ( π ^ G p p s w r ) = E 1 V 2 [ 1 n M 0 i = 1 n M i π ^ i G p i ] = E 1 [ 1 ( n M 0 ) 2 i = 1 n M i 2 p i 2 V 2 ( π ^ i G ) ] = E 1 [ 1 ( n M 0 ) 2 i = 1 n M i 2 p i 2 V 2 { ( 1 T i ) k i 1 + 1 T i k i 2 + 1 ( 1 P i ) k i 2 + 1 P i k i 1 + 1 m i φ i 2 ( j = 1 m i Z i j φ i 1 ) } ] = E 1 [ 1 ( n M 0 ) 2 i = 1 n M i 2 p i 2 1 m i { { ( 1 T i ) k i 1 + 1 T i k i 2 + 1 ( 1 P i ) k i 2 + 1 P i k i 1 + 1 } 2 φ i 2 2 σ i Z 2 } ] = 1 n M 0 2 i = 1 N M i 2 m i p i [ { ( 1 T i ) k i 1 + 1 T i k i 2 + 1 ( 1 P i ) k i 2 + 1 P i k i 1 + 1 } 2 φ i 2 2 σ i Z 2 ] .
We can then obtain the variance (28). Also, an unbiased estimator of V ( π ^ G p p s w r ) is given by:
V ^ ( π ^ G p p s w r ) = 1 n M 0 2 i = 1 n p i [ M i π ^ i G p i M 0 π ^ G p p s w r ] 2 + 1 n M 0 2 i = 1 n M i 2 p i ( m i 1 ) [ { ( 1 T i ) k i 1 + 1 T i k i 2 + 1 ( 1 P ) k i 2 + 1 P i k i 1 + 1 } 2 φ i 2 2 σ ^ i Z 2 ] .
 □

3.2. PPS Sampling Without Replacement

In this subsection, we consider PPS sampling without replacement to estimate the true population proportion of a sensitive characteristic by applying Yennum et al.’s generalized model, in which n FSUs are drawn by PPS sampling without replacement from a population of N clusters with M i elementary units for the i th cluster, and m i SSUs are drawn by SRSWR from each FSU.
From this procedure, the estimator π ^ G p p s w o r of π is given by:
π ^ G p p s w o r = 1 M 0 i = 1 n M i π ^ i G θ i ,
where θ i is the first inclusion probability for the i th cluster.
The variance of π ^ G p p s w o r is given by:
V ( π ^ G p p s w o r ) = 1 M 0 2 i = 1 N j > i N ( θ i θ j θ i j ) [ M i π i θ i M j π j θ j ] 2 + 1 M 0 2 i = 1 N M i 2 m i θ i [ { ( 1 T i ) k i 1 + 1 T i k i 2 + 1 ( 1 P ) k i 2 + 1 P i k i 1 + 1 } 2 φ i 2 2 σ i Z 2 ] ,
where θ i j is the second inclusion probability for i th and j th clusters.
Also, the variance estimator of π ^ G p p s w o r is:
V ^ ( π ^ G p p s w o r ) = 1 M 0 2 i = 1 n j > i n ( θ i θ j θ i j ) θ i j [ M i π ^ i G θ i M j π ^ j G θ j ] 2 + 1 M 0 2 i = 1 n M i 2 θ i ( m i 1 ) [ { ( 1 T i ) k i 1 + 1 T i k i 2 + 1 ( 1 P ) k i 2 + 1 P i k i 1 + 1 } 2 φ i 2 2 σ ^ i Z 2 ] .

3.3. Two-Stage Equal Probability Sampling

In this subsection, we consider a two-stage equal probability sampling scheme to estimate the true population proportion of a sensitive attribute by applying Yennum et al.’s generalized model, in which n FSUs are drawn by SRSWOR from a population of N clusters consisting of M i elementary units for the i th cluster, and m i SSUs are drawn by SRSWR from each FSU.
From this procedure, the estimator π ^ G w r of the true population proportion π for a sensitive attribute is given by:
π ^ G w r = N n M 0 i = 1 n M i π ^ i G ,
where the estimator π ^ i G is the estimator of a sensitive characteristic of the i th cluster, which is the same as (24).
The variance and variance estimator of π ^ G w r are:
V ( π ^ G w r ) = N 2 n M 0 2 i = 1 N 1 N 1 [ M i π i M ¯ π ] 2 + N n M 0 2 i = 1 N M i 2 m i [ { ( 1 T i ) k i 1 + 1 T i k i 2 + 1 ( 1 P ) k i 2 + 1 P i k i 1 + 1 } 2 φ i 2 2 σ i Z 2 ] ,
and:
V ^ ( π ^ G w r ) = N 2 n M 0 2 i = 1 n 1 n 1 ( M i π ^ i G M ¯ π ^ G w r ) 2 + N n M 0 2 i = 1 N M i 2 m i 1 [ { ( 1 T i ) k i 1 + 1 T i k i 2 + 1 ( 1 P ) k i 2 + 1 P i k i 1 + 1 } 2 φ i 2 2 σ ^ i Z 2 ] ,
respectively, where M ¯ = M 0 / N .

4. Efficiency Comparisons

4.1. PPSWR Sampling versus Equal Probability Two-Stage Sampling in Yennum et al.’s Model

If we assume N 1 N , then the difference between the variance of equal probability two-stage sampling, (19), and the variance of PPS with replacement sampling, (6), is given by:
V ( π ^ w r ) V ( π ^ p p s w r ) = 1 n M 0 M ¯ [ i = 1 N ( M i M ¯ ) 2 π i 2 + M ¯ { i = 1 N ( M i M ¯ ) ( π i 2 π 2 ) } + i = 1 N ( M i M ¯ ) 2 1 m i ( π i ( 1 π i ) + π i ψ i 2 A i + 1 ψ i 2 B ) + M ¯ { i = 1 N ( M i M ¯ ) 1 m i ( π i ( 1 π i ) + π i ψ i 2 A i + 1 ψ i 2 B ) } ] .
In (43), we can see that V ( π ^ w r ) = V ( π ^ p p s w r ) under the condition M i = M ¯ = M 0 / N ; i.e., if the cluster sizes are equal, the selection probabilities of the PPS with replacement sampling are all N 1 and equal to those of equal probability two-stage replacement sampling.
If the size of a cluster, M i is significantly different, then i = 1 N ( M i M ¯ ) 2 π i 2 , the first term on the right side of (43), has large values, and the second term, i = 1 N ( M i M ¯ ) 2 ( π i 2 π 2 ) , has relatively small values. Hence, the estimation by PPS with replacement sampling is more efficient than that by equal probability two-stage replacement sampling.
We used the relative efficiency (RE) to compare the efficiency of the two sampling methods—PPS with replacement sampling and equal probability two-stage replacement sampling:
R E 1 = V ( π ^ w r ) V ( π ^ p p s w r ) × 100 ( % ) .
Values of R E 1 over 100% indicate that the estimator obtained by the PPS with the replacement sampling method was more efficient than the estimator obtained by the equal probability two-stage replacement sampling.
In calculating REs, we set the parameters as follows:
M 0 = 10 , 000 , M 1 = 1 , 000 , M 2 = 2 , 000 , M 3 = 3 , 000 , M 4 = 4 , 000 m 0 = 1 , 000 , m 1 = 100 , m 2 = 200 , m 3 = 300 , m 4 = 400 , p 1 = 0.235 , p 2 = 0.441 , p 3 = 0.609 , p 4 = 0.715 .
From Table 1, when the selection probability W for the first-stage randomization device increased from 0.1 to 0.9 by 0.2 and the second stage randomization devices T increased from 0.6 to 0.8 by 0.1 and P from 0.65 to 0.90 by 0.05, REs increase under the fixed proportion of a sensitive attribute (particularly when the selection probability of the second randomization device T increased), and the RE increased according to the conditions of P and π i .
On the other hand, RE increased when the first-stage selection probability W was less than 0.5, and the values of T , P , and π i (from 0.1 to 0.4) decreased, but the RE decreased when the value of W was greater than 0.5 under a fixed value for T , P , and π i .
Furthermore, the greater the true population proportion of a sensitive attribute π i , the higher the overall efficiency of Yennum et al.’s model, as shown by the values of the bottom cells in Table 1. This result agrees with the typical sampling survey methodology as the true population proportion of a sensitive attribute π i increases.

4.2. PPSWR Sampling versus Equal Probability Two-Stage Sampling in Yennum et al.’s Generalized Model

If we assume N 1 N , then the difference between the variance of equal probability two-stage sampling scheme (41) and the variance of the PPS with replacement sampling scheme (28) is given by:
V ( π ^ G w r ) V ( π ^ G p p s w r ) = 1 n M 0 M ¯ [ i = 1 N ( M i M ¯ ) 2 π i 2 + M ¯ { i = 1 N ( M i M ¯ ) ( π i 2 π 2 ) } + i = 1 N ( M i M ¯ ) 2 1 m i [ { ( 1 T i ) k i 1 + 1 T i k i 2 + 1 ( 1 P ) k i 2 + 1 P i k i 1 + 1 } 2 φ i 2 2 σ i Z 2 ] + M ¯ { i = 1 N ( M i M ¯ ) 1 m i ( { ( 1 T i ) k i 1 + 1 T i k i 2 + 1 ( 1 P ) k i 2 + 1 P i k i 1 + 1 } 2 φ i 2 2 σ i Z 2 ) } ] .
In (44), we can see that V ( π ^ G w r ) = V ( π ^ G p p s w r ) under the condition M i = M ¯ = M 0 / N , i.e., if the cluster sizes are equal, the selection probabilities of the PPS with replacement sampling are all N 1 and equal to those of the equal probability two-stage replacement sampling.
If cluster sizes, M i , were significantly different, then i = 1 N ( M i M ¯ ) 2 π i 2 , the first term of the right-hand side in (44), had large values, and the second term, i = 1 N ( M i M ¯ ) 2 ( π i 2 π 2 ) , had relatively small values. Hence, the estimation by PPS with replacement sampling is more efficient than that by equal probability two-stage replacement sampling.
We used the relative efficiency (RE) to compare the efficiency of the two sampling designs (PPS with replacement sampling and equal probability two-stage replacement sampling):
R E 2 = V ( π ^ G w r ) V ( π ^ G p p s w r ) × 100 ( % )
Values of R E 2 over 100% indicate that the estimator obtained by PPS with the replacement sampling method was more efficient than the estimator obtained by equal probability two-stage replacement sampling.
Table 2 shows the results of the REs obtained by increasing the true population proportion π i from 0.1 to 0.4 by 0.1. The selection probabilities of the randomized response model ( W , T and P ) are shown in Section 4.1.
In calculating the REs, we set the parameters as follows:
M 0 = 10 , 000 , M 1 = 1 , 000 , M 2 = 2 , 000 , M 3 = 3 , 000 , M 4 = 4 , 000 m 0 = 1 , 000 , m 1 = 100 , m 2 = 200 , m 3 = 300 , m 4 = 400 , p 1 = 0.235 , p 2 = 0.441 , p 3 = 0.609 , p 4 = 0.715 , k 1 = 2 , k 2 = 1 .
From the results of Table 2, the efficiencies vary according to changes in the probabilities of selection during the first stage W and the second stage T and P in the randomization device, but when the first-stage selection probability W is fixed, and the second-stage selection probabilities T and P increase, then the relative efficiency of the PPS sampling is better than that of the equal probability two-stage sampling in Yennum et al.’s model.

5. Conclusions

We extended Yennum et al.’s model, in which geometric distribution is used as a randomization device for a population consisting of different-sized clusters, and clusters are selected by PPS sampling. Estimators for the true population proportion of a sensitive attribute, their variances, and their variance estimators are derived under PPS sampling and equal probability two-stage sampling.
We also applied these sampling designs to the case of Yennum et al.’s generalized model. Numerical studies were carried out to compare the efficiencies of the proposed methods in each case of Yennum et al.’s model and Yennum et al.’s generalized model in cases with a replacement.
Although the experiments were assumed to use a replacement, we expected similar results for a case without replacement, as per typical sampling theory.
From the numerical study, we found that the efficiency of the two-stage sampling for probability proportional to size depends on the given parameter values, but the efficiency of Yennum et al.’s generalized model is preferred for most combinations of parameters over around 80%.

Author Contributions

Conceptualization, G.-S.L.; methodology, C.-K.S.; writing—original draft preparation, K.-H.H.; writing—review and editing, C.-K.S.; project administration and funding acquisition, G.-S.L.

Funding

This research was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (2018R1D1A3B07044007).

Acknowledgments

We would like to thank the anonymous reviewers for their very careful reading and valuable comments/suggestions.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Warner, S.L. Randomized response: A survey technique for eliminating evasive answer bias. J. Am. Stat. Assoc. 1965, 60, 63–69. [Google Scholar] [CrossRef] [PubMed]
  2. Cochran, W.G. Sampling Techniques, 3rd ed.; John Wiley and Sons: New York, NY, USA, 1977. [Google Scholar]
  3. Fox, J.A.; Tracy, P.E. Randomized Response: A Method for Sensitive Survey; Sage Publications: Newbury Park, CA, USA, 1986. [Google Scholar]
  4. Kuk, A.Y.C. Asking sensitive questions indirectly. Biometrika 1990, 77, 436–438. [Google Scholar] [CrossRef]
  5. Chaudhuri, A.; Mukerjee, R. Randomized Response: Theory and Techniques; Marcel Dekker Inc.: New York, NY, USA, 1988. [Google Scholar]
  6. Ryu, J.B.; Hong, K.H.; Lee, G.S. Randomized Response Model; Freedom Academy: Seoul, Korean, 1993. [Google Scholar]
  7. Lee, G.S.; Hong, K.H. Randomized response model by two-stage cluster sampling. Korean Commun. Stat. 1998, 5, 99–105. [Google Scholar]
  8. Lee, G.S. A Study on the Randomized Response Technique by PPS Sampling. Korean J. Appl. Stat. 2006, 19, 69–80. [Google Scholar]
  9. Yennum, N.Y.; Sedory, S.A.; Singh, S. Improved strategy to collect sensitive data by using geometric distribution as a randomization device. Commun. Stat. Theory Methods 2019, 48, 5777–5795. [Google Scholar] [CrossRef]
  10. Hussain, Z.; Shabbir, J.; Pervez, Z.; Shah, S.F.; Khan, M. Generalized geometric distribution of order k: A flexible choice to randomize the response. Commun. Stat. Simul. Comput. 2017, 46, 4708–4721. [Google Scholar] [CrossRef]
Figure 1. Randomization device for the i th cluster.
Figure 1. Randomization device for the i th cluster.
Mathematics 07 01102 g001
Table 1. The relative efficiencies (REs) of a sensitive estimator between the probability proportional to size (PPS) sampling with replacement and the equal probability two-stage sampling with replacement in Yennum et al.’s model to change π i and W .
Table 1. The relative efficiencies (REs) of a sensitive estimator between the probability proportional to size (PPS) sampling with replacement and the equal probability two-stage sampling with replacement in Yennum et al.’s model to change π i and W .
π i W
T0.10.30.50.70.9
P0.60.70.80.60.70.80.60.70.80.60.70.80.60.70.8
0.10.6556.5995.07123.548.0861.0691.7152.7346.1855.9763.1852.5939.6375.5171.0861.88
0.754.4289.171205054.6181.6958.6548.2848.1771.0160.8545.0883.8880.272.61
0.7552.9381.67114.853.6151.2670.4864.7653.5645.3477.6769.0854.1390.3387.4681.68
0.852.6172.72106.458.1551.6359.970.3260.3748.2883.0176.3864.2995.293.0888.97
0.8553.8463.6793.1762.8455.353.4875.0267.3755.7987.1782.4873.9198.8797.4194.69
0.956.5757.6574.2767.1561.0554.4778.7973.865.5290.3287.4182.23101.6100.799.15
0.20.6582.74134.4153.150.2292.87130.860.2948.4486.0590.6461.6934.18117.4108.989.02
0.775.82129.7151.451.7876.68121.877.7348.2865.5108.784.9743.35130.7125.1111.8
0.7568.56122.7148.760.9761.93108.494.1562.3348.62121.2104.667.95139135.4127
0.862.59111.6144.174.3456.0588.85107.282.1348.28129.7118.994144.4142.1137
0.8561.0494.5713587.9364.3266.55116.7100.468.61135.4128.9114.4148146.5143.6
0.966.674.23114.399.482.0261.52123.5114.495.72139.3135.8128.5150.4149.6148.1
0.30.65106.8152.4164.954.34117.7149.270.4153.88109.911974.1331.19148.1139.5115.9
0.798.24149.4163.953.6499.37142.7100.648.2885.26139.9111.142.92159154.4141.5
0.7587.63144.7162.470.2776.57132.312473.9255.9151.7135.485.58164.8162.2155.2
0.876.17136.6159.994.7961.53114139.2107.448.28158.6149.6122.7168.2166.6162.8
0.8569.9121.6154.7116.575.4783.46148.7131.985.14162.8158145.2170.2169.4167.4
0.978.8394.88140.8131.9107.769.61154.7146.9126.4165.4163.1157.6171.6171.1170.2
0.40.65124.6162.1171.259.97134.4159.382.2961.29126.3141.788.1430.06166.3159.3136.7
0.7116.5160170.555.58117.6154.6122.648.28102.2160.3133.343.6173.7170.4159.7
0.75105.4156.9169.681.2591.92146.7147.487.3565.18169.2156.3103.3177.3175.5170.3
0.891.07151.5168116.667.99131.8160.8130.848.28173.8167.6144.2179.1178.2175.5
0.8580.35140.6164.9142.388.79100.6168.2155.2103.2176.4173.5164180.2179.7178.4
0.993.87116156.4157133.778.99172.3167.2150.6178176.6173.2180.9180.6180.1
Table 2. The REs for a sensitive estimator between the PPS with replacement sampling and equal probability two-stage sampling with replacement in Yennum et al.’s generalized model for changing π i and W .
Table 2. The REs for a sensitive estimator between the PPS with replacement sampling and equal probability two-stage sampling with replacement in Yennum et al.’s generalized model for changing π i and W .
π i W
T0.10.30.50.70.9
P0.60.70.80.60.70.80.60.70.80.60.70.80.60.70.8
0.10.65171.5172.8172.8163166.9167.9145.4155.4159.1104.1126.5138.848.1752.1568.27
0.7167.5169.7170.1152.7160.2163.1119.1139.1148.957.8583.0911178.9876.6358.68
0.75162.2165.9167135.6150.2156.779111133.154.1848.2269.55116.3124.4117.6
0.8154.2160.9163.3106.2133.1147.248.7468.09105.688.176.950.14136.5148.1152.8
0.85139.8153.1158.564.6399.62130.164.1550.6560.85116.4120.1102.3147159.4168.3
0.9108.4136150.349.9451.589.0196.8495.3869.51133144.7150.1152.7165.1175.3
0.20.65180.9181.1181177178.1178.3168.4172173.1140.7154.216048.3959.2587.17
0.7178.9179.5179.5172.1174.6175.5152.4162.4166.875.03112.8137.8120.511171.17
0.75176.4177.5177.8163.1169.2171.8113.3142.1156.167.2848.4788.43159.6161.5149
0.8172.6174.9175.8143.4159.1166.150.391.74133.4132.9110.752.83171.6175.4173.9
0.85165.8171173.191.83133.9154.794.4555.7474.87160159135.4176.4180.4182.1
0.9147.3162.3168.555.0958.5118.4145.2139.794.09170.3174.6173.7178.6182.6185.2
0.30.65184184183.9181.6182182176.2178178.5157.3165.5168.948.6866.76101.4
0.7182.8183182.9178.5179.7180.2165.6171.417491.45130.8151.3145.6132.981.96
0.75181.2181.7181.8172.9176.1177.6134.5156.5166.181.8948.76102.6174.4173.9161.9
0.8178.9180180.4159.7169.3173.652.38109.5147.9155.9132.155.54181.2182.4180.1
0.85174.8177.5178.6114.3150.9165.5120.461.686.84174.9172.4151.2183.6185.3185.4
0.9163.6172.1175.662.6666.66135.6166.4160.1112.1180.7182.3180.3184.7186.4187.3
0.40.65185.5185.5185.4183.9184.1184180181181.3166.1171.6173.849.0474.03112.4
0.7184.7184.8184.7181.7182.4182.6172.3176.1177.9105.3142.5159.4159.6146.491.15
0.75183.5183.8183.9177.7179.7180.7147.7164.5171.795.4849.07113.7180.5179.2168.8
0.8181.9182.6182.9168.2174.7177.754.79122.6156.8167.6145.458.24184.8185.2182.8
0.85179.2180.8181.6130.6160.5171.4138.767.6197.01181178.3160.1186.3187186.7
0.9171.7177.1179.371.7675146.8176170.1125184.7185.3183.2186.9187.8188.1

Share and Cite

MDPI and ACS Style

Lee, G.-S.; Hong, K.-H.; Son, C.-K. An Estimation of Sensitive Attribute Applying Geometric Distribution under Probability Proportional to Size Sampling. Mathematics 2019, 7, 1102. https://doi.org/10.3390/math7111102

AMA Style

Lee G-S, Hong K-H, Son C-K. An Estimation of Sensitive Attribute Applying Geometric Distribution under Probability Proportional to Size Sampling. Mathematics. 2019; 7(11):1102. https://doi.org/10.3390/math7111102

Chicago/Turabian Style

Lee, Gi-Sung, Ki-Hak Hong, and Chang-Kyoon Son. 2019. "An Estimation of Sensitive Attribute Applying Geometric Distribution under Probability Proportional to Size Sampling" Mathematics 7, no. 11: 1102. https://doi.org/10.3390/math7111102

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop