统计代写|抽样调查代考Survey sampling代写|DOMAIN ESTIMATION

Let $D$ be a domain of interest within a population $U=(1, \ldots$, $i, \ldots, N)$. Let $N_D$ be the unknown size of $D$. Let a sample $s$ of size $n$ be drawn from $U$ with a probability $p(s)$ according to a design $p$ admitting positive inclusion probabilities $\pi_i, \pi_{i j}$. Let for $i=1,2, \ldots, N$
\begin{aligned} & I_{D i}=1(0) \quad \text { if } \quad i \in D(i \notin D) \ & Y_{D i}=Y_i(0) \text { if } \quad i \in D(i \notin D) \text {. } \ & \end{aligned}
Then the unknown domain size, total, and mean are, respectively,
$$N_D=\sum_1^N I_{D i}, T_D=\sum_1^N Y_{D i} \quad \text { and } \quad \bar{T}D=\frac{T_D}{N_D}$$ In analogy to $\underline{Y}=\left(Y_1, \ldots, Y_i, \ldots, Y_N\right)^{\prime}$ we write $\underline{I}_D=\left(I{D 1}, \ldots\right.$, $\left.I_{D i}, \ldots, I_{D N}\right)^{\prime}$ and $\underline{Y}D=\left(Y{D 1}, \ldots, Y_{D i}, \ldots, Y_{D N}\right)^{\prime}$. Then, corresponding to any estimator $t=t(s, \underline{Y})=\hat{Y}$, for $Y=\Sigma_1^N Y_i$ we may immediately choose estimators for $N_D$ and $T_D$, respectively,
$$\widehat{N}_D=t\left(s, \underline{I}_D\right) \quad \text { and } \quad \widehat{T}_D=t\left(s, \underline{Y}_D\right) .$$
It may then be a natural step to take the estimator $\widehat{T}_D$ for $\bar{T}_D$ as
$$\widehat{T}_D=\frac{\widehat{T}_D}{\widehat{N}_D}$$

统计代写|抽样调查代考Survey sampling代写|POSTSTRATIFICATION

Suppose a finite population $U=(1, \ldots, i, \ldots, N)$ of $N$ units consists of $L$ post-strata of known sizes $N_h, h=1, \ldots, L$ but unknown compositions with respective post-strata totals $Y_h=$ $\sum_i^{N_h} Y_{h i}$ and means $\bar{Y}h=Y_h / N_h, h=1, \ldots, L$. Let a simple random sample $s$ of size $n$ have been drawn from $U$ yielding the sample configuration $\underline{n}=\left(n_1, \ldots, n_h, \ldots, n_L\right)$ where $n_h(\geq 0)$ is the number of units of $s$ coming from the $h$ th post-stratum, $h=1, \ldots, L, \sum{h=1}^L n_h=n$. In order to estimate $\bar{Y}=\Sigma W_h \bar{Y}h$, writing $W_h=\frac{N_h}{N}, h=1, \ldots, L$ we proceed as follows. Let $I_h=1(0)$ if $n_h>0\left(n_h=0\right)$. Then, $$E\left(I_h\right)=\operatorname{Prob}\left(I_h=1\right)=1-\left(\begin{array}{c} N-N_h \ n \end{array}\right) /\left(\begin{array}{c} N \ n \end{array}\right), h=1, \ldots, L .$$ For $\bar{Y}$ a reasonable estimator may be taken as $$t{p s t}=t_{p s t}(\underline{Y})=\frac{\sum W_h \bar{y}_h I_h / E\left(I_h\right)}{\sum W_h I_h / E\left(I_h\right)}$$
writing $\bar{y}_h$ as the mean of the $n_h$ units in the sample consisting of members of the $h$ th post-stratum, if $n_h>0$; if $n_h=0$, then $\bar{y}_h$ is taken as $\bar{Y}_h$. It follows that $x=\sum W_h \bar{y}_h I_h / E\left(I_h\right)$ is an unbiased estimator for $\bar{Y}$ and $b=\sum W_h I_h / E\left(I_h\right)$ an unbiased estimator for 1. Yet, instead of taking just a as an unbiased estimator for $\bar{Y}$, this biased estimator of the ratio form $\frac{x}{b}$ is proposed by DOSS, HARTLEY and SOMAYAJULU (1979) because it has the following linear invariance property not shared by itself:

Assume $Y_i=\alpha+\beta Z_i$; then $\bar{y}h=\alpha+\beta \bar{z}_h$ and $t{p s t}(\underline{Y})=$ $\alpha+\beta t_{p s t}(\underline{Z})$, with obvious notations. Further properties of $t_{p s t}$ have been investigated by Doss et al. (1979) but are too complicated to merit further discussion here.

