抽样调查sampling theory - 统计代写答疑辅导

分类：抽样调查sampling theory

统计代写|抽样调查作业代写sampling theory of survey代考|STAT532

Posted on 2023年8月7日2023年8月28日 by statistics-lab

如果你也在怎样代写抽样调查Survey sampling 这个学科遇到相关的难题，请随时右上角联系我们的24/7代写客服。抽样调查Survey sampling是数学工程这一广泛新兴领域中的一个自然组成部分。例如，我们可以断言，数学工程之于今天的数学系，就像数学物理之于一个世纪以前的数学系一样;毫不夸张地说，数学在诸如语音和图像处理、信息理论和生物医学工程等工程学科中的基本影响。

抽样调查Survey sampling是主流统计的边缘。这里的特殊之处在于，我们有一个具有某些特征的有形物体集合，我们打算通过抓住其中一些物体并试图对那些未被触及的物体进行推断来窥探它们。这种推论传统上是基于一种概率论，这种概率论被用来探索观察到的事物与未观察到的事物之间的可能联系。这种概率不被认为是在统计学中，涵盖其他领域，以表征我们感兴趣的变量的单个值之间的相互关系。但这是由调查抽样调查人员通过任意指定的一种技术从具有预先分配概率的对象群体中选择样本而创建的。

statistics-lab™ 为您的留学生涯保驾护航在代写抽样调查sampling theory of survey方面已经树立了自己的口碑, 保证靠谱, 高质且原创的统计Statistics代写服务。我们的专家在代写抽样调查sampling theory of survey方面经验极为丰富，各种代写抽样调查sampling theory of survey相关的作业也就用不着说。

统计代写|抽样调查作业代写sampling theory of survey代考|STAT532

统计代写|抽样调查作业代写sampling theory of survey代考|NONRESPONSE

To cite a simple example, suppose that unit $i$, provided it is included in a sample $s$, responds with probability $q_i, q_i$ not depending on $s$ or $Y=\left(Y_i, \ldots, Y_N\right)$. Suppose $n$ units are drawn by SRSWOR and define
$$
M_i= \begin{cases}1 & \text { if unit } i \text { is sampled and responds } \ 0 & \text { otherwise }\end{cases}
$$
Consider the arithmetic mean
$$
\bar{y}=\frac{\sum_1^N M_i Y_i}{\sum_1^N M_i}
$$
of all observations as an estimator of $\bar{Y}$. Then
$$
E M_i=\frac{n}{N} q_i
$$
and $E \bar{y}$ is asymptotically equal to
$$
\frac{\sum q_i Y_i}{\sum q_i}
$$
The bias
$$
\sum\left(\frac{q_i}{\sum q_i}-\frac{1}{N}\right) Y_i
$$
is negligible only if approximately
$$
q_i=\frac{1}{N} \sum q_i .
$$

统计代写|抽样调查作业代写sampling theory of survey代考|CALLBACKS

HANSEN and HURWITZ (1946) gave an elegant procedure for callbacks to tackle nonresponse problems later modified by SRINATH (1971) and J. N. K. RAO (1973), briefly described below. The population is conceptually dichotomized with $W_1\left(W_2=1-W_1\right)$ and $\widehat{Y}_1\left(\widehat{Y}_2=\left[\widehat{Y}-W_1 \widehat{Y}_1\right] / W_2\right)$ as the proportion of respondents (nonrespondents) and mean of respondents (nonrespondents) and an SRSWOR of size $n$ yields proportions $w_1=n_1 / n$ and $w_2=1-w_1=1-n_1 / n=n_2 / n$ of respondents and nonrespondents, respectively. Choosing a suitable number $K>1$ an SRSWOR of size $m_2=n_2 / K$, assumed to be an integer, is then drawn from the initial $n_2$ sample nonrespondents. Supposing that more expensive and persuasive procedures are followed in this second phase so that each of the $m_2$ units called back now responds, let $\bar{y}1$ and $\bar{y}{22}$ denote the first-phase and second-phase sample means based respectively on $n_1$ and $m_2$ respondents. Then, $\bar{Y}$ may be estimated by $\bar{y}d=w_1 \bar{y}_1+w_2 \bar{y}{22}$, and the variance
$$
V\left(\bar{y}d\right)=(1-f) \frac{S^2}{n}+W_2 \frac{(K-1)}{n} S_2^2 $$ by $$ \begin{aligned} v_d= & (1-f)\left(\frac{n_1-1}{n-1}\right) w_1 \frac{s_1^2}{n_1} \ & +\frac{(N-1)\left(n_2-1\right)-(n-1)\left(m_2-1\right)}{N(n-1)} w_2 \frac{s{22}^2}{m_2} \
& +\frac{N-n}{N(n-1)}\left[w_1\left(\bar{y}1-\bar{y}_d\right)^2+w_2\left(\bar{y}{22}-\bar{y}_d\right)^2\right] .
\end{aligned}
$$

抽样调查代考

统计代写|抽样调查作业代写sampling theory of survey代考|NONRESPONSE

举一个简单的例子，假设单元$i$(如果它包含在示例$s$中)的响应概率为$q_i, q_i$，而不取决于$s$或$Y=\left(Y_i, \ldots, Y_N\right)$。假设$n$单位由SRSWOR绘制并定义
$$
M_i= \begin{cases}1 & \text { if unit } i \text { is sampled and responds } \ 0 & \text { otherwise }\end{cases}
$$
考虑算术平均值
$$
\bar{y}=\frac{\sum_1^N M_i Y_i}{\sum_1^N M_i}
$$
作为$\bar{Y}$的估计值。然后
$$
E M_i=\frac{n}{N} q_i
$$
$E \bar{y}$渐近等于
$$
\frac{\sum q_i Y_i}{\sum q_i}
$$
偏见
$$
\sum\left(\frac{q_i}{\sum q_i}-\frac{1}{N}\right) Y_i
$$
只有近似时才可忽略
$$
q_i=\frac{1}{N} \sum q_i .
$$

统计代写|抽样调查作业代写sampling theory of survey代考|CALLBACKS

HANSEN和HURWITZ(1946)给出了一个优雅的回调程序来处理无响应问题，后来由SRINATH(1971)和j.n. K. RAO(1973)修改，下面简要介绍。人口在概念上分为$W_1\left(W_2=1-W_1\right)$和$\widehat{Y}_1\left(\widehat{Y}_2=\left[\widehat{Y}-W_1 \widehat{Y}_1\right] / W_2\right)$作为受访者的比例(非受访者)和受访者的平均值(非受访者)，大小为$n$的SRSWOR分别产生受访者和非受访者的比例$w_1=n_1 / n$和$w_2=1-w_1=1-n_1 / n=n_2 / n$。选择一个合适的数字$K>1$，然后从初始的$n_2$样本非受访者中提取大小为$m_2=n_2 / K$的SRSWOR(假设为整数)。假设在第二阶段中遵循更昂贵和更有说服力的程序，以便每个回调的$m_2$单位现在都做出响应，让$\bar{y}1$和$\bar{y}{22}$分别表示基于$n_1$和$m_2$受访者的第一阶段和第二阶段样本平均值。然后，$\bar{Y}$可以用$\bar{y}d=w_1 \bar{y}_1+w_2 \bar{y}{22}$来估计，方差
$$
V\left(\bar{y}d\right)=(1-f) \frac{S^2}{n}+W_2 \frac{(K-1)}{n} S_2^2 $$ by $$ \begin{aligned} v_d= & (1-f)\left(\frac{n_1-1}{n-1}\right) w_1 \frac{s_1^2}{n_1} \ & +\frac{(N-1)\left(n_2-1\right)-(n-1)\left(m_2-1\right)}{N(n-1)} w_2 \frac{s{22}^2}{m_2} \
& +\frac{N-n}{N(n-1)}\left[w_1\left(\bar{y}1-\bar{y}_d\right)^2+w_2\left(\bar{y}{22}-\bar{y}_d\right)^2\right] .
\end{aligned}
$$

统计代写|抽样调查作业代写sampling theory of survey代考请认准statistics-lab™

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。统计代写|python代写代考

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

统计代写|抽样调查作业代写sampling theory of survey代考|MATH4060

Posted on 2023年8月7日2023年8月28日 by statistics-lab

统计代写|抽样调查作业代写sampling theory of survey代考|Linear Unbiased Estimators

Let a sensitive variable $y$ be defined on a finite population $U=(1, \ldots, N)$ with values $Y_i, i=1, \ldots, N$, which are supposed to be unavailable through a DR survey. Suppose a sample $s$ of size $n$ is chosen according to a design $p$ with a selection probability $p(s)$. In order to estimate $Y=\sum_1^N Y_i$, let an RR as a value $Z_i$ be available on request from each sampled person labeled $i$ included in a sample. Before describing how a $Z_i$ may be generated, let us note the properties required of it. We will denote by $E_R\left(V_R, C_R\right)$ the operator for expectation (variance, covariance) with respect to the randomized procedure of generating RR. The basic RRs $Z_i$ should allow derivation by a simple transformation reduced RRs as $R_i$ ‘s satisfying the conditions
(a) $E_R\left(R_i\right)=Y_i$
(b) $V_R\left(R_i\right)=\alpha_i Y_i^2+\beta_i Y_i+\theta_i$ with $\alpha_i(>0), \beta_i, \theta_i$ ‘s as known constants
(c) $C_R\left(R_i, R_j\right)=0$ for $i \neq j$
(d) estimators $v_i=a_i R_i^2+b_i R_i+C_i$ exist, $a_i, b_i, c_i$ known constants, such that $E_R\left(v_i\right)=V_R\left(R_i\right)=V_i$, say, for all $i$.

We will illustrate only two possible ways of obtaining $Z_i$ ‘s from a sampled individual $i$ on request. First, let two vectors $A=$ $\left(A_1, \ldots, A_T\right)^{\prime}$ and $B=\left(B_1, \ldots, B_L\right)^{\prime}$ of suitable real numbers be chosen with means $\bar{A} \neq 0, \bar{B}$ and variances $\sigma_A^2, \sigma_B^2$. A sample person $i$ is requested to independently choose at random $a_i$ out of $A$ and $b_i$ out of $B$, and report the value $Z_i=a_i Y_i+b_i$. Then, it follows that $E_R\left(Z_i\right)=\bar{A} Y_i+\bar{B}$, giving
$$
R_i=\left(Z_i-\bar{B}\right) / \bar{A}
$$
such that
$$
\begin{aligned}
E_R\left(R_i\right) & =Y_i, \
V_R\left(R_i\right) & =\left(Y_i^2 \sigma_A^2+\sigma_B^2\right) /(\bar{A})^2=V_i, \
C_R\left(R_i, R_J\right) & =0, \quad i \neq j
\end{aligned}
$$

and
$$
v_i=\left(\sigma_A^2 R_i^2+\sigma_B^2\right) /\left(\sigma_A^2+\bar{A}^2\right)
$$
has
$$
E_R\left(v_i\right)=V_i .
$$

统计代写|抽样调查作业代写sampling theory of survey代考|A Few Specific Strategies

Let us illustrate a few familiar specific cases. Corresponding to the HTE $\bar{t}=\bar{t}(s, Y)=\sum_i \frac{Y_i}{\pi_i} I_{s i}$, we have the derived estimator $e=(s, R)=\sum_i \frac{R_i}{\pi_i} I_{s i}$ for which
$$
M=-\sum_{i<j} \sum_j\left(\pi_i \pi_j-\pi_{i j}\right)\left(Y_i / \pi_i-Y_j / \pi_j\right)^2+\sum_i \frac{V_i}{\pi_i}
$$
and
To LAHIRI’s (1951) ratio estimator $t_L=Y_i / \sum_s P_i$ based on LAHIRI-MIDZUNO-SEN (LMS, 1951, 1952, 1953) scheme corresponds the estimator
$$
e_L=\sum_s R_i / \sum_s P_i
$$
$\left(0<P_i<1, \Sigma_1^N P_i=1\right)$ for which
$$
M=\sum_{i<j} \sum_{i j}\left(1-\frac{1}{C_1} \sum_s \frac{I_{s i j}}{P_s}\right)+\sum V_i E_p\left(I_{s i} / P_s^2\right),
$$
where
$$
\begin{aligned}
C_r & =\left(\begin{array}{c}
N-r \
n-r
\end{array}\right), r=0,1,2, \ldots, P_s=\sum_s P_i, a_{i j} \
& =P_i P_j\left(Y_i / P_i-Y_j / P_j\right)^2
\end{aligned}
$$

$$
\begin{aligned}
m= & \sum \sum P_i P_j I_{s i} I_{s i j}\left(\frac{N-1}{n-1}-\frac{1}{P_s}\right) / \
& P_s\left[\left(\frac{R_i}{P_i}-\frac{R_j}{P_j}\right)^2-\left(\frac{v_i}{p_i^2}+\frac{v_j}{p_j^2}\right)\right]+\sum v_i I_{s i} / P_s^2
\end{aligned}
$$
is unbiased for $M$. If $t_L$ and $e_L$ above are based on SRSWOR in $n$ draws, then, $M$ equals
$$
\begin{aligned}
M^{\prime}= & -\frac{1}{C_0}\left[\sum_{i<j} \sum_{i j} \sum_s\left(\frac{I_{s i j}}{p_s^2}-\frac{I_{s j}}{P_s}-\frac{I_{s j}}{P_s}+1\right)\right. \
& \left.-\sum_i V_i\left(\sum_s I_{s i} / P_s^2\right)\right]
\end{aligned}
$$
and
$$
\begin{aligned}
m^{\prime}= & -\frac{N(N-1)}{n(n-1) C_0} \sum_{i<j} \sum_{\hat{a}{i j}} I{s i j} \sum_s\left(\frac{I_{s i j}}{p_s^2}-\frac{I_{s i}}{P_s}-\frac{I_{s j}}{P_s}+1\right) \
& +\frac{1}{C_0} \frac{N}{n} \sum v_i I_{s i}\left(\sum_s I_{s i} / P_s^2\right)
\end{aligned}
$$
writing
$$
\left.\widehat{a}_{i j}=\left{\left(\frac{R_i}{P_i}-\frac{R_j}{P_j}\right)^2-\frac{v_j}{P_i^2}+\frac{v_j}{P_j^2}\right)\right} P_i P_j
$$

抽样调查代考

统计代写|抽样调查作业代写sampling theory of survey代考|Linear Unbiased Estimators

假设在有限总体$U=(1, \ldots, N)$上定义一个敏感变量$y$，其值为$Y_i, i=1, \ldots, N$，该变量应该无法通过DR调查获得。假设一个大小为$n$的样本$s$是根据一个选择概率为$p(s)$的设计$p$选择的。为了估计$Y=\sum_1^N Y_i$，让RR作为一个值$Z_i$在每个样本中被标记为$i$的人的请求下可用。在描述如何生成$Z_i$之前，让我们注意一下它所需的属性。我们将用$E_R\left(V_R, C_R\right)$表示相对于生成RR的随机过程的期望(方差，协方差)的算子。基本RRs $Z_i$应允许通过简化RRs的简单变换推导出满足$R_i$条件的RRs
(a) $E_R\left(R_i\right)=Y_i$
(b) $V_R\left(R_i\right)=\alpha_i Y_i^2+\beta_i Y_i+\theta_i$，其中$\alpha_i(>0), \beta_i, \theta_i$为已知常数
(c) $i \neq j$为$C_R\left(R_i, R_j\right)=0$
(d)估计量$v_i=a_i R_i^2+b_i R_i+C_i$存在，$a_i, b_i, c_i$已知常数，使得$E_R\left(v_i\right)=V_R\left(R_i\right)=V_i$对所有$i$都成立。

我们将只说明两种可能的方法，根据要求从抽样的个人$i$获取$Z_i$。首先，取两个合适实数的向量$A=$$\left(A_1, \ldots, A_T\right)^{\prime}$和$B=\left(B_1, \ldots, B_L\right)^{\prime}$，均值$\bar{A} \neq 0, \bar{B}$，方差$\sigma_A^2, \sigma_B^2$。要求样本人员$i$从$A$和$B$中独立地随机选择$a_i$和$b_i$，并报告值$Z_i=a_i Y_i+b_i$。接下来是$E_R\left(Z_i\right)=\bar{A} Y_i+\bar{B}$，给予
$$
R_i=\left(Z_i-\bar{B}\right) / \bar{A}
$$
这样
$$
\begin{aligned}
E_R\left(R_i\right) & =Y_i, \
V_R\left(R_i\right) & =\left(Y_i^2 \sigma_A^2+\sigma_B^2\right) /(\bar{A})^2=V_i, \
C_R\left(R_i, R_J\right) & =0, \quad i \neq j
\end{aligned}
$$

和
$$
v_i=\left(\sigma_A^2 R_i^2+\sigma_B^2\right) /\left(\sigma_A^2+\bar{A}^2\right)
$$
有
$$
E_R\left(v_i\right)=V_i .
$$

统计代写|抽样调查作业代写sampling theory of survey代考|A Few Specific Strategies

让我们举例说明几个熟悉的具体案例。对应于HTE $\bar{t}=\bar{t}(s, Y)=\sum_i \frac{Y_i}{\pi_i} I_{s i}$，我们有推导出的估计量$e=(s, R)=\sum_i \frac{R_i}{\pi_i} I_{s i}$
$$
M=-\sum_{i<j} \sum_j\left(\pi_i \pi_j-\pi_{i j}\right)\left(Y_i / \pi_i-Y_j / \pi_j\right)^2+\sum_i \frac{V_i}{\pi_i}
$$
和
对LAHIRI(1951)的比率估计量$t_L=Y_i / \sum_s P_i$基于LAHIRI- midzno – sen (LMS, 1951, 1952, 1953)方案对应的估计量
$$
e_L=\sum_s R_i / \sum_s P_i
$$
$\left(0<P_i<1, \Sigma_1^N P_i=1\right)$ for which
$$
M=\sum_{i<j} \sum_{i j}\left(1-\frac{1}{C_1} \sum_s \frac{I_{s i j}}{P_s}\right)+\sum V_i E_p\left(I_{s i} / P_s^2\right),
$$
在哪里
$$
\begin{aligned}
C_r & =\left(\begin{array}{c}
N-r \
n-r
\end{array}\right), r=0,1,2, \ldots, P_s=\sum_s P_i, a_{i j} \
& =P_i P_j\left(Y_i / P_i-Y_j / P_j\right)^2
\end{aligned}
$$

$$
\begin{aligned}
m= & \sum \sum P_i P_j I_{s i} I_{s i j}\left(\frac{N-1}{n-1}-\frac{1}{P_s}\right) / \
& P_s\left[\left(\frac{R_i}{P_i}-\frac{R_j}{P_j}\right)^2-\left(\frac{v_i}{p_i^2}+\frac{v_j}{p_j^2}\right)\right]+\sum v_i I_{s i} / P_s^2
\end{aligned}
$$
对于$M$是公正的。如果上述$t_L$和$e_L$基于$n$抽签中的SRSWOR，则$M$等于
$$
\begin{aligned}
M^{\prime}= & -\frac{1}{C_0}\left[\sum_{i<j} \sum_{i j} \sum_s\left(\frac{I_{s i j}}{p_s^2}-\frac{I_{s j}}{P_s}-\frac{I_{s j}}{P_s}+1\right)\right. \
& \left.-\sum_i V_i\left(\sum_s I_{s i} / P_s^2\right)\right]
\end{aligned}
$$
和
$$
\begin{aligned}
m^{\prime}= & -\frac{N(N-1)}{n(n-1) C_0} \sum_{i<j} \sum_{\hat{a}{i j}} I{s i j} \sum_s\left(\frac{I_{s i j}}{p_s^2}-\frac{I_{s i}}{P_s}-\frac{I_{s j}}{P_s}+1\right) \
& +\frac{1}{C_0} \frac{N}{n} \sum v_i I_{s i}\left(\sum_s I_{s i} / P_s^2\right)
\end{aligned}
$$
写作
$$
\left.\widehat{a}_{i j}=\left{\left(\frac{R_i}{P_i}-\frac{R_j}{P_j}\right)^2-\frac{v_j}{P_i^2}+\frac{v_j}{P_j^2}\right)\right} P_i P_j
$$

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。统计代写|python代写代考

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

统计代写|抽样调查作业代写sampling theory of survey代考|STAT506

Posted on 2023年8月7日2023年8月28日 by statistics-lab

统计代写|抽样调查作业代写sampling theory of survey代考|Model-Based Regression Analysis

In the above, we really considered a two-step randomization: the finite population is supposedly a realization from an infinite hypothetical superpopulation with reference to which a regression relationship is postulated connecting a dependent variable and a set of independent regressor variables. Then, from the given or realized finite population a sample is randomly drawn because the population is too large to be completely investigated. The sample is then utilized to make inference with reference to the two-step randomization. But now let us consider a purely model-based approach that takes account of the structure of the finite population at hand by postulating an appropriate model.

Suppose for a sample of $c$ clusters from a given finite population, observations are taken on a dependent variable $y$ and a set of independent regressor variables $x_1, \ldots, x_r$ for independently drawn samples of second stage units (SSUs) of sizes $m_i$ from the respective sampled clusters labeled $i=1, \ldots, c$ so that $\sum_1^c m_i=n$, the total sample size. Let $Y_n$ be an $n \times 1$ vector of observations on $y$, successive rows in it giving values on the $m_i$ observations in the order $i=1, \ldots, c$ and the observations $X_j$ ‘s, $j=1, \ldots, r$ be also similarly arranged in succession. Now it is only to be surmised that the observations within the same cluster should be substantially well and positively correlated compared to those across the clusters. So, after postulating a regression relation of $Y_n$ on $X_n$, which is an $n \times r$ matrix, the successive rows in it arranging the values for the clusters taken in order $i=1, \ldots, c$, which states that
$$
E_m\left(Y_n\right)=X_n \beta
$$
where $\beta$ is an $r \times 1$ vector of unknown regression parameters, one should carefully postulate about the distribution of the error vector
$$
\epsilon_n=Y_n-X_n \beta .
$$

统计代写|抽样调查作业代写sampling theory of survey代考|Design Variables

Next we consider an important situation where, besides the regressor variables, there exist another set of variables that are utilized in determining the selection probabilities, called the design variables. For example, one may plan to examine how expenses on certain items of consumption, the dependent variable $y$, vary with the annual income, the single regressor variable $x$. Then, if accounts of the taxes paid by the relevant individuals in the last financial year, values of a variable $z$, are available, this information can be utilized in stratifying the population accordingly. Then $z$ is a design variable obviously well-correlated with $x$ and $y$.

Following the works of NATHAN and Holt (1980), HoLT, Smith and Winter (1980), and PfeffermanN and Holmes (1985) let us consider the simple case of a single dependent (endogeneous) variable $y$, a single regressor (exogeneous, independent) variable $x$, and a single design variable $z$. Assume the regression model
$$
y=\alpha+\beta x+\epsilon
$$
with $E_m(\epsilon \mid x)=0, V_m(\epsilon \mid x)=\sigma^2(\sigma>0)$. Suppose a random sample $s$ of size $n$ is taken following a design $p$ using the values $Z_1, Z_2, \ldots, Z_N$ of $z$ and define
$$
v_z=\frac{1}{N} \sum_1^N Z_i, \sigma_z^2=\frac{1}{N-1} \sum_1^N\left(Z_i-v_z\right)^2 .
$$
Also, let $\bar{y}, \bar{x}, \bar{z}$ denote sample means of $y, x, z, s_y^2, s_x^2, s_z^2$ the sample variances and $s_{y x}, s_{y z}, s_{x z}$ the sample covariances. The problem is to infer about $\beta$, the regression coefficient of $y$ on $x$ under the model-based approach.
Consider the ordinary least squares estimator (OLSE),
$$
b=s_{y x} / s_x^2 .
$$
Its performance depends essentially on the relation between the design variable $z$ and the variables $x, y$ in the regression model. In the simplest case $x, y, z$ might follow a trivariate normal distribution. DEMETS and HALPERIN (1977) have shown that, under this assumption, $b$ is biased. Following ANDERSON’s (1957) missing value approach, they derive an alternative estimator, which is the maximum likelihood estimator (MLE) for $\beta$, namely,
$$
\widehat{\beta}=\left[s_{y x}+\frac{s_{y z} s_{x z}}{s_z^2}\left(\frac{\sigma_z^2}{s_z^2}-1\right)\right] /\left[s_x^2+\frac{s_{x z}^2}{s_z^2}\left(\frac{\sigma_z^2}{s_z^2}-1\right)\right] .
$$

抽样调查代考

统计代写|抽样调查作业代写sampling theory of survey代考|Model-Based Regression Analysis

在上面，我们真正考虑的是两步随机化:有限总体是无限假设的超总体的实现，参考这一假设，回归关系连接了因变量和一组独立回归变量。然后，从给定或实现的有限总体中随机抽取样本，因为总体太大而无法完全调查。然后利用样本参照两步随机化进行推理。但是现在让我们考虑一种纯粹基于模型的方法，它通过假设一个适当的模型来考虑手头有限人口的结构。

假设对于来自给定有限总体的$c$簇的样本，对独立绘制的第二阶段单元(ssu)的样本(大小为$m_i$)的因变量$y$和一组独立回归变量$x_1, \ldots, x_r$进行观察，这些样本分别标记为$i=1, \ldots, c$，因此$\sum_1^c m_i=n$为总样本量。设$Y_n$为$y$上的观测值的$n \times 1$向量，其中的连续行以$i=1, \ldots, c$的顺序给出$m_i$观测值的值，而观测值$X_j$ s, $j=1, \ldots, r$也以类似的顺序排列。现在只能推测，同一集群内的观测结果与跨集群的观测结果相比，应该具有良好的正相关关系。因此，假设$X_n$上的回归关系为$Y_n$，这是一个$n \times r$矩阵，其中的连续行将集群的值按$i=1, \ldots, c$的顺序排列，这表明
$$
E_m\left(Y_n\right)=X_n \beta
$$
其中$\beta$是未知回归参数的$r \times 1$向量，应该仔细假设误差向量的分布
$$
\epsilon_n=Y_n-X_n \beta .
$$

统计代写|抽样调查作业代写sampling theory of survey代考|Design Variables

接下来，我们考虑一个重要的情况，除了回归变量之外，还有另一组变量用于确定选择概率，称为设计变量。例如，一个人可能计划检查某些消费项目的费用(因变量$y$)如何随年收入(单回归变量$x$)而变化。然后，如果有有关个人在上一个财政年度所付税款的帐户，即变量$z$的值，则可以利用这一资料对人口进行相应的分层。那么$z$是一个与$x$和$y$明显相关的设计变量。

继NATHAN和Holt(1980)、Holt、Smith和Winter(1980)以及PfeffermanN和Holmes(1985)的研究之后，让我们考虑一个简单的情况，即一个因变量(内源性)$y$、一个回归变量(外源性、自变量)$x$和一个设计变量$z$。假设回归模型
$$
y=\alpha+\beta x+\epsilon
$$
通过$E_m(\epsilon \mid x)=0, V_m(\epsilon \mid x)=\sigma^2(\sigma>0)$。假设取一个大小为$n$的随机样本$s$，遵循使用$z$和define的值$Z_1, Z_2, \ldots, Z_N$的设计$p$
$$
v_z=\frac{1}{N} \sum_1^N Z_i, \sigma_z^2=\frac{1}{N-1} \sum_1^N\left(Z_i-v_z\right)^2 .
$$
同样，设$\bar{y}, \bar{x}, \bar{z}$表示$y, x, z, s_y^2, s_x^2, s_z^2$的样本方差和$s_{y x}, s_{y z}, s_{x z}$的样本协方差的样本均值。问题是在基于模型的方法下，对$y$在$x$上的回归系数$\beta$进行推断。
考虑普通最小二乘估计(OLSE)，
$$
b=s_{y x} / s_x^2 .
$$
其性能主要取决于回归模型中设计变量$z$与变量$x, y$之间的关系。在最简单的情况下$x, y, z$可能遵循三元正态分布。DEMETS和HALPERIN(1977)已经表明，在这个假设下，$b$是有偏见的。根据ANDERSON(1957)的缺失值方法，他们推导出一个替代估计量，即$\beta$的最大似然估计量(MLE)，即:
$$
\widehat{\beta}=\left[s_{y x}+\frac{s_{y z} s_{x z}}{s_z^2}\left(\frac{\sigma_z^2}{s_z^2}-1\right)\right] /\left[s_x^2+\frac{s_{x z}^2}{s_z^2}\left(\frac{\sigma_z^2}{s_z^2}-1\right)\right] .
$$

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。统计代写|python代写代考

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

统计代写|抽样调查作业代写sampling theory of survey代考|ESTIMATION FROM MULTIPLE FRAMES

Posted on 2023年7月14日2023年7月14日 by statistics-lab

统计代写|抽样调查作业代写sampling theory of survey代考|ESTIMATION FROM MULTIPLE FRAMES

Suppose a finite population $U$ of size $N$ is covered exactly by the union of two overlapping frames $A$ and $B$ of sizes $N_A$ and $N_B$. Let $E_A$ denote the set of units of $A$ that are not in $B$, $E_{A B}$ denote those that are in both $A$ and $B$, and $E_B$ denote the units of $B$ that are not in $A ; N_{E A}, N_{A B}, N_{E B}$ respectively denote the sizes of these three mutually exclusive sets. Let two samples of sizes $n_A, n_B$ be drawn by SRSWOR from the two lists $A$ and $B$ respectively in independent manners. Let $n_a, n_{a b}, n_{b a}, n_b$ denote respectively the sampled units of $A$ that are in $E_A, E_{A B}$ and of $B$ that are in $E_{A B}, E_B$. Let us denote the corresponding sample means by $\bar{y}a, \bar{y}{a b}, \bar{y}{b a}$, and $\bar{y}_b$. Then for the population total $Y=\sum_1^N Y_i$ one may employ the following estimators $$ \widehat{Y}_1=\left(N{E A} \bar{y}a+N{E B} \bar{y}b\right)+N{A B}\left(p \bar{y}{a b}+q \bar{y}{b a}\right)
$$
if $N_{E A}, N_{E B}$, and $N_{A B}$ are known, or, without this assumption,
$$
\widehat{Y}2=\frac{N_A}{n_A}\left(\bar{y}_a+p \bar{y}{a b}\right)+\frac{N_B}{n_B}\left(\bar{y}b+q \bar{y}{b a}\right)
$$
In $\widehat{Y}1$ and $\widehat{Y}_2, p$ is a suitable number, $0{A B}, E_B$ are known quantities $\sigma_A^2, \sigma_{A B}^2, \sigma_B^2$ and choosing a simple cost function, he gave rules for optimal choices of $n_A, n_B$ subject to a given value of $n=n_A+n_B$ and of $p$.

统计代写|抽样调查作业代写sampling theory of survey代考|Small Domains and Poststratification

Suppose a finite population $U$ of $N$ units labeled $1, \ldots, i, \ldots, N$ consists of a very large, say several thousand, domains of interest, like the households of people of various racial groups of different predominant occupational groups of their principal earning members located in various counties across various states like those in U.S.A. For certain overall general purposes a sample $s$ of a size $n$, which may also be quite large, say a few thousand, may be supposed to have been chosen according to a design $p$ admitting $\pi_i>0$. Then the total $T_d=\sum_{U_d} Y_i$ for a variable of interest $y$ relating to the members of a particular domain $U_d$ of size $N_d$ of interest may be estimated using the direct estimators
$$
t_d=\left(\sum_{s_d} Y_i / \pi_i\right)
$$
or
$$
t_d^{\prime}=N_d\left(\sum_{s_d} Y_i / \pi_i\right) /\left(\sum_{s_d} 1 / \pi_i\right) .
$$
We write $s_d$ for the part of the sample $s$ that coincides with $U_d$, and $n_d$ for the size of $s_d, d=1, \ldots, D$, writing $D$ for the total number of domains such that $U_d$ ‘s are disjoint, coincident with $U$ when amalgamated over all the $U_d$ ‘s $d=1, \ldots, D$. We suppose $D$ is very large and so even for large $n=\sum_{d=1}^D n_d$, the values of $n_d$ for numerous values of $d$ turn out to be quite small, and even nil for many of them. Thus the sample base of $t_d$ or $t_d^{\prime}$ happens in practice to be so small that they may not serve any useful purpose, having inordinately large magnitudes and unstable estimators for their variances, leading to inconsequential confidence intervals, which in most cases fail to cover the true domain totals. Similar and more acute happens to be the problem of estimating the domain means $\bar{T}_d=T_d / N_d$, writing domain size as $N_d$, which often is unknown. Hence the problem of small domain statistics, and a special method of estimation is needed for the parameters relating to small domains, which are often geographical areas and hence are called small areas or local areas. In this section, we will briefly discuss a few issues involved in small area or local area estimation.

抽样调查代考

统计代写|抽样调查作业代写sampling theory of survey代考|ESTIMATION FROM MULTIPLE FRAMES

假设大小为$N$的有限种群$U$被大小为$N_A$和$N_B$的两个重叠帧$A$和$B$的并集完全覆盖。设$E_A$表示$A$中不属于$B$的单位集合，$E_{A B}$表示同时属于$A$和$B$的单位集合，$E_B$表示$B$中不属于$A ; N_{E A}, N_{A B}, N_{E B}$的单位集合，分别表示这三个互斥集合的大小。用SRSWOR分别从$A$和$B$两个列表中独立抽取两个大小为$n_A, n_B$的样本。设$n_a, n_{a b}, n_{b a}, n_b$分别表示位于$E_A, E_{A B}$中的$A$的采样单位和位于$E_{A B}, E_B$中的$B$的采样单位。我们用$\bar{y}a, \bar{y}{a b}, \bar{y}{b a}$和$\bar{y}b$表示相应的样本均值。那么对于人口总数$Y=\sum_1^N Y_i$，可以使用以下估计器$$ \widehat{Y}_1=\left(N{E A} \bar{y}a+N{E B} \bar{y}b\right)+N{A B}\left(p \bar{y}{a b}+q \bar{y}{b a}\right) $$ 如果$N{E A}, N_{E B}$和$N_{A B}$是已知的，或者，没有这个假设，
$$
\widehat{Y}2=\frac{N_A}{n_A}\left(\bar{y}a+p \bar{y}{a b}\right)+\frac{N_B}{n_B}\left(\bar{y}b+q \bar{y}{b a}\right) $$ 在$\widehat{Y}1$和$\widehat{Y}_2, p$是一个合适的数字，$0{A B}, E_B$是已知的数量$\sigma_A^2, \sigma{A B}^2, \sigma_B^2$和选择一个简单的成本函数，他给出了规则的最优选择$n_A, n_B$在给定的值$n=n_A+n_B$和$p$。

统计代写|抽样调查作业代写sampling theory of survey代考|Small Domains and Poststratification

假设一个有限总体 $U$ 的 $N$ 标示单位 $1, \ldots, i, \ldots, N$ 包含了一个很大的，比如几千个兴趣领域，比如不同种族不同主要职业群体的人的家庭他们的主要收入成员分布在不同的县，不同的州，比如美国为了某些总体的一般目的，一个样本 $s$ 大小相同 $n$，也可能相当大，比如几千，可能是根据设计选择的 $p$ 承认 $\pi_i>0$．然后是总数 $T_d=\sum_{U_d} Y_i$ 对于感兴趣的变量 $y$ 与某一特定领域的成员有关的 $U_d$ 大小 $N_d$ 感兴趣的可以用直接估计量来估计
$$
t_d=\left(\sum_{s_d} Y_i / \pi_i\right)
$$
或
$$
t_d^{\prime}=N_d\left(\sum_{s_d} Y_i / \pi_i\right) /\left(\sum_{s_d} 1 / \pi_i\right) .
$$
我们写 $s_d$ 对于样本的一部分 $s$ 这与 $U_d$，和 $n_d$ 的大小 $s_d, d=1, \ldots, D$，写作 $D$ 对于这样的域的总数 $U_d$ 是不相交的，与 $U$ 当所有的 $U_d$ ’’ $d=1, \ldots, D$．我们认为 $D$ 是非常大的，所以即使是大的 $n=\sum_{d=1}^D n_d$的值 $n_d$ 的许多值 $d$ 结果是相当小的，很多甚至为零。的样本基 $t_d$ 或 $t_d^{\prime}$ 在实践中发生的是如此之小，以至于它们可能没有任何有用的目的，它们的大小和不稳定的方差估计，导致无关紧要的置信区间，在大多数情况下无法覆盖真正的域总数。类似的更尖锐的问题是域均值的估计 $\bar{T}_d=T_d / N_d$，将域大小写为 $N_d$，这通常是未知的。小域统计通常是指地理区域，因此被称为小区域或局部区域，因此小域统计需要一种特殊的参数估计方法。在本节中，我们将简要讨论涉及小区域或局部区域估计的几个问题。

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。统计代写|python代写代考

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

统计代写|抽样调查作业代写sampling theory of survey代考|Subsampling of Second-Stage Units to Simplify Variance Estimation

Posted on 2023年7月14日2023年7月14日 by statistics-lab

统计代写|抽样调查作业代写sampling theory of survey代考|Subsampling of Second-Stage Units to Simplify Variance Estimation

SRINATH and HIDIROGLOU (1980) contrived the following device to bypass the requirement of estimating $V_L\left(T_i\right)$. They consider choosing the fsus by SRSWOR scheme, choosing from each sampled fsu $i$ in the sample $s$ again an SRSWOR $s_i$, in independent manners cluster-wise of size $m_i$ from $M_i$ ssus in it, and using
$$
e=\frac{N}{n} \sum_{i \in s} M_i \bar{y}_i
$$
as an estimator for $Y$. Here $\bar{y}_i$ is the mean of the $y$ values of the ssus in $s_i$ for $i \in s$. Then they recommend taking a subsample $s_i^{\prime}$ of size $m_i^{\prime}$ out of $s_i$ again by SRSWOR method, getting $\bar{y}_i^{\prime}$ as the mean of $y$ based on the ssus in $s_i^{\prime}$. They show that an unbiased estimator for $V(e)$ is available exclusively in terms of $\bar{y}_i^{\prime}$ for $i \in s$ although not in terms of $\bar{y}_i$ as, ideally, one would like to have.

ARNAB (1988) argues that restriction to SRSWOR is neither necessary nor desirable and discarding the ssus in $s_i$ or $s_i^{\prime}$ is neither desirable nor necessary, and gives further generalizations of this basic idea of SRINATH and HIDIROGLOU (1980). Following DES RAJ’s (1968) general strategy, he suggests starting with the estimator
$$
e_D=\sum_s b_{s i} I_{s i} T_i
$$
with
$$
\begin{aligned}
V\left(e_D\right) & =\sum Y_i^2\left(\alpha_i-1\right)+\sum_{i \neq j} Y_i Y_j\left(\alpha_{i j}-1\right)+\sum \alpha_i \sigma_i^2 \
V_L\left(T_i\right) & =\sigma_i^2
\end{aligned}
$$

统计代写|抽样调查作业代写sampling theory of survey代考|Estimation of Y

We have so far restricted ourselves to only unbiased estimators of $Y$. But suppose we want to estimate
$$
\bar{Y}=\sum_1^N Y_i / \sum_1^N M_i
$$
where $\sum_1^N M_i$ may also be unknown like $Y=\sum_1^N Y_i$ and we may know or ascertain only the values of $M_i$ for the clusters actually selected. In that case, an unbiased estimator is unlikely to be available for $\bar{Y}$. Rather, a biased ratio estimator $t_R=\Sigma_s Y_i / \Sigma_s M_i$ may be based on an SRSWOR $s$ of selected clusters if $Y_i$ ‘s are ascertainable. If not, one may employ
$$
e_R=\frac{\sum_s T_i}{\sum_s M_i},
$$
a biased estimator for $\bar{Y}$, using $T_i$ ‘s as unbiased estimators for $Y_i$ based on samples taken at later stages of sampling from the fsu $i$ such that $E_L\left(T_i\right)=Y_i$ with $V_L\left(T_i\right)$ equal to $V_{s i}$ or $\sigma_i^2$ admitting respectively unbiased estimators $\hat{V}{s i}$ or $\hat{\sigma}_i^2$ such that $E_L\left(\hat{V}{s i}\right)=V_{s i}$ or $E_L\left(\hat{\sigma}_i^2\right)=\sigma_i^2$.

In general, following RAO and VIJAYAN (1977) and RAO (1979), let us start with
$$
t=\sum_s b_{s i} I_{s i} Y_i
$$
not necessarily unbiased for $Y$ such that
$$
M=E_p(t-Y)^2=\sum \sum Y_i Y_j d_{i j}
$$
with
$$
E_p\left(b_{s i} I_{s i}-1\right)\left(b_{s j} I_{s j}-1\right)=d_{i j} .
$$
Let us assume that there exist $W_i \neq 0$ such that if $Z_i=$ $Y_i / W_i=c$ (a non-zero constant) for all $i$, then $M$ equals zero. In that case, from chapter 2 we know that we may write
$$
\begin{aligned}
M & =-\sum_{i<j} d_{i j} W_i W_j\left(Z_i-Z_j\right)^2 \
& =-\sum_{i<j} d_{i j} W_i W_j\left(\frac{Y_i}{W_i}-\frac{Y_j}{W_j}\right)^2 .
\end{aligned}
$$

抽样调查代考

统计代写|抽样调查作业代写sampling theory of survey代考|Subsampling of Second-Stage Units to Simplify Variance Estimation

SRINATH和HIDIROGLOU(1980)设计了以下装置来绕过估算$V_L\left(T_i\right)$的要求。他们考虑通过SRSWOR方案选择fsu，从样本中每个抽样的fsu $i$$s$中再次选择SRSWOR $s_i$，以独立的方式从$M_i$中的ssu中选择大小为$m_i$的集群，并使用
$$
e=\frac{N}{n} \sum_{i \in s} M_i \bar{y}_i
$$
作为$Y$的估计值。这里$\bar{y}_i$是$i \in s$中$s_i$中ssus的$y$值的平均值。然后，他们建议再次用SRSWOR方法从$s_i$中取出大小为$m_i^{\prime}$的子样本$s_i^{\prime}$，根据$s_i^{\prime}$中的ssus得到$\bar{y}_i^{\prime}$作为$y$的平均值。它们表明，$V(e)$的无偏估计量只能在$i \in s$的$\bar{y}_i^{\prime}$项下可用，尽管在理想情况下不能在$\bar{y}_i$项下可用。

ARNAB(1988)认为限制SRSWOR既不必要也不可取，丢弃$s_i$或$s_i^{\prime}$中的ssus既不可取也不必要，并进一步概括了SRINATH和HIDIROGLOU(1980)的基本思想。根据DES RAJ(1968)的一般策略，他建议从估计器开始
$$
e_D=\sum_s b_{s i} I_{s i} T_i
$$
有
$$
\begin{aligned}
V\left(e_D\right) & =\sum Y_i^2\left(\alpha_i-1\right)+\sum_{i \neq j} Y_i Y_j\left(\alpha_{i j}-1\right)+\sum \alpha_i \sigma_i^2 \
V_L\left(T_i\right) & =\sigma_i^2
\end{aligned}
$$

统计代写|抽样调查作业代写sampling theory of survey代考|Estimation of Y

到目前为止，我们仅限于$Y$的无偏估计量。但是假设我们想要估计
$$
\bar{Y}=\sum_1^N Y_i / \sum_1^N M_i
$$
其中$\sum_1^N M_i$也可能像$Y=\sum_1^N Y_i$一样是未知的，我们可能只知道或确定实际选择的集群的$M_i$的值。在这种情况下，无偏估计量不太可能用于$\bar{Y}$。相反，如果$Y_i$是可确定的，则偏比值估计器$t_R=\Sigma_s Y_i / \Sigma_s M_i$可以基于所选簇的SRSWOR $s$。如果没有，可以雇佣
$$
e_R=\frac{\sum_s T_i}{\sum_s M_i},
$$
$\bar{Y}$的有偏估计，使用$T_i$作为$Y_i$的无偏估计，基于从fsu $i$采样的后期阶段的样本，使得$E_L\left(T_i\right)=Y_i$与$V_L\left(T_i\right)$等于$V_{s i}$或$\sigma_i^2$分别承认无偏估计$\hat{V}{s i}$或$\hat{\sigma}i^2$，使得$E_L\left(\hat{V}{s i}\right)=V{s i}$或$E_L\left(\hat{\sigma}_i^2\right)=\sigma_i^2$。

总的来说，继RAO和VIJAYAN(1977)和RAO(1979)之后，让我们从
$$
t=\sum_s b_{s i} I_{s i} Y_i
$$
对于$Y$来说不一定是公正的
$$
M=E_p(t-Y)^2=\sum \sum Y_i Y_j d_{i j}
$$
有
$$
E_p\left(b_{s i} I_{s i}-1\right)\left(b_{s j} I_{s j}-1\right)=d_{i j} .
$$
让我们假设存在$W_i \neq 0$，如果$Z_i=$$Y_i / W_i=c$(一个非零常数)对于所有$i$，那么$M$等于零。在这种情况下，从第二章我们知道我们可以写
$$
\begin{aligned}
M & =-\sum_{i<j} d_{i j} W_i W_j\left(Z_i-Z_j\right)^2 \
& =-\sum_{i<j} d_{i j} W_i W_j\left(\frac{Y_i}{W_i}-\frac{Y_j}{W_j}\right)^2 .
\end{aligned}
$$

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。统计代写|python代写代考

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

统计代写|抽样调查作业代写sampling theory of survey代考|Design-Based Variance Estimation

Posted on 2023年7月14日2023年7月14日 by statistics-lab

统计代写|抽样调查作业代写sampling theory of survey代考|Design-Based Variance Estimation

When $\left(X_i, Y_i\right)$ values are available for SRSWOR of size $n$ an alternative to the ratio estimator for $\bar{Y}$ is the regression estimator
$$
t_r=\bar{y}+b(\bar{X}-\bar{x}) .
$$
Here $b$ is the sample regression coefficient of $y$ on $x$. Its variance $V_p\left(t_r\right)$ and mean square error $M_p\left(t_r\right)$ are both approximated by
$$
V=\frac{1-f}{n} \frac{1}{N-1} \sum_1^N D_i^2
$$
where
$$
\begin{aligned}
D_i & =\left(Y_i-\bar{Y}\right)-B\left(X_i-\bar{X}\right) \
B & =\sum_1^N\left(Y_i-\bar{Y}\right)\left(X_i-\bar{X}\right) / \sum_1^N\left(X_i-\bar{X}\right)^2 .
\end{aligned}
$$
The errors in these approximations are neglected for large $n$ and $N$ although for $n, N$, and $X$ at hand it is difficult to guess the magnitudes of these errors. However, there exists evidence that $t_r$ may be more efficient than the ratio estimator $\bar{t}R$ in many situations in terms of mean square error (cf. DENG and Wu, 1987). Writing $$ \begin{aligned} d_i & =\left(Y_i-\bar{y}\right)-b\left(X_i-\bar{x}\right), \ v{l r} & =\frac{1-f}{n(n-2)} \sum_s d_i^2
\end{aligned}
$$
is traditionally taken as an estimator for $V$. DENG and WU (1987) consider a class of generalized estimators
$$
v_g=\left[\frac{\bar{X}}{\bar{x}}\right]^g v_{l r}
$$

统计代写|抽样调查作业代写sampling theory of survey代考|Model-Based Variance Estimation

Besides these ad hoc variance estimators, hardly any others are known to have been proposed as estimators for $V$ with a design-based approach. However, some rivals have emerged from the least squares linear predictive approach.

Suppose $Y, X$ are conformable to the model $\mathcal{M}_{10}^{\prime}$ (cf. section 4.1.2) for which the following is tenable:
$$
\begin{aligned}
E_m\left(Y_i\right) & =\alpha+\beta X_i, \alpha \neq 0, V_m\left(Y_i\right)=\sigma^2, \
C_m\left(Y_i, Y_j\right) & =0, i \neq j .
\end{aligned}
$$
Then the BLUP for $\bar{Y}$ is $\mathrm{t}_r$ and
$$
\begin{aligned}
B_m\left(t_r\right) & =E_m\left(t_r-\bar{Y}\right)=0 \
V_m\left(t_r-\bar{Y}\right) & =\frac{1-f}{n}\left[1+\frac{(\bar{X}-\bar{x})^2}{(1-f) g(s)}\right] \sigma^2=\phi(s) \sigma^2, \text { say, }
\end{aligned}
$$
writing
$$
g(s)=\frac{1}{n} \sum_s\left(X_i-\bar{x}\right)^2 .
$$
Then, for
$$
\hat{\sigma}^2=\frac{1}{(n-2)} \sum_s d_i^2
$$
we have
$$
E_m\left(\hat{\sigma}^2\right)=\sigma^2
$$
Consequently,
$$
v_L=\phi(s) \hat{\sigma}^2=\frac{1-f}{n(n-2)}\left[1+\frac{(\bar{X}-\bar{x})^2}{(1-f) g(s)}\right] \sum_s d_i^2
$$
is an $m$-unbiased estimator for $V_m\left(t_r-\bar{Y}\right)$ under $\mathcal{M}{10}^{\prime}$. The term $$ h(s)=\frac{(X-\bar{x})^2}{(1-f) g(s)} $$ in $v_L$ vanishes if the sample is balanced, that is, $\bar{x}=\bar{X}$, and for a balanced sample $V_m\left(t_r-\bar{Y}\right)$ is the minimal under $\mathcal{M}{10}^{\prime}$.
In general,
$$
v_L=(1+h(s)) v_{l r} \geq v_{l r}
$$
with equality only for a balanced sample. If a balanced sample is drawn, then the classical design-based estimator $v_{l r}$ based on it becomes $m$-unbiased for $V_m\left(t_r-\bar{Y}\right)$.

抽样调查代考

统计代写|抽样调查作业代写sampling theory of survey代考|Design-Based Variance Estimation

当大小为$n$的SRSWOR的$\left(X_i, Y_i\right)$值可用时，$\bar{Y}$的比率估计器的替代方法是回归估计器
$$
t_r=\bar{y}+b(\bar{X}-\bar{x}) .
$$
其中$b$为$y$在$x$上的样本回归系数。其方差$V_p\left(t_r\right)$和均方误差$M_p\left(t_r\right)$都近似为
$$
V=\frac{1-f}{n} \frac{1}{N-1} \sum_1^N D_i^2
$$
在哪里
$$
\begin{aligned}
D_i & =\left(Y_i-\bar{Y}\right)-B\left(X_i-\bar{X}\right) \
B & =\sum_1^N\left(Y_i-\bar{Y}\right)\left(X_i-\bar{X}\right) / \sum_1^N\left(X_i-\bar{X}\right)^2 .
\end{aligned}
$$
对于较大的$n$和$N$，这些近似中的误差可以忽略不计，尽管对于$n, N$和$X$，很难猜测这些误差的大小。然而，有证据表明，在均方误差方面，$t_r$在许多情况下可能比比率估计器$\bar{t}R$更有效(cf. DENG and Wu, 1987)。写作$$ \begin{aligned} d_i & =\left(Y_i-\bar{y}\right)-b\left(X_i-\bar{x}\right), \ v{l r} & =\frac{1-f}{n(n-2)} \sum_s d_i^2
\end{aligned}
$$
通常作为$V$的估计量。DENG和WU(1987)考虑了一类广义估计量
$$
v_g=\left[\frac{\bar{X}}{\bar{x}}\right]^g v_{l r}
$$

统计代写|抽样调查作业代写sampling theory of survey代考|Model-Based Variance Estimation

除了这些特别的方差估计器之外，几乎没有其他已知的方法被提出作为$V$基于设计的方法的估计器。然而，从最小二乘线性预测方法中出现了一些竞争对手。

假设$Y, X$符合模型$\mathcal{M}{10}^{\prime}$(参见第4.1.2节)，其中以下是成立的: $$ \begin{aligned} E_m\left(Y_i\right) & =\alpha+\beta X_i, \alpha \neq 0, V_m\left(Y_i\right)=\sigma^2, \ C_m\left(Y_i, Y_j\right) & =0, i \neq j . \end{aligned} $$ 那么$\bar{Y}$的BLUP就是$\mathrm{t}_r$和 $$ \begin{aligned} B_m\left(t_r\right) & =E_m\left(t_r-\bar{Y}\right)=0 \ V_m\left(t_r-\bar{Y}\right) & =\frac{1-f}{n}\left[1+\frac{(\bar{X}-\bar{x})^2}{(1-f) g(s)}\right] \sigma^2=\phi(s) \sigma^2, \text { say, } \end{aligned} $$ 写作 $$ g(s)=\frac{1}{n} \sum_s\left(X_i-\bar{x}\right)^2 . $$ 然后，对于 $$ \hat{\sigma}^2=\frac{1}{(n-2)} \sum_s d_i^2 $$ 我们有 $$ E_m\left(\hat{\sigma}^2\right)=\sigma^2 $$ 因此， $$ v_L=\phi(s) \hat{\sigma}^2=\frac{1-f}{n(n-2)}\left[1+\frac{(\bar{X}-\bar{x})^2}{(1-f) g(s)}\right] \sum_s d_i^2 $$ 是$\mathcal{M}{10}^{\prime}$下$V_m\left(t_r-\bar{Y}\right)$的一个$m$ -无偏估计量。如果样本是平衡的，则$v_L$中的$$ h(s)=\frac{(X-\bar{x})^2}{(1-f) g(s)} $$项消失，即$\bar{x}=\bar{X}$，并且对于平衡的样本，$V_m\left(t_r-\bar{Y}\right)$是$\mathcal{M}{10}^{\prime}$下的最小值。一般来说， $$ v_L=(1+h(s)) v{l r} \geq v_{l r}
$$
只有平衡的样本才相等。如果绘制了一个平衡样本，那么基于它的经典基于设计的估计器$v_{l r}$对于$V_m\left(t_r-\bar{Y}\right)$就变为$m$ -无偏。

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。统计代写|python代写代考

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

统计代写|抽样调查作业代写sampling theory of survey代考|Bestness under a Model

Posted on 2023年6月26日2023年6月26日 by statistics-lab

如果你也在怎样代写抽样调查Survey sampling 这个学科遇到相关的难题，请随时右上角联系我们的24/7代写客服。抽样调查Survey sampling可大致分为两种类型：概率样本和超级样本。基于概率的样本执行一个具有指定概率的抽样计划（也许是由一个适应性程序指定的适应性概率）。基于概率的抽样允许对目标人群进行基于设计的推断。推论是基于研究方案中指定的已知客观概率分布。基于概率的调查的推论仍然可能受到许多类型的偏见的影响。

抽样调查Survey sampling在统计学中，描述了从目标人群中选择一个元素样本进行调查的过程。术语 “调查 “可以指许多不同类型或技术的观察。在调查取样中，它最常涉及的是用于测量人们的特征和/或态度的调查问卷。一旦样本成员被选中，与他们联系的不同方式就是调查数据收集的主题。抽样调查的目的是为了减少调查整个目标人群所需的成本和/或工作量。衡量整个目标人口的调查被称为普查。样本指的是要从中获取信息的一个群体或部分。

统计代写|抽样调查作业代写sampling theory of survey代考|Bestness under a Model

To choose among different $Q_i$ ‘s satisfying the $\mathrm{ADC}$ and equivalently ADU requirement in case $R=1$, BREWER (1979) recommended as a criterion
$$
L=\lim {T \rightarrow \infty} E_m E_p\left{\left[t{Q 1 T}\left(s_T, Y_T\right)-Y_T\right]^2 / T\right}
$$
where $Y_i=x_i^{\prime} \beta+\varepsilon_i$ is assumed with
$$
\begin{aligned}
E_m\left(\varepsilon_i\right) & =0 & & \
C_m\left(\varepsilon_i, \varepsilon_j\right) & =\sigma_i^2, & & \text { if } j=i \
& =0, & & \text { if } j \neq i
\end{aligned}
$$
$(i, j=1,2, \ldots, T N)$. He has shown that
$$
L \geq \sum \sigma_i^2\left[\frac{1}{\pi_i}-1\right]
$$
holds with equality for the LPRE defined by $Q_i^*$ (see RESULT 6.4).

Now, every QR predictor with the consistency and ADU property is a GREG predictor, $t_{Q 1 / \pi}$, and
$$
\begin{aligned}
t_{Q 1 / \pi}-Y= & {\left[\sum_1^N xi^{\prime}-\sum I{s i} \frac{1}{\pi_i} xi^{\prime}\right]\left[\sum_1^N I{s i} Q_i xi x_i^{\prime}\right]^{-1}\left[\sum_1^N I{s i} Q_i xi Y_i\right] } \ & +\sum_1^N I{s i} \frac{1}{\pi_i} Y_i-\sum_1^N I_{s i} Y_i-\sum_1^N\left(1-I_{s i}\right) Y_i \
= & \sum_1^N I_{s j}\left{\left[\sum_1^N xi^{\prime}-\sum I{s i} \frac{1}{\pi_i} xi^{\prime}\right]\left[\sum_1^N I{s i} Q_i xi x_i^{\prime}\right]^{-1} Q_j x_j\right. \ & \left.+\left[\frac{1}{\pi_j}-1\right]\right} Y_j-\sum_1^N\left(1-I{s j}\right) Y_j .
\end{aligned}
$$
With $s$ replaced by $s_T$ and $N$ by $N T$ we obtain
$$
t_{Q 1 / \pi T}-Y_T
$$
It is easily checked that $E_m\left(t_{Q 1 / \pi T}-Y_T\right)=0$ and under Eq. (6.1)
$$
\begin{aligned}
E_m & {\left[t_{Q 1 / \pi T}-Y_T\right]^2=V_m\left[t_{Q 1 / \pi T}-Y_T\right] } \
= & \sum_1^{N T} I_{s_T j}\left{\left[\sum_1^{N T} xi^{\prime}-\sum I{s_T i} \frac{1}{\pi_i} xi^{\prime}\right]\left[\sum_1^{N T} I{s_T i} Q_i xi x_i^{\prime}\right]^{-1} Q_j x_j\right. \ & \left.+\left[\frac{1}{\pi_i}-1\right]\right}^2 \sigma_j^2+\sum_1^{N T}\left(1-I{s_T j}\right) \sigma_j^2 .
\end{aligned}
$$

统计代写|抽样调查作业代写sampling theory of survey代考|Concluding Remarks

For a fuller treatment and alternative approaches by asymptotic analyses in survey sampling along with their interpretations, one may refer to BREWER (1979), SÄRNDAL (1980), FULLER and ISAKI (1981), ISAKI and FULLER (1982), ROBINSON and SÄRNDAL (1983), HANSEN, MADOW and TEPPING (1983), and CHAUDHURI and VOS (1988). We omit the details to avoid a too technical discussion.

Robustness has been on the focus relating to LPREs. GREG predictors by virtue of their forms acquire robustness from design considerations in the sense of asymptotic design unbiasedness, as we noticed in the previous section. At this stage let us turn again to them to examine their robustness.
An LPRE is of the form $t_L=\Sigma_s Y_i+\Sigma_r \hat{\mu}i$ where $E_m\left(Y_i\right)=$ $\mu_i$. If $\mu_i$ is a polynominal in an auxiliary variable $x$, for samples balanced up to a certain order every $t{B L U}$ is bias robust, that is, $E_m\left(t_{B L U}-Y\right)=0$, and asymptotically so for large samples selected by SRSWOR, preferably with appropriate stratifications. But $t_{B L U}$ is not usually MSE robust, by which we mean the following: Let us write $t_{m^{\prime}}$ for the predictor, which is BLU under a model $m^{\prime}$; its bias, MSE, and variance under a true model, $m$, are, respectively, $B_m\left(t_{m^{\prime}}\right), M_m\left(t_{m^{\prime}}\right)$, and $V_m\left(t_{m^{\prime}}-Y\right)$. Then, $M_m\left(t_{m^{\prime}}\right)=V_m\left(t_{m^{\prime}}-Y\right)+B_m^2\left(t_{m^{\prime}}\right)$ and $M_m\left(t_m\right)=V_m\left(t_m-Y\right)$ because $B_m\left(t_m\right)=0$. Even if $\left|B_m\left(t_{m^{\prime}}\right)\right|$ is negligible, $V_m\left(t_{m^{\prime}}-Y\right)$ may be too far away from $V_m\left(t_m-Y\right)$ and so may be $M_m\left(t_{m^{\prime}}\right)$ from $M_m\left(t_m\right)$. So $t_{m^{\prime}}$, even if bias robust, may be quite fragile in respect to MSE.

Very little with practical utility is known about MSE robustness of LPREs. More importantly, nobody knows what the true model is; even with a polynomial assumption it is hard to know its degree, and in large-scale surveys diagnostic analysis to fix a correct model is a far cry. So, it is being recognized that even for model-based LPREs robustness should be examined with respect to design, that is, one should examine the magnitude of
$$
M_p\left(t_L\right)=E_p\left(t_L-Y\right)^2=V_p\left(t_L\right)+B_p^2\left(t_L\right) .
$$
Since the sample size is usually large, we may presume $V_p\left(t_L\right)$ to be suitably under control and we should concentrate on $\left|B_p\left(t_L\right)\right|$. In section 4.1.2 we saw how a restriction $B_p(t)=0$ may lead to loss of efficiency, especially if a model is accurately postulated. An accepted criterion for robustness studies is therefore to demand that $t_L$ be $\mathrm{ADC}$. Similar are the desirable requirements for any other estimator or predictor.

抽样调查代考

统计代写|抽样调查作业代写sampling theory of survey代考|Bestness under a Model

BREWER(1979)建议在满足$\mathrm{ADC}$和等价ADU要求的情况$R=1$的不同$Q_i$中进行选择作为标准
$$
L=\lim {T \rightarrow \infty} E_m E_p\left{\left[t{Q 1 T}\left(s_T, Y_T\right)-Y_T\right]^2 / T\right}
$$
$Y_i=x_i^{\prime} \beta+\varepsilon_i$在哪里
$$
\begin{aligned}
E_m\left(\varepsilon_i\right) & =0 & & \
C_m\left(\varepsilon_i, \varepsilon_j\right) & =\sigma_i^2, & & \text { if } j=i \
& =0, & & \text { if } j \neq i
\end{aligned}
$$
$(i, j=1,2, \ldots, T N)$。他已经证明了
$$
L \geq \sum \sigma_i^2\left[\frac{1}{\pi_i}-1\right]
$$
对于由$Q_i^*$定义的LPRE(参见RESULT 6.4)，持有与相等。

现在，每个具有一致性和ADU属性的QR预测器都是GREG预测器，$t_{Q 1 / \pi}$和
$$
\begin{aligned}
t_{Q 1 / \pi}-Y= & {\left[\sum_1^N xi^{\prime}-\sum I{s i} \frac{1}{\pi_i} xi^{\prime}\right]\left[\sum_1^N I{s i} Q_i xi xi^{\prime}\right]^{-1}\left[\sum_1^N I{s i} Q_i xi Y_i\right] } \ & +\sum_1^N I{s i} \frac{1}{\pi_i} Y_i-\sum_1^N I{s i} Y_i-\sum_1^N\left(1-I_{s i}\right) Y_i \
= & \sum_1^N I_{s j}\left{\left[\sum_1^N xi^{\prime}-\sum I{s i} \frac{1}{\pi_i} xi^{\prime}\right]\left[\sum_1^N I{s i} Q_i xi xi^{\prime}\right]^{-1} Q_j x_j\right. \ & \left.+\left[\frac{1}{\pi_j}-1\right]\right} Y_j-\sum_1^N\left(1-I{s j}\right) Y_j . \end{aligned} $$ 将$s$替换为$s_T$，将$N$替换为$N T$，我们得到 $$ t{Q 1 / \pi T}-Y_T
$$
很容易检查$E_m\left(t_{Q 1 / \pi T}-Y_T\right)=0$和Eq. (6.1)
$$
\begin{aligned}
E_m & {\left[t_{Q 1 / \pi T}-Y_T\right]^2=V_m\left[t_{Q 1 / \pi T}-Y_T\right] } \
= & \sum_1^{N T} I_{s_T j}\left{\left[\sum_1^{N T} xi^{\prime}-\sum I{s_T i} \frac{1}{\pi_i} xi^{\prime}\right]\left[\sum_1^{N T} I{s_T i} Q_i xi x_i^{\prime}\right]^{-1} Q_j x_j\right. \ & \left.+\left[\frac{1}{\pi_i}-1\right]\right}^2 \sigma_j^2+\sum_1^{N T}\left(1-I{s_T j}\right) \sigma_j^2 .
\end{aligned}
$$

统计代写|抽样调查作业代写sampling theory of survey代考|Concluding Remarks

对于更全面的处理和通过调查抽样渐进分析的替代方法及其解释，可以参考BREWER(1979)、SÄRNDAL(1980)、fuller和ISAKI(1981)、ISAKI和fuller(1982)、ROBINSON和SÄRNDAL(1983)、HANSEN、MADOW和TEPPING(1983)以及CHAUDHURI和VOS(1988)。我们省略了细节以避免过于技术性的讨论。

鲁棒性一直是与LPREs相关的焦点。正如我们在前一节中注意到的那样，GREG预测器凭借其形式从设计考虑中获得了渐近设计无偏性意义上的鲁棒性。在这个阶段，让我们再来看看它们的稳健性。
LPRE的形式为$t_L=\Sigma_s Y_i+\Sigma_r \hat{\mu}i$，其中$E_m\left(Y_i\right)=$$\mu_i$。如果$\mu_i$是辅助变量$x$中的多项式，那么对于平衡到一定阶数的样本，每个$t{B L U}$都是偏倚稳健的，即$E_m\left(t_{B L U}-Y\right)=0$，对于SRSWOR选择的大样本，则渐近地如此，最好有适当的分层。但是$t_{B L U}$通常不是MSE鲁棒性的，我们的意思是:让我们为预测器写$t_{m^{\prime}}$，它是模型$m^{\prime}$下的BLU;在真实模型$m$下，其偏差、MSE和方差分别为$B_m\left(t_{m^{\prime}}\right), M_m\left(t_{m^{\prime}}\right)$和$V_m\left(t_{m^{\prime}}-Y\right)$。然后，$M_m\left(t_{m^{\prime}}\right)=V_m\left(t_{m^{\prime}}-Y\right)+B_m^2\left(t_{m^{\prime}}\right)$和$M_m\left(t_m\right)=V_m\left(t_m-Y\right)$因为$B_m\left(t_m\right)=0$。即使$\left|B_m\left(t_{m^{\prime}}\right)\right|$可以忽略不计，$V_m\left(t_{m^{\prime}}-Y\right)$也可能离$V_m\left(t_m-Y\right)$太远，因此$M_m\left(t_{m^{\prime}}\right)$可能离$M_m\left(t_m\right)$太远。所以$t_{m^{\prime}}$，即使偏倚稳健，在MSE方面也可能很脆弱。

对于LPREs的MSE鲁棒性，我们所知甚少。更重要的是，没有人知道真正的模式是什么;即使有一个多项式假设，也很难知道它的程度，在大规模调查中，诊断分析要确定一个正确的模型是很遥远的。因此，人们认识到，即使是基于模型的LPREs，也应该根据设计来检查鲁棒性，也就是说，应该检查的幅度
$$
M_p\left(t_L\right)=E_p\left(t_L-Y\right)^2=V_p\left(t_L\right)+B_p^2\left(t_L\right) .
$$
由于样本量通常很大，我们可以假设$V_p\left(t_L\right)$在适当的控制之下，我们应该集中于$\left|B_p\left(t_L\right)\right|$。在4.1.2节中，我们看到了限制$B_p(t)=0$如何导致效率的损失，特别是在模型被精确假设的情况下。因此，稳健性研究的公认标准是要求$t_L$为$\mathrm{ADC}$。类似的是任何其他估计器或预测器的理想需求。

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。统计代写|python代写代考

随机过程代考

在概率论概念中，随机过程是随机变量的集合。若一随机系统的样本点是随机函数，则称此函数为样本函数，这一随机系统全部样本函数的集合是一个随机过程。实际应用中，样本函数的一般定义在时间域或者空间域。 随机过程的实例如股票和汇率的波动、语音信号、视频信号、体温的变化，随机运动如布朗运动、随机徘徊等等。

贝叶斯方法代考

贝叶斯统计概念及数据分析表示使用概率陈述回答有关未知参数的研究问题以及统计范式。后验分布包括关于参数的先验分布，和基于观测数据提供关于参数的信息似然模型。根据选择的先验分布和似然模型，后验分布可以解析或近似，例如，马尔科夫链蒙特卡罗 (MCMC) 方法之一。贝叶斯统计概念及数据分析使用后验分布来形成模型参数的各种摘要，包括点估计，如后验平均值、中位数、百分位数和称为可信区间的区间估计。此外，所有关于模型参数的统计检验都可以表示为基于估计后验分布的概率报表。

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

statistics-lab作为专业的留学生服务机构，多年来已为美国、英国、加拿大、澳洲等留学热门地的学生提供专业的学术服务，包括但不限于Essay代写，Assignment代写，Dissertation代写，Report代写，小组作业代写，Proposal代写，Paper代写，Presentation代写，计算机作业代写，论文修改和润色，网课代做，exam代考等等。写作范围涵盖高中，本科，研究生等海外留学全阶段，辐射金融，经济学，会计学，审计学，管理学等全球99%专业科目。写作团队既有专业英语母语作者，也有海外名校硕博留学生，每位写作老师都拥有过硬的语言能力，专业的学科背景和学术写作经验。我们承诺100%原创，100%专业，100%准时，100%满意。

机器学习代写

随着AI的大潮到来，Machine Learning逐渐成为一个新的学习热点。同时与传统CS相比，Machine Learning在其他领域也有着广泛的应用，因此这门学科成为不仅折磨CS专业同学的“小恶魔”，也是折磨生物、化学、统计等其他学科留学生的“大魔王”。学习Machine learning的一大绊脚石在于使用语言众多，跨学科范围广，所以学习起来尤其困难。但是不管你在学习Machine Learning时遇到任何难题，StudyGate专业导师团队都能为你轻松解决。

多元统计分析代考

基础数据: $N$ 个样本， $P$ 个变量数的单样本，组成的横列的数据表
变量定性: 分类和顺序；变量定量：数值
数学公式的角度分为: 因变量与自变量

时间序列分析代写

随机过程，是依赖于参数的一组随机变量的全体，参数通常是时间。随机变量是随机现象的数量表现，其时间序列是一组按照时间发生先后顺序进行排列的数据点序列。通常一组时间序列的时间间隔为一恒定值（如1秒，5分钟，12小时，7天，1年），因此时间序列可以作为离散时间数据进行分析处理。研究时间序列数据的意义在于现实中，往往需要研究某个事物其随时间发展变化的规律。这就需要通过研究该事物过去发展的历史记录，以得到其自身发展的规律。

回归分析代写

多元回归分析渐进（Multiple Regression Analysis Asymptotics）属于计量经济学领域，主要是一种数学上的统计分析方法，可以分析复杂情况下各影响因素的数学关系，在自然科学、社会和经济学等多个领域内应用广泛。

MATLAB代写

MATLAB 是一种用于技术计算的高性能语言。它将计算、可视化和编程集成在一个易于使用的环境中，其中问题和解决方案以熟悉的数学符号表示。典型用途包括：数学和计算算法开发建模、仿真和原型制作数据分析、探索和可视化科学和工程图形应用程序开发，包括图形用户界面构建MATLAB 是一个交互式系统，其基本数据元素是一个不需要维度的数组。这使您可以解决许多技术计算问题，尤其是那些具有矩阵和向量公式的问题，而只需用 C 或 Fortran 等标量非交互式语言编写程序所需的时间的一小部分。MATLAB 名称代表矩阵实验室。MATLAB 最初的编写目的是提供对由 LINPACK 和 EISPACK 项目开发的矩阵软件的轻松访问，这两个项目共同代表了矩阵计算软件的最新技术。MATLAB 经过多年的发展，得到了许多用户的投入。在大学环境中，它是数学、工程和科学入门和高级课程的标准教学工具。在工业领域，MATLAB 是高效研究、开发和分析的首选工具。MATLAB 具有一系列称为工具箱的特定于应用程序的解决方案。对于大多数 MATLAB 用户来说非常重要，工具箱允许您学习和应用专业技术。工具箱是 MATLAB 函数（M 文件）的综合集合，可扩展 MATLAB 环境以解决特定类别的问题。可用工具箱的领域包括信号处理、控制系统、神经网络、模糊逻辑、小波、仿真等。

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

统计代写|抽样调查作业代写sampling theory of survey代考|BREWER’S ASYMPTOTIC APPROACH

Posted on 2023年6月26日2023年6月26日 by statistics-lab

统计代写|抽样调查作业代写sampling theory of survey代考|BREWER’S ASYMPTOTIC APPROACH

Looking for properties of a strategy as population and sample sizes increase presumes some relation between $p_1, p_2, \ldots$ on one hand and between $t_1, t_2, \ldots$ on the other hand.

In this and the next section relations on the design and estimator sequence, respectively, are introduced.

Consistency of an estimator $t_T$ is easy to decide on if Assumption $\mathbf{A}$ is true and $p_T$ satisfies a special condition considered by BREWER (1979):

Assumption B: Using Assumption $\boldsymbol{A}$ and starting with an arbitrary design $p_1$ of fixed size $n_1$ for $\mathcal{U}(1)$, then $p_T$ is as follows: Apply $p_1$ not only to $\mathcal{U}(1)$ but also, independently, to $\mathcal{U}(2)$, $\ldots, \mathcal{U}(T)$ and amalgamate the corresponding samples
$$
s(1), s(2), \ldots, s(T)
$$

to form
$$
s_T=s(1) \cup s(2) \cup \cdots \cup s(T) .
$$
A design satisfying Assumption $\mathbf{B}$ to give the selection probability for $s_T$ is appreciably limited in scope and application.

Some authors have considered such restrictive designs, notably HANSEN, MADOW and TEPPING (1983). However, interesting results have been derived under less restrictive assumptions as well as by alternative approaches.

We mention ISAKI and FULLER (1982) proving the consistency of the HT estimator under rather general conditions on $p_T$. In fact, they even drop Assumption $\mathbf{A}$, a condition that seems quite rational to us.

BREWER’s approach should be adequate where it is advisable to partition a large population $\mathcal{U}_T$ into subsets of similar size and structure and to use these subsets as strata in the selection procedure. This is acceptable only if there is no loss in efficiency. But it is doubtful that this may always be the case.
We plan to enlarge BREWER’s class of designs and obtain a class containing the designs in common use and with the same technical amenities as BREWER’s class.

统计代写|抽样调查作业代写sampling theory of survey代考|MOMENT-TYPE ESTIMATORS

To establish meaningful results of asymptotic unbiasedness and consistency, the estimators $t_1, t_2, \ldots$ of a sequence to be considered must be somehow related to each other. Subsequently, a relation is assumed that is based on the concept of a moment estimator we define as follows: Let $A_i, B_i, C_i, \ldots$ be values associated with $i \in U$. Then, for $s \subset U$ with $n(s)=n$
$$
\frac{1}{n} \sum_s A_i, \quad \frac{1}{n} \sum_s A_i B_i, \quad \frac{1}{n} \sum_s A_i B_i C_i
$$
are sample moments. Examples are
$$
\frac{1}{n} \sum_s \frac{Y_i}{\pi_i}, \quad \frac{1}{n} \sum_s X_{i 1} Y_i, \quad \frac{1}{n} \sum_s \frac{X_{i 1} X_{i 2}}{\pi_i}
$$
where $Y_i, X_{i 1}, X_{i 2}$ are values of variables $y, x_1, x_2$, respectively, and $\pi_i$ inclusion probabilities defined by a design for $i \in U$.
$$
\frac{1}{N} \sum_1^N A_i, \quad \frac{1}{N} \sum_1^N A_i B_i, \quad \frac{1}{N} \sum_1^N A_i B_i C_i
$$
are population moments corresponding to the sampling moments Eq. (5.7).

A moment estimator $t$ is an estimator that may be written as a function of sample moments $m^{(1)}, m^{(2)}, \ldots, m^{(v)}$ :
$$
t=f\left(m^{(1)}, m^{(2)}, \ldots, m^{(\nu)}\right) .
$$

Obvious examples of moment estimators are the sample mean, the HT-estimator, the $\mathrm{HH}$-estimator, and the ratio estimator.
Now, let $t_1$ be a moment estimator, that is,
$$
t_1=f\left(m_1^{(1)}, \ldots, m_1^{(\nu)}\right)
$$
where $m_1^{(1)}, \ldots, m_1^{(v)}$ are sample moments for $s_1$.
Then, $t_T$ may be defined in a natural way:
$$
t_T=f\left(m_T^{(1)}, m_T^{(2)}, \ldots, m_T^{(\nu)}\right)
$$
where $m_T^{(j)}$ is the sample moment for $s_T$ corresponding to $m_1^{(j)}$, $j=1,2, \ldots, v$. As an example, we mention the ratio estimator
$$
t_1=\frac{\sum_{s_1} Y_i}{\sum_{s_1} X_i} \bar{X}
$$
for which
$$
t_T=\frac{\sum_{s_T} Y_i}{\sum_{s_T} X_i} \bar{X}
$$

抽样调查代考

统计代写|抽样调查作业代写sampling theory of survey代考|BREWER’S ASYMPTOTIC APPROACH

当总体和样本量增加时，寻找策略的属性时，假设$p_1, p_2, \ldots$和$t_1, t_2, \ldots$之间存在某种关系。

在这一节和下一节中，分别介绍了设计和估计序列的关系。

假设$\mathbf{A}$为真且$p_T$满足BREWER(1979)考虑的一个特殊条件时，估计量$t_T$的一致性很容易确定:

假设B:使用假设 $\boldsymbol{A}$ 从一个任意的设计开始 $p_1$ 大小固定 $n_1$ 为了 $\mathcal{U}(1)$那么， $p_T$ 如下:Apply $p_1$ 不仅是 $\mathcal{U}(1)$ 但是，独立地说， $\mathcal{U}(2)$， $\ldots, \mathcal{U}(T)$ 并合并相应的样品
$$
s(1), s(2), \ldots, s(T)
$$

形成
$$
s_T=s(1) \cup s(2) \cup \cdots \cup s(T) .
$$
满足假设$\mathbf{B}$给出$s_T$的选择概率的设计在范围和应用上明显受到限制。

一些作者考虑过这种限制性设计，特别是HANSEN, MADOW和TEPPING(1983)。然而，在限制性较低的假设和其他方法下也得出了有趣的结果。

我们提到ISAKI和FULLER(1982)在$p_T$上证明了在相当一般的条件下HT估计量的一致性。事实上，他们甚至放弃了假设$\mathbf{A}$，这在我们看来是相当合理的条件。

布鲁尔的方法应该是适当的，当它是明智的划分一个大的人口$\mathcal{U}_T$为类似的大小和结构的子集，并使用这些子集作为层在选择过程中。只有在没有效率损失的情况下，这才是可以接受的。但令人怀疑的是，情况可能总是如此。
我们计划扩大布鲁尔的设计类别，并获得一个包含常用设计的类别，并具有与布鲁尔类别相同的技术设施。

统计代写|抽样调查作业代写sampling theory of survey代考|MOMENT-TYPE ESTIMATORS

为了建立有意义的渐近无偏性和一致性结果，要考虑的序列的估计量$t_1, t_2, \ldots$必须以某种方式彼此相关。随后，假设一个基于矩估计器概念的关系，我们定义如下:设$A_i, B_i, C_i, \ldots$为与$i \in U$相关的值。然后，用$n(s)=n$表示$s \subset U$
$$
\frac{1}{n} \sum_s A_i, \quad \frac{1}{n} \sum_s A_i B_i, \quad \frac{1}{n} \sum_s A_i B_i C_i
$$
都是一些例子。例子如下
$$
\frac{1}{n} \sum_s \frac{Y_i}{\pi_i}, \quad \frac{1}{n} \sum_s X_{i 1} Y_i, \quad \frac{1}{n} \sum_s \frac{X_{i 1} X_{i 2}}{\pi_i}
$$
其中$Y_i, X_{i 1}, X_{i 2}$分别为变量$y, x_1, x_2$的值，$\pi_i$为$i \in U$的设计定义的包含概率。
$$
\frac{1}{N} \sum_1^N A_i, \quad \frac{1}{N} \sum_1^N A_i B_i, \quad \frac{1}{N} \sum_1^N A_i B_i C_i
$$
为总体矩，对应于抽样矩Eq.(5.7)。

一个矩估计器$t$是一个估计器，可以写成一个样本矩的函数$m^{(1)}, m^{(2)}, \ldots, m^{(v)}$:
$$
t=f\left(m^{(1)}, m^{(2)}, \ldots, m^{(\nu)}\right) .
$$

矩估计器的明显例子是样本均值，ht估计器，$\mathrm{HH}$估计器和比率估计器。
现在，设$t_1$是一个矩估计量，
$$
t_1=f\left(m_1^{(1)}, \ldots, m_1^{(\nu)}\right)
$$
其中$m_1^{(1)}, \ldots, m_1^{(v)}$是$s_1$的样例力矩。
那么，$t_T$可以用一种自然的方式定义:
$$
t_T=f\left(m_T^{(1)}, m_T^{(2)}, \ldots, m_T^{(\nu)}\right)
$$
其中$m_T^{(j)}$为$s_T$对应于$m_1^{(j)}$、$j=1,2, \ldots, v$的样本矩。作为一个例子，我们提到比率估计器
$$
t_1=\frac{\sum_{s_1} Y_i}{\sum_{s_1} X_i} \bar{X}
$$
为了什么?
$$
t_T=\frac{\sum_{s_T} Y_i}{\sum_{s_T} X_i} \bar{X}
$$

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。统计代写|python代写代考

随机过程代考

贝叶斯方法代考

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

机器学习代写

多元统计分析代考

基础数据: $N$ 个样本， $P$ 个变量数的单样本，组成的横列的数据表
变量定性: 分类和顺序；变量定量：数值
数学公式的角度分为: 因变量与自变量

时间序列分析代写

回归分析代写

MATLAB代写

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

统计代写|抽样调查作业代写sampling theory of survey代考|Balancing for Polynomial Models

Posted on 2023年6月26日2023年6月26日 by statistics-lab

统计代写|抽样调查作业代写sampling theory of survey代考|Balancing for Polynomial Models

We return to the model $\mathcal{M}{10}^{\prime}$ of 4.1.2 and consider an extension $\mathcal{M}_k$ defined as follows: $$ \begin{aligned} Y_i & =\sum{j=0}^k \beta_j X_i^j+\varepsilon_i \
E_m\left(\varepsilon_i\right) & =0, V_m\left(\varepsilon_i\right)=\sigma^2, C_m\left(\varepsilon_i, \varepsilon_j\right)=0, \text { for } i \neq j
\end{aligned}
$$
where $i, j=1,2, \ldots, N$. By generalizing the developments of section 4.1.2, we derive.

RESULT 4.2 Let $\mathcal{M}_k$ be given. Then, the MSE of the BLU predictor $t_o$ for $Y$ is minimum for a sample $s$ of size $n$ if
$$
\frac{1}{n} \sum_s X_i^j=\frac{1}{N} \sum_1^N X_i^j \text { for } j=0,1, \ldots, k .
$$
If these equalities hold we have
$$
t_o(s, Y)=N \bar{y}
$$
A sample satisfying the equalities in Result 4.2 is said to be balanced up to order $k$.

Now, assume the true model $\mathcal{M}{k^{\prime}}$ agrees with a statistician’s working model $\mathcal{M}_k$ in all respects except that $$ E_m\left(Y_i\right)=\sum_0^{k^{\prime}} \beta_j X_i^j $$ with $k^{\prime}>k$. The statistician will use $t_o$ instead of $t_o^{\prime}$, the BLU predictor for $Y$ on the base of $\mathcal{M}{k^{\prime}}$. However, if he selects a sample that is balanced up to order $k^{\prime}$
$$
t_o^{\prime}(s, Y)=t_o(s, Y)=N \bar{y}
$$
and his error does not cause losses.
It is, of course, too ambitious to realize exactly the balancing conditions even if $k^{\prime}$ is of moderate size, for example, $k^{\prime}=4$ or 5 . But if $n$ is large the considerations outlined in Result 4.1 apply again for SRSWOR or SRSWOR independently from within strata after internally homogeneous strata are priorly constructed.

统计代写|抽样调查作业代写sampling theory of survey代考|Linear Models in Matrix Notation

Suppose $x_1, x_2, \ldots, x_k$ are real variables, called auxiliary or explanatory variables, each closely related to the variable of interest $y$. Let
$$
xi=\left(X{i 1}, X_{i 2}, \ldots, X_{i k}\right)^{\prime}
$$
be the vector of explanatory variables for unit $i$ and assume the linear model
$$
Y_i=xi^{\prime} \beta+\varepsilon_i $$ for $i=1,2, \ldots, N$. Here $$ \beta=\left(\beta_1, \beta_2, \ldots, \beta_k\right)^{\prime} $$ is the vector of (unknown) regression parameters; $\varepsilon_1, \varepsilon_2, \ldots$, $\varepsilon_N$ are random variables satisfying $$ \begin{aligned} E_m \varepsilon_i & =0 \ V_m \varepsilon_i & =v{i i} \
C_m\left(\varepsilon_i, \varepsilon_j\right) & =v_{i j}, i \neq j
\end{aligned}
$$
where $E_m, V_m, C_m$ are operators for expectation, variance, and covariance with respect to the model distribution; and the ma$\operatorname{trix} V=\left(v_{i j}\right)$ is assumed to be known up to a constant $\sigma^2$.
To have a more compact notation define
$$
\begin{aligned}
Y & =\left(Y_1, Y_2, \ldots, Y_N\right)^{\prime} \
X & =\left(x1, x_2, \ldots, x_N\right)^{\prime}=\left(X{i j}\right) \
\varepsilon & =\left(\varepsilon_1, \varepsilon_2, \ldots, \varepsilon_N\right)^{\prime}
\end{aligned}
$$
and write the linear model as
$$
Y=X \beta+\varepsilon
$$
where
$$
\begin{aligned}
E_{m \varepsilon} & =0 \
V_m(\varepsilon) & =V
\end{aligned}
$$

抽样调查代考

统计代写|抽样调查作业代写sampling theory of survey代考|Balancing for Polynomial Models

我们回到4.1.2的模型$\mathcal{M}{10}^{\prime}$，并考虑如下定义的扩展$\mathcal{M}_k$: $$ \begin{aligned} Y_i & =\sum{j=0}^k \beta_j X_i^j+\varepsilon_i \
E_m\left(\varepsilon_i\right) & =0, V_m\left(\varepsilon_i\right)=\sigma^2, C_m\left(\varepsilon_i, \varepsilon_j\right)=0, \text { for } i \neq j
\end{aligned}
$$
在哪里$i, j=1,2, \ldots, N$。通过推广第4.1.2节的发展，我们得到。

4.2设$\mathcal{M}_k$。然后，对于大小为$n$ if的样本$s$，对于$Y$的BLU预测器$t_o$的MSE是最小的
$$
\frac{1}{n} \sum_s X_i^j=\frac{1}{N} \sum_1^N X_i^j \text { for } j=0,1, \ldots, k .
$$
如果这些等式成立，我们有
$$
t_o(s, Y)=N \bar{y}
$$
满足结果4.2中的等式的样本被称为平衡到$k$阶。

现在，假设真实模型$\mathcal{M}{k^{\prime}}$与统计学家的工作模型$\mathcal{M}_k$在除$$ E_m\left(Y_i\right)=\sum_0^{k^{\prime}} \beta_j X_i^j $$与$k^{\prime}>k$之外的所有方面都一致。统计学家将使用$t_o$代替$t_o^{\prime}$，这是基于$\mathcal{M}{k^{\prime}}$的$Y$的BLU预测器。但是，如果他选择的样品是平衡到订单$k^{\prime}$
$$
t_o^{\prime}(s, Y)=t_o(s, Y)=N \bar{y}
$$
他的错误不会造成损失。
当然，即使$k^{\prime}$是中等大小的，例如$k^{\prime}=4$或5，也很难精确地实现平衡条件。但是，如果$n$很大，则结果4.1中概述的考虑因素再次适用于SRSWOR或在内部均匀地层预先构建后独立于地层内部的SRSWOR。

统计代写|抽样调查作业代写sampling theory of survey代考|Linear Models in Matrix Notation

假设$x_1, x_2, \ldots, x_k$是实变量，称为辅助变量或解释变量，每个变量与感兴趣的变量密切相关$y$。让
$$
xi=\left(X{i 1}, X_{i 2}, \ldots, X_{i k}\right)^{\prime}
$$
为单位$i$的解释变量向量，假设为线性模型
$$
Y_i=xi^{\prime} \beta+\varepsilon_i $$代表$i=1,2, \ldots, N$。其中$$ \beta=\left(\beta_1, \beta_2, \ldots, \beta_k\right)^{\prime} $$为(未知)回归参数的向量;$\varepsilon_1, \varepsilon_2, \ldots$, $\varepsilon_N$是满足$$ \begin{aligned} E_m \varepsilon_i & =0 \ V_m \varepsilon_i & =v{i i} \
C_m\left(\varepsilon_i, \varepsilon_j\right) & =v_{i j}, i \neq j
\end{aligned}
$$的随机变量
其中$E_m, V_m, C_m$为相对于模型分布的期望、方差和协方差的算子;假设ma $\operatorname{trix} V=\left(v_{i j}\right)$是已知的，直到一个常数$\sigma^2$。
有一个更紧凑的符号定义
$$
\begin{aligned}
Y & =\left(Y_1, Y_2, \ldots, Y_N\right)^{\prime} \
X & =\left(x1, x2, \ldots, x_N\right)^{\prime}=\left(X{i j}\right) \ \varepsilon & =\left(\varepsilon_1, \varepsilon_2, \ldots, \varepsilon_N\right)^{\prime} \end{aligned} $$ 把线性模型写成 $$ Y=X \beta+\varepsilon $$ 在哪里 $$ \begin{aligned} E{m \varepsilon} & =0 \
V_m(\varepsilon) & =V
\end{aligned}
$$

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。统计代写|python代写代考

随机过程代考

贝叶斯方法代考

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

机器学习代写

多元统计分析代考

基础数据: $N$ 个样本， $P$ 个变量数的单样本，组成的横列的数据表
变量定性: 分类和顺序；变量定量：数值
数学公式的角度分为: 因变量与自变量

时间序列分析代写

回归分析代写

MATLAB代写

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

统计代写|抽样调查作业代写sampling theory of survey代考|Equicorrelation Model

Posted on 2023年5月25日2023年5月25日 by statistics-lab

如果你也在怎样代写抽样调查sampling theory of survey这个学科遇到相关的难题，请随时右上角联系我们的24/7代写客服。

抽样调查是一种非全面调查，根据随机的原则从总体中抽取部分实际数据进行调查，并运用概率估计方法，根据样本数据推算总体相应的数量指标的一种统计分析方法。

我们提供的抽样调查sampling theory of survey及其相关学科的代写，服务范围广, 其中包括但不限于:

统计代写|抽样调查作业代写sampling theory of survey代考|Equicorrelation Model

Following CSW (1976, 1977), consider the model of equicorrelated $Y_i$ ‘s for which
$$
E_m\left(Y_i\right)=\alpha_i+\beta X_i
$$
$\alpha_i$ known with mean $\bar{\alpha}, \beta$ unknown, $0<X_i$ known with $\Sigma X_i=$ $N$
$$
\begin{aligned}
V_m\left(Y_i\right) & =\sigma^2 X_i^2 \
C_m\left(Y_i, Y_j\right) & =\rho \sigma^2 X_i X_j,-\frac{1}{N-1}<\rho<1 .
\end{aligned}
$$
Linear unbiased estimators (LUE) for $\bar{Y}$ are of the form
$$
t=t(s, Y)=a_s+\sum_{i \in s} b_{s i} Y_i
$$
with $a_s, b_{s i}$ free of $Y$ such that for a fixed design $p$
$$
E_p\left(a_s\right)=0, \sum_{s \ni i} b_{s i} p(s)=\frac{1}{N} \text { for } i=1, \ldots, N .
$$
To find an optimal strategy ( $p, t)$ let us proceed as follows. First note that writing $c_{s i}=b_{s i} X_i$,
$$
1=\frac{X}{N}=\frac{1}{N} \sum_1^N X_i=\sum_1^N \sum_{s \ni i} c_{s i} p(s)=\sum_s p(s)\left[\sum_{i \in s} c_{s i}\right] .
$$

统计代写|抽样调查作业代写sampling theory of survey代考|Applications to Survey Sampling

A further line of approach is now required because $\theta_0$ itself needs to be estimated from survey data
$$
d=\left(i, Y_i \mid i \in s\right)
$$
available only for the $Y_i$ ‘s with $i \in s, s$ a sample supposed to be selected with probability $p(s)$ according to a design $p$ for which we assume
$$
\pi_i=\sum_{s \ni i} p(s)>0 \text { for all } i=1,2, \ldots, N .
$$
With the setup of the preceding section, let the $Y_i$ ‘s be independent and consider unbiased estimating functions $\phi_i\left(Y_i, \theta\right) ; i=$ $1,2, \ldots, N$. Let
$$
\theta_0=\theta_0(Y)
$$
be the solution of $g(Y, \theta)=0$ where
$$
g(Y, \theta)=\sum_1^N \phi_i\left(Y_i, \theta\right)
$$
and consider estimating this $\theta_0$ using survey data $d=\left(i, Y_i \mid i \in\right.$ s). For this it seems natural to start with an unbiased sampling function
$$
h=h(s, Y, \theta)
$$
which is free of $Y_j$ for $j \notin s$ and satisfies
(a) $\frac{\partial h}{\partial \theta}(s, Y, \theta)$ exists for all $Y$
(b) $E_m \frac{\partial h}{\partial \theta}(s, Y, \theta) \neq 0$
(c) $E_p h(s, Y, \theta)=g(Y, \theta)$ for all $Y$, the unbiasedness condition.

抽样调查代考

统计代写|抽样调查作业代写sampling theory of survey代考|Equicorrelation Model

继CSW(1976,1977)之后，考虑equicorrelated $Y_i$的模型
$$
E_m\left(Y_i\right)=\alpha_i+\beta X_i
$$
$\alpha_i$已知，平均$\bar{\alpha}, \beta$未知，$0<X_i$已知，$\Sigma X_i=$$N$
$$
\begin{aligned}
V_m\left(Y_i\right) & =\sigma^2 X_i^2 \
C_m\left(Y_i, Y_j\right) & =\rho \sigma^2 X_i X_j,-\frac{1}{N-1}<\rho<1 .
\end{aligned}
$$
$\bar{Y}$的线性无偏估计量(LUE)是这样的形式
$$
t=t(s, Y)=a_s+\sum_{i \in s} b_{s i} Y_i
$$
与$a_s, b_{s i}$免费$Y$这样，为一个固定的设计$p$
$$
E_p\left(a_s\right)=0, \sum_{s \ni i} b_{s i} p(s)=\frac{1}{N} \text { for } i=1, \ldots, N .
$$
为了找到一个最佳策略($p, t)$)，让我们按照以下步骤进行。首先要注意写$c_{s i}=b_{s i} X_i$，
$$
1=\frac{X}{N}=\frac{1}{N} \sum_1^N X_i=\sum_1^N \sum_{s \ni i} c_{s i} p(s)=\sum_s p(s)\left[\sum_{i \in s} c_{s i}\right] .
$$

统计代写|抽样调查作业代写sampling theory of survey代考|Applications to Survey Sampling

现在需要进一步的方法，因为$\theta_0$本身需要根据调查数据进行估计
$$
d=\left(i, Y_i \mid i \in s\right)
$$
只适用于$Y_i$与$i \in s, s$的一个样本应该是选择的概率$p(s)$根据我们假设的设计$p$
$$
\pi_i=\sum_{s \ni i} p(s)>0 \text { for all } i=1,2, \ldots, N .
$$
通过上一节的设置，让$Y_i$是独立的，并考虑无偏估计函数$\phi_i\left(Y_i, \theta\right) ; i=$$1,2, \ldots, N$。让
$$
\theta_0=\theta_0(Y)
$$
是$g(Y, \theta)=0$的解
$$
g(Y, \theta)=\sum_1^N \phi_i\left(Y_i, \theta\right)
$$
并考虑使用调查数据$d=\left(i, Y_i \mid i \in\right.$来估计这个$\theta_0$。因此，从无偏抽样函数开始似乎很自然
$$
h=h(s, Y, \theta)
$$
哪个是免费的$Y_j$为$j \notin s$和满足
(a) $\frac{\partial h}{\partial \theta}(s, Y, \theta)$适用于所有$Y$
(b) $E_m \frac{\partial h}{\partial \theta}(s, Y, \theta) \neq 0$
(c) $E_p h(s, Y, \theta)=g(Y, \theta)$对于所有$Y$，为无偏性条件。

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。统计代写|python代写代考

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写