标签： EM6613

统计代写|线性回归分析代写linear regression analysis代考|EM6613

Posted on 2023年7月21日2023年7月21日 by statistics-lab

如果你也在怎样代写线性回归Linear Regression 这个学科遇到相关的难题，请随时右上角联系我们的24/7代写客服。线性回归Linear Regression在统计学中，是对标量响应和一个或多个解释变量（也称为因变量和自变量）之间的关系进行建模的一种线性方法。一个解释变量的情况被称为简单线性回归；对于一个以上的解释变量，这一过程被称为多元线性回归。这一术语不同于多元线性回归，在多元线性回归中，预测的是多个相关的因变量，而不是一个标量变量。

线性回归Linear Regression在线性回归中，关系是用线性预测函数建模的，其未知的模型参数是根据数据估计的。最常见的是，假设给定解释变量（或预测因子）值的响应的条件平均值是这些值的仿生函数；不太常见的是，使用条件中位数或其他一些量化指标。像所有形式的回归分析一样，线性回归关注的是给定预测因子值的反应的条件概率分布，而不是所有这些变量的联合概率分布，这是多元分析的领域。

statistics-lab™ 为您的留学生涯保驾护航在代写线性回归分析linear regression analysis方面已经树立了自己的口碑, 保证靠谱, 高质且原创的统计Statistics代写服务。我们的专家在代写线性回归分析linear regression analysis代写方面经验极为丰富，各种代写线性回归分析linear regression analysis相关的作业也就用不着说。

统计代写|线性回归分析代写linear regression analysis代考|EM6613

统计代写|线性回归分析代写linear regression analysis代考|Data and Matrix Notation

In this and the next few sections we use matrix notation as a compact way to describe data and perform manipulations of data. Appendix A.6 contains a brief introduction to matrices and linear algebra that some readers may find helpful.

Suppose we have observed data for $n$ cases or units, meaning we have a value of $Y$ and all of the regressors for each of the $n$ cases. We define
$$
\mathbf{Y}=\left(\begin{array}{c}
y_1 \
y_2 \
\vdots \
y_n
\end{array}\right) \quad \mathbf{X}=\left(\begin{array}{cccc}
1 & x_{11} & \cdots & x_{1 p} \
1 & x_{21} & \cdots & x_{2 p} \
\vdots & \vdots & \vdots & \vdots \
1 & x_{n 1} & \cdots & x_{n p}
\end{array}\right)
$$
so $\mathbf{Y}$ is an $n \times 1$ vector and $\mathbf{X}$ is an $n \times(p+1)$ matrix. The $i$ th row of $\mathbf{X}$ will be defined by the symbol $\mathbf{x}_i^{\prime}$, which is a $(p+1) \times 1$ vector for mean functions that include an intercept. Even though $\mathbf{x}_i$ is a row of $\mathbf{X}$, we use the convention that all vectors are column vectors and therefore need to include the transpose on $\mathbf{x}_i^{\prime}$ to represent a row. The first few and the last few rows of the matrix $\mathbf{X}$ and the vector $\mathbf{Y}$ for the fuel data are
$$
\mathbf{X}=\left(\begin{array}{ccccc}
1 & 18.00 & 1031.38 & 23.471 & 16.5271 \
1 & 8.00 & 1031.64 & 30.064 & 13.7343 \
1 & 18.00 & 908.597 & 25.578 & 15.7536 \
\vdots & \vdots & \vdots & \vdots & \vdots \
1 & 25.65 & 904.894 & 21.915 & 15.1751 \
1 & 27.30 & 882.329 & 28.232 & 16.7817 \
1 & 14.00 & 970.753 & 27.230 & 14.7362
\end{array}\right) \quad \mathbf{Y}=\left(\begin{array}{c}
690.264 \
514.279 \
621.475 \
\vdots \
562.411 \
581.794 \
842.792
\end{array}\right)
$$

统计代写|线性回归分析代写linear regression analysis代考|The Errors e

Define the unobservable random vector of errors e elementwise by $e_i=y_i-\mathrm{E}\left(Y \mid X=\mathbf{x}_i\right)=y_i-\mathbf{x}_i^{\prime} \boldsymbol{\beta}$, and $\mathbf{e}=\left(e_1, \ldots, e_n\right)^{\prime}$. The assumptions concerning the $e_i$ s given in Chapter 2 are summarized in matrix form as
$$
\mathrm{E}(\mathbf{e} \mid X)=\mathbf{0} \quad \operatorname{Var}(\mathbf{e} \mid X)=\sigma^2 \mathbf{I}_n
$$
where $\operatorname{Var}(\mathbf{e} \mid X)$ means the covariance matrix of $\mathbf{e}$ for a fixed value of $X, \mathbf{I}_n$ is the $n \times n$ matrix with ones on the diagonal and zeroes everywhere else, and $\mathbf{0}$ is a matrix or vector of zeroes of appropriate size. If we add the assumption of normality, we can write
$$
(\mathbf{e} \mid X) \sim \mathrm{N}\left(\mathbf{0}, \sigma^2 \mathbf{I}_n\right)
$$

线性回归代写

统计代写|线性回归分析代写linear regression analysis代考|Data and Matrix Notation

在本节和接下来的几节中，我们将使用矩阵表示法作为描述数据和执行数据操作的简洁方式。附录a .6包含对矩阵和线性代数的简要介绍，对一些读者可能会有帮助。

假设我们已经观察到$n$案例或单位的数据，这意味着我们有一个值$Y$和每个$n$案例的所有回归量。我们定义
$$
\mathbf{Y}=\left(\begin{array}{c}
y_1 \
y_2 \
\vdots \
y_n
\end{array}\right) \quad \mathbf{X}=\left(\begin{array}{cccc}
1 & x_{11} & \cdots & x_{1 p} \
1 & x_{21} & \cdots & x_{2 p} \
\vdots & \vdots & \vdots & \vdots \
1 & x_{n 1} & \cdots & x_{n p}
\end{array}\right)
$$
所以$\mathbf{Y}$是一个$n \times 1$向量$\mathbf{X}$是一个$n \times(p+1)$矩阵。$\mathbf{X}$的$i$第一行将由符号$\mathbf{x}_i^{\prime}$定义，它是包含截距的均值函数的$(p+1) \times 1$向量。尽管$\mathbf{x}_i$是$\mathbf{X}$的一行，但我们使用所有向量都是列向量的约定，因此需要包含$\mathbf{x}_i^{\prime}$的转置来表示一行。燃料数据的矩阵$\mathbf{X}$和向量$\mathbf{Y}$的前几行和最后几行是
$$
\mathbf{X}=\left(\begin{array}{ccccc}
1 & 18.00 & 1031.38 & 23.471 & 16.5271 \
1 & 8.00 & 1031.64 & 30.064 & 13.7343 \
1 & 18.00 & 908.597 & 25.578 & 15.7536 \
\vdots & \vdots & \vdots & \vdots & \vdots \
1 & 25.65 & 904.894 & 21.915 & 15.1751 \
1 & 27.30 & 882.329 & 28.232 & 16.7817 \
1 & 14.00 & 970.753 & 27.230 & 14.7362
\end{array}\right) \quad \mathbf{Y}=\left(\begin{array}{c}
690.264 \
514.279 \
621.475 \
\vdots \
562.411 \
581.794 \
842.792
\end{array}\right)
$$

统计代写|线性回归分析代写linear regression analysis代考|The Errors e

通过$e_i=y_i-\mathrm{E}\left(Y \mid X=\mathbf{x}_i\right)=y_i-\mathbf{x}_i^{\prime} \boldsymbol{\beta}$和$\mathbf{e}=\left(e_1, \ldots, e_n\right)^{\prime}$定义不可观察的随机误差向量e。第2章给出的关于$e_i$ s的假设以矩阵形式总结为
$$
\mathrm{E}(\mathbf{e} \mid X)=\mathbf{0} \quad \operatorname{Var}(\mathbf{e} \mid X)=\sigma^2 \mathbf{I}_n
$$
其中$\operatorname{Var}(\mathbf{e} \mid X)$表示对于固定值$X, \mathbf{I}_n$, $\mathbf{e}$的协方差矩阵是$n \times n$矩阵，对角线上为1，其他地方为零，$\mathbf{0}$是一个大小适当的零矩阵或矢量。如果我们加上正态性的假设，我们可以写
$$
(\mathbf{e} \mid X) \sim \mathrm{N}\left(\mathbf{0}, \sigma^2 \mathbf{I}_n\right)
$$

统计代写|线性回归分析代写linear regression analysis代考请认准statistics-lab™

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

随机过程代考

在概率论概念中，随机过程是随机变量的集合。若一随机系统的样本点是随机函数，则称此函数为样本函数，这一随机系统全部样本函数的集合是一个随机过程。实际应用中，样本函数的一般定义在时间域或者空间域。 随机过程的实例如股票和汇率的波动、语音信号、视频信号、体温的变化，随机运动如布朗运动、随机徘徊等等。

贝叶斯方法代考

贝叶斯统计概念及数据分析表示使用概率陈述回答有关未知参数的研究问题以及统计范式。后验分布包括关于参数的先验分布，和基于观测数据提供关于参数的信息似然模型。根据选择的先验分布和似然模型，后验分布可以解析或近似，例如，马尔科夫链蒙特卡罗 (MCMC) 方法之一。贝叶斯统计概念及数据分析使用后验分布来形成模型参数的各种摘要，包括点估计，如后验平均值、中位数、百分位数和称为可信区间的区间估计。此外，所有关于模型参数的统计检验都可以表示为基于估计后验分布的概率报表。

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

statistics-lab作为专业的留学生服务机构，多年来已为美国、英国、加拿大、澳洲等留学热门地的学生提供专业的学术服务，包括但不限于Essay代写，Assignment代写，Dissertation代写，Report代写，小组作业代写，Proposal代写，Paper代写，Presentation代写，计算机作业代写，论文修改和润色，网课代做，exam代考等等。写作范围涵盖高中，本科，研究生等海外留学全阶段，辐射金融，经济学，会计学，审计学，管理学等全球99%专业科目。写作团队既有专业英语母语作者，也有海外名校硕博留学生，每位写作老师都拥有过硬的语言能力，专业的学科背景和学术写作经验。我们承诺100%原创，100%专业，100%准时，100%满意。

机器学习代写

随着AI的大潮到来，Machine Learning逐渐成为一个新的学习热点。同时与传统CS相比，Machine Learning在其他领域也有着广泛的应用，因此这门学科成为不仅折磨CS专业同学的“小恶魔”，也是折磨生物、化学、统计等其他学科留学生的“大魔王”。学习Machine learning的一大绊脚石在于使用语言众多，跨学科范围广，所以学习起来尤其困难。但是不管你在学习Machine Learning时遇到任何难题，StudyGate专业导师团队都能为你轻松解决。

多元统计分析代考

基础数据: $N$ 个样本， $P$ 个变量数的单样本，组成的横列的数据表
变量定性: 分类和顺序；变量定量：数值
数学公式的角度分为: 因变量与自变量

时间序列分析代写

随机过程，是依赖于参数的一组随机变量的全体，参数通常是时间。随机变量是随机现象的数量表现，其时间序列是一组按照时间发生先后顺序进行排列的数据点序列。通常一组时间序列的时间间隔为一恒定值（如1秒，5分钟，12小时，7天，1年），因此时间序列可以作为离散时间数据进行分析处理。研究时间序列数据的意义在于现实中，往往需要研究某个事物其随时间发展变化的规律。这就需要通过研究该事物过去发展的历史记录，以得到其自身发展的规律。

回归分析代写

多元回归分析渐进（Multiple Regression Analysis Asymptotics）属于计量经济学领域，主要是一种数学上的统计分析方法，可以分析复杂情况下各影响因素的数学关系，在自然科学、社会和经济学等多个领域内应用广泛。

MATLAB代写

MATLAB 是一种用于技术计算的高性能语言。它将计算、可视化和编程集成在一个易于使用的环境中，其中问题和解决方案以熟悉的数学符号表示。典型用途包括：数学和计算算法开发建模、仿真和原型制作数据分析、探索和可视化科学和工程图形应用程序开发，包括图形用户界面构建MATLAB 是一个交互式系统，其基本数据元素是一个不需要维度的数组。这使您可以解决许多技术计算问题，尤其是那些具有矩阵和向量公式的问题，而只需用 C 或 Fortran 等标量非交互式语言编写程序所需的时间的一小部分。MATLAB 名称代表矩阵实验室。MATLAB 最初的编写目的是提供对由 LINPACK 和 EISPACK 项目开发的矩阵软件的轻松访问，这两个项目共同代表了矩阵计算软件的最新技术。MATLAB 经过多年的发展，得到了许多用户的投入。在大学环境中，它是数学、工程和科学入门和高级课程的标准教学工具。在工业领域，MATLAB 是高效研究、开发和分析的首选工具。MATLAB 具有一系列称为工具箱的特定于应用程序的解决方案。对于大多数 MATLAB 用户来说非常重要，工具箱允许您学习和应用专业技术。工具箱是 MATLAB 函数（M 文件）的综合集合，可扩展 MATLAB 环境以解决特定类别的问题。可用工具箱的领域包括信号处理、控制系统、神经网络、模糊逻辑、小波、仿真等。

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

统计代写|线性回归分析代写linear regression analysis代考|STAT108

Posted on 2023年7月21日2023年8月24日 by statistics-lab

统计代写|线性回归分析代写linear regression analysis代考|ESTIMATED VARIANCES

Estimates of $\operatorname{Var}\left(\hat{\beta}_0 \mid X\right)$ and $\operatorname{Var}\left(\hat{\beta}_1 \mid X\right)$ are obtained by substituting $\hat{\sigma}^2$ for $\sigma^2$ in (2.11). We use the symbol $\widehat{\operatorname{Var}}(\mathrm{)}$ for an estimated variance. Thus
$$
\begin{aligned}
& \widehat{\operatorname{Var}}\left(\hat{\beta}_1 \mid X\right)=\hat{\sigma}^2 \frac{1}{\mathrm{SXX}} \
& \widehat{\operatorname{Var}}\left(\hat{\beta}_0 \mid X\right)=\hat{\sigma}^2\left(\frac{1}{n}+\frac{\bar{x}^2}{\mathrm{SXX}}\right)
\end{aligned}
$$
The square root of an estimated variance is called a standard error, for which we use the symbol se( ). The use of this notation is illustrated by
$$
\operatorname{se}\left(\hat{\beta}_1 \mid X\right)=\sqrt{\widehat{\operatorname{Var}}\left(\hat{\beta}_1 \mid X\right)}
$$
The terms standard error and standard deviation are sometimes used interchangeably. In this book, an estimated standard deviation always refers to the variability between values of an observable random variable like the response $y_i$ or an unobservable random variance like the errors $e_i$. The term standard error will always refer to the square root of the estimated variance of a statistic like a mean $\bar{y}$, or a regression coefficient $\hat{\beta}_1$.

统计代写|线性回归分析代写linear regression analysis代考|The Intercept

The intercept is used to illustrate the general form of confidence intervals for normally distributed estimates. The standard error of the intercept is $\operatorname{se}\left(\beta_0 \mid X\right)=\hat{\sigma}\left(1 / n+\bar{x}^2 / \mathrm{SXX}\right)^{1 / 2}$. Hence, a $(1-\alpha) \times 100 \%$ confidence interval for the intercept is the set of points $\beta_0$ in the interval
$$
\hat{\beta}_0-t(\alpha / 2, n-2) \operatorname{se}\left(\hat{\beta}_0 \mid X\right) \leq \beta_0 \leq \hat{\beta}_0+t(\alpha / 2, n-2) \operatorname{se}\left(\hat{\beta}_0 \mid X\right)
$$
For Forbes’s data, $\operatorname{se}\left(\hat{\beta}_0 \mid X\right)=0.379\left(1 / 17+(202.953)^2 / 530.724\right)^{1 / 2}=3.340$. For a $90 \%$ confidence interval, $t(0.05,15)=1.753$, and the interval is
$$
\begin{aligned}
-42.138-1.753(3.340) & \leq \beta_0 \leq-42.138+1.753(3.340) \
-47.99 & \leq \beta_0 \leq-36.28
\end{aligned}
$$
Ninety percent of such intervals will include the true value.
A hypothesis test of
$\mathrm{NH}: \quad \beta_0=\beta_0^, \quad \beta_1$ arbitrary $\mathrm{AH}: \quad \beta_0 \neq \beta_0^, \quad \beta_1$ arbitrary is obtained by computing the $t$-statistic
$$
t=\frac{\hat{\beta}_0-\beta_0^*}{\operatorname{se}\left(\hat{\beta}_0 \mid X\right)}
$$
and referring this ratio to the $t$-distribution with $d f=n-2$, the number of $d f$ in the estimate of $\sigma^2$. For example, in Forbes’s data, consider testing the $\mathrm{NH}$ $\beta_0=-35$ against the alternative that $\beta_0 \neq-35$. The statistic is
$$
t=\frac{-42.138-(-35)}{3.34}=-2.137
$$
Since AH is two-sided, the $p$-value corresponds to the probability that a $t(15)$ variable is less than -2.137 or greater than +2.137 , which gives a $p$-value that rounds to 0.05 , providing some evidence against $\mathrm{NH}$. This hypothesis test for these data is not one that would occur to most investigators and is used only as an illustration.

线性回归代写

统计代写|线性回归分析代写linear regression analysis代考|ESTIMATED VARIANCES

通过将式(2.11)中的$\sigma^2$代入$\hat{\sigma}^2$得到$\operatorname{Var}\left(\hat{\beta}_0 \mid X\right)$和$\operatorname{Var}\left(\hat{\beta}_1 \mid X\right)$的估计值。我们使用符号$\widehat{\operatorname{Var}}(\mathrm{)}$表示估计的方差。因此
$$
\begin{aligned}
& \widehat{\operatorname{Var}}\left(\hat{\beta}_1 \mid X\right)=\hat{\sigma}^2 \frac{1}{\mathrm{SXX}} \
& \widehat{\operatorname{Var}}\left(\hat{\beta}_0 \mid X\right)=\hat{\sigma}^2\left(\frac{1}{n}+\frac{\bar{x}^2}{\mathrm{SXX}}\right)
\end{aligned}
$$
估计方差的平方根称为标准误差，我们用符号se()表示。这个符号的用法由
$$
\operatorname{se}\left(\hat{\beta}_1 \mid X\right)=\sqrt{\widehat{\operatorname{Var}}\left(\hat{\beta}_1 \mid X\right)}
$$
标准误差和标准偏差这两个术语有时可以互换使用。在这本书中，估计的标准偏差总是指可观察的随机变量(如响应$y_i$)或不可观察的随机方差(如误差$e_i$)值之间的可变性。术语标准误差总是指统计量估计方差的平方根，如平均值$\bar{y}$，或回归系数$\hat{\beta}_1$。

统计代写|线性回归分析代写linear regression analysis代考|The Intercept

截距用于说明正态分布估计的置信区间的一般形式。截距的标准误差为$\operatorname{se}\left(\beta_0 \mid X\right)=\hat{\sigma}\left(1 / n+\bar{x}^2 / \mathrm{SXX}\right)^{1 / 2}$。因此，截距的$(1-\alpha) \times 100 \%$置信区间是区间中$\beta_0$点的集合
$$
\hat{\beta}_0-t(\alpha / 2, n-2) \operatorname{se}\left(\hat{\beta}_0 \mid X\right) \leq \beta_0 \leq \hat{\beta}_0+t(\alpha / 2, n-2) \operatorname{se}\left(\hat{\beta}_0 \mid X\right)
$$
有关福布斯的数据，请访问$\operatorname{se}\left(\hat{\beta}_0 \mid X\right)=0.379\left(1 / 17+(202.953)^2 / 530.724\right)^{1 / 2}=3.340$。对于$90 \%$置信区间为$t(0.05,15)=1.753$，区间为
$$
\begin{aligned}
-42.138-1.753(3.340) & \leq \beta_0 \leq-42.138+1.753(3.340) \
-47.99 & \leq \beta_0 \leq-36.28
\end{aligned}
$$
90％的这样的间隔将包含真实值。
的假设检验
$\mathrm{NH}: \quad \beta_0=\beta_0^, \quad \beta_1$任意$\mathrm{AH}: \quad \beta_0 \neq \beta_0^, \quad \beta_1$任意通过计算$t$ -统计量得到
$$
t=\frac{\hat{\beta}_0-\beta_0^*}{\operatorname{se}\left(\hat{\beta}_0 \mid X\right)}
$$
并将此比率与$d f=n-2$的$t$ -分布进行比较，在$\sigma^2$的估计值中，$d f$的数量。例如，在福布斯的数据中，考虑将$\mathrm{NH}$$\beta_0=-35$与$\beta_0 \neq-35$的替代选项进行测试。统计数据是
$$
t=\frac{-42.138-(-35)}{3.34}=-2.137
$$
由于AH是双面的，$p$ -值对应于$t(15)$变量小于-2.137或大于+2.137的概率，这使得$p$ -值四舍五入为0.05，从而提供了一些反对$\mathrm{NH}$的证据。这些数据的假设检验并不是大多数调查人员会想到的，只是用作说明。

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

随机过程代考

贝叶斯方法代考

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

机器学习代写

多元统计分析代考

基础数据: $N$ 个样本， $P$ 个变量数的单样本，组成的横列的数据表
变量定性: 分类和顺序；变量定量：数值
数学公式的角度分为: 因变量与自变量

时间序列分析代写

回归分析代写

MATLAB代写

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

统计代写|线性回归分析代写linear regression analysis代考|STAT501

Posted on 2023年7月21日2023年8月24日 by statistics-lab

统计代写|线性回归分析代写linear regression analysis代考|MEAN FUNCTIONS

Imagine a generic summary plot of $Y$ versus $X$. Our interest centers on how the distribution of $Y$ changes as $X$ is varied. One important aspect of this distribution is the mean function, which we define by
$$
\mathrm{E}(Y \mid X=x)=\text { a function that depends on the value of } x
$$
We read the left side of this equation as “the expected value of the response when the predictor is fixed at the value $X=x$ “; if the notation ” $\mathrm{E}(\mathrm{)}$ ” for expectations and “Var( )” for variances is unfamiliar, refer to Appendix A.2. The right side of (1.1) depends on the problem. For example, in the heights data in Example 1.1, we might believe that
$$
\mathrm{E}(\text { dheightlmheight }=x)=\beta_0+\beta_1 x
$$
that is, the mean function is a straight line. This particular mean function has two parameters, an intercept $\beta_0$ and a slope $\beta_1$. If we knew the values of the $\beta \mathrm{s}$, then the mean function would be completely specified, but usually the $\beta \mathrm{s}$ need to be estimated from data. These parameters are discussed more fully in the next chapter.

Figure 1.8 shows two possibilities for the $\beta \mathrm{s}$ in the straight-line mean function (1.2) for the heights data. For the dashed line, $\beta_0=0$ and $\beta_1=1$. This mean function would suggest that daughters have the same height as their mothers on the average for mothers of any height. The second line is estimated using ordinary least squares, or ols, the estimation method that will be described in the next chapter. The ols line has slope less than 1 , meaning that tall mothers tend to have daughters who are taller than average because the slope is positive, but shorter than themselves because the slope is less than 1. Similarly, short mothers tend to have short daughters but taller than themselves. This is perhaps a surprising result and is the origin of the term regression, since extreme values in one generation tend to revert or regress toward the population mean in the next generation (Galton, 1886).

统计代写|线性回归分析代写linear regression analysis代考|VARIANCE FUNCTIONS

Another characteristic of the distribution of the response given the predictor is the variance function, defined by the symbol $\operatorname{Var}(Y \mid X=x)$ and in words as the variance of the response given that the predictor is fixed at $X=x$. For example, in Figure 1.2 we can see that the variance function for dheightlmheight is approximately the same for each of the three values of mheight shown in the graph. In the smallmouth bass data in Figure 1.5, an assumption that the variance is constant across the plot is plausible, even if it is not certain (see Problem 1.2). In the turkey data, we cannot say much about the variance function from the summary plot because we have plotted treatment means rather than the actual pen values, so the graph does not display the information about the variability between pens that have a fixed value of Dose.

A frequent assumption in fitting linear regression models is that the variance function is the same for every value of $x$. This is usually written as
$$
\operatorname{Var}(Y \mid X=x)=\sigma^2
$$
where $\sigma^2$ (read “sigma squared”) is a generally unknown positive constant.

线性回归代写

统计代写|线性回归分析代写linear regression analysis代考|MEAN FUNCTIONS

想象一下$Y$和$X$的一般汇总图。我们的兴趣集中在$Y$的分布如何随着$X$的变化而变化。这个分布的一个重要方面是均值函数，我们用
$$
\mathrm{E}(Y \mid X=x)=\text { a function that depends on the value of } x
$$
我们将这个方程的左边解读为“当预测器固定在$X=x$值时响应的期望值”;如果不熟悉表示期望的“$\mathrm{E}(\mathrm{)}$”和表示方差的“Var()”，请参阅附录A.2。(1.1)的右边取决于问题。例如，在例1.1中的高度数据中，我们可能认为
$$
\mathrm{E}(\text { dheightlmheight }=x)=\beta_0+\beta_1 x
$$
也就是说，均值函数是一条直线。这个特殊的平均函数有两个参数，一个截距$\beta_0$和一个斜率$\beta_1$。如果我们知道$\beta \mathrm{s}$的值，那么均值函数将被完全指定，但通常需要从数据中估计$\beta \mathrm{s}$。这些参数将在下一章中进行更详细的讨论。

图1.8显示了高度数据的直线平均函数(1.2)中$\beta \mathrm{s}$的两种可能性。虚线为$\beta_0=0$和$\beta_1=1$。这个平均函数表明，对于任何身高的母亲来说，女儿的平均身高都与母亲相同。第二行是使用普通最小二乘(ols)估计的，这种估计方法将在下一章中描述。ols线的斜率小于1，这意味着高个子母亲的女儿往往比平均身高高，因为斜率为正，但比自己矮，因为斜率小于1。同样，个子矮的母亲往往生出个子矮但比自己高的女儿。这可能是一个令人惊讶的结果，也是回归一词的起源，因为一代中的极端值往往会在下一代中恢复或回归到种群均值(Galton, 1886)。

统计代写|线性回归分析代写linear regression analysis代考|VARIANCE FUNCTIONS

给定预测器的响应分布的另一个特征是方差函数，由符号$\operatorname{Var}(Y \mid X=x)$定义，用文字表示给定预测器固定为$X=x$的响应的方差。例如，在图1.2中我们可以看到，对于图中所示的三个mheight值，highightlmheight的方差函数大致相同。在图1.5中的小嘴鲈鱼数据中，假设整个图的方差是恒定的是合理的，即使它不确定(参见问题1.2)。在火鸡数据中，我们不能对汇总图的方差函数说太多，因为我们绘制的是治疗手段，而不是实际的笔值，因此该图不显示具有固定剂量值的笔之间的可变性信息。

在拟合线性回归模型时，一个常见的假设是，对于$x$的每个值，方差函数是相同的。这通常写成
$$
\operatorname{Var}(Y \mid X=x)=\sigma^2
$$
其中$\sigma^2$(读作“sigma平方”)是一个通常未知的正常数。

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

随机过程代考

贝叶斯方法代考

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

机器学习代写

多元统计分析代考

基础数据: $N$ 个样本， $P$ 个变量数的单样本，组成的横列的数据表
变量定性: 分类和顺序；变量定量：数值
数学公式的角度分为: 因变量与自变量

时间序列分析代写

回归分析代写

MATLAB代写

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

统计代写|线性回归分析代写linear regression analysis代考|SAMPLING FROM A NORMAL POPULATION

Posted on 2023年7月1日2023年7月1日 by statistics-lab

统计代写|线性回归分析代写linear regression analysis代考|SAMPLING FROM A NORMAL POPULATION

Much of the intuition for the use of least squares estimation is based on the assumption that the observed data are a sample from a multivariate normal population. While the assumption of multivariate normality is almost never tenable in practical regression problems, it is worthwhile to explore the relevant results for normal data, first assuming random sampling and then removing that assumption.

Suppose that all of the observed variables are normal random variables, and the observations on each case are independent of the observations on each other case. In a two-variable problem, for the $i$ th case observe $\left(x_i, y_i\right)$, and suppose that
$$
\left(\begin{array}{c}
x_i \
y_i
\end{array}\right) \sim \mathrm{N}\left(\left(\begin{array}{c}
\mu_x \
\mu_y
\end{array}\right),\left(\begin{array}{cc}
\sigma_x^2 & \rho_{x y} \sigma_x \sigma_y \
\rho_{x y} \sigma_x \sigma_y & \sigma_y^2
\end{array}\right)\right)
$$
Equation (4.9) says that $x_i$ and $y_i$ are each realizations of normal random variables with means $\mu_x$ and $\mu_y$, variances $\sigma_x^2$ and $\sigma_y^2$ and correlation $\rho_{x y}$. Now, suppose we consider the conditional distribution of $y_i$ given that we have already observed the value of $x_i$. It can be shown (see e.g., Lindgren, 1993; Casella and Berger, 1990 ) that the conditional distribution of $y_i$ given $x_i$, is normal and,
$$
y_i \mid x_i \sim \mathrm{N}\left(\mu_y+\rho_{x y} \frac{\sigma_y}{\sigma_x}\left(x_i-\mu_x\right), \sigma_y^2\left(1-\rho_{x y}^2\right)\right)
$$
If we define
$$
\beta_0=\mu_y-\beta_1 \mu_x \quad \beta_1=\rho_{x y} \frac{\sigma_y}{\sigma_x} \quad \sigma^2=\sigma_y^2\left(1-\rho_{x y}^2\right)
$$
then the conditional distribution of $y_i$ given $x_i$ is simply
$$
y_i \mid x_i \sim \mathrm{N}\left(\beta_0+\beta_1 x_i, \sigma^2\right)
$$
which is essentially the same as the simple regression model with the added assumption of normality.

统计代写|线性回归分析代写linear regression analysis代考|Simple Linear Regression and R2

In simple regression linear problems, we can always determine the appropriateness of $R^2$ as a summary by examining the summary graph of the response versus the predictor. If the plot looks like a sample from a bivariate normal population, as in Figure $4.2 \mathrm{a}$, then $R^2$ is a useful measure. The less the graph looks like this figure, the less useful is $R^2$ as a summary measure.

Figure 4.3 shows six summary graphs. Only for the first three of them is $R^2$ a useful summary of the regression problem. In Figure 4.3e, the mean function appears curved rather than straight so correlation is a poor measure of dependence. In Figure $4.3 \mathrm{~d}$ the value of $R^2$ is virtually determined by one point, making $R^2$ necessarily unreliable. The regular appearance of the remaining plot suggests a different type of problem. We may have several identifiable groups of points caused by a lurking variable not included in the mean function, such that the mean function for each group has a negative slope, but when groups are combined the slope becomes positive. Once again $R^2$ is not a useful summary of this graph.

In multiple linear regression, $R^2$ can also be interpreted as the square of the correlation in a summary graph, this time of $Y$ versus fitted values $\hat{Y}$. This plot can be interpreted exactly the same way as the plot of the response versus the single term in simple linear regression to decide on the usefulness of $R^2$ as a summary measure.

For other regression methods such as nonlinear regression, we can define $R^2$ to be the square of the correlation between the response and the fitted values, and use this summary graph to decide if $R^2$ is a useful summary.

线性回归代写

统计代写|线性回归分析代写linear regression analysis代考|SAMPLING FROM A NORMAL POPULATION

统计代写|线性回归分析代写linear regression analysis代考|Simple Linear Regression and R2

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

随机过程代考

贝叶斯方法代考

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

机器学习代写

多元统计分析代考

基础数据: $N$ 个样本， $P$ 个变量数的单样本，组成的横列的数据表
变量定性: 分类和顺序；变量定量：数值
数学公式的角度分为: 因变量与自变量

时间序列分析代写

回归分析代写

MATLAB代写

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

统计代写|线性回归分析代写linear regression analysis代考|UNDERSTANDING PARAMETER ESTIMATES

Posted on 2023年7月1日2023年7月1日 by statistics-lab

统计代写|线性回归分析代写linear regression analysis代考|UNDERSTANDING PARAMETER ESTIMATES

Parameters in mean functions have units attached to them. For example, the fitted mean function for the fuel consumption data is
$$
\mathrm{E}(\text { Fuel } \mid X)=154.19-4.23 \text { Tax }+0.47 \text { Dic }-6.14 \text { Income }+18.54 \log (\text { Miles })
$$
Fuel is measured in gallons, and so all the quantities on the right of this equation must also be in gallons. The intercept is 154.19 gallons. Since Income is measured in thousands of dollars, the coefficient for Income must be in gallons per thousand dollars of income. Similarly, the units for the coefficient for Tax is gallons per cent of $\operatorname{tax}$.
Rate of Change
The usual interpretation of an estimated coefficient is as a rate of change: increasing Tax rate by one cent should decrease consumption, all other factors being held constant, by about 4.23 gallons per person. This assumes that a predictor can in fact be changed without affecting the other terms in the mean function and that the available data will apply when the predictor is so changed. The fuel data are observational since the assignment of values for the predictors was not under the control of the analyst, so whether increasing taxes would cause a decrease in fuel consumption cannot be assessed from these data. From these data, we can observe association but not cause: states with higher tax rates are observed to have lower fuel consumption. To draw conclusions concerning the effects of changing tax rates, the rates must in fact be changed and the results observed.

The coefficient estimate of $\log ($ Miles) is 18.55 , meaning that a change of one unit in $\log ($ Miles $)$ is associated with an 18.55 gallon per person increase in consumption. States with more roads have higher per capita fuel consumption. Since we used base-two logarithms in this problem, increasing $\log$ (Miles) by one unit means that the value of Miles doubles. If we double the amount of road in a state, we expect to increase fuel consumption by about 18.55 gallons per person. If we had used base-ten logarithms, then the fitted mean function would be
$$
\mathrm{E}(\text { Fuel } \mid X)=154.19-4.23 \text { Tax }+0.47 \text { Dlic }-6.14 \text { Income }+61.61 \log {10}(\text { Miles }) $$ The only change in the fitted model is for the coefficient for the log of Miles, which is now interpreted as a change in expected Fuel consumption when $\log {10}$ (Miles) increases by one unit, or when Miles is multiplied by 10 .

统计代写|线性回归分析代写linear regression analysis代考|Signs of Estimates

The sign of a parameter estimate indicates the direction of the relationship between the term and the response. In multiple regression, if the terms are correlated, the sign of a coefficient may change depending on the other terms in the model. While this is mathematically possible and, occasionally, scientifically reasonable, it certainly makes interpretation more difficult. Sometimes this problem can be removed by redefining the terms into new linear combinations that are easier to interpret.
Interpretation Depends on Other Terms in the Mean Function
The value of a parameter estimate not only depends on the other terms in a mean function but it can also change if the other terms are replaced by linear combinations of the terms.
Berkeley Guidance Study
Data from the Berkeley Guidance Study on the growth of boys and girls are given in Problem 3.1. As in Problem 3.1, we will view Soma as the response, but consider the three predictors $W T 2, W T 9, W T 18$ for the $n=70$ girls in the study. The scatterplot matrix for these four variables is given in Figure 4.1. First look at the last row of this figure, giving the marginal response plots of Soma versus each of the three potential predictors. For each of these plots, we see that Soma is increasing with the potential predictor on the average, although the relationship is strongest at the oldest age and weakest at the youngest age. The two-dimensional plots of each pair of predictors suggest that the predictors are correlated among themselves. Taken together, we have evidence that the regression on all three predictors cannot be viewed as just the sum of the three separate simple regressions because we must account for the correlations between the terms.

We will proceed with this example using the three original predictors as terms and Soma as the response. We are encouraged to do this because of the appearance of the scatterplot matrix. Since each of the two-dimensional plots appear to be well summarized by a straight-line mean function, we will see later that this suggests that the regression of the response on the original predictors without transformation is likely to be appropriate.

The parameter estimates for the regression of Soma on WT2, WT9, and WT18 given in the column marked “Model 1” in Table 4.1 leads to the unexpected conclusion that heavier girls at age two may tend to be thinner, have lower expected somatotype, at age 18 . We reach this conclusion because the $t$-statistic for testing the coefficient equal to zero, which is not shown in the table, has a significance level of about 0.06 . The sign, and the weak significance, may be due to the correlations between the terms. In place of the preceding variables, consider the following:
$$
\begin{aligned}
W T 2 & =\text { Weight at age } 2 \
D W 9=W T 9-W T 2 & =\text { Weight gain from age } 2 \text { to } 9 \
D W 18=W T 18-W T 9 & =\text { Weight gain from age } 9 \text { to } 18
\end{aligned}
$$

线性回归代写

统计代写|线性回归分析代写linear regression analysis代考|UNDERSTANDING PARAMETER ESTIMATES

平均函数中的参数有附加的单位。例如，油耗数据的拟合均值函数为
$$
\mathrm{E}(\text { Fuel } \mid X)=154.19-4.23 \text { Tax }+0.47 \text { Dic }-6.14 \text { Income }+18.54 \log (\text { Miles })
$$
燃料的单位是加仑，所以等式右边的所有量也必须是加仑。截距是154.19加仑。由于收入是以千美元为单位计算的，因此收入系数必须以每千美元收入的加仑为单位。同样，税收系数的单位是$\operatorname{tax}$的加仑％。
变化率
对估计系数的通常解释是变化率:在所有其他因素保持不变的情况下，税率每增加一美分，人均消费量应减少约4.23加仑。这假设预测器实际上可以改变而不影响平均函数中的其他项，并且当预测器发生这种改变时，可用数据将适用。燃料数据是观察性的，因为预测值的分配不在分析人员的控制之下，因此增加税收是否会导致燃料消耗的减少无法从这些数据中评估。从这些数据中，我们可以观察到关联，但不能观察到原因:观察到税率较高的州燃料消耗较低。为了得出关于税率变化的影响的结论，税率实际上必须改变，结果必须观察到。

$\log ($英里数)的系数估计值为18.55，这意味着$\log ($英里数$)$每变化一个单位，人均消耗就会增加18.55加仑。拥有更多道路的州人均燃料消耗量更高。由于我们在这个问题中使用了以2为底的对数，因此将$\log$(英里)增加一个单位意味着英里的值加倍。如果我们将一个州的道路数量增加一倍，我们预计每人将增加约18.55加仑的燃料消耗。如果我们使用以10为底的对数，那么拟合的均值函数将是
$$
\mathrm{E}(\text { Fuel } \mid X)=154.19-4.23 \text { Tax }+0.47 \text { Dlic }-6.14 \text { Income }+61.61 \log {10}(\text { Miles }) $$拟合模型中唯一的变化是里程对数的系数，当$\log {10}$(里程)增加一个单位或里程乘以10时，它现在被解释为预期燃料消耗的变化。

统计代写|线性回归分析代写linear regression analysis代考|Signs of Estimates

参数估计的符号表示项和响应之间关系的方向。在多元回归中，如果项是相关的，则系数的符号可能会根据模型中的其他项而变化。虽然这在数学上是可能的，有时在科学上也是合理的，但它确实使解释变得更加困难。有时，这个问题可以通过将术语重新定义为更容易解释的新线性组合来解决。
解释取决于平均函数中的其他项
参数估计的值不仅取决于平均函数中的其他项，而且如果其他项被这些项的线性组合所取代，它也会改变。
伯克利导引研究
关于男孩和女孩成长的伯克利指导研究数据在问题3.1中给出。与问题3.1一样，我们将把Soma视为响应，但考虑研究中$n=70$女孩的三个预测因子$W T 2, W T 9, W T 18$。这四个变量的散点图矩阵如图4.1所示。首先看一下这张图的最后一行，给出了Soma相对于三种潜在预测因子的边际响应图。对于每一个图，我们看到Soma随着潜在预测因子的平均增长而增加，尽管这种关系在年龄最大的时候最强，在年龄最小的时候最弱。每对预测因子的二维图表明，预测因子之间是相关的。综上所述，我们有证据表明，所有三个预测因子的回归不能仅仅被视为三个单独的简单回归的总和，因为我们必须考虑到这些项之间的相关性。

我们将使用三个原始预测器作为项并使用Soma作为响应来继续这个示例。我们被鼓励这样做，因为散点图矩阵的出现。由于每个二维图似乎都被一个直线平均函数很好地概括了，我们稍后将看到，这表明，对原始预测因子的响应进行回归而不进行转换可能是合适的。

表4.1“模型1”一栏给出的Soma对WT2、WT9和WT18的回归参数估计，得出了意想不到的结论，即2岁时较重的女孩在18岁时可能更瘦，期望的体型更低。我们得出这个结论是因为表中没有显示的用于检验系数等于零的$t$ -统计量的显著性水平约为0.06。符号和弱显著性可能是由于术语之间的相关性。代替前面的变量，考虑以下变量:
$$
\begin{aligned}
W T 2 & =\text { Weight at age } 2 \
D W 9=W T 9-W T 2 & =\text { Weight gain from age } 2 \text { to } 9 \
D W 18=W T 18-W T 9 & =\text { Weight gain from age } 9 \text { to } 18
\end{aligned}
$$

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

随机过程代考

贝叶斯方法代考

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

机器学习代写

多元统计分析代考

基础数据: $N$ 个样本， $P$ 个变量数的单样本，组成的横列的数据表
变量定性: 分类和顺序；变量定量：数值
数学公式的角度分为: 因变量与自变量

时间序列分析代写

回归分析代写

MATLAB代写

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

统计代写|线性回归分析代写linear regression analysis代考|ORDINARY LEAST SQUARES

Posted on 2023年7月1日2023年7月1日 by statistics-lab

统计代写|线性回归分析代写linear regression analysis代考|ORDINARY LEAST SQUARES

From the initial collection of potential predictors, we have computed a set of $p+1$ terms, including an intercept, $X=\left(X_0, X_1, \ldots, X_p\right)$. The mean function and variance function for multiple linear regression are
$$
\begin{aligned}
\mathrm{E}(Y \mid X) & =\beta_0+\beta_1 X_1+\cdots+\beta_p X_p \
\operatorname{Var}(Y \mid X) & =\sigma^2
\end{aligned}
$$
Both the $\beta$ s and $\sigma^2$ are unknown parameters that need to be estimated.
Data and Matrix Notation
Suppose we have observed data for $n$ cases or units, meaning we have a value of $Y$ and all of the terms for each of the $n$ cases. We have symbols for the response and the terms using matrices and vectors; see Appendix A.6 for a brief introduction. We define
$$
\mathbf{Y}=\left(\begin{array}{c}
y_1 \
y_2 \
\vdots \
y_n
\end{array}\right) \quad \mathbf{X}=\left(\begin{array}{cccc}
1 & x_{11} & \cdots & x_{1 p} \
1 & x_{21} & \cdots & x_{2 p} \
\vdots & \vdots & \vdots & \vdots \
1 & x_{n 1} & \cdots & x_{n p}
\end{array}\right)
$$
so $\mathbf{Y}$ is an $n \times 1$ vector and $\mathbf{X}$ is an $n \times(p+1)$ matrix. We also define $\boldsymbol{\beta}$ to be a $(p+1) \times 1$ vector of regression coefficients and $\mathbf{e}$ to be the $n \times 1$ vector of statistical errors,
$$
\boldsymbol{\beta}=\left(\begin{array}{c}
\beta_0 \
\beta_1 \
\vdots \
\beta_p
\end{array}\right) \quad \text { and } \quad \mathbf{e}=\left(\begin{array}{c}
e_1 \
e_2 \
\vdots \
e_n
\end{array}\right)
$$

统计代写|线性回归分析代写linear regression analysis代考|Variance-Covariance Matrix of e

The $51 \times 1$ error vector is an unobservable random vector, as in Appendix A.6. The assumptions concerning the $e_i$ s given in Chapter 2 are summarized in matrix form as
$$
\mathrm{E}(\mathbf{e})=\mathbf{0} \quad \operatorname{Var}(\mathbf{e})=\sigma^2 \mathbf{I}_n
$$
where $\operatorname{Var}(\mathbf{e})$ means the covariance matrix of $\mathbf{e}, \mathbf{I}_n$ is the $n \times n$ matrix with ones on the diagonal and zeroes everywhere else, and $\mathbf{0}$ is a matrix or vector of zeroes of appropriate size. If we add the assumption of normality, we can write
$$
\mathbf{e} \sim \mathrm{N}\left(\mathbf{0}, \sigma^2 \mathbf{I}_n\right)
$$
Ordinary Least Squares Estimators
The least squares estimate $\hat{\boldsymbol{\beta}}$ of $\boldsymbol{\beta}$ is chosen to minimize the residual sum of squares function
$$
R S S(\boldsymbol{\beta})=\sum\left(y_i-\mathbf{x}_i^{\prime} \boldsymbol{\beta}\right)^2=(\mathbf{Y}-\mathbf{X} \boldsymbol{\beta})^{\prime}(\mathbf{Y}-\mathbf{X} \boldsymbol{\beta})
$$
The oLs estimates can be found from (3.8) by differentiation in a matrix analog to the development of Appendix A.3. The ols estimate is given by the formula
$$
\hat{\boldsymbol{\beta}}=\left(\mathbf{X}^{\prime} \mathbf{X}\right)^{-1} \mathbf{X}^{\prime} \mathbf{Y}
$$
provided that the inverse $\left(\mathbf{X}^{\prime} \mathbf{X}\right)^{-1}$ exists. The estimator $\hat{\boldsymbol{\beta}}$ depends only on the sufficient statistics $\mathbf{X}^{\prime} \mathbf{X}$ and $\mathbf{X}^{\prime} \mathbf{Y}$, which are matrices of uncorrected sums of squares and cross-products.

线性回归代写

统计代写|线性回归分析代写linear regression analysis代考|ORDINARY LEAST SQUARES

从潜在预测因子的初始集合中，我们计算了一组$p+1$项，包括截距$X=\left(X_0, X_1, \ldots, X_p\right)$。多元线性回归的均值函数和方差函数为
$$
\begin{aligned}
\mathrm{E}(Y \mid X) & =\beta_0+\beta_1 X_1+\cdots+\beta_p X_p \
\operatorname{Var}(Y \mid X) & =\sigma^2
\end{aligned}
$$
$\beta$ s和$\sigma^2$都是未知参数，需要进行估计。
数据和矩阵表示法
假设我们已经观察到$n$案例或单位的数据，这意味着我们有一个值$Y$和每个$n$案例的所有项。我们用符号表示响应用矩阵和向量表示项;参见附录a .6的简要介绍。我们定义
$$
\mathbf{Y}=\left(\begin{array}{c}
y_1 \
y_2 \
\vdots \
y_n
\end{array}\right) \quad \mathbf{X}=\left(\begin{array}{cccc}
1 & x_{11} & \cdots & x_{1 p} \
1 & x_{21} & \cdots & x_{2 p} \
\vdots & \vdots & \vdots & \vdots \
1 & x_{n 1} & \cdots & x_{n p}
\end{array}\right)
$$
所以$\mathbf{Y}$是一个$n \times 1$向量$\mathbf{X}$是一个$n \times(p+1)$矩阵。我们还定义$\boldsymbol{\beta}$为回归系数的$(p+1) \times 1$向量，$\mathbf{e}$为统计误差的$n \times 1$向量，
$$
\boldsymbol{\beta}=\left(\begin{array}{c}
\beta_0 \
\beta_1 \
\vdots \
\beta_p
\end{array}\right) \quad \text { and } \quad \mathbf{e}=\left(\begin{array}{c}
e_1 \
e_2 \
\vdots \
e_n
\end{array}\right)
$$

统计代写|线性回归分析代写linear regression analysis代考|Variance-Covariance Matrix of e

$51 \times 1$错误向量是一个不可观察的随机向量，如附录A.6所示。第2章给出的关于$e_i$ s的假设以矩阵形式总结为
$$
\mathrm{E}(\mathbf{e})=\mathbf{0} \quad \operatorname{Var}(\mathbf{e})=\sigma^2 \mathbf{I}_n
$$
其中$\operatorname{Var}(\mathbf{e})$表示$\mathbf{e}, \mathbf{I}_n$的协方差矩阵是$n \times n$矩阵，对角线上为1，其他地方为零，$\mathbf{0}$是一个大小适当的零矩阵或向量。如果我们加上正态性的假设，我们可以写
$$
\mathbf{e} \sim \mathrm{N}\left(\mathbf{0}, \sigma^2 \mathbf{I}_n\right)
$$
普通最小二乘估计
选择$\boldsymbol{\beta}$的最小二乘估计$\hat{\boldsymbol{\beta}}$来最小化残差平方和函数
$$
R S S(\boldsymbol{\beta})=\sum\left(y_i-\mathbf{x}_i^{\prime} \boldsymbol{\beta}\right)^2=(\mathbf{Y}-\mathbf{X} \boldsymbol{\beta})^{\prime}(\mathbf{Y}-\mathbf{X} \boldsymbol{\beta})
$$
oLs估计值可以从(3.8)中通过矩阵模拟的微分得到，直至附录a .3的发展。ols估计由公式给出
$$
\hat{\boldsymbol{\beta}}=\left(\mathbf{X}^{\prime} \mathbf{X}\right)^{-1} \mathbf{X}^{\prime} \mathbf{Y}
$$
前提是逆$\left(\mathbf{X}^{\prime} \mathbf{X}\right)^{-1}$存在。估计量$\hat{\boldsymbol{\beta}}$只依赖于充分统计量$\mathbf{X}^{\prime} \mathbf{X}$和$\mathbf{X}^{\prime} \mathbf{Y}$，它们是未校正的平方和和叉积的矩阵。

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

随机过程代考

贝叶斯方法代考

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

机器学习代写

多元统计分析代考

基础数据: $N$ 个样本， $P$ 个变量数的单样本，组成的横列的数据表
变量定性: 分类和顺序；变量定量：数值
数学公式的角度分为: 因变量与自变量

时间序列分析代写

回归分析代写

MATLAB代写

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

统计代写|线性回归分析代写linear regression analysis代考|The Box-Cox Transformation

Posted on 2023年6月25日2023年6月25日 by statistics-lab

统计代写|线性回归分析代写linear regression analysis代考|The Box-Cox Transformation

Recall that an objective way to compare different models is to compare their log likelihoods. Recall also that the appropriate likelihood is always for the untransformed $Y$, which means you need to figure out how the classical model for the transformed data $\left(f(Y)=\beta_0+\beta_1 X+\varepsilon\right)$ translates into a model $Y \mid X=x \sim p(y \mid x, \theta)$ for the original data. The likelihood you need is then the product of the $p\left(y_i \mid x_i, \theta\right)$ terms ( $n$ terms).
The Box-Cox method considers the family of transformed models
$$
Y^\lambda=\beta_0+\beta_1 X+\varepsilon, \text { for } \lambda \neq 0,
$$
with $\lambda=0$ given by the logarithmic model
$$
\ln (Y)=\beta_0+\beta_1 X+\varepsilon
$$
Special cases of interest are $\lambda=1$ (implying no transformation), $\lambda=0$ (log transform), $\lambda=-1$ (inverse transform), and $\lambda=0.5$ (square root transform).

Each $\lambda$ induces a model for the original data of the form $Y \mid X=x \sim p_\lambda(y \mid x, \theta)$, and all of these $p_\lambda(y \mid x, \theta)$ distributions are non-normal except in the untransformed case where $\lambda=1$. For example, as shown above, the case $\lambda=0$ implies $p_0(y \mid x, \theta)$ is a lognormal distribution. The maximized log likelihoods for the original $Y$ data, which involve these (non-normal) distributions $p_\lambda(y \mid x, \theta)$, one for each possible $\lambda$, can then be compared to select “good” values of $\lambda$. You can use the “boxcox” function on a fitted Im object in $\mathrm{R}$ to estimate the “best” $\lambda$, but you first need to install the MASS library.

统计代写|线性回归分析代写linear regression analysis代考|Transforming Both $Y$ and $X$

It should be clear from the examples above that you can transform either $Y$ alone or $X$ alone. But you might have had an idea in your head that if you transform $Y$, then you should also transform $X$. In some cases, this is reasonable. After all, if $Y$ and $X$ have a relationship that is close to linear to begin with, then transforming one without the other will automatically introduce severe nonlinear curvature, badly violating the linearity assumption.

For example, consider total crimes $(Y)$ in a U.S. state as it relates to the population covered $(X)$, as reported by the FBI. The following code shows four scatterplots: $(X, Y),(X, \ln (Y))$, $(\ln (X), Y)$, and $(\ln (X), \ln (Y))$. It is clear, in this example, that if you transform one variable, then you must also transform the other.
Cr=read.table (“https://raw.githubusercontent. com/andrea2719/
URA-DataSets/master/Crimes_2012.txt”)
attach(Cr); par(mfrow=C $(2,2))$
plot (Tot_Crimes $~$ Pop_Cov)
lcrimes = log(Tot_Crimes); lpop = log(Pop_Cov)
plot (Tot_Crimes lpop); plot(lcrimes Pop_Cov); plot(lcrimes lpop)
Cr=read.table (“https://raw.githubusercontent. com/andrea2719/
URA-Datasets/master/crimes_2012.txt”)
$\operatorname{attach}(\mathrm{Cr}) ;$ par $(\operatorname{mfrow}=\mathrm{c}(2,2))$
plot (Tot_Crimes Pop_Cov)
lcrimes $=\log ($ Tot_Crimes $) ; \operatorname{lpop}=\log ($ Pop_Cov $)$
plot (Tot_Crimes lpop); plot (lcrimes Pop_Cov); plot (lcrimes lpop)
But it is not always needed to transform $X$ when you transform $Y$. In the computer example there was no need to transform the $X$ variable, RAM, even though we transformed $Y$, time, to $1 / Y$.

Among all of the possible combinations of $X$ – and $Y$-transforms, you can compare likelihood-based statistics to see, objectively, which model is best supported by the data. You can also evaluate the models according to how badly violated are assumptions, as discussed in Chapter 4, because higher log likelihood does not necessarily validate the assumptions of the model. Rather, the highest log likelihood among a collection of models that you are considering simply identifies the model that fits the data best among those you have considered. There may be other, better models that you have not considered. Further, even if the correct model is in your set of models, it will not necessarily have the best log likelihood because of randomness: Every random sample gives you a different $\log$ likelihood.

If you are considering transformations of the form $(f(X), g(Y))$, simply create the two new variables $U=f(X)$ and $W=g(Y)$, and evaluate the assumptions in terms of the transformed $(U, W)$ data as indicated in Chapter 4. If all looks reasonable, then the classical regression model
$$
W=\beta_0+\beta_1 U+\varepsilon
$$
is a reasonable model.

线性回归代写

统计代写|线性回归分析代写linear regression analysis代考|The Box-Cox Transformation

回想一下，比较不同模型的客观方法是比较它们的对数似然。还记得，适当的可能性总是针对未转换的$Y$，这意味着您需要弄清楚如何将转换后数据的经典模型$\left(f(Y)=\beta_0+\beta_1 X+\varepsilon\right)$转换为原始数据的模型$Y \mid X=x \sim p(y \mid x, \theta)$。然后，您需要的可能性是$p\left(y_i \mid x_i, \theta\right)$项的乘积($n$项)。
Box-Cox方法考虑转换后的模型族
$$
Y^\lambda=\beta_0+\beta_1 X+\varepsilon, \text { for } \lambda \neq 0,
$$
用$\lambda=0$给出的对数模型
$$
\ln (Y)=\beta_0+\beta_1 X+\varepsilon
$$
我们感兴趣的特殊情况有$\lambda=1$(意味着没有变换)、$\lambda=0$(对数变换)、$\lambda=-1$(逆变换)和$\lambda=0.5$(平方根变换)。

每个$\lambda$都为表单$Y \mid X=x \sim p_\lambda(y \mid x, \theta)$的原始数据引入了一个模型，除了$\lambda=1$未转换的情况外，所有这些$p_\lambda(y \mid x, \theta)$分布都是非正态分布。例如，如上所示，情况$\lambda=0$意味着$p_0(y \mid x, \theta)$是对数正态分布。原始$Y$数据的最大日志似然，包括这些(非正态)分布$p_\lambda(y \mid x, \theta)$，每个可能的$\lambda$一个，然后可以与$\lambda$的选择“好”值进行比较。您可以在$\mathrm{R}$中对一个拟合的Im对象使用“boxcox”函数来估计“最佳”$\lambda$，但是您首先需要安装MASS库。

统计代写|线性回归分析代写linear regression analysis代考|Transforming Both $Y$ and $X$

从上面的示例可以清楚地看出，您可以单独转换$Y$或单独转换$X$。但是你可能在脑子里有一个想法如果你变换$Y$，那么你也应该变换$X$。在某些情况下，这是合理的。毕竟，如果$Y$和$X$的关系一开始就接近线性，那么只变换其中一个会自动引入严重的非线性曲率，严重违反线性假设。

例如，考虑美国一个州的总犯罪$(Y)$，因为它与所覆盖的人口$(X)$有关，这是FBI报告的。下面的代码显示了四个散点图:$(X, Y),(X, \ln (Y))$、$(\ln (X), Y)$和$(\ln (X), \ln (Y))$。很明显，在这个例子中，如果你变换一个变量，那么你也必须变换另一个变量。
Cr=read。表(“https://raw.githubusercontent。com/andrea2719/
URA-DataSets/master/Crimes＿2012.txt”)
附件(Cr);par(mfrow=C $(2,2))$
剧情(Tot＿Crimes $~$ Pop＿Cov)
lcrimes = log(Tot＿Crimes);lpop = log(Pop＿Cov)
plot (Tot＿Crimes);情节(lcrimes Pop＿Cov);情节(犯罪)
Cr=read。表(“https://raw.githubusercontent。com/andrea2719/
URA-Datasets/master/crimes＿2012.txt”)
$\operatorname{attach}(\mathrm{Cr}) ;$ par $(\operatorname{mfrow}=\mathrm{c}(2,2))$
剧情(Tot＿Crimes Pop＿Cov)
lcrimes $=\log ($ Tot＿Crimes $) ; \operatorname{lpop}=\log ($ Pop＿Cov $)$
plot (Tot＿Crimes);情节(lcrimes Pop＿Cov);情节(犯罪)
但是在转换$Y$时并不总是需要转换$X$。在计算机示例中，不需要转换$X$变量RAM，尽管我们将$Y$时间转换为$1 / Y$。

在$X$ -和$Y$ -转换的所有可能组合中，您可以比较基于似然的统计数据，客观地查看哪个模型最受数据支持。您也可以根据假设被违背的程度来评估模型，正如第4章所讨论的那样，因为更高的对数似然不一定能验证模型的假设。相反，在您正在考虑的模型集合中，最高的对数似然只是标识出您所考虑的模型中最适合数据的模型。也许还有其他更好的模型是你没有考虑到的。此外，即使正确的模型在您的模型集中，由于随机性，它也不一定具有最佳对数似然:每个随机样本都会给您一个不同的$\log$似然。

如果您正在考虑对表单$(f(X), g(Y))$进行转换，只需创建两个新变量$U=f(X)$和$W=g(Y)$，并根据转换后的$(U, W)$数据评估假设，如第4章所示。如果一切看起来合理，那么经典回归模型
$$
W=\beta_0+\beta_1 U+\varepsilon
$$
是一个合理的模型。

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

随机过程代考

贝叶斯方法代考

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

机器学习代写

多元统计分析代考

基础数据: $N$ 个样本， $P$ 个变量数的单样本，组成的横列的数据表
变量定性: 分类和顺序；变量定量：数值
数学公式的角度分为: 因变量与自变量

时间序列分析代写

回归分析代写

MATLAB代写

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

统计代写|线性回归分析代写linear regression analysis代考|Logarithmic Transformation of the $Y$ data

Posted on 2023年6月25日2023年6月25日 by statistics-lab

统计代写|线性回归分析代写linear regression analysis代考|Logarithmic Transformation of the $Y$ data

If you transform the $Y$ variable to $f(Y)$ but not the $X$ variable, then you think the model
$$
f(Y)=\beta_0+\beta_1 X+\varepsilon
$$
is better than the model $Y=\beta_0+\beta_1 X+\varepsilon$. As with transformation of $X$, in order to use this model successfully, you must understand what this model states in the original (untransformed) $(X, Y)$ data. Here,
$$
Y=f^{-1}\left{\beta_0+\beta_1 X+\varepsilon\right}
$$
where $f^{-1}$ is the inverse function (not the inverse of the function). You find the inverse function simply by solving the model equation $\left(f(Y)=\beta_0+\beta_1 X+\varepsilon\right)$ for $Y$.

For example, if $f(Y)=\ln (Y)$, then $Y=f^{-1}{f(Y)}=\exp {f(Y)}$, and the model in terms of the original units is then
$$
Y=\exp \left(\beta_0+\beta_1 X+\varepsilon\right),
$$
or equivalently,
$$
Y=\exp \left(\beta_0\right) \times \exp \left(\beta_1 X\right) \times \exp (\varepsilon)
$$
Notice now that the error term is multiplicative, rather than additive. Along with Jensen’s inequality, the multiplicative error implies that the function $\exp \left(\beta_0\right) \times \exp \left(\beta_1 X\right)$ is not the conditional mean. To see why not, note that
$$
\begin{aligned}
\mathrm{E}(Y \mid X=x) & =\exp \left(\beta_0\right) \times \exp \left(\beta_1 x\right) \times \mathrm{E}{\exp (\varepsilon \mid X=x)} \
& =\exp \left(\beta_0\right) \times \exp \left(\beta_1 x\right) \times \mathrm{E}{\exp (\varepsilon)}
\end{aligned}
$$
But, since $\exp (\cdot)$ is a convex function, $\mathrm{E}{\exp (\varepsilon)}>\exp {\mathrm{E}(\varepsilon)}=\exp (0)=1$, so that $\mathrm{E}(Y \mid X=x)>\exp \left(\beta_0\right) \times \exp \left(\beta_1 x\right)$. Thus, the back-transformed function, $\exp \left(\beta_0\right) \times \exp \left(\beta_1 x\right)$, is no longer the mean function of the untransformed data.

统计代写|线性回归分析代写linear regression analysis代考|Log Transforming Income

The “Charity” data set consists of tax return information and contains variables related to charitable contributions. One variable is adjusted gross income (Income. AGI) and another is number of dependents (DEPS). A question of interest is, what is the nature of the relationship between the income distributions and number of dependents? Do the distributions shift toward higher incomes, or toward lower incomes, as the number of dependents increases?

Fitting the model using logarithmically transformed adjusted gross income is performed using the following $\mathrm{R}$ code.
charity $=$ read.csv (“https://raw.githubusercontent.com/andrea $2719 /$
URA-Datasets/master/charitytax.csv”)
attach (charity)
$\ln$. AGI $=\log ($ Income. AGI)
fit $=\operatorname{lm}(\ln$. AGI $\sim$ DEPS)
summary(fit)
charity $=$ read.csv (“https://raw.githubusercontent.com/andrea2719/
URA-Datasets/master/charitytax.csv”)
attach (charity)
$\ln \cdot \mathrm{AGI}=\log ($ Income. AGI $)$
$\mathrm{fit}=\operatorname{lm}(\ln \cdot \mathrm{AGI} \sim$ DEPS $)$
summary (fit)
Results are as follows:
Coefficients :
From the output, you can construct two estimated models, one in terms of the original variables, and the other in terms of the transformed $Y$ variable. In terms of the transformed $Y$ variable, $W=\ln$ (Adjusted Gross Income), the estimated conditional mean function is
$$
\hat{W}=10.51200+0.01671 \times \text { DEPS }
$$

In terms of the untransformed $Y$ variable, the model provides an estimate of the median of the distribution of the adjusted gross income $(Y)$ variable:
$$
\hat{Y}=\exp (10.51200) \times \exp (0.01671 \times \text { DEPS })
$$
With this estimated model, you estimate that the median income, $Y$, of people with, say 4 dependents, is $\exp (0.01671)=1.01685$ times higher than the median income, $Y$, of people with 3 dependents. Thus the model estimates that, for each additional dependent, median adjusted gross income is $100 \times(1.01685-1.0) \%=1.685 \%$ higher.

线性回归代写

统计代写|线性回归分析代写linear regression analysis代考|Logarithmic Transformation of the $Y$ data

如果您将$Y$变量转换为$f(Y)$，而不是$X$变量，那么您认为模型
$$
f(Y)=\beta_0+\beta_1 X+\varepsilon
$$
比模型要好$Y=\beta_0+\beta_1 X+\varepsilon$。与$X$的转换一样，为了成功地使用该模型，您必须了解该模型在原始(未转换的)$(X, Y)$数据中的状态。这里，
$$
Y=f^{-1}\left{\beta_0+\beta_1 X+\varepsilon\right}
$$
其中$f^{-1}$是逆函数(不是函数的逆)。只需通过求解$Y$的模型方程$\left(f(Y)=\beta_0+\beta_1 X+\varepsilon\right)$就可以找到反函数。

例如，如果$f(Y)=\ln (Y)$，则$Y=f^{-1}{f(Y)}=\exp {f(Y)}$，而模型以原始单位表示则为
$$
Y=\exp \left(\beta_0+\beta_1 X+\varepsilon\right),
$$
或者等价地，
$$
Y=\exp \left(\beta_0\right) \times \exp \left(\beta_1 X\right) \times \exp (\varepsilon)
$$
现在请注意，误差项是乘法项，而不是加法项。与詹森不等式一起，乘法误差意味着函数$\exp \left(\beta_0\right) \times \exp \left(\beta_1 X\right)$不是条件均值。要知道为什么不可以，请注意
$$
\begin{aligned}
\mathrm{E}(Y \mid X=x) & =\exp \left(\beta_0\right) \times \exp \left(\beta_1 x\right) \times \mathrm{E}{\exp (\varepsilon \mid X=x)} \
& =\exp \left(\beta_0\right) \times \exp \left(\beta_1 x\right) \times \mathrm{E}{\exp (\varepsilon)}
\end{aligned}
$$
但是，因为$\exp (\cdot)$是一个凸函数$\mathrm{E}{\exp (\varepsilon)}>\exp {\mathrm{E}(\varepsilon)}=\exp (0)=1$，所以$\mathrm{E}(Y \mid X=x)>\exp \left(\beta_0\right) \times \exp \left(\beta_1 x\right)$。因此，反向转换的函数$\exp \left(\beta_0\right) \times \exp \left(\beta_1 x\right)$不再是未转换数据的均值函数。

统计代写|线性回归分析代写linear regression analysis代考|Log Transforming Income

“慈善”数据集由纳税申报表信息组成，包含与慈善捐款相关的变量。一个变量是调整后的总收入(income)。AGI)和另一个是家属数量(DEPS)。一个令人感兴趣的问题是，收入分配和受抚养人数量之间的关系的本质是什么?随着受抚养人数量的增加，分配是向高收入倾斜，还是向低收入倾斜?

使用以下$\mathrm{R}$代码使用对数转换调整后的总收入来拟合模型。
慈善机构$=$ read.csv (“https://raw.githubusercontent.com/andrea $2719 /$
市建局-数据集/master/charitytax.csv”)
附属(慈善)
$\ln$。AGI $=\log ($收入。agi)
适合$=\operatorname{lm}(\ln$。AGI $\sim$ DEPS)
总结(拟合)
慈善机构$=$ read.csv (“https://raw.githubusercontent.com/andrea2719/
市建局-数据集/master/charitytax.csv”)
附属(慈善)
$\ln \cdot \mathrm{AGI}=\log ($收入。AGI $)$
$\mathrm{fit}=\operatorname{lm}(\ln \cdot \mathrm{AGI} \sim$ DEPS $)$
总结(拟合)
结果如下:
系数:
从输出中，您可以构建两个估计模型，一个根据原始变量，另一个根据转换后的$Y$变量。对于变换后的$Y$变量$W=\ln$ (Adjusted Gross Income)，估计的条件平均函数为
$$
\hat{W}=10.51200+0.01671 \times \text { DEPS }
$$

对于未转换的$Y$变量，该模型提供了调整后总收入$(Y)$变量分布的中位数估计值:
$$
\hat{Y}=\exp (10.51200) \times \exp (0.01671 \times \text { DEPS })
$$
有了这个估计模型，你估计有四个家属的人的收入中位数$Y$，是有三个家属的人的收入中位数$Y$的$\exp (0.01671)=1.01685$倍。因此，该模型估计，对于每一个额外的依赖，调整后的总收入中位数$100 \times(1.01685-1.0) \%=1.685 \%$更高。

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

随机过程代考

贝叶斯方法代考

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

机器学习代写

多元统计分析代考

基础数据: $N$ 个样本， $P$ 个变量数的单样本，组成的横列的数据表
变量定性: 分类和顺序；变量定量：数值
数学公式的角度分为: 因变量与自变量

时间序列分析代写

回归分析代写

MATLAB代写

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

统计代写|线性回归分析代写linear regression analysis代考|Evaluating the Uncorrelated Errors Assumption Using Testing Methods

Posted on 2023年6月25日2023年6月25日 by statistics-lab

统计代写|线性回归分析代写linear regression analysis代考|Evaluating the Uncorrelated Errors Assumption Using Testing Methods

Historically, researchers commonly used a test known as the “Durbin-Watson test” to test for the first-order autocorrelation in pure time-series data. Currently, many tests other than the Durbin-Watson test, such as likelihood ratio tests, are supplied routinely by software that can analyze time-series data. An even simpler (but still useful) test to discover whether the trend in the $\left(e_{t-1}, e_t\right)$ plot (the right panel of Figures 4.8) is explainable by chance alone is given by the cor.test function in R. Simply specify cor. test(resid, lag.resid) to get the result. For the Car Sales data you get the following output:
$$
\text { Pearson’s product-moment correlation }
$$
data: resid and laq.resid
$t=9.4162$, df $=117$, p-value $=5.175 \mathrm{e}-16$
alternative hypothesis: true correlation is not equal to 0
95 percent confidence interval:
0.5404718 0.7481668
sample estimates:
cor
0.6565924
Pearson’s product-moment correlation
data: resid and laq.resid
$t=9.4162, \mathrm{df}=117, \mathrm{p}$-value $=5.175 \mathrm{e}-16$
alternative hypothesis: true correlation is not equal to 0
95 percent confidence interval:
0.54047180 .7481668
sample estimates:
$\operatorname{cor}$
0.6565924
Notice that the estimated correlation is 0.6565924 , a positive number, corroborating the positive linear trend shown in the right-hand panel of Figure 4.8. The $p$-value is $5.175 \times 10^{-16}$, indicating that the difference between 0.6565924 and 0.0 is nearly impossible to explain by chance alone. In other words, you will never (for all intents and purposes) see a trend as extreme as in Figure 4.8 in similarly-sized data sets $(n=120)$ that are produced by a model where the errors are in fact uncorrelated.

As always, you should identify a subject matter rationale for any claimed violation of an assumption. In this case, persistent macroeconomic conditions explain that the Sales residual (deviation from Sales prediction based on the interest rate) in a given month is quite similar to that of a previous month, but not so similar to five years ago. In other words, if conditions in the U.S. economy cause higher sales than expected (given interest rates) this month, such conditions are likely to persist through next month, also causing higher sales.

统计代写|线性回归分析代写linear regression analysis代考|Evaluating the Normality Assumption Using Graphical Methods

The normality assumption states that each conditional distribution $p(y \mid x)$ (one for each $x$ ) is a normal distribution. The normality assumption does not state that the marginal distribution $p(y)$ is normal. It is possible that the distributions $p(y \mid x)$ are all normal yet the distribution $p(y)$ is non-normal; this happens when the distribution of $X$ is non-normal. Thus, you do not assess the normality assumption using the $y_i$ data alone. You have to consider the data $Y$ within specific $X=x$ values.

It is a common error of statistical practice to check normality using the $y_i$ data, so we wish to clarify this issue first, before showing proper methods. Consider a regression process where $Y \mid X=x$ is normal for all $X=x$, with mean $10+2 x$ and variance 1.0 , but where $X$ has the exponential distribution with mean 1.0. The following code generates 200 observations from such a process and shows the $q-q$ plot of the $y_i$ data values. Despite all of the assumptions being satisfied, including normality, the $q-q$ plot in Figure 4.9 incorrectly suggests that the normality assumption is violated.
R code for Figure 4.9
# Normal conditional distributions,
$#$ but non-normal marginal distribution
$X=$ rexp $(200,1)$
$# \mathrm{Y}$ is conditionally normal with mean $=10+2 \mathrm{x}$ and variance $=1$
$Y=10+2 * X+\operatorname{rnorm}(200)$
qqnorm(Y); qqline( $Y)$
# Normal conditional distributions,
# but non-normal marginal distribution
$\mathrm{X}=\operatorname{rexp}(200,1)$
# $Y$ is conditionally normal with mean $=10+2 x$ and variance=1
$Y=10+2 \star X+\operatorname{rnorm}(200)$
qquorm $(Y) ; q q l i n e(Y)$

线性回归代写

统计代写|线性回归分析代写linear regression analysis代考|Evaluating the Uncorrelated Errors Assumption Using Testing Methods

要评估不相关误差假设，首先必须考虑数据集的类型，是纯时间序列、横断面时间序列、空间、重复测量还是多层(分组)数据。对于纯时间序列数据，通常使用$t$而不是$i$表示观测指标，通常使用$T$而不是$n$表示数据集中的时间点数量，因此观测集是通过$t=1,2, \ldots, T$而不是$i=1,2, \ldots, n$进行索引的。

在纯时间序列过程中，不相关误差假设经常被严重违反，因为，例如，今天与昨天相似，但与五年前不太相似。因此，今天的误差项$\varepsilon_t$的潜在可观察值通常与昨天的误差项$\varepsilon_{t-1}$的潜在可观察值高度相关，这意味着违反了不相关误差假设。

要诊断纯时间序列数据的相关错误，您应该首先检查时间序列残差图，或$\left(t, e_t\right)$。寻找系统的、非随机的模式，如趋势或正弦波类型的功能模式，以表明这种假设的失败。这张图的完全随机外观与不相关的误差是一致的。

最常见的残差相关性类型是当前误差$\varepsilon_t$与先前误差$\varepsilon_{t-1}$的相关性，这被称为“滞后”误差项。这种相关性被称为自相关，因为它指的是变量与自身的相关性。因此，您可以查看的第二个图是滞后散点图，或$\left(e_{t-1}, e_t\right)$，您可以在其上叠加OLS或黄土拟合来查看趋势。该图中的趋势表明当前残差与前一个残差之间存在依赖关系，这违反了不相关误差假设。没有趋势的随机散点与不相关误差是一致的。

第三种图是残差的自相关函数，它显示滞后1、$\operatorname{lag} 2$、滞后3和更多的自相关性，因此您可以使用这种图来检查滞后大于1的自相关性。

对于非纯时间序列数据，需要不同的方法。对于空间数据(“空间”中的点，例如，具有地理坐标的数据)，您可以使用变异图来检查误差相关性，在这种情况下称为“空间自相关性”。对于多水平(分组)数据，您可以检查散点图，其中数据按组标记以诊断相关性结构;第十章涉及这个问题。现在，我们只讨论纯时间序列数据。

统计代写|线性回归分析代写linear regression analysis代考|Evaluating the Normality Assumption Using Graphical Methods

Car Sales数据是纯时间序列，因为数据是连续120个月收集的。下面的代码显示了检查不相关(特别是非自相关)错误的相关图。
CarS = read.table(“https://raw.githubusercontent.com/andrea2719/ .table “)
“URA-DataSets/master/Cars.txt”)
attach(CarS);$\mathrm{n}=$ nrow(CarS)
fit $=$ lm(NSOLD $~$ INTRATE)
Resid $=$ fit＄残差
par(mfrow=c(1,2))
图($1: n$, resid, xlab=”month”， ylab=”residual”)
点数($1: n$, resid, type=”l”);abline(h=0)
滞后。resid = c(NA, resid[1:n-1])
情节(滞后)Resid, Resid, xlab=”滞后残差”，ylab= “残差”)
abline(lsfit);残留，残留))
汽车$=$阅读。表$($”https://raw.githubusercontent。com andrea $2719 /$
“URA-DataSets/master/Cars.txt”)
附件(汽车);$n=\operatorname{nrow}(\operatorname{Cars})$
$\mathrm{fit}=\operatorname{lm}(\mathrm{NSOLD} \sim$ INTRATE $)$
Resid $=$ fit＄残差
Par (mfrow=c $(1,2))$
图($1: n$，残差，$x l a b=$“月”，ylab=“残差”)
points $(1: \mathrm{n}$, resid, type=”I”);在线$(\mathrm{h}=0)$
滞后。渣油$=c(N A, r e s i d[1: n-1])$
情节滞后。Resid, Resid, $x l a b=$ “滞后残差”，ylab = “残差”)
Abline (lsfit (lag))残留，残留))
结果如图4.8所示。这两幅图都显示了大量的自相关证据。

这种极端违反假设的后果是什么?根据第3章中总结的数学定理，如果数据生成过程确实由回归模型给出，那么置信区间和$p$ -值的行为精确地与广告一样，精确地具有95％的置信度，精确地具有5％的显著性水平。如图所示，当独立性假设严重违反时，真实的置信水平可能远低于95％，真实的显著性水平可能远低于5％。有多远?你猜对了:你可以通过模拟来找到答案。

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

随机过程代考

贝叶斯方法代考

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

机器学习代写

多元统计分析代考

基础数据: $N$ 个样本， $P$ 个变量数的单样本，组成的横列的数据表
变量定性: 分类和顺序；变量定量：数值
数学公式的角度分为: 因变量与自变量

时间序列分析代写

回归分析代写

MATLAB代写

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

统计代写|线性回归分析代写linear regression analysis代考|Slope

Posted on 2023年6月13日2023年6月13日 by statistics-lab

统计代写|线性回归分析代写linear regression analysis代考|Slope

The standard error of $\hat{\beta}_1$ is $\operatorname{se}\left(\hat{\beta}_1\right)=\hat{\sigma} / \sqrt{S X X}=0.0164$. A $95 \%$ confidence interval for the slope is the set of $\beta_1$ such that
$$
\begin{aligned}
0.8955-2.131(0.0164) & \leq \beta_1 \leq 0.8955+2.131(0.0164) \
0.867 & \leq \beta_1 \leq 0.930
\end{aligned}
$$
As an example of a test for slope equal to zero, consider the Ft. Collins snowfall data presented on page 7. One can show, Problem 2.11, that the estimated slope is $\hat{\beta}_1=0.2035, \operatorname{se}\left(\hat{\beta}_1\right)=0.1310$. The test of interest is of
$$
\begin{array}{ll}
\mathrm{NH}: & \beta_1=0 \
\mathrm{AH}: & \beta_1 \neq 0
\end{array}
$$
For the Ft. Collins data, $t=(0.20335-0) / 0.1310=1.553$. To get a significance level for this test, compare $t$ with the $t(91)$ distribution; the two-sided $p$-value is 0.124 , suggesting no evidence against the $\mathrm{NH}$ that Early and Late season snowfalls are independent.

Compare the hypothesis (2.24) with (2.20). Both appear to be identical. In fact,
$$
t^2=\left(\frac{\hat{\beta}_1}{\operatorname{se}\left(\hat{\beta}_1\right)}\right)^2=\frac{\hat{\beta}_1^2}{\hat{\sigma}^2 / S X X}=\frac{\hat{\beta}_1^2 S X X}{\hat{\sigma}^2}=F
$$
so the square of a $t$ statistic with $d$ df is equivalent to an $F$-statistic with $(1, d) \mathrm{df}$. In nonlinear and logistic regression models discussed later in the book, the analog of the $t$ test will not be identical to the analog of the $F$ test, and they can give conflicting conclusions. For linear regression models, no conflict occurs and the two tests are equivalent.

统计代写|线性回归分析代写linear regression analysis代考|Prediction

The estimated mean function can be used to obtain values of the response for given values of the predictor. The two important variants of this problem are prediction and estimation of fitted values. Since prediction is more important, we discuss it first.

In prediction we have a new case, possibly a future value, not one used to estimate parameters, with observed value of the predictor $x_$. We would like to know the value $y_$, the corresponding response, but it has not yet been observed. We can use the estimated mean function to predict it. We assume that the data used to estimate the mean function are relevant to the new case, so the fitted model applies to it. In the heights example, we would probably be willing to apply the fitted mean function to mother-daughter pairs alive in England at the end of the nineteenth century. Whether the prediction would be reasonable for motherdaughter pairs in other countries or in other time periods is much less clear. In Forbes’ problem, we would probably be willing to apply the results for altitudes in the range he studied. Given this additional assumption, a point prediction of $y_$, say $\tilde{y}$, is just
$$
\tilde{y}=\hat{\beta}0+\hat{\beta}_1 x
$$
$\tilde{y}$ predicts the as yet unobserved $y$. The variability of this predictor has two sources: the variation in the estimates $\hat{\beta}0$ and $\hat{\beta}_1$. and the variation due to the fact that $y$ will not equal its expectation, since even if we knew the parameters exactly, the future value of the response will not generally equal its expectation. Using Appendix A.4, $$ \operatorname{Var}\left(\tilde{y} \mid x\right)=\sigma^2+\sigma^2\left(\frac{1}{n}+\frac{\left(x_-\bar{x}\right)^2}{S X X}\right)
$$
Taking square roots and estimating $\sigma^2$ by $\hat{\sigma}^2$, we get the standard error of prediction (sepred) at $x_$, $$ \operatorname{sepred}\left(\tilde{y} \mid x\right)=\hat{\sigma}\left(1+\frac{1}{n}+\frac{\left(x_-\bar{x}\right)^2}{S X X}\right)^{1 / 2}
$$

线性回归代写

统计代写|线性回归分析代写linear regression analysis代考|Slope

$\hat{\beta}_1$的标准误差为$\operatorname{se}\left(\hat{\beta}_1\right)=\hat{\sigma} / \sqrt{S X X}=0.0164$。斜率的$95 \%$置信区间是$\beta_1$的集合，使得
$$
\begin{aligned}
0.8955-2.131(0.0164) & \leq \beta_1 \leq 0.8955+2.131(0.0164) \
0.867 & \leq \beta_1 \leq 0.930
\end{aligned}
$$
作为斜率等于零的测试示例，请考虑第7页中提供的Ft. Collins降雪数据。可以看出，习题2.11，估计斜率是$\hat{\beta}_1=0.2035, \operatorname{se}\left(\hat{\beta}_1\right)=0.1310$。对兴趣的测试是
$$
\begin{array}{ll}
\mathrm{NH}: & \beta_1=0 \
\mathrm{AH}: & \beta_1 \neq 0
\end{array}
$$
有关柯林斯堡的数据，请访问$t=(0.20335-0) / 0.1310=1.553$。为了得到这个检验的显著性水平，比较$t$和$t(91)$分布;双向$p$值为0.124，表明没有证据反对$\mathrm{NH}$早、晚季降雪是独立的。

比较假设(2.24)与(2.20)。两者看起来是一样的。事实上，
$$
t^2=\left(\frac{\hat{\beta}_1}{\operatorname{se}\left(\hat{\beta}_1\right)}\right)^2=\frac{\hat{\beta}_1^2}{\hat{\sigma}^2 / S X X}=\frac{\hat{\beta}_1^2 S X X}{\hat{\sigma}^2}=F
$$
所以$t$统计量与$d$ df的平方等于$F$ -统计量与$(1, d) \mathrm{df}$的平方。在本书后面讨论的非线性和逻辑回归模型中，$t$检验的类比与$F$检验的类比并不相同，它们可以给出相互矛盾的结论。对于线性回归模型，不发生冲突，两个检验是等效的。

统计代写|线性回归分析代写linear regression analysis代考|Prediction

估计的平均函数可用于获得给定预测器值的响应值。该问题的两个重要变体是拟合值的预测和估计。由于预测更重要，我们先讨论它。

在预测中，我们有一个新的情况，可能是未来的值，而不是用来估计参数的值，预测器的观测值$x_$。我们想知道$y_$的值，对应的响应，但它还没有被观察到。我们可以用估计的均值函数来预测它。我们假设用于估计均值函数的数据与新情况相关，因此拟合模型适用于新情况。在身高的例子中，我们可能愿意将拟合均值函数应用于19世纪末生活在英国的母女对。对于其他国家或其他时期的母女对，这种预测是否合理就不太清楚了。在福布斯的问题中，我们可能愿意将结果应用于他研究范围内的海拔高度。考虑到这个额外的假设，对$y_$(比如$\tilde{y}$)的点预测是合理的
$$
\tilde{y}=\hat{\beta}0+\hat{\beta}1 x $$ $\tilde{y}$预测了尚未观察到的$y$。该预测器的可变性有两个来源:估算值的变化$\hat{\beta}0$和$\hat{\beta}_1$。由于$y$不等于它的期望而引起的变化，因为即使我们确切地知道参数，响应的未来值通常也不会等于它的期望。使用附录A.4, $$ \operatorname{Var}\left(\tilde{y} \mid x\right)=\sigma^2+\sigma^2\left(\frac{1}{n}+\frac{\left(x-\bar{x}\right)^2}{S X X}\right)
$$
取平方根，用$\hat{\sigma}^2$估计$\sigma^2$，我们得到预测的标准误差(sepd)在$x_$， $$ \operatorname{sepred}\left(\tilde{y} \mid x\right)=\hat{\sigma}\left(1+\frac{1}{n}+\frac{\left(x_-\bar{x}\right)^2}{S X X}\right)^{1 / 2}
$$

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写