分类：主成分分析代写

统计代写|主成分分析代写Principal Component Analysis代考|Incomplete PCA by Alternating Minimization

Posted on 2023年4月19日2023年4月19日 by statistics-lab

如果你也在怎样代写主成分分析Principal Component Analysis这个学科遇到相关的难题，请随时右上角联系我们的24/7代写客服。

主成分分析（PCA）是计算主成分并使用它们对数据进行基础改变的过程，有时只使用前几个主成分，而忽略其余部分。

statistics-lab™ 为您的留学生涯保驾护航在代写主成分分析Principal Component Analysis方面已经树立了自己的口碑, 保证靠谱, 高质且原创的统计Statistics代写服务。我们的专家在代写主成分分析Principal Component Analysis代写方面经验极为丰富，各种代写主成分分析Principal Component Analysis相关的作业也就用不着说。

我们提供的主成分分析Principal Component Analysis及其相关学科的代写，服务范围广, 其中包括但不限于:

Statistical Inference 统计推断
Statistical Computing 统计计算
Advanced Probability Theory 高等概率论
Advanced Mathematical Statistics 高等数理统计学
(Generalized) Linear Models 广义线性模型
Statistical Machine Learning 统计机器学习
Longitudinal Data Analysis 纵向数据分析
Foundations of Data Science 数据科学基础

统计代写|主成分分析代写Principal Component Analysis代考|Incomplete PCA by Alternating Minimization

Although the convex-optimization-based approach can ensure correctness of the low-rank solution for the matrix completion problem, it requires solving a convex program of the same size as the matrix. When the data matrix $X$ is very large, parameterizing the low-rank solution $A$ and Lagrange multipliers $Z$ with two matrices of the same size as $X$ seems rather demanding, actually redundant. At least the low-rank solution $A$ could be parameterized more economically with its lowrank factors. Hence, if scalability of the algorithm is a serious concern, it makes sense to look for the low-rank factors of the solution matrix directly.

To this end, we introduce in this section an alternating minimization algorithm for solving the geometric PCA problem with missing data. The main idea behind this approach, which was probably first proposed in (Wiberg 1976), is to find $\mu, U$, and $Y$ that minimize the error $\left|X-\boldsymbol{\mu} \mathbf{1}^{\top}-U Y\right|_F^2$ considering only the known entries of $X$ in the set $\Omega=\left{(i, j): w_{i j}=1\right}$, i.e.,
$$
\begin{aligned}
\left|\mathcal{P}{\Omega}\left(X-\boldsymbol{\mu} \mathbf{1}^{\top}-U Y\right)\right|_F^2 & =\left|W \odot\left(X-\boldsymbol{\mu} \mathbf{1}^{\top}-U Y\right)\right|_F^2 \ & =\sum{i=1}^D \sum_{j=1}^N w_{i j}\left(x_{i j}-\mu_i-\boldsymbol{u}i^{\top} \boldsymbol{y}_j\right)^2, \end{aligned} $$ where $x{i j}$ is the $(i, j)$ th entry of $X, \mu_i$ is the $i$ th entry of $\mu, \boldsymbol{u}i^{\top}$ is the $i$ th row of $U$, and $\boldsymbol{y}_j$ is the $j$ th column of $Y$. Notice that this cost function is the same as that in (3.10), except that the errors $\varepsilon{i j}=x_{i j}-\boldsymbol{u}i^{\top} \boldsymbol{y}_j$ associated with the missing entries $\left(w{i j}=0\right.$ ) are removed.In what follows, we will derive an alternating minimization algorithm for minimizing the cost function in (3.34). For the sake of simplicity, we will first derive the algorithm in the case of zero-mean and complete data. In this case, the problem in (3.34) reduces to a low-rank matrix approximation problem, which can be solved using the SVD, as described in Theorem 2.3. The alternating minimization algorithm to be derived provides an alternative to the SVD solution, which, however, can be more easily extended to the case of incomplete data, as we will see. Moreover, the algorithm can also be extended to the more challenging PCA problem with missing entries, as we will see.

统计代写|主成分分析代写Principal Component Analysis代考|PCA with Robustness to Corrupted Entries

In the previous section, we considered the PCA problem in the case that some entries of the data points are missing. In this section, we consider the PCA problem in the case that some of the entries of the data points have been corrupted by gross errors, known as intrasample outliers. The additional challenge is that we do not know which entries have been corrupted. Thus, the problem is to simultaneously detect which entries have been corrupted and replace them by their uncorrupted values. In some literature, this problem is referred to as the robust PCA problem (De la Torre and Black 2004; Candès et al. 2011).

Let us first recall the PCA problem (see Section 2.1.2) in which we are given $N$ data points $\mathcal{X}=\left{x_j \in \mathbb{R}^D\right}_{j=1}^N$ drawn (approximately) from a $d$-dimensional affine subspace $S={\boldsymbol{x}=\boldsymbol{\mu}+U \boldsymbol{y}}$, where $\boldsymbol{\mu} \in \mathbb{R}^D$ is an arbitrary point in $S, U \in \mathbb{R}^{D \times d}$ is a basis for $S$, and $\left{y_j \in \mathbb{R}^d\right}_{j=1}^N$ are the principal components. In the robust PCA problem, we assume that the $i$ th entry $x_{i j}$ of a data point $\boldsymbol{x}j$ is obtained by corrupting the $i$ th entry $\ell{i j}$ of a point $\ell_j$ lying perfectly on the subspace $S$ by an error $e_{i j}$, i.e.,
$$
x_{i j}=\ell_{i j}+e_{i j}, \quad \text { or } \quad \boldsymbol{x}_j=\boldsymbol{\ell}_j+\boldsymbol{e}_j, \quad \text { or } \quad X=L+E \text {, }
$$ where $X, L, E \in \mathbb{R}^{D \times N}$ are matrices with entries $x_{i j}, \ell_{i j}$, and $e_{i j}$, respectively. Such errors can have a huge impact on the estimation of the subspace. Thus it is very important to be able to detect the locations of those errors,
$$
\Omega=\left{(i, j): e_{i j} \neq 0\right}
$$
as well as correct the erroneous entries before applying PCA to the given data. As discussed before, a key difference between the robust PCA problem and the incomplete PCA problem is that we do not know the location of the corrupted entries. This makes the robust PCA problem harder, since we need to simultaneously detect and correct the errors. Nonetheless, when the number of corrupted entries is a small enough fraction of the total number of entries, i.e., when $|\Omega|<\rho \cdot D N$ for some $\rho<1$, we may still hope to be able to detect and correct such errors. In the remainder of this section, we describe methods from robust statistics and convex optimization for addressing this problem.

主成分分析代考

统计代写|主成分分析代写Principal Component Analysis代考|Incomplete PCA by Alternating Minimization

虽然基于凸优化的方法可以保证矩阵补全问题低秩解的正确性，但它需要求解一个与矩阵大小相同的凸规划。当数据矩阵 $X$ 非常大，参数化低秩解 $A$ 和拉格朗日乘数 $Z$ 具有两个相同大小的矩阵 $X$ 看似苛刻，实则多余。至少是低阶解 $A$ 可以通过其低秩因子更经济地参数化。因此，如果算法的可扩展性是一个严重的问题，那么直接寻找解矩阵的低秩因子是有意义的。
为此，我们在本节中介绍了一种用于解决具有缺失数据的几何 PCA 问题的交替最小化算法。这种方法背后的主要思想可能首先在 (Wiberg 1976) 中提出，是为了找到 $\mu, U ，$ 和 $Y$ 最小化错误 $\left|X-\boldsymbol{\mu} 1^{\top}-U Y\right|F^2$ 只考虑已知条目 $X$ 在集合中 $\backslash$ Omega=\eft ${(i, j):$ w{i j $\left.}=1 \backslash r i g h t\right}$ ，那是，
$$
\left|\mathcal{P} \Omega\left(X-\boldsymbol{\mu} \mathbf{1}^{\top}-U Y\right)\right|F^2=\left|W \odot\left(X-\boldsymbol{\mu} \mathbf{1}^{\top}-U Y\right)\right|_F^2 \quad=\sum i=1^D \sum{j=1}^N w_{i j}\left(x_{i j}-\mu_i\right.
$$
在哪里 $x i j$ 是个 $(i, j)$ 的第一个条目 $X, \mu_i$ 是个 $i$ 的第一个条目 $\mu, \boldsymbol{u} i^{\top}$ 是个 $i$ 第排 $U$ ，和 $\boldsymbol{y}j$ 是个 $j$ 第列 $Y$. 请注意，此成本函数与 (3.10) 中的相同，只是误差 $\varepsilon i j=x{i j}-\boldsymbol{u} i^{\top} \boldsymbol{y}_j$ 与丟失的条目相关联 $(w i j=0)$ 被去除。接下来，我们将推导出一种交替最小化算法，用于最小化 (3.34) 中的成本函数。为了简单起见，我们将首先在零均值和完整数据的情况下推导算法。在这种情况下，(3.34) 中的问题简化为低阶矩阵逼近问题，可以使用 SVD 求解，如定理 2.3 中所述。要导出的交替最小化算法提供了 SVD 解决方案的替代方案，但是，正如我们将看到的，它可以更容易地扩展到不完整数据的情况。此外，正如我们将看到的，该算法还可以扩展到更具挑战性的条目缺失的 PCA 问题。

统计代写|主成分分析代写Principal Component Analysis代考|PCA with Robustness to Corrupted Entries

在上一节中，我们考虑了在数据点的某些条目丟失的情况下的 PCA 问题。在本节中，我们考虑在数据点的某些条目已被严重错误 (称为样本内异常值) 损坏的情况下的 PCA 问题。额外的挑战是我们不知道哪些条目已损坏。因此，问题是同时检测哪些条目已损坏并用末损坏的值替换它们。在一些文献中，这个问题被称为稳健的 PCA 问题（De la Torre 和 Black 2004；Candès 等人 2011)。
让我们首先回顾 PCA 问题（见第 2.1.2 节)，其中给出了 $N$ 数据点
Imathcal ${X}=\backslash l e f t\left{x _j \backslash\right.$ in $\left.\backslash m a t h b b{R}^{\wedge} D \backslash r i g h t\right} _{j=1}^{\wedge} N$ (大约) 从 $d$-维仿射子空间 $S=\boldsymbol{x}=\boldsymbol{\mu}+U \boldsymbol{y}$ ，在分。在稳健的 PCA 问题中，我们假设 $i$ 第一个条目 $x_{i j}$ 一个数据点 $x j$ 是通过破坏获得的 $i$ 第一个条目 $\ell i j$ 的一点 $\ell_j$ 完美地躺在子空间上 $S$ 由于错误 $e_{i j}$ ，那是，
$$
x_{i j}=\ell_{i j}+e_{i j}, \quad \text { or } \quad \boldsymbol{x}j=\boldsymbol{\ell}_j+\boldsymbol{e}_j, \quad \text { or } \quad X=L+E, $$ 在哪里 $X, L, E \in \mathbb{R}^{D \times N}$ 是有条目的矩阵 $x{i j}, \ell_{i j}$ ，和 $e_{i j}$ ，分别。这种错误会对子空间的估计产生巨大影响。因此，能够检测到这些错误的位置非常重要，
\Omega $=\backslash$ left ${(i, j):$ e_ ${i j} \backslash$ lneq o\right } }
以及在将 PCA 应用于给定数据之前更正错误条目。如前所述，稳健的 PCA 问题和不完整的 PCA 问题之间的一个关键区别是我们不知道损坏条目的位置。这使得稳健的 PCA 问题变得更加困难，因为我们需要同时检测和纠正错误。尽管如此，当损坏的条目数量占条目总数的比例足够小时，即 $|\Omega|<\rho \cdot D N$ 对于些 $\rho<1$ ，我们可能仍然㳍望能够检测并纠正此类错误。在本节的其余部分，我们描述了解决此问题的稳健统计和凸优化方法。

统计代写|主成分分析代写Principal Component Analysis代考请认准statistics-lab™

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

金融工程代写

金融工程是使用数学技术来解决金融问题。金融工程使用计算机科学、统计学、经济学和应用数学领域的工具和知识来解决当前的金融问题，以及设计新的和创新的金融产品。

非参数统计代写

非参数统计指的是一种统计方法，其中不假设数据来自于由少数参数决定的规定模型；这种模型的例子包括正态分布模型和线性回归模型。

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

术语广义线性模型（GLM）通常是指给定连续和/或分类预测因素的连续响应变量的常规线性回归模型。它包括多元线性回归，以及方差分析和方差分析（仅含固定效应）。

有限元方法代写

有限元方法（FEM）是一种流行的方法，用于数值解决工程和数学建模中出现的微分方程。典型的问题领域包括结构分析、传热、流体流动、质量运输和电磁势等传统领域。

有限元是一种通用的数值方法，用于解决两个或三个空间变量的偏微分方程（即一些边界值问题）。为了解决一个问题，有限元将一个大系统细分为更小、更简单的部分，称为有限元。这是通过在空间维度上的特定空间离散化来实现的，它是通过构建对象的网格来实现的：用于求解的数值域，它有有限数量的点。边界值问题的有限元方法表述最终导致一个代数方程组。该方法在域上对未知函数进行逼近。[1] 然后将模拟这些有限元的简单方程组合成一个更大的方程系统，以模拟整个问题。然后，有限元通过变化微积分使相关的误差函数最小化来逼近一个解决方案。

tatistics-lab作为专业的留学生服务机构，多年来已为美国、英国、加拿大、澳洲等留学热门地的学生提供专业的学术服务，包括但不限于Essay代写，Assignment代写，Dissertation代写，Report代写，小组作业代写，Proposal代写，Paper代写，Presentation代写，计算机作业代写，论文修改和润色，网课代做，exam代考等等。写作范围涵盖高中，本科，研究生等海外留学全阶段，辐射金融，经济学，会计学，审计学，管理学等全球99%专业科目。写作团队既有专业英语母语作者，也有海外名校硕博留学生，每位写作老师都拥有过硬的语言能力，专业的学科背景和学术写作经验。我们承诺100%原创，100%专业，100%准时，100%满意。

随机分析代写

随机微积分是数学的一个分支，对随机过程进行操作。它允许为随机过程的积分定义一个关于随机过程的一致的积分理论。这个领域是由日本数学家伊藤清在第二次世界大战期间创建并开始的。

时间序列分析代写

随机过程，是依赖于参数的一组随机变量的全体，参数通常是时间。随机变量是随机现象的数量表现，其时间序列是一组按照时间发生先后顺序进行排列的数据点序列。通常一组时间序列的时间间隔为一恒定值（如1秒，5分钟，12小时，7天，1年），因此时间序列可以作为离散时间数据进行分析处理。研究时间序列数据的意义在于现实中，往往需要研究某个事物其随时间发展变化的规律。这就需要通过研究该事物过去发展的历史记录，以得到其自身发展的规律。

回归分析代写

多元回归分析渐进（Multiple Regression Analysis Asymptotics）属于计量经济学领域，主要是一种数学上的统计分析方法，可以分析复杂情况下各影响因素的数学关系，在自然科学、社会和经济学等多个领域内应用广泛。

MATLAB代写

MATLAB 是一种用于技术计算的高性能语言。它将计算、可视化和编程集成在一个易于使用的环境中，其中问题和解决方案以熟悉的数学符号表示。典型用途包括：数学和计算算法开发建模、仿真和原型制作数据分析、探索和可视化科学和工程图形应用程序开发，包括图形用户界面构建MATLAB 是一个交互式系统，其基本数据元素是一个不需要维度的数组。这使您可以解决许多技术计算问题，尤其是那些具有矩阵和向量公式的问题，而只需用 C 或 Fortran 等标量非交互式语言编写程序所需的时间的一小部分。MATLAB 名称代表矩阵实验室。MATLAB 最初的编写目的是提供对由 LINPACK 和 EISPACK 项目开发的矩阵软件的轻松访问，这两个项目共同代表了矩阵计算软件的最新技术。MATLAB 经过多年的发展，得到了许多用户的投入。在大学环境中，它是数学、工程和科学入门和高级课程的标准教学工具。在工业领域，MATLAB 是高效研究、开发和分析的首选工具。MATLAB 具有一系列称为工具箱的特定于应用程序的解决方案。对于大多数 MATLAB 用户来说非常重要，工具箱允许您学习和应用专业技术。工具箱是 MATLAB 函数（M 文件）的综合集合，可扩展 MATLAB 环境以解决特定类别的问题。可用工具箱的领域包括信号处理、控制系统、神经网络、模糊逻辑、小波、仿真等。

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

统计代写|主成分分析代写Principal Component Analysis代考|Incomplete PPCA by Expectation Maximization

Posted on 2023年4月19日2023年4月19日 by statistics-lab

如果你也在怎样代写主成分分析Principal Component Analysis这个学科遇到相关的难题，请随时右上角联系我们的24/7代写客服。

主成分分析（PCA）是计算主成分并使用它们对数据进行基础改变的过程，有时只使用前几个主成分，而忽略其余部分。

我们提供的主成分分析Principal Component Analysis及其相关学科的代写，服务范围广, 其中包括但不限于:

Statistical Inference 统计推断
Statistical Computing 统计计算
Advanced Probability Theory 高等概率论
Advanced Mathematical Statistics 高等数理统计学
(Generalized) Linear Models 广义线性模型
Statistical Machine Learning 统计机器学习
Longitudinal Data Analysis 纵向数据分析
Foundations of Data Science 数据科学基础

统计代写|主成分分析代写Principal Component Analysis代考|Incomplete PPCA by Expectation Maximization

In this section, we derive an EM algorithm (see Appendix B.2.1) for solving the PPCA problem with missing data. Recall from Section 2.2 that in the PPCA model, each data point is drawn as $x \sim \mathcal{N}\left(\mu_x, \Sigma_x\right)$, where $\mu_x=\mu$ and $\Sigma_x=B B^{\top}+\sigma^2 I_D$, where $\mu \in \mathbb{R}^D, B \in \mathbb{R}^{D \times d}$, and $\sigma>0$. Recall also from (2.56) that the log-likelihood of the PPCA model is given by
$$
\mathscr{L}=-\frac{N D}{2} \log (2 \pi)-\frac{N}{2} \log \operatorname{det}\left(\Sigma_x\right)-\frac{1}{2} \sum_{j=1}^N \operatorname{trace}\left(\Sigma_x^{-1}\left(x_j-\mu\right)\left(x_j-\mu\right)^{\top}\right)
$$
where $\left{x_j\right}_{j=1}^N$ are $N$ i.i.d. samples of $\boldsymbol{x}$. Since the samples are incomplete, we can partition each point $x$ and the parameters $\mu_x$ and $\Sigma_x$ as
$$
\left[\begin{array}{l}
x_U \
x_O
\end{array}\right]=P x, \quad\left[\begin{array}{l}
\mu_U \
\mu_O
\end{array}\right]=P \mu, \text { and }\left[\begin{array}{cc}
\Sigma_{U U} & \Sigma_{U O} \
\Sigma_{O U} & \Sigma_{O O}
\end{array}\right]=P \Sigma_x P^{\top} .
$$
Here $\boldsymbol{x}O$ is the observed part of $\boldsymbol{x}, \boldsymbol{x}_U$ is the unobserved part of $\boldsymbol{x}$, and $P$ is any permutation matrix that reorders the entries of $x$ so that the unobserved entries appear first. Notice that $P$ is not unique, but we can use any such $P$. Notice also that the above partition of $x, \mu_x$, and $\Sigma_x$ could be different for each data point, because the missing entries could be different for different data points. When strictly necessary, we will use $\boldsymbol{x}{j U}$ and $\boldsymbol{x}_{j O}$ to denote the unobserved and observed parts of point $\boldsymbol{x}_j$, respectively, and $P_j$ to denote the permutation matrix. Otherwise, we will avoid using the index $j$ in referring to a generic point.

统计代写|主成分分析代写Principal Component Analysis代考|Matrix Completion by Convex Optimization

The EM-based approaches to incomplete PPCA discussed in the previous section rely on (a) explicit parameterizations of the low-rank factors and (b) minimization of a nonconvex cost function in an alternating minimization fashion. Specifically, such approaches alternate between completing the missing entries given the parameters of a PPCA model for the data and estimating the parameters of the model from complete data. While simple and intuitive, such approaches suffer from two important disadvantages. First, the desired rank of the matrix needs to be known in advance. Second, due to the greedy nature of the EM algorithm, it is difficult to ensure convergence to the globally optimal solution. Therefore, a good initialization of the EM-based algorithm is critical for converging to a good solution.

In this section, we introduce an alternative approach that solves the low-rank matrix completion problem via a convex relaxation. As we will see, this approach allows us to complete a low-rank matrix by minimizing a convex objective function, which is guaranteed to have a globally optimal minimizer. Moreover, under rather benign conditions on the missing entries, the global minimizer is guaranteed to be the correct low-rank matrix, even without knowing the rank of the matrix in advance.
A rigorous justification for the correctness of the convex relaxation approach requires a deep knowledge of high-dimensional statistics and geometry that is beyond the scope of this book. However, this does not prevent us from introducing and summarizing here the main ideas and results, as well as the basic algorithms offered by this approach. Practitioners can apply the useful algorithm to their data and problems, whereas researchers who are more interested in the advanced theory behind the algorithm may find further details in (Cai et al. 2008; Candès and Recht 2009; Candès and Tao 2010; Gross 2011; Keshavan et al. 2010a; Zhou et al. 2010a).
Compressive Sensing of Low-Rank Matrices
The matrix completion problem can be considered a special case of the more general class of problems of recovering a high-dimensional low-rank matrix $X$ from highly compressive linear measurements $B=\mathcal{P}(X)$, where $\mathcal{P}$ is a linear operator that returns a set of linear measurements $B$ of the matrix $X$. It is known from highdimensional statistics that if the linear operator $\mathcal{P}$ satisfies certain conditions, then the rank minimization problem
$$
\min _A \operatorname{rank}(A) \quad \text { s.t. } \quad \mathcal{P}(A)=B
$$
is well defined, and its solution is unique (Candès and Recht 2009). However, it is also known that under general conditions, the task of finding such a minimal-rank solution is in general an NP-hard problem.

主成分分析代考

统计代写|主成分分析代写Principal Component Analysis代考|Incomplete PPCA by Expectation Maximization

在本节中，我们推导了一种 EM 算法（参见附录 B.2.1），用于解决具有缺失数据的 PPCA 问题。回想一下 2.2 节，在 PPCA 模型中，每个数据点被绘制为 $x \sim \mathcal{N}\left(\mu_x, \Sigma_x\right)$ ，在哪里 $\mu_x=\mu$ 和 $\Sigma_x=B B^{\top}+\sigma^2 I_D$ ，在哪里 $\mu \in \mathbb{R}^D, B \in \mathbb{R}^{D \times d}$ ，和 $\sigma>0$. 还记得 (2.56) 中的 PPCA 模型的对数似然由下式给出
$$
\mathscr{L}=-\frac{N D}{2} \log (2 \pi)-\frac{N}{2} \log \operatorname{det}\left(\Sigma_x\right)-\frac{1}{2} \sum_{j=1}^N \operatorname{trace}\left(\Sigma_x^{-1}\left(x_j-\mu\right)\left(x_j-\mu\right)^{\top}\right)
$$
在哪里 \left{x_j\right} ${j=1} \wedge N$ 是 $N$ iid样本 $x$. 由于样本不完整，我们可以划分每个点 $x$ 和参数 $\mu_x$ 和 $\Sigma_x$ 作为 $\left[\begin{array}{ll}x_U x_O\end{array}\right]=P x, \quad\left[\mu_U \mu_O\right]=P \mu$, and $\left[\begin{array}{lll}\Sigma_{U U} & \Sigma_{U O} & \Sigma_{O U} \quad \Sigma_{O O}\end{array}\right]=P \Sigma_x P^{\top}$.
这里 $\boldsymbol{x} O$ 是观察到的部分 $\boldsymbol{x}, \boldsymbol{x}U$ 是末被观察到的部分 $\boldsymbol{x} ，$ 和 $P$ 是重新排序条目的任何置换矩阵 $x$ 这样末观察到的条目首先出现。请注意 $P$ 不是唯一的，但我们可以使用任何这样的 $P$. 还要注意上面的分区 $x, \mu_x$ ，和 $\Sigma_x$ 每个数据点可能不同，因为缺失的条目对于不同的数据点可能不同。绝对必要时，我们将使用 $\boldsymbol{x} j U$ 和 $\boldsymbol{x}{j O}$ 表示点的末观察到和观察到的部分 $\boldsymbol{x}_j$ ，分别和 $P_j$ 表示置换矩阵。否则，我们将避免使用索引j在提到一个通用点时。

统计代写|主成分分析代写Principal Component Analysis代考|Matrix Completion by Convex Optimization

上一节中讨论的基于 EM 的不完全 PPCA 方法依赖于 (a) 低秩因子的显式参数化和 (b) 以交替最小化方式最小化非凸成本函数。具体来说，这种方法在给定数据的 PPCA 模型参数的情况下完成缺失条目和从完整数据估计模型参数之间交替进行。虽然简单直观，但这种方法有两个重要的缺点。首先，需要事先知道所需的矩阵秩。其次，由于EM算法的贪心性，很难保证收敛到全局最优解。因此，基于 EM 的算法的良好初始化对于收敛到良好的解决方案至关重要。
在本节中，我们介绍了一种通过凸松驰解决低秩矩阵补全问题的替代方法。正如我们将看到的，这种方法允许我们通过最小化凸目标函数来完成低秩矩阵，保证具有全局最优最小值。此外，在缺失条目相当良性的条件下，全局最小化器保证是正确的低秩矩阵，即使事先不知道矩阵的秩。
严格证明凸松驰方法的正确性需要对高维统计和几何有深入的了解，这超出了本书的范围。然而，这并不妨碍我们在这里介绍和总结主要思想和结果，以及这种方法提供的基本算法。从业者可以将有用的算法应用于他们的数据和问题，而对算法背后的高级理论更感兴趣的研究人员可以在 (Cai et al. 2008; Candès and Recht 2009; Candès and Tao 2010; Gross 2011; Keshavan 等人 2010a; Zhou 等人 2010a)。低阶矩阵的压缩感知
矩阵补全问题可以被认为是恢复高维低秩矩阵的更一般类问题的特例 $X$ 来自高度压缩的线性测量 $B=\mathcal{P}(X)$ ，在哪里 $\mathcal{P}$ 是返回一组线性测量值的线性运算符 $B$ 矩阵的 $X$. 由高维统计可知，如果线性算子 $\mathcal{P}$ 满足一定条件，则秩最小化问题
$$
\min _A \operatorname{rank}(A) \quad \text { s.t. } \quad \mathcal{P}(A)=B
$$
定义明确，其解决方案是独一无二的 (Candès 和 Recht 2009) 。然而，众所周知，在一般情况下，寻找这种最小秩解的任务通常是一个NP-hard 问题。

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

金融工程代写

非参数统计代写

非参数统计指的是一种统计方法，其中不假设数据来自于由少数参数决定的规定模型；这种模型的例子包括正态分布模型和线性回归模型。

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

有限元方法代写

随机分析代写

时间序列分析代写

回归分析代写

MATLAB代写

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

数学代写|MTH9893 Principal component analysis

Posted on 2023年4月19日2023年4月18日 by statistics-lab

Statistics-lab™可以为您提供cuny.edu MTH9893 Principal component analysis主成分分析课程的代写代考和辅导服务！

MTH9893 Principal component analysis课程简介

This course covers univariate and multivariate time series analysis, conditional heteroscedastic models, principal component analysis, and factor models. Students will learn about implementing univariate and multivariate volatility models. Note: Students cannot receive credit for both MTH 9867 and MTH 9893.

PREREQUISITES

MTH9893 Principal component analysis HELP（EXAM HELP， ONLINE TUTOR）

问题 1.

Exercise 5.1 (Clustering Points in a Plane). Describe how Algorithm 5.1 can also be applied to a set of points in the plane $\left{x_j \in \mathbb{R}^2\right}_{j=1}^N$ that are distributed around a collection of cluster centers $\left{\boldsymbol{\mu}i \in \mathbb{R}^2\right}{i=1}^n$ by interpreting the data points as complex numbers: ${z \doteq x+y \sqrt{-1} \in \mathbb{C}}$. In particular, discuss what happens to the coefficients and roots of the fitting polynomial $p_n(z)$.

问题 2.

Exercise 5.3 (Level Sets and Normal Vectors). Let $f(x): \mathbb{R}^D \rightarrow \mathbb{R}$ be a smooth function. For a constant $c \in \mathbb{R}$, the set $S_c \doteq\left{x \in \mathbb{R}^D \mid f(x)=c\right}$ is called a level set of the function $f ; S_c$ is in general a $(D-1)$-dimensional submanifold. Show that if $|\nabla f(x)|$ is nonzero at a point $x_0 \in S_c$, then the gradient $\nabla f\left(x_0\right) \in \mathbb{R}^D$ at $x_0$ is orthogonal to all tangent vectors of the level set $S_c$.

问题 3.

Exercise 5.7 (Two Subspaces in General Position). Consider two linear subspaces of dimension $d_1$ and $d_2$ respectively in $\mathbb{R}^D$. We say that they are in general position if an arbitrarily small perturbation of the position of the subspaces does not change the dimension of their intersection. Show that two subspaces are in general position if and only if
$$
\operatorname{dim}\left(S_1 \cap S_2\right)=\min \left{d_1+d_2-D ; 0\right} .
$$

问题 4.

Exercise 5.8. Implement the basic algebraic subspace clustering algorithm, Algorithm 5.4 , and test the algorithm for different subspace arrangements with different levels of noise.

问题 5.

Exercise 5.12 (Robust Estimation of Fitting Polynomials). We know that samples from an arrangement of $n$ subspaces, their Veronese lifting, all lie on a single subspace $\operatorname{span}\left(V_n(D)\right)$. The coefficients of the fitting polynomials are simply the null space of $\boldsymbol{V}_n(D)$. If there is noise, the lifted samples approximately span a subspace, and the coefficients of the fitting polynomials are eigenvectors associated with the small eigenvalues of $\boldsymbol{V}_n(D)^{\top} \boldsymbol{V}_n(D)$. However, if there are outliers, the lifted samples together no longer span a subspace. Notice that this is the same situation that robust statistical techniques such as multivariate trimming (MVT) are designed to deal with. See Appendix B.5 for more details. In this exercise, show how to combine MVT with ASC so that the resulting algorithm will be robust to outliers. Implement your scheme and find out the highest percentage of outliers that the algorithm can handle (for various subspace arrangements).

Textbooks

• An Introduction to Stochastic Modeling, Fourth Edition by Pinsky and Karlin (freely
available through the university library here)
• Essentials of Stochastic Processes, Third Edition by Durrett (freely available through
the university library here)
To reiterate, the textbooks are freely available through the university library. Note that
you must be connected to the university Wi-Fi or VPN to access the ebooks from the library
links. Furthermore, the library links take some time to populate, so do not be alarmed if
the webpage looks bare for a few seconds.

此图像的alt属性为空；文件名为%E7%B2%89%E7%AC%94%E5%AD%97%E6%B5%B7%E6%8A%A5-1024x575-10.png — 数学代写|MTH9893 Principal component analysis

Statistics-lab™可以为您提供cuny.edu MTH9893 Principal component analysis主成分分析课程的代写代考和辅导服务！请认准Statistics-lab™. Statistics-lab™为您的留学生涯保驾护航。

统计代写|主成分分析代写Principal Component Analysis代考|STAT6020

Posted on 2022年8月10日2022年8月10日 by statistics-lab

如果你也在怎样代写主成分分析Principal Component Analysis这个学科遇到相关的难题，请随时右上角联系我们的24/7代写客服。

主成分分析（PCA）是计算主成分并使用它们对数据进行基础改变的过程，有时只使用前几个主成分，而忽略其余部分。

我们提供的主成分分析Principal Component Analysis及其相关学科的代写，服务范围广, 其中包括但不限于:

Statistical Inference 统计推断
Statistical Computing 统计计算
Advanced Probability Theory 高等概率论
Advanced Mathematical Statistics 高等数理统计学
(Generalized) Linear Models 广义线性模型
Statistical Machine Learning 统计机器学习
Longitudinal Data Analysis 纵向数据分析
Foundations of Data Science 数据科学基础

统计代写|主成分分析代写Principal Component Analysis代考|STAT6020

统计代写|主成分分析代写Principal Component Analysis代考|Lymphoma data sets

The lymphoma data set comes from a study of gene expression of three prevalent lymphoid malignancies: B-cell chronic lymphocytic leukemia (B-CLL), follicular lymphoma (FL) and diffuse large B-cell lymphoma (DLCL). Among 96 samples we took 62 samples 4026 genes in three classes: 11 cases of B-CLL, 9 cases of FL and 42 cases of DLCL. Gene expression levels were measured using 2-channel cDNA microarrays.

After preprocessing, all gene expression profiles were base 10 log-transformed and, in order to prevent single arrays from dominating the analysis, standardized to zero mean and unit variance. Finally, we complete the preprocessing of the gene expression data with gene centring.

In this example we perform the KPCA, as detailed in the previous section, we compute the kernel matrix with using the radial basis kernel with $c=0.01$, this value is set heuristically. The resulting plot is given in Figure 6. It shows the projection onto the two leading kernel principal components of microarrays. In this figure we can see that KPCA detect the group structure in reduced dimension. DLCL, FL and B-CLL are fully separated by KPCA.

To validate our procedure we select a list of genes differentially expressed proposed by (Reverter et al. (2010)) and a list of genes that are not differentially expressed. In particular, in Figures 7, 8,9 and 10 we show the results in the case of genes: $139009,1319066,1352822$ and 1338456 , respectively. The three first genes belong to the list of genes differentially expressed and the last gene is not differentially expressed.

Figure 7 (top) shows the tangent vectors associated with 139009 gene attached at each sample point. This vector field reveals upper expression towards DLCL cluster as is expected from references above mentioned. This gene is mainly represented by the first principal component. The length of the arrows indicate the influence strength of the gene on the sample position despite the dimension reduction. Figure 7 (bottom) shows the expression profile of 139009 gene. We can observe that 139009 gene is up regulated in DLCL cluster. This profile is agree with our procedure because the direction in which the expression of the 139009 gene increases points to the DLCL cluster.

统计代写|主成分分析代写Principal Component Analysis代考|Definitions of major «omics» in molecular biology and their goals

The nomicsm era, also called classically the post-genomic era, is described as the period of time which extends the first publication of the human genome sequence draft in 2001 (International Human Genome Sequencing Consortium, 2001; Venter et al., 2001). Ten years after that milestone, extensive use of high-throughput analytical technologies, high performance computing power and large advances in bioinformatics have been applied to solve fundamental molecular biology questions as well as to find clues concerning human diseases (cancers) and aging. Principal nomicsw, such as Gen-omics, Transcript-omics, Proteomics and Metabol-omics, are biology disciplines whose main and extremely ambitious objective is to describe as extensively as possible the complete class-specific molecular components of the cell. In the a omics sciences, the catalog of major cell molecular components, respectively, genes, messenger RNAs and small interfering and regulatory RNAs, proteins, and metabolites of living organisms, is recorded qualitatively as well as quantitatively in response to environmental changes or pathological situations. Various research communities, organized in institutions both at the academic and private levels and working in the nomicsm fields, have spent large amounts of effort and money to reach. standardization in the different experimental and data processing steps. Some of these “omics” specific steps basically include the following: the optimal experimental workflow design, the technology-dependent data acquisition and storage, the pre-processing methods and the post-processing strategies in order to extract some level of relevant biological knowledge from usually large data sets. Just like Perl (Practical Extraction and Report Language) has been recognized to have saved the Human Genome project initiative (Stein, 1996), by using accurate rules to parse genomic sequence data, other web-driven. programming languages and file formats such as XML have also facilitated nomics” data dissemination among scientists and helped rationalize and integrate molecular biology data.
Data resulting from different womicsw have several characteristics in common, which are summarized in Figure 1: (a) the number of measured variables $\mathrm{n}$ ( $\mathrm{SNP}$, gene expression, proteins, peptides, metabolites) is quite large in size (from 100 to 10000), (b) the number of samples or experiments $\mathrm{p}$ where these variables are measured associated with factors such as the pathological status, environmental conditions, drug exposure or kinetic points (temporal experiments) is rather large $(10$ to 1000$)$ and (c) the measured variables are organized in a matrix of $\mathrm{n} \times \mathrm{p}$ dimensions. The cell contents of such a matrix usually record a metric (or numerical code) related to the abundance of the measured variables. The observed data are acquired keeping the lowest amount of possible technical and analytical variability. Exploring these womicsw data requires fast computers and state-of-the-art data visualization and statistical multivariate tools to extract relevant knowledge, and among these tools PCA is a tool of choice in order to perform initial exploratory data analysis (EDA).

主成分分析代考

统计代写|主成分分析代写Principal Component Analysis代考|Lymphoma data sets

淋巴瘤数据集来自对三种流行淋巴恶性肿瘤基因表达的研究：B 细胞慢性淋巴细胞白血病 (B-CLL)、滤泡性淋巴瘤 (FL) 和弥漫性大 B 细胞淋巴瘤 (DLCL)。在 96 个样本中，我们抽取了 62 个样本 4026 个基因，分为三类：11 例 B-CLL、9 例 FL 和 42 例 DLCL。使用 2 通道 cDNA 微阵列测量基因表达水平。

预处理后，所有基因表达谱都进行了 10 对数转换，并且为了防止单个阵列主导分析，标准化为零均值和单位方差。最后，我们完成了基因中心对基因表达数据的预处理。

在本例中，我们执行 KPCA，如上一节所述，我们使用径向基核计算核矩阵C=0.01，这个值是启发式设置的。结果图在图 6 中给出。它显示了在微阵列的两个主要内核主成分上的投影。在该图中，我们可以看到 KPCA 检测降维的组结构。DLCL、FL 和 B-CLL 由 KPCA 完全分离。

为了验证我们的程序，我们选择了 (Reverter et al. (2010)) 提出的差异表达基因列表和未差异表达的基因列表。特别是，在图 7、8、9 和 10 中，我们展示了基因情况下的结果：139009,1319066,1352822和 1338456 分别。前三个基因属于差异表达基因列表，最后一个基因没有差异表达。

图 7（顶部）显示了与每个样本点连接的 139009 基因相关的切线向量。正如上述参考文献所预期的那样，该向量场揭示了对 DLCL 簇的上层表达式。该基因主要由第一主成分代表。尽管尺寸减小，箭头的长度表示基因对样本位置的影响强度。图 7（下）显示了 139009 基因的表达谱。我们可以观察到 139009 基因在 DLCL 簇中上调。该图谱与我们的程序一致，因为 139009 基因表达增加的方向指向 DLCL 簇。

统计代写|主成分分析代写Principal Component Analysis代考|Definitions of major «omics» in molecular biology and their goals

组学时代，也称为经典的后基因组时代，被描述为延长 2001 年人类基因组序列草案首次出版的时期（国际人类基因组测序联盟，2001；Venter 等人，2001）。在那个里程碑十年之后，广泛使用高通量分析技术、高性能计算能力和生物信息学的巨大进步已被应用于解决基本分子生物学问题以及寻找有关人类疾病（癌症）和衰老的线索。主要的组学，如基因组学、转录组学、蛋白质组学和代谢组学，是生物学学科，其主要和极其雄心勃勃的目标是尽可能广泛地描述细胞的完整类特异性分子成分。在组学科学中，主要细胞分子成分的目录，分别是基因、信使 RNA 和小干扰和调节 RNA、蛋白质和生物体的代谢物，根据环境变化或病理情况进行定性和定量记录。各种研究团体，在学术和私人层面的机构中组织并在经济学领域工作，花费了大量的精力和金钱来接触。在不同的实验和数据处理步骤中进行标准化。其中一些“组学”具体步骤基本上包括以下内容：优化实验工作流程设计、依赖技术的数据采集和存储、预处理方法和后处理策略，以便从通常较大的数据集中提取一定程度的相关生物学知识。就像 Perl（实用提取和报告语言）被公认为拯救了人类基因组计划（Stein，1996）一样，通过使用准确的规则来解析基因组序列数据，其他网络驱动。XML 等编程语言和文件格式也促进了“经济学”数据在科学家之间的传播，并有助于合理化和整合分子生物学数据。
来自不同 womicsw 的数据有几个共同的特征，总结在图 1 中： (a) 测量变量的数量n ( 小号ñ磷, 基因表达, 蛋白质, 肽, 代谢物) 规模相当大 (从 100 到 10000), (b) 样本或实验的数量p这些变量的测量与病理状态、环境条件、药物暴露或动力学点（时间实验）等因素有关(10到 1000)(c) 测量变量被组织成一个矩阵n×p方面。这种矩阵的单元格内容通常记录与测量变量的丰度相关的度量（或数字代码）。采集的观测数据保持尽可能低的技术和分析可变性。探索这些 womicsw 数据需要快速计算机和最先进的数据可视化和统计多元工具来提取相关知识，在这些工具中，PCA 是执行初始探索性数据分析 (EDA) 的首选工具。

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

金融工程代写

非参数统计代写

非参数统计指的是一种统计方法，其中不假设数据来自于由少数参数决定的规定模型；这种模型的例子包括正态分布模型和线性回归模型。

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

有限元方法代写

随机分析代写

时间序列分析代写

回归分析代写

MATLAB代写

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

统计代写|主成分分析代写Principal Component Analysis代考|ENVX2001

Posted on 2022年8月10日2022年8月10日 by statistics-lab

如果你也在怎样代写主成分分析Principal Component Analysis这个学科遇到相关的难题，请随时右上角联系我们的24/7代写客服。

主成分分析（PCA）是计算主成分并使用它们对数据进行基础改变的过程，有时只使用前几个主成分，而忽略其余部分。

我们提供的主成分分析Principal Component Analysis及其相关学科的代写，服务范围广, 其中包括但不限于:

Statistical Inference 统计推断
Statistical Computing 统计计算
Advanced Probability Theory 高等概率论
Advanced Mathematical Statistics 高等数理统计学
(Generalized) Linear Models 广义线性模型
Statistical Machine Learning 统计机器学习
Longitudinal Data Analysis 纵向数据分析
Foundations of Data Science 数据科学基础

统计代写|主成分分析代写Principal Component Analysis代考|ENVX2001

统计代写|主成分分析代写Principal Component Analysis代考|Validation

In this section we illustrate our procedure with data from the leukemia data set of Golub et al. (1999) and the lymphoma data set Alizadeh et al. (2000).

In these examples our aim is to validate our procedure for adding input variables information into KPCA representation. We follow the following steps. First, in each data set, we build a list of genes that are differentially expressed. This selection is based in accordance with previous studies such as (Golub et al. (1999), Pittelkow \& Wilson (2003), Reverter et al. (2010)). In addition we compute the expression profile of each gene selected, this profile confirm the evidence of differential expression.

Second, we compute the curves through each sample point associated with each gene in the list. These curves are given by the $\phi$-image of points of the form:
$$
\mathbf{y}(s)=\mathbf{x}{i}+s \mathbf{e}{k}
$$
where $x_{i}$ is the $1 \times n$ expression vector of the $i$-th sample, $i=1, \ldots, m, k$ denotes the index in the expression matrix of the gene selected to be represented, $\mathbf{e}{k}=(0, \ldots, 1, \ldots, 0)$ is a $1 \times n$ vector with zeros except in the $k$-th. These curves describe locally the change of the sample $x{i}$ induced by the change of the gene expression.

Third, we project the tangent vector of each curve at $s=0$, that is, at the sample points $\mathbf{x}_{i}$, $i=1, \ldots, m$, onto the KPCA subspace spanned by the eigenvectors (9). This representation capture the direction of maximum variation induced in the samples when the expression of gene increases.

By simultaneously displaying both the samples and the gene information on the same plot it is possible both to visually detect genes which have similar profiles and to interpret this pattern by reference to the sample groups.

统计代写|主成分分析代写Principal Component Analysis代考|Leukemia data sets

The leukemia data set is composed of 3051 gene expressions in three classes of leukemia: 19 cases of B-cell acute lymphoblastic leukemia (ALL), 8 cases of T-cell ALL and 11 cases of acute myeloid leukemia (AML). Gene expression levels were measured using Affymetrix high-density oligonucleotide arrays.

The data were preprocessed according to the protocol described in Dudoit et al. (2002). In addition, we complete the preprocessing of the gene expression data with a microarray standardization and gene centring.

In this example we perform the KPCA, as detailed in the previous section, we compute the kernel matrix with using the radial basis kernel with $c=0.01$, this value is set heuristically. The resulting plot is given in Figure 1. It shows the projection onto the two leading kernel principal components of microarrays. In this figure we can see that KPCA detect the group structure in reduced dimension. AML, T-cell ALL and B-cell ALL are fully separated by KPCA.

To validate our procedure we select a list of genes differentially expressed proposed by (Golub et al. (1999), Pittelkow \& Wilson (2003), Reverter et al. (2010)) and a list of genes that are not differentially expressed. In particular, in Figures 2, 3,4 and 5 we show the results in the case of genes: X76223_s_at, X82240_rna1_at, Y00787_s_at and D50857_at, respectively. The three first genes belong to the list of genes differentially expressed and the last gene is not differentially expressed.

Figure 2 (top) shows the tangent vectors associated with $\mathrm{X} 76223_{\text {_s_at gene, attached at }}$ each sample point. This vector field reveals upper expression towards T-cell cluster as is expected from references above mentioned. This gene is well represented by the second principal component. The length of the arrows indicate the strength of the gene on the sample position despite the dimension reduction. Figure 2 (bottom) shows the expression profile of X76223_s_at gene. We can observe that X76223_s_at gene is up regulated in T-cell class. This profile is agree with our procedure because the direction in which the expression of the $\mathrm{x} 76223$ _s_at gene increases points to the T-cell cluster.

主成分分析代考

统计代写|主成分分析代写Principal Component Analysis代考|Validation

在本节中，我们使用来自 Golub 等人的白血病数据集的数据来说明我们的程序。(1999) 和淋巴瘤数据集 Alizadeh 等人。 $(2000$ 年 $)$ 。
在这些示例中，我们的目标是验证我们将输入变量信息添加到 KPCA 表示中的过程。我们遵循以下步骤。首先，在每个数据集中，我们构建了一个差异表达的基因列表。该选择基于先前的研究，例如 (Golub et al. (1999), Pittelkow \& Wilson (2003), Reverter et al. (2010))。此外，我们计算每个所选基因的表达谱，该谱证实了差异表达的证据。
其次，我们通过与列表中每个基因相关的每个样本点计算曲线。这些曲线由 $\phi$ – 形式点的图像:
$$
\mathbf{y}(s)=\mathbf{x} i+s \mathbf{e} k
$$
在哪里 $x_{i}$ 是个 $1 \times n$ 的表达载体 $i$-第一个样本， $i=1, \ldots, m, k$ 表示选择要表示的基因在表达矩阵中的索引， $\mathrm{e} k=(0, \ldots, 1, \ldots, 0)$ 是一个 $1 \times n$ 除了在 $k$-th。这些曲线局部地描述了样本的变化 $x i$ 基因表达的改变引起的。
第三，我们将每条曲线的切向量投影在 $s=0$ ，即在样本点 $\mathbf{x}_{i}, i=1, \ldots, m$, 到由特征向量 (9) 跨越的 KPCA 子空间上。当基因表达增加时，这种表示捕获了样本中诱导的最大变异的方向。
通过在同一个图上同时显示样本和基因信息，既可以直观地检测具有相似特征的基因，也可以通过参考样本组来解释这种模式。

统计代写|主成分分析代写Principal Component Analysis代考|Leukemia data sets

白血病数据集由三类白血病中的 3051 个基因表达组成：19 例 B 细胞急性淋巴细胞白血病 (ALL)、8 例 T 细胞 ALL 和 11 例急性髓性白血病 (AML)。使用 Affymetrix 高密度寡核苷酸阵列测量基因表达水平。

根据 Dudoit 等人描述的协议对数据进行了预处理。（2002 年）。此外，我们通过微阵列标准化和基因中心化完成了基因表达数据的预处理。

在本例中，我们执行 KPCA，如上一节所述，我们使用径向基核计算核矩阵C=0.01，这个值是启发式设置的。得到的图在图 1 中给出。它显示了对微阵列的两个主要内核主成分的投影。在该图中，我们可以看到 KPCA 检测降维的组结构。AML、T 细胞 ALL 和 B 细胞 ALL 由 KPCA 完全分离。

为了验证我们的程序，我们选择了由 (Golub 等人 (1999)、Pittelkow \& Wilson (2003)、Reverter 等人 (2010)) 提出的差异表达基因列表和未差异表达的基因列表。特别是，在图 2、3、4 和 5 中，我们分别展示了基因情况下的结果：X76223_s_at、X82240_rna1_at、Y00787_s_at 和 D50857_at。前三个基因属于差异表达基因列表，最后一个基因没有差异表达。

图 2（顶部）显示了与相关的切向量X76223_s_at 基因，附着在每个采样点。正如上述参考文献所预期的那样，该向量场揭示了对 T 细胞簇的上层表达。该基因由第二个主要成分很好地代表。尽管尺寸减小，箭头的长度表示基因在样本位置上的强度。图 2（下）显示了 X76223_s_at 基因的表达谱。我们可以观察到 X76223_s_at 基因在 T 细胞类中上调。该配置文件与我们的程序一致，因为X76223_s_at 基因增加指向 T 细胞簇。

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

金融工程代写

非参数统计代写

非参数统计指的是一种统计方法，其中不假设数据来自于由少数参数决定的规定模型；这种模型的例子包括正态分布模型和线性回归模型。

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

有限元方法代写

随机分析代写

时间序列分析代写

回归分析代写

MATLAB代写

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

统计代写|主成分分析代写Principal Component Analysis代考|STAT3888

Posted on 2022年8月10日2022年8月10日 by statistics-lab

如果你也在怎样代写主成分分析Principal Component Analysis这个学科遇到相关的难题，请随时右上角联系我们的24/7代写客服。

主成分分析（PCA）是计算主成分并使用它们对数据进行基础改变的过程，有时只使用前几个主成分，而忽略其余部分。

我们提供的主成分分析Principal Component Analysis及其相关学科的代写，服务范围广, 其中包括但不限于:

Statistical Inference 统计推断
Statistical Computing 统计计算
Advanced Probability Theory 高等概率论
Advanced Mathematical Statistics 高等数理统计学
(Generalized) Linear Models 广义线性模型
Statistical Machine Learning 统计机器学习
Longitudinal Data Analysis 纵向数据分析
Foundations of Data Science 数据科学基础

统计代写|主成分分析代写Principal Component Analysis代考|STAT3888

统计代写|主成分分析代写Principal Component Analysis代考|Kernel PCA methodology

KPCA is a nonlinear equivalent of classical PCA that uses methods inspired by statistical learning theory. We describe shortly the KPCA method from Scholkopf et al. (1998).

Given a set of observations $\mathbf{x}{i} \in \mathbb{R}^{n}, i=1, \ldots, m$. Let us consider a dot product space $F$ related to the input space by a map $\phi: \mathbb{R}^{n} \rightarrow F$ which is possibly nonlinear. The feature space $F$ could have an arbitrarily large, and possibly infinite, dimension. Hereafter upper case characters are used for elements of $F$, while lower case characters denote elements of $\mathbb{R}^{n}$. We assume that we are dealing with centered data $\sum{i=1}^{m} \phi\left(\mathbf{x}{i}\right)=0$. In $F$ the covariance matrix takes the form $$ \mathrm{C}=\frac{1}{m} \sum{j=1}^{m} \phi\left(\mathbf{x}{j}\right) \phi\left(\mathbf{x}{j}\right)^{\top} .
$$
We have to find eigenvalues $\lambda \geq 0$ and nonzero eigenvectors $V \in F \backslash{0}$ satisfying
$$
\mathbf{C V}=\lambda \mathrm{V} \text {. }
$$

As is well known all solutions $\mathbf{V}$ with $\lambda \neq 0$ lie in the span of $\left{\phi\left(\mathbf{x}{i}\right)\right}{i=1}^{m}$. This has two consequences: first we may instead consider the set of equations
$$
\left\langle\phi\left(\mathbf{x}{k}\right), \mathbf{C V}\right\rangle=\lambda\left\langle\phi\left(\mathbf{x}{k}\right), \mathbf{V}\right\rangle,
$$
for all $k=1, \ldots, m$, and second there exist coefficients $\alpha_{i}, i=1, \ldots, m$ such that
$$
\mathbf{V}=\sum_{i=1}^{m} \alpha_{i} \phi\left(\mathbf{x}{i}\right) $$ Combining (1) and (2) we get the dual representation of the eigenvalue problem $$ \frac{1}{m} \sum{i=1}^{m} \alpha_{i}\left\langle\phi\left(\mathbf{x}{k}\right), \sum{j=1}^{m} \phi\left(\mathbf{x}{j}\right)\left\langle\phi\left(\mathbf{x}{j}\right), \phi\left(\mathbf{x}{i}\right)\right\rangle\right\rangle=\lambda \sum{i=1}^{m} \alpha_{i}\left\langle\phi\left(\mathbf{x}{k}\right), \phi\left(\mathbf{x}{i}\right)\right\rangle,
$$
for all $k=1, \ldots m$. Defining a $m \times m$ matrix $K$ by $K_{i j}:=\left\langle\phi\left(\mathbf{x}{i}\right), \phi\left(\mathbf{x}{j}\right)\right\rangle$, this reads
$$
K^{2} \alpha=m \lambda K \alpha,
$$

统计代写|主成分分析代写Principal Component Analysis代考|Adding input variable information into Kernel PCA

In order to get interpretability we add supplementary information into KPCA representation. We have developed a procedure to project any given input variable onto the subspace spanned by the eigenvectors (9).

We can consider that our observations are realizations of the random vector $X=\left(X_{1}, \ldots, X_{n}\right)$. Then to represent the prominence of the input variable $X_{k}$ in the $\mathrm{KPCA}$. We take a set of points of the form $\mathbf{y}=\mathbf{a}+s \mathbf{e}{k} \in \mathbb{R}^{n}$ where $\mathbf{e}{k}=(0, \ldots, 1, \ldots, 0) \in \mathbb{R}^{n}, s \in \mathbb{R}$, where $k$-th component is equal 1 and otherwise are 0 . Then, we can compute the projections of the image of these points $\phi(\mathbf{y})$ onto the subspace spanned by the eigenvectors (9).

Taking into account equation (11) the induced curve in the eigenspace expressed in matrix form is given by the row vector:
$$
\sigma(s){1 \times r}=\left(\mathbf{Z}{s}^{\top}-\frac{1}{m} 1_{m}^{\top} K\right)\left(\mathbf{I}{m}-\frac{1}{m} 1{m} \mathbf{1}{m f}^{\top}\right) \tilde{\mathbf{V}}{r}
$$
where $\mathbf{Z}{\mathrm{s}}$ is of the form (10). In addition we can represent directions of maximum variation of $\sigma(\mathrm{s})$ associated with the variable $X{k}$ by projecting the tangent vector at $s=0$. In matrix form, we have
$$
\left.\frac{d \sigma}{d s}\right|{s=0}=\left.\frac{d \mathbf{Z}{s}^{\top}}{d s}\right|{s=0}\left(\mathbf{I}{m}-\frac{1}{m} \mathbf{1}{m} \mathbf{1}{m}^{\top}\right) \tilde{\mathbf{V}}
$$
with
$$
\left.\frac{d \mathbf{Z}{s}^{\top}}{d s}\right|{s=0}=\left(\left.\frac{d \mathbf{Z}{s}^{1}}{d s}\right|{s=0} \ldots,\left.\frac{d \mathbf{Z}{s}^{m}}{d s}\right|{s=0}\right)^{\top}
$$ and, with
$$
\begin{aligned}
\left.\frac{d \mathbf{Z}{s}^{t}}{d s}\right|{s=0} &=\left.\frac{d K\left(\mathbf{y}{,} \mathbf{x}{i}\right)}{d s}\right|{s=0} \ &=\left.\left(\sum{t=1}^{m} \frac{\partial K\left(\mathbf{y}, \mathbf{x}{i}\right)}{\partial y{t}} \frac{d y_{t}}{d s}\right)\right|{s=0} \ &=\left.\sum{t=1}^{m} \frac{\partial K\left(\mathbf{y}, \mathbf{x}{i}\right)}{\partial y{t}}\right|{\mathbf{y}=\mathbf{a}} \delta{t}^{k}=\left.\frac{\partial K\left(\mathbf{y}{,} \mathbf{x}{i}\right)}{\partial y_{k}}\right|_{\mathbf{y}=\mathbf{a}}
\end{aligned}
$$

主成分分析代考

统计代写|主成分分析代写Principal Component Analysis代考|Kernel PCA methodology

KPCA 是经典 PCA 的非线性等价物，它使用受统计学习理论启发的方法。我们简要描述了 Scholkopf 等人的 KPCA 方法。（1998 年）。
给定一组观察结果 $\mathbf{x} i \in \mathbb{R}^{n}, i=1, \ldots, m$. 让我们考虑一个点积空间 $F$ 通过映射与输入空间相关 $\phi: \mathbb{R}^{n} \rightarrow F$ 这可能是非线性的。特征空间 $F$ 可以有一个任意大的，并且可能是无限的维度。此后，大写字符用于元素 $F$, 而小写字符表示元素 $\mathbb{R}^{n}$. 我们假设我们正在处理中心数据 $\sum i=1^{m} \phi(\mathbf{x} i)=0$. 在 $F$ 协方差矩阵采用以下形式
$$
\mathrm{C}=\frac{1}{m} \sum j=1^{m} \phi(\mathbf{x} j) \phi(\mathbf{x} j)^{\top} .
$$
我们必须找到特征值 $\lambda \geq 0$ 和非零特征向量 $V \in F \backslash 0$ 令人满意的
$$
\mathbf{C V}=\lambda \mathrm{V} \text {. }
$$ 我们可以考虑方程组
$$
\langle\phi(\mathbf{x} k), \mathbf{C V}\rangle=\lambda\langle\phi(\mathbf{x} k), \mathbf{V}\rangle,
$$
对所有人 $k=1, \ldots, m$, 其次存在系数 $\alpha_{i}, i=1, \ldots, m$ 这样
$$
\mathbf{V}=\sum_{i=1}^{m} \alpha_{i} \phi(\mathbf{x} i)
$$
结合 (1) 和 (2) 我们得到特征值问题的对偶表示
$$
\frac{1}{m} \sum i=1^{m} \alpha_{i}\left\langle\phi(\mathbf{x} k), \sum j=1^{m} \phi(\mathbf{x} j)\langle\phi(\mathbf{x} j), \phi(\mathbf{x} i)\rangle\right\rangle=\lambda \sum i=1^{m} \alpha_{i}\langle\phi(\mathbf{x} k), \phi(\mathbf{x} i)\rangle
$$
对所有人 $k=1, \ldots m$. 定义一个 $m \times m$ 矩阵 $K$ 经过 $K_{i j}:=\langle\phi(\mathbf{x} i), \phi(\mathbf{x} j)\rangle$, 这读
$$
K^{2} \alpha=m \lambda K \alpha,
$$

统计代写|主成分分析代写Principal Component Analysis代考|Adding input variable information into Kernel PCA

为了获得可解释性，我们将补充信息添加到 KPCA 表示中。我们开发了一个程序，可以将任何给定的输入变量投影到由特征向量 (9) 跨越的子空间上。
我们可以认为我们的观察是随机向量的实现 $X=\left(X_{1}, \ldots, X_{n}\right)$. 然后来表示输入变量的显着性 $X_{k}$ 在里面 $\mathrm{KPCA}$. 我们取一组点的形式 $\mathbf{y}=\mathbf{a}+s \mathbf{e} k \in \mathbb{R}^{n}$ 在哪里 $\mathrm{e} k=(0, \ldots, 1, \ldots, 0) \in \mathbb{R}^{n}, s \in \mathbb{R}$ ，在哪里 $k$ -th 分量等于 1，否则为 0 。然后，我们可以计算这些点的图像的投影 $\phi(\mathbf{y})$ 到由特征向量 (9) 跨越的子空间。
考虑到方程 (11)，以矩阵形式表示的特征空间中的诱导曲线由行向量给出:
$$
\sigma(s) 1 \times r=\left(\mathbf{Z} s^{\top}-\frac{1}{m} 1_{m}^{\top} K\right)\left(\mathbf{I} m-\frac{1}{m} 1 m \mathbf{1} m f^{\top}\right) \tilde{\mathbf{V}} r
$$
在哪里 $\mathrm{Z} \mathrm{s}$ 是 (10) 的形式。此外，我们可以表示最大变化的方向 $\sigma(\mathrm{s})$ 与变量相关联 $X k$ 通过将切向量投影在 $s=0$. 在矩阵形式中，我们有
$$
\frac{d \sigma}{d s}\left|s=0=\frac{d \mathbf{Z} s^{\top}}{d s}\right| s=0\left(\mathbf{I} m-\frac{1}{m} \mathbf{1} m \mathbf{1} m^{\top}\right) \tilde{\mathbf{V}}
$$
和
$$
\frac{d \mathbf{Z} s^{\top}}{d s} \mid s=0=\left(\frac{d \mathbf{Z} s^{1}}{d s}\left|s=0 \ldots, \frac{d \mathbf{Z} s^{m}}{d s}\right| s=0\right)^{\top}
$$
与
$$
\frac{d \mathbf{Z} s^{t}}{d s}\left|s=0=\frac{d K(\mathbf{y}, \mathbf{x} i)}{d s}\right| s=0 \quad=\left(\sum t=1^{m} \frac{\partial K(\mathbf{y}, \mathbf{x} i)}{\partial y t} \frac{d y_{t}}{d s}\right) \mid s=0=\sum t=1^{m} \partial K
$$

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

金融工程代写

非参数统计代写

非参数统计指的是一种统计方法，其中不假设数据来自于由少数参数决定的规定模型；这种模型的例子包括正态分布模型和线性回归模型。

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

有限元方法代写

随机分析代写

时间序列分析代写

回归分析代写

MATLAB代写

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

统计代写|主成分分析代写Principal Component Analysis代考|ICE 2022

Posted on 2022年7月20日2022年7月20日 by statistics-lab

如果你也在怎样代写主成分分析Principal Component Analysis这个学科遇到相关的难题，请随时右上角联系我们的24/7代写客服。

主成分分析（PCA）是计算主成分并使用它们对数据进行基础改变的过程，有时只使用前几个主成分，而忽略其余部分。

我们提供的主成分分析Principal Component Analysis及其相关学科的代写，服务范围广, 其中包括但不限于:

Statistical Inference 统计推断
Statistical Computing 统计计算
Advanced Probability Theory 高等概率论
Advanced Mathematical Statistics 高等数理统计学
(Generalized) Linear Models 广义线性模型
Statistical Machine Learning 统计机器学习
Longitudinal Data Analysis 纵向数据分析
Foundations of Data Science 数据科学基础

统计代写|主成分分析代写Principal Component Analysis代考|ICE 2022

统计代写|主成分分析代写Principal Component Analysis代考|Examples of Mixed Data Modeling

The problem of modeling mixed data is quite representative of many data sets that one often encounters in practical applications. To further motivate the importance of modeling mixed data, we give below a few real-world problems that arise in image processing and computer vision. Most of these problems will be revisited later in this book, and more detailed and principled solutions will be given.
Face Clustering under Varying Illumination
The first example arises in the context of image-based face clustering. Given a collection of unlabeled images $\left{I_{j}\right}_{j=1}^{N}$ of several different faces taken under varying illumination, we would like to cluster the images corresponding to the face of the same person. For a Lambertian object, ${ }^{10}$ it has been shown that the set of all images taken under all lighting conditions forms a cone in the image space, which can be well approximated by a low-dimensional subspace called the “illumination subspace” (Belhumeur and Kriegman 1998; Basri and Jacobs 2003).” For example, if $I_{j}$ is the $j$ th image of a face and $d$ is the dimension of the illumination subspace associated with that face, then there exists a mean face $\mu$ and $d$ eigenfaces $\boldsymbol{u}{1}, \boldsymbol{u}{2}, \ldots, \boldsymbol{u}{d}$ such that $I{j} \approx \boldsymbol{\mu}+\boldsymbol{u}{1} y{1 j}+\boldsymbol{u}{2} y{2 j}+\cdots+\boldsymbol{u}{d j} y{d j}$. Now, since the images of different faces will live in different “illumination subspaces,” we can cluster the collection of images by estimating a basis for each one of those subspaces. As we will see later, this is a special case of the subspace clustering problem addressed in Part II of this book. In the example shown in Figure 1.3, we use a subset of the Yale Face Database B consisting of $n=64 \times 3$ frontal views of three faces (subjects 5,8 and 10 ) under 64 varying lighting conditions. For computational efficiency, we first down-sample each image to a size of $30 \times 40$ pixels. We then project the data onto their first three principal components using PCA, as shown in Figure $1.3$ (a). $.^{12}$ By modeling the projected data with a mixture model of linear subspaces in $\mathbb{R}^{3}$, we obtain three affine subspaces of dimension 2, 1, and 1, respectively. Despite the series of down-sampling and projection, the subspaces lead to a perfect clustering of the face images, as shown in Figure 1.3(b).

统计代写|主成分分析代写Principal Component Analysis代考|Mathematical Representations of Mixture Models

The examples presented in the previous subsection argue forcefully for the development of modeling and estimation techniques for mixture models. Obviously, whether the model associated with a given data set is mixed depends on the class of primitive models considered. In this book, the primitives are normally chosen to be simple classes of geometric models or probabilistic distributions.

For instance, one may choose the primitive models to be linear subspaces. Then one can use an arrangement of linear subspaces $\left{S_{i}^{}_{i=1}^{n}} \subset \mathbb{R}^{D}\right.$,
$$
Z \doteq S_{1} \cup S_{2} \cup \cdots \cup S_{n},
$$
also called a piecewise linear model, to approximate many nonlinear manifolds or piecewise smooth topological spaces. This is the standard model considered in geometric approaches to generalized principal component analysis (GPCA), which will be studied in Part II of this book.

The statistical counterpart to the geometric model in (1.7) is to assume instead that the sample points are drawn independently from a mixture of (near singular) Gaussian distributions $\left{p_{\theta_{i}}(\boldsymbol{x}){i=1}^{n}\right.$, where $\boldsymbol{x} \in \mathbb{R}^{D}$ but each distribution has mass concentrated near a subspace. The overall probability density function can be expressed as a sum: $$ q{\theta}(x) \doteq \pi_{1} p_{\theta_{1}}(x)+\pi_{2} p_{\theta_{2}}(x)+\cdots+\pi_{n} p_{\theta_{n}}(x),
$$
where $\theta=\left(\theta_{1}, \ldots, \theta_{n}, \pi_{1}, \ldots, \pi_{n}\right)$ are the model parameters and $\pi_{i}>0$ are mixing weights with $\pi_{1}+\pi_{2}+\cdots+\pi_{n}=1$. This is the typical model studied in mixtures of probabilistic principal component analysis (PPCA) (Tipping and Bishop $1999 \mathrm{a}$ ), where each component distribution $p_{\theta_{i}}(\boldsymbol{x})$ is a nearly degenerate Gaussian distribution. A classical way of estimating such a mixture model is the expectation maximization (EM) algorithm, where the membership of each sample is represented as a hidden random variable. Appendix B reviews the general EM method, and Chapter 6 shows how to apply it to the case of multiple subspaces.

In the special case that there is only one subspace or one component distribution (i.e., $n=1$ ), the model reduces to the classical (probabilistic) PCA, and we will see that the geometric and statistical formulations are equivalent in the sense that they both give very much the same solution (see Chapter 2). However, in the case of incomplete or corrupted data, or in the general case of a mixture of multiple components, the two formulations can be very different, and their optimal solutions need to be found by very different techniques. In this book, we will study and clarify the similarities and differences between these geometric models and statistical models in Chapters 5 and $6 .

主成分分析代考

统计代写|主成分分析代写Principal Component Analysis代考|Examples of Mixed Data Modeling

混合数据建模问题非常具有代表性，在实际应用中经常遇到的许多数据集。为了进一步激发混合数据建模的重要性，我们在下面给出了图像处理和计算机视觉中出现的一些实际问题。本书稍后将重新讨论这些问题中的大部分，并给出更详细和更有原则的解决方案。
变化光照下的人脸聚类拍摄的几张不同面孔中，我们希望将与同一个人的面孔相对应的图像聚类。对于朗伯物体， ${ }^{10}$ 已经表明，在所有照明条件下拍摄的所有图像的集合在图像空间中形成一个圆雉体，可以很好地近似于称为“照明子空间”的低维子空间 (Belhumeur 和 Kriegman 1998；Basri 和Jacobs 2003 )。”例如，如果 $I_{j}$ 是个 $j$ 一张脸的图像和 $d$ 是与该人脸相关的照明子空间的维度，则存在平均人脸 $\mu$ 和 $d$ 特征脸 $\boldsymbol{u} 1, \boldsymbol{u} 2, \ldots, \boldsymbol{u} d$ 这样
$I j \approx \boldsymbol{\mu}+\boldsymbol{u} 1 y 1 j+\boldsymbol{u} 2 y 2 j+\cdots+\boldsymbol{u} d j y d j$. 现在，由于不同面孔的图像将存在于不同的“照明子空间”中，我们可以通过估计每个子空间的基础来对图像集合进行聚类。正如我们稍后将看到的，这是本书第二部分中讨论的子空间聚类问题的一个特例。在图 $1.3$ 所示的示例中，我们使用了耶鲁人脸数据库 $B$ 的子集，包括 $n=64 \times 3$ 在 64 种不同的照明条件下，三张脸（对象 5,8 和 10）的正面视图。为了计算效率，我们首先将每个图像下采样到 $30 \times 40$ 像素。然后我们使用 PCA 将数据投影到它们的前三个主成分上，如图 $1.3$ (一个)。 12 通过使用线性子空间的混合模型对投影数据进行建模 $\mathbb{R}^{3}$ ，我们分别获得了三个维数为 2、1 和 1 的仿射子空间。尽管进行了一系列的下采样和投影，子空间仍然可以完美地聚类人脸图像，如图 1.3(b) 所示。

统计代写|主成分分析代写Principal Component Analysis代考|Mathematical Representations of Mixture Models

上一小节中的例子有力地证明了混合模型的建模和估计技术的发展。显然，与给定数据集相关的模型是否混合取决于所考虑的原始模型的类别。在本书中，基元通常被选择为简单的几何模型类或概率分布。
例如，可以选择原始模型作为线性子空间。然后可以使用线性子空间的排列
$$
Z \doteq S_{1} \cup S_{2} \cup \cdots \cup S_{n},
$$
也称为分段线性模型，用于逼近许多非线性流形或分段平滑拓扑空间。这是广义主成分分析 (GPCA) 几何方法中考虑的标准模型，本书的第二部分将对此进行研究。 ${\mathrm{i}=1} \wedge{\mathrm{n}} \backslash \mathrm{又}$ 。, where $\backslash$ boldsymbol ${\mathrm{x}} \backslash$ in $\backslash$ mathbb ${\mathrm{R}} \wedge{\mathrm{D}}$
buteachdistributionhasmassconcentratednearasubspace. Theoverallprobabilitydensity function $q \theta(x) \doteq \pi_{1} p_{\theta_{1}}(x)+\pi_{2} p_{\theta_{2}}(x)+\cdots+\pi_{n} p_{\theta_{n}}(x)$, where Itheta=《left(Itheta_{1}, Vldots, Itheta_{n},
\pi_{1}, Vdots, \pi_{n}\right)arethemodelparametersand $\backslash$ pi_{i} $>0$ aremixingweightswith
. Thisisthetypicalmodelstudiedinmixturesofprobabilisticprincipalcomponentanalysis $(P P C A)$ 1999 Imathrm{a}), whereeachcomponentdistribution $\mathrm{p}{-}{$(theta{i}}((1boldsymbol{x})\$ 是一个几乎退化
的高斯分布。估计这种混合模型的经典方法是期望最大化 (EM) 算法，其中每个样本的成员资格表示为隐藏的随机变量。附录 $\mathrm{B}$ 回顾了一般的 EM 方法，第 6 章展示了如何将其应用于多个子空间的情况。
在只有一个子空间或一个分量分布的特殊情况下 (即， $n=1$ )，模型简化为经典（概率) PCA，我们将看到几何和统计公式在某种意义上是等价的，因为它们都给出了非常相同的解 (见第 2 章) 。然而，在数据不完整或损坏的情况下，或者在混合多个成分的一般情况下，两种公式可能非常不同，需要通过非常不同的技术来找到它们的最佳解决方案。在本书中，我们将在第 5 章和第 6 章中研究和阐明这些几何模型和统计模型之间的异同。

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

金融工程代写

非参数统计代写

非参数统计指的是一种统计方法，其中不假设数据来自于由少数参数决定的规定模型；这种模型的例子包括正态分布模型和线性回归模型。

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

有限元方法代写

随机分析代写

时间序列分析代写

回归分析代写

MATLAB代写

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

统计代写|主成分分析代写Principal Component Analysis代考|ENVX2001

Posted on 2022年7月20日2022年7月20日 by statistics-lab

如果你也在怎样代写主成分分析Principal Component Analysis这个学科遇到相关的难题，请随时右上角联系我们的24/7代写客服。

主成分分析（PCA）是计算主成分并使用它们对数据进行基础改变的过程，有时只使用前几个主成分，而忽略其余部分。

我们提供的主成分分析Principal Component Analysis及其相关学科的代写，服务范围广, 其中包括但不限于:

Statistical Inference 统计推断
Statistical Computing 统计计算
Advanced Probability Theory 高等概率论
Advanced Mathematical Statistics 高等数理统计学
(Generalized) Linear Models 广义线性模型
Statistical Machine Learning 统计机器学习
Longitudinal Data Analysis 纵向数据分析
Foundations of Data Science 数据科学基础

统计代写|主成分分析代写Principal Component Analysis代考|Statistical Models versus Geometric Models

There are essentially two main categories of models and approaches for modeling a data set. Methods of the first category model the data as random samples from a probability distribution and try to learn this distribution from the data. We call such models statistical models. Models of the second category model the overall geometric shape of the data set with deterministic models such as subspaces, smooth manifolds, or topological spaces. ${ }^{6}$ We call such models geometric models.
Statistical Learning
In the statistical paradigm, one typically assumes that each data point $\boldsymbol{x}_{j}$ in the data set $\mathcal{X}$ is drawn independently from a common probability distribution $p(\boldsymbol{x})$. Such a probability distribution gives a generative description of the samples and can be used to generate new samples or predict the outcome of new observations. Within this context, the task of learning a model from the data becomes one of inferring the most likely probability distribution within a family of distributions of interest (for example, the Gaussian distributions). Normally, the family of distributions is parameterized and denoted by $\mathcal{M} \doteq{p(x \mid \theta): \theta \in \Theta}$, where $p(x \mid \theta)$ is a probability density function parameterized by $\theta \in \Theta$, and $\Theta$ is the space of parameters. Consequently, one popular criterion for choosing a statistical model $p\left(x \mid \theta^{*}\right)$ is the maximum likelihood (ML) estimate given by ${ }^{7}$ $$
\theta_{M L}^{} \doteq \underset{\theta \in \Theta}{\arg \max } \prod_{j=1}^{N} p\left(x_{j} \mid \theta\right) $$ If a prior distribution (density) $p(\theta)$ of the parameter $\theta$ is also given, then, following the Bayesian rule, the maximum a posteriori (MAP) estimate is given by $$ \theta_{M A P}^{} \doteq \underset{\theta \in \Theta}{\arg \max } \prod_{j=1}^{N} p\left(\boldsymbol{x}_{j} \mid \theta\right) p(\theta)
$$

统计代写|主成分分析代写Principal Component Analysis代考|Modeling Mixed Data with a Mixture Model

As we alluded to earlier, many data sets $\mathcal{X}$ cannot be modeled well by a single primitive model $M$ in a pre-chosen or preferred model class $\mathcal{M}$. Nevertheless, it is often the case that if we group such a data set $\mathcal{X}$ into multiple disjoint subsets,
$$
\mathcal{X}=\mathcal{X}{1} \cup \mathcal{X}{2} \cup \cdots \cup \mathcal{X}{n}, \quad \text { with } \mathcal{X}{l} \cap \mathcal{X}{m}=\emptyset, \text { for } l \neq m $$ then each subset $\mathcal{X}{i}$ can be modeled sufficiently well by a model in the chosen model class:
$$
M_{i}^{}=\underset{M \in \mathcal{M}}{\arg \min } \operatorname{Error}\left(\mathcal{X}{i}, M\right), \quad i=1,2, \ldots, n, $$ where $\operatorname{Error}\left(\mathcal{X}{i}, M\right)$ represents some measure of the error incurred by using the model $M$ to fit the data set $\mathcal{X}{i}$. Each model $M{i}^{}$ is called a primitive or a component model. Precisely in this sense, we call the data set $\mathcal{X}$ mixed (with respect to the chosen model class $\mathcal{M}$ ) and call the collection of primitive models $\left{M_{i}^{*}\right}_{i=1}^{n}$ a mixture model for $\mathcal{X}$. For instance, suppose we are given a set of sample points as shown in Figure 1.2. These points obviously cannot be fit well by any single line, plane, or smooth surface in $\mathbb{R}^{3}$; however, once they are grouped into three subsets, each subset can be fit well by a line or a plane. Note that in this example, the topology of the data is “hybrid”: two of the subspaces are of dimension one, and the other is of dimension two.

主成分分析代考

统计代写|主成分分析代写Principal Component Analysis代考|Statistical Models versus Geometric Models

对数据集进行建模的模型和方法基本上有两大类。第一类方法将数据建模为来自概率分布的随机样本，并尝试从数据中学习此分布。我们称这样的模型为统计模型。第二类模型使用确定性模型（例如子空间、平滑流形或拓扑空间) 对数据集的整体几何形状进行建模。 ${ }^{6}$ 我们称这种模型为几何模型。
统计学习
在统计范式中，通常假设每个数据点 $\boldsymbol{x}{j}$ 在数据集中 $\mathcal{X}$ 独立于一个共同的概率分布 $p(\boldsymbol{x})$. 这种概率分布给出了样本的生成描述，可用于生成新样本或预测新观察的结果。在这种情况下，从数据中学习模型的任务变成了在感兴趣的分布族 (例如，高斯分布) 中推断最可能的概率分布之一。通常，分布族被参数化并表示为 $\mathcal{M} \doteq p(x \mid \theta): \theta \in \Theta$ ，在哪里 $p(x \mid \theta)$ 是由以下参数化的概率密度函数 $\theta \in \Theta$ ，和 $\Theta$ 是参数空间。因此，选择统计模型的一个流行标准 $p\left(x \mid \theta^{*}\right)$ 是由下式给出的最大似然 $(\mathrm{ML})$ 估计 ${ }^{7}$ $$ \theta{M L} \doteq \underset{\theta \in \Theta}{\arg \max } \prod_{j=1}^{N} p\left(x_{j} \mid \theta\right)
$$
如果先验分布 (密度) $p(\theta)$ 参数的 $\theta$ 也给出，然后，根据贝叶斯规则，最大后验 (MAP) 估计由下式给出
$$
\theta_{M A P} \doteq \underset{\theta \in \Theta}{\arg \max } \prod_{j=1}^{N} p\left(\boldsymbol{x}_{j} \mid \theta\right) p(\theta)
$$

统计代写|主成分分析代写Principal Component Analysis代考|Modeling Mixed Data with a Mixture Model

正如我们之前提到的，许多数据集 $\mathcal{X}$ 不能用单一的原始模型很好地建模 $M$ 在预先选择或首选的模型类中 $\mathcal{M}$. 然而，如果我们对这样的数据集进行分组，通常情况下 $\mathcal{X}$ 成多个不相交的子集，
$$
\mathcal{X}=\mathcal{X} 1 \cup \mathcal{X} 2 \cup \cdots \cup \mathcal{X} n, \quad \text { with } \mathcal{X} l \cap \mathcal{X} m=\emptyset, \text { for } l \neq m
$$
然后每个子集 $\mathcal{X} i$ 可以通过所选模型类中的模型充分建模:
$$
M_{i}=\underset{M \in \mathcal{M}}{\arg \min } \operatorname{Error}\left(\mathcal{X}_{i}, M\right), \quad i=1,2, \ldots, n,
$$
在哪里Error $(\mathcal{X} i, M)$ 表示使用模型引起的误差的某种度量 $M$ 拟合数据集 $\mathcal{X} i$. 每个型号 $M i$ 称为原始模型或组件模型。正是在这个意义上，我们称数据集为 $\mathcal{X}$ 混合 (关于所选模型类 $\mathcal{M}$ ) 并调用原始模型的集合

Uleft{M_{i}^{*}}right}_{i=1}^{n}}混合模型 $\mathcal{X}$. 例如，假设给定一组样本点，如图 $1.2$ 所示。这些点显然不能被任何单一的线、平面或光滑表面很好地拟合 $\mathbb{R}^{3}$; 但是，一旦将它们分成三个子集，每个子集就可以很好地被一条线或一个平面拟合。注意，在这个例子中，数据的拓扑是“混合的”：两个子空间是一维的，另一个是二维的。

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

金融工程代写

非参数统计代写

非参数统计指的是一种统计方法，其中不假设数据来自于由少数参数决定的规定模型；这种模型的例子包括正态分布模型和线性回归模型。

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

有限元方法代写

随机分析代写

时间序列分析代写

回归分析代写

MATLAB代写

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

统计代写|主成分分析代写Principal Component Analysis代考|STAT3888

Posted on 2022年7月20日2022年7月20日 by statistics-lab

如果你也在怎样代写主成分分析Principal Component Analysis这个学科遇到相关的难题，请随时右上角联系我们的24/7代写客服。

主成分分析（PCA）是计算主成分并使用它们对数据进行基础改变的过程，有时只使用前几个主成分，而忽略其余部分。

我们提供的主成分分析Principal Component Analysis及其相关学科的代写，服务范围广, 其中包括但不限于:

Statistical Inference 统计推断
Statistical Computing 统计计算
Advanced Probability Theory 高等概率论
Advanced Mathematical Statistics 高等数理统计学
(Generalized) Linear Models 广义线性模型
Statistical Machine Learning 统计机器学习
Longitudinal Data Analysis 纵向数据分析
Foundations of Data Science 数据科学基础

统计代写|主成分分析代写Principal Component Analysis代考|Modeling Data with a Parametric Model

The primary goal of this book is to study theory and methods for modeling highdimensional data with one or more low-dimensional subspaces or manifolds. To a large extent, the methods presented in this book aim to generalize the classical principal component analysis (PCA) method (Jolliffe 1986, 2002) to address two major challenges presented by current applications.

One challenge is to generalize the classical PCA method to data with significant amounts of missing entries, errors, outliers, or even a certain level of nonlinearity. Since the very beginning of PCA nearly a century ago (Pearson 1901; Hotelling 1933), researchers have been aware of PCA’s vulnerability to missing data and corruption. Strictly speaking, estimating a subspace from incomplete or corrupted data is an inherently difficult problem, which is generally NP-hard. Nevertheless, due to the practical importance of this problem, many extensions to PCA have been proposed throughout the years in different practical domains to handle imperfect data, even though many of these methods have been largely heuristic, greedy, or even ad hoc. Recent advances in high-dimensional statistics and convex optimization have begun to provide provably correct ${ }^{1}$ and efficient methods for finding the optimal subspace from highly incomplete or corrupted data.

In science and engineering, one is frequently called upon to infer (or learn) a quantitative model $M$ for a given set of sample points $\mathcal{X}=\left{x_{1}, x_{2}, \ldots, x_{N}\right} \subset \mathbb{R}^{D}$. For instance, Figure $1.1$ shows a simple example in which one is given a set of four sample points in a two-dimensional plane. Obviously, these points can be fit perfectly by a (one-dimensional) straight line $L$. The line can then be called a “model” for the given points. The reason for inferring such a model is that it serves many useful purposes. On the one hand, the model can reveal information encoded in the data or underlying mechanisms from which the data were generated. In addition, it can simplify the representation of the given data set and help predict future samples. In the case of the four points shown in Figure $1.1$, the line model gives a more compact one-dimensional representation than the original twodimensional plane $P$. It also suggests that any new point (if generated with a similar mechanism as the existing points) will likely fall on the same line.

统计代写|主成分分析代写Principal Component Analysis代考|The Choice of a Model Class

A first important consideration to keep in mind is that inferring the “correct” model for a given data set is an elusive, if not impossible, task. The fundamental difficulty is that if we are not specific about what we mean by a “correct” model, there could easily be many different models that fit the given data set “equally well.” For instance, in the example shown in Figure 1.1, any smooth curve that passes through the sample points would seem to be as valid a model as the straight line. Furthermore, if there were noise in the given sample points, then any curve, including the line, passing through the points exactly would unlikely be the “true model.”

The question now is this: in what sense can we say that a model is correct or optimal for a given data set? To make the model inference problem well posed, i.e., to guarantee that there is a unique optimal model for the given data, we need to impose additional assumptions or restrictions on the class of models considered. To this end, we should not be looking for just any model that can describe the data. Instead, we should look for a model $M^{*}$ that is the best among a restricted class of models $\mathcal{M} .^{4}$ In addition, to make the model inference problem computationally tractable, we need to specify how restricted the class of models needs to be. A common strategy, known as the principle of Occam’s razor, ${ }^{5}$ is to try to get away with the simplest possible class of models that is just necessary to describe the data or solve the problem at hand. More precisely, the model class should be rich enough to contain at least one model that can fit the data to a desired accuracy and yet be restricted enough that it is relatively simple to find the best model for the given data.
Thus, in engineering practice, the most popular strategy is to start from the simplest class of models and increase the complexity of the models only when the simpler models become inadequate. For instance, to fit a set of sample points, one may first try the simplest class of models, namely linear models, followed by the class of hybrid (piecewise) linear models (subspaces), and then followed by the class of (piecewise) nonlinear models (submanifolds). One of the goals of this book is to demonstrate that among them, piecewise linear models can already achieve an excellent balance between expressiveness and simplicity for many important practical data sets and problems.

主成分分析代考

统计代写|主成分分析代写Principal Component Analysis代考|Modeling Data with a Parametric Model

本书的主要目标是研究用一个或多个低维子空间或流形对高维数据建模的理论和方法。在很大程度上，本书中介绍的方法旨在推广经典的主成分分析 (PCA) 方法 (Jolliffe 1986, 2002)，以解决当前应用程序提出的两个主要挑战。

一个挑战是将经典的 PCA 方法推广到具有大量缺失条目、错误、异常值甚至某种程度的非线性的数据。自从近一个世纪前 PCA 诞生以来（Pearson 1901；Hotelling 1933），研究人员就已经意识到 PCA 对丢失数据和损坏的脆弱性。严格来说，从不完整或损坏的数据中估计子空间是一个固有的难题，通常是 NP-hard。然而，由于这个问题的实际重要性，多年来在不同的实际领域中提出了许多 PCA 扩展来处理不完美的数据，尽管其中许多方法在很大程度上是启发式的、贪婪的，甚至是临时的。高维统计和凸优化的最新进展已经开始提供可证明的正确性1从高度不完整或损坏的数据中找到最佳子空间的有效方法。

在科学和工程领域，人们经常被要求推断（或学习）一个定量模型米对于给定的一组样本点\mathcal{X}=\left{x_{1}, x_{2}, \ldots, x_{N}\right} \subset \mathbb{R}^{D}\mathcal{X}=\left{x_{1}, x_{2}, \ldots, x_{N}\right} \subset \mathbb{R}^{D}. 例如，图1.1显示了一个简单的示例，其中给定一个二维平面中的一组四个样本点。显然，这些点可以通过（一维）直线完美拟合大号. 然后可以将这条线称为给定点的“模型”。推断出这样一个模型的原因是它有许多有用的用途。一方面，该模型可以揭示编码在数据中的信息或生成数据的基础机制。此外，它可以简化给定数据集的表示，并有助于预测未来的样本。在如图所示的四个点的情况下1.1，线模型给出了比原始二维平面更紧凑的一维表示磷. 它还表明任何新点（如果使用与现有点类似的机制生成）都可能落在同一条线上。

统计代写|主成分分析代写Principal Component Analysis代考|The Choice of a Model Class

要记住的第一个重要考虑因素是，为给定数据集推断“正确”模型是一项难以捉摸的任务，如果不是不可能的话。根本的困难在于，如果我们不具体说明“正确”模型的含义，那么很容易有许多不同的模型“同样好”地拟合给定的数据集。例如，在图 1.1 所示的示例中，任何通过样本点的平滑曲线似乎都与直线一样有效。此外，如果给定的样本点中有噪声，那么任何通过这些点的曲线（包括直线）都不太可能是“真实模型”。

现在的问题是：在什么意义上我们可以说模型对于给定的数据集是正确的或最优的？为了使模型推理问题很好地提出，即保证给定数据存在唯一的最优模型，我们需要对所考虑的模型类别施加额外的假设或限制。为此，我们不应该只寻找可以描述数据的任何模型。相反，我们应该寻找一个模型米∗这是受限模型中最好的米.4此外，为了使模型推理问题在计算上易于处理，我们需要指定模型类需要的限制程度。一种常见的策略，称为奥卡姆剃刀原理，5是试图摆脱对描述数据或解决手头问题所必需的最简单的模型类别。更准确地说，模型类应该足够丰富以包含至少一个模型，该模型可以将数据拟合到所需的精度，但又应受到足够的限制，以便相对简单地为给定数据找到最佳模型。
因此，在工程实践中，最流行的策略是从最简单的模型类开始，只有在更简单的模型变得不合适时才增加模型的复杂性。例如，为了拟合一组样本点，可以先尝试最简单的一类模型，即线性模型，然后是混合（分段）线性模型（子空间）类，然后是（分段）类非线性模型（子流形）。本书的目标之一是证明其中的分段线性模型对于许多重要的实际数据集和问题已经可以在表达性和简单性之间实现出色的平衡。

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

金融工程代写

非参数统计代写

非参数统计指的是一种统计方法，其中不假设数据来自于由少数参数决定的规定模型；这种模型的例子包括正态分布模型和线性回归模型。

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

有限元方法代写

随机分析代写

时间序列分析代写

回归分析代写

MATLAB代写

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

机器学习代写|主成分分析作业代写PCA代考| Matrix Analysis

Posted on 2022年5月13日2022年5月13日 by statistics-lab

如果你也在怎样代写主成分分析PCA这个学科遇到相关的难题，请随时右上角联系我们的24/7代写客服。

主成分分析（PCA）是计算主成分并使用它们对数据进行基础改变的过程，有时只使用前几个主成分，而忽略其余部分。

statistics-lab™ 为您的留学生涯保驾护航在代写主成分分析PCA方面已经树立了自己的口碑, 保证靠谱, 高质且原创的统计Statistics代写服务。我们的专家在代写主成分分析PCA代写方面经验极为丰富，各种代写主成分分析PCA相关的作业也就用不着说。

我们提供的主成分分析PCA及其相关学科的代写，服务范围广, 其中包括但不限于:

Statistical Inference 统计推断
Statistical Computing 统计计算
Advanced Probability Theory 高等概率论
Advanced Mathematical Statistics 高等数理统计学
(Generalized) Linear Models 广义线性模型
Statistical Machine Learning 统计机器学习
Longitudinal Data Analysis 纵向数据分析
Foundations of Data Science 数据科学基础

Automatic Differentiation with torch.autograd — PyTorch Tutorials 1.11.0+cu102 documentation — 机器学习代写|主成分分析作业代写PCA代考| Matrix Analysis

机器学习代写|主成分分析作业代写PCA代考|Gradient of Real Function with Respect to Real Vector

Define gradient operator $\nabla_{x}$ of an $n \times 1$ vector $x$ as
$$
\nabla_{x}=\left[\frac{\partial}{\partial x_{1}}, \quad \frac{\partial}{\partial x_{2}}, \quad \cdots, \quad \frac{\partial}{\partial x_{n}}\right]^{\mathrm{T}}=\frac{\partial}{\partial x},
$$
Then the gradient of a real scalar quantity function $f(\boldsymbol{x})$ with respect to $\boldsymbol{x}$ is a $n \times 1$ column vector, which is defined as
$$
\nabla_{x} f(\boldsymbol{x})=\left[\frac{\partial f(\boldsymbol{x})}{\partial x_{1}}, \quad \frac{\partial f(\boldsymbol{x})}{\partial x_{2}}, \quad \cdots, \quad \frac{\partial f(\boldsymbol{x})}{\partial x_{n}}\right]^{\mathrm{T}}=\frac{\partial f(\boldsymbol{x})}{\partial \boldsymbol{x}}
$$
The negative direction of the gradient direction is called as the gradient flow of variable $\boldsymbol{x}$, written as
$$
\dot{x}=-\nabla_{x} f(x)
$$
The gradient of $m$-dimensional row vector function $f(\boldsymbol{x})=$ $\left[f_{1}(\boldsymbol{x}), f_{2}(\boldsymbol{x}), \ldots, f_{m}(\boldsymbol{x})\right]$ with respect to the $n \times 1$ real vector $x$ is an $n \times m$ matrix, defined as
$$
\frac{\partial f(\boldsymbol{x})}{\partial \boldsymbol{x}}=\left[\begin{array}{lll}
\frac{\partial f_{1}(\boldsymbol{x})}{\partial x_{1}} & \frac{\partial y_{2}(\boldsymbol{x})}{\partial x_{1}} & \frac{\partial f_{w}(\boldsymbol{x})}{\partial x_{1}} \
\frac{\partial f_{1}(\boldsymbol{x})}{\partial x_{2}} & \frac{\partial f_{2}(\boldsymbol{x})}{\partial x_{2}} & \frac{\partial f_{w}(\boldsymbol{x})}{\partial x_{2}} \
\frac{\partial f_{1}(\boldsymbol{x})}{\partial x_{n}} & \frac{\partial f_{2}(\boldsymbol{x})}{\partial x_{n}} & \frac{\partial f_{w}(\boldsymbol{x})}{\partial x_{n}}
\end{array}\right]=\nabla_{x} \boldsymbol{f}(\boldsymbol{x}) .
$$
Some properties of gradient operations can be summarized as follows:
(1) If $f(\boldsymbol{x})=c$ is a constant, then gradient $\frac{\partial c}{\partial \boldsymbol{x}}=\boldsymbol{O}$.
(2) Linear principle: If $f(\boldsymbol{x})$ and $g(\boldsymbol{x})$ are real functions of vector $\boldsymbol{x}$, and $c_{1}$ and $c_{2}$ are real constants, then
$$
\frac{\partial\left[c_{1} f(\boldsymbol{x})+c_{2} g(\boldsymbol{x})\right]}{\partial \boldsymbol{x}}=c_{1} \frac{\partial f(\boldsymbol{x})}{\partial \boldsymbol{x}}+c_{2} \frac{\partial g(\boldsymbol{x})}{\partial \boldsymbol{x}}
$$
(3) Product principle: If $f(\boldsymbol{x})$ and $g(\boldsymbol{x})$ are real functions of vector $\boldsymbol{x}$, then
$$
\frac{\partial f(\boldsymbol{x}) g(\boldsymbol{x})}{\partial \boldsymbol{x}}=g(\boldsymbol{x}) \frac{\partial f(\boldsymbol{x})}{\partial \boldsymbol{x}}+f(\boldsymbol{x}) \frac{\partial g(\boldsymbol{x})}{\partial \boldsymbol{x}}
$$
(4) Quotient principle: If $g(x) \neq 0$, then
$$
\frac{\partial f(\boldsymbol{x}) / g(\boldsymbol{x})}{\partial \boldsymbol{x}}=\frac{1}{g^{2}(\boldsymbol{x})}\left[g(\boldsymbol{x}) \frac{\partial f(\boldsymbol{x})}{\partial \boldsymbol{x}}-f(\boldsymbol{x}) \frac{\partial g(\boldsymbol{x})}{\partial \boldsymbol{x}}\right] .
$$
(5) Chain principle: If $\boldsymbol{y}(\boldsymbol{x})$ is a vector-valued function of $\boldsymbol{x}$, then
$$
\frac{\partial f(\boldsymbol{y}(\boldsymbol{x}))}{\partial \boldsymbol{x}}=\frac{\partial \boldsymbol{y}^{T}(\boldsymbol{x})}{\partial \boldsymbol{x}} \frac{\partial f(\boldsymbol{y})}{\partial \boldsymbol{y}}
$$
where $\frac{\partial y^{\mathrm{T}}(x)}{\partial x}$ is an $n \times n$ matrix.

机器学习代写|主成分分析作业代写PCA代考|Gradient Matrix of Real Function

The gradient of a real function $f(\boldsymbol{A})$ with respect to an $m \times n$ real matrix $\boldsymbol{A}$ is an $m \times n$ matrix, called as gradient matrix, defined as
$$
\frac{\partial f(\boldsymbol{A})}{\partial \boldsymbol{A}}=\left[\begin{array}{cccc}
\frac{\partial f(\boldsymbol{A})}{\partial A_{11}} & \frac{\partial f(\boldsymbol{A})}{\partial A_{12}} & \cdots & \frac{\partial f(\boldsymbol{A})}{\partial A_{L E}} \
\frac{\partial f(\boldsymbol{A})}{\partial A_{21}} & \frac{\partial f(\boldsymbol{A})}{\partial A_{22}} & \cdots & \frac{\partial f(\boldsymbol{A})}{\partial A_{2 c}} \
\vdots & \vdots & & \vdots \
\frac{\partial f(\boldsymbol{A})}{\partial A_{m 1}} & \frac{\partial f(\boldsymbol{A})}{\partial A_{\mathrm{m} 2}} & \cdots & \frac{\partial f(\boldsymbol{A})}{\partial A_{m n}}
\end{array}\right]=\nabla_{\boldsymbol{A}} f(\boldsymbol{A})
$$
where $A_{i j}$ is the element of matrix $A$ on its $i$ th row and $j$ th column.
Some properties of the gradient of a real function with respect to a matrix can be summarized as follows:
(1) If $f(\boldsymbol{A})=c$ is a constant, where $\boldsymbol{A}$ is an $m \times n$ matrix, then $\frac{\partial c}{\partial A}=\boldsymbol{O}{m \times n}$. (2) Linear principle: If $f(\boldsymbol{A})$ and $g(\boldsymbol{A})$ are real functions of matrix $\boldsymbol{A}$, and $c{1}$ and $c_{2}$ are real constants, then
$$
\frac{\partial\left[c_{1} f(\boldsymbol{A})+c_{2} g(\boldsymbol{A})\right]}{\partial \boldsymbol{A}}=c_{1} \frac{\partial f(\boldsymbol{A})}{\partial \boldsymbol{A}}+c_{2} \frac{\partial g(\boldsymbol{A})}{\partial \boldsymbol{A}} .
$$
(3) Product principle: If $f(\boldsymbol{A})$ and $g(\boldsymbol{A})$ are real functions of matrix $\boldsymbol{A}$, then
$$
\frac{\partial f(\boldsymbol{A}) g(\boldsymbol{A})}{\partial \boldsymbol{A}}=g(\boldsymbol{A}) \frac{\partial f(\boldsymbol{A})}{\partial \boldsymbol{A}}+f(\boldsymbol{A}) \frac{\partial g(\boldsymbol{A})}{\partial \boldsymbol{A}}
$$
(4) Quotient principle: If $g(\boldsymbol{A}) \neq 0$, then
$$
\frac{\partial f(\boldsymbol{A}) / g(\boldsymbol{A})}{\partial(\boldsymbol{A})}=\frac{1}{g^{2}(\boldsymbol{A})}\left[g(\boldsymbol{A}) \frac{\partial f(\boldsymbol{A})}{\partial \boldsymbol{A}}-f(\boldsymbol{A}) \frac{\partial g(\boldsymbol{A})}{\partial \boldsymbol{A}}\right]
$$
(5) Chain principle: Let $\boldsymbol{A}$ be an $m \times n$ matrix, and $y=f(\boldsymbol{A})$ and $g(y)$ are real functions of matrix $A$ and scalar $y$, respectively. Then
$$
\frac{\partial g(f(A))}{\partial A}=\frac{d g(y)}{d y} \frac{\partial f(A)}{\partial A}
$$
(6) If $\boldsymbol{A} \in \Re^{m \times n}, \boldsymbol{x} \in \Re^{m \times 1}, \boldsymbol{y} \in \Re^{n \times 1}$, then
$$
\frac{\partial x^{\mathrm{T}} A y}{\partial A}=A y^{\mathrm{T}}
$$
(7) If $A \in \Re^{n \times n}$ is nonsingular $\boldsymbol{x} \in \Re^{n \times 1}, \boldsymbol{y} \in \mathcal{K}^{n \times 1}$, then
$$
\frac{\partial \boldsymbol{x}^{\mathrm{T}} \boldsymbol{A}^{-1} \boldsymbol{y}}{\partial \boldsymbol{A}}=-\boldsymbol{A}^{-\mathrm{T}} \boldsymbol{A} \boldsymbol{y}^{\mathrm{T}} \boldsymbol{A}^{-\mathrm{T}}
$$

机器学习代写|主成分分析作业代写PCA代考|Gradient Matrix of Trace Function

Here, we summarize some properties of gradient matrix of trace functions.
(1)-(3) are gradient matrices of the trace of a single matrix.
(1) If $W$ is an $m \times m$ matrix, then
$$
\frac{\partial \operatorname{tr}(\boldsymbol{W})}{\partial \boldsymbol{W}}=\boldsymbol{I}_{m}
$$
(2) If an $m \times m$ matrix $\boldsymbol{W}$ is invertible, then
$$
\frac{\partial \mathrm{tr}\left(\boldsymbol{W}^{-1}\right)}{\partial \boldsymbol{W}}=-\left(\boldsymbol{W}^{-2}\right)^{\mathrm{T}}
$$
(3) For the outer product of two vectors, it holds that
$$
\frac{\partial \operatorname{tr}\left(x y^{\mathrm{T}}\right)}{\partial \boldsymbol{x}}=\frac{\partial t r\left(\boldsymbol{y} \boldsymbol{x}^{\mathrm{T}}\right)}{\partial \boldsymbol{x}}=\boldsymbol{y}
$$
(4)-(7) are gradient matrices of the trace of the product of two matrices.
(4) If $W \in \Re^{m \times n}, A \in \Re^{n \times m}$, then
$$
\frac{\partial \operatorname{tr}(\boldsymbol{W} \boldsymbol{A})}{\partial \boldsymbol{W}}=\frac{\partial \mathrm{tr}(\boldsymbol{A} \boldsymbol{W})}{\partial \boldsymbol{W}}=\boldsymbol{A}^{\mathrm{T}} .
$$
(5) If $\boldsymbol{W} \in \Re^{m \times n}, \boldsymbol{A} \in \Re^{m \times n}$, then
$$
\frac{\partial \mathrm{tr}\left(\boldsymbol{W}^{\mathrm{T}} \boldsymbol{A}\right)}{\partial \boldsymbol{W}}=\frac{\partial \mathrm{tr}\left(\boldsymbol{A} \boldsymbol{W}^{\mathrm{T}}\right)}{\partial \boldsymbol{W}}=\boldsymbol{A} .
$$
(6) If $W \in \Re^{m \times n}$, then
$$
\frac{\partial \mathrm{tr}\left(\boldsymbol{W} \boldsymbol{W}^{\mathrm{T}}\right)}{\partial \boldsymbol{W}}=\frac{\partial \mathrm{tr}\left(\boldsymbol{W}^{\mathrm{T}} \boldsymbol{W}\right)}{\partial \boldsymbol{W}}=2 \boldsymbol{W}
$$
(7) If $W \in \Re^{m \times n}$, then
$$
\frac{\partial \mathrm{tr}\left(\boldsymbol{W}^{2}\right)}{\partial \boldsymbol{W}}=\frac{\partial \mathrm{tr}(\boldsymbol{W} \boldsymbol{W})}{\partial \boldsymbol{W}}=2 \boldsymbol{W}^{\mathrm{T}} .
$$
(8) If $\boldsymbol{W}, \boldsymbol{A} \in \Re^{m \times m}$ and $\boldsymbol{W}$ is nonsingular, then
$$
\frac{\partial \mathrm{tr}\left(\boldsymbol{A} \boldsymbol{W}^{-1}\right)}{\partial \boldsymbol{W}}=-\left(\boldsymbol{W}^{-1} \boldsymbol{A} \boldsymbol{W}^{-1}\right)^{\mathrm{T}} .
$$

主成分分析代写

机器学习代写|主成分分析作业代写PCA代考|Gradient of Real Function with Respect to Real Vector

定义梯度算子∇X一个n×1向量X作为
∇X=[∂∂X1,∂∂X2,⋯,∂∂Xn]吨=∂∂X,
那么一个实标量函数的梯度F(X)关于X是一个n×1列向量，定义为
∇XF(X)=[∂F(X)∂X1,∂F(X)∂X2,⋯,∂F(X)∂Xn]吨=∂F(X)∂X
梯度方向的负方向称为变量的梯度流X，写为
X˙=−∇XF(X)
的梯度米维行向量函数F(X)= [F1(X),F2(X),…,F米(X)]相对于该n×1实向量X是一个n×米矩阵，定义为
∂F(X)∂X=[∂F1(X)∂X1∂是2(X)∂X1∂F在(X)∂X1 ∂F1(X)∂X2∂F2(X)∂X2∂F在(X)∂X2 ∂F1(X)∂Xn∂F2(X)∂Xn∂F在(X)∂Xn]=∇XF(X).
梯度运算的一些性质可以总结如下：
(1) 如果F(X)=C是一个常数，然后是梯度∂C∂X=这.
(2) 线性原理：如果F(X)和G(X)是向量的实函数X，和C1和C2是实常数，那么
∂[C1F(X)+C2G(X)]∂X=C1∂F(X)∂X+C2∂G(X)∂X
（3）产品原理：如果F(X)和G(X)是向量的实函数X，然后
∂F(X)G(X)∂X=G(X)∂F(X)∂X+F(X)∂G(X)∂X
(4) 商数原则：如果G(X)≠0，然后
∂F(X)/G(X)∂X=1G2(X)[G(X)∂F(X)∂X−F(X)∂G(X)∂X].
(5)链式原理：如果是(X)是一个向量值函数X，然后
∂F(是(X))∂X=∂是吨(X)∂X∂F(是)∂是
在哪里∂是吨(X)∂X是一个n×n矩阵。

机器学习代写|主成分分析作业代写PCA代考|Gradient Matrix of Real Function

实函数的梯度F(一种)关于一个米×n实矩阵一种是一个米×n矩阵，称为梯度矩阵，定义为
∂F(一种)∂一种=[∂F(一种)∂一种11∂F(一种)∂一种12⋯∂F(一种)∂一种大号和 ∂F(一种)∂一种21∂F(一种)∂一种22⋯∂F(一种)∂一种2C ⋮⋮⋮ ∂F(一种)∂一种米1∂F(一种)∂一种米2⋯∂F(一种)∂一种米n]=∇一种F(一种)
在哪里一种一世j是矩阵的元素一种在它的一世第行和j第列。
实函数相对于矩阵的梯度的一些性质可以总结如下：
(1) 如果F(一种)=C是一个常数，其中一种是一个米×n矩阵，那么∂C∂一种=这米×n. (2) 线性原理：如果F(一种)和G(一种)是矩阵的实函数一种，和C1和C2是实常数，那么
∂[C1F(一种)+C2G(一种)]∂一种=C1∂F(一种)∂一种+C2∂G(一种)∂一种.
（3）产品原理：如果F(一种)和G(一种)是矩阵的实函数一种，然后
∂F(一种)G(一种)∂一种=G(一种)∂F(一种)∂一种+F(一种)∂G(一种)∂一种
(4) 商数原则：如果G(一种)≠0，然后
∂F(一种)/G(一种)∂(一种)=1G2(一种)[G(一种)∂F(一种)∂一种−F(一种)∂G(一种)∂一种]
(5) 链式原理：让一种豆米×n矩阵，和是=F(一种)和G(是)是矩阵的实函数一种和标量是，分别。然后
∂G(F(一种))∂一种=dG(是)d是∂F(一种)∂一种
(6) 如果一种∈ℜ米×n,X∈ℜ米×1,是∈ℜn×1，然后
∂X吨一种是∂一种=一种是吨
(7) 如果一种∈ℜn×n是非奇异的X∈ℜn×1,是∈ķn×1，然后
∂X吨一种−1是∂一种=−一种−吨一种是吨一种−吨

机器学习代写|主成分分析作业代写PCA代考|Gradient Matrix of Trace Function

在这里，我们总结了迹函数梯度矩阵的一些性质。
(1)-(3) 是单个矩阵的迹的梯度矩阵。
(1) 如果在是一个米×米矩阵，那么
∂tr⁡(在)∂在=一世米
(2) 如果一个米×米矩阵在是可逆的，那么
∂吨r(在−1)∂在=−(在−2)吨
(3) 对于两个向量的外积，有
∂tr⁡(X是吨)∂X=∂吨r(是X吨)∂X=是
(4)-(7) 是两个矩阵乘积的迹的梯度矩阵。
(4) 如果在∈ℜ米×n,一种∈ℜn×米，然后
∂tr⁡(在一种)∂在=∂吨r(一种在)∂在=一种吨.
(5) 如果在∈ℜ米×n,一种∈ℜ米×n，然后
∂吨r(在吨一种)∂在=∂吨r(一种在吨)∂在=一种.
(6) 如果在∈ℜ米×n，然后
∂吨r(在在吨)∂在=∂吨r(在吨在)∂在=2在
(7) 如果在∈ℜ米×n，然后
∂吨r(在2)∂在=∂吨r(在在)∂在=2在吨.
(8) 如果在,一种∈ℜ米×米和在是非奇异的，那么
∂吨r(一种在−1)∂在=−(在−1一种在−1)吨.

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写