标签： COMP30027

计算机代写|深度学习代写deep learning代考|T81-558

Posted on 2023年8月11日2023年8月28日 by statistics-lab

如果你也在怎样代写深度学习Deep Learning 这个学科遇到相关的难题，请随时右上角联系我们的24/7代写客服。深度学习Deep Learning（是机器学习的一种，也是人工智能的一种方法。通过用简单的世界来构建复杂的世界表示，深度学习技术在计算机视觉、语音识别、自然语言处理、临床应用和其他领域取得了最先进的成果——有望(也有可能)改变社会。

深度学习Deep Learning架构，如深度神经网络、深度信念网络、深度强化学习、递归神经网络、卷积神经网络和变形金刚，已被应用于包括计算机视觉、语音识别、自然语言处理、机器翻译、生物信息学、药物设计、医学图像分析、气候科学、材料检测和棋盘游戏程序等领域，它们产生的结果与人类专家的表现相当，在某些情况下甚至超过了人类专家。

statistics-lab™ 为您的留学生涯保驾护航在代写深度学习deep learning方面已经树立了自己的口碑, 保证靠谱, 高质且原创的统计Statistics代写服务。我们的专家在代写深度学习deep learning代写方面经验极为丰富，各种代写深度学习deep learning相关的作业也就用不着说。

计算机代写|深度学习代写deep learning代考|Criteria of Adequacy

To summarize, a satisfactory explanation for cognitive change consists of at least the following components:

A description of the explanatory target itself. The latter can be a unique learning event, a particular change that happened for a person at some point in time and place, a type of change (belief revision) or a recurring pattern of change (the learning curve).
A background theory of the relevant aspect or aspects of the cognitive architecture. It will include a specification of the types of mental representations assumed; the repertoire of basic cognitive processes that create, manipulate and utilize these representations; and the mechanism that passes control among those processes. This background theory serves as a processing context within which the postulated change mechanisms are assumed to operate.
A repertoire of learning mechanisms. The change produced by a learning mechanism is typically small in scope compared to the explanatory target. The micro-theories proposed in this book distinguish mechanisms for monotonic learning from mechanisms for non-monotonic learning.
A specification of the triggering conditions under which each learning mechanism tends to occur.

An articulation of the mechanisms and the triggering conditions vis-à-vis the explanatory target. An explanation is a demonstration that (a) the relevant triggering conditions held in the situation in which the target change is supposed to have happened, and (b) the specified learning mechanisms, if triggered under those conditions, would in fact produce the observed change. If the explanatory target is a pattern of change, then the articulation needs to show why that type of change tends to recur.
An explanation is the more satisfactory if it comes with an argument to the effect that the postulated change mechanisms scale up, that is, produce observed or plausible outcomes over long periods of time and across system levels.
Last, but not least, an explanation is more satisfactory if it comes with a demonstration that the postulated learning mechanisms can support successful practice.

In the terminology of the philosophy of science, these seven points are criteria of adequacy. Their satisfaction does not guarantee the truth of a theory. They constitute a test that a purported explanation has to pass in order to be a viable candidate. Bluntly put: If a theory or hypothesis lacks these features, it is not worth considering.

计算机代写|深度学习代写deep learning代考|THE PATH AHEAD

The research strategy behind the investigations reported in this book is to study specific types of non-monotonic change and propose micro-theories to explain them. Once the micro-theories have been clearly formulated, they can be mined for deeper principles, if any. In this approach, theory construction does not proceed in an inductive, bottom-up or top-down fashion. The choice of phenomena to be studied is guided by the prior decision to focus on non-monotonic change, itself a theoretical concept. On the other hand, theory construction does not proceed by pushing a single concept or principle into every corner and crevasse of the cognitive landscape. Instead, the principles of each micro-theory are designed to provide understanding of the case that inspires them without regard for their applicability elsewhere. The deeper theory, if any, is to emerge from the conceptual analysis of the micro-theories. In this layered approach, the degree of unification to be sought is itself an outcome of the investigation rather than something given at the outset. Figure 2.3 shows the overall structure of the enterprise.

Following this strategy, Parts II-IV investigate three cases of nonmonotonic change: the creation of novelty, adaptation to an unfamiliar or changing task environment, and conversion from one belief system to another.

The factor that unites these three types of cognitive change is that they require the learner to overcome some part of his prior knowledge, the distinctive feature of non-monotonic learning. They are separated by the key phenomena, the learning scenarios and experimental paradigms in which we can observe those phenomena and the intellectual traditions within which past explanations for those phenomena have been embedded.

Parts II and III consist of three chapters each. The first chapter in each part frames the theoretical problem to be solved, anchoring it in everyday experience as well as in prior research. The latter includes any work that addresses the relevant problem, regardless of age or disciplinary label. In the second chapter, I state a micro-theory for the relevant explanatory target. In the third chapter, I develop the broader implications of the micro-theory, especially in regard to its interactions with other processes, the accumulation of changes over time and the projection of its effects across system levels. Part IV follows the same schema, except that the third chapter is absent.

深度学习代写

计算机代写|深度学习代写deep learning代考|Criteria of Adequacy

总而言之，对认知变化的一个令人满意的解释至少包括以下几个部分:

对解释对象本身的描述。后者可以是一个独特的学习事件，一个人在某个时间和地点发生的特定变化，一种变化(信念修正)或反复出现的变化模式(学习曲线)。

认知体系的相关方面或方面的背景理论。它将包括对假定的心理表征类型的说明;创造、操纵和利用这些表征的基本认知过程;以及在这些过程之间传递控制的机制。这个背景理论作为一个处理环境，在这个环境中，假设的变化机制被假设运作。

一整套学习机制。与解释目标相比，学习机制产生的变化通常是很小的。本书提出的微观理论区分了单调学习机制和非单调学习机制。

每种学习机制发生的触发条件的说明。

对-à-vis解释目标的机制和触发条件的阐述。解释是证明(a)在目标变化发生的情境中存在相关触发条件，以及(b)在这些条件下触发的特定学习机制实际上会产生观察到的变化。如果解释性目标是一种变化模式，那么表述需要显示为什么这种类型的变化倾向于重复出现。

如果一个解释带有这样的论点，即假设的变化机制按比例扩大，也就是说，在很长一段时间内和跨系统级别产生可观察到的或似是而非的结果，那么这个解释就更令人满意。

最后，但并非最不重要的是，如果一个解释能够证明假设的学习机制能够支持成功的实践，那么这个解释就会更令人满意。

用科学哲学的术语来说，这七点就是充分性的标准。他们的满意并不能保证理论的真实性。它们构成了一种测试，一个声称的解释必须通过这个测试才能成为一个可行的候选者。坦率地说:如果一个理论或假设缺乏这些特征，它就不值得考虑。

计算机代写|深度学习代写deep learning代考|THE PATH AHEAD

本书中报告的调查背后的研究策略是研究特定类型的非单调变化，并提出微观理论来解释它们。一旦微观理论被清晰地表述出来，它们就可以被挖掘出更深层次的原理，如果有的话。在这种方法中，理论构建不是以归纳、自下而上或自上而下的方式进行的。要研究的现象的选择是由先前决定关注非单调变化指导的，非单调变化本身就是一个理论概念。另一方面，理论建设不是将单一的概念或原则推到认知景观的每一个角落和裂缝中。相反，每一种微观理论的原则都旨在提供对激发它们的案例的理解，而不考虑它们在其他地方的适用性。更深层次的理论，如果有的话，是从微观理论的概念分析中产生的。在这种分层的方法中，要寻求的统一程度本身就是调查的结果，而不是一开始就给出的东西。企业整体结构如图2.3所示。

根据这一策略，第二至第四部分研究了三种非单调变化的情况:创造新颖性，适应不熟悉或不断变化的任务环境，以及从一种信仰体系转变为另一种信仰体系。

将这三种类型的认知变化联系在一起的因素是，它们都要求学习者克服其先前知识的某些部分，这是非单调学习的显著特征。它们被关键现象、学习场景和实验范式分开，我们可以在这些现象中观察到这些现象，以及过去对这些现象的解释所包含的知识传统。

第二部分和第三部分各由三章组成。每个部分的第一章都是要解决的理论问题，将其固定在日常经验和先前的研究中。后者包括任何解决相关问题的工作，无论年龄或学科标签如何。在第二章中，我为相关的解释对象阐述了一个微观理论。在第三章中，我发展了微观理论的更广泛的含义，特别是关于它与其他过程的相互作用，随时间变化的积累以及它在系统水平上的影响的预测。第四部分遵循同样的模式，只是没有第三章。

计算机代写|深度学习代写deep learning代考请认准statistics-lab™

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

金融工程代写

金融工程是使用数学技术来解决金融问题。金融工程使用计算机科学、统计学、经济学和应用数学领域的工具和知识来解决当前的金融问题，以及设计新的和创新的金融产品。

非参数统计代写

非参数统计指的是一种统计方法，其中不假设数据来自于由少数参数决定的规定模型；这种模型的例子包括正态分布模型和线性回归模型。

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

术语广义线性模型（GLM）通常是指给定连续和/或分类预测因素的连续响应变量的常规线性回归模型。它包括多元线性回归，以及方差分析和方差分析（仅含固定效应）。

有限元方法代写

有限元方法（FEM）是一种流行的方法，用于数值解决工程和数学建模中出现的微分方程。典型的问题领域包括结构分析、传热、流体流动、质量运输和电磁势等传统领域。

有限元是一种通用的数值方法，用于解决两个或三个空间变量的偏微分方程（即一些边界值问题）。为了解决一个问题，有限元将一个大系统细分为更小、更简单的部分，称为有限元。这是通过在空间维度上的特定空间离散化来实现的，它是通过构建对象的网格来实现的：用于求解的数值域，它有有限数量的点。边界值问题的有限元方法表述最终导致一个代数方程组。该方法在域上对未知函数进行逼近。[1] 然后将模拟这些有限元的简单方程组合成一个更大的方程系统，以模拟整个问题。然后，有限元通过变化微积分使相关的误差函数最小化来逼近一个解决方案。

tatistics-lab作为专业的留学生服务机构，多年来已为美国、英国、加拿大、澳洲等留学热门地的学生提供专业的学术服务，包括但不限于Essay代写，Assignment代写，Dissertation代写，Report代写，小组作业代写，Proposal代写，Paper代写，Presentation代写，计算机作业代写，论文修改和润色，网课代做，exam代考等等。写作范围涵盖高中，本科，研究生等海外留学全阶段，辐射金融，经济学，会计学，审计学，管理学等全球99%专业科目。写作团队既有专业英语母语作者，也有海外名校硕博留学生，每位写作老师都拥有过硬的语言能力，专业的学科背景和学术写作经验。我们承诺100%原创，100%专业，100%准时，100%满意。

随机分析代写

随机微积分是数学的一个分支，对随机过程进行操作。它允许为随机过程的积分定义一个关于随机过程的一致的积分理论。这个领域是由日本数学家伊藤清在第二次世界大战期间创建并开始的。

时间序列分析代写

随机过程，是依赖于参数的一组随机变量的全体，参数通常是时间。随机变量是随机现象的数量表现，其时间序列是一组按照时间发生先后顺序进行排列的数据点序列。通常一组时间序列的时间间隔为一恒定值（如1秒，5分钟，12小时，7天，1年），因此时间序列可以作为离散时间数据进行分析处理。研究时间序列数据的意义在于现实中，往往需要研究某个事物其随时间发展变化的规律。这就需要通过研究该事物过去发展的历史记录，以得到其自身发展的规律。

回归分析代写

多元回归分析渐进（Multiple Regression Analysis Asymptotics）属于计量经济学领域，主要是一种数学上的统计分析方法，可以分析复杂情况下各影响因素的数学关系，在自然科学、社会和经济学等多个领域内应用广泛。

MATLAB代写

MATLAB 是一种用于技术计算的高性能语言。它将计算、可视化和编程集成在一个易于使用的环境中，其中问题和解决方案以熟悉的数学符号表示。典型用途包括：数学和计算算法开发建模、仿真和原型制作数据分析、探索和可视化科学和工程图形应用程序开发，包括图形用户界面构建MATLAB 是一个交互式系统，其基本数据元素是一个不需要维度的数组。这使您可以解决许多技术计算问题，尤其是那些具有矩阵和向量公式的问题，而只需用 C 或 Fortran 等标量非交互式语言编写程序所需的时间的一小部分。MATLAB 名称代表矩阵实验室。MATLAB 最初的编写目的是提供对由 LINPACK 和 EISPACK 项目开发的矩阵软件的轻松访问，这两个项目共同代表了矩阵计算软件的最新技术。MATLAB 经过多年的发展，得到了许多用户的投入。在大学环境中，它是数学、工程和科学入门和高级课程的标准教学工具。在工业领域，MATLAB 是高效研究、开发和分析的首选工具。MATLAB 具有一系列称为工具箱的特定于应用程序的解决方案。对于大多数 MATLAB 用户来说非常重要，工具箱允许您学习和应用专业技术。工具箱是 MATLAB 函数（M 文件）的综合集合，可扩展 MATLAB 环境以解决特定类别的问题。可用工具箱的领域包括信号处理、控制系统、神经网络、模糊逻辑、小波、仿真等。

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

计算机代写|深度学习代写deep learning代考|CS7643

Posted on 2023年8月11日2023年8月28日 by statistics-lab

计算机代写|深度学习代写deep learning代考|EXPLAINING CHANGE

If the mind is a system for processing knowledge representations in the service of action and discourse, how does it change over time? What form should an explanation for cognitive change take? What are the criteria of a satisfactory explanation and what issues arise in the construction of such explanations? It is informative to consider successful explanations for other types of change. Cognitive psychologists need not imitate other sciences or assume that their own theories must, in the end, look like those of any other science, but neither is it wise to assume that other sciences have no lessons to teach. Natural scientists, social scientists and humanists have grappled with the concept of change and their successes provide calories for psychological thought.
Componential Explanations
To extract the general features of scientific explanations of change, consider contagion and electrolysis, two seemingly different phenomena. How do contagious diseases like yellow fever spread such that we suffer epidemics ${ }^{28}$ It required considerable research to identify all parts of this complicated process. Although a specialist on yellow fever could add innumerable details, a mere outline suffices here: The disease is caused by a germ that multiplies in a person’s body, causing the symptoms. The sick person is bitten by a mosquito, which sucks up blood that contains the germ. The mosquito flies to another person and bites again, at which point some of the germs are inserted in that person’s body and begin to multiply there; and so on. This narrative makes understandable several otherwise inexplicable aspects of yellow fever, such as the timing and geographical location of epidemics and the pattern of diffusion within each epidemic.

Consider next the explanation for the electrolysis of water: Pass electricity through water and the water turns into hydrogen and oxygen gases. ${ }^{39}$ How does this chemical transformation happen? As every chemistry student knows, water molecules consist of two hydrogen atoms and one oxygen atom connected via co-valent bonds, $\mathrm{H}_2 \mathrm{O}$ in the standard chemical formula. The electrical current dissolves the bonds, causing the hydrogen and oxygen atoms to drift apart. When two hydrogen atoms bump into each other, they bind, forming one molecule of hydrogen $\left(\mathrm{H}_2\right)$. Likewise, two oxygen atoms bind to form one molecule of oxygen $\left(\mathrm{O}_2\right)$. So two water molecules turn into two hydrogen molecules and one molecule of oxygen.

计算机代写|深度学习代写deep learning代考|The Repertoire of Learning Mechanisms

Sciences differ in an interesting way with respect to their repertoires of unit changes. Some fields of research attribute all their phenomena to a single type of change. ${ }^{42}$ Adherents of the mechanical world view in the 17 th and 18 th centuries tried to explain all of physics through the motions of physical bodies and the forces they exert on each other. Chemists explain all chemical reactions in terms of the rearrangement of atoms through the breaking and forming of atomic bonds. Earth scientists, in contrast, draw upon a rich and varied repertoire of change mechanisms: glaciation in response to astronomical cycles, plate tectonics, erosion caused by water freezing in cracks, the actions of wind and water on soil and sand and so on. ${ }^{43}$ Whether a science will turn out to need a sparse or a rich repertoire of change mechanisms cannot be known ahead of investigation.
Tradition has handed down a long list of suggestions about the basic processes of knowledge change. I refer to them as learning mechanisms. Perhaps association, the idea that knowledge changes by the creation of a link between two previously unconnected ideas or concepts, is the oldest learning mechanism of all. The notion of generalization (abstraction, induction) – the idea that the mind extracts commonalities from sets of instances – has likewise been with us since antiquity. Over time, psychologists have coined a wide variety of terms to refer to what are ostensibly different types of cognitive change: association, automatization, belief revision, categorization, chunking, conceptual change, concept learning, conditioning, cognitive development, discrimination, equilibration, generalization, habit formation, implicit learning, maturation, memorization, perceptual learning, induction, knowledge acquisition, list learning, skill acquisition, schema extraction, stage transition, strategy change and theory change.
This plethora of technical terms implicitly claims that cognitive change is a heterogeneous phenomenon and intuition provides some support. It is certainly plausible that, for example, conceptual knowledge and skills are acquired via different mechanisms. Furthermore, knowledge can change both by becoming more abstract and by becoming more specific, two processes that are each others’ opposites and hence difficult to explain with a single mechanism. We should not expect a final theory of learning to pull together all of cognitive change in a single law of learning to take its place next to Isaac Newton’s law of gravitation as one of the triumphs of science.

深度学习代写

计算机代写|深度学习代写deep learning代考|EXPLAINING CHANGE

如果心灵是一个为行为和话语服务而处理知识表征的系统，那么它是如何随时间变化的呢?对认知变化的解释应该采取什么形式?一个令人满意的解释的标准是什么?在构建这样的解释时会出现什么问题?考虑对其他类型的变化的成功解释是有益的。认知心理学家不需要模仿其他科学，也不需要假设他们自己的理论最终必须与任何其他科学的理论一样，但假设其他科学没有教训可教也是不明智的。自然科学家、社会科学家和人文主义者一直在努力研究变化的概念，他们的成功为心理学思想提供了热量。
成分的解释
为了提取变化的科学解释的一般特征，考虑传染和电解这两种看似不同的现象。像黄热病这样的传染病是如何传播的，使我们遭受流行病的折磨?{}^{28}$需要进行大量的研究，以确定这一复杂过程的所有部分。虽然黄热病专家可以提供无数的细节，但在这里只做一个概述就足够了:这种疾病是由一种在人体内繁殖的细菌引起的，从而引起症状。病人被蚊子叮咬，蚊子会吸出含有细菌的血液。蚊子飞到另一个人身上并再次叮咬，这时一些细菌被插入那个人的身体并开始在那里繁殖;等等……这种叙述使人们能够理解黄热病的一些其他方面无法解释的问题，例如流行病的时间和地理位置以及每次流行病的传播模式。

接下来考虑一下对水的电解的解释:把电通过水，水就会变成氢气和氧气。${}^{39}$这种化学转变是如何发生的?每个学化学的学生都知道，水分子是由两个氢原子和一个氧原子通过共价键连接而成的，在标准化学式中是$\ mathm {H}_2 \ mathm {O}$。电流溶解了化学键，导致氢原子和氧原子漂移。当两个氢原子相互碰撞时，它们结合，形成一个氢分子$\左(\ mathm {H}_2\右)$。同样，两个氧原子结合形成一个氧分子$\left(\ mathm {O}_2\right)$。所以两个水分子变成了两个氢分子和一个氧分子。

计算机代写|深度学习代写deep learning代考|The Repertoire of Learning Mechanisms

科学以一种有趣的方式不同于它们对单位变化的反应。一些研究领域把所有的现象都归结为一种变化。17世纪和18世纪机械世界观的拥护者试图通过物理物体的运动和它们相互施加的力来解释所有的物理学。化学家用原子键断裂和形成的原子重排来解释所有的化学反应。相比之下，地球科学家则利用了丰富多样的变化机制:与天文周期有关的冰川作用、板块构造、由裂缝中的水冻结引起的侵蚀、风和水对土壤和沙子的作用等等。${}^{43}$一门科学最终需要的变化机制是稀疏的还是丰富的，在调查之前是不可能知道的。
传统已经流传了一长串关于知识变化的基本过程的建议。我把它们称为学习机制。也许联想是所有学习机制中最古老的。联想是指知识通过在两个先前不相关的想法或概念之间建立联系而发生变化。概括(抽象、归纳)的概念——即思维从一系列实例中提取共性的观点——同样自古以来就与我们同在。随着时间的推移，心理学家创造了各种各样的术语来指代表面上不同类型的认知变化:联想、自动化、信念修正、分类、分块、概念变化、概念学习、条件反射、认知发展、辨别、平衡、概括、习惯形成、内隐学习、成熟、记忆、感知学习、归纳、知识获取、列表学习、技能获取、图式提取、阶段转换、策略变化和理论变化。
这些过多的技术术语暗示了认知变化是一种异质现象，直觉提供了一些支持。例如，概念性知识和技能是通过不同的机制获得的，这当然是合理的。此外，知识可以通过变得更抽象和更具体来改变，这两个过程是彼此对立的，因此很难用单一机制来解释。我们不应该期望一个最终的学习理论将所有的认知变化集中在一个单一的学习定律中，以取代艾萨克·牛顿的万有引力定律，成为科学的胜利之一。

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

金融工程代写

非参数统计代写

非参数统计指的是一种统计方法，其中不假设数据来自于由少数参数决定的规定模型；这种模型的例子包括正态分布模型和线性回归模型。

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

有限元方法代写

随机分析代写

时间序列分析代写

回归分析代写

MATLAB代写

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

计算机代写|深度学习代写deep learning代考|CS230

Posted on 2023年8月11日2023年8月28日 by statistics-lab

计算机代写|深度学习代写deep learning代考|DEEP LEARNING

If prior experience is a seriously fallible guide, learning cannot consist solely or even primarily of accumulating experiences, finding regularities therein and projecting those regularities onto the future. To successfully deal with thoroughgoing change, human beings need the ability to override the imperatives of experience and consider actions other than those suggested by the projection of that experience onto the situation at hand. Given the turbulent character of reality, the evolutionary strategy of relying primarily on learned rather than innate behaviors drove the human species to evolve cognitive mechanisms that override prior experience. This is the main theme of this book, so it deserves a label and an explicit statement:
The Deep Learning Hypothesis
In the course of shifting the basis for action from innate structures to acquired knowledge and skills, human beings evolved cognitive processes and mechanisms that enable them to suppress their experience and override its imperatives for action.
The Deep Learning Hypothesis does not deny the existence of cognitive mechanisms that operate on the basis of experience. Inductive learning works in tight contexts and people obviously do possess the processes for encoding episodic information into memory, inductive reasoning, projection and planning that are described in cognitive psychology textbooks. The type of learning supported by those processes generates new knowledge that is consistent with what was known before. Borrowing a useful term from logicians, I refer to such additive cognitive growth as monotonic.

The claim of the Deep Learning Hypothesis is that monotonic learning is at most half the story. The other half describes how we abandon, override, reject, retract or suppress knowledge that we had previously accepted as valid in order to track a constantly shifting and fundamentally unpredictable environment and thereby indirectly create mental space for alternative or even contradictory concepts, beliefs, ideas and strategies. A complete theory of human learning must complement inductive mechanisms with a second set of non-monotonic learning mechanisms that allow experience to be overruled.
The hypothesis that we possess cognitive mechanisms for overriding the imperatives of the past does not imply that doing so is effortless. Everyday life requires a balance between projecting and overriding past experience – what philosopher Thomas S. Kuhn has called “the essential tension” – and there is no reason to believe that evolution provided us with a perfect solution to this balancing problem. ${ }^{53}$ To explain cognitive change is to explain both the possibility and difficulty of non-monotonic change.

计算机代写|深度学习代写deep learning代考|CLOSING THE ESCAPE ROUTES

Psychology is the science of mind. Nevertheless, some psychologists have tried to replace the task of describing how the mind works by the task of describing something else. Different schools of thought have proposed different replacements.
According to the phenomenological approach, to describe mind is to describe subjective experience; that is, to state what a person is consciously perceiving, feeling, remembering and thinking. ${ }^6$ Once the contents of consciousness have been described, there is yet more to say, but what is left to say falls within the scope of neuroscience. Phenomenologists do not deny mind but limit its scope. Psychology’s responsibilities end at the edge of consciousness; the rest is neurons.

The emphasis on subjective experience is useful. Cognitive processes are expressed in subjective experience as well as in action and discourse. For example, we are all familiar with the frustration of trying to recall a name or a fact that refuses to be recalled, the elation associated with a sudden insight into a recalcitrant problem and the satisfaction of performing a complex skill thoroughly mastered. If psychology is to be a tool for understanding ourselves, our theories must explain the flow of subjective experiences as well as the streams of action and discourse.

But basic facts about cognition reveal as fallacious the idea that the mind can be reduced to nothing but subjective experience. Consider trying to recall a name, failing to do so but spontaneously succeeding a short while later. There cannot be a phenomenological explanation of this type of cognitive event. The subjective experience consists of the effort to recall, the blank mind that accompanies failure to recall and the slight surprise and relief of tension that accompanies the subsequent success. But this remains a description, not an explanation. The process that produces these conscious experiences – retrieval from long-term memory – is not itself conscious. An account of the subjective experiences, no matter how accurate, does not suffice to explain the regularities associated with retrieval failures, tip-of-the-tongue feelings and similar mental events. ${ }^7$ This and many other observations force the conclusion that there are mental processes that are not conscious. The weakness of the phenomenological approach does not lie in its descriptions of conscious experiences but in the unwarranted add-on claim that there is nothing but conscious experience for a psychologist to describe.

深度学习代写

计算机代写|深度学习代写deep learning代考|DEEP LEARNING

如果以前的经验是一个严重不可靠的向导，学习就不可能完全或主要是积累经验，从中发现规律，并把这些规律投射到未来。为了成功地应对彻底的变化，人类需要一种超越经验的能力，并考虑采取行动，而不是将经验投射到手头的情况中。考虑到现实的动荡特征，主要依靠学习而不是先天行为的进化策略驱使人类进化出超越先前经验的认知机制。这是本书的主题，所以它值得一个标签和一个明确的声明:
深度学习假说
在将行动的基础从先天结构转变为后天知识和技能的过程中，人类进化出了认知过程和机制，使他们能够抑制自己的经验，并超越其行动的必要性。
深度学习假说并不否认以经验为基础的认知机制的存在。归纳学习在紧密的环境中起作用，人们显然拥有认知心理学教科书中描述的将情景信息编码成记忆、归纳推理、投射和计划的过程。由这些过程支持的学习类型产生了与以前已知的一致的新知识。借用逻辑学家的一个有用术语，我把这种附加的认知增长称为单调的。

深度学习假说的主张是，单调学习最多只是故事的一半。另一半描述了我们如何放弃、推翻、拒绝、撤回或压制我们之前认为有效的知识，以跟踪不断变化和根本不可预测的环境，从而间接地为替代甚至矛盾的概念、信仰、想法和策略创造心理空间。一个完整的人类学习理论必须用第二套非单调学习机制来补充归纳机制，这种机制允许经验被推翻。
假设我们拥有超越过去命令的认知机制，并不意味着这样做是毫不费力的。日常生活需要在投射和超越过去的经验之间取得平衡——哲学家托马斯·s·库恩称之为“本质张力”——没有理由相信进化为我们提供了一个完美的解决这个平衡问题的办法。${}^{53}$解释认知变化就是同时解释非单调变化的可能性和难度。

计算机代写|深度学习代写deep learning代考|CLOSING THE ESCAPE ROUTES

心理学是关于心理的科学。然而，一些心理学家试图用描述其他事物的任务来取代描述大脑如何工作的任务。不同的思想流派提出了不同的替代方案。
根据现象学方法，描述心灵就是描述主观经验;也就是说，陈述一个人有意识地感知、感觉、记忆和思考的东西。{}^6$一旦意识的内容被描述了，还有更多的东西要讲，但剩下的东西属于神经科学的范围。现象学家并不否认精神，而是限制它的范围。心理学的责任止于意识的边缘;剩下的是神经元。

强调主观经验是有用的。认知过程既表现在主观经验中，也表现在行动和话语中。例如，我们都熟悉试图回忆起一个名字或一件拒绝回忆的事实时的挫败感，突然洞察到一个难以解决的问题时的兴高采烈，以及完全掌握一项复杂技能时的满足感。如果心理学要成为理解我们自己的工具，我们的理论就必须解释主观经验的流动以及行动和话语的流动。

但是关于认知的基本事实表明，认为心灵只能归结为主观经验的观点是错误的。试想一下，试着回忆一个名字，失败了，但过了一会儿就自然而然地记起来了。这类认知事件不可能有现象学的解释。主观体验包括努力回忆，回忆失败时大脑一片空白，以及随后成功时的些许惊喜和紧张的缓解。但这只是一种描述，而不是一种解释。产生这些有意识体验的过程——从长期记忆中提取——本身并不是有意识的。对主观体验的描述，无论多么准确，都不足以解释与检索失败、舌尖感觉和类似心理事件相关的规律。这一点和许多其他的观察结果使我们得出这样的结论:存在着非意识的心理过程。现象学方法的弱点不在于它对意识经验的描述，而在于它毫无根据的附加主张，即除了意识经验之外，心理学家没有别的东西可以描述。

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

金融工程代写

非参数统计代写

非参数统计指的是一种统计方法，其中不假设数据来自于由少数参数决定的规定模型；这种模型的例子包括正态分布模型和线性回归模型。

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

有限元方法代写

随机分析代写

时间序列分析代写

回归分析代写

MATLAB代写

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

计算机代写|机器学习代写machine learning代考|QBUS6850

Posted on 2023年8月11日2023年8月28日 by statistics-lab

如果你也在怎样代写机器学习Machine Learning 这个学科遇到相关的难题，请随时右上角联系我们的24/7代写客服。机器学习Machine Learning令人兴奋。这是有趣的，具有挑战性的，创造性的，和智力刺激。它还为公司赚钱，自主处理大量任务，并从那些宁愿做其他事情的人那里消除单调工作的繁重任务。

机器学习Machine Learning也非常复杂。从数千种算法、数百种开放源码包，以及需要具备从数据工程(DE)到高级统计分析和可视化等各种技能的专业实践者，ML专业实践者所需的工作确实令人生畏。增加这种复杂性的是，需要能够与广泛的专家、主题专家(sme)和业务单元组进行跨功能工作——就正在解决的问题的性质和ml支持的解决方案的输出进行沟通和协作。

statistics-lab™ 为您的留学生涯保驾护航在代写机器学习 machine learning方面已经树立了自己的口碑, 保证靠谱, 高质且原创的统计Statistics代写服务。我们的专家在代写机器学习 machine learning代写方面经验极为丰富，各种代写机器学习 machine learning相关的作业也就用不着说。

计算机代写|机器学习代写machine learning代考|Artifact management

Let’s imagine that we’re still working at the fire-risk department of the forest service introduced in chapter 15. In our efforts to effectively dispatch personnel and equipment to high-risk areas in the park system, we’ve arrived at a solution that works remarkably well. Our features are locked in and are stable over time. We’ve evaluated the performance of the predictions and are seeing genuine value from the model.
Throughout this process of getting the features into a good state, we’ve been iterating through the improvement cycle, shown in figure 16.1.

As this cycle shows, we’ve been iteratively releasing new versions of the model, testing against a baseline deployment, collecting feedback, and working to improve the predictions. At some point, however, we’ll be going into model-sustaining mode.

We’ve worked as hard as we can to improve the features going into the model and have found that the return on investment (ROI) of continuing to add new data elements to the project is simply not worth it. We’re now in the position of scheduled passive retraining of our model based on new data coming in over time.

When we’re at this steady-state point, the last thing that we want to do is to have one of the DS team members spend an afternoon manually retraining a model, manually comparing its results to the current production-deployed model with ad hoc analysis, and deciding on whether the model should be updated.

计算机代写|机器学习代写machine learning代考|MLflow’s model registry

In this situation that we find ourselves in, with scheduled updates to a model happening autonomously, it is important for us to know the state of production deployment. Not only do we need to know the current state, but if questions arise about performance of a passive retraining system in the past, we need to have a means of investigating the historical provenance of the model. Figure 16.3 compares using and not using a registry for tracking provenance in order to explain a historical issue.

As you can see, the process for attempting to re-create a past run is fraught with peril; we have a high risk of being unable to reproduce the issue that the business found in historical predictions. With no registry to record the artifacts utilized in production, manual work must be done to re-create the model’s original conditions. This can be incredible challenging (if not impossible) in most companies because changes may have occurred to the underlying data used to train the model, rendering it impossible to re-create that state.

The preferred approach, as shown in figure 16.3 , is to utilize a model registry service. MLflow, for instance, offers exactly this functionality within its APIs, allowing us to log details of each retraining run to the tracking server, handle production promotion if the scheduled retraining job performs better on holdout data, and archive the older model for future reference. If we had used this framework, the process of testing conditions of a model that had at one point run in production would be as simple as recalling the artifact from the registry entry, loading it into a notebook environment, and generating the explainable correlation reports with tools such as shap.

机器学习代考

计算机代写|机器学习代写machine learning代考|Artifact management

让我们想象一下，我们仍然在第15章介绍的森林服务部门的火灾风险部门工作。在我们努力有效地向公园系统的高风险区域派遣人员和设备的过程中，我们已经找到了一个非常有效的解决方案。随着时间的推移，我们的功能被锁定并保持稳定。我们已经评估了预测的表现，并从模型中看到了真正的价值。
在使特性进入良好状态的整个过程中，我们一直在迭代改进周期，如图16.1所示。

正如这个周期所示，我们一直在迭代地发布模型的新版本，针对基线部署进行测试，收集反馈，并努力改进预测。然而，在某种程度上，我们将进入模型维持模式。

我们已经尽我们所能地改进模型中的特性，并且发现继续向项目中添加新数据元素的投资回报(ROI)根本不值得。我们现在处于计划中的被动再训练位置，这是基于随着时间的推移输入的新数据。

当我们处于这个稳定状态点时，我们最不想做的事情就是让DS团队中的一个成员花一个下午的时间手动地重新训练模型，手动地将其结果与当前生产部署的模型进行特别分析比较，并决定是否应该更新模型。

计算机代写|机器学习代写machine learning代考|MLflow’s model registry

在我们发现自己所处的这种情况下，对模型的计划更新是自主发生的，因此了解生产部署的状态对我们来说非常重要。我们不仅需要知道当前状态，而且如果过去被动再训练系统的性能出现问题，我们需要有一种方法来调查模型的历史来源。图16.3比较了使用和不使用注册表跟踪来源的情况，以便解释历史问题。

正如你所看到的，试图重现过去的运行过程充满了危险;我们有很高的风险无法重现业务在历史预测中发现的问题。由于没有注册中心来记录生产中使用的工件，因此必须进行手工工作来重新创建模型的原始条件。在大多数公司中，这可能是一个难以置信的挑战(如果不是不可能的话)，因为用于训练模型的底层数据可能已经发生了变化，使得无法重新创建该状态。

如图16.3所示，首选的方法是利用模型注册中心服务。例如，MLflow在其api中提供了这种功能，允许我们将每次再培训运行的详细信息记录到跟踪服务器，如果计划的再培训工作在保留数据上表现更好，则处理生产提升，并存档旧模型以供将来参考。如果我们使用了这个框架，那么在生产环境中运行的模型的测试过程就会非常简单，只需从注册表项中召回工件，将其加载到笔记本环境中，并使用诸如shape之类的工具生成可解释的相关报告。

计算机代写|机器学习代写machine learning代考请认准statistics-lab™

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

金融工程代写

非参数统计代写

非参数统计指的是一种统计方法，其中不假设数据来自于由少数参数决定的规定模型；这种模型的例子包括正态分布模型和线性回归模型。

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

有限元方法代写

随机分析代写

时间序列分析代写

回归分析代写

MATLAB代写

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

计算机代写|机器学习代写machine learning代考|QBUS3820

Posted on 2023年8月7日2023年8月28日 by statistics-lab

计算机代写|机器学习代写machine learning代考|Feature stores

We briefly touched on using a feature store in the preceding chapter. While it is important to understand the justification for and benefits of implementing a feature store (namely, that of consistency, reusability, and testability), seeing an application of a relatively nascent technology is more relevant than discussing the theory. Here, we’re going to look at a scenario that I struggled through, involving the importance of utilizing a feature store to enforce consistency throughout an organization leveraging both ML and advanced analytics.

Let’s imagine that we work at a company that has multiple DS teams. Within the engineering group, the main DS team focuses on company-wide initiatives. This team works mostly on large-scale projects involving critical services that can be employed by any group within the company, as well as customer-facing services. Spread among departments are a smattering of independent contributor DS employees who have been hired by and report to their respective department heads. While collaboration occurs, the main datasets used by the core DS team are not open for the independent DS employees’ use.

At the start of a new year, a department head hires a new DS straight out of a university program. Well-intentioned, driven, and passionate, this new hire immediately gets to work on the initiatives that this department head wants investigated. In the process of analyzing the characteristics of the customers of the company, the new hire come across a production table that contains probabilities for customers to make a call-center complaint. Curious, the new DS begins analyzing the predictions against the data that is in the data warehouse for their department.

Unable to reconcile any feature data to the predictions, the DS begins working on a new model prototype to try to improve upon the complaint prediction solution. After spending a few weeks, the DS presents their findings to their department head. Given the go-ahead to work on this project, the DS proceeds to build a project in their analytics department workspace. After several months, the DS presents their findings at a company all-hands meeting.

Confused, the core DS team asks why this project is being worked on and for further details on the implementation. In less than an hour, the core DS team is able to explain why the independent DS’s solution worked so well: they leaked the label. Figure 16.6 illustrates the core DS team’s explanation: the data required to build any new model or perform extensive analysis of the data collected from users is walled off by the silo surrounding the core DS team’s engineering department.

The data being used for training that was present in the department’s data warehouse was being fed from the core DS team’s production solution. Each source feature used to train the core model was inaccessible to anyone apart from engineering and production processes.

计算机代写|机器学习代写machine learning代考|What a feature store is used for

Solving the data silo issue in our scenario is among the most compelling reasons to use a feature store. When dealing with a distributed DS capability throughout an organization, the benefits of standardization and accessibility are seen through a reduction in redundant work, incongruous analyses, and general confusion surrounding the veracity of solutions.

However, having a feature store enables an organization to do far more with its data than just quality-control it. To illustrate these benefits, figure 16.7 shows a highlevel code architecture for model building and serving with and without a feature store.

The top portion of figure 16.7 shows the historical reality of ML development for projects. Tightly coupled feature-engineering code is developed inline to the model tuning and training code to generate models that are more effective than they would be if trained on the raw data. While this architecture makes sense from a development perspective of generating a good model, it creates an issue when developing the prediction code base (as shown at the top right of figure 16.7).

Any operations that are done to the raw data now need to be ported over to this serving code, presenting an opportunity for errors and inconsistencies in the model vector. Alternatives to this approach can help eliminate the chances of data inconsistency, however:

” Use a pipeline (most major ML frameworks have them).

Abstract feature-engineering code into a package that training and serving can both call.
“Write traditional ETL to generate features and store them.

机器学习代考

计算机代写|机器学习代写machine learning代考|Feature stores

在前一章中，我们简要地谈到了使用功能库。虽然理解实现特性存储(即一致性、可重用性和可测试性)的理由和好处很重要，但看到一个相对新兴技术的应用比讨论理论更有意义。在这里，我们将看到一个我曾经历过的场景，涉及到利用特性存储来在组织中同时利用ML和高级分析来加强一致性的重要性。

让我们想象一下，我们在一家拥有多个DS团队的公司工作。在工程团队中，主要的DS团队专注于公司范围内的计划。该团队主要从事涉及关键服务的大型项目，这些服务可以由公司内的任何团队使用，以及面向客户的服务。分散在各个部门之间的是一小部分独立贡献者DS员工，他们被各自的部门主管雇用并向其报告。当协作发生时，核心DS团队使用的主要数据集不开放给独立的DS员工使用。

新年伊始，一位部门主管直接从大学项目中招聘了一名新的副主任。这位新员工动机良好、干劲十足、充满激情，他立即着手部门主管希望调查的项目。在分析公司客户特征的过程中，新员工看到了一张生产表，上面写着客户向呼叫中心投诉的概率。奇怪的是，新的DS开始根据数据仓库中的数据分析预测。

由于无法将任何特征数据与预测相一致，DS开始研究一个新的模型原型，试图改进投诉预测解决方案。经过几周的调查后，副警长将他们的调查结果提交给部门主管。在这个项目上工作的许可下，DS继续在他们的分析部门工作空间中构建一个项目。几个月后，董事总经理在公司全体会议上公布了他们的调查结果。

核心DS团队感到很困惑，他们问为什么要做这个项目，并想知道更多关于实现的细节。在不到一个小时的时间里，核心DS团队能够解释为什么独立DS的解决方案如此有效:他们泄露了标签。图16.6说明了核心DS团队的解释:构建任何新模型或对从用户收集的数据进行广泛分析所需的数据被核心DS团队的工程部门周围的筒仓隔离开来。

部门数据仓库中用于培训的数据来自核心DS团队的生产解决方案。用于训练核心模型的每个源特征除了工程和生产过程之外，任何人都无法访问。

计算机代写|机器学习代写machine learning代考|What a feature store is used for

在我们的场景中，解决数据竖井问题是使用特性存储最令人信服的原因之一。在整个组织中处理分布式DS功能时，通过减少冗余工作、不一致的分析和围绕解决方案准确性的普遍混淆，可以看到标准化和可访问性的好处。

然而，拥有一个功能商店使组织能够对其数据做更多的事情，而不仅仅是对其进行质量控制。为了说明这些好处，图16.7显示了用于模型构建和使用或不使用特性库的高级代码体系结构。

图16.7的顶部显示了项目ML开发的历史现实。紧密耦合的特征工程代码与模型调优和训练代码内联开发，以生成比在原始数据上训练更有效的模型。虽然从生成良好模型的开发角度来看，这种体系结构是有意义的，但是在开发预测代码库时，它会产生一个问题(如图16.7的右上角所示)。

对原始数据所做的任何操作现在都需要移植到此服务代码中，这就为模型向量中的错误和不一致提供了机会。然而，这种方法的替代方法可以帮助消除数据不一致的可能性:

“使用管道(大多数主流ML框架都有管道)。

将特征工程代码抽象为培训和服务都可以调用的包。
“编写传统的ETL来生成特征并存储它们。

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

金融工程代写

非参数统计代写

非参数统计指的是一种统计方法，其中不假设数据来自于由少数参数决定的规定模型；这种模型的例子包括正态分布模型和线性回归模型。

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

有限元方法代写

随机分析代写

时间序列分析代写

回归分析代写

MATLAB代写

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

计算机代写|机器学习代写machine learning代考|STAT3888

Posted on 2023年8月7日2023年8月7日 by statistics-lab

计算机代写|机器学习代写machine learning代考|Model interpretability

Let’s suppose that we’re working on a problem designed to control forest fires. The organization that we work for can stage equipment, personnel, and services to locations within a large national park system in order to mitigate the chances of wildfires growing out of control. To make logistics effectiveness as efficient as possible, we’ve been tasked with building a solution that can identify risks of fire outbreaks by grid coordinates. We have several years of data, sensor data from each location, and a history of fire-burn area for each grid position.

After building the model and providing the predictions as a service to the logistics team, questions arise about the model’s predictions. The logistics team members notice that certain predictions don’t align with their tribal knowledge of having dealt with fire seasons, voicing concerns about addressing predicted calamities with the feature data that they’re exposed to.

They’ve begun to doubt the solution. They’re asking questions. They’re convinced that something strange is going on and they’d like to know why their services and personnel are being told to cover a grid coordinate in a month that, as far as they can remember, has never had a fire break out.

How can we tackle this situation? How can we run simulations of our feature vector for the prediction through our model and tell them conclusively why the model predicted what it did? Specifically, how can we implement explainable artificial intelligence (XAI) on our model with the minimum amount of effort?

When planning out a project, particularly for a business-critical use case, a frequently overlooked aspect is to think about model explainability. Some industries and companies are the exception to this rule, because of either legal requirements or corporate policies, but for most groups that I’ve interacted with, interpretability is an afterthought.

I understand the reticence that most teams have in considering tacking on XAI functionality to a project. During the course of EDA, model tuning, and QA validation, the DS team generally understands the behavior of the model quite well. Implementing XAI may seem redundant.

By the time you need to explain how or why a model predicted what it did, you’re generally in a panic situation that is already time-constrained. Through implementing XAI processes through straightforward open source packages, this panicked and chaotic scramble to explain functionality of a solution can be avoided.

计算机代写|机器学习代写machine learning代考|Shapley additive explanations

One of the more well-known and thoroughly proven XAI implementations for Python is the shap package, written and maintained by Scott Lundberg. This implementation is fully documented in detail in the 2017 NeurIPS paper “A Unified Approach to Interpreting Model Predictions” by Lundberg and Su-In Lee.

At the core of the algorithm is game theory. Essentially, when we’re thinking of features that go into a training dataset, what is the effect on the model’s predictions for each feature? As with players in a team sport, if a match is the model itself and the features involved in training are the players, what is the effect on the match if one player is substituted for another? How one player’s influence changes the outcome of the game is the basic question that shap is attempting to answer.
FOUNDATION
The principle behind shap involves estimating the contribution of each feature from the training dataset upon the model. According to the original paper, calculating the true contribution (the exact Shapley value) requires evaluating all permutations for each row of the dataset for inclusion and exclusion of the source row’s feature, creating different coalitions of feature groupings.

For instance, if we have three features $\left(\mathrm{a}, \mathrm{b}\right.$, and $\mathrm{c}$; original features denoted with $\mathrm{i}_{\mathrm{i}}$ ), with replacement features from the dataset denoted as ${ }_j$ (for example, $a_j$ ) the coalitions to test for evaluating feature $b$ are as follows:
$$
\left(a_i, b_i, c_j\right),\left(a_i, b_j, c_j\right),\left(a_i, b_j, c_i\right),\left(a_j, b_i, c_j\right),\left(a_j, b_j, c_i\right)
$$
These coalitions of features are run through the model to retrieve a prediction. The resulting prediction is then differenced from the original row’s prediction (and an absolute value taken of the difference). This process is repeated for each feature, resulting in a feature-value contribution score when a weighted average is applied to each delta grouping per feature.

It should come as no surprise that this isn’t a very scalable solution. As the feature count increases and the training dataset’s row count increases, the computational complexity of this approach quickly becomes untenable. Thankfully, another solution is far more scalable: the approximate Shapley estimation.

机器学习代考

计算机代写|机器学习代写machine learning代考|Model interpretability

假设我们正在研究一个控制森林火灾的问题。我们工作的组织可以将设备、人员和服务部署到大型国家公园系统内的各个地点，以减少野火失控的可能性。为了尽可能提高物流效率，我们的任务是建立一个可以通过网格坐标识别火灾爆发风险的解决方案。我们有几年的数据，每个位置的传感器数据，以及每个网格位置的火灾区域历史。

在构建模型并将预测作为服务提供给物流团队之后，出现了关于模型预测的问题。物流团队成员注意到，某些预测与他们处理火灾季节的部落知识不一致，表达了他们对使用他们所接触到的特征数据来处理预测灾难的担忧。

他们开始怀疑这个解决办法了。他们在问问题。他们确信发生了一些奇怪的事情，他们想知道为什么他们的服务和人员被要求在一个月内覆盖一个网格坐标，就他们所记得的，从来没有发生过火灾。

我们如何应对这种情况?我们如何通过我们的模型对预测的特征向量进行模拟，并最终告诉他们为什么模型预测了它所做的事情?具体来说，我们如何以最少的努力在我们的模型上实现可解释的人工智能(XAI) ?

当规划一个项目时，特别是对于业务关键型用例，一个经常被忽视的方面是考虑模型的可解释性。由于法律要求或公司政策，一些行业和公司是这条规则的例外，但对于我接触过的大多数团体来说，可解释性是事后考虑的。

我理解大多数团队在考虑将XAI功能添加到项目中时的沉默。在EDA、模型调优和QA验证过程中，DS团队通常非常了解模型的行为。实现XAI似乎是多余的。

当你需要解释一个模型如何或为什么预测它所做的事情时，你通常已经处于时间有限的恐慌状态。通过直接的开放源码包实现XAI过程，可以避免解释解决方案功能时出现的恐慌和混乱。

计算机代写|机器学习代写machine learning代考|Shapley additive explanations

shap包是Python中比较知名且经过彻底验证的XAI实现之一，它由Scott Lundberg编写和维护。Lundberg和Su-In Lee在2017年NeurIPS论文“解释模型预测的统一方法”中详细记录了这种实现。

算法的核心是博弈论。从本质上讲，当我们考虑进入训练数据集的特征时，每个特征对模型预测的影响是什么?就像团队运动中的球员一样，如果比赛是模型本身，训练中涉及的特征是球员，那么如果一名球员被另一名球员替换，会对比赛产生什么影响?玩家的影响力如何改变游戏结果是《shape》试图回答的基本问题。
基础
shape背后的原理包括估计来自训练数据集的每个特征对模型的贡献。根据原始论文，计算真正的贡献(确切的Shapley值)需要评估数据集每行的所有排列，以包含和排除源行的特征，创建不同的特征组联盟。

例如，如果我们有三个特征$\left(\mathrm{a}, \mathrm{b}\right.$和$\mathrm{c}$;原始特征表示为$\mathrm{i}_{\mathrm{i}}$)，替换特征表示为${ }_j$(例如，$a_j$)，用于评估特征$b$的测试联盟如下:
$$
\left(a_i, b_i, c_j\right),\left(a_i, b_j, c_j\right),\left(a_i, b_j, c_i\right),\left(a_j, b_i, c_j\right),\left(a_j, b_j, c_i\right)
$$
这些特征的联合通过模型来检索预测。然后将得到的预测值与原始行的预测值进行差值(并取差值的绝对值)。对每个特征重复此过程，当对每个特征的每个增量分组应用加权平均值时，产生一个特征值贡献分数。

毫无疑问，这不是一个非常可扩展的解决方案。随着特征数的增加和训练数据集行数的增加，这种方法的计算复杂度很快就会变得站不住脚。值得庆幸的是，另一种解决方案更具可扩展性:近似Shapley估计。

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

金融工程代写

非参数统计代写

非参数统计指的是一种统计方法，其中不假设数据来自于由少数参数决定的规定模型；这种模型的例子包括正态分布模型和线性回归模型。

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

有限元方法代写

随机分析代写

时间序列分析代写

回归分析代写

MATLAB代写

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

计算机代写|机器学习代写machine learning代考|COMP4318

Posted on 2023年8月7日2023年8月7日 by statistics-lab

计算机代写|机器学习代写machine learning代考|Leaning heavily on prior art

We could use nearly any of the comical examples from table 15.1 to illustrate the first rule in creating fallback plans. Instead, let’s use an actual example from my own personal history.

I once worked on a project that had to deal with a manufacturing recipe. The goal of this recipe was to set a rotation speed on a ludicrously expensive piece of equipment while a material was dripped onto it. The speed of this unit needed to be adjusted periodically throughout the day as the temperature and humidity changed the viscosity of the material being dripped onto the product. Keeping this piece of equipment running optimally was my job; there were many dozens of these stations in the machine and many types of chemicals.

As in so many times in my career, I got really tired of doing a repetitive task. I figured there had to be some way to automate the spin speed of these units so I wouldn’t have to stand at the control station and adjust them every hour or so. Thinking myself rather clever, I wired up a few sensors to a microcontroller, programmed the programmable logic controller to receive the inputs from my little controller, wrote a simple program that would adjust the chuck speed according to the temperature and humidity in the room, and activated the system.

Everything went well, I thought, for the first few hours. I had programmed a simple regression formula into the microcontroller, checked my math, and even tested it on an otherwise broken piece of equipment. It all seemed pretty solid.

It wasn’t until around 3 a.m. that my pager (yes, it was that long ago) started going off. By the time I made it to the factory 20 minutes later, I realized that I had caused an overspeed condition in every single spin chuck system. They stopped. The rest of the liquid dosing system did not. As the chilly breeze struck the back of my head, and I looked out at the open bay doors letting in the $27^{\circ} \mathrm{F}$ night air, I realized my error.
I didn’t have a fallback condition. The regression line, taking in the ambient temperature, tried to compensate for the untested range of data (the viscosity curve wasn’t actually linear at that range), and took a chuck that normally rotated at around 2,800 RPM and tried to instruct it to spin at 15,000 RPM.

I spent the next four days and three nights cleaning up lacquer from the inside of that machine. By the time I was finished, the lead engineer took me aside and handed me a massive three-ring binder and told me to “read it before playing any more games.” (I’m paraphrasing. I can’t put into print what he said to me.) The book was filled with the materials science analysis of each chemical that the machine was using. It had the exact viscosity curves that I could have used. It had information on maximum spin speeds for deposition.

计算机代写|机器学习代写machine learning代考|Cold-start woes

For certain types of ML projects, model prediction failures are not only frequent, but also expected. For solutions that require a historical context of existing data to function properly, the absence of historical data prevents the model from making a prediction. The data simply isn’t available to pass through the model. Known as the cold-start problem, this is a critical aspect of solution design and architecture for any project dealing with temporally associated data.

As an example, let’s imagine that we run a dog-grooming business. Our fleets of mobile bathing stations scour the suburbs of North America, offering all manner of services to dogs at their homes. Appointments and service selection is handled through an app interface. When booking a visit, the clients select from hundreds of options and prepay for the services through the app no later than a day before the visit.

To increase our customers’ satisfaction (and increase our revenue), we employ a service recommendation interface on the app. This model queries the customer’s historical visits, finds products that might be relevant for them, and indicates additional services that the dog might enjoy. For this recommender to function correctly, the historical services history needs to be present during service selection.

This isn’t much of a stretch for anyone to conceptualize. A model without data to process isn’t particularly useful. With no history available, the model clearly has no data in which to infer additional services that could be recommended for bundling into the appointment.

What’s needed to serve something to the end user is a cold-start solution. An easy implementation for this use case is to generate a collection of the most frequently ordered services globally. If the model doesn’t have enough data to provide a prediction, this popularity-based services aggregation can be served in its place. At that point, the app IFrame element will at least have something in it (instead of showing an empty collection) and the user experience won’t be broken by seeing an empty box.

机器学习代考

计算机代写|机器学习代写machine learning代考|Leaning heavily on prior art

我们几乎可以使用表15.1中的任何一个有趣的例子来说明创建后备计划的第一条规则。相反，让我们用我个人经历中的一个实际例子。

我曾经做过一个项目，必须处理一个制造配方。这个配方的目标是在一个昂贵得离谱的设备上设定一个旋转速度，同时把一种材料滴在上面。该装置的速度需要在一天中周期性地调整，因为温度和湿度改变了被滴到产品上的材料的粘度。保持这台设备的最佳运行状态是我的工作;机器里有几十个这样的工作站和许多种类的化学品。

在我的职业生涯中有很多次，我真的厌倦了做重复的工作。我想一定有某种方法可以自动控制这些装置的旋转速度，这样我就不必站在控制站，每隔一小时左右就调整一次。我觉得自己很聪明，于是在一个微控制器上安装了几个传感器，给可编程逻辑控制器编程，让它接收来自我的小控制器的输入，然后写了一个简单的程序，根据房间里的温度和湿度来调整卡盘的速度，然后启动了系统。

我想，在最初的几个小时里，一切都很顺利。我在微控制器中编写了一个简单的回归公式，检查了我的数学计算，甚至在一个坏掉的设备上进行了测试。一切似乎都很可靠。

直到凌晨3点左右，我的呼机才开始响(是的，那是很久以前的事了)。20分钟后，当我到达工厂时，我意识到我已经在每个旋转卡盘系统中造成了超速状态。他们停止了。液体加药系统的其余部分没有。当冷风吹过我的后脑勺时，我望着敞开的门，让27美元的夜晚空气进来，我意识到自己的错误。
我没有退路。回复线考虑了环境温度，试图补偿未测试的数据范围(粘度曲线在该范围内实际上不是线性的)，并选择了一个通常以2800转/分左右旋转的卡盘，并试图指示它以15,000转/分旋转。

接下来的四天三夜我都在清理机器里面的漆。当我完成游戏时，首席工程师把我叫到一边，递给我一个巨大的三环活页夹，并告诉我“在继续玩游戏之前先阅读它。”(我套用。我不能把他对我说的话付梓。)这本书里写满了机器所使用的每种化学物质的材料科学分析。它有我可以用的粘度曲线。它有关于沉积的最大旋转速度的信息。

计算机代写|机器学习代写machine learning代考|Cold-start woes

对于某些类型的ML项目，模型预测失败不仅频繁，而且是意料之中的。对于需要现有数据的历史上下文才能正常工作的解决方案，缺少历史数据会阻止模型进行预测。数据根本无法通过模型。这被称为冷启动问题，对于任何处理临时关联数据的项目来说，这是解决方案设计和体系结构的一个关键方面。

举个例子，假设我们经营一家狗狗美容公司。我们的移动洗浴站遍布北美郊区，为狗狗提供各种上门服务。约会和服务选择是通过应用程序界面处理的。当预约参观时，客户可以从数百个选项中进行选择，并在参观前一天通过应用程序预付服务费用。

为了提高客户的满意度(并增加我们的收入)，我们在应用程序上使用了一个服务推荐界面。这个模型会查询客户的历史访问记录，找到可能与他们相关的产品，并指出狗可能喜欢的其他服务。要使此推荐程序正确运行，在服务选择期间需要提供历史服务历史。

这对任何人来说都不是很容易理解的。没有数据要处理的模型并不是特别有用。由于没有可用的历史记录，该模型显然没有数据来推断可以推荐绑定到约会中的其他服务。

为最终用户提供服务所需要的是冷启动解决方案。此用例的一个简单实现是生成全局最频繁订购的服务的集合。如果模型没有足够的数据来提供预测，则可以使用这种基于流行度的服务聚合。在这一点上，应用程序的IFrame元素至少会有一些东西在里面(而不是显示一个空的集合)，用户体验不会因为看到一个空框而被破坏。

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

金融工程代写

非参数统计代写

非参数统计指的是一种统计方法，其中不假设数据来自于由少数参数决定的规定模型；这种模型的例子包括正态分布模型和线性回归模型。

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

有限元方法代写

随机分析代写

时间序列分析代写

回归分析代写

MATLAB代写

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

计算机代写|机器学习代写machine learning代考|QBUS6850

Posted on 2023年7月24日2023年8月25日 by statistics-lab

计算机代写|机器学习代写machine learning代考|Keeping things as simple as possible

Simplicity is a unique form of elegance in ML applications. Scoffed at by many who are new to the field, because they initially believe that complex solutions are fun to build, the simplest solutions are the ones that endure. This is true for no greater reason than that they’re easier to keep running than intensely complicated ones-mostly because of cost, reliability, and ease of upgrading.

Let’s imagine that we’re relatively new to a somewhat junior team. Each team member is steeped in the latest technological advancements in the field of ML, highly capable at developing a solution using these cutting-edge tools and techniques. Let’s pretend for an instant that these coworkers of ours believe that people using “old” techniques like Bayesian approaches, linear algorithms, and heuristics in solving problems are mere Luddites who refuse to learn the technology of the future.

One of the first projects that comes along to the team is from the operations department. The senior vice president (SVP) of the retail group approaches the team in a meeting and asks for a solution that the operations department simply can’t scale very well. The SVP wants to know if the DS team can, with only images as fodder for a solution, determine whether the people in the pictures are wearing a red shirt.

The DS team immediately goes to what they’re experienced with in their toolboxes of latest and greatest solutions. Figure 14.12 illustrates the events that unfold.

What happens in this scenario? The largest issue is in the complex approach that the team members take without validating simpler approaches. They choose to focus on technology over a solution. By focusing on a highly advanced solution to the problem and not entertaining a far simpler approach (grab a swatch of pixels at one-third of the way up in the center line of each image, determine the hue and saturation of those pixels, and classify them as either red or not red), they waste months of time and likely an awful lot of money in the process of solving the problem.

This scenario plays out remarkably frequently in companies-particularly those that are nascent to ML. These companies may feel a need to go fast with their projects because the hype surrounding $\mathrm{AI}$ is of such a deafening roar of cacophony that they think their businesses will be at risk if they don’t get AI working at whatever the cost. In the end, our example team recognizes what the easiest solution could be and rapidly develops a solution that runs at massive scale with minimal cost.

The idea of pursuing simplicity exists in two main facets of ML development: defining the problem that you’re trying to solve and building the most minimally complex solution to solve the problem.

计算机代写|机器学习代写machine learning代考|Simplicity in problem definitions

In our preceding scenario, the problem definition was clear to the business and the ML team both. “Predict red shirts for us, please” couldn’t get distilled to any more of a basic task than that. A fundamental breakdown still occurred in the discussion that was conducted, however.

The pursuit of simplicity in defining a problem centers around the elemental attributes of two important questions to be given to the internal (business unit) customer:
-What do you want a solution to do? This defines the prediction type.
-What will you do with the solution? This defines the decision aspect.
If nothing else aside from these two questions was discussed in the early-phase meetings with the business unit, the project would still be a success. Having the core need of the business problem addressed can more directly lead to project success than any other topic. The business simply wanted to identify whether employees were wearing the old company-branded red shirts in order to know to send them the new branded blue shirts. By fixating on the problem of red shirt versus blue shirt, a far simpler solution can be achieved.

Throughout the discussion that follows, we’d get the information about the nature of the photographs and their inherent homogeny. With these two fundamental aspects defined, the team can focus on a smaller list of potential approaches, simplifying the scope and work involved in order to solve the problem. Without these questions defined and answered, however, the team is left to an overly broad and creative exploration of possible solutions-which is risky.

The team members heard image classification, instantly went to CNN implementations, and for months on end locked themselves into a highly complex architecture.

Even though it eventually solved the problem fairly well, it did so in a way that would have been incredibly wasteful. (GPUs and DL models being trained on them are significantly more expensive than a pixel-hue-and-saturation bucketing algorithm that could run on a smart toaster oven.)

Keeping the problem definition for a particular prospective project to such simple terms will not only help guide initial discussions with the business unit requesting a solution, but also provide a path toward implementing the least possible complexity in whatever gets built.

机器学习代考

计算机代写|机器学习代写machine learning代考|Keeping things as simple as possible

简单性是ML应用程序中优雅的一种独特形式。许多刚进入这个领域的人会嘲笑他们，因为他们最初认为构建复杂的解决方案很有趣，最简单的解决方案才是持久的解决方案。之所以如此，无非是因为它们比非常复杂的系统更容易运行——主要是因为成本、可靠性和易于升级。

假设我们是一个初级团队的新手。每个团队成员都沉浸在机器学习领域的最新技术进步中，能够使用这些尖端的工具和技术开发解决方案。让我们暂时假设，我们的这些同事认为，使用贝叶斯方法、线性算法和启发式等“旧”技术来解决问题的人仅仅是拒绝学习未来技术的卢德分子。

团队的第一个项目来自运营部门。零售集团的高级副总裁(SVP)在一次会议上找到团队，要求提供一个解决方案，因为运营部门根本无法很好地扩展。高级副总裁想知道，DS团队能否仅凭图片作为解决方案的素材，确定图片中的人是否穿着红色衬衫。

DS团队立即在他们的工具箱中找到了最新最好的解决方案。图14.12展示了展开的事件。

在这种情况下会发生什么?最大的问题在于团队成员采用的复杂方法没有验证更简单的方法。他们选择关注技术而不是解决方案。由于专注于解决问题的高级解决方案，而不是采用简单得多的方法(在每个图像中线的三分之一处抓取像素样本，确定这些像素的色调和饱和度，并将其分类为红色或非红色)，他们在解决问题的过程中浪费了数月的时间和大量的金钱。

这种情况在公司中非常常见，尤其是那些刚刚开始使用ML的公司。这些公司可能觉得有必要加快他们的项目，因为围绕$\math {AI}$的炒作是如此震耳欲聋的刺耳的咆哮，以至于他们认为如果他们不不惜一切代价让AI工作，他们的业务将面临风险。最后，我们的示例团队认识到最简单的解决方案是什么，并迅速开发出以最小成本大规模运行的解决方案。

追求简单的理念存在于机器学习开发的两个主要方面:定义你想要解决的问题，以及构建最简单的解决方案来解决问题。

计算机代写|机器学习代写machine learning代考|Simplicity in problem definitions

在我们前面的场景中，问题定义对于业务和ML团队都是清楚的。“请帮我们预测一下红衫的颜色”不能提炼成比这更基本的任务了。然而，在进行的讨论中仍然出现了根本性的问题。

在定义问题时追求简单性，主要围绕向内部(业务单位)客户提出的两个重要问题的基本属性:
-你想要一个解决方案吗?这定义了预测类型。
-你怎么解决这个问题?这定义了决策方面。
如果在与业务单位的早期会议中除了这两个问题之外没有讨论其他问题，那么项目仍然是成功的。与其他主题相比，解决业务问题的核心需求可以更直接地导致项目成功。公司只是想确定员工是否穿着旧公司品牌的红色衬衫，以便知道给他们发新的品牌蓝色衬衫。通过关注红衬衫和蓝衬衫的问题，可以实现一个简单得多的解决方案。

在接下来的讨论中，我们会得到关于照片的本质和它们固有的同质性的信息。有了这两个基本方面的定义，团队可以专注于更小的潜在方法列表，简化解决问题所涉及的范围和工作。然而，如果没有定义和回答这些问题，团队就会对可能的解决方案进行过于宽泛和创造性的探索——这是有风险的。

团队成员听到图像分类后，立即转向CNN的实现，并在几个月的时间里将自己锁定在一个高度复杂的架构中。

尽管它最终很好地解决了这个问题，但它的解决方式会造成难以置信的浪费。(在此基础上训练的gpu和深度学习模型要比在智能烤箱上运行的像素色调和饱和度存储算法贵得多。)

将特定的潜在项目的问题定义保持如此简单

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

金融工程代写

非参数统计代写

非参数统计指的是一种统计方法，其中不假设数据来自于由少数参数决定的规定模型；这种模型的例子包括正态分布模型和线性回归模型。

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

有限元方法代写

随机分析代写

时间序列分析代写

回归分析代写

MATLAB代写

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

计算机代写|机器学习代写machine learning代考|COMP5328

Posted on 2023年7月24日2023年8月25日 by statistics-lab

计算机代写|机器学习代写machine learning代考|Have you met your data?

What I mean by meeting isn’t the brief and polite nod of acknowledgment when passing your data on the way to refill your coffee. Nor is it the 30 -second rushed socially awkward introduction at a tradeshow meetup. Instead, the meeting that you should be having with your data is more like an hours-long private conversation in a quiet, wellfurnished speakeasy over a bottle of Macallan Rare Cask, sharing insights and delving into the nuances of what embodies the two of you as dram after silken dram caresses your digestive tracts: really and truly getting to know it.
TIP Before writing a single line of code, even for experimentation, make sure you have the data needed to answer the basic nature of the problem in the simplest way possible (an if/else statement). If you don’t have it, see if you can get it. If you can’t get it, move on to something you can solve.
As an example of the dangers of a mere passing casual rendezvous with data being used for problem solving, let’s pretend that we both work at a content provider company. Because of the nature of the business model at our little company, our content is listed on the internet behind a timed paywall. For the first few articles that are read, no ads are shown, content is free to view, and the interaction experience is bereft of interruptions. After a set number of articles, an increasingly obnoxious series of pop-ups and disruptions are presented to coerce a subscription registration from the reader.

The prior state of the system was set by a basic heuristic controlled through the counting of article pages that the end user had seen. Realizing that this would potentially be off-putting for someone browsing during their first session on the platform, this was then adjusted to look at session length and an estimate of how many lines of each article had been read. As time went on, this seemingly simple rule set became so unwieldy and complex that the web team asked our DS team to build something that could predict on a per-user level the type and frequency of disruptions that would maximize subscription rates.

We spend a few months, mostly using the prior work that was built to support the heuristics approach, having the data engineering team create mirrored ETL processes of the data structures and manipulation logic that the frontend team has been using to generate decision data. With the data available in the data lake, we proceed to build a highly effective and accurate model that seems to perform exceptionally well on all of our holdout tests.

计算机代写|机器学习代写machine learning代考|Make sure you have the data

This example might seem a bit silly, but I’ve seen this situation play out dozens of times. Having an inability to get at the right data for model serving is a common problem.
I’ve seen teams work with a manually extracted dataset (a one-time extract), build a truly remarkable solution with that data, and when ready to release the project to production, realize at the 11 th hour that the process for building that one-time extract required entirely manual actions by a DE team. The necessary data to make the solution effective was siloed off in a production infrastructure that the DS and DE teams had no ability to access. Figure 14.2 shows a rather familiar sight that I’ve borne witness to far too many times.

With no infrastructure present to bring the data into a usable form for predictions, as shown in figure 14.2, an entire project needs to be created for the DE team to build the ETL needed to materialize the data in a scheduled manner. Depending on the complexity of the data sources, this could take a while. Building hardened productiongrade ETL jobs that pull from multiple production relational databases and in-memory key-value stores is not a trivial reconciliation act, after all. Delays like this could lead (and have led) to project abandonment, regardless of the predictive capabilities of the DS portion of the solution.

This problem of complex ETL job creation becomes even more challenging if the predictions need to be conducted online. At that point, it’s not a question of the DE team working to get ETL processes running; rather, disparate groups in the engineering organization will have to accumulate the data into a single place in order to generate the collection of attributes that can be fed into a REST API request to the ML service.

This entire problem is solvable, though. During the time of EDA, the DS team should be evaluating the nature of the data generation, asking pointed questions to the data warehousing team:

Can the data be condensed to the fewest possible tables to reduce costs?
-What is the team’s priority for fixing these sources if something breaks down?
Can I access this data from both the training and serving layers?
-Will querying this data for serving meet the project SLA?

机器学习代考

计算机代写|机器学习代写machine learning代考|Have you met your data?

我所说的会面，并不是在你传递数据、去续杯咖啡的路上，简短而礼貌地点头致意。也不是在展会上匆忙的30秒尴尬的自我介绍。相反，你应该与你的数据进行的会议更像是在一个安静、设备完善的地下酒吧里，喝着一瓶麦卡伦稀有酒桶(Macallan Rare Cask)，进行长达数小时的私人谈话，分享见解，深入研究体现你们两人的细微差别，就像一杯又一杯柔滑的威士忌抚摸着你的消化道:真正真正地了解它。
提示:在编写一行代码之前，即使是为了进行实验，也要确保您拥有以最简单的方式(if/else语句)回答问题的基本性质所需的数据。如果你没有，看看你能不能得到它。如果你不能得到它，那就转向你能解决的问题。
为了说明与用于解决问题的数据仅仅是偶然相遇的危险，让我们假设我们都在一家内容提供商公司工作。由于我们这个小公司的商业模式的性质，我们的内容是在互联网上按时间收费的。对于阅读的前几篇文章，没有广告显示，内容可以自由查看，并且交互体验没有中断。在读完一定数量的文章后，会出现一系列令人讨厌的弹出窗口和干扰，迫使读者注册订阅。

系统的先验状态由一个基本的启发式设置，该启发式通过计算最终用户看过的文章页数来控制。意识到这可能会让那些在平台上的第一次浏览期间浏览的人感到不快，然后调整到查看会话长度和每篇文章的阅读行数估计。随着时间的推移，这个看似简单的规则集变得如此笨拙和复杂，以至于网络团队要求我们的DS团队构建一些东西，可以在每个用户的层面上预测中断的类型和频率，从而最大化订阅率。

我们花了几个月的时间，主要是使用之前为支持启发式方法而构建的工作，让数据工程团队创建数据结构和操作逻辑的镜像ETL流程，前端团队一直使用这些流程来生成决策数据。有了数据湖中可用的数据，我们继续构建一个非常有效和准确的模型，该模型似乎在我们所有的holdout测试中都表现得非常好。

计算机代写|机器学习代写machine learning代考|Make sure you have the data

这个例子可能看起来有点傻，但我已经看到这种情况发生过几十次了。无法为模型服务获取正确的数据是一个常见的问题。
我见过一些团队使用手动提取的数据集(一次性提取)，使用该数据构建真正出色的解决方案，并在准备将项目发布到生产环境时，在第11个小时意识到构建一次性提取的过程完全需要DE团队的手动操作。使解决方案有效的必要数据被隔离在生产基础设施中，DS和DE团队无法访问这些数据。图14.2显示了一个相当熟悉的场景，我已经见过太多次了。

如图14.2所示，由于没有基础设施将数据转换为可用于预测的形式，因此需要为DE团队创建一个完整的项目，以构建以预定方式实现数据所需的ETL。根据数据源的复杂性，这可能需要一段时间。毕竟，构建从多个生产关系数据库和内存中的键值存储中提取的坚固的生产级ETL作业并不是一个微不足道的协调行为。不管解决方案的DS部分的预测能力如何，像这样的延迟可能导致(并且已经导致)项目放弃。

如果预测需要在线进行，那么复杂的ETL创造就业机会的问题就变得更具挑战性。在这一点上，这不是DE团队努力使ETL进程运行的问题;相反，工程组织中的不同组必须将数据积累到一个地方，以便生成可以提供给ML服务的REST API请求的属性集合。

不过，整个问题是可以解决的。在EDA期间，DS团队应该评估数据生成的性质，向数据仓库团队提出尖锐的问题:

能否将数据压缩到尽可能少的表中以降低成本?
-如果有东西坏了，团队修复这些源的首要任务是什么?

我可以从训练层和服务层访问这些数据吗?
-为服务而查询这些数据是否符合项目SLA?

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

金融工程代写

非参数统计代写

非参数统计指的是一种统计方法，其中不假设数据来自于由少数参数决定的规定模型；这种模型的例子包括正态分布模型和线性回归模型。

广义线性模型代考

广义线性模型（GLM）归属统计学领域，是一种应用灵活的线性回归模型。该模型允许因变量的偏差分布有除了正态分布之外的其它分布。

有限元方法代写

随机分析代写

时间序列分析代写

回归分析代写

MATLAB代写

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写

计算机代写|机器学习代写machine learning代考|COMP5318

Posted on 2023年7月24日2023年8月25日 by statistics-lab

计算机代写|机器学习代写machine learning代考|Unintentional obfuscation: Could you read this if you didn’t write it?

A rather unique form of ML hubris materializes in the form of code development practices. Sometimes malicious, many times driven by ego (and a desire to be revered), but mostly due to inexperience and fear, this particular destructive activity takes shape through the creation of unintelligibly complex code.

For our scenario, let’s take a look at a common and somewhat simplistic task: recasting data types to support feature-engineering tasks. In this journey of comparative examples, we’ll take a dataset whose features (and the target field) need to have their types modified to support the pipeline-enabled processing stages to build a model. This problem, at its most simplistic implementation, is shown in the next listing.

From this relatively simple and imperative-style implementation of casting fields in a DataFrame, we’ll look at examples of obfuscation and discuss the impacts that each might have for something as seemingly simple as this use case.
NOTE In the next section, we’ll look at bad habits that some ML engineers have when writing code. Listing 13.3, it must be mentioned, is not intended to be disparaging in its approach and implementation. There is nothing wrong with an imperative approach when building ML code bases (provided the code base doesn’t have tight coupling requiring dozens of edits if one column changes). It becomes a problem only when the complexity of the solution makes modifying imperative code a burden. If the project is simple enough, stick with simpler code. You’ll thank yourself for the simplicity when you need to modify it and add new features.

计算机代写|机器学习代写machine learning代考|The flavors of obfuscation

This section progresses through a sliding scale of complexity, with code examples that become progressively less intelligible, more complex, and increasingly harder to maintain. We’ll analyze bad habits of some developers to aid you in identifying these coding patterns and to call them out for what they are-crippling to productivity and absolutely requiring refactoring to be maintainable.

If you find yourself going down one of these rabbit holes, these examples can serve as a reminder to not follow these patterns. But before we get to the examples, let’s look at the personas that I’ve seen with respect to development habits, shown in figure 13.3 .

These personas are not meant to identify a particular person, but rather to describe traits that a DS may go through during their journey of becoming a better developer. A nearly overwhelming number of people I’ve met (as well as myself)

started off writing code as the Hacker. We’d find ourselves stuck on a problem that we’d never encountered before and instantly move to search online for a solution, copy someone’s code, and if it worked, move on. (I’m not saying that looking on the internet or in books for information is a bad thing; even the most experienced developers do this quite frequently.)

As coding experience becomes deeper, some may lean toward one of the other three coding styles or, if they’re mentored properly, move directly to the center region. Some people have something to prove-usually only to themselves, as most people just want their peers to write the sort of code that comes from a Good Samaritan developer. Others may feel that the least number of lines of code is an effective development strategy, though they’re sacrificing legibility, extensibility, and testability in the process. Figure 13.4 shows the patterns that I’ve come across (and personally experienced).

This circuitous path leads to increasingly complex and unnecessarily complicated implementations before landing on the pinnacle of wisdom-fueled experience. The best we can hope for while making this journey is to have the ability to recognize and learn the better path-specifically, that the simplest solution to a problem (that still meets the requirements of the task) is always the best way to solve it.

机器学习代考

计算机代写|机器学习代写machine learning代考|Unintentional obfuscation: Could you read this if you didn’t write it?

ML傲慢的一种相当独特的形式体现在代码开发实践中。有时是恶意的，很多时候是由自我(和被尊敬的欲望)驱动的，但主要是由于缺乏经验和恐惧，这种特殊的破坏性活动通过创建难以理解的复杂代码而形成。

对于我们的场景，让我们看一看一个常见的、有点简单的任务:重铸数据类型以支持特征工程任务。在这个比较示例的旅程中，我们将采用一个数据集，其特征(和目标字段)需要修改其类型，以支持支持管道的处理阶段，以构建模型。这个问题最简单的实现如下面的清单所示。

从这个相对简单的、命令式的在DataFrame中强制转换字段的实现开始，我们将看到一些混淆的例子，并讨论每个例子对像这个用例这样看似简单的用例可能产生的影响。
在下一节中，我们将看看一些ML工程师在编写代码时的坏习惯。必须提到的是，清单13.3并不是要贬低它的方法和实现。在构建ML代码库时，命令式方法没有什么问题(前提是代码库没有紧密耦合，如果一个列发生变化，需要进行数十次编辑)。只有当解决方案的复杂性使得修改命令式代码成为负担时，它才会成为一个问题。如果项目足够简单，坚持使用更简单的代码。当您需要修改它并添加新功能时，您会感谢自己的简单性。

计算机代写|机器学习代写machine learning代考|The flavors of obfuscation

本节通过复杂性的滑动刻度进行进展，代码示例逐渐变得越来越不容易理解，越来越复杂，并且越来越难以维护。我们将分析一些开发人员的坏习惯，以帮助您识别这些编码模式，并指出它们对生产力的影响，以及绝对需要重构才能维护的地方。

如果你发现自己掉进了其中一个兔子洞，这些例子可以提醒你不要遵循这些模式。但是在我们开始示例之前，让我们看一下我所看到的关于开发习惯的角色，如图13.3所示。

这些角色并不是为了识别一个特定的人，而是为了描述DS在成为一名更好的开发人员的过程中可能经历的特征。我见过的绝大多数人(包括我自己)

以黑客的身份开始写代码。我们会发现自己被一个从未遇到过的问题卡住了，然后立即上网搜索解决方案，复制别人的代码，如果有效，就继续前进。(我并不是说在网上或书本上寻找信息是一件坏事;即使是最有经验的开发者也会经常这么做。)

随着编码经验的深入，一些人可能会倾向于其他三种编码风格中的一种，或者，如果他们得到适当的指导，直接进入中心区域。有些人需要证明一些东西——通常只向他们自己证明，因为大多数人只是希望他们的同伴编写来自好心人开发人员的那种代码。其他人可能觉得最少的代码行数是一种有效的开发策略，尽管他们在过程中牺牲了易读性、可扩展性和可测试性。图13.4显示了我遇到的(和亲身经历过的)模式。

这条迂回的道路会导致越来越复杂和不必要的复杂实现，然后才会到达智慧驱动体验的顶峰。在这段旅程中，我们所能期望的最好的结果是有能力识别和学习更好的路径——特别是，解决问题的最简单的解决方案(仍然满足任务的要求)总是解决问题的最佳方法。

统计代写请认准statistics-lab™. statistics-lab™为您的留学生涯保驾护航。

R语言代写	问卷设计与分析代写
PYTHON代写	回归分析与线性模型代写
MATLAB代写	方差分析与试验设计代写
STATA代写	机器学习/统计学习代写
SPSS代写	计量经济学代写
EVIEWS代写	时间序列分析代写
EXCEL代写	深度学习代写
SQL代写	各种数据建模与可视化代写