## 数学代写|模拟和蒙特卡洛方法作业代写simulation and monte carlo method代考|MATH577

## 数学代写|模拟和蒙特卡洛方法作业代写simulation and monte carlo method代考|POISSON PROCESSES

The Poisson process is used to model certain kinds of arrivals or patterns. Imagine, for example, a telescope that can detect individual photons from a faraway galaxy. The photons arrive at random times $T_1, T_2, \ldots$ Let $N_t$ denote the number of arrivals in the time interval $[0, t]$, that is, $N_t=\sup \left{k: T_k \leqslant t\right}$. Note that the number of arrivals in an interval $I=(a, b]$ is given by $N_b-N_a$. We will also denote it by $N(a, b]$. A sample path of the arrival counting process $\left{N_t, t \geqslant 0\right}$ is given in Figure 1.3.

For this particular arrival process, one would assume that the number of arrivals in an interval $(a, b)$ is independent of the number of arrivals in interval $(c, d)$ when the two intervals do not intersect. Such considerations lead to the following definition:

Definition 1.12.1 (Poisson Process) An arrival counting process $N=\left{N_t\right}$ is called a Pvisson process with rule $\lambda>0$ if
(a) The numbers of points in nonoverlapping intervals are independent.
(b) The number of points in interval $I$ has a Poisson distribution with mean $\lambda \times$ length $(I)$.

## 数学代写|模拟和蒙特卡洛方法作业代写simulation and monte carlo method代考|Markov Chains

Consider a Markov chain $X=\left{X_t, t \in \mathbb{N}\right}$ with a discrete (i.e., countable) state space $\mathscr{E}$. In this case the Markov property $(1.30)$ is
$$\mathbb{P}\left(X_{t+1}=x_{t+1} \mid X_0=x_0, \ldots, X_t=x_t\right)=\mathbb{P}\left(X_{t+1}=x_{t+1} \mid X_t=x_t\right)$$
for all $x_0, \ldots, x_{t+1}, \in \mathscr{E}$ and $t \in \mathbb{N}$. We restrict ourselves to Markov chains for which the conditional probabilities
$$\mathbb{P}\left(X_{t+1}=j \mid X_t=i\right), \quad i, j \in \mathscr{E}$$ are independent of the time $t$. Such chains are called time-homogeneous. The probabilities in (1.32) are called the (one-step) transition probabilities of $X$. The distribution of $X_0$ is called the initial distribution of the Markov chain. The one-step transition probabilities and the initial distribution completely specify the distribution of $X$. Namely, we have by the product rule (1.4) and the Markov property $(1.30)$
\begin{aligned} &\mathbb{P}\left(X_0=x_0, \ldots, X_t=x_t\right) \ &\quad=\mathbb{P}\left(X_0=x_0\right) \mathbb{P}\left(X_1=x_1 \mid X_0=x_0\right) \cdots \mathbb{P}\left(X_t=x_t \mid X_0=x_0, \ldots X_{t-1}=x_{t-1}\right) \ &\quad=\mathbb{P}\left(X_0=x_0\right) \mathbb{P}\left(X_1=x_1 \mid X_0=x_0\right) \cdots \mathbb{P}\left(X_t=x_t \mid X_{t-1}=x_{t-1}\right) . \end{aligned}
Since $\mathscr{E}$ is countable, we can arrange the one-step transition probabilities in an array. This array is called the (one-step) transition matrix of $X$. We usually denote it by $P$. For example, when $\mathscr{E}={0,1,2, \ldots}$, the transition matrix $P$ has the form
$$P=\left(\begin{array}{cccc} p_{00} & p_{01} & p_{02} & \ldots \ p_{10} & p_{11} & p_{12} & \ldots \ p_{20} & p_{21} & p_{22} & \ldots \ \vdots & \vdots & \vdots & \ddots \end{array}\right) .$$
Note that the elements in every row are positive and sum up to unity.
Another convenient way to describe a Markov chain $X$ is through its transition graph. States are indicated by the nodes of the graph, and a strictly positive $(>0)$ transition probability $p_{i j}$ from state $i$ to $j$ is indicated by an arrow from $i$ to $j$ with weight $p_{i j}$.

(a)不重叠区间内的点的数量是独立的
## 数学代写|模拟和蒙特卡洛方法作业代写模拟和蒙特卡罗方法代考|RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS

Specifying a model for a random experiment via a complete description of $\Omega$ and $\mathbb{P}$ may not always be convenient or necessary.

In practice, we are only interested in certain observations (i.e., numerical measurements) in the experiment. We incorporate these into our modeling process via the introduction of random variables, usually denoted by capital letters from the last part of the alphabet (e.g., $X$, $X_1, X_2, \ldots, Y, Z$)

We toss a biased coin $n$ times, with $p$ the probability of heads. Suppose that we are interested only in the number of heads, say $X$. Note that $X$ can take any of the values in $\{0,1, \ldots, n\}$. The probability distribution of $X$ is given by the binomial formula
$$ \mathbb{P}(X=k)=\left(\begin{array}{l} n \\ k \end{array}\right) p^k(1-p)^{n-k}, \quad k=0,1, \ldots, n . $$
Namely, by Example 1.1, each elementary event $\{H T H \cdots T\}$ with exactly $k$ heads and $n-k$ tails has probability $p^k(1-p)^{n-k}$, and there are $\left(\begin{array}{l}n \\ k\end{array}\right)$ such events. In practice, we are only interested in certain observations (i.e., numerical measurements) in the experiment. We incorporate these into our modeling process via the introduction of random variables, usually denoted by capital letters from the last part of the alphabet (e.g., X, \left.X_1, X_2, \ldots, Y, Z\right) We toss a biased coin n times, with p the probability of heads. Suppose that we are interested only in the number of heads, say X. Note that X can take any of the values in {0,1, \ldots, n}. The probability distribution of X is given by the binomial formula$$ \mathbb{P}(X-k)-\left(\begin{array}{l} n \ k \end{array}\right) p^k(1-p)^{n-k}, \quad k-0,1, \ldots, n . $$Namely, by Example 1.1, each elementary event {H T H \cdots T} with exactly k heads and n-k tails has probability p^k(1-p)^{n-k}, and there are \left(\begin{array}{l}n \ k\end{array}\right) such events. The probability distribution of a general random variable X – identifying such probabilities as \mathbb{P}(X=x), \mathbb{P}(a \leqslant X \leqslant b), and so on – is completely specified by the cumulative distribution function (cdf), defined by$$ F(x)=\mathbb{P}(X \leqslant x), x \in \mathbb{R} . $$A random variable X is said to have a discrete distribution if, for some finite or countable set of values x_1, x_2, \ldots, \mathbb{P}\left(X=x_i\right)>0, i=1,2, \ldots and \sum_i \mathbb{P}\left(X=x_i\right)= 1. The function f(x)=\mathbb{P}(X=x) is called the probability mass function (pmf) of X – hut see Remark 1.4.1. ## 数学代写|模拟和蒙特卡洛方法作业代写simulation and monte carlo method代考|JOINT DISTRIBUTIONS Often a random experiment is described by more than one random variable. The theory for multiple random variables is similar to that for a single random variable. Let X_1, \ldots, X_n be random variables describing some random experiment. We can accumulate these into a random vector \mathbf{X}=\left(X_1, \ldots, X_n\right). More generally, a collection \left{X_t, t \in \mathscr{T}\right} of random variables is called a stochastic process. The set \mathscr{T} is called the parameter set or index set of the process. It may be discrete (e.g., \mathbb{N} or {1, \ldots, 10} ) or continuous (e.g., \mathbb{R}{+}=[0, \infty) or \left.[1,10]\right). The set of possible values for the stochastic process is called the state space. The joint distribution of X_1, \ldots, X_n is specified by the joint cdf$$ F\left(x_1, \ldots, x_n\right)=\mathbb{P}\left(X_1 \leqslant x_1, \ldots, X_n \leqslant x_n\right) . $$The joint pdf f is given, in the discrete case, by f\left(x_1, \ldots, x_n\right)=\mathbb{P}\left(X_1=\right. \left.x_1, \ldots, X_n=x_n\right), and in the continuous case f is such that$$ \mathbb{P}(\mathbf{X} \in \mathscr{B})=\int{\mathscr{B}} f\left(x_1, \ldots, x_n\right) \mathrm{d} x_1 \ldots \mathrm{d} x_n $$for any (measurable) region \mathscr{B} in \mathbb{R}^n. The marginal pdfs can be recovered from the joint pdf by integration or summation. For example, in the case of a continuous random vector (X, Y) with joint pdf f, the pdf f_X of X is found as$$ f_X(x)=\int f(x, y) \mathrm{d} y . $$Suppose that X and Y are both discrete or both continuous, with joint pdf f, and suppose that f_X(x)>0. Then the conditional pdf of Y given X=x is given by$$ f_{Y \mid X}(y \mid x)=\frac{f(x, y)}{f_X(x)} \quad \text { for all } y . $$The corresponding conditional expectation is (in the continuous case)$$ \mathbb{E}[Y \mid X=x]=\int y f_{Y \mid X}(y \mid x) \mathrm{d} y . $$Note that \mathbb{E}[Y \mid X=x] is a function of x, say h(x). The corresponding random variable h(X) is written as \mathbb{E}[Y \mid X]. It can be shown (see, for example, [3]) that its expectation is simply the expectation of Y, that is,$$ \mathbb{E}[\mathbb{E}[Y \mid X]]=\mathbb{E}[Y] $$## 模拟和蒙特卡洛方法代写 ## 数学代写|模拟和蒙特卡洛方法作业代写模拟和蒙特卡罗方法代考|随机变量和概率分布 通过对\Omega和\mathbb{P}的完整描述为随机实验指定一个模型可能并不总是方便或必要的。实际上，我们只对实验中的某些观察结果(即数值测量)感兴趣。我们通过引入随机变量将这些变量合并到我们的建模过程中，这些随机变量通常用字母表最后部分的大写字母表示(例如，X, \left.X_1, X_2, \ldots, Y, Z\right) ) 我们抛一枚有偏见的硬币n次，正面出现的概率为p。假设我们只对正面的次数感兴趣，比如X。注意，X可以接受{0,1, \ldots, n}中的任何值。X的概率分布由二项式公式$$ \mathbb{P}(X-k)-\left(\begin{array}{l} n \ k \end{array}\right) p^k(1-p)^{n-k}, \quad k-0,1, \ldots, n . $$即，在例1.1中，每个基本事件{H T H \cdots T}恰好有k个正面和n-k个反面，其概率为p^k(1-p)^{n-k}，有\left(\begin{array}{l}n \ k\end{array}\right)个这样的事件 一般随机变量X的概率分布-识别诸如\mathbb{P}(X=x), \mathbb{P}(a \leqslant X \leqslant b)等的概率-完全由累积分布函数(cdf)指定，由$$ F(x)=\mathbb{P}(X \leqslant x), x \in \mathbb{R} . $$定义一个随机变量X被认为具有离散分布，如果，对于某些有限或可数的值集x_1, x_2, \ldots, \mathbb{P}\left(X=x_i\right)>0, i=1,2, \ldots和\sum_i \mathbb{P}\left(X=x_i\right)= 1。函数f(x)=\mathbb{P}(X=x)被称为X -的概率质量函数(pmf)，请参见注释1.4.1 ## ## 数学代写|模拟和蒙特卡洛方法作业代写simulation and monte carlo method代考|RANDOM EXPERIMENTS

The basic notion in probability theory is that of a random experiment: an experiment whose outcome cannot be determined in advance. The most fundamental example is the experiment where a fair coin is tossed a number of times. For simplicity suppose that the coin is tossed three times. The sample space, denoted \Omega, is the set of all possible outcomes of the experiment. In this case \Omega has eight possible outcomes:$$ \Omega={H H H, H H T, H T H, H T T, T H H, T H T, T T H, T T T}, $$where, for example, HTH means that the first toss is heads, the second tails, and the third heads. Subsets of the sample space are called events. For example, the event A that the third toss is heads is A={H H H, H T H, T H H, T T H} We say that event A occurs if the outcome of the experiment is one of the elements in A. Since events are sets, we can apply the usual set operations to them. For example, the event A \cup B, called the union of A and B, is the event that A or B or both occur, and the event A \cap B, called the intersection of A and B, is the event that A and B both occur. Similar notation holds for unions and intersections of more than two events. The event A^c, called the complement of A, is the event that A does not occur. Two events A and B that have no outcomes in common, that is, their intersection is empty, are called disjoint events. The main step is to specify the probability of each event. Definition 1.2.1 (Probability) A probability \mathbb{P} is a rule that assigns a number 0 \leqslant \mathbb{P}(A) \leqslant 1 to each event A, such that \mathbb{P}(\Omega)=1, and such that for any sequence A_1, A_2, \ldots of disjoint events$$ \mathbb{P}\left(\bigcup_i A_i\right)=\sum_i \mathbb{P}\left(A_i\right) . $$Equation (1.1) is referred to as the sum rule of probability. It states that if an event can happen in a number of different ways, but not simultaneously, the probability of that event is simply the sum of the probabilities of the comprising events. For the fair coin toss experiment the probability of any event is easily given. Namely, because the coin is fair, each of the eight possible outcomes is equally likely, so that \mathbb{P}({H H H})=\cdots=\mathbb{P}({T T T})=1 / 8. Since any event A is the union of the “elementary” events {H H H}, \ldots,{T T T}, the sum rule implies that$$ \mathbb{P}(A)=\frac{|A|}{|\Omega|}, $$where |A| denotes the number of outcomes in A and |\Omega|=8. More generally, if a random experiment has finitely many and equally likely outcomes, the probability is always of the form (1.2). In that case the calculation of probabilities reduces to counting. ## 数学代写|模拟和蒙特卡洛方法作业代写simulation and monte carlo method代考|CONDITIONAL PROBABILITY AND INDEPENDENCE How do probabilities change when we know that some event B \subset \Omega has occurred? Given that the outcome lies in B, the event A will occur if and only if A \cap B occurs, and the relative chance of A occurring is therefore \mathbb{P}(A \cap B) / \mathbb{P}(B). This leads to the definition of the conditional probability of A given B :$$ \mathbb{P}(A \mid B)=\frac{\mathbb{P}(A \cap B)}{\mathbb{P}(B)} . $$For example, suppose that we toss a fair coin three times. Let B be the event that the total number of heads is two. The conditional probability of the event A that the first toss is heads, given that B occurs, is (2 / 8) /(3 / 8)=2 / 3. Rewriting (1.3) and interchanging the role of A and B gives the relation \mathbb{P}(A \cap B)=\mathbb{P}(A) \mathbb{P}(B \mid A). This can be generalized easily to the product rule of probability, which states that for any sequence of events A_1, A_2, \ldots, A_n,$$ \mathbb{P}\left(A_1 \cdots A_n\right)=\mathbb{P}\left(A_1\right) \mathbb{P}\left(A_2 \mid A_1\right) \mathbb{P}\left(A_3 \mid A_1 A_2\right) \cdots \mathbb{P}\left(A_n \mid A_1 \cdots A_{n-1}\right), $$using the abbreviation A_1 A_2 \cdots A_k \equiv A_1 \cap A_2 \cap \cdots \cap A_k. Suppose that B_1, B_2, \ldots, B_n is a partition of \Omega. That is, B_1, B_2, \ldots, B_n are disjoint and their union is \Omega. Then, by the sum rule, \mathbb{P}(A)=\sum_{i=1}^n \mathbb{P}\left(A \cap B_i\right) and hence, by the definition of conditional probability, we have the law of total probability:$$ \mathbb{P}(A)=\sum_{i=1}^n \mathbb{P}\left(A \mid B_i\right) \mathbb{P}\left(B_i\right) $$Combining this with the definition of conditional probability gives Bayes’ rule:$$ \mathbb{P}\left(B_j \mid A\right)=\frac{\mathbb{P}\left(A \mid B_j\right) \mathbb{P}\left(B_j\right)}{\sum_{i=1}^n \mathbb{P}\left(A \mid B_i\right) \mathbb{P}\left(B_i\right)} . $$Independence is of crucial importance in probability and statistics. Loosely speaking, it models the lack of information between events. Two events A and B are said to be independent if the knowledge that B has occurred does not change the probability that A occurs. That is, A, B independent \Leftrightarrow \mathbb{P}(A \mid B)=\mathbb{P}(A). Since \mathbb{P}(A \mid B)=\mathbb{P}(A \cap B) / \mathbb{P}(B), an alternative definition of independence is A, B independent \Leftrightarrow \mathbb{P}(A \cap B)=\mathbb{P}(A) \mathbb{P}(B) This definition covers the case where B=\emptyset (empty set). We can extend this definition to arbitrarily many events. ## 模拟和蒙特卡洛方法代写 ## 数学代写|模拟和蒙特卡洛方法作业代写模拟和蒙特卡罗方法代考|随机实验 概率论的基本概念是一个随机实验:一个结果无法事先确定的实验。最基本的例子是一个实验，一个均匀的硬币被抛了很多次。为了简单起见，假设抛硬币三次。样本空间，记为\Omega，是实验所有可能结果的集合。在这种情况下，\Omega有八种可能的结果:$$ \Omega={H H H, H H T, H T H, H T T, T H H, T H T, T T H, T T T}, $$其中，例如，HTH意味着第一次抛掷是正面，第二次是反面，第三次是正面 样本空间的子集称为事件。例如，第三次抛掷是正面的事件A是 A={H H H, H T H, T H H, T T H} 我们说，如果实验的结果是A中的元素之一，则事件A发生。由于事件是集合，我们可以对它们应用通常的集合操作。例如，事件A \cup B，称为A和B的并集，是A或B或两者都发生的事件，事件A \cap B，称为A和B的交集，是A和B都发生的事件。类似的符号也适用于两个以上事件的合并和交集。事件A^c被称为A的补充，是A没有发生的事件。两个事件A和B没有共同的结果，即它们的交集为空，称为不相交事件。主要步骤是指定每个事件的概率 定义1.2.1(概率)概率\mathbb{P}是给每个事件A分配一个数字0 \leqslant \mathbb{P}(A) \leqslant 1的规则，使得\mathbb{P}(\Omega)=1，并且对于任何不连接事件的序列A_1, A_2, \ldots$$ \mathbb{P}\left(\bigcup_i A_i\right)=\sum_i \mathbb{P}\left(A_i\right) . $$公式(1.1)被称为概率和规则。它指出，如果一个事件可以以多种不同的方式发生，但不是同时发生，那么该事件的概率就是构成该事件的所有事件的概率之和 对于公平抛硬币实验，任何事件的概率都很容易给出。也就是说，因为硬币是均匀的，八种可能的结果都是等可能的，所以\mathbb{P}({H H H})=\cdots=\mathbb{P}({T T T})=1 / 8。因为任何事件A是“基本”事件{H H H}, \ldots,{T T T}的并集，所以求和规则意味着$$ \mathbb{P}(A)=\frac{|A|}{|\Omega|}, $$，其中|A|表示A和|\Omega|=8中的结果数量。更一般地说，如果一个随机实验有有限多个等可能的结果，概率总是形式(1.2)。在这种情况下，概率的计算简化为计数 ## 数学代写|模拟和蒙特卡洛方法作业代写模拟和蒙特卡罗方法代考|条件概率和独立性 当我们知道某个事件B \subset \Omega已经发生时，概率是如何变化的?假设结果位于B，那么事件A将在且仅当A \cap B发生时发生，因此A发生的相对几率是\mathbb{P}(A \cap B) / \mathbb{P}(B)。这导致了条件概率的定义A给定B:$$ \mathbb{P}(A \mid B)=\frac{\mathbb{P}(A \cap B)}{\mathbb{P}(B)} . $$例如，假设我们投掷一枚均匀硬币三次。设B是正面总次数为2的情况。假设B发生，第一次抛掷是正面的事件A的条件概率是(2 / 8) /(3 / 8)=2 / 3 . 重写(1.3)并交换A和B的角色，得到关系\mathbb{P}(A \cap$$B)=\mathbb{P}(A) \mathbb{P}(B \mid A)。这可以很容易地推广到概率积规则，该规则规定，对于任何事件序列$A_1, A_2, \ldots, A_n$，
$$\mathbb{P}\left(A_1 \cdots A_n\right)=\mathbb{P}\left(A_1\right) \mathbb{P}\left(A_2 \mid A_1\right) \mathbb{P}\left(A_3 \mid A_1 A_2\right) \cdots \mathbb{P}\left(A_n \mid A_1 \cdots A_{n-1}\right),$$
，使用缩写$A_1 A_2 \cdots A_k \equiv A_1 \cap A_2 \cap \cdots \cap A_k$ .

$$\mathbb{P}(A)=\sum_{i=1}^n \mathbb{P}\left(A \mid B_i\right) \mathbb{P}\left(B_i\right)$$

$$\mathbb{P}\left(B_j \mid A\right)=\frac{\mathbb{P}\left(A \mid B_j\right) \mathbb{P}\left(B_j\right)}{\sum_{i=1}^n \mathbb{P}\left(A \mid B_i\right) \mathbb{P}\left(B_i\right)} .$$

$A, B$ independent $\Leftrightarrow \mathbb{P}(A \cap B)=\mathbb{P}(A) \mathbb{P}(B)$

