### 统计代写|统计推断作业代写statistical inference代考|The tools of the trade

Several tools are key to the use of statistical inference. We’ll only be able to cover a few in this class, but you should recognize them anyway.

Randomization: concerned with balancing unobserved variables that may confound inferences of interest.
Random sampling: concerned with obtaining data that is representative of the population of interest.
Sampling models: concerned with creating a model for the sampling process, the most common is so called “iid”.

Hypothesis testing: concerned with decision making in the presence of uncertainty. Confidence intervals: concerned with quantifying uncertainty in estimation. Probability models: a formal connection between the data and a population of interest. Often probability models are assumed or are approximated.
Study design: the process of designing an experiment to minimize biases and variability.
Nonparametric bootstrapping: the process of using the data to, with minimal probability model assumptions, create inferences.
Permutation, randomization and exchangeability testing: the process of using data permutations to perform inferences.

We won’t spend too much time talking about this, but there are several different styles of inference. Two broad categories that get discussed a lot are:
Frcqucncy probability: is the long run proportion of timcs an cvent occurs in independent, identically distributed repetitions.
Frequency style inference: uses frequency interpretations of probabilities to control error rates. Answers questions like “What should I decide given my data controlling the long run proportion of mistakes I make at a tolerable level.”

Bayesian probability: is the probability calculus of beliefs, given that beliefs follow certain rules.

Bayesian style inference: the use of Bayesian probability representation of beliefs to perform inference. Answers questions like “Given my subjective beliefs and the objective information from the data, what should I believe now?”
Data scientists tend to fall within shades of gray of these and various other schools of inference. Furthermore, there are so many shades of gray between the styles of

inferences that it is hard to pin down most modern statisticians as either Bayesian or frequentist. In this class, we will primarily focus on basic sampling models, basic probability models and frequency style analyses to create standard inferences. This is the most popular style of inference by far.
Being data scientists, we will also consider some inferential strategies that rely heavily on the observed data, such as permutation testing and bootstrapping. As probability modeling will be our starting point, we first build up basic probability as our first task.

## 统计代写|统计推断作业代写statistical inference代考|Probability

Probability forms the foundation for almost all treatments of statistical inference. In our treatment, probability is a law that assigns numbers to the long run occurrence of random phenomena after repeated unrelated realizations.
Before we begin discussing probability, let’s dispense with some deep philosophical questions, such as “What is randomness?” and “What is the fundamental interpretation of probability?”. One could spend a lifetime studying these questions (and some have). For our purposes, randomness is any process occurring without apparent deterministic patterns. Thus we will treat many things as if they were random when, in fact they are completely deterministic. In my field, biostatistics, we often model disease outcomes as if they were random when they are the result of many mechanistic components whose aggregate behavior appears random. Probability for us will be the long long run proportion of times some occurs in repeated unrelated realizations. So, think of the proportion of times that you get a head when flipping a coin.

For the interested student, I would recommend the books and work by lan Hacking to learn more about these deep philosophical issues. For us data scientists, the above definitions will work fine.

