Statistics

First recall the two definitions

Definition Suppose we have a random variable with p.d.f where is an unknown vector, we then call $\theta$ a parameter and the set of $\theta$'s possible values, denoted , is called the parameter space

Definition For a random sample , any function independent of parameter is called a statistics.

Let be a random variable, if we know p.d.f , then we know everything we want with that random variable. The problem in statistics is that we have a random point from where function $f$ is known but parameter is unknown. Statistics wants to predict . The method for predicting $\theta$ is called statistical inference. In the later text of this book, I shall refer "a random point from " simply by "a random point from " for brevity.

Two kinds of statistical inference are called estimation and hypothesis testing. Estimation means we try to guess the value of , hypothesis testing might not care about , but test the the result of some experiment against the hypothesis. The key in hypothesis testing how sure can we know for certain, that the conclusion draw from the experiment is correct.

More formally, we categorize estimation into

(1) Point estimate: given , what is the function for estimating based solely on .

(2) Interval estimate: Given , find statistics and with

Point estimation is meaningless if we don't have interval estimation, because we need to be able to argue how good is the estimation. Put in other words, after point estimation gives us , we need interval estimation to tell us how close is this compared to real . That is what interval estimation does. Interval estimation introduces the notion of confidence. Confidence is the value in (2). So interval estimation is, no matter how large is your confidence, we can tell you an interval such that the probability that lies within that interval is your confidence. This interval is called confidence interval. And of course, a good estimation has a smaller confidence interval for a given confidence.

And more formally, hypothesis testing is that given and hypothesis , find a rule to decide the acceptance of rejection of . We will have more to say about interval estimation later.

results matching ""

    No results matching ""