Sufficiency (statistics) Guide, Meaning , Facts, Information and Description
In statistics, one often considers a family of probability distributions for a random variable X (and X is often a vector whose components are scalar-valued random variables, frequently independent) parameterized by a scalar- or vector-valued parameter, which let us call θ. A quantity T(X) that depends on the (observable) random variable X but not on the (unobservable) parameter θ is called a statistic. Sir Ronald Fisher tried to make precise the intuitive idea that a statistic may capture all of the information in X that is relevant to the estimation of θ. A statistic that does that is called a sufficient statistic.
| Table of contents |
|
2 Examples 3 The Rao-Blackwell theorem |
The precise definition is this:
Mathematical definition
An equivalent test, known as the Fisher's factorization criterion, is often used instead.
If the probability density function (in the discrete case, the probability mass function) of X is f(x;θ), then T satisfies the factorization criterion if and only if functions g and h'' can be found such that
Examples
This is seen by considering the joint probability distribution:
Because the observations are independent, this can be written as
and, collecting powers of p and 1 − p gives
which satisfies the factorization criterion, with h(x) being just the identity function. Note the crucial feature: the unknown parameter (here p) interacts with the observation x only via the statistic T(x) (here the sum Σ xi).
To see this, consider the joint probability distribution:
Because the observations are independent, this can be written as
where H(x) is the Heaviside step function. This may be written as
Since the conditional distribution of X given T(X) does not depend on θ, neither does the conditional expected value of g(X) given T(X), where g is any (sufficiently well-behaved) function. Consequently that conditional expected value is actually a statistic, and so is available for use in estimation. If g(X) is any kind of estimator of θ, then typically the conditional expectation of g(X) given T(X) is a better estimator of θ ; one way of making that statement precise is called the Rao-Blackwell theorem. Sometimes one can very easily construct a very crude estimator g(X), and then evaluate that conditional expected value to get an estimator that is in various senses optimal. This is an Article on Sufficiency (statistics). Page Contains Information, Facts Details or Explanation Guide About Sufficiency (statistics) The Rao-Blackwell theorem
