Size of a test

In hypothesis testing, the size of a test is the (maximum) probability of committing a Type I error, that is, of incorrectly rejecting the null hypothesis when it is true.

Table of contents

Null hypothesis
Power function
Definition
Simple vs composite hypothesis
Size vs level of significance
Example - Size of a test for the mean
How to adjust the size of a test
More details
Keep reading the glossary

Null hypothesis

Suppose that we are conducting a test about a parameter that can take any value in a parameter space .

The null hypothesis $H_{0}$ is that belongs to a given set :

Power function

Denote by the power function of the test.

For any , the function gives us the probability of rejecting the null hypothesis when the true parameter is :

Note that depends on:

the true parameter ;
the null hypothesis $H_{0}$ that we are testing;
the statistical procedure (test statistic and critical region) used to decide whether to reject the null.

Definition

The size of the test can be defined as follows.

Definition The size of the test, denoted by , is [eq7]

Thus, we consider all the cases in which the null is true ().

For each case, we compute the probability of (incorrect) rejection.

The size is equal to the largest value we find (worst-case scenario).

Simple vs composite hypothesis

When the null hypothesis is simple, that is, the set $Theta _{R}$ contains only one parameter (denote it by $heta _{0}$ ), the above definition becomes

Thus, we have two cases:

if the null hypothesis does not specify an exact value for the parameter, but a whole set of parameters, then we need to take the maximum of the power function over that set in order to compute the size;
otherwise, if the null hypothesis specifies only one parameter, then it suffices to compute the value of the power function corresponding to that parameter.

Size vs level of significance

Many authors use the term level of significance as a synonym for size.

For many others, the level of significance is an upper bound to the size, that is, a constant that satisfies

Example - Size of a test for the mean

For example, suppose that we are testing the null hypothesis that the mean of a normal distribution is equal to $mu _{0}$ .

The variance of the distribution, denoted by , is supposed to be known.

How the test is conducted

We observe a sample of independent draws from the distribution and we compute the z-statistic [eq12] where is the sample mean: [eq13]

We select a critical value and reject the null hypothesis if

How the size is derived

It can be proved (see Hypothesis testing about the mean) that the power function of the test is [eq15] where is the cumulative distribution function of a standard normal random variable.

The size of the test is

By the symmetry of the standard normal distribution around 0, we have that

As a consequence, we can write the size of the test as

How to visualize the size

In order to better understand this result, consider that under the null hypothesis the z-statistic has a standard normal distribution, that is, a normal distribution with mean equal to and variance equal to .

If you set, for example, , then you will reject the null in two cases:

if the value of the z-statistic is less than , that is, if $Z_{n}<-1.96$ ;
if the value of the z-statistic is more than , that is, if $Z_{n}>1.96$ .

But we know that if $Z_{n}$ has a standard normal distribution, then [eq19]

Thus the size of the test is 5%:

The following plot shows the probability density function of the z-statistic.

Probability density function of the z-statistic. The size of the test is the area in the two tails of the distribution.

The black vertical segments indicate the two critical values.

When the value of the z-statistic falls in one of the two tails of the distribution (which are separated from the center of the distribution by the two critical values), the null hypothesis is incorrectly rejected.

The area under the probability density function in the two tails, colored with turquoise, is the probability of rejection, that is, the size of the test.

The area under the probability density function in the center of the distribution, colored with lavender, is the probability of acceptance.

How to adjust the size of a test

In the previous example the size of the test was 5%. What if we want to decrease the size to 1%? How can this be achieved?

In general, the size of a test can be modified by changing the critical value(s) of the test, that is, by reducing or increasing the size (and hence the probability) of the critical region (remember that the critical region is the set of values of the test statistic that lead to rejection of the null hypothesis).

In the previous example, we can decrease the size of the test by increasing the critical value . In particular, note that implies

So, if our desired size is , then we need to search for what value the cumulative distribution function of the normal distribution is equal to .

By using normal distribution tables or a computer, we find that the desired value is .

As a consequence, the new critical value is

More details

More details about the concept of size of a test can be found in the lecture entitled Hypothesis testing.

Keep reading the glossary

Previous entry: Score vector

Next entry: Standard deviation

How to cite

Please cite as:

Taboga, Marco (2021). "Size of a test", Lectures on probability theory and mathematical statistics. Kindle Direct Publishing. Online appendix. https://www.statlect.com/glossary/size-of-a-test.