# Residual Error

## Contents

# Recommended means of analysis of residual errors

## A. Testing the significance of a random error rate on one control zone

For an individual error rate (i.e. the error rate of a single control zone or the error rate of the whole sample), the statistical test is rather simple. It is actually similar to the test that is performed for the LPIS Quality Element 1b within the LPIS QA exercise.

First, the standard deviation of the error rate R must be estimated. This estimation depends on the type of sample, i.e., random or from the risk analysis and also on the group of beneficiaries that this sample represents (i.e. one control zone or the whole population of the MS).

If the error rate R is computed from a random sample, one must use the following procedure:

- - For each individual dossier in the sample, compute the difference = - R* where is the claimed area and is the area not found;
- - Compute as the sample standard deviation of the vector of ;
- - is then computed as:

where ๐ is the number of dossiers in the sample, ๐๐๐ก๐ถ is the total claimed area over the considered zone (i.e. either the control zone or the full MS territory) and ๐ is the total number of dossiers over the considered zone (again, either the control zone or the full MS territory).

If the error rate R is computed from a risk-based sample, the procedure for estimating the standard deviation of the error rate is more complex.

Once, the standard deviation is estimated, one must evaluate the upper limit of error (ULE) with:

Where is the quantile at 95% of the Student distribution with n-1 degree of freedom.

Finally, the conclusion of the test โIs the error rate R smaller than the 2% limit ?โ depends on which of the following cases is verified:

In case no chapter supplied.

## B. Comparing random error rates from different control zones

For the comparison of two error rates from independent samples (e.g. error rates from two different control zones, risk versus random samples), the preparation is the same. Only the test differs has it tries to address questions such as โIs the random error rate A larger than the random error rate B?โ or โAre both random error rates equal?โ.

The random error rates and and their respective standard deviations and must be evaluated individually as described previously and accordingly to the type of samples.

The statistic of the test will be computed as:

Then, according to the test that is considered, the conclusion of the test depends on which of the following cases is verified:

where the value m is computed as:

and are respectively the quantile at 95% and 97.5% of the Student distribution with m degree of freedom and and are the sizes of the samples A and B respectively.

## C. Comparing two error rates: paired case

For the comparison of two error rates from dependent sample (e.g. in case of reprocessed dossiers during a QC or for testing the equivalence of CwRS and classical checks), the procedure is slightly different. Instead of working on the error rates separately, one must keep the original structure of the sample, i.e. the sample is in fact constituted as pairs of observed area-not-found (e.g. a CAPI measurement and a field measurement) on the same dossier.

First, the standard deviation of the difference of the error rates = - must be estimated. This estimation depends on the type of sample, i.e., random or from the risk analysis.

If the error rate is computed from a random sample, one must use the following procedure:

- - For each individual dossier, compute the difference
- = (-) - ( - ) *
- where is the claimed area and is the area not found using the first method and is the area not found using the second method;
- - Compute as the sample standard deviation of the vector of ;
- - is then computed as:

where ๐ is the number of dossiers in the sample, ๐๐๐ก๐ถ is the total claimed area over the considered zone (i.e. either the control zone or the full MS territory) and ๐ is the total number of dossiers over the considered zone (again, either the control zone or the full MS territory).

Once, the standard deviation is estimated, one must evaluate the lower and upper limit of error (LLE and ULE) with:

where is the quantile at 97.5% of the Student distribution with n-2 degree of freedom.

Note that there are n-2 degrees of freedom and not n-1 as in the test on one isolated error rate in previous section. Also, it is a two-sided test so there are two limits: LLE and ULE.

Finally, the conclusion of the test โAre the error rates and equal ?โ depends on which of the following cases is verified:

## End of pages on analysis of residual errors

Go back to the Navigation page OTSC