Proportions Tests

x, n	numeric vectors, a table, or a matrix that specifies the counts of successes and trials respectively. vectors: length(x) must equal length(n), the elements in n must be positive, elements in the x must be non-negative, and elements in x must be less than the corresponding values in n. Because the proportions tests are based on counts, the elements in x and n should be whole numbers; however, the storage mode of x and n will be coerced to double. table: If x is a table, n (if specified) is ignored. matrix: If x is a matrix, n (if specified) is ignored. Missing values (NAs) and infinite values (Infs) are allowed, but in the case where an observation pair (x[i], n[i]) has at least one NA or Inf value, that observation pair is removed.
p	vector of probabilities of success specified by the null hypothesis. length(p) must equal length(x) and length(n), and all elements must be greater than zero and less than one. If p=NULL (the default) and there is only one group (length(x)==1), the null hypothesis tested is that the true probability of success is 0.5; however, if there is more than one group, the null hypothesis tested is that the true probability of success is the same in all groups. If p is not NULL, the null hypothesis tested is that the vector of true probabilities of success is equal to p, regardless of the number of groups. Missing values (NAs) and infinite values (Infs) are not allowed.
alternative	a character string that specifies the alternative hypothesis. Possible values are: "two.sided", "greater", "less". Note: You need to enter only enough of the character string to create a unique match for the value. alternative is usually automatically set to two.sided in most cases. The values greater and less are meaningful in two special cases. If there is one group, alternative pertains to the true probability of success in relation to its value specified under the null hypothesis (see argument p). If there are two groups and p=NULL, so that the null hypothesis tested is that the true probability of success is the same in both groups, then alternative pertains to the true probability of success in the first group in relation to that in the second.
conf.level	a numeric vector in the range [0, 1] that specifies the confidence level for the returned confidence interval. conf.level is meaningful only when there is one group, or when p=NULL and there are two groups. (See the description of the alternative argument for more information.) In all other cases, conf.level is ignored.
correct	a logical value. If TRUE (the default), Yates' continuity correction is applied, but only under certain conditions: When there is only one group, the continuity correction may not exceed in magnitude the difference between the sample proportion x/n and the hypothesized true probability of success. When there are two groups, and p=NULL, then the continuity correction may not exceed in magnitude the difference between the sample proportions. The continuity correction is never applied when there are more than two groups. See the Details section for an algebraic definition of the continuity correction.

Details

Testing if Probabilities of Success Equal Those Specified in p

To test the null hypothesis that the true probabilities of success equal those specified in input argument p (or 0.5 if p=NULL in the case of only one group), Pearson's X-squared statistic is computed for the above table, with expected counts of successes given by n*p and expected counts of failures by n*(1-p). Under the null hypothesis, the X-squared statistic has an asymptotic chi-square distribution with length(x) degrees of freedom.
When there is only one group, X-squared coincides with the square of the Z statistic used to compare a proportion with a specified value.
Testing if All Probabilities of Success Are the Same

To test the hypothesis that the true probability of success is the same in each of the length(x) > 1 groups (the default when p=NULL), Pearson's X-squared statistic is again used with the above table, this time with expected counts of successes estimated by n*(sum(x)/sum(n)) and expected counts of failures by n*(1-sum(x)/sum(n)). This estimates the (common) probability of success as the total number of observed successes divided by the total number of trials. Under the null hypothesis, X- squared has an asymptotic chi-square distribution with length(x)-1 degrees of freedom. It can be shown that X- squared computed this way is algebraically equivalent to X-squared for the hypothesis of independence between the row and column attributes of the table. Furthermore, when there are just two groups, the statistic coincides with the square of the Z statistic used to compare two proportions.

returns a list of class htest containing the following components:

statistic	the X-squared statistic.
parameters	the degrees of freedom of the asymptotic chi-square distribution associated with the X-squared statistic.
p.value	the asymptotic p-value for the test.
conf.int	In the following two cases, the confidence level is recorded in the attribute conf.level. If there is one group, a confidence interval for the true probability of success. If there are two groups and input argument p=NULL, conf.int contains a confidence interval for the difference in probabilities of success between the first and second groups. In all other cases, conf.int is not returned.
estimate	a numeric vector that returns the sample proportions as calculated by x / n, which estimate the true probabilities of success in the corresponding groups. When there is only one group the names attribute is p and when there are two or more groups the names attribute is prop 1, prop 2, ....
null.value	when the null hypothesis is that the true probabilities of success equal specified values (usually input argument p), the component null.value records these specified values, and returns them along with a names attribute as described under component estimate. In all other cases, null.value is not returned.
alternative	a character string that returns the alternative hypothesis (two.sided, greater, or less) as specified in the alternative argument. If there is only one group, or when there are two groups and the argument p=NULL, alternative returns the actual value specified for the alternative argument. In all other cases, alternative returns two.sided.
method	a character string that returns the name of the method used, including whether Yates' continuity correction was applied.
data.name	a character string that contains the actual names of the input vectors x, n, and of p, if given.

Two types of null hypothesis can be tested:

If the argument p is not NULL, the null hypothesis states that the true probability of success in group i is p[i], for each value of i. The alternative hypothesis, when there are at least two groups, is that there is some group for which this relation does not hold; thus alternative is two.sided.
In the special case of one group, the null hypothesis is that the true probability of success is the specified value of p or 0.5 if p is not specified. The alternative hypothesis is that the probability of success is greater than, less than, or simply not equal to p (or 0.5), depending on the input argument alternative.
If the argument p=NULL, and there are at least two groups, the null hypothesis states that the true probability of success is the same in every group. When there are two groups, the alternative hypothesis asserts that the probability of success in the first group is greater than, less than, or simply not equal to that in the second group, depending on the value of the argument alternative. When there are more than two groups, the alternative hypothesis is that there is at least one group whose probability of success is different from the others; thus alternative is two.sided.
The number of groups, insofar as it influences the nature of the test, is determined by length(x) before removal of NAs and other special values. However, the returned component method reflects the actual number of groups containing valid data used in computations.

Description

Usage

Arguments

Details