Test for Correlation Between Paired Samples

x, y

numeric vectors of the same length that contain the sample values. The length of the vectors must be greater than two for the "pearson" method and greater than one for the "kendall" and "spearman" methods. Missing (NAs) and infinite values (Inf, -Infs) are ignored.

alternative

a character string that specifies the alternative hypothesis for the test of correlation between x and y. Acceptable values are:

"two.sided"	non-zero
"greater"	greater than zero
"less"	less than zero

Only enough of the string to be unique is required.

method

a character string that specifies the correlation coefficient that is used in the test statistic. Acceptable values are:

"pearson"
"kendall"
"spearman"

Only enough of the string to be unique is required.

exact

a logical flag to indicate if an exact p-value should be computed. It is used only with Kendall's tau-statistic or Spearman's rank correlation, and only when there are no tie cases.

If exact=NULL (the default), an exact test is used if the observation size is less than 50 for Kendall's tau-statistic or if less than 1291 observations for Spearman's rank correlation. Otherwise an approximation method is used.

conf.level

a numeric value in the range [0, 1] that specifies the confidence level for the returned confidence interval.

conf.level is used only in Pearson's product moment correlation coefficient and when the number of observations is greater than 3.

continuity

a logical scalar. If TRUE, a continuity correction is used for Kendall's tau-statistic or Spearman's rho-statistic when an exact p-value is not computed. Default value is FALSE.

formula

a formula to specify which variables to use in the computations. The formula follows the pattern ~ a + b to compute and test the correlation of a and b.

data

a data frame or a matrix that contains the objects named in formula. By default, this argument reads in values from the environment.

subset

a vector that specifies a subset from the data frame (data) to use in formula.

na.action

a character string that specifies how missing values (NAs) are handled. By default, an error is returned.

...

any other arguments that are appropriate for the particular call.

Details

When method="pearson" the (usual) Pearson's product moment correlation coefficient (r <- cor(x,y)) is computed, and divided by its standard error to produce a t-statistic with n-2 degrees of freedom, where n = length(x) = length(y). This statistic is given by t <- (sqrt(n-2)*r) / sqrt(1-r^2).
When method="kendall" an exact test is computed if exact=TRUE and there are no ties. Otherwise, a normal approximation based on the value of tau is computed.
When method="spearman" an exact test is computed if exact=TRUE and there are no ties. Otherwise a t-distribution approximation based on the value of the Spearman rank correlation is computed.

returns a list of class "htest", containing the following components:

statistic	the value of the test statistic, a t-statistic, or a normalized z-statistic with a names attribute.
parameter	the degrees of freedom of the null distribution that is associated with statistic when this is a t-distribution.
p.value	the p-value under the null hypothesis that the correlation between x and y is zero.
estimate	the correlation coefficient with one of the following names attributes. "tau" for Kendall's statistic "cor" for Pearson's statistic "rho" for Spearman's statistic
null.value	the hypothesized value for the correlation between x and y, always 0 with anames attribute which is one of the following. "tau" for Kendall's statistic "correlation" for Pearson's statistic "rho" for Spearman's statistic
alternative	character string that returns the alternative hypothesis ("two.sided", "greater", or "less") as specified in the alternative argument.
method	a string containing the name of the estimator used for the correlation coefficient. One of the following. "Pearson's product-moment correlation" "Kendall's rank correlation tau" "Spearman's rank correlation"
data.name	a character string containing the actual names of the x and y vectors.
conf.int	confidence intervals for the correlation between x and y. Currently, it is only computed for the Pearson's correlation coefficient and only when the number of observations is greater than 3. The confidence level is recorded in the attribute conf.level.

Description

Usage

Arguments

Details