Help for package nparMD

Type:

Package

Title:

Nonparametric Analysis of Multivariate Data in Factorial Designs

Version:

0.2.1

Depends:

R (≥ 3.1.0)

Imports:

matrixStats, matrixcalc, MASS, gtools, Formula, methods, stats

Author:

Maximilian Kiefel and Arne C. Bathke

Maintainer:

Maximilian Kiefel <physik210@hotmail.com>

Description:

Analysis of multivariate data with two-way completely randomized factorial design. The analysis is based on fully nonparametric, rank-based methods and uses test statistics based on the Dempster's ANOVA, Wilk's Lambda, Lawley-Hotelling and Bartlett-Nanda-Pillai criteria. The multivariate response is allowed to be ordinal, quantitative, binary or a mixture of the different variable types. The package offers two functions performing the analysis, one for small and the other for large sample sizes. The underlying methodology is largely described in Bathke and Harrar (2016) <doi:10.1007/978-3-319-39065-9_7> and in Munzel and Brunner (2000) <doi:10.1016/S0378-3758(99)00212-8>.

License:

GPL-2 | GPL-3

Encoding:

UTF-8

LazyData:

true

RoxygenNote:

7.1.2

NeedsCompilation:

Packaged:

2022-03-09 07:44:58 UTC; mk

Repository:

CRAN

Date/Publication:

2022-03-09 10:40:02 UTC

Nonparametric Test For Multivariate Data With Two-Way Layout Factorial Design - Large Samples

Description

Analysis of multivariate data with two-way completely randomized factorial design - version for large samples. The analysis is based on fully nonparametric, rank-based methods and uses an F-approximation for 'Dempster's ANOVA' and a chisquare-approximation for the criteria called 'Wilks Lambda', 'Lawley-Hotelling' and 'Bartlett-Nanda-Pillai'. These approximations are given by the asymtotic distribution of these statistics under true null-hypothesis. In contrast to the normal-approximated test (as used in the small sample version) it is designed for data with large samples (see details) while the number of factorial levels is allowed to be small. The multivariate response is allowed to be ordinal, quantitative, binary or a mixture of the different variable types. The test statistics are constructed using nonparametric relative effect estimators.

Usage

nparml(formula, data)

Arguments

formula

an object of class "formula" with two explanatory variables (factors), see examples.

data

an object of class "data.frame" containing the variables in the formula

Details

The data is analysed for main effects and interaction effect of the explanatory factors. In each case the null hypothesis "no effect" is testet. In order to obtain reliable results the considered data should include at least 7 observations per factor level combination. This method is only implemented for complete data sets without missing values.

Value

Returns a list of data frames providing the values of the test statistics, p-values, degrees of freedom, factor levels, and groupsize per factor level combination.

References

Kiefel M., Bathke A.C. (2020) Rank-Based Analysis of Multivariate Data in Factorial Designs and Its Implementation in R In: Nonparametric Statistics (285-294) Springer Proceedings in Mathematics & Statistics Springer International Publishing, Cham

Bathke A.C., Harrar S.W. (2016) Rank-Based Inference for Multivariate Data in Factorial Designs. In: Liu R., McKean J. (eds) Robust Rank-Based and Nonparametric Methods. Springer Proceedings in Mathematics & Statistics, vol 168. Springer, Cham

Harrar S.W., Bathke A.C. (2012) A modified two-factor multivariate analysis of variance: asymptotics and small sample approximations (and erratum). In: Annals of the Institute of Statistical Mathematics, 64(1&5):135-165&1087, 2012.

Brunner E., Dette H., Munk A. (1997) Box-Type Approximations in Nonparametric Factorial Designs In: Journal of the American Statistical Association, 92(440):1494-1502

Examples

data(pseudostudy1)
nparml(resp1|resp2|resp3~treatment*age, pseudostudy1)

Nonparametric Test For Multivariate Data With Two-Way Layout Factorial Design - Small Samples

Description

Analysis of multivariate data with two-way completely randomized factorial design - version for small samples. The analysis is based on fully nonparametric, rank-based methods and uses a N(0,1)-approximation for test statistics based on 'Dempster's ANOVA', 'Wilk's Lambda', 'Lawley-Hotelling' and 'Bartlett-Nanda-Pillai' criteria. This approximation is established by the asymptotic distribution of these four statistics under true null-hypothesis if one of the explanatory factors has a large number of levels. The multivariate response is allowed to be ordinal, quantitative, binary or a mixture of the different variable types. The test statistics are constructed using nonparametric relative effect estimators.

Usage

nparms(formula,data)

Arguments

formula

an object of class "formula" with two explanatory variables (factors), see examples.

data

an object of class "data.frame" containing the variables in the formula

Details

This method is only implemented for complete data sets without missing values. The data is analysed for main effects and interaction effect of the explanatory factors. In each case the null hypothesis "no effect" is testet. The explanatory factor that has the higher number of levels is automatically designated as factor "A". The covariance matrix estimation requires at least 4 observations (observation vectors) per factor level combination. As the estimation is very time-consuming for large groups it is performed wih a random selection of observations when a group exceeds a size of 6 observation vectors.

Value

Returns a list of data frames providing the values of the test statistics, p-values, degrees of freedom, factor levels, and groupsize per factor level combination.

References

Brunner E., Dette H., Munk A. (1997) Box-Type Approximations in Nonparametric Factorial Designs In: Journal of the American Statistical Association, 92(440):1494-1502

Examples

data(pseudostudy2)
nparms(resp1|resp2|resp3~treatment*age, pseudostudy2)

pseudostudy dataset 1

Description

Example data set in a two-way layout without effects (in terms of disribution) by factors "treatment" and "age". In constrast to the 'pseudostudy2' data set the number of factorial levels is low while the number of observations per factor level combination is large.

Usage

data(pseudostudy1)

Format

pseudostudy consists of 56 cases (rows) and 5 variables where 'treatment'('age') is considered to be the explanatory factor 'A'('B') and the variables resp1, resp2. resp3 form the response vector.

Examples

data(pseudostudy1)

pseudostudy dataset 2

Description

Example data set in a two-way layout without effects (in terms of disribution) by factors "treatment" and "age".

Usage

data(pseudostudy2)

Format

pseudostudy consists of 107 cases (rows) and 5 variables where 'treatment'('age') is considered to be the explanatory factor 'A'('B') and the variables resp1, resp2. resp3 form the response vector.

Examples

data(pseudostudy2)