Help for package DiceKriging

Title:

Kriging Methods for Computer Experiments

Version:

1.6.0

Date:

2021-02-23

Author:

Olivier Roustant, David Ginsbourger, Yves Deville. Contributors: Clement Chevalier, Yann Richet.

Maintainer:

Olivier Roustant <roustant@insa-toulouse.fr>

Description:

Estimation, validation and prediction of kriging models. Important functions : km, print.km, plot.km, predict.km.

Depends:

methods

Suggests:

rgenoud (≥ 5.8-2.0), foreach, doParallel, testthat, numDeriv

License:

GPL-2 | GPL-3

URL:

https://dicekrigingclub.github.io/www/

NeedsCompilation:

yes

Packaged:

2021-02-23 17:02:18 UTC; yves

Repository:

CRAN

Date/Publication:

2021-02-23 17:30:03 UTC

Kriging Methods for Computer Experiments

Description

Estimation, validation and prediction of kriging models.

Details

Package:	DiceKriging
Type:	Package
Version:	1.6.0
Date:	2021-02-23
License:	GPL-2 \| GPL-3

Note

A previous version of this package was conducted within the frame of the DICE (Deep Inside Computer Experiments) Consortium between ARMINES, Renault, EDF, IRSN, ONERA and TOTAL S.A. (http://dice.emse.fr/).

The authors wish to thank Laurent Carraro, Delphine Dupuy and Celine Helbert for fruitful discussions about the structure of the code, and Francois Bachoc for his participation in validation and estimation by leave-one-out. They also thank Gregory Six and Gilles Pujol for their advices on practical implementation issues, as well as the DICE members for useful feedbacks.

Package rgenoud >= 5.8-2.0 is recommended.

Important functions or methods:

`km`	Estimation (or definition) of a kriging model with unknown (known) parameters
`predict`	Prediction of the objective function at new points using a kriging model (Simple and
	Universal Kriging)
`plot`	Plot diagnostic for a kriging model (leave-one-out)
`simulate`	Simulation of kriging models

Author(s)

Olivier Roustant, David Ginsbourger, Yves Deville. Contributors: C. Chevalier, Y. Richet.

(maintainer: Olivier Roustant roustant@insa-toulouse.fr)

References

F. Bachoc (2013), Cross Validation and Maximum Likelihood estimations of hyper-parameters of Gaussian processes with model misspecification. Computational Statistics and Data Analysis, 66, 55-69. http://www.lpma.math.upmc.fr/pageperso/bachoc/publications.html

N.A.C. Cressie (1993), Statistics for spatial data, Wiley series in probability and mathematical statistics.

O. Dubrule (1983), Cross validation of Kriging in a unique neighborhood. Mathematical Geology, 15, 687-699.

D. Ginsbourger (2009), Multiples metamodeles pour l'approximation et l'optimisation de fonctions numeriques multivariables, Ph.D. thesis, Ecole Nationale Superieure des Mines de Saint-Etienne, 2009.

D. Ginsbourger, D. Dupuy, A. Badea, O. Roustant, and L. Carraro (2009), A note on the choice and the estimation of kriging models for the analysis of deterministic computer experiments, Applied Stochastic Models for Business and Industry, 25 no. 2, 115-131.

A.G. Journel and C.J. Huijbregts (1978), Mining Geostatistics, Academic Press, London.

A.G. Journel and M.E. Rossi (1989), When do we need a trend model in kriging ?, Mathematical Geology, 21 no. 7, 715-739.

D.G. Krige (1951), A statistical approach to some basic mine valuation problems on the witwatersrand, J. of the Chem., Metal. and Mining Soc. of South Africa, 52 no. 6, 119-139.

R. Li and A. Sudjianto (2005), Analysis of Computer Experiments Using Penalized Likelihood in Gaussian Kriging Models, Technometrics, 47 no. 2, 111-120.

K.V. Mardia and R.J. Marshall (1984), Maximum likelihood estimation of models for residual covariance in spatial regression, Biometrika, 71, 135-146.

J.D. Martin and T.W. Simpson (2005), Use of kriging models to approximate deterministic computer models, AIAA Journal, 43 no. 4, 853-863.

G. Matheron (1963), Principles of geostatistics, Economic Geology, 58, 1246-1266.

G. Matheron (1969), Le krigeage universel, Les Cahiers du Centre de Morphologie Mathematique de Fontainebleau, 1.

W.R. Mebane, Jr., J.S. Sekhon (2011). Genetic Optimization Using Derivatives: The rgenoud Package for R. Journal of Statistical Software, 42(11), 1-26. https://www.jstatsoft.org/v42/i11/

J.-S. Park and J. Baek (2001), Efficient computation of maximum likelihood estimators in a spatial linear model with power exponential covariogram, Computer Geosciences, 27 no. 1, 1-7.

C.E. Rasmussen and C.K.I. Williams (2006), Gaussian Processes for Machine Learning, the MIT Press, http://www.gaussianprocess.org/gpml/

B.D. Ripley (1987), Stochastic Simulation, Wiley.

O. Roustant, D. Ginsbourger and Yves Deville (2012), DiceKriging, DiceOptim: Two R Packages for the Analysis of Computer Experiments by Kriging-Based Metamodeling and Optimization, Journal of Statistical Software, 51(1), 1-55, https://www.jstatsoft.org/v51/i01/.

J. Sacks, W.J. Welch, T.J. Mitchell, and H.P. Wynn (1989), Design and analysis of computer experiments, Statistical Science, 4, 409-435.

M. Schonlau (1997), Computer experiments and global optimization, Ph.D. thesis, University of Waterloo.

M.L. Stein (1999), Interpolation of spatial data, some theory for kriging, Springer.

Y. Xiong, W. Chen, D. Apley, and X. Ding (2007), Int. J. Numer. Meth. Engng, A non-stationary covariance-based Kriging method for metamodelling in engineering design.

Penalty function

Description

Smoothly Clipped Absolute Deviation function.

Usage

SCAD(x, lambda)

Arguments

x

a vector where the function is to be evaluated.

lambda

a number representing a tuning parameter.

Details

SCAD is an even continuous function equal to 0 at x=0, and defined piecewise with derivative lambda in [0, lambda], (a*lambda - x)/(a-1) in [lambda, a*lambda], and 0 for x larger than a*lambda. As suggested by (Li, Sudjianto, 2005), we set a=3.7.

Value

A vector containing the SCAD values at x.

Note

In MLE problems, the penalty value lambda should tend to 0 when the sample size tends to infinity to insure that the asymptotic properties of Penalized-MLE and MLE are the same (see Li, Sudjianto, 2005).

Author(s)

O. Roustant, D. Ginsbourger, Ecole des Mines de St-Etienne.

References

R. Li and A. Sudjianto (2005), Analysis of Computer Experiments Using Penalized Likelihood in Gaussian Kriging Models, Technometrics, 47 no. 2, 111-120.

Examples

 
x <- seq(-8,8, length=200)
a <- 3.7

lambda <- 1.5
y <- SCAD(x, lambda)
plot(x, y, type="l", ylim=c(0,6))
x.knots <- c(-a*lambda, -lambda, 0, lambda, a*lambda)
points(x.knots, SCAD(x.knots, lambda), pch=19, cex=0.5)
text(6, SCAD(6, lambda)+0.3, paste("lambda =", lambda))

for (i in 1:2) {
   lambda <- lambda - 0.5
   y <- SCAD(x, lambda)
   lines(x, y, type="l")
   x.knots <- c(-a*lambda, -lambda, 0, lambda, a*lambda)
   points(x.knots, SCAD(x.knots, lambda), pch=19, cex=0.5)
   text(6, SCAD(6, lambda)+0.3, paste("lambda =", lambda))
}

abline(v=0, h=0, lty="dotted")
title("SCAD function")

Penalty function derivative

Description

Derivative of SCAD function.

Usage

SCAD.derivative(x, lambda)

Arguments

x

a vector where the function is to be evaluated.

lambda

a number representing a tuning parameter.

Value

A vector containing the SCAD derivative values at x.

Author(s)

O. Roustant, D. Ginsbourger, Ecole des Mines de St-Etienne.

References

R. Li and A. Sudjianto (2005), Analysis of Computer Experiments Using Penalized Likelihood in Gaussian Kriging Models, Technometrics, 47 no. 2, 111-120.

2D test function

Description

Branin-Hoo 2-dimensional test function.

Usage

branin(x)

Arguments

x

a 2-dimensional vector specifying the location where the function is to be evaluated.

Details

The Branin-Hoo function is defined here over [0,1] x [0,1], instead of [-5,0] x [10,15] as usual. It has 3 global minima : x1 = c(0.9616520, 0.15); x2 = c(0.1238946, 0.8166644); x3 = c(0.5427730, 0.15)

Value

A real number equal to the Branin-Hoo function values at x

Author(s)

D. Ginsbourger, Ecole des Mines de St-Etienne.

Examples

 
n.grid <- 20
x.grid <- y.grid <- seq(0,1,length=n.grid)
design.grid <- expand.grid(x.grid, y.grid)
response.grid <- apply(design.grid, 1, branin)
z.grid <- matrix(response.grid, n.grid, n.grid)
contour(x.grid,y.grid,z.grid,40)
x1 = c(0.9616520, 0.15); x2 = c(0.1238946, 0.8166644); x3 = c(0.5427730, 0.15)
points(rbind(t(x1), t(x2), t(x3)), pch=19, col="red")
title("Fonction de Branin")

2D test function

Description

Camelback 2-dimensional test function.

Usage

camelback(x)

Arguments

x

a 2-dimensional vector specifying the location where the function is to be evaluated.

Details

The Camelback function is usually defined over the domain [-3,-2] x [3, 2]. Here, the function is adapted to the domain [0,1] x [0,1]. It has 2 global minima : x1 = c(0.5149730,0.3218374); x2 = c(0.4850263,0.6781641)

Value

A real number equal to the Camelback function values at x

Author(s)

D. Ginsbourger, Ecole des Mines de St-Etienne.

Examples

 
n.grid <- 20
x.grid <- y.grid <- seq(0,1,length=n.grid)
design.grid <- expand.grid(x.grid, y.grid)
response.grid <- apply(design.grid, 1, camelback)
z.grid <- matrix(response.grid, n.grid, n.grid)
contour(x.grid,y.grid,z.grid,20)
x1 = c(0.5149730,0.3218374); x2 = c(0.4850263,0.6781641)
points(rbind(t(x1), t(x2)), pch=19, col="red")
title("Fonction Camelback")

Consistency test between the column names of two matrices

Description

Tests if the names of a second matrix are equal to a given matrix up to a permutation, and permute its columns accordingly. When the second one has no column names, the names of the first one are used in the same order.

Usage

checkNames(X1, X2, X1.name = "X1", X2.name = "X2")
checkNamesList(X1, l2, X1.name = "X1", l2.name = "l2")

Arguments

X1

a matrix containing column names.

X2

a matrix containing the same number of columns.

l2

a list with length ncol(X1).

X1.name

X2.name

optional names for the matrix X1 and X2 theirselves (useful for error messages).

l2.name

optional names for l2.

Details

If X2 does not contain variable names, then the names of X1 are used in the same order, and X2 is returned with these names. Otherwise, if the column names of X1 and X2 are equal up to a permutation, the column of X2 are permuted according to the order of X1' names.

Value

The matrix X2, with columns possibly permuted. See details.

Author(s)

O. Roustant

Examples

X1 <- matrix(1, 2, 3)
X2 <- matrix(1:6, 2, 3)

colnames(X1) <- c("x1", "x2", "x3")
checkNames(X1, X2)
# attributes the same names for X2, and returns X2

colnames(X2) <- c("x1", "x2", "x5")
## Not run: checkNames(X1, X2)
# returns an error since the names of X1 and X2 are different

colnames(X2) <- c("x2", "x1", "x3")
checkNames(X1, X2)
# returns the matrix X2, but with permuted columns

l2 <- list(x3 = 1, x2 = c(2, 3), x1 = -6)
checkNamesList(X1, l2)

Get coefficients values

Description

Get or set coefficients values.

Usage

  coef(object, ...)

Arguments

object

an object specifying a covariance structure or a km object.

...

other arguments (undocumented at this stage).

Note

The replacement method coef<- is not available.

Author(s)

Y. Deville, O. Roustant

Auxiliary variables for kriging

Description

Computes or updates some auxiliary variables used for kriging (see below). This is useful in several situations : when all parameters are known (as for one basic step in Bayesian analysis), or when some new data is added but one does not want to re-estimate the model coefficients. On the other hand, computeAuxVariables is not used during the estimation of covariance parameters, since this function requires to compute the trend coefficients at each optimization step; the alternative given by (Park, Baek, 2001) is preferred.

Usage

computeAuxVariables(model)

Arguments

model

an object of class km with missing (or non updated) items.

Value

An updated km objet, where the changes concern the following items:

T

a matrix equal to the upper triangular factor of the Choleski decomposition of C, such that t(T)*T = C (where C is the covariance matrix).

z

a vector equal to inv(t(T))*(y - F*beta), with y, F, beta are respectively the response, the experimental matrix and the trend coefficients specified in model@trend.coef. If model@trend.coef is empty, z is not computed.

M

a matrix equal to inv(t(T))*F.

Note

T is computed with the base function chol. z and M are computed by solving triangular linear systems with backsolve. z is not computed if model@trend.coef is empty.

Author(s)

O. Roustant, D. Ginsbourger, Ecole des Mines de St-Etienne

References

J.-S. Park and J. Baek (2001), Efficient computation of maximum likelihood estimators in a spatial linear model with power exponential covariogram, Computer Geosciences, 27 no. 1, 1-7.

Class of tensor-product spatial covariances with isotropic range

Description

S4 class of isotropic spatial covariance kerlnes based upon the covTensorProduct class

Objects from the Class

In 1-dimension, the covariance kernels are parameterized as in (Rasmussen, Williams, 2006). Denote by theta the range parameter, p the exponent parameter (for power-exponential covariance), s the standard deviation, and h=||x-y||. Then we have C(x,y) = s^2 * k(x,y), with:

Gauss	`k(x,y) = exp(-1/2*(h/theta)^2)`
Exponential	`k(x,y) = exp(-h/theta)`
Matern(3/2)	`k(x,y) = (1+sqrt(3)h/theta)exp(-sqrt(3)*h/theta)`
Matern(5/2)	`k(x,y) = (1+sqrt(5)h/theta+(1/3)5*(h/theta)^2)`
	`exp(-sqrt(5)h/theta)`
Power-exponential	`k(x,y) = exp(-(h/theta)^p)`

Slots

d:: Object of class "integer". The spatial dimension.
name:: Object of class "character". The covariance function name. To be chosen between "gauss", "matern5_2", "matern3_2", "exp", and "powexp"
paramset.n:: Object of class "integer". 1 for covariance depending only on the ranges parameters, 2 for "powexp" which also depends on exponent parameters.
var.names:: Object of class "character". The variable names.
sd2:: Object of class "numeric". The variance of the stationary part of the process.
known.covparam:: Object of class "character". Internal use. One of: "None", "All".
nugget.flag:: Object of class "logical". Is there a nugget effect?
nugget.estim:: Object of class "logical". Is the nugget effect estimated or known?
nugget:: Object of class "numeric". If there is a nugget effect, its value (homogeneous to a variance).
param.n:: Object of class "integer". The total number of parameters.
range.names:: Object of class "character". Names of range parameters, for printing purpose. Default is "theta".
range.val:: Object of class "numeric". Values of range parameters.

Extends

Class "covKernel", directly.

Methods

coef: signature(object = "covIso"): ...
covMat1Mat2: signature(object = "covIso"): ...
covMatrix: signature(object = "covIso"): ...
covMatrixDerivative: signature(object = "covIso"): ...
covParametersBounds: signature(object = "covIso"): ...
covparam2vect: signature(object = "covIso"): ...
vect2covparam: signature(object = "covIso"): ...
covVector.dx: signature(object = "covIso"): ...
inputnames: signature(x = "covIso"): ...
kernelname: signature(x = "covIso"): ...
ninput: signature(x = "covIso"): ...
nuggetflag: signature(x = "covIso"): ...
nuggetvalue: signature(x = "covIso"): ...
show: signature(object = "covIso"): ...
summary: signature(object = "covIso"): ...

Author(s)

O. Roustant, D. Ginsbourger

References

N.A.C. Cressie (1993), Statistics for spatial data, Wiley series in probability and mathematical statistics.

C.E. Rasmussen and C.K.I. Williams (2006), Gaussian Processes for Machine Learning, the MIT Press, http://www.gaussianprocess.org/gpml/

M.L. Stein (1999), Interpolation of spatial data, some theory for kriging, Springer.

Examples

showClass("covIso")

Class "covKernel"

Description

Union of classes including "covTensorProduct", "covIso", "covScaling" and "covUser"

Objects from the Class

A virtual Class: No objects may be created from it.

Methods

No methods defined with class "covKernel" in the signature.

Author(s)

Olivier Roustant, David Ginsbourger, Yves Deville

Examples

showClass("covKernel")

Cross covariance matrix

Description

Computes the cross covariance matrix between two sets of locations for a spatial random process with a given covariance structure. Typically the two sets are a learning set and a test set.

Usage

covMat1Mat2(object, X1, X2, nugget.flag=FALSE)

Arguments

object

an object specifying the covariance structure.

X1

a matrix whose rows represent the locations of a first set (for instance a set of learning points).

X2

a matrix whose rows represent the locations of a second set (for instance a set of test points).

nugget.flag

an optional boolean. If TRUE, the covariance between 2 equal locations takes into account the nugget effect (if any). Locations are considered equal if their euclidian distance is inferior to 1e-15. Default is FALSE.

Value

a matrix of size (nb of rows of X1 * nb of rows of X2) whose element (i1,i2) is equal to the covariance between the locations specified by row i1 of X1 and row i2 of X2.

Author(s)

Olivier Roustant, David Ginsbourger, Ecole des Mines de St-Etienne.

Covariance matrix

Description

Computes the covariance matrix at a set of locations for a spatial random process with a given covariance structure.

Usage

covMatrix(object, X, noise.var = NULL)

Arguments

object

an object specifying the covariance structure.

X

a matrix whose columns represent locations.

noise.var

for noisy observations : an optional vector containing the noise variance at each observation

Value

a list with the following items :

C

a matrix representing the covariance matrix for the locations specified in the X argument, including a possible nugget effect or observation noise.

vn

a vector of length n (X size) containing a replication of the nugget effet or the observation noise (so that C-diag(vn) contains the covariance matrix when there is no nugget effect nor observation noise)

Author(s)

Olivier Roustant, David Ginsbourger, Ecole des Mines de St-Etienne.

Covariance matrix derivatives

Description

Computes a partial derivative of the covariance matrix C in function covMatrix.

Usage

covMatrixDerivative(object, X, C0, k, ...)

Arguments

object

an object specifying the covariance structure.

X

a matrix whose columns represent locations.

C0

a matrix corresponding to the covariance matrix for the locations specified in the X argument, when there is no nugget effet nor observation noise.

k

an integer representing the partial derivative index.

...

additional parameters, typically an environment used for storage

Value

A matrix representing the partial derivative of C

Author(s)

Olivier Roustant, David Ginsbourger, Ecole des Mines de St-Etienne.

Boundaries for covariance parameters

Description

Default boundaries for covariance parameters.

Usage

covParametersBounds(object, X)

Arguments

object

an object specifying the covariance structure.

X

a matrix representing the design of experiments.

Details

The default values are chosen as follows :

Range parameters (all covariances)	`lower=1e-10`, `upper`=2 times the difference between
	the max. and min. values of `X` for each coordinate
Shape parameters (`powexp` covariance)	`lower=1e-10`, `upper=2` for each coordinate

Value

a list with items lower, upper containing default boundaries for the covariance parameters.

Author(s)

Olivier Roustant, David Ginsbourger, Ecole des Mines de St-Etienne.

Class "covScaling"

Description

Composition of isotropic kernels with coordinatewise non-linear scaling obtained by integrating piecewise affine functions

Objects from the Class

Gauss	`k(x,y) = exp(-1/2*(h/theta)^2)`
Exponential	`k(x,y) = exp(-h/theta)`
Matern(3/2)	`k(x,y) = (1+sqrt(3)h/theta)exp(-sqrt(3)*h/theta)`
Matern(5/2)	`k(x,y) = (1+sqrt(5)h/theta+(1/3)5*(h/theta)^2)`
	`exp(-sqrt(5)h/theta)`
Power-exponential	`k(x,y) = exp(-(h/theta)^p)`

Here, in every dimension, the corresponding one-dimensional stationary kernel k(x,y) is replaced by k(f(x),f(y)), where f is a continuous monotonic function indexed by a finite number of parameters (see the references for more detail).

Slots

d:: Object of class "integer". The spatial dimension.
knots:: Object of class "list". The j-th element is a vector containing the knots for dimension j.
eta:: Object of class "list". In correspondance with knots, the j-th element is a vector containing the scaling coefficients (i.e. the derivatives of the scaling function at the knots) for dimension j.
name:: Object of class "character". The covariance function name. To be chosen between "gauss", "matern5_2", "matern3_2", "exp", and "powexp"
paramset.n:: Object of class "integer". 1 for covariance depending only on the ranges parameters, 2 for "powexp" which also depends on exponent parameters.
var.names:: Object of class "character". The variable names.
sd2:: Object of class "numeric". The variance of the stationary part of the process.
known.covparam:: Object of class "character". Internal use. One of: "None", "All".
nugget.flag:: Object of class "logical". Is there a nugget effect?
nugget.estim:: Object of class "logical". Is the nugget effect estimated or known?
nugget:: Object of class "numeric". If there is a nugget effect, its value (homogeneous to a variance).
param.n:: Object of class "integer". The total number of parameters.

Extends

Class "covKernel", directly.

Methods

coef: signature(object = "covScaling"): ...
covMat1Mat2: signature(object = "covScaling"): ...
covMatrix: signature(object = "covScaling"): ...
covMatrixDerivative: signature(object = "covScaling"): ...
covParametersBounds: signature(object = "covScaling"): ...
covparam2vect: signature(object = "covScaling"): ...
vect2covparam: signature(object = "covScaling"): ...
show: signature(object = "covScaling"): ...

Author(s)

Olivier Roustant, David Ginsbourger, Yves Deville

References

Y. Xiong, W. Chen, D. Apley, and X. Ding (2007), Int. J. Numer. Meth. Engng, A non-stationary covariance-based Kriging method for metamodelling in engineering design.

Examples

showClass("covScaling")

Spatial covariance - Class constructor

Description

Creates a covariance structure.

Usage

covStruct.create(covtype, d, known.covparam, var.names, coef.cov = NULL, coef.var = NULL, 
     nugget = NULL, nugget.estim = FALSE, nugget.flag = FALSE,
     iso = FALSE, scaling = FALSE, knots=NULL, kernel=NULL)

Arguments

covtype

a character string specifying the covariance structure.

d

an integer containing the spatial dimension.

known.covparam

a character ("None" or "All") indicating whether covariance parameters are known or must be estimated.

var.names

a vector of character strings containing the variable names.

coef.cov

an optional vector containing the values for covariance parameters.

coef.var

an optional number containing the variance value.

nugget

an optional variance value standing for the homogenous nugget effect. Default is NULL.

nugget.estim

is the nugget effect estimated or known?

nugget.flag

is there a nugget effect?

iso

an optional boolean that can be used to force a tensor-product covariance structure to have a range parameter common to all dimensions.

scaling

an optional boolean indicating whether a scaling on the covariance structure should be used.

knots

an optional list of knots (used if scaling = TRUE)

kernel

an optional function containing a new covariance structure

Value

A formal S4 class of type covTensorProduct-class, covIso-class (if iso is TRUE) (if scaling is TRUE), or covUser-class (if kernel is TRUE).

Author(s)

O. Roustant, D. Ginsbourger

Class of tensor-product spatial covariances

Description

S4 class of tensor-product (or separable) covariances.

Value

covTensorProduct

separable covariances depending on 1 set of parameters, such as Gaussian, exponential, Matern with fixed nu... or on 2 sets of parameters, such as power-exponential.

Objects from the Class

A d-dimensional tensor product (or separable) covariance kernel C(x,y) is the tensor product of 1-dimensional covariance kernels : C(x,y) = C(x1,y1)C(x2,y2)...C(xd,yd).

Gauss	`k(x,y) = exp(-1/2*(h/theta)^2)`
Exponential	`k(x,y) = exp(-h/theta)`
Matern(3/2)	`k(x,y) = (1+sqrt(3)h/theta)exp(-sqrt(3)*h/theta)`
Matern(5/2)	`k(x,y) = (1+sqrt(5)h/theta+(1/3)5*(h/theta)^2)`
	`exp(-sqrt(5)h/theta)`
Power-exponential	`k(x,y) = exp(-(h/theta)^p)`

Slots

d:: Object of class "integer". The spatial dimension.
name:: Object of class "character". The covariance function name. To be chosen between "gauss", "matern5_2", "matern3_2", "exp", and "powexp"
paramset.n:: Object of class "integer". 1 for covariance depending only on the ranges parameters, 2 for "powexp" which also depends on exponent parameters.
var.names:: Object of class "character". The variable names.
sd2:: Object of class "numeric". The variance of the stationary part of the process.
known.covparam:: Object of class "character". Internal use. One of: "None", "All".
nugget.flag:: Object of class "logical". Is there a nugget effect?
nugget.estim:: Object of class "logical". Is the nugget effect estimated or known?
nugget:: Object of class "numeric". If there is a nugget effect, its value (homogeneous to a variance).
param.n:: Object of class "integer". The total number of parameters.
range.n:: Object of class "integer". The number of range parameters.
range.names:: Object of class "character". Names of range parameters, for printing purpose. Default is "theta".
range.val:: Object of class "numeric". Values of range parameters.
shape.n:: Object of class "integer". The number of shape parameters (exponent parameters in "powexp").
shape.names:: Object of class "character". Names of shape parameters, for printing purpose. Default is "p".
shape.val:: Object of class "numeric". Values of shape parameters.

Methods

show: signature(x = "covTensorProduct") Print covariance function. See show,km-method.
coef: signature(x = "covTensorProduct") Get the coefficients of the covariance function.

Author(s)

O. Roustant, D. Ginsbourger

References

N.A.C. Cressie (1993), Statistics for spatial data, Wiley series in probability and mathematical statistics.

C.E. Rasmussen and C.K.I. Williams (2006), Gaussian Processes for Machine Learning, the MIT Press, http://www.gaussianprocess.org/gpml/

M.L. Stein (1999), Interpolation of spatial data, some theory for kriging, Springer.

Class "covUser"

Description

An arbitrary covariance kernel provided by the user

Objects from the Class

Any valid covariance kernel, provided as a 2-dimensional function (x,y) -> k(x,y). At this stage, no test is done to check that k is positive definite.

Slots

kernel:: Object of class "function". The new covariance kernel.
nugget.flag:: Object of class "logical". Is there a nugget effect?
nugget:: Object of class "numeric". If there is a nugget effect, its value (homogeneous to a variance).

Extends

Class "covKernel", directly.

Methods

coef: signature(object = "covUser"): ...
covMat1Mat2: signature(object = "covScaling"): ...
covMatrix: signature(object = "covScaling"): ...
show: signature(object = "covScaling"): ...

Author(s)

Olivier Roustant, David Ginsbourger, Yves Deville

Examples

showClass("covUser")

Spatial covariance - Derivatives

Description

Computes the gradient of the covariance vector c(x) computed by covMat1Mat2 with respect to x, for a given covariance structure.

Usage

covVector.dx(object, x, X, c)

Arguments

object

an object specifying the covariance structure.

x

a vector representing the specific location.

X

a matrix whose columns represent locations.

c

a vector containing the covariances between the location x and the set of locations specified in X (see covMat1Mat2).

Value

A vector of real numbers equal to the gradient of c as a function of x.

Author(s)

Olivier Roustant, David Ginsbourger, Ecole des Mines de St-Etienne.

Auxiliary function

Description

Gather the covariance parameters in a single vector. This is useful in the estimation step. Not for direct use.

Usage

covparam2vect(object)

Arguments

object

an object specifying the covariance structure.

Value

A vector containing the covariance parameters.

Author(s)

O. Roustant, D. Ginsbourger

Multiple fold cross validation for a km object

Description

Multiple fold cross validation for a km object without noisy observations.

Usage

cv(model, folds, type="UK", trend.reestim=TRUE, fast=TRUE, light=FALSE)

Arguments

model

an object of class "km" without noisy observations.

folds

a list of index subsets without index redundancy within each fold.

type

a character string corresponding to the kriging family, to be chosen between simple kriging ("SK"), or universal kriging ("UK").

trend.reestim

should the trend be reestimated when removing an observation? Default to FALSE.

fast

binary option to use analytical multiple fold cross validation formulae when applicable.

light

binary option to force not calculating cross validation residual covariances between different folds (relevant, e.g., when performing speed comparisons across baseline versus fast settings).

Value

A list composed of

mean

a list of cross validation mean predictions with same number of elements and respective dimensions than in folds. The ith element is equal to the kriging mean (including the trend) at the ith fold number when it is left out of the design,

y

a vector of actual responses,

cvcov.list

a list of cross validation conditional covariance matrices with same number of elements than in folds and dimensions set accordingly. The ith element is equal to the kriging covariance matrix corresponding to the ith fold number when it is left out of the design,

cvcov.mat

a ntot*ntot matrix containing all covariances between cross-validation errors (stacked with respect to orders between and within folds),

where ntot is the total number of points in the folds list (with possible point redundancies as some points may belong to several folds).

Warning

Kriging parameters are not re-estimated when removing observations. With few points in the learning set, the re-estimated values can be far from those obtained with the entire learning set. One option is to reestimate the trend coefficients, by setting trend.reestim=TRUE.

Author(s)

D. Ginsbourger, University of Bern.

References

F. Bachoc (2013), Cross Validation and Maximum Likelihood estimations of hyper-parameters of Gaussian processes with model misspecification. Computational Statistics and Data Analysis, 66, 55-69.

N.A.C. Cressie (1993), Statistics for spatial data, Wiley series in probability and mathematical statistics.

O. Dubrule (1983), Cross validation of Kriging in a unique neighborhood. Mathematical Geology, 15, 687-699.

J. Gallier. The schur complement and symmetric positive semidefinite (and definite) matrices. Retrieved at https://www.cis.upenn.edu/~jean/schur-comp.pdf.

D. Ginsbourger and C. Schaerer (2021). Fast calculation of Gaussian Process multiple-fold cross-validation residuals and their covariances. arXiv:2101.03108 [stat.ME].

J.D. Martin and T.W. Simpson (2005), Use of kriging models to approximate deterministic computer models, AIAA Journal, 43 no. 4, 853-863.

M. Schonlau (1997), Computer experiments and global optimization, Ph.D. thesis, University of Waterloo.

Examples

# -------------------------------------------------
# A 1D example illustrating leave-one-out residuals 
# and their correlation 
# -------------------------------------------------

# Test function (From Xiong et al. 2007; See scalingFun's doc) 
myfun <- function(x){ 
  sin(30 * (x - 0.9)^4) * cos(2 * (x - 0.9)) + (x - 0.9) / 2
}
t <- seq(from = 0, to = 1, by = 0.005)
allresp <- myfun(t)
par(mfrow = c(1, 1), mar = c(4, 4, 2, 2))
plot(t, allresp, type = "l")

# Design points and associated responses 
nn <- 10
design <- seq(0, 1, length.out = nn)
y <- myfun(design)
points(design, y, pch = 19)

# Model definition and GP prediction (Kriging)
set.seed(1)
model1 <- km(design = data.frame(design = design), 
             response = data.frame(y = y), nugget = 1e-5,
             multistart = 10, control = list(trace = FALSE))
pred1 <- predict(model1, newdata = data.frame(design = t), type = "UK") 
lines(t, pred1$mean, type = "l", col = "blue", lty = 2, lwd = 2)

# Plotting the prediction error versus the GP standard deviation 
par(mfrow = c(2,1))
pred_abserrors <- abs(allresp - pred1$mean)
plot(t, pred_abserrors, type = "l", ylab = "abs pred error")
plot(t, pred1$sd, type = "l", ylab = "GP prediction sd")

# Leave-one-out cross-validation with the cv function 
loofolds <- as.list(seq(1, length(design)))
loo1 <- cv(model = model1, folds = loofolds, type = "UK", 
              trend.reestim = TRUE, fast = TRUE, light = FALSE) 

# y axis limits need to be taken care of 
plotCVmean <- function(cvObj){
  cvObjMean <- unlist(cvObj$mean)
  plot(t, allresp, type = "l", ylim = range(cvObjMean, allresp))
  points(design, y, pch = 19)
  lines(t, pred1$mean, type = "l", col = "blue", lty = 2, lwd = 2)
  points(design, cvObjMean, col = "red", pch = 22, lwd = 2)
}
plotCVsd <- function(cvObj, ylim){
  cv_abserrors <- abs(y - unlist(cvObj$mean))
  plot(t, pred_abserrors, type = "l", ylab = "abs pred error", 
       ylim = ylim)
  points(design, cv_abserrors, col = "red", pch = 22, lwd = 2)
  lines(t, pred1$sd, ylab = "GP prediction sd", col = "blue", 
      lty = 2, lwd = 2)
}

loo1Mean <- unlist(loo1$mean)
loo_abserrors <- abs(y - loo1Mean)
ylim <- c(0, max(loo_abserrors, pred_abserrors))

plotCVmean(loo1)
plotCVsd(loo1, ylim = ylim)

# Calculation of uncorrelated CV residuals and corresponding qqplot 
T <- model1@T
B <- diag(as.numeric(diag(loo1$cvcov.mat))^(-1))
res <- y - loo1Mean
stand <- T %*% B %*% res
opar <- par(mfrow = c(1, 2))
qqnorm(stand, 
       main = "Normal Q-Q Plot of uncorrelated LOO Residuals")
abline(a = 0, b = 1)

# Comparison to "usual" standardized LOO residuals
usual_stand <- diag(as.numeric(diag(loo1$cvcov.mat))^(-1/2)) %*% res
qqnorm(usual_stand, 
       main = "Normal Q-Q Plot of Standardized LOO Residuals") 
abline(a = 0, b = 1)
par(opar)

# Calculation and plot of correlations between most left 
# and other cross-validation residuals 
cvcov.mat <- loo1$cvcov.mat
coco <- cov2cor(cvcov.mat)
par(mfrow = c(1, 1))
plot(coco[1, ], type = "h", ylim = c(-1, 1), lwd = 2,
     main = "Correlation between first and other LOO residuals", 
     ylab = "Correlation")
points(coco[1, ])
abline(h = 0, lty = "dotted")
     
par(mfrow = c(1, 1), mar = c(5.1, 4.1, 4.1, 2.1))

# ------------------------------------------------
# Same example with multiple-fold cross validation 
# under various settings
# ------------------------------------------------

# First with successive two-element folds 
myfolds <- list(c(1, 2), c(3, 4), c(5, 6), c(7, 8), c(9, 10))
cv_2fold <- cv(model = model1, folds = myfolds, type = "SK", 
               trend.reestim = FALSE, fast = TRUE, light = FALSE)  
cv_2fold

opar <- par(mfrow = c(2,1))
plotCVmean(cv_2fold)
plotCVsd(cv_2fold, ylim = ylim)


# With overlapping two-element folds 
myfolds <- list(c(1, 3), c(2, 4), c(3, 5), c(4, 6), 
                c(5, 7), c(6, 8), c(7, 9), c(8, 10))
cv_2fold_overlap <- cv(model = model1, folds = myfolds, type = "UK", 
                       trend.reestim = TRUE, fast = TRUE, light = FALSE)
cv_2fold_overlap


# With a three-fold partition 
myfolds <- list(c(1, 2, 3), c(4, 5, 6, 7), c(8, 9, 10))
cv_3fold <- cv(model = model1, folds = myfolds, type = "UK", 
           trend.reestim = TRUE, fast = TRUE, light = FALSE)
cv_3fold

plotCVmean(cv_3fold)
plotCVsd(cv_3fold, ylim = ylim)
par(opar)

Trend model formula operation

Description

Drop the response in the formula specifying the linear trend.

Usage

drop.response(formula, data)

Arguments

formula

an object of class formula.

data

a data frame corresponding to formula.

Value

An object of class formula.

Author(s)

O. Roustant, D. Ginsbourger

2D test function

Description

Goldstein price 2-dimensional test function.

Usage

goldsteinPrice(x)

Arguments

x

a 2-dimensional vector specifying the location where the function is to be evaluated.

Details

The Goldstein price function is usually defined over the domain [-2,-2] x [2, 2]. Here, the function is adapted to the domain [0,1] x [0,1]. It has 1 global minimum : x1 = c(0.5, 0.25)

Value

A real number equal to the Goldstein price function values at x

Author(s)

D. Ginsbourger, Ecole des Mines de St-Etienne.

Examples

 
n.grid <- 20
x.grid <- y.grid <- seq(0,1,length=n.grid)
design.grid <- expand.grid(x.grid, y.grid)
response.grid <- apply(design.grid, 1, goldsteinPrice)
z.grid <- matrix(response.grid, n.grid, n.grid)
contour(x.grid, y.grid, z.grid, 40)
x1 = c(0.5, 0.25)
points(t(x1), pch=19, col="red")
title("Fonction de Goldstein price")

3D test function

Description

Hartman 3-dimensional test function.

Usage

hartman3(x)

Arguments

x

a 3-dimensional vector specifying the location where the function is to be evaluated.

Details

The hartman3 function is defined over the domain [0,1]^3. It has 1 global minimum : x1 = c(0.1, 0.55592003, 0.85218259)

Value

A real number equal to the hartman3 function values at x

Author(s)

D. Ginsbourger, Ecole des Mines de St-Etienne.

Examples

 
design <- matrix(runif(300), 100, 3)
response <- apply(design, 1, hartman3)

6D test function

Description

Hartman 6-dimensional test function.

Usage

hartman6(x)

Arguments

x

a 6-dimensional vector specifying the location where the function is to be evaluated.

Details

The hartman6 function is defined over the domain [0,1]^6. It has 1 global minimum : x1 = c(0.20168952, 0.15001069, 0.47687398, 0.27533243, 0.31165162, 0.65730054)

Value

A real number equal to the hartman6 function values at x

Author(s)

D. Ginsbourger, Ecole des Mines de St-Etienne.

Examples

 
design <- matrix(runif(600), 100, 6)
response <- apply(design, 1, hartman6)

Get the input variables names

Description

Get the names of the input variables.

Usage

  inputnames(x)

Arguments

x

an object containing the covariance structure.

Value

A vector of character strings containing the names of the input variables.

Get the kernel name

Description

Get the name of the underlying tensor-product covariance structure.

Usage

  kernelname(x)

Arguments

x

an object containing the covariance structure.

Value

A character string.

Fit and/or create kriging models

Description

km is used to fit kriging models when parameters are unknown, or to create km objects otherwise. In both cases, the result is a km object. If parameters are unknown, they are estimated by Maximum Likelihood. As a beta version, Penalized Maximum Likelihood Estimation is also possible if some penalty is given, or Leave-One-Out for noise-free observations.

Usage

km(formula=~1, design, response, covtype="matern5_2",
   coef.trend = NULL, coef.cov = NULL, coef.var = NULL,
   nugget = NULL, nugget.estim=FALSE, noise.var=NULL, estim.method="MLE",
   penalty = NULL, optim.method = "BFGS", lower = NULL, upper = NULL, 
   parinit = NULL, multistart = 1, control = NULL, gr = TRUE, 
   iso=FALSE, scaling=FALSE, knots=NULL, kernel=NULL)

Arguments

formula

an optional object of class "formula" specifying the linear trend of the kriging model (see lm). This formula should concern only the input variables, and not the output (response). If there is any, it is automatically dropped. In particular, no response transformation is available yet. The default is ~1, which defines a constant trend.

design

a data frame representing the design of experiments. The ith row contains the values of the d input variables corresponding to the ith evaluation

response

a vector (or 1-column matrix or data frame) containing the values of the 1-dimensional output given by the objective function at the design points.

covtype

an optional character string specifying the covariance structure to be used, to be chosen between "gauss", "matern5_2", "matern3_2", "exp" or "powexp". See a full description of available covariance kernels in covTensorProduct-class. Default is "matern5_2". See also the argument kernel that allows the user to build its own covariance structure.

coef.trend

(see below)

coef.cov

(see below)

coef.var

optional vectors containing the values for the trend, covariance and variance parameters. For estimation, 4 cases are implemented: 1. (All unknown) If all are missing, all are estimated. 2. (All known) If all are provided, no estimation is performed; 3. (Known trend) If coef.trend is provided but at least one of coef.cov or coef.var is missing, then BOTH coef.cov and coef.var are estimated; 4. (Unknown trend) If coef.cov and coef.var are provided but coef.trend is missing, then coef.trend is estimated (GLS formula).

nugget

an optional variance value standing for the homogeneous nugget effect.

nugget.estim

an optional boolean indicating whether the nugget effect should be estimated. Note that this option does not concern the case of heterogeneous noisy observations (see noise.var below). If nugget is given, it is used as an initial value. Default is FALSE.

noise.var

for noisy observations : an optional vector containing the noise variance at each observation. This is useful for stochastic simulators. Default is NULL.

estim.method

a character string specifying the method by which unknown parameters are estimated. Default is "MLE" (Maximum Likelihood). At this stage, a beta version of leave-One-Out estimation (estim.method="LOO") is also implemented for noise-free observations.

penalty

(beta version) an optional list suitable for Penalized Maximum Likelihood Estimation. The list must contain the item fun indicating the penalty function, and the item value equal to the value of the penalty parameter. At this stage the only available fun is "SCAD", and covtype must be "gauss". Default is NULL, corresponding to (un-penalized) Maximum Likelihood Estimation.

optim.method

an optional character string indicating which optimization method is chosen for the likelihood maximization. "BFGS" is the optim quasi-Newton procedure of package stats, with the method "L-BFGS-B". "gen" is the genoud genetic algorithm (using derivatives) from package rgenoud (>= 5.3.3).

lower

(see below)

upper

optional vectors containing the bounds of the correlation parameters for optimization. The default values are given by covParametersBounds.

parinit

an optional vector containing the initial values for the variables to be optimized over. If no vector is given, an initial point is generated as follows. For method "gen", the initial point is generated uniformly inside the hyper-rectangle domain defined by lower and upper. For method "BFGS", some points (see control below) are generated uniformly in the domain. Then the best point with respect to the likelihood (or penalized likelihood, see penalty) criterion is chosen.

multistart

an optional integer indicating the number of initial points from which running the BFGS optimizer. These points will be selected as the best multistart one(s) among those evaluated (see above parinit). The multiple optimizations will be performed in parallel provided that a parallel backend is registered (see package foreach).

control

an optional list of control parameters for optimization. See details below.

gr

an optional boolean indicating whether the analytical gradient should be used. Default is TRUE.

iso

an optional boolean that can be used to force a tensor-product covariance structure (see covTensorProduct-class) to have a range parameter common to all dimensions. Default is FALSE. Not used (at this stage) for the power-exponential type.

scaling

an optional boolean indicating whether a scaling on the covariance structure should be used.

knots

an optional list of knots for scaling. The j-th element is a vector containing the knots for dimension j. If scaling=TRUE and knots are not specified, than knots are fixed to 0 and 1 in each dimension (which corresponds to affine scaling for the domain [0,1]^d).

kernel

an optional function containing a new covariance structure. At this stage, the parameters must be provided as well, and are not estimated. See an example below.

Details

The optimisers are tunable by the user by the argument control. Most of the control parameters proposed by BFGS and genoud can be passed to control except the ones that must be forced [for the purpose of optimization setting], as indicated in the table below. See optim and genoud to get more details about them.

BFGS	`trace`, `parscale`, `ndeps`, `maxit`, `abstol`, `reltol`, `REPORT`, `lnm`, `factr`, `pgtol`
genoud	all parameters EXCEPT: `fn, nvars, max, starting.values, Domains, gr, gradient.check, boundary.enforcement, hessian` and `optim.method`.

Notice that the right places to specify the optional starting values and boundaries are in parinit and lower, upper, as explained above. Some additional possibilities and initial values are indicated in the table below:

`trace`	Turn it to `FALSE` to avoid printing during optimization progress.
`pop.size`	For method `"BFGS"`, it is the number of candidate initial points generated before optimization starts (see `parinit` above). Default is 20. For method `"gen"`, `"pop.size"` is the population size, set by default at min(20, 4+3*log(nb of variables)
`max.generations`	Default is 5
`wait.generations`	Default is 2
`BFGSburnin`	Default is 0

Value

An object of class km (see km-class).

Author(s)

O. Roustant, D. Ginsbourger, Ecole des Mines de St-Etienne.

References

N.A.C. Cressie (1993), Statistics for spatial data, Wiley series in probability and mathematical statistics.

A.G. Journel and M.E. Rossi (1989), When do we need a trend model in kriging ?, Mathematical Geology, 21 no. 7, 715-739.

D.G. Krige (1951), A statistical approach to some basic mine valuation problems on the witwatersrand, J. of the Chem., Metal. and Mining Soc. of South Africa, 52 no. 6, 119-139.

R. Li and A. Sudjianto (2005), Analysis of Computer Experiments Using Penalized Likelihood in Gaussian Kriging Models, Technometrics, 47 no. 2, 111-120.

K.V. Mardia and R.J. Marshall (1984), Maximum likelihood estimation of models for residual covariance in spatial regression, Biometrika, 71, 135-146.

J.D. Martin and T.W. Simpson (2005), Use of kriging models to approximate deterministic computer models, AIAA Journal, 43 no. 4, 853-863.

G. Matheron (1969), Le krigeage universel, Les Cahiers du Centre de Morphologie Mathematique de Fontainebleau, 1.

W.R. Jr. Mebane and J.S. Sekhon, in press (2009), Genetic optimization using derivatives: The rgenoud package for R, Journal of Statistical Software.

J.-S. Park and J. Baek (2001), Efficient computation of maximum likelihood estimators in a spatial linear model with power exponential covariogram, Computer Geosciences, 27 no. 1, 1-7.

C.E. Rasmussen and C.K.I. Williams (2006), Gaussian Processes for Machine Learning, the MIT Press, http://www.gaussianprocess.org/gpml/

Examples


# ----------------------------------
# A 2D example - Branin-Hoo function
# ----------------------------------

# a 16-points factorial design, and the corresponding response
d <- 2; n <- 16
design.fact <- expand.grid(x1=seq(0,1,length=4), x2=seq(0,1,length=4))
y <- apply(design.fact, 1, branin) 

# kriging model 1 : matern5_2 covariance structure, no trend, no nugget effect
m1 <- km(design=design.fact, response=y)

# kriging model 2 : matern5_2 covariance structure, 
#                   linear trend + interactions, no nugget effect
m2 <- km(~.^2, design=design.fact, response=y)

# graphics 
n.grid <- 50
x.grid <- y.grid <- seq(0,1,length=n.grid)
design.grid <- expand.grid(x1=x.grid, x2=y.grid)
response.grid <- apply(design.grid, 1, branin)
predicted.values.model1 <- predict(m1, design.grid, "UK")$mean
predicted.values.model2 <- predict(m2, design.grid, "UK")$mean
par(mfrow=c(3,1))
contour(x.grid, y.grid, matrix(response.grid, n.grid, n.grid), 50, main="Branin")
points(design.fact[,1], design.fact[,2], pch=17, cex=1.5, col="blue")
contour(x.grid, y.grid, matrix(predicted.values.model1, n.grid, n.grid), 50, 
        main="Ordinary Kriging")
points(design.fact[,1], design.fact[,2], pch=17, cex=1.5, col="blue")
contour(x.grid, y.grid, matrix(predicted.values.model2, n.grid, n.grid), 50, 
        main="Universal Kriging")
points(design.fact[,1], design.fact[,2], pch=17, cex=1.5, col="blue")
par(mfrow=c(1,1))


# (same example) how to use the multistart argument
# -------------------------------------------------
require(foreach)

# below an example for a computer with 2 cores, but also work with 1 core

nCores <- 2
require(doParallel)
cl <-  makeCluster(nCores) 
registerDoParallel(cl)

# kriging model 1, with 4 starting points
m1_4 <- km(design=design.fact, response=y, multistart=4)

stopCluster(cl)

# -------------------------------
# A 1D example with penalized MLE
# -------------------------------

# from Fang K.-T., Li R. and Sudjianto A. (2006), "Design and Modeling for 
# Computer Experiments", Chapman & Hall, pages 145-152

n <- 6; d <- 1
x <- seq(from=0, to=10, length=n)
y <- sin(x)
t <- seq(0,10, length=100)

# one should add a small nugget effect, to avoid numerical problems
epsilon <- 1e-3
model <- km(formula<- ~1, design=data.frame(x=x), response=data.frame(y=y), 
            covtype="gauss", penalty=list(fun="SCAD", value=3), nugget=epsilon)

p <- predict(model, data.frame(x=t), "UK")

plot(t, p$mean, type="l", xlab="x", ylab="y", 
                     main="Prediction via Penalized Kriging")
points(x, y, col="red", pch=19)
lines(t, sin(t), lty=2, col="blue")
legend(0, -0.5, legend=c("Sine Curve", "Sample", "Fitted Curve"), 
       pch=c(-1,19,-1), lty=c(2,-1,1), col=c("blue","red","black"))


# ------------------------------------------------------------------------
# A 1D example with known trend and known or unknown covariance parameters
# ------------------------------------------------------------------------

x <- c(0, 0.4, 0.6, 0.8, 1);
y <- c(-0.3, 0, -0.8, 0.5, 0.9)

theta <- 0.01; sigma <- 3; trend <- c(-1,2)

model <- km(~x, design=data.frame(x=x), response=data.frame(y=y), 
            covtype="matern5_2", coef.trend=trend, coef.cov=theta, 
            coef.var=sigma^2)

# below: if you want to specify trend only, and estimate both theta and sigma:
# model <- km(~x, design=data.frame(x=x), response=data.frame(y=y), 
#             covtype="matern5_2", coef.trend=trend, lower=0.2)
# Remark: a lower bound or penalty function is useful here,
#         due to the very small number of design points...

# kriging with gaussian covariance C(x,y)=sigma^2 * exp(-[(x-y)/theta]^2), 
#         and linear trend t(x) = -1 + 2x

t <- seq(from=0, to=1, by=0.005)
p <- predict(model, newdata=data.frame(x=t), type="SK")
# beware that type = "SK" for known parameters (default is "UK")

plot(t, p$mean, type="l", ylim=c(-7,7), xlab="x", ylab="y")
lines(t, p$lower95, col="black", lty=2)
lines(t, p$upper95, col="black", lty=2)
points(x, y, col="red", pch=19)
abline(h=0)


# --------------------------------------------------------------
# Kriging with noisy observations (heterogeneous noise variance)
# --------------------------------------------------------------

fundet <- function(x){
return((sin(10*x)/(1+x)+2*cos(5*x)*x^3+0.841)/1.6)
}

level <- 0.5; epsilon <- 0.1
theta <- 1/sqrt(30); p <- 2; n <- 10
x <- seq(0,1, length=n)

# Heteregeneous noise variances: number of Monte Carlo evaluation among 
#                                a total budget of 1000 stochastic simulations
MC_numbers <- c(10,50,50,290,25,75,300,10,40,150)
noise.var <- 3/MC_numbers

# Making noisy observations from 'fundet' function (defined above)
y <- fundet(x) + noise.var*rnorm(length(x))

# kriging model definition (no estimation here)
model <- km(y~1, design=data.frame(x=x), response=data.frame(y=y), 
            covtype="gauss", coef.trend=0, coef.cov=theta, coef.var=1, 
            noise.var=noise.var)

# prediction
t <- seq(0, 1, by=0.01)
p <- predict.km(model, newdata=data.frame(x=t), type="SK")
lower <- p$lower95; upper <- p$upper95

# graphics
par(mfrow=c(1,1))
plot(t, p$mean, type="l", ylim=c(1.1*min(c(lower,y)) , 1.1*max(c(upper,y))), 
                xlab="x", ylab="y",col="blue", lwd=1.5)
polygon(c(t,rev(t)), c(lower, rev(upper)), col=gray(0.9), border = gray(0.9))
lines(t, p$mean, type="l", ylim=c(min(lower) ,max(upper)), xlab="x", ylab="y",
                 col="blue", lwd=1)
lines(t, lower, col="blue", lty=4, lwd=1.7)
lines(t, upper, col="blue", lty=4, lwd=1.7)
lines(t, fundet(t), col="black", lwd=2)
points(x, y, pch=8,col="blue")
text(x, y, labels=MC_numbers, pos=3)


# -----------------------------
# Checking parameter estimation 
# -----------------------------

d <- 3       	# problem dimension
n <- 40			# size of the experimental design
design <- matrix(runif(n*d), n, d)

covtype <- "matern5_2"		
theta <- c(0.3, 0.5, 1)		# the parameters to be found by estimation
sigma <- 2
nugget <- NULL  # choose a numeric value if you want to estimate nugget 
nugget.estim <- FALSE # choose TRUE if you want to estimate it

n.simu <- 30		# number of simulations
sigma2.estimate <- nugget.estimate <- mu.estimate <- matrix(0, n.simu, 1)
coef.estimate <- matrix(0, n.simu, length(theta))

model <- km(~1, design=data.frame(design), response=rep(0,n), covtype=covtype, 
            coef.trend=0, coef.cov=theta, coef.var=sigma^2, nugget=nugget)
y <- simulate(model, nsim=n.simu)

for (i in 1:n.simu) {
	# parameter estimation: tune the optimizer by changing optim.method, control
	model.estimate <- km(~1, design=data.frame(design), response=data.frame(y=y[i,]), 
	covtype=covtype, optim.method="BFGS", control=list(pop.size=50, trace=FALSE), 
        nugget.estim=nugget.estim) 
	
	# store results
	coef.estimate[i,] <- covparam2vect(model.estimate@covariance)
	sigma2.estimate[i] <- model.estimate@covariance@sd2
	mu.estimate[i] <- model.estimate@trend.coef
	if (nugget.estim) nugget.estimate[i] <- model.estimate@covariance@nugget
}

# comparison true values / estimation
cat("\nResults with ", n, "design points, 
    obtained with ", n.simu, "simulations\n\n",
    "Median of covar. coef. estimates: ", apply(coef.estimate, 2, median), "\n",
    "Median of trend  coef. estimates: ", median(mu.estimate), "\n", 
    "Mean of the var. coef. estimates: ", mean(sigma2.estimate))
if (nugget.estim) cat("\nMean of the nugget effect estimates: ", 
                      mean(nugget.estimate))

# one figure for this specific example - to be adapted
split.screen(c(2,1))        # split display into two screens
split.screen(c(1,2), screen = 2) # now split the bottom half into 3

screen(1)
boxplot(coef.estimate[,1], coef.estimate[,2], coef.estimate[,3], 
        names=c("theta1", "theta2", "theta3"))
abline(h=theta, col="red")
fig.title <- paste("Empirical law of the parameter estimates 
                    (n=", n , ", n.simu=", n.simu, ")", sep="")
title(fig.title)

screen(3)
boxplot(mu.estimate, xlab="mu")
abline(h=0, col="red")

screen(4)
boxplot(sigma2.estimate, xlab="sigma2")
abline(h=sigma^2, col="red")

close.screen(all = TRUE)  

# ----------------------------------------------------------
# Kriging with non-linear scaling on Xiong et al.'s function
# ----------------------------------------------------------

f11_xiong <- function(x){ 
return( sin(30 * (x - 0.9)^4) * cos(2 * (x - 0.9)) + (x - 0.9) / 2)
}

t <- seq(0, 1, , 300)
f <- f11_xiong(t)

plot(t, f, type = "l", ylim = c(-1,0.6), lwd = 2)

doe <- data.frame(x = seq(0, 1, , 20))
resp <- f11_xiong(doe)

knots <- list(x = c(0, 0.5, 1)) 
eta <- list(c(15, 2, 0.5))
m <- km(design = doe, response = resp, scaling = TRUE, gr = TRUE, 
knots = knots, covtype = "matern5_2",  coef.var = 1, coef.trend = 0)

p <- predict(m, data.frame(x = t), "UK")

plot(t, f, type = "l", ylim = c(-1, 0.6), lwd = 2)

lines(t, p$mean, col = "blue", lty = 2, lwd = 2)
lines(t, p$mean + 2 * p$sd, col = "blue")
lines(t, p$mean - 2 * p$sd, col = "blue")

abline(v = knots[[1]], lty = 2, col = "green")


# -----------------------------------------------------
# Kriging with a symmetric kernel: example with covUser
# -----------------------------------------------------

x <- c(0, 0.15, 0.3, 0.4, 0.5)
y <- c(0.3, -0.2, 0, 0.5, 0.2)

k <- function(x,y) {
  theta <- 0.15
  0.5*exp(-((x-y)/theta)^2) + 0.5*exp(-((1-x-y)/theta)^2)    
}

muser <- km(design=data.frame(x=x), response=data.frame(y=y), 
            coef.trend=0, kernel=k)

u <- seq(from=0, to=1, by=0.01)
puser <- predict(muser, newdata=data.frame(x=u), type="SK")

set.seed(0)
nsim <- 5
zuser <- simulate(muser, nsim=nsim, newdata=data.frame(x=u), cond=TRUE, nugget.sim=1e-8)
par(mfrow=c(1,1))
matplot(u, t(zuser), type="l", lty=rep("solid", nsim), col=1:5, lwd=1)
polygon(c(u, rev(u)), c(puser$upper, rev(puser$lower)), col="lightgrey", border=NA)
lines(u, puser$mean, lwd=5, col="blue", lty="dotted")
matlines(u, t(zuser), type="l", lty=rep("solid", nsim), col=1:5, lwd=1)
points(x, y, pch=19, cex=1.5)

Kriging models class

Description

S4 class for kriging models.

Objects from the Class

To create a km object, use km. See also this function for more details.

Slots

d:: Object of class "integer". The spatial dimension.
n:: Object of class "integer". The number of observations.
X:: Object of class "matrix". The design of experiments.
y:: Object of class "matrix". The vector of response values at design points.
p:: Object of class "integer". The number of basis functions of the linear trend.
F:: Object of class "matrix". The experimental matrix corresponding to the evaluation of the linear trend basis functions at the design of experiments.
trend.formula:: Object of class "formula". A formula specifying the trend as a linear model (no response needed).
trend.coef:: Object of class "numeric". Trend coefficients.
covariance:: Object of class "covTensorProduct". See covTensorProduct-class.
noise.flag:: Object of class "logical". Are the observations noisy?
noise.var:: Object of class "numeric". If the observations are noisy, the vector of noise variances.
known.param:: Object of class "character". Internal use. One of: "None", "All", "CovAndVar" or "Trend".
case:: Object of class "character". Indicates the likelihood to use in estimation (Internal use). One of: "LLconcentration_beta", "LLconcentration_beta_sigma2", "LLconcentration_beta_v_alpha".
param.estim:: Object of class "logical". TRUE if at least one parameter is estimated, FALSE otherwise.
method:: Object of class "character". "MLE" or "PMLE" depending on penalty.
penalty:: Object of class "list". For penalized ML estimation.
optim.method:: Object of class "character". To be chosen between "BFGS" and "gen".
lower:: Object of class "numeric". Lower bounds for covariance parameters estimation.
upper:: Object of class "numeric". Upper bounds for covariance parameters estimation.
control:: Object of class "list". Additional control parameters for covariance parameters estimation.
gr:: Object of class "logical". Do you want analytical gradient to be used ?
call:: Object of class "language". User call reminder.
parinit:: Object of class "numeric". Initial values for covariance parameters estimation.
logLik:: Object of class "numeric". Value of the concentrated log-Likelihood at its optimum.
T:: Object of class "matrix". Triangular matrix delivered by the Choleski decomposition of the covariance matrix.
z:: Object of class "numeric". Auxiliary variable: see computeAuxVariables.
M:: Object of class "matrix". Auxiliary variable: see computeAuxVariables.

Methods

coef: signature(x = "km") Get the coefficients of the km object.
plot: signature(x = "km"): see plot,km-method.
predict: signature(object = "km"): see predict,km-method.
show: signature(object = "km"): see show,km-method.
simulate: signature(object = "km"): see simulate,km-method.

Author(s)

O. Roustant, D. Ginsbourger

Fitting Kriging Models

Description

km1Nugget.init is used to give good initial values to fit kriging models when there is an unknown nugget effect to be estimated.

Usage

km1Nugget.init(model)

Arguments

model

an object of class km.

Details

The procedure can be summarized in 4 stages :

1)	Compute the variogram and deduce a first estimation of the total variance. If an initial value is provided for `nugget`, check its compatibility with the estimated variance. If not, use again the variogram to give a first estimation of the nugget effect.
2)	Simulate several values for the nugget effect and the process variance, around the estimations obtained at stage 1). The number of simulations is the one given in `model@control$pop.size`.
3)	If no initial value is provided for the other covariance parameters, simulate them uniformly inside the domain delimited by `model@lower` and `model@upper`. The number of simulations is the same as in stage 2).
4)	Compute the likelihood at each simulated "point" (variance + nugget effect + other covariance parameters), and take the best(s) one(s). This(these) point(s) gives the first initial value(s). The number of values considered can be set by the argument `multistart` in `km`.

Value

par

a matrix whose rows contain initial vectors of parameters.

value

a vector containing the function values corresponding to par.

cov

a list containing the covariance objects corresponding to par.

lower

upper

vectors containing lower and upper bounds for parameters.

Author(s)

O. Roustant, David Ginsbourger, Ecole des Mines de St-Etienne.

Fit and/or create kriging models

Description

kmData is equivalent to km, except for the interface with the data. In kmData, the user must supply both the design and the response within a single data.frame data. To supply them separately, use km.

Usage

kmData(formula, data, inputnames = NULL, ...)

Arguments

formula

an object of class "formula" specifying the linear trend of the kriging model (see lm). At this stage, transformations of the response are not taken into account.

data

a data.frame containing both the design (input variables) and the response (1-dimensional output given by the objective function at the design points).

inputnames

an optional vector of character containing the names of variables in data to be considered as input variables. By default, all variables but the response are input variables.

...

other arguments for creating or fitting Kriging models, to be taken among the arguments of km function apart from design and response.

Value

An object of class km (see km-class).

Author(s)

O. Roustant

Examples

# a 16-points factorial design, and the corresponding response
d <- 2; n <- 16
design.fact <- expand.grid(x1=seq(0,1,length=4), x2=seq(0,1,length=4))
y <- apply(design.fact, 1, branin)
data <- cbind(design.fact, y=y)

# kriging model 1 : matern5_2 covariance structure, no trend, no nugget effect
m1 <- kmData(y~1, data=data)
# this is equivalent to: m1 <- km(design=design.fact, response=y)

# now, add a second response to data:
data2 <- cbind(data, y2=-y)
# the previous model is now obtained with:
m1_2 <- kmData(y~1, data=data2, inputnames=c("x1", "x2"))

Fitting Kriging Models

Description

kmEstimate is used to fit kriging models. This function should not be called directly, due to the environments defined in km to avoid computing twice nxn matrices. Call km instead.

Usage

kmEstimate(model, envir)

Arguments

model

an object of class km.

envir

an environment specifying where to assign intermediate values for future gradient calculations.

Value

An object of class km.

Author(s)

O. Roustant, D. Ginsbourger, Ecole des Mines de St-Etienne.

References

Park J-S, Baek J. (2001), Efficient computation of maximum likelihood estimators in a spatial linear model with power exponential covariogram, Computer Geosciences, 27, 1-7.

Fitting Kriging Models

Description

kmNoNugget.init is used to give initial values to fit kriging models when there is no nugget effect nor noisy observations.

Usage

kmNoNugget.init(model, fn, fnscale)

Arguments

model

an object of class km.

fn

the function considered: logLikFun or leaveOneOutFun.

fnscale

a real number which sign determines the direction for optimization: <0 for logLikFun, >0 for leaveOneOutFun.

Details

The procedure can be summarized in 2 stages:

1)	If no initial value is provided by the user for the covariance parameters, simulate them uniformly inside the domain delimited by `model@lower` and `model@upper`. The number of simulations is the one given in `model@control$pop.size`.
2)	Compute the likelihood for each parameters set, and select the one(s) that gives the highest value(s). The number of values considered can be set by the argument `multistart` in `km`.

Value

par

a matrix whose rows contain initial vectors of parameters.

value

a vector containing the function values corresponding to par.

cov

a list containing the covariance objects corresponding to par.

lower

upper

vectors containing lower and upper bounds for parameters.

Author(s)

O. Roustant, David Ginsbourger, Ecole des Mines de St-Etienne.

Fitting Kriging Models

Description

kmNuggets.init is used to give initial values to fit kriging models, in presence of noisy observations.

Usage

kmNuggets.init(model)

Arguments

model

an object of class km.

Details

The procedure can be summarized in 4 stages:

1)	Compute the variogram and give a first estimation of the process variance, as well as lower and upper bounds.
2)	Simulate several values for the process variance, around the estimation obtained at stage 1). The number of simulations is the one given in `model@control$pop.size`.
3)	If no initial value is provided for the other covariance parameters, simulate them uniformly inside the domain delimited by `model@lower` and `model@upper`. The number of simulations is the same as in stage 2).
4)	Compute the likelihood at each simulated "point" (variance + other covariance parameters), and take the best one(s). This(these) point(s) gives the first initial value(s). The number of values considered can be set by the argument `multistart` in `km`.

Value

par

a matrix whose rows contain initial vectors of parameters.

value

a vector containing the function values corresponding to par.

cov

a list containing the covariance objects corresponding to par.

lower

upper

vectors containing lower and upper bounds for parameters.

Author(s)

O. Roustant, David Ginsbourger, Ecole des Mines de St-Etienne.

Leave-one-out for a km object

Description

Cross validation by leave-one-out for a km object without noisy observations.

Usage

leaveOneOut.km(model, type, trend.reestim=FALSE)

Arguments

model

an object of class "km" without noisy observations.

type

a character string corresponding to the kriging family, to be chosen between simple kriging ("SK"), or universal kriging ("UK").

trend.reestim

should the trend be reestimated when removing an observation? Default to FALSE.

Details

Leave-one-out (LOO) consists of computing the prediction at a design point when the corresponding observation is removed from the learning set (and this, for all design points). A quick version of LOO based on Dubrule formula is also implemented; It is limited to 2 cases: type=="SK" & (!trend.reestim) and type=="UK" & trend.reestim. Leave-one-out is not implemented yet for noisy observations.

Value

A list composed of

mean

a vector of length n. The ith coordinate is equal to the kriging mean (including the trend) at the ith observation number when removing it from the learning set,

sd

a vector of length n. The ith coordinate is equal to the kriging standard deviation at the ith observation number when removing it from the learning set,

where n is the total number of observations.

Warning

Kriging parameters are not re-estimated when removing one observation. With few points, the re-estimated values can be far from those obtained with the entire learning set. One option is to reestimate the trend coefficients, by setting trend.reestim=TRUE.

Author(s)

O. Roustant, D. Ginsbourger, Ecole des Mines de St-Etienne.

References

N.A.C. Cressie (1993), Statistics for spatial data, Wiley series in probability and mathematical statistics.

O. Dubrule (1983), Cross validation of Kriging in a unique neighborhood. Mathematical Geology, 15, 687-699.

J.D. Martin and T.W. Simpson (2005), Use of kriging models to approximate deterministic computer models, AIAA Journal, 43 no. 4, 853-863.

M. Schonlau (1997), Computer experiments and global optimization, Ph.D. thesis, University of Waterloo.

Leave-one-out least square criterion of a km object

Description

Returns the mean of the squared leave-one-out errors, computed with Dubrule's formula.

Usage

leaveOneOutFun(param, model, envir = NULL)

Arguments

param

a vector containing the optimization variables.

model

an object of class km.

envir

an optional environment specifying where to assign intermediate values for future gradient calculations. Default is NULL.

Value

The mean of the squared leave-one-out errors.

Note

At this stage, only the standard case has been implemented: no nugget effect, no observation noise.

Author(s)

O. Roustant, Ecole des Mines de St-Etienne

References

O. Dubrule (1983), Cross validation of Kriging in a unique neighborhood. Mathematical Geology, 15, 687-699.

Leave-one-out least square criterion - Analytical gradient

Description

Returns the analytical gradient of leaveOneOutFun.

Usage

leaveOneOutGrad(param, model, envir)

Arguments

param

a vector containing the optimization variables.

model

an object of class km.

envir

an environment specifying where to get intermediate values calculated in leaveOneOutFun.

Value

the gradient of leaveOneOutFun at param.

Author(s)

O. Roustant, Ecole des Mines de St-Etienne

References

O. Dubrule (1983), Cross validation of Kriging in a unique neighborhood. Mathematical Geology, 15, 687-699.

log-likelihood of a km object

Description

Returns the log-likelihood value of a km object.

Usage

## S4 method for signature 'km'
logLik(object, ...)

Arguments

object

an object of class km containing the trend and covariance structures.

...

no other argument for this method.

Value

The log likelihood value.