Help for package spatstat.explore

Version:

3.5-2

Date:

2025-07-22

Title:

Exploratory Data Analysis for the 'spatstat' Family

Maintainer:

Adrian Baddeley <Adrian.Baddeley@curtin.edu.au>

Depends:

R (≥ 3.5.0), spatstat.data (≥ 3.1-2), spatstat.univar (≥ 3.1-4), spatstat.geom (≥ 3.5-0), spatstat.random (≥ 3.4), stats, graphics, grDevices, utils, methods, nlme

Imports:

spatstat.utils (≥ 3.1-5), spatstat.sparse (≥ 3.1-0), goftest (≥ 1.2-2), Matrix, abind

Suggests:

sm, gsl, locfit, spatial, fftwtools (≥ 0.9-8), spatstat.linnet (≥ 3.2-1), spatstat.model (≥ 3.3-1), spatstat (≥ 3.3)

Description:

Functionality for exploratory data analysis and nonparametric analysis of spatial data, mainly spatial point patterns, in the 'spatstat' family of packages. (Excludes analysis of spatial data on a linear network, which is covered by the separate package 'spatstat.linnet'.) Methods include quadrat counts, K-functions and their simulation envelopes, nearest neighbour distance and empty space statistics, Fry plots, pair correlation function, kernel smoothed intensity, relative risk estimation with cross-validated bandwidth selection, mark correlation functions, segregation indices, mark dependence diagnostics, and kernel estimates of covariate effects. Formal hypothesis tests of random pattern (chi-squared, Kolmogorov-Smirnov, Monte Carlo, Diggle-Cressie-Loosmore-Ford, Dao-Genton, two-stage Monte Carlo) and tests for covariate effects (Cox-Berman-Waller-Lawson, Kolmogorov-Smirnov, ANOVA) are also supported.

License:

GPL-2 | GPL-3 [expanded from: GPL (≥ 2)]

URL:

http://spatstat.org/

NeedsCompilation:

yes

ByteCompile:

true

BugReports:

https://github.com/spatstat/spatstat.explore/issues

Packaged:

2025-07-22 02:39:14 UTC; adrian

Author:

Adrian Baddeley

[aut, cre, cph], Rolf Turner

[aut, cph], Ege Rubak

[aut, cph], Kasper Klitgaard Berthelsen [ctb], Warick Brown [cph], Achmad Choiruddin [ctb], Ya-Mei Chang [ctb], Jean-Francois Coeurjolly [ctb], Ottmar Cronie [ctb], Tilman Davies [ctb, cph], Julian Gilbey [ctb], Jonatan Gonzalez [ctb], Yongtao Guan [ctb], Ute Hahn [ctb], Martin Hazelton [ctb], Kassel Hingee [ctb, cph], Abdollah Jalilian [ctb], Frederic Lavancier [ctb], Marie-Colette van Lieshout [ctb, cph], Greg McSwiggan [ctb], Robin K Milne [cph], Tuomas Rajala [ctb], Suman Rakshit [ctb, cph], Dominic Schuhmacher [ctb], Rasmus Plenge Waagepetersen [ctb], Hangsheng Wang [ctb], Tingting Zhan [ctb]

Repository:

CRAN

Date/Publication:

2025-07-22 04:40:33 UTC

The spatstat.explore Package

Description

The spatstat.explore package belongs to the spatstat family of packages. It contains the core functionality for statistical analysis and modelling of spatial data.

Details

spatstat is a family of R packages for the statistical analysis of spatial data. Its main focus is the analysis of spatial patterns of points in two-dimensional space.

The original spatstat package has now been split into several sub-packages.

This sub-package spatstat.explore contains the user-level functions that perform exploratory data analysis and nonparametric data analysis of spatial data.

(The main exception is that functions for linear networks are in the separate sub-package spatstat.linnet.)

Structure of the spatstat family

The orginal spatstat package grew to be very large. It has now been divided into several sub-packages:

spatstat.utils containing basic utilities
spatstat.sparse containing linear algebra utilities
spatstat.data containing datasets
spatstat.univar containing functions for estimating probability distributions of random variables
spatstat.geom containing geometrical objects and geometrical operations
spatstat.explore containing the functionality for exploratory data analysis and nonparametric analysis of spatial data.
spatstat.model containing the functionality for statistical modelling, model-fitting, formal statistical inference and informal model diagnostics.
spatstat.linnet containing functions for spatial data on a linear network
spatstat, which simply loads the other sub-packages listed above, and provides documentation.

When you install spatstat, these sub-packages are also installed. Then if you load the spatstat package by typing library(spatstat), the other sub-packages listed above will automatically be loaded or imported.

For an overview of all the functions available in the sub-packages of spatstat, see the help file for "spatstat-package" in the spatstat package.

Additionally there are several extension packages:

spatstat.gui for interactive graphics
spatstat.local for local likelihood (including geographically weighted regression)
spatstat.Knet for additional, computationally efficient code for linear networks
spatstat.sphere (under development) for spatial data on a sphere, including spatial data on the earth's surface

The extension packages must be installed separately and loaded explicitly if needed. They also have separate documentation.

Overview of Functionality in spatstat.explore

The spatstat family of packages is designed to support a complete statistical analysis of spatial data. It supports

creation, manipulation and plotting of point patterns;
exploratory data analysis;
spatial random sampling;
simulation of point process models;
parametric model-fitting;
non-parametric smoothing and regression;
formal inference (hypothesis tests, confidence intervals);
model diagnostics.

For an overview, see the help file for "spatstat-package" in the spatstat package.

Following is a list of the functionality provided in the spatstat.explore package only.

To simulate a random point pattern:

Functions for generating random point patterns are now contained in the spatstat.random package.

To interrogate a point pattern:

`density.ppp`	kernel estimation of point pattern intensity
`densityHeat.ppp`	diffusion kernel estimation of point pattern intensity
`Smooth.ppp`	kernel smoothing of marks of point pattern
`sharpen.ppp`	data sharpening

Manipulation of pixel images:

An object of class "im" represents a pixel image.

`blur`	apply Gaussian blur to image
`Smooth.im`	apply Gaussian blur to image
`transect.im`	line transect of image
`pixelcentres`	extract centres of pixels
`rnoise`	random pixel noise

Line segment patterns

An object of class "psp" represents a pattern of straight line segments.

`density.psp`	kernel smoothing of line segments
`rpoisline`	generate a realisation of the Poisson line process inside a window

Tessellations

An object of class "tess" represents a tessellation.

rpoislinetess generate tessellation using Poisson line process

Three-dimensional point patterns

An object of class "pp3" represents a three-dimensional point pattern in a rectangular box. The box is represented by an object of class "box3".

`runifpoint3`	generate uniform random points in 3-D
`rpoispp3`	generate Poisson random points in 3-D
`envelope.pp3`	generate simulation envelopes for 3-D pattern

Multi-dimensional space-time point patterns

An object of class "ppx" represents a point pattern in multi-dimensional space and/or time.

`runifpointx`	generate uniform random points
`rpoisppx`	generate Poisson random points

Classical exploratory tools:

`clarkevans`	Clark and Evans aggregation index
`fryplot`	Fry plot
`miplot`	Morisita Index plot

Smoothing:

`density.ppp`	kernel smoothed density/intensity
`relrisk`	kernel estimate of relative risk
`Smooth.ppp`	spatial interpolation of marks
`bw.diggle`	cross-validated bandwidth selection for `density.ppp`
`bw.ppl`	likelihood cross-validated bandwidth selection for `density.ppp`
`bw.CvL`	Cronie-Van Lieshout bandwidth selection for density estimation
`bw.scott`	Scott's rule of thumb for density estimation
`bw.abram.ppp`	Abramson's rule for adaptive bandwidths
`bw.relrisk`	cross-validated bandwidth selection for `relrisk`
`bw.smoothppp`	cross-validated bandwidth selection for `Smooth.ppp`
`bw.frac`	bandwidth selection using window geometry

Modern exploratory tools:

`clusterset`	Allard-Fraley feature detection
`nnclean`	Byers-Raftery feature detection
`sharpen.ppp`	Choi-Hall data sharpening
`rhohat`	Kernel estimate of covariate effect
`rho2hat`	Kernel estimate of effect of two covariates
`spatialcdf`	Spatial cumulative distribution function
`roc`	Receiver operating characteristic curve
`sdr`	Sufficient Data Reduction
`thresholdSelect`	optimal thresholding of a predictor

Summary statistics for a point pattern:

`Fest`	empty space function `F`
`Gest`	nearest neighbour distribution function `G`
`Jest`	`J`-function `J = (1-G)/(1-F)`
`Kest`	Ripley's `K`-function
`Lest`	Besag `L`-function
`Tstat`	Third order `T`-function
`allstats`	all four functions `F`, `G`, `J`, `K`
`pcf`	pair correlation function
`Kinhom`	`K` for inhomogeneous point patterns
`Linhom`	`L` for inhomogeneous point patterns
`pcfinhom`	pair correlation for inhomogeneous patterns
`Finhom`	`F` for inhomogeneous point patterns
`Ginhom`	`G` for inhomogeneous point patterns
`Jinhom`	`J` for inhomogeneous point patterns
`localL`	Getis-Franklin neighbourhood density function
`localK`	neighbourhood K-function
`localpcf`	local pair correlation function
`localKinhom`	local `K` for inhomogeneous point patterns
`localLinhom`	local `L` for inhomogeneous point patterns
`localpcfinhom`	local pair correlation for inhomogeneous patterns
`Ksector`	Directional `K`-function
`Kscaled`	locally scaled `K`-function
`Kest.fft`	fast `K`-function using FFT for large datasets
`Kmeasure`	reduced second moment measure
`envelope`	simulation envelopes for a summary function
`varblock`	variances and confidence intervals
	for a summary function
`lohboot`	bootstrap for a summary function

Selecting the bandwidth for kernel estimation of the summary function:

`bw.stoyan`	Stoyan's rule of thumb for bandwidth for `pcf`
`bw.pcf`	cross-validated bandwidth selection for `pcf`
`bw.pcfinhom`	cross-validated bandwidth selection for `pcfinhom`
`bw.bdh`	Adjusted Stoyan rule of thumb for bandwidth for `pcfinhom`

Related facilities:

`plot.fv`	plot a summary function
`eval.fv`	evaluate any expression involving summary functions
`harmonise.fv`	make functions compatible
`eval.fasp`	evaluate any expression involving an array of functions
`with.fv`	evaluate an expression for a summary function
`Smooth.fv`	apply smoothing to a summary function
`deriv.fv`	calculate derivative of a summary function
`pool.fv`	pool several estimates of a summary function
`density.ppp`	kernel smoothed density
`densityHeat.ppp`	diffusion kernel smoothed density
`Smooth.ppp`	spatial interpolation of marks
`relrisk`	kernel estimate of relative risk
`sharpen.ppp`	data sharpening
`rknn`	theoretical distribution of nearest neighbour distance

Summary statistics for a multitype point pattern: A multitype point pattern is represented by an object X of class "ppp" such that marks(X) is a factor.

`relrisk`	kernel estimation of relative risk
`scan.test`	spatial scan test of elevated risk
`Gcross,Gdot,Gmulti`	multitype nearest neighbour distributions `G_{ij}, G_{i\bullet}`
`Kcross,Kdot, Kmulti`	multitype `K`-functions `K_{ij}, K_{i\bullet}`
`Lcross,Ldot`	multitype `L`-functions `L_{ij}, L_{i\bullet}`
`Jcross,Jdot,Jmulti`	multitype `J`-functions `J_{ij}, J_{i\bullet}`
`pcfcross`	multitype pair correlation function `g_{ij}`
`pcfdot`	multitype pair correlation function `g_{i\bullet}`
`pcfmulti`	general pair correlation function
`markconnect`	marked connection function `p_{ij}`
`alltypes`	estimates of the above for all `i,j` pairs
`Iest`	multitype `I`-function
`Kcross.inhom,Kdot.inhom`	inhomogeneous counterparts of `Kcross`, `Kdot`
`Lcross.inhom,Ldot.inhom`	inhomogeneous counterparts of `Lcross`, `Ldot`
`pcfcross.inhom,pcfdot.inhom`	inhomogeneous counterparts of `pcfcross`, `pcfdot`
`localKcross,localKdot`	local counterparts of `Kcross`, `Kdot`
`localLcross,localLdot`	local counterparts of `Lcross`, `Ldot`
`localKcross.inhom,localLcross.inhom`	local counterparts of `Kcross.inhom`, `Lcross.inhom`

Summary statistics for a marked point pattern: A marked point pattern is represented by an object X of class "ppp" with a component X$marks. The entries in the vector X$marks may be numeric, complex, string or any other atomic type. For numeric marks, there are the following functions:

`markmean`	smoothed local average of marks
`markvar`	smoothed local variance of marks
`markcorr`	mark correlation function
`markcrosscorr`	mark cross-correlation function
`markvario`	mark variogram
`markmarkscatter`	mark-mark scatterplot
`Kmark`	mark-weighted `K` function
`Emark`	mark independence diagnostic `E(r)`
`Vmark`	mark independence diagnostic `V(r)`
`nnmean`	nearest neighbour mean index
`nnvario`	nearest neighbour mark variance index

For marks of any type, there are the following:

`Gmulti`	multitype nearest neighbour distribution
`Kmulti`	multitype `K`-function
`Jmulti`	multitype `J`-function

Alternatively use cut.ppp to convert a marked point pattern to a multitype point pattern.

Programming tools:

marktable tabulate the marks of neighbours in a point pattern

Summary statistics for a three-dimensional point pattern:

These are for 3-dimensional point pattern objects (class pp3).

`F3est`	empty space function `F`
`G3est`	nearest neighbour function `G`
`K3est`	`K`-function
`pcf3est`	pair correlation function

Related facilities:

envelope.pp3 simulation envelopes

Summary statistics for random sets:

These work for point patterns (class ppp), line segment patterns (class psp) or windows (class owin).

`Hest`	spherical contact distribution `H`
`Gfox`	Foxall `G`-function
`Jfox`	Foxall `J`-function

Model fitting

Functions for fitting point process models are now contained in the spatstat.model package.

Simulation

There are many ways to generate a random point pattern, line segment pattern, pixel image or tessellation in spatstat.

Random point patterns: Functions for random generation are now contained in the spatstat.random package.

See also varblock for estimating the variance of a summary statistic by block resampling, and lohboot for another bootstrap technique.

Fitted point process models:

If you have fitted a point process model to a point pattern dataset, the fitted model can be simulated.

Methods for simulating a fitted model are now contained in the spatstat.model package.

Other random patterns: Functions for random generation are now contained in the spatstat.random package.

Simulation-based inference

`envelope`	critical envelope for Monte Carlo test of goodness-of-fit
`bits.envelope`	critical envelope for balanced two-stage Monte Carlo test
`qqplot.ppm`	diagnostic plot for interpoint interaction
`scan.test`	spatial scan statistic/test
`studpermu.test`	studentised permutation test
`segregation.test`	test of segregation of types

Manipulation of envelope objects:

`as.data.frame.envelope`	convert to data frame
`with.fv`	calculations with column(s) of data
`eval.fv`	calculations with all columns of data
`plot.envelope`	plot envelope
`summary.envelope`	print summary information
`pool.envelope`	pool data from several envelopes
`ptwise.envelope`	compute pointwise statistics
`bias.envelope`	pointwise bias
`RMSE.envelope`	pointwise root mean square error
`MISE.envelope`	mean integrated squared error
`ISB.envelope`	integrated squared bias
`IV.envelope`	integrated variance
`ISE.envelope`	integrated squared error

Hypothesis tests:

`quadrat.test`	`\chi^2` goodness-of-fit test on quadrat counts
`clarkevans.test`	Clark and Evans test
`cdf.test`	Spatial distribution goodness-of-fit test
`berman.test`	Berman's goodness-of-fit tests
`envelope`	critical envelope for Monte Carlo test of goodness-of-fit
`scan.test`	spatial scan statistic/test
`dclf.test`	Diggle-Cressie-Loosmore-Ford test
`mad.test`	Mean Absolute Deviation test
`anova.ppm`	Analysis of Deviance for point process models

More recently-developed tests:

`dg.test`	Dao-Genton test
`bits.test`	Balanced independent two-stage test
`dclf.progress`	Progress plot for DCLF test
`mad.progress`	Progress plot for MAD test

Model diagnostics:

Classical measures of model sensitivity such as leverage and influence, and classical model diagnostic tools such as residuals, partial residuals, and effect estimates, have been adapted to point process models. These capabilities are now provided in the spatstat.model package.

Resampling and randomisation procedures

You can build your own tests based on randomisation and resampling using the following capabilities:

`quadratresample`	block resampling
`rshift`	random shifting of (subsets of) points
`rthin`	random thinning

Licence

This library and its documentation are usable under the terms of the "GNU General Public License", a copy of which is distributed with the package.

Acknowledgements

Kasper Klitgaard Berthelsen, Ottmar Cronie, Tilman Davies, Julian Gilbey, Yongtao Guan, Ute Hahn, Kassel Hingee, Abdollah Jalilian, Marie-Colette van Lieshout, Greg McSwiggan, Tuomas Rajala, Suman Rakshit, Dominic Schuhmacher, Rasmus Waagepetersen and Hangsheng Wang made substantial contributions of code.

For comments, corrections, bug alerts and suggestions, we thank Monsuru Adepeju, Corey Anderson, Ang Qi Wei, Ryan Arellano, Jens Astrom, Robert Aue, Marcel Austenfeld, Sandro Azaele, Malissa Baddeley, Guy Bayegnak, Colin Beale, Melanie Bell, Thomas Bendtsen, Ricardo Bernhardt, Andrew Bevan, Brad Biggerstaff, Anders Bilgrau, Leanne Bischof, Christophe Biscio, Roger Bivand, Jose M. Blanco Moreno, Florent Bonneu, Jordan Brown, Ian Buller, Julian Burgos, Simon Byers, Ya-Mei Chang, Jianbao Chen, Igor Chernayavsky, Y.C. Chin, Bjarke Christensen, Lucia Cobo Sanchez, Jean-Francois Coeurjolly, Kim Colyvas, Hadrien Commenges, Rochelle Constantine, Robin Corria Ainslie, Richard Cotton, Marcelino de la Cruz, Peter Dalgaard, Mario D'Antuono, Sourav Das, Peter Diggle, Patrick Donnelly, Ian Dryden, Stephen Eglen, Ahmed El-Gabbas, Belarmain Fandohan, Olivier Flores, David Ford, Peter Forbes, Shane Frank, Janet Franklin, Funwi-Gabga Neba, Oscar Garcia, Agnes Gault, Jonas Geldmann, Marc Genton, Shaaban Ghalandarayeshi, Jason Goldstick, Pavel Grabarnik, C. Graf, Ute Hahn, Andrew Hardegen, Martin Bogsted Hansen, Martin Hazelton, Juha Heikkinen, Mandy Hering, Markus Herrmann, Maximilian Hesselbarth, Paul Hewson, Hamidreza Heydarian, Kurt Hornik, Philipp Hunziker, Jack Hywood, Ross Ihaka, Cenk Icos, Aruna Jammalamadaka, Robert John-Chandran, Devin Johnson, Mahdieh Khanmohammadi, Bob Klaver, Lily Kozmian-Ledward, Peter Kovesi, Mike Kuhn, Jeff Laake, Robert Lamb, Frederic Lavancier, Tom Lawrence, Tomas Lazauskas, Jonathan Lee, George Leser, Angela Li, Li Haitao, George Limitsios, Andrew Lister, Nestor Luambua, Ben Madin, Martin Maechler, Kiran Marchikanti, Jeff Marcus, Robert Mark, Peter McCullagh, Monia Mahling, Jorge Mateu Mahiques, Ulf Mehlig, Frederico Mestre, Sebastian Wastl Meyer, Mi Xiangcheng, Lore De Middeleer, Robin Milne, Enrique Miranda, Jesper Moller, Annie Mollie, Ines Moncada, Mehdi Moradi, Virginia Morera Pujol, Erika Mudrak, Gopalan Nair, Nader Najari, Nicoletta Nava, Linda Stougaard Nielsen, Felipe Nunes, Jens Randel Nyengaard, Jens Oehlschlaegel, Thierry Onkelinx, Sean O'Riordan, Evgeni Parilov, Jeff Picka, Nicolas Picard, Tim Pollington, Mike Porter, Sergiy Protsiv, Adrian Raftery, Ben Ramage, Pablo Ramon, Xavier Raynaud, Nicholas Read, Matt Reiter, Ian Renner, Tom Richardson, Brian Ripley, Ted Rosenbaum, Barry Rowlingson, Jason Rudokas, Tyler Rudolph, John Rudge, Christopher Ryan, Farzaneh Safavimanesh, Aila Sarkka, Cody Schank, Katja Schladitz, Sebastian Schutte, Bryan Scott, Olivia Semboli, Francois Semecurbe, Vadim Shcherbakov, Shen Guochun, Shi Peijian, Harold-Jeffrey Ship, Tammy L Silva, Ida-Maria Sintorn, Yong Song, Malte Spiess, Mark Stevenson, Kaspar Stucki, Jan Sulavik, Michael Sumner, P. Surovy, Ben Taylor, Thordis Linda Thorarinsdottir, Leigh Torres, Berwin Turlach, Torben Tvedebrink, Kevin Ummer, Medha Uppala, Andrew van Burgel, Tobias Verbeke, Mikko Vihtakari, Alexendre Villers, Fabrice Vinatier, Maximilian Vogtland, Sasha Voss, Sven Wagner, Hao Wang, H. Wendrock, Jan Wild, Carl G. Witthoft, Selene Wong, Maxime Woringer, Luke Yates, Mike Zamboni and Achim Zeileis.

Author(s)

Adrian Baddeley Adrian.Baddeley@curtin.edu.au, Rolf Turner rolfturner@posteo.net and Ege Rubak rubak@math.aau.dk.

Diagnostics for random marking

Description

Estimate the summary functions E(r) and V(r) for a marked point pattern, proposed by Schlather et al (2004) as diagnostics for dependence between the points and the marks.

Usage

Emark(X, r=NULL,
         correction=c("isotropic", "Ripley", "translate"),
         method="density", ..., normalise=FALSE)
Vmark(X, r=NULL,
         correction=c("isotropic", "Ripley", "translate"),
         method="density", ..., normalise=FALSE)

Arguments

X

The observed point pattern. An object of class "ppp" or something acceptable to as.ppp. The pattern should have numeric marks.

r

Optional. Numeric vector. The values of the argument r at which the function E(r) or V(r) should be evaluated. There is a sensible default.

correction

A character vector containing any selection of the options "isotropic", "Ripley" or "translate". It specifies the edge correction(s) to be applied.

method

A character vector indicating the user's choice of density estimation technique to be used. Options are "density", "loess", "sm" and "smrep".

...

Arguments passed to the density estimation routine (density, loess or sm.density) selected by method.

normalise

IfTRUE, normalise the estimate of E(r) or V(r) so that it would have value equal to 1 if the marks are independent of the points.

Details

For a marked point process, Schlather et al (2004) defined the functions E(r) and V(r) to be the conditional mean and conditional variance of the mark attached to a typical random point, given that there exists another random point at a distance r away from it.

More formally,

E(r) = E_{0u}[M(0)]

and

V(r) = E_{0u}[(M(0)-E(u))^2]

where E_{0u} denotes the conditional expectation given that there are points of the process at the locations 0 and u separated by a distance r, and where M(0) denotes the mark attached to the point 0.

These functions may serve as diagnostics for dependence between the points and the marks. If the points and marks are independent, then E(r) and V(r) should be constant (not depending on r). See Schlather et al (2004).

The argument X must be a point pattern (object of class "ppp") or any data that are acceptable to as.ppp. It must be a marked point pattern with numeric marks.

The argument r is the vector of values for the distance r at which k_f(r) is estimated.

This algorithm assumes that X can be treated as a realisation of a stationary (spatially homogeneous) random spatial point process in the plane, observed through a bounded window. The window (which is specified in X as Window(X)) may have arbitrary shape.

Biases due to edge effects are treated in the same manner as in Kest. The edge corrections implemented here are

isotropic/Ripley: Ripley's isotropic correction (see Ripley, 1988; Ohser, 1983). This is implemented only for rectangular and polygonal windows (not for binary masks).
translate: Translation correction (Ohser, 1983). Implemented for all window geometries, but slow for complex windows.

Note that the estimator assumes the process is stationary (spatially homogeneous).

The numerator and denominator of the mark correlation function (in the expression above) are estimated using density estimation techniques. The user can choose between

"density": which uses the standard kernel density estimation routine density, and works only for evenly-spaced r values;
"loess": which uses the function loess in the package modreg;
"sm": which uses the function sm.density in the package sm and is extremely slow;
"smrep": which uses the function sm.density in the package sm and is relatively fast, but may require manual control of the smoothing parameter hmult.

Value

If marks(X) is a numeric vector, the result is an object of class "fv" (see fv.object). If marks(X) is a data frame, the result is a list of objects of class "fv", one for each column of marks.

An object of class "fv" is essentially a data frame containing numeric columns

r

the values of the argument r at which the function E(r) or V(r) has been estimated

theo

the theoretical, constant value of E(r) or V(r) when the marks attached to different points are independent

together with a column or columns named "iso" and/or "trans", according to the selected edge corrections. These columns contain estimates of the function E(r) or V(r) obtained by the edge corrections named.

Author(s)

Adrian Baddeley Adrian.Baddeley@curtin.edu.au and Rolf Turner rolfturner@posteo.net

References

Schlather, M. and Ribeiro, P. and Diggle, P. (2004) Detecting dependence between marks and locations of marked point processes. Journal of the Royal Statistical Society, series B 66 (2004) 79-83.

Examples

    plot(Emark(spruces))
    E <- Emark(spruces, method="density", kernel="epanechnikov")
    plot(Vmark(spruces))

    plot(Emark(finpines))
    V <- Vmark(finpines)

Extract Subset of Function Array

Description

Extract a subset of a function array (an object of class "fasp").

Usage

  ## S3 method for class 'fasp'
x[I, J, drop=TRUE,...]

Arguments

x

A function array. An object of class "fasp".

I

any valid expression for a subset of the row indices of the array.

J

any valid expression for a subset of the column indices of the array.

drop

Logical. When the selected subset consists of only one cell of the array, if drop=FALSE the result is still returned as a 1 \times 1 array of functions (class "fasp") while if drop=TRUE it is returned as a function (class "fv").

...

Ignored.

Details

A function array can be regarded as a matrix whose entries are functions. See fasp.object for an explanation of function arrays.

This routine extracts a sub-array according to the usual conventions for matrix indexing.

Value

A function array (of class "fasp"). Exceptionally, if the array has only one cell, and if drop=TRUE, then the result is a function value table (class "fv").

Author(s)

Adrian Baddeley Adrian.Baddeley@curtin.edu.au, Rolf Turner rolfturner@posteo.net and Ege Rubak rubak@math.aau.dk

Examples

 online <- interactive()
 # Lansing woods data - multitype points with 6 types
 X <- lansing

 if(!online) {
   # subsample data (from 2251 to 450 points) to shorten check time
   X <- X[c(FALSE,FALSE,FALSE,FALSE,TRUE)]
 }

 a <- alltypes(X, 'K')

 # extract first three marks only
 b <- a[1:3,1:3]
 if(online) {plot(b)}
 # subset of array pertaining to hickories
 h <- a["hickory", ]
 if(online) {plot(h)}

Extract or Replace Subset of Function Values

Description

Extract or replace a subset of an object of class "fv".

Usage

  ## S3 method for class 'fv'
x[i, j, ..., drop=FALSE]

  ## S3 replacement method for class 'fv'
x[i, j] <- value

  ## S3 replacement method for class 'fv'
x$name <- value

Arguments

x

a function value object, of class "fv" (see fv.object). Essentially a data frame.

i

any appropriate subset index. Selects a subset of the rows of the data frame, i.e. a subset of the domain of the function(s) represented by x.

j

any appropriate subset index for the columns of the data frame. Selects some of the functions present in x.

name

the name of a column of the data frame.

...

Ignored.

drop

Logical. If TRUE, the result is a data frame or vector containing the selected rows and columns of data. If FALSE (the default), the result is another object of class "fv".

value

Replacement value for the column or columns selected by name or j.

Details

These functions extract a designated subset of an object of class "fv", or replace the designated subset with other data, or delete the designated subset.

The subset is specified by the row index i and column index j, or by the column name name. Either i or j may be missing, or both may be missing.

The function [.fv is a method for the generic operator [ for the class "fv". It extracts the designated subset of x, and returns it as another object of class "fv" (if drop=FALSE) or as a data frame or vector (if drop=TRUE).

The function [<-.fv is a method for the generic operator [<- for the class "fv". If value is NULL, the designated subset of x will be deleted from x. Otherwise, the designated subset of x will be replaced by the data contained in value. The return value is the modified object x.

The function $<-.fv is a method for the generic operator $<- for the class "fv". If value is NULL, the designated column of x will be deleted from x. Otherwise, the designated column of x will be replaced by the data contained in value. The return value is the modified object x.

Value

The result of [.fv with drop=TRUE is a data frame or vector.

Otherwise, the result is another object of class "fv".

Author(s)

Adrian Baddeley Adrian.Baddeley@curtin.edu.au and Rolf Turner rolfturner@posteo.net

Examples

 K <- Kest(cells)

 # discard the estimates of K(r) for r  > 0.1
 Ksub <- K[K$r <= 0.1, ]

 # extract the border method estimates
 bor <- K[ , "border", drop=TRUE]
 # or equivalently
 bor <- K$border

 # remove the border-method estimates
 K$border <- NULL
 K

Empty Space Function of a Three-Dimensional Point Pattern

Description

Estimates the empty space function F_3(r) from a three-dimensional point pattern.

Usage

F3est(X, ..., rmax = NULL, nrval = 128, vside = NULL,
              correction = c("rs", "km", "cs"),
              sphere = c("fudge", "ideal", "digital"))

Arguments

X

Three-dimensional point pattern (object of class "pp3").

...

Ignored.

rmax

Optional. Maximum value of argument r for which F_3(r) will be estimated.

nrval

Optional. Number of values of r for which F_3(r) will be estimated. A large value of nrval is required to avoid discretisation effects.

vside

Optional. Side length of the voxels in the discrete approximation.

correction

Optional. Character vector specifying the edge correction(s) to be applied. See Details.

sphere

Optional. Character string specifying how to calculate the theoretical value of F_3(r) for a Poisson process. See Details.

Details

For a stationary point process \Phi in three-dimensional space, the empty space function is

F_3(r) = P(d(0,\Phi) \le r)

where d(0,\Phi) denotes the distance from a fixed origin 0 to the nearest point of \Phi.

The three-dimensional point pattern X is assumed to be a partial realisation of a stationary point process \Phi. The empty space function of \Phi can then be estimated using techniques described in the References.

The box containing the point pattern is discretised into cubic voxels of side length vside. The distance function d(u,\Phi) is computed for every voxel centre point u using a three-dimensional version of the distance transform algorithm (Borgefors, 1986). The empirical cumulative distribution function of these values, with appropriate edge corrections, is the estimate of F_3(r).

The available edge corrections are:

"rs":: the reduced sample (aka minus sampling, border correction) estimator (Baddeley et al, 1993)
"km":: the three-dimensional version of the Kaplan-Meier estimator (Baddeley and Gill, 1997)
"cs":: the three-dimensional generalisation of the Chiu-Stoyan or Hanisch estimator (Chiu and Stoyan, 1998).

Alternatively correction="all" selects all options.

The result includes a column theo giving the theoretical value of F_3(r) for a uniform Poisson process (Complete Spatial Randomness). This value depends on the volume of the sphere of radius r measured in the discretised distance metric. The argument sphere determines how this will be calculated.

If sphere="ideal" the calculation will use the volume of an ideal sphere of radius r namely (4/3) \pi r^3. This is not recommended because the theoretical values of F_3(r) are inaccurate.
If sphere="fudge" then the volume of the ideal sphere will be multiplied by 0.78, which gives the approximate volume of the sphere in the discretised distance metric.
If sphere="digital" then the volume of the sphere in the discretised distance metric is computed exactly using another distance transform. This takes longer to compute, but is exact.

Value

A function value table (object of class "fv") that can be plotted, printed or coerced to a data frame containing the function values.

Warnings

A small value of vside and a large value of nrval are required for reasonable accuracy.

The default value of vside ensures that the total number of voxels is 2^22 or about 4 million. To change the default number of voxels, see spatstat.options("nvoxel").

Author(s)

Adrian Baddeley Adrian.Baddeley@curtin.edu.au

and Rana Moyeed.

References

Baddeley, A.J, Moyeed, R.A., Howard, C.V. and Boyde, A. Analysis of a three-dimensional point pattern with replication. Applied Statistics 42 (1993) 641–668.

Baddeley, A.J. and Gill, R.D. (1997) Kaplan-Meier estimators of interpoint distance distributions for spatial point processes. Annals of Statistics 25, 263–292.

Borgefors, G. (1986) Distance transformations in digital images. Computer Vision, Graphics and Image Processing 34, 344–371.

Chiu, S.N. and Stoyan, D. (1998) Estimators of distance distributions for spatial patterns. Statistica Neerlandica 52, 239–246.

Examples

  
  X <- rpoispp3(42)
  Z <- F3est(X)
  if(interactive()) plot(Z)

Estimate the Empty Space Function or its Hazard Rate

Description

Estimates the empty space function F(r) or its hazard rate h(r) from a point pattern in a window of arbitrary shape.

Usage

Fest(X, ..., eps, r=NULL, breaks=NULL,
     correction=c("rs", "km", "cs"),
     domain=NULL)

Fhazard(X, ...)

Arguments

X

The observed point pattern, from which an estimate of F(r) will be computed. An object of class ppp, or data in any format acceptable to as.ppp().

...

Extra arguments, passed from Fhazard to Fest. Extra arguments to Fest are ignored.

eps

Optional. A positive number. The resolution of the discrete approximation to Euclidean distance (see below). There is a sensible default.

r

Optional. Numeric vector. The values of the argument r at which F(r) should be evaluated. There is a sensible default. First-time users are strongly advised not to specify this argument. See below for important conditions on r.

breaks

This argument is for internal use only.

correction

Optional. The edge correction(s) to be used to estimate F(r). A vector of character strings selected from "none", "rs", "km", "cs" and "best". Alternatively correction="all" selects all options.

domain

Optional. Calculations will be restricted to this subset of the window. See Details.

Details

Fest computes an estimate of the empty space function F(r), and Fhazard computes an estimate of its hazard rate h(r).

The empty space function (also called the “spherical contact distribution” or the “point-to-nearest-event” distribution) of a stationary point process X is the cumulative distribution function F of the distance from a fixed point in space to the nearest point of X.

An estimate of F derived from a spatial point pattern dataset can be used in exploratory data analysis and formal inference about the pattern (Cressie, 1991; Diggle, 1983; Ripley, 1988). In exploratory analyses, the estimate of F is a useful statistic summarising the sizes of gaps in the pattern. For inferential purposes, the estimate of F is usually compared to the true value of F for a completely random (Poisson) point process, which is

F(r) = 1 - e^{ - \lambda \pi r^2}

where \lambda is the intensity (expected number of points per unit area). Deviations between the empirical and theoretical F curves may suggest spatial clustering or spatial regularity.

This algorithm estimates the empty space function F from the point pattern X. It assumes that X can be treated as a realisation of a stationary (spatially homogeneous) random spatial point process in the plane, observed through a bounded window. The window (which is specified in X) may have arbitrary shape.

The argument X is interpreted as a point pattern object (of class "ppp", see ppp.object) and can be supplied in any of the formats recognised by as.ppp.

The algorithm uses two discrete approximations which are controlled by the parameter eps and by the spacing of values of r respectively. (See below for details.) First-time users are strongly advised not to specify these arguments.

The estimation of F is hampered by edge effects arising from the unobservability of points of the random pattern outside the window. An edge correction is needed to reduce bias (Baddeley, 1998; Ripley, 1988). The edge corrections implemented here are the border method or "reduced sample" estimator, the spatial Kaplan-Meier estimator (Baddeley and Gill, 1997) and the Chiu-Stoyan estimator (Chiu and Stoyan, 1998).

Our implementation makes essential use of the distance transform algorithm of image processing (Borgefors, 1986). A fine grid of pixels is created in the observation window. The Euclidean distance between two pixels is approximated by the length of the shortest path joining them in the grid, where a path is a sequence of steps between adjacent pixels, and horizontal, vertical and diagonal steps have length 1, 1 and \sqrt 2 respectively in pixel units. If the pixel grid is sufficiently fine then this is an accurate approximation.

The parameter eps is the pixel width of the rectangular raster used to compute the distance transform (see below). It must not be too large: the absolute error in distance values due to discretisation is bounded by eps.

If eps is not specified, the function checks whether the window Window(X) contains pixel raster information. If so, then eps is set equal to the pixel width of the raster; otherwise, eps defaults to 1/100 of the width of the observation window.

The argument r is the vector of values for the distance r at which F(r) should be evaluated. It is also used to determine the breakpoints (in the sense of hist) for the computation of histograms of distances. The estimators are computed from histogram counts. This introduces a discretisation error which is controlled by the fineness of the breakpoints.

First-time users would be strongly advised not to specify r. However, if it is specified, r must satisfy r[1] = 0, and max(r) must be larger than the radius of the largest disc contained in the window. Furthermore, the spacing of successive r values must be very fine (ideally not greater than eps/4).

The algorithm also returns an estimate of the hazard rate function, h(r) of F(r). The hazard rate is defined by

h(r) = - \frac{d}{dr} \log(1 - F(r))

The hazard rate of F has been proposed as a useful exploratory statistic (Baddeley and Gill, 1994). The estimate of h(r) given here is a discrete approximation to the hazard rate of the Kaplan-Meier estimator of F. Note that F is absolutely continuous (for any stationary point process X), so the hazard function always exists (Baddeley and Gill, 1997).

If the argument domain is given, the estimate of F(r) will be based only on the empty space distances measured from locations inside domain (although their nearest data points may lie outside domain). This is useful in bootstrap techniques. The argument domain should be a window (object of class "owin") or something acceptable to as.owin. It must be a subset of the window of the point pattern X.

The naive empirical distribution of distances from each location in the window to the nearest point of the data pattern, is a biased estimate of F. However this is also returned by the algorithm (if correction="none"), as it is sometimes useful in other contexts. Care should be taken not to use the uncorrected empirical F as if it were an unbiased estimator of F.

Value

An object of class "fv", see fv.object, which can be plotted directly using plot.fv.

The result of Fest is essentially a data frame containing up to seven columns:

r

the values of the argument r at which the function F(r) has been estimated

rs

the “reduced sample” or “border correction” estimator of F(r)

km

the spatial Kaplan-Meier estimator of F(r)

hazard

the hazard rate \lambda(r) of F(r) by the spatial Kaplan-Meier method

cs

the Chiu-Stoyan estimator of F(r)

raw

the uncorrected estimate of F(r), i.e. the empirical distribution of the distance from a random point in the window to the nearest point of the data pattern X

theo

the theoretical value of F(r) for a stationary Poisson process of the same estimated intensity.

The result of Fhazard contains only three columns

r

the values of the argument r at which the hazard rate h(r) has been estimated

hazard

the spatial Kaplan-Meier estimate of the hazard rate h(r)

theo

the theoretical value of h(r) for a stationary Poisson process of the same estimated intensity.

Warnings

The reduced sample (border method) estimator of F is pointwise approximately unbiased, but need not be a valid distribution function; it may not be a nondecreasing function of r. Its range is always within [0,1].

The spatial Kaplan-Meier estimator of F is always nondecreasing but its maximum value may be less than 1.

The estimate of hazard rate h(r) returned by the algorithm is an approximately unbiased estimate for the integral of h() over the corresponding histogram cell. It may exhibit oscillations due to discretisation effects. We recommend modest smoothing, such as kernel smoothing with kernel width equal to the width of a histogram cell, using Smooth.fv.

Note

Sizeable amounts of memory may be needed during the calculation.

Author(s)

Adrian Baddeley Adrian.Baddeley@curtin.edu.au

and Rolf Turner rolfturner@posteo.net

References

Baddeley, A.J. Spatial sampling and censoring. In O.E. Barndorff-Nielsen, W.S. Kendall and M.N.M. van Lieshout (eds) Stochastic Geometry: Likelihood and Computation. Chapman and Hall, 1998. Chapter 2, pages 37-78.

Baddeley, A.J. and Gill, R.D. The empty space hazard of a spatial pattern. Research Report 1994/3, Department of Mathematics, University of Western Australia, May 1994.

Baddeley, A.J. and Gill, R.D. Kaplan-Meier estimators of interpoint distance distributions for spatial point processes. Annals of Statistics 25 (1997) 263-292.

Borgefors, G. Distance transformations in digital images. Computer Vision, Graphics and Image Processing 34 (1986) 344-371.

Chiu, S.N. and Stoyan, D. (1998) Estimators of distance distributions for spatial patterns. Statistica Neerlandica 52, 239–246.

Cressie, N.A.C. Statistics for spatial data. John Wiley and Sons, 1991.

Diggle, P.J. Statistical analysis of spatial point patterns. Academic Press, 1983.

Ripley, B.D. Statistical inference for spatial processes. Cambridge University Press, 1988.

Stoyan, D, Kendall, W.S. and Mecke, J. Stochastic geometry and its applications. 2nd edition. Springer Verlag, 1995.

Examples

   Fc <- Fest(cells, 0.01)

   # Tip: don't use F for the left hand side!
   # That's an abbreviation for FALSE

   plot(Fc)

   # P-P style plot
   plot(Fc, cbind(km, theo) ~ theo)

   # The empirical F is above the Poisson F
   # indicating an inhibited pattern

   if(interactive()) {
   plot(Fc, . ~ theo)
   plot(Fc, asin(sqrt(.)) ~ asin(sqrt(theo)))
   }

Inhomogeneous Empty Space Function

Description

Estimates the inhomogeneous empty space function of a non-stationary point pattern.

Usage

  Finhom(X, lambda = NULL, lmin = NULL, ...,
        sigma = NULL, varcov = NULL,
        r = NULL, breaks = NULL, ratio = FALSE,
        update = TRUE, warn.bias=TRUE, savelambda=FALSE)

Arguments

X

The observed data point pattern, from which an estimate of the inhomogeneous F function will be computed. An object of class "ppp" or in a format recognised by as.ppp()

lambda

Optional. Values of the estimated intensity function. Either a vector giving the intensity values at the points of the pattern X, a pixel image (object of class "im") giving the intensity values at all locations, a fitted point process model (object of class "ppm") or a function(x,y) which can be evaluated to give the intensity value at any location.

lmin

Optional. The minimum possible value of the intensity over the spatial domain. A positive numerical value.

sigma, varcov

Optional arguments passed to density.ppp to control the smoothing bandwidth, when lambda is estimated by kernel smoothing.

...

Extra arguments passed to as.mask to control the pixel resolution, or passed to density.ppp to control the smoothing bandwidth.

r

vector of values for the argument r at which the inhomogeneous K function should be evaluated. Not normally given by the user; there is a sensible default.

breaks

This argument is for internal use only.

ratio

Logical. If TRUE, the numerator and denominator of the estimate will also be saved, for use in analysing replicated point patterns.

update

Logical. If lambda is a fitted model (class "ppm" or "kppm") and update=TRUE (the default), the model will first be refitted to the data X (using update.ppm or update.kppm) before the fitted intensity is computed. If update=FALSE, the fitted intensity of the model will be computed without fitting it to X.

warn.bias

Logical value specifying whether to issue a warning when the inhomogeneity correction factor takes extreme values, which can often lead to biased results. This usually occurs when insufficient smoothing is used to estimate the intensity.

savelambda

Logical value specifying whether to save the values of lmin and lambda as attributes of the result.

Details

This command computes estimates of the inhomogeneous F-function (van Lieshout, 2010) of a point pattern. It is the counterpart, for inhomogeneous spatial point patterns, of the empty space function F for homogeneous point patterns computed by Fest.

The argument X should be a point pattern (object of class "ppp").

The inhomogeneous F function is computed using the border correction, equation (6) in Van Lieshout (2010).

The argument lambda should supply the (estimated) values of the intensity function \lambda of the point process. It may be either

a numeric vector: containing the values of the intensity function at the points of the pattern X.
a pixel image: (object of class "im") assumed to contain the values of the intensity function at all locations in the window.
a fitted point process model: (object of class "ppm" or "kppm") whose fitted trend can be used as the fitted intensity. (If update=TRUE the model will first be refitted to the data X before the trend is computed.)
a function: which can be evaluated to give values of the intensity at any locations.
omitted:: if lambda is omitted, then it will be estimated using a ‘leave-one-out’ kernel smoother.

If lambda is a numeric vector, then its length should be equal to the number of points in the pattern X. The value lambda[i] is assumed to be the the (estimated) value of the intensity \lambda(x_i) for the point x_i of the pattern X. Each value must be a positive number; NA's are not allowed.

If lambda is a pixel image, the domain of the image should cover the entire window of the point pattern. If it does not (which may occur near the boundary because of discretisation error), then the missing pixel values will be obtained by applying a Gaussian blur to lambda using blur, then looking up the values of this blurred image for the missing locations. (A warning will be issued in this case.)

If lambda is a function, then it will be evaluated in the form lambda(x,y) where x and y are vectors of coordinates of the points of X. It should return a numeric vector with length equal to the number of points in X.

If lambda is omitted, then it will be estimated using a ‘leave-one-out’ kernel smoother. The estimate lambda[i] for the point X[i] is computed by removing X[i] from the point pattern, applying kernel smoothing to the remaining points using density.ppp, and evaluating the smoothed intensity at the point X[i]. The smoothing kernel bandwidth is controlled by the arguments sigma and varcov, which are passed to density.ppp along with any extra arguments.

Value

An object of class "fv", see fv.object, which can be plotted directly using plot.fv.

Author(s)

Original code by Marie-Colette van Lieshout. C implementation and R adaptation by Adrian Baddeley Adrian.Baddeley@curtin.edu.au and Ege Rubak rubak@math.aau.dk.

References

Van Lieshout, M.N.M. and Baddeley, A.J. (1996) A nonparametric measure of spatial interaction in point patterns. Statistica Neerlandica 50, 344–361.

Van Lieshout, M.N.M. (2010) A J-function for inhomogeneous point processes. Statistica Neerlandica 65, 183–201.

Examples

  online <- interactive()
  if(online) {
    plot(Finhom(swedishpines, sigma=10))
    plot(Finhom(swedishpines, sigma=bw.diggle, adjust=2))
  } else {
    ## use a coarse grid for faster computation and package testing
    plot(Finhom(swedishpines, sigma=10, dimyx=32))
  }

Inhomogeneous Marked F-Function

Description

For a marked point pattern, estimate the inhomogeneous version of the multitype F function, effectively the cumulative distribution function of the distance from a fixed point to the nearest point in subset J, adjusted for spatially varying intensity.

Usage

  Fmulti.inhom(X, J,
              lambda = NULL, lambdaJ = NULL, lambdamin = NULL,
              ...,
              r = NULL)

  FmultiInhom(X, J,
              lambda = NULL, lambdaJ = NULL, lambdamin = NULL,
              ...,
              r = NULL)

Arguments

X

A spatial point pattern (object of class "ppp".

J

A subset index specifying the subset of points to which distances are measured. Any kind of subset index acceptable to [.ppp.

lambda

Intensity estimates for each point of X. A numeric vector of length equal to npoints(X). Incompatible with lambdaJ.

lambdaJ

Intensity estimates for each point of X[J]. A numeric vector of length equal to npoints(X[J]). Incompatible with lambda.

lambdamin

A lower bound for the intensity, or at least a lower bound for the values in lambdaJ or lambda[J].

...

Extra arguments passed to as.mask to control the pixel resolution for the computation.

r

Vector of distance values at which the inhomogeneous G function should be estimated. There is a sensible default.

Details

See Cronie and Van Lieshout (2015).

The functions FmultiInhom and Fmulti.inhom are identical.

Value

Object of class "fv" containing the estimate of the inhomogeneous multitype F function.

Author(s)

Ottmar Cronie and Marie-Colette van Lieshout. Rewritten for spatstat by Adrian Baddeley Adrian.Baddeley@curtin.edu.au.

References

Cronie, O. and Van Lieshout, M.N.M. (2015) Summary statistics for inhomogeneous marked point processes. Annals of the Institute of Statistical Mathematics DOI: 10.1007/s10463-015-0515-z

Examples

  X <- amacrine
  J <- (marks(X) == "off")
  online <- interactive()
  eps <- if(online) NULL else 0.025
  if(online && require(spatstat.model)) {
    mod <- ppm(X ~ marks * x, eps=eps)
    lambdaX <- fitted(mod, dataonly=TRUE)
    lambdaOff <- predict(mod, eps=eps)[["off"]]
    lmin <- min(lambdaOff) * 0.9
  } else {
    ## faster computation for package checker only
    lambdaX <- intensity(X)[as.integer(marks(X))]
    lmin <- intensity(X)[2] * 0.9
  }

  plot(FmultiInhom(X, J, lambda=lambdaX, lambdamin=lmin, eps=eps))

Nearest Neighbour Distance Distribution Function of a Three-Dimensional Point Pattern

Description

Estimates the nearest-neighbour distance distribution function G_3(r) from a three-dimensional point pattern.

Usage

G3est(X, ..., rmax = NULL, nrval = 128, correction = c("rs", "km", "Hanisch"))

Arguments

X

Three-dimensional point pattern (object of class "pp3").

...

Ignored.

rmax

Optional. Maximum value of argument r for which G_3(r) will be estimated.

nrval

Optional. Number of values of r for which G_3(r) will be estimated. A large value of nrval is required to avoid discretisation effects.

correction

Optional. Character vector specifying the edge correction(s) to be applied. See Details.

Details

For a stationary point process \Phi in three-dimensional space, the nearest-neighbour function is

G_3(r) = P(d^\ast(x,\Phi) \le r \mid x \in \Phi)

the cumulative distribution function of the distance d^\ast(x,\Phi) from a typical point x in \Phi to its nearest neighbour, i.e. to the nearest other point of \Phi.

The three-dimensional point pattern X is assumed to be a partial realisation of a stationary point process \Phi. The nearest neighbour function of \Phi can then be estimated using techniques described in the References. For each data point, the distance to the nearest neighbour is computed. The empirical cumulative distribution function of these values, with appropriate edge corrections, is the estimate of G_3(r).

The available edge corrections are:

"rs":: the reduced sample (aka minus sampling, border correction) estimator (Baddeley et al, 1993)
"km":: the three-dimensional version of the Kaplan-Meier estimator (Baddeley and Gill, 1997)
"Hanisch":: the three-dimensional generalisation of the Hanisch estimator (Hanisch, 1984).

Alternatively correction="all" selects all options.

Value

A function value table (object of class "fv") that can be plotted, printed or coerced to a data frame containing the function values.

Warnings

A large value of nrval is required in order to avoid discretisation effects (due to the use of histograms in the calculation).

Author(s)

Adrian Baddeley Adrian.Baddeley@curtin.edu.au and Rana Moyeed.

References

Baddeley, A.J, Moyeed, R.A., Howard, C.V. and Boyde, A. (1993) Analysis of a three-dimensional point pattern with replication. Applied Statistics 42, 641–668.

Baddeley, A.J. and Gill, R.D. (1997) Kaplan-Meier estimators of interpoint distance distributions for spatial point processes. Annals of Statistics 25, 263–292.

Hanisch, K.-H. (1984) Some remarks on estimators of the distribution function of nearest neighbour distance in stationary spatial point patterns. Mathematische Operationsforschung und Statistik, series Statistics 15, 409–412.

Examples

  X <- rpoispp3(42)
  Z <- G3est(X)
  if(interactive()) plot(Z)

Multitype Nearest Neighbour Distance Function (i-to-j)

Description

For a multitype point pattern, estimate the distribution of the distance from a point of type i to the nearest point of type j.

Usage

Gcross(X, i, j, r=NULL, breaks=NULL, ..., correction=c("rs", "km", "han"))

Arguments

X

The observed point pattern, from which an estimate of the cross type distance distribution function G_{ij}(r) will be computed. It must be a multitype point pattern (a marked point pattern whose marks are a factor). See under Details.

i

The type (mark value) of the points in X from which distances are measured. A character string (or something that will be converted to a character string). Defaults to the first level of marks(X).

j

The type (mark value) of the points in X to which distances are measured. A character string (or something that will be converted to a character string). Defaults to the second level of marks(X).

r

Optional. Numeric vector. The values of the argument r at which the distribution function G_{ij}(r) should be evaluated. There is a sensible default. First-time users are strongly advised not to specify this argument. See below for important conditions on r.

breaks

This argument is for internal use only.

...

Ignored.

correction

Optional. Character string specifying the edge correction(s) to be used. Options are "none", "rs", "km", "hanisch" and "best". Alternatively correction="all" selects all options.

Details

This function Gcross and its companions Gdot and Gmulti are generalisations of the function Gest to multitype point patterns.

A multitype point pattern is a spatial pattern of points classified into a finite number of possible “colours” or “types”. In the spatstat package, a multitype pattern is represented as a single point pattern object in which the points carry marks, and the mark value attached to each point determines the type of that point.

The argument X must be a point pattern (object of class "ppp") or any data that are acceptable to as.ppp. It must be a marked point pattern, and the mark vector X$marks must be a factor. The arguments i and j will be interpreted as levels of the factor X$marks. (Warning: this means that an integer value i=3 will be interpreted as the number 3, not the 3rd smallest level).

The “cross-type” (type i to type j) nearest neighbour distance distribution function of a multitype point process is the cumulative distribution function G_{ij}(r) of the distance from a typical random point of the process with type i the nearest point of type j.

An estimate of G_{ij}(r) is a useful summary statistic in exploratory data analysis of a multitype point pattern. If the process of type i points were independent of the process of type j points, then G_{ij}(r) would equal F_j(r), the empty space function of the type j points. For a multitype Poisson point process where the type i points have intensity \lambda_i, we have

G_{ij}(r) = 1 - e^{ - \lambda_j \pi r^2}

Deviations between the empirical and theoretical G_{ij} curves may suggest dependence between the points of types i and j.

This algorithm estimates the distribution function G_{ij}(r) from the point pattern X. It assumes that X can be treated as a realisation of a stationary (spatially homogeneous) random spatial point process in the plane, observed through a bounded window. The window (which is specified in X as Window(X)) may have arbitrary shape. Biases due to edge effects are treated in the same manner as in Gest.

The argument r is the vector of values for the distance r at which G_{ij}(r) should be evaluated. It is also used to determine the breakpoints (in the sense of hist) for the computation of histograms of distances. The reduced-sample and Kaplan-Meier estimators are computed from histogram counts. In the case of the Kaplan-Meier estimator this introduces a discretisation error which is controlled by the fineness of the breakpoints.

The algorithm also returns an estimate of the hazard rate function, \lambda(r), of G_{ij}(r). This estimate should be used with caution as G_{ij}(r) is not necessarily differentiable.

The naive empirical distribution of distances from each point of the pattern X to the nearest other point of the pattern, is a biased estimate of G_{ij}. However this is also returned by the algorithm, as it is sometimes useful in other contexts. Care should be taken not to use the uncorrected empirical G_{ij} as if it were an unbiased estimator of G_{ij}.

Value

An object of class "fv" (see fv.object).

Essentially a data frame containing six numeric columns

r

the values of the argument r at which the function G_{ij}(r) has been estimated

rs

the “reduced sample” or “border correction” estimator of G_{ij}(r)

han

the Hanisch-style estimator of G_{ij}(r)

km

the spatial Kaplan-Meier estimator of G_{ij}(r)

hazard

the hazard rate \lambda(r) of G_{ij}(r) by the spatial Kaplan-Meier method

raw

the uncorrected estimate of G_{ij}(r), i.e. the empirical distribution of the distances from each point of type i to the nearest point of type j

theo

the theoretical value of G_{ij}(r) for a marked Poisson process with the same estimated intensity (see below).

Warnings

The arguments i and j are always interpreted as levels of the factor X$marks. They are converted to character strings if they are not already character strings. The value i=1 does not refer to the first level of the factor.

The function G_{ij} does not necessarily have a density.

The reduced sample estimator of G_{ij} is pointwise approximately unbiased, but need not be a valid distribution function; it may not be a nondecreasing function of r. Its range is always within [0,1].

The spatial Kaplan-Meier estimator of G_{ij} is always nondecreasing but its maximum value may be less than 1.

Author(s)

Adrian Baddeley Adrian.Baddeley@curtin.edu.au and Rolf Turner rolfturner@posteo.net.

References

Cressie, N.A.C. Statistics for spatial data. John Wiley and Sons, 1991.

Diggle, P.J. Statistical analysis of spatial point patterns. Academic Press, 1983.

Diggle, P. J. (1986). Displaced amacrine cells in the retina of a rabbit : analysis of a bivariate spatial point pattern. J. Neurosci. Meth. 18, 115–125.

Harkness, R.D and Isham, V. (1983) A bivariate spatial point pattern of ants' nests. Applied Statistics 32, 293–303

Lotwick, H. W. and Silverman, B. W. (1982). Methods for analysing spatial processes of several types of points. J. Royal Statist. Soc. Ser. B 44, 406–413.

Ripley, B.D. Statistical inference for spatial processes. Cambridge University Press, 1988.

Stoyan, D, Kendall, W.S. and Mecke, J. Stochastic geometry and its applications. 2nd edition. Springer Verlag, 1995.

Van Lieshout, M.N.M. and Baddeley, A.J. (1999) Indices of dependence between types in multivariate point patterns. Scandinavian Journal of Statistics 26, 511–532.

Examples

    # amacrine cells data
    G01 <- Gcross(amacrine)

    # equivalent to:
    
      G01 <- Gcross(amacrine, "off", "on")
    

    plot(G01)

    # empty space function of `on' points
    if(interactive()) {
       F1 <- Fest(split(amacrine)$on, r = G01$r)
       lines(F1$r, F1$km, lty=3)
    }

    # synthetic example    
    pp <- runifpoispp(30)
    pp <- pp %mark% factor(sample(0:1, npoints(pp), replace=TRUE))
    G <- Gcross(pp, "0", "1")   # note: "0" not 0

Inhomogeneous Multitype G Cross Function

Description

For a multitype point pattern, estimate the inhomogeneous version of the cross G function, which is the distribution of the distance from a point of type i to the nearest point of type j, adjusted for spatially varying intensity.

Usage

   Gcross.inhom(X, i, j,
              lambda = NULL, lambdaI = NULL, lambdaJ = NULL,
              lambdamin = NULL,
              ...,
              r = NULL,
              ReferenceMeasureMarkSetI = NULL,
              ratio = FALSE)

Arguments

X

The observed point pattern, from which an estimate of the inhomogeneous cross type G function G_{ij}(r) will be computed. It must be a multitype point pattern (a marked point pattern whose marks are a factor). See under Details.

i

j

The type (mark value) of the points in X to which distances are measured. A character string (or something that will be converted to a character string). Defaults to the second level of marks(X).

lambda

Optional. Values of the estimated intensity of the point process. Either a pixel image (object of class "im"), a numeric vector containing the intensity values at each of the points in X, a fitted point process model (object of class "ppm" or "kppm" or "dppm"), or a function(x,y) which can be evaluated to give the intensity value at any location.

lambdaI

Optional. Values of the estimated intensity of the sub-process of points of type i. Either a pixel image (object of class "im"), a numeric vector containing the intensity values at each of the type i points in X, a fitted point process model (object of class "ppm" or "kppm" or "dppm"), or a function(x,y) which can be evaluated to give the intensity value at any location.

lambdaJ

Optional. Values of the the estimated intensity of the sub-process of points of type j. Either a pixel image (object of class "im"), a numeric vector containing the intensity values at each of the type j points in X, a fitted point process model (object of class "ppm" or "kppm" or "dppm"), or a function(x,y) which can be evaluated to give the intensity value at any location.

lambdamin

Optional. The minimum possible value of the intensity over the spatial domain. A positive numerical value.

...

Extra arguments passed to as.mask to control the pixel resolution for the computation.

r

vector of values for the argument r at which the inhomogeneous G function should be evaluated. Not normally given by the user; there is a sensible default.

ReferenceMeasureMarkSetI

Optional. The total measure of the mark set. A positive number.

ratio

Logical value indicating whether to save ratio information.

Details

This is a generalisation of the function Gcross to include an adjustment for spatially inhomogeneous intensity, in a manner similar to the function Ginhom.

The argument lambdaI supplies the values of the intensity of the sub-process of points of type i. It may be either

a pixel image: (object of class "im") which gives the values of the type i intensity at all locations in the window containing X;
a numeric vector: containing the values of the type i intensity evaluated only at the data points of type i. The length of this vector must equal the number of type i points in X.
a function: of the form function(x,y) which can be evaluated to give values of the intensity at any locations.
a fitted point process model: (object of class "ppm", "kppm" or "dppm") whose fitted trend can be used as the fitted intensity. (If update=TRUE the model will first be refitted to the data X before the trend is computed.)
omitted:: if lambdaI is omitted then it will be estimated using a leave-one-out kernel smoother.

If lambdaI is omitted, then it will be estimated using a ‘leave-one-out’ kernel smoother.

Similarly the argument lambdaJ should contain estimated values of the intensity of the points of type j. It may be either a pixel image, a numeric vector of length equal to the number of points in X, a function, or omitted.

The argument r is the vector of values for the distance r at which G_{ij}(r) should be evaluated. The values of r must be increasing nonnegative numbers and the maximum r value must not exceed the radius of the largest disc contained in the window.

Value

An object of class "fv" (see fv.object) containing estimates of the inhomogeneous cross type G function.

Warnings

The argument i is interpreted as a level of the factor X$marks. It is converted to a character string if it is not already a character string. The value i=1 does not refer to the first level of the factor.

Author(s)

Adrian Baddeley Adrian.Baddeley@curtin.edu.au.

References

Cronie, O. and Van Lieshout, M.N.M. (2015) Summary statistics for inhomogeneous marked point processes. Annals of the Institute of Statistical Mathematics DOI: 10.1007/s10463-015-0515-z

Examples

  X <- rescale(amacrine)
  if(interactive() && require(spatstat.model)) {
    ## how to do it normally
    mod <- ppm(X ~ marks * x)
    lam <- fitted(mod, dataonly=TRUE)
    lmin <- min(predict(mod)[["off"]]) * 0.9
  } else {
    ## for package testing 
    lam <- intensity(X)[as.integer(marks(X))]
    lmin <- intensity(X)[2] * 0.9
  }
  GC <- Gcross.inhom(X, "on", "off", lambda=lam, lambdamin=lmin)

Multitype Nearest Neighbour Distance Function (i-to-any)

Description

For a multitype point pattern, estimate the distribution of the distance from a point of type i to the nearest other point of any type.

Usage

Gdot(X, i, r=NULL, breaks=NULL, ..., correction=c("km", "rs", "han"))

Arguments

X

The observed point pattern, from which an estimate of the distance distribution function G_{i\bullet}(r) will be computed. It must be a multitype point pattern (a marked point pattern whose marks are a factor). See under Details.

i

r

Optional. Numeric vector. The values of the argument r at which the distribution function G_{i\bullet}(r) should be evaluated. There is a sensible default. First-time users are strongly advised not to specify this argument. See below for important conditions on r.

breaks

This argument is for internal use only.

...

Ignored.

correction

Optional. Character string specifying the edge correction(s) to be used. Options are "none", "rs", "km", "hanisch" and "best". Alternatively correction="all" selects all options.

Details

This function Gdot and its companions Gcross and Gmulti are generalisations of the function Gest to multitype point patterns.

The argument X must be a point pattern (object of class "ppp") or any data that are acceptable to as.ppp. It must be a marked point pattern, and the mark vector X$marks must be a factor. The argument will be interpreted as a level of the factor X$marks. (Warning: this means that an integer value i=3 will be interpreted as the number 3, not the 3rd smallest level.)

The “dot-type” (type i to any type) nearest neighbour distance distribution function of a multitype point process is the cumulative distribution function G_{i\bullet}(r) of the distance from a typical random point of the process with type i the nearest other point of the process, regardless of type.

An estimate of G_{i\bullet}(r) is a useful summary statistic in exploratory data analysis of a multitype point pattern. If the type i points were independent of all other points, then G_{i\bullet}(r) would equal G_{ii}(r), the nearest neighbour distance distribution function of the type i points alone. For a multitype Poisson point process with total intensity \lambda, we have

G_{i\bullet}(r) = 1 - e^{ - \lambda \pi r^2}

Deviations between the empirical and theoretical G_{i\bullet} curves may suggest dependence of the type i points on the other points.

This algorithm estimates the distribution function G_{i\bullet}(r) from the point pattern X. It assumes that X can be treated as a realisation of a stationary (spatially homogeneous) random spatial point process in the plane, observed through a bounded window. The window (which is specified in X as Window(X)) may have arbitrary shape. Biases due to edge effects are treated in the same manner as in Gest.

The argument r is the vector of values for the distance r at which G_{i\bullet}(r) should be evaluated. It is also used to determine the breakpoints (in the sense of hist) for the computation of histograms of distances. The reduced-sample and Kaplan-Meier estimators are computed from histogram counts. In the case of the Kaplan-Meier estimator this introduces a discretisation error which is controlled by the fineness of the breakpoints.

The algorithm also returns an estimate of the hazard rate function, \lambda(r), of G_{i\bullet}(r). This estimate should be used with caution as G_{i\bullet}(r) is not necessarily differentiable.

The naive empirical distribution of distances from each point of the pattern X to the nearest other point of the pattern, is a biased estimate of G_{i\bullet}. However this is also returned by the algorithm, as it is sometimes useful in other contexts. Care should be taken not to use the uncorrected empirical G_{i\bullet} as if it were an unbiased estimator of G_{i\bullet}.

Value

An object of class "fv" (see fv.object).

Essentially a data frame containing six numeric columns

r

the values of the argument r at which the function G_{i\bullet}(r) has been estimated

rs

the “reduced sample” or “border correction” estimator of G_{i\bullet}(r)

han

the Hanisch-style estimator of G_{i\bullet}(r)

km

the spatial Kaplan-Meier estimator of G_{i\bullet}(r)

hazard

the hazard rate \lambda(r) of G_{i\bullet}(r) by the spatial Kaplan-Meier method

raw

the uncorrected estimate of G_{i\bullet}(r), i.e. the empirical distribution of the distances from each point of type i to the nearest other point of any type.

theo

the theoretical value of G_{i\bullet}(r) for a marked Poisson process with the same estimated intensity (see below).

Warnings

The function G_{i\bullet} does not necessarily have a density.

The reduced sample estimator of G_{i\bullet} is pointwise approximately unbiased, but need not be a valid distribution function; it may not be a nondecreasing function of r. Its range is always within [0,1].

The spatial Kaplan-Meier estimator of G_{i\bullet} is always nondecreasing but its maximum value may be less than 1.