Threshold selection via coefficient of variation

This function computes the empirical coefficient of variation and computes a weighted statistic comparing the squared distance with the theoretical coefficient variation corresponding to a specific shape parameter (estimated from the data using a moment estimator as the value minimizing the test statistic, or using maximum likelihood). The procedure stops if there are no more than 10 exceedances above the highest threshold.

Usage

thselect.cv(
  xdat,
  thresh,
  method = c("mle", "wcv", "cv"),
  nsim = 999L,
  nthresh = 10L,
  level = 0.05,
  lazy = FALSE,
  plot = FALSE
)

Arguments

xdat: [vector] vector of observations
thresh: [vector] vector of threshold. If missing, set to $p^k$ for $k=0$ to $k=$nthresh
method: [string], either moment estimator for the (weighted) coefficient of variation (wcv and cv) or maximum likelihood (mle)
nsim: [integer] number of bootstrap replications
nthresh: [integer] number of thresholds, if thresh is not supplied by the user
level: [numeric] probability level for sequential testing procedure
lazy: [logical] compute the bootstrap p-value until the test stops rejecting at level level? Default to FALSE
plot: [logical] if TRUE, returns a plot of the p-value path

Value

a list with elements

thresh: value of threshold returned by the procedure, NA if the hypothesis is rejected at all thresholds
thresh0: sorted vector of candidate thresholds
cindex: index of selected threshold among thresh0 or NA if none returned
pval: bootstrap p-values, with NA if lazy and the p-value exceeds level at lower thresholds
shape: shape parameter estimates
nexc: number of exceedances of each threshold thresh0
method: estimation method for the shape parameter

Note

The authors suggest transformation of $$Y = -1/(X + c) + 1/c,$$ where $X$ are exceedances and $c=\sigma/\xi$ is the ratio of estimated scale and shape parameters. For heavy-tailed distributions with $\xi > 0.25$, this may be preferable, but must be conducted outside of the function.

References

del Castillo, J. and M. Padilla (2016). Modelling extreme values by the residual coefficient of variation, SORT, 40(2), pp. 303–320.

Examples

thselect.cv(
 xdat = rgp(1000),
 thresh = qgp(seq(0,0.9, by = 0.1)),
 nsim = 99,
 lazy = TRUE,
 plot = TRUE)