Help for package ReIns

Version:

1.0.15

Date:

2024-11-30

Title:

Functions from "Reinsurance: Actuarial and Statistical Aspects"

Description:

Functions from the book "Reinsurance: Actuarial and Statistical Aspects" (2017) by Hansjoerg Albrecher, Jan Beirlant and Jef Teugels https://www.wiley.com/en-us/Reinsurance%3A+Actuarial+and+Statistical+Aspects-p-9780470772683.

Maintainer:

Tom Reynkens <tomreynkens.r@gmail.com>

Depends:

R (≥ 3.4)

Imports:

stats, graphics, methods, survival, utils, parallel, foreach (≥ 1.4.1), doParallel (≥ 1.0.10), Rcpp (≥ 0.12.12)

Suggests:

testthat, knitr, rmarkdown, interval, Icens

LinkingTo:

Rcpp

License:

GPL-2 | GPL-3 [expanded from: GPL (≥ 2)]

URL:

https://github.com/TReynkens/ReIns

BugReports:

https://github.com/TReynkens/ReIns/issues

VignetteBuilder:

knitr

Encoding:

UTF-8

NeedsCompilation:

yes

ByteCompile:

yes

Packaged:

2024-11-30 13:02:34 UTC; tom

Author:

Tom Reynkens

[aut, cre], Roel Verbelen

[aut] (R code for Mixed Erlang distribution), Anastasios Bardoutsos [ctb] (Original R code for cEPD estimator), Dries Cornilly [ctb] (Original R code for EVT estimators for truncated data), Yuri Goegebeur [ctb] (Original S-Plus code for basic EVT estimators), Klaus Herrmann [ctb] (Original R code for GPD estimator)

Repository:

CRAN

Date/Publication:

2024-12-02 09:30:05 UTC

The Burr distribution

Description

Density, distribution function, quantile function and random generation for the Burr distribution (type XII).

Usage

dburr(x, alpha, rho, eta = 1, log = FALSE)
pburr(x, alpha, rho, eta = 1, lower.tail = TRUE, log.p = FALSE)
qburr(p, alpha, rho, eta = 1, lower.tail = TRUE, log.p = FALSE)
rburr(n, alpha, rho, eta = 1)

Arguments

x

Vector of quantiles.

p

Vector of probabilities.

n

Number of observations.

alpha

The \alpha parameter of the Burr distribution, a strictly positive number.

rho

The \rho parameter of the Burr distribution, a strictly negative number.

eta

The \eta parameter of the Burr distribution, a strictly positive number. The default value is 1.

log

Logical indicating if the densities are given as \log(f), default is FALSE.

lower.tail

Logical indicating if the probabilities are of the form P(X\le x) (TRUE) or P(X>x) (FALSE). Default is TRUE.

log.p

Logical indicating if the probabilities are given as \log(p), default is FALSE.

Details

The Cumulative Distribution Function (CDF) of the Burr distribution is equal to F(x) = 1-((\eta+x^{-\rho\times\alpha})/\eta)^{1/\rho} for all x \ge 0 and F(x)=0 otherwise. We need that \alpha>0, \rho<0 and \eta>0.

Beirlant et al. (2004) uses parameters \eta, \tau, \lambda which correspond to \eta, \tau=-\rho\times\alpha and \lambda=-1/\rho.

Value

dburr gives the density function evaluated in x, pburr the CDF evaluated in x and qburr the quantile function evaluated in p. The length of the result is equal to the length of x or p.

rburr returns a random sample of length n.

Author(s)

Tom Reynkens.

References

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Examples

# Plot of the PDF
x <- seq(0, 10, 0.01)
plot(x, dburr(x, alpha=2, rho=-1), xlab="x", ylab="PDF", type="l")

# Plot of the CDF
x <- seq(0, 10, 0.01)
plot(x, pburr(x, alpha=2, rho=-1), xlab="x", ylab="CDF", type="l")

Conditional Tail Expectation

Description

Compute Conditional Tail Expectation (CTE) CTE_{1-p} of the fitted spliced distribution.

Usage

CTE(p, splicefit)

ES(p, splicefit)

Arguments

p

The probability associated with the CTE (we estimate CTE_{1-p}).

splicefit

A SpliceFit object, e.g. output from SpliceFitPareto, SpliceFiticPareto or SpliceFitGPD.

Details

The Conditional Tail Expectation is defined as

CTE_{1-p} = E(X | X>Q(1-p)) = E(X | X>VaR_{1-p}) = VaR_{1-p} + \Pi(VaR_{1-p})/p,

where \Pi(u)=E((X-u)_+) is the premium of the excess-loss insurance with retention u.

If the CDF is continuous in p, we have CTE_{1-p}=TVaR_{1-p}= 1/p \int_0^p VaR_{1-s} ds with TVaR the Tail Value-at-Risk.

See Reynkens et al. (2017) and Section 4.6 of Albrecher et al. (2017) for more details.

The ES function is the same function as CTE but is deprecated.

Value

Vector with the CTE corresponding to each element of p.

Author(s)

Tom Reynkens with R code from Roel Verbelen for the mixed Erlang quantiles.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Reynkens, T., Verbelen, R., Beirlant, J. and Antonio, K. (2017). "Modelling Censored Losses Using Splicing: a Global Fit Strategy With Mixed Erlang and Extreme Value Distributions". Insurance: Mathematics and Economics, 77, 65–77.

Verbelen, R., Gong, L., Antonio, K., Badescu, A. and Lin, S. (2015). "Fitting Mixtures of Erlangs to Censored and Truncated Data Using the EM Algorithm." Astin Bulletin, 45, 729–758

Examples

## Not run: 

# Pareto random sample
X <- rpareto(1000, shape = 2)

# Splice ME and Pareto
splicefit <- SpliceFitPareto(X, 0.6)

p <- seq(0.01, 0.99, 0.01)
# Plot of CTE
plot(p, CTE(p, splicefit), type="l", xlab="p", ylab=bquote(CTE[1-p]))

## End(Not run)

EPD estimator

Description

Fit the Extended Pareto Distribution (GPD) to the exceedances (peaks) over a threshold. Optionally, these estimates are plotted as a function of k.

Usage

EPD(data, rho = -1, start = NULL, direct = FALSE, warnings = FALSE, 
    logk = FALSE, plot = FALSE, add = FALSE, main = "EPD estimates of the EVI", ...)

Arguments

data

Vector of n observations.

rho

A parameter for the \rho-estimator of Fraga Alves et al. (2003) when strictly positive or choice(s) for \rho if negative. Default is -1.

start

Vector of length 2 containing the starting values for the optimisation. The first element is the starting value for the estimator of \gamma and the second element is the starting value for the estimator of \kappa. This argument is only used when direct=TRUE. Default is NULL meaning the initial value for \gamma is the Hill estimator and the initial value for \kappa is 0.

direct

Logical indicating if the parameters are obtained by directly maximising the log-likelihood function, see Details. Default is FALSE.

warnings

Logical indicating if possible warnings from the optimisation function are shown, default is FALSE.

logk

Logical indicating if the estimates are plotted as a function of \log(k) (logk=TRUE) or as a function of k. Default is FALSE.

plot

Logical indicating if the estimates of \gamma should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates of \gamma should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "EPD estimates of the EVI".

...

Additional arguments for the plot function, see plot for more details.

Details

We fit the Extended Pareto distribution to the relative excesses over a threshold (X/u). The EPD has distribution function F(x) = 1-(x(1+\kappa-\kappa x^{\tau}))^{-1/\gamma} with \tau = \rho/\gamma <0<\gamma and \kappa>\max(-1,1/\tau).

The parameters are determined using MLE and there are two possible approaches: maximise the log-likelihood directly (direct=TRUE) or follow the approach detailed in Beirlant et al. (2009) (direct=FALSE). The latter approach uses the score functions of the log-likelihood.

See Section 4.2.1 of Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

gamma

Vector of the corresponding estimates for the \gamma parameter of the EPD.

kappa

Vector of the corresponding MLE estimates for the \kappa parameter of the EPD.

tau

Vector of the corresponding estimates for the \tau parameter of the EPD using Hill estimates and values for \rho.

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant, J., Joossens, E. and Segers, J. (2009). "Second-Order Refined Peaks-Over-Threshold Modelling for Heavy-Tailed Distributions." Journal of Statistical Planning and Inference, 139, 2800–2815.

Fraga Alves, M.I. , Gomes, M.I. and de Haan, L. (2003). "A New Class of Semi-parametric Estimators of the Second Order Parameter." Portugaliae Mathematica, 60, 193–214.

Examples

data(secura)

# EPD estimates for the EVI
epd <- EPD(secura$size, plot=TRUE)

# Compute return periods
ReturnEPD(secura$size, 10^10, gamma=epd$gamma, kappa=epd$kappa, 
          tau=epd$tau, plot=TRUE)

Fit EPD using MLE

Description

Fit the Extended Pareto Distribution (EPD) to data using Maximum Likelihood Estimation (MLE).

Usage

EPDfit(data, tau, start = c(0.1, 1), warnings = FALSE)

Arguments

data

Vector of n observations.

tau

Value for the \tau parameter of the EPD.

start

warnings

Logical indicating if possible warnings from the optimisation function are shown, default is FALSE.

Details

See Section 4.2.1 of Albrecher et al. (2017) for more details.

Value

A vector with the MLE estimate for the \gamma parameter of the EPD as the first component and the MLE estimate for the \kappa parameter of the EPD as the second component.

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Examples

data(soa)

# Look at last 500 observations of SOA data
SOAdata <- sort(soa$size)[length(soa$size)-(0:499)]

# Fit EPD to last 500 observations
res <- EPDfit(SOAdata/sort(soa$size)[500], tau=-1)

EVT fit

Description

Create an S3 object using an EVT (Extreme Value Theory) fit.

Usage

EVTfit(gamma, endpoint = NULL, sigma = NULL)

Arguments

gamma

Vector of estimates for \gamma.

endpoint

Vector of endpoints (with the same length as gamma). When NULL (default), a vector containing Inf for each value of gamma will be used.

sigma

Vector of scale estimates for the GPD (with the same length as gamma). When NULL (default), not included in the object.

Details

See Reynkens et al. (2017) and Section 4.3 of Albrecher et al. (2017) for details.

Value

An S3 object containing the above input arguments.

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Examples

# Create MEfit object
mefit <- MEfit(p=c(0.65,0.35), shape=c(39,58), theta=16.19, M=2)

# Create EVTfit object
evtfit <- EVTfit(gamma=c(0.76,0.64), endpoint=c(39096, Inf))

# Create SpliceFit object
splicefit <- SpliceFit(const=c(0.5,0.996), trunclower=0, t=c(1020,39096), type=c("ME","TPa","Pa"),
                       MEfit=mefit, EVTfit=evtfit)

# Show summary
summary(splicefit)

Estimates for excess-loss premiums using EPD estimates

Description

Estimate premiums of excess-loss reinsurance with retention R and limit L using EPD estimates.

Usage

ExcessEPD(data, gamma, kappa, tau, R, L = Inf, warnings = TRUE, plot = TRUE, add = FALSE, 
          main = "Estimates for premium of excess-loss insurance", ...)

Arguments

data

Vector of n observations.

gamma

Vector of n-1 estimates for the EVI, obtained from EPD.

kappa

Vector of n-1 estimates for \kappa, obtained from EPD.

tau

Vector of n-1 estimates for \tau, obtained from EPD.

R

The retention level of the (re-)insurance.

L

The limit of the (re-)insurance, default is Inf.

warnings

Logical indicating if warnings are displayed, default is TRUE.

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates for premium of excess-loss insurance".

...

Additional arguments for the plot function, see plot for more details.

Details

We need that u \ge X_{n-k,n}, the (k+1)-th largest observation. If this is not the case, we return NA for the premium. A warning will be issued in that case if warnings=TRUE.

The premium for the excess-loss insurance with retention R and limit L is given by

E(\min{(X-R)_+, L}) = \Pi(R) - \Pi(R+L)

where \Pi(u)=E((X-u)_+)=\int_u^{\infty} (1-F(z)) dz is the premium of the excess-loss insurance with retention u. When L=\infty, the premium is equal to \Pi(R).

We estimate \Pi by

\hat{\Pi}(u) = (k+1)/(n+1) \times (X_{n-k,n})^{1/\hat{\gamma}} \times ((1-\hat{\kappa}/\hat{\gamma})(1/\hat{\gamma}-1)^{-1}u^{1-1/\hat{\gamma}} + \hat{\kappa}/(\hat{\gamma}X_{n-k,n}^{\hat{\tau}})(1/\hat{\gamma}-\hat{\tau}-1)^{-1}u^{1+\hat{\tau}-1/\hat{\gamma}})

with \hat{\gamma}, \hat{\kappa} and \hat{\tau} the estimates for the parameters of the EPD.

See Section 4.6 of Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

premium

The corresponding estimates for the premium.

R

The retention level of the (re-)insurance.

L

The limit of the (re-)insurance.

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Examples

data(secura)

# EPD estimator
epd <- EPD(secura$size)

# Premium of excess-loss insurance with retention R
R <- 10^7
ExcessEPD(secura$size, gamma=epd$gamma, kappa=epd$kappa, tau=epd$tau, R=R, ylim=c(0,2*10^4))

Estimates for excess-loss premiums using GPD-MLE estimates

Description

Estimate premiums of excess-loss reinsurance with retention R and limit L using GPD-MLE estimates.

Usage

ExcessGPD(data, gamma, sigma, R, L = Inf, warnings = TRUE, plot = TRUE, add = FALSE, 
          main = "Estimates for premium of excess-loss insurance", ...)

Arguments

data

Vector of n observations.

gamma

Vector of n-1 estimates for the EVI obtained from GPDmle.

sigma

Vector of n-1 estimates for \sigma obtained from GPDmle.

R

The retention level of the (re-)insurance.

L

The limit of the (re-)insurance, default is Inf.

warnings

Logical indicating if warnings are displayed, default is TRUE.

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates for premium of excess-loss insurance".

...

Additional arguments for the plot function, see plot for more details.

Details

We need that u \ge X_{n-k,n}, the (k+1)-th largest observation. If this is not the case, we return NA for the premium. A warning will be issued in that case if warnings=TRUE. One should then use global fits: ExcessSplice.

The premium for the excess-loss insurance with retention R and limit L is given by

E(\min{(X-R)_+, L}) = \Pi(R) - \Pi(R+L)

where \Pi(u)=E((X-u)_+)=\int_u^{\infty} (1-F(z)) dz is the premium of the excess-loss insurance with retention u. When L=\infty, the premium is equal to \Pi(R).

We estimate \Pi by

\hat{\Pi}(u) = (k+1)/(n+1) \times \hat{\sigma}_k/ (1-\hat{\gamma}_k) \times (1+\hat{\gamma}_k/\hat{\sigma}_k (u-X_{n-k,n}))^{1-1/\hat{\gamma}_k},

with \hat{\gamma}_k and \hat{\sigma}_k the estimates for the parameters of the GPD.

See Section 4.6 of Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

premium

The corresponding estimates for the premium.

R

The retention level of the (re-)insurance.

L

The limit of the (re-)insurance.

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Examples

data(secura)

# GPDmle estimator
mle <- GPDmle(secura$size)

# Premium of excess-loss insurance with retention R
R <- 10^7
ExcessGPD(secura$size, gamma=mle$gamma, sigma=mle$sigma, R=R, ylim=c(0,2*10^4))

Estimates for excess-loss premiums using a Pareto model

Description

Estimate premiums of excess-loss reinsurance with retention R and limit L using a (truncated) Pareto model.

Usage

ExcessPareto(data, gamma, R, L = Inf, endpoint = Inf, warnings = TRUE, plot = TRUE, 
        add = FALSE, main = "Estimates for premium of excess-loss insurance", ...)
        
ExcessHill(data, gamma, R, L = Inf, endpoint = Inf, warnings = TRUE, plot = TRUE, 
        add = FALSE, main = "Estimates for premium of excess-loss insurance", ...)

Arguments

data

Vector of n observations.

gamma

Vector of n-1 estimates for the EVI, obtained from Hill or trHill.

R

The retention level of the (re-)insurance.

L

The limit of the (re-)insurance, default is Inf.

endpoint

Endpoint for the truncated Pareto distribution. When Inf, the default, the ordinary Pareto model is used.

warnings

Logical indicating if warnings are displayed, default is TRUE.

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates for premium of excess-loss insurance".

...

Additional arguments for the plot function, see plot for more details.

Details

The premium for the excess-loss insurance with retention R and limit L is given by

E(\min{(X-R)_+, L}) = \Pi(R) - \Pi(R+L)

where \Pi(u)=E((X-u)_+)=\int_u^{\infty} (1-F(z)) dz is the premium of the excess-loss insurance with retention u. When L=\infty, the premium is equal to \Pi(R).

We estimate \Pi (for the untruncated Pareto distribution) by

\hat{\Pi}(u) = (k+1)/(n+1) / (1/H_{k,n}-1) \times (X_{n-k,n}^{1/H_{k,n}} u^{1-1/H_{k,n}}),

with H_{k,n} the Hill estimator.

The ExcessHill function is the same function but with a different name for compatibility with old versions of the package.

See Section 4.6 of Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

premium

The corresponding estimates for the premium.

R

The retention level of the (re-)insurance.

L

The limit of the (re-)insurance.

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Examples

data(secura)

# Hill estimator
H <- Hill(secura$size)

# Premium of excess-loss insurance with retention R
R <- 10^7
ExcessPareto(secura$size, H$gamma, R=R)

Estimates for excess-loss premiums using splicing

Description

Estimate premiums of excess-loss reinsurance with retention R and limit L using fitted spliced distribution.

Usage

ExcessSplice(R, L=Inf, splicefit)

Arguments

R

The retention level of the (re-)insurance or a vector of retention levels for the (re-)insurance.

L

The limit for the (re-)insurance or a vector of limits for the (re-)insurance, default is Inf.

splicefit

A SpliceFit object, e.g. output from SpliceFitPareto, SpliceFiticPareto or SpliceFitGPD.

Details

The premium for the excess-loss insurance with retention R and limit L is given by

E(\min{(X-R)_+, L}) = \Pi(R) - \Pi(R+L)

where \Pi(u)=E((X-u)_+)=\int_u^{\infty} (1-F(z)) dz is the premium of the excess-loss insurance with retention u. When L=\infty, the premium is equal to \Pi(R).

See Reynkens et al. (2017) and Section 4.6 of Albrecher et al. (2017) for more details.

Value

An estimate for the premium is returned (for every value of R).

Author(s)

Tom Reynkens with R code from Roel Verbelen for the estimates for the excess-loss premiums using the mixed Erlang distribution.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Verbelen, R., Gong, L., Antonio, K., Badescu, A. and Lin, S. (2015). "Fitting Mixtures of Erlangs to Censored and Truncated Data Using the EM Algorithm." Astin Bulletin, 45, 729–758

Examples

## Not run: 

# Pareto random sample
X <- rpareto(1000, shape = 2)

# Splice ME and Pareto
splicefit <- SpliceFitPareto(X, 0.8)

# Excess-loss premium 
ExcessSplice(R=2, splicefit=splicefit)

## End(Not run)

Exponential quantile plot

Description

Computes the empirical quantiles of a data vector and the theoretical quantiles of the standard exponential distribution. These quantiles are then plotted in an exponential QQ-plot with the theoretical quantiles on the x-axis and the empirical quantiles on the y-axis.

Usage

ExpQQ(data, plot = TRUE, main = "Exponential QQ-plot", ...)

Arguments

data

Vector of n observations.

plot

Logical indicating if the quantiles should be plotted in an Exponential QQ-plot, default is TRUE.

main

Title for the plot, default is "Exponential QQ-plot".

...

Additional arguments for the plot function, see plot for more details.

Details

The exponential QQ-plot is defined as

( -\log(1-i/(n+1)), X_{i,n} )

for i=1,...,n, with X_{i,n} the i-th order statistic of the data.

Note that the mean excess plot is the derivative plot of the Exponential QQ-plot.

See Section 4.1 of Albrecher et al. (2017) for more details.

Value

A list with following components:

eqq.the

Vector of the theoretical quantiles from a standard exponential distribution.

eqq.emp

Vector of the empirical quantiles from the data.

Author(s)

Tom Reynkens based on S-Plus code from Yuri Goegebeur.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Examples

data(norwegianfire)

# Exponential QQ-plot for Norwegian Fire Insurance data for claims in 1976.
ExpQQ(norwegianfire$size[norwegianfire$year==76])

# Pareto QQ-plot for Norwegian Fire Insurance data for claims in 1976.
ParetoQQ(norwegianfire$size[norwegianfire$year==76])

The Extended Pareto Distribution

Description

Density, distribution function, quantile function and random generation for the Extended Pareto Distribution (EPD).

Usage

depd(x, gamma, kappa, tau = -1, log = FALSE)
pepd(x, gamma, kappa, tau = -1, lower.tail = TRUE, log.p = FALSE)
qepd(p, gamma, kappa, tau = -1, lower.tail = TRUE, log.p = FALSE)
repd(n, gamma, kappa, tau = -1)

Arguments

x

Vector of quantiles.

p

Vector of probabilities.

n

Number of observations.

gamma

The \gamma parameter of the EPD, a strictly positive number.

kappa

The \kappa parameter of the EPD. It should be larger than \max\{-1,1/\tau\}.

tau

The \tau parameter of the EPD, a strictly negative number. Default is -1.

log

Logical indicating if the densities are given as \log(f), default is FALSE.

lower.tail

Logical indicating if the probabilities are of the form P(X\le x) (TRUE) or P(X>x) (FALSE). Default is TRUE.

log.p

Logical indicating if the probabilities are given as \log(p), default is FALSE.

Details

The Cumulative Distribution Function (CDF) of the EPD is equal to F(x) = 1-(x(1+\kappa-\kappa x^{\tau}))^{-1/\gamma} for all x > 1 and F(x)=0 otherwise.

Note that an EPD random variable with \tau=-1 and \kappa=\gamma/\sigma-1 is GPD distributed with \mu=1, \gamma and \sigma.

Value

depd gives the density function evaluated in x, pepd the CDF evaluated in x and qepd the quantile function evaluated in p. The length of the result is equal to the length of x or p.

repd returns a random sample of length n.

Author(s)

Tom Reynkens.

References

Examples

# Plot of the PDF
x <- seq(0, 10, 0.01)
plot(x, depd(x, gamma=1/2, kappa=1, tau=-1), xlab="x", ylab="PDF", type="l")

# Plot of the CDF
x <- seq(0, 10, 0.01)
plot(x, pepd(x, gamma=1/2, kappa=1, tau=-1), xlab="x", ylab="CDF", type="l")

The Frechet distribution

Description

Density, distribution function, quantile function and random generation for the Fréchet distribution (inverse Weibull distribution).

Usage

dfrechet(x, shape, loc = 0, scale = 1, log = FALSE)
pfrechet(x, shape, loc = 0, scale = 1, lower.tail = TRUE, log.p = FALSE)
qfrechet(p, shape, loc = 0, scale = 1, lower.tail = TRUE, log.p = FALSE)
rfrechet(n, shape, loc = 0, scale = 1)

Arguments

x

Vector of quantiles.

p

Vector of probabilities.

n

Number of observations.

shape

Shape parameter of the Fréchet distribution.

loc

Location parameter of the Fréchet distribution, default is 0.

scale

Scale parameter of the Fréchet distribution, default is 1.

log

Logical indicating if the densities are given as \log(f), default is FALSE.

lower.tail

Logical indicating if the probabilities are of the form P(X\le x) (TRUE) or P(X>x) (FALSE). Default is TRUE.

log.p

Logical indicating if the probabilities are given as \log(p), default is FALSE.

Details

The Cumulative Distribution Function (CDF) of the Fréchet distribution is equal to F(x) = \exp(-((x-loc)/scale)^{-shape}) for all x \ge loc and F(x)=0 otherwise. Both shape and scale need to be strictly positive.

Value

dfrechet gives the density function evaluated in x, pfrechet the CDF evaluated in x and qfrechet the quantile function evaluated in p. The length of the result is equal to the length of x or p.

rfrechet returns a random sample of length n.

Author(s)

Tom Reynkens.

Examples

# Plot of the PDF
x <- seq(1,10,0.01)
plot(x, dfrechet(x, shape=2), xlab="x", ylab="PDF", type="l")

# Plot of the CDF
x <- seq(1,10,0.01)
plot(x, pfrechet(x, shape=2), xlab="x", ylab="CDF", type="l")

The generalised Pareto distribution

Description

Density, distribution function, quantile function and random generation for the Generalised Pareto Distribution (GPD).

Usage

dgpd(x, gamma, mu = 0, sigma, log = FALSE)
pgpd(x, gamma, mu = 0, sigma, lower.tail = TRUE, log.p = FALSE)
qgpd(p, gamma, mu = 0, sigma, lower.tail = TRUE, log.p = FALSE)
rgpd(n, gamma, mu = 0, sigma)

Arguments

x

Vector of quantiles.

p

Vector of probabilities.

n

Number of observations.

gamma

The \gamma parameter of the GPD, a real number.

mu

The \mu parameter of the GPD, a strictly positive number. Default is 0.

sigma

The \sigma parameter of the GPD, a strictly positive number.

log

Logical indicating if the densities are given as \log(f), default is FALSE.

lower.tail

Logical indicating if the probabilities are of the form P(X\le x) (TRUE) or P(X>x) (FALSE). Default is TRUE.

log.p

Logical indicating if the probabilities are given as \log(p), default is FALSE.

Details

The Cumulative Distribution Function (CDF) of the GPD for \gamma \neq 0 is equal to F(x) = 1-(1+\gamma (x-\mu)/\sigma)^{-1/\gamma} for all x \ge \mu and F(x)=0 otherwise. When \gamma=0, the CDF is given by F(x) = 1-\exp((x-\mu)/\sigma) for all x \ge \mu and F(x)=0 otherwise.

Value

dgpd gives the density function evaluated in x, pgpd the CDF evaluated in x and qgpd the quantile function evaluated in p. The length of the result is equal to the length of x or p.

rgpd returns a random sample of length n.

Author(s)

Tom Reynkens.

References

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Examples

# Plot of the PDF
x <- seq(0, 10, 0.01)
plot(x, dgpd(x, gamma=1/2, sigma=5), xlab="x", ylab="PDF", type="l")

# Plot of the CDF
x <- seq(0, 10, 0.01)
plot(x, pgpd(x, gamma=1/2, sigma=5), xlab="x", ylab="CDF", type="l")

Fit GPD using MLE

Description

Fit the Generalised Pareto Distribution (GPD) to data using Maximum Likelihood Estimation (MLE).

Usage

GPDfit(data, start = c(0.1, 1), warnings = FALSE)

Arguments

data

Vector of n observations.

start

warnings

Logical indicating if possible warnings from the optimisation function are shown, default is FALSE.

Details

See Section 4.2.2 in Albrecher et al. (2017) for more details.

Value

A vector with the MLE estimate for the \gamma parameter of the GPD as the first component and the MLE estimate for the \sigma parameter of the GPD as the second component.

Author(s)

Tom Reynkens based on S-Plus code from Yuri Goegebeur and R code from Klaus Herrmann.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Examples

data(soa)

# Look at last 500 observations of SOA data
SOAdata <- sort(soa$size)[length(soa$size)-(0:499)]

# Fit GPD to last 500 observations
res <- GPDfit(SOAdata-sort(soa$size)[500])

GPD-ML estimator

Description

Fit the Generalised Pareto Distribution (GPD) to the exceedances (peaks) over a threshold using Maximum Likelihood Estimation (MLE). Optionally, these estimates are plotted as a function of k.

Usage

GPDmle(data, start = c(0.1,1), warnings = FALSE, logk = FALSE, 
       plot = FALSE, add = FALSE, main = "POT estimates of the EVI", ...)

POT(data, start = c(0.1,1), warnings = FALSE, logk = FALSE, 
    plot = FALSE, add = FALSE, main = "POT estimates of the EVI", ...)

Arguments

data

Vector of n observations.

start

warnings

Logical indicating if possible warnings from the optimisation function are shown, default is FALSE.

logk

Logical indicating if the estimates are plotted as a function of \log(k) (logk=TRUE) or as a function of k. Default is FALSE.

plot

Logical indicating if the estimates of \gamma should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates of \gamma should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "POT estimates of the EVI".

...

Additional arguments for the plot function, see plot for more details.

Details

The POT function is the same function but with a different name for compatibility with the old S-Plus code.

For each value of k, we look at the exceedances over the (k+1)th largest observation: X_{n-k+j,n}-X_{n-k,n} for j=1,...,k, with X_{j,n} the jth largest observation and n the sample size. The GPD is then fitted to these k exceedances using MLE which yields estimates for the parameters of the GPD: \gamma and \sigma.

See Section 4.2.2 in Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

gamma

Vector of the corresponding MLE estimates for the \gamma parameter of the GPD.

sigma

Vector of the corresponding MLE estimates for the \sigma parameter of the GPD.

Author(s)

Tom Reynkens based on S-Plus code from Yuri Goegebeur and R code from Klaus Herrmann.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Examples

data(soa)

# Look at last 500 observations of SOA data
SOAdata <- sort(soa$size)[length(soa$size)-(0:499)]

# Plot GPD-ML estimates as a function of k
GPDmle(SOAdata, plot=TRUE)

GPD residual plot

Description

Residual plot to check GPD fit for peaks over a threshold.

Usage

GPDresiduals(data, t, gamma, sigma, plot = TRUE, 
             main = "GPD residual plot", ...)

Arguments

data

Vector of n observations.

t

The used threshold.

gamma

Estimate for the EVI obtained from GPDmle.

sigma

Estimate for \sigma obtained from GPDmle.

plot

Logical indicating if the residuals should be plotted, default is FALSE.

main

Title for the plot, default is "GPD residual plot".

...

Additional arguments for the plot function, see plot for more details.

Details

Consider the POT values Y=X-t and the transformed variable

R= 1/\gamma \log(1+\gamma/\sigma Y),

when \gamma \neq 0 and

R = Y/\sigma,

otherwise. We can assess the goodness-of-fit of the GPD when modelling POT values Y=X-t by constructing an exponential QQ-plot of the transformed variable R since R is standard exponentially distributed if Y follows the GPD.

See Section 4.2.2 in Albrecher et al. (2017) for more details.

Value

A list with following components:

res.the

Vector of the theoretical quantiles from a standard exponential distribution.

res.emp

Vector of the empirical quantiles of R, see Details.

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Examples

data(soa)

# Look at last 500 observations of SOA data
SOAdata <- sort(soa$size)[length(soa$size)-(0:499)]

# Plot POT-MLE estimates as a function of k
pot <- GPDmle(SOAdata, plot=TRUE)

# Residual plot
k <- 200
GPDresiduals(SOAdata, sort(SOAdata)[length(SOAdata)-k], pot$gamma[k], pot$sigma[k])

Hill estimator

Description

Computes the Hill estimator for positive extreme value indices (Hill, 1975) as a function of the tail parameter k. Optionally, these estimates are plotted as a function of k.

Usage

Hill(data, k = TRUE, logk = FALSE, plot = FALSE, add = FALSE, 
     main = "Hill estimates of the EVI", ...)

Arguments

data

Vector of n observations.

k

Logical indicating if the Hill estimates are plotted as a function of the tail parameter k (k=TRUE) or as a function of \log(X_{n-k}). Default is TRUE.

logk

Logical indicating if the Hill estimates are plotted as a function of \log(k) (logk=TRUE) or as a function of k (logk=FALSE) when k=TRUE. Default is FALSE.

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Hill estimates of the EVI".

...

Additional arguments for the plot function, see plot for more details.

Details

The Hill estimator can be seen as the estimator of slope in the upper right corner (k last points) of the Pareto QQ-plot when using constrained least squares (the regression line has to pass through the point (-\log((k+1)/(n+1)),\log X_{n-k})). It is given by

H_{k,n}=1/k\sum_{j=1}^k \log X_{n-j+1,n}- \log X_{n-k,n}.

See Section 4.2.1 of Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

gamma

Vector of the corresponding Hill estimates.

Author(s)

Tom Reynkens based on S-Plus code from Yuri Goegebeur.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Hill, B. M. (1975). "A simple general approach to inference about the tail of a distribution." Annals of Statistics, 3, 1163–1173.

Examples

data(norwegianfire)

# Plot Hill estimates as a function of k
Hill(norwegianfire$size[norwegianfire$year==76],plot=TRUE)

Bias-reduced MLE (Quantile view)

Description

Computes bias-reduced ML estimates of gamma based on the quantile view.

Usage

Hill.2oQV(data, start = c(1,1,1), warnings = FALSE, logk = FALSE, 
          plot = FALSE, add = FALSE, main = "Estimates of the EVI", ...)

Arguments

data

Vector of n observations.

start

A vector of length 3 containing starting values for the first numerical optimisation (see Details). The elements are the starting values for the estimators of \gamma, \mu and \sigma, respectively. Default is c(1,1,1).

warnings

Logical indicating if possible warnings from the optimisation function are shown, default is FALSE.

logk

Logical indicating if the estimates are plotted as a function of \log(k) (logk=TRUE) or as a function of k. Default is FALSE.

plot

Logical indicating if the estimates of \gamma should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates of \gamma should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of the EVI".

...

Additional arguments for the plot function, see plot for more details.

Details

See Section 4.2.1 of Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

gamma

Vector of the ML estimates for the EVI for each value of k.

b

Vector of the ML estimates for the parameter b in the regression model for each value of k.

beta

Vector of the ML estimates for the parameter \beta in the regression model for each value of k.

Author(s)

Tom Reynkens based on S-Plus code from Yuri Goegebeur and R code from Klaus Herrmann.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant J., Dierckx, G., Goegebeur Y. and Matthys, G. (1999). "Tail Index Estimation and an Exponential Regression Model." Extremes, 2, 177–200.

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Examples

data(norwegianfire)

# Plot bias-reduced MLE (QV) as a function of k
Hill.2oQV(norwegianfire$size[norwegianfire$year==76],plot=TRUE)

Select optimal threshold for Hill estimator

Description

Select optimal threshold for the Hill estimator by minimising the Asymptotic Mean Squared Error (AMSE).

Usage

Hill.kopt(data, start = c(1, 1, 1), warnings = FALSE, logk = FALSE, 
          plot = FALSE, add = FALSE, main = "AMSE plot", ...)

Arguments

data

Vector of n observations.

start

A vector of length 3 containing starting values for the first numerical optimisation (see Hill.2oQV for more details). Default is c(1,1,1).

warnings

Logical indicating if possible warnings from the optimisation function are shown, default is FALSE.

logk

Logical indicating if the AMSE values are plotted as a function of \log(k) (logk=TRUE) or as a function of k. Default is FALSE.

plot

Logical indicating if the AMSE values should be plotted as a function of k, default is FALSE.

add

Logical indicating if the optimal value for k should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "AMSE plot".

...

Additional arguments for the plot function, see plot for more details.

Details

See Section 4.2.1 of Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

AMSE

Vector of the AMSE values for each value of k.

kopt

Optimal value of k corresponding to minimal AMSE value.

gammaopt

Optimal value of the Hill estimator corresponding to minimal AMSE value.

Author(s)

Tom Reynkens based on S-Plus code from Yuri Goegebeur.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Beirlant J., Vynckier, P. and Teugels, J. (1996). "Tail Index Estimation, Pareto Quantile Plots, and Regression Diagnostics." Journal of the American Statistical Association, 91, 1659–1667.

Examples

data(norwegianfire)

# Plot Hill estimator as a function of k
Hill(norwegianfire$size[norwegianfire$year==76],plot=TRUE)

# Add optimal value of k
Hill.kopt(norwegianfire$size[norwegianfire$year==76],add=TRUE)

Kaplan-Meier estimator

Description

Computes the Kaplan-Meier estimator for the survival function of right censored data.

Usage

KaplanMeier(x, data, censored, conf.type="plain", conf.int = 0.95)

Arguments

x

Vector with points to evaluate the estimator in.

data

Vector of n observations.

censored

Vector of n logicals indicating if an observation is right censored.

conf.type

Type of confidence interval, see survfit.formula. Default is "plain".

conf.int

Confidence level of the two-sided confidence interval, see survfit.formula. Default is 0.95.

Details

We consider the random right censoring model where one observes Z = \min(X,C) where X is the variable of interest and C is the censoring variable.

This function is merely a wrapper for survfit.formula from survival.

This estimator is only suitable for right censored data. When the data are interval censored, one can use the Turnbull estimator implemented in Turnbull.

Value

A list with following components:

surv

A vector of length length(x) containing the Kaplan-Meier estimator evaluated in the elements of x.

fit

The output from the call to survfit.formula, an object of class survfit.

Author(s)

Tom Reynkens

References

Kaplan, E. L. and Meier, P. (1958). "Nonparametric Estimation from Incomplete Observations." Journal of the American Statistical Association, 53, 457–481.

Examples

data <- c(1, 2.5, 3, 4, 5.5, 6, 7.5, 8.25, 9, 10.5)
censored <- c(0, 1, 0, 0, 1, 0, 1, 1, 0, 1)

x <- seq(0, 12, 0.1)

# Kaplan-Meier estimator
plot(x, KaplanMeier(x, data, censored)$surv, type="s", ylab="Kaplan-Meier estimator")

Least Squares tail estimator

Description

Computes the Least Squares (LS) estimates of the EVI based on the last k observations of the generalised QQ-plot.

Usage

LStail(data, rho = -1, lambda = 0.5, logk = FALSE, plot = FALSE, add = FALSE, 
       main = "LS estimates of the EVI", ...)
            
TSfraction(data, rho = -1, lambda = 0.5, logk = FALSE, plot = FALSE, add = FALSE, 
           main = "LS estimates of the EVI", ...)

Arguments

data

Vector of n observations.

rho

Estimate for \rho, or NULL when \rho needs to be estimated using the method of Beirlant et al. (2002). Default is -1.

lambda

Parameter used in the method of Beirlant et al. (2002), only used when rho=NULL. Default is 0.5.

logk

Logical indicating if the estimates are plotted as a function of \log(k) (logk=TRUE) or as a function of k. Default is FALSE.

plot

Logical indicating if the estimates of \gamma should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates of \gamma should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "LS estimates of the EVI".

...

Additional arguments for the plot function, see plot for more details.

Details

We estimate \gamma (EVI) and b using least squares on the following regression model (Beirlant et al., 2005): Z_j = \gamma + b(n/k) (j/k)^{-\rho} + \epsilon_j with Z_j = (j+1) \log(UH_{j,n}/UH_{j+1,n}) and UH_{j,n}=X_{n-j,n}H_{j,n}, where H_{j,n} is the Hill estimator with threshold X_{n-j,n}.

See Section 5.8 of Beirlant et al. (2004) for more details.

The function TSfraction is included for compatibility with the old S-Plus code.

Value

k

Vector of the values of the tail parameter k.

gamma

Vector of the corresponding LS estimates for the EVI.

b

Vector of the corresponding LS estimates for b.

rho

Vector of the estimates for \rho when rho=NULL or the given input for rho otherwise.

Author(s)

Tom Reynkens based on S-Plus code from Yuri Goegebeur.

References

Beirlant, J., Dierckx, G. and Guillou, A. (2005). "Estimation of the Extreme Value Index and Regression on Generalized Quantile Plots." Bernoulli, 11, 949–970.

Beirlant, J., Dierckx, G., Guillou, A. and Starica, C. (2002). "On Exponential Representations of Log-spacing of Extreme Order Statistics." Extremes, 5, 157–180.

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Examples

data(soa)

# LS tail estimator
LStail(soa$size, plot=TRUE, ylim=c(0,0.5))

Log-normal quantile plot

Description

Computes the empirical quantiles of the log-transform of a data vector and the theoretical quantiles of the standard normal distribution. These quantiles are then plotted in a log-normal QQ-plot with the theoretical quantiles on the x-axis and the empirical quantiles on the y-axis.

Usage

LognormalQQ(data, plot = TRUE, main = "Log-normal QQ-plot", ...)

Arguments

data

Vector of n observations.

plot

Logical indicating if the quantiles should be plotted in a log-normal QQ-plot, default is TRUE.

main

Title for the plot, default is "Log-normal QQ-plot".

...

Additional arguments for the plot function, see plot for more details.

Details

By definition, a log-transformed log-normal random variable is normally distributed. We can thus obtain a log-normal QQ-plot from a normal QQ-plot by replacing the empirical quantiles of the data vector by the empirical quantiles from the log-transformed data. We hence plot

(\Phi^{-1}(i/(n+1)), \log(X_{i,n}) )

for i=1,\ldots,n, where \Phi is the standard normal CDF.

See Section 4.1 of Albrecher et al. (2017) for more details.

Value

A list with following components:

lnqq.the

Vector of the theoretical quantiles from a standard normal distribution.

lnqq.emp

Vector of the empirical quantiles from the log-transformed data.

Author(s)

Tom Reynkens.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Examples

data(norwegianfire)

# Log-normal QQ-plot for Norwegian Fire Insurance data for claims in 1976.
LognormalQQ(norwegianfire$size[norwegianfire$year==76])

Derivative plot of the log-normal QQ-plot

Description

Computes the derivative plot of the log-normal QQ-plot. These values can be plotted as a function of the data or as a function of the tail parameter k.

Usage

LognormalQQ_der(data, k = FALSE, plot = TRUE, 
                main = "Derivative plot of log-normal QQ-plot", ...)

Arguments

data

Vector of n observations.

plot

Logical indicating if the derivative values should be plotted, default is TRUE.

k

Logical indicating if the derivative values are plotted as a function of the tail parameter k (k=TRUE) or as a function of the logarithm of the data (k=FALSE). Default is FALSE.

main

Title for the plot, default is "Derivative plot of log-normal QQ-plot".

...

Additional arguments for the plot function, see plot for more details.

Details

The derivative plot of a log-normal QQ-plot is

(k, H_{k,n}/N_{k,n})

(\log X_{n-k,n}, H_{k,n}/N_{k,n})

with H_{k,n} the Hill estimates and

N_{k,n} = (n+1)/(k+1) \phi(\Phi^{-1}(a)) - \Phi^{-1}(a).

Here is a=1-(k+1)/(n+1), \phi the standard normal PDF and \Phi the standard normal CDF.

See Section 4.1 of Albrecher et al. (2017) for more details.

Value

A list with following components:

xval

Vector of the x-values of the plot (k or \log X_{n-k,n}).

yval

Vector of the derivative values.

Author(s)

Tom Reynkens.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Examples

data(norwegianfire)

# Log-normal QQ-plot for Norwegian Fire Insurance data for claims in 1976.
LognormalQQ(norwegianfire$size[norwegianfire$year==76])

# Derivate plot
LognormalQQ_der(norwegianfire$size[norwegianfire$year==76])

Mixed Erlang fit

Description

Create an S3 object using a Mixed Erlang (ME) fit.

Usage

MEfit(p, shape, theta, M, M_initial = NULL)

Arguments

p

Vector of mixing weights, denoted by \alpha in Verbelen et al. (2015).

shape

Vector of shape parameters r.

theta

Scale parameter \theta.

M

Number of mixture components.

M_initial

Initial value provided for M. When NULL (default), not included in the object.

Details

The rate parameter \lambda used in Albrecher et al. (2017) is equal to 1/\theta.

See Reynkens et al. (2017) and Section 4.3 of Albrecher et al. (2017) for more details

Value

An S3 object which contains the input arguments in a list.

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Verbelen, R., Gong, L., Antonio, K., Badescu, A. and Lin, S. (2015). "Fitting Mixtures of Erlangs to Censored and Truncated Data Using the EM Algorithm." Astin Bulletin, 45, 729–758

Examples

# Create MEfit object
mefit <- MEfit(p=c(0.65,0.35), shape=c(39,58), theta=16.19, M=2)

# Create EVTfit object
evtfit <- EVTfit(gamma=c(0.76,0.64), endpoint=c(39096, Inf))

# Create SpliceFit object
splicefit <- SpliceFit(const=c(0.5,0.996), trunclower=0, t=c(1020,39096), type=c("ME","TPa","Pa"),
                       MEfit=mefit, EVTfit=evtfit)

# Show summary
summary(splicefit)

Mean excess function

Description

Computes the mean excess values for a vector of observations. These mean excess values can then be plotted as a function of the data or as a function of the tail parameter k.

Usage

MeanExcess(data, plot = TRUE, k = FALSE, main = "Mean excess plot", ...)

Arguments

data

Vector of n observations.

plot

Logical indicating if the mean excess values should be plotted in a mean excess plot, default is TRUE.

k

Logical indicating if the mean excess scores are plotted as a function of the tail parameter k (k=TRUE) or as a function of the data (k=FALSE). Default is FALSE.

main

Title for the plot, default is "Mean excess plot".

...

Additional arguments for the plot function, see plot for more details.

Details

The mean excess plot is

(k,e_{k,n})

(X_{n-k,n}, e_{k,n})

with

e_{k,n}=1/k\sum_{j=1}^k X_{n-j+1,n}-X_{n-k,n}.

Note that the mean excess plot is the derivative plot of the Exponential QQ-plot.

See Section 4.1 of Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

X

Vector of the order statistics data[n-k] corresponding to the tail parameters in k.

e

Vector of the mean excess values corresponding to the tail parameters in k.

Author(s)

Tom Reynkens based on S-Plus code from Yuri Goegebeur.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Examples

data(norwegianfire)

# Mean excess plots for Norwegian Fire Insurance data for claims in 1976.

# Mean excess values as a function of k
MeanExcess(norwegianfire$size[norwegianfire$year==76], k=TRUE)

# Mean excess values as a function of the data
MeanExcess(norwegianfire$size[norwegianfire$year==76], k=FALSE)

Mean excess function using Turnbull estimator

Description

Computes mean excess values using the Turnbull estimator. These mean excess values can then be plotted as a function of the empirical quantiles (computed using the Turnbull estimator) or as a function of the tail parameter k.

Usage

MeanExcess_TB(L, U = L, censored, trunclower = 0, truncupper = Inf, 
              plot = TRUE, k = FALSE, intervalpkg = TRUE, 
              main = "Mean excess plot", ...)

Arguments

L

Vector of length n with the lower boundaries of the intervals for interval censored data or the observed data for right censored data.

U

Vector of length n with the upper boundaries of the intervals. By default, they are equal to L.

censored

A logical vector of length n indicating if an observation is censored.

trunclower

Lower truncation point, default is 0.

truncupper

Upper truncation point, default is Inf.

plot

Logical indicating if the mean excess values should be plotted in a mean excess plot, default is TRUE.

k

Logical indicating if the mean excess values are plotted as a function of the tail parameter k (k=TRUE) or as a function of the empirical quantiles computed using the Turnbull estimator (k=FALSE). Default is FALSE.

intervalpkg

Logical indicating if the Turnbull estimator is computed using the implementation in the interval package if this package is installed. Default is TRUE.

main

Title for the plot, default is "Mean excess plot".

...

Additional arguments for the plot function, see plot for more details.

Details

The mean excess values are given by

\hat{e}^{TB}(v)=(\int_v^{\infty} 1-\hat{F}^{TB}(u) du)/(1-\hat{F}^{TB}(v))

where \hat{F}^{TB} is the Turnbull estimator for the CDF. More specifically, we use the values v=\hat{Q}^{TB}(p) for p=1/(n+1), \ldots, (n-1)/(n+1) where \hat{Q}^{TB}(p) is the empirical quantile function corresponding to the Turnbull estimator.

Right censored data should be entered as L=l and U=truncupper, and left censored data should be entered as L=trunclower and U=u.

If the interval package is installed and intervalpkg=TRUE, the icfit function is used to compute the Turnbull estimator. Otherwise, survfit.formula from survival is used.

Use MeanExcess for non-censored data.

See Section 4.3 in Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

X

Vector of the empirical quantiles, computed using the Turnbull estimator, corresponding to (n-k)/(n+1)=1-(k+1)/(n+1).

e

Vector of the mean excess values corresponding to the tail parameters in k.

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Examples

# Pareto random sample
X <- rpareto(500, shape=2)

# Censoring variable
Y <- rpareto(500, shape=1)

# Observed sample
Z <- pmin(X, Y)

# Censoring indicator
censored <- (X>Y)

# Right boundary
U <- Z
U[censored] <- Inf

# Mean excess plot
MeanExcess_TB(Z, U, censored, k=FALSE)

Moment estimator

Description

Compute the moment estimates for real extreme value indices as a function of the tail parameter k. Optionally, these estimates are plotted as a function of k.

Usage

Moment(data, logk = FALSE, plot = FALSE, add = FALSE, 
       main = "Moment estimates of the EVI", ...)

Arguments

data

Vector of n observations.

logk

Logical indicating if the estimates are plotted as a function of \log(k) (logk=TRUE) or as a function of k. Default is FALSE.

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Moment estimates of the EVI".

...

Additional arguments for the plot function, see plot for more details.

Details

The moment estimator for the EVI is introduced by Dekkers et al. (1989) and is a generalisation of the Hill estimator.

See Section 4.2.2 of Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

gamma

Vector of the corresponding moment estimates.

Author(s)

Tom Reynkens based on S-Plus code from Yuri Goegebeur.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Dekkers, A.L.M, Einmahl, J.H.J. and de Haan, L. (1989). "A Moment Estimator for the Index of an Extreme-value Distribution." Annals of Statistics, 17, 1833–1855.

Examples

data(soa)

# Hill estimator
H <- Hill(soa$size, plot=FALSE)
# Moment estimator
M <- Moment(soa$size)
# Generalised Hill estimator
gH <- genHill(soa$size, gamma=H$gamma)

# Plot estimates
plot(H$k[1:5000], M$gamma[1:5000], xlab="k", ylab=expression(gamma), type="l", ylim=c(0.2,0.5))
lines(H$k[1:5000], gH$gamma[1:5000], lty=2)
legend("topright", c("Moment", "Generalised Hill"), lty=1:2)

The Pareto distribution

Description

Density, distribution function, quantile function and random generation for the Pareto distribution (type I).

Usage

dpareto(x, shape, scale = 1, log = FALSE)
ppareto(x, shape, scale = 1, lower.tail = TRUE, log.p = FALSE)
qpareto(p, shape, scale = 1, lower.tail = TRUE, log.p = FALSE)
rpareto(n, shape, scale = 1)

Arguments

x

Vector of quantiles.

p

Vector of probabilities.

n

Number of observations.

shape

The shape parameter of the Pareto distribution, a strictly positive number.

scale

The scale parameter of the Pareto distribution, a strictly positive number. Its default value is 1.

log

Logical indicating if the densities are given as \log(f), default is FALSE.

lower.tail

Logical indicating if the probabilities are of the form P(X\le x) (TRUE) or P(X>x) (FALSE). Default is TRUE.

log.p

Logical indicating if the probabilities are given as \log(p), default is FALSE.

Details

The Cumulative Distribution Function (CDF) of the Pareto distribution is equal to F(x) = 1-(x/scale)^{-shape} for all x \ge scale and F(x)=0 otherwise. Both shape and scale need to be strictly positive.

Value

dpareto gives the density function evaluated in x, ppareto the CDF evaluated in x and qpareto the quantile function evaluated in p. The length of the result is equal to the length of x or p.

rpareto returns a random sample of length n.

Author(s)

Tom Reynkens.

Examples

# Plot of the PDF
x <- seq(1, 10, 0.01)
plot(x, dpareto(x, shape=2), xlab="x", ylab="PDF", type="l")

# Plot of the CDF
x <- seq(1, 10, 0.01)
plot(x, ppareto(x, shape=2), xlab="x", ylab="CDF", type="l")

Pareto quantile plot

Description

Computes the empirical quantiles of the log-transform of a data vector and the theoretical quantiles of the standard exponential distribution. These quantiles are then plotted in a Pareto QQ-plot with the theoretical quantiles on the x-axis and the empirical quantiles on the y-axis.

Usage

ParetoQQ(data, plot = TRUE, main = "Pareto QQ-plot", ...)

Arguments

data

Vector of n observations.

plot

Logical indicating if the quantiles should be plotted in a Pareto QQ-plot, default is TRUE.

main

Title for the plot, default is "Pareto QQ-plot".

...

Additional arguments for the plot function, see plot for more details.

Details

It can be easily seen that a log-transformed Pareto random variable is exponentially distributed. We can hence obtain a Pareto QQ-plot from an exponential QQ-plot by replacing the empirical quantiles from the data vector by the empirical quantiles from the log-transformed data. We hence plot

( -\log(1-i/(n+1)), \log X_{i,n} )

for i=1,...,n, with X_{i,n} the i-th order statistic of the data.

See Section 4.1 of Albrecher et al. (2017) for more details.

Value

A list with following components:

pqq.the

Vector of the theoretical quantiles from a standard exponential distribution.

pqq.emp

Vector of the empirical quantiles from the log-transformed data.

Author(s)

Tom Reynkens based on S-Plus code from Yuri Goegebeur.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Examples

data(norwegianfire)

# Exponential QQ-plot for Norwegian Fire Insurance data for claims in 1976.
ExpQQ(norwegianfire$size[norwegianfire$year==76])

# Pareto QQ-plot for Norwegian Fire Insurance data for claims in 1976.
ParetoQQ(norwegianfire$size[norwegianfire$year==76])

Derivative plot of the Pareto QQ-plot

Description

Computes the derivative plot of the Pareto QQ-plot. These values can be plotted as a function of the data or as a function of the tail parameter k.

Usage

ParetoQQ_der(data, k = FALSE, plot = TRUE, 
             main = "Derivative plot of Pareto QQ-plot", ...)

Arguments

data

Vector of n observations.

plot

Logical indicating if the derivative values should be plotted, default is TRUE.

k

Logical indicating if the derivative values are plotted as a function of the tail parameter k (k=TRUE) or as a function of the logarithm of the data (k=FALSE). Default is FALSE.

main

Title for the plot, default is "Derivative plot of Pareto QQ-plot".

...

Additional arguments for the plot function, see plot for more details.

Details

The derivative plot of a Pareto QQ-plot is

(k,H_{k,n})

(\log X_{n-k,n}, H_{k,n})

with H_{k,n} the Hill estimates.

See Section 4.1 of Albrecher et al. (2017) for more details.

Value

A list with following components:

xval

Vector of the x-values of the plot (k or \log X_{n-k,n}).

yval

Vector of the derivative values H_{k,n}.

Author(s)

Tom Reynkens.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Examples

data(norwegianfire)

# Pareto QQ-plot for Norwegian Fire Insurance data for claims in 1976.
ParetoQQ(norwegianfire$size[norwegianfire$year==76])

# Derivate plot
ParetoQQ_der(norwegianfire$size[norwegianfire$year==76])

Weissman estimator of small exceedance probabilities and large return periods

Description

Compute estimates of a small exceedance probability P(X>q) or large return period 1/P(X>q) using the approach of Weissman (1978).

Usage

Prob(data, gamma, q, plot = FALSE, add = FALSE, 
     main = "Estimates of small exceedance probability", ...)
          
Return(data, gamma, q, plot = FALSE, add = FALSE, 
       main = "Estimates of large return period", ...)
        
Weissman.p(data, gamma, q, plot = FALSE, add = FALSE, 
           main = "Estimates of small exceedance probability", ...)
          
Weissman.r(data, gamma, q, plot = FALSE, add = FALSE, 
           main = "Estimates of large return period", ...)

Arguments

data

Vector of n observations.

gamma

Vector of n-1 estimates for the EVI, typically Hill estimates are used.

q

The used large quantile (we estimate P(X>q) or 1/P(X>q) for q large).

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of extreme quantile" for Prob and "Estimates of large return period" for Return.

...

Additional arguments for the plot function, see plot for more details.

Details

See Section 4.2.1 of Albrecher et al. (2017) for more details.

Weissman.p and Weissman.r are the same functions as Prob and Return but with a different name for compatibility with the old S-Plus code.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

P

Vector of the corresponding probability estimates, only returned for Prob.

R

Vector of the corresponding estimates for the return period, only returned for Return.

q

The used large quantile.

Author(s)

Tom Reynkens based on S-Plus code from Yuri Goegebeur.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Weissman, I. (1978). "Estimation of Parameters and Large Quantiles Based on the k Largest Observations." Journal of the American Statistical Association, 73, 812–815.

Examples

data(soa)

# Look at last 500 observations of SOA data
SOAdata <- sort(soa$size)[length(soa$size)-(0:499)]

# Hill estimator
H <- Hill(SOAdata)

# Exceedance probability
q <- 10^6
# Weissman estimator
Prob(SOAdata,gamma=H$gamma,q=q,plot=TRUE)

# Return period
q <- 10^6
# Weissman estimator
Return(SOAdata,gamma=H$gamma,q=q,plot=TRUE)

Estimator of small exceedance probabilities and large return periods using EPD

Description

Computes estimates of a small exceedance probability P(X>q) or large return period 1/P(X>q) using the parameters from the EPD fit.

Usage


ProbEPD(data, q, gamma, kappa, tau, plot = FALSE, add = FALSE, 
        main = "Estimates of small exceedance probability", ...)

ReturnEPD(data, q, gamma, kappa, tau, plot = FALSE, add = FALSE, 
          main = "Estimates of large return period", ...)

Arguments

data

Vector of n observations.

q

The used large quantile (we estimate P(X>q) or 1/P(X>q) for q large).

gamma

Vector of n-1 estimates for the EVI obtained from EPD.

kappa

Vector of n-1 estimates for \kappa obtained from EPD.

tau

Vector of n-1 estimates for \tau obtained from EPD.

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of small exceedance probability" for ProbEPD and "Estimates of large return period" for ReturnEPD.

...

Additional arguments for the plot function, see plot for more details.

Details

See Section 4.2.1 of Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

P

Vector of the corresponding probability estimates, only returned for ProbEPD.

R

Vector of the corresponding estimates for the return period, only returned for ReturnEPD.

q

The used large quantile.

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Examples

data(secura)

# EPD estimates for the EVI
epd <- EPD(secura$size, plot=TRUE)

# Compute exceedance probabilities
q <- 10^7
ProbEPD(secura$size, q=q, gamma=epd$gamma, kappa=epd$kappa, tau=epd$tau, plot=TRUE)

# Compute return periods
ReturnEPD(secura$size, q=q, gamma=epd$gamma, kappa=epd$kappa, tau=epd$tau, 
          plot=TRUE, ylim=c(0,10^4))

Estimator of small exceedance probabilities and large return periods using generalised Hill

Description

Computes estimates of a small exceedance probability P(X>q) or large return period 1/P(X>q) using the generalised Hill estimates for the EVI.

Usage

ProbGH(data, gamma, q, plot = FALSE, add = FALSE, 
       main = "Estimates of small exceedance probability", ...)

ReturnGH(data, gamma, q, plot = FALSE, add = FALSE, 
         main = "Estimates of large return period", ...)

Arguments

data

Vector of n observations.

gamma

Vector of n-2 estimates for the EVI obtained from genHill.

q

The used large quantile (we estimate P(X>q) or 1/P(X>q) for q large).

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of small exceedance probability" for ProbGH and "Estimates of large return period" for ReturnGH.

...

Additional arguments for the plot function, see plot for more details.

Details

See Section 4.2.2 of Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

P

Vector of the corresponding probability estimates, only returned for ProbGH.

R

Vector of the corresponding estimates for the return period, only returned for ReturnGH.

q

The used large quantile.

Author(s)

Tom Reynkens.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Beirlant, J., Vynckier, P. and Teugels, J.L. (1996). "Excess Function and Estimation of the Extreme-value Index". Bernoulli, 2, 293–318.

Examples

data(soa)

# Look at last 500 observations of SOA data
SOAdata <- sort(soa$size)[length(soa$size)-(0:499)]

# Hill estimator
H <- Hill(SOAdata)
# Generalised Hill estimator
gH <- genHill(SOAdata, H$gamma)

# Exceedance probability
q <- 10^7
ProbGH(SOAdata, gamma=gH$gamma, q=q, plot=TRUE)

# Return period
q <- 10^7
ReturnGH(SOAdata, gamma=gH$gamma, q=q, plot=TRUE)

Estimator of small exceedance probabilities and large return periods using GPD-MLE

Description

Computes estimates of a small exceedance probability P(X>q) or large return period 1/P(X>q) using the GPD fit for the peaks over a threshold.

Usage

ProbGPD(data, gamma, sigma, q, plot = FALSE, add = FALSE, 
        main = "Estimates of small exceedance probability", ...)

ReturnGPD(data, gamma, sigma, q, plot = FALSE, add = FALSE, 
          main = "Estimates of large return period", ...)

Arguments

data

Vector of n observations.

gamma

Vector of n-1 estimates for the EVI obtained from GPDmle.

sigma

Vector of n-1 estimates for \sigma obtained from GPDmle.

q

The used large quantile (we estimate P(X>q) or 1/P(X>q) for q large).

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of small exceedance probability" for ProbGPD and "Estimates of large return period" for ReturnGPD.

...

Additional arguments for the plot function, see plot for more details.

Details

See Section 4.2.2 in Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

P

Vector of the corresponding probability estimates, only returned for ProbGPD.

R

Vector of the corresponding estimates for the return period, only returned for ReturnGPD.

q

The used large quantile.

Author(s)

Tom Reynkens.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Examples

data(soa)

# Look at last 500 observations of SOA data
SOAdata <- sort(soa$size)[length(soa$size)-(0:499)]

# GPD-ML estimator
pot <- GPDmle(SOAdata)

# Exceedance probability
q <- 10^7
ProbGPD(SOAdata, gamma=pot$gamma, sigma=pot$sigma, q=q, plot=TRUE)

# Return period
q <- 10^7
ReturnGPD(SOAdata, gamma=pot$gamma, sigma=pot$sigma, q=q, plot=TRUE)

Estimator of small exceedance probabilities and large return periods using MOM

Description

Computes estimates of a small exceedance probability P(X>q) or large return period 1/P(X>q) using the Method of Moments estimates for the EVI.

Usage

ProbMOM(data, gamma, q, plot = FALSE, add = FALSE, 
        main = "Estimates of small exceedance probability", ...)

ReturnMOM(data, gamma, q, plot = FALSE, add = FALSE, 
          main = "Estimates of large return period", ...)

Arguments

data

Vector of n observations.

gamma

Vector of n-1 estimates for the EVI obtained from Moment.

q

The used large quantile (we estimate P(X>q) or 1/P(X>q) for q large).

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of small exceedance probability" for ProbMOM and "Estimates of large return period" for ReturnMOM.

...

Additional arguments for the plot function, see plot for more details.

Details

See Section 4.2.2 of Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

P

Vector of the corresponding probability estimates, only returned for ProbMOM.

R

Vector of the corresponding estimates for the return period, only returned for ReturnMOM.

q

The used large quantile.

Author(s)

Tom Reynkens.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Dekkers, A.L.M, Einmahl, J.H.J. and de Haan, L. (1989). "A Moment Estimator for the Index of an Extreme-value Distribution." Annals of Statistics, 17, 1833–1855.

Examples

data(soa)

# Look at last 500 observations of SOA data
SOAdata <- sort(soa$size)[length(soa$size)-(0:499)]

# MOM estimator
M <- Moment(SOAdata)

# Exceedance probability
q <- 10^7
ProbMOM(SOAdata, gamma=M$gamma, q=q, plot=TRUE)

# Return period
q <- 10^7
ReturnMOM(SOAdata, gamma=M$gamma, q=q, plot=TRUE)

Estimator of small tail probability in regression

Description

Estimator of small tail probability 1-F_i(q) in the regression case where \gamma is constant and the regression modelling is thus only solely placed on the scale parameter.

Usage

ProbReg(Z, A, q, plot = FALSE, add = FALSE, 
        main = "Estimates of small exceedance probability", ...)

Arguments

Z

Vector of n observations (from the response variable).

A

Vector of n-1 estimates for A(i/n) obtained from ScaleReg.

q

The used large quantile (we estimate P(X_i>q)) for q large).

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of small exceedance probability".

...

Additional arguments for the plot function, see plot for more details.

Details

The estimator is defined as

1-\hat{F}_i(q) = \hat{A}(i/n) (k+1)/(n+1) (q/Z_{n-k,n})^{-1/H_{k,n}},

with H_{k,n} the Hill estimator. Here, it is assumed that we have equidistant covariates x_i=i/n.

See Section 4.4.1 in Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

P

Vector of the corresponding probability estimates.

q

The used large quantile.

Author(s)

Tom Reynkens.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Examples

data(norwegianfire)

Z <- norwegianfire$size[norwegianfire$year==76]

i <- 100
n <- length(Z)

# Scale estimator in i/n
A <- ScaleReg(i/n, Z, h=0.5, kernel = "epanechnikov")$A

# Small exceedance probability
q <- 10^6
ProbReg(Z, A, q, plot=TRUE)

# Large quantile
p <- 10^(-5)
QuantReg(Z, A, p, plot=TRUE)

Weissman estimator of extreme quantiles

Description

Compute estimates of an extreme quantile Q(1-p) using the approach of Weissman (1978).

Usage

Quant(data, gamma, p, plot = FALSE, add = FALSE, 
      main = "Estimates of extreme quantile", ...)
            
Weissman.q(data, gamma, p, plot = FALSE, add = FALSE, 
           main = "Estimates of extreme quantile", ...)

Arguments

data

Vector of n observations.

gamma

Vector of n-1 estimates for the EVI, typically Hill estimates are used.

p

The exceedance probability of the quantile (we estimate Q(1-p) for p small).

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates as a function of k should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of extreme quantile".

...

Additional arguments for the plot function, see plot for more details.

Details

See Section 4.2.1 of Albrecher et al. (2017) for more details.

Weissman.q is the same function but with a different name for compatibility with the old S-Plus code.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

Q

Vector of the corresponding quantile estimates.

p

The used exceedance probability.

Author(s)

Tom Reynkens based on S-Plus code from Yuri Goegebeur.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Weissman, I. (1978). "Estimation of Parameters and Large Quantiles Based on the k Largest Observations." Journal of the American Statistical Association, 73, 812–815.

Examples

data(soa)

# Look at last 500 observations of SOA data
SOAdata <- sort(soa$size)[length(soa$size)-(0:499)]

# Hill estimator
H <- Hill(SOAdata)
# Bias-reduced estimator (QV)
H_QV <- Hill.2oQV(SOAdata)

# Exceedance probability
p <- 10^(-5)
# Weissman estimator
Quant(SOAdata, gamma=H$gamma, p=p, plot=TRUE)

# Second order Weissman estimator (QV)
Quant.2oQV(SOAdata, gamma=H_QV$gamma, beta=H_QV$beta, b=H_QV$b, p=p,
           add=TRUE, lty=2)

Second order refined Weissman estimator of extreme quantiles (QV)

Description

Compute second order refined Weissman estimator of extreme quantiles Q(1-p) using the quantile view.

Usage

Quant.2oQV(data, gamma, b, beta, p, plot = FALSE, add = FALSE, 
           main = "Estimates of extreme quantile", ...)
                
Weissman.q.2oQV(data, gamma, b, beta, p, plot = FALSE, add = FALSE, 
                main = "Estimates of extreme quantile", ...)

Arguments

data

Vector of n observations.

gamma

Vector of n-1 estimates for the EVI obtained from Hill.2oQV.

b

Vector of n-1 estimates for b obtained from Hill.2oQV.

beta

Vector of n-1 estimates for \beta obtained from Hill.2oQV.

p

The exceedance probability of the quantile (we estimate Q(1-p) for p small).

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of extreme quantile".

...

Additional arguments for the plot function, see plot for more details.

Details

See Section 4.2.1 of Albrecher et al. (2017) for more details.

Weissman.q.2oQV is the same function but with a different name for compatibility with the old S-Plus code.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

Q

Vector of the corresponding quantile estimates.

p

The used exceedance probability.

Author(s)

Tom Reynkens based on S-Plus code from Yuri Goegebeur.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Examples

data(soa)

# Look at last 500 observations of SOA data
SOAdata <- sort(soa$size)[length(soa$size)-(0:499)]

# Hill estimator
H <- Hill(SOAdata)
# Bias-reduced estimator (QV)
H_QV <- Hill.2oQV(SOAdata)

# Exceedance probability
p <- 10^(-5)
# Weissman estimator
Quant(SOAdata, gamma=H$gamma, p=p, plot=TRUE)

# Second order Weissman estimator (QV)
Quant.2oQV(SOAdata, gamma=H_QV$gamma, beta=H_QV$beta, b=H_QV$b, p=p,
           add=TRUE, lty=2)

Estimator of extreme quantiles using generalised Hill

Description

Compute estimates of an extreme quantile Q(1-p) using generalised Hill estimates of the EVI.

Usage

QuantGH(data, gamma, p, plot = FALSE, add = FALSE, 
        main = "Estimates of extreme quantile", ...)

Arguments

data

Vector of n observations.

gamma

Vector of n-2 estimates for the EVI obtained from genHill.

p

The exceedance probability of the quantile (we estimate Q(1-p) for p small).

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of extreme quantile".

...

Additional arguments for the plot function, see plot for more details.

Details

See Section 4.2.2 of Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

Q

Vector of the corresponding quantile estimates.

p

The used exceedance probability.

Author(s)

Tom Reynkens.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Beirlant, J., Vynckier, P. and Teugels, J.L. (1996). "Excess Function and Estimation of the Extreme-value Index". Bernoulli, 2, 293–318.

Examples

data(soa)

# Look at last 500 observations of SOA data
SOAdata <- sort(soa$size)[length(soa$size)-(0:499)]

# Hill estimator
H <- Hill(SOAdata)
# Generalised Hill estimator
gH <- genHill(SOAdata, H$gamma)

# Large quantile
p <- 10^(-5)
QuantGH(SOAdata, p=p, gamma=gH$gamma, plot=TRUE)

Estimator of extreme quantiles using GPD-MLE

Description

Computes estimates of an extreme quantile Q(1-p) using the GPD fit for the peaks over a threshold.

Usage

QuantGPD(data, gamma, sigma, p, plot = FALSE, add = FALSE, 
         main = "Estimates of extreme quantile", ...)

Arguments

data

Vector of n observations.

gamma

Vector of n-1 estimates for the EVI obtained from GPDmle.

sigma

Vector of n-1 estimates for \sigma obtained from GPDmle.

p

The exceedance probability of the quantile (we estimate Q(1-p) for p small).

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of extreme quantile".

...

Additional arguments for the plot function, see plot for more details.

Details

See Section 4.2.2 in Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

Q

Vector of the corresponding quantile estimates.

p

The used exceedance probability.

Author(s)

Tom Reynkens.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Examples

data(soa)

# Look at last 500 observations of SOA data
SOAdata <- sort(soa$size)[length(soa$size)-(0:499)]

# GPD-ML estimator
pot <- GPDmle(SOAdata)

# Large quantile
p <- 10^(-5)
QuantGPD(SOAdata, p=p, gamma=pot$gamma, sigma=pot$sigma, plot=TRUE)

Estimator of extreme quantiles using MOM

Description

Compute estimates of an extreme quantile Q(1-p) using the Method of Moments estimates of the EVI.

Usage

QuantMOM(data, gamma, p, plot = FALSE, add = FALSE, 
         main = "Estimates of extreme quantile", ...)

Arguments

data

Vector of n observations.

gamma

Vector of n-1 estimates for the EVI obtained from Moment.

p

The exceedance probability of the quantile (we estimate Q(1-p) for p small).

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of extreme quantile".

...

Additional arguments for the plot function, see plot for more details.

Details

See Section 4.2.2 of Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

Q

Vector of the corresponding quantile estimates.

p

The used exceedance probability.

Author(s)

Tom Reynkens.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Dekkers, A.L.M, Einmahl, J.H.J. and de Haan, L. (1989). "A Moment Estimator for the Index of an Extreme-value Distribution." Annals of Statistics, 17, 1833–1855.

Examples

data(soa)

# Look at last 500 observations of SOA data
SOAdata <- sort(soa$size)[length(soa$size)-(0:499)]

# MOM estimator
M <- Moment(SOAdata)

# Large quantile
p <- 10^(-5)
QuantMOM(SOAdata, p=p, gamma=M$gamma, plot=TRUE)

Estimator of extreme quantiles in regression

Description

Estimator of extreme quantile Q_i(1-p) in the regression case where \gamma is constant and the regression modelling is thus only solely placed on the scale parameter.

Usage

QuantReg(Z, A, p, plot = FALSE, add = FALSE, 
         main = "Estimates of extreme quantile", ...)

Arguments

Z

Vector of n observations (from the response variable).

A

Vector of n-1 estimates for A(i/n) obtained from ScaleReg.

p

The exceedance probability of the quantile (we estimate Q_i(1-p) for p small).

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of extreme quantile".

...

Additional arguments for the plot function, see plot for more details.

Details

The estimator is defined as

\hat{Q}_i(1-p) = Z_{n-k,n} ((k+1)/((n+1)\times p) \hat{A}(i/n))^{H_{k,n}},

with H_{k,n} the Hill estimator. Here, it is assumed that we have equidistant covariates x_i=i/n.

See Section 4.4.1 in Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

Q

Vector of the corresponding quantile estimates.

p

The used exceedance probability.

Author(s)

Tom Reynkens.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Examples

data(norwegianfire)

Z <- norwegianfire$size[norwegianfire$year==76]

i <- 100
n <- length(Z)

# Scale estimator in i/n
A <- ScaleReg(i/n, Z, h=0.5, kernel = "epanechnikov")$A

# Small exceedance probability
q <- 10^6
ProbReg(Z, A, q, plot=TRUE)

# Large quantile
p <- 10^(-5)
QuantReg(Z, A, p, plot=TRUE)

Scale estimator

Description

Computes the estimator for the scale parameter as described in Beirlant et al. (2016).

Usage

Scale(data, gamma = NULL, logk = FALSE, plot = FALSE, add = FALSE, 
      main = "Estimates of scale parameter", ...)

Arguments

data

Vector of n observations.

gamma

Vector of n-1 estimates for the EVI. When NULL (the default value), Hill estimates are computed.

logk

Logical indicating if the estimates are plotted as a function of \log(k) (logk=TRUE) or as a function of k. Default is FALSE.

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of scale parameter".

...

Additional arguments for the plot function, see plot for more details.

Details

The scale estimates are computed based on the following model for the CDF: 1-F(x) = A x^{-1/\gamma}, where A:= C^{1/\gamma} is the scale parameter:

\hat{A}_{k,n}=(k+1)/(n+1) X_{n-k,n}^{1/H_{k,n}}

where H_{k,n} are the Hill estimates.

See Section 4.2.1 of Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

A

Vector of the corresponding scale estimates.

C

Vector of the corresponding estimates for C, see Details.

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant, J., Schoutens, W., De Spiegeleer, J., Reynkens, T. and Herrmann, K. (2016). "Hunting for Black Swans in the European Banking Sector Using Extreme Value Analysis." In: Jan Kallsen and Antonis Papapantoleon (eds.), Advanced Modelling in Mathematical Finance, Springer International Publishing, Switzerland, pp. 147–166.

Examples

data(secura)

# Hill estimator
H <- Hill(secura$size)

# Scale estimator
S <- Scale(secura$size, gamma=H$gamma, plot=FALSE)

# Plot logarithm of scale          
plot(S$k,log(S$A), xlab="k", ylab="log(Scale)", type="l")

Bias-reduced scale estimator using second order Hill estimator

Description

Computes the bias-reduced estimator for the scale parameter using the second-order Hill estimator.

Usage

Scale.2o(data, gamma, b, beta, logk = FALSE, plot = FALSE, add = FALSE, 
         main = "Estimates of scale parameter", ...)

Arguments

data

Vector of n observations.

gamma

Vector of n-1 estimates for the EVI obtained from Hill.2oQV.

b

Vector of n-1 estimates for B obtained from Hill.2oQV.

beta

Vector of n-1 estimates for \beta obtained from Hill.2oQV.

logk

Logical indicating if the estimates are plotted as a function of \log(k) (logk=TRUE) or as a function of k. Default is FALSE.

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of scale parameter".

...

Additional arguments for the plot function, see plot for more details.

Details

The scale estimates are computed based on the following model for the CDF: 1-F(x) = A x^{-1/\gamma} ( 1+ bx^{-\beta}(1+o(1)) ), where A:= C^{1/\gamma} is the scale parameter.

See Section 4.2.1 of Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

A

Vector of the corresponding scale estimates.

C

Vector of the corresponding estimates for C, see details.

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Examples

data(secura)

# Hill estimator
H <- Hill(secura$size)
# Bias-reduced Hill estimator
H2o <- Hill.2oQV(secura$size)

# Scale estimator
S <- Scale(secura$size, gamma=H$gamma, plot=FALSE)
# Bias-reduced scale estimator
S2o <- Scale.2o(secura$size, gamma=H2o$gamma, b=H2o$b, 
          beta=H2o$beta, plot=FALSE)

# Plot logarithm of scale             
plot(S$k,log(S$A), xlab="k", ylab="log(Scale)", type="l")
lines(S2o$k,log(S2o$A), lty=2)

Bias-reduced scale estimator using EPD estimator

Description

Computes the bias-reduced estimator for the scale parameter using the EPD estimator (Beirlant et al., 2016).

Usage

ScaleEPD(data, gamma, kappa, logk = FALSE, plot = FALSE, add = FALSE, 
         main = "Estimates of scale parameter", ...)

Arguments

data

Vector of n observations.

gamma

Vector of n-1 estimates for the EVI obtained from EPD.

kappa

Vector of n-1 estimates for \kappa obtained from EPD.

logk

Logical indicating if the estimates are plotted as a function of \log(k) (logk=TRUE) or as a function of k. Default is FALSE.

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of scale parameter".

...

Additional arguments for the plot function, see plot for more details.

Details

The scale estimates are computed based on the following model for the CDF: 1-F(x) = A x^{-1/\gamma} ( 1+ bx^{-\beta}(1+o(1)) ), where A:= C^{1/\gamma} is the scale parameter. Using the EPD approach we replace bx^{-\beta} by \kappa/\gamma.

See Section 4.2.1 of Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

A

Vector of the corresponding scale estimates.

C

Vector of the corresponding estimates for C, see details.

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Examples

data(secura)

# Hill estimator
H <- Hill(secura$size)
# EPD estimator
epd <- EPD(secura$size)

# Scale estimator
S <- Scale(secura$size, gamma=H$gamma, plot=FALSE)
# Bias-reduced scale estimator
Sepd <- ScaleEPD(secura$size, gamma=epd$gamma, kappa=epd$kappa, plot=FALSE)

# Plot logarithm of scale             
plot(S$k,log(S$A), xlab="k", ylab="log(Scale)", type="l")
lines(Sepd$k,log(Sepd$A), lty=2)

Scale estimator in regression

Description

Estimator of the scale parameter in the regression case where \gamma is constant and the regression modelling is thus placed solely on the scale parameter.

Usage

ScaleReg(s, Z, kernel = c("normal", "uniform", "triangular", "epanechnikov", "biweight"), 
         h, plot = TRUE, add = FALSE, main = "Estimates of scale parameter", ...)

Arguments

s

Point to evaluate the scale estimator in.

Z

Vector of n observations (from the response variable).

kernel

The kernel used in the estimator. One of "normal" (default), "uniform", "triangular", "epanechnikov" and "biweight".

h

The bandwidth used in the kernel function.

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of scale parameter".

...

Additional arguments for the plot function, see plot for more details.

Details

The scale estimator is computed as

\hat{A}(s) = 1/(k+1) \sum_{i=1}^n 1_{Z_i>Z_{n-k,n}} K_h(s-i/n)

with K_h(x)=K(x/h)/h, K the kernel function and h the bandwidth. Here, it is assumed that we have equidistant covariates x_i=i/n.

See Section 4.4.1 in Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

A

Vector of the corresponding scale estimates.

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Examples

data(norwegianfire)

Z <- norwegianfire$size[norwegianfire$year==76]

i <- 100
n <- length(Z)

# Scale estimator in i/n
A <- ScaleReg(i/n, Z, h=0.5, kernel = "epanechnikov")$A

# Small exceedance probability
q <- 10^6
ProbReg(Z, A, q, plot=TRUE)

# Large quantile
p <- 10^(-5)
QuantReg(Z, A, p, plot=TRUE)

Spliced distribution

Description

Density, distribution function, quantile function and random generation for the fitted spliced distribution.

Usage

dSplice(x, splicefit, log = FALSE)

pSplice(x, splicefit, lower.tail = TRUE, log.p = FALSE)

qSplice(p, splicefit, lower.tail = TRUE, log.p = FALSE)

rSplice(n, splicefit)

Arguments

x

Vector of points to evaluate the CDF or PDF in.

p

Vector of probabilities.

n

Number of observations.

splicefit

A SpliceFit object, e.g. output from SpliceFitPareto, SpliceFiticPareto or SpliceFitGPD.

log

Logical indicating if the densities are given as \log(f), default is FALSE.

lower.tail

Logical indicating if the probabilities are of the form P(X\le x) (TRUE) or P(X>x) (FALSE). Default is TRUE.

log.p

Logical indicating if the probabilities are given as \log(p), default is FALSE.

Details

See Reynkens et al. (2017) and Section 4.3 in Albrecher et al. (2017) for details.

Value

dSplice gives the density function evaluated in x, pSplice the CDF evaluated in x and qSplice the quantile function evaluated in p. The length of the result is equal to the length of x or p.

rSplice returns a random sample of length n.

Author(s)

Tom Reynkens with R code from Roel Verbelen for the mixed Erlang PDF, CDF and quantiles.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Verbelen, R., Gong, L., Antonio, K., Badescu, A. and Lin, S. (2015). "Fitting Mixtures of Erlangs to Censored and Truncated Data Using the EM Algorithm." Astin Bulletin, 45, 729–758.

Examples

## Not run: 

# Pareto random sample
X <- rpareto(1000, shape = 2)

# Splice ME and Pareto
splicefit <- SpliceFitPareto(X, 0.6)



x <- seq(0, 20, 0.01)

# Plot of spliced CDF
plot(x, pSplice(x, splicefit), type="l", xlab="x", ylab="F(x)")

# Plot of spliced PDF
plot(x, dSplice(x, splicefit), type="l", xlab="x", ylab="f(x)")



p <- seq(0, 1, 0.01)

# Plot of splicing quantiles
plot(p, qSplice(p, splicefit), type="l", xlab="p", ylab="Q(p)")

# Plot of VaR
plot(p, VaR(p, splicefit), type="l", xlab="p", ylab=bquote(VaR[p]))



# Random sample from spliced distribution
x <- rSplice(1000, splicefit)


## End(Not run)

Plot of fitted and empirical survival function

Description

This function plots the fitted survival function of the spliced distribution together with the empirical survival function (determined using the Empirical CDF (ECDF)). Moreover, 100(1-\alpha)\% confidence bands are added.

Usage

SpliceECDF(x, X, splicefit, alpha = 0.05, ...)

Arguments

x

Vector of points to plot the functions at.

X

Data used for fitting the distribution.

splicefit

A SpliceFit object, e.g. output from SpliceFitPareto or SpliceFitGPD.

alpha

100(1-\alpha)\% is the confidence level for the confidence bands. Default is \alpha=0.05.

...

Additional arguments for the plot function, see plot for more details.

Details

Use SpliceTB for censored data.

Confidence bands are determined using the Dvoretzky-Kiefer-Wolfowitz inequality (Massart, 1990).

See Reynkens et al. (2017) and Section 4.3.1 in Albrecher et al. (2017) for more details.

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Massart, P. (1990). The Tight Constant in the Dvoretzky-Kiefer-Wolfowitz Inequality. Annals of Probability, 18, 1269–1283.

Verbelen, R., Gong, L., Antonio, K., Badescu, A. and Lin, S. (2015). "Fitting Mixtures of Erlangs to Censored and Truncated Data Using the EM Algorithm." Astin Bulletin, 45, 729–758.

Examples

## Not run: 

# Pareto random sample
X <- rpareto(1000, shape = 2)

# Splice ME and Pareto
splicefit <- SpliceFitPareto(X, 0.6)



x <- seq(0, 20, 0.01)

# Plot of spliced CDF
plot(x, pSplice(x, splicefit), type="l", xlab="x", ylab="F(x)")

# Plot of spliced PDF
plot(x, dSplice(x, splicefit), type="l", xlab="x", ylab="f(x)")



# Fitted survival function and empirical survival function 
SpliceECDF(x, X, splicefit)

# Log-log plot with empirical survival function and fitted survival function
SpliceLL(x, X, splicefit)

# PP-plot of empirical survival function and fitted survival function
SplicePP(X, splicefit)

# PP-plot of empirical survival function and 
# fitted survival function with log-scales
SplicePP(X, splicefit, log=TRUE)

# Splicing QQ-plot
SpliceQQ(X, splicefit)

## End(Not run)

Splicing fit

Description

Create an S3 object using ME-Pa or ME-GPD splicing fit obtained from SpliceFitPareto, SpliceFiticPareto or SpliceFitGPD.

Usage

SpliceFit(const, trunclower, t, type, MEfit, EVTfit, loglik = NULL, IC = NULL)

Arguments

const

Vector of splicing constants or a single splicing constant.

trunclower

Lower truncation point.

t

Vector of splicing points or a single splicing point.

type

Vector of types of the distributions: "ME" and then for each fitted EVT distribution: "Pa" (Pareto), "TPa" (truncated Pareto) or "GPD" (GPD).

MEfit

MEfit object with details on the mixed Erlang fit.

EVTfit

EVTfit object with details on the EVT fit.

loglik

Log-likelihood of the fitted model. When NULL (default), not included in the object.

IC

Information criteria of the fitted model. When NULL (default), not included in the object. This vector should have length 1, 2 or 3 when included.

Details

See Reynkens et al. (2017) and Section 4.3 in Albrecher et al. (2017) for details.

Value

An S3 object containing the above input arguments and values for \pi, the splicing weights. These splicing weights are equal to

\pi_1=const_1, \pi_2=const_2-const_1, ...,\pi_{l+1}=1-const_l=1-(\pi_1+...+\pi_l)

when l\ge 2 and

\pi_1=const_1, \pi_2=1-const_1=1-\pi_1

when l=1, where l is the length of const.

A summary method is available.

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Verbelen, R., Gong, L., Antonio, K., Badescu, A. and Lin, S. (2015). "Fitting Mixtures of Erlangs to Censored and Truncated Data Using the EM Algorithm." Astin Bulletin, 45, 729–758

Examples

# Create MEfit object
mefit <- MEfit(p=c(0.65,0.35), shape=c(39,58), theta=16.19, M=2)

# Create EVTfit object
evtfit <- EVTfit(gamma=c(0.76,0.64), endpoint=c(39096, Inf))

# Create SpliceFit object
splicefit <- SpliceFit(const=c(0.5,0.996), trunclower=0, t=c(1020,39096), type=c("ME","TPa","Pa"),
                       MEfit=mefit, EVTfit=evtfit)

# Show summary
summary(splicefit)

Splicing of mixed Erlang and GPD using POT-MLE

Description

Fit spliced distribution of a mixed Erlang distribution and a Generalised Pareto Distribution (GPD). The parameters of the GPD are determined using the POT-MLE approach.

Usage

SpliceFitGPD(X, const = NULL, tsplice = NULL, M = 3, s = 1:10, trunclower = 0, 
             ncores = NULL, criterium = c("BIC","AIC"), reduceM = TRUE,
             eps = 10^(-3), beta_tol = 10^(-5), maxiter = Inf)

Arguments

X

Data used for fitting the distribution.

const

The probability of the quantile where the ME distribution will be spliced with the GPD distribution. Default is NULL meaning the input from tsplice is used.

tsplice

The point where the ME distribution will be spliced with the GPD distribution. Default is NULL meaning the input from const is used.

M

Initial number of Erlang mixtures, default is 3. This number can change when determining an optimal mixed Erlang fit using an information criterion.

s

Vector of spread factors for the EM algorithm, default is 1:10. We loop over these factors when determining an optimal mixed Erlang fit using an information criterion, see Verbelen et al. (2016).

trunclower

Lower truncation point. Default is 0.

ncores

Number of cores to use when determining an optimal mixed Erlang fit using an information criterion. When NULL (default), max(nc-1,1) cores are used where nc is the number of cores as determined by detectCores.

criterium

Information criterion used to select the number of components of the ME fit and s. One of "AIC" and "BIC" (default).

reduceM

Logical indicating if M should be reduced based on the information criterion, default is TRUE.

eps

Covergence threshold used in the EM algorithm (ME part). Default is 10^(-3).

beta_tol

Threshold for the mixing weights below which the corresponding shape parameter vector is considered neglectable (ME part). Default is 10^(-5).

maxiter

Maximum number of iterations in a single EM algorithm execution (ME part). Default is Inf meaning no maximum number of iterations.

Details

See Reynkens et al. (2017), Section 4.3.1 of Albrecher et al. (2017) and Verbelen et al. (2015) for details. The code follows the notation of the latter. Initial values follow from Verbelen et al. (2016).

Value

A SpliceFit object.

Author(s)

Tom Reynkens with R code from Roel Verbelen for fitting the mixed Erlang distribution.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Verbelen, R., Gong, L., Antonio, K., Badescu, A. and Lin, S. (2015). "Fitting Mixtures of Erlangs to Censored and Truncated Data Using the EM Algorithm." Astin Bulletin, 45, 729–758.

Verbelen, R., Antonio, K. and Claeskens, G. (2016). "Multivariate Mixtures of Erlangs for Density Estimation Under Censoring." Lifetime Data Analysis, 22, 429–455.

Examples

## Not run: 

# GPD random sample
X <- rgpd(1000, gamma = 0.5, sigma = 2)

# Splice ME and GPD
splicefit <- SpliceFitGPD(X, 0.6)



x <- seq(0, 20, 0.01)

# Plot of spliced CDF
plot(x, pSplice(x, splicefit), type="l", xlab="x", ylab="F(x)")

# Plot of spliced PDF
plot(x, dSplice(x, splicefit), type="l", xlab="x", ylab="f(x)")



# Fitted survival function and empirical survival function 
SpliceECDF(x, X, splicefit)

# Log-log plot with empirical survival function and fitted survival function
SpliceLL(x, X, splicefit)

# PP-plot of empirical survival function and fitted survival function
SplicePP(X, splicefit)

# PP-plot of empirical survival function and 
# fitted survival function with log-scales
SplicePP(X, splicefit, log=TRUE)

# Splicing QQ-plot
SpliceQQ(X, splicefit)

## End(Not run)

Splicing of mixed Erlang and Pareto

Description

Fit spliced distribution of a mixed Erlang distribution and Pareto distribution(s). The shape parameter(s) of the Pareto distribution(s) is determined using the Hill estimator.

Usage

SpliceFitPareto(X, const = NULL, tsplice = NULL, M = 3, s = 1:10, trunclower = 0, 
                truncupper = Inf, EVTtruncation = FALSE, ncores = NULL, 
                criterium = c("BIC","AIC"), reduceM = TRUE,
                eps = 10^(-3), beta_tol = 10^(-5), maxiter = Inf)
              
SpliceFitHill(X, const = NULL, tsplice = NULL, M = 3, s = 1:10, trunclower = 0, 
              truncupper = Inf, EVTtruncation = FALSE, ncores = NULL,
              criterium = c("BIC","AIC"), reduceM = TRUE,
              eps = 10^(-3), beta_tol = 10^(-5), maxiter = Inf)

Arguments

X

Data used for fitting the distribution.

const

Vector of length l containing the probabilities of the quantiles where the distributions will be spliced (splicing points). The ME distribution will be spliced with l Pareto distributions. Default is NULL meaning the input from tsplice is used.

tsplice

Vector of length l containing the splicing points. The ME distribution will be spliced with l Pareto distributions. Default is NULL meaning the input from const is used.

M

Initial number of Erlang mixtures, default is 3. This number can change when determining an optimal mixed Erlang fit using an information criterion.

s

Vector of spread factors for the EM algorithm, default is 1:10. We loop over these factors when determining an optimal mixed Erlang fit using an information criterion, see Verbelen et al. (2016).

trunclower

Lower truncation point. Default is 0.

truncupper

Upper truncation point. Default is Inf (no upper truncation). When truncupper=Inf and EVTtruncation=TRUE, the truncation point is estimated using the approach of Beirlant et al. (2016).

EVTtruncation

Logical indicating if the l-th Pareto distribution is a truncated Pareto distribution. Default is FALSE.

ncores

criterium

Information criterion used to select the number of components of the ME fit and s. One of "AIC" and "BIC" (default).

reduceM

Logical indicating if M should be reduced based on the information criterion, default is TRUE.

eps

Covergence threshold used in the EM algorithm (ME part). Default is 10^(-3).

beta_tol

Threshold for the mixing weights below which the corresponding shape parameter vector is considered neglectable (ME part). Default is 10^(-5).

maxiter

Maximum number of iterations in a single EM algorithm execution (ME part). Default is Inf meaning no maximum number of iterations.

Details

The SpliceFitHill function is the same function but with a different name for compatibility with old versions of the package.

Use SpliceFiticPareto when censoring is present.

Value

A SpliceFit object.

Author(s)

Tom Reynkens with R code from Roel Verbelen for fitting the mixed Erlang distribution.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant, J., Fraga Alves, M.I. and Gomes, M.I. (2016). "Tail fitting for Truncated and Non-truncated Pareto-type Distributions." Extremes, 19, 429–462.

Verbelen, R., Gong, L., Antonio, K., Badescu, A. and Lin, S. (2015). "Fitting Mixtures of Erlangs to Censored and Truncated Data Using the EM Algorithm." Astin Bulletin, 45, 729–758.

Verbelen, R., Antonio, K. and Claeskens, G. (2016). "Multivariate Mixtures of Erlangs for Density Estimation Under Censoring." Lifetime Data Analysis, 22, 429–455.

Examples

## Not run: 

# Pareto random sample
X <- rpareto(1000, shape = 2)

# Splice ME and Pareto
splicefit <- SpliceFitPareto(X, 0.6)



x <- seq(0, 20, 0.01)

# Plot of spliced CDF
plot(x, pSplice(x, splicefit), type="l", xlab="x", ylab="F(x)")

# Plot of spliced PDF
plot(x, dSplice(x, splicefit), type="l", xlab="x", ylab="f(x)")



# Fitted survival function and empirical survival function 
SpliceECDF(x, X, splicefit)

# Log-log plot with empirical survival function and fitted survival function
SpliceLL(x, X, splicefit)

# PP-plot of empirical survival function and fitted survival function
SplicePP(X, splicefit)

# PP-plot of empirical survival function and 
# fitted survival function with log-scales
SplicePP(X, splicefit, log=TRUE)

# Splicing QQ-plot
SpliceQQ(X, splicefit)

## End(Not run)

Splicing of mixed Erlang and Pareto for interval censored data

Description

Fit spliced distribution of a mixed Erlang distribution and a Pareto distribution adapted for interval censoring and truncation.

Usage

SpliceFiticPareto(L, U, censored, tsplice, M = 3, s = 1:10, trunclower = 0, 
                  truncupper = Inf, ncores = NULL, criterium = c("BIC", "AIC"), 
                  reduceM = TRUE, eps = 10^(-3), beta_tol = 10^(-5), maxiter = Inf, 
                  cpp = FALSE)

Arguments

L

Vector of length n with the lower boundaries of the intervals for interval censored data or the observed data for right censored data.

U

Vector of length n with the upper boundaries of the intervals.

censored

A logical vector of length n indicating if an observation is censored.

tsplice

The splicing point t.

M

Initial number of Erlang mixtures, default is 3. This number can change when determining an optimal mixed Erlang fit using an information criterion.

s

Vector of spread factors for the EM algorithm, default is 1:10. We loop over these factors when determining an optimal mixed Erlang fit using an information criterion, see Verbelen et al. (2016).

trunclower

Lower truncation point. Default is 0.

truncupper

Upper truncation point. Default is Inf (no upper truncation).

ncores

criterium

Information criterion used to select the number of components of the ME fit and s. One of "AIC" and "BIC" (default).

reduceM

Logical indicating if M should be reduced based on the information criterion, default is TRUE.

eps

Covergence threshold used in the EM algorithm. Default is 10^(-3).

beta_tol

Threshold for the mixing weights below which the corresponding shape parameter vector is considered neglectable (ME part). Default is 10^(-5).

maxiter

Maximum number of iterations in a single EM algorithm execution. Default is Inf meaning no maximum number of iterations.

cpp

Use C++ implementation (cpp=TRUE) or R implementation (cpp=FALSE) of the algorithm. Default is FALSE meaning the plain R implementation is used.

Details

See Reynkens et al. (2017), Section 4.3.2 of Albrecher et al. (2017) and Verbelen et al. (2015) for details. The code follows the notation of the latter. Initial values follow from Verbelen et al. (2016).

Right censored data should be entered as L=l and U=truncupper, and left censored data should be entered as L=trunclower and U=u.

Value

A SpliceFit object.

Author(s)

Tom Reynkens based on R code from Roel Verbelen for fitting the mixed Erlang distribution (without splicing).

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Verbelen, R., Gong, L., Antonio, K., Badescu, A. and Lin, S. (2015). "Fitting Mixtures of Erlangs to Censored and Truncated Data Using the EM Algorithm." Astin Bulletin, 45, 729–758.

Verbelen, R., Antonio, K. and Claeskens, G. (2016). "Multivariate Mixtures of Erlangs for Density Estimation Under Censoring." Lifetime Data Analysis, 22, 429–455.

Examples

## Not run: 

# Pareto random sample
X <- rpareto(500, shape=2)

# Censoring variable
Y <- rpareto(500, shape=1)

# Observed sample
Z <- pmin(X,Y)

# Censoring indicator
censored <- (X>Y)

# Right boundary
U <- Z
U[censored] <- Inf

# Splice ME and Pareto
splicefit <- SpliceFiticPareto(L=Z, U=U, censored=censored, tsplice=quantile(Z,0.9))



x <- seq(0,20,0.1)

# Plot of spliced CDF
plot(x, pSplice(x, splicefit), type="l", xlab="x", ylab="F(x)")

# Plot of spliced PDF
plot(x, dSplice(x, splicefit), type="l", xlab="x", ylab="f(x)")


# Fitted survival function and Turnbull survival function 
SpliceTB(x, L=Z, U=U, censored=censored, splicefit=splicefit)


# Log-log plot with Turnbull survival function and fitted survival function
SpliceLL_TB(x, L=Z, U=U, censored=censored, splicefit=splicefit)


# PP-plot of Turnbull survival function and fitted survival function
SplicePP_TB(L=Z, U=U, censored=censored, splicefit=splicefit)

# PP-plot of Turnbull survival function and 
# fitted survival function with log-scales
SplicePP_TB(L=Z, U=U, censored=censored, splicefit=splicefit, log=TRUE)

# QQ-plot using Turnbull survival function and fitted survival function
SpliceQQ_TB(L=Z, U=U, censored=censored, splicefit=splicefit)

## End(Not run)

LL-plot with fitted and empirical survival function

Description

This function plots the logarithm of the empirical survival function (determined using the Empirical CDF (ECDF)) versus the logarithm of the data. Moreover, the logarithm of the fitted survival function of the spliced distribution is added.

Usage

SpliceLL(x = sort(X), X, splicefit, plot = TRUE, main = "Splicing LL-plot", ...)

Arguments

x

Vector of points to plot the fitted survival function at. By default we plot it at the data points.

X

Data used for fitting the distribution.

splicefit

A SpliceFit object, e.g. output from SpliceFitPareto or SpliceFitGPD.

plot

Logical indicating if the splicing LL-plot should be made, default is TRUE.

main

Title for the plot, default is "Splicing LL-plot".

...

Additional arguments for the plot function, see plot for more details.

Details

The LL-plot consists of the points

(\log(x_{i,n}), \log(1-\hat{F}(x_{i,n})))

for i=1,\ldots,n with n the length of the data, x_{i,n} the i-th smallest observation and \hat{F} the empirical distribution function. Then, the line

(\log(x), \log(1-\hat{F}_{spliced}(x))),

with \hat{F}_{spliced} the fitted spliced distribution function, is added.

Use SpliceLL_TB for censored data.

See Reynkens et al. (2017) and Section 4.3.1 in Albrecher et al. (2017) for more details.

Value

A list with following components:

logX

Vector of the logarithms of the sorted data.

sll.the

Vector of the theoretical log-probabilities \log(1-\hat{F}_{spliced}(x)).

logx

Vector of the logarithms of the points to plot the fitted survival function at.

sll.emp

Vector of the empirical log-probabilities \log(1-\hat{F}(x_{i,n})).

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Verbelen, R., Gong, L., Antonio, K., Badescu, A. and Lin, S. (2015). "Fitting Mixtures of Erlangs to Censored and Truncated Data Using the EM Algorithm." Astin Bulletin, 45, 729–758

Examples

## Not run: 

# Pareto random sample
X <- rpareto(1000, shape = 2)

# Splice ME and Pareto
splicefit <- SpliceFitPareto(X, 0.6)



x <- seq(0, 20, 0.01)

# Plot of spliced CDF
plot(x, pSplice(x, splicefit), type="l", xlab="x", ylab="F(x)")

# Plot of spliced PDF
plot(x, dSplice(x, splicefit), type="l", xlab="x", ylab="f(x)")



# Fitted survival function and empirical survival function 
SpliceECDF(x, X, splicefit)

# Log-log plot with empirical survival function and fitted survival function
SpliceLL(x, X, splicefit)

# PP-plot of empirical survival function and fitted survival function
SplicePP(X, splicefit)

# PP-plot of empirical survival function and 
# fitted survival function with log-scales
SplicePP(X, splicefit, log=TRUE)

# Splicing QQ-plot
SpliceQQ(X, splicefit)

## End(Not run)

LL-plot with fitted and Turnbull survival function

Description

This function plots the logarithm of the Turnbull survival function (which is suitable for interval censored data) versus the logarithm of the data. Moreover, the logarithm of the fitted survival function of the spliced distribution is added.

Usage

SpliceLL_TB(x = sort(L), L, U = L, censored, splicefit, plot = TRUE,
            main = "Splicing LL-plot", ...)

Arguments

x

Vector of points to plot the fitted survival function at. By default we plot it at the points L.

L

Vector of length n with the lower boundaries of the intervals for interval censored data or the observed data for right censored data.

U

Vector of length n with the upper boundaries of the intervals. By default, they are equal to L.

censored

A logical vector of length n indicating if an observation is censored.

splicefit

A SpliceFit object, e.g. output from SpliceFiticPareto.

plot

Logical indicating if the splicing LL-plot should be made, default is TRUE.

main

Title for the plot, default is "Splicing LL-plot".

...

Additional arguments for the plot function, see plot for more details.

Details

The LL-plot consists of the points

(\log(L_{i,n}), \log(1-\hat{F}^{TB}(L_{i,n})))

for i=1,\ldots,n with n the length of the data, x_{i,n} the i-th smallest observation and \hat{F}^{TB} the Turnbull estimator for the distribution function. Then, the line

(\log(x), \log(1-\hat{F}_{spliced}(x))),

with \hat{F}_{spliced} the fitted spliced distribution function, is added.

Right censored data should be entered as L=l and U=truncupper, and left censored data should be entered as L=trunclower and U=u. The limits trunclower and truncupper are obtained from the SpliceFit object.

If the interval package is installed, the icfit function is used to compute the Turnbull estimator. Otherwise, survfit.formula from survival is used.

Use SpliceLL for non-censored data.

See Reynkens et al. (2017) and Section 4.3.2 in Albrecher et al. (2017) for more details.

Value

A list with following components:

logX

Vector of the logarithms of the sorted left boundaries of the intervals.

sll.the

Vector of the theoretical log-probabilities \log(1-\hat{F}_{spliced}(x).

logx

Vector of the logarithms of the points to plot the fitted survival function at.

sll.emp

Vector of the empirical log-probabilities \log(1-\hat{F}^{TB}(x_{i,n})).

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Verbelen, R., Gong, L., Antonio, K., Badescu, A. and Lin, S. (2015). "Fitting Mixtures of Erlangs to Censored and Truncated Data Using the EM Algorithm." Astin Bulletin, 45, 729–758

Examples

## Not run: 

# Pareto random sample
X <- rpareto(500, shape=2)

# Censoring variable
Y <- rpareto(500, shape=1)

# Observed sample
Z <- pmin(X,Y)

# Censoring indicator
censored <- (X>Y)

# Right boundary
U <- Z
U[censored] <- Inf

# Splice ME and Pareto
splicefit <- SpliceFiticPareto(L=Z, U=U, censored=censored, tsplice=quantile(Z,0.9))



x <- seq(0,20,0.1)

# Plot of spliced CDF
plot(x, pSplice(x, splicefit), type="l", xlab="x", ylab="F(x)")

# Plot of spliced PDF
plot(x, dSplice(x, splicefit), type="l", xlab="x", ylab="f(x)")


# Fitted survival function and Turnbull survival function 
SpliceTB(x, L=Z, U=U, censored=censored, splicefit=splicefit)


# Log-log plot with Turnbull survival function and fitted survival function
SpliceLL_TB(x, L=Z, U=U, censored=censored, splicefit=splicefit)


# PP-plot of Turnbull survival function and fitted survival function
SplicePP_TB(L=Z, U=U, censored=censored, splicefit=splicefit)

# PP-plot of Turnbull survival function and 
# fitted survival function with log-scales
SplicePP_TB(L=Z, U=U, censored=censored, splicefit=splicefit, log=TRUE)

# QQ-plot using Turnbull survival function and fitted survival function
SpliceQQ_TB(L=Z, U=U, censored=censored, splicefit=splicefit)

## End(Not run)

PP-plot with fitted and empirical survival function

Description

This function plots the fitted survival function of the spliced distribution versus the empirical survival function (determined using the Empirical CDF (ECDF)).

Usage

SplicePP(X, splicefit, x = sort(X), log = FALSE, plot = TRUE, 
         main = "Splicing PP-plot", ...)

Arguments

X

Data used for fitting the distribution.

splicefit

A SpliceFit object, e.g. output from SpliceFitPareto or SpliceFitGPD.

x

Vector of points to plot the functions at. By default we plot them at the data points.

log

Logical indicating if minus the logarithms of the survival probabilities are plotted versus each other, default is FALSE.

plot

Logical indicating if the splicing PP-plot should be made, default is TRUE.

main

Title for the plot, default is "Splicing PP-plot".

...

Additional arguments for the plot function, see plot for more details.

Details

The PP-plot consists of the points

(1-\hat{F}(x_{i,n}), 1-\hat{F}_{spliced}(x_{i,n})))

for i=1,\ldots,n with n the length of the data, x_{i,n} the i-th smallest observation, \hat{F} the empirical distribution function and \hat{F}_{spliced} the fitted spliced distribution function. The minus-log version of the PP-plot consists of

(-\log(1-\hat{F}(x_{i,n})), -\log(1-\hat{F}_{spliced}(x_{i,n})))).

Use SplicePP_TB for censored data.

See Reynkens et al. (2017) and Section 4.3.1 in Albrecher et al. (2017) for more details.

Value

A list with following components:

spp.the

Vector of the theoretical probabilities 1-\hat{F}_{spliced}(x_{i,n}) (when log=FALSE) or -\log(1-\hat{F}_{spliced}(x_{i,n})) (when log=TRUE).

spp.emp

Vector of the empirical probabilities 1-\hat{F}(x_{i,n}) (when log=FALSE) or -\log(1-\hat{F}(x_{i,n})) (when log=TRUE).

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Verbelen, R., Gong, L., Antonio, K., Badescu, A. and Lin, S. (2015). "Fitting Mixtures of Erlangs to Censored and Truncated Data Using the EM Algorithm." Astin Bulletin, 45, 729–758

Examples

## Not run: 

# Pareto random sample
X <- rpareto(1000, shape = 2)

# Splice ME and Pareto
splicefit <- SpliceFitPareto(X, 0.6)



x <- seq(0, 20, 0.01)

# Plot of spliced CDF
plot(x, pSplice(x, splicefit), type="l", xlab="x", ylab="F(x)")

# Plot of spliced PDF
plot(x, dSplice(x, splicefit), type="l", xlab="x", ylab="f(x)")



# Fitted survival function and empirical survival function 
SpliceECDF(x, X, splicefit)

# Log-log plot with empirical survival function and fitted survival function
SpliceLL(x, X, splicefit)

# PP-plot of empirical survival function and fitted survival function
SplicePP(X, splicefit)

# PP-plot of empirical survival function and 
# fitted survival function with log-scales
SplicePP(X, splicefit, log=TRUE)

# Splicing QQ-plot
SpliceQQ(X, splicefit)

## End(Not run)

PP-plot with fitted and Turnbull survival function

Description

This function plots the fitted survival function of the spliced distribution versus the Turnbull survival function (which is suitable for interval censored data).

Usage

SplicePP_TB(L, U = L, censored, splicefit, x = NULL, log = FALSE, plot = TRUE,
            main = "Splicing PP-plot", ...)

Arguments

L

Vector of length n with the lower boundaries of the intervals for interval censored data or the observed data for right censored data.

U

Vector of length n with the upper boundaries of the intervals. By default, they are equal to L.

censored

A logical vector of length n indicating if an observation is censored.

splicefit

A SpliceFit object, e.g. output from SpliceFiticPareto.

x

Vector of points to plot the functions at. When NULL, the default, the empirical quantiles for 1/(n+1), \ldots, n/(n+1), obtained using the Turnbull estimator, are used.

log

Logical indicating if minus the logarithms of the survival probabilities are plotted versus each other, default is FALSE.

plot

Logical indicating if the splicing PP-plot should be made, default is TRUE.

main

Title for the plot, default is "Splicing PP-plot".

...

Additional arguments for the plot function, see plot for more details.

Details

The PP-plot consists of the points

(1-\hat{F}^{TB}(x_i), 1-\hat{F}_{spliced}(x_i)))

for i=1,\ldots,n with n the length of the data, x_i=\hat{Q}^{TB}(p_i) where p_i=i/(n+1), \hat{Q}^{TB} is the quantile function obtained using the Turnbull estimator, \hat{F}^{TB} the Turnbull estimator for the distribution function and \hat{F}_{spliced} the fitted spliced distribution function. The minus-log version of the PP-plot consists of

(-\log(1-\hat{F}^{TB}(x_i)), -\log(1-\hat{F}_{spliced}(x_i))).

If the interval package is installed, the icfit function is used to compute the Turnbull estimator. Otherwise, survfit.formula from survival is used.

Use SplicePP for non-censored data.

See Reynkens et al. (2017) and Section 4.3.2 in Albrecher et al. (2017) for more details.

Value

A list with following components:

spp.the

Vector of the theoretical probabilities 1-\hat{F}_{spliced}(x_i) (when log=FALSE) or -\log(1-\hat{F}_{spliced}(x_i)) (when log=TRUE).

spp.emp

Vector of the empirical probabilities 1-\hat{F}^{TB}(x_i) (when log=FALSE) or -\log(1-\hat{F}^{TB}(x_i)) (when log=TRUE).

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Verbelen, R., Gong, L., Antonio, K., Badescu, A. and Lin, S. (2015). "Fitting Mixtures of Erlangs to Censored and Truncated Data Using the EM Algorithm." Astin Bulletin, 45, 729–758

Examples

## Not run: 

# Pareto random sample
X <- rpareto(500, shape=2)

# Censoring variable
Y <- rpareto(500, shape=1)

# Observed sample
Z <- pmin(X,Y)

# Censoring indicator
censored <- (X>Y)

# Right boundary
U <- Z
U[censored] <- Inf

# Splice ME and Pareto
splicefit <- SpliceFiticPareto(L=Z, U=U, censored=censored, tsplice=quantile(Z,0.9))



x <- seq(0,20,0.1)

# Plot of spliced CDF
plot(x, pSplice(x, splicefit), type="l", xlab="x", ylab="F(x)")

# Plot of spliced PDF
plot(x, dSplice(x, splicefit), type="l", xlab="x", ylab="f(x)")


# Fitted survival function and Turnbull survival function 
SpliceTB(x, L=Z, U=U, censored=censored, splicefit=splicefit)


# Log-log plot with Turnbull survival function and fitted survival function
SpliceLL_TB(x, L=Z, U=U, censored=censored, splicefit=splicefit)


# PP-plot of Turnbull survival function and fitted survival function
SplicePP_TB(L=Z, U=U, censored=censored, splicefit=splicefit)

# PP-plot of Turnbull survival function and 
# fitted survival function with log-scales
SplicePP_TB(L=Z, U=U, censored=censored, splicefit=splicefit, log=TRUE)

# QQ-plot using Turnbull survival function and fitted survival function
SpliceQQ_TB(L=Z, U=U, censored=censored, splicefit=splicefit)

## End(Not run)

Splicing quantile plot

Description

Computes the empirical quantiles of a data vector and the theoretical quantiles of the fitted spliced distribution. These quantiles are then plotted in a splicing QQ-plot with the theoretical quantiles on the x-axis and the empirical quantiles on the y-axis.

Usage

SpliceQQ(X, splicefit, p = NULL, plot = TRUE, main = "Splicing QQ-plot", ...)

Arguments

X

Vector of n observations.

splicefit

A SpliceFit object, e.g. output from SpliceFitPareto or SpliceFitGPD.

p

Vector of probabilities used in the QQ-plot. If NULL, the default, we take p equal to 1/(n+1),...,n/(n+1).

plot

Logical indicating if the quantiles should be plotted in a splicing QQ-plot, default is TRUE.

main

Title for the plot, default is "Splicing QQ-plot".

...

Additional arguments for the plot function, see plot for more details.

Details

This QQ-plot is given by

(Q(p_j), \hat{Q}(p_j)),

for j=1,\ldots,n where Q is the quantile function of the fitted splicing model and \hat{Q} is the empirical quantile function and p_j=j/(n+1).

See Reynkens et al. (2017) and Section 4.3.1 in Albrecher et al. (2017) for more details.

Value

A list with following components:

sqq.the

Vector of the theoretical quantiles of the fitted spliced distribution.

sqq.emp

Vector of the empirical quantiles from the data.

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Verbelen, R., Gong, L., Antonio, K., Badescu, A. and Lin, S. (2015). "Fitting Mixtures of Erlangs to Censored and Truncated Data Using the EM Algorithm." Astin Bulletin, 45, 729–758

Examples

## Not run: 

# Pareto random sample
X <- rpareto(1000, shape = 2)

# Splice ME and Pareto
splicefit <- SpliceFitPareto(X, 0.6)



x <- seq(0, 20, 0.01)

# Plot of spliced CDF
plot(x, pSplice(x, splicefit), type="l", xlab="x", ylab="F(x)")

# Plot of spliced PDF
plot(x, dSplice(x, splicefit), type="l", xlab="x", ylab="f(x)")



# Fitted survival function and empirical survival function 
SpliceECDF(x, X, splicefit)

# Log-log plot with empirical survival function and fitted survival function
SpliceLL(x, X, splicefit)

# PP-plot of empirical survival function and fitted survival function
SplicePP(X, splicefit)

# PP-plot of empirical survival function and 
# fitted survival function with log-scales
SplicePP(X, splicefit, log=TRUE)

# Splicing QQ-plot
SpliceQQ(X, splicefit)

## End(Not run)

Splicing quantile plot using Turnbull estimator

Description

This function plots the fitted quantile function of the spliced distribution versus quantiles based on the Turnbull survival function (which is suitable for interval censored data).

Usage

SpliceQQ_TB(L, U = L, censored, splicefit, p = NULL,
            plot = TRUE, main = "Splicing QQ-plot", ...)

Arguments

L

Vector of length n with the lower boundaries of the intervals for interval censored data or the observed data for right censored data.

U

Vector of length n with the upper boundaries of the intervals. By default, they are equal to L.

censored

A logical vector of length n indicating if an observation is censored.

splicefit

A SpliceFit object, e.g. output from SpliceFiticPareto.

p

Vector of probabilities used in the QQ-plot. If NULL, the default, we take p equal to 1/(n+1),...,n/(n+1).

plot

Logical indicating if the quantiles should be plotted in a splicing QQ-plot, default is TRUE.

main

Title for the plot, default is "Splicing QQ-plot".

...

Additional arguments for the plot function, see plot for more details.

Details

This QQ-plot is given by

(Q(p_j), \hat{Q}^{TB}(p_j)),

for j=1,\ldots,n where Q is the quantile function of the fitted splicing model, \hat{Q}^{TB} the quantile function obtained using the Turnbull estimator and p_j=j/(n+1).

If the interval package is installed, the icfit function is used to compute the Turnbull estimator. Otherwise, survfit.formula from survival is used.

Use SpliceQQ for non-censored data.

See Reynkens et al. (2017) and Section 4.3.2 in Albrecher et al. (2017) for more details.

Value

A list with following components:

sqq.the

Vector of the theoretical quantiles of the fitted spliced distribution.

sqq.emp

Vector of the empirical quantiles from the data (based on the Turnbull estimator).

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Verbelen, R., Gong, L., Antonio, K., Badescu, A. and Lin, S. (2015). "Fitting Mixtures of Erlangs to Censored and Truncated Data Using the EM Algorithm." Astin Bulletin, 45, 729–758

Examples

## Not run: 

# Pareto random sample
X <- rpareto(500, shape=2)

# Censoring variable
Y <- rpareto(500, shape=1)

# Observed sample
Z <- pmin(X,Y)

# Censoring indicator
censored <- (X>Y)

# Right boundary
U <- Z
U[censored] <- Inf

# Splice ME and Pareto
splicefit <- SpliceFiticPareto(L=Z, U=U, censored=censored, tsplice=quantile(Z,0.9))



x <- seq(0,20,0.1)

# Plot of spliced CDF
plot(x, pSplice(x, splicefit), type="l", xlab="x", ylab="F(x)")

# Plot of spliced PDF
plot(x, dSplice(x, splicefit), type="l", xlab="x", ylab="f(x)")


# Fitted survival function and Turnbull survival function 
SpliceTB(x, L=Z, U=U, censored=censored, splicefit=splicefit)


# Log-log plot with Turnbull survival function and fitted survival function
SpliceLL_TB(x, L=Z, U=U, censored=censored, splicefit=splicefit)


# PP-plot of Turnbull survival function and fitted survival function
SplicePP_TB(L=Z, U=U, censored=censored, splicefit=splicefit)

# PP-plot of Turnbull survival function and 
# fitted survival function with log-scales
SplicePP_TB(L=Z, U=U, censored=censored, splicefit=splicefit, log=TRUE)

# QQ-plot using Turnbull survival function and fitted survival function
SpliceQQ_TB(L=Z, U=U, censored=censored, splicefit=splicefit)

## End(Not run)

Plot of fitted and Turnbull survival function

Description

This function plots the fitted survival function of the spliced distribution together with the Turnbull survival function (which is suitable for interval censored data). Moreover, 100(1-\alpha)\% confidence intervals are added.

Usage

SpliceTB(x = sort(L), L, U = L, censored, splicefit, alpha = 0.05, ...)

Arguments

x

Vector of points to plot the functions at. By default we plot it at the points L.

L

Vector of length n with the lower boundaries of the intervals for interval censored data or the observed data for right censored data.

U

Vector of length n with the upper boundaries of the intervals. By default, they are equal to L.

censored

A logical vector of length n indicating if an observation is censored.

splicefit

A SpliceFit object, e.g. output from SpliceFiticPareto.

alpha

100(1-\alpha)\% is the confidence level for the confidence intervals. Default is \alpha=0.05.

...

Additional arguments for the plot function, see plot for more details.

Details

Use SpliceECDF for non-censored data.

See Reynkens et al. (2017) and Section 4.3.2 in Albrecher et al. (2017) for more details.

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Verbelen, R., Gong, L., Antonio, K., Badescu, A. and Lin, S. (2015). "Fitting Mixtures of Erlangs to Censored and Truncated Data Using the EM Algorithm." Astin Bulletin, 45, 729–758

Examples

## Not run: 

# Pareto random sample
X <- rpareto(500, shape=2)

# Censoring variable
Y <- rpareto(500, shape=1)

# Observed sample
Z <- pmin(X,Y)

# Censoring indicator
censored <- (X>Y)

# Right boundary
U <- Z
U[censored] <- Inf

# Splice ME and Pareto
splicefit <- SpliceFiticPareto(L=Z, U=U, censored=censored, tsplice=quantile(Z,0.9))



x <- seq(0,20,0.1)

# Plot of spliced CDF
plot(x, pSplice(x, splicefit), type="l", xlab="x", ylab="F(x)")

# Plot of spliced PDF
plot(x, dSplice(x, splicefit), type="l", xlab="x", ylab="f(x)")


# Fitted survival function and Turnbull survival function 
SpliceTB(x, L=Z, U=U, censored=censored, splicefit=splicefit)


# Log-log plot with Turnbull survival function and fitted survival function
SpliceLL_TB(x, L=Z, U=U, censored=censored, splicefit=splicefit)


# PP-plot of Turnbull survival function and fitted survival function
SplicePP_TB(L=Z, U=U, censored=censored, splicefit=splicefit)

# PP-plot of Turnbull survival function and 
# fitted survival function with log-scales
SplicePP_TB(L=Z, U=U, censored=censored, splicefit=splicefit, log=TRUE)

# QQ-plot using Turnbull survival function and fitted survival function
SpliceQQ_TB(L=Z, U=U, censored=censored, splicefit=splicefit)

## End(Not run)

Turnbull estimator

Description

Computes the Turnbull estimator for the survival function of interval censored data.

Usage

Turnbull(x, L, R, censored, trunclower = 0, truncupper = Inf,
         conf.type = "plain", conf.int = 0.95)

Arguments

x

Vector with points to evaluate the estimator in.

L

Vector of length n with the lower boundaries of the intervals.

R

Vector of length n with the upper boundaries of the intervals.

censored

Vector of n logicals indicating if an observation is interval censored.

trunclower

Lower truncation point, default is 0.

truncupper

Upper truncation point, default is Inf.

conf.type

Type of confidence interval, see survfit.formula. Default is "plain".

conf.int

Confidence level of the two-sided confidence interval, see survfit.formula. Default is 0.95.

Details

We consider the random interval censoring model where one observes L \le R and where the variable of interest X lies between L and R.

Right censored data should be entered as L=l and R=truncupper, and right censored data should be entered as L=trunclower and R=r.

This function calls survfit.formula from survival.

See Section 4.3.2 in Albrecher et al. (2017) for more details.

Value

A list with following components:

surv

A vector of length length(x) containing the Turnbull estimator evaluated in the elements of x.

fit

The output from the call to survfit.formula, an object of class survfit.

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Turnbull, B. W. (1974). "Nonparametric Estimation of a Survivorship Function with Doubly Censored Data." Journal of the American Statistical Association, 69, 169–173.

Turnbull, B. W. (1976). "The Empirical Distribution Function with Arbitrarily Grouped, Censored and Truncated Data." Journal of the Royal Statistical Society: Series B (Methodological), 38, 290–295.

Examples

L <- 1:10
R <- c(1, 2.5, 3, 4, 5.5, 6, 7.5, 8.25, 9, 10.5)
censored <- c(0, 1, 0, 0, 1, 0, 1, 1, 0, 1)

x <- seq(0, 12, 0.1)

# Turnbull estimator
plot(x, Turnbull(x, L, R, censored)$cdf, type="s", ylab="Turnbull estimator")

VaR of splicing fit

Description

Compute Value-at-Risk (VaR_{1-p}=Q(1-p)) of the fitted spliced distribution.

Usage

VaR(p, splicefit)

Arguments

p

The exceedance probability (we estimate VaR_{1-p}=Q(1-p)).

splicefit

A SpliceFit object, e.g. output from SpliceFitPareto, SpliceFiticPareto or SpliceFitGPD.

Details

See Reynkens et al. (2017) and Section 4.6 of Albrecher et al. (2017) for details.

Note that VaR(p, splicefit) corresponds to qSplice(p, splicefit, lower.tail = FALSE).

Value

Vector of quantiles VaR_{1-p}=Q(1-p).

Author(s)

Tom Reynkens with R code from Roel Verbelen for the mixed Erlang quantiles.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Verbelen, R., Gong, L., Antonio, K., Badescu, A. and Lin, S. (2015). "Fitting Mixtures of Erlangs to Censored and Truncated Data Using the EM Algorithm." Astin Bulletin, 45, 729–758

Examples

## Not run: 

# Pareto random sample
X <- rpareto(1000, shape = 2)

# Splice ME and Pareto
splicefit <- SpliceFitPareto(X, 0.6)



p <- seq(0,1,0.01)

# Plot of quantiles
plot(p, qSplice(p, splicefit), type="l", xlab="p", ylab="Q(p)")

# Plot of VaR
plot(p, VaR(p, splicefit), type="l", xlab="p", ylab=bquote(VaR[1-p]))

## End(Not run)

Weibull quantile plot

Description

Computes the empirical quantiles of the log-transform of a data vector and the theoretical quantiles of the standard Weibull distribution. These quantiles are then plotted in a Weibull QQ-plot with the theoretical quantiles on the x-axis and the empirical quantiles on the y-axis.

Usage

WeibullQQ(data, plot = TRUE, main = "Weibull QQ-plot", ...)

Arguments

data

Vector of n observations.

plot

Logical indicating if the quantiles should be plotted in a Weibull QQ-plot, default is TRUE.

main

Title for the plot, default is "Weibull QQ-plot".

...

Additional arguments for the plot function, see plot for more details.

Details

The Weibull QQ-plot is given by

(\log(-\log(1-i/(n+1))),\log X_{i,n})

for i=1,...,n, with X_{i,n} the i-th order statistic of the data.

See Section 4.1 of Albrecher et al. (2017) for more details.

Value

A list with following components:

wqq.the

Vector of the theoretical quantiles from a standard Weibull distribution.

wqq.emp

Vector of the empirical quantiles from the log-transformed data.

Author(s)

Tom Reynkens.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Examples

data(norwegianfire)

# Weibull QQ-plot for Norwegian Fire Insurance data for claims in 1976.
WeibullQQ(norwegianfire$size[norwegianfire$year==76])

# Derivative of Weibull QQ-plot for Norwegian Fire Insurance data for claims in 1976.
WeibullQQ_der(norwegianfire$size[norwegianfire$year==76])

Derivative plot of the Weibull QQ-plot

Description

Computes the derivative plot of the Weibull QQ-plot. These values can be plotted as a function of the data or as a function of the tail parameter k.

Usage

WeibullQQ_der(data, k = FALSE, plot = TRUE, 
              main = "Derivative plot of Weibull QQ-plot", ...)

Arguments

data

Vector of n observations.

plot

Logical indicating if the derivative values should be plotted, default is TRUE.

k

Logical indicating if the derivative values are plotted as a function of the tail parameter k (k=TRUE) or as a function of the logarithm of the data (k=FALSE). Default is FALSE.

main

Title for the plot, default is "Derivative plot of Weibull QQ-plot".

...

Additional arguments for the plot function, see plot for more details.

Details

The derivative plot of a Weibull QQ-plot is

(k, H_{k,n}/W_{k,n})

(\log X_{n-k,n}, H_{k,n}/W_{k,n})

with H_{k,n} the Hill estimates and

W_{k,n}=1/k\sum_{j=1}^k \log(\log((n+1)/j)) - \log(\log((n+1)/(k+1))).

See Section 4.1 of Albrecher et al. (2017) for more details.

Value

A list with following components:

xval

Vector of the x-values of the plot (k or \log X_{n-k,n}).

yval

Vector of the derivative values.

Author(s)

Tom Reynkens.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Examples

data(norwegianfire)

# Weibull QQ-plot for Norwegian Fire Insurance data for claims in 1976.
WeibullQQ(norwegianfire$size[norwegianfire$year==76])

# Derivative of Weibull QQ-plot for Norwegian Fire Insurance data for claims in 1976.
WeibullQQ_der(norwegianfire$size[norwegianfire$year==76])

EPD estimator for right censored data

Description

Computes the EPD estimates adapted for right censored data.

Usage

cEPD(data, censored, rho = -1, beta = NULL, logk = FALSE, 
     plot = FALSE, add = FALSE, main = "EPD estimates of the EVI", ...)

Arguments

data

Vector of n observations.

censored

A logical vector of length n indicating if an observation is censored.

rho

A parameter for the \rho-estimator of Fraga Alves et al. (2003) when strictly positive or choice(s) for \rho if negative. Default is -1.

beta

Parameter for EPD (\beta=-\rho/\gamma). If NULL (default), beta is estimated by -\rho/H_{k,n} with H_{k,n} the Hill estimator.

logk

Logical indicating if the estimates are plotted as a function of \log(k) (logk=TRUE) or as a function of k. Default is FALSE.

plot

Logical indicating if the estimates of \gamma should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates of \gamma should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "EPD estimates of the EVI".

...

Additional arguments for the plot function, see plot for more details.

Details

The function EPD uses \tau which is equal to -\beta.

This estimator is only suitable for right censored data.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

gamma1

Vector of the corresponding estimates for the \gamma parameter of the EPD.

kappa1

Vector of the corresponding MLE estimates for the \kappa parameter of the EPD.

beta

Vector of estimates for (or values of) the \beta parameter of the EPD.

Delta

Difference between gamma1 and the Hill estimator for censored data.

Author(s)

Tom Reynkens based on R code from Anastasios Bardoutsos.

References

Beirlant, J., Bardoutsos, A., de Wet, T. and Gijbels, I. (2016). "Bias Reduced Tail Estimation for Censored Pareto Type Distributions." Statistics & Probability Letters, 109, 78–88.

Fraga Alves, M.I. , Gomes, M.I. and de Haan, L. (2003). "A New Class of Semi-parametric Estimators of the Second Order Parameter." Portugaliae Mathematica, 60, 193–214.

Examples

# Set seed
set.seed(29072016)

# Pareto random sample
X <- rpareto(500, shape=2)

# Censoring variable
Y <- rpareto(500, shape=1)

# Observed sample
Z <- pmin(X, Y)

# Censoring indicator
censored <- (X>Y)

# EPD estimator adapted for right censoring
cepd <- cEPD(Z, censored=censored, plot=TRUE)

Exponential quantile plot for right censored data

Description

Exponential QQ-plot adapted for right censored data.

Usage

cExpQQ(data, censored, plot = TRUE, main = "Exponential QQ-plot", ...)

Arguments

data

Vector of n observations.

censored

A logical vector of length n indicating if an observation is censored.

plot

Logical indicating if the quantiles should be plotted in an exponential QQ-plot, default is TRUE.

main

Title for the plot, default is "Exponential QQ-plot".

...

Additional arguments for the plot function, see plot for more details.

Details

The exponential QQ-plot adapted for right censoring is given by

( -\log(1-F_{km}(Z_{j,n})), Z_{j,n} )

for j=1,\ldots,n-1, with Z_{i,n} the i-th order statistic of the data and F_{km} the Kaplan-Meier estimator for the CDF. Hence, it has the same empirical quantiles as an ordinary exponential QQ-plot but replaces the theoretical quantiles -\log(1-j/(n+1)) by -\log(1-F_{km}(Z_{j,n})).

This QQ-plot is only suitable for right censored data.

In Beirlant et al. (2007), only a Pareto QQ-plot adapted for right-censored data is proposed. This QQ-plot is constructed using the same ideas, but is not described in the paper.

Value

A list with following components:

eqq.the

Vector of the theoretical quantiles, see Details.

eqq.emp

Vector of the empirical quantiles from the data.

Author(s)

Tom Reynkens

References

Beirlant, J., Guillou, A., Dierckx, G. and Fils-Villetard, A. (2007). "Estimation of the Extreme Value Index and Extreme Quantiles Under Random Censoring." Extremes, 10, 151–174.

Examples

# Set seed
set.seed(29072016)

# Pareto random sample
X <- rpareto(500, shape=2)

# Censoring variable
Y <- rpareto(500, shape=1)

# Observed sample
Z <- pmin(X, Y)

# Censoring indicator
censored <- (X>Y)

# Exponential QQ-plot adapted for right censoring
cExpQQ(Z, censored=censored)

GPD-ML estimator for right censored data

Description

Computes ML estimates of fitting GPD to peaks over a threshold adapted for right censoring.

Usage

cGPDmle(data, censored, start = c(0.1,1), warnings = FALSE, logk = FALSE, 
        plot = FALSE, add = FALSE, main = "POT estimates of the EVI", ...)
      
cPOT(data, censored, start = c(0.1,1), warnings = FALSE, logk = FALSE, 
     plot = FALSE, add = FALSE, main = "POT estimates of the EVI", ...)

Arguments

data

Vector of n observations.

censored

A logical vector of length n indicating if an observation is censored.

start

Vector of length 2 containing the starting values for the optimisation. The first element is the starting value for the estimator of \gamma_1 and the second element is the starting value for the estimator of \sigma_1. Default is c(0.1,1).

warnings

Logical indicating if possible warnings from the optimisation function are shown, default is FALSE.

logk

Logical indicating if the estimates are plotted as a function of \log(k) (logk=TRUE) or as a function of k. Default is FALSE.

plot

Logical indicating if the estimates of \gamma_1 should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates of \gamma_1 should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "POT estimates of the EVI".

...

Additional arguments for the plot function, see plot for more details.

Details

The GPD-MLE estimator for the EVI adapted for right censored data is equal to the ordinary GPD-MLE estimator for the EVI divided by the proportion of the k largest observations that is non-censored. The estimates for \sigma are the ordinary GPD-MLE estimates for \sigma.

This estimator is only suitable for right censored data.

cPOT is the same function but with a different name for compatibility with POT.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

gamma1

Vector of the corresponding MLE estimates for the \gamma_1 parameter of the GPD.

sigma1

Vector of the corresponding MLE estimates for the \sigma_1 parameter of the GPD.

Author(s)

Tom Reynkens

References

Einmahl, J.H.J., Fils-Villetard, A. and Guillou, A. (2008). "Statistics of Extremes Under Random Censoring." Bernoulli, 14, 207–227.

Examples

# Set seed
set.seed(29072016)

# Pareto random sample
X <- rpareto(500, shape=2)

# Censoring variable
Y <- rpareto(500, shape=1)

# Observed sample
Z <- pmin(X, Y)

# Censoring indicator
censored <- (X>Y)

# GPD-ML estimator adapted for right censoring
cpot <- cGPDmle(Z, censored=censored, plot=TRUE)

Hill estimator for right censored data

Description

Computes the Hill estimator for positive extreme value indices, adapted for right censoring, as a function of the tail parameter k (Beirlant et al., 2007). Optionally, these estimates are plotted as a function of k.

Usage

cHill(data, censored, logk = FALSE, plot = FALSE, add = FALSE, 
      main = "Hill estimates of the EVI", ...)

Arguments

data

Vector of n observations.

censored

A logical vector of length n indicating if an observation is censored.

logk

Logical indicating if the estimates are plotted as a function of \log(k) (logk=TRUE) or as a function of k. Default is FALSE.

plot

Logical indicating if the estimates of \gamma_1 should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates of \gamma_1 should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Hill estimates of the EVI".

...

Additional arguments for the plot function, see plot for more details.

Details

The Hill estimator adapted for right censored data is equal to the ordinary Hill estimator H_{k,n} divided by the proportion of the k largest observations that is non-censored.

This estimator is only suitable for right censored data, use icHill for interval censored data.

See Section 4.3.2 of Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

gamma1

Vector of the corresponding Hill estimates.

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant, J., Guillou, A., Dierckx, G. and Fils-Villetard, A. (2007). "Estimation of the Extreme Value Index and Extreme Quantiles Under Random Censoring." Extremes, 10, 151–174.

Examples

# Set seed
set.seed(29072016)

# Pareto random sample
X <- rpareto(500, shape=2)

# Censoring variable
Y <- rpareto(500, shape=1)

# Observed sample
Z <- pmin(X, Y)

# Censoring indicator
censored <- (X>Y)

# Hill estimator adapted for right censoring
chill <- cHill(Z, censored=censored, plot=TRUE)

Log-normal quantile plot for right censored data

Description

Log-normal QQ-plot adapted for right censored data.

Usage

cLognormalQQ(data, censored, plot = TRUE, main = "Log-normal QQ-plot", ...)

Arguments

data

Vector of n observations.

censored

A logical vector of length n indicating if an observation is censored.

plot

Logical indicating if the quantiles should be plotted in a log-normal QQ-plot, default is TRUE.

main

Title for the plot, default is "Log-normal QQ-plot".

...

Additional arguments for the plot function, see plot for more details.

Details

The log-normal QQ-plot adapted for right censoring is given by

( \Phi^{-1}(F_{km}(Z_{j,n})), \log(Z_{j,n}) )

for j=1,\ldots,n-1, with Z_{i,n} the i-th order statistic of the data, \Phi^{-1} the quantile function of the standard normal distribution and F_{km} the Kaplan-Meier estimator for the CDF. Hence, it has the same empirical quantiles as an ordinary log-normal QQ-plot but replaces the theoretical quantiles \Phi^{-1}(j/(n+1)) by \Phi^{-1}(F_{km}(Z_{j,n})).

This QQ-plot is only suitable for right censored data.

In Beirlant et al. (2007), only a Pareto QQ-plot adapted for right-censored data is proposed. This QQ-plot is constructed using the same ideas, but is not described in the paper.

Value

A list with following components:

lqq.the

Vector of the theoretical quantiles, see Details.

lqq.emp

Vector of the empirical quantiles from the log-transformed data.

Author(s)

Tom Reynkens

References

Beirlant, J., Guillou, A., Dierckx, G. and Fils-Villetard, A. (2007). "Estimation of the Extreme Value Index and Extreme Quantiles Under Random Censoring." Extremes, 10, 151–174.

Examples

# Set seed
set.seed(29072016)

# Pareto random sample
X <- rpareto(500, shape=2)

# Censoring variable
Y <- rpareto(500, shape=1)

# Observed sample
Z <- pmin(X, Y)

# Censoring indicator
censored <- (X>Y)

# Log-normal QQ-plot adapted for right censoring
cLognormalQQ(Z, censored=censored)

MOM estimator for right censored data

Description

Computes the Method of Moment estimates adapted for right censored data.

Usage

cMoment(data, censored, logk = FALSE, plot = FALSE, add = FALSE, 
        main = "Moment estimates of the EVI", ...)

Arguments

data

Vector of n observations.

censored

A logical vector of length n indicating if an observation is censored.

logk

Logical indicating if the estimates are plotted as a function of \log(k) (logk=TRUE) or as a function of k. Default is FALSE.

plot

Logical indicating if the estimates of \gamma_1 should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates of \gamma_1 should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Moment estimates of the EVI".

...

Additional arguments for the plot function, see plot for more details.

Details

The moment estimator adapted for right censored data is equal to the ordinary moment estimator divided by the proportion of the k largest observations that is non-censored.

This estimator is only suitable for right censored data.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

gamma1

Vector of the corresponding moment estimates.

Author(s)

Tom Reynkens

References

Einmahl, J.H.J., Fils-Villetard, A. and Guillou, A. (2008). "Statistics of Extremes Under Random Censoring." Bernoulli, 14, 207–227.

Examples

# Set seed
set.seed(29072016)

# Pareto random sample
X <- rpareto(500, shape=2)

# Censoring variable
Y <- rpareto(500, shape=1)

# Observed sample
Z <- pmin(X, Y)

# Censoring indicator
censored <- (X>Y)

# Moment estimator adapted for right censoring
cmom <- cMoment(Z, censored=censored, plot=TRUE)

Pareto quantile plot for right censored data

Description

Pareto QQ-plot adapted for right censored data.

Usage

cParetoQQ(data, censored, plot = TRUE, main = "Pareto QQ-plot", ...)

Arguments

data

Vector of n observations.

censored

A logical vector of length n indicating if an observation is censored.

plot

Logical indicating if the quantiles should be plotted in a Pareto QQ-plot, default is TRUE.

main

Title for the plot, default is "Pareto QQ-plot".

...

Additional arguments for the plot function, see plot for more details.

Details

The Pareto QQ-plot adapted for right censoring is given by

( -\log(1-F_{km}(Z_{j,n})), \log Z_{j,n} )

for j=1,\ldots,n-1, with Z_{i,n} the i-th order statistic of the data and F_{km} the Kaplan-Meier estimator for the CDF. Hence, it has the same empirical quantiles as an ordinary Pareto QQ-plot but replaces the theoretical quantiles -\log(1-j/(n+1)) by -\log(1-F_{km}(Z_{j,n})).

This QQ-plot is only suitable for right censored data, use icParetoQQ for interval censored data.

Value

A list with following components:

pqq.the

Vector of the theoretical quantiles, see Details.

pqq.emp

Vector of the empirical quantiles from the log-transformed data.

Author(s)

Tom Reynkens

References

Beirlant, J., Guillou, A., Dierckx, G. and Fils-Villetard, A. (2007). "Estimation of the Extreme Value Index and Extreme Quantiles Under Random Censoring." Extremes, 10, 151–174.

Examples

# Set seed
set.seed(29072016)

# Pareto random sample
X <- rpareto(500, shape=2)

# Censoring variable
Y <- rpareto(500, shape=1)

# Observed sample
Z <- pmin(X, Y)

# Censoring indicator
censored <- (X>Y)

# Pareto QQ-plot adapted for right censoring
cParetoQQ(Z, censored=censored)

Estimator of small exceedance probabilities and large return periods using censored Hill

Description

Computes estimates of a small exceedance probability P(X>q) or large return period 1/P(X>q) using the estimates for the EVI obtained from the Hill estimator adapted for right censoring.

Usage

cProb(data, censored, gamma1, q, plot = FALSE, add = FALSE, 
      main = "Estimates of small exceedance probability", ...)
      
cReturn(data, censored, gamma1, q, plot = FALSE, add = FALSE, 
        main = "Estimates of large return period", ...)

Arguments

data

Vector of n observations.

censored

A logical vector of length n indicating if an observation is censored.

gamma1

Vector of n-1 estimates for the EVI obtained from cHill.

q

The used large quantile (we estimate P(X>q) or 1/P(X>q) for q large).

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of small exceedance probability" for cProb and "Estimates of large return period" for cReturn.

...

Additional arguments for the plot function, see plot for more details.

Details

The probability is estimated as

\hat{P}(X>q)=(1-km) \times (q/Z_{n-k,n})^{-1/H_{k,n}^c}

with Z_{i,n} the i-th order statistic of the data, H_{k,n}^c the Hill estimator adapted for right censoring and km the Kaplan-Meier estimator for the CDF evaluated in Z_{n-k,n}.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

P

Vector of the corresponding probability estimates, only returned for cProb.

R

Vector of the corresponding estimates for the return period, only returned for cReturn.

q

The used large quantile.

Author(s)

Tom Reynkens

References

Beirlant, J., Guillou, A., Dierckx, G. and Fils-Villetard, A. (2007). "Estimation of the Extreme Value Index and Extreme Quantiles Under Random Censoring." Extremes, 10, 151–174.

Examples

# Set seed
set.seed(29072016)

# Pareto random sample
X <- rpareto(500, shape=2)

# Censoring variable
Y <- rpareto(500, shape=1)

# Observed sample
Z <- pmin(X, Y)

# Censoring indicator
censored <- (X>Y)

# Hill estimator adapted for right censoring
chill <- cHill(Z, censored=censored, plot=TRUE)

# Small exceedance probability
q <- 10
cProb(Z, censored=censored, gamma1=chill$gamma1, q=q, plot=TRUE)

# Return period
cReturn(Z, censored=censored, gamma1=chill$gamma1, q=q, plot=TRUE)

Estimator of small exceedance probabilities and large return periods using censored EPD

Description

Computes estimates of a small exceedance probability P(X>q) or large return period 1/P(X>q) using the parameters from the EPD fit adapted for right censoring.

Usage

cProbEPD(data, censored, gamma1, kappa1, beta, q, plot = FALSE, add = FALSE,
         main = "Estimates of small exceedance probability", ...)

cReturnEPD(data, censored, gamma1, kappa1, beta, q, plot = FALSE, add = FALSE,
           main = "Estimates of large return period", ...)

Arguments

data

Vector of n observations.

censored

A logical vector of length n indicating if an observation is censored.

gamma1

Vector of n-1 estimates for the EVI obtained from cEPD.

kappa1

Vector of n-1 estimates for \kappa_1 obtained from cEPD.

beta

Vector of n-1 estimates for \beta obtained from cEPD.

q

The used large quantile (we estimate P(X>q) or 1/P(X>q) for q large).

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of small exceedance probability" for cProbEPD and "Estimates of large return period" for cReturnEPD.

...

Additional arguments for the plot function, see plot for more details.

Details

The probability is estimated as

\hat{P}(X>q)= (1-km) \times (1-F(q))

with F the CDF of the EPD with estimated parameters \hat{\gamma}_1, \hat{\kappa}_1 and \hat{\tau}=-\hat{\beta} and km the Kaplan-Meier estimator for the CDF evaluated in Z_{n-k,n} (the (k+1)-th largest data point).

Value

A list with following components:

k

Vector of the values of the tail parameter k.

P

Vector of the corresponding probability estimates, only returned for cProbEPD.

R

Vector of the corresponding estimates for the return period, only returned for cReturnEPD.

q

The used large quantile.

Author(s)

Tom Reynkens.

References

Beirlant, J., Bardoutsos, A., de Wet, T. and Gijbels, I. (2016). "Bias Reduced Tail Estimation for Censored Pareto Type Distributions." Statistics & Probability Letters, 109, 78–88.

Examples

# Set seed
set.seed(29072016)

# Pareto random sample
X <- rpareto(500, shape=2)

# Censoring variable
Y <- rpareto(500, shape=1)

# Observed sample
Z <- pmin(X, Y)

# Censoring indicator
censored <- (X>Y)

# EPD estimator adapted for right censoring
cepd <- cEPD(Z, censored=censored, plot=TRUE)

# Small exceedance probability
q <- 10
cProbEPD(Z, censored=censored, gamma1=cepd$gamma1,
        kappa1=cepd$kappa1, beta=cepd$beta, q=q, plot=TRUE)

# Return period
cReturnEPD(Z, censored=censored, gamma1=cepd$gamma1,
        kappa1=cepd$kappa1, beta=cepd$beta, q=q, plot=TRUE)

Estimator of small exceedance probabilities and large return periods using censored generalised Hill

Description

Computes estimates of a small exceedance probability P(X>q) or large return period 1/P(X>q) using the estimates for the EVI obtained from the generalised Hill estimator adapted for right censoring.

Usage

cProbGH(data, censored, gamma1, q, plot = FALSE, add = FALSE, 
        main = "Estimates of small exceedance probability", ...)

cReturnGH(data, censored, gamma1, q, plot = FALSE, add = FALSE, 
          main = "Estimates of large return period", ...)

Arguments

data

Vector of n observations.

censored

A logical vector of length n indicating if an observation is censored.

gamma1

Vector of n-1 estimates for the EVI obtained from cgenHill.

q

The used large quantile (we estimate P(X>q) or 1/P(X>q) for q large).

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of small exceedance probability" for cProbGH and "Estimates of large return period" for cReturnGH.

...

Additional arguments for the plot function, see plot for more details.

Details

The probability is estimated as

\hat{P}(X>q)=(1-km) \times (1+ \hat{\gamma}_1/a_{k,n} \times (q-Z_{n-k,n}))^{-1/\hat{\gamma}_1}

with Z_{i,n} the i-th order statistic of the data, \hat{\gamma}_1 the generalised Hill estimator adapted for right censoring and km the Kaplan-Meier estimator for the CDF evaluated in Z_{n-k,n}. The value a is defined as

a_{k,n} = Z_{n-k,n} H_{k,n} (1-\min(\hat{\gamma}_1,0)) / \hat{p}_k

with H_{k,n} the ordinary Hill estimator and \hat{p}_k the proportion of the k largest observations that is non-censored.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

P

Vector of the corresponding probability estimates, only returned for cProbGH.

R

Vector of the corresponding estimates for the return period, only returned for cReturnGH.

q

The used large quantile.

Author(s)

Tom Reynkens

References

Einmahl, J.H.J., Fils-Villetard, A. and Guillou, A. (2008). "Statistics of Extremes Under Random Censoring." Bernoulli, 14, 207–227.

Examples

# Set seed
set.seed(29072016)

# Pareto random sample
X <- rpareto(500, shape=2)

# Censoring variable
Y <- rpareto(500, shape=1)

# Observed sample
Z <- pmin(X, Y)

# Censoring indicator
censored <- (X>Y)

# Generalised Hill estimator adapted for right censoring
cghill <- cgenHill(Z, censored=censored, plot=TRUE)

# Small exceedance probability
q <- 10
cProbGH(Z, censored=censored, gamma1=cghill$gamma1, q=q, plot=TRUE)

# Return period
cReturnGH(Z, censored=censored, gamma1=cghill$gamma1, q=q, plot=TRUE)

Estimator of small exceedance probabilities and large return periods using censored GPD-MLE

Description

Computes estimates of a small exceedance probability P(X>q) or large return period 1/P(X>q) using the GPD-ML estimator adapted for right censoring.

Usage

cProbGPD(data, censored, gamma1, sigma1, q, plot = FALSE, add = FALSE,
         main = "Estimates of small exceedance probability", ...)

cReturnGPD(data, censored, gamma1, sigma1, q, plot = FALSE, add = FALSE,
           main = "Estimates of large return period", ...)

Arguments

data

Vector of n observations.

censored

A logical vector of length n indicating if an observation is censored.

gamma1

Vector of n-1 estimates for the EVI obtained from cGPDmle.

sigma1

Vector of n-1 estimates for \sigma_1 obtained from cGPDmle.

q

The used large quantile (we estimate P(X>q) or 1/P(X>q) for q large).

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of small exceedance probability" for cProbGPD and "Estimates of large return period" for cReturnGPD.

...

Additional arguments for the plot function, see plot for more details.

Details

The probability is estimated as

\hat{P}(X>q)=(1-km) \times (1+ \hat{\gamma}_1/a_{k,n} \times (q-Z_{n-k,n}))^{-1/\hat{\gamma}_1}

a_{k,n} = \hat{\sigma}_1 / \hat{p}_k

with \hat{\sigma}_1 the ML estimate for \sigma_1 and \hat{p}_k the proportion of the k largest observations that is non-censored.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

P

Vector of the corresponding probability estimates, only returned for cProbGPD.

R

Vector of the corresponding estimates for the return period, only returned for cReturnGPD.

q

The used large quantile.

Author(s)

Tom Reynkens

References

Einmahl, J.H.J., Fils-Villetard, A. and Guillou, A. (2008). "Statistics of Extremes Under Random Censoring." Bernoulli, 14, 207–227.

Examples

# Set seed
set.seed(29072016)

# Pareto random sample
X <- rpareto(500, shape=2)

# Censoring variable
Y <- rpareto(500, shape=1)

# Observed sample
Z <- pmin(X, Y)

# Censoring indicator
censored <- (X>Y)

# GPD-MLE estimator adapted for right censoring
cpot <- cGPDmle(Z, censored=censored, plot=TRUE)

# Exceedance probability
q <- 10
cProbGPD(Z, gamma1=cpot$gamma1, sigma1=cpot$sigma1,
         censored=censored, q=q, plot=TRUE)
         
# Return period
cReturnGPD(Z, gamma1=cpot$gamma1, sigma1=cpot$sigma1,
         censored=censored, q=q, plot=TRUE)

Estimator of small exceedance probabilities and large return periods using censored MOM

Description

Computes estimates of a small exceedance probability P(X>q) or large return period 1/P(X>q) using the Method of Moments estimates for the EVI adapted for right censoring.

Usage

cProbMOM(data, censored, gamma1, q, plot = FALSE, add = FALSE, 
         main = "Estimates of small exceedance probability", ...)

cReturnMOM(data, censored, gamma1, q, plot = FALSE, add = FALSE, 
           main = "Estimates of large return period", ...)

Arguments

data

Vector of n observations.

censored

A logical vector of length n indicating if an observation is censored.

gamma1

Vector of n-1 estimates for the EVI obtained from cMoment.

q

The used large quantile (we estimate P(X>q) or 1/P(X>q) for q large).

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of small exceedance probability" for cProbMOM and "Estimates of large return period" for cReturnMOM.

...

Additional arguments for the plot function, see plot for more details.

Details

The probability is estimated as

\hat{P}(X>q)=(1-km) \times (1+ \hat{\gamma}_1/a_{k,n} \times (q-Z_{n-k,n}))^{-1/\hat{\gamma}_1}

with Z_{i,n} the i-th order statistic of the data, \hat{\gamma}_1 the MOM estimator adapted for right censoring and km the Kaplan-Meier estimator for the CDF evaluated in Z_{n-k,n}. The value a is defined as

a_{k,n}= Z_{n-k,n} H_{k,n} (1-\min(\hat{\gamma}_1,0)) /\hat{p}_k

with H_{k,n} the ordinary Hill estimator and \hat{p}_k the proportion of the k largest observations that is non-censored.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

P

Vector of the corresponding probability estimates, only returned for cProbMOM.

R

Vector of the corresponding estimates for the return period, only returned for cReturnMOM.

q

The used large quantile.

Author(s)

Tom Reynkens

References

Einmahl, J.H.J., Fils-Villetard, A. and Guillou, A. (2008). "Statistics of Extremes Under Random Censoring." Bernoulli, 14, 207–227.

Examples

# Set seed
set.seed(29072016)

# Pareto random sample
X <- rpareto(500, shape=2)

# Censoring variable
Y <- rpareto(500, shape=1)

# Observed sample
Z <- pmin(X, Y)

# Censoring indicator
censored <- (X>Y)

# Moment estimator adapted for right censoring
cmom <- cMoment(Z, censored=censored, plot=TRUE)

# Small exceedance probability
q <- 10
cProbMOM(Z, censored=censored, gamma1=cmom$gamma1, q=q, plot=TRUE)

# Return period
cReturnMOM(Z, censored=censored, gamma1=cmom$gamma1, q=q, plot=TRUE)

Estimator of large quantiles using censored Hill

Description

Computes estimates of large quantiles Q(1-p) using the estimates for the EVI obtained from the Hill estimator adapted for right censoring.

Usage

cQuant(data, censored, gamma1, p, plot = FALSE, add = FALSE, 
       main = "Estimates of extreme quantile", ...)

Arguments

data

Vector of n observations.

censored

A logical vector of length n indicating if an observation is censored.

gamma1

Vector of n-1 estimates for the EVI obtained from cHill.

p

The exceedance probability of the quantile (we estimate Q(1-p) for p small).

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of extreme quantile".

...

Additional arguments for the plot function, see plot for more details.

Details

The quantile is estimated as

\hat{Q}(1-p)=Z_{n-k,n} \times ( (1-km)/p)^{H_{k,n}^c}

with Z_{i,n} the i-th order statistic of the data, H_{k,n}^c the Hill estimator adapted for right censoring and km the Kaplan-Meier estimator for the CDF evaluated in Z_{n-k,n}.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

Q

Vector of the corresponding quantile estimates.

p

The used exceedance probability.

Author(s)

Tom Reynkens.

References

Beirlant, J., Guillou, A., Dierckx, G. and Fils-Villetard, A. (2007). "Estimation of the Extreme Value Index and Extreme Quantiles Under Random Censoring." Extremes, 10, 151–174.

Examples

# Set seed
set.seed(29072016)

# Pareto random sample
X <- rpareto(500, shape=2)

# Censoring variable
Y <- rpareto(500, shape=1)

# Observed sample
Z <- pmin(X, Y)

# Censoring indicator
censored <- (X>Y)

# Hill estimator adapted for right censoring
chill <- cHill(Z, censored=censored, plot=TRUE)

# Large quantile
p <- 10^(-4)
cQuant(Z, gamma1=chill$gamma, censored=censored, p=p, plot=TRUE)

Estimator of large quantiles using censored Hill

Description

Computes estimates of large quantiles Q(1-p) using the estimates for the EVI obtained from the generalised Hill estimator adapted for right censoring.

Usage

cQuantGH(data, censored, gamma1, p, plot = FALSE, add = FALSE, 
         main = "Estimates of extreme quantile", ...)

Arguments

data

Vector of n observations.

censored

A logical vector of length n indicating if an observation is censored.

gamma1

Vector of n-1 estimates for the EVI obtained from cgenHill.

p

The exceedance probability of the quantile (we estimate Q(1-p) for p small).

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of extreme quantile".

...

Additional arguments for the plot function, see plot for more details.

Details

The quantile is estimated as

\hat{Q}(1-p)= Z_{n-k,n} + a_{k,n} ( ( (1-km)/p)^{\hat{\gamma}_1} -1 ) / \hat{\gamma}_1)

a_{k,n} = Z_{n-k,n} H_{k,n} (1-S_{Z,k,n}) / \hat{p}_k

with H_{k,n} the ordinary Hill estimator and \hat{p}_k the proportion of the k largest observations that is non-censored, and

S_{Z,k,n} = 1 - (1-M_1^2/M_2)^(-1) / 2

with

M_l = =1/k\sum_{j=1}^k (\log X_{n-j+1,n}- \log X_{n-k,n})^l.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

Q

Vector of the corresponding quantile estimates.

p

The used exceedance probability.

Author(s)

Tom Reynkens

References

Einmahl, J.H.J., Fils-Villetard, A. and Guillou, A. (2008). "Statistics of Extremes Under Random Censoring." Bernoulli, 14, 207–227.

Examples

# Set seed
set.seed(29072016)

# Pareto random sample
X <- rpareto(500, shape=2)

# Censoring variable
Y <- rpareto(500, shape=1)

# Observed sample
Z <- pmin(X, Y)

# Censoring indicator
censored <- (X>Y)

# Generalised Hill estimator adapted for right censoring
cghill <- cgenHill(Z, censored=censored, plot=TRUE)

# Large quantile
p <- 10^(-4)
cQuantGH(Z, gamma1=cghill$gamma, censored=censored, p=p, plot=TRUE)

Estimator of large quantiles using censored GPD-MLE

Description

Computes estimates of large quantiles Q(1-p) using the estimates for the EVI obtained from the GPD-ML estimator adapted for right censoring.

Usage

cQuantGPD(data, censored, gamma1, sigma1, p, plot = FALSE, add = FALSE, 
          main = "Estimates of extreme quantile", ...)

Arguments

data

Vector of n observations.

censored

A logical vector of length n indicating if an observation is censored.

gamma1

Vector of n-1 estimates for the EVI obtained from cGPDmle.

sigma1

Vector of n-1 estimates for \sigma_1 obtained from cGPDmle.

p

The exceedance probability of the quantile (we estimate Q(1-p) for p small).

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of extreme quantile".

...

Additional arguments for the plot function, see plot for more details.

Details

The quantile is estimated as

\hat{Q}(1-p)= Z_{n-k,n} + a_{k,n} ( ( (1-km)/p)^{\hat{\gamma}_1} -1 ) / \hat{\gamma}_1)

ith Z_{i,n} the i-th order statistic of the data, \hat{\gamma}_1 the generalised Hill estimator adapted for right censoring and km the Kaplan-Meier estimator for the CDF evaluated in Z_{n-k,n}. The value a is defined as

a_{k,n} = \hat{\sigma}_1 / \hat{p}_k

with \hat{\sigma}_1 the ML estimate for \sigma_1 and \hat{p}_k the proportion of the k largest observations that is non-censored.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

Q

Vector of the corresponding quantile estimates.

p

The used exceedance probability.

Author(s)

Tom Reynkens

References

Einmahl, J.H.J., Fils-Villetard, A. and Guillou, A. (2008). "Statistics of Extremes Under Random Censoring." Bernoulli, 14, 207–227.

Examples

# Set seed
set.seed(29072016)

# Pareto random sample
X <- rpareto(500, shape=2)

# Censoring variable
Y <- rpareto(500, shape=1)

# Observed sample
Z <- pmin(X, Y)

# Censoring indicator
censored <- (X>Y)

# GPD-MLE estimator adapted for right censoring
cpot <- cGPDmle(Z, censored=censored, plot=TRUE)

# Large quantile
p <- 10^(-4)
cQuantGPD(Z, gamma1=cpot$gamma1, sigma1=cpot$sigma1,
         censored=censored, p=p, plot=TRUE)

Estimator of large quantiles using censored MOM

Description

Computes estimates of large quantiles Q(1-p) using the estimates for the EVI obtained from the MOM estimator adapted for right censoring.

Usage

cQuantMOM(data, censored, gamma1, p, plot = FALSE, add = FALSE,
          main = "Estimates of extreme quantile", ...)

Arguments

data

Vector of n observations.

censored

A logical vector of length n indicating if an observation is censored.

gamma1

Vector of n-1 estimates for the EVI obtained from cMoment.

p

The exceedance probability of the quantile (we estimate Q(1-p) for p small).

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of extreme quantile".

...

Additional arguments for the plot function, see plot for more details.

Details

The quantile is estimated as

\hat{Q}(1-p)= Z_{n-k,n} + a_{k,n} ( ( (1-km)/p)^{\hat{\gamma}_1} -1 ) / \hat{\gamma}_1)

ith Z_{i,n} the i-th order statistic of the data, \hat{\gamma}_1 the MOM estimator adapted for right censoring and km the Kaplan-Meier estimator for the CDF evaluated in Z_{n-k,n}. The value a is defined as

a_{k,n} = Z_{n-k,n} H_{k,n} (1-\min(\hat{\gamma}_1,0)) /\hat{p}_k

with H_{k,n} the ordinary Hill estimator and \hat{p}_k the proportion of the k largest observations that is non-censored.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

Q

Vector of the corresponding quantile estimates.

p

The used exceedance probability.

Author(s)

Tom Reynkens

References

Einmahl, J.H.J., Fils-Villetard, A. and Guillou, A. (2008). "Statistics of Extremes Under Random Censoring." Bernoulli, 14, 207–227.

Examples

# Set seed
set.seed(29072016)

# Pareto random sample
X <- rpareto(500, shape=2)

# Censoring variable
Y <- rpareto(500, shape=1)

# Observed sample
Z <- pmin(X, Y)

# Censoring indicator
censored <- (X>Y)

# Moment estimator adapted for right censoring
cmom <- cMoment(Z, censored=censored, plot=TRUE)

# Large quantile
p <- 10^(-4)
cQuantMOM(Z, censored=censored, gamma1=cmom$gamma1, p=p, plot=TRUE)

Weibull quantile plot for right censored data

Description

Weibull QQ-plot adapted for right censored data.

Usage

cWeibullQQ(data, censored, plot = TRUE, main = "Weibull QQ-plot", ...)

Arguments

data

Vector of n observations.

censored

A logical vector of length n indicating if an observation is censored.

plot

Logical indicating if the quantiles should be plotted in a Weibull QQ-plot, default is TRUE.

main

Title for the plot, default is "Weibull QQ-plot".

...

Additional arguments for the plot function, see plot for more details.

Details

The Weibull QQ-plot adapted for right censoring is given by

( \log(-\log(1-F_{km}(Z_{j,n}))), \log(Z_{j,n}) )

for j=1,\ldots,n-1, with Z_{i,n} the i-th order statistic of the data and F_{km} the Kaplan-Meier estimator for the CDF. Hence, it has the same empirical quantiles as an ordinary Weibull QQ-plot but replaces the theoretical quantiles \log(-\log(1-j/(n+1))) by \log(-\log(1-F_{km}(Z_{j,n}))).

This QQ-plot is only suitable for right censored data.

In Beirlant et al. (2007), only a Pareto QQ-plot adapted for right-censored data is proposed. This QQ-plot is constructed using the same ideas, but is not described in the paper.

Value

A list with following components:

wqq.the

Vector of the theoretical quantiles, see Details.

wqq.emp

Vector of the empirical quantiles from the log-transformed data.

Author(s)

Tom Reynkens

References

Beirlant, J., Guillou, A., Dierckx, G. and Fils-Villetard, A. (2007). "Estimation of the Extreme Value Index and Extreme Quantiles Under Random Censoring." Extremes, 10, 151–174.

Examples

# Set seed
set.seed(29072016)

# Pareto random sample
X <- rpareto(500, shape=2)

# Censoring variable
Y <- rpareto(500, shape=1)

# Observed sample
Z <- pmin(X, Y)

# Censoring indicator
censored <- (X>Y)

# Weibull QQ-plot adapted for right censoring
cWeibullQQ(Z, censored=censored)

Generalised Hill estimator for right censored data

Description

Computes the generalised Hill estimates adapted for right censored data.

Usage

cgenHill(data, censored, logk = FALSE, plot = FALSE, add = FALSE, 
         main = "Generalised Hill estimates of the EVI", ...)

Arguments

data

Vector of n observations.

censored

A logical vector of length n indicating if an observation is censored.

logk

Logical indicating if the estimates are plotted as a function of \log(k) (logk=TRUE) or as a function of k. Default is FALSE.

plot

Logical indicating if the estimates of \gamma_1 should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates of \gamma_1 should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Generalised Hill estimates of the EVI".

...

Additional arguments for the plot function, see plot for more details.

Details

The generalised Hill estimator adapted for right censored data is equal to the ordinary generalised Hill estimator divided by the proportion of the k largest observations that is non-censored.

This estimator is only suitable for right censored data.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

gamma1

Vector of the corresponding generalised Hill estimates.

Author(s)

Tom Reynkens

References

Einmahl, J.H.J., Fils-Villetard, A. and Guillou, A. (2008). "Statistics of Extremes Under Random Censoring." Bernoulli, 14, 207–227.

Examples

# Set seed
set.seed(29072016)

# Pareto random sample
X <- rpareto(500, shape=2)

# Censoring variable
Y <- rpareto(500, shape=1)

# Observed sample
Z <- pmin(X, Y)

# Censoring indicator
censored <- (X>Y)

# Generalised Hill estimator adapted for right censoring
cghill <- cgenHill(Z, censored=censored, plot=TRUE)

Hill-type estimator for the conditional EVI

Description

Hill-type estimator for the conditional Extreme Value Index (EVI) adapted for censored data.

Usage

crHill(x, Xtilde, Ytilde, censored, h, 
       kernel = c("biweight", "normal", "uniform", "triangular", "epanechnikov"), 
       logk = FALSE, plot = FALSE, add = FALSE, main = "", ...)

Arguments

x

Value of the conditioning variable X to estimate the EVI at.

Xtilde

Vector of length n containing the censored sample of the conditioning variable X.

Ytilde

Vector of length n containing the censored sample of the variable Y.

censored

A logical vector of length n indicating if an observation is censored.

h

Bandwidth of the non-parametric estimator.

kernel

Kernel of the non-parametric estimator. One of "biweight" (default), "normal", "uniform", "triangular" and "epanechnikov".

logk

Logical indicating if the Hill-type estimates are plotted as a function of \log(k) (logk=TRUE) or as a function of k. Default is FALSE.

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "" (no title).

...

Additional arguments for the plot function, see plot for more details.

Details

This is a Hill-type estimator of the EVI of Y given X=x. The estimator uses the censored sample (\tilde{X}_i, \tilde{Y}_i), for i=1,\ldots,n, where X and Y are censored at the same time. We assume that Y and the censoring variable are conditionally independent given X.

See Section 4.4.3 in Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

gamma

Vector of the corresponding Hill-type estimates.

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Examples

# Set seed
set.seed(29072016)

# Pareto random sample
Y <- rpareto(200, shape=2)

# Censoring variable
C <- rpareto(200, shape=1)

# Observed (censored) sample of variable Y
Ytilde <- pmin(Y, C)

# Censoring indicator
censored <- (Y>C)

# Conditioning variable
X <- seq(1, 10, length.out=length(Y))

# Observed (censored) sample of conditioning variable
Xtilde <- X
Xtilde[censored] <- X[censored] - runif(sum(censored), 0, 1)


# Conditional Pareto QQ-plot
crParetoQQ(x=1, Xtilde=Xtilde, Ytilde=Ytilde, censored=censored, h=2)

# Plot Hill-type estimates
crHill(x=1, Xtilde, Ytilde, censored, h=2, plot=TRUE)

Conditional Pareto quantile plot for right censored data

Description

Conditional Pareto QQ-plot adapted for right censored data.

Usage

crParetoQQ(x, Xtilde, Ytilde, censored, h, 
           kernel = c("biweight", "normal", "uniform", "triangular", "epanechnikov"), 
           plot = TRUE, add = FALSE, main = "Pareto QQ-plot", type = "p", ...)

Arguments

x

Value of the conditioning variable X at which to make the conditional Pareto QQ-plot.

Xtilde

Vector of length n containing the censored sample of the conditioning variable X.

Ytilde

Vector of length n containing the censored sample of the variable Y.

censored

A logical vector of length n indicating if an observation is censored.

h

Bandwidth of the non-parametric estimator for the conditional survival function (crSurv).

kernel

Kernel of the non-parametric estimator for the conditional survival function (crSurv). One of "biweight" (default), "normal", "uniform", "triangular" and "epanechnikov".

plot

Logical indicating if the quantiles should be plotted in a Pareto QQ-plot, default is TRUE.

add

Logical indicating if the quantiles should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Pareto QQ-plot".

type

Type of the plot, default is "p" meaning points are plotted, see plot for more details.

...

Additional arguments for the plot function, see plot for more details.

Details

We construct a Pareto QQ-plot for Y conditional on X=x using the censored sample (\tilde{X}_i, \tilde{Y}_i), for i=1,\ldots,n, where X and Y are censored at the same time. We assume that Y and the censoring variable are conditionally independent given X.

The conditional Pareto QQ-plot adapted for right censoring is given by

( -\log(1-\hat{F}_{Y|X}(\tilde{Y}_{j,n}|x)), \log \tilde{Y}_{j,n} )

for j=1,\ldots,n-1, with \tilde{Y}_{i,n} the i-th order statistic of the censored data and \hat{F}_{Y|X}(y|x) the non-parametric estimator for the conditional CDF of Akritas and Van Keilegom (2003), see crSurv.

See Section 4.4.3 in Albrecher et al. (2017) for more details.

Value

A list with following components:

pqq.the

Vector of the theoretical quantiles, see Details.

pqq.emp

Vector of the empirical quantiles from the log-transformed Y data.

Author(s)

Tom Reynkens

References

Akritas, M.G. and Van Keilegom, I. (2003). "Estimation of Bivariate and Marginal Distributions With Censored Data." Journal of the Royal Statistical Society: Series B, 65, 457–471.

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Examples

# Set seed
set.seed(29072016)

# Pareto random sample
Y <- rpareto(200, shape=2)

# Censoring variable
C <- rpareto(200, shape=1)

# Observed (censored) sample of variable Y
Ytilde <- pmin(Y, C)

# Censoring indicator
censored <- (Y>C)

# Conditioning variable
X <- seq(1, 10, length.out=length(Y))

# Observed (censored) sample of conditioning variable
Xtilde <- X
Xtilde[censored] <- X[censored] - runif(sum(censored), 0, 1)


# Conditional Pareto QQ-plot
crParetoQQ(x=1, Xtilde=Xtilde, Ytilde=Ytilde, censored=censored, h=2)

# Plot Hill-type estimates
crHill(x=1, Xtilde, Ytilde, censored, h=2, plot=TRUE)

Non-parametric estimator of conditional survival function

Description

Non-parametric estimator of the conditional survival function of Y given X for censored data, see Akritas and Van Keilegom (2003).

Usage

crSurv(x, y, Xtilde, Ytilde, censored, h, 
       kernel = c("biweight", "normal", "uniform", "triangular", "epanechnikov"))

Arguments

x

The value of the conditioning variable X to evaluate the survival function at. x needs to be a single number or a vector with the same length as y.

y

The value(s) of the variable Y to evaluate the survival function at.

Xtilde

Vector of length n containing the censored sample of the conditioning variable X.

Ytilde

Vector of length n containing the censored sample of the variable Y.

censored

A logical vector of length n indicating if an observation is censored.

h

Bandwidth of the non-parametric estimator.

kernel

Kernel of the non-parametric estimator. One of "biweight" (default), "normal", "uniform", "triangular" and "epanechnikov".

Details

We estimate the conditional survival function

1-F_{Y|X}(y|x)

using the censored sample (\tilde{X}_i, \tilde{Y}_i), for i=1,\ldots,n, where X and Y are censored at the same time. We assume that Y and the censoring variable are conditionally independent given X.

The estimator is given by

1-\hat{F}_{Y|X}(y|x) = \prod_{\tilde{Y}_i \le y} (1-W_{n,i}(x;h_n)/(\sum_{j=1}^nW_{n,j}(x;h_n) I\{\tilde{Y}_j \ge \tilde{Y}_i\}))^{\Delta_i}

where \Delta_i=1 when (\tilde{X}_i, \tilde{Y}_i) is censored and 0 otherwise. The weights are given by

W_{n,i}(x;h_n) = K((x-\tilde{X}_i)/h_n)/\sum_{\Delta_j=1}K((x-\tilde{X}_j)/h_n)

when \Delta_i=1 and 0 otherwise.

See Section 4.4.3 in Albrecher et al. (2017) for more details.

Value

Estimates for 1-F_{Y|X}(y|x) as described above.

Author(s)

Tom Reynkens

References

Akritas, M.G. and Van Keilegom, I. (2003). "Estimation of Bivariate and Marginal Distributions With Censored Data." Journal of the Royal Statistical Society: Series B, 65, 457–471.

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Examples

# Set seed
set.seed(29072016)

# Pareto random sample
Y <- rpareto(200, shape=2)

# Censoring variable
C <- rpareto(200, shape=1)

# Observed (censored) sample of variable Y
Ytilde <- pmin(Y, C)

# Censoring indicator
censored <- (Y>C)

# Conditioning variable
X <- seq(1, 10, length.out=length(Y))

# Observed (censored) sample of conditioning variable
Xtilde <- X
Xtilde[censored] <- X[censored] - runif(sum(censored), 0, 1)

# Plot estimates of the conditional survival function
x <- 5
y <- seq(0, 5, 1/100)
plot(y, crSurv(x, y, Xtilde=Xtilde, Ytilde=Ytilde, censored=censored, h=5), type="l", 
     xlab="y", ylab="Conditional survival function")

Generalised Hill estimator

Description

Computes the generalised Hill estimator for real extreme value indices as a function of the tail parameter k. Optionally, these estimates are plotted as a function of k.

Usage

genHill(data, gamma, logk = FALSE, plot = FALSE, add = FALSE, 
        main = "Generalised Hill estimates of the EVI", ...)

Arguments

data

Vector of n observations.

gamma

Vector of n-1 estimates for the EVI, typically Hill estimates are used.

logk

Logical indicating if the estimates are plotted as a function of \log(k) (logk=TRUE) or as a function of k. Default is FALSE.

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Generalised Hill estimates of the EVI".

...

Additional arguments for the plot function, see plot for more details.

Details

The generalised Hill estimator is an estimator for the slope of the k last points of the generalised QQ-plot:

\hat{\gamma}^{GH}_{k,n}=1/k\sum_{j=1}^k \log UH_{j,n}- \log UH_{k+1,n}

with UH_{j,n}=X_{n-j,n}H_{j,n} the UH scores and H_{j,n} the Hill estimates. This is analogous to the (ordinary) Hill estimator which is the estimator of the slope of the k last points of the Pareto QQ-plot when using constrained least squares.

See Section 4.2.2 of Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

gamma

Vector of the corresponding generalised Hill estimates.

Author(s)

Tom Reynkens based on S-Plus code from Yuri Goegebeur.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Beirlant, J., Vynckier, P. and Teugels, J.L. (1996). "Excess Function and Estimation of the Extreme-value Index". Bernoulli, 2, 293–318.

Examples

data(soa)

# Hill estimator
H <- Hill(soa$size, plot=FALSE)
# Moment estimator
M <- Moment(soa$size)
# Generalised Hill estimator
gH <- genHill(soa$size, gamma=H$gamma)

# Plot estimates
plot(H$k[1:5000], M$gamma[1:5000], xlab="k", ylab=expression(gamma), type="l", ylim=c(0.2,0.5))
lines(H$k[1:5000], gH$gamma[1:5000], lty=2)
legend("topright", c("Moment", "Generalised Hill"), lty=1:2)

Generalised quantile plot

Description

Computes the empirical quantiles of the UH scores of a data vector and the theoretical quantiles of the standard exponential distribution. These quantiles are then plotted in a generalised QQ-plot with the theoretical quantiles on the x-axis and the empirical quantiles on the y-axis.

Usage

genQQ(data, gamma, plot = TRUE, main = "Generalised QQ-plot", ...)

generalizedQQ(data, gamma, plot = TRUE, main = "Generalised QQ-plot", ...)

Arguments

data

Vector of n observations.

gamma

Vector of n-1 estimates for the EVI, typically Hill estimates are used.

plot

Logical indicating if the quantiles should be plotted in a generalised QQ-plot, default is TRUE.

main

Title for the plot, default is "Generalised QQ-plot".

...

Additional arguments for the plot function, see plot for more details.

Details

The generalizedQQ function is the same function but with a different name for compatibility with the old S-Plus code.

The UH scores are defined as UH_{j,n}=X_{n-j,n}H_{j,n} with H_{j,n} the Hill estimates, but other positive estimates for the EVI can also be used. The appropriate positive estimates for the EVI need to be specified in gamma. The generalised QQ-plot then plots

(\log((n+1)/(k+1)), \log(X_{n-k,n}H_{k,n}))

for k=1,\ldots,n-1.

See Section 4.2.2 of Albrecher et al. (2017) for more details.

Value

A list with following components:

gqq.the

Vector of the theoretical quantiles from a standard exponential distribution.

gqq.emp

Vector of the empirical quantiles from the logarithm of the UH scores.

Author(s)

Tom Reynkens based on S-Plus code from Yuri Goegebeur.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Beirlant, J., Vynckier, P. and Teugels, J.L. (1996). "Excess Function and Estimation of the Extreme-value Index." Bernoulli, 2, 293–318.

Examples

data(soa)

# Compute Hill estimator
H <- Hill(soa$size[1:5000], plot=FALSE)$gamma

# Generalised QQ-plot
genQQ(soa$size[1:5000], gamma=H)

Hill estimator for interval censored data

Description

Computes the Hill estimator for positive extreme value indices, adapted for interval censoring, as a function of the tail parameter k. Optionally, these estimates are plotted as a function of k.

Usage

icHill(L, U, censored, trunclower = 0, truncupper = Inf, 
       logk = FALSE, plot = TRUE, add = FALSE, main = "Hill estimates of the EVI", ...)

Arguments

L

Vector of length n with the lower boundaries of the intervals for interval censored data or the observed data for right censored data.

U

Vector of length n with the upper boundaries of the intervals.

censored

A logical vector of length n indicating if an observation is censored.

trunclower

Lower truncation point. Default is 0.

truncupper

Upper truncation point. Default is Inf (no upper truncation).

logk

Logical indicating if the estimates are plotted as a function of \log(k) (logk=TRUE) or as a function of k. Default is FALSE.

plot

Logical indicating if the estimates of \gamma should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates of \gamma should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Hill estimates of the EVI".

...

Additional arguments for the plot function, see plot for more details.

Details

This estimator is given by

H^{TB}(x)=(\int_x^{\infty} (1-\hat{F}^{TB}(u))/u du)/(1-\hat{F}^{TB}(x)),

where \hat{F}^{TB} is the Turnbull estimator for the CDF. More specifically, we use the values x=\hat{Q}^{TB}(p) for p=1/(n+1), \ldots, (n-1)/(n+1) where \hat{Q}^{TB}(p) is the empirical quantile function corresponding to the Turnbull estimator. We then denote

H^{TB}_{k,n}=H^{TB}(x_{n-k,n})

with

x_{n-k,n}=\hat{Q}^{TB}((n-k)/(n+1))=\hat{Q}^{TB}(1-(k+1)/(n+1)).

Right censored data should be entered as L=l and U=truncupper, and left censored data should be entered as L=trunclower and U=u.

If the interval package is installed, the icfit function is used to compute the Turnbull estimator. Otherwise, survfit.formula from survival is used.

Use Hill for non-censored data or cHill for right censored data.

See Section 4.3 in Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

gamma

Vector of the corresponding Hill estimates.

X

Vector of thresholds x_{n-k,n} used when estimating \gamma.

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Examples

# Pareto random sample
X <- rpareto(500, shape=2)

# Censoring variable
Y <- rpareto(500, shape=1)

# Observed sample
Z <- pmin(X,Y)

# Censoring indicator
censored <- (X>Y)

# Right boundary
U <- Z
U[censored] <- Inf

# Hill estimator adapted for interval censoring
icHill(Z, U, censored, ylim=c(0,1))

# Hill estimator adapted for right censoring
cHill(Z, censored, lty=2, add=TRUE)

# True value of gamma
abline(h=1/2, lty=3, col="blue")

# Legend
legend("topright", c("icHill", "cHill"), lty=1:2)

Pareto quantile plot for interval censored data

Description

Pareto QQ-plot adapted for interval censored data using the Turnbull estimator.

Usage

icParetoQQ(L, U = L, censored, trunclower = 0, truncupper = Inf, 
           plot = TRUE, main = "Pareto QQ-plot", ...)

Arguments

L

Vector of length n with the lower boundaries of the intervals for interval censored data or the observed data for right censored data.

U

Vector of length n with the upper boundaries of the intervals.

censored

A logical vector of length n indicating if an observation is censored.

trunclower

Lower truncation point. Default is 0.

truncupper

Upper truncation point. Default is Inf (no upper truncation).

plot

Logical indicating if the quantiles should be plotted in a Pareto QQ-plot, default is TRUE.

main

Title for the plot, default is "Pareto QQ-plot".

...

Additional arguments for the plot function, see plot for more details.

Details

The Pareto QQ-plot adapted for interval censoring is given by

( -\log(1-F^{TB}(x_{j,n})), \log x_{j,n} )

for j=1,\ldots,n-1, where \hat{F}^{TB} is the Turnbull estimator for the CDF and x_{i,n}=\hat{Q}^{TB}(i/(n+1)) with \hat{Q}^{TB}(p) the empirical quantile function corresponding to the Turnbull estimator.

Right censored data should be entered as L=l and U=truncupper, and left censored data should be entered as L=trunclower and U=u.

If the interval package is installed, the icfit function is used to compute the Turnbull estimator. Otherwise, survfit.formula from survival is used.

Use ParetoQQ for non-censored data or cParetoQQ for right censored data.

See Section 4.3 in Albrecher et al. (2017) for more details.

Value

A list with following components:

pqq.the

Vector of the theoretical quantiles, see Details.

pqq.emp

Vector of the empirical quantiles from the log-transformed data.

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Examples

# Pareto random sample
X <- rpareto(500, shape=2)

# Censoring variable
Y <- rpareto(500, shape=1)

# Observed sample
Z <- pmin(X,Y)

# Censoring indicator
censored <- (X>Y)

# Right boundary
U <- Z
U[censored] <- Inf

# Pareto QQ-plot adapted for interval censoring
icParetoQQ(Z, U, censored)

# Pareto QQ-plot adapted for right censoring
cParetoQQ(Z, censored)

Norwegian fire insurance data

Description

Fire insurance claims for a Norwegian insurance company for the period 1972 to 1992 as studied in Beirlant et al. (1996).
A priority of 500 units was in force.

Usage

data("norwegianfire")

Format

A data frame with 9181 observations on the following 2 variables:

size: Size of fire insurance claim (in 1000 NOK).
year: Year of claim occurence (expressed as yy instead of 19yy).

Source

Beirlant, J., Teugels, J. L. and Vynckier, P. (1996). Practical Analysis of Extreme Values, Leuven University Press.

References

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Examples

data(norwegianfire)

# Exponential QQ-plot for Norwegian Fire Insurance data for claims in 1976.
ExpQQ(norwegianfire$size[norwegianfire$year==76])

# Pareto QQ-plot for Norwegian Fire Insurance data for claims in 1976.
ParetoQQ(norwegianfire$size[norwegianfire$year==76])

Classical estimators for the CDF

Description

Compute approximations of the CDF using the normal approximation, normal-power approximation, shifted Gamma approximation or normal approximation to the shifted Gamma distribution.

Usage

pClas(x, mean = 0, variance = 1, skewness = NULL, 
      method = c("normal", "normal-power", "shifted Gamma", "shifted Gamma normal"), 
      lower.tail = TRUE, log.p = FALSE)

Arguments

x

Vector of points to approximate the CDF in.

mean

Mean of the distribution, default is 0.

variance

Variance of the distribution, default is 1.

skewness

Skewness coefficient of the distribution, this argument is not used for the normal approximation. Default is NULL meaning no skewness coefficient is provided.

method

Approximation method to use, one of "normal", "normal-power", "shifted Gamma" or "shifted Gamma normal". Default is "normal".

lower.tail

Logical indicating if the probabilities are of the form P(X\le x) (TRUE) or P(X>x) (FALSE). Default is TRUE.

log.p

Logical indicating if the probabilities are given as \log(p), default is FALSE.

Details

The normal approximation for the CDF of the r.v. X is defined as

F_X(x) \approx \Phi((x-\mu)/\sigma)

where \mu and \sigma^2 are the mean and variance of X, respectively.
This approximation can be improved when the skewness parameter

\nu=E((X-\mu)^3)/\sigma^3

is available. The normal-power approximation of the CDF is then given by

F_X(x) \approx \Phi(\sqrt{9/\nu^2 + 6z/\nu+1}-3/\nu)

for z=(x-\mu)/\sigma\ge 1 and 9/\nu^2 + 6z/\nu+1\ge 0.
The shifted Gamma approximation uses the approximation

X \approx \Gamma(4/\nu^2, 2/(\nu\times\sigma)) + \mu -2\sigma/\nu.

Here, we need that \nu>0.
The normal approximation to the shifted Gamma distribution approximates the CDF of X as

F_X(x) \approx \Phi(\sqrt{16/\nu^2 + 8z/\nu}-\sqrt{16/\nu^2-1})

for z=(x-\mu)/\sigma\ge 1. We need again that \nu>0.

See Section 6.2 of Albrecher et al. (2017) for more details.

Value

Vector of estimates for the probabilities F(x)=P(X\le x).

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Examples

# Chi-squared sample
X <- rchisq(1000, 2)

x <- seq(0, 10, 0.01)

# Classical approximations
p1 <- pClas(x, mean(X), var(X))
p2 <- pClas(x, mean(X), var(X), mean((X-mean(X))^3)/sd(X)^3, method="normal-power")
p3 <- pClas(x, mean(X), var(X), mean((X-mean(X))^3)/sd(X)^3, method="shifted Gamma")
p4 <- pClas(x, mean(X), var(X), mean((X-mean(X))^3)/sd(X)^3, method="shifted Gamma normal")

# True probabilities
p <- pchisq(x, 2)


# Plot true and estimated probabilities
plot(x, p, type="l", ylab="F(x)", ylim=c(0,1), col="red")
lines(x, p1, lty=2)
lines(x, p2, lty=3, col="green")
lines(x, p3, lty=4)
lines(x, p4, lty=5, col="blue")

legend("bottomright", c("True CDF", "normal approximation", "normal-power approximation",
                        "shifted Gamma approximation", "shifted Gamma normal approximation"), 
      lty=1:5, col=c("red", "black", "green", "black", "blue"), lwd=2)

Edgeworth approximation

Description

Edgeworth approximation of the CDF using the first four moments.

Usage

pEdge(x, moments = c(0, 1, 0, 3), raw = TRUE, lower.tail = TRUE, log.p = FALSE)

Arguments

x

Vector of points to approximate the CDF in.

moments

The first four raw moments if raw=TRUE. By default the first four raw moments of the standard normal distribution are used. When raw=FALSE, the mean \mu=E(X), variance \sigma^2=E((X-\mu)^2), skewness (third standardised moment, \nu=E((X-\mu)^3)/\sigma^3) and kurtosis (fourth standardised moment, k=E((X-\mu)^4)/\sigma^4).

raw

When TRUE (default), the first four raw moments are provided in moments. Otherwise, the mean, variance, skewness and kurtosis are provided in moments.

lower.tail

Logical indicating if the probabilities are of the form P(X\le x) (TRUE) or P(X>x) (FALSE). Default is TRUE.

log.p

Logical indicating if the probabilities are given as \log(p), default is FALSE.

Details

Denote the standard normal PDF and CDF respectively by \phi and \Phi. Let \mu be the first moment, \sigma^2=E((X-\mu)^2) the variance, \mu_3=E((X-\mu)^3) the third central moment and \mu_4=E((X-\mu)^4) the fourth central moment of the random variable X. The corresponding cumulants are given by \kappa_1=\mu, \kappa_2=\sigma^2, \kappa_3=\mu_3 and \kappa_4=\mu_4-3\sigma^4.

Now consider the random variable Z=(X-\mu)/\sigma, which has cumulants 0, 1, \nu=\kappa_3/\sigma^3 and k=\kappa_4/\sigma^4=\mu_4/\sigma^4-3.

The Edgeworth approximation for the CDF of X (F(x)) is given by

\hat{F}_{E}(x) = \Phi(z) + \phi(z) (-\nu/6 h_2(z)- (3k\times h_3(z)+\gamma_3^2h_5(z))/72)

with h_2(z)=z^2-1, h_3(z)=z^3-3z, h_5(z)=z^5-10z^3+15z and z=(x-\mu)/\sigma.

See Section 6.2 of Albrecher et al. (2017) for more details.

Value

Vector of estimates for the probabilities F(x)=P(X\le x).

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Cheah, P.K., Fraser, D.A.S. and Reid, N. (1993). "Some Alternatives to Edgeworth." The Canadian Journal of Statistics, 21(2), 131–138.

Examples

# Chi-squared sample
X <- rchisq(1000, 2)


x <- seq(0, 10, 0.01)

# Empirical moments
moments = c(mean(X), mean(X^2), mean(X^3), mean(X^4))

# Gram-Charlier approximation
p1 <- pGC(x, moments)

# Edgeworth approximation
p2 <- pEdge(x, moments)

# Normal approximation
p3 <- pClas(x, mean(X), var(X))

# True probabilities
p <- pchisq(x, 2)


# Plot true and estimated probabilities
plot(x, p, type="l", ylab="F(x)", ylim=c(0,1), col="red")
lines(x, p1, lty=2)
lines(x, p2, lty=3)
lines(x, p3, lty=4, col="blue")

legend("bottomright", c("True CDF", "GC approximation", 
                        "Edgeworth approximation", "Normal approximation"), 
       col=c("red", "black", "black", "blue"), lty=1:4, lwd=2)

Gram-Charlier approximation

Description

Gram-Charlier approximation of the CDF using the first four moments.

Usage

pGC(x, moments = c(0, 1, 0, 3), raw = TRUE, lower.tail = TRUE, log.p = FALSE)

Arguments

x

Vector of points to approximate the CDF in.

moments

raw

When TRUE (default), the first four raw moments are provided in moments. Otherwise, the mean, variance, skewness and kurtosis are provided in moments.

lower.tail

Logical indicating if the probabilities are of the form P(X\le x) (TRUE) or P(X>x) (FALSE). Default is TRUE.

log.p

Logical indicating if the probabilities are given as \log(p), default is FALSE.

Details

Now consider the random variable Z=(X-\mu)/\sigma, which has cumulants 0, 1, \nu=\kappa_3/\sigma^3 and k=\kappa_4/\sigma^4=\mu_4/\sigma^4-3.

The Gram-Charlier approximation for the CDF of X (F(x)) is given by

\hat{F}_{GC}(x) = \Phi(z) + \phi(z) (-\nu/6 h_2(z)- k/24h_3(z))

with h_2(z)=z^2-1, h_3(z)=z^3-3z and z=(x-\mu)/\sigma.

See Section 6.2 of Albrecher et al. (2017) for more details.

Value

Vector of estimates for the probabilities F(x)=P(X\le x).

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Cheah, P.K., Fraser, D.A.S. and Reid, N. (1993). "Some Alternatives to Edgeworth." The Canadian Journal of Statistics, 21(2), 131–138.

Examples

# Chi-squared sample
X <- rchisq(1000, 2)


x <- seq(0, 10, 0.01)

# Empirical moments
moments = c(mean(X), mean(X^2), mean(X^3), mean(X^4))

# Gram-Charlier approximation
p1 <- pGC(x, moments)

# Edgeworth approximation
p2 <- pEdge(x, moments)

# Normal approximation
p3 <- pClas(x, mean(X), var(X))

# True probabilities
p <- pchisq(x, 2)


# Plot true and estimated probabilities
plot(x, p, type="l", ylab="F(x)", ylim=c(0,1), col="red")
lines(x, p1, lty=2)
lines(x, p2, lty=3)
lines(x, p3, lty=4, col="blue")

legend("bottomright", c("True CDF", "GC approximation", 
                        "Edgeworth approximation", "Normal approximation"), 
       col=c("red", "black", "black", "blue"), lty=1:4, lwd=2)

Secura dataset

Description

Automobile claims from 1988 to 2001, gathered from several European insurance companies, exceeding 1 200 000 Euro. Note that the data were, among others, corrected for inflation.

Usage

data("secura")

Format

A data frame with 371 observations on the following 2 variables:

year: Year of claim occurence.
size: Size of automobile insurance claim (in EUR).

References

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Examples

data(secura)

# Exponential QQ-plot of Secura data
ExpQQ(secura$size)

# Pareto QQ-plot of Secura data
ParetoQQ(secura$size)

# Mean excess plot of Secura data (function of k)
MeanExcess(secura$size, k=TRUE)

# Mean excess plot of Secura data (function of order statistics)
MeanExcess(secura$size, k=FALSE)

SOA Group Medical Insurance Large Claims Database

Description

The SOA Group Medical Insurance Large Claims Database records, among others, all the claim amounts exceeding 25,000 USD in the year 1991.

Usage

data("soa")

Format

A data frame with 75789 observations on the following variable:

size: Claim size (in USD).

Source

Grazier, K. L. and G'Sell Associates (1997). Group Medical Insurance Large Claims Database Collection and Analysis. SOA Monograph M-HB97-1, Society of Actuaries, Schaumburg.

Society of Actuaries, https://www.soa.org/resources/experience-studies/2000-2004/91-92-group-medical-claims/.

References

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Examples

data(soa)

# Histogram of log-claim amount
hist(log(soa$size),breaks=seq(10,16,0.2),xlab="log(Claim size)")

# Exponential QQ-plot of claim amount
ExpQQ(soa$size)

# Mean excess plot of claim amount (function of k)
MeanExcess(soa$size, k=TRUE)

# Mean excess plot of claim amount (function of order statistics)
MeanExcess(soa$size, k=FALSE)

Non-parametric estimators of the STDF

Description

Non-parametric estimators of the stable tail dependence function (STDF): \hat{l}_k(x) and \tilde{l}_k(x).

Usage

stdf(x, k, X, alpha = 0.5)

stdf2(x, k, X)

Arguments

x

A d-dimensional point to estimate the STDF in.

k

Value of the tail index k.

X

A data matrix of dimensions n by d with observations in the rows.

alpha

The parameter \alpha of the estimator \hat{l}_k(x) (stdf), default is 0.5. This argument is not used in stdf2.

Details

The stable tail dependence function in x can be estimated by

\hat{l}_k(x) = 1/k \sum_{i=1}^n 1_{\{\exists j\in\{1,\ldots, d\}: \hat{F}_j(X_{i,j})>1-k/n x_j\}}

with

\hat{F}_j(X_{i,j})=(R_{i,j}-\alpha)/n

where R_{i,j} is the rank of X_{i,j} among the n observations in the j-th dimension:

R_{i,j}=\sum_{m=1}^n 1_{\{X_{m,j}\le X_{i,j}\}}.

This estimator is implemented in stdf.

The second estimator is given by

\tilde{l}_k(x) = 1/k \sum_{i=1}^n 1_{\{X_{i,1}\ge X^{(1)}_{n-[kx_1]+1,n} or \ldots or X_{i,d}\ge X^{(d)}_{n-[kx_d]+1,n}\}}

where X_{i,n}^{(j)} is the i-th smallest observation in the j-th dimension. This estimator is implemented in stdf2.

See Section 4.5 of Beirlant et al. (2016) for more details.

Value

stdf returns the estimate \hat{l}_k(x) and stdf2 returns the estimate \tilde{l}_k(x).

Author(s)

Tom Reynkens

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant J., Goegebeur Y., Segers, J. and Teugels, J. (2004). Statistics of Extremes: Theory and Applications, Wiley Series in Probability, Wiley, Chichester.

Examples

# Generate data matrix
X <- cbind(rpareto(100,2), rpareto(100,3))

# Tail index
k <- 20

# Point to evaluate the STDF in
x <- c(2,3)

# First estimate
stdf(x, k, X)

# Second estimate
stdf2(x, k, X)

The truncated Burr distribution

Description

Density, distribution function, quantile function and random generation for the truncated Burr distribution (type XII).

Usage

dtburr(x, alpha, rho, eta = 1, endpoint = Inf, log = FALSE)
ptburr(x, alpha, rho, eta = 1, endpoint = Inf, lower.tail = TRUE, log.p = FALSE)
qtburr(p, alpha, rho, eta = 1, endpoint = Inf, lower.tail = TRUE, log.p = FALSE)
rtburr(n, alpha, rho, eta = 1, endpoint = Inf)

Arguments

x

Vector of quantiles.

p

Vector of probabilities.

n

Number of observations.

alpha

The \alpha parameter of the truncated Burr distribution, a strictly positive number.

rho

The \rho parameter of the truncated Burr distribution, a strictly negative number.

eta

The \eta parameter of the truncated Burr distribution, a strictly positive number. The default value is 1.

endpoint

Endpoint of the truncated Burr distribution. The default value is Inf for which the truncated Burr distribution corresponds to the ordinary Burr distribution.

log

Logical indicating if the densities are given as \log(f), default is FALSE.

lower.tail

Logical indicating if the probabilities are of the form P(X\le x) (TRUE) or P(X>x) (FALSE). Default is TRUE.

log.p

Logical indicating if the probabilities are given as \log(p), default is FALSE.

Details

The Cumulative Distribution Function (CDF) of the truncated Burr distribution is equal to F_T(x) = F(x) / F(T) for x \le T where F is the CDF of the ordinary Burr distribution and T is the endpoint (truncation point) of the truncated Burr distribution.

Value

dtburr gives the density function evaluated in x, ptburr the CDF evaluated in x and qtburr the quantile function evaluated in p. The length of the result is equal to the length of x or p.

rtburr returns a random sample of length n.

Author(s)

Tom Reynkens.

Examples

# Plot of the PDF
x <- seq(0, 10, 0.01)
plot(x, dtburr(x, alpha=2, rho=-1, endpoint=9), xlab="x", ylab="PDF", type="l")

# Plot of the CDF
x <- seq(0, 10, 0.01)
plot(x, ptburr(x, alpha=2, rho=-1, endpoint=9), xlab="x", ylab="CDF", type="l")

The truncated exponential distribution

Description

Density, distribution function, quantile function and random generation for the truncated exponential distribution.

Usage

dtexp(x, rate = 1, endpoint = Inf, log = FALSE)
ptexp(x, rate = 1, endpoint = Inf, lower.tail = TRUE, log.p = FALSE)
qtexp(p, rate = 1, endpoint = Inf, lower.tail = TRUE, log.p = FALSE)
rtexp(n, rate = 1, endpoint = Inf)

Arguments

x

Vector of quantiles.

p

Vector of probabilities.

n

Number of observations.

rate

The rate parameter for the exponential distribution, default is 1.

endpoint

Endpoint of the truncated exponential distribution. The default value is Inf for which the truncated exponential distribution corresponds to the ordinary exponential distribution.

log

Logical indicating if the densities are given as \log(f), default is FALSE.

lower.tail

Logical indicating if the probabilities are of the form P(X\le x) (TRUE) or P(X>x) (FALSE). Default is TRUE.

log.p

Logical indicating if the probabilities are given as \log(p), default is FALSE.

Details

The Cumulative Distribution Function (CDF) of the truncated exponential distribution is equal to F_T(x) = F(x) / F(T) for x \le T where F is the CDF of the ordinary exponential distribution and T is the endpoint (truncation point) of the truncated exponential distribution.

Value

dtexp gives the density function evaluated in x, ptexp the CDF evaluated in x and qtexp the quantile function evaluated in p. The length of the result is equal to the length of x or p.

rtexp returns a random sample of length n.

Author(s)

Tom Reynkens.

Examples

# Plot of the PDF
x <- seq(0, 10, 0.01)
plot(x, dtexp(x, rate = 2, endpoint=5), xlab="x", ylab="PDF", type="l")

# Plot of the CDF
x <- seq(0, 10, 0.01)
plot(x, ptexp(x, rate = 2, endpoint=5), xlab="x", ylab="CDF", type="l")

The truncated Frechet distribution

Description

Density, distribution function, quantile function and random generation for the truncated Fréchet distribution.

Usage

dtfrechet(x, shape, loc = 0, scale = 1, endpoint = Inf, log = FALSE)
ptfrechet(x, shape, loc = 0, scale = 1, endpoint = Inf, lower.tail = TRUE, log.p = FALSE)
qtfrechet(p, shape, loc = 0, scale = 1, endpoint = Inf, lower.tail = TRUE, log.p = FALSE)
rtfrechet(n, shape, loc = 0, scale = 1, endpoint = Inf)

Arguments

x

Vector of quantiles.

p

Vector of probabilities.

n

Number of observations.

shape

Shape parameter of the Fréchet distribution.

loc

Location parameter of the Fréchet distribution, default is 0.

scale

Scale parameter of the Fréchet distribution, default is 1.

endpoint

Endpoint of the truncated Fréchet distribution. The default value is Inf for which the truncated Fréchet distribution corresponds to the ordinary Fréchet distribution.

log

Logical indicating if the densities are given as \log(f), default is FALSE.

lower.tail

Logical indicating if the probabilities are of the form P(X\le x) (TRUE) or P(X>x) (FALSE). Default is TRUE.

log.p

Logical indicating if the probabilities are given as \log(p), default is FALSE.

Details

The Cumulative Distribution Function (CDF) of the truncated Fréchet distribution is equal to F_T(x) = F(x) / F(T) for x \le T where F is the CDF of an ordinary Fréchet distribution and T is the endpoint (truncation point) of the truncated Fréchet distribution.

Value

dtfrechet gives the density function evaluated in x, ptfrechet the CDF evaluated in x and qtfrechet the quantile function evaluated in p. The length of the result is equal to the length of x or p.

rtfrechet returns a random sample of length n.

Author(s)

Tom Reynkens.

Examples

# Plot of the PDF
x <- seq(1, 10, 0.01)
plot(x, dtfrechet(x, shape=2, endpoint=5), xlab="x", ylab="PDF", type="l")

# Plot of the CDF
x <- seq(1, 10, 0.01)
plot(x, ptfrechet(x, shape=2, endpoint=5), xlab="x", ylab="CDF", type="l")

The truncated generalised Pareto distribution

Description

Density, distribution function, quantile function and random generation for the truncated Generalised Pareto Distribution (GPD).

Usage

dtgpd(x, gamma, mu = 0, sigma, endpoint = Inf, log = FALSE)
ptgpd(x, gamma, mu = 0, sigma, endpoint = Inf, lower.tail = TRUE, log.p = FALSE)
qtgpd(p, gamma, mu = 0, sigma, endpoint = Inf, lower.tail = TRUE, log.p = FALSE)
rtgpd(n, gamma, mu = 0, sigma, endpoint = Inf)

Arguments

x

Vector of quantiles.

p

Vector of probabilities.

n

Number of observations.

gamma

The \gamma parameter of the GPD, a real number.

mu

The \mu parameter of the GPD, a strictly positive number. Default is 0.

sigma

The \sigma parameter of the GPD, a strictly positive number.

endpoint

Endpoint of the truncated GPD. The default value is Inf for which the truncated GPD corresponds to the ordinary GPD.

log

Logical indicating if the densities are given as \log(f), default is FALSE.

lower.tail

Logical indicating if the probabilities are of the form P(X\le x) (TRUE) or P(X>x) (FALSE). Default is TRUE.

log.p

Logical indicating if the probabilities are given as \log(p), default is FALSE.

Details

The Cumulative Distribution Function (CDF) of the truncated GPD is equal to F_T(x) = F(x) / F(T) for x \le T where F is the CDF of the ordinary GPD and T is the endpoint (truncation point) of the truncated GPD.

Value

dtgpd gives the density function evaluated in x, ptgpd the CDF evaluated in x and qtgpd the quantile function evaluated in p. The length of the result is equal to the length of x or p.

rtgpd returns a random sample of length n.

Author(s)

Tom Reynkens

Examples

# Plot of the PDF
x <- seq(0, 10, 0.01)
plot(x, dtgpd(x, gamma=1/2, sigma=5, endpoint=8), xlab="x", ylab="PDF", type="l")

# Plot of the CDF
x <- seq(0, 10, 0.01)
plot(x, ptgpd(x, gamma=1/2, sigma=5, endpoint=8), xlab="x", ylab="CDF", type="l")

The truncated Pareto distribution

Description

Density, distribution function, quantile function and random generation for the truncated Pareto distribution.

Usage

dtpareto(x, shape, scale = 1, endpoint = Inf, log = FALSE)
ptpareto(x, shape, scale = 1, endpoint = Inf, lower.tail = TRUE, log.p = FALSE)
qtpareto(p, shape, scale = 1, endpoint = Inf, lower.tail = TRUE, log.p = FALSE)
rtpareto(n, shape, scale = 1, endpoint = Inf)

Arguments

x

Vector of quantiles.

p

Vector of probabilities.

n

Number of observations.

shape

The shape parameter of the truncated Pareto distribution, a strictly positive number.

scale

The scale parameter of the truncated Pareto distribution, a strictly positive number. Its default value is 1.

endpoint

Endpoint of the truncated Pareto distribution. The default value is Inf for which the truncated Pareto distribution corresponds to the ordinary Pareto distribution.

log

Logical indicating if the densities are given as \log(f), default is FALSE.

lower.tail

Logical indicating if the probabilities are of the form P(X\le x) (TRUE) or P(X>x) (FALSE). Default is TRUE.

log.p

Logical indicating if the probabilities are given as \log(p), default is FALSE.

Details

The Cumulative Distribution Function (CDF) of the truncated Pareto distribution is equal to F_T(x) = F(x) / F(T) for x \le T where F is the CDF of an ordinary Pareto distribution and T is the endpoint (truncation point) of the truncated Pareto distribution.

Value

dtpareto gives the density function evaluated in x, ptpareto the CDF evaluated in x and qtpareto the quantile function evaluated in p. The length of the result is equal to the length of x or p.

rtpareto returns a random sample of length n.

Author(s)

Tom Reynkens

Examples

# Plot of the PDF
x = seq(1,10,0.01)
plot(x, dtpareto(x, shape=2, endpoint=10), xlab="x", ylab="PDF", type="l")

# Plot of the CDF
x = seq(1,10,0.01)
plot(x, ptpareto(x, shape=2, endpoint=10), xlab="x", ylab="CDF", type="l")

The truncated Weibull distribution

Description

Density, distribution function, quantile function and random generation for the truncated Weibull distribution.

Usage

dtweibull(x, shape, scale = 1, endpoint = Inf, log = FALSE)
ptweibull(x, shape, scale = 1, endpoint = Inf, lower.tail = TRUE, log.p = FALSE)
qtweibull(p, shape, scale = 1, endpoint = Inf, lower.tail = TRUE, log.p = FALSE)
rtweibull(n, shape, scale = 1, endpoint = Inf)

Arguments

x

Vector of quantiles.

p

Vector of probabilities.

n

Number of observations.

shape

The shape parameter of the Weibull distribution, a strictly positive number.

scale

The scale parameter of the Weibull distribution, a strictly positive number, default is 1.

endpoint

Endpoint of the truncated Weibull distribution. The default value is Inf for which the truncated Weibull distribution corresponds to the ordinary Weibull distribution.

log

Logical indicating if the densities are given as \log(f), default is FALSE.

lower.tail

Logical indicating if the probabilities are of the form P(X\le x) (TRUE) or P(X>x) (FALSE). Default is TRUE.

log.p

Logical indicating if the probabilities are given as \log(p), default is FALSE.

Details

The Cumulative Distribution Function (CDF) of the truncated Weibull distribution is equal to F_T(x) = F(x) / F(T) for x \le T where F is the CDF of the ordinary Weibull distribution and T is the endpoint (truncation point) of the truncated Weibull distribution.

Value

dtweibull gives the density function evaluated in x, ptweibull the CDF evaluated in x and qtweibull the quantile function evaluated in p. The length of the result is equal to the length of x or p.

rtweibull returns a random sample of length n.

Author(s)

Tom Reynkens.

Examples

# Plot of the PDF
x <- seq(0, 10, 0.01)
plot(x, dtweibull(x, shape=2, scale=0.5, endpoint=1), xlab="x", ylab="PDF", type="l")

# Plot of the CDF
x <- seq(0, 10, 0.01)
plot(x, ptweibull(x, shape=2, scale=0.5, endpoint=1), xlab="x", ylab="CDF", type="l")

The truncated log-normal distribution

Description

Density, distribution function, quantile function and random generation for the truncated log-normal distribution.

Usage

dtlnorm(x, meanlog = 0, sdlog = 1, endpoint = Inf, log = FALSE)
ptlnorm(x, meanlog = 0, sdlog = 1, endpoint = Inf, lower.tail = TRUE, log.p = FALSE)
qtlnorm(p, meanlog = 0, sdlog = 1, endpoint = Inf, lower.tail = TRUE, log.p = FALSE)
rtlnorm(n, meanlog = 0, sdlog = 1, endpoint = Inf)

Arguments

x

Vector of quantiles.

p

Vector of probabilities.

n

Number of observations.

meanlog

Mean of the distribution on the log scale, default is 0.

sdlog

Standard deviation of the distribution on the log scale, default is 1.

endpoint

Endpoint of the truncated log-normal distribution. The default value is Inf for which the truncated log-normal distribution corresponds to the ordinary log-normal distribution.

log

Logical indicating if the densities are given as \log(f), default is FALSE.

lower.tail

Logical indicating if the probabilities are of the form P(X\le x) (TRUE) or P(X>x) (FALSE). Default is TRUE.

log.p

Logical indicating if the probabilities are given as \log(p), default is FALSE.

Details

The Cumulative Distribution Function (CDF) of the truncated log-normal distribution is equal to F_T(x) = F(x) / F(T) for x \le T where F is the CDF of the ordinary log-normal distribution and T is the endpoint (truncation point) of the truncated log-normal distribution.

Value

dtlnorm gives the density function evaluated in x, ptlnorm the CDF evaluated in x and qtlnorm the quantile function evaluated in p. The length of the result is equal to the length of x or p.

rtlnorm returns a random sample of length n.

Author(s)

Tom Reynkens.

Examples

# Plot of the PDF
x <- seq(0, 10, 0.01)
plot(x, dtlnorm(x, endpoint=9), xlab="x", ylab="PDF", type="l")

# Plot of the CDF
x <- seq(0, 10, 0.01)
plot(x, ptlnorm(x, endpoint=9), xlab="x", ylab="CDF", type="l")

Truncation odds

Description

Estimates of truncation odds of the truncated probability mass under the untruncated distribution using truncated Hill.

Usage

trDT(data, r = 1, gamma, plot = FALSE, add = FALSE, main = "Estimates of DT", ...)

Arguments

data

Vector of n observations.

r

Trimming parameter, default is 1 (no trimming).

gamma

Vector of n-1 estimates for the EVI obtained from trHill.

plot

Logical indicating if the estimates of D_T should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates of D_T should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of DT".

...

Additional arguments for the plot function, see plot for more details.

Details

The truncation odds is defined as

D_T=(1-F(T))/F(T)

with T the upper truncation point and F the CDF of the untruncated distribution (e.g. Pareto distribution).

We estimate this truncation odds as

\hat{D}_T=\max\{ (k+1)/(n+1) ( R_{r,k,n}^{1/\gamma_k} - 1/(k+1) ) / (1-R_{r,k,n}^{1/\gamma_k}), 0\}

with R_{r,k,n} = X_{n-k,n} / X_{n-r+1,n}.

See Beirlant et al. (2016) or Section 4.2.3 of Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

DT

Vector of the corresponding estimates for the truncation odds D_T.

Author(s)

Tom Reynkens based on R code of Dries Cornilly.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant, J., Fraga Alves, M.I. and Gomes, M.I. (2016). "Tail fitting for Truncated and Non-truncated Pareto-type Distributions." Extremes, 19, 429–462.

Examples

# Sample from truncated Pareto distribution.
# truncated at 99% quantile
shape <- 2
X <- rtpareto(n=1000, shape=shape, endpoint=qpareto(0.99, shape=shape))

# Truncated Hill estimator
trh <- trHill(X, plot=TRUE, ylim=c(0,2))

# Truncation odds
dt <- trDT(X, gamma=trh$gamma, plot=TRUE, ylim=c(0,0.05))

Truncation odds

Description

Estimates of truncation odds of the truncated probability mass under the untruncated distribution using truncated MLE.

Usage

trDTMLE(data, gamma, tau, plot = FALSE, add = FALSE, main = "Estimates of DT", ...)

Arguments

data

Vector of n observations.

gamma

Vector of n-1 estimates for the EVI obtained from trMLE.

tau

Vector of n-1 estimates for the \tau obtained from trMLE.

plot

Logical indicating if the estimates of D_T should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates of D_T should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of DT".

...

Additional arguments for the plot function, see plot for more details.

Details

The truncation odds is defined as

D_T=(1-F(T))/F(T)

with T the upper truncation point and F the CDF of the untruncated distribution (e.g. GPD).

We estimate this truncation odds as

\hat{D}_T=\max\{ (k+1)/(n+1) ( (1+\hat{\tau}_k E_{1,k})^{-1/\hat{\xi}_k} - 1/(k+1) ) / (1-(1+\hat{\tau}_k E_{1,k})^{-1/\hat{\xi}_k}), 0\}

with E_{1,k} = X_{n,n}-X_{n-k,n}.

See Beirlant et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

DT

Vector of the corresponding estimates for the truncation odds D_T.

Author(s)

Tom Reynkens.

References

Beirlant, J., Fraga Alves, M. I. and Reynkens, T. (2017). "Fitting Tails Affected by Truncation". Electronic Journal of Statistics, 11(1), 2026–2065.

Examples

# Sample from GPD truncated at 99% quantile
gamma <- 0.5
sigma <- 1.5
X <- rtgpd(n=250, gamma=gamma, sigma=sigma, endpoint=qgpd(0.99, gamma=gamma, sigma=sigma))

# Truncated ML estimator
trmle <- trMLE(X, plot=TRUE, ylim=c(0,2))

# Truncation odds
dtmle <- trDTMLE(X, gamma=trmle$gamma, tau=trmle$tau, plot=TRUE, ylim=c(0,0.05))

Estimator of endpoint

Description

Estimator of endpoint using truncated Hill estimates.

Usage

trEndpoint(data, r = 1, gamma, plot = FALSE, add = FALSE,
           main = "Estimates of endpoint", ...)

Arguments

data

Vector of n observations.

r

Trimming parameter, default is 1 (no trimming).

gamma

Vector of n-1 estimates for the EVI obtained from trHill.

plot

Logical indicating if the estimates of T should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates of T should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of endpoint".

...

Additional arguments for the plot function, see plot for more details.

Details

The endpoint is estimated as

\hat{T}_{k,n} = \max\{X_{n-k,n} ( ((X_{n-k,n}/X_{n,n})^{1/\hat{\gamma}_k} - 1/(k+1)) / (1-1/(k+1)))^{-\hat{\gamma}_k}, X_{n,n}\}

with \hat{\gamma}_k the Hill estimates adapted for truncation.

See Beirlant et al. (2016) or Section 4.2.3 of Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

Tk

Vector of the corresponding estimates for the endpoint T.

Author(s)

Tom Reynkens based on R code of Dries Cornilly.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant, J., Fraga Alves, M.I. and Gomes, M.I. (2016). "Tail fitting for Truncated and Non-truncated Pareto-type Distributions." Extremes, 19, 429–462.

Examples

# Sample from truncated Pareto distribution.
# truncated at 99% quantile
shape <- 2
X <- rtpareto(n=1000, shape=shape, endpoint=qpareto(0.99, shape=shape))

# Truncated Hill estimator
trh <- trHill(X, plot=TRUE, ylim=c(0,2))

# Endpoint
trEndpoint(X, gamma=trh$gamma, plot=TRUE, ylim=c(8,12))
abline(h=qpareto(0.99, shape=shape), lty=2)

Estimator of endpoint

Description

Estimator of endpoint using truncated ML estimates.

Usage

trEndpointMLE(data, gamma, tau, plot = FALSE, add = FALSE, 
              main = "Estimates of endpoint", ...)

Arguments

data

Vector of n observations.

gamma

Vector of n-1 estimates for the EVI obtained from trMLE.

tau

Vector of n-1 estimates for the \tau obtained from trMLE.

plot

Logical indicating if the estimates of T should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates of T should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of endpoint".

...

Additional arguments for the plot function, see plot for more details.

Details

The endpoint is estimated as

\hat{T}_{k} = X_{n-k,n} + 1/\hat{\tau}_k[( (1-1/k)/((1+ \hat{\tau}_k (X_{n,n}-X_{n-k,n}))^{-1/\hat{\xi}_k}-1/k))^{\hat{\xi}_k} -1]

with \hat{\gamma}_k and \hat{\tau}_k the truncated ML estimates for \gamma and \tau.

See Beirlant et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

Tk

Vector of the corresponding estimates for the endpoint T.

Author(s)

Tom Reynkens.

References

Beirlant, J., Fraga Alves, M. I. and Reynkens, T. (2017). "Fitting Tails Affected by Truncation". Electronic Journal of Statistics, 11(1), 2026–2065.

Examples

# Sample from GPD truncated at 99% quantile
gamma <- 0.5
sigma <- 1.5
X <- rtgpd(n=250, gamma=gamma, sigma=sigma, endpoint=qgpd(0.99, gamma=gamma, sigma=sigma))

# Truncated ML estimator
trmle <- trMLE(X, plot=TRUE, ylim=c(0,2))

# Endpoint
trEndpointMLE(X, gamma=trmle$gamma, tau=trmle$tau, plot=TRUE, ylim=c(0,50))
abline(h=qgpd(0.99, gamma=gamma, sigma=sigma), lty=2)

Hill estimator for upper truncated data

Description

Computes the Hill estimator for positive extreme value indices, adapted for upper truncation, as a function of the tail parameter k (Aban et al. 2006; Beirlant et al., 2016). Optionally, these estimates are plotted as a function of k.

Usage

trHill(data, r = 1, tol = 1e-08, maxiter = 100, logk = FALSE,
       plot = FALSE, add = FALSE, main = "Estimates of the EVI", ...)

Arguments

data

Vector of n observations.

r

Trimming parameter, default is 1 (no trimming).

tol

Numerical tolerance for stopping criterion used in Newton-Raphson iterations, default is 1e-08.

maxiter

Maximum number of Newton-Raphson iterations, default is 100.

logk

Logical indicating if the estimates are plotted as a function of \log(k) (logk=TRUE) or as a function of k. Default is FALSE.

plot

Logical indicating if the estimates of \gamma should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates of \gamma should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of the EVI".

...

Additional arguments for the plot function, see plot for more details.

Details

The truncated Hill estimator is the MLE for \gamma under the truncated Pareto distribution.

To estimate the EVI using the truncated Hill estimator an equation needs to be solved. Beirlant et al. (2016) propose to use Newton-Raphson iterations to solve this equation. We take the trimmed Hill estimates as starting values for this algorithm. The trimmed Hill estimator is defined as

H_{r,k,n} = 1/(k-r+1) \sum_{j=r}^k \log(X_{n-j+1,n})-\log(X_{n-k,n})

for 1 \le r < k < n and is a basic extension of the Hill estimator for upper truncated data (the ordinary Hill estimator is obtained for r=1).

The equation that needs to be solved is

H_{r,k,n} = \gamma + R_{r,k,n}^{1/\gamma} \log(R_{r,k,n}) / (1-R_{r,k,n}^{1/\gamma})

with R_{r,k,n} = X_{n-k,n} / X_{n-r+1,n}.

See Beirlant et al. (2016) or Section 4.2.3 of Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

gamma

Vector of the corresponding estimates for \gamma.

H

Vector of corresponding trimmed Hill estimates.

Author(s)

Tom Reynkens based on R code of Dries Cornilly.

References

Aban, I.B., Meerschaert, M.M. and Panorska, A.K. (2006). "Parameter Estimation for the Truncated Pareto Distribution." Journal of the American Statistical Association, 101, 270–277.

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant, J., Fraga Alves, M.I. and Gomes, M.I. (2016). "Tail fitting for Truncated and Non-truncated Pareto-type Distributions." Extremes, 19, 429–462.

Examples

# Sample from truncated Pareto distribution.
# truncated at 99% quantile
shape <- 2
X <- rtpareto(n=1000, shape=shape, endpoint=qpareto(0.99, shape=shape))

# Truncated Hill estimator
trh <- trHill(X, plot=TRUE, ylim=c(0,2))

MLE estimator for upper truncated data

Description

Computes the ML estimator for the extreme value index, adapted for upper truncation, as a function of the tail parameter k (Beirlant et al., 2017). Optionally, these estimates are plotted as a function of k.

Usage

trMLE(data, start = c(1, 1), eps = 10^(-10), 
      plot = TRUE, add = FALSE, main = "Estimates for EVI", ...)

Arguments

data

Vector of n observations.

start

Starting values for \gamma and \tau for the numerical optimisation.

eps

Numerical tolerance, see Details. By default it is equal to 10^(-10).

plot

Logical indicating if the estimates of \gamma should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates of \gamma should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of the EVI".

...

Additional arguments for the plot function, see plot for more details.

Details

We compute the MLE for the \gamma and \sigma parameters of the truncated GPD. For numerical reasons, we compute the MLE for \tau=\gamma/\sigma and transform this estimate to \sigma.

The log-likelihood is given by

(k-1) \ln \tau - (k-1) \ln \xi- ( 1 + 1/\xi)\sum_{j=2}^k \ln (1+\tau E_{j,k}) -(k-1) \ln( 1- (1+ \tau E_{1,k})^{-1/\xi})

with E_{j,k} = X_{n-j+1,n}-X_{n-k,n}.

In order to meet the restrictions \sigma=\xi/\tau>0 and 1+\tau E_{j,k}>0 for j=1,\ldots,k, we require the estimates of these quantities to be larger than the numerical tolerance value eps.

See Beirlant et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

gamma

Vector of the corresponding estimates for \gamma.

tau

Vector of the corresponding estimates for \tau.

sigma

Vector of the corresponding estimates for \sigma.

conv

Convergence indicator of optim.

Author(s)

Tom Reynkens.

References

Beirlant, J., Fraga Alves, M. I. and Reynkens, T. (2017). "Fitting Tails Affected by Truncation". Electronic Journal of Statistics, 11(1), 2026–2065.

Examples

# Sample from GPD truncated at 99% quantile
gamma <- 0.5
sigma <- 1.5
X <- rtgpd(n=250, gamma=gamma, sigma=sigma, endpoint=qgpd(0.99, gamma=gamma, sigma=sigma))

# Truncated ML estimator
trmle <- trMLE(X, plot=TRUE, ylim=c(0,2))

Truncated Pareto quantile plot

Description

Extension of the Pareto QQ-plot as described in Beirlant et al. (2016).

Usage

trParetoQQ(data, r = 1, DT, kstar = NULL, plot = TRUE, main = "TPa QQ-plot", ...)

Arguments

data

Vector of n observations.

r

Trimming parameter, default is 1 (no trimming).

DT

Vector of n-1 estimates for the truncation odds D_T obtained from trDT.

kstar

Value for k used to construct the plot. When NULL (default), a value will be chosen by maximising the correlation between the empirical and theoretical quantiles (see Details).

plot

Logical indicating if the quantiles should be plotted in a Pareto QQ-plot, default is TRUE.

main

Title for the plot, default is "TPa QQ-plot".

...

Additional arguments for the plot function, see plot for more details.

Details

The Pareto QQ-plot for truncated data plots

(-\log(\hat{D}_{T,r,k^*,n}+j/(n+1)), \log(X_{n-j+1,n}) )

for j=1,\ldots,n.

The value for k^* can be be given by the user or can be determined automatically. In the latter case, we use the k^* that maximises the absolute value of the correlation between -\log(\hat{D}_{T,r,k^*,n}+j/(n+1)) and \log(X_{n-j+1,n}) for j=1,\ldots,k and k^*>10.

When taking D_T=0, one obtains the ordinary Pareto QQ-plot.

Note that the definition here differs slightly from the one in Beirlant et al. (2016). We plot the empirical and theoretical quantiles the other way around and therefore have to add a minus (before the log).

See Beirlant et al. (2016) for more details.

Value

A list with following components:

pqq.the

Vector of theoretical quantiles -\log(\hat{D}_{T,r,k^*,n}+j/(n+1)), see Details.

pqq.emp

Vector of the empirical quantiles from the log-transformed data.

kstar

Optimal value for k or input argument kstar, see Details.

DTstar

Estimate of D_T corresponding to kstar.

Author(s)

Tom Reynkens.

References

Beirlant, J., Fraga Alves, M.I. and Gomes, M.I. (2016). "Tail fitting for Truncated and Non-truncated Pareto-type Distributions." Extremes, 19, 429–462.

Examples

# Endpoint of truncated Pareto distribution
endpoint <- qpareto(0.99, shape=2)

# Generate sample from truncated Pareto distribution
X <- rtpareto(1000, shape=2, endpoint=endpoint)

# Ordinary Pareto QQ-plot
ParetoQQ(X)

# Truncated Hill estimates
gamma <- trHill(X)$gamma

# Estimates for truncation odds
dt <- trDT(X, gamma=gamma)$DT

# Truncated Pareto QQ-plot
trParetoQQ(X, DT=dt)

Estimator of small exceedance probabilities using truncated Hill

Description

Computes estimates of a small exceedance probability P(X>q) using the estimates for the EVI obtained from the Hill estimator adapted for upper truncation.

Usage

trProb(data, r = 1, gamma, q, warnings = TRUE, plot = FALSE, add = FALSE, 
       main = "Estimates of small exceedance probability", ...)

Arguments

data

Vector of n observations.

r

Trimming parameter, default is 1 (no trimming).

gamma

Vector of n-1 estimates for the EVI obtained from trHill.

q

The used large quantile (we estimate P(X>q) for q large).

warnings

Logical indicating if warnings are shown, default is TRUE.

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of small exceedance probability".

...

Additional arguments for the plot function, see plot for more details.

Details

The probability is estimated as

\hat{P}(X>q)=(k+1)/(n+1) ( (q/X_{n-k,n})^{-1/\gamma_k} - R_{r,k,n}^{1/\hat{\gamma}_k} ) / (1- R_{r,k,n}^{1/\hat{\gamma}_k})

with R_{r,k,n} = X_{n-k,n} / X_{n-r+1,n} and \hat{\gamma}_k the Hill estimates adapted for truncation.

See Beirlant et al. (2016) or Section 4.2.3 of Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

P

Vector of the corresponding probability estimates.

q

The used large quantile.

Author(s)

Tom Reynkens based on R code of Dries Cornilly.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant, J., Fraga Alves, M.I. and Gomes, M.I. (2016). "Tail fitting for Truncated and Non-truncated Pareto-type Distributions." Extremes, 19, 429–462.

Examples

# Sample from truncated Pareto distribution.
# truncated at 99% quantile
shape <- 2
X <- rtpareto(n=1000, shape=shape, endpoint=qpareto(0.99, shape=shape))

# Truncated Hill estimator
trh <- trHill(X, plot=TRUE, ylim=c(0,2))

# Small probability
trProb(X, gamma=trh$gamma, q=8, plot=TRUE)

Estimator of small exceedance probabilities using truncated MLE

Description

Computes estimates of a small exceedance probability P(X>q) using the estimates for the EVI obtained from the ML estimator adapted for upper truncation.

Usage

trProbMLE(data, gamma, tau, DT, q, plot = FALSE, add = FALSE, 
          main = "Estimates of small exceedance probability", ...)

Arguments

data

Vector of n observations.

gamma

Vector of n-1 estimates for the EVI obtained from trMLE.

tau

Vector of n-1 estimates for the \tau obtained from trMLE.

DT

Vector of n-1 estimates for the truncation odds obtained from trDTMLE.

q

The used large quantile (we estimate P(X>q) for q large).

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of small exceedance probability".

...

Additional arguments for the plot function, see plot for more details.

Details

The probability is estimated as

\hat{p}_{T,k}(q) = (1+ \hat{D}_{T,k}) (k+1)/(n+1) (1+\hat\tau _k(q-X_{n-k,n}))^{-1/\hat{\xi}_k} -\hat{D}_{T,k}

with \hat{\gamma}_k and \hat{\tau}_k the ML estimates adapted for truncation and \hat{D}_T the estimates for the truncation odds.

See Beirlant et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

P

Vector of the corresponding probability estimates.

q

The used large quantile.

Author(s)

Tom Reynkens.

References

Beirlant, J., Fraga Alves, M. I. and Reynkens, T. (2017). "Fitting Tails Affected by Truncation". Electronic Journal of Statistics, 11(1), 2026–2065.

Examples

# Sample from GPD truncated at 99% quantile
gamma <- 0.5
sigma <- 1.5
X <- rtgpd(n=250, gamma=gamma, sigma=sigma, endpoint=qgpd(0.99, gamma=gamma, sigma=sigma))

# Truncated ML estimator
trmle <- trMLE(X, plot=TRUE, ylim=c(0,2))

# Truncation odds
dtmle <- trDTMLE(X, gamma=trmle$gamma, tau=trmle$tau, plot=FALSE)

# Small exceedance probability
trProbMLE(X, gamma=trmle$gamma, tau=trmle$tau, DT=dtmle$DT, plot=TRUE, q=26, ylim=c(0,0.005))

Estimator of large quantiles using truncated Hill

Description

trQuant computes estimates of large quantiles Q(1-p) of the truncated distribution using the estimates for the EVI obtained from the Hill estimator adapted for upper truncation. trQuantW computes estimates of large quantiles Q_W(1-p) of the parent distribution W which is unobserved.

Usage

trQuant(data, r = 1, rough = TRUE, gamma, DT, p, plot = FALSE, add = FALSE, 
        main = "Estimates of extreme quantile", ...)
        
trQuantW(data, gamma, DT, p, plot = FALSE, add = FALSE, 
         main = "Estimates of extreme quantile", ...)

Arguments

data

Vector of n observations (truncated data).

r

Trimming parameter, default is 1 (no trimming).

rough

Logical indicating if rough truncation is present, default is TRUE.

gamma

Vector of n-1 estimates for the EVI obtained from trHill.

DT

Vector of n-1 estimates for the truncation odds obtained from trDT.

p

The exceedance probability of the quantile (we estimate Q(1-p) for p small).

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of extreme quantile".

...

Additional arguments for the plot function, see plot for more details.

Details

We observe the truncated r.v. X=_d W | W<T where T is the truncation point and W the untruncated r.v.

Under rough truncation, the quantiles for X are estimated using

\hat{Q}(1-p)=X_{n-k,n} ((\hat{D}_T + (k+1)/(n+1))/(\hat{D}_T+p))^{\hat{\gamma}_k},

with \hat{\gamma}_k the Hill estimates adapted for truncation and \hat{D}_T the estimates for the truncation odds.

Under light truncation, the quantiles are estimated using the Weissman estimator with the Hill estimates replaced by the truncated Hill estimates:

\hat{Q}(1-p)=X_{n-k,n} ((k+1)/((n+1)p))^{\hat{\gamma}_k}.

To decide between light and rough truncation, one can use the test implemented in trTest.

The quantiles for W are estimated using

\hat{Q}_W(1-p)=X_{n-k,n} ( (\hat{D}_T + (k+1)/(n+1)) / (p(1+\hat{D}_T))^{\hat{\gamma}_k}.

See Beirlant et al. (2016) or Section 4.2.3 of Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

Q

Vector of the corresponding quantile estimates.

p

The used exceedance probability.

Author(s)

Tom Reynkens based on R code of Dries Cornilly.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant, J., Fraga Alves, M.I. and Gomes, M.I. (2016). "Tail fitting for Truncated and Non-truncated Pareto-type Distributions." Extremes, 19, 429–462.

Examples

# Sample from truncated Pareto distribution.
# truncated at 99% quantile
shape <- 2
X <- rtpareto(n=1000, shape=shape, endpoint=qpareto(0.99, shape=shape))

# Truncated Hill estimator
trh <- trHill(X, plot=TRUE, ylim=c(0,2))

# Truncation odds
dt <- trDT(X, gamma=trh$gamma, plot=TRUE, ylim=c(0,2))

# Large quantile
p <- 10^(-5)
# Truncated distribution
trQuant(X, gamma=trh$gamma, DT=dt$DT, p=p, plot=TRUE)
# Original distribution
trQuantW(X, gamma=trh$gamma, DT=dt$DT, p=p, plot=TRUE, ylim=c(0,1000))

Estimator of large quantiles using truncated MLE

Description

This function computes estimates of large quantiles Q(1-p) of the truncated distribution using the ML estimates adapted for upper truncation. Moreover, estimates of large quantiles Q_Y(1-p) of the original distribution Y, which is unobserved, are also computed.

Usage

trQuantMLE(data, gamma, tau, DT, p, Y = FALSE, plot = FALSE, add = FALSE, 
           main = "Estimates of extreme quantile", ...)

Arguments

data

Vector of n observations.

gamma

Vector of n-1 estimates for the EVI obtained from trMLE.

tau

Vector of n-1 estimates for the \tau obtained from trMLE.

DT

Vector of n-1 estimates for the truncation odds obtained from trDTMLE.

p

The exceedance probability of the quantile (we estimate Q(1-p) or Q_Y(1-p) for p small).

Y

Logical indicating if quantiles from the truncated distribution (Q(1-p)) or from the parent distribution (Q_Y(1-p)) are computed. Default is TRUE.

plot

Logical indicating if the estimates should be plotted as a function of k, default is FALSE.

add

Logical indicating if the estimates should be added to an existing plot, default is FALSE.

main

Title for the plot, default is "Estimates of extreme quantile".

...

Additional arguments for the plot function, see plot for more details.

Details

We observe the truncated r.v. X=_d Y | Y<T where T is the truncation point and Y the untruncated r.v.

Under rough truncation, the quantiles for X are estimated using

\hat{Q}_{T,k}(1-p) = X_{n-k,n} +1/(\hat{\tau}_k)([(\hat{D}_{T,k} + (k+1)/(n+1))/(\hat{D}_{T,k}+p)]^{\hat{\xi}_k} -1),

with \hat{\gamma}_k and \hat{\tau}_k the ML estimates adapted for truncation and \hat{D}_T the estimates for the truncation odds.

The quantiles for Y are estimated using

\hat{Q}_{Y,k}(1-p)=X_{n-k,n} +1/(\hat{\tau}_k)([(\hat{D}_{T,k} + (k+1)/(n+1))/(p(\hat{D}_{T,k}+1))]^{\hat{\xi}_k} -1).

See Beirlant et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

Q

Vector of the corresponding quantile estimates.

p

The used exceedance probability.

Author(s)

Tom Reynkens.

References

Beirlant, J., Fraga Alves, M. I. and Reynkens, T. (2017). "Fitting Tails Affected by Truncation". Electronic Journal of Statistics, 11(1), 2026–2065.

Examples

# Sample from GPD truncated at 99% quantile
gamma <- 0.5
sigma <- 1.5
X <- rtgpd(n=250, gamma=gamma, sigma=sigma, endpoint=qgpd(0.99, gamma=gamma, sigma=sigma))

# Truncated ML estimator
trmle <- trMLE(X, plot=TRUE, ylim=c(0,2))

# Truncation odds
dtmle <- trDTMLE(X, gamma=trmle$gamma, tau=trmle$tau, plot=FALSE)

# Large quantile of X
trQuantMLE(X, gamma=trmle$gamma, tau=trmle$tau, DT=dtmle$DT, plot=TRUE, p=0.005, ylim=c(15,30))

# Large quantile of Y
trQuantMLE(X, gamma=trmle$gamma, tau=trmle$tau, DT=dtmle$DT, plot=TRUE, p=0.005, ylim=c(0,300), 
          Y=TRUE)

Test for truncated Pareto-type tails

Description

Test between non-truncated Pareto-type tails (light truncation) and truncated Pareto-type tails (rough truncation).

Usage

trTest(data, alpha = 0.05, plot = TRUE, main = "Test for truncation", ...)

Arguments

data

Vector of n observations.

alpha

The used significance level, default is 0.05.

plot

Logical indicating if the P-values should be plotted as a function of k, default is FALSE.

main

Title for the plot, default is "Test for truncation".

...

Additional arguments for the plot function, see plot for more details.

Details

We want to test H_0: X has non-truncated Pareto tails vs. H_1: X has truncated Pareto tails. Let

E_{k,n}(\gamma) = 1/k \sum_{j=1}^k (X_{n-k,n}/X_{n-j+1,n})^{1/\gamma},

with X_{i,n} the i-th order statistic. The test statistic is then

T_{k,n}=\sqrt{12k} (E_{k,n}(H_{k,n})-1/2) / (1-E_{k,n}(H_{k,n}))

which is asymptotically standard normally distributed. We reject H_0 on level \alpha if

T_{k,n}<-z_{\alpha}

where z_{\alpha} is the 100(1-\alpha)\% quantile of a standard normal distribution. The corresponding P-value is thus given by

\Phi(T_{k,n})

with \Phi the CDF of a standard normal distribution.

See Beirlant et al. (2016) or Section 4.2.3 of Albrecher et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

testVal

Corresponding test values.

critVal

Critical value used for the test, i.e. qnorm(1-alpha/2).

Pval

Corresponding P-values.

Reject

Logical vector indicating if the null hypothesis is rejected for a certain value of k.

Author(s)

Tom Reynkens.

References

Albrecher, H., Beirlant, J. and Teugels, J. (2017). Reinsurance: Actuarial and Statistical Aspects, Wiley, Chichester.

Beirlant, J., Fraga Alves, M.I. and Gomes, M.I. (2016). "Tail fitting for Truncated and Non-truncated Pareto-type Distributions." Extremes, 19, 429–462.

Examples

# Sample from truncated Pareto distribution.
# truncated at 95% quantile
shape <- 2
X <- rtpareto(n=1000, shape=shape, endpoint=qpareto(0.95, shape=shape))

# Test for truncation
trTest(X)


# Sample from truncated Pareto distribution.
# truncated at 99% quantile
shape <- 2
X <- rtpareto(n=1000, shape=shape, endpoint=qpareto(0.99, shape=shape))

# Test for truncation
trTest(X)

Test for truncated GPD tails

Description

Test between non-truncated GPD tails (light truncation) and truncated GPD tails (rough truncation).

Usage

trTestMLE(data, gamma, tau, alpha = 0.05, plot = TRUE, main = "Test for truncation", ...)

Arguments

data

Vector of n observations.

gamma

Vector of n-1 estimates for the EVI obtained from trMLE.

tau

Vector of n-1 estimates for the \tau obtained from trMLE.

alpha

The used significance level, default is 0.05.

plot

Logical indicating if the P-values should be plotted as a function of k, default is FALSE.

main

Title for the plot, default is "Test for truncation".

...

Additional arguments for the plot function, see plot for more details.

Details

We want to test H_0: X has non-truncated GPD tails vs. H_1: X has truncated GPD tails. Let \hat{\gamma}_k and \hat{\tau}_k be the truncated MLE estimates for \gamma and \tau. The test statistic is then

T_{k,n}=k (1+\hat{\tau} (X_{n,n}-X_{-k,n}) )^{-1/\hat{\xi}_k}

which is asymptotically standard exponentially distributed. We reject H_0 on level \alpha if T_{k,n}>\ln (1/{\alpha)}. The corresponding P-value is given by \exp(-T_{k,n}).

See Beirlant et al. (2017) for more details.

Value

A list with following components:

k

Vector of the values of the tail parameter k.

testVal

Corresponding test values.

critVal

Critical value used for the test, i.e. \ln(1/\alpha).

Pval

Corresponding P-values.

Reject

Logical vector indicating if the null hypothesis is rejected for a certain value of k.

Author(s)

Tom Reynkens.

References

Beirlant, J., Fraga Alves, M. I. and Reynkens, T. (2017). "Fitting Tails Affected by Truncation". Electronic Journal of Statistics, 11(1), 2026–2065.

Examples

# Sample from GPD truncated at 99% quantile
gamma <- 0.5
sigma <- 1.5
X <- rtgpd(n=250, gamma=gamma, sigma=sigma, endpoint=qgpd(0.99, gamma=gamma, sigma=sigma))

# Truncated ML estimator
trmle <- trMLE(X, plot=TRUE, ylim=c(0,2))

# Test for truncation
trTestMLE(X, gamma=trmle$gamma, tau=trmle$tau)

The Burr distribution

Description

Usage

Arguments

Details

Value

Author(s)

References

See Also

Examples

Conditional Tail Expectation

Description

Usage

Arguments

Details

Value

Author(s)

References

See Also

Examples

EPD estimator

Description

Usage

Arguments

Details

Value

Author(s)

References

See Also

Examples

Fit EPD using MLE

Description

Usage

Arguments

Details

Value

Author(s)

References

See Also

Examples

EVT fit

Description

Usage

Arguments

Details

Value

Author(s)

References

See Also

Examples

Estimates for excess-loss premiums using EPD estimates

Description

Usage

Arguments

Details

Value

Author(s)

References

See Also

Examples

Estimates for excess-loss premiums using GPD-MLE estimates

Description

Usage

Arguments

Details

Value

Author(s)

References

See Also

Examples

Estimates for excess-loss premiums using a Pareto model

Description

Usage

Arguments

Details

Value

Author(s)

References

See Also

Examples