Help for package R2ROC

Title:

AUC Statistics

Version:

1.0.1

Description:

Area under the receiver operating characteristic curves (AUC) statistic for significance test. Variance and covariance of AUC values used to assess the 95% Confidence interval (CI) and p-value of the AUC difference for both nested and non-nested model.

License:

GPL (≥ 3)

URL:

https://github.com/mommy003/R2ROC

Encoding:

UTF-8

RoxygenNote:

7.1.2

Depends:

R (≥ 2.10)

LazyData:

true

NeedsCompilation:

Packaged:

2023-11-26 02:30:46 UTC; alh-admmdm

Author:

Hong Lee [aut, cph], Moksedul Momin [aut, cre, cph]

Maintainer:

Moksedul Momin <cvasu.momin@gmail.com>

Repository:

CRAN

Date/Publication:

2023-11-26 03:10:02 UTC

auc_diff function

Description

This function estimates var(AUC(y~x[,v1]) - AUC(y~x[,v2])) where AUC is the Area Under ROC curve of the model, y is N by 1 matrix having the dependent variable, and x is N by M matrix having M explanatory variables. v1 or v2 indicates the ith column in the x matrix (v1 or v2 can be multiple values between 1 - M, see Arguments below)

Usage

auc_diff(dat, v1, v2, nv, kv)

Arguments

dat

N by (M+1) matrix having variables in the order of cbind(y,x)

v1

This can be set as v1=c(1) or v1=c(1,2)

v2

This can be set as v2=c(2), v2=c(3), v2=c(1,3) or v2=c(3,4)

nv

Sample size

kv

Population prevalence

Value

This function will estimate significant difference between two PRS (either dependent or independent and joint or single). To get the test statistics for the difference between AUC(y~x[,v1]) and AUC(y~x[,v2]) (here we define AUC1=AUC(y~x[,v1])) and AUC2=AUC(y~x[,v2]))). The outputs are listed as follows.

mean_diff

AUC differences between AUC1 and AUC2

var

Variances of AUC differences

upper_diff

Upper value of the differences

lower_diff

Upper value of the differences

p

Two tailed P-value for significant difference between AUC1 and AUC2

p_one_tail

One tailed P-value for significant difference

heller_p

P-value based on Heller's test for significant difference

heller_upper_diff

Upper limit of 95% CI for the difference basedon Heller's test

heller_lower_diff

Lower limit of 95% CI for the difference basedon Heller's test

Examples

#To get the test statistics for the difference between AUC(y=x[,1]) 
#and AUC(y=x[,2])
dat=dat1 #(this example embedded within the package)
nv=length(dat$V1)
kv=sum(dat$V1)/length(dat$V1)# pop. prevalence estimated from data
#R2ROC also allows users to estimate AUC using pre-adjusted phenotype
#In that case, users need to specify kv
#eg. kv=0.10 for dat2 (dat2 embedded within the package) 
v1=c(1)
v2=c(2)
output=auc_diff(dat,v1,v2,nv,kv)

#R2ROC output
#output$mean_diff (mean difference of AUC1 and AUC2)
#0.1756046

#output$var (variance of AUC difference)
#9.274356e-05

#output$upper_diff (upper limit of 95% CI for difference)
#0.1944801

#output$lower_diff (lower limit of 95% CI for difference)
#0.1567292

#output$p (two-tailed P-value for the differences is
#significantly different from zero)
#2.747031e-74

#output$p_one_tail (one-tailed P-value for the differences
#is significantly different from zero)
#1.373515e-74


#To get the test statistics for the difference between
#AUC(y=x[,1]+x[,2]) and AUC(y=x[,2])
dat=dat1 #(this example embedded within the package)
nv=length(dat$V1)
kv=sum(dat$V1)/length(dat$V1)# pop. prevalence estimated from data
#R2ROC also allows users to estimate AUC using pre-adjusted phenotype
#In that case, users need to specify kv
#eg. kv=0.10 for dat2 (dat2 embedded within the package) 
v1=c(1,2)
v2=c(2)
output=auc_diff(dat,v1,v2,nv,kv)

#R2ROC output
#output$mean_diff (mean difference of AUC1 and AUC2)
#0.1793682

#output$var (variance of AUC difference)
#0.0001190366

#output$upper_diff (upper limit of 95% CI for difference)
#0.2007526

#output$lower_diff (lower limit of 95% CI for difference)
#0.1579839

#output$p (two-tailed P-value for the differences is
#significantly different from zero)
#9.87014e-61

#output$p_one_tail (one-tailed P-value for the differences
#is significantly different from zero)
#4.93507e-61

#output$heller_p (two-tailed P-value based on Hellers test
#for the differences is significantly different from zero)
#4.2085e-237

#output$heller_upper_diff (upper limit of 95% CI for
#difference based on Hellers test)
#0.2013899

#output$heller_lower_diff (lower limit of 95% CI for
#difference based on Hellers test)
#0.1586212

auc_trf function

Description

This function transforms the observed scale predictive ability (R2) and its standard error (SE) to AUC with its SE

Usage

auc_trf(R2, se, kv)

Arguments

R2

R2 or coefficient of determination on the observed scale

se

Standard error of R2

kv

Population prevalence

Value

This function will transform the observed R2 and its s.e between to AUC. Output from the command is the lists of outcomes.

auc

Transformed AUC

se

SE of transformed AUC

References

Wray, Naomi R., et al. "The genetic interpretation of area under the ROC curve in genomic profiling." PLoS genetics 6.2 (2010): e1000864.

Lee, Sang Hong, et al. "A better coefficient of determination for genetic profile analysis." Genetic epidemiology 36.3 (2012): 214-224.

Examples

#To get the transformed AUC

output=auc_trf(0.04, 0.002, 0.05)
output

#output$auc (transformed AUC)
#0.7522887

#output$se (se of transformed AUC)
#0.005948364

auc_var function

Description

This function estimates var(AUC(y~x[,v1])) where AUC is the Area Under ROC curve of the model, y is N by 1 matrix having the dependent variable, and x is N by M matrix having M explanatory variables. v1 indicates the ith column in the x matrix (v1 can be multiple values between 1 - M, see Arguments below)

Usage

auc_var(dat, v1, nv, kv)

Arguments

dat

N by (M+1) matrix having variables in the order of cbind(y,x)

v1

This can be set as v1=c(1), v1=c(1,2) or possibly with more values

nv

Sample size

kv

Population prevalence

Value

This function will test the null hypothesis for AUC. To get the test statistics for AUC(y~x[,v1]). The outputs are listed as follows.

auc

AUC

var

Variance of AUC

upper_auc

Upper limit of 95% CI for AUC

lower_auc

Lower limit of 95% CI for AUC

p

two tailed p-value

p_one_tail

one tailed p-value

Examples

#To get the AUC for AUC(y=x[,1]) 

dat=dat1 #(this example embedded within the package)
nv=length(dat$V1)
kv=sum(dat$V1)/length(dat$V1)# pop. prevalence estimated from data
#R2ROC also allows users to estimate AUC using pre-adjusted phenotype
#In that case, users need to specify kv
#eg. kv=0.10 for dat2 (dat2 embedded within the package) 
v1=c(1)
output=auc_var(dat,v1,nv,kv)

#R2ROC output
#output$auc (AUC)
#0.7390354

#output$var (variance of AUC)
#7.193337e-05

#output$upper_auc (upper limit of 95% CI for AUC)
#0.7556589

#output$lower_auc (lower limit of 95% CI for AUC)
#0.7224119

#output$p
#9.28062e-175 (two tailed P-value for the AUC is significantly
#different from 0.5)

#output$p_one_tail (one tailed P-value for the AUC is significantly 
#different from 0.5)
#4.64031e-175

Raw phenotypes and 2 sets of discovery PGSs

Description

A dataset containing raw phenotypes and multiple PGSs estimated two independent discovery population

Usage

dat1

Format

A data frame with 10000 rows and 3 variables:

V1: Phenotype, raw case-caontrol data
V2: PGS1, for discovery population 1
V3: PGS2, for discovery population 2

Pre-adjusted phenotypes and 2 sets of discovery PGSs

Description

A dataset containing preadjusted phenotypes and multiple PGSs estimated two independent discovery population

Usage

dat2

Format

A data frame with 10000 rows and 3 variables:

V1: Phenotype, preadjustde case-caontrol data
V2: PGS1, for discovery population 1
V3: PGS2, for discovery population 2

olkin_auc1 function

Description

olkin_auc1 function

Usage

olkin_auc1(omat, nv, kv)

Arguments

omat

3 by 3 matrix having the correlation coefficients between y, x1 and x2, i.e. omat=cor(dat) where dat is N by 3 matrix having variables in the order of cbind (y,x1,x2)

nv

Sample size

kv

Population prevalance

Value

This function will be used as source code

olkin_auc12 function

Description

olkin_auc12 function

Usage

olkin_auc12(omat, nv, kv)

Arguments

omat

3 by 3 matrix having the correlation coefficients between y, x1 and x2, i.e. omat=cor(dat) where dat is N by 3 matrix having variables in the order of cbind (y,x1,x2)

nv

Sample size

kv

Population prevalance

Value

This function will be used as source code

olkin_auc12_1 function

Description

olkin_auc12_1 function

Usage

olkin_auc12_1(omat, nv, kv)

Arguments

omat

3 by 3 matrix having the correlation coefficients between y, x1 and x2, i.e. omat=cor(dat) where dat is N by 3 matrix having variables in the order of cbind (y,x1,x2)

nv

Sample size

kv

Population prevalance

Value

This function will be used as source code

olkin_auc12_13 function

Description

olkin_auc12_13 function

Usage

olkin_auc12_13(omat, nv, kv)

Arguments

omat

3 by 3 matrix having the correlation coefficients between y, x1 and x2, i.e. omat=cor(dat) where dat is N by 3 matrix having variables in the order of cbind (y,x1,x2)

nv

Sample size

kv

Population prevalance

Value

This function will be used as source code

olkin_auc12_3 function

Description

olkin_auc12_3 function

Usage

olkin_auc12_3(omat, nv, kv)

Arguments

omat

3 by 3 matrix having the correlation coefficients between y, x1 and x2, i.e. omat=cor(dat) where dat is N by 3 matrix having variables in the order of cbind (y,x1,x2)

nv

Sample size

kv

Population prevalance

Value

This function will be used as source code

olkin_auc12_34 function

Description

olkin_auc12_34 function

Usage

olkin_auc12_34(omat, nv, kv)

Arguments

omat

3 by 3 matrix having the correlation coefficients between y, x1 and x2, i.e. omat=cor(dat) where dat is N by 3 matrix having variables in the order of cbind (y,x1,x2)

nv

Sample size

kv

Population prevalance

Value

This function will be used as source code

olkin_auc1_2 function

Description

olkin_auc1_2 function

Usage

olkin_auc1_2(omat, nv, kv)

Arguments

omat

3 by 3 matrix having the correlation coefficients between y, x1 and x2, i.e. omat=cor(dat) where dat is N by 3 matrix having variables in the order of cbind (y,x1,x2)

nv

Sample size

kv

Population prevalance

Value

This function will be used as source code