Title: | Marginal Proportional Hazards Mixture Cure Models with Generalized Estimating Equations |
---|---|
Description: | Features the marginal parametric and semi-parametric proportional hazards mixture cure models for analyzing clustered survival data with a possible cure fraction. A reference is Yi Niu and Yingwei Peng (2014) <doi:10.1016/j.jmva.2013.09.003>. |
Authors: | Yi Niu [aut, cre], Hui Song [ctb], Xiaoguang Wang [ctb], Yingwei Peng [aut, ctb] |
Maintainer: | Yi Niu <[email protected]> |
License: | GPL (>= 2) |
Version: | 1.0-6 |
Built: | 2025-03-26 03:47:20 UTC |
Source: | https://github.com/cran/geecure |
A package that uses generalized estimating equations (GEE) approach to estimate marginal proportional hazards mixture cure (PHMC) models. This package implements recently developed inference procedures for the marginal PHMC models with the expectation-solution (ES) algorithm. The package includes the parametric PHMC model with Weibull baseline distribution in the latency part and the semiparametric PHMC model for fitting the multivariate survival data with a cure fraction.
Package: geecure
Type: Package
Version: 1.0-6
Date: 2018-03-28
License: GPL(>=2)
LazyData: TRUE
Yi Niu <[email protected]>, Hui Song, Xiaoguang Wang, Yingwei Peng
Liang, K.-Y. and Zeger, S. L. (1986) Longitudinal data analysis using generalized linear models. Biometrika, 73: 13-22.
Niu, Y. and Peng, Y. (2013) A semiparametric marginal mixture cure model for clustered survival data. Statistics in Medicine, 32: 2364-2373.
Niu, Y. and Peng, Y. (2014) Marginal regression analysis of clustered failure time data with a cure fraction. Journal of Multivariate Analysis, 123: 129-142.
Niu, Y., Song, L., Liu, Y, and Peng, Y. (2018) Modeling clustered long-term survivors using marginal mixture cure model. Biometrical Journal, doi: 10.1002/bjmj.201700114.
Peng, Y., Taylor, J. M. G, and Yu, B. (2007) A marginal regression model for multivariate failure time data with a surviving fraction. Lifetime Data Analysis, 13: 351-369
Rosen, O., Jiang, W., and Tanner, M. A. (2000) Mixtures of marginal models. Biometrika, 87: 391-404.
Yu, B. and Peng, Y. (2008) Mixture cure models for multivariate survival data. Computational Statistics & Data Analysis, 52: 1524-1532.
The estimated baseline survival function based on the product-limit estimator (Kalbfleisch and Prentice, 2002), which is uesd to update the E-step in the ES algorithm.
basesurv(Time, Status, X, beta, Lambda, w, model)
basesurv(Time, Status, X, beta, Lambda, w, model)
Time |
right censored data which is the follow up time. |
Status |
the censoring indicator, normally 1 = event of interest happens, and 0 = censoring. |
X |
a matrix of covariates corresponding to the latency part. |
beta |
initial beta from the GEE for the latency part. |
Lambda |
initial cumulative baseline hazard function from the GEE with independence working corrlation matrix. |
w |
conditional probability of a patient remains uncured at the mth iteration. We use Status as initial value. |
model |
specifies your model, it can be |
Kalbfleisch, J. D. and Prentice, R. L. (2002) The Statistical Analysis of Failure Time Data. John Wiley & Sons, New York, 2nd edition.
This multi-center acute leukemia study consists of 137 patients with acute myelocytic leukemia (AML) or acute lymphoblastic leukemia (ALL) aged 7 to 52 from March 1, 1984 to June 30, 1989 at four institutions (Klein and Moeschberger, 2003). The failure time on study is defined at time (in days) to relapse or death.
data(bmt)
data(bmt)
The variables represented in the data set are as follows:
g
Disease group: 1 - All, 2 - AML Low Risk, 3 - AML High Risk.
T1
Time to death or on study time.
T2
Disease free survival time (time to relapse, death or end of study).
d1
Death indicator: 1 - Dead, 0 - Alive.
d2
Relapse indicator: 1 - Relapsed, 0 - Disease Free.
d3
Disease free survival indicator: 1 - Dead or Relapsed, 0 - Alive Disease Free.
TA
Time to Acute Graft-Versus-Host Disease.
da
Acute GVHD indicator: 1 - Developed Acute GVHD, 0 - Never Developed Acute GVHD.
TC
Time to Chronic Graft-Versus-Host Disease.
dc
Chronic GVHD Indicator: 1 - Developed Chronic GVHD, 0 - Never Developed Chronic GVHD.
TP
Time to return of platelets to normal levels.
dp
Platelet recovery indicator: 1 - platelets returned to normal, 0 - platelets never returned to normal.
Z1
Patient age in years.
Z2
Donor age in years.
Z3
Patient sex: 1 - Male, 0 - Female.
Z4
Doner sex: 1 - Male, 0 - Female.
Z5
Patient CMV status: 1 - CMV positive, 0 - CMV negative.
Z6
Donor CMV status: 1 - CMV positive, 0 - CMV negative.
Z7
Waiting time to transplant in days.
Z8
FAB: 1 - FAB Grade 4 or 5 and AML, 0 - otherwise.
Z9
Hospital: 1 - The Ohio State University, 2 - Alferd , 3 - St. Vincent, 4 - Hahnemann.
Z10
MTX: used as a Graft-Versus-Host-Prophylactic 1 - Yes, 0 - No.
Klein, J. P. and Moeschberger, M. L. (2003) Survival Analysis: Techniques for Censored and Truncated Data. Springer, New York, 2nd edition.
EM algorithm is based on Peng et al. (2007) and ES algorithm is based on Niu and Peng (2013). ES algorithm is an estension of the EM algorithm where the M-step of the EM algorithm is replaced by a step requiring the solution of a series of generalised estimating equations. Both algorithm are used for the analysis of survival cure data with potential correlation.
emes(Time, Status, X, Z, id, corstr, stdz, esmax, eps)
emes(Time, Status, X, Z, id, corstr, stdz, esmax, eps)
Time |
right censored data which is the follow up time. |
Status |
the censoring indicator, normally 0 = event of interest happens, and 0 = censoring. |
X |
a matrix of covariates corresponding to the latency part. |
Z |
a matrix of covariates corresponding to the incidence part. |
id |
a vector which identifies the clusters. The length of |
corstr |
a character string specifying the correlation structure. The following are permitted: |
stdz |
If it is TRUE, all the covariates in the |
esmax |
specifies the maximum iteration number. If the convergence criterion is not met, the ES iteration will be stopped after |
eps |
tolerance for convergence. The default is |
ES algorithm is an estension of the EM algorithm where the M-step of the EM algorithm is replaced by a step requiring the solution of a series of generalised estimating equations. We use the ES algorithm for the analysis of survival cure data with potential correlation.
es(Time, Status, X, Z, id, model, corstr, stdz, esmax, eps)
es(Time, Status, X, Z, id, model, corstr, stdz, esmax, eps)
Time |
right censored data which is the follow up time. |
Status |
the censoring indicator, normally 0 = event of interest happens, and 0 = censoring. |
X |
a matrix of covariates corresponding to the latency part. |
Z |
a matrix of covariates corresponding to the incidence part. |
id |
a vector which identifies the clusters. The length of |
model |
specifies your model, it can be |
corstr |
a character string specifying the correlation structure. The following are permitted: |
stdz |
If it is TRUE, all the covariates in the |
esmax |
specifies the maximum iteration number. If the convergence criterion is not met, the ES iteration will be stopped after |
eps |
tolerance for convergence. The default is |
Fit the PH model in the latency part with the GEE approach.
geebt(Status, Lambda, X, beta, w, id, corstr)
geebt(Status, Lambda, X, beta, w, id, corstr)
Status |
the censoring indicator, normally 0 = event of interest happens, and 0 = censoring |
Lambda |
initial cumulative baseline hazard function from the GEE with independence working corrlation matrix. |
X |
a matrix of covariates corresponding to the latency part. |
beta |
initial beta for the GEE for the latency part. We use 0 as the initial value. |
w |
conditional probability of a patient remains uncured at the mth iteration. We use Status as initial value. |
id |
a vector which identifies the clusters. The length of |
corstr |
a character string specifying the correlation structure. The following are permitted: |
Fit the marginal proportional hazards mixture cure (PHMC) model with the generalized estimating equations (GEE). GEE approach is generalized to the marginal PHMC model through the expectation-solution (ES) algorithm to account for the correlation among the cure statuses and the dependence among the failure times of uncured patients to improve the estimation efficiency.
geecure(formula, cureform, data, id, model = c("para", "semi"), corstr = c("independence", "exchangeable"), Var = TRUE, stdz = FALSE, boots = FALSE, nboot = 100, esmax = 100, eps = 1e-06)
geecure(formula, cureform, data, id, model = c("para", "semi"), corstr = c("independence", "exchangeable"), Var = TRUE, stdz = FALSE, boots = FALSE, nboot = 100, esmax = 100, eps = 1e-06)
formula |
a formula expression, of the form |
cureform |
a formula expression, of the form |
data |
a data frame in which to interpret the variables named in the |
id |
a vector which identifies the clusters. The length of |
model |
specifies your model, it can be |
corstr |
a character string specifying the correlation structure. The following are permitted: |
Var |
If it is TRUE, the program returns Std.Error by the sandwich method. By default, |
stdz |
If it is TRUE, all the covariates in the |
boots |
If it is TRUE, the program returns Std.Error by the bootstrap method. By default, |
nboot |
the number of bootstrap samples. The default is |
esmax |
specifies the maximum iteration number. If the convergence criterion is not met, the ES iteration will be stopped after |
eps |
tolerance for convergence. The default is |
The marginal PHMC model is considered in this function. For cure rate, a logistic regression model is employed and the probability of being cured is given by . For uncured subject, the failure time is modeled by either the parametric proportional hazards model with Weibull baseline distributions or the semiparametric proportional hazards model. A covariate can be used either in
formula
or in cureform
or in both. The model parameters are estimated by the expectation-solution (ES) algorithm and the standard error estimates are obtained from sandwich variance formula based on Niu and Peng (2014) and Niu et al. (2018).
An object of class geecure
is returned. It can be examined by print.geecure()
.
Niu, Y. and Peng, Y. (2014) Marginal regression analysis of clustered failure time data with a cure fraction. Journal of Multivariate Analysis, 123: 129-142.
Niu, Y., Song, L., Liu, Y, and Peng, Y. (2018) Modeling clustered long-term survivors using marginal mixture cure model. Biometrical Journal, doi: 10.1002/bjmj.201700114.
# Be patient, the following examples may take several minites on a faster computer. # Example 1. Fit the marginal parametric PHMC model for the smoking cessation data. data(smoking) smoking$Time <- ifelse(smoking$Relapse == 0, smoking$Timept1, (smoking$Timept1 + smoking$Timept2)/2) # plot the KM survival curve of smoking cessation data plot(survfit(Surv(Time, Relapse) ~ SexF + (SI.UC), data = smoking), ylab = "Survival function", xlab = "Years", ylim = c(0.5, 1), xlim = c(0, 6), lty = 1:4, col = 1:4) legend(0.5, 0.63, c("SI/Male", "SI/Female", "UC/Female", "UC/Male"), cex = 1, col = c(2, 4, 3, 1), lty = c(2, 4, 3, 1)) geesmokingind <- geecure(Surv(Time, Relapse) ~ SexF + Duration + SI.UC + F10Cigs + SexF * SI.UC, cureform = ~ SexF + Duration + SI.UC + F10Cigs + SexF * SI.UC, data = smoking, model = "para", id = smoking$Zip, corstr = "independence") geesmokingexch <- geecure(Surv(Time, Relapse) ~ SexF + Duration + SI.UC + F10Cigs + SexF * SI.UC, cureform = ~ SexF + Duration + SI.UC + F10Cigs + SexF * SI.UC, data = smoking, model = "para", id = smoking$Zip, corstr = "exchangeable") # Example 2. Fit the marginal semiparametric PHMC model for the bmt data. data(bmt) bmt$g <- factor(bmt$g, label = c("ALL", "AML low risk", "AML high risk")) bmt$Z8 <- factor(bmt$Z8, label = c("Otherwise", "FAB")) # plot the KM survival curve of bmt data plot(survfit(Surv(T2, d3) ~ 1, data = bmt), xlab = "Days", ylab = "Survival Probability", cex.lab = 1.7, cex.axis = 2, cex.main = 1.7, mark.time = TRUE) geebmtind <- geecure(Surv(T2, d3) ~ factor(g) + Z8, cureform = ~ factor(g) + Z8, data = bmt, model = "semi", id = bmt$Z9, corstr= "independence") geebmtexch <- geecure(Surv(T2, d3) ~ factor(g) + Z8, cureform = ~ factor(g) + Z8, data = bmt, model = "semi", id = bmt$Z9, corstr= "exchangeable", stdz = TRUE, boots = TRUE) # Example 3. Fit the marginal semiparametric PHMC model for the tonsil data. data(tonsil) tonsil<-tonsil[-c(141,136,159),] tonsil$Sex <- ifelse(tonsil$Sex == 1, 0, 1) tonsil$Cond <- ifelse(tonsil$Cond == 1, 0, 1) tonsil$T <- ifelse(tonsil$T < 4, 0, 1) # plot the KM survival curve of tonsil data plot(survfit(Surv(Time, Status) ~ 1, data = tonsil), xlab = "Days", ylab = "Survival Probability", cex.lab = 1.7, cex.axis = 2, cex.main = 1.7, mark.time = TRUE) geetonsilind <- geecure2(Surv(Time, Status) ~ Sex + factor(Grade) + Age + Cond + T, cureform = ~ Sex + factor(Grade) + Age + Cond + T, data = tonsil, id = tonsil$Inst, corstr = "independence") geetonsilexch <- geecure2(Surv(Time, Status) ~ Sex + factor(Grade) +Age + Cond + T, cureform = ~ Sex + factor(Grade) + Age + Cond + T, data = tonsil, id = tonsil$Inst, corstr = "exchangeable", stdz = TRUE, Var = FALSE)
# Be patient, the following examples may take several minites on a faster computer. # Example 1. Fit the marginal parametric PHMC model for the smoking cessation data. data(smoking) smoking$Time <- ifelse(smoking$Relapse == 0, smoking$Timept1, (smoking$Timept1 + smoking$Timept2)/2) # plot the KM survival curve of smoking cessation data plot(survfit(Surv(Time, Relapse) ~ SexF + (SI.UC), data = smoking), ylab = "Survival function", xlab = "Years", ylim = c(0.5, 1), xlim = c(0, 6), lty = 1:4, col = 1:4) legend(0.5, 0.63, c("SI/Male", "SI/Female", "UC/Female", "UC/Male"), cex = 1, col = c(2, 4, 3, 1), lty = c(2, 4, 3, 1)) geesmokingind <- geecure(Surv(Time, Relapse) ~ SexF + Duration + SI.UC + F10Cigs + SexF * SI.UC, cureform = ~ SexF + Duration + SI.UC + F10Cigs + SexF * SI.UC, data = smoking, model = "para", id = smoking$Zip, corstr = "independence") geesmokingexch <- geecure(Surv(Time, Relapse) ~ SexF + Duration + SI.UC + F10Cigs + SexF * SI.UC, cureform = ~ SexF + Duration + SI.UC + F10Cigs + SexF * SI.UC, data = smoking, model = "para", id = smoking$Zip, corstr = "exchangeable") # Example 2. Fit the marginal semiparametric PHMC model for the bmt data. data(bmt) bmt$g <- factor(bmt$g, label = c("ALL", "AML low risk", "AML high risk")) bmt$Z8 <- factor(bmt$Z8, label = c("Otherwise", "FAB")) # plot the KM survival curve of bmt data plot(survfit(Surv(T2, d3) ~ 1, data = bmt), xlab = "Days", ylab = "Survival Probability", cex.lab = 1.7, cex.axis = 2, cex.main = 1.7, mark.time = TRUE) geebmtind <- geecure(Surv(T2, d3) ~ factor(g) + Z8, cureform = ~ factor(g) + Z8, data = bmt, model = "semi", id = bmt$Z9, corstr= "independence") geebmtexch <- geecure(Surv(T2, d3) ~ factor(g) + Z8, cureform = ~ factor(g) + Z8, data = bmt, model = "semi", id = bmt$Z9, corstr= "exchangeable", stdz = TRUE, boots = TRUE) # Example 3. Fit the marginal semiparametric PHMC model for the tonsil data. data(tonsil) tonsil<-tonsil[-c(141,136,159),] tonsil$Sex <- ifelse(tonsil$Sex == 1, 0, 1) tonsil$Cond <- ifelse(tonsil$Cond == 1, 0, 1) tonsil$T <- ifelse(tonsil$T < 4, 0, 1) # plot the KM survival curve of tonsil data plot(survfit(Surv(Time, Status) ~ 1, data = tonsil), xlab = "Days", ylab = "Survival Probability", cex.lab = 1.7, cex.axis = 2, cex.main = 1.7, mark.time = TRUE) geetonsilind <- geecure2(Surv(Time, Status) ~ Sex + factor(Grade) + Age + Cond + T, cureform = ~ Sex + factor(Grade) + Age + Cond + T, data = tonsil, id = tonsil$Inst, corstr = "independence") geetonsilexch <- geecure2(Surv(Time, Status) ~ Sex + factor(Grade) +Age + Cond + T, cureform = ~ Sex + factor(Grade) + Age + Cond + T, data = tonsil, id = tonsil$Inst, corstr = "exchangeable", stdz = TRUE, Var = FALSE)
Fit the semiparametric marginal proportional hazards mixture cure (PHMC) model for clustered failure time data. The function is based on the methods proposed by Peng et al. (2007) and Niu and Peng (2013).
geecure2(formula, cureform, data, id, corstr = c("independence", "exchangeable"), Var = TRUE, stdz = FALSE, boots = FALSE, nboot = 100, esmax = 100, eps = 1e-06)
geecure2(formula, cureform, data, id, corstr = c("independence", "exchangeable"), Var = TRUE, stdz = FALSE, boots = FALSE, nboot = 100, esmax = 100, eps = 1e-06)
formula |
a formula expression, of the form |
cureform |
a formula expression, of the form |
data |
a data frame in which to interpret the variables named in the |
id |
a vector which identifies the clusters. The length of |
corstr |
a character string specifying the correlation structure. The following are permitted: |
Var |
If it is TRUE, the program returns Std.Error by the sandwich method. By default, |
stdz |
If it is TRUE, all the covariates in the |
boots |
If it is TRUE, the program returns Std.Error by the bootstrap method. By default, |
nboot |
the number of bootstrap samples. The default is |
esmax |
specifies the maximum iteration number. If the convergence criterion is not met, the ES iteration will be stopped after |
eps |
tolerance for convergence. The default is |
The semiparametric marginal PHMC model is considered in this function. For cure rate, a logistic regression model is employed and the probability of being cured is given by . For uncured subject, the failure time is modeled by the semiparametric proportional hazards model. A covariate can be used either in
formula
or in cureform
or in both. When corstr = independence
, the model parameters are estimated by the expectation-maximization (EM) algorithm and the standard error estimates are obtained from sandwich variance formula based on Peng et al. (2007). When corstr = exchangeable
, stdz = TRUE
and boots = TRUE
, the model parameters are estimated by the expectation-solution (ES) algorithm and the standard error estimates are obtained from bootstrap variance formula based on and Niu et al. (2013).
An object of class geecure2
is returned. It can be examined by print.geecure2()
.
Peng, Y., Taylor, J. M. G., and Yu, B. (2007) A marginal regression model for multivariate failure time data with a surviving fraction. Lifetime Data Analysis, 13: 351-369.
Niu, Y. and Peng, Y. (2013) A semiparametric marginal mixture cure model for clustered survival data. Statistics in Medicine, 32: 2364-2373.
# Example. Fit the marginal semiparametric PHMC model for the bmt data. data(bmt) geebmtind2 <- geecure2(Surv(T2, d3) ~ Z8, cureform = ~ Z8, data = bmt, id = bmt$Z9, corstr= "independence") geebmtexch2 <- geecure2(Surv(T2, d3) ~ Z8, cureform = ~ Z8, data = bmt, id = bmt$Z9, corstr= "exchangeable", stdz = TRUE, Var = FALSE)
# Example. Fit the marginal semiparametric PHMC model for the bmt data. data(bmt) geebmtind2 <- geecure2(Surv(T2, d3) ~ Z8, cureform = ~ Z8, data = bmt, id = bmt$Z9, corstr= "independence") geebmtexch2 <- geecure2(Surv(T2, d3) ~ Z8, cureform = ~ Z8, data = bmt, id = bmt$Z9, corstr= "exchangeable", stdz = TRUE, Var = FALSE)
Fit the logistic model in the incidence part with the GEE approach
geega(w, Z, gamma, id, corstr)
geega(w, Z, gamma, id, corstr)
w |
conditional probability of a patient remains uncured at the mth iteration. We use Status as initial value. |
Z |
a matrix of covariates corresponding to the incidence part. |
gamma |
initial beta for the GEE for the latency part. We use 0 as the initial value. |
id |
a vector which identifies the clusters. The length of |
corstr |
a character string specifying the correlation structure. The following are permitted: |
Obtain the initial value of the cumulative baseline hazard function in the latency part through the GEE with the independence
working correlation matrix.
initial_Lambda(Time, Status, X, Z, id, model, corstr)
initial_Lambda(Time, Status, X, Z, id, model, corstr)
Time |
right censored data which is the follow up time. |
Status |
the censoring indicator, normally 0 = event of interest happens, and 0 = censoring. |
X |
a matrix of covariates corresponding to the latency part. |
Z |
a matrix of covariates corresponding to the incidence part. |
id |
a vector which identifies the clusters. The length of |
model |
specifies your model, it can be |
corstr |
a character string specifying the correlation structure. The following are permitted: |
Output of geecure
object.
## S3 method for class 'geecure' print(x, ...)
## S3 method for class 'geecure' print(x, ...)
x |
an object of |
... |
further arguments to be added in the |
Output of geecure2
object.
## S3 method for class 'geecure2' print(x, ...)
## S3 method for class 'geecure2' print(x, ...)
x |
an object of |
... |
further arguments to be added in the |
The original data consist of 223 people enrolled in the study between November 1986 and February 1989 from 51 zip codes in the southeastern corner of Minnesota in the United States (Banerjee and Carlin, 2004). In this study, smokers were randomly assigned to one of two treatment groups: smoking intervention (SI) group or usual care (UC) group. The survival time is defined as the time (in years) required for a failed quitter to resume smoking. The people residing in the area with the same zip code form a cluster and may be spatially correlated due to the shared environment.
data(smoking)
data(smoking)
Observed covariates include
SexF
0 = male, 1 = female.
Duration
duration as smoker in years.
SI.UC
intervention type: 1 = smoking intervention (SI), 0 = usual care (UC).
F10Cigs
the average number of cigarettes smoked per day over the last 10 years (rounded).
Relapse
1 = relapse, 0 = no relapse.
Timept1
the time of study entry.
Timept2
the time of resume smoking.
Zip
51 zip codes in the southeastern corner of Minnesota.
Banerjee, S. and Carlin, B. P. (2004) Parametric spatial cure rate models for interval-censored time-to-relapse data. Biometrics, 60: 268-275.
A tonsil cancer clinical trial study conducted by the Radiation Therapy Oncology Group in the United States. The survival time is defined as the time (in days) from diagnosis to death. In this study, patients in one institution were randomly assigned to one of two treatment groups: radiation therapy alone or radiation therapy together with a chemotherapeutic agent. A part of the data from the study is available in Kalbfleisch and Prentice (2002).
data(tonsil)
data(tonsil)
A part of the data from the study is available in Kalbfleisch and Prentice (2002), which includes times (in days) from diagnosis to death of 195 patients with squamous cell carcinoma of three sites in the oropharynx between 1968 and 1972 in six participating institutions. Other variables include
Inst
institution code, from 1 to 6, represents six participating institutions
Sex
1 = male, 2 = female.
Trt
treatment: 1 = standard, 2 = test.
Grade
1 = well differentiated, 2 = moderately differentiated, 3 = poorly differentiated.
Age
in years at time of diagnosis.
Cond
condition: 1 = no disability, 2 = restricted work, 3 = requires assistance with self care, 4 = bed confined.
Site
1 = faucial arch, 2 = tonsillar fossa, 3 = posterior pillar, 4 = pharyngeal tongue, 5 = posterior wall.
T
T staging: 1 = primary tumor measuring 2 cm or less in largest diameter; 2 = primary tumor measuring 2 to 4 cm in largest diameter, minimal infiltration in depth; 3 = primary tumor measuring more than 4 cm; 4 = massive invasive tumor.
N
N staging: 0 = no clinical evidence of node metastases; 1 = single positive node 3 cm or less in diameter, not fixed; 2 = single positive node more than 3 cm in diameter, not fixed; 3 = multiple positive nodes or fixed positive nodes.
EntryDate
Date of entry: Day of year and year.
Status
0 = censored, 1 = dead.
Time
in days from day of diagnosis.
Kalbfleisch, J. D. and Prentice, R. L. (2002) The Statistical Analysis of Failure Time Data. John Wiley & Sons, New York, 2nd edition.
A bootstrap sample of tonsil data by sampling Zip
with replacement.
data(tonsil_bootsample)
data(tonsil_bootsample)
Calculate the variance estimates using the sandwich formula based on the ES algorithm.
varest(Time, Status, X, Z, id, gamma, beta, kappa, gphi, gcor, bphi, bcor, Lambda, w, model)
varest(Time, Status, X, Z, id, gamma, beta, kappa, gphi, gcor, bphi, bcor, Lambda, w, model)
Time |
right censored data which is the follow up time. |
Status |
the censoring indicator, normally 0 = event of interest happens, and 0 = censoring. |
X |
a matrix of covariates corresponding to the latency part. |
Z |
a matrix of covariates corresponding to the incidence part. |
id |
a vector which identifies the clusters. The length of |
gamma |
the estimates for the incidence part. |
beta |
the estimates for the latency part. |
kappa |
the estimate of the shape parameter in the Weibull baseline hazard function when |
gphi |
the estimate of the scale parameter |
gcor |
the estimate of the correlation parameter |
bphi |
the estimate of the scale parameter |
bcor |
the estimate of the correlation parameter |
Lambda |
the estimate of the cumulative baseline hazard function in the GEE for the latency part. |
w |
conditional probability of a patient remains uncured. |
model |
specifies your model, it can be |
Calculate the variance estimates using the sandwich formula based on the EM algorithm.
varest2(Time, Status, X, Z, id, gamma, beta, bsurv, w)
varest2(Time, Status, X, Z, id, gamma, beta, bsurv, w)
Time |
right censored data which is the follow up time. |
Status |
the censoring indicator, normally 0 = event of interest happens, and 0 = censoring. |
X |
a matrix of covariates corresponding to the latency part. |
Z |
a matrix of covariates corresponding to the incidence part. |
id |
a vector which identifies the clusters. The length of |
gamma |
the estimates for the incidence part. |
beta |
the estimates for the latency part. |
bsurv |
the estimate of the baseline survival function for the latency part. |
w |
conditional probability of a patient remains uncured. |