Title: | Modeling Time-to-Event Data with Functional Predictors |
---|---|
Description: | A collection of methods for modeling time-to-event data using both functional and scalar predictors. It implements functional data analysis techniques for estimation and inference, allowing researchers to assess the impact of functional covariates on survival outcomes, including time-to-single event and recurrent event outcomes. |
Authors: | Zifang Kong [aut, cre], Sy Han (Steven) Chiou [aut], Yu-Lun Liu [aut] |
Maintainer: | Zifang Kong <[email protected]> |
License: | MIT + file LICENSE |
Version: | 1.0.0 |
Built: | 2025-03-18 06:12:41 UTC |
Source: | https://github.com/zifangkong/funsurv |
Construct an AR(1) correlation matrix
ar1_cor(n, rho)
ar1_cor(n, rho)
n |
number of events for each subject |
rho |
autoregressive correlation |
A n by n matrix
## Generate AR(1) structure ar1_cor(n = 5, rho = 0.3) ## first derivative of the AR(1) structure with respect to rho dar1_cor.drho(n = 5, rho = 0.3)
## Generate AR(1) structure ar1_cor(n = 5, rho = 0.3) ## first derivative of the AR(1) structure with respect to rho dar1_cor.drho(n = 5, rho = 0.3)
Jointly model longitudinal measurements and recurrent events, accommodating both scalar and functional predictors while capturing time-dependent correlations among events. The FRAILTY method employs a two-step estimation procedure. First, functional principal component analysis through conditional expectation (PACE) is applied to extract key temporal features from sparse, irregular longitudinal data. Second, the obtained scores are incorporated into a dynamic recurrent frailty model with an autoregressive structure to account for within-subject correlations across recurrent events. This function works only for univariate functional data.
AR1_FRAILTY( formula, sdat, fdat, para0 = c(0.5, 0.5), nbasis = 10, pve = 0.9, npc = NULL, makePD = FALSE, cov.weight.type = c("none", "counts"), iter.max = 50, eps = 1e-06 )
AR1_FRAILTY( formula, sdat, fdat, para0 = c(0.5, 0.5), nbasis = 10, pve = 0.9, npc = NULL, makePD = FALSE, cov.weight.type = c("none", "counts"), iter.max = 50, eps = 1e-06 )
formula |
A formula, with the response on the left of a ~ operator being a |
sdat |
A data frame containing subject IDs, time-to-event outcomes (starting time, end point, censoring time and event status), and scalar covariates |
fdat |
A data frame containing subject IDs, longitudinal measurements, and the corresponding time points for each measurement. |
para0 |
A vector of initial values for |
nbasis |
An integer, representing the number of B-spline basis functions
used for estimation of the mean function and bivariate smoothing of the covariance surface.
Defaults to |
pve |
A numeric value between 0 and 1, the proportion of variance
explained: used to choose the number of principal components. Defaults to
|
npc |
An integer, giving a prespecified value for the number of
principal components. Defaults to |
makePD |
Logical: should positive definiteness be enforced for the
covariance surface estimate? Defaults to |
cov.weight.type |
The type of weighting used for the smooth covariance
estimate. Defaults to |
iter.max |
Maximum number of iterations for both inner iteration and outer iteration. Defaults to |
eps |
Tolerance criteria for a possible infinite coefficient value. Defaults to |
Model specification:
Let denote the time of the
th event for subject
,
and let
represent the censoring time.
The observed event time, accounting for right censoring, is
,
and
serves as an indicator of whether the
th event for subject
is observed.
The hazard function is specified as
where is the baseline hazard function,
and
.
Here,
is the previous event time with
.
is the fixed effect parameter associated with the time-invariant covariates
,
and
is a time-varying coefficient that captures the effect of functional predictor
on the hazard rate of recurrent events.
A funsurv object containing the following components:
beta |
Estimation of coefficients of scalar covariates and FPC scores. Including estimated values, standard errors, and p-values |
beta_vcov |
Estimated variance-covariance of the estimates of beta |
eAR |
Estimation of variance components ( |
eAR_vcov |
Estimated variance of estimates of |
frailties |
Estimated frailty terms (random effects) |
basesurv |
Estimated baseline survival probability |
time |
Time points associated with baseline survival probability |
FPC |
Functional principal components |
data(simDat) fit <- AR1_FRAILTY(Recur(t_start %to% t_stop, id, status) ~ z1, sdat = sdat, fdat = fdat, iter.max = 30) summary(fit)
data(simDat) fit <- AR1_FRAILTY(Recur(t_start %to% t_stop, id, status) ~ z1, sdat = sdat, fdat = fdat, iter.max = 30) summary(fit)
A function to obtain the baseline survival function
basesurv(object)
basesurv(object)
object |
A funsurv object |
A data frame including time and baseline survival
data(simDat) fit <- AR1_FRAILTY(Recur(t_start %to% t_stop, id, status) ~ z1, sdat = sdat, fdat = fdat, iter.max = 30) plot(fit, what = "basesurv")
data(simDat) fit <- AR1_FRAILTY(Recur(t_start %to% t_stop, id, status) ~ z1, sdat = sdat, fdat = fdat, iter.max = 30) plot(fit, what = "basesurv")
First derivative of AR(1) correlation matrix with respect to the auto-regressive coefficient
dar1_cor.drho(n, rho)
dar1_cor.drho(n, rho)
n |
number of events for each subject |
rho |
autoregressive correlation |
A n by n inverse matrix
Plot method for 'funsurv' objects
## S3 method for class 'funsurv' plot(x, what = c("beta", "fpc", "basesurv"), ...)
## S3 method for class 'funsurv' plot(x, what = c("beta", "fpc", "basesurv"), ...)
x |
A funsurv object |
what |
A character string specifying what to be plotted.
Use |
... |
additional graphical parameters to be passed to methods. |
A ggplot object ...
data(simDat) fit <- AR1_FRAILTY(Recur(t_start %to% t_stop, id, status) ~ z1, sdat = sdat, fdat = fdat, iter.max = 30) plot(fit, what = "beta") plot(fit, what = "fpc") plot(fit, what = "basesurv")
data(simDat) fit <- AR1_FRAILTY(Recur(t_start %to% t_stop, id, status) ~ z1, sdat = sdat, fdat = fdat, iter.max = 30) plot(fit, what = "beta") plot(fit, what = "fpc") plot(fit, what = "basesurv")
The dataset was generated based on the proposed model ,
where
is the baseline hazard function generated from a Weibull distribution.
.
is the fixed effect parameter associated with the time-invariant covariates
,
and
is a time-varying coefficient that captures the effect of functional predictor
on the hazard rate of recurrent events.
The simulated dataset is organized into two data frames:
a survival data frame (
sdat
) and a functional data frame (fdat
).
The variables in each data frame are listed below:
data(simDat)
data(simDat)
A list with two data frame:
Survival data; a data frame with xxx rows and xxx variables:
Subjects identification
A sequence of the number of events per subject
Event starting time
Event end time
Event censoring time
Event status: status=1
if event is observed and status=0
if event is censored
A univariate scalar covariates. Can be extended to multiple scalar covariates
Functional data; a data frame with xxx rows and xxx variables:
Subjects identification
Time points for each longitudinal measurement
Longitudinal measurements at distinct time points
Simulated data