International Journal of Preventive Medicine

: 2021  |  Volume : 12  |  Issue : 1  |  Page : 9-

Prognostic Factors Associated with Curing in Patients with Breast Cancer: A Joint Frailty Model

Freshteh Osmani1, Ebrahim Hajizadeh2, Mohammad Esmaeil Akbari3,  
1 Infectious Diseases Research Center, Birjand University of Medical Sciences, Birjand, Iran
2 Department of Biostatistics, Faculty of Medical Sciences, Tarbiat Modares University, Tehran, Iran
3 Cancer Research Center, Shahid Beheshti University of Medical Sciences, Tehran, Iran

Correspondence Address:
Freshteh Osmani
Infectious Diseases Research Center, Birjand University of Medical Sciences, Birjand


Background: Recurrent event data arise frequently in longitudinal medical studies. In many situations, there are a large portion of patients without any recurrences, manifesting the “zero-inflated” nature of the data. Moreover, there often exists a terminal event which may be correlated with the recurrent events. The goal of this study is to extend the application of joint frailty model to identify the prognostic factors associated with curing in patients with breast cancer. Methods: As a prospective study, medical records of women who had been attended to Cancer Research Center, Shahid Beheshti University of Medical Sciences from January 1998 to February 2016 were reviewed. Finally, after an initial review of medical records, 711 patients were included in the study and analyzed. A checklist that included items drawn from the demographic background of patients was provided in the study. Two joint frailty models for zero-inflated recurrent events, combining a logistic model for “structural zero” status (Yes/No) and a joint frailty proportional hazards model for recurrent and terminal event times were performed to identify factors associated with BCS. Results: The mean age of patients was 38.2 years. The numbers of subjects with 1, 2, 3, and 4 recurrent events were 392, 207, 97, and 15, respectively. The median follow-up time was 6.87 years. There were 137 (19.2%) deaths from cancer during the follow-up. Among the 574 patients who were censored, 418 had no tumor recurrence. Thus, there may exist a large portion of “cured” subjects. We can see that the radiation (OR = 6.02, CI = (3.87, 8.61)) and tumor size interaction with radiation (OR = 1.065, CI = (1.002–1.26)) were significant in the cure model (P < 0.05) which means that patients with smaller tumor sizes were more likely to be cured by radiation therapy. Conclusions: Our proposed models can help investigators to evaluate which treatment will result in a higher fraction of cured subjects. This is usually an important research question in biomedical studies.

How to cite this article:
Osmani F, Hajizadeh E, Akbari ME. Prognostic Factors Associated with Curing in Patients with Breast Cancer: A Joint Frailty Model.Int J Prev Med 2021;12:9-9

How to cite this URL:
Osmani F, Hajizadeh E, Akbari ME. Prognostic Factors Associated with Curing in Patients with Breast Cancer: A Joint Frailty Model. Int J Prev Med [serial online] 2021 [cited 2021 Feb 26 ];12:9-9
Available from:

Full Text


Breast cancer is the most common cancer among women worldwide comprising 16% of all female cancers and it is estimated that 519,000 women died in 2004 because of breast cancer.[1] Although, the incidence of breast cancer in developed countries is more than developing countries, about 50% of all diagnosis of breast cancer[2] and majority (69%) of all breast cancer deaths occur in developing countries.[2] The incidence rate of breast cancer is rising in Iran too. Based on a review of published articles from January 1998 to December 2005, the incidence rate of breast cancer was reported as 22/100,000 in Iran. About 72% of the breast cancers diagnosed included a tumor over 2 cm.[3],[4],[5] Most patients present with an advanced stage of the disease are younger than the patients in Western countries.[6]

Breast cancer can be detected at early stages and it has been shown that if breast cancer discovered early, it can often be cured.

In many situations, a substantial portion of subjects has no recurrent events. i.e., we have zero-inflated recurrent event data. Zero-inflated models have been proposed for many types of data, e.g., continuous[7] and count data.[8]

In these models, a portion of the data has an outcome of Y = 0 with a probability P, while the remaining subjects have a specific distribution hence, zero values can come from either the “structural zero” or from (“random zero”). These models can be classified as special cases of finite mixture models of two distributions.[9] It is possible that we observe zero recurrent events for some subjects. Clinically, it is an extension of cure models for single event data.[10] For a cured case, a zero means that none of the events of interest occurs, which maps to waiting until infinity. For a noncured case, the event of interest will occur eventually if the follow-up time is long enough. Although given a finite maximum of follow-up, we may not observe its occurrence, i.e. their follow-up is stopped before the occurrence of the first event.[11] For a subject who has no events being observed, we cannot determine whether he is “cured” or “not cured.” However, the aforementioned cure models can provide an estimate of the probability of “cure”. By doing this, we can identify the treatment that results in a high proportion of “cured” subjects, and make the distinction between these two situations. More recently, Rondeau extended the “cure” model to recurrent events in the frailty model framework.[12]

On the other hand, there is an increasing interest in the joint analysis of recurrent and terminal events, e.g., death, as the recurrent and terminal events are often correlated. Other related work includes Ghosh and Lin (2000), Ye et al. (2007), Huang and Liu (2007), Liu and Huang (2008, 2009).[13],[14],[15],[16],[17] As many of these articles showed that ignoring the informative terminal events could lead to biased estimates and there may be different impacts of “cure” on recurrent events.[18] Thus, it is important to consider recurrent and terminal events in the “cure” model framework.[19] Accordingly, our purpose in this study was to extend the applications of joint frailty model to identify the prognostic factors associated with curing in patients with breast cancer


Study design and participants

This prospective cohort study was conducted at the Shahid Beheshti Breast Cancer Research Center, using medical records. Eligibility criteria required subjects to be female; the follow-up period was at least 18 years after initial surgery from January 1998 until February 2016. In this follow-up period, we identified 711 recurrent breast cancers and included them in the current study. Length of follow-up was not considered as inclusion criteria. Based on the time protocol of the Cancer Research Center, the patient's information was reviewed and updated. Criteria for entering each patient in the study: All patients with a definite diagnosis of breast cancer who were followed up at Shohada Tajrish Hospital for at least 6 months after surgery. Exit criteria in this study were including incomplete information for each patient that these observations were made due to a defect in medical records and patient pathology reports. Moreover, patients who have been followed up for up to 5 months and variables that overlap with the results of the research were removed. Patients from the time of breast-conserving surgery were considered at risk of recurrence or death.

All procedures performed in the study were in accordance with the ethical standards of the institutional ethics committee approval from the medical science of Tarbiat Modares University. The current study was extracted from a Ph.D. thesis, which was checked and approved by the Ethics Committee of the Tarbiat Modares University of Medical Sciences (IR.TMU.PHNS.REC.1396.91).

Variables assessment

Data on the original breast cancer (diagnosis date, surgery type, tumor stage, the number of involved lymph nodes tumor grade, family history, and treatment) were obtained and confirmed via medical records. Reported breast cancer recurrences or new primary breast cancers diagnosed after study enrollment were adjudicated by breast oncologist. Recurrent cancers were classified as any type of recurrence: local/regional or distant metastasis and patients with no recurrence. Follow-up time was censored at the time of death (if not from breast cancer), at the last documented staff contact date, or at study completion (Feb 1, 2016)

Statistical analysis

To examine modeling recurrent event data, we fitted the frailty proportional hazards model that is specified as

λi (t|νi) = λ0(t) exp (βTZi + νi)

Where, β is the regression coefficient, λ0(u) is the baseline hazard function. When there exist a high portion of zero events, there may be patients who have no probability to experience the events, or they are or “cured” denote by A = 1 for “cured,” and A = 2 otherwise (“not cured”). Notably, subjects with zero recurrent events could fall in either class. A cure frailty model was proposed by Rondeau.[12]

In part I, a logistic model is used to describe the probability of cure, namely pi = P (Ai = 1):

logit (pi) = αTZi

Part II is a frailty proportional hazards model for the recurrent event among those “not cured”

λi (t|νi, Ai = 2) = λ0(t) exp(βTZi + νi)

Also, there exists a terminal event (death duo cancer) correlated with recurrences, for patients who not cured, the hazard for death is:

h (t|νi, Ai = 2) = h0(t) exp(ηTZi + νi)

By combining the above equations, we have a joint model of recurrence and survival with a cure fraction. A “cured subject cannot experience any recurrence, nor death due to the disease and death due to other causes will be considered as censoring.

In this article, we present two zero-inflated models for the recurrent event data with a large portion of zero events, both models have three parts: (1) a logistic model for “cure” which comes from those patients with no probability to experience the relapses; (2) a frailty proportional hazard model for the recurrent events of other subjects who are not cured; and (3) a Cox model for death with shared frailty from the recurrent event model, clinically, on whether there is “cure” for the terminal event (death). The analysis was performed using R software (version 10.3.2), and the estimation has been implemented in Proc NLMIXED of SAS software.


[Table 1] gives demographic, medical, and treatment characteristics of a total of 711 patients receiving surgery to remove tumors as the study sample. Patient's diagnosis age in this study was between 22 to 84 years with mean and standard deviation 47.84, 11.75, respectively. The most prevalence of stage tumors and grade were 49.1% and 55.8% in patients with stage tumor and grade II, respectively. Among all of the study samples, 68.1% have no family history. Patients may also receive adjuvant treatments such as chemotherapy and/or radiotherapy. Regarding treatment, 64% of women underwent a BSC (the remainder underwent a mastectomy radiation therapy), 96% received adjuvant chemotherapy. To find out the effect of these adjuvant treatments on tumor recurrences, two types of recurrences were recorded: 138 local recurrences, 413 distant recurrences, and 332 (49%) patients had no recurrence, demonstrating the zero-inflated nature of the data. The median follow-up time was 5.96 years. The numbers of patients with 1, 2, 3, and 4 recurrences were 183, 118, 39, and 5, respectively.{Table 1}

There were 273 (39%) deaths from cancer. Among the 407 patients who were censored, 308 had no tumor recurrence, illustrating a large portion of “cured” subjects in the dataset, who had no recurrences and did not die from cancer. Then this joint model was applied. Covariates, such as chemotherapy (Yes/No), radiotherapy (Yes/No), age at baseline, and the maximum tumor size at baseline were included in all three models. We also considered interactions between adjuvant treatments (chemotherapy or radiation) in these models. As a result, only radiation and tumor size interaction was significant in the logistic cure model (2). We included this interaction term in all three sub-models. The estimation results are shown in [Table 2].{Table 2}

The interaction of radiation and tumor size was significant in the cure model (P = 0.005). Radiation therapy was obtained as a more effective treatment for patients with smaller tumor sizes; we calculated the radiation effect on the ORs of “cure” for different tumor sizes.

The ORs of radiation effect were 3.73, 1.96, and 0.62 at 5, 10, and 20 cm, respectively, i.e. when the tumor size was large, the radiation decrease the cure probability, However, the chemotherapy did not have a significant effect on the cure (P = 0.13). For not cured patients, the interaction of radiation and tumor size was not significant. And only the main effect of tumor size was significant on tumor recurrences and death. Patients with larger tumors were more likely to have disease recurrences HR = exp (0.048) = 1.049, P = 0.008), as well as higher mortality rates (HR = exp (0.059) =1.061, P = 0.006). We also fit a joint model without cure fraction for tumor recurrences and death, [Table 2]. However, the interaction between tumor size and radiation was significant on both recurrent and death in this reduced model. But the sign of these effects were different from those in another model. As aresult, not cured patients would not benefit from radiation therapy. Hence, after identifying patients to be cured more likely by radiation by applying a cure model, we would not use radiation therapy to those not likely to be cured.

The estimate of frailty variance in the model with a cure fraction was smaller than that in the reduced model. This suggests that the logistic cure model alone effectively captured the heterogeneity. As a result, there was much less variation in recurrent events for those no cured patients.


In this paper, we presented two joint frailty models for zero-inflated recurrent and survival. Our proposed models can help investigators to evaluate which treatment will result in a higher fraction of cured subjects. This is usually an important research question in biomedical studies. The “cure” component in the proposed models also helps explain a large portion of heterogeneity among study subjects. Accordingly, this model can provide more precise analysis and offer in-depth insight. For example, we detected a large fraction of cured subjects in this data set. In the breast cancer study, chemotherapy had no significant effect while radiation therapy could lead to a higher probability of cure for small tumor size, the proposed model should be applied when there exist a large fraction of zero recurrent events. Furthermore, we present an estimation method which can be conveniently implemented in SAS Proc NLMIXED. This model can be extended in different directions.[20] Firstly, we can use other frailty distribution, e.g., Gamma.

Secondly, we can model multiple types of recurrent events separately, e.g., local and distant recurrences with two correlated zero-inflated frailty models. Lastly, there may exist nonlinear or time-varying covariate effects in the recurrent or terminal event models, nonparametric methods can be used to capture these effects.

One key advantage of this proposed approach is that different covariate effects can be assessed by the two parts of recurrence hazard functions or death hazard function, these covariates can be time-dependent or independent. In addition, they also provide information on whether one or both types of recurrences can be used as substitute endpoints for overall survival.[21],[22] We noticed that locoregional relapses and death are related events. We did the same joint analysis with distant metastasis and death and we conclude that these two events are also associated. We may conclude that if we omit to consider the distant metastasis events Convergence of the model could be hard to obtain in data with few events. However, convergence could be reached when the number of parameters to estimate decrease.[23],[24],[25],[26] This study gives estimates of incidence and mortality rates in epidemiology, which are meaningful for clinicians. In analyses of the natural history of cancer, there is great interest in a dynamic prediction of death, that is, in the computation of the predictive distribution of death at a certain moment of time, given the history of events (local or distant relapses) and covariates until that moment.[27] As the strength of this study, the results obtained from the proposed model in this study are highly valued due to controlling the zero events by handling them, unlike other survival models that considered them as a censor. These predictions and a measure of their accuracy are in progress, they may provide valuable insight for future research. Notably, some limitations of this study should be considered. First, these results should be considered with caution, given the small number of recurrent events. It may be as a limitation for this study. Incomplete information of some of these patients was another limitation of our study.


In conclusion, in this sample of breast cancer women, this model can be extended in several directions. Firstly, we can model multiple types of recurrent events separately, e.g., local and distant recurrences in this study, with two correlated zero-inflated frailty models. Secondly, there may exist nonlinear or time-varying covariate effects in the recurrent or terminal event models, nonparametric methods can be used to capture these effects. In the other hand, our findings provide a deep insight into challenging the clinician to evaluate which treatment will result in a higher fraction of cured subjects. This is usually an important research question in biomedical studies.

Availability of data

The data that support the findings of this study are available on request from the corresponding author. The data are not publicly available due to privacy or ethical restrictions.


We would like to thank Cancer Research Center for their wholehearted cooperation for data gathering.

Financial support and sponsorship

This work was supported by the Cancer Research Center of Shahid Beheshti University of Medical Sciences.

Conflicts of interest

There are no conflicts of interest.


1Cook RJ, Lawless J. The Statistical Analysis of Recurrent Events. New York: Springer Science & Business Media; 2007.
2Osmani F, Hajizadeh E, Mansouri P. Use of smoothing methods for estimating the coefficients of time dependent rate models in survival analysis and its application in psoriasis disease. Iran J Epidemiol 2016;15;12:36-46.
3Liu L, Huang X. Joint analysis of correlated repeated measures and recurrent events processes in the presence of death, with application to a study on acquired immune deficiency syndrome. J R Stat Soc 2009;58:65-81.
4Cancer Genome Atlas Network. Comprehensive molecular portraits of human breast tumors. Nature 2012;490:61-70.
5Montagna E, Bagnardi V, Rotmensz N, Viale G, Renne G, Cancello G, et al. Breast cancer subtypes and outcome after local and regional relapse. Ann Oncol 2012;23:324-31.
6O'Shaughnessy J. Extending survival with chemotherapy in metastatic breast cancer. Oncologist 2005;10(Suppl 3):20-9.
7Prentice RL, Williams BJ, Peterson AV. On the regression analysis of multivariate failure time data. Biometrika 1981;68:373-9.
8Andersen PK, Gill RD. Cox's regression model for counting processes: A large sample study. Ann Stat 1982;10:1100-20.
9Zeng D, Lin DY. Efficient estimation of semiparametric transformation models for counting processes. Biometrika 2006;93:627-40.
10Wei LJ, Lin DY, Weissfeld L. Regression analysis of multivariate incomplete failure time data by modeling marginal distributions. J Am Stat Assoc 1989;84:1065-73.
11Osmani F, Hajizadeh E, Mansouri P. Kernel and regression spline smoothing techniques to estimate coefficient in rates model and its application in psoriasis. Med J Islam Repub Iran 2019;33:546-50.
12Huang CY, Wang MC. Joint modeling and estimation for recurrent event processes and failure time data. J Am Stat Assoc 2004;99:1153-65.
13Ye Y, Kalbfleisch JD, Schaubel DE. Semiparametric analysis of correlated recurrent and terminal events. Biometrics 2007;63:78-87.
14Cook RJ, Lawless JF, Lakhal-Chaieb L, Lee KA. Robust estimation of mean functions and treatment effects for recurrent events under event-dependent censoring and termination: Application to skeletal complications in cancer metastatic to bone. J Am Statt Assoc 2009;104:60-75.
15Park HS, Kim S, Kim K, Yoo H, Chae BJ, Bae JS, et al. Pattern of distant recurrence according to the molecular subtypes in Korean women with breast cancer. World J Surg Oncol 2012;10:4.
16Kulkarni SV, Mishra GA, Dusane RR. Determinants of compliance to breast cancer screening and referral in low socio-economic regions of urban India. Int J Prev Med 2019;10:84.
17Wu X, Baig A, Kasymjanova G, Kafi K, Holcroft C, Mekouar H, et al. Pattern of local recurrence and distant metastasis in breast cancer by molecular subtype. Cureus 2016;8:e924.
18Dignam JJ, Dukic V, Anderson SJ, Mamounas EP, Wickerham DL, Wolmark N. Hazard of recurrence and adjuvant treatment effects over time in lymph node-negative breast cancer. Breast Cancer Res Treat 2009;116:595-602.
19Ghaffari M, Rad TN, Mohammadi S, Rakhshanderou S. Effect of an intervention on the breast cancer screening behavior in women: Application of integrated behavioral model. Int J Prev Med 2018;9:99.
20Rondeau V, Mazroui Y, Gonzalez JR. frailtypack: An R package for the analysis of correlated survival data with frailty models using penalized likelihood estimation or parametrical estimation. J Stat Softw 2012;47:1-28.
21Osmani F, Hajizadeh E, Rasekhi A, Akbari ME. Multivariate joint frailty models for modeling multiple recurrent events and its application in breast cancer. RJMS 2019;25:71-9.
22Osmani F, Hajizadeh E, Rasekhi A. Association between multiple recurrent events with multivariate modeling: A retrospective cohort study. J Res Health Sci 2018;18:e00433.
23Shirzadi A, Mahmoodzadeh H, Qorbani M. Assessment of sentinel lymph node biopsy after neoadjuvant chemotherapy for breast cancer in two subgroups: Initially node negative and node positive converted to node negative – A systemic review and meta-analysis. J Res Med Sci 2019;24:18.
24Osmani F, Hajizadeh E, Rasekhi A, Akbari ME. Analyzing relationship between local and metastasis relapses with survival of patients with breast cancer: A study using joint frailty model. Int J Cancer Manag 2018;11:e81783.
25Tanis E, van de Velde CJ, Bartelink H, van de Vijver MJ, Putter H, van der Hage JA. Locoregional recurrence after breast-conserving therapy remains an independent prognostic factor even after an event free interval of 10 years in early stage breast cancer. Eur J Cancer 2012;48:1751-6.
26Osmani F, Hajizadeh E, Mansoori P. Estimation of seasonal effect on the psoriasis recurrence using time dependent coefficient rates model for recurrent events. J Dermatol Cosmet 2015;6:23-30.
27Osmani F, Hajizadeh E, Rasekhi A, Akbari M. Multivariate frailty modeling in joint analyzing of recurrent events with terminal event and its application in medical data. IRJE 2019;15:162-71.