Survival analyses in cardiovascular research, part II: statistical methods in challenging situations

doi:10.1016/j.rec.2021.07.001

Abstract

This article is the second of a series of 2 educational articles. In the first article, we described the basic concepts of survival analysis, summarizing the common statistical methods and providing a set of recommendations to guide the strategy of survival analyses in randomized clinical trials and observational studies. Here, we introduce stratified Cox models and frailty models, as well as the immortal time bias arising from a poor assessment of time-dependent variables. To address the issue of multiplicity of outcomes, we provide several modelling strategies to deal with other types of time-to-event data analyses, such as competing risks, multistate models, and recurrent-event methods. This review is illustrated with examples from previous cardiovascular research publications, and each statistical method is discussed alongside its main strengths and limitations. Finally, we provide some general observations about alternative statistical methods with less restrictive assumptions, such as the win ratio method, the restrictive mean survival time, and accelerated failure time model.

Keywords

Survival analysis

Cardiovascular disease

Methodology

Based on the fundamentals covered in the previous review article,1 this second paper explores more complex challenging situations in survival analyses. We now present some extensions to the Cox proportional hazards (CPH) model, such as the stratified and frailty models, and the use of time-dependent variables. We explore the problems faced by researchers in the cardiovascular field due to multiplicity of outcomes, and present some approaches to tackle this issue, such as the use of composite outcomes, competing risks, multistate models, and recurrent-event methods. An increasingly popular topic is the use of the win ratio approach. To provide a comprehensive overview of the most common statistical approaches in survival analyses, some other methods are briefly introduced, such as the restrictive mean survival time, and accelerated failure time model approaches. For a better understanding, we will illustrate how these methods have been applied to data from cardiovascular studies. This review is primarily descriptive in content, and therefore no prerequisite mathematical or statistical knowledge is necessary.

EXTENSIONS OF THE COX PROPORTIONAL HAZARDS MODELStratified Cox proportional hazards model

The CPH model is by far the most commonly used model in survival analysis. Some extensions to this model can be considered when it does not provide a good fit to our data. In the stratified CPH model, instead of assuming that the proportional hazards (PH) model holds for the overall cohort, we assume that the PH model holds within groups (or strata) of individuals. Study variables that are assumed to satisfy the PH assumption are included in the model, whereas the factor being stratified is not included, and is controlled by stratification. Hence, this method does not provide an estimate of the effect of the factor (or factors) defining the groups on the hazard (ie, it does not provide a hazard ratio of the stratifying variable) and is therefore not a suitable approach if the factor exhibiting nonproportionality is of primary interest. To evaluate the effect of mineralocorticoid receptor antagonists in preventing sudden cardiac death in patients with heart failure (HF) with reduced ejection fraction, a stratified CPH model was needed to address the inevitable baseline differences across 11 032 patients recruited from 3 placebo-controlled randomized trials (RCTs): RALES (Randomized Aldactone Evaluation Study), EPHESUS (Eplerenone Post–Acute Myocardial Infarction Heart Failure Efficacy and Survival Study), and EMPHASIS-HF (NCT00232180). Although all patients had in common the fact that they had HF with reduced ejection fraction, participants from EMPHASIS-HF were in New York Heart Association class II, whereas those from RALES were in New York Heart Association class III-IV, and participants from EPHESUS had a recent myocardial infarction (MI).2 In other cases, CPH models are stratified by geographic region and baseline renal function at baseline.3

Frailty models

In a stratified CPH, baseline hazards functions from different strata are unrelated. This is based on the assumption that the study population is homogeneous across strata. However, individuals may differ greatly within strata (eg, in an RCT, with respect to the treatment effect, whereas in an observational study, with respect to the influence of covariates in a given association). The presence of unobserved individual-specific risk factors leads to unobserved heterogeneity in the hazard, which is also referred to as frailty, or a random effect. Importantly, in many situations the population cannot be assumed to be homogeneous (eg, a mixture of participants with different hazards). In this case, in contrast to the CPH model, a frailty model is useful as it implies that baseline hazard functions are proportional to each other.

Frailty models are random effects models for time-to-event data,4 in which the random effect has a multiplicative effect on the baseline hazard function. In the context of survival models, this random effect is called “frailty” for historical reasons, as the term simply refers to the fact that some individuals are intrinsically more “frail” than others. The classic example occurs when a study involves the recruitment of patients from different hospitals. Survival times from participants at the same hospital tend to be similar (eg, due to treatment practices, level of tertiary activity, etc) and there is a greater between-hospital variability than within-hospital variability. These clustered (or hierarchical) data need a model accounting for the clustering. A natural way to model dependence of clustered event times is through the introduction of a cluster-specific random effect—the frailty.4 This random effect explains the dependence in the sense that had we know the frailty, the events would be independent. The use of frailty models is relatively popular. They have been applied in some studies using the EPICOR (Long-term Follow-up of Antithrombotic Management Patterns in Acute Coronary Syndrome Patients) registry,5 where in addition to adjusting for age, sex and other relevant confounders, the model had a random effect (shared frailty) at the hospital level.5

Time-dependent variables

Sometimes explanatory variables change over time in an individual (eg, treatment, blood pressure, or smoking status). These variables are known as time-dependent, time-updated, or time-varying variables. If changes over time in these variables are not taken into account, the results yielded by a survival model may provide a bias known as “immortal time” bias, or survivorship bias.6

Immortal time refers to a period of follow-up where the event of interest cannot occur because the subject has not yet started the exposure.6 A subject is not literally immortal during this period, but remains event-free until classified as exposed. An incorrect consideration of this unexposed time period in the analysis will lead to immortal time bias.6 If the unexposed follow-up time is misclassified as exposed, patients in the exposed group are inherently given a survival advantage. Consequently, immortal time bias of this type results in spurious protective effects of the exposure. Classic immortal time bias examples in the literature have been found in the Texas7 and Stanford Heart Transplant data,8 with both studies concluding that a heart transplant prolongs survival in those patients on a transplant waiting list. However, data were poorly analyzed because heart transplant was not treated as a time-varying variable. The waiting time of all patients who were alive until they received a transplant was classified as exposed to transplant (instead of unexposed), and gave a survival advantage time to the transplanted group. Deaths that occurred while waiting for a transplant were categorized into the nontransplant cohort. By not being correctly classified, the immortal time increased the mortality rate of the nontransplant group, suggesting a benefit of transplant.9 However, when adequately analyzed, the major survival advantage of the intervention disappeared when the follow-up times where properly accounted for.10

When estimating the effect of time-dependent covariates, the follow-up period has to be divided into subintervals (time until exposure and time from exposure onwards), which means that a subject might have more than one subinterval (eg, more than one row in the dataset, one for each subinterval). Subjects enter the study alive, awaiting the exposure, then are censored when they become exposed, and start a new subinterval of time (new row in the dataset) where the entry time is the censored time from the first subinterval, with a new covariate value indicating postexposure. Using the EPICOR cohort, Bueno et al.11 evaluated the impact of dual antiplatelet therapy (DAPT) duration in acute coronary syndrome patients, and the change from DAPT to single antiplatelet therapy (SAPT) in mortality. DAPT was entered into the model as a time-updated categorical variable (0 meant being on DAPT, 1 meant a change to SAPT). For patients who were always on DAPT, never receiving SAPT, the value of the time-updated variables was 0 throughout their follow-up. For those changing from DAPT to SAPT (usually after 1-year of follow-up), the authors provided follow-up time to both groups (time exposed to DAPT, and time exposed to SAPT).

Two approaches can be used to estimate the impact of time-dependent covariates12: the CPH model, which can accommodate time-dependent variables, and the landmarking approach. The latter involves setting a landmark time point and using the value of the time-dependent covariate at this landmark point as a time-fixed covariate. By using this approach, participants with an event before the landmark time point are excluded from the survival analysis, which starts from the landmark time point onwards, in the subset of participants at risk at that given time.

ALTERNATIVE MODELS FOR SURVIVAL DATA

The CPH model is the most common approach used for the analysis of time-to-event data. However, this regression model may not be appropriate in some situations, such as when the hazard ratio is not constant over time.13,14 Furthermore, one limitation of the CPH is that the hazard ratio is a relative measure that does not quantify absolute effects or associations. Other approaches may overcome some of the limitations of PH analysis. However, when PH are satisfied, the CPH model is the most statistically powerful method.13

Restricted mean survival time

Restricted mean survival time (RMST) is a measure of average survival from time 0 to a specified time point, and may be estimated as the area under the survival curve up to that point.13,14 Associations are expressed as the difference in RMST between groups at a suitable follow-up time, which is easy to interpret by both clinicians and patients (eg, if the outcome of interest is mortality, the estimate would be loss of life expectancy). In addition, the difference between RMST provides an absolute measure (eg, in a 2-arm RCT, RMST provides the absolute benefit or harm). This approach does not require assumptions about hazards and has the advantage of being valid under any distribution of survival time, or when it is expected for an association to vary over time, such as an intervention with either early or late treatment effects.

RMST analysis captures the entire survival history, does not change with extended follow-up time, and is routinely associated with a clinically meaningful time point.15 In HF RCTs, RMST seems to add value to traditional PH analyses by providing clinically relevant estimates of treatment effects, in line with the findings yielded by other statistical methods.16

Accelerated failure time

This approach is known as the accelerated failure time model because the term “failure” indicates the death or event, while the term “accelerated” indicates the factor for which the rate of failure is increased. That factor is called the “acceleration factor”.17 Instead of the hazard, the key measure of the association between the study variable and survival time is the acceleration factor, which is a ratio of survival times. Similar to the CPH model, the accelerated failure time model describes the relationship between survival probabilities and a set of covariates, estimating a relative (not an absolute) association. The accelerated failure time model provides an estimate of the ratio of the median event times, which can be translated to clinicians as the expected reduction in the duration of illness with treatment.17

MULTIPLICITY OF OUTCOMES IN SURVIVAL ANALYSIS

Clinical studies may evaluate multiple outcomes to try to maximize the information provided by clinical studies. In the field of cardiovascular research, the outcomes of interest might include stroke, HF, MI, sudden death, cardiovascular death, or all-cause death. To avoid inflation of the type I error rate by testing each outcome separately, a potential solution is to use a composite endpoint by including all the outcomes based on the time-to-first-event principle. Composite outcomes have several advantages,18 such as accounting for both fatal and nonfatal events, and hence leading to higher event rates and power (thus requiring smaller sample sizes or shorter follow-ups).19 Nevertheless, they also have some weaknesses,20 such as the underlying assumption that each individual outcome involved in a composite is of similar importance to patients.21 It is also common to have higher event rates and larger treatment effects associated with less important components.22 Hence, the use of composite outcomes is not always optimal. There are some situations that require more sophisticated statistical approaches than simply using a composite outcome on a time-to-first event basis, such as: a) the use of a competing risk assessment in the evaluation of nonfatal events, where the occurrence of fatal events can bias the findings; b) the use of multistate models to take into account intermediate states (eg, a HF hospitalization is common before an HF-related death)23; c) the use of recurrent-event methods to fully capture the burden of chronic diseases, which may involve several hospitalizations over the follow-up period; and d) the win ratio approach to provide a hierarchical assessment of the individual components of a composite outcome.

COMPETING RISKSThe censoring assumption

Uninformative or independent censoring is assumed for the most popular approaches in survival analyses: those who are censored have the same hazard of the event of interest as those who are not censored.24,25 In other words, the uncensored individuals who remain under follow-up should be representative of the survival experience in the censored individuals. However, if censoring occurs due to another known event taking place, the assumption of uninformative censoring is violated. Competing risks occur when the event of interest is a particular cause of failure (eg, cardiovascular death), which can take place alongside other causes of failure (eg, noncardiovascular death due to cancer). The competing risk may prevent the event of interest from taking place: a person who dies of cancer is no longer at risk of cardiovascular death (figure 1).

Figure 1.

Graphical representation of the competing risks model. A: a classic competing risk challenge with 2 fatal outcomes. B: a more challenging and realistic situation, where several risks are competing in patients with a severe disease.

(0.39MB).

Competing risk bias: impact on the cumulative incidence of events

A competing risk bias happens when censoring is informative due to multiple causes of failure. This bias has been reported in almost half of Kaplan-Meier analyses published in medical journals.26 If we estimate the survival probability of sudden death in patients with HF with reduced ejection fraction and censor the other causes of death (eg, HF-related death), the cumulative incidence of events over time (which is 1 minus the survival probability) will overestimate the probability of death due to sudden death.2 Indeed, by using the Kaplan-Meier estimator and censoring the other causes of death, we assume that those censored due to an HF-related death have the same future hazard of sudden death as those who have not yet had any event. Since those who have already died from other reasons can never experience death from sudden death, this can never be true. By assuming that those already dead from other causes are still at risk of sudden death, and that they can be represented by those not yet experiencing any event, the Kaplan-Meier approach overestimates the probability of failure, and therefore underestimates the probability of surviving at a given time. Another classic example, for patients with implanted cardioverter-defibrillators, can be found elsewhere.27

Addressing statistical analysis in competing risks

Two different hazard regression models are available in scenarios where competing risks are present28,29: modelling the cause-specific hazard, or the subdistribution hazard function.

A) The cause-specific hazard function (use of cumulative incidence function [CIF]). The CIF estimates the incidence of an event of interest while allowing for a competing risk. Individuals experiencing the competing event are no longer considered at risk of the event of interest. In the simplest case, when there is only 1 event of interest and no competing risks, the CIF would equal the 1–Kaplan-Meier estimate. The CIF takes into account both the probability of experiencing the event of interest, conditioned upon not experiencing either event (primary or competing) until that time. The sum of the CIF estimates for each outcome individually equals the CIF estimate of the composite outcome consisting of all competing events. Unlike the survival function in the absence of competing risks, the CIF function of the event of interest will not necessarily approach unity with time, because of the occurrence of competing events that preclude the occurrence of the event of interest.28 The CIF can be interpreted as the instantaneous rate of the primary event in those participants who are currently event free.

B) Fine and Gray model (use of subdistribution hazard function). Fine and Gray modified the CPH model to allow for the presence of competing risks.30 The subdistribution hazard function for a given type of event is defined as the instantaneous rate of occurrence of the given type of event in participants who have not yet experienced an event of that type. Hence, in this model, we are considering the rate of the event in those participants who are either currently event-free or who have previously experienced a competing event (although it feels unnatural to keep dead participants at risk for other events). This differs from the risk set for the cause-specific hazard function, which only includes those who are currently event free. In this way, there is a subdistribution hazard function for each outcome (eg, one for sudden death, and another for HF-related death).

The CIF model estimates the impact of covariates on the cause-specific hazard function, while the Fine-Gray subdistribution hazard model estimates the impact of covariates on the subdistribution hazard function.27 Because the CIF model relies on participants actually at risk (event-free participants), hazard ratios from this model should be interpreted among individuals who have not yet experienced the event of interest or the competing event and therefore this approach is optimal for answering etiological research questions. In contrast, by keeping at risk those individuals who have experienced the competing risk, the subdistribution hazard model may be of greater interest if the focus is on the overall impact of covariates on the incidence of the event of interest, and is optimal to perform risk prediction and risk-scoring systems.31

MULTISTATE MODELSDefinition of absorbing and nonabsorbing events

An absorbing event prevents the outcome of interest from subsequently taking place (eg, a cardiovascular death prevents a cancer death). Sometimes, there is an intermediate event, which may occur before the absorbing event, known as nonabsorbing event. These intermediate events are of particular interest when their occurrence substantially changes the likelihood of the outcome of interest happening, and hence, may provide more detailed information on the natural history of the disease. The intermediate event can be interpreted as a deterioration or improvement step in the disease process. This step was illustrated by Solomon et al.,23 who assessed the influence of nonfatal hospitalizations for HF on subsequent mortality in patients with chronic HF. In contrast to the relatively stable mortality risk observed over time in patients with HF from the CHARM (Candesartan in Heart failure: Assessment of Reduction in Mortality and morbidity) program, these authors found a higher likelihood of dying in the immediate post discharge period of a HF hospitalization, which was directly associated with the duration and frequency of HF hospitalizations.23 Having a HF hospitalization (nonabsorbing event) changed the hazard of the outcome of interest (mortality).

Nonabsorbing events can be modelled using multistate models,32 in which the focus is on the change of status over time (eg, change from baseline status to HF hospitalization, and from there to cardiovascular death).33

Multistate models: an extension of competing risks models

Multistate models provide a framework that allows analysis of the natural history of a disease. These models are an extension of competing risks models (multistate model with 1 initial state and several mutually exclusive absorbing states), since they extend the analysis to what happens after the intermediate event.34 This review will consider only continuous time models allowing changes of state at any time. These models are more realistic and can be seen as an extension of the standard survival model, as they describe how an individual moves between a series of discrete states in continuous time.

Multistate models are appropriate when a disease involves transitions between several well-defined distinct states. A 2-state survival model is defined by a living state and a dead state. The 2 main features of the standard survival model are: a) there is 1event of interest (the transition from alive to dead), which is unidirectional; and b) the timing of this event may be right-censored, in which case it is known that the event has not happened yet. A Kaplan-Meier curve can be thought of as a simple multistate model with 2 states, and 1 transition between those 2 states. The situation becomes more complex when nonabsorbing events are included in the model. In the HF setting, HF hospitalization can be defined as a transient event (nonabsorbing event). A 3-state survival model would be defined by a HF-free state, an HF state, and a dead state. This sets 3 events: death from state 1 (HF-free state), death from state 2 (HF state), and transition from state HF-free to HF hospitalization (figure 2). The hazard rates defining movement from one state to another are defined as transition intensities, the instantaneous risk of moving from one state to another at a given time. These transition intensities are equivalent to the cause-specific hazards described for the competing risks approach, this situation being a particular case of multistate models. There are as many hazards to model as there are transitions.

Figure 2.

Graphical representation of the illness-death model in a heart failure example. λ, transition intensity function (eg, λ12 is the transition intensity function from state 1 to state 2). HF, heart failure.

(0.12MB).

To run a multistate model, a counting process data structure is used to frame the data. Hence, each time there is a transition, another row of information for that individual is needed in the dataset. In contrast, in a traditional time-to-first event survival analysis, there is only 1 row of information per patient in the dataset, including the status and the survival time (time-to-event or time to censoring).

Recurrence of nonfatal HF or MI are nonabsorbing events depending on time. Although they can be taken into account in standard survival models by including the nonabsorbing event as a binary time-dependent covariate for the risk of death, the best approach to tackle these patients’ transitions in various states is by multistate modelling.35 To improve understanding of prognosis, a comprehensive model should include both death and nonfatal clinical events. CPH models are not strictly appropriate since observations are not independent. Multistate models overcome this limitation by separately assessing time-to-death and time-to-disease-related hospitalizations.36

Multistate models in cardiology

Several studies have used multistate models in the field of cardiovascular research.37 Beyond the classic example of chronic HF,36 we can find other examples in patients with multivessel coronary disease. Using data from the BARI trial (NCT00000462), Zhang et al.38 performed a multistate model, where the initial state was patients after randomization and before intervention, the intermediate state was nonfatal MI, and the final state was death. Standard survival analyses with Cox regression and Kaplan-Meier estimation for both mortality and the composite outcome of death or nonfatal MI showed no differences between coronary artery bypass grafting and percutaneous coronary angioplasty after a 10-year follow-up. Of note, this approach did not take into account the intermediate state (nonfatal MI). In contrast, multistate modelling broke the process into 3 transitions, and found significant differences in outcomes favoring coronary artery bypass grafting for patients in a transition path of nonfatal MI to death, whereas for patients without MI, there was no difference in terms of survival between patients who underwent coronary artery bypass grafting and those who underwent percutaneous coronary angioplasty. This study illustrates that the use of composite outcomes may not capture as much prognostic information as the use of a multistate model.

RECURRENT-EVENT METHODSDefinition of recurrent event

Most studies evaluate time-to-first event endpoints, so that all subsequent equal events occurring after a first one are ignored in the analysis. In HF, this problem becomes even more important as “time-to-first” event analyses do not fully reflect the true burden of the disease39: In patients admitted to Spanish emergency departments due to acute HF, 24% revisited the emergency department within 30 days, and 16% were rehospitalized in the same follow-up period.40 Of note, each subsequent HF hospitalization heralds a substantial worsening of the long-term prognosis.41 In contrast to conventional methods, the use of recurrent-event methods may capture the burden of disease (figure 3).

Figure 3.

Graphical representation of the recurrent-risks model. Several situations in the context of recurrent events are illustrated: patient A is hospitalized 4 times during follow-up (only the first event would be used in conventional survival analyses), patient B is hospitalized twice before dying (only the first hospitalization would be taken into account using traditional methods), and patient C dies during follow-up without any previous hospitalizations (using a composite endpoint, information about the disease burden would be lost in this situation).

(0.14MB).

Recurrent-event methods have been generally assumed to improve statistical precision and provide greater statistical power than more conventional time-to-first methods. For instance, in the CHARM-Preserved trial,42 a borderline result for time-to-first composite event analysis was achieved. However, post hoc analyses with a recurrent-event method led to a gain in statistical power and showed significant evidence of efficacy.19

Statistical analysis in recurrent-event methods

There are several statistical methods addressing the issue of recurrent events, but there is some controversy as to which of them is the most appropriate. There are 2 main approaches: through counts (or event rates), and through times between subsequent events. Noninformative censoring is assumed in both cases.

For the first approach, based on measuring the number of events (eg, number of hospitalizations due to worsening HF), there are 2 main methods.43 The Poisson distribution is the most popular count model and can be used to determine if event rates differ between groups, whereas the negative binomial distribution is an alternative approach that allows for different individual tendencies (frailties). The latter has been retrospectively used to evaluate recurrent hospitalizations in the EMPHASIS-HF trial.44 In the TOPCAT (NCT00094302) trial, the prespecified Poisson regression model was eventually replaced with a negative binomial model to allow for correlated events.45

There are many methods for the second approach, based on the time-to-event principle. The Andersen–Gill approach is an extension of the CPH model, which analyses recurrent events as gap times (eg, the times between consecutive events).46 The Lin-Wei-Yang-Ying method is a modified Anderson-Gill model, with a robust variance estimator to account for the correlation between events, which is useful when covariates are considered time dependent. This approach was used for the primary outcome of total (first and recurrent) HF hospitalizations and cardiovascular deaths in the PARAGON-HF trial.47 The Ghosh and Lin method offers a nonparametric estimate of the cumulative number of recurrent events through time, which incorporates death as a competing risk.48

Final remarks about recurrent-event methods

Recurrent-event models seem to improve statistical precision and to provide greater statistical power than time-to-first event approaches.49 However, the relative width of the 95% confidence intervals associated with recurrent-event analyses can sometimes be greater than that from time-to-first event analyses, suggesting a loss of precision.19 Using trial-based data (CHARM, TOPCAT, PARADIGM-HF [NCT01035255]), Clagget et al. found that the increasing heterogeneity of patient risk, a parameter not included in conventional power and sample size formulae, might explain the differences between time-to-first and recurrent-event analyses in terms of treatment effect estimation, precision, and statistical power.49 In that study, they concluded that the greatest statistical gains from using recurrent-event methods occur in the presence of high patient heterogeneity and low rates of study drug discontinuation.49

THE WIN RATIO METHOD

The win ratio was introduced in 2012 by Pocock et al.50 as a new method for examining composite endpoints, and it is becoming progressively popular in cardiovascular RCTs.51,52 Unlike traditional methods evaluating composite endpoints, the win ratio accounts for relative priorities of their components, and even allows different types of components. For example, the win ratio can combine the time to death with the number of occurrences of a nonfatal outcome such as cardiovascular-related hospitalizations (CVHs) in a single hierarchical composite endpoint. It can also include quantitative outcomes such as quality-of-life scores.

Based on the principle of the Finkelstein-Schoenfeld test, the win ratio approach provides an estimate of the treatment effect (the win ratio) and confidence interval, in addition to a P value.53 In a simple 2-arm RCT, the application of the win ratio can be summarized as: a) forming every possible patient-to-patient pair (each patient in the treatment arm is compared with each patient in the control arm); and b) within each pair, evaluating the component outcomes in descending order of importance until one of the pair shows a better outcome than the other. If the patient on the treatment has the better outcome it is called a “win”, if the control patient does better it is called a “loss”, and, if none of these situations happens, then it is a “tie”.

This approach was used in the ATTR-ACT trial,52 which was a double-blind trial that randomized 441 patients with transthyretin amyloid cardiomyopathy to tafamidis (80 and 20mg), or matching placebo for 30 months. In the primary analysis, the investigators hierarchically assessed all-cause mortality, followed by the frequency of CVHs using the Finkelstein-Schoenfeld53 and win ratio50 methods. For each pair, they determined whether the patient receiving tafamidis “won” or “lost” compared with the patient receiving placebo. Their hierarchical assessment was to determine: a) who died first (the “loser”); and then, b) if neither died, who had the most CVHs (again the “loser”), both being assessed over their shared follow-up time. After adding up, they obtained a total of 8595 winners and 5071 losers. Hence, the win ratio was 8595/5071 = 1.70, with a 95% confidence interval of 1.26-2.29 and P = .0006.54 By using traditional methods to evaluate a composite outcome of first CVH or death, we would have ignored repeat CVHs after the first CVH, as well as any death happening after a CVH. The win ratio provides greater statistical power to estimate treatment differences by evaluating hierarchically each component of a composite outcome.

OTHER APPROACHES: LOGISTIC AND POISSON REGRESSION MODELLING

In addition to CPH models, survival data is often evaluated using logistic and Poisson regression models.55,56 The choice between these models is based on the study design and the nature of the research question.57 Table 1 summarizes the main differences between these models in the setting of survival analysis.

Table 1.

Summary of differences between the Cox proportional hazards, Poisson and logistic regression models

	Cox proportional hazards model	“Interval” Poisson regression model	Logistic regression model
Research question	How long before the event occurs in a defined time endpoint?	How many times does the event occur in a defined time endpoint?	Does a subject reach the event in a defined timeframe?
Modelling	Models survival times	Models the rate at which the event occurs independently over time	Models whether an event occurs or not
Use of survival time	Analysis of individual survival times	Analysis of aggregated patient mortality rates	Analysis only of events, without taking into account when they happen. Does not use survival times
Outcome type	Time-to-event	Event-count data	Dichotomous event data
Association type	Hazard ratio	Incidence rate ratio	Odds ratio
Main assumption	Hazard function or death rate are proportional between groups	Rate of events or relative risk ratio remains constant over specific time intervals or are proportional to one another	Does not require the dependent and independent variables to be related linearly, but that the independent variables are linearly related to the log odds

CONCLUSIONS

In this second educational review, we have focused on stratified CPH models, frailty models, and time-dependent variables. Competing risks, multistate models, recurrent-event methods and the win ratio approach have been presented to tackle the issue of multiplicity of outcomes when the use of a composite outcome and a time-to-first event might not be optimal. The use of restrictive mean survival time and accelerated time model approaches have also been illustrated. Adequately modelling survival data is not a straightforward exercise. This review has offered practical advice on what should be considered before choosing the most appropriate model for survival data, as well as some guidance to interpret the findings yielded by more complex statistical approaches.

FUNDING

None declared.

AUTHORS’ CONTRIBUTIONS

X. Rossello and M. González-Del-Hoyo conceived the review. X. Rossello led the writing process, and M. González-Del-Hoyo was in charge of finding most of the examples to illustrate the methods. X. Rossello drafted the article, although both authors contributed substantially to its revision.

CONFLICTS OF INTEREST

None declared.

References

[1]

X. Rossello, M. González-Del-Hoyo.

Survival analyses in cardiovascular research, part I: the essentials.

Rev Esp Cardiol., (2021),

http://dx.doi.org/10.1016/j.rec.2021.06.003

[2]

X. Rossello, C. Ariti, S.J. Pocock, et al.

Impact of mineralocorticoid receptor antagonists on the risk of sudden cardiac death in patients with heart failure and left-ventricular systolic dysfunction: an individual patient-level meta-analysis of three randomized-controlled trials.

Clin Res Cardiol., (2019), 108 pp. 477-486

[3]

X. Rossello, J.P. Ferreira, F. Caimari, et al.

Influence of sex, age and race on coronary and heart failure events in patients with diabetes and post-acute coronary syndrome.

Clin Res Cardiol., (2021),

[4]

O.O. Aalen.

Effects of frailty in survival analysis.

Stat Methods Med Res., (1994), 3 pp. 227-243

http://dx.doi.org/10.1177/096228029400300303 | Medline

[5]

H. Bueno, S. Pocock, J. Medina, et al.

Association between clinical pathways leading to medical management and prognosis in patients with NSTEACS.

Rev Esp Cardiol., (2017), 70 pp. 817-824

http://dx.doi.org/10.1016/j.rec.2016.12.031 | Medline

[6]

S. Suissa.

Immortal time bias in pharmacoepidemiology.

Am J Epidemiol., (2008), 167 pp. 492-499

[7]

B. Messmer, R. Leachman, J. Nora, D. Cooley.

Survival-times after cardiac allografts.

Lancet., (1969), 293 pp. 954-956

[8]

D.A. Clark, E.B. Stinson, R.B. Griepp, et al.

Cardiac transplantation in man VI. Prognosis of patients selected for cardiac transplantation.

Ann Intern Med., (1971), 75 pp. 15-21

http://dx.doi.org/10.7326/0003-4819-75-1-15 | Medline

[9]

M.H. Gail.

Does cardiac transplantation prolong life?. A reassessment.

Ann Intern Med., (1972), 76 pp. 815-817

http://dx.doi.org/10.7326/0003-4819-76-5-815 | Medline

[10]

N. Mantel, D.P. Byar.

Evaluation of Response-Time Data Involving Transient States: An Illustration Using Heart-Transplant Data.

J Am Stat Assoc., (1974), 69 pp. 81-86

[11]

H. Bueno, S. Pocock, N. Danchin, et al.

International patterns of dual antiplatelet therapy duration after acute coronary syndromes.

Heart., (2017), 103 pp. 132-138

http://dx.doi.org/10.1136/heartjnl-2016-309509 | Medline

[12]

H. Putter, H.C. van Houwelingen.

Understanding Landmarking and Its Relation with Time-Dependent Cox Regression.

Stat Biosci., (2017), 9 pp. 489-503

http://dx.doi.org/10.1007/s12561-016-9157-9 | Medline

[13]

J. Gregson, L. Sharples, G.W. Stone, et al.

Nonproportional Hazards for Time-to-Event Outcomes in Clinical Trials: JACC Review Topic of the Week.

J Am Coll Cardiol., (2019), 74 pp. 2102-2112

http://dx.doi.org/10.1016/j.jacc.2019.08.1034 | Medline

[14]

Z.R. McCaw, G. Yin, L.-J. Wei.

Using the Restricted Mean Survival Time Difference as an Alternative to the Hazard Ratio for Analyzing Clinical Cardiovascular Studies.

Circulation., (2019), 140 pp. 1366-1368

http://dx.doi.org/10.1161/CIRCULATIONAHA.119.040680 | Medline

[15]

P. Royston, M.K.B. Parmar.

Restricted mean survival time: an alternative to the hazard ratio for the design and analysis of randomized trials with a time-to-event outcome.

BMC Med Res Methodol., (2013), 13 pp. 152

http://dx.doi.org/10.1186/1471-2288-13-152 | Medline

[16]

C. Perego, M. Sbolli, C. Specchia, et al.

Utility of Restricted Mean Survival Time Analysis for Heart Failure Clinical Trial Evaluation and Interpretation.

JACC Heart Fail., (2020), 8 pp. 973-983

http://dx.doi.org/10.1016/j.jchf.2020.07.005 | Medline

[17]

K. Patel, R. Kay, L. Rowell.

Comparing proportional hazards and accelerated failure time models: an application in influenza.

Pharm Stat., (2006), 5 pp. 213-224

http://dx.doi.org/10.1002/pst.213 | Medline

[18]

N.M. Heddle, R.J. Cook.

Composite outcomes in clinical trials: What are they and when should they be used?.

Transfusion., (2011), 51 pp. 11-13

http://dx.doi.org/10.1111/j.1537-2995.2010.02930.x | Medline

[19]

J.K. Rogers, S.J. Pocock, J.J.V. McMurray, et al.

Analysing recurrent hospitalizations in heart failure: A review of statistical methodology, with application to CHARM-preserved.

Eur J Heart Fail., (2014), 16 pp. 33-40

http://dx.doi.org/10.1002/ejhf.29 | Medline

[20]

I. Ferreira-González, J.W. Busse, D. Heels-Ansdell, et al.

Problems with use of composite end points in cardiovascular trials: systematic review of randomised controlled trials.

BMJ., (2007), 334 pp. 786

http://dx.doi.org/10.1136/bmj.39136.682083.AE | Medline

[21]

J.M. Stolker, J.A. Spertus, D.J. Cohen, et al.

Rethinking composite end points in clinical trials insights from patients and trialists.

Circulation., (2014), 130 pp. 1254-1261

http://dx.doi.org/10.1161/CIRCULATIONAHA.113.006588 | Medline

[22]

I. Ferreira-González, G. Permanyer-Miralda, J.W. Busse, et al.

Methodologic discussions for using and interpreting composite endpoints are limited, but still identify major concerns.

J Clin Epidemiol., (2007), 60 pp. 651-657

http://dx.doi.org/10.1016/j.jclinepi.2006.10.020 | Medline

[23]

S.D. Solomon, J. Dobson, S. Pocock, et al.

Influence of Nonfatal Hospitalization for Heart Failure on Subsequent Mortality in Patients With Chronic Heart Failure.

Circulation., (2007), 116 pp. 1482-1487

http://dx.doi.org/10.1161/CIRCULATIONAHA.107.696906 | Medline

[24]

J.M. Bland, D.G. Altman.

Survival probabilities (the Kaplan-Meier method).

BMJ., (1998), 317 pp. 1572

[25]

S.R. Rao, D.A. Schoenfeld.

Survival Methods.

Circulation, (2007), 115 pp. 109-113

http://dx.doi.org/10.1161/CIRCULATIONAHA.106.614859 | Medline

[26]

C. Van Walraven, F.A. McAlister.

Competing risk bias was common in Kaplan-Meier risk estimates published in prominent medical journals.

J Clin Epidemiol., (2016), 69 pp. 170-173

http://dx.doi.org/10.1016/j.jclinepi.2015.07.006 | Medline

[27]

M. Wolbers, M.T. Koller, V.S. Stel, et al.

Competing risks analyses: objectives and approaches.

Eur Heart J., (2014), pp. 2936-2941

[28]

M. Pintilie.

An introduction to competing risks analysis.

Rev Esp Cardiol., (2011), 64 pp. 599-605

http://dx.doi.org/10.1016/j.recesp.2011.03.017 | Medline

[29]

A. Latouche, A. Allignol, J. Beyersmann, et al.

A competing risks analysis should report results on all cause-specific hazards and cumulative incidence functions.

J Clin Epidemiol., (2013), 66 pp. 648-653

http://dx.doi.org/10.1016/j.jclinepi.2012.09.017 | Medline

[30]

J.P. Fine, R.J. Gray.

A Proportional Hazards Model for the Subdistribution of a Competing Risk.

J Am Stat Assoc., (1999), 94 pp. 496-509

[31]

P.C. Austin, J.P. Fine.

Practical recommendations for reporting Fine-Gray model analyses for competing risk data.

Stat Med., (2017), 36 pp. 4391-4400

http://dx.doi.org/10.1002/sim.7501 | Medline

[32]

P.K. Andersen, S.Z. Abildstrom, S. Rosthøj.

Competing risks as a multi-state model.

Stat Methods Med Res., (2002), 11 pp. 203-215

http://dx.doi.org/10.1191/0962280202sm281ra | Medline

[33]

L.F. Meira-Machado, J. de Uña-Álvarez, C. Cadarso-Suárez, P.K. Andersen.

Multi-state models for the analysis of time-to-event data.

Stat Methods Med Res., (2009), 18 pp. 195-222

http://dx.doi.org/10.1177/0962280208092301 | Medline

[34]

H. Putter, M. Fiocco, R.B. Gekus.

Tutorial in biostatistics: Competing risk and multi-state models.

Stat Med., (2007), 26 pp. 2389-2430

http://dx.doi.org/10.1002/sim.2712 | Medline

[35]

J.N. Upshaw, M.A. Konstam, D. Van Klaveren, F. Noubary, G.S. Huggins, D.M. Kent.

Multistate model to predict heart failure hospitalizations and all-cause mortality in outpatients with heart failure with reduced ejection fraction.

Circ Hear Fail., (2016), 9 pp. e003146

[36]

F. Ieva, C.H. Jackson, L.D. Sharples.

Multi-state modelling of repeated hospitalisation and death in patients with heart failure: The use of large administrative databases in clinical epidemiology.

Stat Methods Med Res., (2017), 26 pp. 1350-1372

[37]

F. Gasperoni, F. Ieva, G. Barbati, et al.

Multi-state modelling of heart failure care path: A population-based investigation from Italy.

PLoS One., (2017), 12 pp. e0179176

http://dx.doi.org/10.1371/journal.pone.0179176 | Medline

[38]

X. Zhang, Q. Li, A. Rogatko, et al.

Analysis of the bypass angioplasty revascularization investigation trial using a multistate model of clinical outcomes.

Am J Cardiol., (2015), 115 pp. 1073-1079

http://dx.doi.org/10.1016/j.amjcard.2015.01.543 | Medline

[39]

S.D. Anker, J.J.V. McMurray.

Time to move on from “time-to-first”: Should all events be included in the analysis of clinical trials?.

Eur Heart J., (2012), 33 pp. 2764-2765

http://dx.doi.org/10.1093/eurheartj/ehs277 | Medline

[40]

X. Rossello, H. Bueno, V. Gil, et al.

MEESSI-AHF risk score performance to predict multiple post-index event and post-discharge short-term outcomes.

Eur Heart Journal Acute Cardiovasc Care., (2021), 10 pp. 142-152

[41]

S.D. Solomon, N. Anavekar, H. Skali, et al.

Influence of ejection fraction on cardiovascular outcomes in a broad spectrum of heart failure patients.

Circulation., (2005), 112 pp. 3738-3744

http://dx.doi.org/10.1161/CIRCULATIONAHA.105.561423 | Medline

[42]

S. Yusuf, M.A. Pfeffer, K. Swedberg, et al.

Effects of candesartan in patients with chronic heart failure and preserved left-ventricular ejection fraction: The CHARM-preserved trial.

Lancet., (2003), 362 pp. 777-781

http://dx.doi.org/10.1016/S0140-6736(03)14285-7 | Medline

[43]

R.J. Glynn, J.E. Buring.

Ways of measuring rates of recurrent events.

Br Med J., (1996), 312 pp. 364-367

[44]

J.K. Rogers, J.J.V. McMurray, S.J. Pocock, et al.

Eplerenone in patients with systolic heart failure and mild symptoms: Analysis of repeat hospitalizations.

Circulation., (2012), 126 pp. 2317-2323

[45]

B. Pitt, M.A. Pfeffer, S.F. Assmann, et al.

Spironolactone for Heart Failure with Preserved Ejection Fraction.

N Engl J Med., (2014), 370 pp. 1383-1392

http://dx.doi.org/10.1056/NEJMoa1313731 | Medline

[46]

P.K. Andersen, R.D. Gill.

Cox's Regression Model for Counting Processes: A Large Sample Study.

Ann Stat., (1982), 10 pp. 1100-1120

[47]

S.D. Solomon, J.J.V. McMurray, I.S. Anand, et al.

Angiotensin–Neprilysin Inhibition in Heart Failure with Preserved Ejection Fraction.

N Engl J Med., (2019), 381 pp. 1609-1620

http://dx.doi.org/10.1056/NEJMoa1908655 | Medline

[48]

D. Ghosh, D.Y. Lin.

Nonparametric analysis of recurrent events and death.

Biometrics., (2000), 56 pp. 554-562

http://dx.doi.org/10.1111/j.0006-341x.2000.00554.x | Medline

[49]

B. Claggett, S. Pocock, L.J. Wei, et al.

Comparison of time-to-first event and recurrent-event methods in randomized clinical trials.

Circulation., (2018), 138 pp. 570-577

http://dx.doi.org/10.1161/CIRCULATIONAHA.117.033065 | Medline

[50]

S.J. Pocock, C.A. Ariti, T.J. Collier, D. Wang.

The win ratio: a new approach to the analysis of composite endpoints in clinical trials based on clinical priorities.

Eur Heart J., (2012), 33 pp. 176-182

http://dx.doi.org/10.1093/eurheartj/ehr352 | Medline

[51]

B. Redfors, J. Gregson, A. Crowley, et al.

The win ratio approach for composite endpoints: practical guidance based on previous experience.

Eur Heart J., (2020), 41 pp. 4391-4399

[52]

M.S. Maurer, J.H. Schwartz, B. Gundapaneni, et al.

Tafamidis Treatment for Patients with Transthyretin Amyloid Cardiomyopathy.

N Engl J Med., (2018), 379 pp. 1007-1016

http://dx.doi.org/10.1056/NEJMoa1805689 | Medline

[53]

D.M. Finkelstein, D.A. Schoenfeld.

Combining mortality and longitudinal measures in clinical trials.

Stat Med., (1999), 18 pp. 1341-1354

http://dx.doi.org/10.1002/(sici)1097-0258(19990615)18:11<1341::aid-sim129>3.0.co;2-7 | Medline

[54]

S.J. Pocock, T.J. Collier.

Statistical Appraisal of 6 Recent Clinical Trials in Cardiology: JACC State-of-the-Art Review.

J Am Coll Cardiol., (2019), 73 pp. 2740-2755

http://dx.doi.org/10.1016/j.jacc.2019.03.484 | Medline

[55]

H. Bueno, X. Rossello, S.J. Pocock, et al.

In-Hospital Coronary Revascularization Rates and Post-Discharge Mortality Risk in Non–ST-Segment Elevation Acute Coronary Syndrome.

J Am Coll Cardiol., (2019), 74 pp. 1454-1461

http://dx.doi.org/10.1016/j.jacc.2019.06.068 | Medline

[56]

Ò. Miró, X. Rosselló, V. Gil, et al.

The Usefulness of the MEESSI Score for Risk Stratification of Patients With Acute Heart Failure at the Emergency Department.

Rev Esp Cardiol., (2019), 72 pp. 198-207

http://dx.doi.org/10.1016/j.rec.2018.05.002 | Medline

[57]

D.R. Cox.

Regression Models and Life-Tables.

J R Stat Soc Ser B., (1972), 34 pp. 187-220

REVISTA ESPAÑOLA DE

CARDIOLOGÍA

Focus on: survival analyses in cardiovascular research, a practical guide
Survival analyses in cardiovascular research, part II: statistical methods in challenging situations

Análisis de supervivencia en investigación cardiovascular (II): metodología estadística en situaciones complejas

Table of contents

Options

Keywords

Year/month	Html	Pdf	Total
2025 July	198	21	219
2025 June	249	22	271
2025 May	197	38	235
2025 April	170	34	204
2025 March	150	10	160
2025 February	165	32	197
2025 January	106	30	136
2024 December	109	19	128
2024 November	134	42	176
2024 October	104	23	127
2024 September	89	10	99
2024 August	158	52	210
2024 July	163	33	196
2024 June	68	26	94
2024 May	71	38	109
2024 April	73	35	108
2024 March	83	27	110
2024 February	63	30	93
2024 January	79	31	110
2023 December	74	27	101
2023 November	81	42	123
2023 October	87	36	123
2023 September	36	17	53
2023 August	48	14	62
2023 July	93	36	129
2023 June	71	18	89
2023 May	84	24	108
2023 April	44	21	65
2023 March	74	26	100
2023 February	22	33	55
2023 January	43	31	74
2022 December	5	1	6
2022 October	2	3	5
2022 May	1	2	3
2022 April	1	2	3
2022 March	0	2	2
2022 February	1	0	1
2022 January	2	0	2
2021 December	1	0	1
2021 November	2	2	4
2021 October	7	2	9
2021 September	2	2	4
2021 August	9	11	20

Focus on: survival analyses in cardiovascular research, a practical guide Survival analyses in cardiovascular research, part II: statistical methods in challenging situations

Análisis de supervivencia en investigación cardiovascular (II): metodología estadística en situaciones complejas

Table of contents

Options

Keywords

Focus on: survival analyses in cardiovascular research, a practical guide
Survival analyses in cardiovascular research, part II: statistical methods in challenging situations