Validation of the EuroSCORE Probabilistic Model in Patients Undergoing Coronary Bypass Grafting

doi:10.1016/S1885-5857(08)60182-6

Introduction and objectives. EuroSCORE utilizes a probabilistic model for predicting the risk of in-hospital mortality in patients undergoing cardiac surgery. It is a useful instrument for evaluating quality of care. The model has two variants: the logistic EuroSCORE and the additive EuroSCORE. The aim of this study was to validate the EuroSCORE model in patients undergoing surgery at Hospital Clínic in Barcelona, Spain, and to compare the results obtained with the two variants. Methods. The study included all patients who received a coronary artery bypass graft (CABG) at Hospital Clínic in Barcelona in two consecutive years. The model's validity was assessed on the basis of its calibration (using the Hosmer-Lemeshow test) and its discrimination (using the receiver operating characteristic [ROC] curve). The two models were compared by carrying out a descriptive analysis of mortality for the whole group and for different risk groups, and by determining the models¿ discriminative power. Results. A total of 498 patients underwent CABG surgery and were included in the study. The Hosmer-Lemeshow test showed that the model¿s calibration was satisfactory (P=.32) and the area under the ROC curve was 0.83. The observed in-hospital mortality rate was 5.8%. The predicted rate was 4.2% with the logistic EuroSCORE and 3.9% with the additive EuroSCORE. Large differences were observed in high-risk patients. In these patients, the mortality predicted by the logistic variant was closer to the actual mortality. Conclusions. EuroSCORE¿s validity was found to be satisfactory and the model can be used to evaluate quality of care. In high-risk patients, mortality estimated using the logistic model was closer to the actual mortality.

Keywords

Probabilistic model

Quality evaluation

Inhospital mortality

Coronary artery bypass grafting

INTRODUCTION

Health-care providers need to be able to reliably assess their activities in terms of outcomes, quality, and cost-effectiveness. This is, in part, due to the constant rise in health care costs and the fact that resources are limited, though other drivers include the increased demand for health services and the need to compare clinical outcomes between centers. As all institutions require information on quality of care they need to be able to summarize their activities in terms of outcomes adjusted by the center's specific characteristics.1

In the area of heart surgery, both in Europe2 and the United States,3 the periodic publication of outcomes reports is now widespread and adequately regulated.4,5 These reports incorporate information from mathematical models which are used to predict the likelihood of a certain event, such as death, occurring in a given individual based on a group of risk factors attributed to that particular patient.6

The European System for Cardiac Operative Risk Evaluation (EuroSCORE) is a logistic model which is used to predict hospital mortality in patients undergoing a cardiac intervention. Using 18 risk variables and a beta coefficient associated with each variable (Table 1), the model provides the likelihood of death for any individual. The model was created and initially validated in a cross-sectional study7,8 of 19 030 European patients in 1999. Since then, it has become the most widely used model worldwide in this type of patient.

A much simpler variant of the logistic model is the additive EuroSCORE, which assigns a weight to each risk factor presented by the patient. The sum of the weights provides the likelihood of dying for that patient. The widespread and uniform use of a single probabilistic model allows for internal and external comparisons over time and can help to minimize risk adverse behavior which might be fomented if comparisons are made using unadjusted outcomes.9,10

The primary objective of the present study was to validate this predictive model of mortality in a large teaching hospital in Barcelona, Spain. The model was assessed in terms of fit and discriminatory capacity.11A second study objective was to compare the additive and logistic versions of the model and to determine which was the most appropriate for use in different groups of patients defined by their level of risk. The additive version has been the most widely used of the model variants because, although it is less precise, it is much easier to calculate and it can be calculated at the bedside. Nevertheless, the logistic equation has been shown to better predict mortality, particularly in high risk patients, and is recommended for use in those patients.12-14

METHODS

Patients

This was a validation study performed in the Hospital Clínic of Barcelona (HCB) using a retrospective design. The hospital is a 720-740 bed center which deals with 40 000 admissions annually. Patients with cardiovascular disease are attended in the Clinical Institute for Diseases of the Thorax which includes, among others, the Cardiovascular Surgery and Cardiology services. The study was approved by the Institutional Review Board of the HCB.

The center's computerized database (SAP®) was used to obtain data on all patients who underwent procedures defined in the CIE-9-CM15 and who were assigned codes 36.10 to 36.17 and 36.19. The SAP is used for both clinical and administrative purposes and has high reliability. All discharges are coded. The study population consisted of patients derived for coronary surgery and who underwent surgery between July 1, 2004 and July 1, 2006.

Variables and Statistical Analysis

The 18 variables included in the EuroSCORE predictive model were identified for all patients included in the study (Table 1) together with administrative variables (date of admission and discharge) and data on deaths occurring while patients were hospitalized in relation to the intervention. These data were obtained from computerized clinical records which included discharge history as well as surgery, pre-anesthesia, and laboratory reports. If no value was recorded for a specific risk factors, it was assumed that the risk factor was absent.

Both the logistic and additive versions of the Euro-SCORE12 were used to predict mortality. In the case of the additive EuroSCORE, the probability of dying was calculated by summing the relative weights for each risk factor for all individuals.

In order to calculate predicted mortality using the logistic model, the following equation was used

where b0 takes a value of -4.789594 (logistic regression constant) and bi is the regression coefficient for the variable ci in Table 1. For the age variable, in the logistic method b was multiplied by the number of years that the patient exceeded 60 years of age. In the additive method, a weight of 1 was assigned for every 5 years (or part of 5 years) over 60.

The validity of the logistic regression model was analyzed by examining its goodness of fit and discriminatory capacity. Goodness of fit was assessed using the Hosmer-Lemeshow test16-19 which estimates a C statistic from the difference between observed and expected values for mortality in different risk groups. The lower the C statistic the better the model's fit. A value of P>.05 indicates that the model fits the data well and that it therefore accurately predicts mortality. The test is most frequently used to validate recently created models but it is equally useful in validating an existing model which has been applied in a new set of data, as in the present study.

The model's ability to discriminate is assessed in terms of its capacity to distinguish between patients who died during hospitalization from those who did not. Discriminatory capacity was analyzed by calculating the area under the ROC curve. A value of 0.5 indicates that the model is equivalent to pure chance and a value of 1 indicates perfect discrimination.17

The logistic and additive models were compared by calculating the mortality predicted by each in both the overall sample and in 2 sub-groups defined by level of risk. The high and low risk groups were defined using a cut-off point on the additive EuroSCORE of 6 points,6 a cut point which had previously been used for this purpose.8,12 ROC curves were calculated for both models.

The validation analysis was carried out using the STATA® v.8 statistical package and the comparative analysis using SPSS® v.12.0.

RESULTS

A total of 498 patients underwent heart surgery in HCB between July 1, 2004 and June 30, 2006. The distribution of EuroSCORE risk variables in these patients is summarized in Table 2.

For the logistic model, a C statistic of 11.51 (P=.32) was obtained on the Hosmer-Lemeshow test and the area under the ROC curve was 0.83 (Figure 1).

Figure 1. ROC curve for EuroSCORE logistic model.

Among the patients who underwent surgery during the study period, there were 29 hospital deaths, giving an overall mortality rate of 5.8%. Total predicted mortality was 3.9% using the additive model and 4.2% using the logistic model (Table 3).

In the low risk group (n=412), observed mortality was very similar to that predicted by both models; however, in the high risk group (n=86) the values predicted by the logistic model were closer to observed values (Table 3). Both models showed good discriminatory power, with an area under the ROC curve of 0.84 for the additive model and 0.83 for the logistic model (Figures 1 and 2).

Figure 2. ROC curve for EuroSCORE additive model.

DISCUSSION

In order to evaluate the quality of health care services and to be able to adequately inform patients on the likely outcomes of the health care process, crude values for overall observed or expected outcomes are often not sufficient. Prognostic models which take into account patients' specific characteristics and which provide risk-adjusted outcomes for interventions are required and more useful.6

There are many risk-adjusted models available to predict mortality in cardiac surgery interventions, though in recent years the EuroSCORE8 has become one of the most widely used in western countries.

The HCB is a reference center for this type of heart surgery and performs a large number of coronary interventions. In this type of center, reliable and comparable data are required in order to assess the quality of care.

Before using a probabilistic model in a context other than that for which it was created,20 the model should be validated to ensure that it does not generate erroneous probabilities. The aim of the present study was to validate21 the EuroSCORE in the HCB.

The C statistic obtained with the Hosmer-Lemeshow was P=.32, which indicates satisfactory model fit for patients undergoing surgery in our center. The model's discriminatory power was also adequate, as indicated by an area under the ROC curve of 0.83. Given these results, we can conclude that the EuroSCORE model has been validated for use in this center and that it has proven to be a reliable instrument. This signifies that the model's predictions of the probability of dying are valid and appropriately risk-adjusted for surgery patients in HCB. It should be pointed out that mortality during hospitalization is a very favorable measure of mortality as it does not incorporate mortality after discharge. Nevertheless, we believe that intervention-related mortality after discharge was practically null in this series.

The results of this study are in line with those reported in the earlier article on model validation in six countries in the European Union,22 to which Spain contributed 2422 patients. In the earlier study, the Hosmer-Lemeshow C statistic was P =.38 and the area under the ROC curve was 0.87. Nevertheless, the earlier validation in Spain took place in very diverse settings and conditions, which meant there was a need for further validation in specific contexts before the model could be used with confidence.

The additive, or standard, model has been the most widely used because it is easy to use. It is a simplified version of the logistic model and the weights it uses are derived from that model. A previous study12 indicated that the additive model tended to underestimate the probability of death in high risk patients.

In the comparison of the 2 models, only low and high risk groups were studied because of the relatively low mortality in the study population. Creating a larger number of risk groups would have led to very broad confidence intervals for the predicted mortality rates and would have hindered comparisons. The cut point for defining the 2 groups was a EuroSCORE value on the additive model of >6. In this study, we found that the logistic model more accurately predicted the probability of death in the high risk group, a result in line with previous studies which have compared the 2 models.12

These results indicate that the logistic model is more accurate and more appropriate for use in daily practice in heart surgery units. The fact that most such units now have the technical resources to rapidly calculate a score using the logistic model further supports its use.

A study in a larger sample would generate a larger number of events (deaths) and would provide more solid results as the Hosmer-Lemeshow test is based on contrasting predicted events with observed events.

A direct comparison of real and expected events was not carried out as they are two distinct variables which provide information about events. True mortality describes the event death, whether observed or otherwise, for each patient (dichotomous variable). Predicted mortality, on the other hand, indicates the likelihood of dying for each patient based on specific characteristics included in the model (quantitative continuous variable). The analysis of the relationship between these 2 variables is precisely what constitutes the validation of the model. The validated model is useful because it allows us to perform risk assessments for patients which can then be compared with observed outcomes, while taking into account the level of risk.

The publication of the outcomes of health care has been a reality in the United States and the United Kingdom for over 15 years. The impact of such publications has been extensively analyzed, with the results showing a clear improvement in the quality of health care.23 The instrument validated in the present study could be useful in providing systematic information on the outcome of interventions in centers providing the relevant services. The information could be published and made available both to citizens and the purchasers of health care.

CONCLUSIONS

The results of this study allow us to conclude that the EuroSCORE is a useful probabilistic model in this public, teaching hospital. It can be used to estimate the probability of death in patients scheduled for heart surgery and to assess the outcomes of health care. The logistic model is the most reliable of the 2 versions, particularly in high risk patients.

ABBREVIATIONS
HCB: Hospital Clínic of Barcelona
EU-A: additive EuroSCORE
EU-L: logistic EuroSCORE
EuroSCORE: European System for Cardiac Operative Risk Evaluation
ROC: receiver operating characteristic

SEE EDITORIAL ON PAGES 567-71

Correspondence:
Dr. A. Trilla.
Servicio de Medicina Preventiva y Epidemiología. Hospital Clínic. Villarroel, 170. 08036 Barcelona. España.
E-mail: atrilla@clinic.ub.es

Received August 11, 2007.
Accepted for publication December 19, 2007.

Bibliography

[1]

Chassin MR, Hannan EL, DeBuono BA..

Benefits and hazards of reporting medical outcomes publicly..

N Engl J Med, (1996), 334 pp. 394-8

http://dx.doi.org/10.1056/NEJM199602083340611 | Medline

[2]

Rowan K, Harrison D, Brady A, Black N..

Hospitals' star ratings and clinical outcomes: ecological study..

BMJ, (2004), 328 pp. 924-5

http://dx.doi.org/10.1136/bmj.38007.694745.F7 | Medline

[3]

Davies HT, Marshall MN..

Public disclousure of performance data..

Lancet, (1999), 353 pp. 1639-40

Medline

[4]

Keogh B, Spiegelhalter D, Bailey A, Roxburgh J, Magee P, Hilton C..

The legacy of Bristol: public disclosure of individual surgeons' results..

BMJ, (2004), 329 pp. 450-4

http://dx.doi.org/10.1136/bmj.329.7463.450 | Medline

[5]

Keogh BE, Dussek J, Watson D, Magee P..

Public confidence and cardiac surgical outcomes..

BMJ, (1998), 316 pp. 1759-60

Medline

[6]

Asimakopoulos G, Al-Ruzzeh S, Ambler G, Omar RZ, Punjabi P, Amrani M, et al..

An evaluation of existing risk stratification models as a tool for comparison of surgical performances for coronary artery bypass grafting between institutions..

Eur J Cardiothorac Surg, (2003), 23 pp. 935-41

Medline

[7]

Roques F, Nashef SA, Michel P, Gauducheau E, de Vincentiis C, Baudet E, et al..

Risk factors and outcome in European cardiac surgery: analysis of the EuroSCORE multinational database of 19030 patients..

Eur J Cardiothorac Surg, (1999), 15 pp. 816-22

Medline

[8]

Nashef SA, Roques F, Michel P, Gauducheau E, Lemeshow S, Salamon R..

European system for cardiac operative risk evaluation (EuroSCORE)..

Eur J Cardiothorac Surg, (1999), 6 pp. 9-13

[9]

Bridgewater B, Grayson AD, Au J, Hassan R, Dihmis WC, Munsch C, et al..

Improving mortality of coronary surgery over first four years of independent practice: retrospective examination of prospectively collected data from 15 surgeons..

BMJ, (2004), 329 pp. 421

http://dx.doi.org/10.1136/bmj.38173.577697.55 | Medline

[10]

Treasure T..

Lessons from the Bristol case..

BMJ, (1998), 163 pp. 1685-6

[11]

Wade A..

Derivation versus validation..

Arch Dis Child, (2000), 83 pp. 459-60

Medline

[12]

Michel P, Roques F, Nashef SA; EuroSCORE Project Group..

Logistic or additive EuroSCORE for high-risk patients? Eur J Cardiothorac Surg, (2003), 23 pp. 684-7

http://dx.doi.org/10.1136/emj.2006.035220 | Medline

[13]

Bridgewater B, Grayson AD, Jackson M, Brooks N, Grotte GJ, Keenan J, et al..

North West Quality Improvement Programme in Cardiac Interventions. Surgeon specific mortality in adult cardiac surgery: comparison between crude and risk stratified data..

BMJ, (2003), 327 pp. 13-7

http://dx.doi.org/10.1136/bmj.327.7405.13 | Medline

[14]

Sergeant P, de Worm E, Meyns B..

Single centre, single domain validation of the EuroSCORE on a consecutive sample of primary and repeat CABG..

Eur J Cardiothorac Surg, (2001), 20 pp. 1176-82

Medline

[15]

Free online searchable 2004 ICD-9-CM and Medical Terminology Dictionary [citado 28 Ago 2005]. Disponible en: http://icd9cm.chrisendres.com/index.php

[16]

Lemeshow S, Hosmer D..

A review of goodness of fit statistic for use in the development of logistic regresión models..

Am J Epidemiol, (1982), pp. 92-106

[17]

Applied logistic regression. 2nd ed. New York: John Wiley & Sons; 2000.

[18]

Rué M..

Apunts metodològics sobre els models probabilistics..

Annals de Medicina, (2004), 87 pp. 10-1

[19]

Lemeshow S, Klar J, Teres D..

Outcome prediction for individual intensive care patients:useful, misused, or abused? Intensive Care Med, (1995), 21 pp. 770-6

Medline

[20]

Bhatti F, Grayson AD, Grotte G, Fabri BM, Au J, Jones MT, et al..

The logistic EuroSCORE in cardiac surgery:how well does it predict operative risk? Heart, (2006), 92 pp. 1817-20

[21]

Hosmer DW, Taber S, Lemeshow S..

The importance of assessing the fit of logistic regression models: a case study..

Am J Public Health, (1991), 81 pp. 1630-5

Medline

[22]

Roques F, Nashef SA, Michel P, Pinna Pintor P, David M, Baudet E; The EuroSCORE Study Group..

Does EuroSCORE work in individual European countries? Eur J Cardiothorac Surg, (2000), 18 pp. 27-30

[23]

Hannan EL, Kilburn H, Racz M, Shields E, Chassin MR..

Improving the outcomes of coronary artery bypass grafting surgery in New York State..

JAMA, (1994), 271 pp. 761-6

Medline

REVISTA ESPAÑOLA DE

CARDIOLOGÍA

Validation of the EuroSCORE Probabilistic Model in Patients Undergoing Coronary Bypass Grafting

Validación del modelo probabilístico EuroSCORE en pacientes intervenidos de injerto coronario

Options

Keywords

Year/month	Html	Pdf	Total
2025 July	30	3	33
2025 June	118	14	132
2025 May	117	16	133
2025 April	68	10	78
2025 March	98	11	109
2025 February	83	16	99
2025 January	76	31	107
2024 December	52	17	69
2024 November	81	39	120
2024 October	67	34	101
2024 September	97	18	115
2024 August	50	39	89
2024 July	31	17	48
2024 June	48	11	59
2024 May	43	16	59
2024 April	56	25	81
2024 March	104	19	123
2024 February	83	20	103
2024 January	90	16	106
2023 December	99	21	120
2023 November	109	35	144
2023 October	151	25	176
2023 September	82	12	94
2023 August	87	7	94
2023 July	118	19	137
2023 June	109	14	123
2023 May	145	18	163
2023 April	82	16	98
2023 March	111	13	124
2023 February	105	25	130
2023 January	87	8	95
2022 December	92	23	115
2022 November	102	20	122
2022 October	71	16	87
2022 September	57	22	79
2022 August	39	18	57
2022 July	46	26	72
2022 June	57	31	88
2022 May	51	29	80
2022 April	76	29	105
2022 March	66	30	96
2022 February	53	19	72
2022 January	73	15	88
2021 December	45	31	76
2021 November	49	27	76
2021 October	77	18	95
2021 September	62	18	80
2021 August	51	31	82
2021 July	29	21	50
2021 June	27	15	42
2021 May	70	35	105
2021 April	79	60	139
2021 March	73	18	91
2021 February	77	12	89
2021 January	60	8	68
2020 December	72	17	89
2020 November	28	15	43
2020 October	60	12	72
2020 September	21	8	29
2020 August	44	13	57
2020 July	34	14	48
2020 June	34	17	51
2020 May	48	11	59
2020 April	38	10	48
2020 March	42	10	52
2020 February	51	8	59
2020 January	49	13	62
2019 December	78	36	114
2019 November	53	50	103
2019 October	80	29	109
2019 September	122	22	144
2019 August	77	24	101
2019 July	180	66	246
2019 June	98	71	169
2019 May	83	56	139
2019 April	46	26	72
2019 March	58	19	77
2019 February	95	21	116
2019 January	69	23	92
2018 December	89	28	117
2018 November	62	24	86
2018 October	57	16	73
2018 September	33	12	45
2018 August	54	12	66
2018 July	76	18	94
2018 June	64	11	75
2018 May	49	18	67
2018 April	63	11	74
2018 March	39	5	44
2018 February	61	7	68
2018 January	60	4	64
2017 December	49	8	57
2017 November	50	12	62
2017 October	38	7	45
2017 September	52	10	62
2017 August	27	12	39
2017 July	39	5	44
2017 June	67	12	79
2017 May	87	13	100
2017 April	32	8	40
2017 March	69	19	88
2017 February	118	5	123
2017 January	108	7	115
2016 December	45	12	57
2016 November	71	4	75
2016 October	108	4	112
2016 September	206	9	215
2016 August	87	6	93
2016 July	91	9	100
2016 June	99	18	117
2016 May	67	17	84
2016 April	91	21	112
2016 March	100	23	123
2016 February	99	25	124
2016 January	100	16	116
2015 December	102	10	112
2015 November	78	14	92
2015 October	109	14	123
2015 September	105	24	129
2015 August	106	21	127
2015 July	100	18	118
2015 June	55	12	67
2015 May	111	12	123
2015 April	68	9	77
2015 March	62	12	74
2015 February	67	13	80
2015 January	53	9	62
2014 December	60	8	68
2014 November	71	10	81
2014 October	114	12	126
2014 September	101	8	109
2014 August	91	3	94
2014 July	58	9	67
2014 June	62	11	73
2014 May	80	12	92
2014 April	72	16	88
2014 March	73	8	81
2014 February	67	9	76
2014 January	76	8	84
2013 December	81	18	99
2013 November	76	13	89
2013 October	61	22	83
2013 September	54	21	75
2013 August	71	40	111
2013 July	66	29	95
2013 June	68	42	110
2013 May	57	22	79
2013 April	41	20	61
2013 March	51	18	69
2013 February	44	15	59
2013 January	48	19	67
2012 December	37	15	52
2012 November	24	18	42
2012 October	11	7	18
2012 September	1321	0	1321