JACC: HEART FAILURE
VOL.
ª 2014 BY THE AMERICAN COLLEGE OF CARDIOLOGY FOUNDATION PUBLISHED BY ELSEVIER INC.
-, NO. -, 2014
ISSN 2213-1779/$36.00 http://dx.doi.org/10.1016/j.jchf.2014.04.006
STATE-OF-THE-ART PAPER
Factors Influencing the Predictive Power of Models for Predicting Mortality and/or Heart-Failure Hospitalization in Patients With Heart Failure Wouter Ouwerkerk, MSC,* Adriaan A. Voors, MD, PHD,y Aeilko H. Zwinderman, PHD*
ABSTRACT The present paper systematically reviews and compares existing prediction models in order to establish the strongest variables, models, and model characteristics in patients with heart failure predicting outcome. To improve decision making accurately predicting mortality and heart-failure hospitalization in patients with heart failure can be important for selecting patients with a poorer prognosis or nonresponders to current therapy, to improve decision making. MEDLINE/ PubMed was searched for papers dealing with heart failure prediction models. To identify similar models on the basis of their variables hierarchical cluster analysis was performed. Meta-analysis was used to estimate the mean predictive value of the variables and models; meta-regression was used to find characteristics that explain variation in discriminating values between models. We identified 117 models in 55 papers. These models used 249 different variables. The strongest predictors were blood urea nitrogen and sodium. Four subgroups of models were identified. Mortality was most accurately predicted by prospective registry-type studies using a large number of clinical predictor variables. Mean C-statistic of all models was 0.66 0.0005, with 0.71 0.001, 0.68 0.001 and 0.63 0.001 for models predicting mortality, heart failure hospitalization, or both, respectively. There was no significant difference in discriminating value of models between patients with chronic and acute heart failure. Prediction of mortality and in particular heart failure hospitalization in patients with heart failure remains only moderately successful. The strongest predictors were blood urea nitrogen and sodium. The highest C-statistic values were achieved in a clinical setting, predicting short-term mortality with the use of models derived from prospective cohort/registry studies with a large number of predictor variables. (J Am Coll Cardiol HF 2014;-:-–-) © 2014 by the American College of Cardiology Foundation.
H
eart failure (HF) is a major cause of cardio-
Accurately predicting prognosis can be of benefit
vascular mortality and morbidity (1), and
for patients with heart failure. First, patients with a
its prevalence and incidence is increasing
poorer prognosis might benefit more from aggressive
(2). Despite wider use of evidence-based medical
treatment and a closer follow-up (5). Prediction rules
therapy and preventive device therapy, the prognosis
like the CHA 2DS2-VASc (Congestive heart failure [or
remains poor (3,4).
left ventricular systolic dysfunction]; Hypertension:
From the *Department of Clinical Epidemiology, Biostatistics, and Bioinformatics, Academic Medical Center, University of Amsterdam, Amsterdam, the Netherlands; and the yDepartment of Cardiology, University of Groningen, University Medical Center, Groningen, the Netherlands. Prof. Voors received consultancy fees and/or research grants from Alere, Bayer HealthCare, Cardio3Biosciences, Celladon, Novartis, Servier, Torrent, Trevena Vifor Pharmaceuticals; is supported by a grant (#FP7-242209-BIOSTAT-CHF) from the European Commission; and is a clinical established investigator (2006T37) of the Dutch Heart Foundation. All other authors have reported that they have no relationships relevant to the contents of this paper to disclose. Manuscript received December 3, 2013; revised manuscript received April 14, 2014, accepted April 15, 2014.
2
Ouwerkerk et al.
JACC: HEART FAILURE VOL.
-, NO. -, 2014 - 2014:-–-
Discriminating Power of Models Predicting Outcome in HF
ABBREVIATIONS
blood pressure consistently above 140/90
and “(re-)hospitalization.” These terms had to be used
AND ACRONYMS
mm Hg [or treated hypertension on medica-
in either title or abstract. We drew a distinction
tion]; Age $75 years; Diabetes mellitus; pre-
between papers including patients diagnosed with
vious stroke or transient ischemic attack or
CHF and those with ADHF.
ADHF = acute decompensated heart failure
CHF = chronic heart failure HF = heart failure HR = hazard ratio OR = odds ratio
thromboembolism; Vascular disease [e.g., peripheral artery disease, myocardial infarction, aortic plaque]; Age 65 to 74 years; Sex category [i.e., female sex]) and the TIMI (Thrombolysis In Myocardial Infarction) risk
score are widely used in clinical practice to identify high-risk patients who require medical/surgical treatment (6,7). These prediction rules are used to justify medical treatment (like antithrombotic therapy for patients with atrial fibrillation). However, the clinical value of the HF risk predictors remains limited. Second, accurately predicting mortality and morbidity might help patients in their decision making (8). Third, identifying patients who are at risk and do not respond to currently recommended therapies for HF might lead to personalized medicine aimed at targeted treatments for patients with HF. Finally, improved prognostic
risk models may help
in
designing trials by choosing population characteristics with higher event rates. Many studies on predictive markers for outcome in
DATA EXTRACTION. Papers were only included in
the present analysis when the predictor variables were reported and the predicted value was quantified using the C-statistics or receiver-operating characteristic curve value. Of these papers, all reported models were included, even if only variable or C-statistics were available. We collected the predicted outcome variable of each model and grouped these according to the following: 1) mortality; 2) mortality or HF hospitalization; or 3) HF hospitalization. We documented all predictor variables and, when published, their predictive power (odds ratio [OR] and hazard ratio [HR]). We reduced the multitude of different variables to generalized variables. (See the Online Appendix for details on the data extraction, data reduction, and statistical analysis.) For each model—derivation or validation (whether the paper described a new model or validated an existing one)— we collected the following information: Total number of variables in the model
patients with HF and several reviews on prediction
Time of prediction in days (the period for which the
models have been published. Most of these models
model makes its prediction [e.g., in 1-year mortal-
focused on prediction of HF hospitalization and were found to perform poorly or only averagely on a specific patient population (9–12). Nutter et al. (13) evaluated
ity, the time of prediction would be 365 days]) Study design (randomized controlled trial, cohort study or registry)
6 prognostic models predicting mortality. They
Retrospective or prospective data collection
concluded that the prediction models used were
Type of prediction model (subjective prediction, a
adequate in discriminating patients, but they might
risk score– or regression-based model)
underestimate absolute risk of mortality in elderly
The form of statistical analysis used to derive the
patients (14). All these reviews focused on descriptive
model (classification and regression tree analysis,
analyses to explain the predictive power of the models.
Cox proportional hazards regression, generalized
The goal of this paper is to provide an overview of different prediction models developed in the recent years. We compared models with respect to the number, type, and predictive power of predictor
linear model or hierarchical modified Poisson regression) Source of data (medical records or administrative registry data)
variables used. We performed a meta-analysis to
Total number of patients studied in the paper
detect the predictive and discriminating value of
Mean age
variables and models and analyzed which model
Percent of male patients
characteristics were associated with the highest C-statistic value.
STATISTICAL ANALYSIS. First, models were com-
pared with respect to the predictor variables. We
METHODS
therefore performed hierarchical cluster analysis to identify those subgroups of models that were com-
SEARCH STRATEGY. MEDLINE/PubMed was searched
parable (15–17). We counted the models in each sub-
for relevant English-language papers using established
group incorporating more than 5 models. Next, the
methods. Search terms used were: “heart failure,”
predictive weights of the variables for predicting
“chronic heart failure” (CHF), “acute decompensated
mortality
heart failure” (ADHF), “risk scores,” “prediction,”
analyzed, using fixed and random effect models. We
“prognosis,” “models,” “mortality,” “(re-)admission,”
separately analyzed the z-scores, which are the OR/SE
and
HF
hospitalization
were
meta-
JACC: HEART FAILURE VOL.
-, NO. -, 2014
Ouwerkerk et al.
- 2014:-–-
Discriminating Power of Models Predicting Outcome in HF
(OR) and the HR/SE (HR), of the OR and HR of pre-
of the papers there were 10 additional models for
diction variables depending on whether they were
which no C-statistics were provided. These models
obtained from case-control or cohort studies.
were included in the next section comparing models
Third, we estimated the mean C-statistics by fixedand random-effects meta-analysis and used meta-
on their predictor variables, but were excluded from the meta-analysis and meta-regression.
regression to estimate the association between
We excluded 5 models from the variables analysis
study and model characteristics and C-statistics (18).
because these constituted models developed for measuring activities of daily life (19,20) or used only
RESULTS
subjective predictions by the physician and nurse (21).
We identified 117 different models in 55 papers, all
PREDICTORS OF OUTCOME. The prediction models
published between 1994 and 2012 (details are pre-
showed a great variety in number and type of vari-
sented in the Online Appendix). These models were
ables. The numbers varied from 1 (22–24) to 65 (25). A
also divided into CHF and ADHF groups, consisting of
total of 249 different variables were used in 117
10 and 111 models, respectively. Four models were
models. An OR/HR was for 140 variables; no OR/HR,
used in predicting events in both patient groups. In 8
therefore, was mentioned for 109 variables. Most
T A B L E 1 The Most Frequently Used Variables, Along With the Number of Times Used in the Different Models, and Their Predictive Scores
Odds Ratio
Hazard Ratio
n
z-Score
Mean
95% CI
z-Score
Mean
95% CI
Age, yrs
78
17,656
1.04
1.03 to 1.04
20,728
1.06
1.06 to 1.07
Sex
53
15,542
1.12
1.11 to 1.14
10,813
1.22
1.19 to 1.26
Systolic blood pressure
51
31,799
1.30
1.29 to 1.32
19,353
1.16
1.15 to 1.18
Sodium
46
33,502
1.41
1.39 to 1.43
30,343
1.09
1.08 to 1.09
Diabetes
41
15,849
1.14
1.12 to 1.15
21,227
1.44
1.41 to 1.47
Creatinine
38
34,191
1.12
1.11 to 1.13
10,411
1.07
1.05 to 1.08
CHF and ADHF
New York Heart Association functional class
35
15,109
4.14
3.96 to 4.33
17,683
1.41
1.37 to 1.45
Blood urea nitrogen
28
60,698
2.28
2.26 to 2.31
12,495
1.07
1.06 to 1.09
12,843
1.08
1.07 to 1.10
15,346
1.12
1.11 to 1.14
8,526
1.08
1.06 to 1.10
15,327
1.44
1.39 to 1.49
Ejection fraction
23
Hemoglobin
23
(N-terminal pro) B-type natriuretic peptide
23
CHF Age, yrs
75
17,566
1.04
1.03 to 1.04
20,650
1.06
1.06 to 1.07
Sex
53
15,542
1.12
1.11 to 1.14
10,813
1.22
1.19 to 1.26
Systolic blood pressure
46
54,716
1.16
1.15 to 1.16
18,647
1.18
1.16 to 1.19
Sodium
44
33,502
1.41
1.39 to 1.43
36,527
1.07
1.06 to 1.07
Diabetes
41
15,849
1.14
1.12 to 1.15
21,227
1.44
1.41 to 1.47
Creatinine
37
34,191
1.12
1.11 to 1.13
10,411
1.07
1.05 to 1.08
New York Heart Association functional class
34
15,109
4.14
3.96 to 4.33
17,506
1.41
1.37 to 1.44
Blood urea nitrogen
25
61,497
2.37
2.34 to 2.39
12,014
1.07
1.06 to 1.08
Ejection fraction
23
12,843
1.08
1.07 to 1.10
(N-terminal pro) B-type natriuretic peptide
23
27,196
1.11
1.1 to 1.12
Etiology
22
6,795
1.23
1.17 to 1.29 -0.01 to 2.57
ADHF Systolic blood pressure
7
2,181
1.22
1.04 to 1.41
0.378
1.28
Age, yrs
6
8,454
1.46
1.37 to 1.55
0.254
1.27
-0.57 to 3.11
Blood urea nitrogen
4
0.250
1.26
-0.55 to 3.07
0.339
1.33
-0.32 to 2.98
Heart failure admissions
4
12,443
1.44
1.39 to 1.5
0.737
2.00
Sodium
4
0.530
1.53
-0.04 to 3.1
1,663
1.33
Dementia/Alzheimer disease or senility
3
6,306
2.42
2.15 to 2.70
Cancer
2
0.827
1.86
0.39 to 3.33
Cerebrovascular disease
2
0.410
1.43
-0.28 to 3.14
Creatinine
2
Sex
2
0.636
1.63
0.13 to 3.13
0.438
1.33
ADHF ¼ acute decompensated heart failure; CHF ¼ chronic heart failure; CI ¼ confidence interval.
0.99 to 1.67
3
4
Ouwerkerk et al.
JACC: HEART FAILURE VOL.
T A B L E 2 The 10 Variables With the Highest z-Score Used >5 in Models
n
z-Score
-, NO. -, 2014 - 2014:-–-
Discriminating Power of Models Predicting Outcome in HF
Mean
95% CI
10 highest standardized OR
different models, and their predictive power, are shown in Table 1. Table 2 summarizes the value of the strongest predictor variables used most frequently measured by
Blood urea nitrogen
28
60.698
2.28
2.26-2.31
Cancer
10
37,997
1.84
1.81-1.88
Troponin
8
34,585
1.73
1.69-1.76
the z-score of OR and HR. The predictive value of predictor variables for CHF; ADHF; and for mortality, mortality or HF hospitalization, or HF hospitalization
Creatinine
38
34,191
1.12
1.11-1.13
Sodium
46
33,502
1.41
1.39-1.43
Systolic blood pressure
51
31,799
1.3
1.29-1.32
The most frequently used variables with the high-
Heart failure
9
27,924
1.28
1.26-1.30
est predictive values were blood urea nitrogen and
models are presented in the Online Appendix.
Arterial pH
6
27,753
1.87
1.83-1.92
sodium. There were 3 variables with a high predictive
Diastolic blood pressure
11
22,013
1.15
1.14-1.16
value of both OR and HR: sodium; blood urea nitro-
Renal failure
8
21,113
1.26
1.24-1.28
Sodium
46
30.343
1.09
1.08-1.09
Race
10
27,027
1.11
1.1-1.12
control studies, but not in prognostic cohort studies
Diabetes
41
21,227
1.44
1.41-1.47
(high OR, but low or no HR). The opposite situation
Age, yrs
77
20,728
1.06
1.06-1.07
was seen with ejection fraction and (N-terminal pro)
Systolic blood pressure
51
19,353
1.16
1.15-1.18
B-type natriuretic peptide, which were found to be
New York Heart Association functional class
35
17,683
1.41
1.37-1.45
highly prognostic in cohort studies but not in case-
(N-terminal pro) B-type natriuretic peptide
23
15,327
1.44
1.39-1.49
Angiotensin-converting enzyme inhibitor/angiotensin II receptor blockers
8
14,008
1.19
1.17-1.21
10 highest standardized HR
gen; and systolic blood pressure. Cancer, arterial pH, and renal failure were highly predictive in case-
control studies. In the ADHF models, the strongest predictor variable was HF admissions. Hierarchical clustering was performed on the prediction variables of the various models to identify
Ejection fraction
23
12,843
1.08
1.07-1.1
subgroups of similar models. The subgroups created
Blood urea nitrogen
28
12,495
1.07
1.06-1.09
by the hierarchical clustering are presented in Figure 1. There were 4 subgroups of models that
CI ¼ confidence interval; HR ¼ hazard ratio; OR ¼ odds ratio.
incorporated more than 5 models. The largest subgroup (purple) consisted of models using relatively few predictor variables. Variables used vary between
models used a combination of demographic, clinical,
these models; age, sodium, and systolic blood pres-
and easily obtainable data to achieve the highest
sure were the 3 most common variables used.
predictive power. The most frequently used vari-
A second subgroup (red) consisted of models on the
ables, along with the number of times used in the
basis of the Seattle Heart Failure Model. These models used the Seattle Heart Failure Model with addition of 1 or 2 predictor variables. The green subgroup contained
models
using
medication
(beta-blocking
agents), glomerular filtration rate, left ventricular ejection fraction, and New York Heart Association Class as predictor variables. The blue subgroup contained models that used such clinical variables as renal failure, weight, blood pressure, and body mass index, as distinct prediction variables different from the green and the other clusters. META-ANALYSIS OF THE PREDICTIVE VALUE FOR MORTALITY AND HF HOSPITALIZATION. There were
260 C-statistic values reported for 103 models: 181 comprised models predicting mortality; 32 were for HF hospitalization; and 47 for mortality or HF hospitalization. F I G U R E 1 Hierarchical Clustering Dendrogram
The models were ordered according to their similarity as a result of hierar-
There were more C-statistic values reported than there were models; 14 models were validated more
chical clustering. There were 4 subgroups (red, blue, green, and purple)
than once (up to 15). Eight models to predict mor-
found incorporated more than 5 models.
tality were also validated in predicting HF hospitalization. Although models were validated multiple
JACC: HEART FAILURE VOL.
-, NO. -, 2014
Ouwerkerk et al.
- 2014:-–-
Discriminating Power of Models Predicting Outcome in HF
times, and 79% of the C-statistic values were the
had larger weights in the overall mean. This resulted
result of derivation models, 69 models were as yet not
in the mean of 0.71 0.0010, whereas the raw un-
validated in separate patient cohorts.
corrected C-index mean was 0.74.
The highest C-statistic value (0.9) was achieved by
There
was
a
highly
significant
difference
Selker et al. (26) who used multivariable logistic
(p < 0.0001) in C-statistics for models predicting
regression to predict in-hospital mortality in patients
mortality, mortality or HF hospitalization, and HF
with CHF. The lowest C-statistic value (0.52) was
hospitalization: 0.71 0.001, 0.63 0.001, and 0.68
found in the multivariate model of Yamokoski et al.
0.001.
(21) (model 41 (21)) predicting 6-month HF hospitali-
The green cluster, with mean C-statistic values
zation. This value was lower than the C-statistic
of 0.83 0.003, had the highest predictive power of
values of the subjective prediction for HF hospitali-
all 4 clusters (blue: 0.82 0.018, red: 0.74 0.004,
zation by physicians and nurses (0.579 and 0.566,
and purple: 0.61 0.001). META-REGRESSION. Table 3 shows the results of
respectively) in the same paper. In Figure 2, we illustrate the C-statistics of models
multivariable meta-regression analysis of the relation
that reported a standard error or confidence interval.
between C-statistic and model/study characteristics.
The mean C-statistic, the black triangle in Figure 2,
Models predicting mortality had significantly higher
was 0.66 0.0005 for the models reporting a standard
C-statistic values than models predicting HF hospi-
error. Standard errors were reported in 3 models (17,
talization (D C-statistic ¼ 0.03, SE ¼ 0.001).
105, and 111) predicting mortality in ADHF. These
Papers describing the derivation of models re-
models had a mean C-statistic value of 0.71 0.0154,
ported lower C-statistic values than papers vali-
which was not statistically different from the mean
dating results ( D C-statistic ¼ -0.01, SE ¼ 0.0084).
statistic value for predicting mortality in CHF
Prospective cohort/registry studies yielded models
patients (0.71 0.0010). Keep in mind that these
with higher C-statistic values than models on the basis of data of randomized trials ( D C-statistic ¼
model, weighted for the SE-squared. All mortality
0.03, SE ¼ 0.037, DC-statistic ¼ 0.11, SE ¼ 0.042) or
models with a C-statistic value of |t|)
0.7035
0.1676
4.20
0.0000
Mortality
0.0296
0.0091
3.27
0.0012
181
Mortality or readmission
-0.0165
0.0106
-1.56
0.1207
47
in the characteristics of prediction models. The risk
0
-
-
-
32
factors found in these reviews (Ross et al. [9], Gia-
Validation
0
-
-
-
115
similar to the predictor variables found in this paper.
Derivation
-0.0095
0.0084
-1.14
0.2565
145
Ross et al. (9), Giamouzis et al. (10), and Betihavas
Number of variables
0.0036
0.0005
7.71
0.0000
256
et al. (12) also found that the C-statistic values of
Time of prediction
-0.0001
0.0000
-1.42
0.0003
260
(Intercept)
n
Outcome
Readmission
hospital HF hospitalization and compared differences
mouzis et al. [10], and Betihavas et al. [12]) were
Derivation or validation study
Study design Randomized controlled trial
prediction variables to the model. The previously published reviews focused only on
models predicting mortality was higher than that of models predicting HF hospitalization. They suggest
0
-
-
-
50
Cohort
0.0337
0.0374
0.90
0.3682
119
that either important predictors of HF hospitalization
Registry
0.1145
0.0422
2.71
0.0072
91
are lacking in the relevant models or that non-
0
-
-
-
124
talization risk. Developing a model using a systems
0.1025
0.0312
3.29
0.0012
136
biology approach, by incorporating information from
0
-
-
-
4
Point-based additive risk score
-0.0717
0.0376
-1.91
0.0579
44
Regression model
-0.0174
0.0321
-0.54
0.5885
209
medical factors may play a larger role in HF hospi-
Prospective or retrospective study Retrospective Prospective Type prediction model Subjective prediction
demographic, biomarker, genomic, proteomic, and the initial response to therapy might create a more effective prediction model and hopefully aid in understanding HF prognosis, as described by Giamouzis
Statistical method Generalized linear model
0
-
-
-
17
et al. (10). An additional advantage of this approach is
Cox proportional hazards
0.1078
0.0131
8.20
0.0000
134
that it will at least identify patients with a poor
Hierarchical modified Poisson
0.0940
0.0905
1.04
0.3000
4
outcome on currently recommended therapy, which
Multivariate logistic regression
0.1399
0.0205
6.82
0.0000
98
might lead to further development of targeted ther-
0
-
-
-
185
-0.1358
0.0198
-6.85
0.0000
75
-0.0000
0.0000
-3.54
0.0005
260
outcome for patients with HF. Although mortality models had the best discrimi-
Data source Medical records Claims data Sample size
apies,
eventually
leading
to
improvements
in
Age, yrs
-0.0013
0.0017
-0.78
0.4384
252
nating values, these models also have important
Male
-0.0010
0.0005
-1.92
0.0557
258
limitations. Nutter et al. (13) demonstrated that the
0
-
-
-
243
elderly cohort at or approaching the end of life.
-0.0014
0.0127
-0.11
0.9104
17
Nutter et al. (13) compared mortality prediction from
models underestimated the mortality risk in an
Patients’ diagnoses CHF ADHF Abbreviations as in Table 1.
6 prediction models in a retrospective cohort with a mean age of 82.7 8.2 and in-hospital death of 28.8%. The differences in predicted 1-year mortality between
Models using data from medical records had significantly better C-statistic values than models
the models was very high; the predicted mortality varied between 11.1 8.5% and 55.3 17.6% (13).
using claims data. Also models using more predictor
In addition to previous reviews (9,11), which
variables had better predictive values; C-statistic
enumerate the variables in prediction models, we
increased 0.0036 (SE ¼ 0.0005) with each added
quantified the predictive capabilities of each variable.
predictor variable.
We also explained variations in C-statistic values by
There was no significant difference in C-statistic values between patients diagnosed with either CHF or ADHF.
DISCUSSION
meta-analyzing model characteristics mentioned in the published reviews. PREDICTORS OF OUTCOME. The 10 models predict-
ing events in ADHF patients are not grouped into 1 subgroup in the dendrogram, as might be expected in
The present review shows that risk prediction in pa-
the event of the use of identical variables, but are
tients with CHF remains difficult. The best predictors
spread throughout the entire dendrogram. Similar to
of outcome were sodium and blood urea nitrogen. In
models developed for CHF patients, the models
addition, some characteristics of the prediction
developed for ADHF patients are inconclusive on
JACC: HEART FAILURE VOL.
-, NO. -, 2014
Ouwerkerk et al.
- 2014:-–-
Discriminating Power of Models Predicting Outcome in HF
which variables to use. There is no consensus as to
These models want to measure quality-of-care in
which variables are to be used to achieve the highest
contrast to patient disease prognosis. Krumholtz
predictive values.
et al. (27) shows that using claims data has its limi-
DISCRIMINATIVE POWER OF MODELS. Sixty-nine
tations but can be used to compare hospital-specific
derivation models were not yet validated. These
rehospitalization rates. Despite the reduced discrim-
models probably overestimate the prediction capa-
inating power, the risk of rehospitalization may be
bilities. Most of these models used internal valida-
more dependent on quality-of-care and system char-
tion, utilizing a bootstrap method for validation.
acteristics. It is important, therefore, to keep the
This, however, does not account for varying patient
objective in mind when creating a prediction model.
populations.
STUDY LIMITATIONS. Our meta-analysis was limited
In the meta-analysis, we only used the C-statistics
to published data only. From the 113 prediction
values of the models reporting standard errors. This
models found only 103 reported C-statistics, of which
might result in an underestimation of the mean
only 50 models in turn incorporated standard errors
values in the analysis; the raw mean C-statistic value
with their C-statistic values.
was, after all, higher than the mean C-statistic in the
We could not include all variations mentioned by
meta-analysis. Nevertheless, mean C-statistics indi-
Ross et al. (9) and Giamouzis et al. (10) in the meta-
cate only moderate predictive capacity.
regression. These variations might result in higher
PREDICTION MODELS. It is less difficult to predict
DC-statistic values than variations currently included
mortality, which had significantly higher C-statistic
in the meta-regression.
values, than to predict HF hospitalization. As expected, models developed in a derivation set reported
CONCLUSIONS
higher C-statistics than papers validating these models in a different patient population. In addition,
There are still difficulties associated with predicting
we found that cohort and prospective studies pro-
mortality and/or HF hospitalization in HF patients
duced higher C-statistics than models on the basis of
Prediction models need to be improved before they
data of randomized trials or retrospective data. Ran-
can be helpful to physicians and patients. Devel-
domized controlled trial studies had lower C-statistics
oping a model using a systems biology approach,
because randomized controlled trials were not pri-
incorporating information from demographic, bio-
marily created for model development, and the pop-
marker, genomic, proteomic, and initial responses to
ulation is more homogeneous (with therefore less
therapy might create a more effective model. An
discriminating capacity), relatively healthy (less
additional advantage of this approach is that it may
comorbidities), and highly controlled.
serve to identify patients with a poor outcome with
Models using data from medical records had
currently recommended therapy, thereby leading to
significantly better C-statistic values than models
the further development of targeted therapies and
using claims data did. This would suggest that pre-
eventually to improvements in outcome for patients
diction models are most accurate when created with
with HF.
data from patients followed prospectively in a cohort study using data from medical records. Models pre-
REPRINT REQUESTS AND CORRESPONDENCE: Dr.
dicting rehospitalization and mortality rates, how-
Wouter Ouwerkerk, Department of Clinical Epidemi-
ever, often use claims data instead of data from
ology, Biostatistics and Bioinformatics, Academic
medical records. Models predicting rehospitalization
Medical Center, P.O. 22660, room J1B-207, Meiberg-
and mortality rates are developed for purposes
dreef 9, 1105 AZ Amsterdam, the Netherlands. E-mail:
different from those for predicting disease prognosis.
[email protected].
REFERENCES 1. McCullough PA, Philbin EF, Spertus JA, et al., for the REACH Study Investigators. Confirmation of a
Force for the Diagnosis and Treatment of Acute and Chronic Heart Failure 2012 of the European
4. Thom T, Haase N, Rosamond W, et al. Heart disease and stroke statistics–2006 update: a
heart failure epidemic: findings from the Resource Utilization Among Congestive Heart Failure (REACH) study. J Am Coll Cardiol 2002;39:60–9.
Society of Cardiology. Eur Heart J 2012;33: 1787–847.
report from the American Heart Association Statistics Committee and Stroke Statistics Subcommittee. Circulation 2006;113:e85–151.
2. McMurray JJ, Adamopoulos S, Anker SD, et al. ESC guidelines for the diagnosis and treatment of acute and chronic heart failure 2012: the Task
3. Roger VL, Weston SA, Redfield MM, et al. Trends in heart failure incidence and survival in a community-based population. JAMA 2004;292: 344–50.
5. Hunt SA, Abraham WT, Chin MH, et al. 2009 focused update incorporated into the ACC/AHA 2005 Guidelines for the Diagnosis and
7
8
Ouwerkerk et al.
JACC: HEART FAILURE VOL.
Management of Heart Failure in Adults: a report of the American College of Cardiology Foundation/ American Heart Association Task Force on Practice Guidelines. J Am Coll Cardiol 2009;53:e1–90. 6. Lip GY, Nieuwlaat R, Pisters R, Lane DA, Crijns HJ. Refining clinical risk stratification for predicting stroke and thromboembolism in atrial fibrillation using a novel risk factor-based
mortality rates in elderly heart failure patients with a fatal heart failure admission. Congest Heart Fail 2010;16:196–201. 14. Kalogeropoulos AP, Georgiopoulou VV, Giamouzis G, et al. Utility of the Seattle Heart Failure Model in patients with advanced heart failure. J Am Coll Cardiol 2009;53:334–42.
approach: the Euro Heart Survey on Atrial Fibrillation. Chest 2010;137:263–72.
15. Hartigan JA. Clustering Algorithms. New York,
7. Antman EM, Cohen M, Bernink PJ, et al. The TIMI risk score for unstable angina/non-ST eleva-
16. Everitt BS, Landau S, Leese M, Stahl D. Cluster Analysis. 5th ed. New York: Halsted, 1980.
tion MI: a method for prognostication and therapeutic decision making. JAMA 2000;284:835–42.
17. Ploner A. Heatplus: Heatmaps with row and/or
8. Steyerberg EW. Applications of prediction models. In: Clinical Prediction Models: A Practical Approach to Development Validation and Updating. New York, NY: Springer, 2009:500. 9. Ross JS, Mulvey GK, Stauffer B, et al. Statistical models and patient predictors of readmission for heart failure: a systemic review. Arch Intern Med 2008;168:1371–86.
NY: Wiley, 1975:365.
column covariates and colored clusters. 2012; R package version 2.6.0. Available at: http://www. bioconductor.org/packages/release/bioc/html/ Heatplus.html. 18. Borenstein M, Hedges LV, Higgins JPT, Rothstein HR. A basic introduction to fixed-effect and random-effects models for meta-analysis. Res Synth Methods 2010;1:97–111. 19. Shah S, Vanclay F, Cooper B. Improving the
10. Giamouzis G, Kalogeropoulos A, Georgiopoulou V, et al. Hospitalization epidemic in patients with heart failure: risk factors, risk prediction, knowledge gaps, and future directions. J Card Fail 2011; 17:54–75. 11. Kansagara D, Englander H, Salanitro A, et al. Risk prediction models for hospital readmission a systematic review. JAMA 2011;306:1688–98. 12. Betihavas V,
Davidson PM,
Newton PJ,
Frost SA, Macdonald PS, Stewart S. What are the factors in risk prediction models for rehospitalisation for adults with chronic heart failure? Aust Crit Care 2012;25:31–40. 13. Nutter AL, Tanawuttiwat T, Silver MA. Evaluation of 6 prognostic models used to calculate
-, NO. -, 2014 - 2014:-–-
Discriminating Power of Models Predicting Outcome in HF
sensitivity of the Barthel Index for stroke rehabilitation. J Clin Epidemiol 1989;42:703–9.
22. Tabak YP, Johannes RS, Silber JH. Using automated clinical data for risk adjustment: development and validation of six diseasespecific mortality predictive models for pay-forperformance. Med Care 2007;45:789–805. 23. Ky B, French B, McCloskey K, et al. Highsensitivity ST2 for prediction of adverse outcomes in chronic heart failure. Circ Heart Fail 2011;4: 180–7. 24. Ketchum ES, Jacobson AF, Caldwell JH, et al. Selective improvement in Seattle Heart Failure Model risk stratification using iodine-123 metaiodobenzylguanidine imaging. J Nucl Cardiol 2012; 19:1007–16. 25. Wang L, Porter B, Maynard C, et al. Predicting risk of hospitalization or death among patients with heart failure in the Veterans Health Administration. Am J Cardiol 2012;110:1342–9. 26. Selker HP, Griffith JL, D’Agostino RB. A TimeInsensitive Predictive Instrument for Acute Hospital mortality Due to Congestive Heart Failure: Development testing, and use for comparing hospitals: A multicenter study. Med Care 1994;32: 1040–52. 27. Krumholz HM, Wang Y, Mattera JA, et al. An
20. Martín-Sánchez FJ, Gil V, Llorens P, et al., for the Acute Heart Failure Working Group of the Spanish Society of Emergency Medicine Investigation Group. Barthel Index-Enhanced Feedback for Effective Cardiac Treatment (BI-EFFECT) study: contribution of the Barthel Index to the Heart Failure Risk Scoring System model in elderly adults with acute heart failure in the emergency department. J Am Geriatr Soc 2012;60:493–8.
administrative claims model suitable for profiling hospital performance based on 30-day mortality rates among patients with heart failure. Circulation 2006;113:1693–701.
KEY WORDS heart failure, outcome, prognosis, risk factor, risk prediction
21. Yamokoski LM, Hasselblad V, Moser DK, et al. Prediction of rehospitalization and death in severe
A PPE NDI X For supplemental data extraction
heart failure by physicians and nurses of the ESCAPE trial. J Card Fail 2007;13:8–13.
and reduction and statistical analysis information, please see the online version of this paper.