ATLS Hypovolemic Shock Classification by Prediction of Blood Loss in Rats Using Regression Models.

SHOCK, Vol. 46, No. 1, pp. 92–98, 2016

ATLS HYPOVOLEMIC SHOCK CLASSIFICATION BY PREDICTION OF BLOOD LOSS IN RATS USING REGRESSION MODELS Soo Beom Choi, * † Joon Yul Choi, ‡ Jee Soo Park, * § and Deok Won Kim * † *Department of Medical Engineering, Yonsei University College of Medicine; † Graduate Program in Biomedical Engineering, Yonsei University; ‡ Department of Electrical and Computer Engineering, Seoul National University; and §Department of Medicine, Yonsei University College of Medicine, Seoul, Korea

Received 26 Sep 2015; first review completed 29 Oct 2015; accepted in final form 20 Jan 2016 ABSTRACT—In our previous study, our input data set consisted of 78 rats, the blood loss in percent as a dependent variable, and 11 independent variables (heart rate, systolic blood pressure, diastolic blood pressure, mean arterial pressure, pulse pressure, respiration rate, temperature, perfusion index, lactate concentration, shock index, and new index (lactate concentration/perfusion)). The machine learning methods for multicategory classification were applied to a rat model in acute hemorrhage to predict the four Advanced Trauma Life Support (ATLS) hypovolemic shock classes for triage in our previous study. However, multicategory classification is much more difficult and complicated than binary classification. We introduce a simple approach for classifying ATLS hypovolaemic shock class by predicting blood loss in percent using support vector regression and multivariate linear regression (MLR). We also compared the performance of the classification models using absolute and relative vital signs. The accuracies of support vector regression and MLR models with relative values by predicting blood loss in percent were 88.5% and 84.6%, respectively. These were better than the best accuracy of 80.8% of the direct multicategory classification using the support vector machine one-versus-one model in our previous study for the same validation data set. Moreover, the simple MLR models with both absolute and relative values could provide possibility of the future clinical decision support system for ATLS classification. The perfusion index and new index were more appropriate with relative changes than absolute values. KEYWORDS—Lactate concentration, linear regression, multicategory, perfusion, support vector regression, triage

INTRODUCTION

method showed accuracy of 80.8%, relative classifier information of 0.629, and a kappa index of 0.732 (4). However, studies indicate that direct multicategory classification is much more difficult than binary classification, and the classification accuracy may drop dramatically when the number of classes increases (5). Therefore, we suggest a relatively simple approach with higher accuracy to determine ATLS hypovolemic shock class. The prediction model for ATLS shock class could be reproduced by predicting blood loss in percent of subjects, and then by determining the class based on the criterion for ATLS shock. Support vector regression (SVR) and multivariate linear regression (MLR) are suitable to predict blood loss in percent, and have been used as prediction mechanisms for continuous variables in clinical conditions such as spinal cord injury, stroke, and Parkinson disease (6). Our hypothesis is that this approach for classifying ATLS shock class by predicting blood loss in percent is more accurate and simpler than the methods of our previous study, which was multicategory classification. Therefore, in this study, to improve the decision support tools for predicting ATLS hypovolemic shock class for rats in acute hemorrhage, we introduced a new simpler approach to classify ATLS shock class by predicting blood loss in percent using SVR and MLR. And we compared the accuracy between the multicategory classification in our previous study and the proposed model. Moreover, the data in our previous study were analyzed on the basis of the relative changes between rest (set to 100%) and immediately after the cessation of bleeding to reduce inter-subject differences. However, it is difficult to determine each individual’s resting values in emergency situations. Therefore, we analyzed the absolute data after

Predicting blood loss in percent is really important to determine treatments of patients with traumatic injury, which can cause to death by hypovolemic hemorrhagic shock. Hemorrhagic deaths typically occur very early, usually within the first 6 h of admission, and early hypo perfusion or shock has been demonstrated to promote coagulopathy (1). The Advanced Trauma Life Support (ATLS) suggests four classes of hypovolemic shock based on the percentage of estimated blood loss, and includes guides for appropriate treatments according to the classes (2). The importance of diagnosing hemorrhage at initial patient contact by on-scene first responders has been greatly emphasized, as more accurate diagnosis of hemorrhage severity and shock has been shown to lead to better treatment for these patients (3). Moreover, a study mentioned that prediction tools for massive transfusion in trauma patients have the advantage to immediately determine upon admission which patients will require resuscitation (1). In our previous study, to predict the four ATLS hypovolemic shock classes for triage, machine learning algorithms with feature selection methods for multicategory classification were applied to a rat model in acute hemorrhage. The previous prediction model using the direct multicategory classification Address reprint requests to Deok Won Kim, PhD, Department of Medical Engineering, Yonsei University College of Medicine, CPO Box 8044, Seoul, Republic of Korea: E-mail: [email protected] This study was supported by the National Research Foundation of Korea (NRF) Grant funded by the Korean Government (MEST) (NRF-2012R1A2A2A03045612). The authors report no conflicts of interest DOI: 10.1097/SHK.0000000000000574 Copyright ß 2016 by the Shock Society

92

Copyright © 2016 by the Shock Society. Unauthorized reproduction of this article is prohibited.

SHOCK JULY 2016 the cessation of bleeding instead of the relative data, and compared the performance of classification models between the absolute and relative change data. MATERIALS AND METHODS Data source We utilized the data, which were measured in our previous study (4). Blood loss in percent of 78 male Sprague-Dawley rats (Orient, Seongnamsi, Korea) was calculated for ATLS classification based on total volume withdrawn. Detailed information of data acquisition can be referred to our previous study (4). Our previous input data set consisted of 78 rats, the blood loss in percent of the dependent variables, and 11 independent variables (heart rate, systolic blood pressure (SBP), diastolic blood pressure (DBP), mean arterial pressure (MAP), pulse pressure, respiration rate, temperature, perfusion index, lactate concentration, shock index, and new index). Blood sampling for lactate concentration was repetitively acquired from the left femoral vein using a 31-gauge insulin syringe (BD Ultra-Fine; Becton Dickinson Korea, Seoul, Korea). Lactate concentration was measured by a portable blood lactate analyzer (Lactate Pro LT-1710; ARKRAY Inc, Kyoto, Japan) with a detection range of 0.8 to 23.3 mmol/L and a precision of 3% (3). Blood samples of approximately 5 mL were required, and the measuring time was 60 s. The perfusion index was continuously measured using a laser Doppler perfusion monitor (PeriFlux System 5000; Perimed, Stockholm, Sweden) with a probe (Probe 407; Perimed) that was attached to the right front sole. The principle of this method was to measure the Doppler shift changes in frequency that light undergoes when reflected by moving objects. A monochromatic laser with a 780-nm wavelength from the probe illuminated the skin through an optical fiber, and the light frequency changed when it was reflected by red blood cells 0.5 mm to 1.0 mm under the skin. Thus, frequency changes were based on the blood’s local speed (3). The shock index was defined as a ratio of heart rate to SBP. The new index is defined as a ratio of lactate concentration to perfusion, and suggested as a good indicator for severity of hypovolemic hemorrhagic shock in our previous studies (3,4). To validate the model for ATLS shock class, the data were randomly separated into two independent data sets: training and validation data sets (Fig. 1). The training data set comprised 66.7% (n ¼ 52) of the overall data set, was used to construct a model. The validation data set comprised 33.3% (n ¼ 26) of the overall data set, was used to assess each model’s performance to categorize samples into the correct four ATLS hypovolaemic shock classes. To compare the performance of the classification model using the vital signs in the previous study and the present model by predicting blood loss in percent in this study, we used the same training and validation

ATLS HYPOVOLEMIC SHOCK CLASSIFICATION

93

sets, including the blood loss in percent of dependent variable as mentioned above.

Machine learning for predicting blood loss in percent The 3-fold cross-validation method was used to construct an optimal SVR model (Fig. 1). We determined the ranking of the input variables using feature selection method for prediction of blood loss in percent, and then found the optimal variables to construct the classification models by increasing the number of variables in the ranking of their importance using the so-called sequential forward selection as the wrapper method (7). Spearman’s correlation coefficient was used as feature selection because nonparametric analysis was more suitable for this data set. To predict blood loss in percent, we used SVR method instead of artificial neural network or random forest. Support vector machine showed best performance among the several machine learning methods in our previous study. SVR is an extension of the support vector machine. SVR projects the descriptor matrix from the input space into a high-dimensional feature space via kernel functions. In this feature space, linear regressions of nonlinear problems can be conducted. The main characteristic of SVR is that instead of minimizing the observed training error, SVR attempts to minimize the generalized error bound so as to achieve generalized performance (8). In this study, we included three kernel functions such as linear, polynomial, and radial basis for SVR (9). The optimal parameters for SVR were selected by the 3-fold cross-validation method, and then the SVR model for prediction of blood loss in percent was constructed with the parameters using the training data set. The validation set were divided into four groups corresponding to the predicted blood loss in percent according to the ATLS classification criterion of shock of Classes I, II, III, and IV, which are 40% of blood loss in percent, respectively (10). Although there are validation methods for regression model such as root mean square error and mean absolute error, we chose accuracy, relative classifier information, and the kappa index for the validation dataset to compare with the results of our previous study (4).

Statistical analysis The absolute and relative data were summarized as mean (standard deviation) for continuous variables, respectively in Tables 1 and 2. Each continuous variable was tested for significant difference between the training (n ¼ 52) and validation sets (n ¼ 26) using Mann–Whitney U test. The MLR evaluated the relationship between the blood loss in percent and 11 input variables, and MLR predicted the dependent variable, blood loss in percent using regression equation. To optimize feature selection, backward stepwise elimination with a threshold of P value ¼ 0.10 was used to select variables for the final model. Collinearity is related to stability problems in a regression model. The collinearity of the variables in the created MLR model was tested by calculating the variance inflation factor (VIF). The existence of collinearity can be determined if the largest VIF is greater than 10, and we excluded the independent variables with collinearity in the MLR model (11). We used MATLAB 2012a (Mathworks Inc, Natick, MA) to analyze machine learning, and all statistical analyses were performed using SPSS 20.0 (IBM Corp, Armonk, NY). All reported P values were two-sided and P < 0.05 was considered statistically significant.

RESULTS Data set characteristics

The characteristics of the training and the validation sets for the absolute and the relative values are shown in Tables 1 and 2, respectively. The P values in Tables 1 and 2 show insignificant difference of all variables between the training and validation sets obtained by Mann–Whitney U test. Feature selection for support vector regression

FIG. 1. Flowchart for ATLS hypovolaemic shock classification by predicting blood loss in percent using regression models. ATLS indicates Advanced Trauma Life Support; SVR, support vector regression.

Table 3 shows the ranking of feature selection for the training set of the absolute and relative values using Spearman’s correlation coefficient between each variable and blood loss in percent. New index, MAP, DBP, and perfusion index were placed top ranking from first to fourth in the relative values. All of these variables were also selected by the direct multicategory


7.5 358.3 101.7 47.5 65.6 54.2 36.0 36.9 82.0 0.9 3.6 1.6

(3.9) (40.2) (18.9) (8.4) (11.4) (12.8) (17.0) (0.9) (62.2) (0.3) (0.6) (0.8)

21.5 322.7 72.8 35.8 48.1 37.0 37.2 36.4 48.3 1.1 4.6 2.6

35.6 311.9 45.8 23.3 30.8 22.5 40.9 35.3 32.9 1.3 7.0 7.2

(3.4) (38.1) (8.6) (5.6) (5.7) (7.6) (14.9) (1.0) (17.8) (0.5) (1.5) (9.2)

III (n ¼ 11)

Class

(3.8) (49.1) (16.1) (8.5) (10.8) (9.0) (9.7) (0.9) (17.7) (0.4) (1.0) (1.3)

II (n ¼ 10) 49.3 319.8 33.6 17.2 22.8 16.5 34.8 34.8 19.7 2.0 12.5 12.9

(4.7) (40.5) (20.7) (9.3) (12.7) (12.9) (16.5) (1.9) (10.2) (0.8) (5.8) (7.7)

IV (n ¼ 20) 32.2 326.9 58.1 28.5 38.4 29.7 36.8 35.7 41.2 1.5 7.9 7.3

(16.9) (43.9) (31.8) (14.5) (20.0) (18.4) (14.9) (1.6) (38.5) (0.8) (5.3) (7.9)

Total (n ¼ 52) 7.3 351.8 97.0 45.9 63.0 51.1 37.2 36.3 119.0 1.1 3.6 1.6

(2.5) (36.2) (5.6) (12.1) (8.1) (14.0) (7.8) (0.9) (63.4) (0.6) (0.4) (2.1)

I (n ¼ 5) 23.0 340.7 70.8 37.0 48.3 33.7 37.2 36.5 63.5 1.0 4.9 3.1

(3.4) (15.8) (21.7) (8.9) (13.0) (13.6) (14.4) (1.4) (24.1) (1.0) (2.0) (6.0)

III (n ¼ 6) 35.9 332.0 54.0 23.2 33.5 30.7 43.0 36.1 25.8 1.8 6.8 10.2

Class

(4.3) (45.9) (16.8) (9.5) (10.3) (14.8) (5.0) (0.8) (42.8) (0.1) (0.6) (2.9)

II (n ¼ 5)

Validation set

49.4 316.6 29.6 12.0 17.8 17.5 31.8 35.3 17.2 1.9 11.9 20.4

(6.1) (60.6) (9.3) (4.3) (5.6) (6.7) (11.7) (1.0) (10.8) (0.7) (4.9) (23.7)

IV (n ¼ 10) 33.1 331.5 56.1 25.9 36.0 30.1 36.5 35.9 47.7 1.6 7.8 11.1

(16.8) (45.9) (28.9) (15.6) (19.5) (16.5) (11.1) (1.1) (51.5) (0.8) (4.7) (16.6)

Total (n ¼ 26)

0.775 0.488 0.836 0.388 0.571 0.811 0.885 0.874 0.535 0.570 0.865 0.487

P-value

II (n ¼ 10)

III (n ¼ 11)

Total (n ¼ 52)

II (n ¼ 5)

(6.1) (12.5) (13.7) (8.5) (10.3) (38.2) (22.2) (0.7) (10.9) (58.2) (211.1) (1733.7)

33.1 92.4 52.3 45.2 48.5 63.8 77.7 97.6 40.5 141.2 244.8 785.1

(16.8) (9.7) (25.7) (25.7) (25.4) (36.4) (21.0) (1.3) (30.3) (61.4) (177.9) (1226.7)

Total (n ¼ 26)

0.775 0.750 0.992 0.335 0.589 0.488 0.417 0.448 0.528 0.853 0.992 0.641

P-value

SHOCK VOL. 46, No. 1


Relative values are ratios between rest (set to 100%) and immediately after the cessation of bleeding. Data are the mean (standard deviation). Blood loss (%) ¼ (bleeding/total blood volume) 100. Shock index ¼ heart rate/SBP. New index ¼ lactate concentration/perfusion index. P values represent significant difference of total values between the training and validation sets, obtained by Mann–Whitney U test. DBP indicates diastolic blood pressure; MAP, mean arterial pressure; SBP, systolic blood pressure.

(3.4) 49.4 (10.8) 88.6 (17.1) 28.1 (12.3) 20.3 (14.3) 23.9 (24.1) 43.6 (17.9) 65.3 (0.9) 97.0 (8.1) 18.5 (76.5) 180.4 (52.4) 392.5 (410.4) 1533.3

III (n ¼ 6)

IV (n ¼ 10)

Validation set (%) Class

(2.5) 23.0 (4.3) 35.9 (3.4) 94.2 (4.5) 95.3 (7.4) 65.7 (10.4) 52.7 (9.2) 57.4 (11.6) 44.6 (6.9) 61.0 (7.8) 48.6 (11.9) 82.1 (43.5) 61.4 (11.0) 93.6 (12.0) 73.3 (0.9) 98.9 (2.1) 97.5 (20.1) 45.6 (9.7) 29.1 (13.3) 102.3 (15.9) 145.4 (10.7) 146.2 (23.3) 193.3 (36.0) 238.5 (94.6) 556.0

I (n ¼ 5)

(4.7) 32.2 (16.9) 7.3 (7.7) 92.5 (6.3) 94.7 (20.3) 52.8 (26.3) 86.8 (14.7) 50.5 (24.8) 83.4 (16.5) 51.7 (25.2) 85.4 (38.3) 56.5 (33.0) 88.8 (20.7) 82.2 (21.0) 91.9 (1.1) 97.6 (1.1) 97.9 (15.1) 44.7 (29.9) 93.1 (48.2) 133.3 (44.4) 96.7 (150.6) 237.7 (146.3) 109.8 (829.9) 602.3 (683.9) 109.9

IV (n ¼ 20)

Blood loss (%) 7.5 (3.9) 21.5 (3.8) 35.6 (3.4) 49.3 Heart rate (%) 93.7 (2.1) 93.4 (5.0) 89.2 (6.7) 93.1 SBP (%) 84.0 (9.2) 71.8 (12.6) 42.2 (10.1) 32.0 DBP (%) 82.0 (10.0) 67.9 (10.5) 41.9 (12.9) 29.3 MAP (%) 83.0 (9.5) 69.8 (11.3) 42.1 (10.8) 30.6 Pulse pressure (%) 85.8 (9.5) 76.1 (16.7) 41.9 (15.0) 38.5 Respiration rate (%) 89.1 (8.3) 87.7 (25.6) 93.5 (15.2) 69.3 Temperature (%) 98.3 (0.6) 98.0 (0.9) 98.0 (1.2) 96.9 Perfusion index (%) 84.3 (22.8) 61.9 (16.0) 30.2 (10.4) 22.3 Lactate concentration (%) 103.6 (22.2) 118.0 (26.8) 113.3 (19.7) 168.4 Shock index (%) 112.5 (10.9) 133.5 (22.5) 223.3 (63.1) 366.6 New index (%) 130.6 (43.0) 217.4 (123.0) 442.9 (241.3) 1141.9

I (n ¼ 11)

Class

Training set (%)

TABLE 2. Characteristics of the relative values for the training and the validation sets

Data are the mean (standard deviation). Blood loss (%) ¼ (bleeding/total blood volume) 100. Shock index ¼ heart rate/SBP. New index ¼ lactate concentration/perfusion index 100. P values represent significant differences in total values between the training and validation sets, obtained by the Mann–Whitney U test. DBP indicates diastolic blood pressure; MAP, mean arterial pressure; PU, perfusion unit; SBP, systolic blood pressure.

Blood loss (%) Heart rate (beats/min) SBP (mm Hg) DBP (mm Hg) MAP (mm Hg) Pulse pressure (mm Hg) Respiration rate (breaths/min) Temperature (8C) Perfusion index (PU) Lactate (mmol/L) Shock index New index

I (n ¼ 11)

Training set

TABLE 1. Characteristics of the absolute values for the training and the validation sets

94 CHOI ET AL.

SHOCK JULY 2016


95

TABLE 3. Feature selection rankings for support vector regression models with absolute and relative values in the training set using Spearman’s correlation coefficient (n ¼ 52) Ranking

Absolute values

1 2 3 4 5 6 7 8 9 10 11

Relative values 0.877* 0.877* 0.871* 0.866* 0.835* 0.828* 0.767* 0.663* 0.559* 0.257 0.238

MAP SBP DBP Shock index Pulse pressure New index Perfusion index Lactate concentration Temperature Heart rate Respiration rate

0.898* 0.878* 0.871* 0.861* 0.848* 0.848* 0.752* 0.600* 0.572* 0.365* 0.041

New index MAP DBP Perfusion index SBP Shock index Pulse pressure Lactate concentration Temperature Respiration rate Heart rate

DBP indicates diastolic blood pressure; MAP, mean arterial pressure; SBP, systolic blood pressure. *P < 0.05.

machine learning method using relative values in our previous study. In the comparison of the absolute and relative values, the correlation coefficients of the new index, perfusion index, temperature, and respiration rate in absolute values had lower value than the correlation coefficients in the relative values. However, the correlation coefficients of the SBP, shock index, pulse pressure, lactate concentration, and heart rate in relative values had lower values than the correlation coefficients in the absolute values. Moreover, DBP and MAP showed almost the same values of the correlation coefficients for the absolute and relative values. Multivariate linear regression (MLR)

The final results of the MLR models for blood loss in percent with the absolute and relative values are shown in Table 4. The adjusted R2 of 0.873 for the MLR model with the relative values was slightly better than that of 0.844 with the absolute values. The MLR model with absolute values included lactate concentration, DBP, pulse pressure, and perfusion index. The MLR model with relative values included DBP, lactate concentration, perfusion index, heart rate, and pulse pressure. While the heart rate was excluded in the final MLR model with absolute values, the heart rate of the MLR model with relative values had the smallest value of standardized beta indicating the least effect on the performance of the model. Therefore, these results suggest that DBP, lactate concentration, perfusion index, and pulse

pressure are important variables for predicting blood loss in percent of rats in hypovolaemic shock. Performance of SVR and MLR models for the validation set

We obtained the accuracy, relative classifier information (RCI), and kappa index for the validation dataset to evaluate the performance of the three SVR models and an MLR one with the number of variables determined (Table 5). SVR model with linear kernel with relative values showed the best performance with accuracy of 88.5%, RCI of 0.754, and Kappa index of 0.839 with eight input variables. The SVR models with polynomial and radial basis kernel with both the absolute and relative values showed poor performance. The MLR models with relative values also showed quite a good performance with accuracy of 84.6%, RCI of 0.672, and Kappa index of 0.782 with five variables. The performance of the models with relative values was much better than that with absolute values because large individual difference of perfusion was reduced for the models with the relative values (3, 4). Scatter plots of predicted blood loss in percent

Figure 2 shows the four scatter plots of predicted and actual blood loss in percent for the validation set, and criterion for ATLS shock class. The four boxes in the scatter plots indicate criterion of four ATLS shock classes. The alphabets in the four

TABLE 4. The results of the multivariate linear regression analysis for absolute and relative values predicting blood loss in percent (n ¼ 52) Adjusted R2

Model Absolute value model Lactate concentration DBP Pulse pressure Perfusion index Constant Relative value model DBP Lactate concentration Perfusion index Heart rate Pulse pressure Constant

Beta

Standardized beta

P-value

VIF

5.679 0.448 0.319 0.058 48.462

0.255 0.383 0.347 0.133

0.000 0.002 0.003 0.065

1.214 4.552 3.957 1.612

0.420 0.087 0.194 0.251 0.072 23.243

0.617 0.228 0.343 0.093 0.141

0.000 0.000 0.003 0.076 0.099

5.375 1.264 4.622 1.061 2.818

0.844

0.873

Backward stepwise elimination with a threshold P value of 0.10 was used to select variables for the final model. DBP indicates diastolic blood pressure; VIF, variance inflation factors.


96


CHOI

ET AL.

TABLE 5. Classification performance of the four methods for the validation set (n ¼ 26) Absolute values

Linear SVR Polynomial SVR RBF SVR MLR

Relative values

Accuracy (%)

RCI

Kappa index

No. of features

Accuracy (%)

RCI

Kappa index

No. of features

76.9 42.3 26.9 69.2

0.569 – 0.125 0.479

0.684 0.156 0.093 0.569

8 6 1 4

88.5 69.2 65.4 84.6

0.754 0.506 0.489 0.672

0.839 0.574 0.521 0.782

8 2 3 5

MLR indicates multivariate linear regression; RBF, radial basis function kernel; RCI, relative classifier information; SVR, support vector regression.

plots indicated wrong categorized rats. Figure 2A and B shows predicted blood losses in percent using SVR and MLR for absolute values, respectively while Figure 2C and D shows those using SVR and MLR for relative values. Figure 2C shows best performance with only three errors in the validation set, which are estimated one class higher than the actual class by SVR model with relative values. As error between predicted and actual blood loss in percent decreases, the accuracy of the model for ATLS shock class increases. The incorrectly categorized rats in class 3 are relatively more than other classes because its criterion box is smaller than others. DISCUSSION This study successfully discriminated four ATLS hypovolaemic shock classes in an animal model by predicting blood loss in percent using the regression models. The results demonstrated SVR with linear kernel with relative values as the best model for accurately predicting ATLS class. Moreover, in comparison of results between absolute and relative values, the classification model with relative values showed better performance than those with absolute values. By statistical

analysis with MLR models, we found close associations between blood loss in percent and the four variables such as DBP, lactate concentration, perfusion index, and pulse pressure. The SVR model with linear kernel of machine learning showed better performance than the MLR model. In clinical domain, superiority of machine learning was revealed by several studies (12, 13). SVR is known for the ability to tackle the standard problem of over-fitting, especially in multivariate settings (8). These characteristics of SVR of machine learning could lead to higher performance than the traditional MLR method. However, machine learning approaches have a distinct disadvantage over the MLR model. The variables are difficult to interpret in the machine learning model because of their ‘‘black box’’ nature (14). The regression coefficients from MLR models can be interpreted in a straightforward manner, and it would be important to see improvements in the interpretability of results from machine learning methods (15). In our previous study, to predict the four ATLS classes, three popular machine learning algorithms with four feature selection methods for multicategory classification were applied to a rat model in acute hemorrhage (4). In the present study, classifying four ATLS classes by predicting blood loss in

FIG. 2. Scatter plots of predicted and actual blood loss in percent, and criterion for ATLS shock class in the validation set (n ¼ 26). The scatter plots showed predicted blood loss in percent using (A) SVR for absolute values, (B) MLR for absolute values, (C) SVR for relative values, and (D) MLR for relative values. The four boxes in scatter plots indicated criterion of ATLS shock class. The alphabets in four plots indicated wrong categorized rats. MLR indicates multivariate linear regression; SVR, support vector regression.


SHOCK JULY 2016 percent could perform the same objects of direct multicategory classification methods by using the variables such as vital signs instead of blood loss in the previous study. In the comparisons of performance between the present and previous studies, predicting blood loss in percent using SVR and MLR models (accuracy: 88.5%, 84.6%, respectively) was better than the direct multicategory classification method using the SVMOVO (one versus one) model (accuracy: 80.8%) for the same validation data set. Information loss occurs when continuous data are grouped into discrete intervals (16). The rats in our previous study were grouped according to ATLS criterion based on the blood loss in percent to determine ATLS shock class, so the characteristics information loss of the blood loss in percent occurred. Moreover, the extension of SVM to multicategory problems is more complicated because of the iteration of binary classifiers (15). Therefore, the model for classifying ATLS shock class by predicting blood loss in percent using SVR or MLR model constructed in this study is more accurate and simpler. DBP, lactate concentration, perfusion index, and pulse pressure were selected as important variables for predicting blood loss (in percent) in rats in hypovolemic shock via MLR. DBP and pulse pressure are closely related to blood loss and have already been utilized by ATLS. As a product of anaerobic glycolysis, lactate concentration indirectly indicates oxygen debt. Many studies demonstrated significant increases in serum lactate concentration in response to hemorrhage (3). Moreover, a new portable blood lactate analyzer (Lactate Pro 2 LT-1730; ARKRAY Inc, Kyoto, Japan) can measure blood samples of only 0.3 mL within 15 s in the field (4). Therefore, the measurement of lactate concentration as a good predictor of blood loss in the field is simple and easy, as well as noninvasive. Perfusion index was significantly correlated with blood loss in Table 4, and the perfusion index decreased in response to hemorrhage. Choi et al. (3) and Kaiser et al. (17) reported that perfusion index responded to severe hemorrhage earlier than blood pressure. Noninvasive monitoring of the perfusion index could be an early and sensitive marker of vital tissue hypoperfusion, considering that, in circulatory failure, blood flow is diverted from less important tissues (e.g., skin, subcutaneous tissue, muscle, gastrointestinal tract) to vital organs (e.g., heart, brain, kidneys) (18). The advantages of laser Doppler flowmetry include less complexity, noninvasiveness, and the ability to continuously monitor microcirculatory blood flow in real time, although it is expensive. In Table 3, the ranks of new index and perfusion index with relative values were first and fourth, respectively, but those with absolute values decreased to sixth and seventh. The perfusion index was recommended for relative changes rather than absolute values, due to large individual differences, as mentioned in our previous study (4). This disadvantage of the perfusion index in absolute values also deteriorates the new index, because the new index is defined as the ratio of lactate concentration to perfusion index. In the meanwhile, the DBP and MAP were not almost influenced by absolute or relative values, and the SBP and pulse pressure in absolute values had higher correlation coefficients than those in relative values. This indicates that the analysis of the relative change values


97

caused information loss of SBP for association with blood loss in percent. Therefore, the shock index in absolute values showed higher correlation with blood loss in percent than the new index. Although the perfusion index showed the relatively low correlation with blood loss in percent, they were selected by the MLR model with both absolute and relative values (Table 4). Moreover, the classification performances of SVR with linear kernel and MLR with relative values were quite higher than those with absolute ones. However, as it is difficult to determine each individual’s resting values in emergency situations for relative values, we proposed replacing resting variables with mean values for humans (4). This study demonstrated possibility of classifying multiple outcomes by predicting continuous variable (blood loss) using the regression models, and showed superiority of these methods compared with direct multicategory classification such as support vector machine in our previous study (4). Predicting continuous variable and discriminating multicategory group by standard criterion for diagnosis could be simple and perform the same purpose of direct multicategory classification. This method could apply other disease prediction, such as classifying normal, osteopenia, or osteoporosis by predicting T-score of bone mineral density, and using diagnostic criterion stated by the World Health Organization (19). Moreover, it could also discriminate the prediabetes from diabetes by predicting a fasting plasma glucose level or HbA1c level with diagnostic criterion (20). Early diagnosis and intervention for prediabetes could prevent complications, prevent the transition to diabetes, and be cost-effective (21, 22). However, this approach for multicategory classification also has disadvantages when compared with the direct multicategory classification models. First, this method cannot apply all multicategory classifications because it needs special conditions, which are continuous dependant variables and obvious diagnostic criterion. Second, the variables that were selected by the regression models to predict blood loss in percent were not directly associated with ATLS shock class. However, the blood loss in percent could be a mediating variable between four ATLS shock classes and the selected variables in this study, because the blood loss is a diagnostic criterion. Lu et al. (23) investigated the buccal partial pressure of carbon dioxide (PCO2) in rats with hemorrhagic shock and compared the data with traditional vital signs and perfusion index. The buccal PCO2 differed significantly among four groups (no bleeding, 25%, 35%, and 45% blood loss) and approximately 10 min earlier than shock index, heart rate, SBP, and MAP; additionally, PCO2 correlated with perfusion index. Jefferson et al. (24) investigated a prediction model of hemorrhagic blood loss using mean blood pressure, PaO2, SBP, and base excess in 33 rats using machine learning. The model included PaO2 and base excess of biochemical responses, which are difficult to measure in the field, and did not investigate perfusion index or lactate concentration. In contrast, our study predicted blood loss and determined four ATLS shock classes using two times the number of rats that were in Jefferson’s study. Our study warrants to verify the reproducibility of perfusion index and lactate concentration measurements in prehospital


98


setting in humans. Perfusion index and lactate concentration measurements in humans are simple and readily available because these require no special measurement skill, and that the measurement time is less than 2 min, including equipment setup time (4). Reproducibility of perfusion index measurement was obtained in our previous human study for diabetic neuropathy (9.5%, n ¼ 125, reproducibility ¼ standard deviation/ mean 100%) (25). This study has four main limitations. First, the sample size was relatively small, particularly in the validation set. Second, we did not include measurements of coagulation or hemostasis using the Prothrombin Time or Partial Thromboplastin Time methods in the present study, which would be associated with blood loss. International normalized ratio that is a derivative of prothrombin time can be available in the prehospital setting with portable selfmonitoring devices. In the future study, it would be useful to investigate the prediction model for blood loss with variables for functional coagulation. Third, we propose to use resting variables with normally distributed data ranges for humans in the model with relative changes, because resting values are not known, especially in emergency situations. Fourth, a larger animal model is warranted to provide more clinical relevance in the future. Given that we used rats to strictly control the hemorrhagic shock model, our experiment cannot be repeated for humans. However, this study showed the potential of perfusion index and lactate concentration for hemorrhage, which are not currently measured in emergency situations. In conclusion, we introduced a new approach for discriminating ATLS shock class using regression models for predicting blood loss in percent. The regression model showed better performance than the direct multicategory classification method, which was shown in our previous study. Moreover, the simple MLR models with both absolute and relative values could give possibility of the clinical decision support system for ATLS shock class, and provide association between the independent variables and blood loss. The perfusion index and the new index are suggested as new variables in relative changes for classifying ATLS classes. REFERENCES 1. Spinella PC, Holcomb JB: Resuscitation and transfusion principles for traumatic hemorrhagic shock. Blood Rev 23(6):231–240, 2009. 2. Mutschler M, Nienaber U, Brockamp T, Wafaisade A, Wyen H, Peiniger S, Paffrath T, Bouillon B, Maegele M, TraumaRegister DGU. A critical reappraisal of the ATLS classification of hypovolaemic shock: does it really reflect clinical reality? Resuscitation 84(3):309–313, 2013. 3. Choi JY, Lee WH, Yoo TK, Park I, Kim DW: A new severity predicting index for hemorrhagic shock using lactate concentration and peripheral perfusion in a rat model. Shock 38(6):635–641, 2012. 4. Choi SB, Park JS, Chung JW, Kim SW, Kim DW: Prediction of ATLS hypovolemic shock class in rats using the perfusion index and lactate concentration. Shock 43(4):361–368, 2015.

CHOI

ET AL.

5. Zhang R, Huang GB, Sundararajan N, Saratchandran P: Multicategory classification using an extreme learning machine for microarray gene expression cancer diagnosis. IEEE/ACM Trans Comput Biol Bioinform 4(3):485–495, 2007. 6. Hoffman H, Lee SI, Garst JH, Lu DS, Li CH, Nagasawa DT, Ghalehsari N, Jahanforouz N, Razaghy M, Espinal M, et al.: Use of multivariate linear regression and support vector regression to predict functional outcome after surgery for cervical spondylotic myelopathy. J Clin Neurosci 22(9):1444–1449, 2015. 7. Saeys Y, Inza I, Larran˜aga P: A review of feature selection techniques in bioinformatics. Bioinformatics 23(19):2507–2517, 2007. 8. Basak D, Pal S, Patranabis DC: Support vector regression. Neural Inf Process Lett Rev 11(10):203–224, 2007. 9. Parrella F: Online support vector regression. Department of Information Science, University of Genoa, Italy, 2007. Available at: http://onlinesvr.altervista. org/. Accessed August 18, 2015. 10. Guly HR, Bouamra O, Spiers M, Dark P, Coats T, Lecky FE: Vital signs and estimated blood loss in patients with major trauma: testing the validity of the ATLS classification of hypovolaemic shock. Resuscitation 82(5):556–559, 2011. 11. Cho KH, Kang JH, Ki SJ, Park Y, Cha SM, Kim JH: Determination of the optimal parameters in regression models for the prediction of chlorophyll-a: a case study of the Yeongsan Reservoir, Korea. Sci Total Environ 407(8):2536– 2545, 2009. 12. Yoo TK, Kim SK, Kim DW, Choi JY, Lee WH, Oh E, Park EC: Osteoporosis risk prediction for bone mineral density assessment of postmenopausal women using machine learning. Yonsei Med J 54(6):1321–1330, 2013. 13. Hsieh CH, Lu RH, Lee NH, Chiu WT, Hsu MH, Li YC: Novel solutions for an old disease: diagnosis of acute appendicitis with random forest, support vector machines, and artificial neural networks. Surgery 149(1):87–93, 2011. 14. Goetz JN, Brenning A, Petschko H, Leopold P: Evaluating machine learning and statistical prediction techniques for landslide susceptibility modeling. Comput Geosci 81(1):1–11, 2015. 15. Kruppa J, Liu Y, Biau G, Kohler M, Ko¨nig IR, Malley JD, Ziegler A: Probability estimation with machine learning methods for dichotomous and multicategory outcome: theory. Biom J 56(4):534–563, 2014. 16. Shaw DG, Huffman MD, Haviland MG: Grouping continuous data in discrete intervals: information loss and recovery. JEM 24(2):167–173, 1987. 17. Kaiser ML, Kong AP, Steward E, Whealon M, Patel M, Hoyt DB, Cinat ME: Laser Doppler imaging for early detection of hemorrhage. J Trauma 71(2):401– 406, 2011. 18. Lima A, Bakker J: Noninvasive monitoring of peripheral perfusion. Intensive Care Med 31(10):1316–1326, 2005. 19. Brown TT, Qaqish RB: Antiretroviral therapy and the prevalence of osteopenia and osteoporosis: a meta-analytic review. AIDS 20(17):2165–2174, 2006. 20. Tuomilehto J, Lindstro¨m J, Eriksson JG, Valle TT, Ha¨ma¨la¨inen H, IlanneParikka P, Keina¨nen-Kiukaanniemi S, Laakso M, Louheranta A, Rastas M, et al.: Finnish Diabetes Prevention Study Group: prevention of type 2 diabetes mellitus by changes in lifestyle among subjects with impaired glucose tolerance. N Engl J Med 344(18):1343–1350, 2001. 21. Bertram MY, Lim SS, Barendregt JJ, Vos T: Assessing the cost-effectiveness of drug and lifestyle intervention following opportunistic screening for pre-diabetes in primary care. Diabetologia 53(5):875–881, 2010. 22. Choi SB, Kim WJ, Yoo TK, Park JS, Chung JW, Lee YH, Kang ES, Kim DW: Screening for prediabetes using machine learning models. Comput Math Methods Med 2014:618976, 2014. 23. Lu H, Zheng J, Zhao P, Zhang G, Wu T: Buccal partial pressure of carbon dioxide outweighs traditional vital signs in predicting the severity of hemorrhagic shock in a rat model. J Surg Res 187(1):262–269, 2014. 24. Jefferson MF, Pendleton N, Mohamed S, Kirkman E, Little RA, Lucas SB, Horan MA: Prediction of hemorrhagic blood loss with a genetic algorithm neural network. J Appl Physiol 84(1):357–361, 1998. 25. Kim SW, Kim SC, Nam KC, Kang ES, Im JJ, Kim DW: A new method of screening for diabetic neuropathy using laser Doppler and photoplethysmography. Med Biol Eng Comput 46(1):61–67, 2008.


Hypovolemic shock.

Management of hypovolemic shock.

Pediatric readmission classification using stacked regularized logistic regression models.

Regional gastric mucosal blood flow in hypovolemic shock.

Individualized Prediction of Heat Stress in Firefighters: A Data-Driven Approach Using Classification and Regression Trees.

Vital Sign Prediction of Adverse Maternal Outcomes in Women with Hypovolemic Shock: The Role of Shock Index.

Hypovolemic Shock Caused by a Ruptured Hemorrhagic Hepatic Cyst.

Whole pancreatic blood flow and islet blood flow in hypovolemic hypotension in rats.

Prediction of early breast cancer metastasis from DNA microarray data using high-dimensional cox regression models.

Spinal cord ischemia secondary to hypovolemic shock.

Measuring familial aggregation by using odds-ratio regression models.

Genomic prediction based on data from three layer lines using non-linear regression models.

Hypovolemic shock in acute lethal T-2 mycotoxicosis.

Non-puerperal Uterine Inversion Presented with Hypovolemic Shock.

An analysis of first-time blood donors return behaviour using regression models.

Hypertonic saline in the traumatic hypovolemic shock: meta-analysis.

[Propranolol and lactatemia during hypovolemic shock: a case report].

An Atypical Case of Eosinophilic Gastroenteritis Presenting as Hypovolemic Shock.

Vagal hypotension after percutaneous biopsy: possible confusion with hypovolemic shock.

Developing EHR-driven heart failure risk prediction models using CPXR(Log) with the probabilistic loss function.

Classification Models to Predict Survival of Kidney Transplant Recipients Using Two Intelligent Techniques of Data Mining and Logistic Regression.

Texture classification and retrieval using shearlets and linear regression.

Electrocardiogram classification using reservoir computing with logistic regression.

Comparing in-patient classification systems: a problem of non-nested regression models.