There is insufficient evidence to support a strong recommendation about the diagnostic accuracy of instruments and models in predicting survival in pTBI.
Level IIIn the absence of direct scientific evidence, EXPERT CONSENSUS concluded that:
No evidence or expert opinion supported distinct recommendations based on patient gender.
The St. Louis Scale for Pediatric Gunshot Wounds to the Head specifically investigated patients under the age of 19.
Only the WHIS was specifically evaluated in military and civilian populations and stratified by wounding mechanism.
Penetrating TBI has one of the highest case fatality rates of all types of injury. Mortality rates in excess of 90% for patients with gunshot wounds to the head have been reported in some series (2053059). However, true natural history data for pTBI is lacking. Given the issues of therapeutic nihilism in patients with pTBI, it is not surprising that reported rates are so high as the perception of a poor prognosis can become a self-fulfilling prophecy. Tools that allow health care teams to accurately predict mortality, or survivability, are badly needed. The ability to accurately predict who may survive their injury allows for better allocation of resources and ability to communicate accurately with families. Over approximately the past ten years, an increased body of literature has developed and validated a variety of instruments and models that might help predict survivability following pTBI. The various models use a combination of patient characteristics, injury characteristics, physical exam, radiographic findings, and laboratory values to derive prognostic models for survival following pTBI.
Previous versions of the guidelines did not investigate instruments and models to predict morbidity and mortality following pTBI. However, an extensive section of the first edition of these guidelines, published in 2001, compiled evidence supporting a wide variety of variables as prognosticators in pTBI (11505200). Age, intentionality (i.e. self-inflicted versus assault/homicide), anatomic wounding trajectory (i.e. perforating versus tangential tracts), hypotension, coagulopathy, respiratory distress, low GCS score, presence of fixed dilated pupils, elevated intracranial pressure, and anatomical features of injury on CT were all found to correlate with mortality and/or poor functional outcome. Many of these features are utilized in the instruments and models described above as variables used to calculate the predictive scoring.
Fourteen studies (15 publications, N=1,988 with penetrating injuries) assessed the diagnostic accuracy of instruments and models for predicting survival or mortality in penetrating traumatic brain injury (pTBI).
As is generally accepted, an AUC of 0.5 suggests no discrimination 0.7 to 0.8 is considered acceptable, 0.8 to 0.9 is considered excellent, and more than 0.9 is considered outstanding (PMID: 20736804). All studies included civilian subjects, while one study also included a separately analyzed military group.
The Baylor score (range 1 to 5 with higher scores indicate greater likelihood of mortality) is based on GCS, pupil response, age, and bullet trajectory. One study (N=297) reported discriminative ability of 0.8756,
The ABC score, based on admission blood pressure (A), brain spillage (B), and presence or absence of consciousness (C), yielded an outstanding discriminative ability (AUROC 0.968), in one study reporting AUROC (N=102).
The REMS is based on GCS, age, heart rate, respiratory rate, and mean arterial pressure (MAP) (range 0 to 26 with higher scores indicating greater likelihood of mortality, cutoff not reported) performed much less well in one study (N=134), with AUROC 0.583 indicating little discriminative ability to determine likelihood of mortality versus survival.
The WHIS is based on the GCS and ISS score, and was used in one study that reported results for a military population with ballistic pTBI (N=43) and a civilian population with non-ballistic pTBI (N=41).
The Maritzburg score, based on the motor GCS and brain spillage, was discussed in the same population as the ABC score above that reported AUROC (N=102), and had outstanding discriminative ability to determine those likely to survive versus those likely not to survive (AUROC 0.94).
The Rotterdam score, the Marshall score, the Stockholm score, and the Helsinki score rely solely on findings from head CT. The Marshall score is based on the degree of cerebral swelling (i.e., midline shift, compression of basal cisterns) and the presence/size of contusions/ hemorrhages. The Rotterdam score also includes midline shift and the degree of basal cistern compression but does not include the presence of contusion and limits other criteria to the presence of an epidural mass lesion and intraventricular blood/traumatic subarachnoid hemorrhage (SAH). The Stockholm score is based on SAH in convexities, basal cisterns, intraventricular hemorrhage, midline shift, epidural hemorrhage, diffuse axonal injury, and dual-sided subdural hematoma. The Helsinki score is based on the presence of subdural hematoma, intracerebral hematoma, epidural hematoma, hematoma volume, intraventricular hemorrhage, and compression of basal cisterns. One small study (N=87) reported the discriminative ability of the Marshall and the Rotterdam scores (Marshal: AUROC 0.874; Rotterdam: AUROC 0.905) for determining the likelihood of survival or not.
The St. Louis scale for pediatric gunshot wounds to the head is based on pupil response, bullet trajectory, and lobes of injury. One study (N=71) did not report discrimination ability of the St Louis scale, but reported that with a cutoff score of 5 or above the scale had a sensitivity of 94.12%, specificity of 75.68%, positive predictive value (PPV) of 78.05%, and a negative predictive value (NPV) of 93.33%.
Another study (N=101) reported outstanding discriminative ability using the continuous motor GCS plus ISS score (AUROC 0.951).
A novel scoring system developed by Gressot et al., (N=119) using age (>35 years), GCS score (3 or 4), nonreactive pupils, and posterior fossa or bihemispheric trajectory was found to correlate with mortality outcomes; while measures of diagnostic accuracy were not reported, patients with scores of 3 to 5 had a 75% mortality rate and patients with scores of 0 to 1 had a 25% mortality rate (50% mortality in patients with score of 2).
Although many of the instruments described above have good accuracy for determination of survivability following pTBI, all are imperfect, especially when predictions for a single patient are considered. The SPIN score is currently the most studied and validated model although numerous models performed well in initial investigations.
When using these prediction models, their inherent limitations must be considered. Predictive modeling is based on probabilities across a population and therefore carry inherent inaccuracies when applied to individual patients especially in the acute post-injury time-frame. Additionally, the type of data used to generate these models must be considered. As most modern pTBI patients die from withdrawal of life-sustaining therapy, prognostic models may not be based on natural history data but instead a perception of a bad outcome. It is also important to consider the relationship between prediction models and nihilism. The SPIN score was developed with an interest in reducing nihilism, demonstrating that patients with pTBI can achieve acceptable outcomes with aggressive care. By contrast, however, the use of prognostic calculators has been associated with nihilism (19218017, 29239321).
It should also be noted that this key question focused only on survival and not on functional outcome which is clearly a subject of critical importance to patients and their families. It is critical to frame family meetings with discussions of functional outcomes which are as important, if not more important, than just survival in determining the extent of the care rendered. Care should be taken in discussion with families when issues of survival or functional outcome are considered. As a general rule, the terms survivability - which indicates likelihood of whether the patient will live or die - and salvageability - which is related to a likelihood of a poor functional outcome - may be considered.
The use of prognostic calculators generated a substantial amount of discussion amongst our panelists and, in particular, there was great concern that the predictions they generate could be used or misused to limit care of patients who might have a chance of an acceptable outcome. Our panelists expressed concern about nihilism across the breadth of neurotrauma but felt that problems with nihilism were more significant as it relates to pTBI. Concern was expressed that many clinicians currently judge care for those with a gunshot wound to the head as automatically futile despite the fact that many of these patients can have good neurological outcomes. Indeed, in considering the fact that military patients have better outcomes than civilian patients despite higher caliber weapons in combat, it is believed that the highly aggressive approach taken to pTBI in the military is likely the key factor underlying this difference.
Admittedly, the high lethality of suicide-related pTBI is likely also a factor. Although there is very little published data regarding nihilism, our group felt a strong need to combat inappropriate therapeutic nihilism despite our limited ability to write evidence-based guidelines on the topic. Acknowledging that the TQIP guidelines recommend at least 72h of aggressive care before making withdrawal of life sustaining therapy decisions, our group felt that this timepoint may be premature given that important neurological improvements are often seen after this time point.
Our group ultimately felt that existing prognostic calculators should not be used to drive care-limiting decisions. While the information inherent to prognostic calculations can provide a helpful benchmark and can have utility is educating substitute decision-makers, the imprecision of these tools in the care of an individual patient must be considered.
Prognostication has been a major recent advance in TBI care. There will be a perpetual push to develop increasingly accurate yet simpler models that are accurate earlier and earlier in the course of care. As models become more accurate it may be reasonable for them to play a greater role in patient care decisions. Machine learning algorithms, for example, are starting to out-perform regression models and are anticipated to advance pTBI prognostication in the near future.
The enthusiasm to develop prediction models has not been equaled by efforts to delineate how to use prognostic information. What is a good outcome? What is an acceptable chance of achieving a good outcome? How accurate does a prognostic calculator need to be to support routine clinical use? A tremendous amount of additional research is needed in this regard. Implicit in the term nihilism is the assertion that physicians play a key role in medical decision-making in critically ill patients. The extent to which decision-making in these patients is made by physicians vs. well-informed substitute decision-makers is also very deserving of discussion. It would also be beneficial for prognostic models to incorporate longer-term functional outcomes as opposed to only short-term survival.