In a large (N = 2545) sample of level 1 trauma center patients aged 17 years and older with GCS 3–15 TBI, we used IRT to model the continuum of TBI severity from 23 clinical, head CT, and blood-based biomarker features assessable soon after injury. Our findings were fourfold. First, finding good fit of a 1-factor model provided empirical support for the widespread assumptions of a dimension of TBI severity that can be indexed, in part, by the individual GCS components, duration of altered consciousness (LOC, PTA), and specific clinical head CT findings. Second, IRT information curves provided a novel view of the level of TBI severity indexed by each indicator and the relative ability of the indicators to characterize individuals’ positions along the severity continuum. Third, we demonstrated that blood-based biomarkers collected within 24 hours of injury, especially GFAP, contribute to indexing the entire continuum of injury. Finally, we demonstrated the validity of novel IRT-based TBI severity scores by showing that these related as expected with traditional GCS-based severity categories and incrementally improved prediction of functional outcome beyond GCS-based severity categories and IMPACT scores.
This study fills a need to integrate data from multiple measurement domains to develop evidence-based, pragmatic TBI severity grading approaches. It also provides an initial demonstration of the utility of IRT for addressing longstanding challenges in the classification and staging of TBI, a problem that could be further addressed through additional applications of this tool–for example, by investigating whether adding other acute severity indicators (e.g., pupillary reactivity, hypotension, hypoxia) 14, or variables reflecting patients’ evolving clinical course, improves characterization of different levels of TBI severity41. IRT also provides a quantitative framework for developing simpler classification systems, through examination of effects on model precision of omitting redundant or lower-performing variables, to balance goals of precision and parsimony.
Our results contribute to growing evidence that incorporating blood-based biomarkers may improve characterization of TBI severity41. For example, our findings suggest that GFAP, currently FDA-cleared to assist in the decision to pursue clinical neuroimaging19, may also indicate level of TBI severity–by contributing to differentiation especially (relative to clinical and CT variables) in the lower half of the severity spectrum, where most individuals show no other objective (e.g., CT) biomarkers of injury. Two other widely used TBI biomarkers, UCH-L1 and S100B, improved characterization of severity but less so than GFAP. However, we caution against drawing firm conclusions about the relative performance of biomarkers from this study alone, as the markers we used vary widely in their half-lives, and TRACK-TBI blood samples were collected at a more optimal timepoint for detecting GFAP.
A strength of this study was that IRT analyses included the full TRACK-TBI sample, a feature enabled by limited missingness on most variables and the use of full-information methods for IRT parameter estimation. This, alongside the diverse 18-site sample, theoretically maximizes generalizability of the model to U.S. level 1 trauma centers. Moreover, in emphasizing the most widely used variables in current TBI severity classification approaches (e.g., GCS, LOC/PTA, head CT findings), our findings directly inform current classification approaches.
We recognize several limitations. Additional investigations will inform the utility of adding other candidate severity indicators to the model. Future studies may also verify model fit to important subgroups. Further, measurements of some variables may not have been optimal. Blood for the current analyses was sampled at one timepoint within 24 hours of injury (12 hours or later for many samples), often missing the early peak of UCH-L1 and S100B and preceding the peak of hsCRP. The relative information provided by the blood-based biomarkers should be interpreted with this limitation in mind. Additionally, assessments of LOC and PTA duration were not standardized, and PTA was not assessed through serial formal assessment. While this underscores their value for measuring TBI severity, it also raises the possibility of that these clinical signs could be more informative if assessed under more controlled conditions.
It is worth noting that there are a multitude of quantitative approaches to staging, or grading, disease severity. For example, IMPACT scores were developed to predict functional outcome and, in turn, provide an early estimate of TBI severity. In contrast, our IRT-based scores were developed without regard to outcomes, instead relying on the assumption of a latent dimension underlying observed associations among TBI indicators and modeling that dimension using the observed indicators in a manner that best reproduces covariation among them. Of note, the finding that the IRT-based severity scores incrementally predicted functional outcomes when combined with IMPACT scores suggests that IRT methods can complement these validated approaches aimed at maximizing prognostication in conceptualizing and quantifying TBI severity.
Taken together, our study provides novel, clinically interpretable findings regarding the manner in which diverse clinical, neuroimaging, and blood-based biomarker variables contribute to indexing the continuum of TBI severity. The IRT approach is sufficiently flexible to enable future integration of other measures related to physiological brain injury and outcome, such as lesion laterality and location, post-acute clinical decline and radiographic progression of intracranial injury, markers of secondary injury, and potentially psychosocial or environmental factors. IRT methodology can help delineate the contributions of diverse variables to a composite, comprehensive TBI severity classification, while improving the understanding of the varied premorbid and injury factors that have limited traditional approaches to severity classification. A number of these indicators are currently under curation in TRACK-TBI and could be included in follow-up studies. In summary, this study demonstrates the potential utility of IRT to contribute to developing and validating practical, empirically-supported TBI severity grading systems.