3.1. Descriptive Statistics-Quantitative Data
Descriptive statistics for all data are freely available online at https://rpubs.com/R-Minator/heart [62]. Roll-up for the quantitative data are provided in Table 2. The average hospital observation during any given year had about 1,600 observations of DRG 291, 292, and 293 (median of 383). That same hospital had about 147 staffed beds (median of 86), 7 thousand discharges (median of 2.8 thousand), and about 6.4 thousand surgeries (median of 4.5 thousand). The average hospital had positive income (in millions) of $17.3 (median of $2.03), significant cash-on-hand ($20.3 thousand, median of $1.95 thousand), and positive equity. The typical hospital had just over 1,000 employees (median of 436) with 232 affiliated physicians (median of 104) and was reimbursed 45% by Medicare (median of 42%). Only 9% reimbursement was from Medicaid (median of 6%).
Table 2. Descriptive statistics for the study (dollars in millions)
|
|
|
|
|
|
n=40,257
|
Mean
|
SD
|
Median
|
Min
|
Max
|
|
Number DRGs
|
1,640.258
|
3,334.942
|
385
|
11
|
57,461
|
|
Staffed Beds
|
146.507
|
172.468
|
86
|
2
|
2,753
|
|
Affiliated Physicians
|
231.786
|
353.461
|
104
|
1
|
4,328
|
|
Employees
|
1,008.034
|
1,683.991
|
436
|
4
|
26,491
|
|
Percent Medicare
|
0.448
|
0.186
|
0.422
|
0
|
0.983
|
|
Percent Medicaid
|
0.087
|
0.091
|
0.063
|
0
|
0.869
|
|
Discharges
|
7,014.259
|
9,908.036
|
2,811
|
1
|
129,339
|
|
ER Visits
|
32,864.497
|
3,3976.188
|
25,085
|
0
|
543,457
|
|
Surgeries
|
6,349.317
|
7,987.273
|
4,464
|
0
|
130,741
|
|
Net Income ($ in M)
|
$17.23
|
$117.65
|
$2.04
|
-$1.21
|
$3,31
|
|
Cash on Hand ($ in M)
|
$20.28
|
$120.24
|
$1.99
|
-$2.51
|
$3.88
|
|
Profit Margin
|
-0.03
|
1.25
|
-0.02
|
-15.45
|
62.07
|
|
Equity ($ in M)
|
$174.11
|
$625.76
|
$33.94
|
-$3.25
|
$10.24
|
|
Year over year, both DRGs and rates of DRGs per 1000 population increased as illustrated in Figure 2 and Figure 3, respectively. The significance of the DRG increase is the financial consideration. The significance of the rate of DRG increase is the epidemiological consideration. If the DRG rate is considered a proxy for incidence rate, then there is either a significant increase, a coding issue, or something else. These considerations are found in the discussion section. One might expect the DRG rate graph to remain horizontal (static). Independent variables remained relatively constant year-over-year likely due to repeated measures on the same facilities.
3.3. Descriptive Statistics-Categorical Data
California, Texas, and Florida had the largest number of diagnoses for all years and year-over-year, largely due to population size, with averages of 1.7 million, 1.6 million, and 1.5 million, respectively. When adjusted per 1000 population, the District of Columbia, West Virginia, and Delaware dominated the with total rates per 1,000 population of 109, 103, and 94, respectively. Utah, Hawaii, and Colorado had the smallest average rates, 26, 29, and 35, respectively.
Most hospitals were in urban settings (58%). Fifty-two percent were voluntary non-profits with 29% proprietary and 18.7% governmental. The vast majority ( 75%) had no affiliation with a medical school and were short-term care facilities (60%). Nearly no hospitals were classified as Department of Defense (DoD) or children’s hospitals.
3.4. Descriptive Statistics-Financial Estimates
In FY 2008, the Centers for Medicare and Medicaid (CMS) estimated that heart failure DRGs 291, 292, and 293 national average total costs per case were $10.235, $6.882, and $5.038 thousand, respectively. By FY 2012, CMS increased those estimates to $11.437, $7.841, $5.400 thousand, respectively. In four years, the accumulation rates (1 plus the inflation rate) were 1.139, 1.117, and 1.072 for the DRGs in ascending order. Using these accumulation rates, estimates for 2016, 2017, and 2018 were generated. Table 3 shows these extrapolated estimates.
Table 3. Estimated total costs for heart failure by DRG in thousands, linear extrapolation method
DRG
|
2016
|
2017
|
2018
|
DRG 291
|
$12,780
|
$13,155
|
$13,243
|
DRG 292
|
$8,934
|
$9,245
|
$9,257
|
DRG 293
|
$5,788
|
$5,891
|
$5,998
|
Another method for estimating these costs involved the use of the Federal Reserve Bank of Saint Louis (FRED) producer price index for general medical and surgical hospitals [63]. The annual accumulation rates for 2013 through 2018 were estimated as 1.022, 1.012, 1,007, 1.013, 1.018, and 1.023, respectively. Applying these to the 2012 total costs from CMS results in Table 4 estimates for 2016 through 2018.
Table 4. Estimated total costs for heart failure by DRG in thousands, medical inflation rate method
DRG
|
2016
|
2017
|
2018
|
DRG 291
|
$12,058
|
$12,273
|
$12,582
|
DRG 292
|
$8,267
|
$8,414
|
$8,626
|
DRG 293
|
$5,693
|
$5,795
|
$5,491
|
Both estimates are reasonably close. To estimate costs, we used both of these tables separately as upper and lower bounds. Since these total costs represent only CMS costs, the actual financial burden across all payers is likely underestimated as commercial third-party insurers can reimburse up to 90% more than Medicare for the same diagnosis [64]. Figure 4 illustrates the number of DRGs by year, while Figure 5 shows the associated aggregate cost estimates.
In Figure 4, it is clear that DRG 291, the DRG with the highest average reimbursement rate per case, has increased nonlinearly, while DRG2 292 has seen a small drop, and DRG 293 is flat. In Figure 5, the total cost estimates for 2018 are nearly $66 billion more than 2016 on average. DRG 291, the most expensive DRG, has seen reimbursement increases of $92 billion on average. Reasons for such an increase are explored in the discussion section.
3.4. Descriptive Statistics-Correlational Analysis
Hierarchical clustered correlation analysis of quantitative variables (Figure 6) illustrates tight relationships among many variables. This type of correlation analysis clusters variables based on distance measures (e.g., Euclidean), so that those which are most highly correlated are close in location. These variables are then placed into a correlation plot or correlogram. Figure 6 illustrates that discharges and staffed beds are most closely associated with the number of diagnoses, our primary variable of interest. More importantly, the workload variables appear to have significant collinearity that must be addressed for regression-based models.
Analysis of the relationship between some categorical variables and the number of diagnoses also proved interesting. Notched boxplots by year and medical school affiliation reveal that hospitals with major medical school affiliations experience a larger number of diagnoses at the .05 level, a result that is to be expected. (See Figure 7). Further, voluntary not-for-profits see a larger number of diagnoses (Figure 8).
3.5. Explanatory Models for Heart Failure Diagnoses, Hospital Unit of Analysis
3.5.1. Regression Models
Linear, lasso, and elastic net regression evaluated the number of diagnoses as a function of all other variables. Models built on the training set and applied to the training and test sets resulted in predicted R2 values of 0.501, 0.328, and 0.417 (training) and 0.454, 0.323, 0.348 (test) for the OLS, lasso, and elastic net models, respectively. The OLS model predicted better than the constrained regression models and did not overfit. Table 5 provides the coefficient estimates for all variables after fitting on the entire dataset
Table 5. Results of regression analyses for the number of diagnoses, hospital unit of analysis
Variable
|
Linear
|
|
Lasso
|
Elastic Net
|
Variable
|
Linear
|
|
Lasso
|
Elastic Net
|
Workload
|
-0.439
|
***
|
-0.298
|
-0.323
|
State_MA
|
0.010
|
**
|
0.000
|
0.000
|
Net Income
|
-0.043
|
***
|
0.000
|
0.000
|
State_MD
|
0.013
|
***
|
0.000
|
0.000
|
Profit Margin
|
0.026
|
***
|
0.000
|
0.000
|
State_ME
|
-0.002
|
|
0.000
|
0.000
|
Cash on Hand
|
-0.082
|
*
|
0.000
|
0.000
|
State_MI
|
0.013
|
***
|
0.000
|
0.000
|
Equity
|
0.012
|
***
|
0.000
|
0.000
|
State_MN
|
0.003
|
|
0.000
|
0.000
|
% Medicare
|
0.029
|
**
|
0.000
|
0.000
|
State_MO
|
0.005
|
|
0.000
|
0.000
|
% Medicaid
|
-0.002
|
***
|
0.000
|
0.000
|
State_MS
|
0.003
|
|
0.000
|
0.000
|
Proprietary Ownership
|
0.003
|
|
0.000
|
0.000
|
State_MT
|
0.004
|
|
0.000
|
0.000
|
Non-profit Ownership
|
0.006
|
***
|
0.000
|
0.000
|
State_NC
|
0.016
|
***
|
0.000
|
0.000
|
Limited Med Sch Aff
|
0.002
|
***
|
0.000
|
0.000
|
State_ND
|
0.004
|
|
0.000
|
0.000
|
Major Med Sch Aff
|
-0.009
|
|
0.000
|
0.000
|
State_NE
|
0.001
|
|
0.000
|
0.000
|
No Med Sch Aff
|
0.000
|
***
|
0.000
|
0.000
|
State_NH
|
0.002
|
|
0.000
|
0.000
|
Unknown Med Sch Aff
|
-0.005
|
|
0.000
|
0.000
|
State_NJ
|
0.012
|
***
|
0.000
|
0.000
|
Critical Access Hospital
|
0.074
|
|
0.000
|
0.000
|
State_NM
|
-0.002
|
|
0.000
|
0.000
|
DoD Hospital
|
0.000
|
***
|
0.000
|
0.000
|
State_NV
|
0.007
|
|
0.000
|
0.000
|
LTAC Hospital
|
0.048
|
***
|
0.000
|
0.000
|
State_NY
|
-0.005
|
|
0.000
|
0.000
|
Psych Hospital
|
0.067
|
***
|
0.000
|
0.000
|
State_OH
|
0.007
|
*
|
0.000
|
0.000
|
Rehab Hospital
|
0.058
|
***
|
0.000
|
0.000
|
State_OK
|
0.001
|
|
0.000
|
0.000
|
STAC Hospital
|
0.084
|
***
|
0.000
|
0.006
|
State_OR
|
-0.002
|
|
0.000
|
0.000
|
State_AL
|
0.005
|
|
0.000
|
0.000
|
State_PA
|
0.002
|
|
0.000
|
0.000
|
State_AR
|
0.003
|
|
0.000
|
0.000
|
State_RI
|
0.002
|
|
0.000
|
0.000
|
State_AZ
|
-0.002
|
|
0.000
|
0.000
|
State_SC
|
0.006
|
|
0.000
|
0.000
|
State_CA
|
0.000
|
|
0.000
|
0.000
|
State_SD
|
-0.002
|
|
0.000
|
0.000
|
State_CO
|
-0.003
|
|
0.000
|
0.000
|
State_TN
|
0.003
|
|
0.000
|
0.000
|
State_CT
|
0.013
|
**
|
0.000
|
0.000
|
State_TX
|
0.004
|
|
0.000
|
0.000
|
State_DC
|
0.006
|
|
0.000
|
0.000
|
State_UT
|
-0.004
|
|
0.000
|
0.000
|
State_DE
|
0.018
|
**
|
0.000
|
0.000
|
State_VA
|
0.014
|
***
|
0.000
|
0.000
|
State_FL
|
0.006
|
|
0.000
|
0.000
|
State_VT
|
-0.001
|
|
0.000
|
0.000
|
State_GA
|
0.009
|
**
|
0.000
|
0.000
|
State_WA
|
0.005
|
|
0.000
|
0.000
|
State_HI
|
-0.003
|
|
0.000
|
0.000
|
State_WI
|
0.003
|
|
0.000
|
0.000
|
State_IA
|
0.001
|
|
0.000
|
0.000
|
State_WV
|
0.002
|
|
0.000
|
0.000
|
State_ID
|
0.000
|
|
0.000
|
0.000
|
State_WY
|
0.002
|
|
0.000
|
0.000
|
State_IL
|
0.009
|
**
|
0.000
|
0.000
|
Urban
|
0.004
|
***
|
0.000
|
0.000
|
State_IN
|
0.006
|
|
0.000
|
0.000
|
Year 2017
|
0.003
|
***
|
0.000
|
0.000
|
State_KS
|
0.002
|
|
0.000
|
0.000
|
Year 2018
|
0.004
|
***
|
0.000
|
0.000
|
State_KY
|
0.005
|
|
0.000
|
0.000
|
DRG 292
|
-0.040
|
***
|
0.000
|
-0.016
|
State_LA
|
0.006
|
|
0.000
|
0.000
|
DRG 293
|
-0.056
|
***
|
-0.013
|
-0.029
|
*p<.05, **p<.01, ***p<.001
Very few coefficients are recommended by the lasso and elastic net models. The lasso model suggest that the workload principal component and DRG 293 are important predictors, both of which are associated with reduced diagnoses ceterus parabus. Elastic net was similar in recommending inclusion of workload as well as DRG 292 and DRG 293, all associated with reduced diagnoses. The OLS model had a larger array of variables that were statistically significant, and the coefficients of the largest magnitude for the min-max scaled variables were associated with workload (-0.439), Short-Term Acute Care hospitals (STAC, 0.084), cash-on-hand (0.082), and Critical Access Hospitals (CAH, 0.082). When evaluated by categorical groups, the most significant variables were workload (0.302 additional R2), DRGs (0.162 additional R2), and hospital type (0.011 additional R2).
3.5.2. Tree Ensemble Models
Random forests, extra trees regression, gradient boosting, and bagging regressors after some hyperparameter tuning on the training set predicted heart failure diagnoses on the test set with reasonable accuracy (R2 = 0.829, 0.862, 0.821, and 0.830, respectively) . The number of trees used for each estimator was tuned along with the maximum depth of the trees (number of branches). A pseudo-random number ensured that any model improvements were not due to the random number stream. All models accounted for more variance than any regression model.
The best performing tree ensemble was the extra trees regression. This model ensembled 50 trees and resulted in variable importance shown in Figure 9. Similar to the regression models, hospital type (STAC / LTAC), workload (PC), and DRG (DRG 293) were important along with the state of Utah.
The conclusion for both the regression and tree models is that hospital-level diagnoses by DRG are forecastable and that workload along with hospital type are important in doing so. Further, the models indicated that geography might be important, as individual state variables and urban / rural status were important to the OLS and the tree models. These models were evaluated on the hospital unit of analysis for raw diagnoses numbers. Rate-based admission models were then evaluated for the states and counties.
3.6. State Level Geospatial Analysis
A descriptive analysis of heart failure from 2016-2018 using geographical informat systems was conducted to evaluate regional differences. Primarily, we were interested in rates per standardized unit in the population of the geographical area. Populations were based on Census Bureau estimates for each geographic region [7, 46]. The state level analysis was limited in that only 50 states and Washington, D.C. were included (n=51).
Results of the state GIS analysis are available here: https://rpubs.com/R-Minator/HeartState [65]). There is a clear bifurcation in the center of the United States separating high and low rates. That bifurcation suggests a clear West-East difference, favoring the West Coast. Washington, D.C. has experienced the highest average admission rate for diagnoses of heart failure (109.5 per 1000), which might be due to the large presence of military and veteran care facilities) followed by West Virginia (102.8 per 1000), Mississippi (98.2 per 1000), Michigan (94.3 per 1000), Delaware (94.2 per 1000), Kentucky (93.8 per 1000), North Dakota (90.6 per 1000), North Carolina (88.7 per 1000), Virginia (88.0 per 1000), and Missouri (87.5 per 1000) . Of interest is that previous studies indicate these states also see many admissions due to the opioid crisis [43].
From 2016 through 2018, the average rate of diagnoses per 1,000 population increased for nearly all states. A Friedman rank sum test (paired, non-parametric ANOVA) of rates by state by year revealed significantly different rates by year by state (c22=70.941, p<.001). Figure 10 illustrates the changes by year and by state.
Further, evaluating obesity prevalence intensity from the Centers for Disease Control and Prevention (CDC) shows significant correlation between obesity and DRGs per 1000 [66]. A Spearman’s test for correlation of obesity prevalence and 2018 DRGs per 1000 was statistically significant with r=0.689, S=6,867.7, p<.001.
Ordinary Least Square regression was performed on the state admission rate as a function of the quantitative, aggregated variables. While the model was statistically significant and accounted for reasonable variability
only the proportion Medicare was significant at the 0.05 level. Most important to this preliminary analysis was whether state-level spatial data were important to evaluating admission rates. The spatial map of the standardized residuals [58] as well as residuals associated separate linear models for all included variables is available as an interactive GIS map here: https://rpubs.com/R-Minator/heart [62]. The spatial residuals shows some spatial correlation. The visual check was confirmed by a global test for both Queen and Rook neighbors suggest that spatial relationships exist, (I=0.309, p<.001 and I=0.306, p<.001, respectively) [67]. Lagrangian Multiplier Diagnostics (non-robust and robust) suggested that the preferred models would be spatial lag rather than spatial error, as robust tests for error models were insignificant while lag models remained significant (see [62]).
Generalized spatial two-stage least squares estimated Queen, and Rook models, while a comparison linear model was estimated in traditional fashion. R2 for OLS, Queen, and Rook models were 0.529, 0.816, and 0.809, respecctively. Queen and Rook models performed better on the state-aggregated data. The coefficient results of the spatial models were nearly identical to each other, while the OLS was obviously needed geospatial data to improve its performance (see Table 6). The geographic component for Queen and Rook models were statistically significant along with mean profit margin and the proportion of facilities with major medical school affiliation were important in predicting diagnoses rates.
Table 6. Results of the state-level regression
Variable
|
Linear Model
|
|
Queen Model
|
|
Rook Model
|
|
Rho
|
|
|
0.993
|
***
|
0.993
|
***
|
(Intercept)
|
-0.221
|
|
-0.101
|
|
-0.123
|
|
Income
|
-0.055
|
|
0.011
|
|
0.01
|
|
Profit Margin
|
-0.418
|
|
-0.458
|
**
|
-0.458
|
**
|
Cash on Hand
|
-0.162
|
|
0.015
|
|
0.023
|
|
Equity
|
0.183
|
|
0.060
|
|
0.049
|
|
% Medicare
|
0.842
|
***
|
0.221
|
|
0.24
|
|
% Medicaid
|
-0.163
|
|
0.058
|
|
0.061
|
|
% Non-Profit
|
0.129
|
|
-0.128
|
|
-0.122
|
|
% Med School
|
0.386
|
|
0.398
|
***
|
0.408
|
***
|
% STAC
|
0.483
|
**
|
-0.016
|
|
-0.015
|
|
Workload
|
-0.004
|
|
-0.162
|
|
-0.152
|
|
*p<.05, **p<.01, ***p<.001
3.7. County-Level Spatial Analysis
3.7.1. Interactive, Online Maps
The average three-year heart failure admissions per 1000 county population are shown in the interactive map online [68]. These county maps show that the admissions are generally (as expected) in large metropolitan areas, e.g., Dekalb, Illinois (0.65 per 1000). There are exceptions, however. For example, Montour, Pennsylvania is a small county that is home to a large Geisinger facility and thus has a higher than expected admission rate (1.00 per 1000).
The top ten counties for average rates per 1000 over three-years were Winchester, Virginia (3.33 per 1000); Norton, Virginia (3.21 per 1000), Montour, Pennsylvania (3.01 per 1000); Fredericksburg, Virginia (2.13 per 1000); DeKalb, Illinois (1.95 per 1000); Harrisonburg, Virginia (1.58 per 1000); Petersburg, Virginia (1.57 per 1000); Boyd, Kentucky (1.45 per 1000); St. Francois, Missouri (1.34 per 1000); and Adams, North Dakota (1.21 per 1000). Of interest is that half of these counties are in the state of Virginia, perhaps due to the large military and veteran medical centers located in the area. Many of these counties (e.g., Winchester) are small but have large healthcare facilities.
3.7.2. Regression Models, County-Level of Analysis
Similar to what was done at the state level, an exploratory spatial regression model using first-order Queen and Rook contiguity criterion to evaluate the importance of geography was performed using rolled, Z-scaled, county-level independent variables on the county-level admission rate variable (admissions per population in each county). Moran’s I global test suggested that the OLS model was probably sufficient (I=0.02, p=0.100); however, we explored further with Lagrangian multiplier diagnostics. The robust from of these statistics slightly favored a lag model. Results of the regression are in Table 7, and the residual maps for the global model and the individual variables are available online [62] The OLS, Queen, and Rook regression models accounted for only a small fraction of the sum of the squares (R2 = .169, R2 = .132, R2 = .132, respectively).
Table 7. Regression table for county analysis
Variable
|
Linear Model
|
|
Queen Model
|
|
Rook Model
|
|
Rho
|
-0.539
|
***
|
-0.538
|
***
|
(Intercept)
|
0.019
|
0.048
|
***
|
0.047
|
***
|
Income
|
0.010
|
0.015
|
0.015
|
Profit Margin
|
-0.007
|
*
|
-0.002
|
-0.002
|
Cash on Hand
|
-0.063
|
*
|
-0.058
|
***
|
-0.057
|
***
|
Equity
|
0.090
|
*
|
0.081
|
***
|
0.081
|
***
|
% Medicare
|
0.049
|
**
|
0.050
|
***
|
0.050
|
***
|
% Medicaid
|
0.012
|
*
|
-0.001
|
-0.001
|
% Non-Profit
|
0.013
|
**
|
0.016
|
***
|
0.016
|
***
|
Mean Affiliated Providers
|
0.045
|
**
|
0.044
|
***
|
0.044
|
***
|
% STAC
|
0.041
|
**
|
0.044
|
***
|
0.044
|
***
|
Workload
|
0.084
|
*
|
0.079
|
***
|
0.079
|
***
|
Moran's I favors the linear model, but all coefficients are similar.
|
*p<.05, **p<.01, ***p<.001
|
|
|
|
|
|
Most variables in all models are statistically significant largely due to the sample size, but the coefficients are of small magnitude. Every variable except for the proportion Medicaid was statistically significant in the best-fiting OLS model, and yet the magnitude of the coefficients across the three models (OLS, Queen, Rook) was quite similar. Profit margin, and cash-on-hand were negatively associated with admission rates in the OLS model, ceterus parabus. All other variables had positive coefficients in the OLS model. Interpretation of directionality must be done cautiously, as the variables act together in prediction.
Interactive maps of the admission rate, model residuals (OLS, Queen, and Rook), as well as residuals for individual variables are provided online [62]. The residual maps are not suggestive of spatial autocorrelation given the residual dispersion by county. Future explanatory models can omit spatial correlation.
Given the small contribution of the OLS, Queen and Rook models to estimating county-level admission rates, ensemble models were investigated at the county level. With 2,431 valid observations, sufficient power existed to split the data into training and test sets (80% / 20%). Results of hyperparameter tuned models suggested that extreme gradient boosting was the best model as the predictive R2 for random forests, extra trees, extreme gradient boosting, and bagging regressors was 0.242, 0.292, 0.264, and 0.130, respectively. The first three models performed much better on the 20% withhold set than regression models. The variable importance analysis suggested that workload, cash-on-hand, and mean equity were the most important variables with importance scores of 0.35, 0.13, and 0.13, respectively.