The graphs of the calculated state-level GCI and EFI values by years are given below. In Figure 1, the left panel shows the five states with the highest average EFI values between 2002 and 2018; the right panel gives the five states with the lowest average fitness for this period.
The left panel of Figure (1) shows us that New Jersey stands out as the state with the highest EFI value. In addition, it is noteworthy that while the EFI values of states such as Florida and California have increased over the years, the state of Pennsylvania has experienced a slight decrease. In Figure 1, we can also observe that states with high EFI values show a more stable development path over the years. Alaska stands out as the state that lags behind in terms of EFI, i.e. productive capabilities, due to its economy based largely on unsophisticated products such as fishing and forestry. One interesting point in Figure (1) is that, as one of the states with the highest per capita income, DC, is ranked among the lowest EFI states. The main reason for this is that the DC’s economic structure is business service oriented rather than production.
In addition to EFI, state level visual inspection can be made in terms of GCI from the Figure (2) below. Figure 2 demonstrates the states with the highest (left panel) and the lowest (right panel) average GCI values for the period of 2002-2018.
In Figure (2), three states with the highest EFI, namely Illinois, Florida, and Pennsylvania, are also among the states with the highest GCI. In addition, Alaska, Hawaii, North Dakota, and DC, which are among the lowest EFI values, are also among the states with the lowest GCI values. It is noteworthy that states with high EFI such as New Jersey and California are not among states with high GCI. This shows us that while New Jersey and California have a high accumulation of productive capabilities, this accumulation seems not to be concentrated on green products.
Correlations and descriptive statistics of the variables we used for our analysis are given in Table 1 and Table 2, respectively.
Table 1
|
GCI
|
EFI
|
GDP pc
|
Electric Cons. pc
|
Population Density
|
GCI
|
1.000
|
|
|
|
|
EFI
|
0.883
|
1.000
|
|
|
|
GDP pc
|
-0.172
|
-0.213
|
1.000
|
|
|
Electric Cons. pc
|
-0.141
|
-0.247
|
-0.087
|
1.000
|
|
Population Density
|
0.442
|
0.566
|
0.387
|
-0.269
|
1.000
|
N
|
866
|
|
|
|
|
In Table 1, the high correlation between GCI and EFI is remarkable. For this reason, we preferred not to include the GCI and EFI variables together in an estimated model. Apart from this, the correlations between other variables are considered to be reasonable. In Table 2, descriptive statistics for logarithmic transformations of the variables are given. Alaska, is the only state with a negative logarithmic value due to its low population density.
Table 2
|
Mean
|
Standard Deviation
|
Min
|
Max
|
GCI
|
7.308
|
0.822
|
1.975
|
8.701
|
EFI
|
9.938
|
0.778
|
6.970
|
11.273
|
GDP pc
|
10.824
|
0.256
|
10.336
|
12.123
|
Electric Cons. pc
|
9.442
|
0.305
|
8.775
|
10.332
|
Population Density
|
3.631
|
1.527
|
-0.833
|
8.398
|
The models we run for SO2, PM10 and CO2 dependent variables are given in Table 3. First, we questioned the significance of the GCI and EFI variables separately. Both variables are not significant for three different air pollution variables.
Table 3
Univariate Fixed Effect Estimations for GCI and EFI
|
SO2pc
|
PM10pc
|
CO2pc
|
|
Model 1
|
Model 2
|
Model 3
|
Model 4
|
Model 5
|
Model 6
|
GCI
|
-0.009
|
|
0.001
|
|
0.007
|
|
|
(0.068)
|
|
(0.037)
|
|
(0.018)
|
|
EFI
|
|
-0.166
|
|
0.025
|
|
-0.033
|
|
|
(0.222)
|
|
(0.102)
|
|
(0.051)
|
Constant
|
-3.046***
|
-1.459
|
-2.559***
|
-2.799***
|
2.994***
|
3.372***
|
|
(0.495)
|
(2.180)
|
(0.274)
|
(0.991)
|
(0.127)
|
(0.502)
|
R-squared within
|
0.771
|
0.770
|
0.282
|
0.276
|
0.686
|
0.687
|
F stat.
|
49.295
|
45.017
|
20.328
|
24.104
|
24.151
|
31.866
|
prob.
|
0.000
|
0.000
|
0.000
|
0.000
|
0.000
|
0.000
|
Number of obs.
|
866
|
867
|
866
|
867
|
815
|
816
|
Heteroskedasticity and autocorrelation robust standard errors are in parentheses. |
* p < 0.10, ** p < 0.05, *** p < 0.01. Year dummies are not reported. |
We expand the results in Table 3 with our control variables and that are reported in Table 4. Accordingly, there is no linear relationship between GCI and EFI variables. However, considering the fact that the analysis results using linear fixed effect estimation may contain functional form bias, alternative estimations based on fractional polynomial recession have been made.
From Table 3 and Table 4, we can see that there is no statistical relationship between GCI, EFI, and local air pollutants. However, it should be kept in mind that the fixed effect estimation is based on the assumption that the relationship between the variables is linear. Considering that polynomial models are used to avoid functional form bias in the EKC literature, estimation over alternative functional forms will yield more reliable estimation results. In this context, the test results called fractional polynomial selection procedure or function selection procedure proposed by Royston (2017), separately for both our GCI and EFI variables, are given in Table 5 and Table 6, respectively.
Table 4
Multivariate Fixed Effect Estimations for GCI and EFI
|
SO2pc
|
PM10pc
|
CO2pc
|
|
Model 1
|
Model 2
|
Model 3
|
Model 4
|
Model 5
|
Model 6
|
GCI
|
0.029
|
|
0.003
|
|
0.018
|
|
|
(0.057)
|
|
(0.035)
|
|
(0.013)
|
|
EFI
|
|
-0.057
|
|
0.006
|
|
0.002
|
|
|
(0.216)
|
|
(0.104)
|
|
(0.035)
|
GDP pc
|
0.317
|
0.243
|
-0.417
|
-0.465
|
0.061
|
0.061
|
|
(0.914)
|
(0.923)
|
(0.389)
|
(0.408)
|
(0.104)
|
(0.099)
|
Electric.Cons pc.
|
2.223**
|
2.229**
|
0.382
|
0.404
|
0.646***
|
0.640***
|
|
(1.018)
|
(1.026)
|
(0.375)
|
(0.378)
|
(0.111)
|
(0.113)
|
Population Density
|
0.698
|
0.695
|
0.727
|
0.709
|
-0.399**
|
-0.399**
|
|
(1.069)
|
(1.093)
|
(0.594)
|
(0.598)
|
(0.192)
|
(0.197)
|
Constant
|
-30.204***
|
-28.678**
|
-4.294
|
-3.960
|
-2.407
|
-2.250
|
|
(9.302)
|
(10.767)
|
(4.229)
|
(4.581)
|
(1.454)
|
(1.365)
|
R-squared within
|
0.793
|
0.791
|
0.292
|
0.287
|
0.776
|
0.775
|
F stat.
|
32.569
|
35.282
|
15.249
|
14.063
|
43.650
|
39.047
|
prob.
|
0.000
|
0.000
|
0.000
|
0.000
|
0.000
|
0.000
|
Number of obs.
|
866
|
867
|
866
|
867
|
815
|
816
|
The first column of Table 5 shows the null hypotheses of FSP. Accordingly, the null hypothesis of omitting the GCI variable from the model cannot be rejected for both the second order (M=2) and third order (M=3) polynomial functional form specifications for SO2pc and PM10pc. This shows that, as reported in our fixed effect estimations, GCI is statistically unrelated with the SO2 and PM10 variables for both linear and nonlinear functional forms.
In the FSP, if the null hypothesis for omitting the variable in question is significant, we can say that there is an association between the variables, and moreover, statistically significant null hypothesis for linearity (second row), refers to a nonlinear relationship (Royston, 2017). According to Table 5, the functional form of the relationship between the GCI and CO2 should be linear. However, the linear functional specification in our fixed effect estimation shows us that the relationship between GCI and CO2 is not statistically significant.
Table 5
Function Selection Procedure for GCI
|
SO2pc
|
PM10pc
|
CO2pc
|
|
Second order fractional polynomial form
|
Third order fractional polynomial form
|
Second order fractional polynomial form
|
Third order fractional polynomial form
|
Second order fractional polynomial form
|
Third order fractional polynomial form
|
Omitted
|
0.897
|
0.714
|
0.859
|
0.937
|
0.047**
|
0.030**
|
Linear
|
0.863
|
0.617
|
0.765
|
0.848
|
0.158
|
0.116
|
M=1
|
0.777
|
0.435
|
0.637
|
0.679
|
0.179
|
0.121
|
M=2
|
0.946
|
--
|
0.413
|
--
|
0.357
|
--
|
M=3
|
--
|
|
--
|
|
--
|
|
Number of models tested
|
164
|
44
|
164
|
44
|
164
|
44
|
* p < 0.10, ** p < 0.05, *** p < 0.01 |
When the functional selection procedure is applied for second (M=2) and third (M=3) order polynomial forms for the EFI variable, the results are as in Table 6.
Table 6
Function Selection Procedure for EFI
|
SO2pc
|
PM10pc
|
CO2pc
|
|
Second order fractional polynomial form
|
Third order fractional polynomial form
|
Second order fractional polynomial form
|
Third order fractional polynomial form
|
Second order fractional polynomial form
|
Third order fractional polynomial form
|
Omitted
|
0.044**
|
0.036**
|
0.000***
|
0.002***
|
0.005***
|
0.014**
|
Linear
|
0.027**
|
0.019**
|
0.000***
|
0.001***
|
0.003***
|
0.006***
|
M=1
|
0.019**
|
0.010***
|
0.000***
|
0.000***
|
0.002***
|
0.004***
|
M=2
|
0.263
|
--
|
0.003***
|
--
|
0.057*
|
--
|
M=3
|
--
|
|
--
|
|
|
|
Number of models tested
|
164
|
44
|
164
|
44
|
164
|
44
|
* p < 0.10, ** p < 0.05, *** p < 0.01 |
In Table 6, the null hypothesis for omitting the EFI, along with the linear and first order fractional specifications of EFI, are all statistically significant and thus, should be rejected. Table 6 shows us that the EFI variable should not be linearly estimated, as in our fixed effect models, but rather second order fractional polynomial specification of EFI should be used to test the relationship between SO2, PM10, and CO2 variables.
While the best specification for SO2 and CO2 variables for EFI is the second order fractional polynomial form, third order fractional polynomial form is the best specification for PM10.
However, as stated in Royston (2017), the probability of falling into type II error increases as the degrees of variables tested in the functional selection procedure are increased. For this reason, it would be appropriate to choose the most parsimonious model. From this point of view, for the PM10pc variable, the second-order fractional polynomial of the EFI variable is preferred and included in the model.
According to these test results, it can be said that fixed effect estimations, may yield biased results due to functional form misspecification. Therefore, the fractional polynomial fixed effect estimation results we estimated based on the test results above are given in Table 7 below.
Table 7
Second Order Fractional Polynomial Estimation
|
SO2
|
PM10
|
CO2
|
EFI-1
|
0.036*
|
-1474.680***
|
-303.435**
|
|
(0.018)
|
(447.972)
|
(126.274)
|
EFI-2
|
-0.014*
|
868.063***
|
178.526**
|
|
(0.007)
|
(279.143)
|
(80.125)
|
GDP pc
|
0.251
|
-0.456
|
0.061
|
|
(0.919)
|
(0.406)
|
(0.098)
|
Electric Cons. pc
|
2.199**
|
0.387
|
0.636***
|
|
(1.017)
|
(0.370)
|
(0.112)
|
Population Density
|
0.889
|
0.869
|
-0.362*
|
|
(1.138)
|
(0.603)
|
(0.199)
|
Constant
|
-33.283***
|
-9.623**
|
-3.396*
|
|
(9.872)
|
(4.190)
|
(1.747)
|
R-squared within
|
0.793
|
0.301
|
0.778
|
F stat.
|
44.420
|
14.700
|
71.167
|
prob.
|
0.000
|
0.000
|
0.000
|
Number of obs.
|
867
|
867
|
816
|
Heteroskedasticity and autocorrelation robust standart errors are in parentheses. |
* p < 0.10, ** p < 0.05, *** p < 0.01. Year dummies are not reported. |
Since the EFI is added to the model as a second order fractional polynomial, two variables included to the model additionally as EFI-1 and EFI-2. Both variables are significant for SO2pc, PM10pc and CO2pc. Graphs showing predicted values and observations for estimated fractional polynomial models for EFI are given below.
According to these graphs, it is seen that the EFI variable is not only significant, but also exhibits a similar structure to the inverted U-shaped for US. This is especially evident for the SO2 variable.
Overall, the results indicate no evidence of a relationship between GCI and air pollution at the regional level. This finding seems not compatible with the findings of Mealy and Teytelboym (2020). In addition, a non-linear relationship found between EFI and air pollution. Moreover, this relationship is in the form of an inverted U shape for SO2. Dinda (2004) states that inverted U shape is significant between GDP and especially SO2 and PM10, but controversial for CO2 data. He makes this assertion with reference to Holtz-Eakin and Selden (1995), Roberts and Grimes (1997) and Dinda (2001). Our results contain similar findings for the EFI variable as Dinda (2004) put forward for GDP. In the same vein, Pata (2020), in his study for US, found that there is an inverted U shape between ECI and CO2.
Concluding Remarks
Institutional and public environmental awareness stimulates global demand for environmental products and services. Governments and corporations are getting more sensitive against ecologically hazardous production. For instance, recently, the European Union agreed on the European Green Deal (EGD) which promotes environmentally friendly product markets and sets new product standards to eliminate the adverse effects of environmentally hazardous production.
We attempt to analyze the nexus between green production and environmental quality by exploiting sub-national data for US States. The analysis consists of two stages. First, we developed a green product complexity index dataset for each state. Later, environmental data and green and overall product complexity indices are estimated by fixed effect and the fractional polynomial regression method, which allows more flexible functional forms.
We find that higher green complexity index levels have an insignificant effect on emission levels in the US States. Contrary to Mealy and Teytelboym (2020), our findings indicate that exporting more sophisticated green products does not yield a better air quality. This may be due to the current green product classifications which fail to incorporate the production and end-use stages of goods or services of green-labeled products. Thus, green-labeled products may have adverse environmental effects from their production to final consumption.
In contrast to the GCI, findings suggest that economic complexity index which includes all production regardless of green or non-green classifications has significantly reduces the Sulfur dioxide, Particulate Matter 10, and CO2 levels. In line with the existing literature (e.g., Neagu, 2019; Chu, 2020; and Pata, 2020), we find an inverted U-shape relationship between EFI and emission levels particularly for SO2. This paper extends the literature in many folds. First, we provide a new dataset (i.e., green product complexity index) for each US states. The GCI data can be used for future research on green production. Furthermore, we outline the link between green production and environmental quality at the sub-national level. Sub-national analysis provides more robust estimation in environmental studies as the significant differences between emission measurement methods across countries create cross-country data inconsistency.
Although subnational analysis offers a more homogeneous environment for researchers compared to cross-country studies, unfortunately they do not allow for a completely homogeneous research sample. In addition, it should be kept in mind that subnational studies are less generalizable. For this reason, more subnational studies for different countries are needed for a more reliable results.