3.1. The Dataset
Data on ESG were retrieved from the Refivitiv Eikon platform powered by Thomson Reuters. Refinitiv Eikon is an open-technology solution for academics who would like to exam deeply the ESG performance and capacity of businesses in different sectors globally, as provides access to industry-leading data, insights, and exclusive and trusted news (Refinitiv Eikon, 2021; Gaganis et al., 2021). The database of Thomson Reuters captures and calculates ESG measures, of which a subset of the most comparable and material per industry, power the overall company assessment and scoring process (Milner, Ham, and Hur 2014; Refinitiv Eikon 2021). These are grouped into different categories that reformulate the three pillar scores and the final ESG score, which reflects the company’s ESG performance, commitment and effectiveness based on publicly reported information. Based on the objective of this study, ESG score is used as the dependent variable (Landis and Skouras 2021; Refinitiv Eikon 2021).
The category scores are rolled up into three pillar scores – environmental, social, and corporate governance (Achim and Borlea 2015; Paltrinieri et al. 2020). The ESG pillar score is a relative sum of the category weights, which vary per industry for the environmental and social categories (Mohammad and Wasiuzzaman 2021).
A company's ESG scoring is the numerical expression of the way that its performance is perceived over a wide range of environmental, social and governance (ESG) issues. An ESG score can be characterized as a tool that: (i) helps businesses to be alert regards the continuous changes in the market and (ii) motivate them to reconsider their corporate strategy by setting in the core ESG performance. There are many reasons besides understanding why a business needs to know its ESG score. One of the most critical refers to the rapid growth of ESG investments, with investors looking for portfolios of sustainable assets. With a reliable link between strong performance on key ESG issues and financial performance, ESG score is used by institutional and independent investors to identify companies that may offer good returns. Executives equate a good ESG score with healthy earnings (Giannarakis, Konteos, and Sariannidis 2014). In terms of ESG score reliability, the more reliable an ESG rating is, the more consistently it is calculated and reported, the greater the impact it will have on long-term performance, by managing ESG risks and opportunities, encouraging impact investment and pushing corporate governance to create a more sustainable business. The Refinitiv Eikon ESG scoring is calculated on a scale between 0.0 to 1 and can provide comparable scores for businesses across sectors and regions. The calculation of the scores was defined by Thomson Reuters as following (Refinitiv Eikon 2021):

A relative percentile ranking is only applied if a numeric data point is reported by a company, while all the companies in an industry group report that respective data point. Each measure has a polarity indicating whether a higher value is positive or negative. For instance, more water recycled is positive, but more emissions are negative (Landis and Skouras 2021; Refinitiv Eikon 2021). Percentile rank scoring methodology is adopted to calculate scores of the categories included in the three pillars of ESG.
3.2. The Variables
The main objective of the study is to highlight the factors that are related to ESG score and contribute to the prediction of fraud on supply chain. Regression analysis was used in this study, as the common tool to use for forecasting and prediction (Topliss and Costello 1972). Specific, a multiple log-log regression model was developed to determine if exists a relationship between at least two or more explanatory variables (Clifford et al. 2013; DeFries and Fulker 1985). Furthermore, the first step of creating a multiple regression model is to choose the factors (Pan et al. 2021; Wilkie and Galasso 2021). The factors that were chosen in the current study was ESG score, Resource Use Score, Emissions Score, Environmental Innovation Score, Workforce Score, Human Rights Score, Management Score, Policy Water Efficiency Score, Policy Environmental Supply Chain Score, where the ESG score is the dependent variable. Both the dependent variable and the independents are listed and fully described in Table 1.
Global complexity is increasing as supply chains become more interconnected, economies grow and develop, weather patterns change, and societies experience and acquire more sophisticated technology. ESG factors is an attempt to capture more of this complexity in business decision-making and to assess potential for continued viability in a world that increasingly requires more sustainable outcomes. Based on that point, dependent variable ESG score was selected as it plays a key role on the prediction of fraud in supply chain.
The independent variables were defined as in Table 1 The choice was due to the strong relationship between these factors and the ESG score. In addition, this group of indicators are part of the three main pillars of ESG and present the highest contribution on the prediction of the supply chain from disruptions (Alda 2021). Firstly, the way that a company uses the resource to achieve a better performance and capacity leads to the improvement and sustainability of its supply chain (Alonso-Fradejas 2021; Tseng, Bui, and Lim 2021). So, the role of Resource Use Score plays a vital role in the study. Moreover, organizations are progressively thoughtful and responsive to the carbon emission in today's world, which relates to their organizational operations (Molthan-Hill et al. 2020; Tseng et al. 2021). In their main priorities is to calculate their Carbon Footprint, which is called as CFP, because they want to maintain and reduce it (Firoozi Nejad et al. 2021). This can be act as the initial step for any firm to maintain carbon emissions and create an efficient environmental management system and as a result a sustainable supply chain. Emissions Score which measures a company’s commitment to and effectiveness in reducing environmental emission in the production and operational processes was selected as independent variable in this study based on the above statement (Magerakis and Habib 2021). Following to the Emissions Score, Environmental Innovation Score was included too, as independent variable, because it reflects a company’s capacity to reduce the environmental costs and burdens for its customers, and thus creating new market opportunities through new environmental technologies and processes or eco-designed products (Fuente, Ortiz, and Velasco 2021). Management Score was selected for the development of the model of the current study and measures a company’s commitment to and effectiveness in following best practice corporate governance principles, while Workforce score was included too (DasGupta 2021). Workforce is vital for any business and is one of the most important assets of it. Providing a range of growth opportunities to employees can positively impact wellbeing (Rajesh and Rajendran 2020). Purpose is one of the most powerful drivers of engagement. An engaged employee will feel as though they are contributing towards something that matters to them. Similarly, promoting sustainable behavior at work can indirectly impact on wellbeing (Sakun et al. 2020). For example, by encouraging sustainable travel (eg walking and cycling) this can improve workforce’s health, reduce stress and decrease air pollution. Based on that, workforce Score was selected as independent variable in this study as it measures a company’s effectiveness towards job satisfaction, healthy and safe workplace, maintaining diversity and equal opportunities, and development opportunities for its workforce (Dorfleitner, Kreuzer, and Laschinger 2021). Also, Human Rights Score was added and measures a company’s effectiveness in respecting the fundamental human rights conventions. Lastly, policy factors were selected for the model like the Policy Environmental Supply Chain Score and the Policy Water Efficiency Score (Naffa and Fain 2020). The first reflects a company’s efforts to include in the supply chain measures to reduce their environmental impact, while the second reflects a company’s policy to improve its water efficiency by using various forms of processes/mechanisms/procedures and a system or a set of formal documented processes for efficient use of water and driving continuous improvement.
Table 1
Selected variables that have been used in the model
Variable
|
Type
|
Description
|
ESG Score
|
Dependent
|
ESG score reflects the overall score of companies based on information from their internal environment and focusing on the pillars of environmental, social and corporate governance.
|
Resource Use Score
|
Independent
|
Resource Use Score highlights the ability of a businesses to rationally manage their materials and energy and to focus on solutions that are more environmentally friendly, thus improving supply chain management.
|
Emissions Score
|
Emission Score represents the degree of commitment and efficiency of a company, in terms of reducing its environmental emissions that come of its production and operation processes.
|
Environmental Innovation Score
|
Environmental Innovation Score reflects a company’s ability to reduce its environmental footprint as well as its customer burdens, thus creating new market opportunities through new environmental technologies and processes or eco-friendly products.
|
Management Score
|
Management Score measures a company's commitment and effectiveness in terms of the best practices of corporate governance principles.
|
Workforce Score
|
Workforce Score measures the effectiveness of a company in terms of Human Resources. This rating expresses employee’s satisfaction with the work, the implementation of safety and quality systems, respect for diversity of individuals, while it ensures equal development opportunities.
|
Human Rights Score
|
Human Rights Score measures a company's effectiveness in respecting fundamental human rights principles.
|
Policy Environmental Supply Chain Score
|
|
Supply Chain Environmental Policy Score highlights all the actions of the company to integrate in the supply chain measures and practices, regarding the reduction of their environmental impact.
|
Policy Water Efficiency Score
|
|
Policy Water Efficiency Score indicates the intention of businesses to improve water efficiency by adopting various forms of processes and systems.
|
3.3. Descriptive Statistics
In Table 2 are presented the descriptive statistics of the dependent and all independent variables that are included in the regression analysis. As it can be seen, there are 681 observations were retrieved from the Eikon database for the Fiscal Year 2020 and included in the sample. These observations represented companies that are headquartered in Europe. The company with the highest score has an ESG Score of 94.073, and the company with the lowest ESG Score rating has a score of 21.359. The mean of ESG Score for companies is 68.107. This indicates that companies in Europe have a good relative ESG performance and above-average degree of transparency in reporting material ESG data publicly.
Table 2
Descriptive statistics of all variables included in the regression analysis
Variable
|
N
|
Minimum
|
Maximum
|
Mean
|
Std. Deviation
|
ESG Score
|
681
|
21.359
|
94.073
|
68.107
|
14.120
|
Rersource Use Score
|
681
|
18.508
|
99.895
|
78.095
|
18.275
|
Emissions Score
|
681
|
0.215
|
99.876
|
73.603
|
21.207
|
Environmental Innovation Score
|
681
|
0.811
|
99.865
|
55.608
|
26.110
|
Workforce Score
|
681
|
14.486
|
99.940
|
78.220
|
18.319
|
Human Rights Score
|
681
|
3.438
|
98.264
|
72.561
|
21.817
|
Management Score
|
681
|
1.190
|
99.919
|
61.746
|
27.385
|
Policy Water Efficiency Score
|
681
|
57.692
|
95.652
|
72.381
|
7.971
|
Policy Environmental Supply Chain Score
|
681
|
57.143
|
90.385
|
72.366
|
7.013
|
The maximum scores observed on Resource Use Score, Emissions Score, Environmental Innovation Score, Workforce Score and Management Score were exceeded 99 points, which was supposed to be close to the maximum possible score. This indicates that variables are compatible with the scoring method. As for the minimum score, Emission Score, Environmental Innovation Score, Management Score and Human Rights Score were far lower than the minimum scores for the other variables of the model. Regards the average, that of Resource Use Score and Workforce Score, as well as, Human Rights Score, Policy Water Efficiency Score and Policy Environmental Supply Chain Score were very close. Environmental Innovation Score had the lowest average score of the independent variables but had the second highest standard deviation in scores.
3.4. Multiple linear regression analysis
The main purpose of the multiple linear regression analysis is to investigate the relationship between a dependent variable (in the current research the dependent variable is ESG score) and two or more independent variables in the following form:
y = βo + β1χ1 + β2χ2 + …+ βnχn + εi
In the above equation the terms of β0 ... βn are called as the coefficients of the regression and their estimation is based on a record of observations. This is done by curve fitting based on the least square method with the aim of minimizing the difference between the observed and estimated values. The predictors should have little or no correlation with each other. For example, the correlation coefficient should be less than 0.7 to evade from problems like multicollinearity. The last term of the equation is εi and is mainly referred to as the residual. Also, residuals can be used about testing the general significance (F-test) of the equation and the significance of each regression coefficient (t-test). For obtaining valid results from the above tests, the residual εi should be distributed independently, with a mean of zero and a constant variance of σ2. This is described by a residual analysis and can also lead to the elimination of the data outliers. Another way of estimating the interpretive power of a linear model is the coefficient of determination, which is called R2. This coefficient measures which part of the variance of the dependent variable can interpret by independent variables Essentially, it is a simpler coefficient that measures the ability of a set of factors to interpret a phenomenon.
However, a regression model will have unit changes between the χ and y variables, where a single unit change in χ will coincide with a constant change in y. Taking the log of both variables will effectively change the case from a unit change to a percent change. This is especially important when using medium to large datasets, as happens in the current research. Usually, logarithmic transformation is a convenient means of transforming a highly skewed variable into a more normalized dataset. In theory, we want to produce the smallest error possible when making a prediction, while also considering that we should not be overfitting the model. Overfitting occurs when there are too many dependent variables in play that it does not have enough generalization of the dataset to make a valid prediction. By using the logarithm of one or more variables this can improve the fit of the model by transforming the distribution of the features to a more normally shaped bell curve. Based on the reasoning above, the second-order model was adopted as the following:
log(y) = βo + β1logχ1 + β2logχ2 + …+ βnlogχn + εi
Moreover, by taking the logarithm of both dependent variable and all independent variables and creating a log-log functional form can contribute to overcome the problem of non-linearity. A log-log function is suitable when a unit percentage change in one of the independent variables is expected to respond to a percentage change in dependent variable. Also, another problem that can occur in multiple regression analysis is that of the imperfect multicollinearity (Shrestha 2020). Perfect multicollinearity occurs when an independent variable is a perfect linear relationship of one or more independent variables and is something that can happen very rare. However, the occurrence of severe imperfect multicollinearity is more common (Mansfield and Helms 1982). When severe imperfect multicollinearity occurs, there is a linear functional relationship between two of more independent variables, which is so strong that the estimation of coefficients of the variables in the regression model is significantly affected (Haitovsky 1969). The VIF test examines the degree to which an independent variable can be explained by the other independent variables in the model. The VIF test reflects the degree to which multicollinearity has increased the variance of the estimated coefficient (Jou, Huang, and Cho 2014). If the VIF value range between 1-10, then there is no multicollinearity. On the other side, if the VIF <1 or VIF > 10, then there is the problem of multicollinearity (Dias Curto and Castro Pinto 2011; Jou et al. 2014). In the current study, results from VIF test that displayed in Table 3 shows no sign of multicollinearity in the regression model.
Table 3
VIF test showing no signs of multicollinearity
|
Collinearity Statistics
|
|
Tolerance
|
VIF
|
ESG score (Dependent Variable)
|
|
|
Policy Water Efficiency Score
|
0.485
|
2.063
|
Human Rights Score
|
0.848
|
1.179
|
Environmental Innovation Score
|
0.897
|
1.114
|
Management Score
|
0.92
|
1.087
|
Emissions Score
|
0.578
|
1.729
|
Workforce Score
|
0.66
|
1.516
|
Rersource Use Score
|
0.547
|
1.828
|
Policy Water Efficiency Score
|
0.511
|
1.959
|
Regards the interpretation of the results of the model, this can be given as an expected percentage change in y when χ increases by some percentage (Hinckson and Hopkins 2005). Such relationships where both y and χ are log-transformed are commonly referred to as elastic in econometrics and the coefficient of log χ is referred to as an elasticity (Kitali et al. 2018). So in terms of effects of changes in χ on y (both unlogged): (i) multiplying χ by e will multiply expected value of y by eβ and (ii) to get the proportional change in y associated with a p percent increase in χ calculate βo = log([100 +p]/100)and take eβοβ. These treated data sets yielded the following equation:
log (ESG Score) = 1.393 + 0.160 log(Resource Use Score) + 0.091 log(Emissions Score) + 0.069
(0.165) (0.017) (0.018) (0.007)
log(Environmental Innovation Score) + 0.194 log(Workforce Score) + 0.127 log(Human Rights
(0.021) (0.013)
Score) + 0.132 log(Management Score) + (-0.206) log(Policy Water Efficiency Score) + 0.103
(0.008) (0.044) (0.05)
log(Policy Environmental Supply Chain Score) + εi