4.1) The dataset
This research uses the dataset provided by the GII report 2019 (Dutta et al., 2019), which consists of 80 indicators (53 inputs and 27 outputs) collected from 129 countries. The data is available only in PDF format, so we had to transfer it into an MS-Excel sheet manually. It is important to acknowledge that the reliability of the data combines with the element of being presented by well-known international agencies (WIPO, INSEAD, Cornell University) and recognized by institutes and established bodies such as the UN Economic and Social Council as a reliable benchmark for measuring innovation (WIPO, 2021). Furthermore, GII indicators are derived from distinguished socio-economic data such as government effectiveness, tertiary education enrolment, global R&D companies’ expenditure, and ICT usage.
When analyzing the data of the 2019 GII, there are only sixteen indicators without missing data for all countries. The remaining indicators have different percentages of missing data points, where the highest percentage of missing data for one indicator (High-tech net exports) is 51.2%. Among the 64 indicators with missing data, 27 indicators have less than 5% missing data points, 11 indicators have 5–10% missing data points, 12 indicators have between 10–20% missing data points, 7 indicators have 20–30% missing data points, and the remaining 7 indicators have more than 30% missing data points.
To deal with the missing data, several steps have been taken. Firstly, the countries have been divided into four groups based on the level of income: High-, Upper-middle-, Lower-middle-, and Low-income according to the World Bank (Dutta et al., 2019). Following this, the mean of each indicator was calculated for each group. Secondly, the countries have been divided into four groups again based on the Human Development Index (UNDP, 2019). Following this, the mean of each indicator was calculated for each group. Thirdly, each country was grouped with the five nearest neighboring countries, then the mean of each indicator was calculated for each group. Finally, the mean of the three means for each indicator was taken as an estimation for the missing data point. However, the previous steps did not solve the problem entirely, leaving the data with 40 missing data points, which were finally imputed by linear regression modeling.
4.2) Data Envelopment Analysis
DEA is a linear nonparametric programming model developed by Charnes et al. (1978) in the field of operations research, which is often referred to as the CCR Model. The idea behind this model is to evaluate the relative efficiency for homogenous Decision-Making Units (DMUs) (i.e., companies, banks, universities, countries, etc.) by giving them scores between 0 and 1. Specifically, DMUs with a score = 1 are considered as “efficient” and DMUs with a score < 1 are considered as “inefficient”. In the 1980s, Banker et al. (1984) extended the method to develop a model that deals with multiple inputs and several outputs, which the literature refers to as the BCC-Model. DEA relies on a frontier created from the observed DMUs by utilizing the so-called best-practice, based on the minimum extrapolation principle (Thanassoulis, 2001).
Shen et al.(2013) conclude that DEA provides several features to the field of CIs such as: (1) it is a means to combine multiple indicators for countries without any prior awareness about the tradeoffs (i.e., the weights), (2) the country itself obtains its own best possible indicators weights, (3) if a country is underperforming compared to other countries, this cannot be attributed to the unfair weighting scheme, since every country has been put in its most beneficial position vis a vis all the other countries, (4) and any other weighting scheme would have generated lower weighting scores for that particular country. Additionally, DEA evaluates the relative efficiency of every country, taking into consideration the performance of all other countries (Cherchye et al., 2008). For the above-mentioned features, DEA has been broadly utilized to examine CIs to name but a few: Technology Achievement Index (Cherchye et al., 2008), the Macro-economic Performance Index (Ramanathan, 2006), the Human Development Index (Despotis, 2005) and the Knowledge Economy composite indicator (Guaita Martínez et al., 2021).
In the context of CI construction, the literature has broadly suggested an adjustment for the classic DEA formulation by considering all the indicators to be treated as outputs (Hermans et al., 2008; Cherchye et al., 2008; Martin et al., 2017; Guaita Martínez et al., 2021). This adjustment is known as the “Benefit of doubt” approach (Cherchye et al., 2008), and it shifts all input variables to become outputs, compromising the inputs with a dummy variable equal to one. It was initially adopted by Melyn & Moesen (1991) as a method to construct CIs to evaluate macroeconomic performance. This approach is to be considered if the underlying structure of the evaluated composite phenomenon is not definitive or if there is disagreement regarding the construction methodology, or if the input indicators are considered to be “achievements” (Cherchye et al., 2007). All these concerns are valid for any CI that endeavors to measure innovation performance. For example, Crespo & Crespo (2016) by applying a fuzzy-set qualitative comparative analysis conclude that none of the GII input pillars is a necessary condition for anticipating high innovation performance. Meanwhile, in the high-income countries, only two of the pillars (Infrastructure and Human capital and research) are sufficient to secure better innovation performance. Over and above, Jankowska et al. (2017), Edquist et al. (2018), and Barbero et al. (2021) among others, evidence that the common assumption that the higher GII input indicators, the higher GII output indicators is not confirmed.
For the above-stated, this paper relies on the DEA, using the benefit of doubt approach (see Eq. 1), with one dummy input variable equal to 1, and 21 output variables. Particularly, these 21 output variables will be generated by considering the linear combination of the indicators under each sub-pillar of the GII (i.e.
\({V}_{q}=\left({u}_{1}. {I}_{1}+{u}_{2}. {I}_{2}+\dots +{u}_{n}. {I}_{n}\right)\) , \({I}_{n}\)= indicators under sub-pillar \({V}_{q}\), \(q= (1,\dots ,21)\), \(n=\) number of indictors under each sub-pillar, \({u}_{n}\)= the weight generated for indicator \(n\) by using the PCA one component loadings for the sub-pillar that the indicator belongs to. Eventually, the linear combination of indicators under the sub-pillar Political environment will produce variable 1, and the linear combination of indicators under the sub-pillar Regulatory environment will produce variable 2, etc. (See Table 1). This PCA-DEA approach has been introduced by (Adler & Yazhemsky, 2010).
Where \({I}_{qc}\) is the normalized value of the \(q\)th individual variable \((q=1,\dots ,Q)\) for the country \(c\) \((c=1,\dots M)\) and \({w}_{qc}\) the corresponding weight (Cherchye et al., 2004). Whilst \({I}^{*}\) is the “benchmark performance” (i.e., the hypothetical country that maximizes the overall performance (OECD, 2008)).
However, due to the nature of the DEA, all efficient countries will obtain the same efficiency score equal to one (i.e., DMUs that lay at the frontier). Consequently, at least for these countries, the ultimate desired ranking will not be entirely discriminating. This limitation of DEA is known in the literature as the “discrimination power problem” (Adler & Yazhemsky, 2010; Barbero et al., 2021; Hatefi & Torabi, 2010). To address this problem, a sequence of sub-DEAs will be executed over the efficient countries only, by dividing the output variables for these countries into subsets according to the GII pillars. For example, the first sub-DEA will be performed over the output variables: Political environment, Regulatory environment, and Business environment, with a dummy input equal to 1. This will be repeated seven times for the seven pillars. Finally, the total efficiency score for each country will be the average of the seven sub-DEAs scores.
Table 1
Sub-pillar
|
Weight
|
Variable
|
|
Sub-pillar
|
Weight
|
Variable
|
Political environment
|
|
V1
|
|
Trade, competition, & market scale
|
|
V12
|
Political and operational stability
|
0.969
|
|
Applied tariff rate, weighted avg.
|
0.716
|
Government effectiveness
|
0.969
|
|
Intensity of local competition
|
0.809
|
|
|
|
Domestic market scale, bn PPP$
|
0.552
|
Regulatory environment
|
|
V2
|
|
|
|
|
Regulatory quality
|
0.964
|
|
Knowledge workers
|
|
V13
|
Rule of law
|
0.958
|
|
Knowledge-intensive employment, %
|
0.914
|
Cost of redundancy dismissal, salary
|
0.478
|
|
Firms offering formal training, % firms
|
0.332
|
|
|
|
|
GERD performed by business, % GDP
|
0.817
|
Business environment
|
|
V3
|
|
GERD financed by business, %
|
0.831
|
Ease of starting a business
|
0.842
|
|
Females employed w/advanced degrees
|
0.883
|
Ease of resolving insolvency
|
0.842
|
|
Innovation linkages
|
|
V14
|
|
|
|
University/industry research collaboration
|
0.92
|
Education
|
|
|
|
State of cluster development
|
0.873
|
Expenditure on education, % GDP
|
0.488
|
V4
|
|
GERD financed by abroad,
|
0.098
|
Government funding/pupil, secondary
|
0.453
|
|
JV-strategic alliance deals/bn PPP$ GDP
|
0.624
|
School life expectancy, years
|
0.894
|
|
Patent families 2 + offices/bn PPP$ GDP
|
0.76
|
PISA scales in reading, maths, & science
|
0.881
|
|
Knowledge absorption
|
|
V15
|
Pupil-teacher ratio, secondary
|
0.761
|
|
Intellectual property payments
|
0.718
|
Tertiary education
|
|
V5
|
|
High-tech imports, % total trade
|
0.343
|
Tertiary enrolment, % gross
|
0.807
|
|
ICT services imports, % total trade
|
0.514
|
Graduates in science & engineering, %
|
0.725
|
|
FDI net inflows, % GDP
|
0.736
|
Tertiary inbound mobility, %
|
0.437
|
|
Research talent, % in business enterprise
|
0.623
|
Research & development
|
|
V6
|
|
Knowledge creation
|
|
V16
|
Researchers, FTE/mn pop
|
0.915
|
|
Patents by origin/bn PPP$ GDP
|
0.82
|
Gross expenditure on R&D, %
|
0.942
|
|
PCT patents by origin/bn PPP$ GDP
|
0.871
|
GDP Global R&D companies, avg. exp. top 3
|
0.927
|
|
Utility models by origin/bn PPP$ GDP
|
0.325
|
QS university ranking, average score top 3
|
0.881
|
|
Scientific & technical articles/bn PPP$ GDP
|
0.559
|
|
|
|
Citable documents H-index
|
0.798
|
(ICTs)
|
|
V7
|
|
|
|
ICT access
|
0.914
|
|
Knowledge impact
|
|
V17
|
ICT use
|
0.917
|
|
Growth rate of PPP$ GDP/worker
|
0.074
|
Government’s online service
|
0.928
|
|
New businesses/th pop. 15–64
|
0.409
|
E-participation
|
0.918
|
|
Computer software spending, % GDP
|
0.71
|
|
|
|
ISO 9001 quality certificates/bn PPP$ GDP
|
0.691
|
General infrastructure
|
|
V8
|
|
High- & medium-high-tech manufactures,
|
0.801
|
Electricity output, kWh/mn pop
|
0.849
|
|
Knowledge diffusion
|
|
V18
|
Logistics performance
|
0.853
|
|
Intellectual property receipts
|
0.731
|
Gross capital formation, % GDP
|
0.117
|
|
High-tech net exports, % total trade
|
0.377
|
|
|
|
ICT services exports, % total trade
|
0.697
|
Ecological sustainability
|
|
V9
|
|
FDI net outflows, % GDP
|
0.776
|
GDP/unit of energy use
|
0.646
|
|
|
|
Environmental performance
|
0.885
|
|
Intangible assets
|
|
V19
|
ISO 14001 environmental certificates
|
0.692
|
|
Trademarks by origin/bn PPP$ GDP
|
0.496
|
|
|
|
Industrial designs by origin/bn PPP$ GDP
|
0.553
|
Credit
|
|
V10
|
|
ICTs & business model creation
|
0.899
|
Ease of getting credit
|
0.726
|
|
ICTs & organizational model creation
|
0.881
|
Domestic credit to private sector, % GDP
|
0.77
|
|
|
|
Microfinance gross loans, % GDP
|
0.257
|
|
Creative goods & services
|
|
V20
|
|
|
|
Cultural & creative services exports
|
0.834
|
Investment
|
|
V11
|
|
National feature films/mn pop
|
0.607
|
Ease of protecting minority investors
|
0.645
|
|
Entertainment & Media market/th pop
|
0.621
|
Market capitalization, % GDP
|
0.61
|
|
Printing & other media, % manufacturing
|
0.63
|
Venture capital deals/bn PPP$ GDP
|
0.738
|
|
Creative goods exports, % total trade
|
0.082
|
|
|
|
|
Online creativity
|
0.816
|
V21
|
|
|
|
|
Generic top-level domains (TLDs)/th pop
|
0.792
|
|
|
|
|
Country-code TLDs/th pop
|
0.878
|
|
|
|
|
Wikipedia edits/mn pop
|
0.709
|
|
|
|
|
Mobile app creation/bn PPP$ GDP
|
0.816
|
4.3) Random Forests
RF is a non-parametric supervised learning statistical method, introduced by Breiman (2001). It has been proved to be a reliable method for classification problems (Hastie et al., 2009; Hamidi & Berrado, 2018). RF develops a random bootstrap of a set of data, performing multiple decision trees according to identified features (variables), eventually by so-called ‘Bagging’ to vote for the best classification (Hastie et al., 2009). The relationship between CIs and RF emerged recently in the fields of Data mining and Machine learning, to examine the robustness of the classifiers (Setiawan et al., 2019). In this paper, the idea behind the use of RF is to assure the robustness of the PCA-DEA results, by using the 21 variables in Table 1 to classify the countries and see to what level this classification matches the PCA-DEA results. Another quality RF can provide, is the ability to assess the ‘importance’ of every variable in the production of the classification (i.e., what are the variables that played an effective role in the classification of the countries?).
Before running the RF, all countries have been divided into three groups: (1) Countries in the top quartile of the PCA-DEA ranking and labelled as ‘Efficient’. (2) Countries in the lowest quartile of the PCA-DEA ranking and labelled as ‘Highly inefficient’. (3) Countries in the two remaining middle quartiles labelled as ‘Inefficient”.