2.1 Data sources
Data used in this study was obtained from the China Drug Supply Information Platform (CDSIP) database. The CDSIP is a national drug database constructed and operated by the Statistical Information Center of the National Health Commission of the PRC, and was officially launched on October 22, 2015. Since then, health facilities upload information daily on ordering, storing and delivering, and medicines settlement to the CDSIP; the government can monitor prices, quantities, distribution and warehousing, and can organize medications purchased by medical institutions and strengthen management based on relevant information . Thus, the CDSIP database covered drug procurement order data of all provincial drug centralized procurement platforms from 31 provinces (autonomous regions and municipalities) in mainland China. Under the zero-markup drug policy in China [22], the drug purchase prices in public medical institutions are the same as the prices used by patients. Since 2015, it was required that all drugs used by public medical institutions should be purchased through the provincial-level drug centralized procurement platform [23]. Therefore, in mainland China, the drug purchase data of public medical institutions in the CDSIP database is generally consistent with the drug use data.
In the CDSIP database, each drug purchase order record included the name of the medical institution, purchase date, drug YPID (Yao Pin Identifier) code, drug generic name, dosage form, specification, conversion factor, pharmaceutical manufacturer, price per unit, purchasing unit (by box, bottle, or branch), purchase volume, purchase expenditures, etc. Details of the CDSIP database are available elsewhere [18,24]. A total of 7,647 drugs (by generic name) and 141,624 products (by YPID code) were contained in this national database.
2.2 Sample selection
The inclusion criteria of study samples was as follows: (a) The drug scope was “4+7” policy-related drugs, including 25 drugs (by generic name) in the “4+7” List and the alternative drugs (supplementary table 1). The alternative drugs referred to drugs that had an alternative relationship with “4+7” List drugs in clinical use, and was determined following the Monitoring Plan for the Pilot Work of National Centralized Drug Procurement and Use issued by the NHSA of the PRC [25]. The “4+7” List drugs were then divided into bid-winning products and non-winning products according to the "4+7" city centralized drug procurement bid-winning results announced by the Joint Procurement Office [10]. (b) The time period was 23 months from January 2018 to November 2019. (c) The scope of regions was pilot cities (i.e. pilot group) and non-pilot provinces (i.e. control group). The pilot group involved nine “4+7” pilot cities, including Beijing, Shanghai, Chongqing, Tianjin, Chengdu, Xi'an, Shenyang, Dalian, and Xiamen. For two (Guangzhou and Shenzhen) of the eleven “4+7” pilot cities that were not included in this study, their purchase order data in the CDSIP database was incomplete. The control group involved 12 provinces without the implementation of “4+7” pilot policy, including Hubei, Hunan, Guizhou, Inner Mongolia, Jilin, Heilongjiang, Anhui, Hainan, Gansu, Qinghai, Ningxia, and Xinjiang. (d) The scope of health facilities were all the public medical institutions in the included 9 pilot cities and 12 non-pilot provinces, and were divided into tertiary hospitals, secondary hospitals, and primary healthcare centers (PHCs). Purchase order records with incomplete information were excluded.
Aggregated monthly drug procurement records of 4663 public medical institutions from 9 pilot cities and 13,973 public medical institutions from 12 non-pilot provinces were retrospectively analysed (Table 1). A total of 108 policy-related drugs (by generic name) were included in this study, including 25 “4+7” List drugs and 83 alternative drugs. The flow chart of the sample selection process is shown in Figure 1.
Table 1. Distribution of sample medical institutions.
|
Pilot group
|
Control group
|
Overall
|
Tertiary
|
356
|
613
|
969
|
Secondary
|
488
|
1749
|
2237
|
Primary
|
3819
|
11611
|
15430
|
Total
|
4663
|
13973
|
18636
|
2.3 Outcome variables
This study employed the Drug Structure Index (DSI) as the main outcome variable. In 2002, Antonio Addis and Nicola Magrini of the Italian Centre for Effectiveness Evaluation of Health Services constructed a method for analyzing the drug cost variance factors, known as A.M. index method [26]. This method divided drug cost changes into three parts: quantity factor, price factor, and structural factor, in which the structural factor represents the concept of “price structure” and refers to the combined structure of drugs at different prices. Based on the classical A.M. index system method, previous scholars constructed a drug structure index system, and has been applied in relevant studies [27,28]. The DSI represents the changes in the combination of drugs with different Defined Daily Drug cost (DDDc) in the same drug group, showing the transfer of use from drugs with lower (higher) prices to drugs with higher (lower) prices [29]. DSI is calculated as follows:

In formula (1), Q represents the used quantity of drugs. The used quantity of each drug was measured based on its Defined Daily Dose (DDD), which is developed by WHO to compare drug consumptions and refers to the average maintenance dose per day for a drug used for its main indication in adults [30]. In this study, the DDD of the drugs which could not be coded in WHO’s ATC/DDD Index 2021 system was determined based on the dosage regimen recommended in the manufacturers’ instructions, as approved by China Food and Drug Administration. P represents the prices of drugs, which is expressed as DDDc. DDDc was calculated as drugs costs per unit DDD (DDDc=expenditures/DDDs).
The numbers of subscripts represent periods, with 0 representing the baseline period and 1 representing the reporting period. The higher the DSI, the higher the use proportion of drugs with higher DDDc in a “basket” of drugs. DSI>1 means that the use proportion of higher DDDc drugs increased in the reporting period compared with the baseline period; DSI<1 means that the use proportion of higher DDDc drugs decreased in the reporting period compared with the baseline period; DSI=1 means that the use proportion of higher DDDc drugs remains stable between reporting-period and baseline period.
This study observed the change of DSI in the pilot group and control group between the pre- and post-“4+7” policy periods. In this study, January to March 2018 was assigned as the baseline period, and April 2018 to November 2019 was assigned as reporting periods, involving a total of 20 reporting periods. Finally, the time series of DSI for policy-related drugs were constructed containing 20 monthly time points.
2.4 Statistical analysis
Descriptive analysis was used to describe the distribution of DSI in the pilot group and control group, pre- and post-“4+7” policy periods, different drug categories, and different levels of medical institutions. The line chart of DSI was drawn to describe the overall trend of drug structure changes from July 2018 to November 2019.
Difference-in-difference (DID) is a method commonly used for the quantitative effect evaluation of public policies or projects. By effectively combining “the difference before and after intervention” with “the difference with or without intervention”, this method to a certain extent can control the influence of some factors other than intervention, so as to estimate the net impacts of the intervention on the outcome variable [31-33]. In order to eliminate the net effect of “4+7” policy on the DSI change of policy-related drugs, we conducted DID models by using the DSI time series data in the pilot group and control group constructed above. The DID model is expressed as follows:

In formula (2), Y is the outcome variable, i.e. DSI in this study. Tt refers to “4+7” policy intervention with the value of 0 and 1, and 0 represents the pre-“4+7” policy period (from April 2018 to February 2019) and 1 represents the post-“4+7” policy period (from March 2019 to November 2019). Gt represents groups with the value of 0 and 1, and 0 represents the control group and 1 represents the pilot group. εt is the error term, representing random errors that cannot be explained by variables in the model. β0 represents the constant term. β1 estimates the change of DSI in the post-“4+7” policy period compared with the pre-“4+7” policy period. β2 estimates the change of DSI in the pilot group compared with the control group. β3 represents DID value between the Pilot Group and the Control Group, is the DID value, which represents the interaction item between intervention measures and groups, that is, the net impacts of NCDP policy on DSI. The relative change of DSI after “4+7” policy was expressed as β3/β0 [34]. In this study, we observed the monthly trends of outcome variable between pilot group and control group before the policy intervention, so as to verify if the DID model met the parallel trend conditions (supplementary figure 1) [35].
To further explore the change of DSI in different periods after the implementation of “4+7” policy, this study divided the post-“4+7” policy period into three stages: Period 1 (March to May 2019), Period 2 (June to August 2019), and Period 3 (September to November 2019). The net impact of “4+7” policy on the change of DSI was estimated in each of the three post-intervention periods compared with the pre-“4+7” policy period. Stata version 16.0 was used to perform the analyses above. A p-value <0.05 was considered statistically significant.