In this article we describe the network analysis method to analyze SUS patients’ intermunicipal journeys for healthcare services. Similar analyses had been conducted in other studies [15, 16], however, these studies focus on analysis at the State level. The difference of this article is the application of the network analysis method at the municipality level, with a detailed description of the method, so that it can be replicated by healthcare managers and other professionals working with healthcare data. After all, the positive impacts observed in State and National data is a result of the organization of the healthcare system in the municipalities. This method can be used for different types of healthcare systems.
To detail the proposed method, the flowchart (Figure 1) describes all the steps followed in this study.
2.1 Definition of the network of interest
There are various databases in Brazil that form the Healthcare Information System (Sistema de Informação em Saúde - SIS). This variety of databases, such as the Hospital Information System (Sistema de Informação Hospitalar - SIH), Ambulatorial Information System (Sistema de Informação Ambulatorial - SIA), Minimum Data Set (Conjunto Mínimo de Dados - CMD), among others, can be used in network analyses.
First, it is necessary to define which healthcare service to analyze, because the ideal spatial distribution of the healthcare units in the territory may be different according to the type of service under study. It is also important to know if the information system used has the necessary data to generate a network. The system must have at least the municipality of origin and the destination of SUS patients.
To demonstrate the application of the network theory in the present study, the journeys for intermunicipal cardiac hospitalizations were analyzed, thus representing a high complexity modality of hospital care. Cardiac problems deserve emphasis because of the relevance of cardiovascular disease in the national scenario, considering that it is the main cause of death in Brazil [17-20] and the cause of hospitalizations with the highest expenses. To demonstrate the cardiovascular surgery network [17], the municipality of Vitória da Conquista, Bahia, Brazil, was selected. This municipality is located in the Southwest macroregion of the state of Bahia (Figure 2), being the municipality of reference of the macroregion that showed the highest decrease in the number of cardiovascular surgeries as well as the lowest proportion of municipalities without records about cardiovascular surgery.
2.2 Data collection
To construct the networks, it is necessary to prepare two types of files, one for the arcs, and other for the vertices. The file with the arcs contains the patients' journeys, from the municipality of origin to the destination in which the procedure would be performed. The file with information about the vertices contains the spatial locations of the municipalities in the territory that will be analyzed.
2.2.1 SIH / List of hospitalizations
To create the file containing the arcs, data from the Hospital Information System (SIH/SUS) were used. The SIH/SUS is the system in which the records of the patients hospitalized in the units that are part of SUS (public or associated units) are processed. These units send hospitalization information collected through the Hospitalization Authorization Form (Autorização de Internação Hospitalar - AIH). After data is sent to Datasus, SUS Department of Informatics, this information becomes part of the national database, for further dissemination [8]. These files about the AIH are generated monthly by Brazilian municipalities. In the present study the files from Bahia (RDBA) from 2008 to 2020 were downloaded. The downloaded files are organized in the format RDBAaamm.dbc, in which “aa” is the year of the file and “mm” is the month. The variables “municipality of origin” and “destination” are some of the information available in the hospitalization list generated by the SIH/SUS, which are necessary to analyze the flow of patients.
2.2.2 INDE / Coordinates
Data of the National Infrastructure of Spatial Data (Infraestrutura Nacional de Dados Espaciais[1] - INDE) were used for the creation of the file containing the information about the network vertices. This web page makes geospatial data available through a network of servers connected to the internet. This web page provides files of type “shape” for download of the cities in the 417 municipalities in Bahia. To do so, the option “Geo serviços” (Geo services) was selected, then the INDE visualizer (VINDE), then “Adicionar camadas” (Add layers) in the option “Temas” (Themes). In this stage, two maps were selected: one to get the location of the city of Salvador and the other to get the locations of the cities in Bahia municipalities. The map containing Brazilian capitals (BCIM Capital - Ponto) was used to get Salvador’s location and the map of cities (BCIM Cidade - Ponto) was used to get the locations of the other cities in Bahia municipalities.
2.3 Data organization
2.3.1 List of municipalities of origin and destinations
The intermunicipal journeys by SUS patients were organized as matrices of municipality of origin and destination. This matrix is analogous to the matrix of migration, or cost matrix, used to represent a graph/network. In this matrix, the municipalities in Bahia were organized in a way so that the lines contained the municipalities of residence (municipalities of origin) and the columns contained the municipalities of occurrence of the group of procedures under study. The matrices of origin and destination were constructed with the software Tabwin [21].
To generate the matrix of cardiovascular surgery, SUS codes of procedures listed in the Covenanted and Integrated Plan (Programação Pactuada e Integrada - PPI) about cardiovascular surgery were filtered, in the intermunicipal hospital plan of high complexity [22], along with the municipalities of residence and occurrence in Bahia. To visualize the temporal evolution of the municipalities’ intermunicipal networks, a matrix of origin and destination was generated for each year. Subsequently, these data were reorganized so that they could be imported in a network analysis tool. In this study, the tool used was Gephi. Each matrix of origin and destination generated another file, here called list of origin and destination, containing the columns origin, destination and weight. In this newly generated file, the origin column is the municipality of residence, the destiny column is the municipality in which the procedure was conducted, and the weight column represents the number of people that traveled from the municipality of residence to the municipality of the procedure.
2.3.2 Coordinates in Lat/Long and UTM
The software ArcGis was used to reorganize the data for the construction of the file with the vertices of the network. The file with the vertices was generated to contain two different formats of coordinates: (1) latitude and longitude that must be in decimal degrees (lat/long), and (2) coordinates in UTM (Universal Transverse Mercator). To do the spatialization of the municipalities’ cities in Gephi, the coordinates used were in lat/long, and to calculate the mean length of the arcs it is important that the coordinates used are in UTM.
2.4 Construction of the networks
With the data organized, two types of report were generated with the software Gephi and R.
With Gephi, the spatial distributions of the networks were generated and the network indices were calculated, which in this study were called migration indices, as they are linked to people's journeys. The two migration indices generated were in-degree and out-degree. With R, four statistical indices were generated, namely: incoming flow, outgoing flow, mean length of the incoming arcs and mean length of the outgoing arcs. Based on the indices generated from the networks’ spatial distributions, it is possible to characterize the hospitalization network of a municipality and to know how many people and from how many different municipalities people are looking for healthcare services in the municipality under analysis. On the other hand, it is also possible to verify how many people are traveling out of the municipality, the municipality from which they are traveling and the number of different municipalities to which they are traveling for procedures.
2.5 Analyses of the networks
The intermunicipal journeys organized as networks can be considered as a directed graph. A graph is a structure 𝐺=(𝑉,𝐴) in which 𝑉 is the set of vertices or nodes and 𝐴 is the set of arcs of the network, considering that the networks in this study are directed [9].
The indices used in the analyses of the networks in this study were: in-degree, out-degree, incoming flow, outgoing flow, mean length of incoming arcs and mean length of outgoing arcs [15, 16].
In the present study, the in-degree index is the number of arcs coming into a vertex (𝑘𝐼). In this study, the interpretation of this index is similar to the immigration of patients to a given municipality. The municipality in which the procedure is conducted is the reference, thus this index quantifies the number of different municipalities from which people traveled in search for a procedure in the municipality under analysis. The out-degree is the number or arcs going outwards of a vertex (𝑘𝑂), in this study its interpretation is similar to the emigration of patients, representing the municipality of residence and thus quantifies the number of different municipalities to which people travel in search of the procedure. The incoming flow is the weight of the incoming arcs of a vertex (𝐹𝐼). The interpretation of this index in this study is the number of people that arrive at a municipality for the procedure in question. The outgoing flow is the weight of the outgoing arcs (𝐹𝑂), and in this study it measures the number of people that traveled out of their municipality of residence for a procedure in another municipality.
The mean length of the outgoing arcs, initially presented by Sousa et al. (2017) [15] and used in Sousa et al. (2020) [16] is represented by Equation 01:
in which 𝑘𝑂 is the out-degree, 𝐷𝑎 is the distance of the journey, in kilometers, from the municipality of residence to the destination municipality, and 𝐹𝑎 is the number of people that traveled.
The equation above represents the weighted average of the distances between the municipalities in which outgoing flow of patients were observed [15]. It is important to emphasize that this index considers the straight-line distance, in other words, the shortest distance between two municipalities, with the possibility of longer real
distances depending on the road layout and the means of transportation used. Notwithstanding, considering the processes of analysis and decision making, this index turned out to be appropriate to measure the distances that citizens are traveling to have access to the healthcare service in question.
Finally, the mean length of the incoming arcs, whose value represents the weighted average of the distances between the municipalities in which incoming flow of people for the procedure under analysis was observed, is similar to Equation 01, replacing the out-degree (𝑘𝑂) with the in-degree (𝑘𝐼).