3.3. Analysis of Core Journal and Journal Distribution
The frequency of articles per journal shows how attractive it is for researchers to publish their research results in certain sources. The histogram of the distribution of articles published by journals is shown in Fig. 3a, embedded figure. The results show that a total of 75 different journals have published articles in the field. Most of the publications were concentrated in a few sources. Overall, 45 journals published one to two papers, and only eight journals published more than ten papers.
On the other hand, Bradford's law was used to identify core sources (Brookes, 1985). This law describes a quantitative relationship between the journals and the articles contained in a collection on a given subject and states that the production of articles in journals is established by a highly uneven distribution, where most of the articles are concentrated in a small population of journals, while a small proportion of articles are spread over a large number of journals, following a distribution proportional to a ratio of 1 (zone 1): n (zone 2): n2 (zone 3). where zone 1 is the number of core journals preferred by researchers and therefore most specific to a domain of knowledge. The results obtained by applying Bradford's law, Fig. 3a, to the article collection show that the "core zone" is composed of 3 sources with a total of 134 articles published, corresponding to 36.4% of the total number of articles; the second and third zones are composed of 11 and 64 sources, respectively.
The top 10 journals preferred by the authors were ranked according to the number of articles published (NP) in each journal (Fig. 3b), followed by the total local citations (TC) of the journals (Fig. 3c), the impact factor (IF, Fig. 3d) and the H-index (Fig. 3e) associated with the article collection. This group of journals includes, as expected, the core group that predicts Bradford's law (Fig. 3a). The ATMOS ENVIRON journal stands out with a total of 15.2% of published articles (NP: 56), making it the journal chosen by the majority of authors in this field. It is followed by ENVIRON SCI TECHNOL (NP: 40), SCI TOTAL ENVIRON (NP: 38) and ENVIRON POLLUT (NP: 23). In terms of NP, the top 10 journals present an average of 22 published articles (CI95%: 12 to 32) and an average of 14, their contribution representing almost 60% of the articles in the collection (Fig. 3b). The TC of this group of journals also stands out with an average of 707 citations (CI95%: 348 to 1066, Fig. 3b). It should be noted that an important aspect in the choice of the journal to be published is its IF. The top 10 journals have an average IF of 9 (CI95%: 7 to 11) and a mean of 10 (Fig. 3d). Regarding the H-index, the top 10 journals have an average of 13 (CI95%: 8 to 17) and a mean of 10 (Fig. 3e). The journal with the highest H-index is ATMOS ENVIRON (H-index: 25). The indexing categorization of the top 10 journals is mainly in the fields of "Environmental Sciences" and "Meteorology and Atmospheric Sciences", and all of them are in the top five categories. In general, given their scope, the journals have published a large number of articles on atmospheric pollution, which, together with their high citation counts and IF and H indices, makes them attractive to researchers in this field and makes them the reference sources in this area of research.
3.4. Most relevant authors and cocitation network
In this section, we present the authors who have contributed the most to the knowledge in the field. First, the productivity of the authors is assessed using Lotka's law of scientific productivity in Fig. 4a. Lotka's law considers that, regardless of the field analyzed, most of the production is concentrated in a limited number of authors (López-Fernández et al., 2016) and is described as the number of articles = C x (Nº Authors)-2, where C is an arbitrary constant. In the present study, the data adjusted for Lotka's law show a good fit (Chung et al., 1992). Therefore, we conclude that the results follow the inverse of Lotka's quadratic law (R2 = 0.9995). In our case, production is diversified, with many authors having only one publication (1243 out of 1724), representing 72.1% of total authors. The top 10 authors represent 0.58% of the total number of authors. This implies that they contribute at least 16 articles on the topic.
The 10 top authors by productivity are shown in Figs. 4b-d. They are presented in descending order according to the number of published articles (NP, Fig. 4b) and the number of local citations (LC, Fig. 4c), which measures how often an author included in this collection has been cited by articles also included in the collection. Finally, the H-index is presented (Fig. 4d). The results show that, considering the NP, Schauer JJ is the most prolific with a total of 36 published articles, followed by Sioutas C (NP: 30) and Kelly FJ (NP: 30). However, the predominance of authors is not uniform across the different indicators. Considering the publication impact in terms of citations (LC), Verma V is the most important author with 505 citations, followed by Schauer JJ (LC: 419), Weber RJ (LC: 364) and Cassee FR (LC: 357). In terms of the H-index, Schauer JJ is the top author (H-index: 25), followed by Sioutas C (H-index: 22) and Kelly FJ (H-index: 21).
Figure 4e shows the coauthorship network among the authors who have published on the topic studied. In coauthorship networks, the authors of the network are connected as nodes by coauthored scientific publications. The colors represent the working groups as clusters. Authors coauthoring more than one paper are grouped in the same cluster. In this case, there are clusters of 4 to 7 authors with a high level of coauthorship. For example, Verma V, Fang T, Gao D, and Weber RJ have a strong collaborative relationship, as shown by the orange cluster. Verma V seems to be the central author of this community of researchers. The topology of the collaborative network is as follows. There are groups with high collaboration (yellow, blue, and red clusters) and others that are more dispersed. This is an indication that there is not much collaborative activity among the other clusters or working groups, so there is a need for an increase in collaborative activity to advance research in this area.
3.7. Research hotspot analysis
Two types of analysis were performed for the global evaluation of OP-PM research hotspots. The first was based on the cooccurrence method between thematic WoS categories (Fig. 6a) and author keywords (Fig. 6b) of the article collection, with at least 3 interactions between them, grouped by the leading eigenvalue algorithm. The second analysis consisted of a detailed examination of each article in the collection, using a questionnaire to obtain and classify the information explicitly mentioned by the authors in relation to the main characteristics of the scope of the study, the objectives, the methods used, and the potential impact of the results obtained. The results are expressed in terms of the percentage of occurrence of each response. These percentages are weighted by the total number of articles in the collection. Both analyses allow us to understand the main characteristics of the research that has been conducted worldwide in relation to OP-PM and to identify possible research questions that have not yet been answered.
The results of analyzing the WoS subject categories indicate that the five main subject categories in which the articles in the collection were published correspond to the following (Fig. 6a): "Environmental Sciences", "Meteorology & Atmospheric Sciences", "Engineering, Environmental", "Public, Environmental & Occupational Health", and "Toxicology". The categories have been grouped into 4 clusters. These clusters can be characterized by the following topics: Cluster 1—Biological, chemical and health sciences (red cluster in Fig. 6a), this is the group with the largest number of items and includes mainly categories related to chemical and biological health sciences such as 'biochemistry & molecular biology', 'chemistry, multidisciplinary', 'nanoscience & nanotechnology', 'chemistry, medical', pharmacology & pharmacy, 'toxicology' and 'public, environmental & occupational health'; Cluster 2—Engineering (green cluster in Fig. 6a), this group includes categories such as 'Engineering, Chemical, 'Engineering, Mechanical' and 'Energy and Fuel'; Cluster 3—Environmental Science and Chemical Analysis (blue cluster in Fig. 6a), with categories such as Environmental Science, Meteorology and Atmospheric Science and Chemistry, Analytical. Finally, Cluster 4—Sustainable infrastructure (yellow color cluster in Fig. 6a) includes categories such as 'Construction & Building Technology', 'Engineering, Civil', and building technology, engineering, civil engineering, and engineering, environmental engineering.
However, the keyword analysis of the author shows that the five main keywords in the collection are particulate matter, oxidative potential, dithiothreitol assay, reactive oxygen species, and air pollution (Fig. 6b). Keywords have been grouped into 7 clusters. These clusters can be characterized as Cluster 1: Fuels and toxicological health effects (red color cluster in Fig. 6b), this group represents the cluster with the highest number of elements and includes keywords related to the emission of pollutants from the use of fuels such as "PM emission", "gasoline direct injection" or "diesel/biodiesel" and health effects such as "cytotoxicity", "genotoxicity", "toxicity", "inflammation", "free radicals" and "oxidative stress"; Cluster 2—Oxidative potential assays and spatial variability (green color cluster in Fig. 6b), correspond to the second largest group and includes keywords mainly related to assays to measure oxidative potential such as "ascorbic acid assay", "dithiothreitol assay", "esr" (electron spin resonance) or "epr" (electron paramagnetic resonance) and spatial distribution variation models such as "land use regression model" or "spatial variation"; Cluster 3—Biomass burning and its impacts (blue color cluster in Fig. 6b), includes keywords mainly related to biomass burning and its emissions such as "biomass burning", "source apportionment", "black carbon", and "PAH" (polycyclic aromatic hydrocarbons), and its impacts such as "air quality" and "health"; Cluster 4—The physical characterization of particulate matter (yellow color cluster in Fig. 6b), includes keywords related to the characterization of particulate matter, such as "chemical composition" and "physicochemical properties"; Cluster 5—Air pollution and respiratory health (purple color cluster in Fig. 6b), this group includes keywords such as "air pollution" and "asthma", on the other hand, it includes "antioxidants" and "metals", which are keywords that can be related through the mechanism of oxidative stress; Cluster 6—Oxidative potential due to exposure to particulate matter (cyan color cluster in Fig. 6b), including keywords such as "exposure", "particulate matter" and "oxidative potential". Finally, Cluster 7—Indoor pollution (orange color cluster in Fig. 6b) includes keywords related mainly to indoor or workplace pollution, such as "indoor air quality" and "office building".
The main findings from the application of the questionnaire to each of the articles collected are described below, as are the results of our analysis. Figures supporting this analysis and a brief description of the categories used are provided in the supplementary material (see Tables S1 – S12 and Figures S2 – S14):
Main characteristics of the scope of the study (see Table S3 and Figure S3 in the supplementary materials). The primary emphasis of the research encapsulated in the article collection centers on evaluating the OP by utilizing environmental PM samples, which constitute a significant 70.65% of the entire dataset, referred to here as "Ambient". Succeeding this, artificially created samples under regulated conditions, labeled "laboratory-generated samples", account for a substantial 27.72% of the research. Last, the analysis includes samples procured from enclosed residential or occupational settings, denoted "Indoor", and this forms 10.33% of the overall investigation. Together, these three aspects provide the structural foundation for field research. Recognizing the different proportions of these data sources is important, as it underscores the distinct importance of each in this field. The focus of the study on unraveling the OP in real-world situations is highlighted by the heavy reliance on environmental samples. In addition, laboratory and indoor study samples provide valuable controlled data and contextual information, respectively.
Objectives of the study (see Table S4 and Figure S4 in the supplementary materials). Regarding the objectives of the studies, the first and most common is to "analyze the causes of air pollution and/or oxidizing potential (OP)" (70.92%). It involves the evaluation and characterization of zones where air pollution, air quality, or OP is affected or driven by comparable parameters (e.g., emission sources, dispersion conditions, deposition, or atmospheric chemistry that explain a spatial or temporal variability of OP in a specific area) or emission sources. Delimitation of areas where air pollution/air quality or OP is influenced/triggered by similar parameters or emission sources (e.g., sources with similar temporal variations, meteorological factors). The second most common focus is on "exposure assessment" (38.32%), which often involves determining the amount, duration, and frequency of human exposure by employing both mass-based exposure estimates and various OP metrics. Finally, a significant number of studies focus on the "evaluation of OP measurement methods" (26.36%). This could involve comparing the reliability and accuracy of different measurement techniques or establishing new methodologies to improve the accuracy of OP measurement.
Methods used, OP assays and chemical characterization (see Tables S5 – S8 and Figures S5 – S8 in the supplementary materials). In terms of the type of OP assay used, acellular assays have emerged as the predominant method used by researchers in article collections, accounting for 86.14% of all tests, significantly outweighing the use of cellular assays, which accounts for only 21.47%. The five most commonly used OP assays, listed in descending order of prevalence, include the dithiothreitol assay (DTT: 57.07%), the ascorbic acid assay (AA: 24.73%), the dichlorofluorescein assay (DFCH: 21.20%), the reduced glutathione assay (GSH: 14.40%), and the electron paramagnetic/spin resonance assay (EPR/ESR: 11.14%). This preference for acellular assays is not surprising given their distinct advantages over cellular assays. They are more efficient and cost-effective because they provide rapid results, require less stringent environmental conditions, and lend themselves to automation. However, despite the current dominance of acellular assays, research is beginning to shift toward the use of in vivo models. This transition is primarily driven by an increased understanding of the biological mechanisms related to PM. By authentically replicating complex biological interactions, in vivo models provide a more holistic and accurate assessment of the effects of PM.
All studies quantify the concentration of various air pollutants, such as PM or others (with 100% "concentration"), and a significant proportion evaluate their chemical composition (85.05%) as a complementary assessment method to OP. Toxicity is of tertiary concern, with a total toxicity of 38.86%. In general, these studies are aimed at correlating OP with chemical or physical parameters of the concentration of PM and/or gases, meteorological variables, analysis of emission sources, and others. However, of the large number of pollutants or variables that have been prominently featured in OP studies, five stand out due to their relative prevalence. These include PM, which tops the list with an overwhelming 89.67% representation. Next, in descending order, are "Elemental Composition" with 61.14% and "Carbon Compounds" with a substantial frequency of 51.36%. Further down the list, "Anions and Cations" appear with 30.16%. Finally, the "Biological Endpoint" completes our list with 25.27%. Each of these elements shows a significant presence within this environmental context. This underscores its relevance in the overall analytical framework and in understanding the drivers of OP.
PM samples Geographical origin and spatiotemporal scales of the studies (see Tables S9 – S11 and Figures S9 – S13 in supplementary materials). According to the results described in Section 3.2, the three main continents are Europe with 35.05% of the samples, Asia with 32.61% and North America with 31.79% of the samples, regarding the geographical origin of the PM samples. Among them, the United States, China, and Italy stand out, representing 24.73%, 21.20% and 11.14% of the total sample, respectively. This suggests that there is a growing interest in this area of research, especially in developed countries, compared to other geographical areas. In addition, an analysis of the emission sources that contribute to the oxidative potential (OP) emerges, and five major contributors emerge. First and foremost, "traffic" emerges as the most significant, accounting for 61.41% of OP. This is followed by "domestic heating - wood burning" with 26.09%. This shows the influence of domestic activities on the OP. The category "industrial and commercial emissions" accounts for 22.28%. This underlines the role of the industrial emission source in the OP. Interestingly, "unspecified biomass burning" accounts for 21.47% of the OP. This suggests a significant influence of uncharacterized biomass burning. Finally, indicating the significant role of the environment itself in PM emissions and impacts, "natural sources" contribute 18.75%.
Regarding the spatiotemporal variation of the studies, we categorize the temporal scales from most extended to least. We find that 13.59% of the studies cover "More than a year – 1 year", 33.42% cover "Year - Season", 16.03% are conducted over "3 months − 1 month", and 11.41% range from "three weeks to a few hours". For the spatial scale, ordered from the most extensive to the least extensive, we find that 0.82% of the studies are conducted at a "global" scale, 5.43% at a "regional" scale, a substantial 59.51% at a "local" scale, and 34.78% at a "micro" scale. By systematically categorizing temporal and spatial scales, this observation provides a clear picture of how studies are distributed across different spatiotemporal scales.
The geographic origin and spatiotemporal expansion of PM sample studies is important in understanding the relationship between composition, temporal variability, and oxidative potential (OP), which varies significantly from region to region. These scales also suggest that the impact and implications of OP can differ markedly depending on both the time frame and the spatial context, reinforcing the need for multidimensional research approaches. Therefore, insightful information on pollutant sources can be obtained by understanding the geographical origin and providing the ability to assess the long- and short-term effects of the source and composition of PM in the OP.
Potential impact of the study (see Table S12 and Figure S14 in supplementary materials): The study primarily reveals valuable, application-oriented perspectives in the health sector, a component here referred to as "human health," which accounts for an estimated 75.27% of the study's impacts. These results underscore a substantial correlation of OP with health outcomes on the usefulness of the exposure metric, over mass concentration. Subsequently, the regulatory and legal sphere, referred to as "regulatory and legal", gains substantial impact, comprising approximately 25.27% of the potential ramifications. This facet relates to the usefulness of OP as a surrogate measure for PM exposure in the formulation and implementation of mitigation strategies for the reduction in PM pollution. It also promotes the establishment of air quality standards that are more health-based, as opposed to being based solely on physicochemical parameters such as mass concentration. Finally, we address the elucidation of the intrinsic mechanisms associated with the OP metric, an area referred to as "understanding the OP mechanism", which accounts for approximately 14.67% of the research implications. These insights are related to understanding the driving factors of OP in relation to the chemical-physical characteristics of PM, including emission sources, chemical composition and size, as well as meteorological and orographic variables related to the dynamics and transport of pollutants.
The results of these studies deeply emphasize the potential implications of OP in a variety of sectors. The breadth of the scope of OPs and their implications in these areas imply a complex interweaving of diverse disciplines. This underscores the essential role of interdisciplinary collaboration and understanding in the pursuit of a deeper understanding and application of the results of OP research in the real world.