Design and participants
Data from cross-sectional surveys carried out from 2014 to 2018 in four regions of Cameroon were used. The subjects were recruited in the city of Yaoundé (urban area, Center region) from December 2014 to April 2015, in the health district of Bandjoun (semi-urban and rural area, Western region) from November 2015 to April 2016, in the city of Douala (urban area, Coastal region) from November 2016 to April 2017, in the city of Garoua (urban area, North region) and in Figuil (semi-urban and rural area, North region) from December 2017 to April 2018. Ethical clearance was obtained from the institutional ethics committees of the Faculty of Medicine and Biomedical Sciences of the University of Yaoundé 1, and of the Faculty of Medicine and Pharmaceutical Sciences of the University of Douala.
The sampling method used in this study has been published elsewhere [14, 15]. In summary, a 3-level stratified sampling method was applied in each recruitment area. At the 1st level, the enumeration areas corresponding to those used for national immunization days were selected by random sampling. At the 2nd level, households were selected by systematic sampling with variable sampling intervals depending on the size of each enumeration area. At the last level, all subjects from households selected at the second level and meeting both the inclusion and exclusion criteria were invited to participate in the study.
Baseline data collection
Data was collected by final year medical student trained on the standardized questionnaire and the realization of spirometry. Demographic data including gender (male or female), age (calculated to the nearest month for children and adolescents) and ethnic group (Bantu, Sudano-Sahelian, mixed) were noted. Height and weight were measured for each subject and the body mass index (BMI) calculated as the ratio of the weight (kg) to the square of the height (m). "Healthy" subjects were selected using the American Thoracic Society (ATS) and National Heart and Lung -1978 respiratory questionnaire[16]. Subjects with the following conditions were excluded: recent respiratory symptoms (<1 month), history of respiratory disease that may interfere with lung function [asthma, chronic bronchitis, chronic obstructive pulmonary disease (COPD)], tuberculosis and any other chronic respiratory disease), cardiovascular disease (heart failure, angina pectoris, myocardial infarction, severe uncontrolled hypertension), diabetes mellitus, stroke, smokers and ex-smokers, treatment with beta-blockers or bronchodilators, obesity (BMI ≥ 30 kg/m2), underweight (BMI <18.5 kg/m2) and incorrect realization of the spirometric curves.
Measure of spirometric parameters
Spirometry was performed in patients meeting the above inclusion and exclusion criteria. The methods for producing the flow-volume curve were those recommended by the American Thoracic Society/European Respiratory Society (ATS/ERS) in 2005[17]. The spirometric tests were carried out using a turbine pneumotachograph complying with 1994-ATS standards including Spiro USB, Care fusion, Yorba Linda-USA or Spirobank II, MIR France, Langlade-France. The measurements were carried out under the supervision of a pulmonologist who regularly performs and interprets LFTs.
All spirometric measurements were obtained from patients after a minimum of 15 minutes rest in a seated position, their backs straight with a nose clip to allow air movement only through the mouth. Full instructions on the realization of the test were clearly explained to each participant before the maneuver was performed.
All measurements were automatically corrected for body temperature and saturation pressure. The acceptability and reproducibility criteria recommended by the ATS/ERS were used[17]. Three to eight maneuvers were performed by each subject for the realization of the forced vital capacity (FVC) curve, with a resting period of at least one minute between each maneuver. The spirometric indices selected were: forced expiratory volume in 1s (FEV1), forced vital capacity (FVC), FEV1/FVC ratio and forced mid-expiratory flow (FEF25-75%). The best FEV1 and FVC values among the three tests meeting the acceptability criteria were selected (maximum difference less than or equal to 5% or 150 ml compared to the other values). The maneuver with the highest FEV1 + FVC sum was kept for the derivation of the FEF25-75%.
Data analysis
Our data were analyzed using R software version 4.0.3 [18]. The baseline characteristics and the spirometric parameters were analyzed separately for the male and female subjects and according to the different age groups (children: 4-12 years, adolescents: 13-18 years and adults: 19-89 years). Qualitative variables were summarized in terms of counts and proportions. The quantitative variables were summarized by their mean (standard deviation), median (25th -75th percentiles) and range. Scatterplots and box-plot (after discretization of age in 2-year increments and height in 5-year increments) were used to graphically represent the relationship between the spirometric and anthropometric parameters without and after logarithmic transformation. These graphical representations showed that the relationship between all the spirometric parameters and the anthropometric parameters was not linear regardless of the age group. Drawing inspiration from the latest studies on normative spirometric values and the complex effects of anthropometric parameters (explanatory factors, independent variables) on spirometric parameters (dependent variables), prediction models were developed using the generalized additive models for location, scale and shape, LMS imbedded in GAMLSS. Spirometric parameters (FEV1, FVC, FEV1/FVC, FEF25-75%) can be characterized by their mean (location, M or mu), coefficient of variation (scale, S or sigma) and their skewness coefficient (shape, L or lambda). These characteristics are summarized by the acronym LMS. The prediction analysis were done by the gamlss package of the R software [19]. The complex effects of the explanatory variables on the dependent variable can thus be modeled in a smooth and non-linear way using splines and thus make it possible to obtain a smooth modeling over all age groups. We used the Box-Cox-Cole-Green distribution described by Cole et al. to estimate the best prediction model of the spirometric parameters while avoiding over-modeling [20]. Thus, the models with the LMS indices giving the smallest Schwarz Bayesian criterion (SBC) were selected for each spirometric parameter separately in male and female subjects. The general form of the equation for each spirometric parameter was of the form:
Y = a + b * height + c * age + spline (spline is an age-specific contribution from the spline function). The best models were obtained after the logarithmic transformation of the parameters, thus giving an equation of the final form: ln (Y) = a1 + a2 * ln (height) + a3 * ln (age) + spline or Y = exp (a1 + a2 * ln (height) + a3 * ln (age) + spline). For ages whose splines were not obtained directly from the table of spline values, a linear interpolation was carried out according to the formula:
Xspline (age) = [(age2 - age) * Xspline (age1) + (age - age1) * Xspline (age2)] / (age2 - age1); X: L, M or S; Xspline (age): spline corresponding to a given age, age2: upper limit of the interpolation age, age1: lower limit of the interpolation age. The lower limit of normal (LLN) corresponds to the 5th percentile of M. The splines and the corresponding coefficients as well as the formulas for calculating the predicted values were recorded in the lookup tables. The following formula was used to calculate deviation indices (% difference): % difference= (predicted parameter according to other references - predicted parameter in our equation)/ predicted parameter in our equation.