Study Area
The site chosen for the application of the model is the main headwaters of the Barigui River basin, which is in the municipalities of Almirante Tamandaré and Colombo, in the State of Paraná, in the southern region of Brazil. The climate in the region is classified as Humid Subtropical (Cfb) in accordance with the Köppen classification system, without any well-defined dry season and a mild summer. The average annual rainfall is around 1,500 mm and the average temperature ranges from 16.1ºC to 18ºC (Froehner & Martins, 2008; Kozak, 2020).
The soil occupation in this part of the basin is predominantly rural with some agriculture and dense vegetation, although there are a few scattered urban communities. The main crops that are grown in the two municipalities are black beans and maize. Regarding the index of services provided through the sewage and drainage systems, according to the SNIS [National Sanitation Information System] of 2020 in Almirante Tamandaré, the coverage is approximately 58% and 74% in Colombo. However, according to the data provided by IBGE [Brazilian Institute of Geography and Statistics] and SNIS there is also a large treatment plant for individuals, which in 2010 was around 44% of the municipality of Almirante Tamandaré and 23% of the population of Colombo had septic tank sludge in their dwellings (IBGE, 2012; SNIS, 2021). Figure 4 and Table 1 show the location of the study area and the collection points.
Table 1
Location and drainage area of the collection points
Collection Point
|
x(m)
|
y (m)
|
Drainage Area (km2)
|
P1
|
671,633
|
7,200,115
|
43.08
|
P2
|
671,559
|
7,199,321
|
58.43
|
Source: Authors' elaboration
When obtaining input data from the model, water samples were collected along a stretch between two points of the main river of the Barigui River basin, in five sampling sessions (C1 to C5) on the following dates: 04/16 (autumn), 07/09 (winter) and 11/05 (spring) in 2018 and 02/18 (summer) and 05/20 (autumn) of 2019 to allow an assessment of the seasonal effects on organic matter. The climatic conditions, temperature and rainfall, on the dates of the collection are summarized at Figure 5. The red lines indicate the sampling dates.
The first collection (C1) was conducted in a period soon after a period of mild rainfall after a short dry season. The average temperature on the collection day was 15ºC, although on the preceding days it was higher and reached an average of 20ºC. C2 took place on a day when there was a sharp fall in temperature and the average on the preceding days was above 15ºC. It was also a period of drought when there had been little rainfall in the previous months.
In contrast, C3 took place after heavy rainfall followed by a short dry period. The average temperatures in the period shortly before the collection ranged from 15ºC to 20ºC. C4 was conducted soon after a rainfall of about 2 mm in a period of high temperatures and in the preceding days, the temperatures were above 20ºC. Finally, C5 took place in a period of constant rainfall but it was not very heavy, and the average temperatures ranged from 15ºC to 20ºC. All the collections took place during the morning. Table 2 shows the temperature data (average over a period of 3 days) and rainfall (total) for the 36h preceding the collections.
Table 2
Average Temperature and Total Precipitation in the previous 36 h before collection campaigns
Campaign
|
Average Temperature 36h (°C)
|
Total Precipitation 36h (mm)
|
C1
|
17.4
|
7.20
|
C2
|
17.3
|
0.00
|
C3
|
17.4
|
7.40
|
C4
|
20.5
|
60.40
|
C5
|
15.0
|
22.60
|
Source: National Institute of Meteorology (INMET), 2022
It is necessary to obtain the flow from the loading points at the time of the collection to calculate the load of organic matter at the collection points. However, there is only a fluviometric station at the P2 point. In view of this, the flow at this point was obtained from the depth of the river and its key curve. as shown in Equation 5 (Kozak, 2020).
where Q is the flow in m3/s and h the depth in cm. Following this, the flow at the P1 point was obtained by regularizing the flow conducted by means of the specific flow method. Table 3 displays the values at the level of the reference-point obtained from the data of the sampling sessions that were conducted.
Table 3
Reference-point levels at the time of collection
P2
|
C1
|
C2
|
C3
|
C4
|
C5
|
h (cm)
|
12.5
|
2
|
8
|
13
|
8
|
Source: Authors' elaboration with data from field
Based on the data obtained at the level of P2, the flows were calculated for this point by means of Equation 5, while the flows were calculated at Point P1 in a proportional way to their contributing areas, as was noted in Table 4.
Table 4
Flow rate at the time of collection
Collection Point
|
Drainage Area (km2)
|
C1 Flow (m3/s)
|
C2 Flow (m3/s)
|
C3 Flow (m3/s)
|
C4 Flow (m3/s)
|
C5 Flow (m3/s)
|
P1
|
43.08
|
0.41
|
0.19
|
0.31
|
0.42
|
0.31
|
P2
|
58.43
|
0.56
|
0.25
|
0.43
|
0.57
|
0.43
|
Source: Authors' elaboration with data from field and Equation 5 (Kozak, 2020)
Laboratory Parameters
After the collection, the samples were stored in glass amber flasks and then packed in thermal ice boxes until they reached the laboratory. The analysis of the DOC involved making use of approximately 50 mL of the samples filtered in cellulose acetate membrane with a porosity of 0.45 µm, and previously washed in ultra-pure water. Following this, it was transferred to an amber flask (calcined to 550°C) and acidified with 0.5% of the volume of PA sulphuric acid. Before the analyses, the acidified samples were purged with nitrogen gas to eliminate the dissolved inorganic carbon in the sample and thus avoid analytical errors. The DOC was measured in TOC V-CPH, Shimadzu brand equipment, through the high temperature combustion method (5310 B of Standard Methods) (APHA, 2005).
In the case of the analysis of UV-visible absorbance. about 4 mL of the filtered sample (but not acidified) was analyzed in a Spectrophotometer for Varian Absorbance (Cary 50 Conc model), in a quartz cell with 1 cm of an optical path. The reading of the absorbance spectrum was carried out between 200 and 600 nm with an interval of 1.0 nm and a scanning rate of 4800 nm/min. Ultrapure water was used like blank. The SUVA254 Index was obtained to standardize the absorbance in 254 nm by the DOC and the correction by the optical path (l in m), having as the unit L/(mg m) (Westerhoff & Anning, 2000), as expressed in Equation 6.
The values of SUVA254 were thus used for the separation of the fractions of organic matter and Equation 3 for calculating the aromaticity of the samples and following this, Equation 4 to obtain the concentration of refractory organic matter (ROM). The difference between the concentration of ROM and DOC, made it possible to obtain the concentration of labile organic matter (LOM). After this, the loads of ROM and LOM were obtained, based on the flows obtained at the time of the collection. Table 5 and Table 6 show the data referring to the C1 to C5 campaigns.
Table 5
Organic matter fractions at collection points
Campaign
|
Point
|
DOC
(mg/L)
|
Absorbance
254 nm
|
SUVA254
(L/(mg m))
|
%
aromaticity
|
%
ROM
|
%
LOM
|
C1
|
P1
|
1,79
|
0,026410
|
1,47
|
13,24%
|
40,96%
|
59,04%
|
P2
|
1,91
|
0,024093
|
1,26
|
11,87%
|
36,74%
|
63,26%
|
C2
|
P1
|
1,62
|
0,028361
|
1,75
|
15,04%
|
46,55%
|
53,45%
|
P2
|
1,61
|
0,044374
|
2,76
|
21,60%
|
66,83%
|
33,17%
|
C3
|
P1
|
5,37
|
0,077288
|
1,44
|
13,01%
|
40,27%
|
59,73%
|
P2
|
4,765
|
0,064227
|
1,35
|
12,42%
|
38,42%
|
61,58%
|
C4
|
P1
|
4,258
|
0,110656
|
2,60
|
20,57%
|
63,66%
|
36,34%
|
P2
|
4,106
|
0,126874
|
3,09
|
23,78%
|
73,57%
|
26,43%
|
C5
|
P1
|
3,247
|
0,066275
|
2,04
|
16,94%
|
52,41%
|
47,59%
|
P2
|
3,601
|
0,041710
|
1,16
|
11,18%
|
34,60%
|
65,40%
|
Table 6
Refractory and labile organic matter loads
Campaign
|
Point
|
DOC (mg/L)
|
% ROM
|
% LOM
|
ROM (mg/L)
|
LOM (mg/L)
|
Flow (m3/s)
|
Load ROM
(kg/day)
|
Load LOM
(kg/day)
|
C1
|
P1
|
1.79
|
40.96%
|
59.04%
|
0.73
|
1.06
|
0.4113
|
26
|
38
|
P2
|
1.91
|
36.74%
|
63.26%
|
0.70
|
1.21
|
0.5578
|
34
|
58
|
C2
|
P1
|
1.62
|
46.55%
|
53.45%
|
0.75
|
0.87
|
0.1874
|
12
|
14
|
P2
|
1.61
|
66.83%
|
33.17%
|
1.08
|
0.53
|
0.2541
|
24
|
12
|
C3
|
P1
|
5.37
|
40.27%
|
59.73%
|
2.16
|
3.21
|
0.3136
|
59
|
87
|
P2
|
4.765
|
38.42%
|
61.58%
|
1.83
|
2.93
|
0.4253
|
67
|
108
|
C4
|
P1
|
4.258
|
63.66%
|
36.34%
|
2.71
|
1.55
|
0.4223
|
99
|
56
|
P2
|
4.106
|
73.57%
|
26.43%
|
3.02
|
1.09
|
0.5727
|
149
|
54
|
C5
|
P1
|
3.247
|
52.41%
|
47.59%
|
1.70
|
1.55
|
0.3136
|
46
|
42
|
P2
|
3.601
|
34.60%
|
65.40%
|
1.25
|
2.36
|
0.4253
|
46
|
87
|
Satellite images
The land cover can be obtained through the classification of satellite images by means of GIS tools. Thus, when correlating the data on the organic matter obtained in the collections, satellite images were selected on the dates closest to the collections conducted. The satellite chosen was Sentinel 2 which has free images and a high spectral resolution since the bands in the visible and ultraviolet range were close to 10 m (bands 2 – blue, 3 – green, 4 – red, 8 – close to infrared), and 20 m for the red edge bands (5, 6, 7 and 8A bands) and short waves (11 and 12 bands) (USGS, 2021).
The study watershed is included in the T22JFT scene of the Sentinel-2 series. The data from the images were chosen (provided the images were available with few clouds) from the site of the United States Geological Service (USGS), in periods close to the time when the collections were conducted. After the analysis of the current images, the data obtained were 05/15, 07/09 and 09/07 in 2018 and 02/09 and 06/09 in 2019. The shape of the Barigüi River basin was acquired at the IAT (Water and Land Institute) site (IAT, 2020). However, the sub-basins contributed by the points in the analysis, were obtained by means of the contour lines available in the AMBDATA program (INPE, 2021) for the region in question.
The projection system used for the images was the same shape as the basin that was made available – SAD 69 UTM 22S. The bands were stacked to make it possible to proceed with the classification of the soil cover and following this, the cutting of the sub-basin was conducted in the shape of the basin formed by the shape of the P1 and P2 points. These stages were prepared in QGIS free software. Figure 6 presents the images used in the land cover classification.
The classes of coverage were determined by analyzing a file with stacked bands in MultiSpec software by employing different combinations of bands available. Each set of combinations highlights certain features of the classes. Together with this, images from Google Earth Pro and Street View were looked at so that the visualization of a better coverage resolution could be obtained in their analysis and the class of each sample that belonged to it could be determined. The choice of samples took account of the fact that a uniform analysis would be required for all the types of coverage found in study area. Thus, the classification of the images was supervised, and the following classes of coverage were identified: Forest (fo), Grass (gr), Agriculture (ag), Bared Land (bl), Urban Area (ua) and Water (wa). Figure 7 shows the classified images.
After this supervised classification, images were created with a much higher number of features including tiny areas that were not needed for the designed model (for example of a pixel, or 100 m2). Hence, the features with an area less than a 0,026 km2 (which is equivalent to 0.05% of the study basin) were combined with the adjoining features to make it easier to conduct the subsequent data processing of the existing coverage. At the same time, any corrections regarded as necessary, were automatically made to the classified features. This process was aided by comparing the features obtained with the images of Google Earth for the next dates available. The final classification of the soil cover for the area under study can be found in Figure 8.
The LUPC 1.0 program
The model was put into effect by creating a Python language code (shown in the Supplementary Material). The model was then employed for the data described earlier to ensure its applicability, as well as allowing the PCI parameters to be calculated for the several types of land cover analyzed and the bandwidth b regarding each collection.
When determining the PCI and b values, sets of PCI values were applied to each type of landcover with a pre-established range of values (each PCI ranging from 1 to 10, for example). It also involved applying Equation 1 with the resulting value equal to the load at the monitoring point of the river and the Gaussian (Kernel) function (Equation 2) to determine the weight of each feature. The value of b was found by applying the bisection method to determine the roots of an equation. The set of values that obtains the lowest error show the PCI and b values that are most suitable for this session.
The calculation was made for a load of 102 g/day because the program only uses whole numbers of PCI with a view to improving the precision of the values obtained. However, the load is displayed in terms of kg/day and PCI in kg/(day km2), which are the values calculated for the PCI to one decimal place.
LUCP Model Validation
At the end of the calculation for the first four sessions, a set of PCI values and b was obtained for each collection, which represent the four seasons of the year. The validation of the model with data from the C5 session made use of both the PCI and b values obtained for the same season (autumn), as well as the average of the values obtained in the four seasons for verification. At the same time, the weights for each feature were calculated based on the average bandwidth b that was found, and the loads produced by ROM and LOM were calculated. They were then added to the product of each load through its equivalent weight to obtain the load values of ROM and LOM for the P1 and P2 points. These values were compared with the real-world values of the load available, and the errors were calculated to evaluate the applicability of the model on screen.