Denoising of the backscattered signal is crucial for suppressing the random and unwanted changes in the Differential Absorption LIDAR signal profile and thereby improving the worth of the DIAL signal to a achieve longer detection distance. The Lidar signal for far-off distances has lower signal strength and lower SNR, which poses a limitation on the detection of chemicals at longer distances; thereby introducing errors and false values of estimated concentration. Therefore, the noise suppression in the signal is crucial to reduce these errors and to increase the detection range.
In the reported work following three signal processing techniques have been considered for denoising LIDAR signal,
-
Spatial average technique
-
Pulse topulse temporal average technique
-
Empirical Mode Decomposition (EMD) technique
Spatial averaging of Lidar signal involves the integration and averaging of certain number of neighboring LIDAR data points and the resultant value is overwritten in the current reference data point. The pulse-to-pulse temporal average technique involves the averaging of number backscatter profiles corresponding to each transmitted laser pulse. The temporary variations like gaussian white noise gets cancelled and averaged out after the integration of the number of backscatter profiles. Empirical Mode Decomposition (EMD) technique involves the splitting of the original signal into numerous components which are called Intrinsic Mode Functions (IMF’s). IMFs are estimated in the signal in the order of decreasing frequency of components. The signal's different frequency components can be selectively filtered out and the signal can be de-noised by analyzing the noise component of the signal. The methodology of implementation of these techniques is described in following sections.
3.1 Spatial Average Technique
The spatial average technique [13] is one of the techniques used in the Lidar signal processing field. In order to calculate the average value using the approach, a number of consecutive data points from a lidar profile are combined and averaged. The number of data points can be variable and can be decided based on the user requirement and range resolution requirement. Mathematically, the operation can be expressed as Eq. 3
$$y\left(i\right)=\frac{1}{N}\sum _{j=0}^{N-1}x\left(i-j\right)$$
3
where x is the array of digital values representing the lidar signal data at different range bins. N denotes the size of the spatial averaging window. The processed resultant value at a range bin is obtained by averaging N data points in a shifting window method.
The spatial averaging technique replaces each data value at a range bin with the averaged surrounding value in order to smooth out the sudden fast changes in the lidar signal profile. The procedure lasts for all the data values at different range bins. As the spatial averaging technique only performs the smoothening operation on the signal; it is not able to reject the non-useful transient values and those negative voltage values caused by the noise. Another disadvantage is that as the higher number of pulses are used for the spatial averaging, the necessary useful signal amplitude is also suppressed along with the reduction of noise levels. Although the SNR may increase in a good manner, but the atmospheric characteristics may get camouflaged when moving average is executed. This is a drawback of the spatial averaging technique due to which this technique can be used for only small number of averaging points.
3.2 Pulse to Pulse Temporal Average Technique
Pulse to Pulse temporal average technique [13] is the most broadly used technique for enhancement of SNR of the Lidar signals. This technique involves recording in a memory the lidar return signal data values at every range bin associated with a transmitted laser pulse. Similar to this, sets of data corresponding to the required pulses to be summed are acquired and recorded in other memory locations. The processed resultant value at a range bin is obtained by adding and averaging each individual data bin value at all pulses. Eq. 4 illustrates the mathematical expression or this methodology.
$$\text{y}\left(j\right)=\frac{1}{\text{K}}{\sum }_{k=0}^{K}{\text{x}}_{\text{k}}\left(\text{j}\right)$$
4
Where x1(j) is the Lidar signal value for first backscatter profile at jth range bin. For calculating the resultant value y(j) at jth bin, each pulse's associated Lidar data value at the jth bin is summed, and then the average is calculated.
The advantage of temporal averaging is that the signal values are not lost in the averaging process and range resolution is also not compromised. Disadvantages of temporal averaging include requirement of large integration time for low prf systems and change in atmospheric phenomena in between the averaging process during large integration times.
3.3 Empirical Mode Decomposition Technique
Empirical Mode Decomposition (EMD) technique was first described by Huang et. al. (1998) [14]. The analysis of signals that are non-stationary and nonlinear was suggested using this method. EMD is the signal data-adaptive method in which the base function is determined in real time from the signal itself. It is an iterative method in which the procedure is repeated until the convergence criteria are satisfied. The essence of the EMD technique lies in the fact that a non-stationary signal can be fragmented into a limited number of functions called Intrinsic Mode Functions (IMFs) and one can approximate the original signal by adding these IMFs as given in Eq. 5.
$$y[n]=\sum\limits_{{k=1}}^{Z} {{I_k}} [n]+r[n]$$
5
Where y[n] is discrete time signal of length N under measurement, \({I_k}\) is the kth IMF, Z is the total number of IMFs and r[n] is the residue at the end of the process.
The calculated function is considered as IMF if the following requirements are fulfilled [15]:
-
The total number of extreme locations (maxima and minima) and the total number of zero crossings of the signal in the time domain should be equal, or there should be a maximum difference of one in the number of positions.
-
The average of the envelope of all maxima points and envelope of all minima points should be zero.
The sifting process, an iterative procedure used to calculate IMFs and residue, is explained as follows.
1) Look for maxima and minima and their locations in time series discrete signal y[n].
2) Generate the maxima envelope Smax[n] by joining all the maxima points using some interpolation method. Repeat the similar procedure to generate minima envelope Smin[n]
3) Determine the average of maxima envelope and minima envelope as following
Smean1[n]= \(\frac{{{S_{\hbox{max} }}[n]+{S_{\hbox{min} }}[n]}}{2}\) (6)
4) Determine the difference d[n] = y[n]-S [n], checked for the criteria of being IMF. If it
a. satisfies both the conditions, then it is denoted as IMF (I1[n]). Further residue is calculated as r1[n] = y[n]-I1[n].
b. does not satisfy the IMF criteria, then consider d1[n] as signal and compute d11[n]
as
d11[n] = d1[n]-Smean11 [n] (7)
repeat the above procedure until the difference satisfies the condition of being IMF.
5) The procedure in steps 1 to 4 is repeated by considering residue as calculated in step 4a as the signal.
6) The finite numbers of IMFs are computed in this way and the procedure is terminated as per stopping criteria of EMD, which indicates that there are no more extrema (maxima/minima) or having one extremain the residue.
Every IMF calculated using the aforementioned method represents oscillations at a specific frequency. Following IMFs indicate lower frequency components in decreasing order of frequency, with the first IMF representing the greatest frequency oscillation present in the signal. Effectively, the residue at the end of iterations does not have any oscillations.