6.1. The power ratio of the two sources
Some array-analysis methods, such as MUSIC and CLEAN beamforming, are expected to resolve multiple sources. However, even in the analyses of the SI-array correlation, the array analyses identified only the ISC fumarole signals. Namely, the analyses failed to distinguish the ISC and IWC fumaroles simultaneously. One of the reasons for this failure should be the significantly larger power of the ISC fumarole signal than the IWC’s. To know the individual source contributions to the observed waveforms, we estimate their powers, assuming simple monopole sources (Woulff and McGetchin 1976).
The monopole source amplitudes can be calculated from the peak values of the cross-correlation functions and the source-receiver distances. We model the waveforms \({F}^{a}\left(t\right)\) at the array and \({F}^{b}\left(t\right)\) at the SI microphone as follows:
$${F}^{a}\left(t\right)=\frac{{S}_{1}\left(t-{\tau }_{1}^{a}\right)}{\left|{{y}}_{1}-{{x}}^{a}\right|}+\frac{{S}_{2}\left(t-{\tau }_{2}^{a}\right)}{\left|{{y}}_{2}-{{x}}^{a}\right|}+{N}^{a}\left(t\right), \left(1\right)$$
$${F}^{b}\left(t\right)=\frac{{S}_{1}\left(t-{\tau }_{1}^{b}\right)}{\left|{{y}}_{1}-{{x}}^{b}\right|}+\frac{{S}_{2}\left(t-{\tau }_{2}^{b}\right)}{\left|{{y}}_{2}-{{x}}^{b}\right|}+{N}^{b}\left(t\right), \left(2\right)$$
where \({{x}}^{n}\) and \({N}^{n}\left(t\right)\) indicate the location and noise at the station \(n\) (\(a\) or \(b\)), \({S}_{m}\left(t\right)\) and \({{y}}_{m}\) indicate the source time function and location of the source \(m\) (1 or 2), and \({\tau }_{m}^{n}\) indicates the travel time from the source \(m\) to the station \(n\). For the convenience, we define \(\varDelta {\tau }_{m}={\tau }_{m}^{b}-{\tau }_{m}^{a}\) and \(\varDelta {\tau }^{n}={\tau }_{2}^{n}-{\tau }_{1}^{n}\). We assume that the correlations between the two sources, source and noise, and noise at the two stations are negligible, namely, \(E\left[{S}_{1}\left(t\right){S}_{2}\left(t\right)\right]=E\left[{S}_{m}\left(t\right){N}^{n}\left(t\right)\right]=E\left[{N}^{a}\left(t\right){N}^{b}\left(t\right)\right]=0\), where \(E\left[\bullet \right]\) is the ensemble average. In addition, since \(\varDelta {\tau }_{1}\) and \(\varDelta {\tau }_{2}\) are significantly different (the locations of the colored dotted lines and colored dashed lines in Fig. 4b, respectively), the correlation between \({S}_{m}\left(t-\varDelta {\tau }_{1}\right)\) and \({S}_{m}\left(t-\varDelta {\tau }_{2}\right)\) is assumed to be negligible compared to the power of sources, namely, \(E\left[{S}_{1}\left(t\right){S}_{1}\left(t\right)\right]\gg E\left[{S}_{2}\left(t-\varDelta {\tau }_{1}\right){S}_{2}\left(t-\varDelta {\tau }_{2}\right)\right]\) and \(E\left[{S}_{2}\left(t\right){S}_{2}\left(t\right)\right]\gg E\left[{S}_{1}\left(t-\varDelta {\tau }_{1}\right){S}_{1}\left(t-\varDelta {\tau }_{2}\right)\right]\). Considering the general relation that \(E\left[{f}_{1}\left(t\right){f}_{2}\left(t+T\right)\right]=E\left[{f}_{1}\left(t-T\right){f}_{2}\left(t\right)\right]\) for arbitrary functions \({f}_{1}\) and \({f}_{2}\), we get the relation that \(E\left[{S}_{m}\left(t-\varDelta {\tau }_{1}\right){S}_{m}\left(t-\varDelta {\tau }_{2}\right)\right]=E\left[{S}_{m}\left(t-\varDelta {\tau }^{a}\right){S}_{m}\left(t-\varDelta {\tau }^{b}\right)\right]\). Then, we can calculate the power of the sources \(P{S}_{1}\) and \(P{S}_{2}\) as follows:
$${PS}_{m}=E\left[{S}_{m}\left(t\right){S}_{m}\left(t\right)\right] \simeq \left|{{y}}_{m}-{{x}}_{ }^{a}\right| \left|{{y}}_{m}-{{x}}_{ }^{b}\right| E\left[{F}_{ }^{a}\left(t+{\tau }_{m}^{a}\right){F}_{ }^{b}\left(t+{\tau }_{m}^{b}\right)\right], \left(3\right)$$
where \(E\left[{F}^{a}\left(t+{\tau }_{m}^{a}\right){F}^{b}\left(t+{\tau }_{m}^{b}\right)\right]\) could be evaluated by the peak value of the cross-correlation (Fig. 4b). Once \(P{S}_{1}\) and \(P{S}_{2}\) are obtained, we can calculate the noise power at the \(n\)-th station \(P{N}^{n}\) as follows:
$${PN}_{ }^{n}=E\left[{F}_{ }^{n}\left(t\right){F}_{ }^{n}\left(t\right)\right]-\frac{{PS}_{1}}{{\left|{{y}}_{1}-{{x}}_{ }^{n}\right|}^{2}}-\frac{{PS}_{2}}{{\left|{{y}}_{2}-{{x}}_{ }^{n}\right|}^{2}} . \left(4\right)$$
Since the propagation effect or the directionality of the source might lose the correlation values, this method using the cross-correlation peak values might underestimate the source power. Nevertheless, it should give us approximate powers of the sources.
The results are shown in Table 1. The source amplitude of the ISC fumarole is estimated to be larger than the IWC fumarole by one or two orders of magnitude. Although the SI microphone was near the IWC fumarole (~ 15 m distance), the ISC fumarole’s power was comparable to the IWC fumarole. It emphasizes that it is crucial to identify the multiple sources, even when we put a microphone very close to a target source.
6.2. Correlation signal
The ambient noise around two stations is known to generate peaks in the stacked correlation function, which represents Green’s functions between the stations (Shapiro and Campillo 2004; Ortiz et al. 2021). In other words, the stacked correlation function of a station pair can hold two minor but significant peaks without any particular signals. Since the IWC peak in SI-array CC is pretty small, it is not easy to distinguish whether it corresponds to Green’s function or the IWC fumarole signal. In the former case, there should be another minor peak around + 0.8 sec in the SI-array CC because two correlation signals should emerge symmetrically with respect to the zero lag time. However, the correlation peak of the ISC fumarole signal may mask such a peak (Fig. 3b2). Besides, since the IWC fumarole and the SI microphone are close, it is not easy to identify which is the source by the array analysis (Fig. 3d1).
To solve this problem, we conducted some synthetic tests. The locations of stations are simplified that all the stations and fumaroles are on a single plane (Fig. 4a). The waveform, \({F}^{n}\left(t\right)\), recorded at \(n\)-th microphone at \({{x}}^{n}\) consists of four types of waves:
$${F}_{ }^{n}\left(t\right)={A}_{1}\frac{{s}_{1}\left(t-{\tau }_{1}^{n}\right)}{\left|{{y}}_{1}-{{x}}_{ }^{n}\right|}+{A}_{2}\frac{{s}_{2}\left(t-{\tau }_{2}^{n}\right)}{\left|{{y}}_{2}-{{x}}_{ }^{n}\right|}+{A}_{\text{n}\text{o}\text{i}\text{s}\text{e}}^{n}{s}_{\text{n}\text{o}\text{i}\text{s}\text{e}}^{n}\left(t\right)+\sum _{k}^{ }{A}_{k}\frac{{s}_{k}\left(t-{\tau }_{k}^{n}\right)}{\left|{{y}}_{k}-{{x}}_{ }^{n}\right|}, \left(5\right)$$
where \({s}_{1}\) and \({s}_{2}\) are the normalized synthetic waveforms from the ISC and IWC fumaroles, \({s}_{\text{n}\text{o}\text{i}\text{s}\text{e}}^{n}\) is the uncorrelated noise at \(n\)-th microphone, and \({s}_{k}\) is the normalized noise from the \(k\)-th of the numerous background acoustic sources. Their amplitudes are denoted by \({A}_{1}\), \({A}_{2}\), \({A}_{\text{n}\text{o}\text{i}\text{s}\text{e}}^{n}\), and \({A}_{k}\), respectively. The source-to-station propagation times of the acoustic waves are \({\tau }_{1}^{n}\), \({\tau }_{2}^{n}\), and \({\tau }_{k}^{n}\), and the source locations are \({y}_{1}\), \({y}_{2}\), and \({y}_{k}\), respectively. We put 7,000–8,000 ambient sources within 2 km from the stations but excluded 400 m from the mid-point to avoid ambient sources having a larger power than the fumarolic signals (Fig. 4a). I mainly tested the ambient noise sources in the same plane as the microphones because they made more distinct correlation signals than those out of the plane. Gaussian noise in 5–80 Hz is used as the synthetic waveforms. The correlation coefficients are calculated in 1 min time windows with 60 min stacking.
Several conditions are tested (Table 2) and compared for both SI-array CC and array CC between the observation (Figs. 4b and 4c) and synthetic tests (Fig. 4b1–b5 and 4c1–c5). The comparison results are summarized in Table 2. As was expected, the ambient noise generated the correlation signals without particular sources (Test 1: Fig. 4b1 and 4c1). The peak values of the correlation signals in the SI-array CC were as small as the minor peak of the observation (Fig. 4c and 4c). In the test using a synthetic ISC signal with an adequate amplitude and ambient noise, the peak values of the array CC in the synthetic test were smaller than the observation, though the peak values of the SI-array CC were similar to the observation (Test 2: Fig. 4b2 and 4c2). A larger synthetic ISC signal could reproduce the observed peak values of the array CC. However, it also made a too large ISC peak value of the SI-array CC (Test 3: Fig. 4b3 and 4c3) while weakening the peak of the correlation signal at -0.8 s. Adding uncorrelated noise on the SI microphone could suppress the SI-array CC but erased the correlation signal at -0.8 s (Test 4: Fig. 4b4 and 4c4). The existence of the IWC fumarole signal was necessary to explain all the peak values of the correlation coefficients (Test 5: Fig. 4b5 and 4c5, Table 2), at least in these synthetic tests. From above, we concluded that the IWC fumarole’s signal formed the minor peak in the SI-array CC.
6.3. Power spectrum
At the array, the contribution of the ISC fumarole is significant (Table 1). Thus, the power spectrum of the array elements (the green line in Fig. 3e) is expected to represent the infrasound signal character from the ISC fumarole. The distinct feature is the persistent power in ~ 10 Hz and ~ 35 Hz. Although both correlation functions corresponding to the IWC and ISC signals have significant power in ~ 35 Hz (the red line in Fig. 3e1 and the blue line in Fig. 3e2, respectively), the numerical tests using synthetic waveforms confirmed that the power in ~ 35 Hz should be from the ISC signal (Supporting Information S6).
The mechanism generating the power in ~ 10 Hz or ~ 35 Hz would be important for the investigation of the fumarole. In volcanoes, the harmonic monochromatic sources are often observed and modeled (e.g., Fee et al. 2010; Ripepe et al. 2010; Goto et al. 2011; Yokoo et al. 2019). The topographical resonance under the ground is often discussed, such as the Helmholtz resonance (Fee et al. 2010; Goto et al. 2011) or the resonance of the conduit (Watson et al. 2019). Similarly, the power of the ISC fumarole in ~ 10 Hz or ~ 35 Hz might be the resonance frequency that relates to the topography around the source. On the other hand, the ground topography is also affect the power spectral shape significantly (Yamakawa 2022). There is a small wall behind the ISC fumarole (Fig. 2c), and it might have an influence. Further investigation is needed to understand the fumarolic infrasound generation mechanisms.
6.4. The VSA array and SI microphone
In the estimation of IWC fumarole, the SI microphone close to the source played an essential role. The SI-Array CC revealed the contribution of the IWC fumarole, though the array CC failed to identify it. From the peak of the SI-Array CC, the source regions were constrained. One might question whether the VSA array was necessary or not. Here we summarize the contribution of the array. Although the peak of the IWC fumarole in a single pair of the SI-array CC was recognizable by the stacking (Fig. S5), such a peak as small as 0.01 would be difficult to conclude as a signal without the confirmation by the array analysis using all pairs of the SI-array CC. Namely, the VSA array confirmed the very small peak of the SI-array CC as a signal. Then, the array CC helped distinguish the signal from the correlation peak made of ambient noise through the synthetic tests (Section 6.2). The identification of signals from noise is essential in monitoring. In addition, the combination of the source hyperbola obtained by the two stations and the back azimuth range obtained from the array analysis on the SI-array CC constrained the source location. From above, we conclude that the VSA array could significantly improve the monitoring network.