Fault diagnosis of belt conveyors is crucial for coal mine production, but audio-based fault diagnosis in underground coal mines remains challenging due to the strong noise environment. To address this problem, a method for audio fault diagnosis of belt conveyors based on improved variational modal decomposition and improved adaptive noise reduction convolutional networks in a strong noise environment is proposed. Firstly, the improved beluga whale optimization is designed by introducing the non-linear balance factor and non-linear probability and combining them with the proposed cyclical shock factor to optimize the variational modal decomposition parameters to achieve noise reduction and signal reconstruction. Secondly, an improved adaptive noise reduction convolutional network is developed using an adaptive threshold activation function and an improved loss function to enhance noise robustness and fault diagnosis accuracy. Finally, the proposed method's effectiveness is evaluated in low and strong noise environments, with experimental results demonstrating superior fault diagnosis performance. In low noise environments, the fault diagnosis accuracy is 98.61%, and in strong noise environments, it is 98.96%, outperforming existing fault diagnosis methods.