Seismic signal detection is a crucial technology for enhancing the efficiency of earthquake early warning systems.However, existing deep learning-based seismic signal detection models often face limitations in resource-constrained seismic monitoring engineering environments due to their high computational resource demands. To address this challenge, we introduce an innovative seismic signal detection network, which integrates the advantages of Coordinate Attention modules and Transformer attention mechanisms (ICAT-net). It aims to reduce computational resource consumption while maintaining or enhancing the multitask recognition performance of seismic waveform detection and phase picking. Specifically, ICAT-net employs a Downsampling module to reduce data dimensions, while meticulously controlling the spatial relationships of features through the Coordinate Attention module. Coupled with the capacity of the Transformer to capture long-range dependencies, a significant enhancement is observed in the accuracy of earthquake event detection and phases picking. By using concatenation operations between encoders and decoders, the model retains rich contextual information and gradually restores the spatial resolution of the signal during the decoding process. The study trained the ICAT-net using the global seismic dataset Stanford Earthquake Dataset (STEAD) and employed multidimensional performance metrics, including precision, recall, F1-score, mean absolute error, floating-point operations, and model parameters, to ensure comprehensive and accurate evaluation. Extensive experiments demonstrate that the ICAT-net can generate more accurate responses in various seismic scenarios, achieving higher detection accuracy with lower computational power consumption, providing a highly valuable tool for earthquake monitoring and disaster risk assessment.