Utilizing Stacked Ensemble AI Algorithm for Comprehensive Water Quality Analysis and Health Assessment in Complex Aquatic Systems

doi:10.21203/rs.3.rs-4113265/v1

Download PDF

Research Article

Utilizing Stacked Ensemble AI Algorithm for Comprehensive Water Quality Analysis and Health Assessment in Complex Aquatic Systems

https://doi.org/10.21203/rs.3.rs-4113265/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

A novel framework, referred to as the Intelligent Mobile Data Collection (MDC) framework, is proposed to enhance data collection efficiency in Internet of Things (IoT) based sensor networks. This framework organizes IoT devices and sensors into clusters based on their geographical proximity or region. Within each cluster, a gateway node is designated to collect and consolidate data from its constituent members before transmitting it to the central MDC. To optimize data collection, the framework employs a learning mechanism known as Frequency-Based Reinforcement Learning (FRL). This technique analyzes data generation patterns, such as time intervals between transmissions, the quantity and type of packets generated, to classify clusters into categories: Frequent, Less Frequent, Rare, and Very Rare. Within FRL, each IoT sensor or device independently trains its local model using Reinforcement Learning (RL) techniques, encompassing states, actions, and rewards. These local models capture the specific behaviors and characteristics of the sensors. Subsequently, IoT sensors transmit their local model parameters to the gateway, where they are aggregated into a global model. This aggregated global model is then disseminated back to the IoT sensors, enabling them to adjust their behavior based on collective insights. Based on the categorized clusters, the framework dynamically adjusts parameters such as Time Division Multiple Access (TDMA) slot allocations, sleep durations for sensors, and the visiting schedule of the MDC. This adaptive approach ensures efficient utilization of network resources while accommodating varying data generation rates and priorities across different clusters. In summary, the proposed Intelligent Mobile Data Collection framework integrates FRL and RL techniques to optimize data collection in IoT sensor networks. By dynamically adapting to changing data generation patterns and cluster characteristics, it enhances overall network performance and resource utilization.

Federated Reinforcement Learning

IoT-Sensor Networks

Mobile Data Collection framework

RL technique

In recent years, the concept of the "Internet of Things" (IoT) has gained prominence, fueled by significant advancements in computer and communication technologies [1]. Broadly defined, an IoT system encompasses any entity capable of connecting to the internet via either wired or wireless networks. This includes individuals, machinery, or a combination of both, operating within the framework of the Internet of Things. Central to the functioning of IoT systems are Wireless Sensor Networks (WSNs), a prevalent technology facilitating multi-user access through a versatile application platform. WSNs enable seamless communication and interaction among various interconnected devices, forming the backbone of IoT infrastructure. The potential applications of IoT solutions are extensive, promising significant benefits across diverse domains. By leveraging IoT technologies, societies can achieve lower energy consumption, optimize the utilization of natural resources, foster safer urban environments, and promote environmental sustainability. These outcomes are made possible by the enhanced connectivity and data exchange facilitated by IoT systems, enabling more informed decision-making and efficient resource management. In essence, the emergence of IoT heralds a new era of interconnectedness, where the integration of digital and physical realms offers transformative possibilities for individuals, industries, and society as a whole. Through the deployment of IoT solutions, we stand to realize a future characterized by greater efficiency, sustainability, and overall well-being.

The technological landscape is undergoing a significant transformation, marked by the emergence of novel computational paradigms such as Cloud computing, Edge computing, and Fog computing. These paradigms represent a departure from the centralized architecture of the traditional Internet towards a more distributed and decentralized approach. This shift has been fueled by the proliferation of Artificial Intelligence (AI) and the Internet of Things (IoT), which have garnered considerable attention from academia due to their rapid expansion [3]. At the heart of this technological revolution lies the IoT's capability to collect and analyze vast amounts of data, making it a cornerstone of modern technology. In leveraging mobile data collection within IoT networks, significant energy savings can be achieved. However, a critical challenge arises in efficiently determining and organizing the route of mobile sinks to collect data from various nodes. Conventional static approaches to mobile data collection offer solutions based on predetermined variables, limiting their adaptability and effectiveness [4][5]. Enhancing data transmission efficiency is crucial for optimizing network performance and reducing communication costs, particularly in environments where devices operate on intermittent power sources. Efficient scheduling methods can significantly increase the amount of data collected via radio connections while minimizing data loss, even in scenarios with high data rates and diverse network sizes. The performance degradation observed in current systems is often attributed to the heterogeneity of devices [6]. The proliferation of smart cities and the widespread deployment of data-generating devices have led to an exponential growth in data repositories. Managing, searching, browsing, and analyzing these large datasets pose formidable challenges, exacerbated by the influx of data from consumer videos and surveillance networks [8][9]. Amidst the complexity of IoT ecosystems, data collection has become a multifaceted process. A primary challenge lies in addressing the dynamic movement of devices within these systems, which complicates data gathering and processing [10][11]. In summary, the convergence of AI, IoT, and emerging computational paradigms presents both opportunities and challenges in harnessing the potential of data-driven technologies. Efforts to overcome these challenges are crucial for realizing the full benefits of a connected and intelligent future.

1.1 Problem Identification and Objectives

Pervasive Sensing (PS) necessitates a robust data sharing and dissemination system to efficiently distribute data across a wide-ranging model while ensuring cost-effectiveness. Addressing nodes' latency, energy consumption, and storage capacity is paramount for facilitating efficient data collection. Moreover, ensuring accuracy in data collection demands a reliable system capable of handling errors effectively. One of the challenges in PS stems from the variability in packet generation among nodes within the network. Assigning fixed slots to each node proves impractical due to this dynamic nature. Consequently, the reallocation of slots before each scheduling round becomes essential but presents challenges in real-time implementation [3]. In mobile data collector (MDC)-based data collection techniques, clusters without data to transmit in a given time slot pose a dilemma for the MDC. The MDC must either exclude such clusters from its schedule or reduce their visitation frequency to optimize resource utilization [5]. Develop scheduling algorithms that dynamically allocate time slots to nodes based on their varying data generation rates, ensuring efficient utilization of network resources. Devise strategies for managing nodes' sleep cycles in alignment with their data generation patterns, thereby minimizing energy consumption while maintaining responsiveness to data collection requirements. Formulate scheduling protocols that dictate the MDC's visitation schedule to clusters based on their data generation behaviors, optimizing the collection process while reducing overhead. Develop methodologies and protocols aimed at mitigating latency and conserving energy throughout the data collection process, enhancing overall efficiency and performance. By addressing these objectives, this work aims to enhance the effectiveness and sustainability of data collection in PS environments, fostering improved decision-making and insights derived from pervasive sensing applications.

The IPDCA, suggested by Walid Osamy et al [2], proposes a method for delivering public data in a large-scale smart city setup. Data collectors (D-collection vehicles) that gather data from a variety of Access Points (APs) and relay it back to a central Base Station are used by IPDCA (BS). Our discrete optimization challenge necessitates the use of a modified Bat algorithm for route finding of D-collectors. Apart from that, we employ a multi-objective fitness function to pick D-collectors in smart city settings in order to guarantee optimum use of resources by taking into account D-collector count, journey distance, and storage. The suggested mechanism's effectiveness is shown through simulations.

Data collecting in WSNs with minimal latency and energy consumption may be achieved by using a DEEDC scheme-based matrix filling theory was propounded by Xiang et al [3]. The DEEDC scheme employs a clustering strategy. Nodes that create data aren't taken into account when determining how many slots are needed for transmission for each cluster. When data is created randomly, it may be gathered in a network (number of slots x number of nodes), eliminating the allocation of slots for each node and duplicate data acquisition.

The G-Link 200 vibration sensor was utilised by Cosmas Ifeanyi Nwakanma et al [4] to gather real-time vibration data. It is used to identify sensor information and transmit the sensor to an internet gateway through the Long Short Term Memory. In the event of an emergency, categorization enables for the identification of normal and anomalous activity situations. To protect residents' privacy, this is a common feature on newer models of smart homes. Toilets, bedrooms, and dressing rooms are all examples of this kind of space. Detecting excessive or abnormal vibration in a smart factory may also benefit from the use of this device. The user does not have to deal with the unpleasantness of video surveillance. Sensor data enhancement research may benefit from the information gathered by this research.

Deep Q-learning with experience replay has been presented by Sana Benhamaid et al [5] to design a mobile node's trajectory for mobile data collecting and cluster-based scenarios. Learning and finding an energy efficient route for the mobile node based on environmental information and adapting to substantial changes in the context (such as the amount of collectable data) is presently being taught using a neural network.

Aya H. Allam et al [7] have suggested an improvement to ZEAL that would increase WSN performance in terms of energy usage and data transmission. For the mobile-sink node, the K-means clustering technique is used by Enhanced ZEAL (E-ZEAL). In addition, a wider range of subsink nodes is available. The ns-3 simulator is used in the experiments. E-performance ZEAL's is compared to that of ZEAL. Speeding up data gathering by more than 30 percent with full data transmission is made possible by E-reduced ZEAL's number of hops and distance. Additionally, E-ZEAL extends the network's lifetime by 30 percent.

In their study, Najafabadi et al. [16] delved into the realm of Deep Learning applications within the domain of big data analytics. Their research explored the intersection between deep learning techniques and the challenges posed by the voluminous datasets characteristic of big data environments. By investigating various methodologies and approaches, Najafabadi et al. shed light on the potential of deep learning to extract meaningful insights and drive innovation in the field of big data analytics. Similarly, Sohangir et al. [17] conducted a thorough examination of the application of Deep Learning techniques in financial sentiment analysis within the context of big data. By leveraging advanced machine learning algorithms, Sohangir et al. explored how deep learning methodologies can enhance sentiment analysis processes, thereby providing valuable insights into financial markets and investor sentiments. Their research underscores the pivotal role of deep learning in unlocking actionable intelligence from vast volumes of financial data. In a complementary study, Dash et al. [18] delved into the multifaceted landscape of big data in healthcare, focusing on its management, analysis, and future prospects. Through their investigation, Dash et al. elucidated the various challenges and opportunities associated with leveraging big data analytics to revolutionize healthcare delivery and improve patient outcomes. Their research underscores the transformative potential of big data in healthcare, offering insights into strategies for effective data management, analysis, and utilization. Meanwhile, Javaid et al. [19] explored the significant applications of big data within the context of Industry 4.0. By examining the convergence of big data analytics and industrial processes, Javaid et al. highlighted the transformative impact of big data on manufacturing, supply chain management, and industrial automation. Their research provides valuable insights into how organizations can harness the power of big data to drive efficiency, innovation, and competitiveness in the era of Industry 4.0.

3.1 Overview

This paper presents an innovative Intelligent Mobile Data Collection (MDC) framework tailored specifically for Internet of Things (IoT) based sensor networks. Central to this framework is the utilization of Frequency-Based Reinforcement Learning (FRL) to discern and adapt to data generation patterns, including time intervals between transmissions, packet quantities, and packet types. Within the FRL paradigm, each IoT sensor or device autonomously trains its local model using Reinforcement Learning (RL) techniques, encompassing states, actions, and rewards. This localized learning process enables sensors to efficiently capture and adapt to the unique characteristics of their operational environment. Following the training phase, IoT sensors transmit their locally trained model parameters to a central gateway for aggregation. At the gateway, these individual parameters are amalgamated into a comprehensive global model, leveraging collective insights from the entire sensor network. This aggregated global model is then disseminated back to the IoT sensors, empowering them with refined decision-making capabilities informed by network-wide data.

Building upon the insights gleaned from the global model, the framework dynamically adjusts key parameters such as Time Division Multiple Access (TDMA) slots, sleep durations for sensors, and the visiting schedule of the MDC. These adjustments are contingent upon the categorization of sensor clusters, ensuring optimized resource allocation and data collection efficiency tailored to the specific needs and dynamics of each cluster. By integrating FRL-based learning, distributed RL techniques, and adaptive scheduling mechanisms, the proposed framework offers a comprehensive solution to the challenges inherent in IoT-based sensor networks. This approach not only enhances data collection efficiency but also fosters adaptability and resilience in the face of evolving network conditions and requirements. Ultimately, the framework holds promise for facilitating more intelligent and responsive IoT deployments, with implications spanning various domains, from smart cities to industrial automation.

3.2 System Model

Figure 1 illustrates the system model of the proposed framework, delineating the intricate interplay between IoT sensors, gateways, and the Mobile Data Collector (MDC). Central to this model is the concept of clustering, wherein IoT devices and sensors are organized into cohesive groups based on their geographical proximity or regional affiliation. Each cluster is spearheaded by a designated gateway node tasked with the responsibility of collecting, aggregating, and transmitting data from its constituent members to the central MDC. The process of cluster formation hinges on meticulous consideration of geographical locations, ensuring that clusters encapsulate sensors within close physical proximity to optimize data collection efficiency. To facilitate this, each cluster is assigned a unique identifier (ID), enabling seamless communication and management within the network architecture. Crucially, the selection of gateway nodes within each cluster is guided by a judicious blend of factors, primarily residual energy and node degree. Residual energy levels serve as a vital metric for gauging the operational capacity and longevity of potential gateway candidates, ensuring sustainable data transmission capabilities. Concurrently, the node degree, reflecting the connectivity and centrality of each sensor within the cluster, informs the selection process, ensuring robust network coverage and resilience. By integrating geographical clustering, cluster identification, and gateway selection based on energy and connectivity metrics, the framework lays the groundwork for a resilient and efficient data collection ecosystem. This holistic approach not only optimizes resource utilization but also enhances the scalability and adaptability of the system to dynamic environmental conditions and network dynamics. Through Fig. 1, the intricate orchestration of components within the framework comes to life, offering a visual representation of the cohesive symbiosis driving the efficacy and functionality of the proposed architecture.

3.3 Basics of FL

A consortium of users leveraging IoT devices such as smartphones, laptops, or tablets collaboratively implements Federated Learning (FL) algorithms to execute IoT tasks. FL stands as a cornerstone in the evolution of next-generation IoT networks, where its significance is paramount for unlocking the full potential of intelligence at the network edge. This is especially crucial as a centralized Base Station (BS) often lacks the capability to gather all data generated by distributed IoT devices for training Artificial Intelligence/Machine Learning (AI/ML) models. FL revolutionizes the conventional approach by enabling IoT users and the BS to jointly train a global model while preserving raw data privacy at the users' devices. Through FL, each IoT user actively contributes to the training process by leveraging their individual datasets to train a localized ML model. Subsequently, these locally trained models are uploaded to the BS, which orchestrates the aggregation process to construct a comprehensive global model. This collaborative FL process ensures that the intelligence gleaned from IoT data remains decentralized and distributed, reflecting the diverse contexts and environments in which IoT devices operate. By allowing users to retain control over their data while still contributing to the collective intelligence, FL strikes a delicate balance between privacy preservation and model accuracy. Furthermore, FL facilitates continual model refinement and adaptation to evolving data distributions without necessitating data centralization. This distributed approach not only mitigates privacy concerns but also enhances scalability and robustness, as the global model reflects the collective insights from a diverse array of IoT devices. In essence, FL empowers IoT ecosystems to harness the collective intelligence of distributed devices while respecting data privacy and security. As IoT applications continue to proliferate across various domains, FL emerges as a pivotal enabler for unlocking the full potential of intelligence at the network edge, fostering innovation and efficiency in IoT-driven endeavors.

A standard Federated Learning (FL) system comprises a FL server (S) and a cohort of participating clients, each possessing a private dataset (dc ∈ C). Each client leverages its local dataset to train a specific local model (mc) and subsequently transmits the local model parameters as an update to the FL server (S). The FL server (S) then aggregates all received local models to derive the global model (MG) following a specified aggregation protocol. It's important to note that this approach differs from conventional cloud-centric training methods, where the model is trained by aggregating and processing data centrally from all clients.

The training process of FL, as illustrated in Fig. 1, entails the following three steps:

Step 1 (Initialization and Model Distribution): During the initial round (Round 0), the FL server (S) defines the training task, specifying the target model, data requirements, and hyperparameters (e.g., batch size). Subsequently, it broadcasts the initial global model and task settings to all participating clients.

Step 2 (Local Model Training and Update): In subsequent rounds (Round t), each client (c) updates its local model parameters based on the global model received from the FL server (S). The objective is to optimize local parameters to minimize the loss function associated with the training data. Upon completion, the updated local parameters are uploaded to the FL server (S).

Step 3 (Global Model Aggregation and Update): In the same round (Round t), the FL server (S) aggregates all received local models with the aim of minimizing the global loss function. The aggregation process combines insights from all participating clients to refine the global model (MG).

The FL server (S) then broadcasts the updated global model to all clients for training in the subsequent round (t + 1). This iterative process continues until convergence of the global model (MG) or until a desired level of accuracy is achieved.

In summary, the FL framework facilitates collaborative model training across distributed clients while preserving data privacy. By leveraging local datasets and iterative model updates, FL enables the development of robust and accurate global models without centralized data aggregation.

3.4 Deep Reinforcement Learning (DRL)

Reinforcement Learning (RL) serves as a powerful mathematical framework that empowers computing devices to autonomously learn and make decisions based on experiences garnered from interacting with their environment. At the heart of RL lies the concept of learning through interactions, where an agent navigates through a dynamic environment by selecting actions, observing outcomes, and receiving rewards in return. In RL, the agent's decision-making process revolves around selecting actions according to a predefined policy and executing them within the environment. Subsequently, the agent receives feedback in the form of rewards, which reflect the outcomes of its actions within the evolving environment. Through iterative cycles of action-selection, observation, and reward-feedback, the agent continually refines its policy to optimize its decision-making strategy and maximize cumulative rewards. The ultimate goal of the agent is to learn an optimal policy that guides it towards actions yielding the highest expected rewards. This entails discerning the most favorable sequence of actions based on the rewards provided by the environment. The methodology or algorithm employed by the agent to learn and update its policy varies depending on the specific RL method utilized. Deep Reinforcement Learning (DRL) represents a significant advancement in RL by integrating deep neural networks into the learning process. By leveraging the expressive power of deep learning architectures, DRL enables agents to learn complex decision-making strategies directly from raw sensory inputs. The DRL framework trains neural networks to map environmental states to optimal actions, leveraging rich representations of state-action spaces learned through layers of abstraction. In recent years, the field of DRL has witnessed an explosion of research activity, resulting in the development of a diverse array of algorithms and techniques. These advancements have led to significant breakthroughs in a wide range of domains, including robotics, gaming, finance, and healthcare, among others. In summary, RL and its variant, DRL, represent cutting-edge approaches to autonomous learning and decision-making in dynamic environments. By enabling agents to learn optimal strategies through interaction and experience, these frameworks hold immense potential for advancing the capabilities of intelligent systems across various domains.

3.5 Federated Reinforcement Learning (FRL) Process

Frequency-Based Reinforcement Learning (FRL) represents a fusion of Federated Learning (FL) and Reinforcement Learning (RL) techniques, leveraging the strengths of both approaches [13]. FRL offers a unique advantage by enabling the aggregation of observations from diverse environments, thus enhancing learning capabilities compared to traditional Deep Reinforcement Learning (DRL) methods that rely solely on partial observations from a single environment. The integration of FL principles into RL frameworks empowers FRL to harness collective intelligence from distributed sources, facilitating more robust and comprehensive learning. By pooling observations from various environments, FRL can overcome the limitations of individual datasets and extract valuable insights from a broader spectrum of experiences. One notable advantage of FRL lies in its ability to outperform standard DRL approaches when confronted with scenarios characterized by partial observations. FRL's capacity to integrate observations from multiple environments enables it to derive more accurate and generalized models, thereby enhancing performance across a range of tasks and environments. In the context of IoT data collection, FRL emerges as a potent tool for training and classifying data generation patterns among IoT devices. By leveraging insights from diverse clusters, FRL enables the identification and classification of data generation patterns into categories such as Frequent, Less Frequent, Rare, and Very Rare. This categorization facilitates more nuanced and effective resource allocation strategies, tailored to the specific characteristics and requirements of each cluster. In summary, FRL represents a novel approach that bridges the gap between FL and RL techniques, offering enhanced learning capabilities by leveraging observations from multiple environments. In the realm of IoT data collection, FRL holds promise for optimizing resource allocation, improving classification accuracy, and ultimately advancing the efficiency and effectiveness of data-driven decision-making processes.

Let K = {1, 2, ..., K} denote the set of participants who use IoT devices to collaboratively implement an FRL algorithm for performing an IoT task.

The key process involved are as follows:

In this framework, the data generation patterns such as time interval, number of packets generated and type of packets are learned using FRL.
This learning has two clients
Data Clients – this refers IoT Devices
Aggregation Server – This is located at the base station or access point.
FRL allows IoT users and the BS to train a shared global model while the raw data are remained at users’ devices.
Each IoT user k participates in training a shared model by using their own dataset D_k∈K. Hereinafter, the FL model trained at the IoT device is called the local model wk.
After local training, IoT users upload their local model updates to the BS that then aggregates to build a shared model, called the global model w_G.
By relying on the distributed data training at the IoT devices, the aggregation server at the BS can enrich the training performance without significantly compromising user data privacy.
After learning, the clusters gets classified into 4 categories as

Frequent
Less Frequent
Rare and
Vary Rare.

3.5.1 FRL Algorithm

The Frequency-Based Reinforcement Learning (FRL) system entails several pivotal steps to effectively train models and optimize performance. These steps are as follows:

1. Initial Global Model Distribution:

The gateway initiates the training process by disseminating the initial global model to all devices within the network. This serves as the foundation upon which local models will be built and refined.

2. Local Model Training:

Each device engages in training its own local model using locally available information, encompassing states, actions, and rewards.

- **States**:

To enable optimal decision-making, the state representation includes pertinent information tailored to the context. The initial value set incorporates cluster information, providing crucial insights into the network topology.

- **Actions**:

Devices are endowed with the ability to execute movements in four cardinal directions on the traffic map, namely left, right, bottom, and up. These actions facilitate dynamic navigation and adaptation to changing traffic conditions.

- **Rewards**:

Proper incentivization is essential for effective learning. Rewards are bestowed based on fluctuations in traffic volume: a positive reward is conferred upon traffic reduction, while an increase in traffic warrants a negative reward. Additionally, rewards are allocated for the efficient utilization of network service capability, promoting optimal resource management.

3. Transmission of Local Model Parameters:

Following local model training, devices transmit their respective local model parameters (W1,. . ., Wn) back to the gateway. This exchange facilitates the aggregation of individual insights into a unified global model.

4. Aggregation of Model Parameters:

The gateway aggregates received local model parameters into the global model, consolidating diverse insights and refinements contributed by individual devices.

5. Global Model Distribution:

Subsequently, the parameters of the aggregated global model (WG) are disseminated to all devices once again. This iterative process continues until the global model reaches a satisfactory level of training, ensuring continual refinement and optimization of performance.

Through these coordinated steps, the FRL system harnesses the collective intelligence of distributed devices, enabling adaptive decision-making and optimization in dynamic environments. By iteratively refining the global model based on local insights, the system achieves enhanced performance and adaptability, ultimately driving efficiency and effectiveness in managing traffic congestion.

4.1 Simulation Parameters

The proposed Intelligent Mobile Data Collection using FRL (IMDC-FRL) is implemented in NS3 and the FRL module is implemented in Python. The simulation settings are shown in Table 1.

Table 1

Simulation settings
Number of Nodes	20 to 100
Size of the topology	50m X 50m
MAC protocol	IEEE 802.15.4
Traffic Source	CBR and Exponential
Traffic Flows	6
Traffic Rate	50Kb
Initial Energy	15 Joules
Transmit power	0.3 watts
Receiving power	0.3 watts

4.2 Comparison Results

In this section, we present a comprehensive comparison between the performance of the Intelligent Mobile Data Collection with Frequency-Based Reinforcement Learning (IMDC-FRL) framework and the Intelligent Proficient Data Collection Approach (IPDCA) [2]. The comparison is conducted across multiple key metrics, including packet delivery ratio, packet drop rate, computation cost, and average residual energy. The evaluation encompasses varying the number of nodes within the network, ranging from 20 to 100, to provide insights into scalability and performance across different network scales. The packet delivery ratio serves as a crucial metric reflecting the effectiveness of data transmission within the network. Higher packet delivery ratios indicate better reliability and efficiency in delivering data packets to their intended destinations. Packet drop rate measures the percentage of data packets that fail to reach their intended destinations due to network congestion, transmission errors, or other factors. A lower packet drop rate signifies improved network stability and robustness. Computation cost refers to the computational resources consumed during the execution of data collection and processing tasks within the network. Lower computation costs indicate more efficient utilization of resources, leading to reduced energy consumption and improved overall performance. The average residual energy metric quantifies the remaining energy levels across nodes within the network after performing data collection and processing tasks. Maintaining higher average residual energy levels is crucial for prolonging network longevity and sustainability. Through systematic experimentation and analysis, we compare the performance of IMDC-FRL and IPDCA across these metrics to assess their respective strengths and weaknesses. By varying the number of nodes within the network, we explore the scalability and robustness of each approach under different network configurations.

The comparison results will provide valuable insights into the efficacy of IMDC-FRL compared to existing approaches such as IPDCA, thereby informing decisions regarding the selection and deployment of data collection frameworks in IoT environments. Additionally, the findings will contribute to advancing research and development efforts aimed at enhancing the efficiency and effectiveness of data collection methodologies in IoT networks.

Table 2

Results of Delivery Ratio
Nodes	IMDC-FRL	IPDCA
20	0.9583	0.9211
40	0.9515	0.9169
60	0.9436	0.9078
80	0.9416	0.9013
100	0.9403	0.8951

Figure 3 shows the packet delivery ratio values for different number of nodes, from the figure it shows that the packet delivery ratio of IMDC-FRL 4% higher than IPDCA.

Table 3

Results of Packet Drop
Nodes	IMDC-FRL	IPDCA
20	3007	4583
40	5447	7255
60	6210	9466
80	6730	9868
100	7459	10494

Figure 4 shows the packet drop values for different number of nodes, from the figure it shows that the packet drop of IMDC-FRL 31% lesser than IPDCA.

Table 4

Results of Computational Cost
Nodes	IMDC-FRL (Kb)	IPDCA (Kb)
25	38	42
45	41	44
65	46	56
85	49	66
105	63	97

Figure 5 shows the Computational Cost values for different number of nodes, from the figure it shows that the Computational Cost of IMDC-FRL 18% lesser than IPDCA.

Table 5

Results of Residual Energy
Nodes	IMDC-FRL (Joules)	IPDCA (Joules)
25	12	11
45	12	11
65	13	12
85	13	12
105	13	13

Figure 6 shows the Residual Energy values for different number of nodes, from the figure it shows that the Residual Energy of IMDC-FRL 8% higher than IPDCA.

4.3 Results of Classification

In this section, the classification performance of the FRL is compared with traditional Federated Learning (FL) and K-Means clustering algorithms, Table 6 and Fig. 7 show the results of accuracy, sensitivity and specificity metrics for the 3 ML algorithms.

Table 6

Results of Accuracy, sensitivity and specificity
Metrics	FRL	FL	K-Means
Accuracy	0.96	0.94	0.89
Sensitivity	0.84	0.79	0.75
Specificity	0.90	0.87	0.84

Figure 7 shows that Accuracy of FRL is 2% higher than FL and 8% higher than K-Means Algorithm and the sensitivity of FRL is 6% higher than FL and 11% higher than K-Means algorithm, Similarly the specificity of FRL is 3% higher than FL and 7% higher than K-Means algorithm.

Table 7 and Figure7 show the results of Recall, Precision and F1-score metrics for the 3 ML algorithms.

Table 7

**Results of Recall, Precision and F1-score**
Metrics	FRL	FL	K-Means
Recall	0.92	0.91	0.89
Precision	0.46	0.43	0.41
F1 score	0.65	0.60	0.59

Figure 8 shows that Recall of FRL is 2% higher than FL and 4% higher than K-Means Algorithm and the Precision of FRL is 7% higher than FL and 11% higher than K-Means algorithm, Similarly the F1-score of FRL is 7% higher than FL and 9% higher than K-Means algorithm.

This paper introduces an innovative Intelligent Mobile Data Collection (IMDC) framework designed specifically for Internet of Things (IoT) based sensor networks. At the heart of this framework lies the organization of IoT devices and sensors into clusters based on their geographical location or regional proximity. Each cluster is spearheaded by a gateway node tasked with the responsibility of collecting, aggregating, and transmitting data from its constituent members to the central Mobile Data Collector (MDC). A key aspect of the proposed framework is the utilization of Frequency-Based Reinforcement Learning (FRL) techniques to analyze and learn data generation patterns exhibited by IoT devices. These patterns encompass critical parameters such as time intervals between data transmissions, the quantity and types of packets generated, among others. Leveraging FRL, each IoT sensor or device autonomously trains its local model using Reinforcement Learning (RL) techniques, encompassing states, actions, and rewards. Following the local model training phase, IoT sensors transmit their model parameters to the gateway node, where they are aggregated into a comprehensive global model. This global model encapsulates collective insights gleaned from the entire sensor network and is subsequently disseminated back to the IoT sensors, enabling them to adapt their behavior based on shared knowledge. The efficacy of the proposed framework is validated through implementation in NS2, a widely used network simulation tool. Through extensive experimentation, the framework's performance is evaluated in terms of data collection latency, energy consumption, and accuracy. The results demonstrate notable improvements in these key metrics, highlighting the framework's effectiveness in optimizing data collection processes in IoT environments. Furthermore, the framework dynamically adapts to the characteristics of different clusters, adjusting parameters such as Time Division Multiple Access (TDMA) slots, sleep duration for sensors, and the visiting schedule of the MDC. This adaptive approach ensures efficient resource allocation and data collection tailored to the specific requirements of each cluster. In summary, the proposed IMDC framework represents a significant advancement in IoT data collection methodologies, offering enhanced efficiency, accuracy, and adaptability. By leveraging FRL techniques and dynamic scheduling mechanisms, the framework holds promise for optimizing data collection processes and facilitating more intelligent and responsive IoT deployments.

*Funding: Not Applicable

*Conflicts of interest: The authors do not have any conflict or competing interests.

*Ethics approval: Not Applicable

*Consent to participate: Not Applicable

*Consent for publication: Not Applicable

*Availability of data and material: Data used in this work is available from the corresponding author base on a reasonable request.

*Code availability: Not Applicable

*Authors' contributions:

A.G. Conceptualization, Methodology, Software. Writing- Original draft preparation. A.J.: Simulations, Visualization, Investigation. S.A.O: Numerical computations, results and discussion, I.K. Revision, Reviewing and Editing. All the authors reviewed the manuscript.

Funding Declaration: Not Applicable

*Acknowledgment: Not Applicable

Andreas P, Plageras KE, Psannis C, Stergiou H, Wang, Gupta BB. Efficient IoT-based sensor BIG Data collection-processing and analysis in Smart Buildings. Future Generation Comput Syst, 2017.
Walid Osamy AM, Khedr, Ahmed A, El-Sawy A, Salim, Vijayan D. Intelligent Proficient Data Collection Approach for IoT-Enabled Wireless Sensor Networks in Smart Environments. IPDCA: MDPI, Electronics,; 2021.
Xuemei Xiang W, Liu T, Wang M, Xie X, Li H, Song A, Liu, Zhang G. Delay and energy-efficient data collection scheme-based matrix filling theory for dynamic traffic IoT. EURASIP J Wirel Commun Netw, 2019.
Nwakanma CI, Islam FB, Maharani MP, Kim D-S, Lee J-M. IoT-Based V ibra tion Sensor Data C ollection and Emergency Detection Classification using Long. IEEE, ICAIIC: Short Term Memory (LSTM); 2021.
Sana Benhamaid H, Lakhlef A, Bouabdallah. Towards Energy Efficient Mobile Data Collection. In Cluster-based IoT Networks,IEEE; 2021.
R.Radhika and K.Kulothungan, Improved Data Dissemination using Intelligent Stimulus Mechanism in IoT NEtwork. Adv Eng Res (AER), volume 142, 2018.
Aya H, Allam M, Taha, Hala H, Zayed. Enhanced Zone-Based Energy Aware Data Collection Protocol for WSNs (E-ZEAL). J King Saud Univ – Comput Inform Sci 34, 2022.
Ghyzlane CHERRADI, Adil ELBOUZIRI, Azedine BOULMAKOUL. Smart Data Collection Based on IoT Protocols, Proceedings of the Third International Conference on Advanced Informatics for Computing Research, 2019.
Khan Muhammad J, Lloret, Baik SW. Intelligent and Energy-Efficient Data Prioritization in Green Smart Cities: Current Challenges and Future Directions, IEEE Communications Magazine, 2019.
Huang Y, Nazir S, Kong XMS, Liu Y. Acquiring Data Traffic for Sustainable IoT and Smart Devices Using Machine Learning Algorithm, Security and Communication Networks, Hindawi, 2021, Article ID 1852466, 11pages, https://doi.org/10.1155/2021/1852466.
Nataly Zhukova AM, Thaw M, Tianxing. Mustafin Nikolay, IoT Data Collection based on Social Network Models, Proceedings of The 26th Conference Of Fruct Association.
Xu W, Fang W, Ding Y, Meixia Zou, Access IEEE. March 2021, 10.1109/Access.2021.3063291.
Yoo S, Lee W. Federated Reinforcement Learning Based AANs with LEO Satellites and UAVs, Sensors, 2021, 21, 8111. https://doi.org/10.3390/s21238111.
Tong QYYLTCY. Federated Machine Learning: Concept and Applications', ACM Transactions on Intelligent Systems and Technology, Volume 10, Issue 2,March 2019, Article No.: 12, pp:1–19, https://doi.org/10.1145/3298981.
Dinh C, Nguyen M, Ding PN, Pathirana A, Seneviratne J, Li. and H. Vincent Poor, Federated Learning for Internet of Things: A Comprehensive Survey. IEEE Commun Surv Tutorials, 16 April 2021.
Najafabadi MM, Villanustre F, Khoshgoftaar TM, Seliya N, Wald R, Muharemagic E. Deep learning applications and challenges in big data analytics. J Big Data. 2015;2:1–21.
Sohangir S, Wang D, Pomeranets A, Khoshgoftaar TM. Big Data: Deep Learning for financial sentiment analysis. J Big Data. 2018;5(1):1–25.
Dash S, Shakyawar SK, Sharma M, Kaushik S. Big data in healthcare: management, analysis and future prospects. J Big Data. 2019;6(1):1–25.
Javaid M, Haleem A, Singh RP, Suman R. Significant applications of big data in Industry 4.0. J Industrial Integr Manage. 2021;6(04):429–47.

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Utilizing Stacked Ensemble AI Algorithm for Comprehensive Water Quality Analysis and Health Assessment in Complex Aquatic Systems

Status:

Version 1

Abstract

Figures

1. Introduction

1.1 Problem Identification and Objectives

2. Related Works

3. Proposed Solution

3.1 Overview

3.2 System Model

3.3 Basics of FL

3.4 Deep Reinforcement Learning (DRL)

3.5 Federated Reinforcement Learning (FRL) Process

3.5.1 FRL Algorithm

1. Initial Global Model Distribution:

2. Local Model Training:

3. Transmission of Local Model Parameters:

4. Aggregation of Model Parameters:

5. Global Model Distribution:

4. Experimental Results

4.1 Simulation Parameters

4.2 Comparison Results

4.3 Results of Classification

5. Conclusion

Declarations

References

Additional Declarations

Status:

Version 1

Utilizing Stacked Ensemble AI Algorithm for Comprehensive Water Quality Analysis and Health Assessment in Complex Aquatic Systems

Status:

Version 1

Abstract

Figures

1. Introduction

1.1 Problem Identification and Objectives

2. Related Works

3. Proposed Solution

3.1 Overview

3.2 System Model

3.3 Basics of FL

3.4 Deep Reinforcement Learning (DRL)

3.5 Federated Reinforcement Learning (FRL) Process

3.5.1 FRL Algorithm

1. **Initial Global Model Distribution**:

2. **Local Model Training**:

3. **Transmission of Local Model Parameters**:

4. **Aggregation of Model Parameters**:

5. **Global Model Distribution**:

4. Experimental Results

4.1 Simulation Parameters

4.2 Comparison Results

4.3 Results of Classification

5. Conclusion

Declarations

References

Additional Declarations

Status:

Version 1

1. Initial Global Model Distribution:

2. Local Model Training:

3. Transmission of Local Model Parameters:

4. Aggregation of Model Parameters:

5. Global Model Distribution: