Big data for IoV’s data structuring enhance the query language process in a multi-dimensional analysis, interoperability, and semantic intelligibility for heterogeneous data.
3.1 The paradigm architecture for big data and IoV’s system
Leveraging big data for an intelligent road system is the essence of support from one to another. Big data for sensor data acquisition, transmission, structuring, modeling, and analytics is a methodological approach to enhance the data value from the internal and external IoV’s environment. The architecture addresses the existing data processing and scales up to the global needs of the core of the data exploratory and management contexts (Jairo et al., 2021; Ying, 2013). The current IoV data analysis algorithm considers the sensor data and its dynamism in line with the ever-growing technologies, data deluge, which is the emerging paradigm of big data to apply for sensor data structuring and analytics. It performs in a multi-aspect, including (i) the need for the reliable design of popular algorithms makes integration with other techniques. (ii) Sequential design of the algorithms to parallel computing infrastructures to the optimal performance and sensor devices’ requirements. (iii) Enhancing technological development by integrating with intelligent cloud computing systems (Li et al., 2019; Gebeyehu et al., 2014). In parallel, the process gives secure information access and data trust to its sources–via vehicles and sensor devices (Mahmood et al., 2018; Wenchao et al., 2018) (Fig. 2).
Figure 2 is an architecture for a vehicular network that encompasses federated sensor data transformed into valuable information for the optimal intelligent system services and safety. It reveals the integrity of big data applications and IoV sensor data structure towards intelligent systems enhance performance. It is a verge of Big data as a core component of the future Internet of Vehicles to optimize large-scale data computation to extract valuable information (James, 2018; Diansheng and Jeremy, 2009). Big data for searching knowledge involves various steps, including feature extraction and selection to improve the intelligent system performances (Shen-Bin et al., 2010).
3.2 The success of the vehicular system and Big Data application
Big data for IoV data structuring and analytics is a notion that has a fundamental strength to the industrial efforts of devising intelligently connected vehicles for developing safe road environments. The paradigm application of big data is to enhance the ITS's digital ecosystem for IoV service privacy and safety (Longhua et al., 2017; Kate and Jason, 2014; Omer and Jules, 2013). Big data for IoV is a scientific and dynamic way to drive data patterns that gain from sensor devices. The data is processed using big data technologies from a data vault and event-driven approaches. The data received from multiple sensor devices, including vehicles, are a meaningful insight into the events, and bandwidth (Zhenyu et al., 2017; João, 2013), as shown in Fig. 3a. It applies the process information to optimize the response in the services triggered to the optimal vehicular system interface (Fig. 3b).
The paradigm of Big Data in an ITS applies to predict the Internet of Vehicles safety and optimal operability by the data analysis using machine learning techniques for safe and proper vehicle trajectories that include location and vehicle speeds (Elmustafa et al., 2021). This data flow in the network is pertinent to estimating the traffic circumstances along the road sectors using longitudinal and latitude coordination. The traffic flow, vehicle speeds, and direction are defined in spots of the specified time using (Shridevi and Manjunath, 2020) models (Eq. 7–9).
\(q\left( A \right)=\frac{{d\left( A \right)}}{{\left| A \right|}}\) (7) \(k\left( A \right)=\frac{{t\left( A \right)}}{{\left| A \right|}}\) (8) \(v\left( A \right)=\frac{{d\left( A \right)}}{{t\left( A \right)}}\) (9)
Where q(A) is the flow of vehicles, k(A) is vehicle density, and v(A) is the vehicle speed. Annotations by d(A), t(A), and |A| represent the total distance traveled and vehicle time spent in that location and area covered by the Vehicles, respectively. Therefore, the integrity of big data for the Internet of Vehicles is to enhance road safety and vehicle trajectories time efficiency (Fei et al., 2017). As shown in Fig. 3, big data is associated with the fundamentals of the Internet of Vehicles for sensor data structuring and exploring vehicle movement safety (Noor et al., 2020). It ensures the Internet of Vehicle system security by learning and identifying the vehicles.
3.2 The Big data approach to exploit the hidden pattern of IoV’s
The big data for the vehicular system is the essence of effective intelligent technology for sequential vehicles’ information flow from one to another. A vehicle on the road will have signal information for its distance and road length, and the traffic controller can have the vehicle’s start and end distance in between (Sanghyun and Jonghwa, 2019; Young-Sik et al., 2019). The current signal information presents using Eq. 10, as follows:
$$\mathop T\nolimits_{M} =\mathop T\nolimits_{{current}} +\tau \times \frac{{\mathop R\nolimits_{{len}} - \mathop V\nolimits_{{dist}} }}{{\mathop R\nolimits_{{len}} }}$$
10
The Vehicle’s information message (M) transmitted at time t (TM) is the current time t (Current) and tau τ) of the unit time with the quotient of road length (Rlen) minus Vehicle distance from its start (Vdist) to Rlen. Big data for the traffic control system is methodological support for analyzing vehicle information as to their sequence or queues (Q) by Eq. 11. It allows the transformation of valuable information on the existing circumstance of road services. The roadside sensor devices or from the Vehicle work automatically to turn on the system and get notifications from or for the other sensor devices to remind the Vehicle of safety in its path and scenario at the car’s moment (Alesandra et al., 2018).
$$\mathop Q\nolimits_{{len}} =\left\{ {\begin{array}{*{20}{c}} {\mathop {\hbox{max} }\limits_{{\forall \mathop M\nolimits^{i} }} \left[ {\left( {\left\lceil {\frac{{\mathop V\nolimits_{{dist}}^{i} }}{{\mathop V\nolimits_{{len}} +\mathop V\nolimits_{{\operatorname{int} r - dist}} }}} \right\rceil +1} \right) \times k} \right],if\mathop V\nolimits_{{dist}}^{i} >0} \\ {1 \times k,otherwise} \end{array}} \right.$$
11
Where k is the number of lanes, Mi is the i− th message for Vehicle information, and Vdist, i is the distance of the Vehicle sending Mi. Vlen is the length of the Vehicle, and Vdist is the distance between two vehicles.
Big data for the Internet of Vehicles are generic to concatenate the techniques of simulations, visualization, ontologies, and data-driven into one form of architecture (dashboard). It is an advanced approach to managing and exploring large-scale sensor data phenomena and technology that blends the issues and applications to allow organizations to get value from the ever-increasing data sets (Gang, 2015; Stefano et al., 2013). The analytic process in agents and the network or infrastructure learn and understand the real-world using wearable and sensor devices (Husen et al., 2013). It reveals the combination of the intelligent cloud system explosive, as shown in Fig. 4. Integration is a clear insight into the IoV technological revolution that passes the new ubiquitous connectivity, computing, and communication, which depends on technical innovations in several fields, from wireless sensors to nanotechnology (Razi, 2018; Ning et al., 2013).
3.3.1 Data structure and big data technology for The Internet of Vehicles data
Big data technology is a large-scale data structuring and handling using the techniques of data summarization and analytics. It can handle the massive data structures to extract value and new patterns through a dynamic analytic process. Big data for IoV’s data are sequential structuring approaches to know the speed and bytes associated with the data phenomena and technology in optimal performance. It allows a computer-based data model development and architecture (Ifeyinwa and Henry, 2019; Tasneem and Kamalrulnizam, 2018). Big data technology is a systematic approach to parametric data representation and correlation in the context of the intelligent systems shown in Fig. 4. It is vital to view the facts of the sensor device operation in a readable data model, as shown in Fig. 5.
The integrated architecture (Fig. 5) is a structuring of data in line with the system performance to a pre-hand solution to vehicular intelligent system functionality in the context of sensor nodes and smart objects (Abdul et al., 2020). Therefore, Hive-based structured data is pertinent to performing an integrated computational model in optimal disk spacing and processing time. The architecture is essential to demystify searching for a specific object within the group that processes the data items in any desired order (Fei, 2010).
3.3.2 Big data for the verge of IoV cloud system
Big data for IoV is technology-driven to a range of decision processes (Shridevi and Manjunath, 2020). The advantage includes enhancing the knowledge sharing for efficient system infrastructures, cost optimizations, and future predictions to a profound of intelligent cloud systems (Mohamed et al., 2018). It empowers system performance and service safety by performing an optimal precision of the Internet of Vehicle system functionality. Therefore, Big data analytics for the Internet of Vehicles is a technology for real-time sensor device connectivity to prove easy access, information flows security, and data analytics (Alesandra et al., 2018; Diebold, 2003).
The intelligent transportation industry is dynamic and incremental sensor technology, and the sensor data is pertinent to augment big data on the Internet of Vehicles. The data sets in terms of volume, velocity, variability, etc., revealed as big data are capable and scalable to handle challenges by considering the issues to propose various analytical tools, including Hadoop, Alteryx, distributed databases, complex event-processing frameworks, and others.
3.3.3 Big data for IoV data virtualization
Big data for IoV data visualization is the process of federating systems' relational databases, applications, file repositories, and website data sources integrity into single data access via the dashboard (Gebeyehu et al., 2019; Mingming et al., 2017). Big data applications need a cluster of nodes and communicate with defined protocols to solve large-scale data handling and transformation problems in a given network or connectivity (Kate and Jason, 2014). The distributed system works with the delegated agent or machine to use the federated large-scale sensor data (João, 2013). The data visualization framework allows for the data replication on a given Virtual Machine (VM) (Rateb et al., 2020; Hedi et al., 2019). The performance of various steps alters the system functionality depending on the frequency of data loading, replication, backups, etc. (Robert et al., 2014). As Zhou et al., 2020). IoV’s storage optimizations and access proposed correlation probability model demystify using Big data analytics and integrity for connected wearable devices. For example, a big data analytics tool such as Tableau uses data from connectivity than transforming or importing itself, as shown in Fig. 6.
Data visualization is an advanced methodological approach to sensor data virtualization through various steps, including virtualizing data sources into an abstract format. The physical and combined federated sources into virtual data and deploy to the web-based application via the networks, as shown in Fig. 6. The advantage of big data is to enhance a wide range of information access between devices.
3.3.4 Big data inside and outside the IoV’s data center
The Internet of Vehicle’s data structuring and analytics is a dynamic process, which evolves sensor data from different sources. The data captured in the central database and the data servers are sensor data, which comes from dedicated sensor devices in the system (Vijay, 2012) for information generating and flow across various sensor devices and the real-time process perform in connecting agents. Big data is a capable and advanced way of handling large-scale sensor data as data structuring and modeling (Robert et al., 2014). The data management system is a Big data application with its entire routing standards for a systematic enhancement of the Internet of Vehicle system performances. The sensor data handling is supported through big data, as showed it in Table 2.
Table 2
IOV intelligent data descriptions
Inside | Outside |
The IoV’s data center is coming from applications and connected sensor devices -via the intelligent system, which includes: ♣ Clickstreams ♣ Application events ♣ Transaction records ♣ IoVlog/Monitoring data ♣ Virtual Machine (VM) Infrastructure monitoring | The data sources include phones, distributed sensors, and partners from outside of the center, which is massive, and endpoints and connections become the challenge, including: • Roadside and remote Telemetry • Sensors devices and nets • Smart Phones and wearable devices • RFID and barcode scanners • GPS Position Tracking • Point-of-Sale Terminals |
3.3.5 IoV’s data modeling using big data analytics
Big data technology for IoV data modeling promotes novel approaches and technologies for structured and unstructured sensor data handling. An intelligent system sensor data is large and complex to manage and conduct a critical analysis using the classic approach (Wei and Albert, 2013). Dynamic processing support users’ queries repeatedly and improves the search results. Therefore, big data stand for putting analytics to search for valuable information or knowledge. The multi-layer platform agglomerated different tasks and protocols the integrating data, time, and other factors. The big data analytics approach provides advanced large-scale data processing (Eugen et al., 2017; Shen et al., 2010), as shown in Fig. 7, which is pertinent to analyzing events in IoV systems based on the given query and analysis in the layer.
AS the data processing in a multi-layer architecture of the Vehicle data, the big data technology allows being built data management and event processing of various event-based analyzing techniques for multiple applications, such as optimizations of managing data center and system performance. The data in IoV are always massive, heterogeneous, distributed, and time and position-related, which is challenging for classical or centralized mining architecture to extract valuable information (Mingming et al., 2017; Yang et al., 2017). Based on these facts, the advantage of big data over the traditional analysis tools presents the following fundamental issues. (i) Large-scale data of IoV are almost everywhere on its infrastructure and stations. (ii) The IoV data always needs preprocessing in real-time events. (iii) The complexity of the sensor data and sensor devices’ variables to analyze the data using traditional analytic tools. (iv) The structure of IoV is complex in architecture, which requires advanced sensor devices. (v) Moreover, the strategy of sending all data to the central nodes does not optimize the performance of the services.
Big data for the intelligent system is essential to carry out advanced and dynamic techniques to be scalable up large-scale data handling and analysis via federated data. The big data approach allows undertaking a preprocessing of the raw data in the distributed nodes (agents) (Paul et al., 2012), and users can send/receive information through distributed IoV’s system infrastructure. The proposed approach is a methodological augmentation of big data for ITS performance optimizations. Therefore, the big data approach is an agent-based Big Data Mining (BDM) mechanism of distributing the load among various sensor devices to improve the response time. And also it gives a clear insight into a situation where the devices are heavily loaded while others are idle or doing very little work (Shen et al., 2010). The data process and analysis are the core of the whole activity. An experiment is a methodological approach to combining tasks and navigation processes, as shown in the following pseudo-codes.
Algorithm 1
Step 1: defining the data sets for data analysis purposes,
Step 2: Data cleaning or preprocessing and filtering from its destination and real situations,
Step 3: Navigate the specific root system using its task,
Step 4: Assigning the devices, including the subelements to receive information or facts from various sensor objects,
Step 5: Saved data in the local data warehouse and the data model obtained from the data event filtering.
The Big data-based IoV’s data modeling is scalable to adopt large data sets from multiple sensor devices and data sources by accessing the infrastructure or the networks. It is the most advanced data analytics model to connect various sensor objects via the Internet. It becomes intelligent, context-awareness, and a long-range operable approach (Ovidiu and Peter, 2013). The model can also adopt the ubiquitous ways to access the digital system (Li et al., 2018; Kate and Jason, 2014) via the Internet, sensor devices such as RFID, WLAN/WiMAX, and others.