In this section, several social network methods are utilized, and their advantages and disadvantages are analyzed subsequently.
A. Social Network Graph
A social network graph is a visual representation of social relationships and interactions between individuals or groups, often displayed as nodes (representing individuals or groups) and edges (representing relationships or connections between them). Social network graphs can be used to analyze and understand social networks, including identifying key actors, communities or groups, and patterns of interaction and communication [7]. The interactive relationships of individuals of both two ant groups can be plotted through the social network graph of sphere and random. Figures 1 and 2 portray these connections through straight lines that illustrate the relationships of each member. The size of nodes in the figure is represented by degree/4. The position and importance of each subject in the social network can be roughly understood in the social network graphs.
B. Structural Analysis
1) Density
The concept of network density pertains to the proximity or relationship between nodes in a network structure [8]. A higher level of closeness among network members leads to a more tightly-knit ant community. A network in which every node is linked to every other node has a density of 1. The densities of the group 1 and 2 are 0.3594343 and 0.3869642, respectively.
2) Centrality analysis
Centrality is a metric that evaluates how central an individual is within a network [9], which is a powerful tool for identifying key individuals or nodes in a network, helping to understand the structure and functioning of the network. Different centrality measures reveal different aspects of a network’s importance, and it's important to consider multiple measures to get a more complete picture. This paper examines four types of centrality measures: degree centrality, betweenness centrality, closeness centrality, and eigenvector centrality. These measures describe the node’s influence, control over resources, speed of information spread, and overall influence, respectively [10]. Tables 1 and 2 below show the descriptive statistics of these centrality indicators for both groups.
However, two important limitations of centrality indices exist [11]. Firstly, what works best for one network may not work well for other networks. Additionally, while identifying the most important nodes in a network is helpful, it may not apply to all other nodes in the network.
Table 1
shows centrality analysis of group 1
|
Group 1
|
|
degree
|
betweenness
|
closeness
|
eigenvector
|
Mean
|
80.51
|
15.39
|
7.03e-03
|
0.75
|
Std.Dev
|
13.58
|
12.12
|
6.67e-04
|
0.12
|
Min
|
41.00
|
0.00
|
5.46e-03
|
0.37
|
Q1
|
72.00
|
7.63
|
6.58e-03
|
0.68
|
Median
|
79.00
|
11.79
|
6.90e-03
|
0.75
|
Q3
|
92.00
|
20.74
|
7.58e-03
|
0.86
|
Max
|
109.00
|
61.20
|
8.70e-03
|
1.00
|
Skewness
|
-0.20
|
1.22
|
2.54e-01
|
-0.38
|
Table 2
shows centrality analysis of group 2
|
Group 2
|
|
degree
|
betweenness
|
closeness
|
eigenvector
|
Mean
|
100.61
|
14.42
|
6.34e-03
|
0.80
|
Std.Dev
|
17.05
|
9.90
|
6.01e-04
|
0.14
|
Min
|
25.00
|
0.00
|
4.26e-03
|
0.17
|
Q1
|
94.00
|
7.30
|
6.02e-03
|
0.75
|
Median
|
104.00
|
13.47
|
6.41e-03
|
0.84
|
Q3
|
112.00
|
20.83
|
6.76e-03
|
0.89
|
Max
|
128.00
|
50.22
|
7.58e-03
|
1.00
|
Skewness
|
-1.66
|
0.86
|
-7.89e-01
|
-1.88
|
C. Community Detection
1) Community detection based on edge betweenness (Newman-Girvan
Edge C. Community Detectionbetweenness is a widely used measure for detecting communities in a network. The edge betweenness measure is effective in detecting both small and large communities within a network. Using this approach, edges with high betweenness are identified as bridges between highly connected clusters of nodes. Thus, by iteratively removing the edge with the highest betweenness, we can break down a network into a hierarchy of nested communities. Figures 3 and 4 display the community detection results based on edge betweenness for both group 1 and 2.
While the method may improve the speed of computer processing, as only intermediate betweenness values are recalculated after edge removal, it only results in a continuous process of breaking down the network into smaller and smaller communities [12]. It does not indicate which partition is best, which means that edge betweenness may not always be the best measure for detecting communities in a network, as other measures such as modularity or conductance may better capture the characteristics of a particular network, which will be introduced in the next section. Despite its limitations, the edge betweenness method remains a useful tool for community detection in social network analysis.
2) Community detection based on greedy optimization of modularity
Different nodes can belong to different communities, and we aim to enhance the modularity by including nodes that can contribute the most to a single community. We ultimately prefer a partition with a higher modularity score [13]. As it shows in Figs. 5 and 6, the community detection results based on greedy optimization of modularity for both two groups are presented.
The greedy optimization of modularity is a precise and computationally efficient method of detecting communities in networks [14]. However, it has two shortcomings. The method tends to merge communities that are connected through a single link if their size is below a certain threshold. Additionally, as the number of nodes in the network increases, obtaining the best partition becomes increasingly challenging [15].
3) K-means clustering
The K-means clustering algorithm is commonly used in social network analysis due to its simplicity and efficiency in detecting clusters or communities in the network. K-means clustering divides the data set into K predefined different clusters, so that each node comes into to the cluster with the nearest mean. In addition, the average distance from the center is plotted as a function of K, and the “elbow point” where the rate of descent changes violently can be used to roughly determine K [16]. Figures 7 and 8 reveal the K-means clustering results for both group 1 and 2.
As for its strengths, K-means clustering is a useful technique for identifying distinct groups or communities within a social network, it can scale to large data sets and easily generalize to clusters of different shapes and sizes [17]. However, K-means clustering is limited in its ability to capture the complex dynamics and interconnections within social networks. The number of clusters needs to be assigned and cannot handle noisy data and outliers [18].
D. Link Analysis
1) PageRank
PageRank is a metric that measures the importance of a node based on the concept that a node is considered important if other important nodes link to it. The algorithm calculates a score for each node by looking at the incoming links to that node, and then normalizes those scores to add up to 1. The nodes that have higher PageRank scores are viewed as more crucial within the network. This metric can identify key individuals that are highly connected and influential within the social structures of both two ant groups [19]. As it shows in Table 3, the descriptive statistics of the PageRank results for both ant groups are listed.
On the positive side, PageRank offers a straightforward and simple way to evaluate the importance of nodes in a network. It is commonly used and well-understood by many researchers in the area of social network analysis. Additionally, it is successful at identifying nodes that serve as important connectors between various groups within the social network. However, PageRank also has certain limitations. It may not differentiate between incoming and outgoing links in the network, giving equal importance to both kinds of links. Therefore, this metric could be susceptible to manipulation by individuals who want to increase their perceived importance within the network. Lastly, it may not consider the impact of other significant features of the network, such as the strength or quality of the connections between nodes [20].
Table 3
shows PageRank of group 1 and 2
|
Group 1
|
Group 2
|
Mean
|
8.85e-03
|
7.63e-03
|
Std.Dev
|
1.60e-02
|
1.24e-02
|
Min
|
2.39e-03
|
2.14e-03
|
Q1
|
2.94e-03
|
2.69e-03
|
Median
|
4.15e-03
|
3.81e-03
|
Q3
|
7.21e-03
|
6.93e-03
|
Max
|
1.41e-01
|
1.08e-01
|
Skewness
|
5.78e + 00
|
5.20e + 00
|
E. Proximity Measures
1) Neumann kernel
The Neumann kernel, also known as the Laplacian smoothing function, is a mathematical function that is applied to a network in social network analysis to measure nodal centrality. It is a smoothing function that is based on the average of the neighboring nodes’ importance scores. The function assumes that more central nodes will have more high-quality neighbors, and that a node's score should be related to the scores of its neighbors. By applying the Neumann kernel to a social network, researchers can identify important nodes based on the concept of shared influence or centrality. The correlation of nodes in the graph based on the immediate connections and remote connections are modeled through Neumann Kernel. It employs a customizable parameter to adjust the weight assigned to connections that are further away. As a result, two matrices can be generated, \({K}_{{\gamma }}\) and \({T}_{{\gamma }}\). By examining the diagonal values in the \(K\) matrix, the relative ranking of nodes can be determined [21]. Here in Table 4, the descriptive statistics of the Neumann kernel ranking results of both group 1 and 2 are presented.
One benefit of the Neumann kernel is that it takes into account both relevance and importance. Nevertheless, it might not consider other important network characteristics, such as the quality of the connections between nodes or the existence of dominant individuals who govern social interactions [22].
Table 4
shows Neumann kernel ranking of group 1 and 2
NK ranking
|
Group 1
|
Scores
|
NK ranking
|
Group 2
|
Scores
|
1
|
Individual 8
|
5.66e + 04
|
1
|
Individual 113
|
2.21e + 05
|
2
|
Individual 71
|
5.25e + 04
|
2
|
Individual 114
|
2.05e + 05
|
3
|
Individual 68
|
4.47e + 04
|
3
|
Individual 112
|
1.90e + 05
|
4
|
Individual 60
|
4.07e + 04
|
4
|
Individual 120
|
1.88e + 05
|
5
|
Individual 89
|
3.93e + 04
|
5
|
Individual 108
|
1.78e + 05
|
…
|
…
|
…
|
…
|
…
|
…
|
112
|
Individual 3
|
0
|
130
|
Individual 3
|
0
|
113
|
Individual 7
|
0
|
131
|
Individual 130
|
0
|