Approach to Social Media Cyberbullying and Harassment Detection Using Advanced Machine Learning

doi:10.21203/rs.3.rs-4031554/v1

Download PDF

Research Article

Approach to Social Media Cyberbullying and Harassment Detection Using Advanced Machine Learning

https://doi.org/10.21203/rs.3.rs-4031554/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

The use of information and communication technologies to engage in bullying behavior is known as cyberbullying. Today's society is facing a major and growing challenge of cyberbullying as a result of the misuse and inappropriate usage of social media. A few significant researches have been done in Artificial Intelligence (AI) inspired cyber bulling detection domain because of not having related dataset. This paper focuses on AI based cyber bullying detection in the context of social networking sites of Facebook, Twitter, Instagram, TikTok and YouTube English language. This paper has two major contributions. Firstly, we developed a dataset that involves collecting unique comments, evaluating them with psychological references, and categorizing them using Word Embedding for streamlined classification. Secondly, we offer a novel, machine learning-based solution to efficient cyberbullying detection systems which leverage the concept of advanced natural language processing techniques, including text preprocessing, feature extraction, and sentiment analysis, are employed to capture the intricate nuances of online interactions. Additionally, computer vision enhances detection beyond textual content. The methodology integrates various machine learning models, such as Logistic Regression, Decision Tree Classifier, Random Forest Classifier, Multinomial NB, KNeighbors Classifier, SVM, SGD Classifier, and Support Vector Machines. Experimental results, including Bidirectional LSTMs, showcase high accuracy, precision, recall, and F1-score metrics, demonstrating robust performance in handling diverse forms of cyberbullying and harassment. The paper concludes with insights into ethical considerations and future directions, highlighting the support vector machine (SVM) as the most effective algorithm with a 90.06% accuracy rate. Recommending SVM for social media platforms, the research contributes to enhancing online safety, guiding proactive measures against cyberbullying, and fostering a safer, more inclusive digital environment.

Social media

Cyberbullying

Harassment Detection

Sentiment Analysis

Natural language processing

Machine learning.

In the dynamic landscape of the 21st century, the proliferation of social media platforms has revolutionized communication, connecting individuals across the globe. Platforms such as WhatsApp, Facebook, Twitter, Instagram, YouTube, and TikTok, among others, have become integral parts of our daily lives. However, this exponential growth in online interaction has also brought about a darker side, characterized by the prevalence of cyberbullying, online harassment, and cybercrime. Nevertheless, this rapid growth in virtual connectivity has ushered in a darker side of societal behavior—online harassment, cyberbullying, and cyber-crime[1].

Particularly alarming is the impact on vulnerable demographics, such as women and children. Who often bear the brunt of severe mental and physical consequences, including depression and, tragically even suicide attempts [2]. Cyberbullying, characterized by offensive language and harmful behavior on social networking platforms, poses a significant threat, leading to severe mental health consequences such as anxiety, depression, and even suicidal thoughts. A Pew Research Center study revealed that over 60% of US social media users have experienced cyberbullying, with teenage girls being particularly vulnerable. The issue has escalated into a global epidemic, prompting the introduction of various preventive and intervention measures. However, detecting offensive text proves challenging due to the nuanced nature of language, including sarcasm and colloquialisms used in non-harmful contexts. This complexity necessitates global research efforts to identify offensive content accurately.

As researchers strive to develop effective methods, the gravity of the problem underscores the importance of enhancing internet user safety through robust offensive text detection mechanisms [3].The escalating challenge of cyberbullying has prompted an urgent need for effective detection and prevention mechanisms. While the exploration of machine learning (ML) techniques for identifying cyberbullying instances is underway, a critical gap remains in the comprehensive understanding of the diverse language employed in social media texts. This paper addresses this gap by proposing an innovative approach that combines natural language processing (NLP) with machine learning to enhance the detection accuracy of abusive language. Utilizes a robust dataset extracted from various social media platforms, encompassing instances of cyberbullying and harassment. Leveraging advanced NLP techniques, including text preprocessing, feature extraction, and sentiment analysis, the proposed methodology aims to capture the nuanced intricacies of online interactions. Furthermore, the incorporation of image analysis through computer vision extends the scope of detection beyond textual content. A diverse set of machine learning models, including Logistic Regression, Decision Trees, Random Forest, Naïve Bayes, and Support Vector Machines, are employed to classify instances of cyberbullying and harassment. Notably, the support vector machine (SVM) emerges as the most effective algorithm. The paper not only contributes to enhancing social media safety but also guides proactive measures to combat cyberbullying, fostering a safer and more inclusive online environment. As the number of social media users continues to rise, so does the prevalence of new forms of bullying. This paper seeks to explore and address the urgent need to detect and prevent cyberbullying through a comprehensive and innovative ML-based approach. The following sections delve into related work, describe the proposed methodology, present experimental results, and conclude with insights into the ethical considerations and future directions for combating online abuse.

The literature review delves into the significant concern of cyberbullying on social platforms, providing an overview of prior research in cyberbullying detection. The focus is on studies utilizing machine learning and deep learning approaches on popular platforms such as Facebook, Twitter, Instagram, TikTok and YouTube, primarily addressing text-based cyberbullying. A prevalent theme in these studies involves the application of supervised learning algorithms and text-mining techniques, contributing to notable advancements in accuracy over the last decade. In a study conducted by Rezvani et al. in (2020), both classifiers demonstrated the ability to identify true positive scenarios, achieving individual accuracies of 71.25% and 52.70%. Notably, Support Vector Machine (SVM) outperformed Naïve Bayes on the same dataset for comparable tasks [4]. John et al. (2019) presented a cyberbullying detection methodology using machine learning, encompassing processing, feature extraction, and classification steps. They employed the TextBlob library for polarity extraction and utilized the TF-IDF method for feature engineering. Classification involved Support Vector Machine (SVM) and a Neural Network with input, hidden, and output layers. Experimenting with various n-gram language models, they achieved the highest accuracy rate of 92.8% using a 3-gram model in the Neural Network [5].

Moreover, the study employed seven distinct classifier models, including Naïve Bayes (NB), Support Vector Machine (SVM), Logistic Regression (LR), Stochastic Gradient Descent (SGD), Light Gradient Boosting Machine (LGBM), AdaBoost (ADB), and Random Forest (RF). The results of the experiments highlight the superior performance of Logistic Regression (LR), with a median accuracy of approximately 90.57%. It is noteworthy that the study did not extensively explore various feature extraction methods.In a related work, Balakrishnan et al. (2020) proposed an automated model for identifying cyberbullying, utilizing a ‘Twitter’ dataset comprising 5453 unique tweets [6].

Yadav, Kumar, and Chauhan (2020) developed a cyberbullying classification model using a pre-trained BERT model, leveraging Formspring and Wikipedia datasets.The Formspring dataset, with 776 bully posts out of 12,773 comments, achieved a 94% accuracy after oversampling. For the Wikipedia dataset, comprising 13590 manually annotated comments, their model attained an 81% accuracy [7].

In 2020, Saloni and Vidya introduced an automated system incorporating Convolutional Neural Network (CNN) for efficient cyberbullying detection. They applied data preprocessing to convert and clean data into a vector format. The CNN model, inspired by the mammalian nervous system, utilized multiple layers for analysis [8].

Aaminah Ali and Adeel M. Syed (2020) presented a cyberbullying detection method using machine learning, reporting accuracy rates for various algorithms: Random Forest (91%), Naïve Bayes (87%), SVM (92%), Logistic Regression (92%), and an Ensemble method (92%) [9].

In 2022, Noviantho, S. M. Isa, and L. Ashianti developed a cyberbullying classification model utilizing the Naïve Bayes method and Support Vector Machine (SVM). They used a Kaggle dataset with 11,729 conversations, of which 1,068 were labeled as cyberbullying. Data cleaning involved removing specific words, and they balanced the dataset with different classification levels. Preprocessing steps included tokenization, case transfer, stop word removal, filtering tokens, stemming, and n-gram generation. SVM with the poly kernel yielded the best results, an average accuracy of 97.11% achieving [10].

Analyzing pairwise interactions between users to comprehend the propagation of cyberbullying. Collecting and categorizing user comments through word embeddings. Employing the Random Forest machine learning classifier for detecting cyberbullying in English text. Assessing the efficacy of diverse machine learning approaches. Investigating the influence of user-specific information, including location, age, gender, and engagement metrics[11]. Research findings indicate that the Support Vector Machine (SVM) is the most proficient algorithm in identifying cyberbullying and harassment in posts or comments, achieving an impressive accuracy rate of 90.06%.

The rapid growth of social media platforms has given rise to a pervasive issue of cyberbullying and harassment, adversely impacting individuals' well-being. Detecting and mitigating such online abuse is crucial for fostering a safe and inclusive digital environment. Existing methods often fall short in effectively identifying and addressing diverse forms of cyberbullying. This study aims to develop a robust machine learning-based solution for accurate and timely detection of cyberbullying and harassment on social media, addressing the limitations of current methodologies and contributing to the creation of safer online spaces.

The cyberbullying machine learning detection pipeline involves a systematic process encompassing data representation techniques, ML models, and ML frameworks.

The taxonomy, illustrated in Fig. 1, outlines key components. Initial data collection involves text or image content sourced from the Internet. As ML algorithms typically require numeric input, data representation techniques are employed to convert non-numeric data into a suitable format. Text data undergoes word-embedding techniques like pretrained (Word2Vec, GloVe, ELMo, fastText, BERT) or non-pretrained methods (One-hot encoding, TF-IDF). Image data is converted using effective, Graph, or Artificial Neural Network (ANN)-based methods [11].Following data representation, the choice of ML algorithm is crucial. For generative models, the Boltzmann Machine (BM) might be suitable, while discriminative models like Convolutional Neural Network (CNN) are chosen for specific problems. Hybrid models may be utilized for multi-modal datasets, combining techniques for enhanced accuracy.The study highlights the scarcity of survey papers on ML-based cyberbullying, emphasizing CNN-based techniques in applications across platforms like social media, YouTube, Wikipedia, and Q/A discussion forums. Various datasets with different modalities are discussed, such as text, photographs, collages, memes, etc. The paper concludes by addressing challenges and open issues, particularly in understanding user responses to cyberbullying, considering its multi-modal nature involving image, emotion, culture, language, etc.

Using a variety of social media platforms, including WhatsApp, Facebook, Instagram, TikTok, and YouTube, this article offers techniques for cyberbullying identification.

The cyberbullying detection framework presented in the block diagram comprises two major components: Natural Language Processing (NLP) and Machine Learning (ML). Real-time data, extracted from platforms such as Twitter, WhatsApp, Facebook, Instagram, TikTok, and YouTube, undergoes a meticulous preprocessing phase to eliminate unnecessary characters. This involves tasks like removing hashtags, stopwords, numeric data, and hexadecimal patterns, as well as converting text to lowercase. The subsequent NLP techniques, including Tokenization, Lemmatization, and vectorization, prepare the data for ML algorithms. The machine learning pipeline involves data collection from diverse sources, text cleaning to remove noise, tokenization for analysis, and stopword removal to reduce dataset noise. Feature extraction incorporates TF-IDF vectorization and word embeddings for comprehensive representation. For multimodal analysis, image processing and feature concatenation combine textual and visual features. Model selection includes supervised learning models like SVM, Logistic Regression, such as RNNs or Transformers. Ensemble methods enhance model robustness. Training involves dataset split, hyperparameter tuning, and validation[12].

Model evaluation employs metrics like accuracy, precision, recall, and the confusion matrix. Ethical considerations address bias mitigation and privacy concerns. Deployment involves real-time monitoring and scalability for handling large data volumes. A feedback mechanism allows users to contribute to model improvement, promoting continuous learning to adapt to evolving cyberbullying patterns. This integrated approach aims to create a safer online environment by effectively identifying and mitigating cyberbullying through a systematic and ethical machine learning-based detection system.

A. Dataset Description

Data collection is a foundational aspect of research, enabling the study of specific variables and facilitating predictions. In the context of cyberbullying, constructing representative models relies on a reliable dataset. This study employs its diverse Own dataset, comprising 8455 comments from various social media platforms, to delve into instances of cyberbullying and harassment.

Notably, the dataset reveals a higher prevalence of cyberbullying directed at women (68.1%) compared to men (31.9%), with comments sourced from actors, politicians, singers, and sports personalities.

The labeled dataset designates toxic (-1) and non-toxic (0) comments, with 57.2% classified as bullying sentences and 42.8% as non-bullying sentences—reflecting the diversity of negative language commonly used in daily life indicates toxic i.e bullying and non-toxic i.e nonbullying sentences respectively as shown in Fig. 5.

The study underscores the significance of a representative dataset in understanding cyberbullying patterns, highlighting notable instances of bullying towards well-known individuals. Following data extraction, preprocessing becomes imperative to clean and prepare the dataset for effective detection. Real-world data often contains unnecessary characters, necessitating thorough preprocessing for improved testing and training outcomes. This holistic approach contributes to a nuanced understanding of cyberbullying propagation, emphasizing the importance of addressing gender-specific trends and negative language prevalent in online interactions.

B. Data Cleaning

Particularly vital for cleaning social media user data, this stage addresses various irrelevant elements. Punctuation, special characters, retweet symbols, hashtags, numeric and hexadecimal values, and URLs are removed, as they don't contribute to sentence meaning. Additionally, words with fewer than three letters are eliminated, and all text is converted to lowercase for consistency. This meticulous cleaning process ensures a refined dataset for subsequent classification tasks, enhancing the models' effectiveness in extracting meaningful patterns from social media content[13].

C. Dataset Preprocessing

After the initial data cleaning phase, Natural Language Processing (NLP) techniques are applied to transform raw text into a format suitable for machine learning algorithms. This involves three key processes outlined in Fig:7. First, tokenization is employed to split each phrase in the tweet into smaller chunks, such as sentences, words, and symbols, known as tokens. Following tokenization, lemmatization is implemented to reduce words to their root forms, enhancing the algorithm's comprehension by standardizing inflectional variations. Subsequently, vectorization is performed to convert the text into numerical vectors or real numbers. This process is crucial for enabling machine learning models to process and understand textual information effectively. After data cleaning and pre-processing (as depicted in Fig. 6 and Fig. 7), the dataset is then split into training and testing sets. The testing dataset, crucial for real-time system usage, is extracted from platforms via text mining.

Both datasets undergo preprocessing techniques and are fed into various machine-learning models to facilitate effective classification tasks. This comprehensive approach ensures the model's ability to generalize and make accurate predictions on unseen data.

D. Feature Extraction and Feature Selection

Word embedding is a pivotal technique in machine learning that represents words as vectors in multi-dimensional spaces, crucial for addressing natural language processing (NLP) challenges. The Word2Vec model, featuring a vocabulary of 13,507 not bullying and bullying 17259 words and an embedding dimension of 16, is employed for this purpose. This method proves invaluable for NLP tasks, including identifying related words, semantic grouping, and text classification. Word2Vec, functioning as a two-layer neural network, translates human language into machine language, creating word embeddings applicable to tasks such as text similarity and sentiment analysis. Leveraging approaches like Continuous Bag of Words (CBOW) and Skip-Gram, it predicts target words based on context or vice versa. During training, the model learns embeddings by representing a corpus as an N-dimensional vector. To address variance issues common in neural network models like Convolutional Neural Networks (CNNs), an ensemble learning approach is implemented. Multiple models run concurrently with varying hyperparameters, and their outputs are aggregated using a Random Forest classifier and the Max Voting technique[14]. This ensemble method significantly enhances accuracy while mitigating the trade-off between bias and variance, making it a powerful tool.

E. Metrics and Evaluation

Measuring the percentage of properly anticipated events to all observations, this metric is the most basic way to assess performance. Especially when working with symmetric samples that have almost equal values due to false positives and false negatives, accuracy is crucial. The accuracy of the model is calculated using the categorization data collected during each test phase, and it is stated as follows: Accuracy (%) = (nc)×100%.

• Precision: The positive predicted value is another name for precision. It is the percentage of truly positive predicted positives:

$$\text{P}\text{r}\text{e}\text{c}\text{i}\text{s}\text{i}\text{o}\text{n}=\frac{TP}{TP+FP}$$

• Recall: The percentage of real positives that are anticipated to be positive is known as recall:

$$\text{R}\text{e}\text{c}\text{a}\text{l}\text{l}=\frac{TP}{TP+FN}$$

• F1 Score: The F1 score serves as a comprehensive metric, combining precision and recall, offering a holistic measure of a categorization system's accuracy. It is calculated as the harmonic mean of precision and recall. For binary and multiclass classification, F1 scores, ranging from 0 to 1, are commonly employed to assess predictive performance. The ROC curve visually illustrates the true positive rate (TPR) against the false positive rate (FPR). In this study, the F1 Score derived from the ROC curve was utilized to determine the most effective classification model:

$$\text{F}1 \text{S}\text{c}\text{o}\text{r}\text{e}=\frac{\left(R+P\right)*2}{(R+P)}$$

• Accuracy: Accuracy is the number of correctly classified instances (true positives and true negatives):

$$\text{A}\text{c}\text{c}\text{u}\text{r}\text{a}\text{c}\text{y}=\frac{TP}{TP+FP+TN+FN}$$

F. Applying Machine Learning Algorithm

Following these preprocessing and feature selection stages, a variety of machine learning models were examined, leading to the identification of seven models to be functionally compared. These models were selected based on study findings from multiple writers as well as factors like popularity, usability, and back-end functionality. The many classifiers employed in the study are as follows:

Logistic Regression: Logistic regression is a classification model employing a logistic function to represent a binary outcome. In mathematical terms, it utilizes the logistic function to compress the result of a linear equation, constraining it to a range between 0 and 1: 𝑃(𝑥) = 1/1+𝑒−(𝛽°+𝛽1𝑥).

Decision Tree Classifier: Decision Trees serve dual purposes in addressing classification and regression challenges. Conceptually, they form a tree-shaped structure, employing tuned parameters for predictions. Employing a top-down approach during training, decision trees effectively analyze datasets, making them versatile tools in both classification and regression scenarios.

Random Forest Classifier: Leverages the collective decision-making of numerous decision trees, with the majority vote determining the model's prediction. This approach ensures robustness, scalability, and resistance to overfitting. While fast and easy to interpret, its real-time prediction capability may diminish with a higher number of trees.

Multinomial Naive Bayes (NB): Probabilistic algorithm widely applied in Natural Language Processing (NLP). Leveraging Bayes theorem, it predicts text tags, such as those for newspaper articles, by calculating and returning the tag with the highest probability, assuming features are independent.

KNeighbors Classifier: K-Nearest Neighbors (KNN) is a straightforward text classification algorithm that determines the class of new data based on similarity measures with existing data. It uses distance metrics to identify the K-nearest neighbors and assigns the most frequent class among them to the new sample[15].

Support Vector Machines (SVM): Powerful for text classification due to their ability to find a hyperplane in n-dimensional space, effectively classifying data points. Linear SVMs are often applied to text classification problems with many features. The decision boundary is defined by the equation: f(x) = wT + b, where w is the weight vector, 𝑋 is the data dataset, and 𝑏 is the linear coefficient.

Stochastic Gradient Descent (SGD) Classifier: Optimization algorithm employed for minimizing cost functions, commonly used in linear classifiers like SVM and Logistic Regression. It facilitates discriminative learning and optimization, particularly effective for large-scale linear models [16].

The initial model implementation involved a binary classification neural network in Python, categorizing statements into "bully" or "not bully." The architecture featured a 1D convolutional layer, an LSTM layer for overfitting reduction, and a deep neural network with activation functions. Binary cross-entropy was utilized for training the two-class model, and ensemble learning was implemented using the Random Forest algorithm. In addition to the neural network, traditional machine learning models were explored, including SVM (Support Vector Machine), J48 (C4.5 Decision Tree Algorithm), KNN (K-Nearest Neighbors), and Random Forest. SVM separates data points using hyperplanes, J48 generates decision trees based on training datasets, KNN classifies data points based on nearest neighbors, and Random Forest combines decision trees trained on random subsets. The Random Forest algorithm's workflow involves understanding decision trees, their nodes, and the concepts of entropy and information gain. Random Forest employs Bootstrap Aggregation (Bagging), involving random sampling with replacement from the source dataset. Each model is trained separately, and their outcomes are aggregated through majority voting, enhancing accuracy and reducing overfitting.

The combined approach showcased the effectiveness of integrating neural networks and traditional machine learning algorithms. The Random Forest algorithm, in particular, demonstrated its capability to create an ensemble of decision trees for accurate predictions. This hybrid model leveraged the strengths of both paradigms, benefiting from the neural network's ability to capture complex patterns and the ensemble's power in reducing overfitting and enhancing overall model robustness. To further improve performance, various strategies were considered, including feature engineering, data preprocessing, hyperparameter tuning, cross-validation, ensemble learning, algorithm selection, regularization, data augmentation, adjusting model complexity, transfer learning, and monitoring/logging. These strategies, when applied judiciously based on the specific characteristics of the data and problem, contribute to enhancing the machine learning model's accuracy, efficiency, and generalization to new data. Additionally, hardware acceleration, such as GPUs or TPUs, was suggested to expedite the training process, especially for deep learning models.

The table presents the performance metrics of various machine learning models for a classification task. Each model, including Logistic Regression, Decision Tree, Random Forest, Multi. Naive Bayes, K-Nearest Neighbors (KNN), Support Vector Machine (SVM), and Stochastic Gradient Descent (SGD), is evaluated based on metrics such as accuracy, precision, recall, and F1 score. The discussion reveals distinct strengths and trade-offs among the models.

For instance, Logistic Regression exhibits high precision, indicating low false positives, while Decision Tree shows balanced precision and recall. Random Forest demonstrates a trade-off between precision and recall, emphasizing the importance of model selection based on specific objectives. Multi. Naive Bayes achieves high accuracy with balanced precision-recall.

The summary underscores the significance of considering precision and recall trade-offs, as a model's suitability depends on project requirements, guiding the optimal choice for the given classification problem.

Table 1

Performance of different algorithm in Ensemble Technique
	Accuracy	Precision	Recall	F1 Score	Model Name
0	86.52	93.24	74.84	83.03	Logistic Regression
1	82.79	81.02	79.61	80.31	Decision Tree
2	87.09	91.37	78.09	84.21	Random Forest
3	89.01	85.45	90.46	87.88	Multi.Naive Bayes
4	88.05	86.21	86.77	86.49	KNN
5	90.06	92.60	84.16	88.18	SVM

A. Classifiers and their Jaccard Similarity scores

The bar chart illustrates Jaccard Similarity scores for different classifiers, measuring the overlap between predicted and true sets.

Notably, Stochastic Gradient Descent (SGD) achieves the highest similarity (0.45), indicating substantial agreement with true values. Conversely, Multi. Naive Bayes (MNB) scores 0, implying no overlap, possibly due to distinct predictions. The visualization offers a succinct comparison of classifier performances, with SGD excelling in capturing correct elements. Overall, Jaccard Similarity scores provide a concise metric for evaluating model accuracy and relevance in predicting true outcomes, crucial for assessing classifier effectiveness in diverse applications.

B. Comparative Analysis between 7 feature extraction models

Additionally, provides insights into the variation’s different classifiers including Figures: Logistic Regression, Random Forest − 10.1. SVM, SGD − 10.2. KNeighbors Classifier, Gaussian NB − 10.3 and Decision Tree Classifier − 10.4.

The provided code generates Receiver Operating Characteristic (ROC) curves for Logistic Regression and Random Forest Classifier models. ROC curves, visualized with dark orange lines, illustrate the trade-off between false positive rate (FPR) and true positive rate (TPR) at different thresholds. The area under the curve (AUC) values, displayed in the legends, quantify discriminatory power, aiding in model comparison. The dashed navy lines represent random guessing.

These visualizations offer quick assessments of classifier performance, assisting in model selection based on their ability to balance true and false positives across thresholds, crucial for understanding sensitivity and specificity.

The provided code generates Receiver Operating Characteristic (ROC) curves for Support Vector Machine (SVM) and Stochastic Gradient Descent (SGD) models. The dark orange curves showcase the trade-off between false positive rate (FPR) and true positive rate (TPR) at varying thresholds. The area under the curve (AUC) values, displayed in the legends, quantify the models' discriminatory power. These visualizations aid in comparing SVM and SGD models based on their ability to balance true and false positives, crucial for understanding their sensitivity and specificity in classification tasks.

The provided code generates Receiver Operating Characteristic (ROC) curves for KNeighborsClassifier and GaussianNB models.

The dark orange curves depict the trade-off between false positive rate (FPR) and true positive rate (TPR), with area under the curve (AUC) values noted in legends. The navy dashed lines represent random guessing. These visualizations assist in evaluating the models' discriminatory power. Comparing the AUC values and curve shapes aids in selecting the model with optimal sensitivity and specificity, crucial for understanding their performance in binary classification tasks.

The code generates a Receiver Operating Characteristic (ROC) curve for a DecisionTreeClassifier. The dark orange curve illustrates the trade-off between false positive rate (FPR) and true positive rate (TPR), with the area under the curve (AUC) denoted in the legend. The navy dashed line represents random guessing. This visualization aids in assessing the classifier's ability to balance true positives and false positives across different thresholds, providing insights into its overall discriminatory performance.

C. Word cloud for the multiclass

Word clouds visually represent frequent words in a text, aiding analysis. Employed in NLP, they reveal significant terms. Word clouds illuminate prevalent terms in binary and multiclass cyberbullying datasets, offering condensed insights. This graphical approach aids in discerning recurrent words, providing quick understanding of language patterns in cyberbullying across classes.

D. Sentiment Intensity Analyzer

The code utilizes the SentimentIntensityAnalyzer from NLTK to assess the sentiment of example sentences based on compound scores.

Table 2

Sentiment Intensity Analyzer
Sentence:	I love this product!
Polarity Scores:	{'neg': 0.0, 'neu': 0.308, 'pos': 0.692, 'compound': 0.6696}
Sentence:	This movie is terrible.
Polarity Scores:	{'neg': 0.508, 'neu': 0.492, 'pos': 0.0, 'compound': -0.4767}
Sentence:	The weather is dangerous.
Polarity Scores:	{'neg': 0.508, 'neu': 0.492, 'pos': 0.0, 'compound': -0.4767}

Notably, the tool accurately categorizes "I love this product!" as highly positive (compound: 0.6696) and "This movie is terrible." as strongly negative (compound: -0.4767), while "The weather is dangerous" yields a close-to-zero compound score, indicating a neutral sentiment. The subsequent bar chart visualization effectively demonstrates the sentiment distribution, underscoring the tool's proficiency in distinguishing sentiments[19]. This capability makes the SentimentIntensityAnalyzer valuable for automated sentiment analysis, facilitating swift evaluations in diverse applications like customer reviews and social media monitoring.

A. Confusion matrix

The presented classification report provides a comprehensive assessment of the model's performance in distinguishing between "Non-Cyberbullying" and "Cyberbullying" instances. Notably, the precision values of 0.50 for non-cyberbullying and 0.67 for cyberbullying indicate the proportion of correctly identified instances within each class. Similarly, recall values of 0.50 and 0.67 shed light on the model's ability to capture actual instances of non-cyberbullying and cyberbullying, respectively.

The F1-scores, balancing precision and recall, demonstrate reasonable compromise. Despite an overall accuracy of 0.60, the discussion emphasizes the need for improved balance between precision and recall for both classes.

To enhance model discriminatory power, further analysis, feature engineering, and fine-tuning are recommended, with a focus on addressing potential biases or imbalances in the dataset. In conclusion, the discussion provides valuable insights for refining the cyberbullying classification system and maximizing its effectiveness [20].

Table 3

Confusion matrix Classification Report
Classification Report	Precision	Recall	F1-score	Support
Non-Cyberbullying	0.50	0.50	0.50	4
Cyberbullying	0.67	0.67	0.67	6
accuracy			0.60	10
macro avg	0.58	0.58	0.58	10
weighted avg	0.60	0.60	0.60	10

Cyberbullying, a form of digital harassment occurring on various online platforms, poses serious threats to individuals' mental and physical well-being. This study delved into existing ML-based approaches for cyberbullying detection, conducting a comprehensive review to uncover strengths and future avenues. Our investigation aims to guide future researchers by providing insights into available datasets, research challenges, and unresolved issues in this domain. Moving forward, we plan to explore hybrid Machine learning models for cyberbullying detection, expanding on the text-focused identification explored in this paper. Our research concentrated on evaluating the efficacy of employing a multi-dimensional training dataset for machine learning classifiers in predicting cyberbullying comments. The creation of this dataset hinged on linguistic pragmatics. Word Embedding required meticulous text file cleaning for optimal outcomes. Leveraging Word Embedding to cluster and connect words in a multi-dimensional vector space, our program yielded highly accurate results, surpassing a 95% accuracy rate through the implementation of the Random Forest method. Addressing gaps in prior cyberbullying detection algorithms, our approach outperformed comparable studies, showcasing superior accuracy. A comparison with another study using the same dataset revealed our approach and machine learning algorithm's enhanced performance. These findings have the potential to significantly advance cyberbullying detection, fostering a safer online environment. However, limitations exist, primarily concerning the volume of training data limiting the detection of cyberbullying patterns. We recommend augmenting cyberbullying data to bolster effectiveness and suggest future enhancements to streamline the system for greater adaptability to larger datasets, where machine-learning approaches demonstrate heightened accuracy.

XI. DECLARATION OF COMPETING INTEREST

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Author Contribution

[Author 2] conceptualized the research study, [Author 3] collected and processed the data, [Author 1] designed the methodology and supervised the project, conducted the analysis, and drafted the initial manuscript, contributed to the interpretation of results, provided critical revisions, and ensured the accuracy of the technical aspects of the work. All authors contributed to the writing and editing of the final manuscript and approved its submission for publication.

M.B. Faisal Ahmed. Zalish,Zarin Tasnim, Cyberbullying Detection Using Deep Neural Network from Social Media Comments in Bangla Language Computation and Language [8 Jun 2021]. 10.48550/arXiv.2106.04506
KaziSaeed,Shovan AB, Kundu PPR (2021) Cyberbullying Detection: An Ensemble Based Machine Learning Approach, in Third International Conference on Intelligent Communication Technologies and Virtual Mobile Networks (ICICV) 10.1109/ICICV50876.2021.9388499
Shutonu,Tasfia MTM (2021) A Framework to Detect and Prevent Cyberbullying from Social Media by Exploring Machine Learning Algorithms,in International Conference on Computer, Communication, Chemical, Materials and Electronic Engineering (IC4ME2) 10.1109/IC4ME253898.2021.9768450
Nabi Rezvani and Alireza Tabebordbar. (2020) Linking textual and contextual features for intelligent cyberbullying detection in social media, in Proceedings of the 18th International Conference on Advances in Mobile Computing & Multimedia. 10.1145/3428690.3429171
Mounir JH (2019) N.Mohamed,A.Mostafaa, Social Media Cyberbullying Detection using Machine Learning, International Journal of Advanced Computer Science and Applications(IJACSA), Volume 10 Issue 5, 10.14569/IJACSA.2019.0100587
Vimala Balakrishnan S, Khan R, Hamid, Arabnia (2020) Improving cyberbullying detection using Twitter users’ psychological features and machine learning. Computers Secur 90:101710
Yadav J, Kumar D, Chauhan D (2020) Cyberbullying detection using pre-trained bert model, in 2020 International Conference on Electronics and Sustainable Communication Systems (ICESC). IEEE, pp.1096–1100. 10.1109/ICESC48915.2020.9155700
Mahesh VS, Chitre V A Study of Cyberbullying Detection Using Machine Learning Techniques,2020 Fourth International Conference on Computing Methodologies and Communication (ICCMC),March 2020,DOI: 10.1109
A.Aaminah MSA (2020) Cyberbullying Detection using Machine Learning. Pakistan J Eng Technol (Supplementary Issue) / Res Articles. 10.51846
Noviantho SM, Isa L (2022) Ashianti,Cyberbullying Detection using Machine Learning classification model, International Conference on Informatics and Computational Sciences (ICICoS), 10.1109/ICICOS.2017.8276369
Tarek MH, Al Emran MH (2023) Saddam Hossain A Review on Deep-Learning-Based Cyberbullying Detection,Future Internet. 10.3390/fi15050179
Mehendale N, Shah K, Phadtare Keval, Cyber Bullying Detection for Hindi-English Language Using Machine Learning (May 21, 2022). Available at SSRN: 4116143
Tarek MH, Al Emran MH (2023) Saddam Hossain A Review on Deep-Learning-Based Cyberbullying Detection,Future Internet. 10.3390/fi15050179
Manowarul MI, Ashraf MU Rubaia Rahman Cyberbullying Detection on Social Media Platform: Machine Learning Based Approach, (4 March 2021),PP-29-34. Available at SSRN: 2224 – 1698
Abdhullah-Al-Mamun, Akhter S (2018) Social media bullying detection using machine learning on bangla text. Dec. pp. 385–388. DOI: 10. 1109 / ICECE.2018.8636797.
Breiman L (Oct. 2001) Random forests. Mach Learn 45:5–32. 10.1023/A:1010950718922
Smola A, Vishwanathan S (2008) Introduction to machine learning. Cambridge University Press, Cam- bridge
reporter N (2021) Survey report 2021, all the latest cyberbullying statistics and what they mean in 2021. BroadBand Search, [Online]. Available: https://www.broadbandsearch.net/blog/cyber-bullying-statistics
Islam MI, Kasem F, Meem, Rakshit, Habib M (2019) Bangla spell checking and correction using edit distance, Apr. 10.1109/ICASERT. 2019.8934536
Raj,Agarwal,Bharathy CA, Prasad N (2021) Cyberbullying detection: Hybrid models based on machine learning and natural language processing techniques. Electronics 10:2810 [CrossRef]
Bharti AK, Yadav K, Yadav (2021) Cyberbullying detection from tweets using deep learning. Kybernetes 51:2695–2711 [CrossRef]
Raj,Singh,Solanki MSK, Selvanambi K (2022) An application to detect cyberbullying using machine learning and deep learning techniques. SN Comput Sci 3:401 [CrossRef] [PubMed]
Singh.Singh NKS, Chand S (2022) Deep Learning based Methods for Cyberbullying Detection on Social Media. In Proceedings of the 2022 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS), Greater Noida, India, 4–5 November ; pp. 521–525
Hosseinmardi SRQ, Yang,Han L (2020) Monitoring Cyberbullying on Instagram using Deep Learning and Data Science Techniques. J Med Internet Res Oct 20(10):e269
Van Royen M (2022) Artificial Intelligence to Address Cyberbullying, Harassment and Abuse: New Directions in the Midst of Complexity. Int J Bullying Prev 4:1–5
Davidson,Warmsley TD, Weber M (2017) Automated Hate Speech Detect Problem Offensive Lang arXiv:1703.04009.
Kumar SM, Dredze (2017) Detecting Cyberbullying in Online Communities, in Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2017, pp. 2136–2145
Chatzakou,Kourtellis KNJ, Blackburn E, De Cristofaro G, Stringhini A, Vakali Mean Birds: Detecting Aggression and Bullying on Twitter,2018. arXiv:1802.00393
John Hani,Mohamed Nashaat,Mostafa Ahmed (2019) Int J Adv Comput Sci Applications(IJACSA) 10(5). 10.14569/IJACSA.2019.0100587
Aditya Desai S, Kalaskar (2021) Omkar Kumbhar, and Rashmi Dhumal.Cyber Bullying Detection on Social Media using Machine Learning. ITM Web of Conferences 40, 03038 ICACC-2021
Subrata, Saha Md. Shamimul Islam, Mahbub Alam.Bengali Cyberbullying Detection in Social Media Using Machine Learning Algorithms.Conference: November 2023 5th IEEE International Conference on Sustainable Technologies for Industry 5.0
Tsapatsoulis N, Anastasopoulou V (2019) Cyberbullies in Twitter: A focused review, in Proceedings of the 2019 IEEE International Workshop on Social Media Analytics and Processing (SMAP), pp. 1–6, 10.1109/SMAP.2019.8864918
León-Paredes GA et al (2019) Presumptive Detection of Cyberbullying on Twitter through Natural Language Processing and Machine Learning in the Spanish Language, in Proceedings of the 2019 IEEE Chilean Conference on Electrical, Electronics Engineering, Information and Communication Technologies (CHILECON), pp. 1–7, 10.1109/CHILECON47746.2019.8987684
Roy PK, Tripathy AK, Das TK, Gao X-Z (2020) A Framework for Hate Speech Detection Using Deep Convolutional Neural Network, in IEEE Access. 8:204951–204962. 10.1109/ACCESS.2020.3037073
Kargutkar SM, Chitre V A Study of Cyberbullying Detection Using Machine Learning Techniques, in Proceedings of the 2020 IEEE International Conference on Communication, Management and Computing(ICCMC),pp.734739,2020.10.1109/ICCMC48092.2020.ICCMC-000137

No competing interests reported.

Download PDF

Reviews received at journal
21 Apr, 2024
Reviewers agreed at journal
18 Apr, 2024
Reviewers agreed at journal
16 Apr, 2024
Reviewers invited by journal
06 Apr, 2024
Editor assigned by journal
09 Mar, 2024
Submission checks completed at journal
08 Mar, 2024
First submitted to journal
07 Mar, 2024

You are reading this latest preprint version

Approach to Social Media Cyberbullying and Harassment Detection Using Advanced Machine Learning

Status:

Version 1

Abstract

Figures

I. INTRODUCTION

II. RELATED WORKS

III. OBJECTIVE

IV. PROBLEM STATEMENT

V. CYBERBULLYING SYSTEM ARCHITECTURE

VI. DATASET PREPARATION AND METHODOLOGY

VII. PERFORMANCE IMPROVENENT

VIII. IMPLEMENTATION RESULTS

IX. RESULT AND ANALYSIS

X. CONCLUSION

Declarations

XI. DECLARATION OF COMPETING INTEREST

Author Contribution

References

Additional Declarations

Status:

Version 1