Cyberbullying Detection using Ensemble Method

Public Deposited
Resource Type
  • Cyberbullying can be defined as a form of bullying that occurs across social media platforms using electronic messages.These platforms provide a ground for the cyberbullies to engage in bullying activities.State-of-the-art technologies such as Machine learning, NLP and Deep learning can be used to develop models that can detect cyberbullying.This dissertation proposes three different approaches and five models based on these technologies to identify cyberbullying using a newly generated email dataset.Our initial approach consists in using a traditional supervised machine learning.Our second approach is based on DistilBERT.Our last approach employs an ensemble technique. Our initial approach led to the implementation of two SVM models, one using TF-IDF feature extraction, the other using a combination of different tokens of TF-IDF vectors. Our third model was implemented using DistilBERT word embeddings. The highest accuracy was obtained using an ensemble model and the lowest accuracy was obtained using the SVM model with simple TF-IDF.

Thesis Degree Level
Thesis Degree Name
Thesis Degree Discipline
Rights Notes
  • Copyright © 2022 the author(s). Theses may be used for non-commercial research, educational, or related academic purposes only. Such uses include personal study, research, scholarship, and teaching. Theses may only be shared by linking to Carleton University Institutional Repository and no part may be used without proper attribution to the author. No part may be used for commercial purposes directly or indirectly via a for-profit platform; no adaptation or derivative works are permitted without consent from the copyright owner.
Date Created
  • 2022


In Collection: