Improving Hate Speech Detection Using Machine and Deep Learning Techniques: A Preliminary Study

Jawaid Ahmed  Siddiqui; Siti Sophiayati  Yuhaniz; Zulfiqar Ali Memon; Yumna  Amin

doi:10.11113/oiji2021.9nSpecial Issue 2.143

Authors

Jawaid Ahmed Siddiqui
Siti Sophiayati Yuhaniz
Zulfiqar Ali Memon
Yumna Amin

DOI:

https://doi.org/10.11113/oiji2021.9nSpecial%20Issue%202.143

Keywords:

Hate speech, machine learning, Classification, Categorization, Random Forest, Logistic Regression and Multinomial Naïve Bayes

Abstract

The increasing use of social media and information sharing has given major benefits to humanity. However, this has also given rise to a variety of challenges including the spreading and sharing of hate speech messages. Thus, to solve this emerging issue in social media, recent studies employed a variety of feature engineering techniques and machine learning or deep learning algorithms to automatically detect the hate speech messages on different datasets. However, most of the studies classify the hate speech related message using existing feature engineering approaches and suffer from the low classification results. This is because, the existing feature engineering approaches suffer from the word order problem and word context problem. In this research, identifying hateful content from latest tweets of twitter and classify them into several categories is studied. The categories identified are; Ethnicity, Nationality, Religion, Gender, Sexual Orientation, Disability and Other. These categories are further classified to identify the targets of hate speech such as Black, White, Asian belongs to Ethnicity and Muslims, Jews, Christians can be classified from Religion Category. An evaluation will be performed among the hateful content identified using deep learning model LSTM and traditional machine learning models which includes Linear SVC, Logistic Regression, Random Forest and Multinomial Naïve Bayes to measure their accuracy and precision and their comparison on the live extracted tweets from twitter which will be used as our test dataset.

Improving Hate Speech Detection Using Machine and Deep Learning Techniques: A Preliminary Study

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

OIJI

Article Template