Separating Hate Speech from Abusive Language on Indonesian Twitter

Ibrahim, Muhammad Amien and Sagala, Noviyanti Tri Maretta and Arifin, Samsul and Nariswari, Rinda and Murnaka, Nerru Pranuta and Prasetyo, Puguh Wahyu Separating Hate Speech from Abusive Language on Indonesian Twitter. 2022 International Conference on Data Science and Its Applications (ICoDSA), 2022. pp. 187-191. ISSN 978-166548665-1

[thumbnail of Hasil Cek Similarity] Text (Hasil Cek Similarity)
Separating_Hate_Speech_from_Abusive_Language_on_Indonesian_Twitter-turnitin.pdf

Download (1MB)
[thumbnail of Dokumen Publikasi] Text (Dokumen Publikasi)
Separating_Hate_Speech_from_Abusive_Language_on_Indonesian_Twitter.pdf

Download (956kB)

Abstract

Social media is an effective tool for connecting with people and distributing information. However, many people often use social media to spread hate speech and abusive languages. In contrast to hate speech, abusive languages are frequently used as jokes with no purpose of offending individuals or groups, even though they may contain profanities. As a result, the distinction between hate speech and abusive language is often blurred. In many cases, individuals who spread hate speech may be prosecuted as it has legal implications. Previous research has focused on binary classification of hate speech and normal tweets. This study aims to classify hate speech, abusive language, and normal messages on Indonesian Twitter. Several machine learning models, such as logistic regression and BERT models, are utilized to accomplish text classification tasks. The model's performance is assessed using the F1-Score evaluation metric. The results show that BERT models outperform other models in terms of F1-Score, with the BERT-indobenchmark model, which was pretrained on social media text data, achieving the highest F1-Score of 85.59. This also demonstrates that pretraining the BERT model using social media data improves the classification model significantly. Developing such classification model that can distinguish between hate speech and abusive language would help individuals in preventing the spread of hate speech that has legal implications.

Item Type: Artikel Umum
Subjects: Q Science > QA Mathematics
Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisi / Prodi: Faculty of Teacher Training and Education (Fakultas Keguruan dan Ilmu Pendidikan) > S1-Mathematics Education (S1-Pendidikan Matematika)
Depositing User: puguh prasetyo
Date Deposited: 18 Nov 2022 02:07
Last Modified: 18 Nov 2022 02:07
URI: http://eprints.uad.ac.id/id/eprint/37552

Actions (login required)

View Item View Item