Cahyani, Andharini Dwi and Fathoni, Moh. Wildan and Rachman, Fika Hastarita and Basuki, Ari and Amin, Salman and Khotimah, Bain Khusnul (2025) Automatic essay scoring: leveraging Jaccard coefficient and Cosine similarity with n-gram variation in vector space model approach. IAES International Journal of Artificial Intelligence (IJ-AI), 14 (5). pp. 3599-3612.
![]() |
Text
23521-62412-1-PB.pdf Download (712kB) |
Abstract
Automated essay scoring (AES) is a vital area of research aiming to provide efficient and accurate assessment tools for evaluating written content. This study investigates the effectiveness of two popular similarity metrics, Jaccard coefficient, and Cosine similarity, within the context of vector space models (VSM) employing unigram, bigram, and trigram representations. The data used in this research was obtained from the formative essay of the citizenship education subject in a junior high school. Each essay undergoes preprocessing to extract features using n-gram models, followed by vectorization to transform text data into numerical representations. Then, similarity scores are computed between essays using both Jaccard coefficient and Cosine similarity. The performance of the system is evaluated by analyzing the root mean square error (RMSE), which measures the difference between the scores given by human graders and those generated by the system. The result shows that the Cosine similarity outperformed the Jaccard coefficient. In terms of n-gram, unigrams have lower RMSE compared to bigrams and trigrams.
Item Type: | Artikel Umum |
---|---|
Subjects: | T Technology > TK Electrical engineering. Electronics Nuclear engineering |
Divisi / Prodi: | Faculty of Industrial Technology (Fakultas Teknologi Industri) > S1-Electrical Engineering (S1-Teknik Elektro) |
Depositing User: | M.Eng. Alfian Ma'arif |
Date Deposited: | 17 Oct 2025 08:46 |
Last Modified: | 17 Oct 2025 08:46 |
URI: | http://eprints.uad.ac.id/id/eprint/88351 |
Dosen Pembimbing: | UNSPECIFIED | [error in script] |
Actions (login required)
![]() |
View Item |