similarity- Similarity Identification Based on Word-Trigrams Using Exact String Matching Algorithms

Sunardi, Sunardi (2022) similarity- Similarity Identification Based on Word-Trigrams Using Exact String Matching Algorithms. INTENSIF: Jurnal Ilmiah Penelitian dan Penerapan Teknologi Sistem Informasi, 6 (2). ISSN 2549-6824

[thumbnail of Similarity Identification Based on Word-Trigrams Using Exact String Matching Algorithms.pdf] Text
Similarity Identification Based on Word-Trigrams Using Exact String Matching Algorithms.pdf

Download (3MB)

Abstract

Several studies regarding excellent exact string matching algorithms can be used to identify similarity, including the Rabin-Karp, Winnowing, and Horspool Boyer-Moore algorithms. In determining similarities, the Rabin-Karp and Winnowing algorithms use fingerprints, while the Horspool Boyer-Moore algorithm uses a bad-character table. However, previous research focused on identifying similarities using these algorithms based on character n-gram. In contrast, identification based on the wordn-gram to determine the similarity based on its linguistic meaning, especially for longer strings, had not been covered yet. Therefore, a word-level trigram was proposed to identify similarities based on the word trigrams using the three algorithms and compare each performance. Based on precision, recall, and running time comparison, the Rabin-Karp algorithm results were 100%, 100%, and 0.19 ms, respectively; the Winnowing algorithm results with the smallest window were 100%, 56%, and 0.18 ms, respectively; and the Horspool algorithm results were 100%, 100%, and 0.06 ms. From these results, it can be concluded that the performance of the Horspool Boyer-Moore algorithm is better in terms of precision, recall, and running time.

Item Type: Artikel Umum
Subjects: T Technology > T Technology (General)
Divisi / Prodi: Faculty of Industrial Technology (Fakultas Teknologi Industri) > S1-Electrical Engineering (S1-Teknik Elektro)
Depositing User: SUNARDI
Date Deposited: 23 Aug 2022 02:31
Last Modified: 23 Aug 2022 02:36
URI: http://eprints.uad.ac.id/id/eprint/36442

Actions (login required)

View Item View Item