Similarity-AF-Similarity Identification Based on Word Trigrams Using Exact String Matching Algorithms

Fadlil, Abdul and Sunardi, Sunardi and Ramdhani, Rezki (2022) Similarity-AF-Similarity Identification Based on Word Trigrams Using Exact String Matching Algorithms. INTENSIF : Jurnal Ilmiah Penelitian dan penerapan Teknologi Sistem Informasi, 6 (2). ISSN 2580-409X

[thumbnail of Similarity-AF-Similarity Identification Based on Word Trigrams.pdf] Text
Similarity-AF-Similarity Identification Based on Word Trigrams.pdf

Download (3MB)

Abstract

Several studies regarding excellent exact string matching algorithms can be used to identify similarity, including the Rabin-Karp, Winnowing, and Horspool Boyer-Moore algorithms. In determining similarities, the Rabin-Karp and Winnowing algorithms use fingerprints, while the Horspool Boyer-Moore algorithm uses a bad-character table. However, previous research focused on identifying similarities using
these algorithms based on character n-gram. In contrast, identification based on the word n-gram to determine the similarity based on its linguistic meaning, especially for longer strings, had not been covered yet. Therefore, a word-level trigram was proposed to identify similarities based on the word trigrams using the three algorithms and compare each performance. Based on precision, recall, and running time comparison, the Rabin-Karp algorithm results were 100%, 100%, and 0.19 ms, respectively; the Winnowing algorithm results with the smallest window were 100%, 56%, and 0.18 ms, respectively; and the Horspool algorithm results were 100%, 100%, and 0.06 ms. From these results, it can be concluded that the performance of the Horspool Boyer-Moore algorithm is better in terms of precision, recall, and running time.

Item Type: Artikel Umum
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
T Technology > T Technology (General)
Divisi / Prodi: Master (Magister) > Master of Technology Informatica (Magister Teknologi Informatika)
Depositing User: Drs. Abdul Fadlil, M.T., Ph.D.
Date Deposited: 22 Aug 2022 03:47
Last Modified: 22 Aug 2022 03:51
URI: http://eprints.uad.ac.id/id/eprint/36405

Actions (login required)

View Item View Item