Film Recommendation System Using Content-Based Filtering and the Convolutional Neural Network (CNN) Classification Methods

Arliyanna, Nilla and Setiawan, Erwin Budi (2024) Film Recommendation System Using Content-Based Filtering and the Convolutional Neural Network (CNN) Classification Methods. Jurnal Ilmiah Teknik Elektro Komputer dan Informatika (JITEKI), 10 (1). pp. 17-29.

[thumbnail of 2-Film Recommendation System Using Content-Based Filtering and the Convolutional Neural Network (CNN) Classification Methods.pdf] Text
2-Film Recommendation System Using Content-Based Filtering and the Convolutional Neural Network (CNN) Classification Methods.pdf

Download (695kB)

Abstract

Managing large amounts of data is a challenge faced by users, so a recommendation system is needed as an information filter to provide relevant item suggestions. Twitter is often used to find information about movie reviews that can be used a basis for developing recommendation systems. This research contributes to applying content-based filtering in the context of Convolutional Neural Network (CNN). To the best of the researcher's knowledge, there has been no research addressing this combination of method and classification. The main focus is to evaluate the development of a recommendation system by integrating and comparing similarity identification methods using the RoBERTa and TF-IDF approaches. In this research, Roberta and TF-IDF as vectorizer and classification methods are applied to form a model that can recognize patterns in data and produce accurate predictions based on its features. The total data used is 854 movies and 34086 film reviews from 44 Twitter accounts. The SMOTE method was applied as a technique to overcome data imbalance. The research was conducted three times with increasing accuracy results. The first experiment TF-IDF as baseline, SMOTE on CNN classification. The second experiment, applying baseline, SMOTE, embedding on CNN classification. The third experiment applied baseline, SMOTE, embedding, and optimizer to CNN classification. The experimental results show that TF-IDF as baseline, SMOTE, embedding and SGD optimizer with the best learning rate on CNN classification can provide optimal results with an accuracy rate of 86.41%. Thus, the system can provide relevant movie recommendations with good prediction accuracy and performance.

Item Type: Artikel Umum
Subjects: T Technology > TK Electrical engineering. Electronics Nuclear engineering
Divisi / Prodi: Faculty of Industrial Technology (Fakultas Teknologi Industri) > S1-Electrical Engineering (S1-Teknik Elektro)
Depositing User: M.Eng. Alfian Ma'arif
Date Deposited: 22 Apr 2024 02:20
Last Modified: 22 Apr 2024 02:20
URI: http://eprints.uad.ac.id/id/eprint/61852

Actions (login required)

View Item View Item