Journal Article

Sentiment Analysis Classification of ChatGPT on Twitter Big Data in Indonesia Using Fast R-CNN

Sio Jurnalis Pipin, Frans Mikael Sinaga, Sunaryo Winardi, Muhammad Noor Hakim

MATRIK: Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer, Vol. 7 No. 4 (2024) • 2024

15 citations

Abstract

The advent of OpenAI's ChatGPT, a large language model (LLM) proficient in various fields including artificial intelligence (AI) and natural language processing (NLP), has sparked extensive discussions and diverse opinions on social media platforms such as Twitter in Indonesia. This research aims to explore these dynamics by mapping both praises and criticisms related to ChatGPT's technological advancements and potential negative impacts. Utilizing deep learning-based sentiment analysis techniques, the study applies Convolutional Neural Network (CNN) and Fast Region-based Convolutional Neural Network (Fast R-CNN) models to a dataset of 7,604 tweets categorized into Positive, Negative, and Neutral sentiments. The methodology covers data collection from Twitter, data cleaning and pre-processing, followed by model training and evaluation. The results show that the Fast R-CNN model outperforms the CNN model, achieving an accuracy of 94.5% compared to 86% for CNN. These findings underscore the effectiveness of Fast R-CNN for sentiment analysis in the Indonesian context and highlight its potential to support more informed strategies for addressing the opportunities and challenges posed by AI technologies like ChatGPT.

Overview

Penelitian ini menganalisis persepsi masyarakat Indonesia terhadap ChatGPT dengan memanfaatkan data besar Twitter yang diklasifikasikan ke dalam sentimen positif, negatif, dan netral.

Model deep learning berbasis CNN dan Fast R-CNN digunakan untuk melakukan klasifikasi sentimen, dengan serangkaian tahapan pengumpulan data, pembersihan, dan prapemrosesan teks sebelum pelatihan model.

Key Contributions

Mengumpulkan dan mengolah dataset berisi 7.604 tweet berbahasa Indonesia terkait ChatGPT yang diklasifikasikan ke dalam tiga kelas sentimen: Positive, Negative, dan Neutral.
Menerapkan dan membandingkan model Convolutional Neural Network (CNN) dan Fast R-CNN untuk tugas klasifikasi sentimen pada data Twitter skala besar.
Menunjukkan bahwa Fast R-CNN mencapai akurasi 94,5% dan melampaui performa model CNN yang berada di kisaran 86%, sehingga lebih efektif untuk analisis sentimen ChatGPT di konteks Indonesia.
Memberikan wawasan tentang persepsi publik terhadap teknologi AI seperti ChatGPT dan menunjukkan potensi integrasi Fast R-CNN dalam analisis sentimen media sosial di masa depan.