Representasi Bidirectional Berbasis Deef Learning Dengan Metode Bert Dan Lstm Untuk Menganalisis Ujaran Kebencian Pada Pengguna Media Sosial Di Indonesia

Agustina, Irma (2024) Representasi Bidirectional Berbasis Deef Learning Dengan Metode Bert Dan Lstm Untuk Menganalisis Ujaran Kebencian Pada Pengguna Media Sosial Di Indonesia. Masters thesis, Institut Teknoogi Sepuluh Nopember.

	Text cover tesis irma agustina.pdf Download (353kB)
	Text pernyataan keaslian laporan tesis irma agustina.pdf Download (229kB)
	Text halaman persetujuan tesis irma agustina.pdf Download (363kB)
	Text halaman pengesahan tesis irma agustina.pdf Download (428kB)
	Text persetujuan percetaka laporan tesis irma agustina.pdf Download (216kB)
	Text daftar riwayat hidup irma agustina.pdf Download (205kB)
	Text halaman persembahan.pdf Download (174kB)
	Text halaman motto.pdf Download (155kB)
	Text daftar isi.pdf Download (477kB)
	Text daftar tabel.pdf Download (158kB)
	Text daftar gambar.pdf Download (216kB)
	Text kata pengantar.pdf Download (376kB)
	Text abstract.pdf Download (515kB)
	Text Bab I.pdf Download (1MB)
	Text Bab II.pdf Download (6MB)
	Text Bab III.pdf Download (1MB)
	Text Bab IV.pdf Download (3MB)
	Text Bab V.pdf Download (222kB)
	Text Daftar Pustaka.pdf Download (837kB)
	Text Lampiran.pdf Download (866kB)

Abstract

Penggunaan media sosial untuk berkomunikasi sesama warga pengguna internet (netizen) sudah sangat umum dan menyebar ke seluruh lapisan masyarakat Indonesia. Pada penelitian ini, akan diimplementasikan algoritma Long Short-Term Memory (LSTM) pada kasus ujaran kebencian, khususnya yang berkaitan dengan ujaran kebencian dengan pemerintah dan juga menggunakan BERT. Teknik pencarian data set menggunakan web crawler yang diambil dari Twitter API untuk 5000 kata yang akan dianalisis apakah mengandung Hate Speech (HS) atau Non Hate Speech (NHS) dengan pemrograman python. Hasil penelitian menujukkan bahwa Analisis terhadap ujaran kebencian (hate speech) dengan metode Bidirectional Encoder Representations dari Transformers 86% menghasilkan tingkat akurasi yang lebih tinggi sedikit dibanding Long Short Term Memory yaitu 85,2%. Hal ini disebabkan oleh penggunaan algoritma TD-IDF dan GloVe untuk menentukan pembobotan nilai yang direpresentasikan dalam vektor pada setiap kata dalam penyelesaian klasifikasi dataset spam dan non-spam sangat efektif dan efisien. Untuk penelitian selanjutnya, hasil pada penelitian ini berpeluang untuk ditingkatkan dengan menambah jumlah data latih maupun data uji serta khususnya untuk kalimat yang bermakna ambigu, atau berupa bahasa informal dan singkatan yang umum digunakan di media sosial. Penelitian selanjutnya juga dapat menggunakan metode k-fold cross validation untuk menentukan akurasinya. Kata Kunci : representasi bidirectional, deep learning, BERT dan LSTM, ujaran kebencian, media sosial ==============================================================================The use of social media to communicate with fellow internet users (netizens) is very common and has spread to all levels of Indonesian society. In this research, the Long Short-Term Memory (LSTM) algorithm will be implemented in cases of hate speech, especially those related to hate speech against the government and also using BERT. The data set search technique uses a web crawler taken from the Twitter API for 5000 words which will be analyzed whether they contain Hate Speech (HS) or Non Hate Speech (NHS) using Python programming. The research results show that analysis of hate speech using the Bidirectional Encoder Representations method from Transformers 86% produces a slightly higher level of accuracy than Long Short Term Memory, namely 85.2%. produces a slightly higher level of accuracy than Long Short Term Memory, namely 85.2%. This is caused by the selection of hyperparameters namely batch size 32, learning rate 3e-6, and epoch 5. For further research, the results of this study have the opportunity to be improved by increasing the amount of training data and test data, especially for sentences that have ambiguous meanings, or in the form of informal language and abbreviations that are commonly used on social media. Future research can also use the k-fold cross validation method to determine its accuracy. Keywords: bidirectional representation, deep learning, BERT and LSTM, hate speech, social media

Item Type:	Thesis (Masters)
Uncontrolled Keywords:	representasi bidirectional, deep learning, BERT dan LSTM, ujaran kebencian, media sosial
Subjects:	Ilmu Komputer eTheses
Divisions:	Pasca Sarjana > Magister Teknik Informatika
Depositing User:	M.T.I IRMA AGUSTINA
Date Deposited:	16 Jan 2024 01:30
Last Modified:	16 Jan 2024 01:30
URI:	http://repo.darmajaya.ac.id/id/eprint/14569

Actions (login required)

View Item