Post on 25-Jan-2020
PERBANDINGAN METODE TERM WEIGHTING
UNTUK KLASIFIKASI EMOSI PADA LIRIK LAGU
TUGAS AKHIR
Sebagai Persyaratan Guna Meraih Gelar Sarjana Strata 1
Teknik Informatika Universitas Muhammadiyah Malang
Oleh :
LU’LU’UL MUKARROMAH
201010370311074
JURUSAN TEKNIK INFORMATIKA
FAKULTAS TEKNIK
UNIVERSITAS MUHAMMADIYAH MALANG
2015
KATA PENGANTAR
Alhamdullilah segala puja dan puji syukur senantiasa penulis panjatkan kehadirat
Allah SWT yang telah memberikan rahmat, taufiq serta hidayahnya, sehingga penulis
dapat menyelesaikan pembuatan Laporan Tugas Akhir (TA) dengan judul
”Perbandingan Metode Term Weighting untuk Klasifikasi Emosi pada Lirik Lagu”
yang diajukan sebagai salah satu syarat untuk meraih gelar Sarjana Strata 1.
Dalam penyusunan Tugas Akhir ini penulis berusaha untuk menerapkan ilmu
yang telah didapat selama menjalani perkuliahan dengan tidak terlepas dari petunjuk,
bimbingan, bantuan dan dukungan dari berbagai pihak.
Penulis menyampaikan rasa terima kasih yang tidak mungkin terlupakan kepada
pihak-pihak yang telah memberikan bantuan moral maupun material secara langsung
maupun tidak langsung kepada :
1. Allah SWT, atas terselesaikannya Laporan Tugas Akhir (TA).
2. Kedua orang tuaku, H.Abdul Hafidz Alwi dan Hj. Chofifah Alwi, terima kasih
atas kasih sayang, do’a, kesabaran dan pengorbanan tak terhingga yang telah
tercurahkan selama ini.
3. Adekku Ulul Izzatul Jannah dan Muhammad Akmal Firdaus terima kasih atas
doanya.
4. Bapak Yuda Munarko, S.Kom, M.Sc selaku Dosen Pembimbing 1.
5. Bapak Yufis Azhar S.Kom, M.kom selaku Dosen Pembimbing 2.
6. Dosen Pengajar yang telah banyak memberikan ilmunya untuk Kami.
7. Ibu Gita Indah M., S.T, M.Kom selaku Dosen Penguji 1, dan Bapak Aminudin,
S.Kom selaku Dosen Penguji 2 yang telah dengan sabar memberikan saran demi
perbaikan tugas akhir ini.
8. Terima kasih buat Nurul Solechah, Linda Nur Wulansari, Marcellina Ratna,
kalian teman yang luar biasa.
9. Teman-teman seangkatan dan seperjuangan, terima kasih atas persahabatan,
kebersamaan dan semangat kekeluargaan yang telah terjalin selama ini.
10. Semua pihak yang tidak dapat disebutkan satu persatu oleh penulis terima kasih
atas bantuannya.
Jazakumullah khoiron katsiron atas semuanya, penulis menyadari bahwa
pembuatan Tugas Akhir ini masih banyak kekurangan karena keterbatasan dan
kemampuan penulis.
Akhirnya penulis berharap semoga memberikan manfaat bagi penulis khususnya,
pembaca pada umumnya.
Malang, Januari 2015
Penulis
DAFTAR ISI
Abstrak ................................................................................................................ i
Kata Pengantar ..................................................................................................... iii
Daftar Isi .............................................................................................................. v
Daftar Gambar ..................................................................................................... vii
Daftar Tabel .......................................................................................................... viii
BAB I PENDAHULUAN .................................................................................. 1
1.1 Latar Belakang ........................................................................................... 1
1.2 Rumusan Masalah ...................................................................................... 2
1.3 Tujuan Penelitian ....................................................................................... 2
1.4 Batasan Masalah ......................................................................................... 2
1.5 Metodologi Penelitian ................................................................................ 2
1.5.1 Studi Pustaka .................................................................................. 3
1.5.2 Analisa Sistem ................................................................................ 3
1.5.3 Implementai Sistem ........................................................................ 3
1.5.4 Pengujian Sistem ............................................................................ 3
1.5.5 Penulisan Laporan .......................................................................... 3
1.6 Sistematika Penulisan ................................................................................ 3
BAB II LANDASAN TEORI ........................................................................... 5
2.1 Lirik Lagu ................................................................................................... 5
2.2 Emosi ........................................................................................................ 5
2.3 Klasifikasi .................................................................................................. 6
2.4 Text Mining ................................................................................................ 6
2.4.1 Case Folding ................................................................................. 7
2.4.2 Tokenizing ..................................................................................... 8
2.4.3 Filtering ......................................................................................... 9
2.4.4 Stemming ...................................................................................... 9
2.5 Pembobotan ................................................................................................. 10
2.5.1 Term Frequency .............................................................................. 10
2.5.2 Term Frequency – Inverse Document Frequency ........................... 11
2.5.3 Term Presence ................................................................................. 12
2.5.4 Okapi BM25 .................................................................................... 12
2.6 K-Nearest Neighbour ................................................................................... 13
2.7 Pengujian .................................................................................................... 14
BAB III ANALISA DAN PERANCANGAN SISTEM .................................. 15
3.1 Analisa Masalah .......................................................................................... 15
3.2 Analisa Kebutuhan Sistem .......................................................................... 15
3.2.1 Kebutuhan Fungsional .................................................................... 16
3.2.2 Kebutuhan Non Fungsional ............................................................ 16
3.3 Perancangan Sistem ................................................................................... 16
3.3.1 Preprocessing ................................................................................. 17
3.3.2 Pembobotan (Term Weighting) ...................................................... 18
3.3.3 K-Nearest Neighbour ...................................................................... 20
3.4 Perancangan Database ................................................................................ 22
3.5 Perancangan Antar Muka / Interface .......................................................... 22
BAB IV IMPLEMENTASI DAN PENGUJIAN ............................................. 24
4.1 Implementasi Kebutuhan Hardware dan Software .................................... 24
4.2 Implementasi Sistem .................................................................................. 24
4.2.1 Implementasi Tahap Pembuatan Data Training ............................. 25
4.2.2 Implementasi Tahap Preprocessing ............................................... 28
4.2.3 Implementasi Tahap Term Weighting ............................................ 30
4.2.4 Implementasi K-Nearest Neighbour ................................................ 33
4.2.5 Tampilan / User Interface ................................................................ 34
4.3 Pengujian Sistem ......................................................................................... 36
BAB V PENUTUP ............................................................................................. 43
5.1 Kesimpulan ................................................................................................ 43
5.2 Saran ........................................................................................................... 43
DAFTAR PUSTAKA ........................................................................................ 44
DAFTAR GAMBAR
Gambar 2.1 Tahapan Text Mining ................................................................. 7
Gambar 2.2 Proses Case Folding .................................................................. 8
Gambar 2.3 Proses Tokenizing ...................................................................... 8
Gambar 2.4 Proses Filtering ......................................................................... 9
Gambar 2.5 Proses Stemming ........................................................................ 10
Gambar 3.1 Alur Perancangan Sistem............................................................ 17
Gambar 3.2 Relasi Antar Tabel ..................................................................... 22
Gambar 3.3 Interface Menu Utama ............................................................... 23
Gambar 3.4 Interface Halaman Hasil Pembobotan ....................................... 23
Gambar 3.5 Interface Halaman Hasil Keseluruhan ....................................... 23
Gambar 4.1 Implementasi Tahap Casefolding .............................................. 28
Gambar 4.2 Implementasi Tahap Tokenizing ................................................ 29
Gambar 4.3 Implementasi Tahap Filtering .................................................. 29
Gambar 4.4 Implementasi Tahap Remove Non-Alphanumeric ..................... 29
Gambar 4.5 Implementasi Tahap Pemanggilan Porter Stemmer .................. 30
Gambar 4.6 Implementasi Perhitungan OKAPI BM25 ................................. 30
Gambar 4.7 Implementasi Perhitungan Term Frequence ............................. 31
Gambar 4.8 Implementasi Perhitungan Term Presence ................................ 32
Gambar 4.9 Implementasi Perhitungan TF – IDF ......................................... 32
Gambar 4.10 Implementasi Perhitungan Cosine Similarity ............................ 33
Gambar 4.11 Interface Halaman Utama Sistem .............................................. 34
Gambar 4.12 Interface Hasil Pembobotan ....................................................... 35
Gambar 4.13 Interface Hasil Klasifikasi Emosi Keseluruhan ......................... 35
Gambar 4.14 Hasil Klasifikasi Sistem ............................................................ 38
Gambar 4.15 Hasil KNN dari Masing – Masing Metode Pembobotan (Term
Weighting) ................................................................................... 39
DAFTAR TABEL
Tabel 3.1 Contoh Dokumen ........................................................................ 18
Tabel 3.2 Perhitungan TF dan IDF ............................................................. 19
Tabel 3.3 Perhitungan Bobot TF – IDF ...................................................... 19
Tabel 3.4 Perhitungan Bobot Term Presence ............................................ 19
Tabel 3.5 Perhitungan Perkalian Q dan D ................................................. 20
Tabel 3.6 Perhitungan Panjang Vektor ...................................................... 21
Tabel 3.7 Hasil Cosine Similarity .............................................................. 21
Tabel 3.8 Urutan Hasil Cosine Similarity ................................................... 21
Tabel 4.1 Data Training Emosi Marah (Angry) ........................................ 25
Tabel 4.2 Data Training Emosi Senang (Joy) ............................................ 26
Tabel 4.3 Data Training Emosi Sedih (Sad) .............................................. 27
Tabel 4.4 Perhitungan Akurasi dengan Data Training Berjumlah 50 buah lirik
lagu ............................................................................................ 36
Tabel 4.5 Perhitungan Akurasi dengan Data Training Berjumlah 100 buah lirik
lagu ............................................................................................ 37
Tabel 4.6 Perhitungan Akurasi dengan Data Training Berjumlah 150 buah lirik
lagu ............................................................................................ 37
Tabel 4.7 Perhitungan Rata – Rata Presentase Akurasi ............................. 37
Tabel 4.8 Perhitungan Bobot OKAPI BM25 ............................................. 39
Tabel 4.9 Perhitungan Bobot TF ............................................................... 40
Tabel 4.10 Perhitungan Bobot TP ............................................................... 41
Tabel 4.11 Perhitungan Bobot TF-IDF ........................................................ 42
DAFTAR PUSTAKA
[1] Lailatus Sofiana, Zainal Abidin, Hani Nurhayati, 2012 “Klasifikasi Emosi untuk
Teks Berbahasa Indonesia dengan Menggunakan K-Nearest Neighbor”, Jurnal
Mahasiswa UIN Maliki Malang.
[2] Oxford English Dictionary 1st ed. lyric, adj. and n." 1903, diakses tanggal:
15 Jan 2014.
[3] Liddell, Henry & al. A Greek–English Lexicon 9th ed., "λυρικός". Clarendon
Press (Oxford), 1996. Hosted at the Perseus Project. Diakses tanggal:
15 Jan 2014.
[4] Sidney, Philip. An Apologie for Poetrie op. cit. OED (1903).
[5] Haryanto S.Pd, 2009, Pengertian Emosi, [Online],
http://belajarpsikologi.com/pengertian-emosi/
[6] Frieda, N.H. (Inggris) Moods, Emotion Episodes and Emotions, New York:
Guilford Press, 1993, hal. 381-403.
[7] Frijda, (Inggris) Moods, Emotion Episodes and Emotions," hal. 381.
[8] Kamus Besar Bahasa Indonesia (KBBI), Klasifikasi, [Online],
http://kbbi.web.id/klasifikasi
[9] Wikipedia, 2013, Klasifikasi, [Online], http://id.wikipedia.org/wiki/Klasifikasi ,
diakses tanggal: 20 Agustus 2014
[10] Central Statistics Office, 2014, What is a classification?, [Online],
http://www.cso.ie/en/surveysandmethodology/classifications/whatisaclassificatio
n/
[11] Majumapan Corp, 2012, Pengertian Text Minning, [Online],
http://retrievalsystem.blogspot.com/2012/07/pengertian-text-mining.html
[12] Berry, M.W.; Kogan, J. (2010). Text Mining: Application and Theory.
Chichester: John Wiley & Sons, Ltd.
[13] Feldman, R.; Sanger, J. (2007). The Text Mining Handbook: Advanced
Approaches in Analyzing Unstructured Data. New York: Cambridge University
Press.
[14] Ratna Maria, 2013, Pengertian Data Mining, Text Mining Dan Web Mining,
[Online], http://analisis-proses-bisnis-
koperasi.blogspot.com/2013/04/pengertian-data-mining-text-mining-dan.html
[15] Ashil Fairuz, 2012, Text Mining, [Online],
http://ashilfairuz.blogspot.com/2012/07/text-mining.html#more
[16] Hullyati, 2012, Tugas Information Retrieval, [Online],
http://hullyati260.wordpress.com/2012/10/24/tugas-information-retriefal/
[17] Nurul Hidayati, 2012, Tahapan Text Processing, [Online],
http://nurulhidayati41991.blogspot.com/2012_10_01_archive.html
[18] Karyono G, Fandy Setyo Utomo, (2012), “Temu Balik Informasi Pada Dokumen
Teks Berbahasa Indonesia Dengan Metode Vector Space Retrieval Model”,
[Online], Tersedia:
http://publikasi.dinus.ac.id/index.php/semantik/article/viewFile/141/102
[19] Yugianus P, Harry Soekotjo Dachlan, dan Rini Nur Hasanah, (2013),
"Pengembangan Sistem Penelusuran Katalog Perpustakaan Dengan Metode
Rocchio Relevance Feedback”, [Online], Jurnal EECCIS Vol. 7, No. 1, Juni
2013, Tersedia:
http://jurnaleeccis.ub.ac.id/index.php/eeccis/article/viewFile/201/174
[20] Sasono Dimas Prasetyo Adi, Lailil Muflikhah, Achmad Ridok, (2014),
“Klasifikasi Genre Film Berdasarkan Judul Dan Sinopsis Menggunakan Fuzzy
K-Nearest Neighbour (Fuzzy K-NN)”, [Online], Jurnal Mahasiswa PTIIK UB
Volume 3 - Number 4, Tersedia:
https://www.google.com/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=
rja&uact=8&ved=0CB4QFjAA&url=http%3A%2F%2Fptiik.ub.ac.id%2Fdoro%
2Fdownload%2Farticle%2Ffile%2FDR00040201406&ei=RwjyU_bWGM6NuA
T9z4KYDA&usg=AFQjCNFLdjIg_c6A1_xeas_6c7U5Xpt96g&bvm=bv.73231
344,d.c2E
[21] Budy, 2012, Automatic Text Summarization Menggunakan Metode TF-IDF,
[Online], http://budyblog9.wordpress.com/2012/12/12/automatic-text-
summarization-menggunakan-metode-tf-idf/
[22] Putranti Noviah Dwi, Edi Winarko, (2014) , “Analisis Sentimen Twitter untuk
Teks Berbahasa Indonesia dengan Maximum Entropy dan Support Vector
Machine”, [Online], IJCCS, Vol.8, No.1, January 2014, pp. 91~100, Tersedia:
http://jurnal.ugm.ac.id/ijccs/article/viewFile/3499/3025
[23] Budi Prasetyo, Susana, (2012) “Pengembangan Aplikasi Cerdas Berbasis
Intelegensiabuatan Untuk Perbandingan Karya Ilmiah Hasil Penelitiandalam
Upaya Mencegah Plagiasi”, [Online], Tersedia:
http://digilib.batan.go.id/ppin/katalog/index.php/searchkatalog/downloadDatabyI
d/989/0853-9812-2012-157.pdf
[24] Renaldi Primaswara P, Rekyan Regasari MP., ST., MT., Dewi Yanti Liliana
S.Kom., M.Kom, “Simulasi Pengontrolan Nyala Lampu Berbasis Sistem
Pengenalan Suara (Voice Recognition) Dengan Menggunakan Metode Fast
Fourier Transform Dan K-Nearest Neighbor”, [Online], Jurnal Mahasiswa
PTIIK UB Volume 1 - Number 8 2013, Tersedia:
https://www.google.com/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=
rja&uact=8&ved=0CB4QFjAA&url=http%3A%2F%2Fptiik.ub.ac.id%2Fdoro%
2Farchives%2Fdetail%2FDR00072201306&ei=tBTyU--
dLobJuAT3z4GoDA&usg=AFQjCNFl2yjUa7RKezNekngnDXBVOhs05Q&bv
m=bv.73231344,d.c2E .
[25] Bagus Puji, 2014, K – Nearest Neighbor Information Retrieval, [Online],
http://bagus-puji-fst12.web.unair.ac.id/artikel_detail-101546-Pendidikan-
K%20%E2%80%93%20NEAREST%20NEIGHBOR%20INFORMATION%20
RETRIEVAL.html