Rancang Bangun Service Application Program Interface Sistem Machine Learning Klasifikasi Teks Menggunakan Algoritma Support Vector Machine

Authors

  • Ottoh Hidayatullah Universitas Islam Negeri Syarif Hidayatullah
  • Victor Amrizal Universitas Islam Negeri Syarif Hidayatullah
  • Arini Universitas Islam Negeri Syarif Hidayatullah

DOI:

https://doi.org/10.29080/systemic.v6i1.920

Keywords:

Support Vector Machine, Knowledge Acquistion, Machine Learning

Abstract

Data shows very large numbers for Internet use in Indonesia. In the field of education, online libraries are an effort to facilitate researchers to search for references to research documents. Based on observations, UIN Jakarta already has a good repository of research documents, but the online research document repository does not fulfill the Knowledge Acquisition feature. This capability allows users to obtain knowledge information that is not easily accessible to users. This research build a machine learning system using the Support Vector Machine algorithm so that the system built can categorize documents based on the informatics research fields. This research also builds a system services API (Application Program Interface) so that data output from machine learning systems can be used by a variety of platforms and different operating system environments. The accuracy of the machine learning system in this study resulted in a percentage of classification accuracy of 73.2% with a parameter value of 0.9. At the preprocessing stage the selection of unigram-bigram is the best in this study. Preprocessing affects the level of classification of machine learning systems. Preprocessing using stemming improves the results of ability accuracy. The amount of data affects the accuracy of the machine learning classification ability, it can be seen when the data is increased to 488 accuracy increases to 74.49. When the experiment was done again so that the data increased to 492 data, the accuracy increased again to 77.78%.

Downloads

Download data is not yet available.

References

C. X. Zhai, and S. Massung, "Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining." Penerbit New York : Association for Computing Machinery and Morgan & Claypool , New York. Vol 1, p.7-9, 2016.

S. M. H. Dadgar, M. S. Araghi , and M. M. Farahani, " A Novel Text Mining Approach Based on TF-IDF and Support Vector Machine for News Classification," IEEE International Conference on Engineering and Technology (ICETECH), 2016, DOI: 10.1109/ICETECH.2016.7569223

R. Wongso, " News Article Text Classification in Indonesian Language,", Procedia Computer Science 116:137-143 · December 2017, DOI: 10.1016/j.procs.2017.10.039

M. Kretchmar , and Y. Zhao, "Text Message Authorship Classification Using kernel Support Vector Machines," IEEE International Conference on Computational Science and Computational Intelligence, 2014, DOI: 10.1109/CSCI.2014.121

S. N. Asiyah, and K. Fithriasari , Klasifikasi Berita online menggunakan Support Vector Machine dan K-nearest Neighbor, Jurnal Sains dan Seni ITS, Vol 5, No 2 (2016) DOI: 10.12962/j23373520.v5i2.16643, http://ejurnal.its.ac.id/index.php/sains_seni/article/view/16643

D. Ariadi, and K. Fithriasari, "Klasifikasi Berita Indonesia menggunakan Naive Bayesian dan Support Vector Machine dengan Confix Stripping Stemmer," Jurnal Sains dan Seni ITS, Vol 4, No 2, 2015, DOI: 10.12962/j23373520.v4i2.10966

A. D. Putri, and J. Adisantoso, " Klasifikasi dokumen teks menggunakan metode Support Vector Machine dengan pemilihan fitur CHI-Square," 2013, http://repository.ipb.ac.id/handle/123456789/65199

A.Z. Arifin, I.P.A.K. Mahendra, and H.T. Ciptaningtyas, "Enhanced Confix Stripping Stemmer and Ants Algorithm for Classifying News Document in Indonesian Language,", Proceeding of International Conference on Information & Communication Technology and Systems (ICTS), p. 149-157, 2009

A. D. Tahitoe, and D. Purwitasari, "Implementasi Modifikasi Enhanced Confix Stripping Stemmer Untuk Bahasa Indonesia dengan Metode Corpus Based Stemming," Institut Teknologi Sepuluh Nopember (ITS) – Surabaya, 60111, Indonesia, 2010, http://digilib.its.ac.id/public/ITS-Undergraduate-14255-paperpdf.pdf

D.N. Chandra, G. Indrawan, & I.N. Sukajaya, "Klasifikasi Berita Lokal Radar Malang Menggunakan Metode Naïve Bayes Dengan Fitur N-Gram, " Jurnal Ilmiah Teknologi Dan Informasi ASIA (JITIKA), 10(1),p.11–19. 2016 http://lp3m.asia.ac.id/wpcontent/uploads/2016/02/2.-jurnal-Denny.pdf

E. Indrayuni, M. Wahyudi, S. Informasi, J. Selatan, I. Komputer, & J. Selatan, "Penerapan Charachter N-Gram Untuk Sentiment Review Hotel Menggunakan Algoritma Naive Bayes, " Konfrensi Nasional Ilmu Pengetahuan dan Teknologi (KNIT) (pp. 88–93). 2015.

G. Dwyer, S. Aggarwal, J. Stouffer, "Flask: Building Python Web Services. Packt, ", Publishing Ltd, Birmingham, 2017

Haroon, Danish and Karachi, "Python Machine Learning Case Studies, " Penerbit Apress, Pakistan, 2017

R. Lawson, " Web Scraping with Python. Packt," Publishing Ltd, Birmingham, 2015

MongoDB Compass. 2016. Dari : https://www.mongodb.com/blog/post/getting-startedwith-mongodb-compass. Diakses pada 25 Mei 2018 pukul 20.30 WIB.

Gartner, "Preparing and Architecting for machine learning," 2017. https://www.gartner.com/binaries/content/assets/events/keywords/catalyst/catus8/preparing_and_architecting_for_machine_learning.pdf .

Additional Files

Published

2021-01-25

How to Cite

Hidayatullah, O. . ., Amrizal, V., & Arini. (2021). Rancang Bangun Service Application Program Interface Sistem Machine Learning Klasifikasi Teks Menggunakan Algoritma Support Vector Machine . Systemic: Information System and Informatics Journal, 6(1), 13–21. https://doi.org/10.29080/systemic.v6i1.920

Issue

Section

Articles