Comparison of Logistic Regression and Random Forest Methods in Predicting Vehicle Tax Payment Compliance
DOI:
https://doi.org/10.31294/informatika.v13i1.11944Keywords:
Tax; Motor Vehicle Tax; Logistic Regression; Random Forest; ClassificationAbstract
Motor vehicle tax is a major source of Regional Original Income (PAD). However, the level of motor vehicle tax payment compliance in North Aceh Regency is still suboptimal, particularly related to late payments. A data-driven approach is needed to predict and understand taxpayer compliance patterns more accurately. This study aims to compare the performance of the Logistic Regression and Random Forest methods in predicting motor vehicle tax payment compliance, as well as to identify factors that influence taxpayer compliance behavior at the North Aceh Samsat (Sat). This study uses secondary data in the form of motor vehicle tax payment transactions at the North Aceh Samsat for the 2022–2024 period, totaling 100,000 observations. The response variable is the tax payment compliance status (compliant and non-compliant), while the predictor variables include vehicle age, type of ownership, vehicle type, and vehicle brand. The data is divided into 70% training data and 30% testing data. The performance evaluation model is conducted using accuracy, precision, recall, and Area Under Curve (AUC) metrics. The analysis results show that Random Forest has better predictive performance than Logistic Regression, with higher accuracy and AUC values. Vehicle age and type of ownership are the most influential variables in predicting tax payment compliance, while vehicle brand has a relatively smaller influence. Logistic Regression provides a clear interpretation of the variable relationship, but has lower discrimination ability than Random Forest. Random Forest has proven to be more effective as a prediction model for motor vehicle tax payment compliance at the North Aceh Samsat. The application of machine learning-based predictive models has the potential to support more targeted policy making in an effort to improve motor vehicle tax payment compliance, especially in reducing late payments.
Downloads
References
Abd Madjid, Z., Sabijono, H., & Mintalangi, S. S. E. (2024). Evaluasi pemungutan pajak restoran berdasarkan Peraturan Daerah Kota Ternate Nomor 15 Tahun 2014 tentang pajak restoran di Badan Pengelolaan Pajak dan Retribusi Daerah Kota Ternate. Riset Akuntansi dan Portofolio Investasi, 2(2), 84–92. https://doi.org/10.58784/rapi.132
Apriansyah, R., Hasibuan, A., Fahmi, B. L., & Munawaroh, N. L. (2023). Sosialisasi Pemberdayaan kaum Perempuan Sebagai Upaya Penghasilan Tambahan dari Hasil Panen Nelayan di Bantayan, Kecamatan Seunuddon, Kabupaten Aceh Utara.
Avidaniar Bintary, A. (2020). Analisis Kepatuhan Wajib Pajak Kendaraan Bermotor dalam upaya Meningkatkan Penerimaan Pajak Daerah pada Kantor Bersama Samsat Jakarta Timur Tahun 2015-2018. Jurnal Pajak Vokasi (JUPASI), 1(2), 86–101. https://doi.org/10.31334/jupasi.v1i2.816
Awal, F. S., Sirat, A. H., & Hadady, H. (2025). Pengelolaan Pajak Kendaraan Bermotor Di Provinsi Maluku Utara. Sibatik Journal: Jurnal Ilmiah Bidang Sosial, Ekonomi, Budaya, Teknologi, Dan Pendidikan, 4(7), 1505-1530.
Baj, T. D. W., Baj, D. R., Baj, T. T., & Baj, M. H. A. (2023). Pengaruh Kesadaran Pajak, Pengetahuan Pajak, Sanksi Pajak Terhadap Kepatuhan Wajib Pajak Kendaraan Bermotor (Studi Kasus Kantor Bersama Samsat Surabaya Selatan). Behavioral Accounting Journal, 2(1), 41–53. https://doi.org/10.33005/baj.v2i1.38
Damayanti, A. Y., Afifah, A. N., & Sunaningsih, S. N. (2023). Analisis Kontribusi Pemungutan Pajak Kendaraan Bermotor Terhadap Peningkatan Pendapatan Asli Daerah (Pad) Di Kota Magelang Tahun 2018—202. 12(2).
Hasibuan, M. R., Rahman, S., & Chiuloto, K. (2025). Implementasi Metode Random Forest Dalam Prediksi Penyebab Tunggakan Pembayaran Pajak Kendaraan Bermotor Di Samsat Medan Utara. 1(2).
Hulaifah Al Abrori, Z. Z., & Subhiyakto, E. R. (2025). Analisis Komparatif Akurasi Prediksi Kanker Payudara Menggunakan Algoritma Random Forest dan Logistic Regression. Jurnal Algoritma, 22(1), 300–311. https://doi.org/10.33364/algoritma/v.22-1.2164
Larasati, N. (2023). Perbandingan Regresi Logistik dan Random Forest pada Klasifikasi Cuaca Wilayah Jawa Tengah. AKSIOMA : Jurnal Matematika dan Pendidikan Matematika, 14(2), 172–181. https://doi.org/10.26877/aks.v14i2.15985
Maulidya, H. (2025). Pemodelan Regresi Logistik Biner Untuk Mengetahui Faktor-Faktor Yang Berpengaruh Terhadap Stunting. Jurnal Matematika, 5.
Nisa, M. (2024). Analisis Pengaruh Pengenaan Pajak Kendaraan Bermotor, Bea Balik Nama Kendaraan Bermotor, Dan Pemungutan Denda Pajak Kendaraan Bermotor Terhadap Peningkatan Pendapatan Asli Daerah Kota Lhokseumawe Pada Tahun 2018-2022. 3(4).
Nugroho, A., & Harini, D. (2024). Teknik Random Forest untuk Meningkatan Akurasi Data Tidak Seimbang. JSITIK: Jurnal Sistem Informasi dan Teknologi Informasi Komputer, 2(2), 128–140. https://doi.org/10.53624/jsitik.v2i2.379
Nurahaliza, F., & Mulyadi, N. (2022). Aplikasi Sistem Informasi Dan Pemetaan Daerah Pariwisata Berbasis Web. Jurnal Tika, 7(1), 48–54. https://doi.org/10.51179/tika.v7i1.1085
Purnama, M. A., Ramadhani, J., Anugraha, Y. S., Efrizoni, L., & Rahmaddeni, R. (2024). Perbandingan Performa Algoritma Random Forest dan Gradient Boosting dalam Mengklasifikasi Churn Telco. Techno.Com, 23(3), 645–657. https://doi.org/10.62411/tc.v23i3.11278
Purwa, T. (2019). Perbandingan Metode Regresi Logistik dan Random Forest untuk Klasifikasi Data Imbalanced (Studi Kasus: Klasifikasi Rumah Tangga Miskin di Kabupaten Karangasem, Bali Tahun 2017). Jurnal Matematika, Statistika dan Komputasi, 16(1), 58. https://doi.org/10.20956/jmsk.v16i1.6494
Rahmadani, A. A., Putri, A. A., Dala, D., Angka, M. T., & Rafiq, M. (2023). Analisis Regresi Logistik Biner Untuk Memprediksi Faktor-Faktor Internal Yang Memengaruhi Keharmonisan Rumah Tangga Menurut Provinsi Di Indonesia Pada Tahun 202.
Suci Amaliah, Nusrang, M., & Aswi, A. (2022). Penerapan Metode Random Forest Untuk Klasifikasi Varian Minuman Kopi di Kedai Kopi Konijiwa Bantaeng. VARIANSI: Journal of Statistics and Its application on Teaching and Research, 4(3), 121–127. https://doi.org/10.35580/variansiunm31
Wasana, W. S., Adityo, R. D., & Herulambang, W. (2021). Implementation of Intelligent Parking System Using IoT-Based Devices (case Study of Galaxy Mall Surabaya). JEECS (Journal of Electrical Engineering and Computer Sciences), 6(2), 1135–1158. https://doi.org/10.54732/jeecs.v6i2.207
Wati, R., & Saepuloh, C. (2025). Pengaruh Program Pemutihan Pajak Kendaraan Bermotor Terhadap Kepatuhan Wajib Pajak Kendaraan Bermotor Pada Kantor SAMSAT Kabupaten Bandung II Soreang. 11.
Yuniarsyih R.A, R. D., Muhadi, R. A., Fitrianto, A., & Silvianti, P. (2025). Analisis Regresi Logistik Biner dan Random Forest untuk Prediksi Faktor-Faktor Stunting di Pulau Jawa. Euler : Jurnal Ilmiah Matematika, Sains dan Teknologi, 13(2), 147–156. https://doi.org/10.37905/euler.v13i2.31680
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Khairul Fuadi, Taufiq, Arnawan Hasibuan, Dahlan Abdullah, Nurdin Nurdin (Author)

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.





