Analisis Kinerja Logistic Regression dan Random Forest pada Deteksi Fraud E-Commerce Menggunakan SMOTE dan PCA
DOI:
https://doi.org/10.31294/justian.v6i2.11566Keywords:
E-commerce Fraud Detection, SMOTE, PCA, Logistic Regression, Random ForestAbstract
The rapid growth of e-commerce platforms has increased the volume and complexity of digital transactions, which is accompanied by a rising risk of fraudulent activities. This study aims to apply and evaluate the performance of Logistic Regression and Random Forest algorithms for fraud detection in e-commerce transactions. To address the class imbalance problem, the Synthetic Minority Over-sampling Technique (SMOTE) is employed, while dimensionality reduction is performed using Principal Component Analysis (PCA). The dataset is divided into training and testing sets using an 80:20 ratio. Model evaluation is conducted under four scenarios: baseline without additional preprocessing, SMOTE only, PCA only, and a combination of SMOTE and PCA. The results indicate that Random Forest consistently outperforms Logistic Regression across most evaluation metrics, including Recall, F1-Score, and Area Under the Curve (AUC). The application of SMOTE significantly improves the model’s ability to identify fraudulent transactions, achieving the highest Recall of 80.79% using Random Forest. In contrast, the use of PCA, either alone or combined with SMOTE, tends to degrade model performance. This study concludes that Random Forest combined with SMOTE provides the most effective approach for fraud detection in highly imbalanced e-commerce transaction data.
References
[1] W. Priatna, S. Yulianto, J. Prasetyo, S. Wijono, E. Maria, and D. Manongga, “Deteksi Anomali dalam Penipuan E-commerce Menggunakan Hybrid Autoencoder-Transformer Frameworks,” JEPIN (Jurnal Edukasi dan Penelit. Inform., vol. 11, no. 1, pp. 33–40, 2025.
[2] J. K. Afriyie et al., “A supervised machine learning algorithm for detecting and predicting fraud in credit card transactions,” Decis. Anal. J., vol. 6, no. November 2022, p. 100163, 2023, doi: 10.1016/j.dajour.2023.100163.
[3] P. Sundaravadivel, R. A. Isaac, D. Elangovan, D. KrishnaRaj, V. V. L. Rahul, and R. Raja, “Optimizing credit card fraud detection with random forests and SMOTE,” Sci. Rep., vol. 15, no. 1, 2025, doi: 10.1038/s41598-025-00873-y.
[4] K. S. Roy, P. B. Udas, B. Alam, and K. Paul, “Unveiling Hidden Patterns: A Deep Learning Framework Utilizing PCA for Fraudulent Scheme Detection in Supply Chain Analytics,” Int. J. Intell. Syst. Appl., vol. 17, no. 2, pp. 14–30, 2025, doi: 10.5815/ijisa.2025.02.02.
[5] Z. Salekshahrezaee, J. L. Leevy, and T. M. Khoshgoftaar, “The effect of feature extraction and data sampling on credit card fraud detection,” J. Big Data, vol. 10, no. 1, 2023, doi: 10.1186/s40537-023-00684-w.
[6] F. Ernawan, K. Handayani, M. Fakhreldin, and Y. Abbker, “Light Gradient Boosting with Hyper Parameter Tuning Optimization for COVID-19 Prediction,” Int. J. Adv. Comput. Sci. Appl., vol. 13, no. 8, pp. 514–523, 2022, doi: 10.14569/IJACSA.2022.0130859.
[7] A. C. Purba and T. Handhayani, “Perbandingan Algoritma K-Means, Affinity Clustering, Dan Minibatch K-Means Untuk Analisis Segmentasi Pasar,” vol. 13, no. 1, pp. 54–63, 2024.
[8] K. Handayani and E. Erni, “Penerapan Light Gradient Boosting Dalam Prediksi Rasio Klik Tayang,” JATI (Jurnal Mhs. Tek. Inform., vol. 7, no. 1, pp. 13–18, 2023, doi: 10.36040/jati.v7i1.6010.
[9] R. Rianti et al., “Penerapan PCA dan Algoritma Clustering untuk Analisis Mutu Perguruan Tinggi di LLDIKTI Wilayah IV,” vol. 18, 2024.
[10] S. Khoiriyah and Z. Fatah, “Penerapan Algoritma Linear Regression dalam Memprediksi Harga Rumah Menggunakan RapidMiner,” vol. 3, no. 2, pp. 107–115, 2024.
[11] M. M. Alvanof and R. K. Dinata, “Penerapan Algoritma Random Forest dalam Deteksi dan Klasifikasi Ransomware,” vol. 5, no. 2, 2024.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Kartika Handayani, Erni, Ardiyansyah, Agung Sasongko

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.







