CLASSIFICATION OF HEART DISEASE RISK WITH MULTILAYER PERCEPTRON: A MACHINE LEARNING APPROACH
Abstract
This research explores the integration of Principal Component Analysis (PCA) with Support Vector Machine (SVM) classification to identify the key factors influencing customer satisfaction using questionnaire data. Leveraging SVM's effectiveness in questionnaire-based classification and PCA's feature reduction capabilities, the study examines the impact of PCA on model performance. From a dataset comprising responses from 100 participants regarding product quality, price, and service level, PCA was employed to reduce the questionnaire features to three components. Results revealed Product Quality as the most significant factor affecting Customer Satisfaction, while Service Level exhibited the lowest influence. Comparative analysis of SVM models with and without PCA integration demonstrated improved performance metrics, including reduced error rates and enhanced accuracy. PCA mitigated overfitting risks, enhancing model generalization and interpretability. The findings underscore the practical significance of PCA in optimizing SVM classification models for customer satisfaction analysis. Future research may focus on optimal component selection and the robustness of PCA-SVM models across diverse domains, advancing machine learning applications in real-world scenarios.
Keyword : Classification, Customer satisfaction analysis, Feature reduction, Principal Component Analysis, Support Vector Machine
References
Basak, H., Roy, A., Lahiri, J. B., Bose, S., & Patra, S. (2021). SVM and ANN based Classification of EMG signals by using PCA and LDA. I. http://arxiv.org/abs/2110.15279
Bramhe, M. V, Gohade, S., Kapse, R., Thamke, S., & Mehar, A. (2023). Student Performance Prediction Using Regression Analysis & Feature- Based Opinion Mining on Student Feedback. Journal of Harbin Engineering University, 44(7), 360–366.
Cateni, S., Colla, V., & Vannucci, M. (2023). Improving the Stability of the Variable Selection with Small Datasets in Classification and Regression Tasks. Neural Processing Letters, 55(5), 5331–5356. https://doi.org/10.1007/s11063-022-10916-4
Chen, J., Gong, F., Xiang, S., & Yu, T. (2021). Application of principal component analysis in evaluation of epidemic situation policy implementation. Journal of Physics: Conference Series, 1903(1). https://doi.org/10.1088/1742-6596/1903/1/012056
Dabhade, P., Agarwal, R., Alameen, K. P., Fathima, A. T., Sridharan, R., & Gopakumar, G. (2021). Educational data mining for predicting students’ academic performance using machine learning algorithms. Materials Today: Proceedings, 47(xxxx), 5260–5267. https://doi.org/10.1016/j.matpr.2021.05.646
Geche, F., Mulesa, O., Hrynenko, V., & Smolanka, V. (2019). Search for impact factor characteristics in construction of linear regression models. Technology Audit and Production Reserves, 3(2(47)), 20–25. https://doi.org/10.15587/2312-8372.2019.175020
Honest, N. (2020). A survey on Feature Selection Techniques. GIS SCIENCE JOURNAL, 7(6), 353–358. https://www.researchgate.net/publication/344121693
Kirchner, A., & Signorino, C. S. (2018). Using Support Vector Machines for Survey Research. Survey Practice, 11(1), 1–14. https://doi.org/10.29115/sp-2018-0001
Kumar, M., Sharma, C., Sharma, S., Nidhi, N., & Islam, N. (2022). Analysis of Feature Selection and Data Mining Techniques to Predict Student Academic Performance. 2022 International Conference on Decision Aid Sciences and Applications, DASA 2022, March, 1013–1017. https://doi.org/10.1109/DASA54658.2022.9765236
Li, T. (2021). Application of slack variable-optimized SVM on piano teaching reform. Journal of Physics: Conference Series, 1941(1), 012084. https://doi.org/10.1088/1742-6596/1941/1/012084
Lubis, A. H., Sihombing, P., & Nababan, E. B. (2020). Analysis of Accuracy Improvement in K-Nearest Neighbor using Principal Component Analysis (PCA). Journal of Physics: Conference Series, 1566(1), 2–10. https://doi.org/10.1088/1742-6596/1566/1/012062
M, S. S., & M, J. (2022). Prediction of Student’s Wellbeing from Stress and Sleep Questionnaire data using Machine Learning Approach. 2022 IEEE Bombay Section Signature Conference (IBSSC), 1–6. https://doi.org/10.1109/IBSSC56953.2022.10037549
Nakao, H., Imaoka, M., Hida, M., Imai, R., Nakamura, M., Matsumoto, K., & Kita, K. (2023). Determination of individual factors associated with hallux valgus using SVM-RFE. BMC Musculoskeletal Disorders, 24(1), 1–7. https://doi.org/10.1186/s12891-023-06303-2
Nasution, M. Z. F., Sitompul, O. S., & Ramli, M. (2018). PCA based feature reduction to improve the accuracy of decision tree c4.5 classification. Journal of Physics: Conference Series, 978(1), 0–6. https://doi.org/10.1088/1742-6596/978/1/012058
Odhiambo Omuya, E., Onyango Okeyo, G., & Waema Kimwele, M. (2021). Feature Selection for Classification using Principal Component Analysis and Information Gain. Expert Systems with Applications, 174(November 2020), 114765. https://doi.org/10.1016/j.eswa.2021.114765
Pardede, D., Wanayumini, & Rosnelly, R. (2023). A Combination Of Support Vector Machine And Inception-V3 In Face-Based Gender Classification. International Conference on Information Science and Technology Innovation (ICoSTEC), 2(1), 34–39. https://doi.org/10.35842/icostec.v2i1.30
Samudra, J. T., Rosnelly, R., & Situmorang, Z. (2023). Comparative Analysis of SVM and Perceptron Algorithms in Classification of Work Programs. MATRIK : Jurnal Manajemen, Teknik Informatika Dan Rekayasa Komputer, 22(2), 285–298. https://doi.org/10.30812/matrik.v22i2.2479
Taherdoost, H. (2022). Designing a Questionnaire for a Research Paper: A Comprehensive Guide to Design and Develop an Effective Questionnaire. Asian Journal of Managerial Science, 11(1), 8–16. https://doi.org/10.51983/ajms-2022.11.1.3087
Yadav, A., Jamir, I., Jain, R. R., & Sohani, M. (2019). Breast Cancer Prediction using SVM with PCA Feature Selection Method. International Journal of Scientific Research in Computer Science, Engineering and Information Technology, 5(2), 969–978. https://doi.org/10.32628/cseit1952277
Zhu, X., Dong, H., Salvo Rossi, P., & Landrø, M. (2021). Feature Selection Based on Principal Component Regression for Underwater Source Localization by Deep Learning. Remote Sensing, 13(8), 1486. https://doi.org/10.3390/rs13081486
Downloads
Published
Issue
Section
License
Copyright (c) 2024 Arief Rahman Hakim, Irwan Daniel, Novendra Adisaputra Sinaga

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.