Analisis Data Mining Dalam Pemilihan Smartphone dan Klasifikasi di Berbagai Perangkat Menggunakan Random Forest
Abstract
Abstract− Smartphone technology continues to develop rapidly, driving the need for effective analysis methods to assist users in selecting devices that suit their needs. This research aims to implement data mining using the Random Forest method in the process of selecting smartphones and classifying devices based on their technical specifications. The Random Forest method was chosen because of its reliable ability to handle data with a large number of attributes, produce an accurate classification model, and minimize the risk of overfitting.
The dataset used includes technical specifications of various smartphones, such as camera resolution, chipset, RAM capacity, screen resolution, and support for 4K video recording. The research process involved data collection, pre-processing to handle missing values and data transformation, as well as model training using the Random Forest algorithm.
The research results show that the Random Forest method is able to classify devices with high accuracy, helping users determine smartphones that meet their criteria, such as support for 4K video recording and overall performance. Additionally, this research provides insight into the importance of certain attributes in smartphone selection. Thus, implementing data mining using Random Forest can be an effective solution in supporting data-based decision making in the field of consumer technology.
Keywords: Data Mining, Random Forest, Smartphone, Classification, Technical Specifications
References
[1] R. Meng and W. He, “Data Mining,” in Flavoromics, Boca Raton: CRC Press, 2023, pp. 83–122. doi: 10.1201/9781003268758-8.
[2] D. Yates and M. Z. Islam, “Data Mining on Smartphones: An Introduction and Survey,” ACM Comput Surv, vol. 55, no. 5, 2022, doi: 10.1145/3529753.
[3] M. J. Hamid Mughal, “Data mining: Web data mining techniques, tools and algorithms: An overview,” International Journal of Advanced Computer Science and Applications, vol. 9, no. 6, 2018, doi: 10.14569/IJACSA.2018.090630.
[4] S. R. Durugkar, R. Raja, K. K. Nagwanshi, and S. Kumar, “Introduction to data mining,” 2022. doi: 10.1002/9781119792529.ch1.
[5] M. A. Ibrahim, “Jenis, Klasifikasi dan Karakteristik Media Pembelajaran,” Journal of Economic Perspectives, vol. 2, no. 1, 2022.
[6] A. P. Wibawa, M. Guntur, A. Purnama, M. Fathony Akbar, and F. A. Dwiyanto, “Metode-metode Klasifikasi,” Prosiding Seminar Ilmu Komputer dan Teknologi Informasi, vol. 3, no. 1, 2018.
[7] M. Wibowo and Muh. R. F. Djafar, “Perbandingan Metode Klasifikasi Untuk Deteksi Stress Pada Mahasiswa di Perguruan Tinggi,” JURNAL MEDIA INFORMATIKA BUDIDARMA, vol. 7, no. 1, 2023, doi: 10.30865/mib.v7i1.5182.
[8] S. Jamil, M. Ur Rahman, and Fawad, “A Comprehensive Survey of Digital Twins and Federated Learning for Industrial Internet of Things (IIoT), Internet of Learning for Industrial Internet of Things (IIoT), Internet of Vehicles (IoV) and Internetofof Drones (IoD),” Applied System Innovation, vol. 5, no. 3, 2022, doi: 10.3390/asi5030056.
[9] R. L. Kumar, F. Khan, S. Kadry, and S. Rho, “A Survey on blockchain for industrial Internet of Things: Blockchain for Internet of Things,” Alexandria Engineering Journal, vol. 61, no. 8, 2022, doi: 10.1016/j.aej.2021.11.023.
[10] S. Villamil, C. Hernández, and G. Tarazona, “An overview of internet of things,” Telkomnika (Telecommunication Computing Electronics and Control), vol. 18, no. 5, 2020, doi: 10.12928/TELKOMNIKA.v18i5.15911.
[11] X. Shu and Y. Ye, “Knowledge Discovery: Methods from data mining and machine learning,” Soc Sci Res, vol. 110, 2023, doi: 10.1016/j.ssresearch.2022.102817.
[12] X. Wu, X. Zhu, G. Q. Wu, and W. Ding, “Data mining with big data,” IEEE Trans Knowl Data Eng, vol. 26, no. 1, 2014, doi: 10.1109/TKDE.2013.109.
[13] L. Breiman, “Random forests,” Mach Learn, vol. 45, no. 1, 2001, doi: 10.1023/A:1010933404324.
[14] J. L. Speiser, M. E. Miller, J. Tooze, and E. Ip, “A comparison of random forest variable selection methods for classification prediction modeling,” 2019. doi: 10.1016/j.eswa.2019.05.028.
[15] M. Schonlau and R. Y. Zou, “The random forest algorithm for statistical learning,” Stata Journal, vol. 20, no. 1, 2020, doi: 10.1177/1536867X20909688.
[16] A. Nugroho, I. Asror, and Y. F. A. Wibowo, “Klasifikasi Tingkat Kualitas Udara DKI Jakarta Berdasarkan Open Government Data Menggunakan Algoritma Random Forest,” eProceedings of Engineering, vol. 10, No. 2, no. 2, 2023.
Copyright (c) 2024 Ananda Aulia

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).


.png)
.png)


