Handling Missing Data in Bivariate Gamma Generation Data Using the Random Forest Method

Authors

  • Muhammad Arib Alwansyah Arib Universitas Negeri Jakarta
  • Ramya Rachmawati Universitas Bengkulu

DOI:

https://doi.org/10.21009/JKOMA.082.02

Keywords:

Bivariate Gamma, Correlation, Mean Absolute Percentage Error, Random Forest Imputations, Root Mean Square Error

Abstract

Missing data is a common problem in data analysis that can reduce the quality and accuracy of study results if not handled properly. This study aims to evaluate the performance of the Random Forest (RF) imputation method at various levels of missing value proportions, namely 5%, 10%, 15%, and 20%. The data used are Bivariate Gamma data of 200 observations with two variables, generated using RStudio software. Evaluation of imputation performance is carried out by considering the correlation value between the imputed data and the original data, the p-value as an indicator of the significance of the difference, and the error measures Mean Absolute Percentage Error (MAPE) and Root Mean Square Error (RMSE).

Downloads

Published

2025-12-26

How to Cite

Arib, M. A. A., & Ramya, R. R. (2025). Handling Missing Data in Bivariate Gamma Generation Data Using the Random Forest Method. J-KOMA : Jurnal Ilmu Komputer Dan Aplikasi, 8(02), 9–15. https://doi.org/10.21009/JKOMA.082.02