DETERMINATION OF IMPORTANT VARIABLES IN DIVORCE TYPE CLASSIFICATION USING THE RANDOM FOREST METHOD WITH SMOTE

Authors

  • Dania Siregar Statistics, Universitas Negeri Jakarta
  • Bintang Mahesa Wardana Statistics, Universitas Negeri Jakarta
  • Ahmad Syauqi Baihaqy Statistics, Universitas Negeri Jakarta
  • Liswatun Naimah Statistics, Universitas Negeri Jakarta
  • Almira Nindya Putri Statistics, Universitas Negeri Jakarta
  • Qory Meidianingsih Mathematics Education Study Program, Universitas Negeri Jakarta
  • Dini Safitri Communication Science, Universitas Negeri Jakarta

DOI:

https://doi.org/10.21009/JSA.08209

Keywords:

Divorce, Importance Level Measure, Random Forest, SMOTE, Importance Features

Abstract

Central Jakarta is highly strategic area situated at the heart of the Indonesian capital. It serves as the central hub for the government, history, tourism, and elite shopping sectors with convenient access to various buffer areas surrounding the capital. However, the availability of these facilities does not necessarily ensure the continuity of domestic life within the community. This can be observed from the increasing divorce rate in the region since 2017. Notably, a higher proportion of divorce suits are filed by wives than by husbands. There are various factors that can trigger divorce lawsuits such as continuous disputes and arguments, economic factors, and domestic violence. And these factors certainly cannot be separated from the individual profiles of married couples such as age, occupation, education level, and duration of marriage. The purpose of this study is to determine the level of importance of the variables used in the classification of wife-initiated divorce and husband-initiated divorce of married couples in the Central Jakarta area through the Random Forest method. Random Forest is a development of the CART (Classification and Regression Tree) method obtained through the application of bootstrap aggregating and random feature selection methods to the standard CART method. The number of wife-initiated divorce cases exceeds that of husband-initiated divorce cases, necessitating the use of SMOTE technique to address the imbalance in the data set. The results showed that the most important variable used to classify divorce cases was the plaintiff's age followed by the defendant's occupation, the defendant's age, and the plaintiff's occupation.

Downloads

Published

2024-12-31

How to Cite

Siregar, D., Bintang Mahesa Wardana, Ahmad Syauqi Baihaqy, Liswatun Naimah, Almira Nindya Putri, Qory Meidianingsih, & Dini Safitri. (2024). DETERMINATION OF IMPORTANT VARIABLES IN DIVORCE TYPE CLASSIFICATION USING THE RANDOM FOREST METHOD WITH SMOTE. Jurnal Statistika Dan Aplikasinya, 8(2), 229–244. https://doi.org/10.21009/JSA.08209