Influence of Class Imbalance on the Quality of Hydrocracking Unit Failure Prediction Models
Table 3: Balanced model’s evaluation metrics
| F-score for | Decision tree | Random Forest | Logistic regression | Gaussian Bayesian |
| Balanced | 0.569 | 0.601 | 0.513 | - |
| Downsampled | 0.575 | 0.605 | 0.475 | 0.501 |
| Upsampled | 0.569 | 0.635 | 0.478 | 0.507 |
| Upsampled AUC-ROC | 0.824 | 0.852 | 0.729 | 0.755 |
