Assessment of the environmental impact of hard coal mining waste disposal sites using machine learning algorithms, with an indication of important features influencing the selection of learning algorithms

Authors

  • Joanna Fajfer Polish Geological Institute – National Research Institute, Rakowiecka 4, 00-975 Warszawa, Poland https://orcid.org/0000-0002-7382-5383
  • Michał Rolka Polish Geological Institute – National Research Institute, Upper Silesian Branch, Królowej Jadwigi 1, 41-200 Sosnowiec, Poland

DOI:

https://doi.org/10.7306/gq.1838

Keywords:

machine learning algorithms, hard coal mining waste, environmental impact, important features, disposal sites

Abstract

This paper presents the potential application of supervised machine learning algorithms to assess the environmental impact of mine disposal sites. Algorithms available for Python from the scikit-learn and pandas libraries were applied to a group of sites representing mine waste dumps used for the disposal of hard coal mining waste. Each disposal site was described with 11 attributes (site characteristics, waste characteristics, groundwater, surface water, air, soil, atmospheric factors, geology, geohazards, nature, and human environment) and 73 features (categorical, numerical, and descriptive) detailing the sites’ environmental impact. As a result of applying the learning process to training data and verifying it on test data, prediction results of at least 80% were obtained for all algorithms tested. The results indicate that the best algorithm for determining the environmental impact of the waste dumps would be the BernouliNB algorithm (86% prediction accuracy), followed by the RidgeClassifier algorithm (87% prediction accuracy), with the currently available training dataset.  Potential extension of the dataset could improve the results of the MLPClassifier, Support Vector Machine, and LogisticRegression algorithms.

Downloads

Published

2026-01-09

Issue

Section

Articles