Abstract
The widespread impact of thyroid disease and its diagnosis is a challenging task for healthcare experts. The conventional technique for predicting such a vital disease is complex and time-consuming. A data-driven approach may offer predictive solutions, but it relies on all relevant attributes, which are computationally expensive. Hence, we propose a novel machine learning (ML) based disease prediction system that could potentially predict it by considering three crucial steps. First, to reduce the dimension of the dataset, three feature selection techniques were employed, including feature importance (FIS), information gain selections (IGS), and least absolute shrinkage and selection operator (LAS). Moreover, recommended medical references were considered while developing a feature set having the identical attributes as high-risk factors (HRF). Second, the models, including the three stage hybrid classifier (3SHC) and the three stage hybrid artificial neural network (3SHANN), are used as classifiers on the training data set. Third, a local interpretable model-agnostic explanations (LIME) to the 3SHC with the HRF samples was applied to individually explain the predictions. Then, the overall behaviors of both gender and age categories were explored with the help of a partial dependence plot (PDP). Finally, the proposed system is validated with extensive experiments, where the 3SHC achieves an accuracy (ACC) of 99.29%, which can play a crucial role in preventing thyroid disease and alleviating stress in the healthcare sector.
View more >>