Machine Learning Approach to Predict AXL Kinase Inhibitor Activity for Cancer Drug Discovery Using XGBoost and Bayesian Optimization

Authors

  • Teuku Rizky Noviandy Department of Informatics, Faculty of Mathematics and Natural Sciences, Universitas Syiah Kuala, Banda Aceh 23111, Indonesia
  • Ghalieb Mutig Idroes Interdisciplinary Innovation Research Unit, Graha Primera Saintifika, Aceh Besar 23771, Indonesia
  • Irsan Hardi Interdisciplinary Innovation Research Unit, Graha Primera Saintifika, Aceh Besar 23771, Indonesia

Keywords:

Supervised learning, QSAR, molecular descriptors, ChEMBL, hyperparameter tuning

Abstract

Cancer persists as a significant global health challenge, marked by uncontrolled cell growth and the potential for metastasis, posing a substantial threat to human well-being. Recent years have witnessed notable progress utilizing machine learning for cancer drug discovery. This study employs the XGBoost algorithm and Bayesian optimization to classify AXL kinase inhibitor activity in cancer drug discovery. A comprehensive dataset of 1074 compounds and their IC50 values was obtained from the ChEMBL database. Molecular descriptors were calculated using the Mordred Python library, providing a detailed profile of each compound. The XGBoost model optimized by Bayesian optimization demonstrated superior performance, achieving an accuracy of 86.24%, precision of 89.52%, recall of 89.52%, and an F1-score of 89.52%. Comparative analysis with other machine learning models further highlighted XGBoost's efficacy. A Principal Component Analysis (PCA) plot demonstrated the model's broad applicability domain, providing reliable predictions within defined boundaries to assess applicability. The study's implications extend to practical pharmaceutical research, serving as a screening tool to prioritize compounds for synthesis and testing, potentially streamlining the drug development pipeline.

Downloads

Published

21-06-2024

Issue

Section

Articles

How to Cite

Noviandy, T. R., Idroes, G. M., & Hardi, I. (2024). Machine Learning Approach to Predict AXL Kinase Inhibitor Activity for Cancer Drug Discovery Using XGBoost and Bayesian Optimization. Journal of Soft Computing and Data Mining, 5(1), 46-56. https://publisher.uthm.edu.my/ojs/index.php/jscdm/article/view/16427