An Evaluation of Machine Learning Models coupled with Powerful Big Data Techniques in the Case of Pancreatic Cancer

Eleftheria Kouremenou, George Manias, Shabbir Syed-Abdul, Dimosthenis Kyriazis

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This study proposes an optimized machine learning (ML) methodology and workflow to examine pancreatic cancer factors, taking advantage of real-world data collected from three different hospitals. The overall proposed processing and analysis pipeline incorporates data transformation, cleaning, and mapping techniques such as translating specific values into a common language and calculating average blood result tests per patient. The ML models utilized under the scope of this research work are supervised learning techniques, such as Random Forest, LightGBM, XGBoost, SVM, and Gradient Boosting, by also considering and analyzing various risk factors such as demographic characteristics, drug use, surgeries, organ removal, blood values, and disease history of the patient. The models were evaluated and compared in terms of performance, considering important characteristics such as age, marriage, gender, and pre- existing diseases as risk factors for pancreatic cancer. The results indicate that the utilization of ML models offers a robust and comprehensive solution for pancreatic cancer risk prediction, considering a broad range of variables and risk factors. These models enhance the understanding and identification of the key risk factors associated with the development and progression of this rare type of cancer and can act as powerful tools in the hands of healthcare professionals in the fight against pancreatic cancer.

Original languageEnglish
Title of host publicationProceedings - 2023 International Conference on Applied Mathematics and Computer Science, ICAMCS 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages42-49
Number of pages8
ISBN (Electronic)9798350324266
DOIs
Publication statusPublished - 2023
Event3rd International Conference on Applied Mathematics and Computer Science, ICAMCS 2023 - Lefkada Island, Greece
Duration: Aug 8 2023Aug 10 2023

Publication series

NameProceedings - 2023 International Conference on Applied Mathematics and Computer Science, ICAMCS 2023

Conference

Conference3rd International Conference on Applied Mathematics and Computer Science, ICAMCS 2023
Country/TerritoryGreece
CityLefkada Island
Period8/8/238/10/23

Keywords

  • big data processing
  • machine learning
  • pancreatic cancer risk prediction
  • riskfactors analysis

ASJC Scopus subject areas

  • Artificial Intelligence
  • Information Systems and Management
  • Applied Mathematics
  • Computational Mathematics
  • Modelling and Simulation
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'An Evaluation of Machine Learning Models coupled with Powerful Big Data Techniques in the Case of Pancreatic Cancer'. Together they form a unique fingerprint.

Cite this