Abstract
This study proposes an optimized machine learning (ML) methodology and workflow to examine pancreatic cancer factors, taking advantage of real-world data collected from three different hospitals. The overall proposed processing and analysis pipeline incorporates data transformation, cleaning, and mapping techniques such as translating specific values into a common language and calculating average blood result tests per patient. The ML models utilized under the scope of this research work are supervised learning techniques, such as Random Forest, LightGBM, XGBoost, SVM, and Gradient Boosting, by also considering and analyzing various risk factors such as demographic characteristics, drug use, surgeries, organ removal, blood values, and disease history of the patient. The models were evaluated and compared in terms of performance, considering important characteristics such as age, marriage, gender, and pre- existing diseases as risk factors for pancreatic cancer. The results indicate that the utilization of ML models offers a robust and comprehensive solution for pancreatic cancer risk prediction, considering a broad range of variables and risk factors. These models enhance the understanding and identification of the key risk factors associated with the development and progression of this rare type of cancer and can act as powerful tools in the hands of healthcare professionals in the fight against pancreatic cancer.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2023 International Conference on Applied Mathematics and Computer Science, ICAMCS 2023 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 42-49 |
| Number of pages | 8 |
| ISBN (Electronic) | 9798350324266 |
| DOIs | |
| Publication status | Published - 2023 |
| Event | 3rd International Conference on Applied Mathematics and Computer Science, ICAMCS 2023 - Lefkada Island, Greece Duration: Aug 8 2023 → Aug 10 2023 |
Publication series
| Name | Proceedings - 2023 International Conference on Applied Mathematics and Computer Science, ICAMCS 2023 |
|---|
Conference
| Conference | 3rd International Conference on Applied Mathematics and Computer Science, ICAMCS 2023 |
|---|---|
| Country/Territory | Greece |
| City | Lefkada Island |
| Period | 8/8/23 → 8/10/23 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- big data processing
- machine learning
- pancreatic cancer risk prediction
- riskfactors analysis
ASJC Scopus subject areas
- Artificial Intelligence
- Information Systems and Management
- Applied Mathematics
- Computational Mathematics
- Modelling and Simulation
- Computer Science Applications
Fingerprint
Dive into the research topics of 'An Evaluation of Machine Learning Models coupled with Powerful Big Data Techniques in the Case of Pancreatic Cancer'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS