Author: Mshaly, Mohamed Elsayed Abdellah Ahmed./ Title: Supervised Machine Learning for Classification of Odors of Biological Samples from Different Cancers using Python Framework/

Search In this Thesis

العنوان

Supervised Machine Learning for Classification of Odors of Biological Samples from Different Cancers using Python Framework/

المؤلف

Mshaly, Mohamed Elsayed Abdellah Ahmed.

هيئة الاعداد

باحث / محمد السيد عبداللاه أحمد مشالى

مشرف / إيهاب إبراهيم عبده محمد

مشرف / رضوى عبد الرحمن مشر ف

مناقش / سامي حسن درويش

مناقش / ممدوح محمد شوقى

الموضوع

Biophysics. Medical Biophysics.

تاريخ النشر

2021.

عدد الصفحات

95 p. :

اللغة

الإنجليزية

الدرجة

ماجستير

التخصص

Biophysics

تاريخ الإجازة

26/8/2021

مكان الإجازة

جامعة الاسكندريه - معهد البحوث الطبية - الفيزياء الحيوية الطبية

الفهرس

Only 14 pages are availabe for public view

from

Abstract

Cancer is the second leading cause of death worldwide. Cancer can be detected early, which lowers mortality rates and increases the likelihood of a successful therapy. The goal of this research is to provide a method for early cancer detection using machine learning. This is done by using machine learning algorithms for predicting various forms of cancer from scents in biological data. Based on electronic nose measurements, odors of biological substances.
The study comprised Egyptian men and women who were referred to the Medical Research Institute’s Hospital for diagnosis and/or treatment. All the subjects had a comprehensive medical examination, as well as demographic information and electronic nose measurements. The E-Nose employed in this investigation was a movable PEN3 (Airsense Analytics GmbH, Schwerin, Germany) with ten nonspecific metal-oxide sensors. All experiment measurements and sensor array patterns were stored in files for subsequent investigation. The data was divided into five categories depending on their health state. Controlled health participants were among the participants who showed no indications or pathologies: brain cancer, breast cancer, lung cancer, and CLL. The information came from three biological samples: blood, urine, and a biopsy.
Data is preprocessed before being split into training and testing datasets for machine learning classification algorithm training and evaluation. Support vector machine with optimized kernel and C parameter was the machine learning algorithm used in this investigation. For these data, the Optimal Training Parameter Combination (OTPC) of the SVM model was C = 200 and linear kernel.
For the blood dataset, the proposed classifier had precision, recall, F1 scores, and specificity of 98.6, 99.0, 98.6, and 98.0%, respectively. While the precision, recall, F1 scores, and specificity of the urine dataset were 99.5, 99.3, 99.3, and 99.8%, respectively. The precision, recall, F1 scores, and specificity of the biopsy dataset were 94.3, 96.0, 94.6, and 98.3%, respectively. Using the urine dataset, the proposed classifier had an accuracy of up to 99.0%.