Author: Malak Hassan Mostafa Elbakry/ Title: A comparative study to enhance big data classification using fuzzy technique /

Search In this Thesis

العنوان

A comparative study to enhance big data classification using fuzzy technique /

الناشر

Malak Hassan Mostafa Elbakry ,

المؤلف

Malak Hassan Mostafa Elbakry

هيئة الاعداد

باحث / Malak Hassan Mostafa Elbakry

مشرف / Osman Hegazy

مشرف / Soha Safwat Labib

مناقش / Soha Safwat Labib

تاريخ النشر

2016

عدد الصفحات

85 Leaves :

اللغة

الإنجليزية

الدرجة

ماجستير

التخصص

Information Systems

تاريخ الإجازة

7/3/2017

مكان الإجازة

جامعة القاهرة - كلية الحاسبات و المعلومات - Information Systems

الفهرس

Only 14 pages are availabe for public view

from

Abstract

Due to the huge increase in size of the data it becomes troublesome to perform efficient analysis using the current traditional techniques. Big data put forward a lot of challenges due to its several characteristics like volume, velocity, variety, variability, value and complexity. Today, it is not only a necessity for efficient data mining techniques to process large volume of data but also a need for means to meet the computational requirements to process such huge volume of data. The objective of this research is to implement classification techniques using the map reduce framework using fuzzy and crisp methods. Further it provides a comparative study between the results of the proposed systems and the methods reviewed in the literature. In this thesis we implemented the fuzzy K-Nearest Neighbor method as a fuzzy technique. It will be compared to both the support vector machine and the k-nearest neighbor as non-fuzzy techniques using the map reduce paradigm to process on big data. We proposed an integrated system using the support vector machine with the fuzzy soft label and gaussian fuzzy membership. Results show that fuzzy k-nearest neighbor classifier gives higher accuracy but it takes a lot of time in classification compared to the other techniques. But results on different data sets show the proposed method that used fuzzy logic in the reducer function gives higher accuracy and lower time than the crisp proposed methods and the methods reviewed in the literature