Search In this Thesis
   Search In this Thesis  
العنوان
Automatic act classification for Arabic dialogue context /
الناشر
Abdelrahim Abdelsabour Abdelhalim Mohamed Elmadany ,
المؤلف
Abdelrahim Abdelsabour Abdelhalim Mohamed Elmadany
هيئة الاعداد
باحث / Abdelrahim Abdelsabour Abdelhalim Mohamed Elmadany
مشرف / Mervat Gheith
مشرف / Sherif M. Abdou
مشرف / Mervat Gheith
تاريخ النشر
2016
عدد الصفحات
135 Leaves :
اللغة
الإنجليزية
الدرجة
الدكتوراه
التخصص
Computer Science (miscellaneous)
تاريخ الإجازة
20/6/2017
مكان الإجازة
جامعة القاهرة - المكتبة المركزية - Computer and Information Science
الفهرس
Only 14 pages are availabe for public view

from 152

from 152

Abstract

Recognition of dialogue acts is essential to a dialogue understanding system because dialogue acts have closely tied with the speaker{u2019}s intention. However, it has been difficult to infer a dialogue act from a surface utterance because they highly depend on the context of the utterance and his linguistic background. Building dialogues systems interaction has recently gained considerable attention, but most of the resources and systems built so far have tailored to English and other Indo-European languages. The need for designing systems for other languages is increasing such as the Arabic language. For these reasons, there is more interest for Arabic dialogue acts classification task because of it a key player in Arabic language understanding to building these systems . In this work, a novel language understanding component for Arabic spontaneous dialogues and Instant Messages at utterance level namely 2BASEER3 is presented, which designed to works on inquiry-answer dialogues. BASEER tested on Egyptian Arabic corpus, which collected manually from Egyptian call-centers. The BASEER component is composed of three parts: Pre-treatment module, segmentation turns into utterances model (i.e. USeg), and utterances labeling model (i.e. YOSR) . USeg is a machine learning approach based on context without relying on punctuation, text diacritization or lexical cues. Whereas, USeg depends on a set of features from the annotated data that include morphological features. The results obtained that USeg classifier achieved 0.907, and 0.933 without using cues and with using cues respectively, and it is very promising and to the best of my knowledge, these are the first results reported for turn segmentation into utterances task for Egyptian dialect