Improving Afaan Oromo Question Answering System: Definition, List and Description Question Types for Non-factoid Questions

Daba, Endale

Full metadata record

DC Field	Value	Language
dc.contributor.author	Daba, Endale	-
dc.date.accessioned	2021-09-24T07:34:09Z	-
dc.date.accessioned	2021-09-24T07:34:10Z	-
dc.date.available	2021-09-24T07:34:09Z	-
dc.date.available	2021-09-24T07:34:10Z	-
dc.date.issued	2021-07	-
dc.identifier.uri	.	-
dc.identifier.uri	http://hdl.handle.net/123456789/6240	-
dc.description.abstract	Question Answering (QA) can go beyond the retrieval of relevant documents, it is an option for efficient information access to such text data. The task of QA is to find the accurate and precise answer to a natural language question from a source text. The existing Afaan Oromo QA systems handle questions that usually take named entities as the answers. A different type of Afaan Oromo Question answer such as list, definition and description. The goal of this study is to propose approaches that tackle important problems in Afaan Oromo non-factoid QA, specifically in list, definition and description questions. The proposed QA system comprises of document preprocessing, question analysis, document analysis, and answer extraction components. Rule based techniques are used for the question classification. The approach in the document analysis component retrieves relevant documents and filters the retrieved documents using filtering patterns for list, definition and description questions a retrieved document is only retained if it contains all terms in the target in the same order as in the question. The answer extraction component works in type by type manner. The extracted sentences are scored and ranked, and then the answer selection algorithm selects top 5 non-redundant sentences from the candidate answer set. Finally the sentences are ordered to keep their coherence. The system is tested using evaluation metrics and used percentage ratio for evaluating question classification which classified 98.3% correctly. The document retrieval component is tested on two data sets that are analyzed by a stemmer and morphological analyzer. The F-score on the stemmed documents is 0.729 and on the other data it set is 0.764. Moreover, the average Fscore of the answer extraction component is 0.592.	en_US
dc.language.iso	en	en_US
dc.publisher	ST. MARY’S UNIVERSITY	en_US
dc.subject	Non-factoid Question-Answering, Afaan Oromo Question Answering System, Description Question types, Question Classification, Document Filtering, Sentence Extraction, Answer,Selection,RuleBased.	en_US
dc.title	Improving Afaan Oromo Question Answering System: Definition, List and Description Question Types for Non-factoid Questions	en_US
dc.type	Thesis	en_US
Appears in Collections:	Master of computer science Master of computer science