DC Field | Value | Language |
dc.contributor.author | Daba, Endale | - |
dc.date.accessioned | 2021-09-24T07:34:09Z | - |
dc.date.accessioned | 2021-09-24T07:34:10Z | - |
dc.date.available | 2021-09-24T07:34:09Z | - |
dc.date.available | 2021-09-24T07:34:10Z | - |
dc.date.issued | 2021-07 | - |
dc.identifier.uri | . | - |
dc.identifier.uri | http://hdl.handle.net/123456789/6240 | - |
dc.description.abstract | Question Answering (QA) can go beyond the retrieval of relevant documents, it is an option
for efficient information access to such text data. The task of QA is to find the accurate and
precise answer to a natural language question from a source text. The existing Afaan Oromo
QA systems handle questions that usually take named entities as the answers.
A different type of Afaan Oromo Question answer such as list, definition and description. The
goal of this study is to propose approaches that tackle important problems in Afaan Oromo
non-factoid QA, specifically in list, definition and description questions. The proposed QA
system comprises of document preprocessing, question analysis, document analysis, and
answer extraction components.
Rule based techniques are used for the question classification. The approach in the document
analysis component retrieves relevant documents and filters the retrieved documents using
filtering patterns for list, definition and description questions a retrieved document is only
retained if it contains all terms in the target in the same order as in the question. The answer
extraction component works in type by type manner.
The extracted sentences are scored and ranked, and then the answer selection algorithm selects
top 5 non-redundant sentences from the candidate answer set. Finally the sentences are ordered
to keep their coherence.
The system is tested using evaluation metrics and used percentage ratio for evaluating question
classification which classified 98.3% correctly. The document retrieval component is tested on
two data sets that are analyzed by a stemmer and morphological analyzer. The F-score on the
stemmed documents is 0.729 and on the other data it set is 0.764. Moreover, the average Fscore
of the answer extraction component is 0.592. | en_US |
dc.language.iso | en | en_US |
dc.publisher | ST. MARY’S UNIVERSITY | en_US |
dc.subject | Non-factoid Question-Answering, Afaan Oromo Question Answering System, Description Question types, Question Classification, Document Filtering, Sentence Extraction, Answer,Selection,RuleBased. | en_US |
dc.title | Improving Afaan Oromo Question Answering System: Definition, List and Description Question Types for Non-factoid Questions | en_US |
dc.type | Thesis | en_US |
Appears in Collections: | Master of computer science Master of computer science
|