Please use this identifier to cite or link to this item:
http://dspace.univ-guelma.dz/jspui/handle/123456789/15019
Title: | Dynamic QA System for Nosql Databases |
Authors: | Mihoubi, Zakaria |
Keywords: | Question answering systems, NoSQL databases, T5, dataset construction, Data augmentation techniques, Fine-tuning. |
Issue Date: | 2023 |
Publisher: | University of Guelma |
Abstract: | Question-answering (QA) systems are crucial for retrieving information from large databases. While traditional systems focus on structured relational databases, the rise of NoSQL databases with a variety of query languages like MongoDB requires adapting these systems. In this study, we propose a NoT5QL (NoSQL T5) model for dynamic question answering of MongoDB based on fine-tuning of T5, a pre-trained transformer model for natural language processing (NLP) tasks. The developed dy- namic question-answering system is specifically tailored for generating queries for MongoDB from natural language questions. To accomplish this task, a dataset called "MongoQpedia" (Mongodb Query pedia) was created using diverse questions from the Movies domain and annotated with MongoDB document-derived answers. The construction of MongoQpedia is based on data augmentation via paraphrasing, back translation, and named entity replacement techniques. The evaluation of the NoT5QL model was performed through various metrics such as BLEU and ROUGE, comparing fine-tuned T5 small and base models. This provided insights into the impact of model size and complexity on MongoDB question- answering capabilities. The results of these experiments demonstrate the effective- ness of our approach in achieving high accuracy in answering questions related to MongoDB databases. |
URI: | http://dspace.univ-guelma.dz/jspui/handle/123456789/15019 |
Appears in Collections: | Master |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
MIHOUBI_ZAKARYA_F5.pdf | 2,5 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.