Show simple item record

dc.contributor.advisorPark, Deokgun
dc.creatorThapa, Sanjay
dc.date.accessioned2020-01-10T22:56:52Z
dc.date.available2020-01-10T22:56:52Z
dc.date.created2019-12
dc.date.issued2019-12-06
dc.date.submittedDecember 2019
dc.identifier.urihttp://hdl.handle.net/10106/28870
dc.description.abstractThe advancement in the field of Natural Language Processing and Machine Learning has played a significant role in the huge improvement of conversational Artificial Intelligence (AI). The use of text-based conversation AI such as chatbots have increased significantly for the everyday purpose to communicate with real people for a variety of tasks. Chatbots are deployed in almost all popular messaging platforms and channels. The rise of chatbot development frameworks based on machine learning is helping to deploy chatbot easily and promptly. These chatbot development frameworks use machine learning and natural language understanding (NLU) to understand users' messages and intents and respond accordingly to users' utterance. Since most of the chatbots are developed for domain-specific purposes, the performance of the chatbot is directly related to the training data. To increase the domain knowledge and knowledge base of the chatbots via training data, the chatbots need to know similar words or phrases for a users' message. Furthermore, it is not guaranteed that a user will spell a word correctly. A lot of times, in written conversation, a user will misspell at least some words. Thus, to include semantically similar words and misspellings in the training data, I have used word embedding to generate misspellings and similar words. These generated similar words and misspellings will be used as training data to train the model for chatbot development.
dc.format.mimetypeapplication/pdf
dc.language.isoen_US
dc.subjectChatbots
dc.subjectConversational artificial intelligence
dc.subjectMachine learning
dc.subjectRasa
dc.subjectMisspellings
dc.subjectWord embedding
dc.subjectSimilar words
dc.titleUSE OF WORD EMBEDDING TO GENERATE SIMILAR WORDS AND MISSPELLINGS FOR TRAINING PURPOSE IN CHATBOT DEVELOPMENT
dc.typeThesis
dc.degree.departmentComputer Science and Engineering
dc.degree.nameMaster of Science in Computer Science
dc.date.updated2020-01-10T22:59:02Z
thesis.degree.departmentComputer Science and Engineering
thesis.degree.grantorThe University of Texas at Arlington
thesis.degree.levelMasters
thesis.degree.nameMaster of Science in Computer Science
dc.type.materialtext
dc.creator.orcid0000-0002-2031-1186


Files in this item

Thumbnail


This item appears in the following Collection(s)

Show simple item record