Machine learning and deep learning techniques for natural language processing with application to audio recordings

Motitswane, Olorato Glendah

Machine learning and deep learning techniques for natural language processing with application to audio recordings

dc.contributor.advisor	Montshiwa, T.V.
dc.contributor.author	Motitswane, Olorato Glendah
dc.contributor.researchID	22297812 - Montshiwa, Volition Tlhalitshi (Supervisor)
dc.date.accessioned	2023-11-23T07:33:24Z
dc.date.available	2023-11-23T07:33:24Z
dc.date.issued	2023
dc.description	MCur (Statistics), North-West University, Mahikeng Campus	en_US
dc.description.abstract	Many debt collection companies need to rely on research focusing on data analysis methods that can assist them to analyse their unstructured data which holds information that could help them to better assign their collection agents to high repayment probable accounts. These types of accounts are characterised by the debtor’s ability to repay which comprise their employment status among many other driving factors. Unfortunately, analysing unstructured data is extremely challenging as it comes in natural forms such as audio recordings, videos and images, to mention a few. The aim of this study was to seek for data analysis methods that can accurately predict the employment status of the debtor using audio call recordings. Transcription of the recordings to text was done using Automatic Speech Recognition (ASR), followed by data cleaning and the transcribed text was represented in numerical form using the Term Frequency-Inverse Document Frequency (TF- IDF) and the Count Vectorizer. The study then compared the accuracy of Artificial Neural Network (ANN) and Naïve Bayes classifiers in predicting the employment status of the debtor. To evaluate the performance of the ASR transcription method, word error rate (WER) was used, for text and to compare ANN and Naïve Bayes, the accuracy, recall and F1-Score were used. An overall WER of 106.93 was archived by the speech recognition ASR method. ANN with TF-IDF was identified as the best model for predicting employment status from transcribed audio recordings.	en_US
dc.description.thesistype	Masters	en_US
dc.identifier.uri	https://orcid.org/0000.0003.3905.1633
dc.identifier.uri	http://hdl.handle.net/10394/42346
dc.language.iso	en	en_US
dc.publisher	North-West University (South Africa)	en_US
dc.subject	Natural Language Processing	en_US
dc.subject	Automatic Speech Recognition	en_US
dc.subject	Term Frequency-Inverse Document Frequency Vectorizer	en_US
dc.subject	Count Vectorizer	en_US
dc.subject	Data Augmentation	en_US
dc.subject	Naïve Bayes	en_US
dc.subject	Artificial Neural Network	en_US
dc.title	Machine learning and deep learning techniques for natural language processing with application to audio recordings	en_US
dc.type	Thesis	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Motitswane_OG.pdf
Size:: 1.82 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.61 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Economic and Management Sciences