Machine learning and deep learning techniques for natural language processing with application to audio recordings
dc.contributor.advisor | Montshiwa, T.V. | |
dc.contributor.author | Motitswane, Olorato Glendah | |
dc.contributor.researchID | 22297812 - Montshiwa, Volition Tlhalitshi (Supervisor) | |
dc.date.accessioned | 2023-11-23T07:33:24Z | |
dc.date.available | 2023-11-23T07:33:24Z | |
dc.date.issued | 2023 | |
dc.description | MCur (Statistics), North-West University, Mahikeng Campus | en_US |
dc.description.abstract | Many debt collection companies need to rely on research focusing on data analysis methods that can assist them to analyse their unstructured data which holds information that could help them to better assign their collection agents to high repayment probable accounts. These types of accounts are characterised by the debtor’s ability to repay which comprise their employment status among many other driving factors. Unfortunately, analysing unstructured data is extremely challenging as it comes in natural forms such as audio recordings, videos and images, to mention a few. The aim of this study was to seek for data analysis methods that can accurately predict the employment status of the debtor using audio call recordings. Transcription of the recordings to text was done using Automatic Speech Recognition (ASR), followed by data cleaning and the transcribed text was represented in numerical form using the Term Frequency-Inverse Document Frequency (TF- IDF) and the Count Vectorizer. The study then compared the accuracy of Artificial Neural Network (ANN) and Naïve Bayes classifiers in predicting the employment status of the debtor. To evaluate the performance of the ASR transcription method, word error rate (WER) was used, for text and to compare ANN and Naïve Bayes, the accuracy, recall and F1-Score were used. An overall WER of 106.93 was archived by the speech recognition ASR method. ANN with TF-IDF was identified as the best model for predicting employment status from transcribed audio recordings. | en_US |
dc.description.thesistype | Masters | en_US |
dc.identifier.uri | https://orcid.org/0000.0003.3905.1633 | |
dc.identifier.uri | http://hdl.handle.net/10394/42346 | |
dc.language.iso | en | en_US |
dc.publisher | North-West University (South Africa) | en_US |
dc.subject | Natural Language Processing | en_US |
dc.subject | Automatic Speech Recognition | en_US |
dc.subject | Term Frequency-Inverse Document Frequency Vectorizer | en_US |
dc.subject | Count Vectorizer | en_US |
dc.subject | Data Augmentation | en_US |
dc.subject | Naïve Bayes | en_US |
dc.subject | Artificial Neural Network | en_US |
dc.title | Machine learning and deep learning techniques for natural language processing with application to audio recordings | en_US |
dc.type | Thesis | en_US |