Show simple item record

dc.contributor.authorRamalepe, Simon P
dc.contributor.authorModipa, Thipe I
dc.contributor.authorDavel, Marelie H
dc.date.accessioned2023-07-31T07:42:56Z
dc.date.available2023-07-31T07:42:56Z
dc.date.issued2022
dc.identifier.citationRamalepe, SM et.al.2022.The Development of a Sepedi Text Generation Model Using Transformersen_US
dc.identifier.urihttp://hdl.handle.net/10394/41890
dc.description.abstractText generation is one of the important sub-tasks of natural language generation (NLG), and aims to produce humanly readable text given some input text. Deep learning approaches based on neural networks have been proposed to solve text generation tasks. Although these models can generate text, they do not necessarily capture long-term dependencies accurately, making it difficult to coherently generate longer sentences. Transformer-based models have shown significant improvement in text generation. However, these models are computationally expensive and data hungry. In this study, we develop a Sepedi text generation model using a Transformer based approach and explore its performance. The developed model has one Transformer block with causal masking on the attention layers and two separate embedding layers. To train the model, we use the National Centre for Human Language Technology (NCHLT) Sepedi text corpus. Our experimental setup varied the model embedding size, batch size and the sequence length. The final model was able to reconstruct unseen test data with 75% accuracy: the highest accuracy achieved to date, using a Sepedi corpus.en_US
dc.description.sponsorshipSouthern Africa Telecommunication Networks and Applications Conference (SATNAC) 2022en_US
dc.language.isoenen_US
dc.publisherSouthern Africa Telecommunication Networks and Applications Conference (SATNAC) 2022en_US
dc.subjectTransformersen_US
dc.subjectGenerative pre-trained Trans formeren_US
dc.subjectNatural Language Generationen_US
dc.subjectText generationen_US
dc.titleThe Development of a Sepedi Text Generation Model Using Transformersen_US
dc.typeArticleen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record