Show simple item record

dc.contributor.authorRamalepe, Simon Phetole
dc.contributor.authorModipa, Thipe I.
dc.contributor.authorMarelie, H. Davel
dc.date.accessioned2025-05-05T08:43:54Z
dc.date.available2025-05-05T08:43:54Z
dc.date.issued2025-01-25
dc.identifier.citationRamalepe, S.P. et al. Pre-training a Transformer-Based Generative Model Using a Small Sepedi Dataset. arXiv:2501.15281v1 [cs.CL] 25 Jan 2025en_US
dc.identifier.urihttp://hdl.handle.net/10394/42870
dc.description.abstractDue to the scarcity of data in low-resourced languages, the development of language models for these languages has been very slow. Currently, pre-trained language models have gained popularity in natural language processing, especially, in developing domain-specific models for low-resourced languages. In this study, we experiment with the impact of using occlusion-based techniques when training a language model for a text generation task. We curate 2 new datasets, the Sepedi monolingual (SepMono) dataset from several South African resources and the Sepedi radio news (SepNews) dataset from the radio news domain. We use the SepMono dataset to pre-train transformer-based models using the occlusion and non-occlusion pre-training techniques and compare performance. The SepNews dataset is specifically used for fine-tuning. Our results show that the non-occlusion models perform better compared to the occlusion-based models when measuring validation loss and perplexity. However, analysis of the generated text using the BLEU score metric, which measures the quality of the generated text, shows a slightly higher BLEU score for the occlusion-based models compared to the nonocclusion models.en_US
dc.language.isoenen_US
dc.subjectTransformersen_US
dc.subjectText generationen_US
dc.subjectPre-trainingen_US
dc.subjectOcclusionbased trainingen_US
dc.subjectDatasetsen_US
dc.titlePre-training a Transformer-Based Generative Model Using a Small Sepedi Dataseten_US
dc.typeArticleen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record