Show simple item record

dc.contributor.authorFourie, W.
dc.contributor.authorDu Toit, J.V.
dc.contributor.authorSnyman, D.P.
dc.date.accessioned2016-02-09T05:42:36Z
dc.date.available2016-02-09T05:42:36Z
dc.date.issued2014
dc.identifier.citationFourie, w. et al. 2014. Comparing support vector machine and multinomial naive Bayes for named entity classification of South African languages. Proceedings of the 2014 PRASA, RobMech and AfLaT international joint symposium. 27-28 Nov, Cape Town. [http://www.prasa.org/proceedings/2014/prasa2014-32.pdf]en_US
dc.identifier.isbn978-0-620-62617-0
dc.identifier.urihttp://hdl.handle.net/10394/16239
dc.identifier.urihttp://www.prasa.org/proceedings/2014/prasa2014-32.pdf
dc.description.abstractIn this study, two classical machine learning algorithms, multinomial naive Bayes and support vector machines, are compared when applied to named entity recognition for two South African languages, Afrikaans and English. The definition of a named entity was based on previous definitions and deliberations in literature as well as the intended purpose of classifying sensitive personal information in textual data. For the purpose of this study, the best algorithm should be able to deliver accurate results while requiring the least amount of time to train the classification model. A binary nominal class was selected for the classifiers and the standard implementation of the algorithms were utilised; no parameter optimisation was done. All the models achieved remarkable results in both ten-fold cross validation and independent evaluations with the support vector machine models outperforming the multinomial naive Bayes models. The multinomial naive Bayes models, however, required less time to train and would be more suited to low resource implementationsen_US
dc.language.isoenen_US
dc.publisherPRASAen_US
dc.subjectBinary classen_US
dc.subjectcross-domainen_US
dc.subjectnamed entity classificationen_US
dc.subjectmultilingualen_US
dc.subjectmultinomial naive Bayesen_US
dc.subjectsupport vector machinesen_US
dc.titleComparing support vector machine and multinomial naive Bayes for named entity classification of South African languagesen_US
dc.typePresentationen_US
dc.contributor.researchID10789901 - Du Toit, Jan Valentine
dc.contributor.researchID20570856 - Snyman, Dirk Petrus


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record