Multivariate statistical process monitoring  and diagnostic analysis of compositional data

Liebenberg, Jennifer Leigh

dc.contributor.advisor	Coetzer, R.L.J.
dc.contributor.advisor	Cockeran, M.
dc.contributor.advisor	Nombebe, T.
dc.contributor.author	Liebenberg, Jennifer Leigh
dc.date.accessioned	2025-05-12T12:24:05Z
dc.date.available	2025-05-12T12:24:05Z
dc.date.issued	2024
dc.identifier.uri	https://orcid.org/0000-0002-3811-4957
dc.identifier.uri	http://hdl.handle.net/10394/42907
dc.description	Master of Science in Mathematical Statistics, North-West University, Potchefstroom Campus	en_US
dc.description.abstract	Compositional data are found in many types of data. Some examples of compositional data are particle size distributions, alloy composition and chemical milling bath composition, colour compositions of paintings, chemical compositions of basalt specimens, as well as household expenditure. In industry, process monitoring of compositional data are of interest for feed and product compositions, and to detect and diagnose potential deviations from expected performance. However, compositional data are subject to certain properties and constraints that complicate the analysis thereof, such as a large number of variables and a unit-sum constraint. In this study, the multivariate statistical analysis of compositional data is reviewed and discussed for purposes of data interpretation, and for application to multivariate statistical process monitoring. Log-transformations are applied to the compositional data, followed by a reduction in the number of variables using principal components analysis (PCA). PCA biplots are used as a visual inspection of the data, providing a way to estimate certain properties of the compositional data through geometric features of the biplots. Specifically, it is shown how correlations and relationships between the components are quantified from the biplot properties. The log-transformed and PCA-reduced data are used to perform multivariate statistical process monitoring. The T2, SPE and combined statistics are used to illustrate multivariate process monitoring of compositional data. In addition, variable contributions are calculated based on the various monitoring statistics for faulty data. Using simulated compositional data, and the well-known Tennessee Eastman Process, it is illustrated that faults can be accurately detected, together with the correct variable contributions, for compositional data.	en_US
dc.language.iso	en	en_US
dc.publisher	North-West University (South Africa)	en_US
dc.subject	Compositional data	en_US
dc.subject	Multivariate data	en_US
dc.subject	Principal components analysis	en_US
dc.subject	Log-transformation	en_US
dc.subject	Multivariate statistical process monitoring	en_US
dc.subject	Compositional biplots	en_US
dc.title	Multivariate statistical process monitoring and diagnostic analysis of compositional data	en_US
dc.type	Thesis	en_US
dc.description.thesistype	Masters	en_US
dc.contributor.researchID
dc.contributor.researchID

Files in this item

Name:: Liebenberg_JL_2024.pdf
Size:: 3.234Mb
Format:: PDF
Description:: Thesis (Masters)

View/Open

This item appears in the following Collection(s)

Natural and Agricultural Sciences [2777]

Show simple item record