dc.contributor.advisor | Coetzer, R.L.J. | |
dc.contributor.advisor | Cockeran, M. | |
dc.contributor.advisor | Nombebe, T. | |
dc.contributor.author | Liebenberg, Jennifer Leigh | |
dc.date.accessioned | 2025-05-12T12:24:05Z | |
dc.date.available | 2025-05-12T12:24:05Z | |
dc.date.issued | 2024 | |
dc.identifier.uri | https://orcid.org/0000-0002-3811-4957 | |
dc.identifier.uri | http://hdl.handle.net/10394/42907 | |
dc.description | Master of Science in Mathematical Statistics, North-West University, Potchefstroom Campus | en_US |
dc.description.abstract | Compositional data are found in many types of data. Some examples of compositional data are particle size distributions, alloy composition and chemical milling bath composition, colour compositions of paintings, chemical compositions of basalt specimens, as well as household expenditure. In industry, process monitoring of compositional data are of interest for feed and product compositions, and to detect and diagnose potential deviations from expected performance. However, compositional data are subject to certain properties and constraints that complicate the analysis thereof, such as a large number of variables and a unit-sum constraint. In this study, the multivariate statistical analysis of compositional data is reviewed and discussed for purposes of data interpretation, and for application to multivariate statistical process monitoring. Log-transformations are applied to the compositional data, followed by a reduction in the number of variables using principal components analysis (PCA). PCA biplots are used as a visual inspection of the data, providing a way to estimate certain properties of the compositional data through geometric features of the biplots. Specifically, it is shown how correlations and relationships between the components are quantified from the biplot properties. The log-transformed and PCA-reduced data are used to perform multivariate statistical process monitoring. The T2, SPE and combined statistics are used to illustrate multivariate process monitoring of compositional data. In addition, variable contributions are calculated based on the various monitoring statistics for faulty data. Using simulated compositional data, and the well-known Tennessee Eastman Process, it is illustrated that faults
can be accurately detected, together with the correct variable contributions, for compositional data. | en_US |
dc.language.iso | en | en_US |
dc.publisher | North-West University (South Africa) | en_US |
dc.subject | Compositional data | en_US |
dc.subject | Multivariate data | en_US |
dc.subject | Principal components analysis | en_US |
dc.subject | Log-transformation | en_US |
dc.subject | Multivariate statistical process monitoring | en_US |
dc.subject | Compositional biplots | en_US |
dc.title | Multivariate statistical process monitoring and diagnostic analysis of compositional data | en_US |
dc.type | Thesis | en_US |
dc.description.thesistype | Masters | en_US |
dc.contributor.researchID | | |
dc.contributor.researchID | | |