Proceedings of the 31st International Congress and Exhibition on Condition Monitoring and Diagnostic Engineering Management 2-5 July 2018 Sun City, Rustenburg, South Africa 31st International Congress and Exhibition on Condition Monitoring and Diagnostic Engineering Management Proceedings Editors P.S. Heyns, P.A. van Vuuren, G. van Schoor, R.B.K.N. Rao Published by: North-West University, South Africa 2018 ISBN: 978-1-86822-691-7 i Organizing committee Prof Raj B.K.N. Rao Comadem International United Kingdom Prof George van Schoor North-West University South Africa Prof Stephan P. Heyns University of Pretoria South Africa Prof Pieter A. van Vuuren North-West University South Africa Prof Alwyn J. Hoffman North-West University South Africa Prof Kenny R. Uren North-West University South Africa Dr. Henri Marais North-West University South Africa Technical programme committee Prof Stephan P. Heyns University of Pretoria South Africa Prof Pieter A. van Vuuren North-West University South Africa Dr. Fengshou Gu University of Huddersfield United Kingdom Prof David Baglee University of Sunderland United Kingdom Prof Viliam Makis University of Toronto Canada Dr. Nicholas West University of the Witwatersrand South Africa Prof Jerry Walker Vaal University of Technology South Africa Mr. Christo van der Walt Engineering Dynamics South Africa Dr. Philip Loveday Council for Scientific and Industrial Research (CSIR) South Africa Additional reviewers Prof Mehdi Behzad Sharif University of Technology Iran Prof Cuthbert Nyamupangedengu University of the Witwatersrand South Africa Dr. Guojin Feng Brunel University United Kingdom Dr. Xiange Tian University of Portsmouth United Kingdom Dr. Dong Zhen Hebei University of Technology China Dr. Hao Zhang Hebei University of Technology China Dr. Benji Gwashavanhu University of Pretoria South Africa Dr. Erfan Asaadi University of Massachussets, Dartmouth United States of America Dr. Sylvester Aye University of Pretoria South Africa Dr. Anriëtte Bekker Stellenbosch University South Africa Dr. Stephan Schmidt University of Pretoria South Africa Dr. Logan Reddy Eskom South Africa Mr. Henry Omoregbee University of Pretoria South Africa Mr. Dawie Diamond University of Pretoria South Africa Mr. Amin Jami University of Pretoria South Africa ii Sponsored by: Supported by: iii Table of contents 1. Foreword viii 2. Keynote lecture Driving the digital enterprise Leinen 1 3. Presented papers 3.1. Mechanical systems 3.1.1. Modelling and Simulation of a Two Stage Reciprocating Compressor for Condition Monitoring Based on Motor Current Signature Analysis Haba, Brethee, Alabied, Mondal, Gu, Ball 2 3.1.2. Experimental Study on Vibration Reduction Characteristics of a Helical Gear Coupling System Based on ISFD Zhang, He, Lu 12 3.1.3. Modelling the Vibration Response of a Journal Bearing for Condition Monitoring Hassin, Ma, Gu, Ball 20 3.1.4. A Study of the Influence of Time-varying Meshing Stiffness on Dynamic Response in Gear Transmission Systems Liu, Zhang, Wang, Zhen, Zhang, Shi 29 3.1.5. An Outer Ring Fault Quantitative Diagnosis Method of Ball Bearing Based on the Detail Impact Characteristic Cui, Huang, Wang, Wang 38 3.2. Electrical Systems 3.2.1. Ageing Models for Lightly Loaded Distribution Power Transformer Mukuddem, Jooste 44 3.2.2. Measurement and Visualization of the Corona Inception Gradient on an HVDC Transmission Line under the Effect of Wet Conditions Djeumen, Walker, West 53 3.2.3. A New Fuzzy Logic-based Approach to Predict Fault in Transformer Oil Based on Health Index using Dissolved Gas Analysis Mulyodinoto, Suwarno, Abu-Siada 61 3.2.4. Condition monitoring thermal properties of a 20A hydraulic-magnetic MCB Kleynhans, Van Vuuren, Thomas 69 iv 3.2.5. A Comparative Study of Partial Discharge Measurement using Electrical and Inductive Coupling Sensors Kyere, Becker, Walker 79 3.2.6. Review of Transmission Line Fault Location using Travelling Wave Method Wijaya, Suwarno, Abu-Siada 86 3.2.7. Off Line Partial Discharge as an essential Condition Based Monitoring (CBM) Tool for busbar insulation monitoring on Air Insulated Switchgear (AIS) Bisset, Van Vuuren 95 3.3. Integrated Maintenance Management 3.3.1. A Condition Based Reliability Simulator Framework based on a Heuristic Fault Model Swanepoel, Wichers 107 3.3.2. Cloud based Real-time Condition Monitoring Model for Effective Maintenance of Machines Dhandapani, Veilumuthu 118 3.3.3. Arc Tangent Failure Rate Distribution Method Hu, Yang, Zhang, Qiu, Liu, Li 125 3.4. Renewable energy systems and energy storage systems 3.4.1. Deep online analysis of dielectric parameters for lubricants and insulation oils with an innovative oil sensor system: Identification of critical operation conditions of industrial gearboxes and high voltage transformers for reduction of failure rates and live time enhancement Mauntz, Peuser 132 3.4.2. Real-Time Monitoring of Temperature Distribution in a Lithium-Ion Battery Pack Aucamp, Janse van Rensburg 140 3.5. Asset Management 3.5.1. A perspective on rotating equipment technology trends and maintenance in mining Amadi-Echendu, Matheta 148 3.5.2. A Policy Framework for Integrating Smart Asset Management within Operating Theatres in a Private Healthcare Group to Mitigate Critical System Failure Hirschowitz, Jooste 155 3.5.3. Identifying current challenges of data-based maintenance management: a case study Marttonen-Arola, Baglee 165 v 3.5.4. Investigations on augmented reality based maintenance practices within SMEs Müller, Stegelmeyer, Mishra 173 3.6. Signal processing and pattern recognition 3.6.1. Discrepancy analysis for gearbox condition monitoring: A comparison of different healthy data models Schmidt, Heyns, Gryllias 181 3.6.2. A Generalized Synchroextracting Transform for Fast and Strong Frequency Modulated Signal Analysis Chen, Wang, Zuo 189 3.6.3. A Deep Statistical Feature Learning Method Based on Stacked Auto-Encoder for Intelligent Diagnosis of Rolling Bearing Han, Long, Liu, Jiang 197 3.6.4. Research and Application of Weak Fault Diagnosis Method Based on Asymmetric Potential Stochastic Resonance in Strong Noise Background Li, Wang, Han, Kang, Li, Shi, Liu 205 3.6.5. Fault diagnosis method of rolling element bearing based on relative wavelet packet energy and Hilbert envelope analysis Guo, Li, Zhen, Zhang, Shi, Gu 213 3.6.6. Application of Adaptive Variable Scale Stochastic Resonance in Bearing Fault Diagnosis Wang, Zhang, He, Zhu 221 3.6.7. Characterization and modelling of a customs operation Hoffman 226 3.6.8. The Improvement of Instantaneous Angular Speed Estimation using Signals from a Dual Read Head for Monitoring Planetary Gearboxes Zang, Alqataweh, Xu, Shao, Gu, Ball 234 3.6.9. Novel Bearing Fault Detection using Generative Adversarial Networks Baggeröhr, Booyse, Heyns, Wilke 243 3.7. Diagnosis and prognosis 3.7.1. Condition Monitoring of a Fan using Neural Network Passi, Zhang, Timusk 251 3.7.2. Time-frequency domain analysis of varying speed vibration response of dual- rotor system Yang, Liu, Jiang, Yang 259 vi 3.7.3. Remaining Useful Life Prediction and Uncertainty Modelling with Bayesian Deep Learning Louw, Heyns 267 3.7.4. Rolling Element Bearings Prognostics Using High-Frequency Spectrum of Offline Vibration Condition Monitoring Data Behzad, Arghand, Bastami 276 3.8. Structural health monitoring 3.8.1. Investigation of Infrared Thermography as a Dual Online Diagnostic Tool for Dynamic Structural Health Monitoring Dasai, Talai, Heyns 285 3.8.2. Experimentally validated numerical simulation of prediction of structural vibration frequencies from interfacial frictional temperature signature Talai, Desai, Heyns 293 3.8.3. Corrosion monitoring with Acoustic Emission of steel embedded in concrete that is subjected to different environmental conditions Sigoba, Howse, Sikakana 301 3.8.4. Variations in vibration responses of an ice-going vessel during wave slamming Van Zijl, Bekker 312 3.8.5. Remote Monitoring of Wind Turbine Blades based on High-speed Photogrammetry Li, Wang, Liu, Tang, Gu, Ball 321 3.8.6. An improved resonance demodulation technique based on spectral kurtosis and fault characteristic harmonic-to-noise ratio Yan, Lin, Zhao, Zeng 330 3.9. Fault detection and localization 3.9.1. A vision of energy-based visualisation of large scale industrial systems for the purposes of condition monitoring Van Schoor, Uren 337 3.9.2. A low-cost condition monitoring solution for industrial bakery equipment Marais, Black 347 3.9.3. Online performance monitoring of discrete legs in a convective heat exchanger of a coal fired power plant boiler Prinsloo, Rousseau, Gosai 355 3.9.4. Simulation Based LOX/Methane Rocket Engine Fault Features Analysis Xiong, Wu, Cheng 363 vii 3.9.5. Pipe network leak detection: Sensor placement optimization using Support Vector Machines and a model-based leak detection technique Van der Walt, Heyns, Wilke 370 3.9.6. Performance visualisation of a transcritical CO2 heat pump under fault conditions De Bruin, Uren, Van Schoor, Van Eldik 380 viii Foreword Welcome to the 31st International Congress and Exhibition on Condition Monitoring and Diagnostic Engineering Management (Comadem 2018). In a certain sense the name says it all - this is the 31st edition of this established conference series. Since 1988 this annual conference has brought together influential role players in the field of condition monitoring to share the newest research developments, practical experience and industry offerings with the global condition monitoring community. This year is no different with exciting technology showcased by industry and cutting edge research presented by leading experts as well as budding condition monitoring researchers. As one author of popular fiction has written: we are living in interesting times. Mankind is faced with numerous challenges in 2018 ranging from climate change, extreme poverty and hunger in developing countries to isolationism. The main theme of the 2018 edition of the Comadem conference series was therefore aptly chosen as: energy and environmental issues facing the 21st century and beyond. This volume contains 46 accepted papers written by active researchers in the broader condition monitoring community. The authors of the accepted papers hail from ten different countries, namely: Canada, China, Germany, India, Indonesia, Iran, Kenya, Namibia, South Africa and the United Kingdom. The papers included in the conference proceedings are original research papers that have been subjected to a double-blind peer review process under the guidance of a panel of internationally recognized experts in condition monitoring. Initially each abstract was reviewed by a member of the technical programme committee. The authors of all accepted abstracts were invited to submit full length papers, which were then reviewed by two further reviewers (once again in a double-blind manner). Eighty-five percent of all the submissions were accepted. We want to thank each and every author, reviewer, speaker and exhibitor for their valuable contribution to this congress. A special word of thanks for the sponsors for their financial contribution. Last, but definitely not the least, thank you Andreas Alberts, Maggie Faku and Helen van Vuuren for your help without which this conference wouldn’t have become a practical reality. Pieter van Vuuren Chair of the Comadem 2018 organizing committee 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 1 Driving the digital enterprise Ralf Leinen1 1 Siemens, South Africa ABSTRACT The keynote will address the theme from the 4th Industrial Revolution, the German initiative "Industry 4.0" and how Digitalization together with the cloud can bring benefits to energy generation, energy consumption by industry, energy efficiency and the effect that this will have on the environment. The five pillars of Digitalization in industry also applies to energy: Reducing Time-to-market, Enhancing Flexibility, Increasing Quality, Increasing Efficiency with Increasing Security. . Keywords: Industry 4.0, digitalization, energy efficiency BIOGRAPHY Leinen is Vice President for Siemens Digital Factory & Process Industries & Drives for Southern & Eastern Africa. He studied engineering and business administration at the University of Applied Sciences Cologne (Rheinische Fachhochschule Köln) and is an experienced senior level manager in various industries related to manufacturing and processes. Leinen joined Siemens in 1990 has been in international sales and management roles at the company’s headquarters and regions in Germany, the USA and South Africa. He is currently driving Digitalization in key vertical markets in both Manufacturing and Process Industries 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Modelling and Simulation of a Two Stage Reciprocating Compressor for Condition Monitoring Based on Motor Current Signature Analysis Usama Haba1, 2, Khaldoon Brethee1, Samir Alabied1,2, Debanjan Mondal1, Fengshou Gu1 and Andrew Ball1 1Centre of Efficiency and Performance Engineering, University of Huddersfield, Queensgate, Huddersfield HD1 3DH, UK. 2University of Al-Jabal Al-Gharbi, Gharyan, Libya. ABSTRACT Reciprocating compressors (RC) are widely used in various industrial processes such as petrochemical production for compressing and moving gases. It is necessary to have accurate and cost-effective detection and diagnosis approaches for ensuring process efficiency and safety. This paper investigates into utilising motor current responses for monitoring RCs. A comprehensive numerical simulation studies were carried out when the reciprocating compressor is under a wide range of operations along with different fault cases. An extended dynamic model has been developed, which allows comment faults to be studied in five physical processes including AC induction motor, crankshaft dynamics, the mass flow rate through valves, cylinder pressure variation, and valve dynamics. The changes in motor current signatures due to the faults such as valve leakage, intercooler leakage, broken rotor bars and stator winding asymmetries can then be understood with great insights, leading to the establishment of effective and reliable diagnostic features. Simulation results show that two sideband peaks, normalised speed and current magnitude calculated using a modulation signal bispectrum are significantly correlated with compressor dynamics and can be used as feature parameters for separating between different faults over the entire operating range of the compressor. Moreover, it shows that motor current signals contain sufficient information for diagnosing different faults. Keywords: Reciprocating compressor model; Motor current signature analysis; Modulation signal bispectrum; Condition monitoring Corresponding author: Usama Haba (Usama.haba@hud.ac.uk) 1. INTRODUCTION Reciprocating compressor and its driving motor is one of the most popular machine in-use in various industrial systems. This is mainly because of its simple powerful construction and flexibility which offer high value of reliability. However, many of its components work in unsuitable working conditions like high-temperature and high- pressure environment [1], which in turns reduce the compressor efficiency. Consequently, this machine subjected to different faults related to its functionalities. Such faults not only degrade performance but cause production losses and consume additional energy resulting in an additional cost and even lead to severe damages to other parts [2], resulting in a reduction of the performance of machines. Hence their conditions need to be monitored. Damages to compressor valves are considered to be the main cause of a serious machine breakdown and led to unscheduled shutdown [2, 3]. Similarly, intercooler leakage is also considered as a common fault, caused by deterioration of pipe seals or loose joints. Additionally, faults occurring in its driving motor such as stator winding asymmetry can also affect the performance of a compressor. Increases in motor stator resistance can lead to voltage imbalances, thus the increase in motor temperature and reduction in the motor efficiency [4]. Therefore, efficient and effective condition monitoring techniques are actively studied to detect machine faults as early as possible in order to avoid catastrophic failures, reduction of maintenance cost and improve system availability [3, 5, 6]. 2 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management More attention is paid to the diagnosis of compressor faults based on vibration and current analysis, in which a large amount of research work has been conducted with state of the art technologies. Gu and Ball [7] presented a new method based on the use of Wigner–Ville distribution, wavelet transform [8], support vector machines [9], induction motor current signature analysis [10] and a high-resolution spectral analysis [11] for the detection of different RC faults. Lin et al. [12] used artificial neural network (ANN) for valve diagnostic combining three indices of time, frequency and amplitude. Instantaneous angular speed [3] and dynamic cylinder pressures [13], relevant vector machines based on envelope spectrum analysis [2], and time-frequency representation under different operating conditions [14] were used as analysis tools for diagnosis of various leakages in compressor valve. The motor current signatures can also be used as a mean for detecting abnormalities in the downstream machines such as compressors systems [15]. The rotor disturbances due to mechanical problems can also be detected by analysing the induction stator current [16]. Mathematical modelling is the way to study and understand the behaviour of the machine, the interaction of its components and its performance characteristic. In order to develop more accurate fault detection and diagnosis techniques, researchers have developed different mathematical models on the compressor. The first compressor model was developed by Costagliola [17] to study the dynamics of the valve with one degree-of-freedom. In [18] different degree-of-freedom was considered in the model for valve dynamics. A similar model was developed for compressor performance prediction [19], it studies in-cylinder pressure and the effect of piston rings leakage. Similarly, Winandy et al. [20] presented a numerical model to estimate the performance of the compressor which was able to determine discharge temperature and the ambient losses considering the compression as an isentropic process. Elhaj et al. [21] developed a numerical simulation for compressor condition monitoring under different operating conditions and was validated by experimental results. In [22] a similar numerical simulation a semi- hermetic CO2 was presented which included compressor leakages and friction loss. In all the previous studies, only a few models focused on the diagnostic of different compressor faults and their effects on compressor performance. However, most of the presented RC models have not been extended to investigate the influences of the stator winding faults on the compressor performances and have not shown the potential of using the induction motor as a mean of assessing the condition of downstream driven equipment such as reciprocating compressors. For model development and simulation study, the mathematical model offers a number of advantages. In which, modelling provides useful fault signatures with less time and equipment, particularly in the case of simulating faults which cannot be introduced to a real compressor for the purpose of experimental studies. Secondly, it allows researchers to study different compressor performances under the same conditions [21]. The main objective of this paper is to study and analyse the dynamic responses of a compressor in the context of compressor condition monitoring. On this ground, a mathematical model is developed for RC to simulate different operating conditions. The model describes the compressor working principles, valve movements, crankshaft motions and compressor pressure for the 1st and 2nd stages. The simulation considers both healthy and faulty conditions which include valve leakage, intercooler leakage, the effects of stator asymmetry and broken rotor bar. In addition, in-cylinder pressure and motor current signals are analysed to identify the fault features. Finally, to evaluate the model and simulation results, this paper presents a comparative result of measured and predicted in-cylinder pressure signals and motor current signals. 2. EXPERIMENTAL SETUP An experimental study was employed based on a two-stage, single acting V-shape Broom Wade TS9 reciprocating compressor. It can deliver compressed air up to 0.8MPa (120 psi) to a horizontal air tank. The compressor is driven by a three-phase KX-C184, 2.5 kw induction motor via a transmission belt as shown in Figure 1. The test data was recorded using the Power 1401 Plus CED high-speed data acquisition system with a sampling rate of 49 kHz; the data of each test was recorded for 3.6-second intervals. In addition, a shaft encoder and a pressure sensor were also used to measure the output rotating speed and in-cylinder pressure respectively. In this work, to evaluate the performance of MCSA, the current data were collected for four working conditions performed one by one. Starting with normal condition (BL), then discharge valve leakage on the second stage (DVL) which was introduced by drilling a 2mm hole in the valve plate to simulate the leakage as shown in Figure 1(b). 3 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Leakage on the intercooler (ICL) was introduced to the intercooler pipe by loosening the nut holding the pipeline into second stage suction valve to simulate the intercooler leakage Figure 1(c). Stator winding asymmetry (Rfs) introduced by phase winding by increasing resistance of 1.0 Ω using an external resistor bank, shown in Figure 1(d). Figure 1: A photograph of the test rig facility 3. MODEL EVALUATION Different conditions were simulated during modelling under various compressor discharge pressure to obtain features for compressor fault diagnosis at an early stage. The model dynamic responses were obtained under healthy and faulty operating conditions. Both model and experimental results are obtained and compared for in-cylinder pressure and motor current signals to evaluate the developed model with the experimental study. 3.1. Measured and predicted cylinder pressures comparison The measured and predicted cylinder pressure for both stages is illustrated in Figure 2. The 1st stage cylinder pressure (the dashed blue line) Figure 2 clearly shows the four working processes (expanding, suction, compression and discharge) for each compression cycle. From the figure representation, a rapid drop in cylinder pressure can be shown during the re-expansion due to the downwards piston movement for the range of 0o to 30o crank angle allowing air enters the cylinder (suction) when the cylinder pressure drops sufficiently below the suction manifold, which lasts from 30o to 190o. Then the piston starts to move upwards, compressing the trapped air and pressure increases gradually up to the maximum which can be seen in the range from 190o to 285o. The discharge valve opens when the pressure exceeds the discharge manifold, allowing the compressed air to be pushed into the intercooler. 0 100 200 300 Crank Angle(deg.) 0 0.5 1 Pr es su re (M Pa ) (a) Measured cylinder Pressure at 120psi First Stage Second Stage 0 100 200 300 Crank Angle(deg.) 0 0.5 1 (b) Predicted Cylinder Pressure at 120psi First Stage Second Stage Figure 2: Pressure for low and high cylinders: (a) Measured, (b) predicted For the 2nd stage cylinder pressure as shown by a dotted red line in Figure 2 shows the features of the compressor processes in the pressure profiles. However, each process occurs 90o earlier in terms of crank angle whereas, the 2nd stage suction process occurs within the discharge process of the first stage cylinder. Comparing both waveforms, the experimental and the predicted result generally show a good agreement for both cylinders and have the same trend 4 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management which confirms that the pressure model is reliable. Yet, the measured results show a slight oscillation especially in the second stage during the discharge process and this likely caused by valve flutter. 3.2. Measured and predicted current waveform comparison The measured and predicted motor current signal for different compressor discharge pressures are illustrated in Figure 3. The signals presented are of more than three cycles for comparison purpose. During the compressor working cycle, the motor is under a dynamic load fluctuation of about 7.3 Hz (rotation frequency of the compressor) which leads to a motor current modulation. A clear amplitude modulation (AM) effect can be clearly observed from the current waveform. Moreover, the amplitude increases as the pressure increases. Thus, the motor current signal can be used to obtain useful information from the compressor conditions. A comparison between the measured and predicted current signals shows that the basic features are the same and consistent over different compressor discharge pressures. This confirms that the model is valid for different discharge pressures and it can be used for fault simulation study. 0 200 400 600 800 1000 Crank Angle(deg) -5 0 5 C ur re nt (A ) (a) Measured Current Signal 60 90 120 0 200 400 600 800 1000 Crank Angle(deg) -5 0 5 (b) Predicted Current Signal Figure 3: Current waveform for healthy compressor: (a) measured, (b) predicted 4. COMPRESSOR FAULT DETECTION Valve leakages are considered as the most common fault in RC and the most frequent root cause of unplanned compressor shutdowns [3]. At high-pressure stages, both suction and discharge valves are exposed to a high number of impacts per second and also to high-speed flow and high temperature which cause non-uniform wear of the sealing surfaces between the valve seat and its plates which cause leakage in the valve system. Moreover, loose connection to the intercooler due to compressor vibration and pipe deterioration is also considered as a common fault affecting the compressor efficiency. Furthermore, failures to its associated motor parts such as broken rotor bar or stator winding asymmetry cause a reduction of motor efficiency. As a result, the temperature of motor goes up. The oscillatory running conditions can also affect the performance of a compressor. Therefore, this paper examines the leakage through the 2nd stage discharge valve, motor winding asymmetry and intercooler leakage. Simulation is used in each case to predict the signatures of each fault. 4.1. Valve leakage modelling Valve leakage was simulated by drilling a small hole (2mm in diameter) into the valve plate Figure 1(b) which cover 2% of the plate. The leakage was introduced into the 2nd stage discharge valve where the valve works under high pressure and high temperature. During the simulation, valve leakage was simulated as an additional flow through an orifice in parallel to the normal flow. The cylinder pressure Equation was modified to consider the leakage through the discharge valve, in which additional term for the mass flow ),( HLvdLm was included. 2 2 2 ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) 1 c H i H vi H c H vd H c H c H c H vdL H c H P c m c m p v c mv γ = − − −       (1) The discharge valve leakage mass flow rate ),( HLvdLm can be calculated as )()()()()()()( 2).( Hc e HdHcHdLHddHdLHvdL PPAxcm −= ρβ (2) 5 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management where ( ) H HHdl d c sign P Pβ = − is the direction of the flow (+1 leakage flow from the cylinder into discharge valve and -1 flow from discharge valve into the cylinder), )(HdLA is the leakage flow area of the 2nd stage discharge valve. 4.2. Intercooler leakage modelling The assumption is that there is a small leakage in the compressor’s intercooler pipes due to deterioration or loose nut. This fault was simulated by loosening the nut that connects the intercooler pipe into the second stage suction valve Figure 1(c). The intercooler pressure Equation was modified to consider the leakage through the intercooler pipe in which additional term icm was included. [ ]icicHviHicvdlicL ic ic mcmcmcvP   2 )( 2 )( 21 −−= (3) The intercooler leakage mass flow rate icm can be determined as ( ). 2ic ic ic ic ic ic om c x A P P Pβ= − (4) where )( oinic PP −=β is the direction of the flow and icA is the leakage flow area of the intercooler. 4.3. Motor fault modelling Stator faults are modelled by varying the resistance matrices and inductance matrices associating with stators correspondingly. The resistance matrices in the qd0 reference frame for the healthy motor is defined as [23]: 0 1 0 0 0 1 0 0 0 1 qd sr srr r    =      (5) For the simplicity, the assumption is that the stator fault occurs only in phase a in the stator. Resistance matrix of the stator with shorted turns in the qd0 reference frame can be derived as [23]: (11) (13) 0 (22) (31) (33) 0 0 0 0 s s qd sr sr s s s r r r r r r r    =      (6) where 0qdsfr is the stator winding resistance matrix for the faulty motor. Let gas be the percentage of the remaining un-shorted winding in stator phase a, and the non-zero elements of the matrix 0qdsfr are being defined as: (11) (13) (22) (31) (33) 1 2 1 1(2 1) & ( 1) & 1& ( 1) & ( 1)3 3 3 3s as s as s s as s asr g r g r r g r g= + = − = = − = + (7) In the same way, the rotor faults (broken rotor bars) are modelled by changing the resistance matrices and inductance matrices associating with rotor [23]. 6 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 5. SIMULATION AND MODEL VALIDATION 5.1. Cylinder pressures Figure 4 illustrates both the measured and simulated cylinder pressure under 120 psi discharge pressure load, for the different operating condition. During the expansion and suction strokes, a noticeable deviation of the pressure waveform for valve leakage can be seen due to a high-pressure air backflow through the discharge valve which raises the in-cylinder pressure above the normal conditions and hence the discharge valve opens earlier compared to healthy condition. The intercooler leakage shows small decrement of in-cylinder pressure due to the fact that the pressure escapes through the intercooler leak causes a slight delay in valve opening time. It is hardly distinguishable between the stator winding asymmetry and the healthy condition. Comparing both the measured and the predicted results for high-pressure cylinder shows a good agreement and this agreement is considered to be acceptable. 0 50 100 150 200 250 300 350 Crank Angle(deg.) 0 0.5 1 Pr es su re (M Pa ) (a)Measured Second Stage Pressure at 120psi BL R fs = 1.0 ICL DVL 0 50 100 150 200 250 300 350 Crank Angle(deg.) 0 0.5 1 (b)Predicted Second Stage Pressure at 120psi Figure 4: 2nd stage cylinder pressure for different conditions: (a) measured results, (b) predicted results 5.2. Motor current waveforms The measured and predicted motor current results for the healthy and faulty conditions under 120 psi discharge pressure load are depicted in Figure 5. A clear amplitude modulation (AM) effect can be observed from the current waveform for both measured and predicted signals as mentioned earlier. The current amplitude for the crank angles from 250o to 350o is higher than that from 100o to 200o due to the load fluctuation caused by the increasing pressure during the compression stroke. The measured and the predicted results for current waveform also show good agreement. 0 200 400 600 800 1000 1200 1400 Crank Angle(deg) -5 0 5 C ur re nt (A ) (a) Measured Current Signal at 120psi BL R fs = 1.0 ICL DVL 0 200 400 600 800 1000 1200 1400 Crank Angle(deg) -5 0 5 (b) Predicted Current Signal at 120psi Figure 5: Current waveform for different conditions 5.3. Motor current spectra Figure 6 depicted the measured and predicted current spectra for normal operation (healthy) and different faulty conditions under 120 psi discharge pressure load. It can be seen that signals for both measured and predicted signals exhibit a clear AM effect that causes a series of sidebands at 50 7.3s rf nf n Hz± = ± . Especially, the amplitudes of sidebands show differences for different fault cases. 7 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 35 40 45 50 55 60 65 Frequency (Hz) 0 1 2 A m pl itu de (A ) Measured Current Spectrum BL R fs = 1.0 ICL DVL 35 40 45 50 55 60 65 Frequency (Hz) 0 0.5 1 1.5 Predicted Current Signal Figure 6: Current spectrum for different operating conditions As shown in Figure 7 the sideband amplitudes increase as the compressor pressure increases. Moreover, when the sidebands amplitude for both measured and predicted signals are compared, it shows that both have the same increasing trends with discharge pressures. In addition, it also shows high consistency for different fault cases. 60 70 80 90 100 110 120 Discharge Pressure (psi) 1.5 2 2.5 A m pl itu de s ( A ) Measured Sidbands vs Pd BL R fs = 1.0 ICL DVL 60 70 80 90 100 110 120 Discharge Pressure (psi) 0.8 1 1.2 1.4 Predicted Sidbands vs Pd Figure 7: Measured and predicted sidebands vs pressure 6. DIAGNOSTIC FEATURES As MSB has the unique properties of combining sidebands and noise suppression simultaneously [10, 24-26], it was applied to simulated current signals to obtain reliable diagnostic features. Figure 8 shows a typical MSB slice at supply frequency of 50 Hz. With its much simpler representation, MSB peaks can be observed clearly at various orders of harmonics associating with the rotational frequency 7.3rf Hz= of the compressor. Moreover, these peaks decrease slightly when there is a fault. In addition, the fault also induces additional frequencies components at the twice slip frequency: 2 3.2brb sf sf Hz= = due to the presence of broken rotor bar (BRB) fault. 7.3310 14.6620 21.9929 29.3239 36.6549 43.9859 Frequency(Hz) 10 -4 10 -2 10 0 10 2 M SB M ag .(A . 2 ) 1f r 2f r 3f r 3f br b Baseline SWA+BRB Figure 8: A typical MSB slice at 50Hz 8 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Based on these observations, four primary diagnostic features including 1) the speed normalised by rated speed; 2) current amplitude normalised by the rated current of AC motor; and 3) two MSB peaks are extracted from the MSB slice to differentiate different fault cases, which allows diagnostics to be implemented without the need of additional measurements such as pressure, temperature, etc. 6.1.Diagnostics of compressor faults Figure 9 shows the feature characteristics for different faults simulated on the compressor. Figure 9 (a1), (a2) and (a3) shows that all the four features exhibit distinctive differences for the inter-cooler leakage (ICL). It means that ICL cause significant influences on the pressures of both stages and hence on the dynamics of torque, and thus large deviation from the baseline operation. So this fault can be easily separated. For the suction valve leakages (SVL) and discharge valve leakage (DVL), shown in Figure 9 (b1-b3) and (c1-c3) respectively, smaller changes are induced to the pressure and hence the dynamics of torque. Consequently, only smaller deviations are shown in the current signals by the MSB peaks at 2 rf . Nevertheless, these changes can be acceptable to make differences from the baseline and between the two faults as their changing trends are different with the normalised current values. For the losing belt transmission (LBT), the deviated speed pattern in Figure 9 (d1) can be based to separate it from others. Although the deviation values look much smaller, the high accuracy of speed estimation by higher order harmonics can ensure the diagnostic accuracy. In general, these feature parameters can be sufficient for the diagnostics of these common faults. 0.2 0.4 0.6 0.8 1 Norm. Current 0.96 0.97 0.98 0.99 1 N or m . S pe ed (a1) ICL Baseline ICL 0.2 0.4 0.6 0.8 1 Norm. Current 0 0.2 0.4 0.6 M SB P ea k at 1f r (a2) ICL 0.2 0.4 0.6 0.8 1 Norm. Current 0 0.005 0.01 0.015 0.02 M SB P ea k at 2f r (a3) ICL 0.7 0.8 0.9 1 Norm. Current 0.98 0.985 0.99 0.995 1 N or m . S pe ed (b1) DVL Baseline DVL 0.7 0.8 0.9 1 Norm. Current 0.2 0.4 0.6 0.8 M SB P ea k at 1f r (b2) DVL 0.7 0.8 0.9 1 Norm. Current 0 0.01 0.02 0.03 0.04 M SB P ea k at 2f r (b3) DVL 0.7 0.8 0.9 1 Norm. Current 0.98 0.985 0.99 0.995 1 N or m . S pe ed (c1) SVL Baseline SVL 0.7 0.8 0.9 1 Norm. Current 0.2 0.4 0.6 0.8 M SB P ea k at 1f r (c2) SVL 0.75 0.8 0.85 0.9 0.95 Norm. Current 0.005 0.01 0.015 0.02 0.025 M SB P ea k at 2f r (c3) SVL 0.7 0.8 0.9 1 Norm. Current 0.98 0.985 0.99 0.995 1 N or m . S pe ed (d1) LBT Baseline LBT 0.7 0.8 0.9 1 Norm. Current 0.2 0.4 0.6 0.8 M SB P ea k at 1f r (d2) LBT 0.75 0.8 0.85 0.9 0.95 Norm. Current 0.005 0.01 0.015 0.02 M SB P ea k at 2f r (d3) LBT Figure 9: Characteristics of feature parameters for different mechanical faults 6.2.Diagnostics of motor faults Figure 10 presents the feature characteristics for different faults simulated on the driving motor. For the stator winding asymmetry (SWA) which is of 10% increase of the stator phase resistance, shown in Figure 10 (a1-a3), the deviations in speed, MSB peaks are very significant and different from that of ICL in Figure 9. Therefore, this fault can be identified easily. For the BRB case (1 bar over 20 of them), only the MSB peaks at brbf is significantly different from the baseline, so this unique feature can be based to separate this fault. The combination of SWA and BRB show the deviation characteristics of both, so it is can be differentiated using these features completely. In 9 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management addition, as these feature patterns along with the additional one at brbf are exhibit very different characteristics from that of compressor faults in Figure 9, these two types of faults can be diagnosed without any ambiguity. 0.2 0.4 0.6 0.8 1 Norm. Current 0.98 0.99 1 N or m . S pe ed (a1) SWA 0.2 0.4 0.6 0.8 1 Norm. Current 0.2 0.4 0.6 M SB P ea k at f r (a2) SWA 0.2 0.4 0.6 0.8 1 Norm. Current 0 0.01 0.02 M SB P ea k at f br b (a3) SWA 0.2 0.4 0.6 0.8 1 Norm. Current 0.98 0.99 1 N or m . S pe ed (b1) BRB 0.75 0.8 0.85 0.9 0.95 Norm. Current 0.2 0.4 0.6 M SB P ea k at f r (b2) BRB 0.2 0.4 0.6 0.8 1 Norm. Current 0 0.01 0.02 M SB P ea k at f br b (b3) BRB 0.2 0.4 0.6 0.8 1 Norm. Current 0.98 0.99 1 N or m . S pe ed (c1) SWA+BRB 0.2 0.4 0.6 0.8 1 Norm. Current 0.2 0.4 0.6 M SB P ea k at f r (c2) SWA+BRB 0.2 0.4 0.6 0.8 1 Norm. Current 0 0.01 0.02 M SB P ea k at f br b (c3) SWA+BRB Figure 10: Characteristics of feature parameters for electrical faults 7. CONCLUSIONS Simulation studies along with experimental verification have shown that motor current waveforms contain sufficient information and have the capability of characterising reciprocating compressor working process under various common faults for a wide range of discharge pressure. Thus, fault detection and diagnosis can be performed for reciprocating compressor condition monitoring based on the motor current analysis. It can be concluded that the agreement between the measured and predicted results for the first and second stages is considered to be acceptable. The comparative study illustrates that the diagnostic features of normalised speed, normalised current allows reflect the basic operating conditions of reciprocating compressors, and the MSB sideband peaks allow the dynamics of the torques induced by pressure changes to be accurately reflected. The combinations of these multiple current features allow various common faults (valve leakages, inter-cooler leakage, loose belt, stator winding asymmetry and broken rotor bar) to be diagnosed over a wide range of operating conditions. REFERENCES [1] AlThobiani, F. and A. Ball, An approach to fault diagnosis of reciprocating compressor valves using Teager–Kaiser energy operator and deep belief networks. Expert Systems with Applications, 2014. 41(9): p. 4113-4122. [2] Ahmed, M., et al. Fault diagnosis of reciprocating compressors using revelance vector machines with a genetic algorithm based on vibration data. in Automation and Computing (ICAC), 2014 20th International Conference on. 2014. IEEE. [3] Elhaj, M., et al., A combined practical approach to condition monitoring of reciprocating compressors using IAS and dynamic pressure. World Academy of Science, Engineering and Technology, 2010. 4(3): p. 287-293. [4] Sridhar, S. and K.U. Rao. Detection of simultaneous unbalanced under-voltage and broken rotor fault in induction motor. in Condition Assessment Techniques in Electrical Systems (CATCON), 2013 IEEE 1st International Conference on. 2013. IEEE. 10 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management [5] N. A. Hussein, D.Y.M., E. M. Abdul-Baki, 3-phase Induction Motor Bearing Fault Detection and Isolation using MCSA Technique based on neural network Algorithm. Journal of Engineering and Development, 2012. 16(3): p. 15. [6] Haba, U., et al. Diagnosis of Compound Faults in Reciprocating Compressors Based on Modulation Signal Bispectrum of Current Signals. in Proceedings of the 2nd International Conference on Maintenance Engineering, IncoME-II 2017,(University of Manchester, 5-6 September 2017). 2017. University of Manchester. [7] Gu, F. and A. Ball, Use of the Smoothed Pseudo-Wigner-Ville Distribution. Maintenance & Asset Management Journal, 1995. 10(2): p. 16-23. [8] Wang, Y., M. Liao, and T. Zhao, Application of the wavelet transform to fault diagnosis in compressor valves. Zhongguo Jixie Gongcheng/China Mechanical Engineering, 2003. 14(12): p. 1046-1048. [9] Cui, H., et al., Research on fault diagnosis for reciprocating compressor valve using information entropy and SVM method. Journal of loss prevention in the process industries, 2009. 22(6): p. 864-867. [10] Gu, F., et al., Electrical motor current signal analysis using a modified bispectrum for fault diagnosis of downstream mechanical equipment. Mechanical Systems and Signal Processing, 2011. 25(1): p. 360-372. [11] Garcia-Perez, A., et al., The application of high-resolution spectral analysis for identifying multiple combined faults in induction motors. Industrial Electronics, IEEE Transactions on, 2011. 58(5): p. 2002-2010. [12] Lin, Y.-H., H.-C. Wu, and C.-Y. Wu, Automated condition classification of a reciprocating compressor using time–frequency analysis and an artificial neural network. Smart materials and structures, 2006. 15(6): p. 1576. [13] Pichler, K., et al. Detecting broken reciprocating compressor valves in the PV diagram. in Advanced Intelligent Mechatronics (AIM), 2013 IEEE/ASME International Conference on. 2013. IEEE. [14] Pichler, K., et al., Fault detection in reciprocating compressor valves under varying load conditions. Mechanical Systems and Signal Processing, 2016. 70: p. 104-119. [15] Haynes, H. and R. Kryter, Condition monitoring of machinery using motor current signature analysis. 1989, Oak Ridge National Lab., TN (USA). [16] El Hachemi Benbouzid, M., A review of induction motors signature analysis as a medium for faults detection. Industrial Electronics, IEEE Transactions on, 2000. 47(5): p. 984-993. [17] Costagliola, M., The theory of spring-loaded valves for reciprocating compressors. JOURNAL OF APPLIED MECHANICS- TRANSACTIONS OF THE ASME, 1950. 17(4): p. 415-420. [18] WAMBSGANSS, M., MATHEMATICAL MODELING AND DESIGN EVALUATION OF HIGH-SPEED RECIPROCATING COMPRESSORS. 1968. [19] Manepatil, S., G. Yadava, and B. Nakra, Modelling and computer simulation of reciprocating compressor with faults. JOURNAL- INSTITUTION OF ENGINEERS INDIA PART MC MECHANICAL ENGINEERING DIVISION, 2000: p. 108-116. [20] Winandy, E., C. Saavedra, and J. Lebrun, Simplified modelling of an open-type reciprocating compressor. International journal of thermal sciences, 2002. 41(2): p. 183-192. [21] Elhaj, M., et al., Numerical simulation and experimental study of a two-stage reciprocating compressor for condition monitoring. Mechanical Systems and Signal Processing, 2008. 22(2): p. 374-389. [22] Yang, B., C.R. Bradshaw, and E.A. Groll, Modeling of a semi-hermetic CO2 reciprocating compressor including lubrication submodels for piston rings and bearings. International Journal of Refrigeration, 2013. 7(36): p. 1925-1937. [23] Mustafa, M.O., Faults detection and diagnosis for three phase induction machines. 2012, Luleå tekniska universitet. [24] Naid, A., et al. Bispectrum Analysis of Motor Current Signals for Fault Diagnosis of Reciprocating Compressors. in Key Engineering Materials. 2009. Trans Tech Publ. [25] Alwodai, A., et al. Modulation signal bispectrum analysis of motor current signals for stator fault diagnosis. in Automation and Computing (ICAC), 2012 18th International Conference on. 2012. IEEE. [26] Gu, F., et al., A new method of accurate broken rotor bar diagnosis based on modulation signal bispectrum analysis of motor current signals. Mechanical Systems and Signal Processing, 2015. 50: p. 400-413. 11 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Experimental Study on Vibration Reduction Characteristics of a Helical Gear Coupling System Based on ISFD Zhang Yipeng1 , He Lidong2, Lu Kaihua3 Beijing Key Laboratory of Health Monitoring and Self-recovery for High end Mechanical Equipment , Beijing University of Chemical Technology, Beijing, 100029, China ABSTRACT In view of the characteristics of helical gear coupling systems and their complex vibration characteristics, a new type of integral squeeze film damper (ISFD) was designed to reduce the vibration transmission of the coupling process of a pair of helical gears to the bearing and foundation by squeezing film. For the experiment, two pairs of helical gears needed to be designed: one is a normal helical gear, and the other is a faulty helical gear worn on the tooth surface. The first-stage helical gear test bench was set up for the experiments. The rigid support and ISFD support were each installed on the four bearings to study the ISFD vibration reduction of the driving shaft and the driven shaft of two pairs of helical gear systems under the effects of different rotational speeds. The experimental results show that, under the same conditions, the vibration amplitude of the faulty helical gear is much larger than that of the normal gear. The ISFD can improve the vibration of the helical gear coupling system. When the system increased the damping, the resonant modulation phenomenon of the faulty helical gears was reduced. Also, ISFD has a frequency shift effect, and this can effectively shift away from the resonance region. In addition, ISFD can significantly reduce the meshing frequency of the helical gear coupling system caused by the resonance phenomenon. Keywords: Vibration control; Helical gear; Integral squeeze film damper (ISFD); faulty gear; Time domain analysis; Frequency domain analysis . 1. INTRODUCTION With the rapid development of modern science and technology, rotating machinery is developed quickly. Gear transmission systems are widely used in the fields of aviation, energy, petrochemicals, shipbuilding, and other fields for power and transmission in rotating machinery. However, the meshing transmission process of the gear system is complicated. The complexity is not only affected by the external factors of the prime mover and the load but also by the influence of internal excitation, such as time-varying meshing stiffness. These factors can cause strong vibration of the system, and, in severe cases, they will lead to faults in the vulnerable components, such as bearings. Helical gears are a relatively stable type of gear for transmission but vibration still exists. When the gears are worn out, glued, and fractured, the vibrations are aggravated. A large amplitude will cause fatigue damage to the helical gears. It will be transmitted to the foundation and other parts through shafts, bearings, etc., resulting in vibration of the entire mechanical system, causing safety hazards [1]. Helical gear vibration excitation is diverse and complex, mainly affecting the modulation part and the additional pulse part. The modulation part mainly for the meshing frequency and its higher harmonics, the additional pulse part for the frequency and its lower harmonic components [2]. For helical gear vibration, domestic and foreign scholars often use the following ways to control it: (1) by adjusting the structural parameters of the helical gear [3], so that the meshing situation is more reasonable, thereby reducing the excitation force and controlling the vibration source; (2) installing a damping element on a helical gear structure or system and dissipating the vibration energy through damping [4]; and (3) installing an isolation device between the vibration source and the foundation to further cut off the transmission of the vibration. This local vibration reduction measure can reduce the vibration transmitted during meshing of the helical gear along the shaft and bearing to the entire gearbox [5]. At present, the most commonly studied method can be found in (1) and (2). The method reported in (1) belongs to the active design and combines the helical gear dynamics and vibration theory to reduce the vibration by optimizing the design of the helical gear parameters. The method in (2) involves 12 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management passive damping, which is commonly used when viscoelastic dampers are affected by the thickness of the damping layer [6], and the friction dampers are limited to a narrower frequency band for damping [7]. ISFD is a type of squeeze film damper (SFD) used in conjunction with bearings to reduce rotor vibration problems and improve system stability. ISFD has low stiffness and high damping. Low stiffness reduces the dynamic force that helical gears transmit to the foundation and other components through bearings. High damping dissipates vibrational energy. However, there are some problems [8-9] in SFD, such as the nonlinearity of the oil film, large occupied space, difficult installation, and nonlinear change of the stiffness and damping characteristics [10]. A new type of ISFD was designed, and it can overcome the above problems of SFD. In this research, targeting the vibration problem of the helical gear coupling system, the new ISFD was designed to isolate the vibration source from the foundation to reduce the vibration energy transferred to the bearing and the foundation. A pair of normal helical gears and faulty helical gears was designed, and a first-order helical gear coupling test bench was set up. The vibration amplitude of ISFD was compared with that of rigid support to verify the damping effect of ISFD. A simple and convenient helical gear vibration control method was obtained. 2. HELICAL GEAR VIBRATION MODEL AND ANALYSIS First, we establish a helical gear foundation single-degree-of-freedom vibration model, as shown in Figure 1. The helical gear is simplified as a cylindrical mass. ISFD is between the bearing and foundation, expressed in terms of the stiffness coefficient K and the damping coefficient C. According to the vibration isolation principle [11], K at the bearing is properly reduced, and C is increased; thus, the natural frequency of the system is reduced and the vibration transmission coefficient is reduced. Figure 1. Single-degree-of-freedom helical gear vibration model [12]. Figure 2. Single-degree-of-freedom system vibration isolation [13]. According to Fig. 1 and Fig. 2, the simplified mechanism diagram of the vibration isolation of the helical gear foundation system is analyzed. The dynamic equation of this system is: 0 j tMx Cx Kx F e ω+ + =  (1) where K, M, and C, denote the stiffness, mass and damping of the vibration isolation system, respectively. The vibration displacement is, (2) where 0 K Mω = , ( )2C KMζ = , . The transmission force transmitted to the foundation by the vibration-isolating spring is (3) 13 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management The magnitude of transmission force transferred to the foundation is given by (4) (4) The vibration transfer coefficient is (5) Through Fig. 1 and 2 and Eq. (5), it can be found that, when the vibration frequency ω is much larger than the resonance frequency ω0 of the entire system, the vibration transmission coefficient is less than 1. The helical gear meshing frequency is higher, it is often larger than the natural frequency of the helical gear coupling system. At this point, the system has the vibration isolation function. The higher the vibration frequency is at the resonance frequency of the entire system, the smaller the vibration transmission coefficient and the better the vibration isolation effect. When reducing the stiffness in the system and increasing the damping of the system, the resonance of the system can be avoided, and the stability of the system can be improved. 3. CHARACTERISTICS OF ISFD 3.1. Structural features ISFD adopts EDM technology to process the slits of the squeezed film to form a four area integrated structure, which can provide high damping and low stiffness for the whole system. This integrated structure has more- precise concentricity and can improve the installation accuracy. Figure 3 shows the experimental ISFD. The S- shaped spring structure can evenly distribute pressure and provide low radial static stiffness. It can also provide significant damping for the system in the process of squeezing the oil film. Figure 3. S-shaped integral squeeze film damper. Fig. 4. Comparison of the mechanical model between the rigid support and the ISFD. Figure 4 is a comparison of the schematics during the experiment. The left figure shows the rigid support structure, and the right figure shows the ISFD structure. In the process of vibration transmission, the vibration deformation mainly concentrates on the elastic support part, part of the vibration energy is dissipated during the movement of the spring, and there is also the process of squeezing the oil film. The damping provided can dissipate the vibration energy in the form of thermal energy. 3.2. Energy Dissipation Mechanism The stiffness coefficient K and the damping coefficient C in the ISFD structure are relatively independent, and the spring structure and damping can decouple. When the vibration is transmitted to the ISFD, the dynamic pressure Δp generated by the squeeze film causes the lubricant to move circumferentially along the gap. Because the ISFD has a division, it has a linear relationship and consumes energy during the flow friction. The resulting damping will effectively reduce the vibration of the helical gear system and increase the effective damping ratio of the system [14-15]. 14 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 4. EXPERIMENT To study the vibration of the normal helical gear pair and the faulty helical gear pair under the same conditions, two pairs of helical gear pair were designed: the normal helical gear pair shown in Figure 5(left one) and the faulty helical gear pair in Figure 5(right one), where the fault in the helical gear pair is mainly tooth surface wear. Figure 5. Normal helical gear and faulty helical gear The two types of comparative support in this experiment are rigid sleeve support and ISFD support, The rigid sleeve support consists of a rigid sleeve, rolling bearing, and foundation; ISFD support consists of the foundation, the S-shaped elastic body, the rolling bearing, the lubricating oil, the O-shaped sealing ring, and the end seal, where the O-shaped sealing ring includes the small O-shaped sealing ring between the end seal and the shaft, and the foundation and the end seal are the large O-shaped seal. The helical gear coupled transmission test bench is shown in Figure 6. Driven by a permanent-magnet-type DC servo motor. The speed of the motor can be adjusted by the speed regulator. Before the experiment, the helical gear pair was lubricated. The photoelectric sensor was used to collect the rotational speed, and the sensor that collects the vibration was an acceleration sensor. An acceleration sensor was installed in the radial x-direction of the driving shaft and in the driven shaft. Both these sensors were connected with an LC-8008 vibration monitoring and diagnosis system. The time domain waveforms, frequency domain waveforms, and axial trajectory of the vibration could be read in real time on the computer interface. Figure 6. Experimental device 5. ANALYSIS OF RESULTS Experimental results of the following three groups of experiments were strictly analyzed using the control variable method, and accelerometers were used to collect the acceleration and time domain signals on the master and slave axes, respectively, and the speed ranges were n1 = 300~1500 r/min. The data were recorded every 100 r/min. The LC-8008 acquisition data were set to an analysis frequency of 2 kHz and 1024 sampling points. Tooth surface wear 15 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 5.1. Experiment of Vibration of Faulty Helical Gear Pairs and Normal Helical Gear Pairs This experiment was used to investigate the difference in vibration between the normal helical gear pair and the faulty helical gear pair under the same conditions. Figure 7 is the acceleration peak change curve of the driving shaft and the driven shaft of the normal helical gear pair and the faulty helical gear pair during the speed- increasing process. Figure 7. Curves of the amplitude at various speeds of a normal helical gear pair and a faulty helical gear pair under a rigid support From Fig. 7, it can be observed that the vibrations of the faulty helical gear and the normal helical gear during transmission are very different, and the vibration difference at 300 r/min to 600 r/min is small, which indicates the helical gear transmission. With high stability, but with the continuous increase in speed, the faulty helical gear has a larger vibration, and the normal gear is basically maintained in a small range of vibration and is more stable. Although the smoothness of the helical gear transmission is good, the vibration value increases at high speeds. 5.2. Experiment of ISFD in Damping Test of Faulty Helical Gear pair In Experiment 5.1, it was found that the vibration amplitude of the faulty gear increases rapidly when the speed is increased. Therefore, the following experiment was conducted to perform the vibration reduction experiment: replace the rigid sleeve for ISFD to repeat the experiment, record the data, and compare the faulty helical gear pair in the rigid sleeve Support and ISFD support. The vibration situation, as shown in Fig. 8 for the faulty helical gear pair of the DRIVING shaft and the driven shaft in the process of increasing the peak acceleration curve. Figure 8. Acceleration curve of amplitude at various speeds with ISFD and rigid support of the faulty helical gear 16 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management From the experimental results shown in Fig. 8, the vibration values of the driving shaft and the driven shaft of the rigid sleeve support and the ISFD were reduced compared with those at the respective rotational speeds, and the highest reductions were 36.3% and 52.7%, respectively. From this, it can be concluded that the squeeze film damper can have a better damping effect on the faulty helical gear shaft. The 1200-r/min time domain signal extraction was analyzed. Figure 9 shows the time domain signal diagram of the 1200r/min driving shaft and driven shaft. The time domain signal was converted into a frequency domain signal by fast Fourier transform, as shown in Figure 10. Figure 9. Comparison of two support time domain waveforms for the driving and driven shaft at 1200 r/min Figure 10. Comparison of two support time domain waveforms for the driving and driven shaft at 1200 r/min From the time domain waveforms in Fig. 9, it is seen that, under the condition of a rigid sleeve, the periodic phenomenon is not obvious, and a high amplitude appears, mostly during the meshing of helical gears. Impact vibration resulted in waveform modulation. From the structural parameters of the helical gear and the analyzed speed, the meshing frequency of the helical gear pair is found to be f = f1Z1 = f2Z2 = 640 Hz. In the case of the ISFD, the vibration phenomenon of the faulty helical gear pair was significantly improved. The maximum peak value of the driving shaft was reduced from 30 m/s2 to 20 m/s2, the reduction rate was 33.3%, and the maximum peak value of the driven shaft decreased by 60 m/s2. To 30 m/s2, the drop is 50%. In Fig. 10 show that the frequency components are complex, the vibration value is relatively large near the meshing frequency, the frequency band is relatively wide, and a large vibration occurs near 765 Hz. This situation is due to frequency resonance modulation of the basic and gear shafts. According to Fig. 11 and the analysis, the vibration amplitude 17 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management of the above-mentioned characteristic frequency is extracted, and the vibration reduction effects of the rigid sleeve support and the ISFD of the faulty helical gear are compared, as shown in Table 1. Table 1: Vibration amplitude of faulty helical gear pairs under different supports Frequency (Hz) Acceleration amplitude of driving shaft measuring point (m/s2) Acceleration amplitude of driven shaft measuring point (m/s2) Rigid support ISFD support Decreasing amplitude (%) Rigid support ISFD support Decreasing amplitude (%) 640 0.42 0.11 73.8 1.52 0.8 47.3 765 4.75 0.53 88.8 4.66 0.56 88 Table 1 shows the vibration attenuation at the measurement points of the active and driven shaft at frequencies of 640 Hz and 765 Hz. The ISFD has a good damping effect on the faulty helical gear pair, which can significantly improve the resonance modulation phenomenon caused by the characteristic frequency of the faulty helical gear. 5.3. Experiment of ISFD in Damping Test of Normal Helical Gear pair In the same way, the experiment bench was used to perform an extensive experiment on the normal helical gear pair. As shown in Fig. 11, the amplitude of the driving shaft and the driven shaft of the normal helical gear pair. Figure 11. Acceleration curve of amplitude at various speeds with ISFD and rigid support of the normal helical gear Figure11 shows that the vibration values of the driving shaft and driven shaft of the rigid sleeve support and the ISFD are lower than those of the ISFD at various rotation speeds, with the highest reductions reaching 65.8% and 62.9%, respectively. The squeeze film damper can have a good damping effect on the normal helical gear shaft.Table 3 shows the vibration attenuation at the measuring points of the driving and driven shaft near the three frequencies. The ISFD has a good damping effect on the normal helical gear pair and is insensitive to the type of frequency. It can achieve a good damping effect and can greatly attenuate the modulation phenomenon. Table 3. Vibration attenuation of different normal helical gear pairs under different supports 18 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Frequency (Hz) Acceleration amplitude of driving shaft measuring point (m/s2) Acceleration amplitude of driven shaft measuring point (m/s2) Rigid support ISFD support Decreasing amplitude (%) Rigid support ISFD support Decreasing amplitude (%) 500 0.12 0.043 64.2 0.71 0.1 85.9 640 0.064 0.015 76.5 0.22 0.03 86.3 1000 0.13 0.04 69.2 0.22 0.06 72.7 6. CONCLUSION The above experiments showed that ISFD can effectively reduce the transmission of helical gear meshing excitation, and the following conclusions were made: (1) The helical gear has stability. Under the same conditions, the vibration of the helical gear with low-speed rotation is not much different from the vibration of the normal helical gear, but the amplitude of the faulty helical gear rises significantly after higher-speed rotation. (2) In the process of increasing the speed, the ISFD can produce damping effects on both the faulty helical gear and the normal helical gear. (3) ISFD can significantly reduce the modulation of the helical gear coupling system. (4) ISFD has the function of frequency shifting to the characteristic frequency and can effectively avoid the resonance region. (5) ISFD is not sensitive to frequency, and it can reduce the vibration of the characteristic frequency in a wide frequency band and achieve good results. REFERENCES [1] Liu, F.H., Jiang, H.J., Zhang, L.. Analysis of vibration characteristic for helical gear under hydrodynamic conditions[J]. Advances in Mechanical Engineering, 2017, 9(1):1-9. [2] Ding, K., Zhu, X.Y, Chen, Y.H.. Typical fault vibration characteristics and diagnosis strategy of gearbox[J]. Journal of Vibration and Shock, 2001, 20(3):7-12. [3] PG Parker. Dynamic modeling and analysis of tooth profile modification for multimesh gear vibration [J]. Journal of Mechanical Design, 2008, 130(12):1500-8. [4] RJ Wojcikowski. Gear wheel with vibration damping rings[J]. Acoustical Society of America, 1982, 72(6):2054. [5] Wang Guozhi, Sun Peilin, Fang Kaixiang. Research on Application of Damping Vibration Reduction Technology in Transmission Gear Device[J]. Journal of Jiangsu University of Science and Technology, 2009,8(3):14-20. [6] Huang, X.J., He, L.D., Xia, X.R.. Experimental Study on Vibration Suppression of Gear Shaft Systems with Viscous Damper Installation of Driven Shafts[J]. Mechatronic Engineering, 2014, 31(12):1551-1554. [7] E Guglielmino, KA Edge. A controlled friction damper for vehicle applications[J]. J Control Engineering Practice, 2004, 12(4):431- 443. [8] Chen C S, Natsiavas S, Nelson H D. Coupled lateral-torsional vibration of a gear-pair system supported by a squeeze film damper[J]. Journal of Vibration and Acoustics, 1998, 120(4):860-867. [9] Chen C S, Natsiavas S, Nelson H D. Stability analysis and complex dynamics of a gear-pair system supported by a squeeze film damper[J]. Journal of Vibration and Acoustics, 1997,119(1):85-88. [10] K Gjika, C Groves, LS AndréS, GD Larue. Nonlinear Dynamic Behavior of Turbocharger Rotor-Bearing Systems with Hydrodynamic Oil Film and Squeeze Film Damper in Series: Prediction and Experiment[J]. Journal of Computational and Nonlinear Dynamics, 2010,5(4):2040-2049. [11] Zhu, S.J., Lou, J.J., He, Q.W.. Vibration Theory and Vibration Isolation Technology [M]. Beijing: National Defense Industry Press, 2006. [12] Chang, L.H.. Research on the General Modeling Method and Dynamic Incentive Influence Law of Parallel Shaft Gear Transmission System Dynamics[D]. Northwestern Polytechnical University, 2014 [13] J Vance, F Zeidan, B Murphy. Machinery Vibration and Rotordynamics[M]. Untied States:Wiley, 2010. [14] Cai-Wan, Chang-Jian. Bifurcation and chaos analysis of the porous squeeze film damper mounted gear-bearing system[J]. Computers and Mathematics with Applications, 2012, 64(64):798-812. [15] Santiago D O, Andrés S L. Imbalance response and damping force coefficients of a rotor supported on end sealed Integral Squeeze Film Dampers[C]. International Gas Turbine and Aeroengine Congress and Exhibition. Indianapolis, USA, 1999:1-6. 19 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa and Diagnostic Engineering Management Modelling the Vibration Response of a Journal Bearing for Condition Monitoring Osama Hassin1, Jiaojiao Ma2,3, Fengshou Gu3 & Andrew D. Ball3 1University of Gharyan, Gharyan, Libya. 2Hebei University of Technology,China 3University of Huddersfield, Queens gate, Huddersfield, UK ABSTRACT Journal bearings are widely used to undertake heavy dynamic and static loads in many mission- critical rotating machines such as compressors, engines, turbines and centrifugal pumps. Vibration responses are often used for monitoring their healthy conditions. Vibration characteristics due to mechanical problems such as misalignment, looseness and unbalanced forces are sufficiently understood. However, fluid flow related sources such as fluid instable flows and hydrodynamic forces are less known. Especially the interaction between mechanical and hydrodynamic sources is rarely reported. Aimed at promoting fault diagnosis, this paper presents a comprehensive vibration model for studying vibration response from a journal bearing. It takes into account both conventional hydrodynamic effects and the also self-excitations of asperity collisions and fluid churns. This study successfully calculates both natural frequencies and mode shapes by considering fluid and asperity forces, which are validated based on an in-house test system. Keywords: journal bearing, mathematical model, internal and external excitations, natural frequencies and condition monitoring. Corresponding author: Osama Hassin (osamingo@yahoo.com) 1. INTRODUCTION In journal bearings, the vibration response is often caused by mechanical problems such as misalignment, looseness and unbalanced forces. These problems can be easily identified at shaft frequencies and usefully indicate bearing conditions at late lifespan. To assess bearing condition at different operational phases and identify any early signs of wear or degraded lubricants, vibration mechanisms associated with hydrodynamic forces need to be understood sufficiently. Previous studies show that dynamic forces of oil film can make the shaft oscillate violently. Furthermore, asperity contact between the journal and bearing surfaces can cause random vibrations spreading over a wide frequency band. To understand the vibration phenomena in a journal bearing, a vibration model is developed to include variable excitation forces. Unbalance (Fu) creates an external excitation force. When unbalance occurs, the journal fluctuates inside the bearing which causes low frequency correlated to shaft frequency and its harmonics. The hydrodynamic (Fo) generates self-excitation force correlated to oil coefficients. Asperity contact (Fa) produces an internal self- excitation force. This random force causes corresponding high frequency vibration responses. External and internal excitation forces generate vibration of the journal. In Figure 1, brown, blue, green and red arrows present unbalanced forces, fluid forces, asperity churn forces and asperity collision forces, respectively. The equations for small amplitude motions of the rotor-bearing system in X direction and Y direction are [1, 2]. 20 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa and Diagnostic Engineering Management   Static Force e Viscous Friction Force Unbalance Forces ox oy xF y 0 x F F ux uy F m F                          (1) where m is the mass of the shaft, oxF and oyF are oil film forces at x and y directions, eF is static load at x direction only, uxF and uyF are unbalanced forces at x and y direction. By adding asperity collision forces axF and ayF , Equation (1) becomes:    Viscous Friction Force Asperity Friction Force Unbalance Forces ox ax o Stat y ay ic Force exF F F Fx F 0y ux uy F m F                                 (2) Shaft Fo Fluid; Fa Asperity Collision; Fp Asperity Churns Forces Fu Unbalance ForceStatic Radial Force Fr t r Ob Os θ ϕ Figure 1. Free body diagram of journal bearing 2. EXTERNAL EXCITATION Of different external excitations, unbalanced force is the most common one. As illustrated in Figure 2, unbalance can occur when the journal centre mass is not at the centre of the journal Oj. ux 2 uy F cos( ωt ) m ω F sin( ωt )            (3) Figure 2. Vibration caused by external excitation and out of shaft roundness [3] 21 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa and Diagnostic Engineering Management 3. INTERNAL EXCITATION 3.1 Fluid stiffness and damping coefficients Fluid-film forces may be expanded in a Taylor series about the static equilibrium position[2]. x x0 xx xy xx xy y y0 yx yy yx yy F F K x K y C x C y F F K x K y C x C y                       (4) Fluid film bearing stiffness ijK , ij X ,Y and damping ijC , ij X ,Y force coefficients are defined as i ij j F K x   iij j F C ; i, j x, y x    (5) For example, xy xK F y/   corresponds to a stiffness produced by a fluid force in the X direction due to a journal static displacement in the Y direction. By definition, this coefficient is evaluated at the equilibrium position with other journal centre displacements and velocities set to zero [4]. 3.2 Asperity collision excitation A journal bearing is designed to operate under the hydrodynamic regime. Unfortunately, it often operates under mixed or boundary regimes due to abnormal operating conditions during transient operations and fault cases like oil leakages, oil degradations and worn surfaces [5], etc. In these two regimes asperity collisions occur and result in increased vibration. Figure 3 illustrates that asperity contact can be expressed as a source of vibrations in journal bearing. It shows that one of the asperity pairs has a larger degree of collisions, which produces higher transient vibrations; whereas the smaller contact produces lower transient responses. The stiffness of a single asperity contact pair is built and calculated based on shoulder-shoulder asperity contact [6], as shown in Figure 4. Figure 3. Asperity deformation caused by collisions [3] 22 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa and Diagnostic Engineering Management Figure 4. Vibration caused by asperity collisions [3, 6] 3.3 Asperity churn excitation For high speeds and high oil viscosities but low radial loads, a bearing operates under hydrodynamic lubricant regime in which the bearing surfaces are separated by oil film and under this condition the wear is minimal. However, the fluid shear stress is high and consequently can cause high friction in the forms of heat generation and vibration. This type of vibration can be understood to be the effect of fluid shear stress which causes an alternation between surface asperity deformation and reclamation because the shear stress fields are not uniform i.e. random turbulent close to bearing surfaces [5]. To simplify the modelling of asperity churns, all the asperities are assumed to be as a cantilever beam. Figure 5 shows the vibration modes and maximum deflections of asperities while the oil flows at zone load. Figure 5 Asperity churns [3] In the mixed lubrication regime, vibration is caused mainly by asperity-asperity collisions and asperity churns. The total elastic energy of asperities is gathering asperity collisions and asperity churns [3]. total asperity churn asperity collisionsU U U (6) 23 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa and Diagnostic Engineering Management 4. EQUATION OF MOTION In designed operating conditions, mainly there is not any asperity contact between the journal spins and the bearing because the oil film completely separates their surfaces. By decreasing the oil film thickness, metal to metal contact occurs and asperity stiffness affects the motion. Shaft bx Kyx CyxKxy Cxy Kxx Cyy Kyy Cxx (a) Shaft sx bx sy Ka-i (b) yy s bk ( y y ) yy s bc ( y y )  xx s bk ( x x ) xx s bc ( x x )  n a i s b i 1 k ( x x )   ah b hk ( x x ) hh xc hh xk ah s bk ( y y ) hh yc  hh yk ss xm  ss ym  bb xm  hh xm  bb ym  hhym  r uxF ( t ) F ( t ) (c) Figure 6. Vibration model. a) Considers fluid coefficients (hydrodynamic regimes). b) Considers fluid coefficients and asperity contact (boundary regimes). c) Free body diagram of a journal bearing 1 s s xx s b xx s b xy s b xy s b n n a _ i s b a _ i s b r ux i 1 i 1 m x c ( x x ) k ( x x ) c (y y ) k (y y ) c ( x x ) k ( x x ) F ( t ) F ( t )                         (7) 2 b b xx s b xx s b xy s b xy s b n n a _i s b a _i s b xah b h xah b h i 1 i 1 m x c ( x x ) k ( x x ) c (y y ) k (y y ) c ( x x ) k ( x x ) c ( x x ) k ( x x ) 0                              (8) 3 h h xah b h xah b h h h h hm x c ( x x ) k ( x x ) c x k x 0          (9) 4 s s yy s b yy s b yx s b yx s b n n a _i s b a _i s b uy i 1 i 1 m y c (y y ) k (y y ) c (x x ) k (x x ) c (y y ) k (y y ) F ( t )                        (10) 5 b b yy s b yy s b yx s b yx s b n n a _i s b a _i s b yah b h yah b h i 1 i 1 m y c (y y ) k (y y ) c (x x ) k (x x ) c (y y ) k (y y ) c (y y ) k (y y ) 0                              (11) 6 h h yah b h yah b h h h h hm y c (y y ) k (y y ) k y c y 0          (12) where sm , bm and hm are masses of shaft, bearing and housing, xxk , yyk , xyk and yxk are fluid film bearing stiffness coefficients, xxc , yyc , xyc and yxc are damping force coefficients of fluid film, a _ ik and a _ ic are stiffness and damping coefficients of asperity collisions between bearing and shaft, xahk and xahc are stiffness and damping coefficients of asperities between bearing and its house, and hk and hc are house stiffness and damping coefficients. State space has been used to estimate the linear system, and its equations are presents as 24 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa and Diagnostic Engineering Management ( ) ( ) ( ) ( ) ( ) ( ) x t Ax t Bu t y t Cx t Du t      1 1 1 n n n 1 1 1 n n n x x u A B x x u y x u C D y x u                                                                       1 1 0 I A [ K( M )] [C( M )]               1 0 0 B 0 M            In x direction, 1 sx x , 2 bx x and 3 hx x are positions of shaft, bearing and house masses, respectively. And 7 sx x  , 8 bx x  and 9 hx x  are velocities of shaft, bearing and house, respectively. In y direction, 4 sx y , 5 bx y and 6 hx y are positions of shaft, bearing and house masses, respectively. And 10 sx y  , 11 bx y  and 12 hx y  are velocities of shaft, bearing and house masses, respectively. 5. MODEL RESULTS Based on journal bearing parameters and state space technique, eigenvalues have been calculated that determine the nature frequencies of the system. As the result of Eigen analysis, the natural frequencies of the shaft and the bearing are around 6 kHz and 13 kHz. The result bellow, in Figure 7, presents vibration response of a journal bearing under multiple operating conditions, including speed 1500 rpm, six different radial loads (1, 5, 10, 20, 40 bar) and three different oil viscosities (5, 15, 37). The vibration model has been simplified based on the value of eccentricity ration. At low eccentricity ratio, the excitation force is only correlated to fluid. At high eccentricity ratio, the excitations become more significant and correlation to both fluid and asperity forces, when the oil film thickness becomes much smaller than the size of the surface roughness. 25 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa and Diagnostic Engineering Management Figure 7. Natural frequency modes of multiple operating conditions Figure 8 presents six modes, each mode shows the vibration shapes of the shaft, bearing and house masses in X and Y axis. This also presents the nature mode of vibration under different loads and oil viscosities. Figure 9 presents natural frequency for each mode under different operating conditions. Natural frequencies increase while the load increase but oil viscosity decrease. Figure 8. Natural modes of vibration Figure 9. Natural frequency Vs radial load for three different oil viscosities 6. EXPERIMENTAL RESULTS The test rig of journal bearing consisted two self-aligning spherical journal bearings (SA35M), electrical motor and load system exerted radial load on the shaft supported between bearings. Also measurement instrumentation transducers were mounted. Four accelerometer sensors are fixed to X-m1 X-m2 X-m3 Y-m1 Y-m2 Y-m3 Cordinates 0 0.2 0.4 0.6 0.8 1 M ag ni tu de Mode 1 56.493Hz - 1bar / oil5 363.17Hz - 5bar / oil5 524.79Hz - 10bar / oil5 743.68Hz - 20bar / oil5 1045Hz - 40bar / oil5 170.38Hz - 1bar / oil15 262.34Hz - 5bar / oil15 488.25Hz - 10bar / oil15 735.41Hz - 20bar / oil15 1045.6Hz - 40bar / oil15 303.84Hz - 1bar / oil37 174.55Hz - 5bar / oil37 310.09Hz - 10bar / oil37 656.13Hz - 20bar / oil37 1028.5Hz - 40bar / oil37 X-m1 X-m2 X-m3 Y-m1 Y-m2 Y-m3 Cordinates 0 0.2 0.4 0.6 0.8 1 Mode 2 229.3Hz - 1bar / oil5 589.17Hz - 5bar / oil5 1006Hz - 10bar / oil5 1713.2Hz - 20bar / oil5 2833.6Hz - 40bar / oil5 274.58Hz - 1bar / oil15 502.84Hz - 5bar / oil15 760.13Hz - 10bar / oil15 1263.8Hz - 20bar / oil15 2136.1Hz - 40bar / oil15 369.52Hz - 1bar / oil37 534.5Hz - 5bar / oil37 711.89Hz - 10bar / oil37 1037.2Hz - 20bar / oil37 1681.6Hz - 40bar / oil37 X-m1 X-m2 X-m3 Y-m1 Y-m2 Y-m3 Cordinates 0 0.2 0.4 0.6 0.8 1 M ag ni tu de Mode 3 5735.8Hz - 1bar / oil5 5737.7Hz - 5bar / oil5 5739.9Hz - 10bar / oil5 5744.1Hz - 20bar / oil5 5752.6Hz - 40bar / oil5 5735.3Hz - 1bar / oil15 5736.8Hz - 5bar / oil15 5739.3Hz - 10bar / oil15 5743.9Hz - 20bar / oil15 5752.6Hz - 40bar / oil15 5734.4Hz - 1bar / oil37 5735.3Hz - 5bar / oil37 5737.2Hz - 10bar / oil37 5742.2Hz - 20bar / oil37 5752Hz - 40bar / oil37 X-m1 X-m2 X-m3 Y-m1 Y-m2 Y-m3 Cordinates 0 0.2 0.4 0.6 0.8 1 Mode 4 5736.5Hz - 1bar / oil5 5741Hz - 5bar / oil5 5751.3Hz - 10bar / oil5 5784.2Hz - 20bar / oil5 5899.6Hz - 40bar / oil5 5736.9Hz - 1bar / oil15 5739.5Hz - 5bar / oil15 5744.5Hz - 10bar / oil15 5760.8Hz - 20bar / oil15 5815.9Hz - 40bar / oil15 5737.8Hz - 1bar / oil37 5740Hz - 5bar / oil37 5743.4Hz - 10bar / oil37 5752.3Hz - 20bar / oil37 5782.2Hz - 40bar / oil37 X-m1 X-m2 X-m3 Y-m1 Y-m2 Y-m3 Cordinates 0 0.2 0.4 0.6 0.8 1 M ag ni tu de Mode 5 14329Hz - 1bar / oil5 14331Hz - 5bar / oil5 14334Hz - 10bar / oil5 14338Hz - 20bar / oil5 14347Hz - 40bar / oil5 14328Hz - 1bar / oil15 14330Hz - 5bar / oil15 14333Hz - 10bar / oil15 14338Hz - 20bar / oil15 14347Hz - 40bar / oil15 14327Hz - 1bar / oil37 14328Hz - 5bar / oil37 14331Hz - 10bar / oil37 14336Hz - 20bar / oil37 14347Hz - 40bar / oil37 X-m1 X-m2 X-m3 Y-m1 Y-m2 Y-m3 Cordinates 0 0.2 0.4 0.6 0.8 1 Mode 6 14330Hz - 1bar / oil5 14335Hz - 5bar / oil5 14346Hz - 10bar / oil5 14380Hz - 20bar / oil5 14480Hz - 40bar / oil5 14330Hz - 1bar / oil15 14333Hz - 5bar / oil15 14339Hz - 10bar / oil15 14356Hz - 20bar / oil15 14410Hz - 40bar / oil15 14331Hz - 1bar / oil37 14334Hz - 5bar / oil37 14337Hz - 10bar / oil37 14347Hz - 20bar / oil37 14378Hz - 40bar / oil37 26 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa and Diagnostic Engineering Management collect the vibration signals. And, an encoder and a pressure sensor are placed to measure the output rotating speed and radial load, respectively. The journal bearing has been tested to validate the vibration response model. The experiments were carried out to test the spectrum of three different lubrication types under different radial loads at 1500 rpm rotating speed. Figure 10. Self-aligning journal bearing companies and test rig [5] Figure 11 presents spectrums of all cases of experiments. The spectrum in the frequency domain of different lubrications can be regarded to have three sub-bands: 3.5kHz-5.5kHz, 5.5kHz-7.5kHz and 7.5kHz-11kHz, according to the distinction of the spectral amplitudes. In each band, spectrum exhibit continuous profiles due to the random excitations of asperity churns and collisions. In addition, these bands are magnified by the nonlinear transfer paths of structural resonances, showing that they can be validated by mathematical model. Moreover, the frequency amplitude in each band increases with the increasing radial loads. This change clearly agrees with the vibration response model results in Figure 9. Figure 11. Experimental spectrums of three lubrications (5, 15, 37) under different radial loads 7. CONCLUSION A mathematical model is established to carry out an in-depth bearing vibration responses. This model considers not only conventional fluid forces but also asperity collisions and churn effects, allowing explaining high frequency responses of the lubricated journal bearing to be the effect of micro DE-Journal bearing Pressure sensor Accelerometer Encode Radial load system 27 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa and Diagnostic Engineering Management asperity elastic deformations. The models with six degrees of freedom are solved by using the state space technique, each eigenvalue and volume of a certain mass in one direction present natural and shape modes. The natural frequencies of the system are found to be around 6 kHz and 13 kHz. Additionally, as a result of asperity collisions which increase with higher radial loads or lower viscous oil, the natural frequencies of the journal bearing can shift higher. An experimental comparison was made by vibration signal spectrums with different oil types and different radial loads. It has found that the structural resonances in frequency range bands can reflect the excitations of asperity churns and asperity collisions, and the vibration amplitudes exhibit significant increase. These results are validated based on lab experimental results. REFERENCES [1] Singhal, S., A Simplified Thermohydrodynamic Stability Analysis of the Plain Cylindrical Hydrodynamic Journal Bearings. 2004, Louisiana State University. [2] Javorova, J., B. Sovilj, and I. Sovilj-Nikic, On the derivation of dynamic force coefficients in fluid film bearings. 2009, Monograph of FTS “Machine design”, Novi Sad. [3] Hassin, O., Condition Monitoring of Journal Bearings for Predictive Maintenance Management Based on High Frequency Vibration Analysis. 2017, Huddersfield. [4] Andres, L.S., Hydrodynamic fluid film bearings and their effect on the stability of rotating machinery. 2006, DTIC Document. [5] Hassin, Osama, Yao, Aiying, Wei, Nasha, Gu, Fengshou and Ball, Andrew. Monitoring Oil Levels Of Journal Bearings Based On The Analysis Of Vibration Signals. 2016. [6] Bin Zhao, Song Zhang, Jia Man, Qing Zhang and Yan Chen. A modified normal contact stiffness model considering effect of surface topography. Proceedings of the Institution of Mechanical Engineers, Part J: Journal of Engineering Tribology, 2014: p. 1350650114558099. 28 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management A Study of the Influence of Time-varying Meshing Stiffness on Dynamic Response in Gear Transmission Systems Yinghui Liu1, Chen Zhang1, Zuolu Wang1, Dong Zhen1,*, Hao Zhang1, Zhanqun Shi1 1. School of Mechanical Engineering, Hebei University of Technology, Tianjin, 300140, P.R.China ABSTRACT Broken teeth are one of the main faults in the gear transmission system. The existing models could not accurately reflect the dynamic features of broken teeth in the fault diagnosis of gear transmission system. When broken teeth appear, the broken teeth will influence on the time-varying meshing stiffness. Therefore, optimizing the parameter of dynamics model system can improve the accuracy of the fault feature extraction for the fault diagnosis of gear transmission system. This paper mainly studies the influences of broken teeth on the time-varying meshing stiffness. Through optimizing the time-varying mesh stiffness, a gear mechanism system dynamic model is built for the simulation analysis. It aims to validate the optimized parameter setting method can more accurately reflect the characteristics of the gear fault. The analysis results show that when the failure of the broken tooth occurs, the time-varying meshing stiffness presents instantaneous shock, which is the function of the meshing frequency, and is also related to the size of the broken tooth. By studying the time-varying meshing stiffness of the broken teeth, it can provide more reliable fault information for fault diagnosis of gear transmission system. Key words: broken gear teeth; time-varying stiffness; dynamical model; gear-mesh frequency; fault diagnosis 1. Introduction It is well known that the undulation of the time-varying mesh stiffness is one of the main motivations that cause unwished vibration and noise in gear transmission systems. The influence of meshing stiffness on vibration has been analyzed in previous study for different types of gear system. And the influence of these factors on the vibration characteristics of the system is also studied when the nonlinearity associated with radial, dynamic backlash and time-varying meshing stiffness considered simultaneously. When the gears fail, such as broken teeth, the vibration of the system will aggravate and it will also affect the time-varying meshing stiffness. Therefore, in order to better diagnose the gear failure, it is necessary to study the influence rule on the vibration characteristics of the system at different speeds when the teeth are broken based on the consideration of time-varying meshing stiffness. The study of the time-varying meshing stiffness is an important research field in gear system with gear failure. Many researchers have conducted various mathematical methods to calculate the time-varying meshing stiffness, and these models can be classified into four categories: finite element (FE) method, analytical method (AM), analytical -FE method and experimental method [1]. Based on the model of the time-varying meshing stiffness of the health gear system established by Yang and Lin, many time-varying meshing stiffness models for fault gear system are optimized. For the time-varying meshing stiffness of gear pair with tooth faults like root crack, spalling, breakage and pitting, Chaari et al. [2] revealed the influences of spalling and breakage on mesh stiffness and dynamic response of spur gear transmission by potential energy method. Saxena and Parey [3] revealed the influences of spalling and friction on the time-varying meshing stiffness of spur gear. Kahraman et al.[4,5] put forward a gear system nonlinear dynamic model for the first time ,which the backlash and radial clearance were both considered, and then the model was validated by experiment, and analyzed the characteristics of the dynamic response of the gear system in the meantime. Byrtus et al. [6]. studied the influence of clearance and time-varying meshing stiffness on the vibration characteristics of the gear system. Suohuai Zhang et al. [7,8] analyzed the vibration characteristics of the gear system by using the multi-gap gear model. Yahui Cui et al. [9,10] studied the influence rule of backlash and time-varying meshing stiffness on the gear rotor system by using the experimental facility with adjustable backlash. Later, Huibo Zhang et al. [11,12] established the model of radial clearance of bearing by using the two-state model, proposed the modeling method of dynamic backlash and dynamic meshing force, and took the influence of time-varying meshing stiffness into consideration. 29 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management In order to better simulate the actual state of gear system and provide more reliable theoretical support for the fault diagnosis of gear transmission system, it is necessary to study the vibration characteristics of gear transmission system when there is a broken-tooth gear. When the gear has broken teeth, it will have a great influence on the time-varying meshing stiffness function of the gear. Because the changing of radial clearance of the bearing will affect the backlash, it is necessary to build the radial clearance model. In this study, the radial clearance model of bearing is established by using the two-state model, at the same time, the broken tooth fault is added in the model and the influence of dynamic backlash is considered. 2.Multi-gap fault gear - rotor system dynamics model In the multi-clearance gear rotor system, the clearance mainly includes the backlash of gears and radial clearance of the bearing. In the actual gear transmission system, these gaps are inevitable, which can help lubricate gears and bearings, and also change the topology of the system, thus changing the vibration state of the gear system. In the actual gear transmission system, the broken teeth are a main fault, so it is necessary to study the vibration characteristics of the dynamic system when the gear tooth is broken. In this paper, both of the gear tooth failure and multi-clearance are considered. Firstly, a two-state model is used to establish the radial clearance model of the bearing and the dynamic backlash model of the gear. Finally, the dynamic model of gear rotor system considering gear tooth broken, backlash, radial clearance and time-varying meshing stiffness is established. 2.1 Bearing radial clearance model The radial clearance of the bearing is the movement gap in the radial direction of the bearing as shown in fig. 1. In the model, it is simplified as shaft and sleeve. Ob is the center of the sleeve. Or is the center of the shaft. ebr represents the vector from Ob to Or . Supposing the radial clearance between the shaft and the sleeve is  r . When the gap exists, there are two states of contact and separation between the shaft and the sleeve. The radial clearance function is introduced to describe the current state of the system :            rbr rbr rbrrbr brr e e ee ef ||0 ||0 |||| ,....................................................................(1) In the formula (1),ebr is the radial displacement between the shaft center and the sleeve center. If ebr is within the range of  rbre || , there is a free movement between the shaft and the sleeve; If ebr is within the range of  rbre || , there is a critical contact between the shaft and the sleeve. If ebr is within the range of  rbre || , there is a collision force between the shaft and the sleeve[11]. 2.2 Dynamic backlash model Or ob  r ebr Figure 1. The bearing radial clearance model sleeve shaft 30 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management The backlash is the gap between the two gears in the direction of the meshing line when the gears mesh. When there is a radial clearance, it causes the radial run-out of the gear shaft, and the backlash will change accordingly as shown in fig. 2. When the two gears mesh at point P, the distance between M and N at the direction of the meshing line is the dynamic backlash. According to the geometrical relationship of involute, the dynamic backlash can be expressed as: bss Z R b gp p p t 0 ' cos 2       ,...........................................................(2) In the formula (2),R p represents the radius of the active gear pitch circle. Z p represents the number of the active gear tooth. S p and S g represent the pitch circle tooth thickness of the active gear and driven gear respectively. b0 is the initial backlash.  ' is the actual mesh Angle and can be expressed as:      00' cosarccos AAR ,.....................................................................(3) In the formula (3), A0 represents ideal center distance for gear standard meshing. AR is the actual center distance when there is a radial beat.  0 is the presdsure angle for gear standard meshing. Combining (2) with (3), the dynamic backlash can be written as: bAb invinvtt 00 ' 00 ))()()(cos(2)(   ,.................................................(4) In the formula (4), inv is the involute function which can be expressed as  ''' tan inv and  000 tan inv . The relative displacement of the gear along the meshing line can be defined as:  '''' cossin)(  eeRRg brybrxggPPt t ,................................................(5) In the formula (5), R p ' is the actual radius of the active gear pitch circle.  p is the active gear rotation angle. Rg' is the actual radius of the driven gear pitch circle.  g is the driven gear rotation angle. ebrx is the component on the X- axis of ebr . ebry is the component on the Y-axis of ebr . Therefore, combining (4) and (5), the function of the backlash can be expressed as: Figure 2. The backlash model 31 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management )()( 0)()( 0)( )()( 0 )( )( tt tt t tt t t bg gb g bg g f tt tt t tt t t         ,....................................................(6) If )(tg t is within the range of 0)( tg t , the two gears are in the contact state of the teeth surface, belonging to normal meshing. If )(tg t is within the range of 0)()(  tt gb tt , the two gears are in the separate state of the teeth surface. If )(tg t is within the range of )()( tt bg tt  , the two gears are in the contact state of the back surface of the teeth[12]. When the contact ratio is less than 2, the meshing cycle of the gears contains two teeth meshing period and single tooth meshing period. The relationship between  d and  m can be defined as:  md / ,...........................................................................(7) In the formula (7),  d is the two teeth meshing period and  m is the whole the meshing cycle of the gears. When time is contained between 0 and  m , the gears keep two teeth meshing. When time is contained between  mand m ,the gears keep single teeth meshing [13]. The relationship between  and the contact ratio  can be defined as: 1  ,............................................................................(8) The ratio of the time of the gear fracture part in the double tooth meshing period can be written as: )1( 2222 1   p rrrr b bxbai ,...........................................................(9) In the formula, ra1 is the tooth tip radius of active gear. rb is base circle radius of active gear. r x is broken tooth effective circle radius of active gear. p b is the normal pitch.  is gear coincide degree. Therefore, the time of gear fracture part in the double tooth meshing period can be calculated by: i dit  ,...........................................................................(10) In the process of meshing, the dynamic meshing force can be expressed as : )()()()( tgfKF tCttt ttt   ,........................................................(11) In the formula (11), )(tK t is time-varying meshing stiffness. C is meshing damping. )(t tg  is the relative velocity along the meshing direction. Time-varying meshing stiffness changes along time and when there is broken teeth, )(tK t can be calculated by:                    n i n inam nn i n nam t tt ttt t ttKK tKK K 2)1(2)1( ))(cos( 22 , 2)1( 0)cos( )( ,...................................(12) 32 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management In the formula, K m is average meshing stiffness. K m' is time-varying meshing stiffness amplitude.n is meshing frequency.  is meshing phase angle.  is the contact ratio. 2.3 Dynamic model of multi-gap gear-rotor system. The dynamic model of multi-gap coupling gear rotor system was established by using the radial clearance model and dynamic backlash model. In the model, the active gear is connected with the motor without considering its radial run out. The function of the dynamic model of the gear rotor system can be expressed as:                             0))(( 0))(( )( )( ' ' )()()( )()()(     yrbrg n ryry xrbrg n rxrx ggt ttt t tttp CefKm CefKm TIRtgCfK TRtgCfKI g g ggtgt PPtgtp ,................................................(13) In the formula (13), pand  g respectively represent the angular displacement of the active and driven gears. I p and I g respectively represent the moment of inertia of the active and driven gears. T p is the driving torque. T q is the load torque. )(tK t is time-varying meshing stiffness. C is meshing damping.  x  and  y  are the acceleration component of X and Y in the global coordinate system of the radial acceleration. mg is the driven gear mass.  x  and  y  are the speed component of X and Y in the global coordinate system of the radial speed. K r is the contact stiffness between shaft and sleeve. N is force index. C r is nonlinear damping between shaft and sleeve. 3. Analysis of vibration characteristics of the gear-rotor system. Because of the existence of the radial clearance and the backlash, the gear rotor system will produce radial vibration and torsional vibration. The torsional vibration is caused by the meshing of the gears which is related to the gear speed and the number of the gear teeth. The radial vibration is due to the collision between the shaft and the sleeve that exist the radial clearance between them. In this model, a broken tooth is added and the time-varying meshing stiffness is adopted to characterize the torsional vibration. The gear parameters used in the model is presented in table 1. Table 1:Gear parameters name value name value Active gear teeth 20 Active gear mass(kg) 0.784 Driven gear teeth 40 Driven gear mass(kg) 3.137 Module of gear(mm) 4 Driven gear moment of inertia(kg*m2) 0.00339 Gear pressure angle(°) 20 Active gear moment of inertia(kg*m2) 0.05123 3.1 The influence of gear speed on vibration 33 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management According to the above parameters, the dynamic model of gear rotor system with multi-gap coupling was established. But the dynamic equation is a high dimensional nonlinear equation. Therefore, the dynamic equation is solved by Newmark method. In this model, the radial clearance is 1mm, and the backlash is 0.16mm. When the active gear speed is 60r/min, 90r/min and 120r/min, the vibration acceleration spectrum of the driven gear is shown in figure 3. In the vibration acceleration spectrum of the driven gear, the x-coordinate is the frequency, and the y-coordinate is the acceleration amplitude. It can be seen from the figure that the spectrum has two peaks at every speed. The first peak that caused by radial vibration of the bearing is the same. The peak f1 which appears at the speed of 60r/min is due to the broken tooth, and the corresponding frequency is 3.05 Hz. The peak f2 which appears at the speed of 90r/min is due to the broken tooth, and the corresponding frequency is 4.88 Hz. The peak f3 which appears at the speed of 120r/min is also due to the broken tooth, and the corresponding frequency is 6.71 Hz. Therefore, the frequency caused by the broken tooth is approximately proportional to the speed of the gear. But because of the low frequency resolution, the frequency and speed are not strictly proportional. In terms of numerical value of the spectrum, the amplitude of radial vibration is much larger than the vibration caused by meshing and the vibration caused by the broken tooth. The vibration caused by the broken tooth is great larger than that of the meshing vibration. Therefore, the meshing frequency and its octave are not recognized. It is indicated that the radial clearance of the bearing has the greatest influence on the vibration of the system, and the vibration of the system will increase sharply when the teeth break. And when the speed increases, the frequency caused by the broken tooth would increase correspondingly. 3.2 The influence of time-varying meshing stiffness on vibration In order to study the influence of time-varying meshing stiffness on vibration, keep the active gear speed unchanged, and when the meshing stiffness was increased into 10^7N/m, the vibration spectrum of the driven gear was shown in figure 4. Figure 3. The vibration acceleration spectrum of the driven gear at different speed 34 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management It can be seen from the figure 4 that the spectrum has two peaks at every speed. The first peak that caused by radial vibration of the bearing is the same. The peak f4 is due to the broken tooth at the speed of 60r/min, and the corresponding frequency is 3.05 Hz. The peak f5 is due to the broken tooth at the speed of 90r/min, and the corresponding frequency is 4.88 Hz. The peak f6 is also due to the broken tooth at the speed of 120r/min, and the corresponding frequency is 6.71 Hz. When the meshing stiffness was increased, the frequency caused by the broken tooth is also approximately proportional to the speed of the gear. In terms of numerical value of the spectrum, the amplitude of radial vibration is much larger than the vibration caused by meshing and the vibration caused by the broken tooth. The vibration caused by the broken tooth is great larger than that of the meshing vibration. Therefore, the meshing frequency and its octave are not recognized. It is indicated that the radial clearance of the bearing has the greatest influence on the vibration of the system, and the vibration of the system will increase sharply when the teeth break. And when the speed increases, the frequency caused by the broken tooth would increase proportionally. According to the figure3 and figure4, the acceleration amplitude of the driven gear will decrease when the stiffness increases. 3.3 The comparison of different mesh stiffness at the same speed When the active gear speed is 60r/min and 90r/min, the vibration at different mesh stiffness was compared in figure5 and figure6. Figure 4. The vibration acceleration spectrum of the driven gear at different speed Figure 5. The vibration acceleration spectrum of the driven gear at the speed of 60r/min 35 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management It can be seen from the figure 5 and figure 6 that when the meshing stiffness is increased, the acceleration amplitude of the driven gear will decrease greatly. 4. Conclusion The dynamic model of gear rotor system with multi-gap coupling is established in this study. In addition, the broken tooth fault is added to the system. By changing the time-varying meshing stiffness, reflect the gear meshing condition when the broken tooth exists. Then the relationship between the vibration characteristics of the fault system and the time-varying meshing stiffness is studied. By comparing the simulation results under different gear speed it can be know that there is a relationship between the fault frequency and speed. Also it shows that the time- varying meshing stiffness function is reasonable, but the parameters choice of the model is still exists room for improvement and needs further research. ACKNOWLEDGMENT This research was supported by the National Natural Science Foundation of China (Grant no. 51605133; 51705127), Hebei Science and Technology Projects of China (Grant no. 17394303D) and Hebei Provincial High- level Personnel Funding (Grant no. E2014100015). REFERENCES [1] Hui Ma, Xu Pang, Ranjiao Feng, Jin Zeng, Bangchun Wen.Improved time-varying mesh stiffness model of cracked spur gear[J].Engineering Failure Analysis,2015,55:271-287. [2] Fakher Chaari, Walid Baccar, Mohamed Slim Abbes, et al..Effect of spalling or tooth breakage on gear mesh stiffness and dynamic response of a one-stage spur gear transmission. Eur. J. Mech. A.Solids 2008,27:691-705. [3] Ankur Saxena, Anand Parey, Manoj Chouksey. Time varying mesh stiffness calculation of spur gear pair considering sliding friction and spalling defects. Eng. Fail. Anal. 2016,70:200-211. [4] Kahraman A, Singh R. Non-linear dynamics of a geared rotor-bearing system with multiple clearances[J].Journal of Sound and Vibration, 1991, 144(3):469-506. [5] Blankenship G W, Kahraman A. Steady state forced response of a mechanical oscillator with combined parametric excitation and clearance type non-linearity[J].Journal of Sound and Vibration, 1995, 185(5):743- 765. [6] Byrtus M, Zeman V. On modeling and vibration of gear drives influence by nonlinear couplings[J].Mechanism and Meshine Theory, 2011, 46:375-397. [7] Suohuai Zhang, Yiping Li, Damou Qiu. Study on nonlinear dynamic characteristics of a geared rotor bearing system[J].Chinese Journal of Mechanical Engineering, 2001,37(9)53-61. [8] Suohuai Zhang, Yunwen Shen, Damou Qiu.On response of mass unbalance in a geared rotor bearing system[J].Chinese Journal of Mechanical Engineering, 2002, 38(6):51-55. Figure 6. The vibration acceleration spectrum of the driven gear at the speed of 90r/min 36 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management [9] Yahui Cui, Zhansheng Liu, Jianhuai Ye.Dynamic response of geared roter system and the effect of clearance on jump characteristics of amplitude[J].Journal of Mechanical Engineering, 2009, 45(7):7-15. [10] Yahui Cui, Zhansheng Liu, Jianhuai Ye, et al..Bifurcation and chaos of gear transmission system with clearance subjected to internal and external excitation[J].Journal of Mechanical Engineering, 2010, 46(11):129-136. [11] Huibo Zhang, Jian Tian, Jun Zhou, et al..Dynamic and Experimental Investigation of Gear-rotor System with Multiple Clearances Coupled[J].Journal of Mechanical Engineering, 2017, 53(11). [12] Huibo Zhang, Ran Wang, Zikun Chen, et al..Nonlnear dynamic analysis of a gear-rotor system with coupled multi-clearance[J].Journal of Vibration and Shock, 2015, 34(8). [13] J.-H.Kuang, A.-D.Lin. Theoretical Aspect of Torque Responses in Spur Gearing due to Meshing Stiffness Vatriation[J]. Mechanical Systems and Processing, 2003, 17(2):255-271. 37 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management An Outer Ring Fault Quantitative Diagnosis Method of Ball Bearing Based on the Detail Impact Characteristic Lingli Cui 1, Jinfeng Huang 2, Huaqing Wang3, Xin Wang2 1 Key Laboratory of Advanced Manufacturing Technology, Beijing University of Technology, Beijing 100124, China 2 Beijing Engineering Research Center of Precision Measurement Technology and Instruments, Beijing 100124, China 3 Beijing University of Chemical Technology, Beijing 100029, China ABSTRACT Outer ring faults that differ in size by natural multiples of the ball angular spacing will have the same time interval, which may lead to misdiagnosis. Aiming at this problem, the dynamic model of ball bearing is established in this paper. And the vibration acceleration of the bearing system with outer ring fault that differ in size by natural multiples of the ball angular spacing is simulated. Analysis of the simulation signal shows that the bearing systems with outer ring fault that differ in size by natural multiples of the ball angular spacing have the similar impact characteristics. This phenomenon will easily lead to misdiagnosis of the quantitative diagnosis method based on the time interval. Therefore, the dynamic model is optimized. In this model, the effect of the degree of freedom of balls on the vibration acceleration is considered, and then the new features which can distinguish these two faults is theoretically predicted and proposed. More importantly, experimental results show that the proposed new features can distinguish these two faults effectively, thus expanding the applicability of the diagnosis method of the fault of the bearing outer ring. Keywords: Ball bearing; outer ring fault; quantitative diagnosis; dynamic model; impact characteristic Corresponding author: Ling li Cui (acuilingli@163.com) 1. INTRODUCTION Ball bearings are critical mechanical components that determine the health of machinery and its remaining lifetime in modern production machinery [1]. A defect in such a bearing, unless detected in time, causes malfunction and may even lead to catastrophic failure of the machinery [2]. From this reason, it is significant to study the fault diagnosis of a ball bearing [3]. With the development of bearing fault diagnosis, bearing fault diagnosis technology and methods have made great breakthroughs. In terms of signal processing, most researches on bearing fault diagnosis focus on the presence or absence of faults and types of faults. Zhu et al. [4] used the Laplace wavelet correlation filtering method to effectively identify the cyclic period of fault characteristic. Jia et al. [5] achieved the accurate identification of bearing fault types based on a morphological demodulation method of improved singular spectral decomposition. This is not enough for preventive and maintenance of bearings. Only by mastering the severity of the failure can it be possible to effectively prevent the occurrence of equipment downtime and save production costs and maintenance costs. The Ref. [6] also mentioned the need to achieve a breakthrough from qualitative diagnosis to quantitative diagnosis. Therefore, it is of great significance to realize quantitative diagnosis of faults. Domestic and foreign scholars have achieved certain results in quantitative diagnosis. In order to obtain the bearing fault diagnosis mechanism, scholars established a dynamic model of the bearing, simulated the vibration response of the bearing system under different conditions, and analyzed the fault mechanism. Cao et al. [7] used the dynamic model of roller bearings to simulate the vibration response of the inner and outer rings of the bearing under different conditions. Yuan et al. [8] established a multi-body dynamics model of defective bearings, and analyzed the vibration signal of local fault and composite faults. Singh et al. [9] established the finite element model of bearing and obtained the corresponding relationship between contact force and defects. Therefore, this 38 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management paper established the dynamic model of the bearing system and simulated the vibration acceleration response of the bearing system under different sizes. 2. NONLINEAR DYNAMIC MODEL OF BEARING SYSTEM To obtain the rolling bearing vibration response characteristics in different severities and to explore its mechanism, this section from the dynamic and kinematic point of view, a dynamic model of the rolling bearing system is built. Basing on the Hertz contact theory, the contact load Fj can be derived: n j jF K (1) where K represents the load deflection factor of bearing, the contact deformation δj of the jth ball can be expressed as:  0( ( ))- , 00, 0=j j jj A t A   > ≤ (2) where A0 and A are the unloaded and loaded relative displacement between the outer and inner ring groove curvature centers, respectively. When a bearing has an rectangular outer ring fault of circumferential ∆ϕf (as shown in Figure1), depth h, and the fault center position at ϕf, in this instance, the effective depth d(ϕj) for the fault can be derived as[11]: 2 2 2( ) min( , 0.25 (0.5 ) ) j b b O f j fd h r r R         (3) Figure1. The schematic diagram of rolling bearing with outer ring fault where Ro is the outer ring radius, rb is the radius of balls. The loaded relative displacement A can be expressed as:     2 2 0 0 0 0( ( )) cos sinj rj zjA t A A        (4) where radial δrj and axial δzj deformations are calculated as follows:     0cos sin ( )cosrj i o j i o j L jx x y y r d          (5) 0( sin cos ) ( )sinzj z d x j y j jr d           (6) where rl is the bearing radial clearance. (xi, yi) and (xo, yo ) represent the displacement of the inner ring and outer ring, respectively. αo is the unloaded contact angle. The total contact forces in the x, y and z directions between balls and rings can be given by: 1 1 cos cos cos sin sin b b j j xj N N j j j yj j j j zj F F F F                              (7) where the load contact angle αj is calculated as follows: 39 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management     0 0 0 0 sin arctan cos zj j rj A A             (8) Figure 2.The nonlinear dynamic model of rolling bearing with outer ring fault The nonlinear dynamic model of the rolling bearing system is shown in Figure 2. Based on the nonlinear dynamic model of the bearing, the dynamic differential equation of the rolling bearing is given by: 1 1 1 1 b b b b N o o ox o ox o xj j N o o oy o oy o yj j N i i ix i ix i x xj j N i i iy i iy i y yj j m x c x k x F m y c y k y F m x c x k x F F m y c y k y F F                                 (9) where mi, ki and ci are the mass, the stiffness and damping of inner ring and shaft, xi and yi are the displacements. mo, ko and co are the mass, the stiffness and damping of outer ring and bearing pedestal, xo and yo are the displacements. 3. VIBRATION RESPONSE OF BEARING SYSTEM The bearings studied in this paper are NSK6308 ball bearings with eight balls. The angle extents of the outer ring faults are set to ∆ϕf =7.4o and 52.4o, these faults differ in size by an integer number of the ball angular spacing. The Hertz contact load factor K=5453MN/m1.5. The radial load in the horizontal and vertical directions of the bearing is Fx=0N, Fy=-100N respectively, set the shaft rotation speed fs=7Hz and the sampling frequency Fs=65536Hz. 40 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 3 (a) and (b) shows the simulated vertical vibration response for ∆ϕf = 7.4° and 52.4°, respectively. Combining with the theoretical analysis, it can be seen that when the ball enters or leaves the fault area, the acceleration will abruptly change and there will be shock oscillating, namely the impact 1 and the impact 2 in Figure 3(a). From the comparison of the results in Figure 3, it can be seen that the time interval between the impact 1 and impact 2 in the vibration response of ∆ϕf =7.4° is almost equal to the time interval of ∆ϕf =52.4°. Therefore, the two faults have the same impact characteristics in the time domain. So the quantitative diagnosis method based on the time interval may be misdiagnosed. Ref. [10] proposed a third feature method to deal with this problem. This method can effectively distinguish these faults with the same impact time interval located at 270o. Considering that in the actual engineering case, the outer ring fault angle position may appear near 270o, so this paper carries on further research to this question. 0.1 0.15 0.2 0.25 0.3 -20 -10 0 10 Time (s) A cc e le ra ti o n ( m /s 2 ) 0.1 0.15 0.2 0.25 0.3 -20 -10 0 10 20 A cc e le ra ti o n ( m /s 2 ) Impact 1 Impact 2 (a) (b) Impact 1 Impact 2 Figure 3. Simulated vertical vibration response for ∆ϕf=7.4o and 52.4o: (a) ∆ϕf =7.4o; (b) ∆ϕf =52.4o 4. MECHANISM ANALYSIS BASED ON IMPROVED BEARING SYSTEM MODEL 4.1. Analysis of bearing simulated vibration response Based on the concern of above problem, firstly, this part will optimize the nonlinear dynamic model of bearing system shown in Figure 2. The dynamic model of Figure 2 does not consider the degree of freedom of balls, but in the actual system, the ball has independent freedom and inertia force. In Ref. [11], an improved dynamic model of the bearing system is introduced. The model is considered the degree of freedom and inertia of the ball based on the model of equation (9), and its nonlinear dynamic equation is as follows:   , , , , , , , , , , , , 2 , , , 0 0 0 sin 0 i x di xi i i y di yi s x o s x o o x do xo o s y o s y o o y do yo b i r b i r c b i r r F Fx m F Fy g W c x k x F Fx m c y k y F Fy g m m w m g f                                                             ....................................... (10) where fr is the contact force, and pi is the radial coordinates of each ball. Based on the optimized dynamics model of bearing system, the simulated vibration acceleration signal is shown in Figure 4. The results shown in Figure 4 indicates that the impact 1 and the impact 4 correspond respectively to the moment of the ball entry and exit fault area, and the two impacts are similar to the impact of the entry and exit fault area obtained by the equation 1. In addition, except for the two impacts, there are small impacts between the 41 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management two impacts, such as the impact 2 and impact 3 shown in Figure 4, which cannot be obtained by the equation (9). Due to the existence of faults, the gap between the ball and the inner and outer raceway increases, resulting in the bouncing phenomenon of the ball in the raceway, causing the small impacts in the vibration response. This phenomenon will only appear in larger faults, while small faults such as 7.4o will not appear. For the small size fault, the ball does not contact with the bottom of the raceway in the fault area, and then rolled out the edge of the fault, so there is no bouncing phenomenon. According to the impact characteristics, it can further be used to distinguish these outer ring faults that differ in size by an integer number of the ball angular spacing. Figure 4. Simulated vibration response for Ref.[11] 4.2. Analysis of experimental data of bearing In order to verify the feasibility of the impact characteristic proposed in this paper for distinguishing these outer ring faults that differ in size by an integer number of the ball angular spacing, the experiment was carried out. The test rig is shown in Figure 5. In this experiment, the outer ring fault of ∆ϕf =7.4o and 52.4o in NSK6308 deep groove ball bearing is processed by wire cutting, and the fault bearing is shown in Figure 6. Rotational frequency is 7Hz, the sampling frequency is set to 65536Hz. The measurement system is NI USB4432, the sensors is HD- YD-232, and the Butterworth filter is adopted. The experimental signals of ∆ϕf =7.4o and 52.4o are shown in Figure 7. The results shown in Figure 7 show that only two impacts (impact 1 and impact 2 in Figure a) appear in the signal of the small size fault ∆ϕf =7.4o, and four impacts appear in the signal of ∆ϕf =52.4o, which are similar to the simulation conclusions in the last section. Therefore, the detail impact characteristics can be used to distinguish these outer ring faults that differ in size by an integer number of the ball angular spacing from the time domain signal. Figure 5. Photos of the test rig Figure 6. Defective bearing with fault size of ∆ϕf=7.4o and 52.4o: (a) ∆ϕf=7.4o; (b) ∆ϕf=52.4o Impact 2 Impact 1 Impact 3 Impact 4 42 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 0.8 0.805 0.81 -20 -10 0 10 A cc e le ra ti o n ( m /s 2 ) Time (s) (a) x:0.7967 y: -0.058Impact 1 Impact 2 x:0.7997 y:8.13 1.7 1.705 1.71 1.715 -5 0 5 A cc e le ra ti o n ( m /s 2 ) Time (s) Impact 1 Impact 2 x:1.705 y:2.86 x:1.702 y:0.278 Impact 3 Impact 4 (b) Figure 7. Measurement vibration response for Δϕf=7.4° and 52.4°: (a) Δϕf=7.4°; (b) Δϕf=52.4° 5. CONCLUSION In this paper, a dynamic model of bearing system with outer ring faults is established. Based on this model, the vibration acceleration of the ball bearing system is simulated with outer ring faults that differ in size by an integer number of the ball angular spacing. Through the comparison of the simulation signals, it is known that these outer ring faults that differ in size by an integer number of the ball angular spacing have the same impact interval. This phenomenon will lead to the diagnosis method based on the impact time interval. To solve this problem, the dynamic model is further optimized. From the simulation results, it is known that there are small impacts in the vibration response of the large size fault except for the double impact, and the double impact phenomenon only occurs in the signal of small size fault. Then the new characteristics can be used to distinguish these faults that differ in size by an integer number of the ball angular spacing theoretically. The experimental results show that the new features can effectively distinguish these faults that differ in size by an integer number of the ball angular spacing. ACKNOWLEDGMENTS This work is supported by the National Natural Science Foundation of China (Grant No. 51575007 and No. 51675035). The authors would like to thank the editors and anonymous reviewers for their helpful comments and constructive suggestions. REFERENCES [1] El-Thalji I., Jantunen E., A summary of fault modelling and predictive health monitoring of rolling element bearings[J]. Mechanical Systems & Signal Processing, 2015, 60-61(1):252-272. [2] Tandon N, Choudhury A. A review of vibration and acoustic measurement methods for the detection of defects in rolling element bearings [J]. Tribology International, 1999, 32(8):469-480. [3] Cui L., Huang J., Zhang F., Quantitative and Localization Diagnosis of a Defective Ball Bearing Based on Vertical-Horizontal Synchronization Signal Analysis [J]. IEEE Transactions on Industrial Electronics, 2017, 64(11): 8695 - 8706. [4] Wang S., Zhu Z., Wang A., Bearing fault feature detection based on parameter identification of transient impulse response [J]. JOURNAL OF VIBRATION ENGINEERING, 2010, 23(4):445-449. [5] Yan X. Jia M., Morphological Demodulation Method Based on Improved Singular Spectrum Decomposition and Its Application in Rolling Bearing Fault Diagnosis [J]. Journal of Mechanical Engineering,2017, 53 (7):104-112. [6] Wang G., He Z., Chen X., Lai Y., Basic Research on Machinery Fault Diagnosis—What is the Prescription [J]. Journal of Mechanical Engineering,2013,49(1): 63-72. [7] Cao M., Xiao J., A comprehensive dynamic model of double-row spherical roller bearing—Model development and case studies on surface defects, preloads, and radial clearance[J]. Mechanical Systems & Signal Processing, 2008, 22(2):467-489. [8] Xing Y., Zhang Y., Zhu Y., et al. Fault degree evaluation for rolling bearing combining backward inference with forward inference[J]. Journal of Zhejiang University, 2012, 46(11):1960-1967. [9] Singh S., Köpke U., Howard C., et al. Analyses of contact forces and vibration response for a defective rolling element bearing using an explicit dynamics finite element model [J]. Journal of Sound & Vibration, 2014, 333(21):5356-5377. [10] Petersen D., Howard C., Prime Z., Varying stiffness and load distributions in defective ball bearings: Analytical formulation and application to defect size estimation [J]. Journal of Sound & Vibration, 2015, 337:284-300. [11] Ahmadi A., Petersen D., Howard C., A nonlinear dynamic vibration model of defective bearings – The importance of modelling the finite size of rolling elements [J]. Mechanical Systems & Signal Processing, 2015, 52-53(1):309-326. 43 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Ageing Models for Lightly Loaded Distribution Power Transformers M.Z. Mukuddem1, J.L. Jooste 2 1 Department of Industrial Engineering, Stellenbosch University, South Africa 2 Department of Industrial Engineering, Stellenbosch University, South Africa ABSTRACT Power transformers form an integral part of an electricity network. These assets allow for power to be transformed from different voltage levels in an efficient manner. In order to extract maximum return on investment, asset owners are required to ensure transformers are fully utilized prior to being disposed of. In line with this, there are various existing ageing models available which are used to predict the loss of life for a transformer. Some of these models include those presented by IEC [1], IEEE [2], Emsley [3], Lungaard [4] and Martin [5] These models attempt to predict loss of life based on a combination of parameters such as hot-spot temperature, moisture in paper content and oxygen content. These three parameters account for the three different degradation mechanics of transformer solid insulation (cellulose paper). In this paper, four frequently used ageing models are applied for ten pairs of distribution power transformers, which operate at an average loading of below 50%. The data used covers a three year period. The results were verified against Furanic oil results of the transformers and it was found that the ageing models constantly produce high variations in their predicted versus measured loss of life. To reduce the variation between predicted and measured loss of life values, three new ageing models are developed in this paper. These models are based on hot spot temperature, oxygen content in the oil and moisture content in the solid insulation. A genetic algorithm is used to determine the model parameters from available data from in-service transformers. The results show that these new ageing models produce ageing estimations with less variation between the predicted and measured loss of life in comparison to the results of the initial models investigated. The modified ageing models is shown to be easily implementable on in-service transformers, even when limited condition data is available. Condition Monitoring, Power Transformer, Genetic Algorithm, Ageing Model, Loss of Life. Corresponding author: Mohamed Ziyaad Mukuddem (mzmukuddem@gmail.com) 1. INTRODUCTION Power transformers form an integral part of present day electrical networks. They allow for power to be transmitted over vast distances while minimizing losses and costs [6]. They are however one of the most expensive assets within the distribution network [7, 8, 9]. To maximize the return on investment, the transformers should operate, at minimum, for the duration of their designed life time. Power transformers are used throughout the electricity network, as part of large power stations to small pole mount distribution transformers. The distribution networks with redundancy typically share the total load of a substation across two or more transformers [9]. Should one of the transformers not be operational (N-1 condition), the remaining transformers are still capable of supplying the full load of the substation [9]. This results in transformers operating at below full load capacity under normal network conditions [9]. A lightly loaded transformer can be considered one which operates at below 50% average loading [10]. A reduction in load factor results in a reduction of hot-spot temperature below the rated operating hot-spot temperature [1]. In the South African context distribution power transformers are commonly built with network redundancy. This typically results in two transformers sharing a total load which can be supplied by each individual unit. As a result, these transformers operate at below 50% loading. The operating hot spot temperature which they operate at is therefore below that of its designed rating. 44 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management A transformer consists of two main components, namely the winding and the core. The core is covered with solid insulation. Kraft paper is the most commonly used solid insulation. The loss of life of a transformer is directly affected by the condition of the solid insulation within the transformer windings [1, 2], since it is not ordinarily feasible to replace the solid insulation during the life of a transformer. The strength of solid insulation is measured via the degree of polymerization of the paper (DP), which is directly related to the physical strength of the solid insulation. Ageing mechanics such as hydrolytic degradation, oxidative degradation and thermal degradation degrade the strength of the solid insulation inside a transformer. A transformer degrades as a result of these mechanics until it reaches it end of life criteria. A widely accepted physical end-of-life criteria is when the degree of polymerization of the paper reaches 200 or less [1, 2, 3, 4, 5]. There are various models for calculating this rate of loss-of-life in solid insulation [1, 2, 3, 4, 5]. These models use different permutations of the parameters, hot-spot temperature, moisture and oxygen content, to calculate degradation. The accuracy of the ageing models is important for the effective management of transformers. The norm is to use a reference hot-spot temperature of 98°C for calculating loss-of-life of kraft paper insulated transformers. Transformers in distribution networks operating below average loading, therefore also operate below this reference temperature. The problem is that it is unknown how accurate the existing ageing models are for estimated loss-of-life of lightly loaded transformers. The need is therefore to identify a suitable ageing model which is sufficiently accurate to effectively manage lightly loaded transformers throughout their lifetime. 2. METHODOLOGY The data available in support of this study is limited to transformer loading and oil sampling data. The study is therefore approached in three phases to cater for the data limitations. The existing ageing models require hot-spot temperature data for estimating loss-of-life. Hot-spot temperature data therefore needs to be firstly derived from the loading data. Secondly, the data is applied to the existing ageing models to estimate the loss-of-life and to compare the accuracy of the estimates. Finally, three new ageing models are derived and evaluated to improve on the accuracy of loss-of-life estimation for lightly loaded transformers. Since the data available is limited to transformer loading data, an algorithm is required to simulate the operating hot-spot temperature of the transformers. IEC [1] provides an algorithm, which is capable of using loading and ambient temperature data for determining the theoretical hot-spot temperature. The algorithm is developed in Matlab and validated in two ways. Against a dataset provided by IEC [1] and against data of a pair of in-service transformers which have digital temperature probes installed. The validation results produce a deviation of 0.11% against IEC [1]. The second validation against in-service transformers produces an average variation of 6%. The variation between the measured and calculated values can be attributed to the lack of ambient temperature which is not available in the given dataset. The algorithm is expanded for calculating the estimated loss of life based on the existing ageing models. The algorithm logic is presented in Figure 1. For each time interval the hot-spot temperature and ageing is calculated until the full time period is reached, followed by the reporting of the total loss of life for the transformer over the full time period. The existing ageing models investigated as part of this study are the models by IEC [1], Emsley and Stevens [3], Lungaard et al. [4] and Martin et al. [5]. These ageing models vary from single to multi-parameter models. The simplest model [1] only uses the hot-spot temperature of the transformer to determine the loss of life, while the most complex model [5] utilizes hot-spot temperature, oxygen content and moisture content. The ageing models under investigation come in two forms, either in relative ageing rate form or in Arrhenius form. For simplification the Arrhenius form models [3, 4, 5] are converted to the relative ageing rate form by using equation (1): !"!#" . 𝑒( (# #")*+, - (# ")*+, ),………..………………………………. (1) where AT is a constant at the temperature, moisture and oxygen for a specific interval, ART is a constant at the reference temperature, moisture and oxygen, E is the activation energy, R is the ideal gas constant, 8.314 J mol-1 45 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management K-1, T is the hot-spot temperature (°C), and RT is the reference temperature (°C). The parameters values used for the conversion are provided in Tables 1 to 3. Figure 1: Graphical representation of algorithm used to determine hot-spot temperature and transformer ageing Table 1: Constant, ART, for the ageing models Ageing Model ART (Kraft paper, Moisture < 0.5%, Oxygen < 7000 ppm) Emsley & Stevens [3] 1.07 × 108 Lungaard, et al. [4] 2.00 × 108 Martin, et al. [5] 0.52 × (1.78 × 10 12) + 0.5 × (1.10 × 1010) + 5.28 × 107 46 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Table 2: Constant, AT, for the Emsley and Stevens [4] and Lungaard et al. [5] models Ageing Model Kraft Paper + 1% Moisture Kraft Paper + 2% Moisture Kraft Paper + 3% Moisture Kraft Paper + 4% Moisture Emsley & Stevens [3] 3.5 × 108 7.8 × 108 35 × 108 35 × 108 Lungaard, et al. [4] 6.2 × 108 6.2 × 108 21 × 108 21 × 108 Table 3: Constant, AT, for the Martin et al. model, were w is the moisture content as a percentage Martin, et al. [5] AT Low oxygen (<7000ppm) (1.78×1012)w2 + (1.10×1010)w + 5.28×107 Medium Oxygen ( 7 000 – 14 000 ppm) (2.07×1012)w2 + (5.61×1010)w + 2.31×108 High oxygen (> 14 000 ppm) (2.29×1012)w2 + (9.78×1010)w + 3.86×108 3. LOSS OF LIFE ESTIMATION WITH EXISTING AGEING MODELS Furanic oil sampling is non-intrusive and a universally accepted method for determining the DP value of a given transformer. The results from the ageing models provide a loss-of-life of the transformer in minutes. To compare the models to the actual Furanic oil samples of the twenty transformers under investigation, a loss of degree of polymerization needs to be determined. The conversion from loss of life minutes to degree of polymerization is derived from IEC [1] and IEEE [2]: 𝐷𝑃 𝑙𝑜𝑠𝑠 = 7.41×10-:×𝑙𝑜𝑠𝑠 𝑜𝑓 𝑙𝑖𝑓𝑒 (𝑖𝑛 𝑚𝑖𝑛𝑢𝑡𝑒𝑠),……………………………………. (2) To compare these values a variation factor is calculated, which indicate how closely the ageing models compare to the measured Furanic oil sample results. A variation percentage of zero indicates that the ageing model and the measured results are identical. A value less than zero indicates an underestimation, and values greater than zero indicate overestimation. Table 4 presents a scale used to classify variation as low, medium and high and Figure 2 shows the frequency plot for the variation categories. Table 4: Variation classification scale in percentage Var ≤ -100% -100% < Var < -20% -20% ≤ Var ≤ 20% 20% < Var < 100% Var ≥ 100% High Medium Low Medium High From the results it is evident that the ageing models produce results which tend to underestimate the loss of life of the sample of transformers. This could be due to the low operating temperature which the transformers operate at. The transformers operate at low hot-spot temperatures due to shared load and therefore could explain the underestimation. The shared load is due to the distribution network configuration which caters for contingencies, where two transformers share a load, although one transformer is able to handle the full load, if required. Based on the ageing models, for approximately every decrease of 6 °C of the hot-spot temperature, the ageing rate of paper halves. The highest average temperature from the sample of transformers is 66°C. This is well below the rated operating temperature of 98°C. Based on the relationship above, the ageing of the paper would be at 0.03125 of the rated ageing. The results in Figure 2 show that the ageing models investigated produce results of predominantly medium variation compared to the actual Furanic oil sample values. If these models are used by assets owners who use a shared load distribution network configuration, the transformers would be replaced prematurely. To improve on the 47 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management large variations, three modified ageing models are proposed and assessed against the existing models’ result in the next section. Figure 2: Frequency plot of the classifications for the sample of transformers 4. MODIFIED AGEING MODELS & RESULTS The ageing models presented previously have three variables which can be selected by the user, namely, the activation energy (E), pre-exponent constant (A) and the reference temperature for ageing. The activation energy has been accepted by several authors as 111 000 J/Mol and is independent of the quantity of moisture, oxygen or hot-spot temperature [1, 2, 3, 4, 5]. Thus, the aim is to determine the value of the pre-exponent constant, A. The constant, A, is used to account for accelerated ageing depending on the moisture and oxygen content within the transformer. The second variable which is investigated is the reference temperature (RT). In the case of kraft paper, the reference temperature is 98°C, which increases to 110°C for thermally upgraded paper. To improve the accuracy of the ageing models two variables, namely the reference temperature, RT, and the constant, A, are investigated. The process involves the investigation of three alternative cases of ageing models. Case 1 only considers the hot- spot temperature, Case 2 only oxygen content, and Case 3 only moisture content. The same basic form of the ageing model as in (1) is used for the investigation. Case 1 aims to identify whether the reference ageing temperature should be altered to decrease the percentage variation of the loss of life estimate. In Case 1 a single variable is considered for optimization, namely the reference temperature, RT. The parameters, ART and AT, are kept constant at a value of one for this case, reducing equation (1) to equation (3). The genetic algorithm previously presented is used to solve for variable RT which produces the most accurate ageing estimation. 𝑒( (# #")*+, - (# ")*+, ),………..………………………………. (3) Case 2 incorporates oxygen content. The reference temperature in this case is kept constant at 98°C. This case uses a form similar to Martin et al. [5]. The values for the pre-exponents presented by Martin et al. [5] are based on a combination of moisture and oxygen content. In this case, the pre-exponent, AT, is based only on the oxygen content during a given time period. The values for AT and the limits used are presented in Table 5. For Case 2 equation (1) is used and the three variables, AA, AB, and AC, is optimized. ART is kept constant at a value of one which is the reference for this case. Low Variation Medium Variation High Variation IEC [1] 0 20 0 Esmley [3] 1 16 3 Lungaard [4] 1 17 2 Martin [5] 2 15 3 0 5 10 15 20 25 N um be r o f Tr an sf or m er s 48 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Case 3 examines the moisture content in isolation to the reference temperature and oxygen content. The reference temperature in this case is held constant at 98°C. This case uses a similar approach to Emsley & Stevens [3] and Lungaard et al. [4]. Case 3 considers the optimisation of four pre-exponent variables and are based on the moisture content of the paper. The moisture limits are specified in Table 5. For Case 3 equation (1) is used and the four variables, Aw, Ax, Ay and Az, is optimized. ART is kept constant at a value of one which is the reference for this case. Table 5: Pre-exponents, AT, used in Case 2 with their oxygen limits and Case 3 with their moisture limits Case 2 Oxygen Content [PPM] AT Case 3 Moisture in paper [%] AT <1000 ART <1% ART >1000; <= 7000 AA >=1%; <2% Aw >7000; <=16500 AB >=2%; <3% Ax > 16500 AC >=3%; <4% Ay >4% Az The optimization is performed with the premise that the ageing models are black-box problems. Values for the variables is passed into the ageing model and the model produces an accuracy value. This accuracy value is used as the cost function for the optimization model. The number of variables to be determined for each of the three cases are, one, three and four, respectively. An approximate optimization method is utilized as the problem does not exist within the scope of a simple mathematical model. In order to carry out the optimization, the algorithms for Cases 1, 2 and 3 are passed through the genetic algorithm. The genetic algorithm is able to vary the specified number of variables in each case algorithm. The genetic algorithm evaluates the accuracy of the model at each iteration to determine its next step and terminates when it finds a solution which satisfies the exit requirements. This algorithm operates with the specified cost function in (4) and it aims to minimize the cost function to a specified fitness limit. 𝐶𝑜𝑠𝑡 = 𝐷𝑃BCDEFGHDE − 𝐷𝑃!GHJKL ,……………………………………. (4) A genetic algorithm by nature may produce slightly different result each time it is executed. To overcome this, the genetic algorithm is executed five times for each case. The results are averaged to obtain a final value. Case 1 was executed five times with the genetic algorithm. The output of each iteration produced a reference temperature for the transformers which produce a predicted DP value equal to the measured DP value. These results are averaged out across all the iterations to produce a specific reference temperature for each transformer. The two variables, average reference temperature and average operating hot-spot temperature, are analyzed using correlation analysis to determine if the two variables are correlated. The results from the correlation indicate a positive correlation with a value of 0.912, which is indicative that the reference temperature for ageing can be based on the average operating hot-spot temperature. A three-year average is used to carry out regression analysis for defining the function between the variables. Hypothesis testing, at 95% significance, for the slope as well as the intercept of the regression line concludes that equation (5) is an acceptable representation of the relationship. Equations (3) in conjunction with (5) forms the modified ageing model for Case 1. 𝑅N = 1.682 × 𝐴𝑣𝑒𝑟𝑎𝑔𝑒 ℎ𝑖𝑠𝑡𝑜𝑟𝑖𝑐 𝑜𝑝𝑒𝑟𝑎𝑡𝑖𝑛𝑔 ℎ𝑜𝑡𝑠𝑝𝑜𝑡 𝑡𝑒𝑚𝑝𝑒𝑟𝑎𝑡𝑢𝑟𝑒 − 19.996,……………. (5) Case 2 was executed five times using the genetic algorithm. Correlation analysis revealed no significant correlation (at 95% significance) between the variables AA, AB and AC and the input parameters; load, hot-spot temperature or moisture. A weak correlation is found between AA, AC and average oxygen content. To utilize these results correctly, the minimum and maximum oxygen values for each transformer is derived. The minimum and maximum value is used to classify which results from the genetic algorithm should be accepted. If the oxygen content of a 49 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management transformer never enters any of the three categorization limits, the results are neglected. If a transformer operates within the range of a category, a one is attributed, otherwise a zero is assigned. The complete set of results, in conjunction with the operating oxygen range, is used to determine the final variables AA, AB and AC. To determine the value of these variables, the following steps are carried out: The average variable value for each of AA, AB and AC are calculated per transformer from the five iterations’ outputs. The overall average of the variable is calculated if the transformer operates within the given categories and finally the results are substituted into the (1). The final variable values for Case 2 is AA = 159.47, AB = 235.27 and AC = 128.63. These variables in conjunction with (1) forms the modified ageing model for Case 2. The results for Case 3 were obtained in a similar manner than for Case 2. For this case, the correlation analysis also did not show significant correlation. The final variable values obtained for Case 3 are: Aw = 243.15, Ax = 133.08, Ay = 143.13, and Az = 1058.60. These variables in conjunction with (1) forms the modified ageing model for Case 3. The three modified ageing models are executed through the same loss of life algorithm presented in section 2. The results are compared with the results of the initial four ageing model in section 3 to establish whether the modified ageing model produce more accurate loss of life estimations. From the results in Figure 3 it is evident that the modified ageing models produce more results in the low variation range in comparison to the initial four models used. Case 1 and Case 2 present the highest number in this category, accounting for 25% of the sample, respectively. Case 3 also have results in this category which account for 20% of the sample. Case 1 does however have more results within the medium variation category in comparison to Case 2. In addition, the new modified ageing models have a higher tendency to overestimate the age in comparison to the original four models which predominantly underestimated ageing. To gain further insight, the results are categorised. The categories consider whether the predicted value is an over or under estimation of the measured value. The categories are defined based on the percentage variation, where less than -50% is considered an underestimation, between -50% and 50% is considered acceptable and more than 50% is considered an overestimation. The categorized results are shown in Table 7. Table 7: Categorised ageing model results Variation IEC[1] Esmley [3] Lungaard [4] Martin [5] Case 1 Case 2 Case 3 Underestimation 19 14 17 10 2 2 1 Acceptable 0 3 1 5 12 7 8 Overestimation 1 3 2 5 6 11 11 When considering the three modified ageing models, Case 1 produces the largest improvement in the age estimation results. Its improvement lies in the number of prediction in the low variation category when compared to the IEC [1], Emsley and Stevens [3], Lungaard et al. [4] and Martin et al. [5] ageing models. In addition, the ageing model described by Case 1 is the simplest of the three modified models. It only accounts for hot-spot temperature, but it however reduces the reference temperature for ageing. This makes the model easily implementable in practice, with the equation using the reduction of the reference ageing temperature to account for moisture and oxygen content. 5. CONCLUSIONS The Case 1 ageing model is recommended to be better suited for lightly loaded transformers when compared to the original four ageing models investigated. The ageing estimation are to be used in the asset management of power transformers. It is evident that while IEC, Emsley and Stevens and Lungaard et al. primarily produce results which underestimate the ageing of a transformer, Case 1 shifts the results into the categories of reasonable and overestimation. In terms of asset management, an underestimation may result in no action being taken on the specified asset which may result in an unplanned breakdown. This can be due to lack of action as it is assumed the asset does not require any intervention. An overestimation, will however start off a further investigation process. In 50 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management terms of power transformers, further investigation can entail oil and paper Furanic tests. This will confirm whether action is required or not. Figure 3: Frequency plot of the classifications for the sample of transformers including modified ageing models This study set out to investigate the accuracy of ageing models when applied to lightly loaded distribution power transformers. The results indicate that the available ageing models produce results with a high degree of variation between predicted and measured DP values, which is an indication of loss of life. Three cases for modified ageing models is proposed, which generate ageing estimations with a lower variation rate in comparison to the existing models available. The modified ageing model which produces the best improvement results is based only on the operating hot-spot temperature of a transformer. Opportunities for future research is to investigate the existing and modified models with a larger sample of data, to determine what the optimal time period for historic hotspot temperature should be, and to use technology to digitally record hotspot temperature, moisture and oxygen measurements more accurately. REFERENCES [1] IEC (2005). Power Transformers – Part 7: Loading Guide for oil-immersed power transformers [2] IEEE (2011). IEEE Guide for Loading Mineral Oil-Immersed transformers and step-voltage regulators, [3] Emsley, A. M. & Stevens, G. C. (1994). Review of chemical indicators of degradation of cellulosic electrical paper insulation in oil-filled transformers. IEE Proc-Sci, Volume 141. [4] Lundgaard, L., Hansen, W., Linhjell, D. & Painter, T. (2002). Ageing of oil-impregnated paper in power transformers. IEEE Transactions on Power Delivery, 19, pp.230-239 [5] Martin, D. et al. (2015). An updated model to determine the life remaining of transformer insulation. IEEE Transactions on Power Delivery, Volume 30(1). [6] Heathcote, M. J. (2007). The J & P Transformer Book. 13 ed. Oxford: Elsevier. [7] Abu-Elanien, A. E. & Salama, M. M. (2010). Asset Management Technique for Transformers. Electric Power Systems Research, Volume 80, pp. 456-464. [8] Brandtzaeg, G. (2015). Health Indexing of Norwegian Power Transformers, Masters Thesis, Norwegian University of Science and Technology, Trondheim, Norway. Low Variation Medium Variation High Variation IEC [1] 0 20 0 Esmley [3] 1 16 3 Lungaard [4] 1 17 2 Martin [5] 2 15 3 Case 1 5 12 3 Case 2 5 3 12 Case 3 4 4 12 0 5 10 15 20 25 N um be r o f Tr an sf or m er s 51 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management [9] Eskom Holdings Ltd and ABB Powertech, 2008. Theory, Design, Maintenance and Life Management fo Power Transformers. Johannesburg: Y-Land Design, Print and Promotions. [10] Felber Engineering GMBH, 2016. Transformer Design and Technology Training Course, Weiz: Felber Engineering. 52 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Measurement and visualization of the corona inception gradient on an HVDC transmission line under the effect of wet conditions J. S. Djeumen1*, J. J. Walker1 and N. J. West2 1Department of Electrical Engineering, Vaal University of Technology, Private Bag X021, 1Andries Potgieter Blvd, Vanderbijlpark, South Africa, 1900 2School of Electrical. and Info. Eng. University of the Witwatersrand, Johannesburg Private Bag 2, 2050 2Jan Smuts Avenue, Braamfontein, Johannesburg, 2000 1*julesd@vut.ac.za; 1jerryw@vut,ac,za and 2nicholas.west@wits.ac.za ABSTRACT Due to the increasing demand to transmit a high amount of electric power over very long distances, High Voltage Direct Current (HVDC) transmission is gaining increasingly in importance across the world. South Africa has important coal resources. Moreover, with the implementation of the Cohora Bassa to Apollo line transmission project and many others, transmission lines will inevitably pass through high altitude regions with complex atmospheric conditions. These complex atmospheric conditions are one of the main causes of power loss on transmission lines resulting in adverse economic and environmental effects. Therefore, understanding the atmospheric conditions is crucial in designing HVDC transmission lines. Typical environmental conditions include temperature, wind or air pressure and rain. This paper focuses on the measurement and the visualization of the corona inception voltage in a corona cage at Vaal University Technology high voltage laboratory for the positive and negative HVDC applications on the transmission line under wet conditions. The focus of these measurements and visualizations were to investigate and present an overall picture of corona discharge performance and the influence of the water mist on the transmission line in the corona cage at ±100 kV with the variation of the room’s humidity and the temperature. Keywords: Corona Discharge, CoroCam8, Partial Discharge, HVDC, Fault detection 1. INTRODUCTION In modern developed or developing societies and in South Africa in particular, due to the growth of the industrial sector and world population, the demand for electricity is ever increasing. A large amount of power is generated, transmitted and distributed. As an example, in South Africa, the total official unit sentout in 2010 was 237,073,293 MWH and reached 216,473,767 MWH in 2017 [1, 2]. The centres of power generation are distant from each other, thus extensive transmission systems are needed to transmit energy over long distances. Transmission systems are the backbone of the power system that carries the large amount of power at high voltage, very high voltage and ultra- high voltage. South Africa's climatic conditions generally range from Mediterranean in the southwestern corner of South Africa to temperate in the interior plateau, and subtropical in the northeast. These diverse climatic conditions pose a problem for the power system transmission networks as lines often have to travel for many kilometers and cross-different climatic regions. ESKOM, the South African power utility, is planning in the near future to build new HVDC transmission lines, convert some of the existing HVAC lines into HVDC as well as moving towards hybrid AC-DC towers. This will only be possible of course, with a good capital expenditure plan, which is the key of investments, and with specific environmental studies [1]. Weather conditions such as rain, temperature and snow easily cause the corona loss rather than normal air weather condition [3-4]. In addition, several natural factors such as weather, conductor size, location, altitude above mean 53 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management sea level and wet atmospheric conditions affect corona discharge phenomena on transmission lines. In this study, the influence of moisture on DC corona discharge pulses was measured and visualized experimentally while varying the applied voltage and the atmospheric conditions in the room. Artificial rain which aimed at increasing humidity and dropping the temperature in the HV laboratory room was introduced. A corona camera was used to visualize the corona discharge patterns and distribution. A digital partial discharge detection system through its control screen, enabled measurement of the intensity of the corona discharge. 2. FACTORS AFFECTING AND CORONA LOSS EQUATION The corona inception voltage is defined as the lowest voltage at which continuous corona discharge of a specified pulse amplitude occurs as the applied voltage is gradually increased [5]. Corona discharge is one of the main problems encountered in the design and operation of transmission lines. This is a well-known phenomenon that can cause damage and losses in a transmission or power system [5-6]. Corona loss can be described as the ionization of air surrounding the power transmission line [7]. Consequently, the corona becomes non-uniform along the transmission line; for instance, if the transmission line is rough, then the glow will be relatively brighter. In addition, positive DC corona will be observed as having a ‘smooth’ glow while negative DC corona will tend to appear erratic. 2.1. Factors affecting Corona discharge MATLAB simulations have shown that under wet conditions, there is more corona loss than during the dry season [7]. Besides the wet conditions as factors affecting the corona loss, the surface roughness of the conductor can also increase corona loss as the roughness of the surface decreases the value of breakdown voltage [6]. Among the factors, which affect the corona loss mentioned [8], the atmospheric conditions were one of the parameters emphasized. From Peek’s equation, the conductor surface field intensity is proportional to the size of conductor. Also, parameters such as the supply frequency, size of conductor bundles, spacing between conductors, mean load current at sea level, conductivity of the atmosphere and temperature of the conductor from load current may have an effect on cause corona loss. 2.2. Corona Loss Equation According to Peek’s equation [7], the power loss due to corona under fair weather conditions can be expressed as: ……………………………………..…… (1) The normalized air density correction factor, ⸹ can be elaborated as: ………………………………………………………….…………………………………….. ……. (2) Where: Pl: Corona loss under fair weather conditions f: Frequency t: Temperature of the surroundings L: Length of the conductor Ed: Disruptive critical voltage ⸹: Air density correction factor R: Radius of the conductor P: Atmospheric pressure K: Fixed Constant d: Conductor spacing VL: Line to line voltage 54 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 3. EXPERIMENTAL PROCEDURE AND SETUP 3.1. Corona Cage Experimental setup The corona cage used for this research has a diameter of 1.5 m, a height of 2.4 m and is 4.7 m in length, with two outer or protective segments of 0.6 m each and one inner segment of 1.2 m usually called measured segment. The whole structure of the cage has a total length of 4.7 m. A single TERN aluminium conductor with a diameter of 30 mm runs through the centre of the cage. The conductor was energised with a HVDC modular system power supply with a maximum output of 270 kV. The shield segments are electrically isolated from the measuring segment(s) to measure the corona discharge on the conductor’ surface and to eliminate the corona discharge on the cage [9-10]. A high conductor surface electric field is generated when a sufficient voltage is applied between the conductor and the cage. Figure 1: Test system configuration 3.2. Experimental setup The atmospheric condition, which in this case was the “mist”, has been simulated by a mist cooling system. This system was installed just outside of the cage’s surface on the circular mesh of the cage in order to prevent corona discharges being generated on the mist nozzles The water were sprayed through the outer mesh electrode on to the inner energized conductor. Preassembled flexible tubing was installed around the cage, one on top of the conductor and two on the sides of the conductor. The tubing was attached to the water supply system, making it possible to control the flow of water during the measurements. The mist spray system also helps to reduce the ambient temperature around the conductor up to 15˚C [11, 12, 13] have also studied the influence of humidity on the first corona inception. It can be observed that the humidity is a parameter to consider during the design of the transmission lines. Figure 2: The installation of the mist cooling in the cage Resistance HV Transformer C c Z ICM Measurement Segment 1 Protec. Seg. 1 Protec. Seg. 1 Corona Cage Diode ca p a cito r Mist water Aluminum ACSR TERN Conductor Brass and Stainless Steel Nozzles mist water 55 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 4. TEST ARRANGEMENT AND INCEPTION VOLTAGE MEASUREMENT 4.1. Test Arrangement The positive and negative corona inception voltage of the aluminum ACSR stranded conductor (TERN) is obtained by using the corona cage test platform as shown in figure 1 above. The conductor is placed concentrically inside the corona cage. With a total length of approximatively 4.7 m, the cross section of the cage is circular in its dimension. The inception is affected by many factors, such as the temperature, the air pressure and the relative humidity. 4.2. Inception Voltage measurement The positive inception voltage of the test system configuration was measured using the partial discharge detector with the setting at the normal condition of 20.5˚C and 41% relative humidity. The background noise was 0.87 pC and the system was calibrated at 5 pC and the scale of the monitor was 12 pC/Div. The positive inception voltage was observed around a voltage of +24 kV, the first pulses appeared on the negative half cycle of the phase resolved-pattern and the negative inception voltage was found to be -19.5 kV. The temperature in the room at the time of measurement was 20˚C and the humidity 49%. The same calibration of 5 pC was used and a background noise of 0.59 pC was measured. Here the first appearance of corona pulses on the pattern was observed on the positive half cycle of the phase-resolved pattern with the aid of the ICM monitor PD detector. Figure 3 and 4 below show the measurement and magnitudes of the positive and negative inception voltages respectively. It must be noted that the sine wave do not represent the supply voltage as this stage we are using a DC supply, but the sine wave only represent the internal voltage used for the triggering purposes. Figure 3. Inception voltage at +24 kV for Positive voltage. Figure 4. Inception voltage at -19.5 kV for Negative voltage. As can be seen in Figure 3 and Figure 4, the magnitude of the positive inception voltage, which is 8.04 pC at 10 pC/DIV, is greater than the negative inception’s magnitude, which is 4.94 pC at a voltage of -19.5 kV with a 12 pC/DIV. 56 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 4.3. Visualization with CoroCam8 Before the measurements were performed visual detection has been used to determine if the cage was corona free. The CoroCam8 has been used to visualize the entire cage and the sharp edges. The CoroCAM8 is a combined thermal IR, solar blind corona and video camera. The combination of these three components makes it possible to do both thermal IR and UV inspections at the same time, saving time and effort. The co-location of electrical discharges and hot spots gives the inspector more insights into the cause of a fault. The CoroCam8 can be used to inspect HV infrastructure operating at 3.3 kV and above. Figure 5 below displays the visualization of an artificial corona between the ring and the stand from transparent figure 5a to UV visualization only figure 5d. Figure 5. Process from visible to UV visualization of corona. 5. EXPERIMENTAL RESULTS, VISUALIZATION AND MEASUREMENTS 5.1. Virtualization results with CoroCam8 The first step of the experiment consisted of visualizing the entire corona cage using the corona camera to make sure that the cage was corona free. Some discharge sources were found on the cage. After investigation, the main source of the discharge was found and fixed. The second step was the visualization of the corona on the conductor. Four different scenarios were observed: Dry, wet with one tap (top spray nozzles), two taps (top and one side spray nozzles) and all three taps (top and both sides spray nozzles) open. The supply voltage was constant at 100 kV DC. Figure 6. Visualization at normal condition -100 kV Figure 7. Visualization with one pipe on at -100 kV Figure 6 and 7 represent the case without mist water with the ambient temperature at 23˚C, 41% humidity and “the top spray nozzles” on with 63% humidity and the ambient temperature at 19˚C respectively. With the opening of the left side spray nozzles, the humidity increased to 78% and the temperature dropped to 18˚C and when the right side spray nozzles was opened, the humidity still increased to 85% and the temperature dropped to 17˚C. For good accuracy in the results, the corona camera was set at a fixed position during the visualization at a distance of 3.5 m a) b) c) d) 57 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management from the corona source and the test voltage at – 100 kV. Figure 8 and 9 below illustrate the two above scenarios where the mist water was having more pressure than the conditions in figures 6 and 7. Figure 8. Visualization with two pipes on at -100 kV Figure 9. Visualization with three pipes on at -100 kV When the voltage polarity was reversed (+100 kV), the partial discharge unit was again calibrated to 5.0 pC. The same set of four experiments were conducted. The temperature and humidity readings were 21˚C, humidity of 45% and 19˚C, humidity of 55% respectively. Figure 10 and 11 display the presence of corona during the measurements. Figure 10. Visualization at normal condition +100 kV Figure 11. Visualization with one pipe on at +100 kV Figure 12. Visualization with two pipes on at +100 kV Figure 13. Visualization with three pipes on at +100 kV 58 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 5.2. Measurement results with ICM monitor The third step of the experiment was the measurement of the corona discharge using the partial discharge detector at similar atmospheric conditions as mentioned earlier above. The corona magnitude was measured at negative supply voltage with the sensitivity of the instrument at 200 pC/DIV (figures 14 and 15) with a change in humidity and temperature from 63% to 85% and from 19˚C to 17˚C respectively In addition, the magnitude of the corona discharge increased from 562.9 pC to 851.1 pC. Figure 14. Measurement with two pipes on at -100 kV Figure 15. Measurement with three pipes on at -100 kV The corona magnitude was measured at positive voltage with the sensitivity of the instrument at 2 nC/DIV. Figure 17 and 18 display the patterns of the similar conditions mentioned above. An increase in the magnitude was observed from 8.54 nC to 9.39 nC with a variation in the atmospheric conditions. Figure 16. Measurement with two pipes on at +100 kV Figure 17. Measurement with three pipes on at +100kV 5.3. Summary of the measurement results The summary of the results can be summarized into table 1 below, by bringing the sensitivity of the instrument at 200 pC/DIV for all the measurements, table 1 below clearly show the difference between the positive and negative corona at the wet condition. Table 1: Results summary at different wet condition level Inception Voltage T=20˚C & H=45% T=19˚C & H=63% T=17˚C & H=85% Voltages Value +24 kV -19.5 kV 50 kV 100 kV 100 kV Positive Voltage 8.04 pC 1800 pC 8540 pC 9390 pC Negative Voltage 4.17 pC 15.7 pC 562.9 pC 851.1 pC 59 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 6. CONCLUSION In conclusion, the main objective of this experiment was to study corona on the conductor by understanding the physical mechanisms involved in corona discharge as well as the resulting corona effects under wet conditions. This study presented the corona test results obtained during the visualization and measurement of the inception voltage using the conventional method of apparent charge measurement in a corona cage having three sections under positive and negative voltages. The results show that corona inception gradient under the effect of wet conditions on HVDC Transmission line, visualization and patterns magnitude of the corona discharge increase very little with the increase of the humidity and the decrease of the temperature under the DC voltage supply. Moreover, the negative inception voltage is lower than the positive inception voltage. REFERENCES [1] T. Govender. (2015) Transmission development plan 2016-2025. ESKOM TDP. Pp.1-103. [2] S. Bisnath, A. Britten, D. Cretchley, D. Muftic, T. Pillay, and R. Vajeth, The planning, design and construction of overhead power lines. Johannesburg: Crown Publication cc, 2005. [3] L. Youwei, L. Jihong, and L. Bin, Influence of air density and humidity on the corona performance of conductor, Power System Technology, vol. 14(4), pp 46-50, 1990. [4] P. S. Maruvada, Corona Performance of High-Voltage Transmission Lines. India: Narinder Kumar Lijhara for Overseas Press India Private Limited, 2006, p. 310. [5] J.S. Djeumen & J.J. Walker, “Environmental influence on corona inception with HVDC application”, International Symposium on High Voltage Engineering, (ISH 2017), Buenos Aires, Argentina, pp. 1-6, 2017. [6] E.A. Yahaya, T. Jacob, M. Nwohu, A. Abubakar, "Power Loss due to Corona on High Voltage Transmission Lines," Journal of Electrical and Electronics Engineering (IOSR-JEEE), vol. 8, no. 3, pp. 14-19, Dec. 2013. [7] M. C. Anumaka, "Influence of Corona and Skin Effect on the Nigerian 330 kV Interconnected Power System," IJRRAS, vol. 12, no. 2, pp. 328-333, August 2012. [8] J.S. Djeumen & J.J. Walker, “Visualization of Positive and Negative Corona under Environment Influence with HVDC Application”, South African Universities of Power Engineering Conference, (SAUPEC 2017), Johannesburg, South Africa, pp. 1-6, 2017. [9] R. Urban, H. Reader, and J. P. Holtzhausen, "AC transmission line corona noise issues in a small corona cage," in Africon Conference in Africa, 2002. IEEE AFRICON. 6th, 2002, pp. 639-644 vol.2. [10] Y.-p. Liu, L.-j. Ren, y. Chen, Q.-f. Wan, S.-h. You, and W. Chen, "Research on High Altitude Corona Loss Measurement System Based on Small Corona Cage," in High Voltage Engineering and Application, 2008. ICHVE 2008. International Conference on, 2008, pp. 144-147. [11] P. Ortega, R. Dıaz, F. Heilbronner and F Ruhling, “Influence of negative ions on the humidity effect on the first corona inception”, Journal of Physics D: Applied Physics, vol. 40, pp. 7000–7007, 2007. [12] P. A. Calva and F. Espino , "Effect of the humidity in the ionic mobility in reduced air-density," in Annual Report Conference on Electrical Insulation and Dielectric Phenomena (Cat. No.98CH36257), pp. 508-511 vol. 2, 1998. [13] T. Soban, R. Ohyama, "A Study on the Humidity Effect of AC Corona Discharge for a Thin-Wire Electrode Arrangement," in Proceedings of the School of Engineering Tokai University Tokyo, Japan, 2016, pp. 1-7. 60 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management A New Fuzzy Logic-based Approach to Predict Fault in Transformer Oil Based on Health Index using Dissolved Gas Analysis Kharisma Utomo Mulyodinoto 1,2 , Suwarno 2,3 A. Abu-Siada 3,4 1 PT PLN (Persero), Jl. Trunojoyo Blok M I/135 Kebayoran Baru, 12160, Jakarta, Indonesia 2 Senior Member, IEEE 3 School of Electrical and Informatics, Institut Teknologi Bandung, Indonesia 4 Electrical and Computer Engineering Department, Curtin University, Australia ABSTRACT Dissolved gas-in-oil analysis has been widely accepted as a powerful technique to detect such incipient faults. While the measurement of dissolved gases within transformer oil samples has been standardized over the past two decades, analysis of the results is not always straight forward as it depends on personnel expertise more than mathematical formulas. IEEE C57.104-2008 standards has classified the health condition of the transformer based on the absolute value of individual dissolved gases along with the Total Dissolved Combustible Gas (TDCG) within transformer oil into 4 conditions. While the technique is easy to implement, it is considered as a very conservative technique and is not widely accepted as a reliable interpretation tool. Moreover, measured gases for the same oil sample can be within various conditions limits and hence, misinterpretation of the data is expected. To overcome this limitation, this paper introduces a fuzzy logic approach to predict the health condition of the transformer oil based on IEEE C57.104-2008 standards along with Roger ratio and IEC ratio-based methods. DGA results of 31 chosen oil samples from 469 transformer oil samples of normal transformers and pre-known fault-type transformers that were collected from Indonesia Electrical Utility Company, PT. PLN (Persero), from different voltage rating: 500/150 kV, 150/20 kV and 70/20 kV; different capacity: 500 MVA, 60 MVA, 50 MVA, 30 MVA, 20 MVA, 15 MVA and 10 MVA; and different life span, are used to test and establish the fuzzy logic model. Results show that the proposed approach is of good accuracy and can be considered as a platform toward the standardization of the dissolved gas interpretation process. Keywords — Dissolved gas analysis, Transformer, Roger ratio method, IEC ratio method, Fuzzy logic, Health Index, IEEE C57.104-2008 I. INTRODUCTION Power transformers are critical components in any transmission and primary distribution networks. Effective monitoring and diagnostic techniques must be adopted to decrease possibility of transformer catastrophic failures and increase the reliability of this asset. Transformer dielectric oil is considered as a key source to detect incipient and fast developing faults within a transformer [1]. There are several chemical and electrical diagnostic techniques to assess the health condition of transformer [2]. Dissolved gas analysis (DGA) is one of the most effective techniques to detect incipient and fast developing faults within power transformers. This technique uses the concentration of various key gases which are produced due to the decomposition of oil and paper insulation [3]. Gases such as Hydrogen (H2), Methane (CH4), Acetylene (C2H4), and Ethane (C2H6) are mainly generated due to oil decomposition [4]-[6]. On the other side, Carbon oxides (CO and CO2) may be generated due to paper decomposition, oil oxidation or due to atmospheric leak. Faults that can be detected using this technique include thermal, arcing and partial discharge faults. These faults produce particular characteristic gases, which can be used to assess and estimate the health index of the transformer [7], [8]. H2 and CH4 gases are generated due to low level energy discharge / partial discharge fault while arcing and high level energy discharge generate all gas including traceable amount of C2H2. C2H4 and C2H6 gases are often generated as a result of high thermal faults [4]. While the fault type can be identified based on the amount and type of dissolved gases, the analysis may lead to misinterpretation since there is a possibility of multiple faults cases [9]. From the expert personal experience, it is difficult to determine whether a transformer operates normally or not if it does not have history of dissolved gases along with a trending of each gas. Four criteria as given in Table 1 have been identified in the IEEE C57.104-2008 standards to classify the risk of a transformer without DGA history [12]. Based on individual gases concentrations, the transformer can be grouped into one of the four conditions listed in Table 1. The limits for all key gases in each condition group are listed in Table 2 [12]. 61 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Table 1. IEEE C57.104-2008 standard [12]. LEVEL 1 TDCG below this level indicates that the transformer is operating properly. If any of the gas value exceeds this level limit, investigation must be done immediately LEVEL 2 TDCG at this level indicates the level of flammable gas has exceeded the normal limit. If one of the gas values exceeds these levels, it should be investigated immediately. Possible, fault has occured. LEVEL 3 TDCG at this level indicates a high level of decomposition. If any of the gas values exceeds this level limit, an investigation must be done immediately. Possible, fault has occured. LEVEL 4 TDCG exceeds this value limit indicates a very high level of deterioration. Continuing the transformer operation may result in damage to the transformer. Table 2. Key gases limits for each condition [12] Condition 1 Condition 2 Condition 3 Condition 4 Hidrogen (H2) 100 101-700 701-1800 >1800 Methane (CH4) 120 121-400 401-1000 >1000 Acetylhene (C2H2) 1 2-9 10-35 >35 Ethylene (C2H4) 50 51-100 101-200 >200 Ethane (C2H6) 65 66-100 101-150 >150 Carbon Monoxyde (CO) 350 351-570 571-1400 >1400 Carbon Dioxyde (CO2) 2500 2500-4000 4001-10000 >10000 TDCG 720 721-1920 1921-4630 >4630 Limit Concentration of Dissolved Gas (ppm) STATUS Unfortunately, this technique can only identify the health condition of the transformer without identifying the exact type of fault e of fault within the transformer. Therefore, ratio-based methods such as Roger and IEC were developed to overcome this limitation. However, ratio-based methods cannot be used unless the concentration of the gases used in the analysis is more than certain limits [13-21]. This paper presents a fuzzy logic-based approach that combines the features of IEEE C57.104-2008 standard, Roger and IEC Ratio-based methods. II. PROPOSED SCORING INDEX AND FUZZY LOGIC MODEL The proposed DGA-based health scoring index comprises 3 scoring indexes: good with 9 scoring index, fair with 6 scoring index, and poor with 1 scoring index as shown in Table 3. The scoring index reflects the health condition of a transformer based on the measured individual dissolved gases in the oil sample. The simple workflow of this method is shown in Figure 1. The measured concentration of each gas is to be compared with the limits in Table 2. If the health index is good, no further investigation will be required. On the other hand, if the health index is fair or poor, further investigation using Roger and IEC ratio methods are used to accurately identify and quantify the fault type and level. Table 3. Proposed transformer dissolved gas health index Condition Scoring Index Explanation Good 9 Indicates that transformer is working and operating properly Fair 6 Indicates that transformer oil should be investigated immediately using dissolved gas analysis interpretation techniques such as ratio methods (Doernenberg, Roger ratio, IEC), Duval triangle method to accurately identify the fault type. The oil must be re-tested and re-sampled to obtain the trending and dissolve combustible gas growth rate. Poor 1 Indicates that the transformer have problems especially with oil decomposition and high level of deterioration. The transformer should be investigated immediately using dissolved gas analysis interpretation techniques such as ratio methods (Doernenberg, Roger ratio, IEC), Duval triangle method to accurately identify the fault type. The transformer’s operation must be stopped for further analysis. 62 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 1. Flowchart of first assessment using IEEE C57.104-2008 The health index score should be 1 (Poor) in the following cases: -If there are minimum of two key gases (H2, CH4, C2H4, and C2H2) not in condition 1 and at least one gas (H2, CH4, C2H4, C2H2, C2H6, and CO) within conditions 2, 3 or 4. -If there are minimum of two gases ((H2, CH4, C2H4, C2H2, C2H6, and CO) not in condition 1 and at least one gas (H2, CH4, C2H4, C2H2, C2H6, and CO) within condition 3 or 4. The health index score should be 6 (Fair) in the following cases: -If there are minimum of two dissolved combustible gases (H2, CH4, C2H4, C2H2, C2H6, and CO) not in condition 1 and at least one gas (H2, CH4, C2H4, C2H2, C2H6, and CO) within condition 2. -If there are minimum of one gas (H2, CH4, C2H4, C2H2, C2H6, and CO) not in condition 1 and at least one gas (H2, CH4, C2H4, C2H2, C2H6, and CO) within reach condition 2, 3 or 4. The health index score should be 9 (Good) in the following cases: -If all of dissolved gases (H2, CH4, C2H4, C2H2, C2H6, and CO) in condition 1. Fuzzy logic rules have been developed using MATLAB fuzzy logic toolbox based on the above as listed in Table 4. Table 4. The logic bit for the input and output of the proposed fuzzy logic approach 1 (High) 1 (High) 1 (High) 0 (Low) 0 (Low) 1 POOR 1 (High) 1 (High) 1 (High) 1 (High) 1 (High) 1 POOR 1 (High) 1 (High) 1 (High) 0 (Low) 1 (High) 1 POOR 0 (Low) 1 (High) 1 (High) 0 (Low) 0 (Low) 6 FAIR 0 (Low) 1 (High) 1 (High) 1 (High) 1 (High) 1 POOR 0 (Low) 1 (High) 1 (High) 0 (Low) 1 (High) 1 POOR 0 (Low) 0 (Low) 1 (High) 0 (Low) 0 (Low) 6 FAIR 0 (Low) 0 (Low) 1 (High) 1 (High) 1 (High) 6 FAIR 0 (Low) 0 (Low) 1 (High) 0 (Low) 1 (High) 6 FAIR 0 (Low) 0 (Low) 0 (Low) 0 (Low) 0 (Low) 9 GOOD 2 keygas >condition 1 state 2 dissolved combustible gas >condition 1 state 1 dissolved combustible gas >condition 1 state 1 dissolved combustible gas condition 4 state 1 dissolved combustible gas condition 4 state Output of Fuzzy Logic Output Result Inter- pretation Logic Bit Input The fuzzy logic output of the IEEE C57.104-2008 comprises three triangle membership functions that reflect the health index of the transformer based on Table 4. 63 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Among the ratio-based methods, Roger ratio method (RRM) and IEC ratio method (IRM) are most widely used by utilities companies, including the Indonesian State Own Electric Utility Company, PT. PLN (Persero). Tables 5 shows proposed fuzzy logic outputs of these two methods. The output membership functions are shown in Figure 2. Figure 2. Output membership function of RRM and IRM Table 5. Fuzzy logic output of RRM and IRM Inputs of RRM fuzzy logic model are the three ratios CH4/H2, C2H2/C2H6 and C2H4/C2H6 with member ship functions based on Table 6. Inputs of IRM are the same ratios as in RRM with different ranges and fault types as per Table 7. Table 6. Fault Diagnosis Interpretation of Roger Ratio Method [4] RATIO FAULT DIAGNOSIS R2 R1 R5 C2H2 / C2H4 CH4 / H2 C2H4 / C2H6 < 0,1 0,1 - 1 < 1 0 : Normal < 0,1 < 0,1 < 1 1 : Partial Discharge 0,1 - 3 0,1 - 1 > 3 2 : Arcing - high energy discharge < 0,1 0,1 - 1 1 – 3 3 : Low temperature thermal < 0,1 > 1 1 – 3 4 : Thermal < 700°C < 0,1 > 1 > 3 5 : Thermal > 700°C Table 7. Fault Diagnosis Interpretation based on IEC 60599 [22] RATIO FAULT DIAGNOSIS R1 R2 R5 CH4 / H2 C2H2 / C2H4 C2H4 / C2H6 < 0,1 NS < 0,2 PD : Partial Discharge 0,1 – 0,5 > 1 > 1 D1 : Low Energy Discharge 0,1 - 1 0,6 – 2,5 > 2 D2 : High Energy Discharge > 1 (NS) NS < 1 T1 : Thermal fault < 300°C > 1 < 0,1 1 – 4 T2 : Thermal fault 300°C-700°C > 1 < 0,2 > 4 T3 : Thermal fault > 700°C All fuzzy logic models are developed in accordance to the fuzzy inference flowchart shown in Figure 3. Figure 3. Fuzzy Logic Inference Flowchart M eth o d F1 Thermal in Cellulose Fault F2 Thermal in Oil Fault F3 Partial Discharge Fault F4 Arcing Fault F5 (N o rm al co n d itio n ) F6 (O u t O f C o d e C o n d itio n ) R o ger R atio M eth o d - Low Temperature Thermal - Thermal Fault <700°C Low Energy Density Arcing High Energy Discharge (Arcing) - Thermal Fault >700°C IEC R atio M eth o d - Thermal Fault <300°C - Thermal Fault between 300°-700°C Low Energy Discharge (Partial DIscharge) High Energy Discharge (Arcing) - Thermal Fault >700°C 64 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 5 The overall proposed fuzzy logic model is a combination of the three methods discussed above as shown in in Figure 4. The logic table of this model is given in Table 8. Individual Concentration Limit of TDCG output Total Dissolved Combustible Gas (TDCG) value Roger Ratio Method (RRM) output value IEC Ratio Method (IRM) output value Output 9 (Good) All Condition All Condition All Condition Normal normal not identified not identified not identified normal not identified not identified not identified not identified Partial Discharge Partial Discharge Fault not identified Arcing Arcing Fault Partial Discharge not identified Partial Discharge Fault Arcing not identified Arcing Fault normal Partial Discharge Partial Discharge Fault normal Arcing Arcing Fault Partial Discharge Partial Discharge Partial Discharge Fault Partial Discharge Arcing Arcing Fault Arcing Partial Discharge Arcing Fault Arcing Arcing Arcing Fault not identified thermal in cellulose fault Thermal in Cellulose Fault not identified thermal in oil fault Thermal in Oil Fault thermal in cellulose fault not identified Thermal in Cellulose Fault thermal in oil fault not identified Thermal in Oil Fault normal thermal in cellulose fault Thermal in Cellulose Fault normal thermal in oil fault Thermal in Oil Fault thermal in cellulose fault thermal in cellulose fault Thermal in Cellulose Fault thermal in cellulose fault thermal in oil fault Thermal in Oil Fault thermal in oil fault thermal in cellulose fault Thermal in Oil Fault thermal in oil fault thermal in oil fault Thermal in Oil Fault Partial Discharge thermal in cellulose fault Partial Discharge thermal in oil fault Arcing thermal in cellulose fault Arcing thermal in oil fault thermal in cellulose fault Partial Discharge thermal in oil fault Partial Discharge thermal in cellulose fault Arcing thermal in oil fault Arcing not identified Partial Discharge Partial Discharge Fault not identified Arcing Arcing Fault Partial Discharge not identified Partial Discharge Fault Arcing not identified Arcing Fault normal Partial Discharge Partial Discharge Fault normal Arcing Arcing Fault Partial Discharge Partial Discharge Partial Discharge Fault Partial Discharge Arcing Arcing Fault Arcing Partial Discharge Arcing Fault Arcing Arcing Arcing Fault not identified thermal in cellulose fault Thermal in Cellulose Fault not identified thermal in oil fault Thermal in Oil Fault thermal in cellulose fault not identified Thermal in Cellulose Fault thermal in oil fault not identified Thermal in Oil Fault normal thermal in cellulose fault Thermal in Cellulose Fault normal thermal in oil fault Thermal in Oil Fault thermal in cellulose fault thermal in cellulose fault Thermal in Cellulose Fault thermal in cellulose fault thermal in oil fault Thermal in Oil Fault thermal in oil fault thermal in cellulose fault Thermal in Oil Fault thermal in oil fault thermal in oil fault Thermal in Oil Fault not identified not identified normal not identified Partial Discharge thermal in cellulose fault Partial Discharge thermal in oil fault Arcing thermal in cellulose fault Arcing thermal in oil fault thermal in cellulose fault Partial Discharge thermal in oil fault Partial Discharge thermal in cellulose fault Arcing thermal in oil fault Arcing 6 (Fair) Condition 1 / Condition 2 / Condition 3 / Condition 4 Discharge Thermal Fault 1 (Poor) Condition 1 / Condition 2 / Condition 3 / Condition 4 Discharge Thermal Fault 6 (Fair) Condition 1 Normal 6 (Fair) Condition 2 / Condition 3 / Condition 4 Discharge Thermal Fault Table 8. Inputs and outputs of the proposed overall model 65 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Table 9. Data Simulation Result of Dissolved Gas Transformer Oil Samples Figure 4. Overall fuzzy logic-based model Sample Number Power Voltage Rating Operational Ages (Years) (CO) (H2) (CH4) (C2H4) (C2H6) (C2H2) Simulation Individual gas TDCG Simulation TDCG each gas Result Simulation TDCG Result Simulation Roger Ratio Result Simulation IEC Ratio Result Output Propose d Fuzzy Output Result Real Condition 1 500 MVA 500/150 kV 7 0 96,896 84,038 3,675 3,675 0 9 GOOD CONDITION 1 Thermal Fault in Cellulose Not Identified 10,99813 Normal Normal 2 20 MVA 150/20 kV 29 244,946 42,664 18,543 1,717 37,789 0 9 GOOD CONDITION 1 Normal Thermal Fault in Cellulose 10,99846 Normal Normal 3 60 MVA 150/20 kV 12 16,36 36,24 6,45 4,26 12,36 1,42 9 GOOD CONDITION 1 Not Identified Thermal Fault in Cellulose 10,99846 Normal Normal 4 20 MVA 150/20 kV 35 91,16 47,49 0 3,75 6,89 0 9 GOOD CONDITION 1 Partial Discharge Not Identified 10,99846 Normal Normal 5 60 MVA 150/20 kV 6 357,855 156,3 10,72 8,9 5,21 0 6 FAIR CONDITION 1 Not Identified Not Identified 10,99846 Normal Normal 6 30 MVA 150/20 kV 25 124,03 481,82 9,03 5,72 37,04 0 6 FAIR CONDITION 1 Partial Discharge Partial Discharge 4,998462 Partial Discharge Fault Fault, High Temperature in Winding 7 50 MVA 150/20 kV 12 492,34 20 0 1,2 3,19 0 6 FAIR CONDITION 1 Partial Discharge Not Identified 4,998462 Partial Discharge Fault Fault, Partial Discharge 8 30 MVA 150/20 kV 8 36,92 20 0 2,23 46,34 14,3 6 FAIR CONDITION 1 Not Identified Partial Discharge 4,998462 Partial Discharge Fault Fault, Partial Discharge 9 60 MVA 150/20 kV 12 412,8 54,1 63,8 77 60,9 0 6 FAIR CONDITION 1 Thermal Fault in Oil Thermal Fault in Oil 7,001538 Thermal in Oil Fault Fault, overheat in oil 10 60 MVA 150/20 kV 25 128,212 51,026 74,558 60,841 62,067 1 6 FAIR CONDITION 1 Arcing Thermal Fault in Cellulose 1,004054 Discharge and Thermal Fault Fault, arcing and overheat 11 10 MVA 150/20 kV 17 0 20 50,32 4,67 154,93 32,63 1 POOR CONDITION 1 Not Identified Thermal Fault in Cellulose 9 Thermal in Cellulose Fault Fault, overheat-high temperature 12 60 MVA 150/20 kV 15 154,665 79,16 0 78,825 106,898 0 1 POOR CONDITION 1 Partial Discharge Not Identified 4,998462 Partial Discharge Fault Fault, Partial Discharge and overheat 13 30 MVA 150/20 kV 22 0 20 223,26 98,05 5,67 236,6 1 POOR CONDITION 1 Not Identified Not Identified 1,001538 Discharge and Thermal Fault Fault, Partial Discharge and overheat 14 30 MVA 150/20 kV 21 16,41 20 0 4,94 206,71 14,67 1 POOR CONDITION 1 Not Identified Partial Discharge 4,998462 Partial Discharge Fault Fault, Partial Discharge 15 500 MVA 500/150 kV 7 640 46 21 13 5 1 6 FAIR CONDITION 2 Thermal Fault in Cellulose Not Identified 9 Thermal in Cellulose Fault Fault, overheat-high temperature 16 60 MVA 150/20 kV 19 623,379 67,019 19,603 49,183 1 1 6 FAIR CONDITION 2 Not Identified Not Identified 1,001538 Discharge and Thermal Fault Fault, Partial Discharge and overheat 17 30 MVA 150/20 kV 23 883,07 20 22,55 42,37 195,19 0 1 POOR CONDITION 2 Not Identified Thermal Fault in Cellulose 9 Thermal in Cellulose Fault Fault, overheat-high temperature 18 30 MVA 150/20 kV 15 689,093 201,78 30,81 89,921 89,921 0 1 POOR CONDITION 2 Thermal Fault in Cellulose Not Identified 9 Thermal in Cellulose Fault Fault, High Temperature in Winding 19 30 MVA 150/20 kV 12 869,921 177,47 10,945 146,379 146,379 0 1 POOR CONDITION 2 Not Identified Not Identified 1,002611 Discharge and Thermal Fault Fault, Partial Discharge and High Temperature in Winding 20 50 MVA 150/20 kV 12 673,88 183,54 184,39 34,09 382,27 0 1 POOR CONDITION 2 Arcing Thermal Fault in Cellulose 1,001527 Discharge and Thermal Fault Fault, arcing and overheat 21 50 MVA 150/20 kV 23 556,64 478,42 14,74 2,56 35,39 21,29 1 POOR CONDITION 2 Not Identified Partial Discharge 4,998462 Partial Discharge Fault Fault, Partial Discharge 22 60 MVA 150/20 kV 22 2354,91 20 21,42 2,27 6,28 0 6 FAIR CONDITION 3 Not Identified Thermal Fault in Cellulose 9 Thermal in Cellulose Fault Fault, overheat-high temperature 23 50 MVA 150/20 kV 23 0 2230,9 10,78 6,89 4,61 0 6 FAIR CONDITION 3 Not Identified Not Identified 1,001538 Discharge and Thermal Fault Fault, Partial Discharge and overheat 24 60 MVA 150/20 kV 22 0 36,98 2943,7 34,99 2,97 0 6 FAIR CONDITION 3 Thermal Fault in Oil Thermal Fault in Oil 7,001538 Thermal in Oil Fault Fault, overheat in oil 25 15 MVA 150/20 kV 32 3300 112 73 64 26 0 1 POOR CONDITION 3 Thermal Fault in Cellulose Not Identified 9 Thermal in Cellulose Fault Fault, overheat-high temperature 26 30 MVA 150/20 kV 20 1589,71 522,08 35,88 32,85 18,61 187,19 1 POOR CONDITION 3 Not Identified Not Identified 1,001538 Discharge and Thermal Fault Fault, Partial Discharge and High Temperature in Oil 27 30 MVA 150/20 kV 8 749,56 1322,2 627,77 1032,55 158,48 1139,65 1 POOR CONDITION 4 Arcing Arcing 3 Arcing Fault Fault, Arcing in Oil 28 50 MVA 150/20 kV 12 0 1454 1179,9 440,45 642,27 1028,94 1 POOR CONDITION 4 Not Identified Thermal Fault in Cellulose 9 Thermal in Cellulose Fault Fault, High Temperature in Winding 29 60 MVA 150/20 kV 18 473,41 2591,1 907,02 971,03 639,85 2058,19 1 POOR CONDITION 4 Not Identified Partial Discharge 4,998462 Partial Discharge Fault Fault, Partial Discharge 30 20 MVA 70/20 kV 26 0 247,34 216,59 3547,55 1036,51 0 1 POOR CONDITION 4 Not Identified Not Identified 1,001538 Discharge and Thermal Fault Fault, arcing and high temperature 31 60 MVA 150/20 kV 13 3255,27 3324,9 378,33 6,39 80,92 0 1 POOR CONDITION 4 Normal Thermal Fault in Cellulose 9 Thermal in Cellulose Fault Fault, overheat-high temperature 66 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 3. VALIDATION OF PROPOSED FUZZY LOGIC MODEL To assess the accuracy of the proposed model in identifying and quantifying various faults based on the three combined DGA interpretation techniques, 31 dissolved gas transformer oil data were collected from Indonesia National Electrical Utility Company, PT. PLN (Persero) Jawa, Bali, and Sumatera. To design the fuzzy logic model, 469 transformer oil samples that were collected from different voltage rating: 500/150 kV, 150/20 kV, and 70/20 kV; different capacity: 500 MVA, 60 MVA, 50 MVA, 30 MVA, 20 MVA, 15 MVA, and 10 MVA; and different life span from 7 years to 35 years operational ages, are used to test and establish the fuzzy logic model. Normal and pre-known fault types transformers are tested using the proposed fuzzy logic model as shown in Table 9. From 31 transformer oil samples, there are 5 transformers in good condition: sample number 1 to 5; and 26 transformers in faulty condition: sample number 6 to 31. The normal transformers that were chosen in table 9 had different life span and operational ages: 2 transformer with long operational ages, 1 transformer with medium operational ages and 2 transformer with early operational ages. The data samples of faulty transformers that were chosen also reflect that 3 condition of operational ages: long operational ages, normal operational ages, and early operational ages. Each samples also have unique combination of each dissolved gas analysis interpretation method so it can check the result of proposed fuzzy logic model with the real fault of transformers. Results in Table 9 reveal a high agreement of the model output with the real fault. It proves that proposed fuzzy logic approach which combine several methods: Individual Dissolved Combustible Gas concentration called Health Index, TDCG, Roger Ratio Methods, and IEC Ratio Method, can be used and can be applied to different voltage rating, different power rating, and different lifespan operational ages of transformer with satisfying result. As such, it can be concluded that the proposed model can be used to identify the type of internal faults within a transformer based on dissolved gas analysis data with high degree of accuracy. IV. CONCLUSION This paper presents a fuzzy logic-based technique that combines the feature of three dissolved gas analysis interpretation techniques; IEEE C57.104-2008, Roger and IEC ratio methods. The proposed technique is aiming at overcoming the limitations involved in these methods. This includes the conservative nature of the IEEE method and the inability of the ratio methods to diagnose faults when gases of low concentrations exist within the transformer oil samples. Accuracy of the proposed technique to identify various faults has been assessed by comparing the output of the developed model with the pre-known faults of 31 samples collected from PT. PLN (Persero) Jawa, Bali, and Sumatera with different voltage rating, different power rating, and different lifespan operational ages. Results reveal the high agreement between the model’s output and the real fault types. The proposed expert system can be used by non- experienced persons to help them gaining an experience in dissolved gas analysis. REFERENCES [1] A. Abu-Siada and S. Islam ―A Novel online technique to detect Power Transformer Winding Fault.‖ IEEE Trans Power Delivery, Vol. 27 No 2, pp. 849-857, 2012. [2] T. K. Saha. ―Review of Modern diagnostic techniques for assessing insulation condition in aged transformer Dielectric and Electrical Insulation, IEEE Transaction on Vol. 10, pp 903-917, 2013. [3] N. A. Bakar, A. Abu Siada, and S. Islam. ―A review of dissolved gas analysis measurement and interpretation techniques.‖ IEEE Electrical Insulation Magazine, Vol. 30, No. 3, pp 39-49, 2014. [4] ―IEEE Guide for the interpretation of Gases Generated in Oil Immersed Transformers—Redline.‖ IEEE Std. C57.104-2008, pp, 1-45, 2009. [5] ―IEEE Guide for the Detection and Determination of Generated Gas in Oil-Immersed Transformer and the relation to Serviceability of the Equipment.‖ ANSI/IEEE C57.104-1978 p. 0-1, 1978. [6] ―IEEE Guide for the Interpretation on Gases Generated in Oil-Immersed Transformer.‖ IEEE Std. C57.104-1991, p 0-1, 1992. [7] M. Duval and A. dePabla,‖Interpretation of gas in oil analysis using new IEC Publication 60599 and IEC TC 10 databases.‖ IEEE Electrical Insulation Magazine, Vol. 17, pp. 31-41, 2001. [8] H-C Sun, Y-C Huang, and C-M. Huang. ―A Review of Dissolved Gas Analysis in Power Transformer.‖ Energy Procedia, Vol. 14, pp. 1220-1225, 2012. 67 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management [9] A. Abu-Siada and S. Islam.‖ A new approach to identify power transformer and asset management decision based on dissolved gas in oil analysis.‖ IEEE Transaction on Dielectric and Electrical Insulation, vol 19, pp. 1007-1012, 2012. [10] H. Li and M. Gupta, Fuzzy Logic and Intelligent Systems, International Series in Intelligent Technologies, Kluwer Academic Publisher, 1995. [11] A. Abu-Siada, S. Hmood, and S. Islam. ―A New Fuzzy Logic Approach for Consistent Interpretation of Dissolved Gas-in-Oil Analysis.‖ IEEE Transactions on Dielectrics and Electrical Insulation, Vol. 20, No. 6, pp. 2348, December 2013. [12] ―IEEE Guide for the Interpretation on Gases Generated in Oil-Immersed Transformer.‖ IEEE Std. C57.104-2008, p 9-10, 2008. [13] M. Arshad and S. Islam,‖A Novel Fuzzy Logic Technique for Power Transformer Asset Management,‖ in Industry Application Conference, 2006, 41st IAS Annual Meeting Conference Record of the 2006 IEEE, 2006, pp 276-286. [14] M. Arshad, S. Islam, and A. Khaliq,‖ Fuzzy logic approach in power transformer management and decision making,‖IEEE Transactions on Dielectric and Electrical Insulation,vol 20, pp. 2343-2354, 2014. [15] A. Abu-Siada, M. Arshad, and S. Islam, ‖Fuzzy logic approach to identify transformer critically using dissolve gas analysis,‖ in Power and Energy Society General Meeting, 2010, IEEE, pp. 1-5, 2010. [16] S. Mofizul Islam, T. Wu, and G. Ledwich,‖ A novel fuzzy logic approach to transformer fault diagnosis,‖ IEEE Transaction on Dielectric and Electrical Insulation, vol.7, pp.177-186, 2000. [17] S. Hmood, A. Abu Siada, Mohammad A.S. Masoum amd Syed M. Islam. ―Standardization of DGA Interpretation Techniques using Fuzzy Logic Approach.‖ 2012 IEEE Int’l Conf. on Condition Monitoring and Diagnosis, 23-27 September 2012, Bali, Indonesia. Pp. 929-932, 2012. [18] A. Abu-Siada and S. Hmood, ‖Fuzzy Logic Approach for Power Transformer Asset Management Based on Dissolved Gas-in-Oil Analysis‖, Chemical Engineering Transactions, Vol 33, No 2, pp. 997-1002, September 2013. [19] N. A. Muhammad, B.T. Phung, and T. R. Blackburn, ‖Comparative study and analysis of DGA methods for mineral oil using fuzzy logic‖, Internasional Conf. Power Engineering (IPEC), pp. 1301-1306, 2007. [20] M. Hongzhong, L. Zheng, P. Ju, H. Jingdong, and Z. Limin, ―Diagnosis of power transformer faults on fuzzy three-ratio method‖, 7th International Conference Power Engineering (IPEC), pp. 1-4, 2005. [21] S. M. Islam, T. Wu, and G. Ledwich,‖ A novel fuzzy logic approach to transformer fault diagnosis,‖ IEEE Transaction Dielectric Electrical Insulation, Vol. 7, pp. 177-186, 2000. [22] ―International Standard Mineral Oil-Filled Equipment in service – Guidance on the interpretation of dissolved and free gases analysis.‖ IEC 60599 Edition 3.0, p 14-15, 2015. 68 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Condition monitoring thermal properties of a 20A hydraulic-magnetic MCB R. Kleynhans1, P.A. van Vuuren1, R. A. Thomas2 1 School of Electrical, Electronic and Computer Engineering, North-West University, South Africa 2 School of Management, Swansea University, Bay Campus, Swansea, Wales, UK ABSTRACT Heat is one of the major causes of degradation which can cause a reduction in performance, reliability and life span of miniature circuit breakers (MCBs) [1]. In order to diagnose the condition of electrical equipment, the thermal stress of that equipment should be known [2]. Due to the compact design of MCB’s it is important to minimize or dissipate the heat that builds up inside MCBs during normal use. With time electrical components and contact surfaces heats up and begin to deteriorate, this increase in heat is often a product of increased electrical resistance which is a result of degradation [3]. Infrared thermography is a condition monitoring technique that can be applied to electrical distribution boards but because of design limitations MCBs are stacked next to each other on a rail in a panel allowing a direct view only from the front. Is the surface temperature of the MCB a reflection of the internal degradation or are there other factors that need to be considered to accurately diagnose the MCB? The following methodology for the exploratory study was followed to determine the heat profile after degradation, a 20A hydraulic-magnetic MCB was switched 9000 times at rated voltage and current. The volt drop across the conduction path as well as thermal images of the front and sides were taken after every 500 switching operations, after the MCB was allowed to heat up for 1 hour at rated current. When comparing the surface temperature rise above ambient of the three sides the results shows that the number of operations does not influence the surface temperature of the front and the left (hydraulic magnetic unit) as much as the right (conduction path). The aim of this paper is to use a lumped model and compare with empirical infrared thermographic results relating to the condition of MCBs and to: Define the MCB surface heat distribution after a number of switching operations at rated current. Determine whether the frontal surface temperature of the MCB is an accurate indicator of internal degradation. Determine a field methodology to determine degradation and to anticipate failure of MCBs? Due to the construction of the MCB the front, left and right sides give different heat patterns that complicate matter with regards to determining the correct internal health state of the MCB. Keywords: condition monitoring, thermal imaging, miniature circuit breaker, degradation, hydraulic magnetic 1. INTRODUCTION An electrical installation protection system is the most important component affecting the performance and safety of personnel under abnormal conditions. Protection devices such as miniature circuit breaker (MCB) are designed to detect electrical faults and to promptly disconnect the fault component, in order to reduce the effect of the fault on the rest of the installation. The speed at which the MCB disconnect the fault is dependent on the type of MCB and can be affected by temperature, for example, fatigue failure. The importance of protection devices such as circuit breakers in a LV (low voltage) network are often overlooked and may lead to unnecessary damage to expensive equipment [3]. A small fraction of downtime during a system failure is spent to maintain or repair the faulty components; however, during this downtime 80% is spent trying to locate the source of the fault [4]. In terms of electrical failure the same natural laws i.e. Stefan-Boltzmann law, Ohm’s law, Lenz’s law, Newton’s, Faraday’s law etc. are applied in normal operation of equipment. Electrical component failure is thus obeying natural laws when it fails. Electrical degradation processes may include electrical heating, thermal degradation and contact resistance [5]. The uncontrolled flow of electrical energy is generally the result of misapplication, misuse or accident and manifests itself in the form of electrical fault currents either in intended or unintended electrical circuits or paths [6]. Thermal imaging has proven itself as a useful condition monitoring tool as part of an intelligent asset management program to prevent unplanned breakdowns and to increase plant availability [2]. 69 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Due to the internal mechanical connections of MCB’s which can radiate heat under load, it is not always straightforward to pin point the root cause of the excessive heat due to resistance. Early detection and analysis will lead to better prediction of remaining useful life estimates. Prognostics and health management for MCB’s will detect, isolate and predict the onset and source of degradation as well as time to failure [7]. Misdiagnosed thermal anomalies can result in wasted time and money [8]. As heat is a product of normal energy associated with normal operations, heat inside an electrical system can be seen as wasted energy and also contributes to the wear of components whether it is insulation material or mechanical parts. The thermal performance of MCB’s is dependent on the geometry of their construction, materials used, dynamics of operation and the aerodynamics in the cabinet. A hydraulic magnetic trip unit uses the induced magnetic flux to separate the contacts and not heat due to current flow. Hydraulic magnetic thermal MCB’s are usually temperature stable and aren’t affected by ambient changes as the case is with bimetallic trip units. A lumped model is useful to determine the thermal distribution inside a MCB often in a topographic way which could be compared to the thermographic results. The difficulty in this is that the heat generation mechanisms are often tightly coupled with the electrical operation of the circuit. In order to create a lumped parameter model of the MCB the individual segment resistance of the current path should be determined. Due to these segments being arranged in series a simple network model of the whole circuit can be built. The resistance across the terminals will then give an indication of the combined resistance of the conduction path. The electrical notations for “current”, “voltage” and “electrical resistance” can be replaced by “heat flux (power)”, “temperature” and “ thermal resistance” in a lumped parameter model [9]. Modeling a thermal equivalent circuit consist of three steps:  Identification of heat transfer mechanism.  Definition of thermal parameters of each part of the MCB.  Validation of the model [10]. The rest of this paper is structured as follows: Section 2 provides an overview of related work. Background information on the construction and operation of MCBs is given in Section 3. Section 4 describes the methodology that was followed to prepare the exploratory practical experiments. Section 5 summarizes the results and Section 6 discusses those results. Lastly, a conclusion is drawn in Section 7. 2. RELATED WORK Siegel et al. concluded that contact points inside a MCB influences the electrical and thermal resistance. The electrical resistance is responsible for terminal temperature rise and the thermal resistance the temperature of the internal hot spot [11]. The majority of MCB simulations found to validate experimental results were completed to determine the forces and arc behavior inside a MCB. Arulanantha, 2015, validated that an inverse problem can be used to model the complex arcing phenomenon, which is governed by coupled physics of electromagnetism and thermal dynamics [12]. Stammberger et al. discussed lumped network models for calculating short circuit current, sequential coupling of finite element codes and tight coupling of magnetic field and fluid dynamics that run simultaneously. Proving that calculating the thermal behavior is complex due to the combination of separate finite element and finite volume calculations where boundary conditions and source terms are exchanged manually [13]. Barcikowki, 2002, performed comprehensive analysis on heat transportation in switchgear and came to the conclusion that convection and radiation inside a device has a negligible effect on the resulting temperature distribution. Gregory B McIntosh, 2013, discussed various considerations for assessing the condition of electrical connections using infrared thermography revealing a thermal anomaly but limited his work to electrical connections. 3. BACKGROUND ON THE CONSTRUCTION AND PRINCIPLE OF OPERATION OF A HYDRAULIC MAGNETIC MCB MCB construction is simple but very precise. With no replacement parts the unit is replaced when it fails to operate as per manufacturer’s chart [14]. LV MCBs must be able to carry rated current, interrupt an overload current, successfully interrupt short circuits and enable manual on/off operations within prescribed temperature and humidity ranges [15]. 70 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management In order for a MCB to carry out the four functions mentioned above it needs to consist of five sub-systems either independent or as a combination: I. Conduction path – Carrying the rated current and consist of conductors and contact tips. Since the conduction path is the only heat source it is primarily responsible for the temperature rise of the MCB [15]. The conduction path starts with the line terminal, through the moving contacts into the top and bottom frame, through the sensing unit and out the load terminal. II. Sensing unit – Detect the overload and initiate opening of the moving contacts. A hydraulic magnetic MCB operates by using the magnetic force produced by the current flowing through a coil where the coil and the load are in series. The coil is wound around a hermetically sealed tube containg a mild steel core, a spring and silicon oil as a hydraulic damping fluid.The silicon oil slows the movement of the core’s velocity to create a controlled time delay. The magnetic flux created by the coil during an overcurrent produces sufficient pull on the core to start movement towards the pole piece. With a persistant overload the core reaches the pole piece and by doing so the reluctance of the magnetic circuit drops considerably. The armature is attracted to the pole piece with sufficient force to collapse the latching mechanism [15]. During a short circuit the magnetic force produced by the coil is sufficient to attract the armature to the pole piece and trip the MCB. During this operation the core has not moved [15]. III. Arc Chamber – This is where the arc is formed during contact opening and is controled and extinguished to interrupt the current. When the contacts open a magnetic force blows the arc along the arc runners and into the arc grids. Erosion is reduced if the arc is removed from the contact area.The purpose of the arc grids is to split the arc into a number of shorter arcs and to cool the arc. This will raise the impedance and hence increase the arc voltage drop. The rapid rise in arc voltage limits the current. Current ceases to flow and the arc is extinguished when the current is forced to zero. The damage to the MCB is determined by the amount of energy let-through and is determined by the opening time of the moving contacts and efficiency of the magnetic force which blows the arc off the contact tips and into the arc grids [15]. IV. Mechanism – Open automatically when activated by the sensing unit or open and close the moving contacts manually [15]. The speed is dependent on the type of MCB. V. Molded case – The housing of the MCB which is made out of [15]. Figure 1 shows a photograph of the components and configuration inside an MCB. Figure1: Photograph showing the typical components of a miniature circuit breaker. On/off handle Source terminal Sensing unit Load terminalArc chamber Moving contact 71 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 4. METHODOLOGY An electrical lumped parameter model of the main heat generating components in a MCB is as follows: Figure 2: Circuit diagram of the electrical domain of an MCB Where:  Isource represents the external DC power supply  RTS represents the resistance at the terminal contact between the source and the MCB  RCS represents the resistance at the contact point between the switch and the source  Rcu represents the copper resistance of all conductors in the system (lumped together)  RCL represents the resistance of the connection point between the switching mechanism and the load  RTL represents the resistance of the terminal contact between the MCB and the load The electrical resistance of a length of conductor is calculated from:= ,……………….………………………………….(1) With:  ρ = resistivity [Ω.m]  l = length of conductor [m]  A = cross section area [m2] The resistivity in turn is temperature dependent: − = ( − ) , ………………………………………..(2) With:  0 = resistivity at reference temperature T0 [Ω.m]  T = current temperature [⁰C]  = resistivity at the current temperature [Ω.m]  α = temperature coefficient of resistance [1/⁰C] For the practical experiment a commercially available 20A curve 2 single pole MCB was used to perform an accelerated endurance test at ambient temperature which are aligned with the IEC 60947-2 international standard. The aim of an endurance test is to check the electrical and mechanical performance after a number of switching operations at rated voltage and current [16]. The following steps were followed: Step 1: A never operated MCB was placed on a static test bench where it was allowed to heat up at rated current, 20A, for 1 hour at ambient temperature, i.e. until steady state has been reached. The load simulation was carried out with a DC power source to maintain a constant current, to limit the additional heat generation, due to the influence of alternating current, like eddy currents and core losses [11]. Infrared thermal images were taken of the front, left and right side of the MCB with a FLUKE Ti400 radiometer (7.5μm to 14μm, ≤0.05⁰C at 30⁰C target temp). It was 72 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management assumed that the emissivity of the MCB housing was 0.96. The ambient temperature was recorded with a Kestrel 3000 pocket weather meter to determine the temperature rise above ambient. The volt drop over the terminals was measured to determine the combined resistance of the conduction path. The total resistance of the conduction path is the sum of the inherent resistances of the components as well as the sum of the connection resistances. The thermal images as well as the resistance values obtained were used as reference values. Step 2: The MCB was moved to the endurance test bench and positioned in the off position for 9 seconds, then switched to the on position for 1 second. This test was performed at rated current, 20A, and voltage, 250V. The test frequency was 6 switching cycles per minute. After 500 switching cycles the MCB was moved from the endurance test bench back to the static test bench. Step 3: The MCB was placed on the static test bench where it was allowed to heat up at rated current, 20A, for 1 hour. After steady state has been reached thermal images of the front and sides of the MCB were taken to evaluate the temperature rise after some degradation has set in and to compare the temperature rise to the reference values obtained in step 1. The ambient temperature was recorded to determine the temperature rise above ambient. The volt drop over the terminals was also measured to determine the combined resistance of the conduction path to record the change in resistance after multiple switching operations at rated current and voltage. Step 2 and step 3 was repeated until the MCB has reached 9000 operations. 5. RESULTS A graphical representation in Figure 3 illustrates the surface temperature rise above ambient of the highest temperatures recorded on the front, left and right sides of the MCB starting with the reference values up to 9000 switching operations. The conducted and/or convected temperatures on the outer shell of the MCB of interest on the right is that of the top frame and on the left that of the hydraulic magnetic unit. During the experiment the ambient temperature fluctuated by 3.9⁰C. The ΔT for the rise above ambient on the front, left and right sides were 1.58⁰C, 3.91⁰C and 11.13⁰C between 0 and 9000 operations. Figure 3 Temperature rise above ambient of a 20A MCB after multiple switching cycles at full load The change in the measured resistance (black line) in Figure 4 shows the conduction path after multiple operations at rated current and voltage. The blue line represents the calculated resistance value of the conduction path over the switching period. The fact that it is much lower than the measured resistance can be contributed to the fact that the exact transfer mechanism as well as the exact parameters of each part of the MCB was not known. The temperature 0 5 10 15 20 25 30 35 40 Te m pe ra tu re ri se a bo v e a m bi en t i n ⁰C Number of operations Front Left Right 73 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management rise on the right side (red line) of the MCB is also plotted on the same graph fitted with a quartic to make the trend clear. The electrical resistance of the components in the current path is dependent on its geometry and the specific electrical resistance of the material used. The specific electrical resistance is temperature dependent [11]. Oxidation and area of contact of the moving contacts is a significant contributor to the overall measured resistance of the conduction path. Figure 4 Measured and calculated resistance of conduction path after multiple operations and temperature rise on right side Figure 5 shows the reference front (a), right (b) and left(c) thermal view of the MCB before any operations but after it was heated for 1 hour at rated current, 20A, as described in step 1. (a) (b) (c) Figure 5 Thermal images of the front (a), right (b) and left(c) of the MCB before any operations 74 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 6 shows the front (a), right (b) and left(c) thermal view of the MCB after 9000 operations and heated for 1 hour at rated current, 20A, as described in step 2 and step 3. (a) (b) (c) Figure 6 Thermal images of the front (a), right (b) and left(c) of the MCB after 9000 operations. 6. DISCUSSION OF THE RESULTS Due to the construction of the MCB used for this paper the front face shows a lower temperature rise above ambient as the sides with the right side being the highest. The higher temperature of the right is due to the electrical current that flows down the conduction path in the bottom frame which is in contact with the right shell of the MCB and thus releasing heat to the housing by conduction and convention. This heat on the sides of the MCB housing will result in conduction of the heat to adjacent MCBs when stacked in a distribution board (DB). This is also known as the proximity effect. The effect of this heat conduction due to the proximity effect is outside the scope for this paper but could be further investigated using infrared thermography due to the high emissivity of MCB casing and with an infrared window could take into consideration the internal temperatures. The change in surface temperature after multiple switching operations is due to the changing resistance of the conduction path as can be seen from Figure 4. The increase in voltage drop of figure 4 is according to Ohm’s law caused by the increase of the electrical resistance of mainly the switching contacts. The electrical resistance is determined by the contact area and is among other things influenced by the contact force, the moduli of elasticity and the Poisson’s ratio. During switching, the contact surface changes due to oxidation. [17]. There is an initial rise in temperature up to 1000 operations where it stabilizes. This can be due to the erosion of the silver plated contact tips and a thin layer of oxidation that is formed on the contact tips. The small deviations in temperature thereafter can be contributed to how the contacts make contact in the conduction path, the so called “a-spots” [17]. The drop in resistance over the course of the experiment may be due to wiping of the oxidation each time the contacts are opened and closed. Thereafter, increases are a result of the oxide build up on the contacts. The heat flow through the current path is affected by the additional or reduced thermal resistance and electrical resistance. The thermal resistance is assumed from the electrical resistance and is subject to the average thermal conductivity of the two different materials [9]. The surface temperature observed from the thermal imager is a result of heat convection externally around the MCB, metallic conduction and radiation from the emissive outer surface. The first law of thermodynamics states that the total energy of an isolated system is constant where the internal energy is equal to the amount of heat supplied to the system minus the amount of work done by the system on its surroundings. Energy can be transformed from one to another but cannot be created nor destroyed. Considering the experiment the first law of thermodynamics can be represented as:= + ℎ , ……………………………………………..(3) Where: Pin = I²Rcb = power dissipated in the total circuit breaker resistance, Rcb, by the current, I = 20A Tr = temperature of the right shell Cth = thermal capacitance of the circuit breaker and cables Rth = thermal resistance between the heat source in the breaker and the right shell t = time Hot air escaping 75 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Cth and Rth are the two critical components in the equation to accurately calculate the internal heat of the MCB. Figure 7 Temperature rise versus power dissipated Figure 7 shows the graph of ΔT versus I²R where the slopes of the fitted lines are the thermal resistances between the heat source in the MCB and outer shell on the right and front respectively. The almost linearity of the lines are dependent on the total resistance of the circuit breaker. This verifies that the surface temperature of both the front and right sides of the MCB correlate and there is a possibility that a prediction of the inside condition and hence the state of health of the MCB can be developed. The front face shows a flatter slope compared to the right shell suggesting a less accurate predictor of the overall MCB resistance. The calculated resistance of the MCB is much lower than the actual measured value as can be seen in Figure 4. This can be contributed to the fact that the exact transfer mechanism as well as the exact parameters of each part of the MCB was not known. The internal temperature of the MCB due to current flow and degradation of internal components is much higher than the measured surface temperature. With the internal temperature of the individual components known a more accurate resistivity of each of the components can be calculated. The true effect of contact contamination due to oxidation is also unknown. When examining the thermal images of Figure 5 it is seen that after heating the MCB at rated current for one hour without it performing a single operation, the most heat is concentrated around the hydraulic magnetic unit due to the coil in the sensing unit. This change as the unit is switched at rated current and the contacts start to degrade due to the erosion of the silver-plated contact tips, formation of oxidation and subsequent resistance increases as can be seen from Figure 4. With every switching operation under load an arc is formed. The high temperature of the arc will vaporize a percentage of the contact material which will lead to contact erosion. 7. CONCLUSION Different MCBs will have different thermal and electrical resistances due to manufacturing diversities that result in unique signatures. Thus, comparing the surface temperature rise above ambient after various operating cycles is not a true reflection of the temperature profile for all MCB types but can be used as a comparative reference. From the results it is seen that after 9000 operations at rated current and voltage degradation has not really set in due to the small change in temperature and measured resistance. International standard IEC 60947-2 does only make provision for temperature rise limits for specified parts at 1500 operations at rated current. More testing post 9000 operations is required in order to see the thermal properties after degradation has set in and how the heat generated due to the rise in resistance is conducted and confected to the housing. These results will assist to determine a field methodology to anticipate failure of MCBs. 76 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management When comparing the surface temperature rise above ambient of the 3 sides it is clear from Figure 3 that the number of operations does not influence the front as much as the conduction path on the right side/shell. This can be contributed to the airgap between the components and the front shell. The heat inside rises due to buoyancy and is dissipated through the gap between the housing and the on/off handle. Resulting in less heat transferred to the front face due to convection. This is illustrated in Figure 6(a). Due to changes in resistance the temperature rise above ambient is more for the moving contacts. The temperature difference as well as the slopes in Figure 7 between the front and right sides suggests that the front surface temperature only is not a true reflection of the internal temperature due to degradation. A certified thermographer will only have access to the front view of a MCB when inspecting distribution boards. The surface temperature rise above ambient from the results of the experiment obtained of the front, left and right sides, clearly show that the rise on the left side is on average 8.15⁰C and the right side 12.53⁰C higher than the front. This value may change as degradation sets in and the product near failure. Although the front face is an indicator as indicated on Figure 4, it is displaying a lower surface temperature due to the construction of the MCB and requires a correction factor to accurately determine the state of health of the MCB to avoid misinterpreting the state of health of the MCB. It is proposed that MCB infrared thermography comparative severity criteria be investigated/developed as regions of external interest both normal and abnormal to determine the internal condition of the MCB. 8. REFERENCES [1] N. Gorjian, L. Ma, M. Mittinty, P. Yarlagadda and Y. Sun, "A Review On Degradation Models In Reliability Analysis," in Proceedings of the 4th world congress on engineering asset management, Athens, Greece, 2009. [2] S. Bagavathiappan, B. Lahiri, T. Saravanan, T. Jayakumar and J. Philip, "Infrared Thermography for Condition Monitoring - A Review," Infrared physics & Technology, vol. 60, pp. 35-55, 2013. [3] V. Cohen, Application Guide for the Protection of L.V. Distribution Systems, Second ed., Johannesburg: Creda Communications, 1998. [4] R. L. Kegg, "On-line machine and process diagnostics," Annals of the CIRP, vol. 32, no. 2, pp. 469-573, 1984. [5] A. Hattangadi, Failure Prevention of Plant Machinery, New York: McGraw Hill Education Private Limited, 2004. [6] V. Cohen, "Overcurrent Protection in Low Voltage Electrical Circuits - The Perils of Adjustment," CBI, Johannesburg, 2012. [7] C. S. Kulkarni, J. R. Celaya, G. Biswas and K. Goebel, "Prognostics of Power Electronics, Methods and Validation Experiments," in 48th Systems Readiness Technology Conference, Anaheim, 2012. [8] W. C. Garber, "Resistance in Circuit Breakers," Industrial Electric Testing, Jacksonville, 2010. [9] P. U. Frei and H. O. Weichert, "Advanced Thermal Simulation of a Circuit Breaker," in 22nd International Conference on Electrical Contacts, Seattle, 2004. [10] M. Rasid, A. Ospina, K. El Kadri Benkara and V. Lanfranchi, "A Thermal Study on Small Synchronous Reluctance Machine in Automotive Cycle," in 25th IEEE International Symposium on Industrial Electronics, Santa Clara, 2016. [11] D. Siegel and M. Anheuser, "Significance of resistances of switching contacts for the temperature rise of LV circuit breakers," in 27th International Conference on Electrical Contacts, Dressden, Germany, 2014. [12] A. Prabu, "An Inverse Problem Approach to Modeling of Circuit Breaker Arc Voltage," in 2014 International Symposium on Fundamentals of Electrcal Engineering, Bucharest, Romania, 2015. [13] H. Stammberger, C. Rumpler and A. Zacharias, "Interaction in low-voltage switchgear - a staggered simulation approach," 2001. [14] D. Pop, L. Neamt, R. Tirnovan and D. Sabou, "3D Finite Element Analysis of a Miniature Circuit Breaker," in The 8th international symposium on advances topics in electrical engineering, Bucharest, Romania, 2013. 77 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management [15] E. G. van Niekerk, "Fundamentals of low voltage circuit breaker operation," Private, Johannesburg, 2002. [16] E. Larsen, M. Valdes, G. H. Fox, K. Rempe and C. G. Walker, "IEEE3004.5 Recommended Practice for the Application of Low-Voltage Circuit Breakers in Industrial and Commercial Power Systems," 2016, 2016. [17] T. E. Browne, Circuit interruption:Theory and techniques, Taylor & Francis, 1984. 78 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management A Comparative Study of Partial Discharge Measurement using Electrical and Inductive Coupling Sensors IK Kyere1, TR Becker2, JJ Walker3 Vaal University of Technology, Vanderbijlpark, South Africa 1isaack@vut.ac.za, 2209034092@edu.vut.ac.za, 3jerrywalker@walmet.co.za ABSTRACT Since the acceptance of partial discharge measurements as a tool used for diagnostic condition assessment of high voltage insulation systems, various equipment and methodologies are employed to acquire and analyse the data obtained, be it in the factory or field environment. Some of the equipment and methods can only detect the presence of partial discharges and cannot be used to evaluate the true condition of the insulation by determining the apparent charge magnitude. Advancement in signal processing technology allowed manufacturers to design digital acquisition equipment with much better diagnostic applications. Consideration must therefore be given to the type of sensor used and its application in the measuring circuit. The aim of this study is to compare the results obtained by using two different sensing techniques on two different defects, namely, an artificial defect in the form of a void as well as corona discharge in point-to-plane configuration. A method published and recommended in IEC 60270 makes use of a coupling capacitor with associated accessories to determine the apparent charge (in coulomb/pico coulomb) which includes a measuring impedance and preamplifier to measure within a specific frequency spectrum. The results are then compared to a different decoupling method by means of high frequency current transformers, having different frequency spectrums and a preamplifier with the same frequency response as the coupling capacitor technique. Sensitivity assessments and merits of applying each different coupling techniques are presented and discussed. Keywords: Partial discharge, Electrical, Diagnostic, Inductive, Sensor. 1. INTRODUCTION A partial discharge (PD) is generally a highly localised or bound electrical discharge in an insulation medium between two conductors, and at times PD activity is the forerunner to an entire electrical breakdown. The occurrence of PD can lead to electrically-induced ageing of certain insulating materials, for example, by formation of corrosive gaseous by products, surface erosion, and tree development [1]. Internal discharges occur within the dielectric medium and can include various cavities of different sizes and geometries and the discharge occurs when the critical inception field of the void is reached, granted that a free electron is available. Surface discharge ensues remotely along the surface of the insulation, while a corona discharge occurs externally in non-uniform gaps and at sharp conductive edges [2]. The impact that partial discharge activity has on different insulation materials and the ability of the insulation to sustain this activity for prolonged period is directly influenced by the nature of the insulation material. Epoxy mica insulation used in rotating machines has the capability of withstanding the presence of partial discharge activity for a longer period compared to the very sensitive polymeric insulation systems used in cables [3]. Coupling sensors are fundamental components in the partial discharge measurement circuit. They are required to capture high frequency PD signals originating from the object under test. Field testing or routine tests in a factory 79 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management environment determines the choice and application of coupling sensors. IEC 60270 recommends standard test circuits and the use of different sensing techniques to capture the current impulses. The arrival of the current pulse at the terminals of the chosen sensor is attenuated due to a change in impedance and/or reflections within the object and the location of the defect. Therefore, the sensitivity and the sensors impact on the overall test circuit is crucial to reduce any signal distortion [4]. Various techniques have been used for partial discharge measurement with most of them focusing on wideband measurements with various kinds of sensors [5, 6, 7]. High Frequency Current Transformer (HFCT) partial discharge detection systems cannot verify apparent charge by calibration procedure, however, there is the need for sensitivity checks and their performance to ensure validity of non-conventional partial discharge measurement [8]. In this paper, the partial discharge (PD) measurements using two different sensing techniques are presented. Inductive sensing using two high frequency current transformers (HFCT) with different frequency responses and a capacitive coupling device with measuring impedance as published in IEC 60270, was used in the laboratory, to investigate the difference in sensitivity in the discharge pattern and PD inception voltage in defects. The partial discharge data was presented in the form of Φ-q-n pattern where Φ is the phase angle of partial discharge occurrence, q the PD magnitude and n is the PD pulse number. 2. THE MEASUREMENT SYSTEM Figure 1 shows the experimental setup of the measuring circuit; the coupling capacitor is in series with the measuring impedance and the HFCT’s are placed in the earth leg of the test object. During the first experiment the partial discharge phase-resolved measurement method has been applied for pattern acquisition, in accordance with IEC 60270 standard [9] consisting of 50 Hz a.c. power supply, the test sample (Ck), the Coupling Capacitor (Cc), and the measuring impedance (Zm). In the following experiments the outputs of the HFCT sensors was connected to the digital measuring PD detector. Variable Test Voltage HV filter Ca Test Object CT 1 HFCT CT 100 HFCT Ck Coupling Capacitor Zmi Measuring Impedance Aquisition Unit 50 Ohm, connecting cable **HFCT's, swopped out during each measurement Figure 1. Partial discharge measuring circuit 80 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 2(a) shows the defect (cavity) that was artificially created by drilling a hole of 1,5 mm diameter 1 mm deep into the XLPE insulation of a cable. In addition to that, as shown in Figure 2(b), corona is obtained by using a point- to-plane electrode configuration in air. The high voltage electrode is a steel needle and the earth electrode is the flat plane. The distance between them is 60 mm. The measurement circuit was calibrated according to the procedure described in IEC 60287 before voltage application. A 5 pC calibration pulse was applied across the coupling capacitor leg, as close as possible to the test object. 3. COUPLING SENSORS Choosing an efficient way to sense the pulses depends on a number of aspects ranging from sensitivity, type of test object and field environment. Consideration as to calibration efficacy must also be taken into account. Other methods of detecting include electromagnetic (antennas), acoustic and optical detection. Since sensors should be adapted to the test object and the test environment it is important to state that the detecting method has an impact on the signal transmission. 3.1. Capacitive coupling a) PD measurement using coupling capacitor and measuring impedance A coupling capacitor is a capacitive sensor, in combination with a measuring impedance and should be discharge free at the test voltage as well as providing a path that captures PD pulses. 3.2. Inductive coupling Using a ring type of ferrite core, the basic structure of HFCT consists of several turns of copper wire over the ring core. Ferrites being ferromagnetic Ceramics with very high resistivity and permeability are the most attractive Conductor Inner semiconductive layer XLPE Insulation Artificial void External stress control tape Figure 2. (a) Artificial void Earth Electrode HV Electrode (b) Point-to-plane configuration 81 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management materials for high frequency applications [11]. The sensitivity of HFCT’s is increased due to the ferrite material being used as core material [12] and can detect the PD signals in a wide frequency band [13]. b) PD measurement using CT1 in earth lead CT1 is a high frequency current transformer with a window of 15 mm, and for the experiments it was placed in the earth leg of the test object. It has a -3dB bandwidth of 0.5 - 80 MHz and a -6dB bandwidth of 0.3 - 100 MHz. c) PD measurement using CT100 in earth lead CT100 is a clamp–on high frequency current transformer with a window of 100 mm which is also placed in the earth leg of the test object. CT100 can therefore be applied without any disconnection of earth leads or interruption of the supply. It has a -3dB bandwidth of 2 - 25 MHz and a -6dB bandwidth of 1.2 - 40 MHz. 4. COMPARISON OF RESULTS AND EXPLANATION The performance of the sensors used in this paper are validated by the measurement using different partial discharge sources. PD activity in the test samples are detected to obtain representative patterns for each individual sensor. Initial tests were performed using a point plane PD source. Figures 3 – 5, including table 1 include the results obtained for PD inception voltage condition. A further increase beyond inception is depicted in figures 6 – 8 followed by the corresponding results in table 2. Figure 3 CC, gain setting 3.50 pC/DIV Figure 4 CT 1, gain setting 0.70 pC/DIV Figure 5 CT 100, gain setting 0.09 pC/DIV Table 1 Summary of results with different coupling sensors for a point plane discharge at inception voltage. Coupling capacitor CT 1 CT 100 Q Magnitude (pC) 138.6 0.84 0.21 Voltage (kV) 6.2 6.2 6.2 Total counts 2 549 22 208 59 1640 82 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 6 CC, gain setting 350 pC/DIV Figure 7 CT 1, gain setting 1.50 pC/DIV Figure 8 CT 100, gain setting 0.35 pC/DIV Table 2: Summary of results with different coupling sensors for a point plane discharge above inception (streamer discharge) Coupling capacitor CT 1 CT 100 Q Magnitude (pC) 1130 6.25 1.25 Voltage (kV) 9.4 9.4 9.4 Total counts 23 364 121 273 542 116 Below are the patterns obtained for an artificial void using different coupling sensors. Figure 9 indicates the pattern using the coupling capacitor followed by figure 10 – 11 using HFCT’s. Table 3: Summary of results with different coupling sensors for an artificial void defect at inception. Coupling capacitor CT 1 CT 100 Q Magnitude(pC) 13.23 0.23 0.22 Voltage (kV) 14.5 14.5 14.5 Total counts 6 312 57 3424 59 5258 Referring to Figures 3 – 5 and Table 1(Point to Plane), the PD pattern measured using coupling capacitor are greater in magnitude than the pulses measured with the HFCTs. The same can be explained for pulses measured with CT1, also have greater magnitude compared to the results with CT100. Although the signature of the pattern acquired for each sensor is indicative of a corona discharge on the high voltage electrode the pulse amplitude and height distribution are different for each sensor. The amplification factor was also adjusted for each measurement, the gain setting was adjusted to 3.50 pC/DIV for the coupling capacitor however the CT 100 amplification factor was set to 0.09 pC/DIV. A further increase in voltage beyond inception from 6.2 kV to 9.4 kV showed similar characteristics in gain settings and pulse amplitude – phase height distribution as with patterns acquired at inception. Figure 9. CC, gain setting 7pC/DIV Figure 10. CT 1, gain setting 0.35 pC/DIV Figure 11. CT 100, gain setting 0.09 pC/DIV 83 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Aside from point plane, an artificial void was used in subsequent tests as a PD source. Only the patterns and results at inception voltage were reported as illustrated in Figures 9 – 11 as well as Table 3. The issue of sensitivity can be highlighted by a visible pattern seen in figure 9 which does not exist in figures 10 – 11. From the pattern seen in figure 9 a diagnosis of the defect can be explained as a combination of a surface and void discharge which cannot be said for the remaining results as only the noise flow is visible. 5. CONCLUSION The measurement of partial discharge activities with two detection techniques has been undertaken. The phase – resolved partial discharge pattern in different cases obtained from HFCT measurement are compared with that from the conventional IEC method. From the measurement results obtained it’s clear that the performance of the sensors employed in this study can be distinguished. The coupling capacitor/measuring impedance sensor has a better sensitivity than HFCTs. However, CT1 with a smaller window and different bandwidth is more sensitive than CT100 with the larger window. The results show that the HFCT can detect the discharge pattern but from a diagnostic point of view analysis could not be performed. Comparing to the conventional method, the commercial HFCT cannot detect the partial discharge at the inception voltage due to its poor sensitivity. The HFCT’s do not detect some PD pulses with some amount of energy as shown in Figures 4, 5, 10 and 11. The differences in the measurement results can be attributed to the size of the air gap between the conductor and the CT core and the difference in frequency bandwidth of the current transformers. The capacitance of the transformer winding also play a significant role in the measurement with the high frequency current transformers. This can occur when the frequency bandwidth of the PD pulses is low [13]. The practical implications of the measurements show that it is imperative to determine the correct coupling method based on the application of the partial discharge measurements and finally how the results will be assessed. The sensitivity can be determined based on whether the application is for monitoring, diagnostic or on site testing applications and keeping in mind the nature of the test object. REFERENCES [1] Fruth,. B.A, Gross,. D.W., (1994) Phase resolving partial discharge pattern acquisition and frequency spectrum analysis. Proceedings of the 4th international conference on properties and application of dielectric materials. [2] Fruth,. B.A, Gross,. D.W., (1996) Partial discharge signal conditioning techniques for on line noise rejection and improvement of calibration. Conference record of IEEE international symposium on electrical insulation. Montreal, Quebec, Canada. [3] Gross, D., (2016) Acquisition and location of partial discharge – esp. in transformers. [4] Bergius, O. (2011) Implementation of On-Line Partial Discharge Measurements in Medium Voltage Cable network. Master thesis, Tampere University of Technology, pp.33-34. [5] Tian Y., Lewin P., Davies A. (2002) Comparison of On-line Partial Discharge Detection Methods for HV Cable Joints. IEEE Trans. Dielectr. Electr. Insul. ;9:604–615. doi: 10.1109/TDEI.2002.1024439. [6] Rodrigo A., Llovera P., Fuster V., Quijano A. (2011) Influence of High Frequency Current Transformers Bandwidth on Charge Evaluation in Partial Discharge Measurements. IEEE Trans. Dielectr. Electr. Insul. ;18:1798–1802. doi: 10.1109/TDEI.2011.6032852 [7] Rodrigo A., Llovera P., Fuster V., Quijano A. (2012) Study of Partial Discharge Charge Evaluation and the Associated Uncertainty by Means of High Frequency Current Transformers. IEEE Trans. Dielectr. Electr. Insul. ;19:434–442. doi: 10.1109/TDEI.2012.6180236. [8] Cho, S.I. (2011) On – Line PD (Partial Discharge) Monitoring of Power System Components. Msc Thesis. Aalto University. [9] IEC60270 (2000) Standard. High – Voltage Test Techniques – Partial Discharge Measurements. [10] Mallikarjunappa, K.; Ratra, M.C. (1990) Detection of partial discharges in power capacitors using high frequency current transformers,‖ Electrical Insulation and Dielectric Phenomena, 1990. Annual Report. Conference on, Page(s): 379 – 384. [11] Bhosale, N., Kothoke, P., Deshpande, A and Cheeran, A. N. (2013) Analysis of Partial Discharge Using Phase-Resolved (Φ-Q) and (Φ-N) Statistical Techniques. International Journal of Engineering Research & Technology (IJERT), Vol. 2 Issue 5, Pp 1960-1965. 84 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management [12] Luo, G and Zhang, D. (2010) Study on Performance of HFCT and UHF Sensors in Partial Discharge Detection. In proceedings of IPEC 2010 conference, 27-29 Oct., Singapore, pp. 630-635. 85 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Review of Transmission Line Fault Location using Travelling Wave Method Gunawan Wijaya12, Suwarno1, A. Abu-Siada3 1 PT PLN, Indonesian Electricity Company 2 School of Electrical and Informatics, Institut Teknologi Bandung, Indonesia 3 Electrical and Computer Engineering Department, Curtin University, Australia ABSTRACT This paper presents a state of the art review for travelling wave-based fault locating techniques. The application of these techniques to estimate the fault location in various configurations of transmission lines is also presented. The presented configurations include series capacitor compensated transmission line, tapped transmission line, and hybrid transmission line. Both one-ended and two-ended synchronized methods are explained and compared. Keywords: Fault location, Transmission lines, Travelling-waves Corresponding author: Gunawan Wijaya (awan8019@yahoo.com) 1. INTRODUCTION Several studies on traveling-wave-based fault location in long transmission lines (TLs) can be found in the literatures [1]–[9]. Faults generate voltage signals that propagate through the TL and reflect back at the two ends of the line. By identifying these waves and measuring its arrival time toward the ends, the fault location can be identified. A B F dA l IA IB Figure 1. Fault on a TL fed from both ends l tA tB dA F A t tA1 tA2 tA3 tA4 dB Figure 2. Bewley lattice diagram for the TL in Figure 1 It was reported that this technique is independent on power swing, current transformer saturation, fault type, fault resistance, fault-inception angle and system parameters [10]. The application of the travelling waves can be extended to identify fault location within under-ground cables. Figs. 1 and 2 show schematics for a fault in a TL and the corresponding Bewley lattice diagram. Once the arrival time for the fault travelling waves is recorded, it can be used to identify the fault location. 86 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Data can be collected from one end of the line or at both ends. Fault location based on data collected from one end is categorized as one-ended algorithm, while using data from both ends is referred to two-ended algorithms. The later employs time-synchronization technique between both ends of the line. Although basic calculation of travelling wave-based fault location is similar for all fault types, a fault occurs on a series capacitor compensated line, hybrid lines (mix of overhead lines and underground cables) and tapped line differ in characteristics. For example, a series capacitor in compensated lines introduces non-linearity to the system. In the long TL and underground cables, capacitive current can be considerably large in comparison to fault current [11]– [13]. Other techniques based on impedance measurements were presented in the literature to overcome the limitation of impedance-based methods in identifying faults in a compensated line. Sadeh et. al. presented an impedance based algorithm to identify fault locations in a series capacitor-compensated TL [14]. Huang et. al. modelled series capacitor in the middle of a long TL and measurements were captured at one-end of TL [15]. Their wavelet analysis showed that the current travelling wave is more accurate than the voltage travelling wave in locating faults in such configuration. The main difficulty is on distinguishing the source of the reflected waves of the same polarity as it may be due to a reflection from the series capacitor or a remote bus. Spoor et. al. used the polarity of the arrival waves to distinguish reflected signals from the fault point and the opposing terminal [7]. Abedini et. al. combined the two methods reported in [15] and [7] with a fault locator installed in the middle of a series capacitor compensated line [8]. The fault location on tapped lines or on three terminal lines can be figured using Bewley lattice diagram as in Fig. 2. Analyzing measured data using wavelet was introduced by Magnago and Abur [16] that was applied for a tapped line as reported in [3]. Hamidi et. al. used wavelet to analyse the travelling wave signals on multi hybrid terminals [4], [5]. Though this algorithm is able to estimate fault location, its accuracy decreases as faults occurred close to the joint points and end-terminals. For this kind of TLs, identification of first and consecutive traveling wave is the major challenge of wavelet-based fault location. Wavelet has been widely used to process traveling waves when a fault occurs on a TL [2]–[8], [15], [16] and in TL protection [15], [17], [18]. Wavelet-based digital signal processing has been compared with Artificial Neural Networks (ANN) and Fourier Transformation (FT) [1]. Results show the higher accuracy of the wavelet in identifying the location of phase to ground faults when compared to ANN and FT. On another research, Hilbert-Huang Transform has been utilized [19]. However, this approach calls for additional Rogowski coil to measure the voltage at both terminals of the line and also in the middle of TL. 2. TRAVELLING WAVE THEORY When a fault occurs on TL, current will be drawn at the fault location from both terminals. Voltage waves occurred at the fault point travel toward the ends of the line. The fault will divide the TL into two sections at the location of fault as shown in Fig. 2. The wave arrival times to both ends of the line are tA1, tA2, tB1, tB2, as shown in the Bewley lattice diagram of Fig. 2. These waves propagate along the line until they find a discontinuity point such as joints, buses, or another fault point. Then these waves undergo reflections and refractions. This process continues until the wave energy is completely dissipated along the line [10]. The wave arrival time on an end-terminal depend on fault point and propagation velocity of the line medium. Although several wavelet-based algorithms are proposed in the literatures, the basic principle of fault location has the same major steps [2]–[4], [6]–[8], [16], [17]. The first step is transforming the voltage or current signal to modal mode. For this purpose, Clarke Transformation Matrix is used. The second step is applying Discrete Wavelet Transform (DWT) to the modal voltages or current. Squaring the wavelet transform coefficients (WTC2) will determine the maximum value of the wave due to the signal energy reaches its maximum value. Many researches utilize Daubechies family as the mother wavelet due to its simplicity that reduces the computation process and the fewer resource it requires when implemented [8]. The final step is to observe WTC2 to determine the fault location. This step varies to specific cases and will be explained below. 87 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 2.1. Modal Analysis Clarke’s Transformation Matrix is used to transform a signal into several independent modes of signal propagation. The matrix will transform a three-phase system into a three-mode signals known as mode 0, 1, and 2. For fault location purpose, these mode have different function. The zero-mode, depends on the frequency. At this mode, wave travels at slower velocity due to the line high attenuation. The other modes, modes 1 and 2, are known as independent aerial modes. At these modes, wave travels at a speed close to the speed of light [7]. All these modes can be calculated from signal measurement following a general equation. 𝑆𝑆𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚 = 𝑇𝑇𝑆𝑆𝑝𝑝ℎ𝑎𝑎𝑎𝑎𝑚𝑚 ,…......………………………………. (1) where Smode is modal-transformed voltage or current signals, Sphase is the signal phase and T is Clarke’s Transformation Matrix for fully transposed line that is defined as [6], [7], [15], [16] 𝑇𝑇 = 1 3 � 1 1 12 −1 −10 √3 −√3� ,…...………………………….…… (2) 2.2. Discrete Wavelet Transform DWT analysis is widely used in power systems because of its capability to capture both frequency and time simultaneously. Wavelet analysis can improve the analysis of signals with localized-impulses and also oscillations. DWT for a signal f(k) with respect to a “mother” wavelet ψ(k) is given by [7], [16]: 𝐷𝐷𝐷𝐷𝑇𝑇(𝑚𝑚, 𝑛𝑛) = 1 �𝑎𝑎0 𝑚𝑚 ∑ 𝑓𝑓(𝑘𝑘)𝑘𝑘 𝜓𝜓 �𝑛𝑛−𝑘𝑘𝑎𝑎0𝑚𝑚𝑎𝑎0𝑚𝑚 � ,...………………………… (3) Where 𝑎𝑎0𝑚𝑚 is a scaling factor and 𝑘𝑘𝑎𝑎0𝑚𝑚 is a shifting factor, k is an integer variable and m is the level of decomposition. Wavelet analysis is dividing the signal into two-separate blocks. The signal will be shifted and scaled based-on a power of 2 as seen to Fig. 3. Unlike Continuous Wavelet Transform (CWT), DWT determines the coefficients on the two blocks. The higher scale applied, the lower frequency will be obtained. It means that scale factor has an inverse relationship to the frequency, and the resulting series of wavelet coefficients is referred to as the wavelet series decomposition [7]. h1(z) g0(z) h1(z2) g0(z2) h1(z4) g0(z4) h1(z8) g0(z8) f1(k) d1(k) (scale 1) f(k)=f0(k) d2(k) (scale 2) f2(k) d3(k) (scale 3) f3(k) d4(k) (scale 4) f4(k) Figure 3. Structure of the algorithm for signal wavelet at 4th level of decomposition [15] The pseudo-frequency (Fig. 4), corresponding to a particular level of decomposition or scale, can be determined from [7], [18] 𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑛𝑛𝑓𝑓𝑓𝑓 = 𝑇𝑇𝑎𝑎𝐹𝐹𝑐𝑐/𝑎𝑎0𝑚𝑚 ,……………………………………... (4) where 𝑇𝑇𝑎𝑎 is the sampling period of input signal, 𝐹𝐹𝑐𝑐 is the wavelet function, and 𝑎𝑎0𝑚𝑚 is scaling factor. 50 kHz Scale 20 0-25 kHz Scale 21 0-12.5 kHz Scale 22 0-6.25 kHz Scale 23 0-3.125 kHz Scale 24 25-50 kHz 12.5-25 kHz 6.25- 12.5 kHz 3.125- 6.25 kHz Low Pass Filter High Pass Filter Figure 4. Pseudo-frequency of 4th scale of wavelet transform [18] 88 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 3. FAULT LOCATION AND ERROR CALCULATION Once a fault is detected and recorded, its fault point can be estimated. After calculating the fault location, the error in the location can be calculated based on [13]. The error is represented as a percentage of TL length. Assuming FLcalc, FLexact, and l respectively as calculated fault location, actual fault location and TL length, the error is then calculated as : 𝐸𝐸𝑓𝑓𝑓𝑓𝐸𝐸𝑓𝑓 (%) = (𝐹𝐹𝐹𝐹𝑐𝑐𝑎𝑎𝑙𝑙−𝐹𝐹𝐹𝐹𝑒𝑒𝑒𝑒𝑎𝑎𝑐𝑐𝑒𝑒) 𝑙𝑙 𝑥𝑥 100% ,………………………………. (5) 3.1. One-ended Measurements In this technique, the recording unit is installed at one or two terminals without mutual communication. Based on Fig. 2, when conventional approach is considered [2], [3], [6]–[8], [16], which uses propagation velocity (v), the fault location on the first section of the line from bus A (dA) is calculated by (6). If the fault occurs on the second section of the line then dA is calculated by (7). As seen from (6) and (7), speed of wave propagation, as calculated in (8), is required to calculate the fault location. 𝑑𝑑𝐴𝐴 = 𝑣𝑣.(𝑡𝑡𝐴𝐴2−𝑡𝑡𝐴𝐴1)2 ,………………………………………… (6) 𝑑𝑑𝐴𝐴 = 𝑙𝑙 − 𝑣𝑣.(𝑡𝑡𝐴𝐴2−𝑡𝑡𝐴𝐴1)2 ,..…………………………………… (7) 𝑣𝑣 = � 1 𝐹𝐹′𝐶𝐶′ ,………………………………………………. (8) where L' and C' are respectively the line inductance and capacitance per unit length. Another fault locating technique is presented in [2] without using of the velocity of wave propagation. This technique uses three consecutive traveling waves appearance instead of two traveling waves as per the equations below. Fault location can be calculated at first section of the line using (10) if (9) is valid. Otherwise, (11) is used to calculate fault location. It is shown on [2] that traveling wave at tREF has higher magnitude compared to the reflected one from the fault point. Fault location within the second section of the line can be calculated using (12) if (9) is valid. Otherwise, (13) is used. 𝑡𝑡𝐴𝐴2 − 𝑡𝑡𝐴𝐴1 = 𝑡𝑡𝐴𝐴3 − 𝑡𝑡𝐴𝐴2 ,……..…………………………… (9) For fault occurred within the first section : 𝑑𝑑𝐴𝐴 = (𝑡𝑡𝐴𝐴2−𝑡𝑡𝐴𝐴1).𝑙𝑙(𝑡𝑡𝐴𝐴2−2𝑡𝑡𝐴𝐴1+𝑡𝑡𝑅𝑅𝑅𝑅𝑅𝑅) ,…………………...……………… (10) 𝑑𝑑𝐴𝐴 = (𝑡𝑡𝐴𝐴2−𝑡𝑡𝐴𝐴1).𝑙𝑙(𝑡𝑡𝐴𝐴2−2𝑡𝑡𝐴𝐴1+𝑡𝑡𝐴𝐴3) ,…………………...………………. (11) For fault occurred within the second section : 𝑑𝑑𝐴𝐴 = (𝑡𝑡𝑅𝑅𝑅𝑅𝑅𝑅−𝑡𝑡𝐴𝐴1).𝑙𝑙(𝑡𝑡𝐴𝐴2−2𝑡𝑡𝐴𝐴1+𝑡𝑡𝑅𝑅𝑅𝑅𝑅𝑅) ,…………………...……………… (12) 𝑑𝑑𝐴𝐴 = (𝑡𝑡𝐴𝐴3−𝑡𝑡𝐴𝐴1).𝑙𝑙(𝑡𝑡𝐴𝐴2−2𝑡𝑡𝐴𝐴1+𝑡𝑡𝐴𝐴3) ,…………………...……………… (13) 3.2. Two-ended Measurements In this technique, a recording unit should be installed at both ends and should be synchronized through global positioning system (GPS). In this method, fault signals are captured and recorded at both terminals by the two recording units using the same time-synchronized reference. In synchronized measurements, the delay time td between the fault detection time at both ends can be determined [16] and the distance to the fault point measured 89 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management from bus A can be calculated by: 𝑑𝑑𝐴𝐴 = 𝑙𝑙−(𝑣𝑣𝑚𝑚.𝑡𝑡𝑑𝑑)2 ,…………………...…………………. (14) where 𝑡𝑡𝑚𝑚 = 𝑡𝑡𝐵𝐵1 − 𝑡𝑡𝐴𝐴1, and 𝑣𝑣𝑚𝑚 is velocity of traveling wave for mode m. Another method, reported in [2], utilizes the first signal received from the opposite terminal without using the signal velocity in calculating the fault location. Compared to one-ended algorithm, information from both terminals can calculate the fault location using the local measurement as primary data as in (15). Equations (16) and (17) could be used to calculate the fault location dA, while (18) and (19) for the other section measured from the opposite terminal (1-dA) or dB. 𝑡𝑡𝐴𝐴1 < 𝑡𝑡𝐵𝐵1 , 𝑓𝑓𝑎𝑎𝑓𝑓𝑙𝑙𝑡𝑡 𝐸𝐸𝑛𝑛 1𝑎𝑎𝑡𝑡𝑠𝑠𝑓𝑓𝑓𝑓𝑡𝑡𝑠𝑠𝐸𝐸𝑛𝑛 𝑡𝑡𝐴𝐴1 = 𝑡𝑡𝐵𝐵1 , 𝑓𝑓𝑎𝑎𝑓𝑓𝑙𝑙𝑡𝑡 𝐸𝐸𝑛𝑛 𝑓𝑓𝑓𝑓𝑛𝑛𝑡𝑡𝑓𝑓𝑓𝑓 ,…………………...……..… (15) 𝑡𝑡𝐴𝐴1 > 𝑡𝑡𝐵𝐵1 , 𝑓𝑓𝑎𝑎𝑓𝑓𝑙𝑙𝑡𝑡 𝐸𝐸𝑛𝑛 2𝑛𝑛𝑚𝑚𝑠𝑠𝑓𝑓𝑓𝑓𝑡𝑡𝑠𝑠𝐸𝐸𝑛𝑛 Fault location dA for fault occurred at first section: 𝑑𝑑𝐴𝐴 = (𝑡𝑡𝐴𝐴2−𝑡𝑡𝐴𝐴1).𝑙𝑙2(𝑡𝑡𝐴𝐴2−2𝑡𝑡𝐴𝐴1+𝑡𝑡𝐵𝐵1) ,…………………...…………..…… (16) For fault occurred at second section: 𝑑𝑑𝐴𝐴 = (𝑡𝑡𝐵𝐵1+𝑡𝑡𝐵𝐵2−2𝑡𝑡𝐴𝐴1).𝑙𝑙2(𝑡𝑡𝐵𝐵2−𝑡𝑡𝐴𝐴1) ,…………………...…………....….. (17) Fault location (1-dA) or dB for fault occurred at first section: 𝑑𝑑𝐵𝐵 = (𝑡𝑡𝐴𝐴2−3𝑡𝑡𝐴𝐴1+2𝑡𝑡𝐵𝐵1).𝑙𝑙2(𝑡𝑡𝐴𝐴2−2𝑡𝑡𝐴𝐴1+𝑡𝑡𝐵𝐵1) ,…………………...…………..….. (18) Fault location (1-dA) or dB for fault occurred at second section: 𝑑𝑑𝐵𝐵 = (𝑡𝑡𝐵𝐵2−𝑡𝑡𝐵𝐵1).𝑙𝑙2(𝑡𝑡𝐵𝐵2−𝑡𝑡𝐴𝐴1) ,………………………...…………....….. (19) All these equations can be applied for fault location calculation on the opposite terminal. 3.3. Series compensated line Series capacitors compensation on a TL affects the line impedance. Therefore, fault location using impedance-based calculation needs an information about the degree of series compensation and location of the capacitor [11], [14], [20]. By using traveling wave based methods, fault location on compensated TL is not affected by the presence of the series capacitors since the impedance modification at high frequencies due to the series capacitor will be negligible [16]. Fault location series capacitor compensated line can be calculated using one or two of the above equations of (6) – (19). 3.4. Hybrid line Fault location that uses wave propagation velocity on hybrid type line (Bewley lattice diagram of such line is shown in Fig. 5) is more prone to large errors introduced by changing parameter over its age such as relative permittivity [9]. Therefore, in order to use fault location technique accurately, it is suggested the analysis should not take into account the wave propagation velocity [5], [9]. The use of modal transformation analysis on hybrid type TLs mitigates the effect of cross-bonding. Therefore, travelling wave-based algorithms is preferred due to its less sensitivity to fault type, non-linear fault impedance, and transmission loading conditions [4]. Fault location on this line can be calculated using one or two of equations of (14) – (19). 90 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management B tA tB d F t tA2 1-d tA1 tA3 tB2tB1 tB3 Figure 5. Bewley lattice diagram of a fault on hybrid line 3.5. Tapped line On a tapped line, fault detection has to be done before fault location calculation [3]–[6]. Obtaining high accuracy is done by synchronizing measurement from all terminal. The measurement or protection device at joint point will increase the error of fault location. B tA tB m F t tA C tC 1-m tB tC Figure 6. Bewley lattice diagram of a fault on tapped line Fig. 6 shows the Bewley diagram for a tapped line where first-wave arrival time at all terminals are defined as tA, tB, and tC. Travelling times for segments AJ, BJ, and CJ can be defined as τAJ, τBJ, and τCJ. The relationships between these variables are defined by the following equations [4]: 𝑡𝑡𝐴𝐴 = 𝑚𝑚. τAJ + 𝑡𝑡𝐹𝐹 ,…………………..................... (20) 𝑡𝑡𝐵𝐵 = (1 −𝑚𝑚). τAJ + τBJ + 𝑡𝑡𝐹𝐹 ,…………………..................... (21) 𝑡𝑡𝐶𝐶 = (1 −𝑚𝑚). τAJ + τCJ + 𝑡𝑡𝐹𝐹 ,…………………..................... (22) where 𝑡𝑡𝐹𝐹 is the time when fault occurred and m is the ratio of fault location to faulty segment length (dA/lAJ). Subtracting arrival times one another will eliminate 𝑡𝑡𝐹𝐹. For fault on segment AJ, eliminating equations (20) – (22) each other will result in first-wave arrival time differences at three terminals [4] 𝑡𝑡𝐴𝐴𝐵𝐵 = |𝑡𝑡𝐵𝐵 − 𝑡𝑡𝐴𝐴| = |(1 − 2𝑚𝑚). τAJ + τBJ| ,………………….............. (23) 𝑡𝑡𝐴𝐴𝐶𝐶 = |𝑡𝑡𝐶𝐶 − 𝑡𝑡𝐴𝐴| = |(1 − 2𝑚𝑚). τAJ + τCJ| ,………………….............. (24) 𝑡𝑡𝐵𝐵𝐶𝐶 = |𝑡𝑡𝐶𝐶 − 𝑡𝑡𝐵𝐵| = |τCJ − τBJ| ,………………….............. (25) Similarly, for fault occurred on segment BJ and CJ, (25) can be represented as [4] 𝑡𝑡𝐴𝐴𝐶𝐶 = |𝑡𝑡𝐶𝐶 − 𝑡𝑡𝐴𝐴| = |τCJ − τAJ| ,…………………....................... (26) 𝑡𝑡𝐴𝐴𝐵𝐵 = |𝑡𝑡𝐵𝐵 − 𝑡𝑡𝐴𝐴| = |τBJ − τAJ| ,…………………....................... (27) Equations (25) – (27) will result in a unique difference for each segment if it is subtracted to propagation time between two terminals. This unique value can be used to recognize the faulty segment [4]. ∆𝑖𝑖𝑖𝑖 = �𝑡𝑡𝑖𝑖𝑖𝑖 − τij�, ∀ 𝑠𝑠∗𝑗𝑗∗: ∆𝑖𝑖∗𝑖𝑖∗ = 0 ,…………………………. (28) 91 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management where ∆𝑖𝑖𝑖𝑖’s are differences between measured first-wave arrival times 𝑡𝑡𝑖𝑖𝑖𝑖 and pre-calculated wave travelling time along two segments τ𝑖𝑖𝑖𝑖. The zero or approaching zero value of ∆𝑖𝑖𝑖𝑖 identify the faulty segment. The approaching zero value could be happen due to low sampling rate of measurement [4]. Having calculating it, a statistical mean and standard deviation can be used to find the outlier value which represent the faulty segment. The general form of faulty segment is defined as [4] 𝑂𝑂∆ = {𝑥𝑥 | 𝑥𝑥 ∈ ∆𝑖𝑖𝑖𝑖 ′𝑠𝑠, 𝑥𝑥 ∉ [µ∆ − σ∆ , µ∆ − σ∆]} ,………………………. (29) where 𝑂𝑂∆ represents the outlier showing the faulty segment, while µ∆ and σ∆ respectively are population mean and standard deviation of ∆𝑖𝑖𝑖𝑖 ′𝑠𝑠. Fault location from corresponding terminal is done by using one of the following equations [4]. 𝑂𝑂∆ = ∆𝐵𝐵𝐶𝐶 , 𝑥𝑥 = �12 − �𝑡𝑡𝐴𝐴𝐵𝐵−τBJ�2τAJ � 𝐿𝐿𝐴𝐴𝐴𝐴 , 𝑠𝑠𝑓𝑓 𝐴𝐴𝐴𝐴 𝑠𝑠𝑠𝑠 𝑓𝑓𝑎𝑎𝑓𝑓𝑙𝑙𝑡𝑡𝑓𝑓 ,…………………... (30) 𝑂𝑂∆ = ∆𝐴𝐴𝐶𝐶 , 𝑥𝑥 = �12 − �𝑡𝑡𝐴𝐴𝐵𝐵−τAJ�2τBJ � 𝐿𝐿𝐵𝐵𝐴𝐴, 𝑠𝑠𝑓𝑓 𝐵𝐵𝐴𝐴 𝑠𝑠𝑠𝑠 𝑓𝑓𝑎𝑎𝑓𝑓𝑙𝑙𝑡𝑡𝑓𝑓 ,…………………... (31) 𝑂𝑂∆ = ∆𝐴𝐴𝐵𝐵 , 𝑥𝑥 = �12 − �𝑡𝑡𝐴𝐴𝐴𝐴−τAJ�2τCJ � 𝐿𝐿𝐶𝐶𝐴𝐴, 𝑠𝑠𝑓𝑓 𝐶𝐶𝐴𝐴 𝑠𝑠𝑠𝑠 𝑓𝑓𝑎𝑎𝑓𝑓𝑙𝑙𝑡𝑡𝑓𝑓 ,…………………... (32) Where 𝐿𝐿𝐴𝐴𝐴𝐴 , 𝐿𝐿𝐵𝐵𝐴𝐴 , and 𝐿𝐿𝐶𝐶𝐴𝐴 are length of segment AJ, BJ, and CJ, respectively. Summary of travelling wave-based fault locating technique is provided in Table I. 4. CONCLUSION This paper presented a literature review for fault locating techniques based on travelling waves. The technique can be implemented as one-ended or two-ended measurements. Travelling wave techniques are insusceptible to the fault type. Traveling wave fault location is not affected by the line parameters variation such as line impedance, cable permittivity, as well as line configuration modification. On the other hand, traveling wave-based fault location techniques have to be used in pair with wide range band of frequency measurement, including high sampling frequency measurement unit, optical cable for measurement unit, and also higher processing recording unit. Absence of one of these requirements may lead to less accuracy of such techniques. The technique is relatively expensive and of low accuracy for faults close to the terminals of the line. Table 1 : Travelling wave based algorithms characteristic Characteristics on several TLs One-ended measurement Two-ended measurement (sync'd) Velocity based Time based Velocity based Time based Complexity1 Simple low fair low fair CC low fair low fair Hybrid fair high low high Tapped fair high low high Key features2 Simple Fast calculation Fair accuracy and/or calculation Fast calculation High accuracy CC Fast calculation Fair accuracy and/or calculation Fast calculation High accuracy Hybrid Fast calculation and/or difficult FSD Fair calculation and/or hard FSD Fast calculation Fair accuracy Tapped Fast calculation Fair calculation Fast calculation High accuracy and/or slow calculation 92 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Limitation Simple • Higher error on faults close to terminals • Higher error on short transmission CC • Higher error on faults close to terminals • Higher error on short transmission Hybrid • Higher error on faults close to terminals and joints • Higher error on short transmission • Difficulties on distinguishing reflected wave from fault point and joint • Difficulties on fault close to terminals and joints Tapped • Higher error on faults close to terminals and joints • Higher error on short transmission • Difficulties on distinguishing reflected wave from fault point and joint • Difficulties on fault close to terminals and joints Improvement • Higher frequency sampling to accuracy • Higher processing unit (conventional substation) • Wider frequency-band measurement sensor (optical sensor) • Lower measurement-cable impedance (optical cable) Cost3 Simple low - medium low - medium medium medium CC low - medium low - medium medium medium Hybrid low - medium low - medium medium medium Tapped low - high low - high high high These characteristics are qualitatively measured compared to wavelet-based algorithm on simple TLs. 1Based on its filter processing, Fault Section Detection (FSD), FL formula 2Based on FL calculation processing, accuracy, and FSD 3Estimation costs on new installation compared to simple TLs one-ended algorithm and limited data from [21], including sensors, GPS, communication module, number of Input/Output (I/O), and Central Processing Unit (CPU). 5. REFERENCES [1] A. Abdollahi and S. Seyedtabaii, “Transmission line fault location estimation by Fourier & wavelet transforms using ANN,” Power Eng. …, no. June, pp. 23–24, 2010. [2] O. Altay, E. Gursoy, and O. Kalenderli, “Single end travelling wave fault location on transmission systems using wavelet analysis,” in 2014 ICHVE International Conference on High Voltage Engineering and Application, 2014, pp. 1–4. [3] C. Y. Evrenosog̃lu and A. Abur, “Travelling wave based fault location for teed circuits,” IEEE Trans. Power Deliv., vol. 20, no. 2 I, pp. 1115–1121, 2005. [4] R. J. Hamidi and H. Livani, “A travelling wave-based fault location method for hybrid three-terminal circuits,” IEEE Power Energy Soc. Gen. Meet., vol. 2015–Septe, pp. 1–5, 2015. [5] R. J. Hamidi and H. Livani, “Traveling-Wave-Based Fault-Location Algorithm for Hybrid Multiterminal Circuits,” IEEE Trans. Power Deliv., vol. 32, no. 1, pp. 135–144, 2017. [6] K. Andanapalli and B. R. K. Varma, “Travelling wave based fault location for teed circuits using unsynchronised measurements,” 2013 Int. Conf. Power, Energy Control, pp. 227–232, 2013. [7] D. Spoor and J. G. Zhu, “Improved single-ended traveling-wave fault- location algorithm based on experience with conventional substation transducers,” IEEE Trans. Power Deliv., vol. 21, no. 3, pp. 1714–1720, 2006. [8] M. Abedini, A. Hasani, A. H. Hajbabaie, and V. Khaligh, “A new traveling wave fault location algorithm in series compensated transmission line,” 2013 21st Iran. Conf. Electr. Eng., pp. 1–6, 2013. [9] O. Altay, E. Gursoy, A. Font, and O. Kalenderli, “Travelling Wave Fault Location on Hybrid Power Lines,” pp. 1–4, 2016. [10] M. Saha, J. Izykowski, and E. Rosolowski, Fault Location on Power Networks. Springer, 2010. [11] J. Sadeh et al., “Accurate Fault Location Algorithm for Series Compensated Transmission Lines,” vol. 0, no. c, pp. 46–53, 2013. [12] S. Kapuduwage, “Fault Location on the High Voltage Series Compensated Power Transmission Networks,” 2006. [13] P. System, R. Committee, I. Power, and E. Society, IEEE Guide for Determining Fault Location on AC Transmission and Distribution Lines IEEE Power and Energy Society, vol. 2014. 2014. [14] J. Sadeh, N. Hadjsaid, A. M. Ranjbar, and R. Feuillet, “Line Parameter-Free Fault Location Algorithm for Series 93 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Compensated Transmission Lines,” IEEE Trans. Power Deliv., vol. 15, no. 3, 2000. [15] Z. Huang, Y. Chen, and Q. Gong, “A protection and fault location scheme for EHV line with series capacitor based on travelling waves and wavelet analysis,” PowerCon 2002 - 2002 Int. Conf. Power Syst. Technol. Proc., vol. 1, pp. 290–294, 2002. [16] F. H. H. Magnago and A. Abur, “Fault location Using Wavelets,” IEEE Trans. Power Deliv., vol. 13, no. 4, pp. 1475–1480, 1998. [17] B. Bhalja, R. P. Maheshwari, E. Engineering, and I. I. T. Roorkee, “Wavelet Transform Based Differential Protection Scheme for Tapped Transmission Line Crossleyiand,” 2006 IEEE Int. Conf. Ind. Technol., pp. 1004–1008, 2006. [18] W. Chen, O. P. Malik, X. Yin, D. Chen, and Z. Zhang, “Study of Wavelet-Based Ultra-High-Speed Directional Transmission Line Protection,” IEEE Power Eng. Rev., vol. 22, no. 11, p. 54, 2002. [19] D. Wang, H. L. Gao, S. B. Luo, and G. B. Zou, “Travelling wave fault location principle based on Rogowski coil ’ s differential output and Hilbert-Huang transform,” pp. 1–6, 2016. [20] S. Kapuduwage and M. Al-Dabbagh, “Development of Efficient Algorithm for Fault Location on Series Compensated Parallel Transmission Lines,” TENCON 2005 - 2005 IEEE Reg. 10 Conf., pp. 1–5, 2005. [21] PLN, Fault Recorder Contract. 2014. 94 31st Conference on Condition Monitoring COMADEM 2018, North-West University, and Diagnostic Engineering Management South Africa. Off Line Partial Discharge as an essential Condition Based Monitoring (CBM) Tool for busbar insulation monitoring on Air Insulated Switchgear (AIS) J.V. Bissett1, P.A. van Vuuren2 1Erongo Regional Electricity Distributor, Namibia 2 School of Electrical, Electronic and Computer Engineering, North-West University, South Africa ABSTRACT This paper investigates the effectiveness of current test methods and procedures associated with insulation testing on the busbar of Air Insulated Switchgear (AIS) during scheduled planned maintenance. An improved test method proposes combining traditional AC voltage withstand test with Partial Discharge (PD) measurement. Three PD sensors were considered, namely Airborne Acoustic (AA), Transient Earth Voltage (TEV) and High Frequency Current Transformer (HFCT). The results confirm that HFCT is most sensitive and revealed significant difference at higher voltages between the six AIS samples. The TEV results produced consistent readings with smaller differences at higher voltages. The Ultra-sound instrument was found to be extremely sensitive but lack the capability to produce reliable measurement readings. Peak TEV measurements indicate a noticeable difference between older and newer samples. The paper is supported by a practical case study. Keywords: Air Insulated Switchgear (AIS), Remaining Useful Life (RUL), Condition Based Monitoring (CBM), Partial Discharge (PD), On Line Partial Discharge (OLPD) Corresponding author: Vermaas Bissett (vbissett@erongored.com.na) 1. INTRODUCTION The field of Partial Discharge (PD) is enormous and extensive research during the last few decades produced excellent results and value to the industry. The methods and techniques of partial discharge assessment using non-intrusive test equipment appears to be more comprehensive in HV and EHV switchgear, with fewer studies on MV switchgear [1]. In Electricity Distribution Networks, AIS forms a critical node in the electricity network to distribute energy from transmission levels to the end customers. The electricity utility must operate and maintain the AIS in the most effective and efficient manner to ensure compliance to statutory requirements. Whilst many of the AIS are approaching the end of lifetime phase, most utilities are not in a position to replace the obsolete AIS substations due to financial and technical constraints. Failure of AIS have enormous impact on business integrity and should be avoided by the utility at all cost. AIS consists of various components that are assembled in a compartmentalized manner and includes the circuit breaker-, busbar-, current transformer- and cable compartments. The most under-maintained component in electrical distribution systems is the main busbar insulation of medium voltage switchgear [2]. Busbar failure is catastrophic and results in switchgear failure and unavailability of the substation. The busbar is not easily accessible and requires a power outage for inspection and maintenance. During preventative maintenance, a very elementary Alternating Current (AC) voltage withstand test is recommended by Original Equipment Manufacturers (OEM) in accordance with the International Electrotechnical Commission (IEC62271-200) [3][4]. The final result of this prescribed test reveals either a pass or fail outcome and does not necessarily provide additional information on the condition of the insulation. 95 31st Conference on Condition Monitoring COMADEM 2018, North-West University, and Diagnostic Engineering Management South Africa. PD phenomena has many macroscopic-physical effects as illustrated in Figure 1. The measurement of PD in electrical apparatus can be divided into the conventional (IEC60270) [5] and unconventional (IEC62478) [6] measurement methods. This paper will only consider unconventional measurement by electromagnetic and acoustic methods. Unconventional methods are the preferred type for on-site assessment because switchgear can remain in service while conducting the assessment with portable hand held devices. The investment cost is relatively low and most utilities can easily accommodate this budget requirement. Figure 1: Macroscopic-physical effects of PD with detection methods The measurement of PD is a suitable means of detecting certain defects in the equipment under test and is a useful complement to dielectric tests. Experience shows that PD may lead in particular arrangements to a progressive degradation in the dielectric strength of the insulation, especially of solid insulation. It is not yet possible to establish a reliable relationship between the results of PD measurement and the life expectancy of the equipment owing to the complexity of the insulation systems used in metal enclosed switchgear [7]. This paper proposes a new test method by obtaining PD measurements during over voltage withstand test across a range of voltages up to the maximum rating of the equipment. The test can be used for on-site testing before and after major maintenance has been completed, but can also be considered for Factory- and Site Acceptance Tests. The method provides more insight into leakage current discharge when performing normal AC voltage withstand testing. The results are used for further interpretation of insulation condition and RUL. AIS suffer from several insulation failure processes, namely electrical tracking, inadequate clearance between phases and earth potential, as well as electrical treeing on voltage- and current transformers, all of which produce undesirable PD. Poor workmanship, inferior insulation material and sub-standard manufacturing processes, as well as insulation degradation due to normal aging, all contributes to this phenomena. PD assessments is currently one of the best methods that are used for asset management on AIS. Both On Line and Off Line PD monitoring systems have been installed in the electricity industry with great success [8]. Considerable work was done to determine the most appropriate sensors applicable to electrical assets, as well as PD test techniques and the applications thereof [9]. No literature could be found that made reference to the proposed new test method. 96 31st Conference on Condition Monitoring COMADEM 2018, North-West University, and Diagnostic Engineering Management South Africa. Research has been done to simulate accelerated aging of switchgear by changing the environmental conditions [10][11]. Other research focused on the extension of the lifetime of the switchgear by applying certain maintenance techniques to eliminate discharge in MV AIS [12]. Degradation analysis of epoxy resin surfaces exposed to partial discharge was also popular research and some new work was recently completed by Tokyo City University [13]. Good research was also done on deterioration of insulating materials by internal discharge where life versus voltage characteristic were measured [14]. Asset management remains a core responsibility of electricity utilities. Risk and health indices are highlighted as important components of effective condition assessment of MV switchgear [15][16]. The utilisation of Off Line Partial Discharge measurements on AIS as a CBM tool is furthermore recommended as an essential tool for distribution utilities. Monitoring of the busbar and other compartments with various PD sensors are presented. Detailed results provide insight into the condition of insulation with opportunities to determine RUL and to extend the lifetime of the equipment. No research could however be found to quantify Remaining Useful Life of AIS, either by extrapolating information or trending data, or life expectancy formula or health indices, or by interpretation of PD results. 2. QUESTION Is the power frequency withstand voltage test adequate for insulation testing and what additional information on the condition of the insulation could be obtained by taking a variety of PD measurements during the over voltage withstand test across a range of voltages up to the maximum rating of the equipment? 3. CURRENT INSULATION TEST METHODS AND PROCEDURES The current test method recommended by IEC (International Electrotechnical Commission) for metalclad switchgear prescribes applying a 50Hz power frequency test voltage to prove integrity of the insulation. The test is typically done during Factory Acceptance Test (FAT) and repeated on site (Site Acceptance Test) at a lower test voltage [17]. Milli Ampere reading and in some cases also Ohm readings are displayed on the test equipment. The interpretation of the test results could be debated as high milli Ampere reading does not necessarily indicate a breakdown in the insulation. Likewise, high Ohm readings cannot reliably indicate good insulation. Traditionally, 2.5 mA per panel was considered acceptable and used as a bench mark [18]. More recently, conventional PD testing during FAT has been conducted provided that the manufacturer’s facility can accommodate this test. Maximum permissible PD quantity is the apparent charge that is usually expressed in picocoulombs (pC) [4]. The standard requires that at the correctly rated test voltage permissible discharge quantity shall be less than 100pC. This is a conventional test method with directly connected circuit to the test object. Due to many constraints this test cannot easily be repeated on-site. After completion of on-site maintenance the switchgear will be tested at a reduced test voltage, typically 75% - 80% of the original test voltage, to safeguard equipment against unwanted stresses caused by over voltage. The equipment is considered to have passed the test if the test voltage is maintained for 1 minute without flashover. This test can be seen as formality and will seldom fail the voltage withstand test. The 97 31st Conference on Condition Monitoring COMADEM 2018, North-West University, and Diagnostic Engineering Management South Africa. main purpose of this test as part of normal maintenance practices is to prove that it is safe to energise the switchgear after completion of all work and that no major defects are present on the switchgear. The shortcomings of the existing test methods and general industry practice could be summarized as follows: (a) Limited guidelines on the interpretation of ohm (Mῼ) and leakage current (mA) readings. (c) The voltage withstand test lacks information on the condition of the insulation. (d) Inconsistency on test voltage compensation based on environmental conditions (both temperature and relative humidity) to avoid discrepancies when comparing results of previous tests. (e) The voltage withstand test provides limited information on how effective the maintenance activities were executed. 4. IDENTIFICATION OF ALL AIS ELEMENTS TO BE MONITORED WITH EMPHASIS ON INSULATION Equipment breakdown on the primary parts or components of AIS is depicted in Figure 2. Advice and recommendation from OEM was considered to identify all the critical areas of inspection. Location of PD sensors and positioning thereof is absolutely critical for the successful recording of PD pulses [9]. As metal enclosed switchgear is compartmentalized it is important to understand the coverage of the sensors on the entire board. Figure 2: Typical component breakdown of AIS with compartments 5. PD APPLICATION ON AIS AND ASSET MANAGEMENT Already from the early years of PD study, researchers discovered the potential of PD assessment on insulation material to determine the health condition of the apparatus. In recent years more emphasis was on relating the assessment with health indices as well as condition risk assessment [19]. The outcome of assessment values could be used as input drivers for assessment management. There is already On Line 98 31st Conference on Condition Monitoring COMADEM 2018, North-West University, and Diagnostic Engineering Management South Africa. monitoring systems available that provide continuous figures on health statuses of the entire plant in real time. During Factory Acceptance Test (FAT) it has lately become customary for clients to specify PD measurement on newly manufactured switchgear in accordance with IEC standards [4]. It is typically a single measurement value that should be below 100pC. Although FAT is assumed to represent the actual and final site configuration, it should be noted that the main busbar assembly has not been completed and this PD value is only reflecting on the combination of all the individual bays including the circuit breakers. Repeatability of the same FAT test for PD measurement on site after final commissioning and during major maintenance are therefore not possible, unless it is specifically prescribed by the engineer. Three PD sensor technologies are currently available for AIS assessment and will furthermore be evaluated in this paper, namely Airborne Acoustic (AA), Transient Earth Voltage (TEV) and High Frequency Current Transformer (HFCT). 6. PD ASSESSMENT METHODS CURRENTLY DEPLOYED WITIN THE INDUSTRY Table 1 provides comparison criteria between OLPD and Off Line PD solution for in-service AIS. PD Maintenance Testing or Site Acceptance Test (SAT) is introduced as a new proposed method for consideration. Table 1: Comparison of PD measurement methods There is currently not an acceptable standard for interpretation of PD levels for unconventional measurements. Manufacturers and industry users have over the years developed own benchmark levels as guidelines for assessment of AIS. Based on field experience many utilities have developed their own PD level guidelines. The guideline from HVPD which is based on similar switchgear within the UK industry and was found that this guideline with four level indication can successfully be used and trusted with high confidence [20]. 99 31st Conference on Condition Monitoring COMADEM 2018, North-West University, and Diagnostic Engineering Management South Africa. 7. PROPOSED NEW TEST METHOD BY COMBINING PD MEASUREMENT DURING OVER VOLTAGE WITHSTAND TEST Unconventional test methods are sometimes not preferred by industry users because the interpretation of the physical discharge quantity cannot be determined with high accuracy and confidence. The conventional test methods were originally developed to determine true discharge quantities and connects directly to the test object with suitable capacitor. This preferred test is normally performed in controlled laboratory condition where noise levels can be controlled and reduced to minimum levels. This test procedure is difficult to perform on-site due to the uncontrolled environment and availability of test equipment. The unconventional (IEC62478) measurement method has lately become the preferred method and all PD equipment manufacturers have released several new Hand Held Devices as well as OLPD systems. A test method that has not yet been considered by the industry proposes to combine the functionality and measuring capabilities of PD instrumentation while conducting off line the power frequency voltage withstand test over a range of voltages. A considerable amount of additional information can be captured during this test because measuring at higher level frequency components are possible. 8. PRACTICAL TESTS AND ASSESSMENT ON AIS SAMPLE PANELS AND THE INTERPRETATION OF RESULTS 8.1. Selection of most suitable sensor and measurement instrument The sensor type, instrument capabilities and responsiveness to increasing AC voltages were used as selection criteria to identify the most suitable instruments. The PDS Insight from HVPD (with AA, TEV, HFCT) and Ultra TEV from eaTechnology (with contact probe AA, TEV) were selected as most appropriate and the preferred instruments. Capturing measurements at the higher test voltages were problematic as the continuous display of readings on the ultrasonic instrument changes rapidly. A 10 second measuring time was allowed for the ultrasonic reading. The 5 second recording capability of the HVPD Insight was an advantage. Repeatability was proven by conducting the same test more than once at the same voltage. The results were acceptable and within a 10% discrepancy between readings. It should be noted that measurements were taken at the same voltage with each separate instrument and sensor, but not at the exact same instance of time. Although there is good correlation between the various measurements, the values because it is not taken at the exact same instance of time. 8.2. Selection of panel samples The selection of AIS panel samples consisted of 6 individual Reyrolle/ABB panel types, namely: LMT, LMS and LMR [21]. The LMT type was followed by two improved designs, namely LMS and LMR, all being fully compatible with one another. 100 31st Conference on Condition Monitoring COMADEM 2018, North-West University, and Diagnostic Engineering Management South Africa. Equipment from a redundant substation became available for research and tests and a sample from the LMT, LMS and LMR were taken to focus on equipment that has been in service for a long duration, respectively 52, 24 and 14 years. The selected panels were labelled Sample #1, #2 and #3 respectively. It was observed that all the compartments were covered with some dust deposits and that the insulation was not fully clean and free from dirt. The condition basically resembles typical substation environment. It was decided not to immediately clean the switchgear (Sample #1, #2 and #3) before testing, but to rather look at the impact of this undesirable condition on the measurement results, and correlate the test results with visual inspection. The same test procedure will be followed once the switchgear has been properly cleaned and restored to satisfactorily condition. This test will be conducted at later stage and identified as future work. Table 3: Detail on panel samples used during the tests. A total of 6 samples were therefore selected that were used during the practical test and assessment. Table 3 provides for summary of information on the different panel samples. The samples can be divided into two distinct groups, namely an older Group A (Samples #1, #2 and #3) and a newer Group B (Samples #4, #5 and #6). 8.3. Test procedure and set-up The approach on how the assessment and tests were conducted can be summarized as follows:- a) Same test method and approach on LMT, LMS and LMR panel types. b) Test only the primary busbar compartment, with all phases connected. c) Take HFCT, TEV and Ultrasonic measurements during the test. d) Increase the applied test voltage from 12kV to 21kV in 3kV steps. e) Cleaning of insulation was only done on the Group B samples. 8.4. Conducting Practical Tests A systematic approach was followed by conducting the practical tests to arrive at three distinctive objectives:- Objective A: Voltage and current, as well as PD measurements will be taken during all tests. This is to confirm the correlation between leakage current and PD to prove in general that the test set-up and procedure is valid and acceptable. Objective B: A variety of PD sensors are available for AIS assessments. It is important to identify the most suitable sensors applicable to the type of asset assessment, in this case AIS. 101 31st Conference on Condition Monitoring COMADEM 2018, North-West University, and Diagnostic Engineering Management South Africa. Objective C: Determine the sensitivity of sensor and measurement equipment by conducting before and after tests on specific samples. Group B (Samples #4, #5 and #6) was tested before and after cleaning of insulation. 8.5. Observation, interpretation and comparison of the results a) There is excellent correlation between the leakage current (mA) at different voltages, as well as between the different panel types. The overall increase in leakage current between lowest and highest voltage readings are 3.3, 6 and 3.6 times respectively for LMT (Sample #1), LMS (Sample #2) and LMR (Sample #3) panels. This is typical and acceptable results for AIS. b) It is noticeable that the leakage current for the newer Group B (Samples #4, #5 and #6) are higher than Group A. It was expected to achieve lower readings because the insulation is new. c) There is excellent correlation between peak HFCT PD measurements at different voltages, for each individual panel, as well as between the different panel types. The overall increase in peak HFCT PD between lowest and highest voltage readings are 398, 178 and 153 times respectively for LMT (Sample #1), LMS (Sample #2) and LMR (Sample #3) panels. d) Graph 1: Combined HFCT Peak PD and Leakage Current measurements for all 6 samples “Withstand Voltage (X - axis) versus Leakage Current (Y1 axis) & HFCT Peak PD (Y2 axis)” e) There is good correlation between peak TEV PD measurements at different voltages, for each individual panel, as well as between the different panel types. The overall increase in peak TEV PD between lowest and highest voltage readings are around 10 times for all 3 panels. The measurements appear to be less sensitive compared to HFCT. 102 31st Conference on Condition Monitoring COMADEM 2018, North-West University, and Diagnostic Engineering Management South Africa. Graph 2: Combined TEV Peak PD and Leakage Current for all 6 samples “Withstand Voltage (X1) versus Leakage Current (Y1) & TEV Peak PD (Y2)” f) Sensitivity of measurement and test set-up was proven by first obtaining TEV (Graph 3) and HFCT (Graph 4) measurements on samples 4, 5 and 6, for Group B. The test was repeated after the insulation was thoroughly cleaned. Results of all three tests indicate that both TEV and HFCT measure significantly lower levels of partial discharge. For illustration purposes, only results of Sample #6 is shown which is a good representation of the entire sample group. Measurements are consistently lower for the cleaned insulation, except for the first two TEV readings which are approximately the same value. Graph 3: Withstand Voltage versus TEV Peak PD for dust and clean conditions for sample 6 only Graph 4: Withstand Voltage versus HFCT Peak PD for dust and clean conditions for sample 6 only g) Graph 5 and 6 indicate HFCT Peak PD and TEV Peak PD for all 6 samples. There is a clear distinction between the two sample groups, namely the older Group A and newer Group B. This is depicted on the graph by the blue and green ovals, respectively for the older Group A and newer Group B. It is more predominant on the TEV readings where the newer sample group measures 103 31st Conference on Condition Monitoring COMADEM 2018, North-West University, and Diagnostic Engineering Management South Africa. consistently lower levels at three of the four data points. It is only at the 12kV data point where no distinction is visible. For the HFCT measurements, this phenomena is less prevailing although also clearly visible at the higher voltages at 18kV and 21kV. Graph 5: Withstand Voltage versus HFCT Peak PD for all 6 samples Graph 6: Withstand Voltage versus TEV Peak PD for all 6 samples 9. CONCLUSIONS Off Line PD measurement combined with power frequency withstand voltage test during planned maintenance as Condition Based Monitoring (CBM) tool can provide additional information for asset management systems. Peak PD measurements were found to be a good indicator for analysis purposes. It was confirmed that there is excellent correlation between leakage current and both HFCT and TEV PD measurements. Both HFCT and TEV sensors and measurement capabilities proved to be sensitive enough to distinguish and differentiate between deliberate surface discharge and cleaned insulation. The detection method is therefore ideal to differentiate between good insulation and deteriorated insulation. 104 31st Conference on Condition Monitoring COMADEM 2018, North-West University, and Diagnostic Engineering Management South Africa. Although HFCT, TEV and ultra-sonic methods were found to produce repeatable results, HFCT appears to be the most sensitive. TEV produced consistent results although not as sensitive compared to HFCT. HFCT measurement results is most sensitive and revealed significant increases in PD peak levels at higher voltages. Ultrasonic instrument could not reliable display and record the measured values and was therefore not included in this paper. It appears that the busbar insulation design, development and manufacturing from the Reyrolle switchgear range improved, as expected, from the older LMT to the newest LMR type as reflected by significantly lower PD measurements. The proposed new method could therefore be considered for the developing of health indices for AIS. 10. FUTURE PRACTICAL WORK AND TESTS This paper has not considered the full value of count and activity measurements which is typically available on some HHD instruments. It is believed that these parameters could also contribute towards assessment methods. Conduct conventional PD test on Sample #1, #2 and #6 to compare conventional measurement results with unconventional results. The test can be conducted at the Rotek facility at Eskom in Johannesburg, South Africa. The improved test method can provide information on the condition of the insulation in order to determine Remaining Useful Life (RUL) of the insulation by making use of TEV measurements. Develop a proposal to determine or calculate RUL for AIS insulation by theoretic and practical model. Both HVPD and eaTechnology recently released advanced Phase Resolved Partial Discharge HHD with full recording and sophisticated noise filtering capability. This will improve integrity of field assessments and provide more credible measurements. 11. REFERENCES [1] G. Dennis, “Non-intrusive techniques for detecting partial discharge in HV and EHV switchgear,” University of Bath, Oct 2008. [2] G. Paoletti, M. Stephens, G. Herman, M. Whitehead, “The most ignored maintenance electrical item in the plant electrical power distribution system and practical solutions.” [3] IEC 60060, “High Voltage test techniques, Part 1, General definitions and test requirements,” 1989. [4] IEC62271-200, “International Electrotechnical Commission, High Voltage Switchgear and controlgear,” 2011. [5] IEC60270, “International Standard, High Voltage Test Techniques: Partial Discharge Measurement,” Third Edition, 2000 [6] IEC62478, “International Standard Technical Specification, High voltage test techniques - Measurement of partial discharges by electromagnetic and acoustic methods,” Edition 1.0, 2016 [7] J.E. Smith, G. Paoletti, I. Blokhintsev, “Experience with On Line Partial Discharge Analysis as a tool for predictive maintenance for medium voltage (MV) Switchgear Systems,” IEEE Petroleum and Chemical Industry Committee, 2002. 105 31st Conference on Condition Monitoring COMADEM 2018, North-West University, and Diagnostic Engineering Management South Africa. [8] L. Renforth, R. Giussani, M. Seltzer-Grant, M. Marcarelli, M. Foxall, “On Line and Off Line PD monitoring: The experience in testing MV and HV apparatus from an industrial point of view” [9] M. Muhr, T. Strehl, E. Gulski, K. Feser, E. Gockenbach, W, Hauschild, E. Lemke, “Sensor and sensing used for non-conventional PD detection,” Cigré, 2006 [10] D. Koenig, P. Roesch, “Contribution on the specification of an aging test for the insulation of enclosed, air insulated switchgear,” IEEE Transactions on Electrical Insulation, 1987 [11] S. Keim, D. Koenig, “Standardised and non-standardised aging tests for indoor switchgear and switchgear components under severe climatic conditions,” IEEE International Symposium on Electrical Insulation , 2002 [12] X. Liu, X. Xiaohui, Z, Suo, L. Luo, “Maintenance techniques to eliminate partial discharge in MV air insulated switchgear” [13] Y. Ehara, K. Aono, S. Koshio, “Degradation analysis of epoxy resin surfaces exposed to partial discharge,” IEEE, 2016 [14] H. Okamoto, M. Kanazashi, T. Tanaka, “Deterioration of insulating materials by internal discharge,” IEEE Transactions on Power Apparatus and Systems, 1977. [15] C. Lowsley, N. Davies, D. Miller, “Effective condition assessment of MV switchgear,” eaTechnology. [16] ABB, “Air insulated switchgear Life Cycle Services,” 2011. [17] IEEE Standard C37, “Standard for Metal-Clad Switchgear,” 2015. [18] R.H. Goodwin, “A handy guide to the safe on-site testing of electrical equipment,” H.V. Test (Pty) Ltd. [19] G. Pudlo, “Advanced PD monitoring of power transformers supporting asset management,” Doble, Germany, 2009. [20] HVPD, “Guidelines for TEV PD Measurements for MV Switchgear,” High Voltage Partial Discharge Ltd, May 2009. [21] ABB, “Retrofitting of Reyrolle type LMT switchgear,” Technical Data Sheet, http://www.abb.com/heritage. 106 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management A Condition Based Reliability Simulator Framework based on a Heuristic Fault Model Dr. H.F. Swanepoel 1* & Prof. J.H. Wichers 2 1 Department of Mechanical and Nuclear Engineering North West University, South Africa swanephf@eskom.co.za 2 Department of Mechanical and Nuclear Engineering North West University, South Africa Harry.Wichers@nwu.co.za ABSTRACT There are significant concerns as well as remedial efforts by the SA Power Utility to improve the generation plant performance that showed a significant decline over the past number of years. There is general consensus on the significant opportunity for the power utility to leverage Condition Monitoring (CM) and Advanced Analytics (AA) technologies to assist in the turn–around of technical performance of the power station fleet. Over the years, extensive online-monitoring technology and predictive condition monitoring capability have been introduced on high-value assets like the generator, turbine, transformer and boiler; with much less focus on Balance of Plant (BoP) equipment and machine trains. The research study developed a Condition Based Reliability Simulator (CBRS) Framework that uses a heuristic fault and failure analytical model based on the Design Basis of plant systems and equipment. By developing integrated advanced analytical and predictive fault models based on a thorough understanding of critical Design Basis parameters and typical failure behaviors, it is possible to better identify impending failures and apply the most appropriate CM technology at the right time. The research considered operating philosophy and maintenance strategy impacts and demonstrated how the CBRS Framework and its associated heuristic fault model approach can significantly enhance predictive capability and assist with asset management activities to achieve the lowest total asset cost (Figure 1). Figure 1: Optimising Assets towards the point of lowest total cost [2] 1 The author is enrolled for a Ph.D (Mechanical) degree in the Department of Mechanical and Nuclear Engineering, North West University. 2 The author is Director of the Department of Mechanical and Nuclear Engineering, North West University 107 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Due to the generic nature of BoP equipment and machine trains, it would have application to the wider proc ess plant industry that uses similar equipment. The successful and practical application of this CBRS Analytical Framework and Fault Model development and implementation methods are discussed in this paper. Keywords: Condition Monitoring; Pattern recognition; Fault detection and localization; Diagnosis; Industrial asset management. 1. INTRODUCTION Modern design practices for plant components use the “Design for Reliability (DfR)” principle creating a highly reliable asset base where high levels of availability and reliability should be the norm - and if proactive steps are taken to monitor, evaluate and optimise operations, plant asset life can be maximized to achieve and even exceed design life. The current performance of South African power generating plants is no longer on par with average to top quartile global performers, while in the 1990's the utility consistently performed in the top quartile [3]. Most power stations in the Utility have a formal Condition Monitoring program in place, but still experience unacceptably high levels of UCLF/PCLF (which measures unplanned outages and extended maintenance outage durations – see Figure 2). Figure 2: Increased Unavailability (UCLF) in the Power Station Fleet [3] The research study confirmed that balance of plant (BoP) equipment failures significantly contributes to partial load losses and overall plant reliability [10]. As such, a BoP machine train became the focus of this experimental research study and basis for the heuristic fault model development aspects related to the Condition-based Reliability Simulator (CBRS) Analytical Framework. The BoP areas are generally serviced with hand-held condition monitoring devices and a condition monitoring program that is frequency based. Fault condition analysis and prediction is left to the vibration analyst, assuming a high level of skill, competency and in-depth plant Design Basis knowledge (which is many times not the case). Further evaluation of the current CM program in the Utility shows a blanket implementation of the ISO standards related to condition and vibration monitoring practices, with limited consideration of the Design Base of the systems and equipment involved. A significant challenge of CM program efficiency is the amount of “standards” and “guidelines” provided for health assessments, many times contradicting each other. Even the International Standards Organisation [4] acknowledges that they can at best provide guidelines. 108 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management A further challenge is that research and practical experience show that age–related failures is the minority and that “Random Failures” makes up 89% of typical failures [5]. Generic failure modes also do not consider cases where specific plant design or operational conditions imposes very specific stresses on general machine condition and operational capabilities. This has been pointed out in a number of research studies, for example Sondalini that describes the risk when acting on CM results without considering operational and production delivery stresses . He discusses how incorrect life prediction and failure curves can add to the challenges of ensuring an effective CM Program and points out how a Predictive Maintenance strategy without economic considerations is often the root cause for early abandonment of a CM Program. Incorrect CM program frequencies and overly specified monitoring requirements also result in some CM programs becoming cost inhibitive and eventually abandoned [6]. An Emerson study states overwhelmed operators and complex plant operations as one of the biggest challenges in process plants that demand a new approach to managing plant more efficiently and predictively [7]. Their view is that abnormal situation prevention is one of the biggest potential productivity gains in the process plant operating space. By developing integrated advanced analytical and predictive fault models based on a thorough understanding of critical Design Basis parameters and typical failure behaviors, it is possible to better identify impending failures and also better utilise the most appropriate CM technology at the right time. The researcher postulated the theory that effective and efficient decisions regarding plant condition can only be made taking into consideration all aspects, including Plant Design Basis, equipment des ign, plant age, process design and influences; and doing this in a consistent manner using a standardised framework to guide decisions. The primary aim and outcome of the experimental research study were to establish a Condition Based Reliability Simulator (CBRS) for power stations based on a standardised Advanced Analytics Framework, and which includes a Fault Model development and implementation methodology. 2. THE CBRS ANALYTICAL FRAMEWORK AND FAULT MODEL Since 2013, there has been significant movement and maturity of technology elements considered to be fundamental building blocks for advanced analytics capability development (i.e. Big Data, Content Analytics, Prescriptive analytics, predictive analytics, augmented reality, etc.) [8]. 2.1. The Advanced Analytics Framework The research study developed the CBRS Analytical Framework based on the MIMOSA OSA–CBM® Specification [9], as well as analytical framework recommendations by Angeli [1]. 109 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 3: The CBRS Analytical Framework 2.2. The Heuristic Fault Model Framework and Implementation Methodology The CBRS Analytical Framework is supported by a standardised implementation methodology, as well as a successfully verified Fault Model development and implementation methodology (which was validated using research study use cases). The various elements making up the Analytical Framework were built in a phased approach, indicated in Figure 4. The colours used in Figure 4 shows alignment with the relevant MIMOSA OSA-EAITM Specification elements of Data Acquisition (Green), Data Manipulation (Aqua), State Detection (Red), Health Assessment (Grey), Prognostic Assessment (Orange), Advisory Generation (Maroon) and linking to External Systems (Purple). 110 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 4: CBRS Framework and Fault Model Implementation Methodology The need to integrate advanced analytics in a well-managed framework is confirmed by Emmannouilidis in their research [10]. In their research, Lei confirm the vital importance of understanding fault behaviour and characteristics in line with equipment Design Basis. This information is crucial to develop fault trees, determine failure mechanisms and failure progression patterns, and they strongly advise against using and adopting “common models” or assumptions that all equipment types will exhibit the save behaviour and fault characteristics [11]. The extent of the MIMOSA specification adoption in the CBRS Analytical Framework makes this a unique contribution to the field of specifically condition monitoring and vibration analysis , as most research in these two technology areas would focus only on elements of the specification, and none of the literature consulted in the research survey provided a holistic framework that fully encompass the MIMOSA specification in its entirety. Researchers tend to focus their energy on only the advanced analytics algorithm aspects and very few, if any, would describe a standardised methodology with which to achieve a consistent outcome across a wide variety of equipment types (their research tend to focus on specific equipment types only). At best, they would describe some of the elements depicted in Figure 4 [12, 13, 14, 15, 16]. 2.3. The CBRS Analytical Framework Fault Model A significant challenge with advanced vibration analytics is the significant amount of data captured with each data point captured in the plant. This is in line with the findings of Rozados that states that machine-generated data far exceed conventional ERP and CRM system data in volume, velocity and variety of data generated [17]. Biehn shares this view and states that as little as 5% of “big data” gathered results in 95% of the value contribution of the data [18]. Marwala’s extensive research suggests that modal properties, frequency response functions (FRF) and wavelet transform (WT) data is required for meaningful and correct advanced vibration data analytics and the use of AI Phase 1 Phase 2 Phase 3 111 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management algorithms, but points out that the use of FRF and WT data is significantly more challenging than modal property analysis due to the volumes of data collected during the data capturing process [19]. The CBRS research study developed a detailed Fault Model methodology and validated it by means of research use cases. A unique mechanism (ARMA Statistical modelling) was implemented in the methodology to address the “big data” challenges described [17, 18, 19]. Using ARMA it was possible to identify Specific Fault Frequencies of Interest (SFFoI’s) in the time waveform data and these SFFoI amplitudes and trend patterns were then used to determine whether a specified fault condition exist, as well as the severity of the fault condition. The outcome from the Fault Model prediction was validated by means of detailed vibration spectrum and time waveform analysis (internal validation), as well as physical inspections on equipment identified to present with the fault conditions (external validation). The CBRS Fault Model proposes the use of five levels of degradation as indicated in Figure 5. Degradation thresholds were required to prove the postulated theory that there is a point in the asset life (when a failure condition is present in the asset), where the cost of repair will approximate the cost of failure. At this cost intersection point (repair cost vs. failure cost), the opportunity for “cost avoidance” will be lost. Ideally the equipment should be removed in sufficient time BEFORE reaching this point to justify the condition monitoring effort; as well as to allow for effective and timeous maintenance planning and spares sourcing; and to minimise production loss costs. Figure 5: CBRS Fault Model Degradation Levels And this is the primary intent of the fault models developed as part of the CBRS Analytical Framework – identify the optimal point for removing equipment from service for repairs or replacement. It was concluded that equipment should be scheduled for inspection or remedial action when “Notable Degradation” is identified, and removed for repair when Extensive Degradation is noted. Equipment can be kept in service for a limited period of time (depending on the failure degradation curve behaviour) when there is “Fault Match”, but the repair cost will approximate failure cost with the resultant loss in “cost avoidance” opportunity. 112 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 3. PROVING THE ANALYTICAL FRAMEWORK AND FAULT MODELS Three use cases were used to build and prove the required heuristic fault models. They covered mechanical, electrical and complex fault conditions. A complex fault condition in the research study is defined as a fault imposed by an external machine train component on another piece of equipment in the machine train. This paper discusses Use Case 2 which covers an electrical fault condition. 3.1. Use Case 2: Electrical Supply Phase Imbalance Use Case 2 covered the fault condition of an electrical supply fault condition, i.e. Supply Source Phase Unbalance. Phase imbalance is typically caused by loose phase connections, although supply breaker fault conditions can generate similar fault indications. The typical fault frequencies of interest (FFoI’s) are indicated in Table 1. During initial FFoI identification, acceptable levels of vibration specified by the equipment manufacturer were used as a baseline to define degradation thresholds. ARMA statistical analysis methods were then employed to determine the most statistically significant FFoI, termed the “Significant Fault Frequency of Interest (SFFoI)”, which in this case is the 2 x Line Frequency (100 Hz) FFoI. Table 1: Electrical Phase Imbalance Fault Frequencies of Interest (FFoI’s) Fault Frequency Frequency Unit Motor Running Speed 993 cpm Electrical Line Frequency (50 Hz) 3,000 cpm 2 x Line Frequency (100 Hz) 6,000 cpm Rotor Bar Pass 69,510 cpm Stator Slot Pass 89,370 cpm Once the SFFoI was identified, a baseline case was identified that most typically present the failure behaviour and degradation curve to be expected. Figure 6 depicts the performance degradation curve (in red) of the motor considered to form the baseline for this fault condition, i.e. Mill 2D Motor. This is a polynomial 2nd order curve fit based on the actual vibration behaviour data taken over the time period in question. Figure 6: Phase Imbalance Baseline Degradation Curve Once the trend curve and SFFoI for the baseline case was confirmed, additional historical cases of the fault condition were sourced as an internal validation mechanism to confirm that their SFFoI values display the same failure progression behaviour and that the identified SFFoI is indeed most indicative of the fault condition. The trend curves exhibited by these additional cases, as well as evidence from the external physical inspections performed at the time of removing the equipment from service, were used to define the failure threshold levels for the specific fault condition (in line with Figure 5). 113 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 3.2. Validating the Fault Model Use Case Internal validation involved detailed evaluation of the vibration spectrum of Mill 2D motor (see Figure 7). Figure 7: Motor Vibration Spectrum with electrical fault indications Stator Slot Pass (SSP) fault frequency and its harmonics are clearly indicated in Figure 7 (marked “D”), and significant 2 x Line Frequency (100 Hz) sideband harmonic frequencies are visible around the 2nd SSP harmonic frequency. The 2nd SSP harmonic is also significantly higher in amplitude when compared to the fundamental SSP amplitude level. Amplitudes of the 2 x Line Frequency and its harmonics in the lower frequency range are also very notable (Fundamental, 2nd and 3rd harmonic). External validation of the fault condition was achieved by evaluating the entire dataset for the 36 motors deployed in the power station’s milling plant, and the SFFoI amplitude distribution is indicated in Figure 8. Figure 8: Phase Imbalance Baseline Degradation Curve 114 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management This distribution curve was then used to identify more motors that present with SFFoI amplitudes above the “Extensive Degradation” threshold value. As mentioned, this threshold was deemed by the research study to be the optimal point to leverage cost avoidance opportunity. Figure 9: Phase Imbalance Baseline Degradation Curve These motors SFFoI failure curves were mapped against the baseline case (see Figure 9) and based on severity of the failure trend curve in Figure 9; remedial action recommendations were made (summarized in Table 2). Table 2: Phase Imbalance Fault Condition Severit y Allocation Motor Condition Motor Candidates Recommended Action Fault Match Mill 3A (03NM10) Remove motor from service and inspect all electrical supply breakers and connections for faults and looseness, based on high SFFoI amplitudes (> 600 microns) as well as failure progression curve behaviour. Urgent priority for remedial action. Extensive Degradation Mill 1C (01NM30) Schedule outage to inspect electrical supply components for looseness and damage. Not critical and can be part of normal scheduled outage. Notable Degradation Mill 6C (06NM30) Mill 2D (02NM40) Increased monitoring and schedule electrical supply inspection to coincide with normal scheduled maintenance. Early Degradation Mill 2B (02NM20) Mill 5E (05NM50) Maintain monitoring program frequency and act if degradation curve behaviour changes. Normal * Normal vibration monitoring program frequency to be applied. *Note: Due to the amount of motors falling within the “Normal Category”, this was not indicated in this table. The relevant motors was then scheduled for a physical inspection and in all cases revealed significant issues with electrical phase connections, either at the supply junction box or at the motor phase connections. Figure 10 reflects the infrared thermography (thermal) images taken of supply cables on the Mill motors. On the left hand side is an infrared image of the normal cable temperature expected on a mill motor with healthy connections (± 36°C), and the right hand side shows increased cable temperature (45°C) on the motor with the fault condition (Mill 2D). 115 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 10: Infrared Thermography Image of Cable overheating due to hot connections As part of Analytical Framework continuous improvement, more data was added to the Fault Model database for optimisation purposes. Mill 1F motor was subsequently identified as another case meeting the SFFoI fault and threshold criteria for this condition. This motor was removed from service and the phase connections inspected during a 3000hr Mill Service outage. Two of the phase connections were found to be severely damaged with one of the connections nearly destroyed by electrical arcing (Figure 11). Figure 11: Severe Phase Connection Damage 4. CONCLUSION The outcome of the research study proved that a combination of condition monitoring assessment, Design Base generated analytical fault models and understanding of the impact of operating and maintenance philosophies was vital to provide the most accurate assessment of plant health and failure prediction. The research study also successfully designed, implemented and validated the CBRS Analytical Framework and Fault Model methodology and demonstrated it with operational research and use cases as a viable tool with which failure identification and failure degradation predictions can be made using vibration data analysis on rotating machinery. The research study outcome also demonstrated that although early warning failure detection capability is valuable, the benefit of such early detection must be balanced with the most optimal asset maintenance cost (repair versus overall failure cost). This is crucial in the drive of ensuring that the lowest total asset cost (depicted in Figure 2) can be achieved on power plant assets. 116 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management REFERENCES [1] Angeli, C. and Chatzinikolaou, A. (2004). On–Line Fault Detection Techniques for Technical Systems: A Survey. Department of Mathematics and Computer Science, Greece. Technomathematics Research Foundation. International Journal of Computer Science & Applications, Volume 1, No. 1. 2004. [2] Covino, L. and Hanifan, M. (2011). Proactive Approach to Managing Production Assets. Bently Nevada. Orbit Magazine, Volume 31, Number 2, 2011. [3] Ngubane, B. (2015). Eskom Holdings Corporate Plan 2015/6 – 2019/20, Rev 3. Eskom Holdings (Pty) Ltd. 24 April 2015. [4] ISO Standards Organisation. (1996). ISO 7919-2: Mechanical vibration of non-reciprocating machines - Measurements on rotating shafts and evaluation criteria - Part 2: Large land-based steam turbine generator sets. [5] Mobius Institute , (2010). Vibrat ion Training – Category I, II and III Training Modules. Developed between 2005 and 2012. Published as Revision 3.0 published in 2010. [6] Sondalini, M. Don’t Waste your T ime and Money with Condition Monitoring. Article by Lifetime Reliability Solutions. D rawn from www.lifetime-reliability.com on 10 April 2017. [7] Emerson. (2003). White Paper: Reducing Operations and Maintenance Costs. Emerson Process Management, Austin Texas. www.EmersonProcess.com. September 2003. Retrieved January 2012. [8] Rivera, J. and Van Der Meulen, R. (2013). Gartner’s 2013 Hype Cycle for Emerging Technologies Maps out Evolving Relationship Between Humans and Machines. Gartner, Stamford, Connecticut. 19 August 2013. [9] ISO Standards O rganisation. (2007). ISO 13374-2: Condition monitoring and diagnostics of machines – Data processing, communication and presentation – Part 2: Data Processing. [10] Potgieter, C. (2017). Station Statistical Overview of Partial Load Loss Contributions to UCLF. Report compiled and issued on 13 March 2017. Eskom Holdings. [11] Emmanouilidis, C., Fumagalli , L., Jantunen, E., Pistofidis, P., Macchi, M. and Garetti, M. (2010). Condition monitoring based on incremental learning and domain ontology for condition-based maintenance. APMS 2010 International Conference on Advances in Production Management Systems, Italy. October 2010. [12] Lee, Jl, Ni, J, Djurdjanovic, D and Liao ,H. (2006). Intelligent Prognostics Tools and E-Maintenance. Computers in Industry, Volume 57. August 2006. [13] Shaheryar, A., Yin, X.C and Ramay, W.Y. (2017). Robust Feature Extraction on Vibration Data under Deep-Learning Framework: An Application for Fault Identification in Rotary Machines. International Journal of Computer Applications, Volume 167, No 4. June 2017. [14] Sheng, S. (2012). Wind Turbine Gearbox Condition Monitoring Round Robin Study – Vibration Analysis. Technical Report by the National Renewable Energy Laboratory, US Department of Energy. July 2012. [15] Lei, Y., Lin, J., Zuo, M.J. and He, Z. (2014). Condition Monitoring and fault diagnosis of planetary gearboxes: A Review. Measurement, Volume 48. February 2014. [16] Yang, S., Xiang, D., Bryant, A., Mawby, P., Ran, L. and Tavener, P . (2010). Condition Monitoring for Device Reliability in Power Electronic Converters: A Review. IEEE Transactions on Power Electronics, Vol. 25, No. 11. November 2010. [17] Wang, W. (2003). Modelling condition monitoring intervals: A hybrid of simulation and analytical approaches. Journal of the Operational Research Society, Volume 54, Issue 3, pp. 273-282. March 2003. [18] Rozados, I.V. and Tjahjono, B. (2014). Big Data Analytics in Supply Chain Management: Trends and Related Research. Supply Chain Research Center, Cranfield University. 2014. [19] Biehn, N. (2013). The Missing V’s in Big Data: Viability and Value. PROS. www.wired.com/insights. May 2013. [20] Marwala, T. (2012). Condition Monitoring Using Computational Intelligence Methods: Application s in Mechanical Systems. University of Johannesburg. Springer. Published in 2012. 117 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Cloud based Real-time Condition Monitoring Model for Effective Maintenance of Machines Ganga Dhandapani 1, Ramachandran Veilumuthu2 1 Department of Electrical and Electronics Engineering, NIT Nagaland, Dimapur 797103, India 2 Department of Information Science and Technology, CEG, Anna University, Chennai 25, India ABSTRACT This research aims at transforming the conventional condition monitoring model by including cloud services at the group and process control levels through virtualization of Process Control System (PCS) for effective maintenance to achieve maximum performance of machines. Application specific condition monitoring algorithms are represented as cloud services. The parameters of the machines which are operating in an industrial network are communicated to the virtualized PCS and distributed to the cloud services. An experimental setup consisting of a servomotor and an induction motor operated by S120 and V20 AC drives respectively is considered to validate the proposed cloud-based condition monitoring model in real-time. The vibration signals acquired from the servomotor and induction motor at different operating conditions have been communicated to virtual PCS. The non-stationary vibration data of the machines are processed using the cloud services to estimate the respective vibration threshold levels adaptive to the operating conditions and have been archived as contextual repository of thresholds in the virtual machine. These vibration threshold levels corresponding to specific operating conditions are received by the PCS as responses for alarm fixation, which facilitates reliable monitoring of the machine conditions together with prediction of machine behavior pattern and coordinates the processes of multiple machines in real-time. Immediate decision making, flexible analysis under real-time operating conditions and scalability are the inherent features of the proposed cloud-based condition monitoring model. The convergence of cloud applications with conventional condition monitoring models is the root cause for innovative research that provides accurate solution for Data Logging, Control and effective Monitoring in all industrial applications and leads to enhancement in coordinated machinery diagnostics and process control decisions in industrial automation. Keywords: Condition Monitoring, Cloud, Virtual Process Control System, Vibration Alarms, Machine Maintenance Corresponding author: D. Ganga (gangaadhan@gmail.com) 1. INTRODUCTION Conventional technologies and their convergence with the existing technologies lead to profound changes in the field of machine condition monitoring and enable better decision making in automation which consecutively improve the performance, reduce the downtime and frequency of failures. The ever growing advancements in the technologies of smart sensors, actuators and embedded devices in association with cloud ensure efficient and upgraded method of control and monitoring of electrical machines which are located at geographically distributed industrial environments. The new paradigm of Internet of Things (IoT) leads to lot of innovation in Industrial applications due to ease in gathering of information and communication with other devices. IoT provides a common and conducive platform to sense data and communicate with heterogeneous devices, and monitor applications which are executing or functioning in a distributed environment. Automation is the process that incorporates multiple technologies in interdisciplinary manner to make any system to operate without or much less human interaction, but leading to very high performance. Enhancement in the communication technologies, advancement in the sensing and monitoring systems and in turn enrichment in gaining access to information and interaction between heterogeneous devices via Internet along with the concept of virtualization overlay a strong and innovative research direction in Industrial Automation applications. The performance of the machines which 118 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management are operating under dynamically varying conditions depends on their automated maintenance through effective condition monitoring with precise data acquisition, appropriate data analysis, communication of results of the analysis to all the relevant systems for carrying out necessary fault detection, control and recovery actions. Hence, efficient condition monitoring is a highly significant component in the architecture of Industrial Automation and has been critical for mitigation of failures through prediction techniques and formulation of subsequent solutions. Accurate calibration of measurement systems, reliable data logging and exchange, real-time and collaborative analysis in platform independent manner are the various key aspects of effective condition monitoring and automation of industrial machines. In this research, the convergence of cloud and IoT technologies for analysis of real-time systems has been brought into the implementation of condition monitoring for electrical machines and enhanced with virtualized PCS to support scalable and interoperable data exchange between machines from group control level to process control level with features of flexible and collaborative analytics, fixation of adaptive alarms with contextual thresholds and control of multiple machines in real-time operating environment. The proposed cloud model for condition monitoring is implemented on an industrial prototype set up having machines and drives, IoT devices, virtualized PCS and VMs executing vibration analytics and decision making algorithms for effective maintenance. The model gives collaborative access to machine data for analysis and decision making from any geographical location. 2. RELATED WORKS The application of cloud for industrial applications is currently being proposed and discussed by various authors. Omid Givehchi et.al [1] has presented an overview on the application of cloud computing technology for industrial automation. It has been viewed at three levels viz. Enterprise Management and Manufacturing Execution level, Process Control Level and Field level. The authors have envisaged the importance of cloud solution for lower levels of automation i.e., at control and field levels. The physical devices at the lower levels have been integrated in a single cloud by choosing service bus as information model. The integration is proposed by encapsulation of services and functions inside delivery standards of cloud. At control level, migration of soft controllers as services has been proposed. Cloud computing is seen as a solution to provide platform for integration of growing information technologies such as Internet of Things, Service Oriented Architectures and mobile computing. Similarly, the automation architecture for the optimal control of devices in process plants through Internet has been elucidated by R. Kirubashankar et.al [2]. The web based architecture proposed for the control of devices remotely consists of Programmable Logic Controller (PLC) connected to Gateway, Supervisory Control and Data Acquisition (SCADA) and Virtual Private Network (VPN). Effective point-to-point network communication architecture has been said to decrease the problems associated with Internet latency and system stability in the design of web-based remote supervisory control and information system. Larry combs [3] has stated that the functionality and reliability of conventional SCADA gets enhanced with the application of cloud for SCADA and has provided an insight on possible utilization of cloud. The SCADA can be from the cloud by itself or the data of the onsite SCADA shall be transferred to the cloud. This facilitates data availability to multiple resources. The cloud can be of public, private or hybrid nature based on the industrial requirements. Omid Givehchi et.al [4] cited about the development of Virtual PLCs on the Virtual Machines of the private cloud created using VMware’s vCloud suite. The hypervisor host is connected with more network adapters to interface with multiple physical devices. The Virtual switch manages the network policies for VMs to communicate with clients. The VM communication Interface with network adapter driver for VMs supports communication of Physical systems (here sensors and actuators) using industrial protocols such as PROFINET. The practical implementation of simple virtual PLC illustrates the possibility of having an industrial application as a Service. Sandy Dunn [5] has reviewed the reasons that drive the industries to make changes in the condition monitoring techniques and the implications they will create. The condition monitoring system adopted by industries is expected to reduce Mean Time Between Failures (MTBF). The industries consider asset effectiveness as significant because it leads to higher profit from minimum investment and the author has substantiated that condition monitoring has a significant role in bringing asset effectiveness. Condition based maintenance is centered on the process of condition monitoring which diagnoses the abnormalities by evaluation and comparison of the current condition with a fixed failure limit. The reliability of the decision that has been made is associated with the predefined 119 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management failure limits of the parameters monitored and the data interpretation made through analysis. The issues pertaining to the definition of failure limit and appropriate monitoring parameters to indicate the system deterioration have been pointed out as the challenges for future research. Willetts R et al. [6] insisted on the need for adaptive alarms in condition monitoring to avoid identification of failures due to false alarms. The techniques of classical statistics, multi-variant statistics and intelligent systems namely neural networks, classifiers and genetic algorithms have been reviewed for determination of adaptive alarms through prediction methods using historical data. Condition monitoring governs the diagnostics and prognostics adopted by the industries in bringing out efficient process and asset maintenance. Rosmaini Ahmad et al. [7] has given an overview of asset maintenance techniques, which are being performed in the industries. The practice of preventive maintenance initially carried out by OEMs could not meet the practical requirements of the industries as the machines function in different environmental and operating conditions, which demanded the incorporation of application specific scientific approaches for real-time data analysis for condition assessment. Mallikarjun Kande et al. [8] have extensively reviewed and discussed about existing machine condition monitoring techniques and industrial automation for plant-wide condition monitoring of rotating electrical machines. They pointed out the importance for on-equipment, on-premise and on- cloud integration of condition monitoring and the Distributed Control System to provide continuous monitoring of the equipment with high update rates from the sensors, to collect and send sensor data to diagnostics running as part of plant operations and to offer the elasticity required for the data and computational resources. While discussing about on-equipment and on-premise integration methods, the need for on-cloud monitoring using IoT gateway has been substantiated to meet the requirements of advanced diagnostics and data platforms for enhanced computation. The integration of remote services gives the benefit of making intensive data analysis even with the application basic data acquisition devices for condition monitoring. Steve Lacey [9] has stated that condition monitoring carried out with the incorporation of cloud facilitates comparative analysis of the conditions of similar machines or related machines. The adoption of cloud allows data sharing and enables implementation of new analysis techniques when unknown signal patterns are observed at the user end. The cloud environment provides an added value of being able to share and compare the local machine condition data with other similar machines across the plant, or with other machines at multiple plants wherever they are located. The cloud based condition monitoring system can infer the data from the distributed databases for effective decision making in vibration analysis. The vibration data is further processed in the cloud with the combination of data of one machine and other similar machines’ data with extensive analysis options. This increases the reliability of the diagnosis information for appropriate decision making. 3. CLOUD-BASED CONDITION MONITORING MODEL - IMPLEMENTATION The condition monitoring set up shown in Figure 1 encompasses a networked hardware architecture composed of motors, drives, cloud based Process Control System 7 (PCS), Remote Input Output module and IoT device connected over Industrial Ethernet. In this architecture, the data from all the networked components are collectively stored in cloud and control actions are initiated after implementing the data analysis with the condition monitoring and predictive maintenance services deployed in separate Virtual Machines. The SIMATIC PCS 7 software hosted on one of the virtual machines is capable of receiving the data acquired by different hardware components from multiple machines. In the proposed set up shown in Figure 1, S120 AC drive operating the servo motor reads the physical data of position and speed measured by proximity sensor and encoder. These data variables in S120 AC drive are then communicated to PCS where the actual values of the motor variables are compared with commanded or processed values. The control variables generated by the PCS are sent as operational commands to S120 AC drive which modulates the power input to the Servo motor as per the control variable. Alike to this, the vibration and speed data of three phase induction motor are acquired by IoT 2040 gateway through piezo electric and proximity sensors respectively when operated at different operating conditions of starting to no load speed and loading. The acquired values of vibration and speed are further communicated by IoT2040 gateway to the PCS over Industrial Ethernet through S7 communication. The vibration measurement made with piezo electric sensor using SPI protocol is unsupported by both S120 and V20 AC drives. This incompatibility of piezo electric sensor interface to either S120 or V20 drive has been overcome by S7 120 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management communication feature of IoT2040 gateway which communicates the vibration values to PCS without the said limitation. Figure 1. Cloud based condition monitoring model The motor data acquired by IoT 2040 gateway and S120 AC drive are transferred to the data tags of Continuous Function Chart executed in PCS 7 software. The tagged data of position, speed and vibration are archived in the WinCC Explorer of PCS. The virtual machine settings of the windows VMware station are configured to share the archived data to the other VMs for the execution of vibration analysis. The bridged connection setting of the network adapter has been configured so as to enable the VMs running in the host to network among each other and also to the physical network. In order to share the data files between the VMs and host of the bridged network, the option of shared folder has always been enabled and mapped as a network drive in Windows guests. The data of induction motor and servo motor that need to be archived for any required time interval are selected from the settings of ‘value column’ and ‘time column’ of WinCC online table properties. The vibration data of the induction motor at different operating conditions namely starting to no load speed with and without disturbance and loading at standalone condition are logged and archived from WinCC online table control. Such archived vibration data specific to the operating conditions are fed as input to the Condition Monitoring and Predictive Maintenance services for the determination of the vibration threshold values. The implementation of the condition monitoring service on the shaft vibration data of the induction motor has yielded the values of upper and lower threshold class clusters for different operating conditions as furnished in Table 1. This analysis reveals that the vibration threshold class clusters identified for standalone condition cannot be maintained as alarm for monitoring of machines at all conditions in machine maintenance. The integrated vibration threshold analysis carried out with two different motors namely induction motor and servo motor, operating in the close vicinity reveals the impact of the motors running in the same environment on each other’s shaft vibration pattern. The effects of external disturbance and loading on a machine’s vibration are also realized to be equivalent from the closely matching values of upper and lower threshold class clusters. The fact that the neighboring machine running at constant Vibration of Servo and Induction Motors Speed &Vibration Speed & Position Speed of Servo and Induction Motors Vibration Industrial Ethernet IoT 2040 S120 AC Drive Servo Motor V20 AC Drive Virtualized Process Controller (PCS) WinCC Explorer Running in VM 3 Induction Motor Contextual Threshold References for various Machine Conditions Condition Monitoring and Predictive Maintenance Services Running in VMs Data Sharing Remote IO 121 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management speed of 1500 rpm creates a vibration effect equivalent to that of loading in other machines substantiate the fixation of the operational limits for mechanical and electrical loading, dynamic load duty cycle, speed and torque capacity etc., during machine maintenance. Table 1: Vibration Alarm Fixation for Three Phase Induction Motor at Dynamic Operating Conditions Upper Threshold Class Cluster in g Lower Threshold Class Cluster in g Starting to No Load Speed Condition (Standalone Mode) Starting to No Load Speed Condition (Disturbance From Servo Motor) Loaded Condition Starting to No Load Speed Condition (Standalone Mode) Starting to No Load Speed Condition (Disturbance From Servo Motor) Loaded Condition 0.054 -0.007 -0.015 -0.585 -0.634 -0.630 0.267 0.203 0.190 -0.372 -0.425 -0.425 0.481 0.412 0.395 -0.159 -0.216 -0.220 0.694 0.621 0.600 0.000 0.000 0.000 0.907 0.831 0.805 0.000 0.000 0.000 Cluster Mean 0.481 0.412 0.395 -0.223 -0.255 -0.255 The values of thresholds determined are stored and used as references for the fixation of vibration alarms adaptive to operating conditions of induction motor and servo motor. The vibration alarms fixed are in turn written to the tags of PCS through SCADA layout of the WinCC (Figure 2) for carrying out precise operational decisions for motors and thereby to enhance the quality of machine maintenance. Thus, the threshold values computed from instantaneous values of the induction motor’s vibration are continuously compared with that of the alarms corresponding to the operational conditions. Figure 2. SCADA layout for machine condition monitoring The operational decisions made for the induction motor based on the measured values of vibration during starting to no load speed of 1500 rpm when the servo motor is also in operation has been illustrated in Figure 2. The actual 122 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management vibration thresholds of the induction motor running in the presence of servo motor are compared with the threshold class clusters meant for the operating condition of ‘Starting to No Load speed with disturbance’. The analysis reveals that the motor while reaching the speed of 1300 rpm at no load possess the value of 0.4026g as upper threshold class cluster mean. This on comparison with the threshold of 0.412g fixed as vibration alarm for the said condition, brings out an insight that the value of the speed set point should be fixed between 1300 rpm and 1400 rpm during this condition instead of maintaining 1500 rpm as constant speed set point. This operational decision avoids induction motor overloading under high speeds during disturbance condition and hence will improve the durability of the motor. The new speed set point has been fed as input to the tag of Continuous Function Chart of the PCS. The PCS compares the new speed set point with actual speed of the induction motor and generates control output in the range of (0-10) V. The remote IO module ET200SP connected to PCS generates corresponding analog voltage output of (0-10) V which acts as speed reference for V20 AC drive as shown in Figure 3. Figure 3. Data Transfer from Machines to PCS The analog voltage generated by ET200SP modulates the power output of V20AC drive which has been fed as input to the induction motor. In this work, the operational decision of motor speed adjustment of induction motor has been done by comparing the instantaneous vibration pattern and the thresholds corresponding to the operating condition. 4. CONCLUSION The condition monitoring of electrical machines in Industries are carried out widely with vibration signals. These signals are of non-stationary nature under dynamic conditions and hence numerous mathematical techniques are applied to extract information. Thus for effective machine maintenance, precise processing of vibration signals has been most sought and progressed. In this work, the need for a versatile condition monitoring system that possess flexible and precise analysis techniques for fixation of adaptive vibration thresholds as adaptive alarms has been elucidated. Such analysis brings outs the benefits that appropriate operational variations could bring for improved performance and life time of the machines in the working environment. The partially automated maintenance of the motors implemented by making maintenance decisions with the help of virtualized PCS, cloud services running in VMs and remote IO aid the system designers and enterprise resource managers to access the machine data and services in a collaborative and secured manner that lead to better maintenance strategies with ease. The IoT 2040 gateway also possess the feature of data exchange with the public cloud which opens up varied analysis domains for multiple and concurrent information extraction and condition monitoring decisions. Although the proposed cloud based architecture requires high end servers for mitigation of implementation overheads, the advantages of scalability, redundancy, manageability, interoperability and cost effectiveness of cloud contributes for improved performance of industrial applications. Overall, the proposed condition monitoring architecture based on cloud will address the challenges of latency in data processing, data storage and requirement of wide analytic features for predictive maintenance of machines in industries. 123 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management REFERENCES [1] Omid Givehchi, Henning Trsek, Juergen Jasperneite (2013) Cloud Computing for Industrial Automation Systems - A Comprehensive Overview.18th IEEE Conference on Emerging Technology and Factory Automation, pp. 1- 4. [2] Kirubashankar, R., Krishnamurthy, K., Indra, J., Vignesh, B. (2011) Design and Implementation of Web Based Remote Supervisory Control and Information System. International Journal of Soft Computing and Engineering (IJSCE), Vol.1, No.4, ISSN: 2231-2307 [3] Larry Combs (2011) Cloud Computing for SCADA.Control Engineering Magazine. [4] Omid Givehchi, Jahanzaib Imtiaz, Henning Trsek, Juergen Jasperneite (2014) Control-as-a-service from the cloud: A case study for using virtualized PLCs.10th IEEE Workshop on Factory Communication Systems (WFCS), pp. 1- 4. [5] Sandy Dunn (2015) Condition Monitoring in the 21st Century. Plant Maintenance Resource Center. [6] Willetts, R., Starr, AG., Doyle, A., Barnes, J. (2005) Generating adaptive alarms for condition monitoring data. International Journal of COMADEM Vol. 8, No.3, pp. 26-36. [7] Rosmaini Ahmad, Shahrul Kamaruddin (2012) An overview of time-based and condition-based maintenance in industrial application. Computers & Industrial Engineering Vol. 63, No.1, pp.135-149. [8] Mallikarjun Kande, Alf J. Isaksson, Rajeev Thottappillil, Nathaniel Taylor (2017). Rotating Electrical Machine Condition Monitoring Automation—A Review. Machines Vol.5, No.4. [Online]. Available: http://dx.doi.org/10.3390/machines5040024. [9] Dr. Steve Lacey (2017). From local to cloud-based condition monitoring. [Online]. Available: www.engineerlive.com/content/local- cloud-based-condition-monitoring. 124 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Arc Tangent Failure Rate Distribution Method Yerong Hu,Peng Yang,Yong Zhang,Jing Qiu,Guanjun Liu,Hukang Li Science and Technology on Integrated Logistics Support Laboratory National University of Defense Technology, NUDT Changsha, China yeronghu@foxmail.com, nudtyp7894@163.com, 95363020@qq.com, qiujing16@sina.com, gjliu342@qq.com, lihuakang856@163.com ABSTRACT Aiming at solving the problem that the distributed index may exceed 1 or is too small in the current fault rate distribution method, we propose an arc tangent fault rate distribution method based on arc tangent function. The distribution function model is determined by the parameter that is calculated out based on the specific object under distribution. The steps of the method are simple and easy to operate. Finally, a case is given to analyze the contrast between the conventional fault rate distribution method and arc tangent fault rate distribution method. The analysis results show that the distribution method can overcome the problem that the distributed index in the conventional fault rate distribution method is too large or too small, making the allocated indexes more accordant with the engineering practice. Keywords: testability; fault rate; index distribution; integrated maintenance management; conceptual design Corresponding author: Yerong Hu (yeronghu@foxmail.com) 1. INTRODUCTION Testability distribution refers to assigning the quantitative requirement of equipment to the components of the equipment from top to bottom level according to the given principles and methods in the demonstration stage and preliminary design stage. Testability distribution is an important part of testability design stage. It has guiding significance for determining the testability index of system [1]. It lays a solid foundation for the integrated maintenance management. Currently, the classical testability index distribution methods include experience distribution method, fault rate distribution method, weight distribution method, old-product-based distribution [2], etc. In recent years, new research has been carried out on the test distribution technology in the domestic academia. Shen Qinmu has proposed testability distribution method based on block diagram [3], Wang Baolong has put forward testability optimization distribution method based on the genetic algorithm [4], Shen Qinmu [5] and Li Jinlong [6] respectively have studied the distribution method based on analytic hierarchy process (AHP).In particular, the fault rate, which is objective and reliable to better reflect the distribution needs of the actual equipment, is the data source of the fault rate distribution method. However there are some problems existing in the classical fault rate distribution method right now. The basic principle of the fault rate distribution method is to allocate the test indexes according to the size of the unit fault rate. This method is applicable in the system that the fault rate of the component is known. We only consider the factor of fault rate in the fault rate distribution method. The distribution function of the classical fault rate method is linear. So the distributed index may varies greatly when the unit fault rate varies greatly. The distributed index of a low-fault-rate unit is low even close to 0, while the index of high-fault-rate unit is high even more than 1. This phenomenon is obviously not in line with the actual situation. 2. PRINCIPLE The derivative of a fault rate distribution function can be used to indicate the difficulty degree of index promotion. When the derivative is large, index upgrading is relatively easy and when the derivative of curve is relatively small, it is difficult to improve the index. In a real production environment, even if the fault rate of a product’s component is low, we still hope that the component can have a suitable index when the promotion of the index will 125 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management not cause great technical difficulties. Though the fault rate is relatively small, the derivative of the fault rate distribution function should be larger. It’s difficult to improve the index when it has reached a high situation. So when the fault rate of a component is high, the derivative of the function should be small. Therefore, the fault rate distribution function that satisfies the actual situation is not linear, and different fault rates should be corresponding to different derivatives. So the function should be a convex function. In summary, we can assume that the distribution function of the fault rate is a convex function. Given two constants 1 and 2 ( 1 2 > ), the distribution ( )f  should satisfy the following conditions:  1 , 1'( )f  > ( 1 < ),  2 , 2'( )f  < ( 2 > ),  lim '( ) 0f   ,  lim ( ) 1f   , where  represents the fault rate. In allusion to the condition,and mentioned above, we construct a function '( )= a f b c    (1) where a, b and c are constants greater than 0.  represents the power of  .Given =2 ,then 2 ( )= '( ) arctan( ) a a c f f d d b bb c           (2) From condition, we can calculate out 2a b   . Given = c b  , the function can be written as 2 ( ) arctan( )f     (3) 2.1. Distribution of fault detection rate Based on (3), we can adopt the fault detection rate distribution function 2 arctan( ), (1 )FDk FD k k N       (4) where N represents the number of system’s component, FDk represents the distributed fault detection rate of component k, k denotes the fault rate of component k, and FD stands for the coefficient of fault detection rate distribution. The coefficient is determined by the following steps:  In order to make the fault detection rates assigned to each component meet the requirement of the system index, the following equation should be satisfied sr 1 1 2 arctan( ) N N FD i i FD k i k          (5) where srFD represents the required index of system. The solution 1=FD FD  is the minimum value that FD can be obtained to satisfy the requirement of the system index. 126 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management  If the system needs to meet the specific requirement that the fault detection rate should reach FDq in the case that the failure rate of a unit is FDq , the following equation should be satisfied: 2 arctan( )FDq FD FDq     . (6) The solution 2=FD FD  is the minimum value that FD can be obtained to satisfy the specific requirement.  In the case 2 1FD FD  , we have 2=FD FD  ; in the case 2 1FD FD  , we have 1=FD FD  ; if the system does not give the specific requirement, we directly have 1=FD FD  . 2.2. Distribution of fault isolation rate Based on (3), we can adopt the fault isolation rate distribution function 2 arctan( ), (1 )FIk FI FDk k N       (7) where FDk represents the distributed fault isolation rate of component k, FDk represents the failure rate of unit k which can be detected and calculated by k=FDk FDk   , FD  represents the coefficient of fault isolation rate distribution. The coefficient is determined by the following steps.  In order to make the fault isolation rates assigned to each component meet the requirement of the system index, the following equation should be satisfied: sr 1 1 2 arctan( ) N N FI FDi FDi FI FDk i k          (8) where srFI represents the required index of the system. The solution 1=FI FI  is the minimum value that FI can be obtained to satisfy the requirement of the system index.  If the system needs to meet the specific requirement that the fault isolation rate should reach to FIq in the case that the failure rate of a unit is FIq , the following equation should be satisfied: 2 arctan( )FIq FI FIq     (9) The solution 2=FI FI  is the minimum value that FI can be obtained to satisfy the specific requirement.  In the case 2 1FI FI  , we have 2=FI FI  ;in the case 2 1FI FI  ,we have 1=FI FI  ;if the system does not give the specific requirement we directly have 1=FI FI  . 2.3. ALGORITHM Based on the principle mentioned above, a generalized algorithm for the proposed arc tangent fault rate distribution method is given as follows. Step.1 Collect relevant data, including: The required system index to be allocated, including the fault detection rate FDsr and the isolation rate FIsr ;  The number N of units that can be found from the product structure tree, the number of redundant modules of each unit 1, , Nn n , and the failure rate of each unit 1, , N  ; 127 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management  If it is required by the designer that the fault detection rate should reach to FDq  in the case of FDq  ,we have 1a  ; otherwise, we have 0a  ;  If it is required by the designer that the fault isolation rate should reach to FDq  in the case of FDq  ,we have 1b  ; otherwise, we have 0b  ; Step.2 Calculate FD in equation (4).  Obtain 1=FD FD  through equation (5);  In case 1a  ,obtain 2=FD FD  though equation (6), if 2 1FD FD  , we have 2=FD FD  , if not, we have 1=FD FD  ;in case 0a  ,we directly have 1=FD FD  ; Step.3 Allocate the fault detection rate through equation (4). Step.4 Calculate out each unit’s fault rate that can be detected through equation k= (1 )FDk FDk k N     . Step.5 Calculate FI in equation (7).  Obtain 1=FI FI  through equation (8);  In case 1b  , obtain 2=FI FI  though equation (6), if 2 1FI FI  , we have 2=FI FI  , if not, we have 2 1FI FI  ; in case 0b  , we directly have 1=FI FI  ; Step.6 Allocate the fault isolation rate through equation (7). 3. CASE AND CONFRONTATION 3.1. Case date and the result of the classical method The case in figure 1 is used to allocate the system index. The system consists of 5 LRUs with requirement 0.95FDsr  and 0.90FIsr  . It is appointed that the fault detection rate should reach 0.85 when the fault rate reaches 30(×10-6), the fault detection rate is 0.85.It doesn’t have specific requirement for the failure isolation rate. system LRU1 LRU2 LRU3 LRU4 LRU5 Figure 1 Allocation case The related data of the system is filled in the distribution work sheet (see table 1), and allocation is carried on by using the classical failure rate distribution method given in the GJB1770.2-1993 Testability and Maintainability Distribution and Prediction Method for Air Intelligence Radar [7]. The distribution result and correction values are showed in table 1. 128 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Table 1 Distribution work sheet of classical distribution method name number i ×10-6 FDi FIi result correction result correction LRU1 1 30 0.2788 0.90 0.2595 0.90 LRU2 1 30 0.2788 0.90 0.2595 0.90 LRU3 1 100 0.9293 0.98 0.8650 0.94 LRU4 1 150 1.3940 0.99 1.2976 0.95 LRU5 1 50 0.4647 0.90 0.4325 0.90 total 5 360 0.9677 0.9597 0.8999 0.9328 3.2. The result of arc tangent fault rate distribution method Combined with the case in figure 1, we can calculate the failure detection rate allocation coefficient and the fault isolation rate allocation coefficient of arc tangent fault rate distribution method through the steps in the second section. The result is as follow: 0.8075FD  , (10) 0.0936FI  . (11) Substitute the result (10) into equation (1), the distribution function of the fault detection rate can be obtained as follows: 2 arctan(0.8075 ) (1 )FDk k k N     , (12) where the unit of k is (×10-6). Substitute the result (11) into equation (4), the distribution function of the fault detection rate can be obtained as follows: 2 arctan(0.0936 ) (1 )FIk FDk k N     , . (13) Substitute the fault rate data of the case into the equation (12) and (13) to obtain the distribution result showed in the table 2 129 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Table 2 Distribution work sheet of arc tangent fault rate distribution method name number i ×10-6 FDi FD ×10-6 FIi LRU1 1 30 0.8807 26.421 0.7554 LRU2 1 30 0.8807 26.421 0.7554 LRU3 1 100 0.9638 96.38 0.9297 LRU4 1 150 0.9579 143.64 0.9527 LRU5 1 50 0.9279 46.395 0.8559 total 5 360 0.95 342 0.9 4. ANALYSIS It shows in table 2 that there will be instances when the distributed index is as low as 0.2778, or the instances when the distribute index exceed 1,even up to 1.3094.in the classical distribution method, such as the LRU1 and the LRU4. It is not consistent with the actual situation and doesn’t meet the mathematics category. The result of the classical fault rate distribution method usually needs to be corrected by people with experience, which is random and unreliable. After the correction, the allocated indexes need to be checked whether they can satisfy the system requirement. If the indexes can’t satisfy the total requirement of the system, the correction should be carried on again. It is hard to converge to an appropriate solution by using the classical method. As is showed in table 2, arc tangent fault rate distribution method uses the characteristic of the tangent function to solve the problem that the allocated index is too low or too high after determining the appropriate distribution coefficient. And after completing the distribution, it’s no need to have the checking of the unit index. Based on the above case, the distribution function of arc tangent fault rate distribution method is as follows: 2 arctan(0.8075 )FDk k    . (14) The distribution function of classical fault rate distribution method is as follows: 0.0092935FDk k  . (15) The curves of equation (14) and (15) are shown in figure 2. In actual engineering project, the index may have the characteristic of improving easily when it’s low and improving difficultly when it’s high. The curve of the classical failure rate distribution method is a linear function, which obviously does not satisfy this characteristic. However, the arc fault tangent failure rate distribution method, with a large derivative at low index and a small derivative at high index in convex function curve, satisfies this characteristic well. 130 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 2 Comparison of two methods 5. CONCLUSION In order to solve the problems in the method of classical failure rate assignment, we propose a method named the arc tangent failure rate assignment, which solves the problem that the distribution index exceeds 1 and the distribution index is too small, meanwhile, the distribution index meets the actual changing trend in the project. This method improves the guiding significance of the test index, and it is more in line with the engineering practice. REFERENCES [1] Department of Defense, MIL-STD-2165A, Military Standard Testability Program for Systems and Equipments[S]. 1993. [2] Shi Junyou. System Testing Design Analysis and Verification [M]. Beijing: National Defense Industry Press, 2010. [3] Shen Qinmu, Qiu Jing, Liu Guanjun, Yang Peng. Test study of estability allocation method based on block diagram of testing technology [J], 21, 2007.6 (S). [4] WANG Baolong, HUANG Quli, SU Lin, WEI Zhonglin. Test-based Optimal Test Allocation for Complex Electronic Equipment Based on Genetic Algorithm [J] .Computer Measurement and Control, 15 (7), 2007. [5] Shen Qin Mu. Equipment-level test-based distribution of technology research and application [M]. Changsha: National University of Defense Technology, 2004.7. [6] Li Jinlong, Tao Fenghe, Jia Changzhi, Liang Guangjian.A Test Assignment Method Based on AHP [J]. China Testing, 36 (2), 2010.3. [7] Zheng Xiaofu, Yang Tianliang, Fu Quanhua et al. GJB1770.2-93 Airsoft radar maintenance of the sexuality of the distribution and prediction [S] .1993: 27-28,40-41. 131 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Deep online analysis of dielectric parameters for lubricants and insulation oils with an innovative oil sensor system: Identification of critical operation conditions of industrial gearboxes and high voltage transformers for reduction of failure rates and live time enhancement Prof. Dr. Manfred R. Mauntz1, Dr. Jörn Peuser2 1cmc Instruments GmbH, Rudolf Diesel Strasse 12 A, D-65760 Eschborn, Germany, CEO, Head of R&D mrm@cmc-instruments.de 2cmc Instruments GmbH, Rudolf Diesel Strasse 12 A, D-65760 Eschborn, Germany, Applications jp@cmc-instruments.de ABSTRACT The requirements in the renewable energy for high voltage transformers and large industrial gearboxes as installed in wind turbines on-/offshore rise. Ever more flexibility at a maximum operational reliability and a long life time are required of them at the same time, so the requirements for the oil and the oil condition monitoring grow correspondingly. This presentation provides information about a novel online oil condition monitoring system to give a solution to the mentioned priorities for both energy sectors. The focus is set to industrial gearboxes but the possibilities in monitoring applications of insulation oils are also addressed. The online oil sensor system measures the components conductivity kappa, the relative permittivity epsilon r and the temperature T independently from each other. Based on a very sensitive measurement method with high accuracy even small changes in the conductivity and dielectric constant of the oil composition can be detected reliably. The new sensor system effectively controls the proper operation conditions of industrial gearboxes, high voltage transformers and in test rigs for electrical vehicles, where lubrication and isolation of the oil has a double function. The system enables damage prevention of the gearbox by an advanced warning time of critical operation conditions and an enhanced oil exchange interval realized by a precise measurement of the electrical conductivity, the relative permittivity and the oil temperature. A new parameter, the WearSens® Index (WSi) is introduced. The mathematical model of the WSi combines all measured values and its gradients in one single parameter for a comprehensive monitoring to prevent wind turbines from damage. Furthermore, the WSi enables a long-term prognosis on the next oil change by 24/7 server data logging. Corrective procedures and/or maintenance can be carried out before actual damage occurs. Raw data and WSi results of wind turbine installations with different lubrication oils are shown. Short-term and long-term analysis of the data show significant trends and events, which are discussed more in detail. Once the oil condition monitoring sensor systems are installed on the high voltage transformer or a wind turbine’s gearbox, the measured data can be displayed and evaluated elsewhere in sense of a full online condition monitoring system. 24/7 monitoring of the system (gearbox or HV transformer) during operation enables specific preventive and condition based maintenance independent of rigid inspection intervals. Keywords: Oil sensor system, oil aging, bearing wear, online condition monitoring, oil additive degradation. 1. INTRODUCTION In general, the field of maintenance can be divided into three sectors: preventive (time-based), intelligent (condition- based) and reactive maintenance (run to failure), which show different dependencies between costs and number of failures. Figure 1 shows the costs associated with the different strategies [1]. From this graph, the optimal point in terms of costs and number of failures can be identified within the center of the intelligent maintenance sector; intelligent maintenance can be realized with an online condition monitoring solution. Different kinds of online monitoring systems have been established over the past years: temperature [2, 3, 3], vibration [4] and particle counting [1, 5]. A vibration monitoring system analyses changes in the frequency spectrum of the observed bearing/gearbox component by Fast Fourier Transformation by converting a time-domain signal into a frequency- domain signal [6]. 132 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 1. Costs associated with traditional maintenance strategies [1]. An optical particle-counter can detect particles larger than 4µm due to the optical resolution of the laser light source. All of these systems need a significant change in the topological contact sur-faces, which means damaged surfaces, particles and pitting. The presented oil sensor system in contrast is already sensible at the very beginning of the damage formation stage, when the tribological layer gets depleted due to overload conditions. At this early stage, an increase of electrical charge carriers can be identified by the oil sensor system WearSens®: the electrical conductivity, relative permittivity and temperature are measured with a high precision and low noise over a broad range to enable the detection of small changes in the oil induced by variations in the tribology of the device under test [9]. Inorganic compounds occur at contact surfaces from the wear of parts, broken oil molecules, acids or oil soaps. These all lead to an increase in the electrical conductivity, which correlates directly with the wear. In oils containing additives, changes in dielectric constant infer the chemical breakdown of additives. A reduction in the lubricating ability of the oils, the determination of impurities, the continuous evaluation of the wear of bearings and gears and the oil aging all together follow the holistic approach of real-time monitoring of changes in the oil-machine system [10, 11]. Abrasive (metallic) wear, ions, broken oil molecules, acids, oil soaps, etc., cause an increase of the oil electrical conductivity kappa. It rises with increasing ion concentration and mobility. The electrical conductivity of almost all impurities is high compared to the extremely low corresponding property of original pure oils. Oils are principally electrical non-conductors. The electrical residual conductivity of pure oils lies in the range below one pS/m. A direct connection between the degree of contamination of oils and the electrical conductivity is found. An increase of the electrical conductivity of the oil in operation can thus be interpreted as increasing wear or contamination of the lubricant. The aging of the oil is also evident in the degradation of additives, which are reflected in the relative permittivity epsilon r [12, 13]. To measure the electrical conductivity and the dielectric constant the oil is passed through an electrode array, which determines the electrical resistance and the capacitance of the sensor assembly using the base oil as a resistive material and dielectric. Figure 2 shows a detail picture of the sensor electrode array with the triple plate de-sign and the schematic electronic circuit. By the high sensitivity of a time measurement method the sensor system detects critical operation conditions much earlier than existing technologies such as vibration measurement or particle counting. 133 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 2. Detail of the triple plate design of the WearSens® base sensor and the corresponding simplified drawing of the electronic circuit. To determine the conductivity and permittivity with a direct measurement of AC observables in a RC circuit is quite inaccurate, because oil has a very high resistance R (several GOhm) and a very low capacity C, which leads to high uncertainties, errors and a low resolution, which is necessary to follow the effects in oil. In the presented system, the electrical conductivity kappa and relative permittivity epsilon r are determined by a precise time measurement with a very high accuracy and repeatability based on an integrating measurement technique with a high time / bandwidth product: the measurement range for the conductivity starts from 0.1 pS/m up to 1,000,000 pS/m with a resolution of 0.01 pS/m; the relative permittivity is measured between 1 and 5 with a resolution of 1*10-6. 2. TEMPERATURE COMPENSATION Ion mobility and thus, electrical conductivity kappa are dependent on the internal friction of the oil and therefore, also on its temperature. The conductivity epsilon r of the oil increases with temperature. The type of contamination and its temperature dependence cannot be assumed to be known. To improve the comparability of measurements, a self-learning adaptive temperature compensation algorithm is necessary. A change of the oil quality can then be assessed by the temperature compensated conductivity value, even though the specific contamination is not determinable [13]. Calculating the electrical conductivity and the dielectric constant at the reference temperature of 40° Celsius is realized by approximating the polynomial form of the temperature dependence. κ𝑇0 = κ𝑇0𝑎 + (𝑎∆𝑇𝑖 + 𝑏∆T𝑖 2 + 𝑐∆T𝑖 3 ) ∙ κ𝑚 T0 is the approximate electrical conductivity of the oil at the reference temperature T0, T0a is the previously calculated (old) electrical conductivity at the reference temperature T0, m is the non-temperature compensated measured value of the electrical conductivity, a, b and c are the coefficients of the approximating polynomial to be adaptively determined during the runtime of the sensor system. ∆𝑇𝑖 = 𝑇0 − 𝑇𝑖 is the temperature difference. 134 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 3. Electrical conductivity versus temperature with second and third order fit curves. Figure 3 shows the dependency of the electrical conductivity with the temperature of a gearbox oil. Two different trend lines with second order fit and third order fit are plot-ted on the measured data. The approximation by a polynomial of third degree guarantees a good approximation at a reasonably low computational effort for the used microcomputer with an optimal coefficient of determination R². For the adaptive determination of the coefficients a, b and c of the polynomial a risk function is defined on the basis of the Gaussian method of least squares from the N measured values pairs and the approximating polynomial, whose minimization enables determination of the desired coefficients. Figure 4 shows the effect of the adaptive temperature compensation of the electrical conductivity using the example of field data from on onshore wind turbine: the wind fluctuations are changing the oil temperature and with it the electrical conductivity permanently. Figure 4. Graph of the measured and compensated electrical conductivity from an onshore wind turbine at a reference temperature of 40 °C. 135 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management While the measured conductivity changes significantly with temperature, the temperature compensated conductivity 40 stays nearly constant. The implemented adaptive algorithm is working in the background of the measurement procedure autonomously; it has to be reset only after an oil exchange to adapt to the new lubricant. Without the adaptive temperature compensation, it is not possible to identify a critical operation condition in the monitored system due to the high influence of the temperature on the conductivity and the relative permittivity [17]. 3. MATHEMATICAL MODEL OF THE WEARSENS® INDEX – WSI The WearSens® Index (WSi) has originally been developed for the lubricant analysis of a wind turbine gearbox; however, it can be adapted to any other lubricated system and different oil types with individual modifications. The following description is based on the wind turbine application. The WSi model considers short, mid and long-term changes in the lubricant by continuous monitoring of the conductivity, relative permittivity and temperature over a time period of several years with a high time resolution of < 45 seconds. Because of the measurement sensitivity and the high time resolution critical operation conditions can be identified much earlier and a damage can be evaded in short term analysis. The stress of the lubricant and the turbine itself is based on the actual wind condition, wind fluctuation and wind turbine settings (e.g. pitch control, torque control) resulting in instantaneous changes of the conductivity and relative permittivity and their gradients. Critical operation conditions result in an increased charge carrier generation and will change the conductivity and its gradient significantly. A big change in a short time period in the measured values leads to a high WSi Signal; for example, a significant increase in the electrical conductivity in a short time period is an indication of an abrupt high load or depending on the increase in the conductivity a critical operation condition. Frequent critical operation conditions lead to faster degradation of the oil additive complex. The hypothesis is that the consumption of the additives is directly correlated with the reduction of the relative permittivity of the oil: the relative permittivity r is directly affected by the presence of polar elements; there is a high content of polar additives in gearbox oils [7]. The polar additives combine together with other polar elements (e.g. wear products, water contamination), so from this point of view the consumed additives are not polar anymore, which results in the reduction of the relative permittivity. The gradient, i.e. the time derivative, of the conductivity or the dielectric constant progression respectively represents a measure of the additive degradation and consumption. After identifying initial base κi40, εir40, Δεir40, Δκi40, Ti can be feed into the simplified WSi model below: 𝑊𝑆𝑖 = ∫ [f(κ40, κ𝑖40) + f(εr40, εir40) 𝑡2 𝑡1 + f ( Δεr40 Δt , Δεir40) + f ( Δκ40 Δt , Δκi40) + f(𝑇, 𝑇𝑖)] 𝑑𝑡 The next pages show results from an offshore installation of the WearSens® sensor system. 4. SENSOR INSTALLATION OFFSHORE WIND TURBINE This section demonstrates first results of the 24/7 oil condition monitoring of an offshore wind turbine installation with WearSens® in the cooling bypass of a Siemens SWT 2.3MW. Figure 5a below shows the offshore wind turbine, the wind turbine gearbox in the nacelle and the installed base sensor in 5b. The communication unit in 5c transfers the data highly encrypted to the onshore located data server. 136 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 5. a) picture of the offshore wind turbine, b) WearSens® base sensor installation in the existing cooling bypass and c) sensor communication unit. The WSi of two identically constructed offshore wind turbines with the same oil running time are compared in figure 6. From this graph a clear difference between wind turbine WTG #01 and WTG #02 can be identified: this increased WSi has been reported to the wind farm operator; together was other CMS data a forming damage at the fast drive shaft was identified early enough to prevent the wind turbine for a long downtime and maintenance period due to a total breakdown. The high dynamics and the high peaks of the WSi due to the varying load conditions (normal for WTG #01 and overload for WTG #02) and the forming damage in TGW #02 also resulted in the faster degradation of the oil and consumption of the additives, this effect is also visible in the comparative data of the temperature dependency of the electrical conductivity shown in figure 7. The oil has been changed at the two offshore wind turbines: the data before (red colour) and after the oil exchange (green colour) show a completely different temperature dependency. From this information the active additive level and the oil quality can be determined together with the trending of the WearSens® index WSi to estimate the next oil exchange condition based by the continuous measurement with the presented oil sensor system. In figure 7 you see the significant difference between the wind turbine #01 with a normal behaviour and the wind turbine #02 with a forming damage in the grey curve. The additive components are getting consumed much faster due to higher stress and the increased generation of wear products, which results in a decrease of the dependency between the electrical conductivity  and temperature T. b) a) c) 137 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 6. WearSens® Index WSi profile from two offshore wind turbines with semaphore loading indication. Based on an empirical approach the hypothesis is, that fluctuations from normal to critical operation conditions can be identified earlier with the presented online oil sensor system by the precise measurement of the electrical conductivity and relative permittivity and the computation of the WearSens® index WSi, because changes in the tribological layer, an increased charge carrier generation occur at the very beginning of an forming damage at a gearbox components due to material fatigue. Therefore, it is possible to react much faster on events of critical conditions to prevent the gearbox from damage to enhance the overall life time. By the long-term analysis over several month and years, it is possible to perform condition-based oil change on demand to preserve the environment, to protect the oil resources and to reduce costs. From this point of view, the benefits of an online oil condition monitoring are clearly eminent: the short-term analysis can avoid critical operation conditions and prevent the wind turbine from damage; a long term analysis and trending can be used to estimate the time for the next oil change – condition based. Figure 7. Time course (24 hours) of WSi with low time resolution (green), WSi high time resolution (blue) and the vibration signal (red). 138 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 5. CONCLUSION The online diagnostics system measures components of the specific complex impedance of oils. The indication of forming stage of damage and wear is measured as an integral factor of, e.g., the degree of pollution, oil aging and acidification, water content and the decomposition state of additives or abrasion of the bearings, which is correlated to the changes in the electrical conductivity and relative permittivity. By the adaptive temperature compensation of the measured values, it is possible to identify even small variation in the actual charge carrier generation and additive consumption. For an efficient machine utilization and targeted damage prevention, the WearSens® online condition monitoring system and the WearSens® index offers the prospect to carry out timely preventative maintenance on demand rather than in rigid inspection intervals. The benefits of an extended oil change interval are reduced costs, preservation of the environment and resource protection. The oil sensor system has been installed into an offshore wind turbine performing short and mid-term analysis of the lubricant quality. The direct advantages of this online oil condition monitoring are detection of critical operation conditions and damage prevention to increase the lifetime of gearbox. The high time resolution, fast response time and accuracy of WearSens® allows earlier intervention control / optimization of the current operation in comparison to sole vibration analysis. REFERENCES [1] 1 P. Tchakoua, R. Wamkeue, M. Ouhrouche, F.S. Hasnaoui, T.A. Tameghe, G. Ekemb: Wind Turbine Condition Monitoring: State-of- the-Art Review, New Trends, and Future Challenges, Energies 2014,7, pp. 2595-2630, (2014) [2] 2 C.J. Hellier: Handbook of Non-destructive Evaluation, McGraw-Hill Professional Publishing, New York, USA, (2003) [3] 3 J.-Y. Park, J.-K. Lee, K.-Y. Oh, J.-S. Lee, B.-J. Kim: Design for 3MW Wind Turbine and its Condition Monitoring System, Proceedings of the International Multi Conference of Engineers and Computer Scientists, IMECS 2010, Kowloon, Hong Kong, 17–19 March 2010, Volume II, pp. 930–933,(2010) [4] 4 B.N Madsen: Condition Monitoring of Wind Turbines by Electric Signature Analysis, Master’s Thesis, Technical University of Denmark, Copenhagen, Denmark, October 2011, (2011) [5] 5 A.C. Goncalves, J.B. Campos: Predictive maintenance of a reducer with contaminated oil under an excentrical load through vibration and oil analysis, J. Braz. Soc. Mech. Sci. Eng. 2011, 33, 1–7, (2011) [6] 6 M.M. Khan, M.T. Iqbal, F. Khan: Reliability and Condition Monitoring of a Wind Turbine, Proceedings of the 2005 Canadian Conference on Electrical and Computer Engineering, Saskatoon, SK, Canada, 1–4 May 2005, pp. 1978–1981, (2005) [7] 7 Noria Corporation: The critical role of additives in lubrication, Machinery Lubrication June 2012, (2012) [8] 8 A. Jablonsky, T. Barszcz, M. Bielecka: Automatic validation of vibration signals in wind farm distributed monitoring systems, Measurement 2011, 44, 1954–1967, (2011) [9] 9 M. Mauntz and U. Kuipers: Ölsensorsystem- Sensorsystem zur Messung von Komponenten der komplexen Impedanz elektrisch gering leitender und nichtleitender Fluide, dessen Realisierung und Anwendung, patent application no. 10 2008 047 366.9, German Patent Office, Munich, (2008) [10] 10 M. Mauntz and U. Kuipers: Verfahren, Schaltungsanordnung, Sensor zur Messung physikalischer Größen in Fluiden sowie deren Verwendung, European patent application no. EP 09000244, European Patent Office, Munich, (2009) [11] 11 M. Mauntz, J. Gegner and U. Kuipers: Ölsensorsystem zur Echtzeit-Zustandsüberwachung von technischen Anlagen und Maschinen, Technisches Messen 77, pp. 283-292, (2010) [12] 12 M. Mauntz, U. Kuipers and J. Gegner: New Electric Online Oil Condition Monitoring Sensor – an Innovation in Early Failure Detection of Industrial Gears, The 4th International Multi-Conference on Engineering and Technological Innovation July 19th – July 22nd, 2011, Orlando, Florida, USA 2011, Proceedings Volume I, International Institute of Informatics and Systemics, Winter Garten, FL, USA, 2011, pp. 238-242, (2011) [13] 13 M. Mauntz, U. Kuipers and J. Gegner: High-precision online sensor condition monitoring of industrial oils in service for the early detection of contamination and chemical aging, Sensor + Test Conf., 7.-9.6.2011, Nürnberg, AMA Service GmbH, Wunstorf, pp. 702-709, (2011) [14] 14 M.Mauntz, J. Gegner, S. Klingauf and U. Kuipers: Continu-ous Wear Measurement in Tribological Systems to Control Operational Wear Damage with a new Online Oil Sensor Sys-tem, TAE Technische Akademie Esslingen, 19th International Colloquium Tribology, Esslingen, January 21-23, 2014, (2014) [15] 15 A. Saeed: Online Condition Monitoring System for Wind Tur-bine, Master’s Thesis, Blekinge Institute of Technology, Karlskrona, Sweden, (2008) [16] 16 M. Mauntz, U. Kuipers and J. Peuser: Continuous, online detection of critical operation conditions and wear damage with a new oil condition monitoring system, WearSens®, 14th In-ternational Conference on Tribology - SERIATRIB ’15 Proceedings, Belgrad, Serbian Tribology Society Kragujevac, University of Belgrade, Faculty of Mechanical Engineering, Belgrade, ISBN: 978-86-7083-857-4, S. 283-288, (2015) [17] 17 M. Mauntz, U. Kuipers and J. Peuser: New oil condition monitoring system, WearSens® enables continuous, online detection of critical operating conditions and wear damage, Malaysian International Tribology Conference 2015 - MITC2015, Penang, Malaysia on November 16- 17, 2015, Conference Proceedings, ISBN: 978-967-13625-0-1, S. 179-180, (2015) 139 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Real-Time Monitoring of Temperature Distribution in a Lithium-Ion Battery Pack A.M. Aucamp1, A. Janse van Rensburg1 1 School of Electrical, Electronic and Computer Engineering, North-West University, South Africa ABSTRACT Intelligent battery management systems aim to continuously diagnose the state-of-health (SOH) of cells in a battery pack and predict performance, reliability and safety issues. Cells with a higher internal resistance component will generate more heat and can be identified by the battery management system (BMS) from this thermal behavior. Placing a temperature sensor on each cell in a battery pack containing thousands of individual units might not be feasible due to practical limitations or budgetary constraints. As such, this paper describes an approach to real-time monitoring of a pack’s temperature distribution using a small, configurable number of temperature sensors with the aid of interpolation and extrapolation algorithms. Of the five algorithms initially considered, only the nearest neighbor and inverse distance weighting algorithms were implemented and compared in terms of accuracy and aesthetic practicality. The temperature distributions are displayed using a 3D visualization framework inside an option-rich user interface. The effect the number of temperature sensors and sample resolution had on data accuracy and latency were also investigated. With accurate and timeous data describing the behavior of the cells in a LIB pack, the BMS can better achieve its purpose in terms of monitoring, diagnosis and control. Keywords: Practical low-cost condition monitoring; Energy storage systems; Lithium-ion battery pack; Temperature distribution Corresponding author: Dr. Angelique Janse van Rensburg (angeliquejvr86@gmail.com) 1. INTRODUCTION Lithium-ion battery (LIB) technology is used in a wide variety of applications ranging from consumer electronics to electric vehicles and, more recently, for large-scale stationary energy storage [1, 2]. The majority of efforts aimed at improving the safety of LIB technology focus on effective thermal management strategies and the conditions that lead to thermal runaway [1, 3]. Internal exothermic reactions generate heat and an elevated ambient temperature also accelerates these reactions. A lithium-ion cell’s performance is improved by these accelerated reactions because the energy conversion rate increases. At the same time, lifetime might be sacrificed because detrimental material changes can occur at elevated temperatures and result in capacity fade. In rare cases, thermal runaway can propagate throughout a battery pack and cause cells to ignite and/or explode [3]. Lithium-ion battery packs consist of many cells that all have slightly different individual characteristics due to manufacturing imperfections and impurities in the materials used. These characteristics will cause cells to generate differing amounts of heat. This leads to hotspots forming inside the battery pack. These hotspots can lead to thermal runaway and will damage not only the affected cell, but also all the cells surrounding it. Hotspots can therefore lead to catastrophic failures if they are not identified and mitigated [1, 3]. This paper describes an approach to real-time monitoring of a pack’s temperature distribution using a small, configurable number of temperature sensors with the aid of interpolation and extrapolation algorithms. The distributions generated by the algorithms are then used to create a 3D visualization of the battery pack’s temperature distribution. The low-cost approach is in response to the traditionally more expensive proprietary data acquisition systems and their accompanying software that may not comply with the needs of the user. The data gathered can then be used by a BMS system to calculate certain SOH statistics and identify and mitigate anomalies in the temperature distribution. The visualizations generated can also be recorded, reviewed and used to identify anomalies. 140 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management The small number of configurable sensors used in conjunction with a custom data acquisition system (DAQ) means that the solution can be more cost effective than placing a sensor on every cell and using proprietary data gathering hardware and software. The software also creates an interactive visualization that is subjectively more informative than a static set of numbers or 2D graphs. The temperature distribution of single cell batteries and battery packs have been well studied and simulated [4]–[6]. These simulations, however, are not real-time displays of a multi- dimensional battery pack in operation. Maxim Integrated™ have designed a system to "Measure Multiple Temperatures in Battery- Management Systems, and Save Power Too" [4]. Current data-acquisition integrated circuits for battery packs are capable of measuring multiple cell voltages but they only scan and measure two temperatures at most [4]. This arrangement is inadequate since an accurate representation of the temperature distribution within a battery pack cannot be made with only two measurements. Maxim’s solution also does not make the information available to the user since a battery management system is supposed to run in the background and manage the battery temperature and voltages according to its preconfigured set-up. Johns Hopkins Applied Physics Laboratory have developed a system they call the Battery Internal Temperature Sensor or BITS [7]. The system is reportedly capable of accurately measuring the instantaneous internal temperature of a lithium-ion battery in a non-intrusive manner. The technology is not yet commercially available. They have not tested it on all battery form factors. The 18650 is one of the form factors they have not tested yet at the time of writing. The technology is, however, largely untested and there is little documentation. In a recent work, temperature data of a single 18650 cell in operation are collected using 4 thermocouples for the purposes of model validation. Post-processing is applied and the data are presented offline in simple 2D graphs with no option of intervening during operation [8]. An attempt to reconstruct the 3D temperature distribution of a series- connected LIB pack in real time using Kalman filtering has also been reported. The results seem to indicate visibility of only the outside surface temperature and not the distribution inside the pack itself [9]. Onboard monitoring of a cell’s internal temperature distribution in 2D using impedance-based temperature detection has also yielded promising results because the method was experimentally validated [10]. The following section starts with a brief discussion on the interpolation algorithms used to obtain the resulting temperature distributions. Section 2 continues with an explanation of the process behind the hardware and software design as well as the approach to data verification. The resulting visualizations of the temperature distributions and the investigation into the number of sensors and sample resolution are presented in Section 3. The paper concludes with the final section summarizing the key findings and providing suggestions on how to improve the current work. 2. METHOD 2.1. Interpolation algorithms Interpolating the temperature distribution requires that there be a set of known coordinates with known values to begin with. These coordinates and values correspond with the sensor’s coordinates and temperature values. It is reasonable to assume that coordinates located closer together have a higher probability of having temperature values that are more similar than those of coordinates that are farther from each other. This estimation intuitively follows from the general heat equation [11] shown in Equation (1), which is based on the diffusion equation [12]. 𝜕𝑢 𝜕𝑡 − 𝛼∇2𝑢 = 0, ......................................................................... (1) However, this formula assumes conservation of energy as well as a single heat source within a uniform material. The general principal of a divergent vector field is useful in that it allows for comparisons with approximation algorithms that will produce similar results with less computational complexity and cost. Using Big-O notation, the performance of the interpolation algorithms can be can be estimated [13]–[15]. We take 𝑛 as the set of coordinates that needs interpolation and 𝑚 as the number of sensors. The time needed to interpolate the values would therefore be as Table 1 describes. Since the data need to be interpolated in real-time, the faster methods will deliver more timeous results. It is for this reason that the nearest neighbor (NN) and inverse distance weighted average (IDWA) interpolation algorithms were identified as relatively simple and computationally inexpensive. 141 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Table 1: Computational complexity of interpolation algorithms Interpolation Algorithm Big-O Nearest Neighbor 𝑂(𝑛𝑚) Inverse Distance Weighted Average 𝑂(𝑛4𝑚) Linear 𝑂(𝑛(6𝑚 + 9)) Spline 𝑂(𝑛𝑚2) Nearest neighbor interpolation is the simplest implemented algorithm. The temperature of the discrete and evenly placed coordinates in the array used by the software are calculated by determining the closest sensor to each individual coordinate and setting that coordinate’s temperature equal to that of the sensor [13]. Equations (2) and (3) show this relationship where d is the minimum distance calculated and 𝑢(𝑥), the temperature being calculated is set equal to 𝑥𝑖, which is the temperature of the nearest sensor. 𝑑 = min {d(x, 𝑥𝑖)}, ..................................................................... (2) 𝑢(𝑥) = 𝑥𝑖, ................................................................................. (3) The IDWA of each discrete coordinate is calculated by first calculating the distance of the discrete coordinate to each of the sensors. The inverse of each distance is calculated and the sum thereof is normalized. Each normalized inverted distance is then multiplied by its sensor’s temperature and these temperatures are summed again to calculate the temperature value at the specific coordinate [14]. Equation (4) represents the mathematical formula that is used. If Wi is taken as the inverse distance a value 𝑢 at a given coordinate x can be estimated based on the samples 𝑢 = 𝑢(𝑥𝑖) for 𝑖 = 1, 2, . . . 𝑛 by applying the following function: 2.2. Design process A systems engineering approach was followed to achieve the final design that was implemented [16]–[18]. The system consists of hardware and software that work in tandem to affect the desired outcome. This section details the different components of the system as depicted in Figure 1. Figure 1. Proposed system architecture of the temperature distribution monitoring system 𝑢(𝑥) = ∑ Wi(x)ui n i=1 ∑ Wi(x) n i=1 , .................................................................. (4) 142 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 2.2.1. Hardware The hardware system developed for the proposed system consists of a variable number of DS18B20 sensors connected to an STMF303 development-board via auxiliary cables. The auxiliary cables connect power, ground and data pins of the sensors to the STMF303. The STMF303 in turn is connected to a PC with a USB connection that carries serial data and power. The DS18B20 temperature sensors come in a TO-92 package. The sensors have a programmable resolution (accurate from 0.5-0.0625°C), onboard analogue to digital (A/D) converters and each sensor is etched with a 64-bit serial number to differentiate them when they are all connected via the same data pin. The sensors are soldered to stereo auxiliary cables to enable easy configurability. The DAQ consists of the STMF303 dev-board housed within a 3D printed casing that also encloses the debugging LED circuits as well as the stereo jacks. These components are shown in Figure 2. Figure 2. Components required to construct the custom DAQ 2.2.2. Software and firmware The STMF303 is programmed using the online Mbed IDE with C. The firmware loop is represented in Figure 3. The main objective of the firmware is to detect the sensors and send the temperature data from the sensors to the PC in a serialized string each time a temperature reading is taken. Figure 3. DAQ firmware loop Unity 3D, a game engine based on the cross platform Mono framework, is used to generate the interactive real-time graphics. The framework itself is based on a subset of the .NET framework and allows for the use of .NET’s multithreading and serial communication libraries. The software can be compiled to run on both Windows and Linux whereas the aim of achieving a low-cost design excluded a macOS compatible version. The programming is done in C#. The main thread of the software is responsible for handling the user input and generating the graphics. Separate threads are spawned that handle the serial data communication and the interpolation of the data. The interpolation algorithms generate a set of temperature values at discrete intervals using the data from the temperature sensors and the sensors locations. This generated array is then used to generate the interactive real- time 3D graphics. 143 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 2.2.3. Interfaces The system has two important physical interfaces. These are the interface between the sensors and the STMF303, and the interface between the DAQ and the PC. The proposed system also has an important software user interface on the PC. The sensors are connected to the STMF303 via auxiliary cables and utilize the MAXIM 1-wire interface. This enables all the sensors to connect to the DAQ while utilizing only a single digital pin for data transfers. This feature also allows the sensors to be configurable and dynamic without having to rewrite the firmware of the DAQ. The STMF303 is connected to the PC via USB and acts as a serial connection. This makes it possible to use serial communication libraries and protocols in both the DAQ and the visualization software to send and receive data. The USB connection also enables the PC to act as a convenient power supply to the DAQ and, by proxy, the sensors as well. 2.3. Verification Each individual temperature sensor is tested and corroborated with 2 other temperature sensors to ensure accuracy. The temperatures are also verified within the visualization software to ensure that the temperature at the discrete location of the sensors correlate with the discrete locations within the visualization. The visualization was tested by deliberately heating up and cooling down certain areas of the battery pack, by using hot and cold air, and checking if the visualization emulated the temperature changes on the specific locations. 3. RESULTS & DISCUSSION The figures and graphs in this section show that the graphics generated can be changed and interacted with to a degree by the user. This means that a better and more intuitive understanding of the temperature distribution of the battery pack can be reached in contrast to the standard 2D line graph displaying a single temperature average of the whole battery pack. The custom DAQ with sensors that was constructed is shown in Figure 4. Figure 4. Battery pack setup with sensors and DAQ The data gathered is displayed to the user instead of propagating silently in the background. It is saved as a text file with a custom extension but can be easily converted to a csv file by removing the header. The software has a built- in function for this purpose. This also means that, by proxy, any other analytic software or procedure that the user wishes to subject the data to can be done with relative ease. The data can further be analyzed with Matlab, Python, R etc. by analysts wishing to better understand the thermal behavior of battery packs. Figure 5 depicts the 2D graph on the bottom panel of the data review mode where the user reviews data that were previously recorded. It shows the maximum, minimum and average temperatures throughout the session and the locations of these maximum and minimum temperatures. Figure 5. Timeline graph in 2D for data review with statistics 144 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management In contrast with Figure 5, Figure 6 displays the software operating in real time with the middle layer isolated to enable a better view of the inside of the array. The software is also capable of recording and reviewing data. The control panel on the right-hand side is responsible for configuring the graphics to the user’s satisfaction, including setting the sensitivity of the colors, the maximum and minimum temperatures and isolating layers or rows of the battery pack to more clearly view the “inside”. The bottom panel shows a graph of a single discrete location’s (cell) recent temperature. Right clicking on a location or entering the location’s coordinates in the text boxes changes the graph shown and the relevant max/min statistics under the graph to the location that has been selected. The software also has a warning feature that flashes either yellow or red in the right-hand panel to warn the user if the temperatures are approaching or exceeding the preset limits. Figure 6. Software operating in real-time operation with middle layer isolated The software lets the user pick a 3D model that they want to use to depict the temperature distribution. The choices are depicted in Figure 7. These models will scale and change color depending on the calculated temperature at the discrete coordinates. Figure 7. Visualization options (cylinders, cubes, spheres) The system gathers data in real time with the DAQ, depicted in Figure 4, but the A/D conversion and the amount of sensor data that need to be formatted result in a delay being present when the final interactive graphics are being rendered. This delay, illustrated in Figure 8 sows that the resolution of the A/D conversion has a much larger initial impact than the number of sensors present in the system. This latency that the system experiences between each data sample proved to be the largest delay and, since the serial communication was running in a separate thread, it would not be increased by any other part of the software. This did mean that to facilitate a coherent user experience the interpolation algorithms needed to complete faster than the delay created by the DAQ. Since the interpolation algorithms were also running in their own thread, their execution time was not affected by other parts of the software. The execution time of the interpolation algorithms was mainly affected by the battery pack size and the number of sensors. The larger the battery pack, the longer it takes to interpolate the data. The same effect is witnessed with the number of sensors connected. 145 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 8. Average data stream latency with regards to the number of sensors and the sensor resolution The software limits the creation of battery packs to sizes smaller than 50x50x50. Battery packs of this size and larger tended to cause a significant drop in frame rates of the software, from 60 fps to <12 fps, which is unacceptable from a usability and aesthetics perspective. This was mainly because of the graphics framework not being able to update all the dynamic objects fast enough due to the implementation inefficiencies. There is thus a tradeoff that the final user of the system can make when using the system. Increasing the resolution and limiting the number of sensors or increasing the number of sensors (which generally result in a more accurate 3D temperature distribution generated). They can also have a high resolution and number of sensors if a longer update delay is acceptable to them. Figure 8 shows the graph generated of the average data stream latency with regards to the number of sensors and the sensor resolution. Even though only 7 sensors were tested, connecting more is possible but more expensive. The inverse distance weighted average algorithm generated a smooth gradient in the resulting graphics, while the nearest neighbor algorithm resulted in a certain set of regions that all seemed to have the same temperature. The IDWA therefore produced more natural and accurate results, while the NN produced clearly sectioned patterns. The system as it was developed needs to be physically connected via wires to the battery pack and the PC. This might induce difficulties in certain scenarios where the battery pack’s temperature data needs to be viewed while in operation, for example, a running electric vehicle. Developing a wireless DAQ could solve this issue as well as make the system more usable. The system (hardware and software) cost less than R 1,600 to implement. This is more cost effective than using traditional DAQ hardware and the proprietary software that they may need to function. The sensors themselves are the most expensive components (except for the STMF303), and since they are configurable, the cost of the system can be kept very low should a budget be limited. 4. CONCLUSIONS The low-cost system for monitoring of the temperature distribution throughout a battery pack proved to be effective and can relay information in real-time with delays of less than a second. The information gathered and made available can be used by a BMS to improve the thermal management of a battery pack. By knowing the temperature distribution of the battery pack, hotspots can be identified and mitigated before they cause any harmful side effects. The BMS can apply measures to dissipate the excess heat as quickly as possible preventing thermal runaway. Large temperature variations throughout the pack can also be minimized to ensure balanced cells. 150 158 170 183 245 260 277 302 430 446 458 472 805 822 838 860 0 100 200 300 400 500 600 700 800 900 1000 4 5 6 7 La te n cy in m s Number of sensors 9 Bits (±0.5°C) 10 Bits (±0.25°C) 11 Bits (±0.125°C) 12 Bits (±0.0625°C) 146 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management The small number of configurable sensors allows the solution to be scalable and applicable in many scenarios. The interactive nature of the software also allows for a more intuitive and user-friendly experience when manually determining the SOH of a battery pack with regards to the temperature distribution. While the system is functional and operating correctly for the purposes of its original design, it can be improved to be of much greater use and functionality. At the time of writing, it is still only a generalized real-time temperature distribution monitor of 3D structures. The system can further be improved upon by incorporating it into other battery pack projects. This may enhance its functionality and applicability. As part of a BMS, it could potentially increase the lifetime and safety of battery packs as well as allowing for a better understanding of thermal behavior within battery packs and battery pack design. In terms of the hardware, a wireless DAQ solution can be developed. It can also be given a smaller physical footprint so that it can be used in scenarios where space is an issue. The software can be improved in terms of user friendliness and functionality. However, as new technology is being developed, the software can be rewritten with another framework that is has a more permissive license and may run certain algorithms faster and more efficiently. It can also be improved by adding more variants of interpolation algorithms that may increase efficiency and accuracy. To further improve the accuracy of the temperature distribution, the generated graphics can be enhanced and compared with an infrared camera view of the battery pack. This might increase costs but it will serve as an extra validation method that closely resembles the generated graphics. 5. ACKNOWLEDGEMENTS This work is based on the research supported, wholly or in part, by the National Research Foundation (NRF) of South Africa (Grant numbers 91093, 103392). Opinions expressed and conclusions arrived at are those of the authors and are not necessarily to be attributed to the NRF. REFERENCES [1] Abada, S. (2016) Safety focused modeling of lithium-ion batteries: A review. Journal of Power Sources. vol. 306. pp. 178–192. [2] Rezvanizaniani S. M. , Liu Z., Chen Y., L. J. (2014) Review and recent advances in battery health monitoring and prognostics technologies for electric vehicle (EV) safety and mobility. Journal of Power Sources. vol. 256. pp. 110–124. [3] Saw, L. H., Ye, Y., and Tay, A. A. O. (2016) Integration issues of lithium-ion battery into electric vehicles battery pack. Journal of Cleaner Production. vol. 113. pp. 1032–1045. [4] Racherla, K. Measure Multiple Temperatures in Battery Management Systems, and Save Power Too. [Online]. Available: https://www.maximintegrated.com/en/app-notes/index.mvp/id/5070. [5] Robinson, J. B., Darr, J. A., Eastwood, D. S., Hinds, G., et al. (2014) Non-uniform temperature distribution in Li-ion batteries during discharge – A combined thermal imaging, X-ray micro-tomography and electrochemical impedance approach. Journal of Power Sources. vol. 252. pp. 51–57. [6] Saw, L. H., Ye, Y., and Tay, A. A. O. (2013) Electrochemical–thermal analysis of 18650 Lithium Iron Phosphate cell. Energy Conversion and Management. vol. 75. pp. 162–174. [7] Srinivasan, R., Carkhuff, B. G., Butler, M. H., and Baisden, A. C. (Feb. 2011) Instantaneous measurement of the internal temperature in lithium-ion rechargeable cells. Electrochimica Acta. vol. 56. no. 17. pp. 6198–6204. [8] Panchal, S., Mathew, M., Fraser, R., and Fowler, M. (2018) Electrochemical thermal modeling and experimental measurements of 18650 cylindrical lithium-ion battery during discharge cycle for an EV. Applied Thermal Engineering. vol. 135. pp. 123–132. [9] Tian, N., Fang, H., and Wang, Y. (2017) 3-D Temperature Field Reconstruction for a Lithium-Ion Battery Pack: A Distributed Kalman Filtering Approach. IEEE Transactions on Control Systems Technology. pp. 1–8. [10] Richardson, R. R., Zhao, S., and Howey, D. A. (2016) On-board monitoring of 2-D spatially-resolved temperatures in cylindrical lithium-ion batteries: Part II. State estimation via impedance-based temperature sensing. Journal of Power Sources. vol. 327. pp. 726– 735. [11] Cannon, J. R. (1984) The One–Dimensional Heat Equation 1st ed. Cambridge University Press. Addison-Wesley Publishing Company. [12] Thambynayagam, R. K. M. (2011) The Diffusion Handbook: Applied Solutions for Engineers. McGraw-Hill. [13] Lu, G. Y. and Wong, D. W. (2008) An adaptive inverse-distance weighting spatial interpolation technique. Computers & Geosciences. vol. 34. no. 9. pp. 1044–1055. [14] Bartier, P. M. and Keller, C. P. (1996) Multivariate interpolation to incorporate thematic surface data using inverse distance weighting (IDW). Computers & Geosciences. vol. 22. no. 7. pp. 795–799. [15] Toraichi, K., Katagishi, K., Sekita, I., and Mori, R. (1987) Computational complexity of spline interpolation. International Journal of Systems Science. vol. 18. no. 5. pp. 945–954. [16] Burge, S. (2011) The Systems Engineering Tool Box. [17] US Army (2001) Systems Engineering Fundamentals. Fort Belvoir, Virginia. The Defense Acquisition University Press. [18] Heath, S. (2002) Embedded Systems Design. Elsevier Science. 147 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management A perspective on rotating equipment technology trends and maintenance in mining Joe Amadi-Echendu & Masiye Matheta Department of Engineering and Technology Management, University of Pretoria, South Africa ABSTRACT Advances in technology suggest marked improvements in the reliability of machinery deployed in mining operations. Rotating equipment is extensively deployed in mining and as such, there is increasing expectation that such equipment should operate at higher levels of reliability and reduced total life cost. The question thus arises as to the extent to which actual maintenance practice tracks changes in the technology of the rotating equipment. This paper describes a study that compares the trends in rotating equipment technologies against the history of actual maintenance interventions on such equipment in a real mining environment. Empirical data obtained during the study provides intriguing evidence about the relationship between technology trends and actual maintenance interventions. Keywords: Rotating Equipment Maintenance; Equipment Technology Trends; Maintenance in Mining; Integrated Maintenance Management. 1. INTRODUCTION Rapid evolutions in technology pervade every area of human endeavor, and in particular, advances in automation, information and communication technologies, new materials, et cetera, are increasingly incorporated in machinery deployed in mining. Rotating equipment make up a large portion of machinery used in mining operations, and the maintenance of such equipment is a significant activity in minerals extraction and processing. The technology trends affect both the original equipment manufacturers (OEMs) and the mining sector businesses, and increasingly influence the relationships between OEMs, vendors, suppliers, and users of rotating equipment in mining operations. With increasing levels of automation at the component, subsystem, equipment and machinery levels, the mining industry is advancing towards full scale mechanized automation and autonomous operations. There are claims that such full scale mechanization and automation should result in improved performance in terms of reduced costs of operations, increased production outputs, improved safety performance, increased profits, and so on. There are also claims that advanced technologies embedded in modern day rotating equipment should increase reliability and availability of such equipment during mining operations. Granted that changes in equipment technology will continue to happen at an even greater pace, the question arises as to how actual maintenance interventions in mining operations keep track with the technology trends. This paper briefly describes a review of the technology trends in rotating equipment and examines whether actual maintenance interventions in a case study mining operation has tracked the technology trends. Section 2 of this paper includes a summarized review of the rotating equipment technology trends. Section 3 also includes a summarized review of the history of actual maintenance interventions performed on rotating equipment deployed in a mining operation. A comparison of the technology trends against the maintenance interventions is discussed in section 4, and some conclusions are drawn based on the results. 148 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 2. TRENDS IN ROTATING EQUIPMENT TECHNOLOGY The picture in Figure 1 provides an overview of the range of rotating machinery, equipment types and components (cf: Forsthoffer, 2005:4), classified in terms of i) driven equipment, ii) prime movers, iii) transmission devices, and iv) auxiliary systems. Figure 1. Rotating equipment types and components (Source: Forsthoffer, 2005:4) In terms of the scale and scope, references [1] and [2] noted that most technology changes focused on automation, embedded diagnostics and operational ergonomics. References [3], [4], [5] indicate that technology trajectory mostly involved increased modularization so as to create highly reliable components, subsystems and equipment with the aim to extend life of machinery. The approach subsumes the following hierarchy; components → subsystems → equipment → machine. In terms of life extension, the suggestion is that the components have the shortest life so as to maintain pace with technology changes. Some authors (see, for example, [6], [7], [8], [9], [10], and [11]) surmise that the introduction of technological superior equipment complicated operation and maintenance interventions. It is not trivial to effect traditional maintenance intervention on highly reliable and automated machinery that also has embedded diagnostics capabilities. Whereas new technologies improved energy efficiency and operational ergonomics of machinery, however, in some instances, operators and users of technologically superior rotating equipment did not achieve desired availability, cost and reliability performance. Some of the technology changes in rotating equipment deployed in the case study mine are summarized in Table 1. For example, between 1970 and 1999, the mine utilized a combination of direct-current (DC) and alternating- current (AC) drives in machinery such as trammel screens, but from 2000 to date, only AC drives are deployed in vibrating screens, albeit that the drives are obtained from various OEMs. Some of the rotating equipment deployed for mining operations are depicted in Figure 2. 149 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Table 1: Summary of equipment technology changes Technology changes affecting Description of the type of technology changes observed in the mine rotating equipment Mostly affected Scale of operation physical size of the equipment to increase the equipment output capacities Dozers, loaders, locos, cranes Equipment Life design in terms of strength, durability and reliability to provide for increased of extended equipment life Pumps, compressors, disintegrators, dozers Information processing more accurate information (in terms of the internal and external processes parameters). Drives, clarifier plant automation Level of control rransfer of control from human to machinery (Automation) Turbines, pumps Functional range equipment capabilities (e.g., increased power output, speed, etc.) Vibrating screens Scalability and modularity physical shape of the equipment to provide for a level of scalability and modularity in the design Drives, Screens, hydraulic pumps Amplification of human execution of work by shifting the energy to do work from humans to machines. Screens Ergonomics reduce the physical stresses imposed on the operator by the work environment None Recorded Figure 2. Some rotating equipment installed in the case study mining operation. 150 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 3. CASE STUDY OF MAINTENANCE INTERVENTIONS ON ROTATING EQUIPMENT Data regarding maintenance interventions were obtained from the computerized maintenance management system of the case study mining operation. Remarkably, the records showed that only about 20% of the planned maintenance interventions were associated with rotating equipment, even though the mine utilizes a substantial number of such equipment. Between 2003 and 2017, it was surprising to note that out of the 83,815 maintenance interventions recorded against rotating equipment, only 827 were based on detailed lists of tasks that were performed, thus creating the impression that most maintenance interventions were adhoc. Further investigation revealed that detailed lists of tasks were only transferred from one platform to another during software upgrades, i.e., the tasks were often not updated to reflect the fact that a new version of rotating equipment had been installed to replace an earlier model. For brevity, Table 2 provides a summarized view of maintenance interventions on some electric drives deployed in the case study mining operation between 2003 and 2017. The boxes marked with “X” represent planned maintenance interventions with detailed lists of tasks. The column of shaded boxes indicate that the exact same maintenance interventions were carried out each year, i.e., for electric drive with code A/516151/1, the detailed list of tasks performed were unchanged from 2005 to 2012, while for electric drive A516154/1, the same maintenance interventions have been carried out since 2005 unchanged. What is intriguing is that the row of contiguous shaded boxes indicates that the same maintenance interventions were carried out for electric drives in different functional locations of the mine. In 2005 for example, the same set of tasks were performed on 9 respective electric drives in different functional locations. A similar curious observation applies to other rotating equipment like pumps, trammel and vibrating screens. Table 2. Maintenance interventions on some electric drives between 2003 and 2017 The pattern for the lists of tasks carried out during maintenance interventions is illustrated in Table 3. For the blank spaces, there were no recorded lists of tasks for maintenance interventions found. At least three of the columns from the sample show that the same maintenance intervention tasks have been carried out from 2003 to 2016. It seems unreasonable that the list tasks have not been changed or updated for nearly 13 years! 151 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Table 3. List of tasks for maintenance interventions on some rotating equipment between 2003 and 2017 A very remarkable observation is the difference in availability of trammel versus vibrating screens despite increased maintenance interventions on vibrating screens. Trammel screens were deployed between 2003 and 2009 before most were replaced with vibrating screens by 2013. The bar charts in Figure 3 indicate that, on average, the downtime for vibrating screens were higher (at 12.5 days/year) than for the trammel screens (at 2 days/year). Based on the preceding Tables 2 and 3, it is reasonable to suggest that the lack of updated list of tasks meant that the maintenance interventions were less effective such that the apparently better reliability of vibrating screens could not be achieved. Figure 3 A comparison of downtime hours in changing from trammel to vibrating screens. 152 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 4. DISCUSSION AND CONCLUSION The following general observations are based on a review of the recorded history of actual maintenance interventions :-  There was little or no change in the detailed maintenance intervention tasks even after DC drives were replaced byAC drives from 2000 onwards  Increased inherent reliability of the AC motors led to the maintenance effort being reduced and maintenance work made easier  Maintenance effort had increased as evident in the increased number of planned and corrective maintenance work undertaken, including the additional operations support work, which was indicative of the higher level of effort required to keep the equipment running  Even with significantly higher maintenance effort and maintenance expenditure on the vibrating screen, the availability and reliability remained much lower than in the previous case of trammel screens  Changes in the mode of operation and equipment used introduce increased complexity. This is mostly made up of changes in speed (whilst hydraulic drive was rotational speed was low, speed on the electric motors was high), mode of operation from controlled rotation to high frequency vibrations (this inherently lead to increased cyclic loading on the structural components), power source (change from hydraulic system to electrical driven unit effectively requiring a change in the type of maintenance work performed on the equipment), nature of design (whilst defects and condition of equipment could be assessed or corrected through use of visual inspections and corrective work was easily taken with standard tools, on vibrating screens, the faults and defect are usually hidden and not easily detected by visual inspection and as such, it required a use of advanced monitoring and diagnostic tools (predictive – condition monitoring) as well as advanced repair toolkits (i.e. torque wrench instead of spanners, laser alignment on motors, highly technical skills). This scenario highlights that, with introduction of vibrating screen technology, the maintenance needed to change. The adoption of planned maintenance PMs previously used on the screen, implies that either the end user had no knowledge and technical capability to develop and execute appropriate maintenance work suitable for vibrating screen OR that the results observed on the vibrating screen was due to the inherent poor reliability and maintainability characteristics of the vibrating screen as opposed to the trammel screen. Based on this case study, the concerning observation is that in practice, it appears that maintenance interventions are not updated with respect to changes in the technology of rotating equipment deployed in some mining operations. With increasingly automated and autonomous mining operations, there is also increased requirement to redefine the relationships between original equipment manufacturers, suppliers, vendors and the users so as to ensure that maintenance interventions track and keep pace with technology changes in rotating equipment. Acknowledgments The authors wish to thank the mining company for benevolently supporting the study reported here REFERENCES [1] Goodrum, P., Zhai, D. & Yasin, M., 2009. Relationship between changes in material technology and construction productivity. Journal of construction engineering and management, 135(4), pp. 278-287. [2] Lynas, D. & Horberry, T., 2011. Human factor issues with automated mining equipment. The Ergonomics Open Journal, Volume 4, pp. 74-80. [3] Bartos, P. J., 2007. Is mining a high-tech industry?:Investigation into innovation and productivity advance. Resource Policy, Volume 32, pp. 149-158. [4] Brown, G. M., Ebacher, B. J. & Koellner, W. G., 2000. Increased productivity with AC drives for mining excavators and haul trucks. Alpharetta, IEEE Industry Applications Society, pp. 28-37. [5] Di Febo, M. & Paganini, P., 2015. Centrifugal pump technology in oil and gas refinment. World Pumps, November, pp. 36-40. [6] Swanson, L., 1997. An empirical study of the relationship between production technology and maintenance management. International journal of production economics, Volume 53, pp. 191-207. [7] Sheridan, T. B. & Parasuraman, R., 2000. Human Vs Automation: An expected analysis. Journal of human factors and ergonomics society, 42(3), pp. 403-407. 153 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management [8] Wang, K. & Zhou, M., 2006. An analysis of technological progress contribution to economic growth in construction industry of china. Construction and design for project, Volume 1, pp. 77-78. [9] Bellamy, D. & Pravica, L., 2011. Assesing the impact of driverless haul trucks in australian surface mining. Resources Policy, Volume 36, pp. 149-158. [10] Ahmed, A. et al., 2012. Using design of experiment and genetic algorithm to obtain the otimum gear shifting strategy for a real driving cycle. Applied mechanics and materials, Volume 224, pp. 497-503. [11] Boudreau-Trudel, B., Zaras, K., Nadeau, S. & Deschamps, I., 2014. Introduction of innovative equipment in mining: Impact on productivity. American Journal of industrial and Business Management, Volume 4, pp. 31-39. 154 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management A Policy Framework for Integrating Smart Asset Management within Operating Theatres in a Private Healthcare Group to Mitigate Critical System Failure C.B.H. Nel 1 , J.L. Jooste 1 1 Department of Industrial Engineering, Stellenbosch University, South Africa ABSTRACT Within the hospital environment health technologies include medical equipment, consisting of physical assets which directly affect human lives [1]. Apart from being responsible for patient well-being, the private healthcare industry is also a for-profit business, where assets are considered an investment and where utilisation, performance, downtime and maintenance need to be considered for determining the return on investment. It is therefore important, to have a well-integrated physical asset management programme for ensuring that the medical equipment is reliable, safe and available for use when it is needed. This paper proposes a policy framework for the implementation of smart asset management to assist with the strategic execution of physical asset management in private healthcare businesses. The focus of the study is on operating theatres, which is regarded as critical assets, both in terms of patient safety and operational importance. A case study approach is used where data is collected with structured questionnaires and further analyses of archival records is considered. Results from the data analysis is used to develop a policy framework for implementing smart asset management at the case organisation. The results cover smart asset management applications, requirements and benefits of implementation. The integration of performance management and condition monitoring within a smart asset management approach are discussed and it is concluded that this synergetic approach can support strategic execution of physical asset management to predict and mitigate critical system failure within the private healthcare industry. Validation of the policy framework is done through face validation, which highlights the business potential of smart asset management for assisting in strategic decision- making. Keywords: Physical Asset Management, Smart Asset Management (SAM), Integrated maintenance management, Integrated Asset Management, Industrial Asset Management, Condition Monitoring, Engineering 4.0, Internet of Things (IoT). Corresponding author: Charles B.H. Nel (cbhnel@gmail.com) 1. INTRODUCTION According to the World Health Organisation (WHO), health technologies are an essential basis for the correct functionality of an effective healthcare system [1]. It is within this hospital environment that medical equipment is considered assets that can directly affect human lives. However, hospitals within the private healthcare industry are also considered businesses which apart from being responsible for human lives, are also accountable to management to be profitable, where assets are also considerable investments and maintenance costs need to be factored into the return of asset investment as well as income generated from asset utilisation and performance. It is therefore important to have a well-planned and managed asset management (AM) programme that is able to keep the medical equipment in a private hospital reliable, safe and available for use when it is needed. The emphasis of this paper is towards the strategic execution of AM within an operating theatre, which is regarded by the health industry as a critical asset, both in terms of human safety and operational importance. Smart asset management (SAM) is a concept where the term “smart”, is increasingly used within the commercial environment, which relates to a perception of technological intelligence. The concept of the Internet of Things (IoT) has also become a reality, which necessitates the need for a different approach to managing physical assets 155 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management [2]. With this emergence of technological intelligence, comes the possibility of vast quantities of asset data and the analysis thereof, which has proven to add value to asset management [3]. To capitalise and expand on this value realisation, SAM was established by incorporating various proven methodologies and applying these within real- time management environments. This research aims to create industry awareness about the business potential of incorporating SAM within automated asset environments for assisting in strategic management decisions. Considering the qualitative intricacies of this the research, it is established that management at various levels need to be engaged as well as informed about the context in which SAM can be utilised to address potential medical asset-related risks. The paper presents the result of case study research about the common understanding of technical managers operating within various private hospitals across Southern Africa. Based on the results a policy framework for implementing SAM is established from the feedback and experiences from stakeholders within the industry as well as archival records withdrawn from specific private hospitals to verify results. To validate the policy framework, face validation is performed. The policy framework serves to assist senior management to steer their focus and efforts towards practises that contribute to strategic execution for risk mitigation planning associated with critical asset failure. 2. LITERATURE REVIEW The SAM concept is associated with various keywords and concept variations as the term SAM itself is not fully explicated in current literature [2]. To define SAM within its own field of study, a suitable definition should represent both a practical implementation as well as an abstract or hypothetical optimum to strive towards. To do so the integral aspects of AM, is primarily the foundation of SAM. A brief overview of the current literature is provided. AM is becoming a well-established field of research with a variety of definitions, academic sources and alternative variants which further branch out into separate specialist studies. AM in the true sense of the term inherently relates to the simple concept of managing an asset. The term, asset, is defined by Oxford dictionary as “a useful or valuable thing or person” [3], essentially highlighting an asset as an attribute of value contribution. Therefore, the term AM relates to a concept of managing value contribution. It is evident that the term, asset, is therefore dependent on perceived value creation and contribution. Hence, the term, asset, can therefore also be defined with respect to the value contribution from physical equipment, where AM relates to the management of these physical items to control or propagate their value contribution [4,5,6,7]. Considering this value which AM can contribute, other sources [8,9,10,11,12,13,14,15,16,18] have also highlighted the positive contribution of real-time asset information being available within an organisation to make improved AM decisions. The Aberdeen Group highlights key enablers for successful implementation of “Real Time Enterprise Asset Management” which includes, but is not limited to; real-time monitoring of asset performance, real-time data exchange between systems that reside at the plant floor, Enterprise Resource Planning (ERP), workflow automation across plant floor and enterprise applications, and open architected, interoperable applications. It is therefore evident that within these established AM practices, the requirement for a real-time asset-monitoring environment can further add value to asset-owning organisations [17]. Within this AM value contribution opportunity, the impact of SAM becomes evident as a means to establish AM principles in a “smart” technologically aware manner with respect to real time asset monitoring. By doing so, real-time information throughout the organisation is possible, where this availability of information can facilitate more informed management decision-making [5]. Quality information is another concept closely associated with AM, where [18] states: “The quality of Asset Data & Knowledge should be assessed, understood and managed in order to ensure that it provides effective support to business decision making and processes”. AM literature refers to asset knowledge enablers, which consist of: asset information strategy, asset knowledge standards, asset information systems, and asset data and knowledge. These four asset knowledge enablers illustrate a flow from asset information, which leads to organisational knowledge through standards and systems. Furthermore, [19] concludes that “rational decision-making models establish a weighing mechanism between choice and value. Rational methodologies lead to the optimization of the outcomes by emphasizing the process of choosing rather than on what is chosen. A certain alternative is always selected 156 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management whenever its expected value is greater than that of other potential choices”. Therefore, rational decision-making is defined by the process of choosing the structures and standards which govern the choices we make. Quality information and specifically knowledge, therefore form the basis of quality decisions which may lead to quality results, when applied within a premeditated systematic, standardised operational process. From these theories, it is evident that AM standards are defined by an individual or a group who have considered which decisions may offer the best outcome, taking the probability of failure into consideration, while also acknowledging the potential risks associated with these decisions. This notion of rational decision-making illustrates how AM leaders need to make rationalised strategic decisions concerning assets. Their decision-making processes should facilitate the growth and development of structures, which focus on information becoming knowledge, in order to enable quality decisions, which allow for AM performance improvement. It is within this decision-making process that SAM can play a vital supporting role in accumulating asset information, used to enhance the quality and consistency of these strategic decisions. The Oxford Dictionary defines smart as “having or showing a quick-witted intelligence” [3], where smart when used as an adjective to describe a device, can be defined as “programmed so as to be capable of some independent action”. Smart appliances are also considered a growing consumer trend where it is a marketing-related term which is used to differentiate between ordinary household appliances with those having an ability of above average intelligence [2]. Various researchers have contributed to the term, smart object, by describing an object characterised by a degree of technological intelligence which is able to make informed decisions for itself or within a greater network of devices [15,19,20]. Smart fields, defined by de Best and van den Berg, is described as an integration of technology, processes and resources [21]. Furthermore, these researchers claim that asset managers should strive for continuous measurement to allow for improved decision-making [21]. In doing so, an association between the implementation of strategy and the risk management of assets can take place, where accurate operational feedback allows for managing the opportunity to make more informed strategic decisions. The word, smart, therefore relates to being able to interpret information into workable knowledge, which is useful to the end user. Currently available literature about SAM does not provide an industry standard definition or structure for SAM implementation, although similar concept variations exist. Concepts such as, Internet of Things (IoT), Industry 4.0 and Big Data, is associated with creating greater interdependence between technology, with specific focus on information feedback and asset management. Within a literature review, Nel and Jooste [2] presents how these concepts are used to derive a definition for SAM, while also highlighting how SAM facilitates asset communication within organisational management structures to better facilitate the control of operational activities. SAM can therefore be considered as taking two concepts – asset management and smart equipment (physical assets) – and integrating these for technological advantage. Smart assets can communicate within a given structure to assist management and make predefined decisions, or use artificial intelligence (machine learning) to improve productivity. SAM also serve to advance AM through the use of digital capabilities to enhance the control and performance of organisational assets. A “techno-organisational” environment is created which allows technology to assist with the implementation of organisational strategy via the intercommunication between assets with organisational infrastructure [15]. This creates the impression of artificial intelligence, where assets can think for themselves and for humans. 3. METHODOLOGY The research approach followed for this research is a case study investigation, where various sources of evidence are utilised for establishing the policy framework regarding SAM implementation. These sources are used for triangulation where the initial investigation is to establish current operational issues experienced with AM within the hospital maintenance environment. Initial data is collected from a homogenous sample of participants involving various hospitals within the Mediclinic Southern Africa (MCSA) group. Their feedback is analysed through a thematic analytic approach, which utilises coding techniques to group feedback from participants within general themes and concepts. 157 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Furthermore, quantitative data from the organisation’s Computerised Management Maintenance System (CMMS), and hospital archival records are utilised for further data verification and motivating the concerns experienced with the current maintenance methodology. Interpretation of the results is performed by comparing the various sources of evidence to form a basis from where a policy framework can be developed from industry best practises. The validation of this policy framework is achieved by means of face validation with a heterogenous sample of participants, representing the corporate management of MCSA who governs the implementation of MCSA standards through various policy frameworks. Within the MCSA group there are 52 hospitals each with a head of the technical department. Depending on the size of the hospital a technical supervisor or a technical manager holds this position. Assisting these hospitals on a regional level are five regional managers, four technical specialists and two senior managers in the corporate division of the organisation. The participants selection for the homogenous sample is done through purposive sampling by selecting participants based on their role and associated decision-making involvement in MCSA. For this study, the hospital technical managers were included in the homogenous sample, while for the heterogeneous sample regional or senior managers who are involved in corporate decision-making were included. The data collection is conducted in the form of a structured questionnaire, completed online though a Google forms platform. During the process, participants are required to answer a series of predefined questions. This allows for the analysis and interpretation of the collected evidence in a standard manner where the format of the interview is the same for all participants in the sample. The questionnaire provided for both closed and open-ended answers where the analysis of this information is related to the case organisation. 4. RESULTS AND DISCUSSION The questionnaires are analysed according to a six-phase process for conducting thematic analysis, where the researcher undergoes a familiarisation process to comprehend the data collected. This is followed by the generation of codes and theme categorisation. Association and pattern recognition is then performed, followed by interpretation and representation. Lastly, explanation and abstraction conclude the process. The outcome of the thematic analysis involves selecting vivid, compelling extract examples which demonstrate the feedback toward the questions posed in the questionnaire. The feedback is summarised in seven focus areas, namely; the life cycle approach to managing assets, asset risk management, criticality of life-support equipment, reactive notifications for asset malfunction, important asset infrastructure (or asset systems) in operating theatres, theatre unavailability and electronic data acquisition. Each of these focus areas are briefly discussed. From a life cycle approach to managing assets the questionnaire results are indicative that effective use of the CMMS is critical for managing assets. Technical managers use the information recorded in the CMMS to make decisions concerning the procurement, maintenance and replacement interval of certain assets. Regarding the maintenance of critical assets, the CMMS is used to produce both preventive maintenance, based on a time-based maintenance strategy, as well as breakdown notifications created from end users within the hospital. Preventive maintenance, which is controlled by the CMMS, is a reliable source of information, but depends on the feedback quality collected after completion of work orders. The quality of information received from breakdown maintenance work orders is frequently questionable due to the end user creating the work request. The end user is typically from other non-technical departments, resulting in incorrectly recorded information and in many cases request with missing information. Asset risk management commonly forms part of the life cycle approach to managing assets, where the managing of risk is covered within the asset management strategy. However, due to the nature of assets within the hospital environment, where above-average risks are involved, the importance of risk management should be highlighted. From the questionnaire results, it is evident that risks are managed from breakdown information recorded in the CMMS, which are monitored on a monthly basis. Furthermore, instantaneous alarms connected to certain critical assets, alert specific users who report faults via telephone to the technical department for corrective action. 158 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Critical life-support equipment are assets, which are critical to the operation of the operating theatre. Within the case organisation equipment is classified in three categories with the following definitions; life support, strategic, and general. Life support is considered the most critical as it affects the patient’s life directly. Strategic assets are those which can have an adverse effect on the operation of the business. Based on the questionnaire results, high priority items are those facilitating life-support equipment and include central assets systems such as the electrical reticulation, water reticulation (including emergency water), gas reticulation as well as climate control and sterilisation. These systems are central to the entire hospital where certain departments utilise more of these assets due to their support of patients’ lives. Reactive notification for asset malfunction relates to the delayed time for notifying the technical department about critical asset malfunction. The majority of respondents confirmed that these notifications are directly received from the end user. In most cases an alarm notifies the end user, where the end user then utilises telephonic contact as the fastest means of reporting system malfunction to the technical department. Although proactively, planned maintenance inspections are performed according to a time-based schedule and daily visual inspections are performed during the week, the questionnaire results are indicative that immediate notifications are lacking. An opportunity exists for proactively notifying about malfunction through the use of a real-time monitoring system. The results from the questionnaires highlights that not all life-support assets are considered relevant to the study. This study is only concerned with assets affecting the support of infrastructure within specific departments of a hospital. The highest priority maintenance departments are indicated in Figure 1 based on questionnaire responses which highlight each departments’ critical priority as well as economic revenue priority. It is indicative from the results that equipment associated with operating theatres is considered the most important to the case organisation. Figure 1. Interview Response for Departments with Critical Asset Infrastructures Theatre unavailability relates to the establishment of the operating theatre as the most important department in terms of maintenance priority and strategic importance, where the infrastructure of assets within this department are considered as having the greatest priority. Further to establishing the operating theatre as the most important department, Figure 2 depicts the ranking of the most important assets within the theatre environment. If any of these assets fail, the patient’s life will be in danger and the theatre would be deemed unavailable for use. The current practise within the case organisation for data acquisition is a strategic visual monitoring inspection of assets. Strategic assets are identified by each hospital, where qualified personnel (mostly artisans) conduct these checks and information is manually recorded on a checklist. However, the possibility arises that during a visual inspection, certain aspects can be overlooked. The information, which is instantaneous when manually recorded, is 159 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management also cumbersome to capture electronically. From the questionnaire results, it is evident that by utilising electronic monitoring devices, value can be derived from more timeous information to make improved decisions. This applies specifically within critical areas such as the operating theatres where centrally located asset systems support the hospital infrastructure both in terms of strategic business importance but also in terms of patient life support. Figure 2. Interview Response for Critical Infrastructure Assets within an Operating Theatre The analysis of archival records are used in support of the results from the questionnaires and outcomes of the thematic analysis. Information from the CMMS is extracted to highlight asset failures, whilst also looking at cases of asset failure and considering Mean Time Between Failure (MTBF). MTBF measures the average operational time between failures. It is considered an indication of reliability where historic data is used to highlight failure trends [22]. A low MTBF value represents a negative operational history; where the asset has experienced more failures over a total time, or has a low operational time representing availability. It is important to note that the time between failures is measured from the time of operation after the previous failure of the asset, to the time of the next failure. Only breakdowns are considered based on work orders created to repair damaged equipment. Table 1 presents an example of the calculations concerning MTBF data for the period June 2016 to July 2017 for a critical asset identified in Figure 2, namely an autoclave steriliser. Table 1: MTBF calculation for Autoclave steriliser 3 at Hospital 1 It is evident that various risks highlighted due to low MTBFs support responses from the questionnaire results. From the correlation between the MTBF results and qualitative evidence from the questionnaires, it is evident that CMMS as a reporting tool is effective when good quality information is recorded in the system. However, there is 160 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management proof from past cases, where critical failures occurred without data correctly monitored or recorded. Therefore, although the failure was dealt with and a solution was created, from an analysis point of view, poor quality records are on recorded in the CMMS. This data quality issue is mainly related to human intervention which is required to correctly record the information. To eliminate human intervention with respect to reporting faults, a system with characteristics identified in SAM could be utilised to correctly report faults from assets where a true indication of the number of faults and a true reflection of MTBF can be recorded. 5. POLICY FRAMEWORK FOR SAM IMPLEMENTATION From the data collection and analysis, it is evident that the utilisation of electronic monitoring devices, such as those identified within a SAM implementation plan is required, especially within critical areas such as the operating theatres. In response, a SAM policy solution is developed to assist technical managers within the private healthcare sector to implement a SAM plan to monitor critical assets for mitigating potential asset failure. To address the concerns identified during the analysis of the questionnaire and the analysis conducted during the verification process, where a breakdown explanation and abstraction is conducted, the proposed implementation of a policy framework for SAM is considered a viable solution to electronically monitor critical infrastructure assets within case organisation. The basic premise of the SAM implementation is characterised from an information loop as presented by Holdowsky et al. [23]. Within this research, a cyclical value-adding model is presented where conversion is established within the loop from machine intervention to real-world decisions affecting overall value contribution. Considering the external actions of the loop namely; act, create, communicate, aggregate and analyse; these actions are affected by the basic technological building blocks, which can be referred to as technological enablers, which assist in the information transfer between actions. From the basic sensory collection of raw data, converted to a digital language within a network, this information is then sorted and stored on a central platform enforcing certain criteria and standards to be implemented. Once this organised information is accessible, it is applied to preconceived knowledge criteria which are used as a reference to obtain an augmented intelligence which ultimately leads to a value-contributing decision or to augment behaviour. The steps outlined within Figure 3 are closely related to the theory presented by Holdowsky. Utilising these established procedures presents a valuable tool for AM decision-making, implementing of the collection of asset information within an SAM strategic plan. Building from this research and further research regarding the correct implementation of policy frameworks presented by Ostrom [24], a derived concept for a policy framework is created specifically for application within the case organisation. This policy framework is presented as an implementation philosophy of SAM where the basic flow of information as well as structure of the document is captured within the Figure 3. 161 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 3. Proposed SAM Continuous Improvement Cycle The premise of the policy framework resides in the amalgamation of existing fundamental MCSA standards and principles captured in established MCSA documents namely the Maintenance Pre-amble and Technical Operations Management System incorporating Planned Maintenance documents. The cyclical procedure presented in Figure 3 is further incorporated in the management of remote monitoring software applications classified within the SAM principles. Within the document fundamental principles are established outlining the intended purpose of the document, its intended implementation scope, original referenced documents as well as the responsibility of various key role players accountable for implementing the document. Key role players include the technical manager as well as the corporate technical operations managers who are required to assist with respect to implementation of current and newly established policies captured within the proposed document. Further to execution of the document, the expected procedure for implementation briefly discusses the premise of SAM, the existing standards and quality maintenance standards and best practise applications applicable for background knowledge. The scope of the intended SAM implementation is also clearly established as means to record information to further utilise in strategic decision-making, where clear guidelines are set for specific assets critical in nature, which need to be targeted. Furthermore, the implementation principle of achieving an actionable response is also clearly established highlighting the progression of information as in Figure 4. Figure 4. Progression of information flow Building on Figure 3, each process within the cycle is further expanded, where applications within a hospital 162 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management environment are established and discussed with examples for further practical implementation. Upon establishing the basis of implementation, further standards are established regarding the operation control within the case organisation, where concepts such; as training, awareness and competence, security of the system, operational checks and balances, critical equipment to be monitored, record keeping and the backup of information are aspects further deliberated highlighting the best practises for SAM implementation. For validation of the policy framework a face validation process is followed with a heterogeneous sample representing corporate technical operation management of the case organisation who governs the implementation of standards through various policy frameworks. Their feedback was supportive toward implementation of such a policy framework, with acknowledgement that such a concept did not exist currently. 6. CONCLUSIONS The paper proposes a policy framework for SAM implementation. It is evident from the results that current applications utilised within this specific case study are lacking with respect to monitoring critical assets as well as further mitigation of critical system failure. Therefore, the presented policy framework has potential to add value not only within the case organisation but also has potential value to industry to support the implementation of SAM. The limitations of the policy framework include the boundaries set by case study research investigation where a specific application within the private healthcare environment is considered. Furthermore, implementation of the document could not be established which leads to recommendations for future research within this field and other areas within industry. REFERENCES [1] WHO (2011) Medical equipment maintenance programme overview. WHO Medical device technical series, vol. 1, no. ISBN 978 92 4 150153 8, p. all. [2] Nel, C.B.H. and Jooste, J.L. (2016). A technologically-driven asset management approach to managing physical assets - a literature review and research agenda for smart asset management. South African Journal of Industrial Engineering, vol. 27(4), pp. 50-65. [3] Oxford (2016 May). Oxford dictionary [4] ISO55000 (2014). Management, ISO 55000 international standard for assets. Online. Available at: http://www.assetmanagementstandards.com [5] Schneider, J., Armin, J.G., Neumann, C., Hografer, J., Wellbow, W., Schwan, M. and Schnettler, A. (2006 March). Asset management techniques. Electrical Power and Energy Systems, vol. 28, pp. 643-654 [6] Campbell, J.D., Jardine, A.K.S. and McGlynn, J. (2011). ASSET MANAGEMENT EXCELLENCE: Optimizing Equipment Life- Cycle Decisions. Second edition ed. CRC Press Taylor and Francis Group. International Standard Book Number-13: 978-0-8493- 0324-1. [7] Hastings, N.A.J. (2009). Physical Asset Management. Springer Science & Business Media. [8] Salonen, A. (2011 March). Strategic maintenance development in manufacturing. Mälardalen University Press Dissertations, vol. 99, no. 99. Printed by Mälardalen University, Västerås, Sweden. [9] Berger, D. (2010 Feb). The growing value of a cmms. Available at: https://www.plantservices.com/articles/2010/02AssetManager/ [10] de Best, L. and van den Berg, F. (2006). Smart fields: Making the most of our assets. In: SPE Russian Oil and Gas Technical Conference and Exhibition, 3-6 October, Moscow, Russia. Society of Petroleum Engineers, SPE Russian Oil and Gas Technical Conference and Exhibition (3-6 October). Available at: https://www.onepetro.org/conference-paper/SPE-103575-RU [11] Bouleau, C., Gutierrez, F., Gehin, H., Landgren, K. and Miller, G. (2007/2008). Integrated asset managment. Oil eld Review, pp. 34- 48. [12] Halgamuge, M., Liyanage, J.P. and Mendis, P. (2010). Wireless technologies for integrated e-operations in offshore environments. EJSE Special Issue: Wireless Sensor Networks and Practical Applications, vol. Special Issue, pp. 100-112. [13] Dioguardi, R.M. and Smith, P. (2010). Use of r d asset management systems for monitoring analytical instrumentation. PerkinElmer Inc., vol. 22, no. 9, pp. 66-70. [14] Glova, J., Sabol, T. and Vajda, V. (2014). Business models for the internet of things environment. Elsevier B.V., vol. 15, pp. 1122- 1129. [15] Liyanage, J.P. and Langeland, T. (2009). Smart assets through digital capabilities. IGI Global, pp. 3480 3485. [16] Lopez, T.S., Ranasinghe, D.C., Patkai, B. and McFarlane, D. (2011). Taxonomy, technology and applications of smart objects. Inf Syst Front, vol. 13, p. 281 to 300. [17] Aberdeen (2008 June) Enterprise asset management: Maximizing return on assets and emerging trends. Aberdeen group Inc. [18] IAM, T. (2015). Asset management maturity scale and guidance. In: The Institute of Asset Management Annual Conference 2015, vol. 1 [19] Oliveira, A. (2007). A discussion of rational and psychological decision-making theories and models. Electronic Journal of Business Ethics Vaishnavi, V.K., Buchanan, G.C. and Jr., W.L.K. (1997). A data/knowledge paradigm for the modelling. IEEE 163 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, vol. 9, no. 2, pp. 275-291. [20] Gangyan, X., George, H.Q. and Fang, J. (2015). Cloud asset for urban flood control. Advanced Engineering Informatics, vol. 29, p. 355 to 365. and Organization Studies, vol. 12, no. 2, pp. 12 17. [21] de Best, L. and van den Berg, F. (2006). Smart elds: Making the most of our assets. In: SPE Russian Oil and Gas Technical Conference and Exhibition, 3-6 October, Moscow, Russia. Society of Petroleum Engineers, SPE Russian Oil and Gas Technical Conference and Exhibition (3-6 October). Available at: https://www.onepetro.org/conference-paper/SPE-103575-RU [22] Campbell, J.D., Jardine, A.K.S. and McGlynn, J. (2011). ASSET MANAGEMENT EXCELLENCE: Optimizing Equipment Life- Cycle Decisions. Second edition ed. CRC Press Taylor and Francis Group. International Standard Book Number-13:978-0-8493-0324- 1. [23] Holdowsky, J., Mahto, M., Raynor, M.E. and Cotteleer, M. (2015). Inside the internet of things (IoT). Deloitte University Press | DUPress.com. [24] Ostrom, E. (2011). Background on the institutional analysis and development framework. The Policy Studies Journal, vol. Vol. 39, No. 1, pp. 11 - 27. 164 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Identifying current challenges of data-based maintenance management: a case study S. Marttonen-Arola1, D. Baglee1 1 Faculty of Engineering and Advanced Manufacturing, University of Sunderland, UK ABSTRACT Exabytes of data from various sources are available for maintenance decision makers. The yearly increase in data is exponential due to technological developments such as the rapid increase in the amount of interconnected systems and assets, which utilize smart sensors, cloud-based computing and eMaintenance. All of these are supported by the rapid developments in the Internet. The data provide vast possibilities for smart, autonomous assets and predictive maintenance. However, in practice, there are technical, managerial, and organizational challenges, which impede the maintenance decision makers from exploiting the information retrieved from the data analyses. The existing literature has discussed the data required in different maintenance decision making situations extensively, although there is a limited number of academic publications which explore general-level frameworks or tools to support the management of maintenance data. This paper builds upon a review of the current literature on the value of maintenance data management. The data needed to support a number of different maintenance management situations are discussed, and an approach to analyze and increase the value and resource efficiency of the maintenance data management process is suggested. The paper presents a case study example conducted in collaboration with a UK manufacturing industry. The objective of the paper is to map the current state of maintenance data exploitation paths. This makes the different value-based development needs in the data management process visible. The results of this paper will contribute to future empirical research including modelling and optimizing the use of data in maintenance decision making through adopting lean management principles. The majority of previous lean management research has focused on the optimal management of production processes and the maintenance processes. In this research, the principles of lean management will be taken to the level of optimizing the maintenance data management process. Keywords: Maintenance data; Decision making; Case study; Value; Industrial asset management. Corresponding author: Salla Marttonen-Arola (salla.marttonen-arola@sunderland.ac.uk) 1. INTRODUCTION The recent technological developments regarding for instance eMaintenance, sensors, and the Internet of Things have exponentially increased the amount of data available to maintenance decision makers [1-2], providing vast possibilities for novel approaches in smart, predictive maintenance [3-4]. However, in practice many companies are struggling with the process of gathering, integrating, analyzing, and exploiting their maintenance data [5], and so far they have not been able to harvest the value of the data. There is no universally agreed definition for maintenance data. In this paper it is defined to include asset data (technical, operational, financial, environmental, and performance of the assets), data on the planned and implemented maintenance actions, and supportive data (on e.g. strategy, processes, risks, and contracts) [6-10]. The optimal amount of maintenance data to be gathered and analyzed depends on the size of the company, the type of business, the complexity of processes, the competence of people, and the complexity of assets [11]. The value of data is an ambiguous term, and systematical approaches to assessing or even defining it are still scarce both in industry and in academia. The objective of this paper is to introduce a value-based approach to studying maintenance data management processes. The current state of maintenance data exploitation paths are then mapped through a case study. The paper contributes to the understanding of the value of data in maintenance 165 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management management, and helps to build the foundation for further research including value-based modelling of maintenance data and methods for lean maintenance data management. In section two of the paper, the research design and methods are discussed. Section three addresses previous literature on maintenance data and its value. Section four presents the results and observations from the case study, and the paper finishes with conclusions in section five. 2. RESEARCH DESIGN In this research a literature-based approach to evaluating the value of maintenance data is created and evaluated using a case study example. The selected case company operates in the food and drink industry in the UK. The company is interested in increasing the value of their maintenance through better exploitation of their maintenance data. However currently, the state of maintenance data management in the company is limited; they are operating with manual data collection, the maintenance engineers are still learning how and why they should document their work, and data is mostly used to monitor corrective maintenance tasks which constitute a majority of the company’s maintenance. The case company studied which maintenance tasks take up the most of their maintenance engineers’ time, and based on the results they wanted to analyze the data management processes of three specific maintenance tasks in more detail: 1) Changing a retort probe when it has been damaged, 2) Installing or removing a specific conveyor belt used when making cannelloni, and 3) Cleaning the printer head of videojets after the quality of the print has been deemed suboptimal. To analyse the data management processes in the selected maintenance tasks, the case company’s maintenance work requests from July 2017 until mid-January 2018 were studied. Two maintenance managers from the company were also interviewed to gain understanding on the current processes and practices in the company. The maintenance work requests from the studied period included 284 retort probe changes, 92 cannelloni belt installations/removals, and 117 videojet head cleanings. In total, these 493 work requests equaled 12% of all the maintenance work requests from the assembly department during the studied period, and 5.3% of all the maintenance work requests in the case company during the period. All three studied maintenance tasks are simple and quite straightforward to conduct: the time used in completing them equals only 5% of the time used in completing all the work requests from the assembly department, and 1.8% of the time used to complete all the work requests in the case company. 3. LITERATURE REVIEW The recent technological developments have taken maintenance decision making towards real-time decisions. The time available for analyzing the data to support maintenance decisions is decreasing, and so is the time required for data collection [5]. Sun et al. [12] have recognized four different time scales in asset management decisions: 1) Strategic decisions ranging from one to five years (e.g. developing strategic asset management plans), 2) Technical decisions ranging from a month to a year (e.g. creating preventive maintenance plans for major assets), 3) Implementation decisions ranging from days to a month (e.g. scheduling maintenance actions), and 4) Reactive decisions ranging from minutes to a day (e.g. selecting maintenance actions in case of unexpected asset failure). To form a more comprehensive view of the data needed to support maintenance decisions, the authors reviewed the literature on various decision-making situations in maintenance management. In figure 1 below, various decisions are presented according to the categorization of the four different time scales in decision making. In the 166 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management empirical part of this paper, the focus is on reactive decisions related to designing and implementing corrective maintenance actions. The main elements of a corrective maintenance plan can be seen as detecting failures, and deciding on whether to repair or replace the asset, immediately or later. To optimize these decisions maintenance managers would need data related to e.g. asset condition and life cycle, costs, safety and production. [13-16] STRATEGIC DECISIONS TECHNICAL DECISIONS IMPLEMENTATION DECISIONS Setting maintenance objectives and optimization criteria [11-12, 14, 17] Selecting and updating a maintenance strategy [11-12, 14, 18] Identifying suitable maintenance policies (e.g. corrective maintenance, time-based maintenance, condition-based maintenance [19-20] Maintenance resources management and development (skills, personnel, budget, new technologies etc.) [14, 21-22] Deciding on maintenance contracts and outsourcing [21] Interaction with other units, companies and systems [21] Deciding how risks and opportunities should be addressed [11] Decisions to invest in new assets or in non- asset alternatives [5, 10] Asset replacement and end-of-life strategies [10, 16-17] Maintenance performance measurement and continuous improvement [14] Developing maintenance plans and programmes [11-12, 14, 21] Corrective maintenance: - Methods to detect failures, - When to repair an asset [19], - When to replace an asset [19, 23] Time-based maintenance: - Maintenance frequency Condition-based maintenance: - Parameters to be measured [24], - Measurement techniques [24], - Measurement locations [24], - Inspection frequency [17, 20], - Action limits [17, 24], - Required maintenance actions [24] Spare parts management and control [17, 20-21] Evaluating the performance of the maintenance plans [11] Scheduling maintenance activities [11-12, 17] Maintenance workforce allocation [11-12, 20] Material delivery timetables [11-12] REACTIVE DECISIONS Triggered by unplanned events, creating a need for maintenance decisions on a short notice. Feedback FeedbackObjectives Objectives Figure 1. Classification of maintenance decision making situations. When collecting and analyzing maintenance data it should be noted that, the objective is to create more value for the maintenance decision makers. The value of data or information still has not received much attention in scientific literature, and practical applications on the topic are mostly missing. The value of asset management data in fleet contexts has previously been discussed by e.g. [25], who consider data management to be an investment to be paid off by the benefits achieved through improved decision-making. The costs of data management can be categorized into hardware, software, and work hour based costs. The benefits, on the other hand, include saving in maintenance costs, savings in quality costs, as well as other savings and benefits. [25] To define what constitutes value in this study, the view on value proposition in information services presented in [26] is adopted. According to this view, the value of information depends on not only the information items provided, but also on the amount, quality, format, time, place, and price or cost of the information as follows: 167 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Value of information = Right information, in the right level of detail, in the right condition, at the right time, at the right place, and for an appropriate price [26]. The optimal extent of maintenance data to be gathered and analyzed to support decision making is highly case specific. Data collection and analysis must not be too complex or strenuous, however collecting data only on the level of e.g. large systems might invalidate the whole process if the data do not provide root causes for failures and thus do not enable improvements of maintenance plans [27]. Thus the optimal data depend on the maturity of the organization in question. Data can be collected from both internal and external sources. When considering which data to collect, [8] suggests all organizations to at least record the details of their corrective maintenance actions. The standards recommend coding data using mutually exclusive codes, and using additional free text to provide further details [8]. It is necessary to acknowledge the potential problems in data quality to evaluate and manage the impact on maintenance decisions. According to the categorization presented by [28], data uncertainty can be caused by: 1) Input data (e.g. material properties, sampling rates), 2) Measurements (e.g. sensor noise), 3) Operating environment (e.g. high variability in conditions), and 4) Modelling errors (e.g. lack of understanding about the process to be modelled). In practice, there are often a large number of quality problems in maintenance data. According to the categorization of levels of data completeness in asset record databases presented by [29]:  Approximately 10-20% of the records can be considered “perfect data” which do not require extrapolation or expert judgement before exploitation,  35-40% of the records are “imperfect data”, requiring additional efforts to ensure the quality,  25% of the records are “verbal/inspection data”, which require e.g. expert judgement for verification, and  10-15% of the records are “soft data” relying on human perceptions and/or memory, which require a lot of time to summarize and can be considered inconsistent quality-wise. The concept of value of data offers an approach to analyzing data management processes and identifying their current value-destroying features. The case study in the next section presents an example of mapping the current maintenance data flows in an industrial context through analyzing the various dimensions of value of data. 4. CASE STUDY 4.1. The current maintenance process Figure 2 depicts the current maintenance data management process in the case company. The corrective maintenance process is triggered by an asset failure, after which the asset operators use a specific manual form to file a maintenance work request. All the three studied maintenance tasks are production critical (the failures cause the production line to stop), so the maintenance engineer(s) react to the work requests as quickly as possible, executing the requested maintenance task and restoring the asset into normal operating condition. The engineers report any possible needs for follow-up maintenance work, and they use the manual forms to document the details of the conducted maintenance work. The forms are then manually transferred to the maintenance management team. The maintenance managers insert the data gathered with the manual work request forms into electronic spreadsheets once a week. Every week the maintenance managers use the spreadsheet to construct a report to describe the performance of the maintenance department to the company management. In addition to performance 168 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management reporting, the maintenance management team regularly uses the spreadsheet data to identify bottlenecks and development needs in their maintenance processes. Asset failure Maintenance work request Maintenance work execution Potential follow-up maintenance Reporting with manual forms Data into electronic formatData analysis Maintenance planning Asset operating normally Asset operating normally Reporting to management Figure 2. The current corrective maintenance data management process in the case company. 4.2. Identifying value-destroying challenges in the process The six dimensions of value of information presented in section two have been used as the basis of the analysis addressed here. Table 1 below summarizes the identified value-destroying challenges in the current maintenance data management process of the case company. Eight challenges were identified, out of which three were deemed more significant (at the moment): poor quality data, manual data, and unnecessary data processing. The poor quality of maintenance data is a significant challenge because it can void the entire data management process if maintenance decisions are based on incorrect data. In the future, the case company will contribute to this challenge by promoting instructions and awareness on how and why maintenance data is collected. The condition the data is processed in (documented in manual work request forms) also causes quality problems and makes data analysis slower. The case company has considered implementing technological solutions to support transferring the manual data into electronic format, or adopting a computer-based maintenance management system to make the whole data management process electronic and to increase the quality of data. However, so far the managers of the company have averted from investing in software and/or hardware before the basic process and motives of maintenance data management have been adopted and accepted by the personnel. This decision could be costly, as mentioned in table 1 the amount of time currently used in unnecessary data processing is significant and must be a cause for lots of wasted personnel costs. It would be important for the case company to conduct a quantitative investment appraisal on comparing the costs of the current process, including working hours, with the costs of implementing a computerized system. 169 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Table 1. Identified value-destroying challenges in the current maintenance data management process of the case company. DIMENSION OF VALUE IDENTIFIED CHALLENGE IMPACT ON MAINTENANCE Right information 1) Missing information: Details of the used spare parts are not documented consistently. For retort probe changes, only 11% of the work request forms documented the used spare parts at all, and most of them were missing details such as part numbers or specific location. The costs of spare parts cannot be allocated properly, the performance of spare part management cannot be evaluated, and proactive spare part management cannot be supported by data analysis. 2) Unnecessary information: The engineers use time documenting some information items which are constant for each of the three tasks (e.g. impact and nature of work). The unnecessary data get carried through the process and cause both the maintenance engineers and maintenance managers to use additional time in processing it. Right level of detail 3) Poor quality data: 12% of the studied work requests had visible quality problems that made them unreliable regarding data analysis (e.g. several tasks bundled in the same work request, several work requests created on the same task, or maintenance start time later than the reported finish). The maintenance engineers and managers use their time in inserting the poor quality data into the manual forms and spreadsheets. In addition, the incorrect data might lead to incorrect conclusions in maintenance decision making. 4) Confusion over key terminology: There are no explicit instructions on how to document the data and what kind of terminology should be used, so the engineers use a number of different terms when writing about the same issues/assets/tasks. The reliability of the data can be questioned because the maintenance management team has to interpret the data before inserting it into the spreadsheet. Data analysis takes more manual work. Right condition 5) Manual data: The manual forms are strenuous to fill and to interpret. The poor legibility of the forms causes additional work for the maintenance managers. Sometimes the maintenance management team struggles to read the hand-written forms, which slows the data processing and again reduces the reliability of the data. At the right time 6) Slow process: The manual forms cause severe delays to the data management process. It can occasionally take weeks before the data of a maintenance task is inserted into the spreadsheet. The data is not timely enough to support any kind of proactive maintenance decision making. Currently this is not a very significant problem since the data is mostly used to monitor corrective maintenance on a weekly or monthly basis, but in the future it should be taken into account. At the right place 7) Poor traceability: The traceability of the manual forms is not good and occasionally they get lost in the process. The data management process is slower, and using data to support proactive decision making in maintenance is not possible. For an appropriate price 8) Unnecessary data processing: The arduous process causes additional work, for instance the maintenance managers examine the forms again when inserting the data into the spreadsheet. If inserting the data from one form takes on average 5 minutes, there is 62 hours/year of unnecessary work related to the three studied maintenance tasks, and 1165 hours/year considering all work requests. 170 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 5. CONCLUSIONS This paper has suggested an approach to analysing and increasing the value of the maintenance data management process. An industrial case study example has been presented to demonstrate how the current state and main challenges of the data management process can be mapped. The paper contributes to theory through showing how the data management process can be analysed and improved by adapting the concept of value of information, which has been presented in previous literature. Industry and academia agree on the importance of data in maintenance management, but in practice many organizations are struggling with developing their data management processes. The main managerial contribution of this paper is related to focusing on value-centred thinking, which enables achieving business feasibility in data management. It is critical to remember that data do not create value unless exploited in decision making. It is easy to forget this and implement various technological solutions to create more and more data without stopping to think whether it is actually needed. Due to the limited extent of this paper, developing the data management process systematically to maximise the value, as well as measuring the value of information quantitatively were not included. These topics will be included in the later phases of the research project. The value-based analysis presented in this paper will be integrated into a method of adopting lean principles in the maintenance data management process to maximise the value while minimizing the waste. The majority of existing lean management research has focused on optimizing production processes, with a few exceptions of studying the maintenance process. In this research project the principles of lean management will be adapted to the data management process in maintenance. Further research will also include creating a value-based cost model for measuring and improving maintenance data management to better communicate the value of information to company managers. ACKNOWLEDGEMENTS This project has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No 751622. REFERENCES [1] Baglee, D., Marttonen, S., Galar, D. (2015) The need for big data collection and analyses to support the development of an advanced maintenance strategy. Proceedings of the 11th International Conference on Data Mining. Las Vegas. July 27-30. pp. 3-9. [2] Candell, O., Karim, R., Söderholm, P. (2009) eMaintenance – Information logistics for maintenance support. Robotics and Computer- Integrated Manufacturing. 25 (6). pp. 937–944. [3] Bumblauskas, D., Gemmill, D., Igou, A., Anzengruber, J. (2017) Smart Maintenance Decision Support Systems (SMDSS) based on corporate big data analytics. Expert Systems With Applications. 90 (C). pp. 303–317. [4] Crespo Márquez, A. (2007) The maintenance management framework. Models and methods for complex systems maintenance. Springer series in reliability engineering. ISBN 978-1-84628-820-3. [5] Kinnunen, S.K., Marttonen-Arola, S., Ylä-Kujala, A., Kärri, T., Ahonen, T., Valkokari, P., Baglee, D. (2016) Decision making situations define data requirements in fleet asset management. Proceedings of the 10th World Congress in Engineering Asset Management (WCEAM 2015). Lecture Notes in Mechanical Engineering (2195-4356). Springer. pp. 357-364. ISBN 978-3-319- 27062-3. [6] Marttonen-Arola, S., Baglee, D., Kinnunen, S.-K., Holgado, M. (2018) Introducing lean into maintenance data management: a decision making approach. Submitted to the 13th World Congress on Engineering Asset Management (WCEAM). Stavanger, Norway. September 24-26. [7] BS EN 13306 Std. (2010) Maintenance. Maintenance terminology. BSI Standards Ltd. ISBN 978-0-580-64184-8. [8] BS EN ISO 14224 Std. (2006) Petroleum, petrochemical and natural gas industries – Collection and exchange of reliability and maintenance data for equipment. BSI Standards Ltd. ISBN 978-0-580-50138-8. [9] Wang, L., Qian, Y., Li, Y., Liu, Y. (2017) Research on CBM information system architecture based on multi-dimensional operation and maintenance data. IEEE International Conference on Prognostics and Health Management. Allen, TX, USA. June 19-21. ISBN 978-1-5090-0382-2. [10] BS ISO 55002 Std. (2014) Asset management. Management systems – Guidelines for the application of ISO 55001. BSI Standards Ltd. ISBN 978-0-580-86468-1. [11] BS ISO 55001 Std. (2014) Asset management. Management systems – Requirements. BSI Standards Ltd. ISBN 978-0-580-75128-8. [12] Sun, Y., Fidge, C., Ma, L. (2008) A generic split process model for asset management decision-making. Proceedings of the 3rd World Congress on Engineering Asset Management and Intelligent Maintenance Systems. Beijing, China. 171 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management [13] Ahonen, T. (2005) Eri tietolähteiden käyttö kunnossapidon tukena. In: Helle, A. (Ed.) Kunnossapito ja prognostiikka. Prognos- vuosiseminaari 2005. VTT Symposium 239. Otamedia Oy. Espoo. ISBN 951-38-6302-6. [14] BS EN 16646 Std. (2014) Maintenance – Maintenance within physical asset management. BSI Standards Ltd. ISBN 978-0-580- 83321-2. [15] Karim, R., Westerberg, J., Galar, D., Kumar, U. (2016) Maintenance analytics – the new know in maintenance. IFAC-PapersOnLine. 49 (28). pp. 214–219. [16] Shafiee, M., Animah, I. (2017) Life extension decision making of safety critical systems: an overview. Journal of Loss Prevention in the Process Industries. 47. pp. 174–188. [17] Jardine, A.K.S., Montgomery, N. (2009) Optimal maintenance decisions for asset managers. Industrial Engineer. 41 (6). pp. 44–49. [18] Dong, Y.-L., Gu, Y.-J., Yang, K. (2004) Research on the condition based maintenance decision of equipment in power plant. Proceedings of the Third International Conference on Machine Learning and Cybernetics. Shanghai. 26-29 August. IEEE. pp. 3468– 3473. ISBN 0-7803-8403-2. [19] Kareem, B., Jewo, A.O. (2015) Development of a model for failure prediction on critical equipment in the petrochemical industry. Engineering Failure Analysis. 56. pp. 338–347. [20] Sharma, A., Yadava, G.S., Deshmukh, S.G. (2011) A literature review and future perspectives on maintenance optimization. Journal of Quality in Maintenance Engineering. 17 (1). pp. 5–25. [21] Marttonen-Arola, S., Ali-Marttila, M., Ylä-Kujala, A., Saunila, M., Pekkola, S., Sinkkonen, T., Kärri, T., Pekkarinen, O., Rantala, T., Ukko, J. (2016) Towards comprehensive value management in inter-organizational industrial maintenance. International Journal of Condition Monitoring and Diagnostic Engineering Management. 19 (3). pp. 27–32. [22] BS ISO 55000 Std. (2014) Asset management. Overview, principles and terminology. BSI Standards Ltd. ISBN 978-0-580-86467-4. [23] Ahmad, R., Kamaruddin, S. (2011) Maintenance decision making method for repairable system by using output-based maintenance technique: a case study at pulp manufacturing industry. IEEE. pp. 15–20. ISBN 978-1-61284-486-2. [24] BS ISO 17359 Std. (2011) Condition monitoring and diagnostics of machines – General guidelines. BSI Standards Ltd. ISBN 978-0- 580-67132-6. [25] Kinnunen, S.-K., Marttonen-Arola, S., Kärri, T. (2016) Value of fleet information in asset management. In: Galar, D., Seneviratne, D. (Eds.) (2016) Proceedings of the 6th International Conference on Maintenance Performance Measurement and Management (MPMM 2016). Luleå, Sweden. November 28. pp. 76–80. ISBN 978-91-7583-841-0. [26] Bucherer, E., Uckelmann, D. (2011) Business models for the Internet of Things. In: Uckelmann, D., Harrison, M., Michahelles, F. (Eds.) Architecting the Internet of Things. Springer. 352 p. e-ISBN 978-3-642-19157-2. [27] Kunttu, S., Kortelainen, H. (2004) Supporting maintenance decisions with expert and event data. Proceedings of the annual Reliability and Maintainability Symposium. Los Angeles, CA, USA. January 26-29. IEEE. pp. 593–599. [28] Javed, K., Gouriveau, R., Zerhouni, N. (2017) State of the art and taxonomy of prognostics approaches, trends of prognostics applications and open issues towards maturity at different technology readiness levels. Mechanical Systems and Signal Processing. 94. pp. 214–236. [29] Hale, P.S. Jr., Arno, R.G. (2009) Operational and maintenance data collection for determining site reliability or availability. IEEE Industrial Applications Magazine. Sept/Oct. pp. 21–24. 172 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Investigations on Augmented Reality based maintenance practices within SMEs M. Müller1,2, D. Stegelmeyer2, R. Mishra1 1 School of Computing and Engineering, University of Huddersfield, United Kingdom 2 Institut für Interdisziplinäre Technik, Frankfurt University of Applied Sciences, Germany ABSTRACT Maintenance services provided by Original Equipment Manufacturers (OEMs) play a fundamental role in industrial asset management. The vast majority of OEMs provide maintenance services to their customers, as most customers now outsource complex maintenance tasks. For many OEMs the economic value of operations has even shifted from manufacturing traditional products, to providing services required to operate and maintain their installed base. In general downtime is expensive and maintenance activities are often conducted under time pressure. However, maintenance processes are knowledge intensive and field service technicians have to cope with increasingly complex and diverse systems. To stand out from competitors and to satisfy customer needs, productivity improvement in maintenance processes is urgently required. Augmented Reality (AR) technologies, e.g. head mounted displays (HMD) can assist field service technicians in improving efficiency of maintenance operations. Systems for mobile collaborative AR supported maintenance are commercially available, and the benefits of AR supported maintenance work are generally evident. Still, on a wider scale in the industrial maintenance service delivery, the technology is far from established. From the perspective of small and medium-sized enterprises (SME), with a limited number of field service technicians, remote support for customers and service partners might be an opportunity to enhance resources availability. This study investigates ten business cases of mobile collaborative AR systems using HMDs for services delivery. The research strategy follows an exploratory approach. Based on focus group discussions with representatives of SMEs and semi-structured interviews with industry experts, enablers and barriers for the use of mobile collaborative AR systems are determined. Keywords: augmented reality, head mounted displays, industrial asset management, maintenance, servitization Corresponding author: Maike Müller (maike.mueller@hud.ac.uk) 1. INTRODUCTION In recent decades, traditional manufacturing companies of different kinds of capital goods have moved up the value chain and become service or solution based businesses in order to distinguish themselves from their competitors, better meet customer’s needs, and thus survive in increasingly competitive product markets [1–5]. For many OEMs, regardless of their position in the supply chain, the economic value has shifted from manufacturing traditional products, to providing services required to operate and maintain their installed base [3][6]. The vast majority of capital goods manufacturers provides at least basic services, such as spare parts supply or repair and maintenance- on-demand [6]. In general downtime is expensive and maintenance activities are often conducted under time pressure. Besides, customers often expect availability commitments for specialised service technicians for recurring maintenance activities as well as in cases of unscheduled downtime. Maintenance processes, however, are knowledge intensive and maintenance personnel have to cope with increasingly complex and diverse systems [7]. To stand out from competitors and satisfy customer needs, productivity improvement in maintenance processes is required. With respect to their limited resources and difficulties to implement service-oriented strategies, SMEs are challenged in particular [8][9]. Nevertheless, SMEs are an important part of most economies. In Germany, for instance, 85 % of machinery and equipment industries are SMEs. They account for nearly 30 % of the total turnover generated and 37 % of the people employed by the sector as a whole [10]. 173 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management One concept, to address aforementioned challenges and increasingly complex maintenance operations, is the real- time collaboration between experts and on-site field service technicians via mobile collaborative AR systems using HMDs [11][12]. Generally, remote expert assistance produces quicker and more accurate troubleshooting and repairs [13] and is one of the most popular application areas for HMDs [14]. For SMEs, considering their limited resources, mobile collaborative AR might be an opportunity to increase customer satisfaction, withstand competitive pressure, and broaden their sales network [15]. But even though AR and HMDs have been a research interest for more than 50 years [16], there are very few concrete examples of implementation in industry and if implemented, most applications are on a field test level in large enterprises (LE) [17][18]. In the past, researchers developed prototypical systems for mobile collaborative AR and tested those systems in experiments or case studies [11][12][15][19][20]. Those systems usually include a wearable computing system (e.g. HMD) worn by a technician and a user interface to enable collaborative communication between remote experts and technicians in the field. The main function of the HMD is to capture a live audio and video stream [11]. The user interface displays the live stream captured by the HMD’s camera and features communication and documentation functionality, e.g. video recording, screenshots, document sharing, chat, language translation and laser pointing into the live stream. HMDs used can be either see-through HMDs or video display HMDs (non-see-through). See-through HMDs are based on waveguides, semi-transparent monocular or binocular mirrors that reflect computer generated images to augment the real world (e.g. Epson BT- 350, Google Glass, Microsoft HoloLens). Non-see-through HMDs capture the real world, overlay the computer- generated information and display the composited image in a small monitor placed in front of one of the user’s eyes (e.g. RealWear HMT-1, Vuzix M300) [18]. Mobile collaborative AR systems, both hardware and user interfaces (e.g. web-based communication platforms), are commercially available. However, on a broad scale in industrial maintenance service delivery, the technology is far from established. Only 3.4 % of companies currently use HMDs in the manufacturing context [14]. Previous work on mobile collaborative AR has mostly been focused on the development of systems and aspects of usability of features [21][22], whereas now that the technology seems to be capable of handling complex tasks, our focus is on service business cases enabled through the technology. In particular, we would like to explore barriers and enablers for the use of mobile collaborative AR systems supporting services provided by small and medium-sized machinery manufacturers (OEMs) on their installed base. Even though we investigate OEMs that offer divers products, the service delivery is characterised by some similarities: Those similarities are: a multi-variant technically complex installed base, customers that typically outsource complex maintenance tasks and often expect availability commitments. Especially, availability commitments for specialised service technicians are a major concern for SMEs. In this paper we solely take the perspective of the OEM providing paid services for their installed base. Therefore, the term maintenance describes service offerings, the OEM charges for. We do not take the perspective of the customer and/or their maintenance function. 2. METHODOLOGY AND DATA COLLECTION Given the comparative lack of existing literature on the business perspective of mobile collaborative AR, especially on barriers and enablers, we aim to explore business cases, barriers and enablers for the use of the technology in service delivery. In order to address the aforementioned research aim, a qualitative research approach with a strong exploratory notion has been used. This approach was selected on conceptual grounds rather than for its representativeness [23]. Data was collected in two focus group discussions and individual semi-structured interviews with 20 representatives from 17 German companies (table 1). The companies used in our sample were classified following the definition of the Institut für Mittelstandsforschung [24]. According to this definition SMEs are the companies that employ less than 500 people. The sample consists of 14 machinery manufacturers, referred to as OEMs, of which ten are SMEs and four are LEs. Focus group participants and interviewees all held positions of strategic responsibility for the service business to ensure validity of results. In Addition to the OEM perspective, representatives of two HMD manufacturers and a system integrator, referred to as industry experts, are also part of the sample to broaden the perspective and provided additional insights. The industry experts provide insight from their experience of many AR implementation projects in capital goods industries. Additionally, 24 use cases of AR supported service delivery found in non-academic publications on the internet were analysed and compared to the findings derived from the empirical data (table 2). This aimed to fill the above mentioned lack of academic literature. 174 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Table 1: Sample of the interview study The focus group discussions took place in December 2017 and January 2018. We selected companies that intended to use HMDs for their service delivery in the near future and do not compete in the same markets. The absence of completion was important to ensure that participants share their views openly. However, the experience of focus group participants with the technology varied. Some companies tested HMDs in the past, while other participants had no experience with HMDs. This limited experience was addressed within a half-day workshop on features of the technology prior to the discussion. In addition to the focus group discussions, five semi-structured interviews with industry experts and companies that already tested HMDs within their service operation were conducted. These interviews aimed to gain a deeper understanding of business cases, enablers and barriers previously discussed in the focus groups (table 1). The interviews were conducted between February and March 2018 via telephone and lasted between 60 and 120 minutes. Data collected from focus groups and interviews were recorded and transcribed for analysis. The empirical data was analysed by applying thematic coding according to Miles et al. (2014) using NVivo 11 [25]. Table 2: Analysed internet use cases of AR systems Position of Interviewees Company / Industry Company Size Company Type Interview Method General Manager 1 General Manager 2 Air Cleaning Plant Technology SME OEM Focus Group and Individual Interview Head of Service Sales & Repair Coating Equipment LE OEM Focus Group Manager After Sales 1 Manager After Sales 2 Control and Regulation Systems LE OEM Focus Group Head of Customer Support Food Processing Plant Engineering LE OEM Focus Group CEO HMD Hardware/Software System Integrator SME HMD provider / industry expert Focus Group and Individual Interview Business Development Manager HMD Manufacturer SME HMD provider / industry expert Focus Group Individual Interview Business Development Director HMD Manufacturer & Software Provider SME HMD provider / industry expert Focus Group and Individual Interview Innovation Manager Machine Tools LE OEM Focus Group CEO Machine Tools SME OEM Focus Group Head of Customer Support Service Engineer Machine Tools SME OEM Focus Group Quality and Process Management Engineer Machine Tools SME OEM Focus Group Service Manager Machine Tools SME OEM Focus Group Director of Service and Quality Management Machine Tools SME OEM Focus Group Head of Mechanical Engineering Plastics Processing Machinery SME OEM Focus Group Head of Customer Support Power Plant Engineering SME OEM Focus Group CEO Steam Generator Technology SME OEM Individual Interview Sales Director Wastewater Technology SME OEM Focus Group No. Use Case Main features of AR system No. Use Case Main features of AR system # 1 Bosch Automotive Unidirectional AR assembly instruction, contextual data provision, access to knowledge data base # 13 MA micro automation MCAR-system # 2 Bosch Rexroth MCAR-system # 14 MAN Truck & Bus Unidirectional AR aassemly/repair instructions, 3D- tutorial training, contextual data provision # 3 Caterpillar MCAR-system # 15 Mekorot Unidirectional AR assembly instruction, MCAR-system # 4 Daikin Netherlands MCAR-system, access to knowledge data base and error codes # 16 NORDEN MCAR-system # 5 GE Aviation AR aassemly/repair instructions with feedback on performance of task, video tutorials, MCAR-system # 17 Porsche MCAR-system # 6 Gebhardt Fördertechnik MCAR-system # 18 Schmidt & Heinzmann MCAR-system, overlay of process data # 7 Getinge Unidirectional AR training # 19 Schulz Systemtechnik Unidirectional AR assembly instruction, overlay of process data # 8 Koenig & Bauer MCAR-system, access to knowledge data base (projected) # 20 Systronik AR-based measure monitoring # 9 Kieback & Peter MCAR-system # 21 TGW Lifetime MCAR-system # 10 Kurtz MCAR-system # 22 Thyssenkrupp Unidirectional AR projection of 3D-models of products (pre-sales), MCAR-system # 11 Lee Company MCAR-system # 23 VMT - Tunnelbau MCAR-system # 12 Leybold Unidirectional AR assembly instruction, contextual data provision # 24 Wärtsilä MCAR-system, overlay of process data MCAR-system = mobile callaboration AR-system 175 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 3. RESULTS AND DISCUSSION The following section presents the results of our investigation and compares the findings of present work to earlier works. First, ten business cases for the technology are presented. Thereafter, enablers and barriers for the use of the technology, which we derived from the two focus group discussions, the five interviews and the supporting 24 use cases, are discussed. Our research identified ten business cases listed in table 3. Besides supporting their own service technicians with mobile collaboration AR systems, SMEs also intend to equip service partners, sub-contractors and customer’s personnel. Inspection, diagnosis and repair, assembly/installation, and training are the main service operations companies propose to support with the use of HMDs. Our findings correspond with Palmarini et al. (2018) who analysed 30 academic articles on the state of the art of AR in maintenance and found that dis/assembly (33%), repair (26%), inspection/diagnosis (26%) and training (15%) were the maintenance operations described in those articles [18]. In addition, we learned that application support to improve customer production processes was identified as a valuable business case for mobile collaborative AR systems. Application issues usually occur sporadically. Thus, the remote approach enables the application engineer to support customers in the moment of occurrence. In correspondence with Quint et al. (2017), the use cases analysed (table 2) are confirmed to be demonstrators or test level systems of predominately LEs [17]. Therefore, those use cases provided little information on organizational barriers of the implementation of mobile collaborative AR systems from the perspective of SMEs. Nonetheless, the use cases analysed did confirm the interest of companies in mobile collaborative AR systems as 18 out of 24 feature this functionality. Other functionalities found in the use cases were unidirectional AR application supporting step- by-step instructions and training scenarios that aim to reduce errors and minimize training effort. Table 3: Business cases and value of mobile collaborative AR systems for service delivery 3.1. Enablers and Opportunities Improvement of OEM service delivery (Business cases #1, #4, #7, #8, #10, table 3) Companies expect that hands-free collaboration and “seeing through the users’ eye” will lead to productivity improvement in service delivery. If the problem can't be solved immediately, at least errors occurred and spare parts needed can be identified more quickly and more precisely. The mobile collaborative AR approach is superior compared to today’s hotline support in combination with emailed photos or videos. The findings can be compared Business Case No. Description Value for SMEs Discussed in focus group # 1 OEM's service technicians are equipped with HMDs and are remotely supported by a specialist. Time saving for diagnosis/repairs/identification of needed spare parts, spontaneous involvement of further specialists, new technicians are faster able to work independently, automatic video/screenshot documentation in session protocols both # 2 Service technicians of service partners and sub-contractors are equipped with HMD and are remotely supported by an OEM's specialist. Reduction of traveling time and expenses, increase of partners service quality though remote OEM back up, consistent service portfolio also for distant customers both # 3 Customer's maintenance personnel is equipped with HMD and is remotely supported by a OME's specialist. Reduction of traveling time and expenses, avoidance of unnecessary technician deployment, briefer respond time to service inquiries (increase of service quality through ad hoc support if no spare parts are needed), precise spare part identification both # 4 Customer's operators are equipped with HMDs and are remotely instructed by a OEM's trainer, e.g. in case of frequent fluctuation. Resource saving for operator on-site trainings, better training results compared to showroom trainings (actions have been really executed by operator and not only simulated by trainer, session video can be recalled by customer operators) both # 5 Service technicians of service partners and sub-contractors are equipped with HMDs and are remotely instructed by a OEM's trainer. Increase of service quality (better skilled technicians), reduction of training cost could lead to more frequent trainings, increase of customer loyalty through international quality service, less "firefighting" through headquarter one # 6 Service personnel of subsidiaries are equipped with HMDs and are remotely instructed by a OEM's trainer. Time and cost savings for trainer travels, more frequent training through decreased time and costs, increase of international service quality, less "firefighting" through headquarter both System installation # 7 OEM's service technician is equipped with HMD and is supported by OEM's or component supplier's specialist, e.g. software engineers or design engineers. Relaxation of human resources situation (saving specialist travels), spontaneous involvement of further specialists, installation condition documented immediately (as is versus as engineered) one # 8 OEM's service technicians is equipped with HMD and remotely supported in the process of system acceptance. Saving specialist travels, spontaneous involvement of further specialists, video documentation of performance approval according to sales contract one # 9 Pre-delivery acceptance test of sub-supplier's components. OEMs technicians is equipped with the HMD and components are remotely accepted by the sub-supplier. Faster delivery of system to end customer, decrease of cost for acceptance tests (sub- suppliers traveling expenses) one Application support # 10 OEMs remote application specialist collaboratively optimises customer's production process, while customer's operator is equipped with HMD and executes instructions on-site. Specialist support in the moment of occurrence (application issues often occur sporadic), saving specialist travels one System acceptance tests Inspection, diagnosis and repair Training 176 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management to Fussell et al. (2003), who found that technology enabled shared visual spaces are superior compared to audio only collaboration [26]. The latest generation of HMDs seem to be mature enough to provide those shared visual spaces. Equipping technicians with HMDs also enables companies to deploy them to customers with less training, as they are covered by remote specialists. „With the glasses [mobile collaborative AR system] new field service technicians don‘t have to be deployed alongside an experienced technician as a ‚second man’ for such a long time to become familiar with our diverse systems.” Predictably initial training is one of the most popular application areas for HMDs [14]. Reduction of service costs for OEM (Business cases #1, #2, #7, #8, #9) Cost reduction, unsurprisingly, is an enabler of mobile collaborative AR. Cost reduction primarily derives from the reduction of traveling time and expenses for deploying field service technicians, trainers, and specialists. In SMEs the company’s specialised know-how is often concentrated in few people. A general manager of an SME stated: “[…] we can multiply our experience without going on an airplane. This is really valuable to us.” Mourtzis et al. (2017) presents a case of a robotics SME that could lower costs from 1.370 € to 150 € for a battery replacement, a problem which can also be solved by instructing the customer’s workers remotely, instead of deploying an OEM’s technician to the customer’s production site [15]. Especially during the warranty period of products, field service technician deployment is aimed to be avoided, as cost can’t be charged. If the first error detection process can be conducted remotely, the value obtained is twofold. First, unnecessary technician deployment can be diminished, which reduces costs. A statement of a focus group participant illustrates the problem: “[…] It happens that someone needs to go to Poland [from Germany] and then he takes two wires, reconnects them and drives back home. For this purpose the glasses are perfect. […]”. Second, service enquiries can be responded to faster, which may avoid contractual penalties. “Particularly the immediate [response is valuable to us]. I really can’t tell the customer, ‘it is a warranty case, so I will have time for you only next week’.” Ability to respond to service enquiry (Business cases #2, #4, #10) Specialised field service technicians are hardly available on labour markets. Equipping customers with HMDs enables ad hoc problem solving and minimises the effects of the human resource shortage. Deploying technicians to customer’s sites for minor problem solving leads to serious problems for SMEs. With their limited number of field service technicians, the use of mobile collaborative AR systems can ensure the ability to respond to customer errors in case of simultaneous service enquiries. As one manager puts it “We are a relatively small company and we have a major problem with capacity. We can't recruit 200 percent technicians, because there is no one available. […] For me it is not so much about money, but about being able to respond at all.” International consistent quality of service (Business cases #2, #3, #4, #5, #6, #10) During the discussion the participants stressed that generally, domestic service is of higher quality than service to distant customers. The use of mobile collaborative AR systems also enables SMEs to provide their customers with an international service network. Equipping subcontractors, service partners and customers with HMDs supports quality standards. “On the one hand, I train the employees [of the service partner], on the other hand we enable them to provide services to the customers which they can’t provide today. Because they are just not capable to provide these services. Besides, the training can be performed while he [the service partner’s technician] is having the problem at the end customer’s site.” The qualification of service partners is important because SMEs are generally more reliant on a broader partner network to perform their field services [27] than LEs with subsidiaries in major markets. Above that, the reduction of training costs is expected to lead to more frequent training of subsidiaries, service partners and customer’s operators. This increases service and product quality through improved operation of machinery and superior service of international service networks. Increase of customer satisfaction (Business cases #2, #3, #4, #5, #6, #10) Training, application service, error diagnosis and the identification of spare parts are major value propositions for OEMs to their customers. More efficient and effective management of the technicians leads to an increase of customer satisfaction. With the help of mobile collaborative AR systems, previously neglected distant customers may enjoy similar service levels as domestic customers. “[…] When customers buy new systems, they always ask how we can support them. And if I have to say, ‘of course you can call the hotline support in Germany […]’, the Vietnamese certainly says ‘nice offer, but you are not available during my operating hours’. Therefore, we certainly benefit form qualifying service partners. Because it increases customer loyalty, if our systems can be serviced properly.” Thus, customer satisfaction will increase. 177 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Remarkably, we found only one OEM (use case #2, table 2) promoting lower total cost of ownership (TCO) to the customer. Neither in the focus group discussions, nor in the interviews did we observed this enabler. 3.2. Barriers Connectivity (Business cases #1, #2, #7, #8, table 3) Maintenance service usually takes place in the field or at the customer’s production site. However, customers are often reluctant to open their Wi-Fi networks for external devices. Moreover, production sites may be situated in rural regions with poor bandwidth of wireless data connectivity. “A problem that we definitely do have with the glasses is mobile connectivity, since our machinery is often located in some factory in the back of beyond. […] Mobile reception is already bad outside at the car park and when you enter the building, then it becomes even worse.” Bottecchina et al. (2010) found that under experimental conditions experts preferred video resolution of at least 640x480 [12], which might not always be possible. Thus, connectivity is a major barrier to mobile collaborative AR field service. Privacy protection (All ten business cases) A major advantage of mobile collaborative AR systems is the possibility of storing screenshots and videos of maintenance or training sessions. However, the documentation function of the system is a barrier with respect to privacy concerns. Data security representatives and trade unions often insist that consent of all recorded participants is required. Although privacy protection is more likely to be an issue of LEs, rather than for SMEs, privacy concerns can also be a barrier from the personal perspective of technicians. The feeling of being monitored could become an issue. Protection of intellectual property (Business cases #1, #2, #5, #6, #7, #8, #9) Many companies within our sample serve customers that are rigorous in protecting their intellectual property. Interviewees assume, that technicians wearing HMDs might be banned from working at customer’s production sites. As a participant puts it: “We are forced to hand over our smartphone when we enter […] and now we should convince them to walk in wearing glasses that feature a camera system with a video store function.”. However, it was also stressed that when it comes to machine breakdowns, customers usually leave all concern behind since unavailability of production facilities is a much larger concern. “Even the automotive industry gets this idea, […] either your production is down for four hours or a day until my technician is there, or there is someone wearing glasses and your production continues […].” Resistance of technicians (All ten business cases) As with every new technology getting people to use HMDs might be a challenge. The reason for the resistance can be diverse, ranging from fashionable appearance to privacy concerns [28]. Besides, even if the benefits of the system are apparent from the business perspective, the personal perspective of technicians might be different. “[…] experienced technicians often diagnose problems in a hands-on process. Some just don’t see the advantages [of the technology] and therefore are reluctant. These older service guys are used to their tool kit and some even have problems in using a smartphone.” Loss of service orders (Business case #3, #4, #5, #9, #10) Deploying service technicians is an important source of revenue for many companies [29]. Equipping customers with HMDs poses the risk of losing field service orders, by training customer’s personnel with every remote maintenance session. Mourtzis et al. (2017) outline a maintenance case of a robotics SME that illustrates this risk. In their case, a battery replacement is described to be crucial for securing the robots function, and usually requires the OEM to deploy a technician to the customer’s site, although the task only requires simple assembly actions and could easily be explained remotely [15]. With this procedure the OEM qualifies the personnel of the customer. This qualification could lead to decreased service revenues, when customers execute simple tasks on their own. Obviously, this barrier is contrary to the aforementioned ensure ability to respond enabler. Business model innovation (All ten business cases) With respect to innovation enabled by the industrial internet of things, SMEs are typically challenged by their lack of resources, management know-how, selection of suitable solutions and the lack of transparency of the value of technologies [30]. Against those findings, the “transparency of value” does not seem to be a barrier concerning the use of mobile collaborative AR systems. All companies within our sample formulated the value of the technology 178 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management quite clearly. Consequently, a consensus was found in one focus group discussion that companies must be prepared with a smart service business model for the after-warranty period to monetize the improvements in service quality. However, as Kowalkowski et al. (2013) indicate, SMEs have more difficulties than larger companies to charge for service at all, as their revenue models are often based on product unit sales and large customers expect services for free [9]. Thus, using mobile collaborative AR systems might be an opportunity to innovate the business model and apply a more service-orientated strategy. Managers do not seem to have worked out such business models. 4. CONCLUSION With the description of concrete business cases, enablers and barriers for AR supported service delivery, we contribute to the scarce research of servitization from the perspective of SMEs. Our investigation indicates that use of mobile collaborative AR systems is an opportunity for SMEs to improve their service quality and extend their service businesses. New business opportunities arise, when providing mobile collaborative AR systems to service partners, sub-contractors and customers. The reduction of time that adds value (e.g. traveling time, lengthy error diagnostics, unnecessary technician deployment) and frees capacity to perform additional service orders subsequently might increase service revenues. The technology also facilitates international consistent quality of service to distant customers, which is a major challenge for SMEs. For SMEs who are generally more dependent on external service partners to provide customer support, mobile collaborative AR systems are an opportunity to expand their service business. Notably, no interviewee took the customers’ perspective and considered the decreased total cost of ownership. Decreasing total cost of ownership might lead to higher sales of new equipment. However, we were able to identify barriers to be addressed. Connectivity problems might be removed in the near future with the introduction of 5G networks. Barriers regarding protection of individual’s privacy and company’s intellectual property might be solved with further technology developments and/or contractual negotiation. Reluctance of human beings might disappear once the technology becomes more widespread. As of now, the loss of service orders and with it the lack of new business models to monetize the innovation is an unsolved issue. Unlike the study by BMWi (2015) [30] suggests, we could not confirm the lack of transparency of the value barrier for mobile collaborative AR systems. 5. LIMITATIONS AND FURTHER RESEARCH The research strategy and data collection method are not without limitations. Especially external validity and transferability are issues when using a qualitative research approach. All results and conclusions are derived from information that relies on views from a limited number of individuals, stemming from a limited number of companies. The companies and interviewees were selected to ensure the validity of this study. However, to describe the investigated phenomena more precisely and confirm validity of the results, a larger number of companies should be included in our study. As more and more companies are implementing mobile collaborative AR systems, the outcome of this investigation should be tested in a quantitative large sample study. Furthermore, the economic value of the ten identified business cases should be assessed. A better understanding of barriers and enablers of mobile collaborative AR systems in field service is indispensable for developing robust business models for SMEs. From the perspective of servitization research it would be interesting to investigate whether or not AR is an enabler for SME servitization. REFERENCES [1] VanDerMerwe S, Rada JF (1988) Servitization of business: Adding value by adding services. European Management Journal 6(4). [2] Martin CR, Horne DA (1992) Restructuring towards a Service Orientation: The Strategic Challenges. International Journal of Service Industry Management 3(1):25–38. [3] Wise R, Baumgartner P (September/Oktober Issue 1999) Go downstream: The new profit imperative in manufacturing. Harvard Business Review 77(5):133–41. [4] Mathieu V (2001) Product services: From a service supporting the product to a service supporting the client. Journal of Business & Industrial Marketing 16(1):39–61. [5] Davies A, Brady T, Hobday M (2006) Charting a path toward integrated solutions. MIT Sloan Management Review 47(3). 179 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management [6] Lay G, Copani G, Jäger A, Biege S (2010) The relevance of service in European manufacturing industries. Journal of Service Management 21(5):715–26. [7] Aromaa S, Väätänen A, Aaltonen I, Heimonen T (2015) A model for gathering and sharing knowledge in maintenance work. Proceedings of the 33rd Annual Conference of the European Association of Cognitive Ergonomics, pp. 1–8. [8] Storey DJ, Greene FJ (2010) Small business and entrepreneurship. Financial Times Prentice Hall, Harlow. [9] Kowalkowski C, Witell L, Gustafsson A (2013) Any way goes: Identifying value constellations for service infusion in SMEs. Industrial Marketing Management 42(1):18–30. [10] The Federal Statistic Office. Destatis Datenbank. Beschäftigte und Umsatz der Betriebe im verarbeitenden Gewerbe: Deutschland, Jahre, Beschäftigtengrößenklassen (Wirtschaftszweige WZ2008 2-4-Steller Hierarchie) [Employees and Sales of German manufacturing companies of industry sector WZ2008]. [11] Alem L, Tecchia F, Huang W (2011) Remote Tele-assistance System for Maintenance Operators in Mines. 11th Underground Coal Operators' Conference, University of Wollongong & the Australasian Institute of Mining and Metallurgy:171–7. [12] Bottecchia S, Cieutat J-M, Jessel J-P (2010) T.A.C: Augmented Reality System for Collaborative Tele-Assistance in the Field of Maintenance through Internet. Proceedings of the 1st Augmented Human International Conference, Article No. 14. [13] Kraut RE, Miller MD, Siegel J (1996) Collaboration in Performance of Physical Tasks: Effects on Outcomes and Communication. Proceedings of the 1996 ACM conference on Computer supported cooperative work:57–66. [14] Plutz M, Große Böckmann M, Siebenkotten P, Schmitt R (2016) Smart Glasses in der Produktion [smart glasses in manufacturing]: Studienbericht des Fraunhofer-Instituts für Produktionstechnologie IPT [Survey report of the Fraunhofer Institute for Production Technology]. Fraunhofer IPT, Aachen. [15] Mourtzis D, Zogopoulos V, Vlachou E (2017) Augmented Reality Application to Support Remote Maintenance as a Service in the Robotics Industry. Procedia CIRP 63:46–51. [16] Sutherland IE (1965) The Ultimate Display. Proceedings of IFIP Congress:506–8. [17] Quint F, Loch F, Bertram P (2017) The Challenge of Introducing AR in Industry - Results of a Participative Process Involving Maintenance Engineers. Procedia Manufacturing 11:1319–23. [18] Palmarini R, Erkoyuncu JA, Roy R, Torabmostaedi H (2018) A systematic review of augmented reality applications in maintenance. Robotics and Computer-Integrated Manufacturing 49:215–28. [19] Kurata T, Sakata N, Kourogi M, Kuzuoka H, Billinghurst M (2004) Remote Collaboration using a Shoulder-Worn Active Camera/Laser. ISWC 2004: Eighth International Symposium on Wearable Computers proceedings 31 October-3 November, 2004, Arlington, Virginia. IEEE Computer Society. Los Alamitos, Calif, pp. 62–69. [20] Zhong XW, Boulanger P, Georganas ND (2002) Collaborative Augmented Reality: A Prototype for Industrial Training. 21th Biennial Symposium on Communication, Canada. [21] Alem L, Huang W (2011) Developing Mobile Remote Collaboration Systems for Industrial Use: Some Design Challenges. in Campos P, Graham N, Jorge J, Nunes N, Palanque P, Winckler M, (Eds.). Human-computer interaction - INTERACT 2011: 13th IFIP TC 13 international conference, Lisbon, Portugal, September 5 - 9, 2011 ; proceedings, part IV. Springer. Berlin, pp. 442–445. [22] Huck-Fries V, Wiegand F, Klinker K, Wiesche M, Krcmar H (2017) Datenbrillen in der Wartung [smart glasses in maintenance]. in Eibl M, Gaedke M, (Eds.). Informatik 2017 - Bände I-III: Tagung vom 25.-29. September 2017 in Chemnitz. Gesellschaft für Informatik. Bonn. [23] Miles MB, Huberman AM (1994) Qualitative data analysis: An expanded sourcebook. 2nd ed. Sage, Thousand Oaks, Calif. [24] Günterberg B (2012) Unternehmensgrößenstatistik [company size statistics]: Unternehmen, Umsatz und sozialversicherungspflichtige Beschäftigte 2004 bis 2009 in Deutschland. Ergebnisse des Unternehmensregisters (URS 95), Bonn. [25] Miles MB, Huberman AM, Saldaña J (2014) Qualitative data analysis: A methods sourcebook. 3rd ed. Sage, Los Angeles, London, New Delhi, Singapore, Washington DC. [26] Fussell, Susan R., Setlock, Leslie D., Kraut, Robert E. (2003) Effects of Head-Mounted and Scene-Oriented Video Systems on Remote Collaboration on Physical Tasks. New horizons conference proceedings, Conference on Human Factors in Computing Systems 5(1):513–20. [27] Gebauer H, Paiola M, Edvardsson B (2010) Service business development in small and medium capital goods manufacturing companies. Managing Service Quality 20(2):123–39. [28] van Krevelen DWF, Poelman R (2010) Survey of Augmented Reality Technologies, Applications and Limitations. The International Journal of Virtual Reality 9(2):1–20. [29] VDMA (2016) VDMA-Benchmarks Kundendienst 2016 [customer service benchmarking survey], Frankfurt. [30] BMWi (2015) Industrie 4.0 - Volks- und betriebswirtschaftliche Faktoren für den Standort Deutschland [Study on macroeconomic and microeconomic factors in the context of Industry 4.0 in Germany]: Eine Studie im Rahmen der Begleitforschung zum Technologieprogramm AUTONOMIK für Industrie 4.0, Berlin. 180 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Discrepancy analysis for gearbox condition monitoring: A comparison of different healthy data models S. Schmidt1, P.S. Heyns1, K.C. Gryllias2,3 1 Centre for Asset Integrity Management, Department of Mechanical and Aeronautical Engineering, University of Pretoria, South Africa 2 Department of Mechanical Engineering, PMA division, KU Leuven, Belgium 3Dynamics of Mechanical and Mechatronic Systems, Flanders Make, Belgium ABSTRACT Discrepancy analysis is a novelty detection technique that uses diagnostic information, extracted from vibration signals that were acquired from a rotating machine, and a model of the healthy historical machine data to infer the condition of the gearbox under varying operating conditions. In discrepancy analysis, various healthy data models such as Gaussian mixture models can potentially be used to model the healthy features, however, the performance of these models have not been compared in a systematic way for discrepancy analysis. In this work, a principal component reconstruction-based model is considered for discrepancy analysis. Moreover the ability of Gaussian models, Gaussian mixture models, hidden Markov models, and principal component-based reconstruction models to detect bearing damage in gearboxes operating under varying speed conditions is evaluated and compared. The gearbox data are generated using a phenomenological numerical gearbox model and the performance of the different models is compared by investigating the sensitivity of the models to detect changes in the magnitude of impulses induced by bearing damage and to compare the models’ robustness to changes in operating conditions. The results provide further insight into the properties of the different models and can assist with model selection when investigating new datasets with discrepancy analysis. Keywords: Condition monitoring; Diagnosis; Discrepancy analysis; Fault detection and localisation Corresponding author: Stephan Schmidt (stephanschmidt@zoho.com) 1. INTRODUCTION Condition monitoring is performed on critical assets such as gearboxes to ensure that they operate as intended. Under varying operating conditions, it is difficult to discern between changes in machine condition and changes in operating condition [1] and therefore diagnostic techniques, which allow the condition of the machine to be inferred under varying operating conditions, are actively investigated. Discrepancy analysis is a novelty detection approach that uses the measured condition monitoring data and a model of the historical condition monitoring data of a healthy machine to generate a discrepancy signal. This discrepancy signal quantifies the deviation of the machine’s response from the response of a healthy machine for each time step and can be processed to infer the condition of gears and bearings of gearboxes. The healthy data model forms a central part of discrepancy analysis, with different models such as Gaussian [2], Gaussian mixture [3,4] and hidden Markov models [5] that can be used. A comparison of the aforementioned models are required to assist with model selection in future investigations. In this paper, different healthy data models are investigated and compared on numerical gearbox data simulating bearing damage under varying speed conditions, with principal component analysis also investigated to obtain a healthy data model for discrepancy analysis. 181 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 2. DISCREPANCY ANALYSIS Discrepancy analysis consist of (1) data acquisition and processing, (2) feature extraction, (3) feature modelling and discrepancy signal generation, and (4) discrepancy signal processing. A very brief overview of each step is given in this section, with a more detailed description can be found in the paper by Schmidt et al. [2]. 2.1. Data acquisition and processing Vibration measurements are used as condition monitoring data, because the data are rich with diagnostic information and can easily be acquired. The damage components are generated from physical rotating components which result in the signals to be angle periodic [6], hence the discrepancy signal is analysed in the angle domain by extracting features from the angle domain. Therefore, the rotational speed or phase information of a shaft has to be measured or estimated from the vibration data as well. 2.2. Feature extraction The purpose of the feature extraction process is to extract diagnostic information from a signal that is rich with information pertaining to the condition, the operating conditions and the operating environment of the machine. The bearing features used in this investigation are based on the features used by Schmidt et al. [2]. The wavelet packet transform is used with a Daubechies db1 wavelet function to decompose the time domain vibration signal into a set of wavelet coefficients. The Daubechies db1 basis function is used because it is able to detect singularities induced by bearing damage. A four level wavelet packet decomposition is performed which results in 16 wavelet coefficients, associated with different frequency bands, to be obtained. This allows the frequency bands to have a relatively fine resolution to detect damage manifesting in specific frequency bands and the features to have an acceptable dimensionality to avoid potential problems associated with the curse of dimensionality. The wavelet coefficients of each frequency band are separately order tracked, whereafter the localised RMS is extracted from a moving average window with a length of 2π/50 radians and a 50% overlap between adjacent windows. The short window length makes it possible to detect localised changes in the vibration signal, however, a method to select the optimal window length for bearing diagnostics still needs to be determined. Hence, instead of obtaining a single 16 dimensional feature for a single vibration measurement i.e. by calculating the RMS of the wavelet coefficients for all shaft angles, many localised 16 dimensional features are extracted from a single vibration measurement by extracting features from the moving windows. A detailed description of the windowing and feature extraction procedures can be found in Ref. [2]. Order tracking is performed on the wavelet packet coefficients instead of the vibration signal so that the angle-time cyclostationary properties of the bearing damage impulses are preserved. 2.3. Feature modelling and discrepancy signal generation The features of a healthy gearbox, extracted from the previous section, is subsequently modelled by the models investigated in this section, where new features extracted from a single window, denoted by b , are evaluated with the discrepancy function )(bS to obtain a discrepancy measure. The discrepancy measures that correspond to the features extracted from each window are combined to form a discrepancy signal which is subsequently analysed for damage [2]. The dimensionality of the feature space is 16, which means that the dimensionality of the features b are 16 x 1. A brief overview of each model and its associated discrepancy measure are given in this section. 2.3.1. Gaussian Model (GM) A multivariate Gaussian Model (GM) is one of the simplest models that can be used for discrepancy analysis and has been successful for bearing diagnostics under varying speed conditions [2]. The Mahalanobis distance for the features b )()()( 1 μbΣμbb  TS , (1) 182 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management uses the mean μ and covariance Σ , estimated from the healthy data, and is used as a discrepancy measure. If the features have a dimensionality of D, then the mean μ is a 1D vector while the covariance Σ is a DD matrix. The features investigated in this work result in 16D and the mean and covariances have the same size throughout this work, unless stated otherwise. The GM has the benefit that it is easy (i.e. it has closed form maximum likelihood solutions) and computationally efficient to implement, however, it is incapable of describing multi- modal and non-Gaussian distributions. 2.3.2. Gaussian Mixture Model (GMM) The Gaussian Mixture Model (GMM) is a generalisation of the GM, where a weighted sum of N-Gaussian densities are used to approximate the density of the features under investigation. The GMM has the benefit that it can approximate any density to an arbitrary accuracy if a sufficient number of mixture components is used [7]. The GMM has been successfully used for discrepancy analysis-based gear diagnostics [3,4], with the NLL of the discrepancy signal      N i iiiGS 1 ),|(log)( Σμbb  , (2) used as a discrepancy measure for the GMM. In Equation (2), i is the weight of the ith mixture component and ),|( iiG Σμb denotes the Gaussian probability density function associated with the ith mixture component. Even though the GMM is very flexible, it requires the estimation of many parameters (i.e. N-means, N-covariances and N-weight parameters) with an optimisation procedure, and the appropriate number of mixture components needs to be estimated from the data as well. GMMs are also susceptible to overfitting and care needs to be taken when optimising the models [7]. In this investigation, the Expectation-Maximisation (EM) algorithm is used to obtain the model parameters using a maximum likelihood framework. 2.3.3. Hidden Markov Model (HMM) The aforementioned models are optimised based on the assumption that each data point is independent and identically distributed (i.i.d.), which may be limiting. HMMs overcome the i.i.d. limitation by assuming that the features are generated by a latent state that follows a Markov process and the features are related to the latent states by an observation density. HMMs have been successful in the discrepancy analysis field [5] and in this investigation, a first-order Markov process is used with a Gaussian observation distribution for each latent state. The discrepancy measure for the HMM is calculated with  ),|(log)( 11 bbbb  iii pS , (3) for the feature at window i. The HMM is even more computationally expensive than the GMM, because the transition probability matrix of the Markov process needs to be estimated as well. The additional temporal dimension gives the HMM additional discriminatory power for detecting novelties over the GMM and GM. The mean and covariance of the Gaussian observation density of each state, the initial state, and the transition matrix are obtained using the Baum-Welch algorithm using a maximum likelihood framework in this investigation. 2.3.4. Principal Component Analysis-based Reconstruction (PCA-R) Principal Component Analysis (PCA) is a popular dimensionality reduction technique, where the eigenvectors V of the covariance matrix of the features are used to transform the features to a lower subspace by removing redundant information. The dimensionality reduction is performed with )(:1 μbVz  Td , (4) 183 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management where d:1V represents the eigenvectors associated with the d-largest eigenvalues and has a dimensionality of dD , μ denotes the mean of the healthy features and z denotes the principal components with a dimensionality of 1d . It is possible to obtain a one dimensional signal from the original signal with PCA (i.e. 1d ), however, this is not a discrepancy signal and very poor results are obtained when using it in a discrepancy analysis context. In this paper, a new discrepancy analysis approach based on PCA is investigated. The mean and the eigenvectors of the healthy features are used to map the lower dimensional principal component features with a dimensionality of 1d back to the original feature space with μμbVVb  )(:1:1 Tddrec , (5) where the reconstructed features are denoted by recb which is a 1D vector. The root-mean-square of the reconstruction error recbb  is used as a discrepancy measure    bμμbVVbμμbVVb  )()(1)( :1:1:1:1 TddTTddDS . (6) If d < D, then the reconstruction error will be non-zero and if the machine condition changes, the reconstruction error will increase. The dimensionality d is estimated by selecting the minimum PCA dimension that has an accumulative contribution rate of 95%. 2.4. Discrepancy signal post-processing It is possible to calculate a statistic of the discrepancy signal and use it to infer changes in the condition of the machine, however, further processing is required to infer the condition of a specific machine component. The spectrum of the discrepancy signal can be used with the characteristic frequencies of the rotational components (e.g. ball-pass frequency) to determine which components generate the damage. As a result, similarly to Ref.[2], the spectrum of the discrepancy signal is used to infer the condition of the bearings. 3. NUMERICAL VALIDATION A phenomenological gearbox model proposed by Abboud et al. [8] is used to compare the different healthy data models. The motivation for the phenomenological gearbox model is that it is able to simulate many of the complex phenomena seen in actual gearboxes (e.g. dependence of the signal’s amplitude on speed) and the exact operating conditions and the exact condition of the gearbox are known. 3.1. Overview of model The casing vibration signal of the gearbox comprises of the contributions of the gear mesh components gx , the damaged bearing bx , and broadband noise nx in the following form )()()()( txtxFStxtx nbgc  , (7) where the fault severity FS is a factor that linearly increases the bearing components’ magnitude. The different components can be decomposed in terms of the signal at the source, convoluted with the impulse response function of the system            t k dgteeth N k k dggg dkNAtMthtx dg 01 )(cos))(()()(  , (8) 184 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management        impN i ibb TttMthtx 1 )())(()()(  , (9) ))(()()( tMttxn  , (10) where )(thi denotes the impulse response function of characteristic i, the magnitude and the phase of the kth harmonic of the gear component are denoted by kgA and k g . The rotational speed of the shaft is denoted by  , with the magnitude of the vibration component increasing with rotational speed and is modelled with baM  )( . During the measurement, impN bearing impulses occur, with the ith bearing impulse occurring at time step iT . The raw noise is sampled from a zero mean Gaussian distribution and is denoted by )(t at time t. Outer race bearing damage is simulated with the model, with the characteristic frequency of the damage being 4.12 shaft orders. The number of teeth on the gear that is connected to the shaft under consideration is 20teethN . Four different rotational speed profiles are investigated for the phenomenological gearbox model and presented in Figure 1. The four operating conditions help to compare the robustness of the discrepancy signals, obtained from the different models, to varying speed conditions. Figure 1: The Operating Condition (OC) profiles investigated for the phenomenological gearbox model. Five different FS, namely 0.0, 0.02, 0.04, 0.06 and 0.08 are investigated here as well. The FS merely quantifies the relative magnitude of the bearing impulses of the different cases e.g. a FS of 0.02 contains impulses that are on average four times smaller than a FS of 0.08, while FS = 0.0 simulates a healthy bearing. The three vibration components and the casing vibration signal are presented in Figure 2 for a FS = 0.02 with operating condition profile 2 (i.e. OC: 2) being used during the simulation. Figure 2: The casing vibration signal of operating condition 2 and its components for the case where the bearing damage FS is 0.02. 185 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management The casing vibration signal and the corresponding rotational speed are used in the discrepancy analysis process investigated in the next section. 3.2. Results The casing vibration signal and the rotational speed of the healthy vibration data are used with the wavelet packet transform to extract 16-dimensional features as described in Section 2.3. These features are modelled separately with the models under consideration. The different discrepancy measures, denoted by S(b), are subsequently used with the features extracted from the windowed wavelet coefficients of the new data to generate a discrepancy signal associated with each vibration measurement. In this section, three distinct investigations are performed (1) the spectra of the discrepancy signals are investigated for diagnostic information, (2) the sensitivity of the discrepancy signal to damage is investigated, and (3) the sensitivity of the discrepancy signal to changes in operating conditions is investigated as well. 3.2.1. Spectrum of the discrepancy signals The frequency content of the discrepancy signals, where the bearing has a Fault Severity (FS) of 0.02, is shown in Figure 3 for the four investigated models. This is obtained by calculating the spectrum of the discrepancy signal of each measurement in the dataset. The model complexity (e.g. number of mixture components) with the best cross- validation results are presented in the figure where appropriate. GM GMM HMM PCA-R Figure 3: The spectrum of the discrepancy signals obtained from the different models for a bearing with a FS of 0.02 are shown. The damage has a fundamental component at 4.12 shaft orders, with the fundamental component and a few harmonics being seen in the spectra of all the models. However, the spectra are definitely sensitive to changes in operating conditions as seen by the variation of the amplitudes between different operating condition profiles. The spectra of the discrepancy signals generated with the GM, GMM and the HMM contain mostly diagnostic information, whereas the PCA-R spectrum contains a lot of noise which impedes the condition inference process. It is difficult to quantify the robustness of the different models to varying operating conditions and the sensitivity of the models to damage from the spectra and therefore two separate investigations are performed in the next sections to quantify the performance of the models. 3.2.2. Sensitivity to damage The sensitivity of the discrepancy signals obtained from the different models to damage is investigated in this section. The amplitude of the fundamental component of the characteristic frequency (i.e. 4.12 shaft orders) in the spectrum is used to quantify the magnitude of the damage for the different FS values and the different models in this section. 186 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management The amplitude of the fundamental components in Figure 4(a) are averaged over the four operating condition profiles, and presented over bearing fault severity. The HMM has the largest discrepancy values with respect to damage, where in contrast the PCA-R and the GM have the smallest discrepancy values. However, it is not desired to quantify the absolute values, but instead to quantify the relative increase in the values as the damage progresses. Therefore, the amplitudes in Figure 4(a) are normalised by the amplitude of the component from a healthy machine, with the results presented in Figure 4(b). It can be observed that the GMM is the most sensitive to damage, while the GM is second most sensitive to damage, performing even better than the HMM. The PCA-R model does not perform very well relative to the other models, however, it successfully detects changes in the bearing’s condition. The GM is very simple relative to the GMM and HMM and may not represent the exact density of the healthy features, however, it is clearly capable of detecting small changes in the spectrum of the machine and is therefore well suited for discrepancy analysis. (a) (b) Figure 4: The sensitivity of the models to bearing damage are compared. The amplitude (a) and normalised amplitude (b) of the fundamental ball-pass outer race component are shown over fault severity. 3.2.3. Sensitivity to operating conditions The discrepancy signal of a healthy gearbox is presented over rotational speed in Figure 5 for the different models. The conditional mean and variance of the discrepancy signal, given the rotational speed, varies over rotational speed. The GM, GMM and the PCA-R models contain very similar characteristics, however, the HMM’s statistics vary significantly over rotational speed. GM GMM HMM PCA-R Figure 5: The discrepancy signal of a healthy gearbox versus the rotational speed is presented for the different models. 187 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management It is desired to quantify the sensitivity of the discrepancy signal to rotational speed and therefore the conditional mean of the discrepancy signal given the rotational speed is calculated with the procedure used by Ref. [2]. The raw conditional mean results are presented in Figure 6(a). It can be observed that the HMM contains the largest changes in discrepancy value, while the GM and the PCA-R models contain the smallest changes. Similar to the investigation in Section 3.2.2., the normalised discrepancy, obtained from dividing the discrepancy value by the discrepancy value that correspond to the lowest rotational speed, is investigated as well. The results in Figure 6(b) indicate the PCA-R and the GM are the most sensitive to changes in rotational speed, while the GMM is the most robust. Even though the GM results are sensitive to rotational speed variations, it can easily be compensated for by the method proposed in Ref. [2]. (a) (b) Figure 6: The sensitivity of the discrepancy signal to rotational speed is compared for the different models by comparing the conditional mean (a) and the normalised conditional mean (b) of the results in Figure 5. 4. CONCLUSION In this investigation, different models for discrepancy analysis were investigated and compared. The Gaussian mixture model is the most sensitive to damage and the most robust to varying speed conditions and is therefore best suited for discrepancy analysis according to this investigation. The hidden Markov model is quite sensitive to operating conditions which may be ascribed to the Markov process of the model which struggles to distinguish between changes in machine conditions and operating conditions. The Gaussian model performed very well with the bearing diagnostics task which has the advantage that it can be easily implemented without considering convergence and hyperparameter optimisation issues. Lastly, even though the principal component reconstruction- based model did not perform as well as the other models, the investigation indicated how the reconstruction model framework can be used for discrepancy analysis. More sophisticated reconstruction-based models can be investigated in future work. Future investigations can also compare the different models on experimental data and this investigation can be performed for gear diagnostics as well. REFERENCES [1] Stander, C.J., Heyns, P.S. and Schoombie, W.: Using vibration monitoring for local fault detection on gears operating under fluctuating load conditions. Mechanical Systems and Signal Processing, 16(6), pp.1005-1024 (2002). [2] Schmidt, S., Heyns, P.S. and Gryllias, K.C.: A discrepancy analysis methodology for rolling element bearing diagnostics under variable speed conditions. Submitted to Mechanical Systems and Signal Processing (2017). [3] Heyns, T., Heyns, P.S. and De Villiers, J.P.: Combining synchronous averaging with a Gaussian mixture model novelty detection scheme for vibration- based condition monitoring of a gearbox. Mechanical Systems and Signal Processing, 32, pp.200-215 (2012). [4] Heyns, P.S., Vinson, R. and Heyns, T.: Rotating machine diagnosis using smart feature selection under non-stationary operating conditions. Insight-Non- Destructive Testing and Condition Monitoring, 58(8), pp.417-422 (2016). [5] Schmidt, S., Heyns, P.S. and De Villiers, J.P.: A novelty detection diagnostic methodology for gearboxes operating under fluctuating operating conditions using probabilistic techniques. Mechanical Systems and Signal Processing, 100, pp.152-166 (2018). [6] Antoni, J., Bonnardot, F., Raad, A. and El Badaoui, M.: Cyclostationary modelling of rotating machine vibration signals. Mechanical systems and signal processing, 18(6), pp.1285-1314 (2004). [7] Bishop, C.M., 2006. Pattern recognition and machine learning. Springer (2006). [8] Abboud, D., Antoni, J., Sieg-Zieba, S. and Eltabach, M.: Envelope analysis of rotating machine vibrations in variable speed conditions: As comprehensive treatment. Mechanical Systems and Signal Processing, 84, pp.200-226 (2017). 188 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management A Generalized Synchroextracting Transform for Fast and Strong Frequency Modulated Signal Analysis Peng Chen1, Kesheng Wang 1, Ming J. Zuo1,2 1 School of Mechanical and Electrical Engineering, University of Electronic Science and Technology of China, China 2 Department of Mechanical Engineering, University of Alberta, Canada ABSTRACT High-quality time-frequency representation (TFR) is vital for signal analysis and condition monitoring. The blurred time frequency (TF) energy distribution in TF plane may lead to uncertainty in TFR and inevitably result in ambiguous results. Synchroextracting, with an adaptive and reconstructive natured transform, is used to improve the readability of STFT-based TFR. However, the standard synchroextracting method is based on the assumption that the analyzed signal is purely harmonic, as a result, it is more applicable to deal with signals with small amplitude and slow frequency modulations, whereas, signals with fast and strong nonlinear and non-stationary nature are of not suitable for the current synchroextracting method. To tackle this problem, a generalized synchroextracted transform (GSET) is proposed to cope with fast and strong nonlinear and nonstationary signals. In the proposed method, the generalized Fourier transform is first used to map the original frequency time-varied signal to a constant frequency signal. The constant frequency signal is further analyzed through synchroextracting operation so that the energy concentrated TF ridge can be obtained for a fast and strong nonlinear and nonstationary signal. Numerical simulated case study is provided for the demonstration of the efficiency of the proposed GSET method. Keywords: Time frequency representation (TFR); Instantaneous frequency; Generalized Fourier transform; synchroextracting; short time Fourier transform Corresponding author: Kesheng Wang (keshengwang@uestc.edu.cn) 1. INTRODUCTION Rotating machinery vibration signals [1–3] in industry are commonly analyzed through time frequency analysis (TFA), and such signals may suffer from unstable operational conditions. In this case, these operating conditions may give rise to frequency and amplitude modulated vibrations or nonlinear and nonstationary vibrations, and these vibration signals are often featured with fast and strong frequency time-varying nature. Recently, many TFA methods have been introduced to extend traditional Fourier analysis to deal with nonlinear and nonstationary signals, thus resulting in a body of literature that is referred to [4–8]. The bottleneck of traditional time frequency analysis methods such as short time Fourier transform (STFT), continuous wavelet transforms (CWT) and Wigner- Ville distribution (WVD), to deal with fast and strong frequency modulation natured signal, is to obtain a high- quality time and frequency representation. In this regard, recently, several advanced time frequency analysis methods such as reassignment method (RM) [10], synchrosqueezing transform (SST) [6,11] and synchroextracting transform (SET) [8] have been introduced. RM can provide TFA result with high readability, but it needs to reassign the time frequency spectrogram into an IF ridge along two-dimensional TF directions. It, therefore, loses ability to reconstruct the component of interest, it is a non-reconstruction method. To cope with this shortcoming, Daubechies et al. [6] introduced the SST method, it squeezes the TF coefficients into the IF ridge only in frequency direction. Consequently, SST not only can provide a TFR with a better readability but also enable signal reconstruction. By comparing the traditional methods such as STFT,WT and advanced SST, Iatsenko et al. [12] pointed out that SST also has its drawbacks for signal representation and reconstruction, i.e., it squeezes all the TF coefficients no matter the concentrated TF coefficients are related to signal time-varying features or not. In other words, TF coefficients due to the noise are possible to be squeezed. Based on this work, Yu et al. provide a novel 189 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management post-processing strategy SET [8]. This SEST technique aims to retain the closet TF information which is related to the intrinsic time-varying characteristics of the analyzed signal. It provides a high resolution TFR and a better robustness to noise pollution with signal reconstruction ability. Though SET features so many advantages, it should also be noted that the standard synchroextracting transform method assumes that the analyzed signal is purely harmonic. Therefore, it is more applicable to deal with signals with small amplitude and slow frequency modulations, whereas, signals with fast and strong nonlinear and non-stationary nature may not be handled properly by the current synchroextracting transform method. To overcome this problem, a generalized sysnchroextracting transform (GSET) is proposed in this paper to cope with “strong” nonlinear and nonstationary signals. In the proposed method, the generalized Fourier transform is used to map the original time-varying frequency signal to a constant frequency signal. The constant frequency signal is further analyzed through synchroextracting operation so that the energy concentrated TF ridge can be obtained for a fast and strong nonlinear and nonstationary signal. Numerical simulated case study is provided for the demonstration of the efficiency of the proposed GSET method. 2. THE THEORY OF GENERALIZED SYNCHROEXTRACTING TRANSFORM An enhanced generalized synchroextracting transform (GSET) method is illustrated in this section in detail. In the method, the generalized Fourier transform (GFT) is first used to map the original time-varying frequency signal to an analytic signal with a constant frequency. It can be used to remove smeared TFR by the fast and strong time- varying effect. The constant frequency signal is further analyzed through synchroextracting procedure so that the energy concentrated TF ridge can be obtained for a fast and strong nonlinear and nonstationary signal. As a result, it may result in an energy concentrated TFR, and avoid the ambiguous time frequency analysis results. 2.1. Revisit the generalized Fourier Transform The generalized Fourier transform (GFT) proposed was used in [13]. GFT can be employed to map a signal with time-varying instantaneous frequency to an analytic signal with a constant frequency. For a signal ( )x t , the GFT is defined as follows, 02 ( s ( ))x ( ) ( ( )) ( )  + − + − = =  i ft t G Gf F x t x t e dt (1) where 0 ( )s t is a real-valued function depending on time, and it satisfies the evolutionary phase behavior of original signal ( )x t . It is noted that equation (1) can be viewed as the standard Fourier transform to 0 2 s ( ) ( ) −i t x t e . About the inverse GFT, it can be computed from the inverse Fourier transform to x ( )G f , followed by multiplication of the result by 0 2 s ( )i t e i.e., 0 0 2 ( s ( ))1 2 s ( ) 2 ( ) (x ( )) x ( ) = x ( )    + +− − + − = =    i ft t G G G i t i ft G x t F f f e df e f e df (2) If 0x ( ) ( ) −G f f f , then 0 02 ( ( ))( )  += i f t s tx t e . That means that signal ( )x t with the instantaneous frequency (IF) ( )f t will be projected to the desired horizontal GFT frequency ridge ' 0 0= ( ) ( )−f f t s t , where ' 0 0( ) ( ( )) /=s t d s t dt . The above equations (1) and (2) show that the signal ( )x t with time-varying frequency IF ( )f t can be transformed to a constant frequency signal 0 - 2 ( ) ( )= ( ) i s t d t y t e  , where ( ) ( ) ( ( ))y t x t H x t= + , via mapping function 0- 2 ( )i s te  . Because the signal ( )d t is not an analytic signal and in order to remove the effect of negative frequency 190 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management components during the frequency mapping, Hilbert transform is again exploited to generate a new analytic signal ( )= ( )+ ( ( ))z t d t i H d t . Then the maximal overlap discrete wavelet packet transform (MODWPT) is applied to decompose ( )z t into 1 ( )= ( ) J i i z t c t =  . Hence the interested signal component ( )ic t can be obtained by the GFT method. 2.2. Generalized synchroextracting transform (GSET) In this section, the current synchroextracting transform [8] will be reviewed. Assuming that the signal is a purely harmonic signal 2 ( ) 0( ) i tx t A e = i.e., a constant frequency ' 0( )t f = and with invariant amplitude 0( )A t A= , 02 0( ) i f t x t A e = (3) The Fourier transform (FT) of ( )x t can be calculated as, 0( ) ( )x f A f f  =  − (4) To obtain the result of STFT, ( , )gxS t f designates STFT of the original signal ( )x t using a Gaussian analyzed window g , and it can be calculated as [8], 0 + 2 ( ) 0 2 0 0 ( , )= ( ) ( ) = ( ) = ( ) ( ) g i f u t x i f t S t f x u g u t e du A g f f e x t g f f    − −   −  −  −  (5) where g  is the FT of g . To obtain the estimation of local instantaneous frequency, based on equation (5) and it can be measured by the derivative of ( , )gxS t  with respect to t as, '( , ) ( ) ( , ) 1 = arg ( , ) 2 ( , )1 = 2 ( , ) g ins t x g t x g t x g x t f t t f S t f S t f i S t f      = =    (6) where ( , )ins t f is the estimate of instantaneous frequency of ( )x t . To reassign the complex time frequency coefficients based on STFT result, the details of mapping ( , ) ( , ( , ))inst f t t f→ may be referred to [8]. ( , ) ( , ) ( ( , ))x ge x insT t f S t f t f  =  − (7) where ( , )xeT t f is SET result of ( )x t . The original time domain signal ( )x t can be reconstructed by, 191 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management ^ 0 0 ^ 1 ( ) ( , ) , ( , ) ( ) ( ( , )) 1 ( , ) (0) x e ins ins x e x t T t f t f g t f T t f g         =  = = −  − =  (8) For SET method, the local instantaneous frequency (IF) is estimated by equation (6). This equation clearly interprets that IF is calculated by the results of STFT and one order derivation of ( , )gxS t  regarding to time t . As a result, an analyzed signal with slow frequency modulated characteristics can be distinguished by SET method. It also means that, in case of strong frequency modulated signal, these characteristics cannot be identified by the current SET method. For adapting to this strong frequency modulated signal and motivated by the advantage of GFT which can map a time-varying signal to an analytic signal with a constant frequency, the approach of GSET is proposed. To realize GSET, three necessary steps are implemented. Firstly, the GFT is applied to extract the interested signal component and it can be projected into a constant frequency ridge. Secondly, the constant frequency component, then, can be used for SET extraction. The extraction results can be calculated via equations (5) and (6). Lastly, the interested component ( )ic t can be reconstructed by inverse GFT. The proposed GSET procedures are illustrated in Figure 1. Targeted frequency Signal ( )x t Initial phase function ( )= ( )+ ( ( ))y t x t i H x t 0f 0- 2 ( )( ) ( ) i s t d t y t e =  0( )s t ( )= ( )+ ( ( ))z t d t i H d t Obtained the interested component MODWPT filter Obtained reconstructed signal by (6) 1 ( )= ( ) J i i z t c t =  Implement SET to obtain TF ridge ( )ic t ^ ( )ic t0 ( , )ins t f Implement Inverse GFT to obtain Extract TF content from 0 ^ 2 ( ) ( ) i s t iir c t e = ir Figure 1. Flowchart of the proposed GSET method for signal TFA 3. NUMERICAL SIMULATED CASE STUDY To validate the application procedures and effectiveness of GSET method for the fast and strong frequency modulated signal analysis, an indicator to measure the rate of change of the frequency (RCF) per unit time is defined as follows. For a general multi-component signal (MCS) reads 2 ( ) 1 1 ( ) ( )= ( ) i N N i t i i i i y t y t A t e  = = =  (9) where the mono-component ( )iy t satisfy ' ' ' 1( ) ... ( ) ... ( ) 0N it t t      . For all 0t  and a MCS from the equation (9), instantaneous amplitude ( )iA t and the instantaneous frequency ' ( )i t satisfy ( ) 0iA t  , ' ( ) 0i t  , and the RCF can be calculated as, 2 ''( )max( ) max( ( ) )i i d t RCF t dt  = = (10) 192 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management In this section, a numerical simulation signal with two components of RCF=118.43 and 37.89 are used for analysis. The signal is defined as 1 2( )= ( )+ ( )y t y t y t 1 2 ( ) sin(2 (30 1.5cos(2 0.8 ))) ( ) ( ) sin(2 (65 3.0cos(2 ))) y t t t y t y t t t     = +   =  = +  (11) In this study, the sampling frequency and time are set as 4000Hz and 6 seconds, respectively. The time wave and Fourier spectrum are presented in Figure 2 (a) and (b). The ideal TF trajectories are also plotted in Figure 2 (c). (b) (c) (a) 𝑦1(𝑡) 𝑦2(𝑡) Figure 2. (a) Time waveform of multicomponent signal (b) Fourier spectrum (c) Ideal time frequency trajectories 0 1 2 3 4 5 6 Time (s) 0 20 40 60 80 100 F re q ( H z) 0 1 2 3 4 5 6 Time (s) 0 20 40 60 80 100 F re q ( H z) 0 1 2 3 4 5 6 Time (s) 0 20 40 60 80 100 F re q ( H z) 0 1 2 3 4 5 6 Time (s) 0 20 40 60 80 100 F re q ( H z) (a) (b) (c) (d) 0 1 2 3 4 5 6 Time (s) 0 20 40 60 80 100 Fr e q ( H z) (e) Figure 3. Time frequency representations by (a) STFT (b) SST (c) RM (d) SET (e) GSET From the time frequency analysis results of Figure 3, it can be found that the TF nature of component 1( )y t can be precisely described by RM and SET. However, due to the severely smeared TFRs in Figure 3 (a) and (b), TF 193 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management information of 1( )y t and 2 ( )y t cannot be characterized properly. A careful inspection of the component 2 ( )y t with a fast and strong frequency modulation, these four TFAs such as STFT, SST, RM and SET cannot accurately distinguish this TF signature. In addition, because the RCF of component 2 ( )y t at zoned version circled with squared in Figure 3 (d) to the maximum, thus the four aforementioned methods can not deal with this signal component. To overcome these limitations and problems, GSET is proposed. According to the proposed algorithm flowchart illustrated in Figure 1, the first step is to implement the GFT to the interested component and, in this study, it is 2 ( )y t . The GFT result for the interested component 2 ( )y t is shown in Figure 4 (b) in which the frequency modulated 2 ( )y t is transferred into a constant frequency component. Next, because the mapped signal component from 2 ( )y t and 1( )y t are distributed in different frequency region, thus, the interested signal component 2 ( )y t can easily filtered by MODWPT and the result can be seen in Figure 4 (c). Then, the component 2 ( )y t can be restored by inverse GFT (see in Figure 4 (d)). Consequently, the constant frequency signal in Figure 4 (c) is further analyzed through synchroextracting operation so that the energy concentrated TF ridge can be obtained, and then the synechroextracted result is used for the procedure of inverse GFT. At last, the interested component signal 2 ( )y t can be restored by completed algorithm, and the corresponding result is displayed in Figure 5 (a). To evaluate the accuracy of GSET, the results of extracted IF trajectory and phase function 0 ( )s t (referring to flow chat of Figure 1) are superimposed in Figure 5 (a) and Figure 5 (b) for comparisons. It can be found that the extracted TF trajectory is almost the same with the ideal TF ridge which is shown in Figure 2 (c). Utilizing the IF ridge measured by GSET, the extracted IF ridges of three components in equation (11) are depicted in Figure 3 (e). And it is observed that the results of GSET almost perfectly match the ideal TF features shown in Figure 2 (c). Compared with STFT, SST, RM and SET (see in Figure 3 (a-d)), the results of GSET (Figure 3 (e)) feature advantages in TFR with a satisfactory result for characterizing fast and strong frequency modulated signals. 0 1 2 3 4 5 6 Time (s) 0 20 40 60 80 100 Fr e q ( H z) 0 1 2 3 4 5 6 Time (s) 0 500 1000 1500 2000 2500 In st a nt p h a se (r a d ) Ideal phase function Estimated phase function (a) (b) (c) (d) Figure 4 (a) Estimation of instantaneous phase function (b) Extracted the y2(t) component by generalized transform (c) The y2(t) component is separated by MODWPT (d) Restoring the y2(t) component by inverse GFT 194 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management (c) (a) 0 2 4 Time (s) 40 60 80 100 120 F r e q ( H z) Ideal IF trajectory Extracted IF trajectory (b) 0 1 2 3 4 5 6 Time (s) 0 20 40 60 80 100 Fr eq (H z) 0 2 4 Time (s) 20 40 60 80 100 F re q ( H z) Figure 5. (a) Restoring the y2(t) component by GSET (b) Instantaneous frequency estimation result (d) Extracted TF trajectories by GSET 4. CONCLUSIONS In this paper, the method of GSET is introduced. GSET is an extension of the recently proposed synchroextracting transform. The shortcoming of SET in dealing with fast and strong frequency modulated signal is overcome by introducing GFT in the proposed GSET. The simulation studies demonstrate that the proposed GSET exhibits advantages in TFR over traditional methods, STFT, SST, RM and SET. Future experimental studies and practical applications will be further investigated. ACKNOWLEDGEMENTS This work is supported by the National Key Research and Development Program of China (Project No. 2017YFC0108401), Fundamental Research Funds for the Centered University (ZYGX2016J111), National Natural Science Foundation of China (51305067 and 51537010), and the Natural Sciences and Engineering Research Council of Canada (Grant #RGPIN-2015-04897). REFERENCES [1] K.S. Wang, P.S. Heyns, Application of computed order tracking, Vold–Kalman filtering and EMD in rotating machine vibration, Mechanical Systems and Signal Processing. 25 (2011) 416–430. [2] K.S. Wang, P.S. Heyns, The combined use of order tracking techniques for enhanced Fourier analysis of order components, Mechanical Systems and Signal Processing. 25 (2011) 803–811. [3] P. Chen, K. Wang, K. Feng, Application of order-tracking holospectrum to cracked rotor fault diagnostics under nonstationary conditions, in: Prognostics and System Health Management Conference, 2017: pp. 1–6. [4] F. Auger, P. Flandrin, Improving the readability of time-frequency and time-scale representations by the reassignment method, IEEE Transactions on Signal Processing. 43 (1995) 1068–1089. 195 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management [5] F. Hlawatsch, G.F. Boudreauxbartels, Linear and quadratic time-frequency signal representations, IEEE Signal Processing Magazine. 9 (1992) 21–67. [6] I. Daubechies, J. Lu, H.-T. Wu, Synchrosqueezed wavelet transforms: An empirical mode decomposition-like tool, Applied and Computational Harmonic Analysis. 30 (2011) 243–261. [7] I. Daubechies, Y. (Grace) Wang, H. Wu, ConceFT: concentration of frequency and time via a multitapered synchrosqueezed transform, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences. 374 (2016) 20150193. [8] G. Yu, M. Yu, C. Xu, Synchroextracting Transform, IEEE Transactions on Industrial Electronics. 64 (2017) 8042–8054. [9] F. Auger, P. Flandrin, Y.-T. Lin, S. McLaughlin, S. Meignen, T. Oberlin, H.-T. Wu, Time-Frequency Reassignment and Synchrosqueezing: An Overview, IEEE Signal Processing Magazine. 30 (2013) 32–41. [10] K. Kodera, R. Gendrin, C. Villedary, Analysis of time-varying signals with small BT values, IEEE Transactions on Acoustics Speech & Signal Processing. 26 (2003) 64–76. [11] G. Thakur, E. Brevdo, N.S. Fučkar, H.T. Wu, The Synchrosqueezing algorithm for time-varying spectral analysis: Robustness properties and new paleoclimate applications, Signal Processing. 93 (2013) 1079–1094. [12] D. Iatsenko, P.V.E. McClintock, A. Stefanovska, Nonlinear mode decomposition: A noise-robust, adaptive decomposition method, Physical Review E. 92 (2015). [13] S. Olhede, A.T. Walden, A generalized demodulation approach to time-frequency projections for multicomponent signals, Proceedings Mathematical Physical & Engineering Sciences. 461 (2005) 2159–2179. 196 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management A Deep Statistical Feature Learning Method Based on Stacked Auto- Encoder for Intelligent Diagnosis of Rolling Bearing Te Han1,2, Quan Long2, Chao Liu1,3, Dongxiang Jiang1,2 1 State Key Laboratory of Power System and Generation Equipment, Tsinghua University, Beijing 2 Department of Energy and Power Engineering, Tsinghua University, Beijing 3 Key Laboratory for Thermal Science and Power Engineering of Ministry of Education, Tsinghua University, Beijing ABSTRACT To avoid redundancy and interferential information, feature fusion and dimension reduction is a critical step to improve the recognition accuracy in the fault diagnosis of mechanical system. The paper presents a novel feature fusion method based on one kind of deep neural networks (DNN), namely stacked auto-encoder (SAE). First, 29 popular statistical features in time domain and frequency domain are extracted from the raw vibration signals to form a statistical feature set. Then, the SAE is utilized for deep statistical feature learning to reduce redundancy. With the aid of supervised fine-tuning, the feature learning ability can be further promoted, and the diagnostic results can be given by the top-layer classifier of SAE. An experiment analysis using rolling bearing fault dataset indicates that SAE-based method possesses superior feature learning ability in comparison with conventional dimension reduction algorithm. Meanwhile, contrast results demonstrate that the proposed method can achieve higher recognition accuracy. Keywords: Deep Learning, Stacked Auto-Encoder, Feature Learning, Intelligent Fault diagnosis, Rolling Bearing. 1. INTRODUCTION Rolling bearing is a critical component in rotating machinery and its health conditions have a significant influence on the performance and vibrating noise of mechanical equipment. Due to the harsh working condition, rolling bearing is prone to failure, which may lead to serious accidents. Given this, developing the effective fault diagnosis techniques is beneficial to enhance the security and reliability of rotating machinery (Yuan and Lu, 2017; Han and Jiang, 2016; Han et al., 2016). In the intelligent diagnosis approaches, the procedure mainly include the three key stages: (1) signal pre- processing; (2) feature extraction and (3) fault classification (Han et al., 2017). In the stage of feature extraction, most of the extraction algorithms have been explored in the time domain, frequency domain, and time-frequency domain. Different feature may have different sensibility to bearing faults. Moreover, the extracted features inevitably contain useless and redundant information, which brings difficulties to fault recognition and classification. Therefore, the feature selection based optimization has a great significance (Zhang et al., 2016). The most popular feature selection methods can be mainly classified into: filter approach, wrapper approach, hybrid scheme and dimension reduction strategy (Liu et al., 2014). Shen et al. (2012) established a distance evaluation technique (DET)-based filter approach method to select salient features from 116 time- and frequency domain features. Liu et al. (2016) combined the filter and wrapper approaches to search optimal feature space for diagnosing the engine operating states. Qin et al. (2016) applied the principal component analysis (PCA) to reduce the dimension of 16-dimensional primitive inputs in the diagnosis of aileron actuators. Yang and Jiang (2017) utilized the nonlinear local linear embedding (LLE) for dimension reduction from the feature parameters in time- and frequency domain. All these techniques achieve several successful cases, while they all suffer from their own defects. For instance, the evaluation metrics in filter approach is difficult to be determined; the wrapper approach 197 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management will cause high computational burden; nonlinear dimension reduction algorithm always need extra optimization of structural parameters. In recent years, deep neural networks (DNN) has received considerable attention in the felid of diagnostics (Lu et al., 2017; Lu et al., 2017). Deep convolutional neural networks (DCNN) was utilized for multi-sensor data fusion in the fault diagnosis of planetary gearbox (Jing et al., 2017). An optimized DCNN was applied for extracting features from raw vibration signal of bearing faults (Zhang et al., 2017). A stacked auto-encoder (SAE)-based deep fault recognizer model was established to automatically mine the essential feature from frequency spectra of fault signal (Guo et al., 2016). In essence, DNN can be treated as one kind of nonlinear transformation, converting high- dimensional feature to low dimension (Guo et al., 2016; Shao et al., 2015). And further, the signal pattern can be legibly reflected by the learned feature in low dimension. In this paper, we will introduce the novel SAE to deep statistical feature learning from a 29 statistical features in time domain and frequency domain, and diagnose the bearing faults with the fused features. The remaining part is organized as follows. In section 2, the principles of SAE and the proposed framework are introduced. In section 3, a case study using bearing fault datasets is conducted. A comparative analysis will be given to demonstrate the effectiveness and superiority of proposed method. Finally, the conclusions can be drawn in section 4. 2. INTELLIGENT DIAGNOSIS METHOD USING PROPOSED METHOD 2.1.Brief Introductions of stacked auto-encoder Proposed in 1986, the auto-encoder is a special neural network, whose structure is shown as the left part in figure 1. The auto-encoder contains 3 layers: input layer, hidden layer and output layer. It should be noted that the neuron number of output layer is same as that of input layer. The training process of auto-encoder is similar to a guiding neural network learning process, with the purpose of getting the transformation function y = h_wb (x) using the data as a guide. The training process enables that output is equal to the original input signal, i.e., y=x. The detailed procedure can be generalized as following steps: (1) at the beginning of the training, use the input x and the initialized weights of encoding W^e to get the output value of the hidden layer, namely L2 after encoding process from L1; (2) use the output value of the hidden layer and the initialized weights of decoding W^d to get the estimated value y for the input x, namely L3 after decoding process from L2. (3) according to the deviation of estimated value y and actual value x, train the W^e and W^d by back-propagation (BP). The training algorithm is the classical gradient descent algorithm. Substantially, the auto-encoder is .a kind of neural network, whose output is approximately equal to the input. It is intuitional to find that the output of encoder may not make much sense. On the contrary, the output of hidden layer neurons is valuable, since it can be regarded as another kind of expression of the data. A series of fused features can be learned from original input with the coding process. Furthermore, on the basis of multilayer restricted boltzmann network, that is deep boltzmann machine (DBM), the stacked auto-encoder (SAE) was also proposed in 2006, whose structure is presented in the right part of figure 1. In each layer, only the encoder is retained. At the end of the network, a BP classifier is add for pattern recognition. After the unsupervised pretraining, the fine-tuning is performed as a supervised learning method to promote the feature learning ability. 198 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 1. Illustration of the auto-encoder network and the stacked auto-encoder. 2.2.Proposed framework Table 1. The statistical characteristic parameters in time-domain Number Feature expression Number Feature expression Number Feature expression tp1 tp7 tp13 tp2 tp8 tp14 tp3 tp9 tp15 tp4 tp10 tp16 tp5 tp11 tp6 tp12 Where is a sampling series of raw signal with N points. Table 2. The statistical characteristic parameters in frequency-domain Number Feature expression Number Feature expression Number Feature expression fp1 fp6 fp11 fp2 fp7 fp12 fp3 fp8 fp13 fp4 fp9 fp5 fp10 Where is the frequency spectrum of the discrete signal and means the sequence number of spectrum line; represents the frequency value of the spectrum line. 199 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management In the intelligent diagnosis of rolling bearing, the first step is signal acquisition and pre-processing. In this work, vibration signal is adopted as analysis object because it contains rich information about the bearing condition (Nembhard et al., 2015; Yunusa-Kaltungo et al., 2015). A variety of vibration samples can be collected by the acceleration sensor. Then, the sensitive features need to be extracted from the signal samples so as to diagnose the bearing condition. As listed in Table 1 and 2, 29 popular statistical features are employed, in which 16 features are from time-domain and the other 13 features are from frequency-domain (Lei et al., 2007). These features can describe healthy degree of mechanical components from different aspects. For example, the root mean square (tp2) can reflect the amplitude and energy of vibration signal in time-domain, the kurtosis feature (tp6) is sensitive to the impact characteristics in the signal and the parameter fp 6 may reflect the position change of main frequencies. However, the dimensionality of the feature space is high and these features inevitably contain useless and redundant information, making the signal pattern unrecognizably. Consequently, the SAE-based deep neural network (DNN) is applied to statistical feature learning. The proposed method is a nonlinear dimensionality reduction and feature fusion method. With the learned features, the decision about the bearing condition can be made by the classifier in the end of DNN. The execution steps of proposed method is presented in figure 2. The comparative analysis and the visualization of the feature distribution are conducted in the further case study. Figure 2. The architecture of this work for intelligent diagnosis of rolling bearing. 3. EXPERIMENT AND DISCUSSION 3.1.Experimental setup Table 3. Description of bearing fault dataset Bearing condition Fault diameters (mil) Motor Speed (RPM) Label of classification Sample size Health \ 1797, 1772, 1750, 1730 1 2048 Inner race fault 7, 14, 21 1797, 1772, 1750, 1730 2 Outer race fault 7, 14, 21 1797, 1772, 1750, 1730 3 Ball fault 7, 14, 21 1797, 1772, 1750, 1730 4 200 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management To demonstrate the effectiveness of the DNN-based diagnosis method, a case study using the bearing fault dataset from Case Western Reserve University bearing data center is performed. The test stand consists of a motor, a torque transducer, a dynamometer. The analysis focus on the drive end bearings, which support the motor shaft. Single point faults were introduced to the test bearings using electro-discharge machining with three fault diameters of 7 mils, 14 mils, 21 mils. Four bearing conditions, i.e., health, inner race fault, outer race fault and ball fault, are classified with the corresponding label. In each bearing status, the experiments are carried out under four load conditions, namely 1797rpm, 1772rpm, 1750rpm and 1730rpm. Totally, 40 operating conditions are taken into consideration and 100 samples with the length of 2048 points are collected in each operating condition. In real practical applications, it is difficult to acquire all the labelled training sets under different load conditions. To conform this condition, the samples in 1797 rpm and 1730rpm are used for training, and the others in 1772 rpm and 1750 rpm are used for testing in this study. The waveforms of one sample in the four bearing conditions are presented in figure 3. (a) (b) (c) (d) Figure 3. Vibration waveform of bearing with four states: (a) health; (b) inner race fault; (c) outer race fault; (d) ball fault. 3.2.Results and discussion of rolling bearing fault diagnosis The architecture of the SAE is 29-30-25-20-10-6, that is, the input of the first layer is the 29 statistical features and the hidden neurons in the next few layers are 30, 25, 20, 10 and 6, respectively. In this work, to explore the inner operation conditions, we present the networks visualization for each layer in figure 4. The simple and effective PCA is adopted to generate the principal components of the output features in each layer for training samples and the first two principal components are plotted. Clearly, there are still some overlap areas in the raw 2-D principal components space for the 29 statistical features, although PCA can reduce dimensions according to the sensitive directions of maximal variance. By using SAE-based deep statistical feature learning, it is obvious that the separability of different fault patterns becomes better and better with the increase of layer number. The visualization results demonstrate that DNN not only can efficiently mine the essential features through the nonlinear transformation layer-by-layer, but can avoid the effect of speed fluctuation and crack depth change, presenting a valuable prospect for practical applications. 201 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 4. Visualization of the first two principal components of training samples from each layer. Furthermore, the Table 4 gives the diagnosis results of testing samples using the proposed deep statistical feature learning method. For comparison, two traditional methods are executed, whose results are shown in Table 5 and Table 6. The principal components of raw statistical features by PCA are served as input features and the support vector machine (SVM) is utilized for classification and diagnosis. The contribution rate of adopted principal components is above 90%. Another method uses the raw statistical features as the input features for SVM directly, without prior feature fusion and dimension reduction. The results verify the learned features can remarkably reflect the nature of information hidden in raw statistical features, and further improve the diagnosis results. 202 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Table 4. Diagnosis accuracy using deep statistical feature learning method Test classes Classification results(%) Accuracy (%) Average (%) Normal OR fault IR fault B fault Normal 100 0 0 0 100 98.07 OR fault 0 96.17 3.83 0 96.17 IR fault 0 0 100 0 100 Ball fault 0 2.33 1.83 95.84 95.84 Table 5. Diagnosis accuracy using statistical features, PCA and SVM Test classes Classification results(%) Accuracy (%) Average (%) Normal OR fault IR fault B fault Normal 100 0 0 0 100 96.37 OR fault 0 87.17 12.83 0 87.17 IR fault 0 0 100 0 100 Ball fault 0 2.33 0.67 97 97 Table 6. Diagnosis accuracy using statistical features and SVM Test classes Classification results(%) Accuracy (%) Average (%) Normal OR fault IR fault B fault Normal 100 0 0 0 100 95.75 OR fault 0 84.33 15.67 0 84.33 IR fault 0 0 100 0 100 Ball fault 0 1.50 2.33 96.17 96.17 4. CONCLUSIONS With the purpose of avoiding redundancy and interferential information, a novel deep neural network (DNN) whose structure is stacked by five hidden layers of auto-encoder is proposed for feature learning from 29 raw statistical features. The proposed method is a nonlinear dimensionality reduction and feature fusion method. With the aid of the fine-tuning after the unsupervised pretraining, this method is also performed as a supervised learning process. A case study using a bearing fault dataset is conducted to demonstrate the effectiveness of this method. The networks visualization in each layer illustrates the excellent ability for capturing essential features. To make a quantitative comparative analysis, the diagnosis result of deep statistical feature learning method and another two conventional diagnosis methods are presented. The diagnosis accuracy further verify the superior fault diagnosis ability of proposed method. REFERENCES [1] Yuan H and Lu C. (2017) Rolling bearing fault diagnosis under fluctuant conditions based on compressed sensing. Structural Control and Health Monitoring 24(5): e1918. [2] Han T and Jiang D. (2016) Rolling Bearing Fault Diagnostic Method Based on VMD-AR Model and Random Forest Classifier. Shock and Vibration 2016: 1-11. [3] Han T, Jiang D and Wang N. (2016) The Fault Feature Extraction of Rolling Bearing Based on EMD and Difference Spectrum of Singular Value. Shock and Vibration 2016: 1-14. 203 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management [4] Han T, Jiang D, Zhang X, et al. (2017) Intelligent Diagnosis Method for Rotating Machinery Using Dictionary Learning and Singular Value Decomposition. Sensors 17(4). [5] Liu C, Jiang D and Yang W. (2014) Global geometric similarity scheme for feature selection in fault diagnosis. Expert Systems with Applications 41(8): 3585-3595. [6] Zhang X, Jiang D, Han T, et al. (2016) Feature Dimension Reduction Method of Rolling Bearing Based on Quantum Genetic Algorithm. In: Zuo MJ, Xing L, Li Z, et al. 2016 Prognostics and System Health Management Conference. [7] Shen Z, Chen X, Zhang X, et al. (2012) A novel intelligent gear fault diagnosis model based on EMD and multi-class TSVM. Measurement 45(1): 30-40. [8] Liu P-y, Li B, Han C-e, et al. (2016) Feature Extraction and Selection Scheme for Intelligent Engine Fault Diagnosis Based on 2DNMF, Mutual Information, and NSGA-II. Shock and Vibration 2016: 1-13. [9] Qin W-L, Zhang W-J and Lu C. (2016) A Method for Aileron Actuator Fault Diagnosis Based on PCA and PGC-SVM. Shock and Vibration 2016: 1-12. [10] Yang Y and Jiang D. (2017) Casing Vibration Fault Diagnosis Based on Variational Mode Decomposition, Local Linear Embedding, and Support Vector Machine. Shock and Vibration 2017: 1-14. [11] Lu C, Wang Z-Y, Qin W-L, et al. (2017) Fault diagnosis of rotary machinery components using a stacked denoising autoencoder-based health state identification. Signal Processing 130: 377-388. [12] Lu C, Wang Z and Zhou B. (2017) Intelligent fault diagnosis of rolling bearing using hierarchical convolutional network based health state classification. Advanced Engineering Informatics 32: 139-151. [13] Jing L, Wang T, Zhao M, et al. (2017) An Adaptive Multi-Sensor Data Fusion Method Based on Deep Convolutional Neural Networks for Fault Diagnosis of Planetary Gearbox. Sensors 17(2). [14] Zhang W, Peng G, Li C, et al. (2017) A New Deep Learning Model for Fault Diagnosis with Good Anti-Noise and Domain Adaptation Ability on Raw Vibration Signals. Sensors 17(2). [15] Guo X, Shen C and Chen L. (2016) Deep Fault Recognizer: An Integrated Model to Denoise and Extract Features for Fault Diagnosis in Rotating Machinery. Applied Sciences 7(1): 41. [16] Guo L, Gao H, Huang H, et al. (2016) Multifeatures Fusion and Nonlinear Dimension Reduction for Intelligent Bearing Condition Monitoring. Shock and Vibration 2016: 1-10. [17] Shao H, Jiang H, Zhang X, et al. (2015) Rolling bearing fault diagnosis using an optimization deep belief network. Measurement Science and Technology 26(11): 115002. [18] Nembhard AD, Sinha JK and Yunusa-Kaltungo A. (2015) Development of a generic rotating machinery fault diagnosis approach insensitive to machine speed and support type. Journal of Sound and Vibration 337: 321-341. [19] Yunusa-Kaltungo A, Sinha JK and Nembhard AD. (2015) A novel fault diagnosis technique for enhancing maintenance and reliability of rotating machines. Structural Health Monitoring 14(6): 604-621. [20] Lei Y, He Z, Zi Y, et al. (2007) Fault diagnosis of rotating machinery based on multiple ANFIS combination with GAs. Mechanical Systems and Signal Processing 21(5): 2280-2294. 204 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Research and Application of Weak Fault Diagnosis Method Based on Asymmetric Potential Stochastic Resonance in Strong Noise Background Zhixing Li 1, Jianguo Wang 2 , Songjiu Han 3 , Jian Kang 4, Zedong Li 5, Furui Shi 6, Wenjing Liu 7 1 School of Mechanical Engineering , Inner Mongolia University of Science and Technology 2 School of Mechanical Engineering , Inner Mongolia University of Science and Technology 3 School of Mechanical Engineering , Inner Mongolia University of Science and Technology 4 School of Mechanical Engineering , Inner Mongolia University of Science and Technology 5 School of Mechanical Engineering , Inner Mongolia University of Science and Technology 6 China University of Petroleum 7 School of Mechanical Engineering , Inner Mongolia University of Science and Technology ABSTRACT When machinery is used for fault diagnosis and status detection, mechanical equipment is often in a strong noise environment, The fault signal is often weak and can easily be submerged in other noise. Therefore, For this reason, this paper studies a method of stochastic resonance based on noise enhancing weak fault signals to detect weak fault signals. In this paper, a stochastic resonance phenomenon based on an asymmetrical potential system is studied. For the symmetry of the traditional stochastic resonance potential model, The shape of the potential model cannot achieve structural optimization problems. Study the effect of asymmetric potential stochastic resonance model on weak signal extraction, The effect of the asymmetric potential stochastic resonance model on weak signal extraction was studied. A non-symmetrical stochastic resonance model is proposed to achieve optimal structure. Comparing this model with traditional classical stochastic resonance. Finally, combined with experimental data of bearing weak fault diagnosis and computer simulations. Numerical simulation is consistent with the approximation theory. Keywords: Strong noise, Stochastic resonance, Weak fault diagnosis, Asymmetric potential function 1. INTRODUCTION With the development of modern industry, the fault diagnosis technology of mechanical equipment has received more and more attention, and it has become an important means and key technology to ensure the stable operation of production systems and improve product quality. Once a device fails to continue to work, it will often affect the normal operation of production, and serious failures may even lead to machine crashes [1-4]. In an industrial system that is more complicated and multiple devices are operating together, the faint fault signals of mechanical equipment will be submerged in a strong noise background and difficult to diagnose [5-7]. Scholars have developed many detection methods and techniques for weak signal research. such as, Wavelet analysis [8-10], singular value decomposition [11-13], empirical mode decomposition [14-16], local mean decomposition [17-19]. These methods can achieve the detection of weak signals, but there are also problems with the noise signal is filtered out together, resulting in weak signal detection result is not ideal. In 1981, Benzi [20] and other scholars first discovered and proposed the phenomenon of stochastic resonance in the interpretation of the periodic recursion of the ice age. The phenomenon of stochastic resonance shows: In a nonlinear system, noise and periodic 205 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management signals are added at the same time, noise is enhanced within a certain range, and the system output signal-to-noise ratio is not reduced but will be enhanced. This has great significance for the detection of weak signals. In the early days, studies on stochastic resonances were based on common bistable studies. The classic SR requires very small parameters, and the actual fault caused a frequency greater than 1 Hz. Therefore, early stochastic resonances are not suitable for fault diagnosis[21,22]. To solve this technical problem. T Wang et al. proposed a scale transformation stochastic resonance method. The SR phenomenon can be achieved under large-parameter conditions[23-25]. In recent years, scholars have further studied stochastic resonance in the field of fault diagnosis. In terms of over-damped SR, Lu et al. [26] proposed an overdamped monostable SR method with Woods-Saxon potential to diagnose early bearing failures. Qiao et al. [27] discussed the influence of multiplicative and additive noises on the potential asymmetry of overdamped SRs. The results show, the asymmetric structure of the potential contributes to enhancing the enhancement of SR in weak feature extraction. Qiao et al. [28] established a new segmented overdamped SR model. To overcome the output saturation of the traditional bistable SR, and by using the proposed method to diagnose further planetary gearbox faults. From the above, it can be seen that over-damped SR is widely used in the field of fault diagnosis. In addition, Lu et al. [29] improved the enhanced performance of the OBSR through the full-wave signal construction and further achieved bearing fault diagnosis. Shi et al. [30] proposed a method based on multi-scale noise-based stochastic resonance tuned weak signal detection for multi- frequency signal detection in colored noise background. In addition, some scholars combine traditional detection methods with stochastic resonance to better achieve the purpose of detection. Such as, Shi et al. [31] proposed a novel weak signal detection method based on wavelet transform and parameter compensation bandpass multi- stability stochastic resonance for the detection of multi-frequency signals under heavy background noise. In summary, Most scholars' researches are based on the classical symmetry potential bistable model, or other symmetric models such as multi-steady state and tri-stable state. However, there are few studies on the characteristics of the asymmetric potential function stochastic resonance. For this reason, this paper proposes a weak fault diagnosis method based on ASR. Compared with existing methods, ASR not only has the property of stochastic resonance with symmetrical potential, but also can independently adjust the position of the barrier, the slope of the potential wall and the potential well, and can obtain a more abundant shape of the potential structure. Therefore, this method solves the problem of local adjustment of the potential function of the existing method from the asymmetry of the potential structure, the optimization of the potential structure is achieved so as to better extract weak fault features in a strong noise background. Section II of this paper model potential characteristics were analyzed to explore the potential effects of various parameters on the model potential model; In the third section, the method of extracting the weak signal of asymmetry potential stochastic resonance is studied. Section IV simulation verification; Section V were bearing experimental verification; Section VI concludes summarized. 2. ASYMMETRIC POTENTIAL STOCHASTIC RESONANCE MODEL The basic principle of stochastic resonance is the synergistic effect of input signals and noise signals in nonlinear systems. As mentioned earlier, The process of detecting the weak signal by stochastic resonance can be described by the movement of particles in the potential well. The main factor that hinders the particle's transition is the shape of the potential. Adjusting parameters of traditional stochastic resonance a , b can achieve the purpose of changing the potential shape. When a is fixed, adjust the size of b , The bigger b is, the smaller the barrier height is, and the smaller the width of the potential wall is. When b is fixed, adjust the size of a , The larger a is, the larger the barrier height is, and the width of the potential wall is correspondingly increased. That is, adjusting the size of a and b , the barrier height changes, and the corresponding potential wall also changes. Similarly, the potential function expression for asymmetric bistable stochastic resonance is as follows: 206 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management               2 0 0 0 1 exp 1 1 exp ( 1) 1 exp ( 1) 1 exp 1 a x b x V x V a x b x                   (1) The smallest position of potential is at 1x . For the sake of convenience, parameters 0x and have been introduced to illustrate the position and height of the potential, respectively. In addition to 0x , the tunable parameters a and b control the slope of the bottom of the well. In order to compare the similarities and differences between the bistable system and the asymmetrical bistable system, the curve of the adjustment function of the different parameters is shown in Figure 1. -2 -1.5 -1 -0.5 0 0.5 1 1.5 2 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 x V (x ) V 0 =0.1 V 0 =0.3 V 0 =0.5 V 0 =1 a=b=1 x 0 =0 (a) -2 -1.5 -1 -0.5 0 0.5 1 1.5 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 x V (x ) x 0 =0 x 0 =0.3 x 0 =0.5 x 0 =0.7 a=0.5 b=3 V 0 =0.5 (b) -2 -1.5 -1 -0.5 0 0.5 1 1.5 2 0 0.5 1 1.5 2 2.5 3 3.5 4 x V (x ) a=0.5 a=1 a=2 a=4 b=V 0 =x 0 =0.5 (c) -2 -1.5 -1 -0.5 0 0.5 1 1.5 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 x V (x ) b=0.5 b=1 b=2 b=4 a=V 0 =x 0 =0.5 (d) Fig. 1 Curves of different parameters to adjust the potential function: (a) The potential model when , , changes; (b) When , the potential model for changes; (c) When , the potential model for changes; (d) When , the potential model for changes; As mentioned above, the asymmetric potential stochastic resonance system also has two potential wells, a potential barrier, fixed a , b and 0x , adjusting the size of the parameter , which can also achieve the adjustment of the barrier height, with the characteristics of classical bistable stochastic resonance, the difference is that the adjustment parameters can also realize the asymmetry of the potential shape. When a , b and are fixed, and the size of 0x is adjusted individually, the barrier height and inclination change, that is, the position of the barrier changes, and the slope of the right potential wall remains basically the same; Fixed b , 0x and , adjust the size of a alone, the barrier height changes, not only the slope of the barrier changes, and the slope of the barrier side remains unchanged. In addition, when a , and 0x are fixed, and the size of b is individually adjusted, not only the barrier height is changed, but also the potential barrier is tilted to the right side and the gradient of the barrier is changed. From the above analysis, it can be seen that the classical bistable stochastic resonance can only adjust the height of the barrier and the width of the potential wall. The asymmetric stochastic resonance can not only regulate the height and position of the barrier. It can also ensure that the slope of one side of the potential barrier and the potential wall remains unchanged, with a richer potential shape and more adjustment characteristics. 207 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 3. ASYMMETRY POTENTIAL RANDOM WEAK SIGNAL EXTRACTION METHOD The essence of stochastic resonance is that the particle oscillates periodically between two potential wells under the influence of noise, periodic signal and potential model, so as to achieve the purpose of improving the output of the system. In the actual project, the periodic signal and noise are fixed, and only the shape of the adjustment potential can make the random resonance effect the best. In general, when the barrier height is too high and the distance between the two sides of the wall is too far, the particles consume too much energy and cannot return to their original position from one potential well and can only oscillate within a potential well. “Under resonance”; When the inclinations of the two sides of the potential wall are too large and the barrier height is too low, the particles get too much energy to cause excessive acceleration, which causes the particles to jump back and forth between two potential wells to form an “over resonance”. Therefore, only the best potential shape can enable the particles to periodically oscillate between the two potential wells with the help of weak signal and noise, thus forming the best random resonance effect. As we all know, classical bistable stochastic resonance achieves the best random resonance effect by adjusting system parameters a and b . When the height of the classical stochastic resonance barrier changes, the width of the potential wall also changes. Assuming that the width of the potential wall has reached its optimal state, the height of the barrier may not be the best. Therefore, the final state may not be the best potential shape. However, asymmetric stochastic resonance can not only independently adjust the height of the barrier, guarantee the same characteristics as the traditional stochastic resonance, but also can adjust the slope and the position of the barrier, the slope of the potential wall and the barrier. For example, when the noise is too small, in order to achieve a smooth transition of particles, classical stochastic resonance achieves its goal by reducing the height of the barrier. The asymmetry stochastic resonance can reduce the height of the barrier and at the same time, adjust the slope of the potential barrier and the slope of the potential wall, so that the asymmetric potential resonance has a better effect. Therefore, in order to extract the weak fault feature frequency in the strong noise background, The asymmetric stochastic resonance optimizes the system parameters a , b , 0x , and using the chaotic ant colony algorithm to obtain the best potential shape, and the output signal-to-noise ratio is used to evaluate the output effect. The specific steps are as follows: (1)Signal preprocessing. The original vibration signal simulated or acquired is frequency shifted and scaled to meet the requirements of small parameters. Calculate the time domain waveform, spectrum, and envelope spectrum of the original signal. (2)Parameter optimization. Envelope signal is input to asymmetry potential stochastic resonance system. The chaotic ant colony algorithm is used to optimize the system parameters a , b , 0x , and . The parameter optimization range is set to [0,10]. (3)Calculate the signal-to-noise ratio. All the optimization parameter combination sequences are input to an asymmetric potential random resonance system and the output signal-to-noise ratio is calculated. (4)The parameters are determined. Finding the maximum output signal-to-noise ratio results in the best combination of parameters a , b , 0x , and . (5)Signal processing. The optimal combination parameters are input to the non-symmetric potential stochastic resonance system, and finally the time domain waveform and spectrum of the asymmetric potential stochastic resonance are obtained. 208 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 4. SIMULATION VERIFICATION In order to illustrate the effectiveness of the above ASR method, Firstly, by simulating the bearing signal to verify and construct a simulation signal, the time waveform obtained is shown in Figure 2(a). The frequency spectrum obtained by making FFT changes is shown in Figure 2(b). Due to strong noise interference, it is difficult to obtain characteristic frequencies through spectrum analysis. Then, as shown in Figure 2(c), through envelope analysis, the fault feature frequency cannot be found in the figure. In order to further extract weak fault characteristics, an asymmetric model is established. Using chaos ant colony algorithm to optimize the parameters were , , Then the signal is subjected to a stochastic resonance process through an asymmetric potential model. Figure 3(a) shows the output signal after being processed by random resonance. The final Fourier transform is shown in Figure 3(b). It can be clearly seen from the figure that the movement of the particles between the two potential wells reaches resonance. The Fourier-transformed frequency domain signal is more easily identified at the characteristic frequency, and the characteristic frequency amplitude is 0.04377 higher than the surrounding noise. Therefore, the proposed asymptotic potential stochastic resonance model is effective and feasible. It can enhance the eigenfrequency and make the feature frequency easier to identify. So it can be used for weak feature fault detection. Comparing the classical symmetry stochastic resonance weak feature frequency extraction method, the same simulation signal is processed by the classical symmetric stochastic resonance method. Chaos ant colony algorithm optimizes the potential system parameters a and b , the optimization result is , The time domain signal and frequency spectrum after the stochastic resonance processing are shown in Figure 3(c) and (d). As can be seen in Figure 3(d), the characteristic frequency is also higher than the ambient noise, but the amplitude of the characteristic frequency is only 0.01517 higher than the ambient noise. Therefore, it can be determined that the classical symmetry potential stochastic resonance restricts the change of the potential shape due to the symmetry of the potential and forms the local optimum of the potential shape. Comparing Fig. 3(b) and Fig. 3(d), it is found that the peak value of the characteristic frequency output by the proposed method is significantly enhanced compared with the conventional method, indicating that the phenomenon of stochastic resonance is more obvious. Therefore, the proposed asymmetry potential stochastic resonance method is better than traditional methods for detecting weak features. 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 -5 0 5 t/s A /m s -2 0 1000 2000 3000 4000 5000 6000 0 0.05 0.1 f/Hz A /m s -2 0 1000 2000 3000 4000 5000 6000 0 0.02 0.04 0.06 f/Hz A /m s -2 (c) (b) (a) Fig. 2 Simulation signal: (a) Time domain waveform (b) Spectrum (c) Envelope spectrum 209 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 -0.5 0 0.5 t/s A /m s -2 0 1000 2000 3000 4000 5000 6000 0 0.02 0.04 0.06 0.08 0.1 f/Hz A /m s -2 0.04377 (a) (b) 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 -0.4 -0.2 0 0.2 0.4 t/s A /m s -2 0 1000 2000 3000 4000 5000 6000 0 0.01 0.02 0.03 0.04 0.05 f/Hz A /m s -2 0.01517 (c) (d) Fig. 3 Symmetrical potential stochastic resonance method for simulated signals: (a) Time domain waveform (b) Spectrum; Asymmetry potential stochastic resonance method for simulated signal: (c) Time domain waveform (d) Spectrum 5. BEARING TEST VERIFICATION In order to demonstrate the effectiveness of ASR, bearing tests have been used. First of all, In order to meet the requirements of small parameters, the use of frequency-shifting scales to preprocess bearing fault data, the envelope signal is input to the asymmetrical system. The parameters of the asymmetric potential stochastic resonance system obtained by using the chaotic ant colony algorithm are , , The time domain plot and frequency domain plot of the output signal are shown in Figure 4. From the spectrum diagram in Figure 4(b), it can be seen that the frequency of the fault signature is clearly demonstrated, being 0.03857 higher than the ambient noise. Therefore, It can be concluded that the proposed method can be applied to fault diagnosis of bearings. and achieved remarkable results. 0 0.5 1 1.5 2 2.5 3 -0.4 -0.2 0 0.2 0.4 t/s A /m s -2 0 200 400 600 800 1000 1200 0 0.02 0.04 0.06 0.08 0.1 f/Hz A /m s -2 0.03857 (b) (a) Figure 4 Asymmetric potential resonance method for bearing signals: (a) Time domain waveform (b) Spectrum The non-symmetric potential stochastic resonance proposed in this paper not only has the symmetry potential shape adjustment characteristics, but also through multi-parameter optimization. The inclination of the barrier, the multi-angle adjustment of the slope of the potential wall and the barrier can be realized, and a more abundant potential shape can be obtained, and the movement of the particles between the potential wells on both sides can be realized more easily. The proposed method for adaptive non-symmetry stochastic resonance obtains better results in the diagnosis of weak faults in rolling bearings. 210 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 6. CONCLUSION Aiming at the symmetry of the classical classical stochastic resonance potential shape, this paper presents a new method of asymmetry potential stochastic resonance weak fault diagnosis. And applied to weak feature signal extraction under strong noise background, draw the following conclusions: (1) The structural properties of the potential function play a key role in the effect of stochastic resonance. The multi-parameter optimization of asymmetric potential stochastic resonance can make the shape of the stochastic resonance system diversify. Not only can the potential height and potential width be adjusted individually, but also the slope of the barrier, the slope of the potential wall and the barrier, can be independently adjusted. Therefore, the best potential shape can be obtained. (2) Simulation and bearing pitting corrosion experiments show that the proposed method is effective in weak fault diagnosis. REFERENCES [1] [1] N. Sawalhi, R.B. Randall, Vibration response of spalled rolling element bearings: Observations, simulations and signal processing techniques to track the spall size,[J] . Mechanical Systems and Signal Processing, 2011 , 25 (3) :846-870 [2] [2] J.J. Zakrajsek , R.F. Handschuh , H.J. Decker, Application of fault detection techniques to spiral bevel gear fatigue data,[J]Advanced Materials and Process Technology for Mechanical Failure Prevention , 1994 , 14 :93-104 [3] [3] J. Zheng , H. Pan , J. Cheng, Rolling bearing fault detection and diagnosis based on composite multiscale fuzzy entropy and ensemble support vector machines,[J]Mechanical Systems & Signal Processing , 2017 , 85 :746-759 [4] [4] J. Lin , Q .Chen, Fault diagnosis of rolling bearings based on multifractal detrended fluctuation analysis and Mahalanobis distance criterion,[J]Mechanical Systems and Signal Processing , 2013 , 38 (2) :515-533 [5] [5]Z. Li , B. Shi, An adaptive stochastic resonance method for weak fault characteristic extraction in planetary gearbox,[J]Journal of Vibroengineering , 2017 , 19 (3) :1782-1792 [6] [6]S. Lu , Q. He , H. Zhang , F. Kong, Rotating machine fault diagnosis through enhanced stochastic resonance by full-wave signal construction,[J]Mechanical Systems and Signal Processing, 2017 , 85 :82-97 [7] [7]P. Zhou , S. Lu , F. Liu , Y. Liu , G. Li, Novel synthetic index-based adaptive stochastic resonance method and its application in bearing fault diagnosis,[J]Journal of Sound and Vibration , 2017 , 391 :194-210 [8] [8]F. Hemmati , W. Orfali , M.S. GadalaRoller, bearing acoustic signature extraction by wavelet packet transform, applications in fault detection and size estimation,[J]Applied Acoustics , 2016 , 104 :101-118 [9] [9]Z. Gao , J. Lin , X. Wang , X. Xu, Bearing Fault Detection Based on Empirical Wavelet Transform and Correlated Kurtosis by Acoustic Emission,[J]Materials , 2017 , 10 (6) :571 [10] [10]KCD. Kompella , MVG. Rao , R.S. Rao, Bearing fault detection in a 3 phase induction motor using stator current frequency spectral subtraction with various wavelet decomposition techniques,[J]Ain Shams Engineering Journal , 2017 [11] [11]Z. Li , B. Shi, Extracting weak fault characteristics with adaptive singular value decomposition and stochastic resonance,[J]Transactions of the Chinese Society of Agricultural Engineering , 2017 , 33 (11) :60-67 [12] [12]Q. Lu , T. Wang , Z. Li , C. Wang, Detection Method of Series Arcing Fault Based on Wavelet Transform and Singular Value Decomposition,[J]Transactions of China Electrotechnical Society , 2017 , 32 (17) :208-217 [13] [13]R. Golafshan , K.Y. Sanliturk, SVD and Hankel matrix based de-noising approach for ball bearing fault detection and its assessment using artificial faults,[J]Mechanical Systems and Signal Processing , 2016 , s 70–71 :36-50 [14] [14]Z. Li , B. Shi, Research of Fault Diagnosis Based on Sensitive Intrinsic Mode Function Selection of EEMD and Adaptive Stochastic Resonance,[J]Shock and Vibration,2016,(2016-12-4) , 2016 , 2016 (11) :1-12 [15] [15]V.T. Do , C.N. Le, Adaptive Empirical Mode Decomposition for Bearing Fault Detection,[J]Journal of Mechanical Engineering 62(2016)5, 281-290 [16] [16] J. Xiang , Y. Zhong, A new fault detection strategy using the enhancement ensemble empirical mode decomposition,[J]Journal of Physics Conference Series , 2017 , 842 (1) :012002 [17] [17]C. Zhang , Z. Li , C. Hu , S. Chen , J. Wang ,An optimized ensemble local mean decomposition method for fault detection of mechanical components,[J]Measurement Science & Technology , 2017 , 28 (3) :035102 [18] [18]Z. Wu , S.P. Yang , J.C. Zhang, Bearing fault feature extraction method based on LMD adaptive multiscale morphology and energy operator demodulating,[J]Journal of Vibration and Shock , 2016 [19] [19]G. Cheng , H. Li , X. Hu , X. Chen , H. Liu, Fault diagnosis of gearbox based on local mean decomposition and discrete hidden Markov models,[J] ARCHIVE Proceedings of the Institution of Mechanical Engineers,2016 , 231 (14) [20] [20] R. Benzi , G. Parisi , A. Sutera , A. Vulpiani , A Theory of Stochastic Resonance in Climatic Change,[J]. Siam Journal on Applied Mathematics , 1983 , 43 (3) :565-578 [21] [21] Q. He , J. Wang , Y. Liu , D. Dai , F. Kong , Bearing defect diagnosis by stochastic resonance with parameter tuning,[J]Prognostics and System Health Management Conference , 2011 :1-5 [22] [22]J. Hao , D.U. Taihang , C. Jiang , S. Sun , F.U. Chao, Application of parameter-tuning stochastic resonance for detecting weak signal with ultrahigh frequency,[J]Journal of Computer Application》 , 2016 211 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management [23] [23] F. Shengbo , T. Wang , Y. Leng , Wang. Wenjin, Detection of Weak Periodic Impact Signals Based on Scale Transformation Stochastic Resonance,[J]China Mechanical Engineering , 2006 , 17 (4) :387-390 [24] [24]Y.G. Leng , T.Y. Wang , L.I. Rui-Xin , Y.S. Peng , X.X. Deng, SCALE TRANSFORMATION STOCHASTIC RESONANCE FOR THE MONITORING AND DIAGNOSIS OF ELECTROMOTOR [25] FAULTS,[J]Proceedings of the Csee , 2003 [26] [25]Y .Zhou , H. Wang , Y. Zuo , M. Yao, Bearing fault diagnosis based on scale transformation stochastic resonance,[J]Journal of Beijing Information Science & Technology Unversity, 2015 , 8916 (1) :36 [27] [26]S. L. Lu, Q.B. He, F.R. Kong. Stochastic resonance with Woods-Saxon potential for rolling element bearing fault diagnosis[J]. Mechanical Systems and Signal Processing, 2014 , 45 (2) :488-503. 
 [28] [27]Z. J. Qiao, Y. G. Lei, J. Lin, et al. Stochastic resonance subject to multiplicative and additive noise: the influence of potential asymmetries[J]. Physical Review E, 2016 , 94 (5-1) :052214. [29] [28]Z. J. Qiao, Y. G. Lei, J. Lin, et al. An adaptive unsaturated bistable stochastic resonance method and its application in mechanical fault diagnosis[J]. Mechanical Systems and Signal Processing, 2017 , 84 :731-746. [30] [29]S. L. Lu, Q. B. He, H. B. Zhang, et al. Rotating machine fault diagnosis through enhanced stochastic resonance by full-wave signal construction[J]. Mechanical Systems and Signal Processing, 2017 , 85 :82-97. [31] [30]P.Shi , X. Ding , D. Han, Study on multi-frequency weak signal detection method based on stochastic resonance tuning by multi-scale noise,[J] Measurement , 2014 , 47 (1) :540-546 [32] [31]D .Han , P. Li , S .An , P. Shi, Multi-frequency weak signal detection based on wavelet transform and parameter compensation band-pass multi-stable stochastic resonance,[J] Mechanical Systems and Signal Processing , 2016 , s 70–71 :995-1010 212 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Fault diagnosis method of rolling element bearing based on relative wavelet packet energy and Hilbert envelope analysis Junchao Guo1, Haiyang Li2, Dong Zhen1,*, Hao Zhang1, Zhanqun Shi1, Fengshou Gu2 1. School of Mechanical Engineering, Hebei University of Technology, Tianjin, 300140, P.R.China 2. Centre for Efficiency and Performance Engineering, University of Huddersfield, HD1 3DH, UK ABSTRACT Rolling element bearings are widely used in rotating machinery, faults occurring in bearings may lead to fatal breakdowns in rotating machinery and such failure can be catastrophic, resulting in costly downtime. Therefore, it is significant to accurately diagnose the existence of faults at an early stage. Vibration signals collected from bearings contain rich information on machine health condition . Here, it is possible to obtain vital characteristic information from vibration signals through the use of advanced signal processing techniques due to their intrinsic advantage of revealing bearing failure. This paper proposes a new study to take advantage of the relative wavelet packet energy (RWPE) and Hilbert envelope analysis. Relative wavelet packet energy (RWPE) can reflect the energy distribution information of the signal in each wavelet packet transform frequency band. Then apply the Hilbert envelope analysis separate the modulation components to extract fault frequency. The results show that the outer ring fault features of the rolling bearing can be clearly identified through simulation and experimental analysis. It verify the proposed method is effective and feasible in the condition monitoring and fault diagnosis for rolling bearing. Keywords: relative wavelet packet energy; Hilbert envelope analysis; rolling element bearings; fault frequency;health condition Corresponding author: d.zhen@hebut.edu.cn 1. INTRODUCTION Rolling element bearings are one of the most important and common components in rotary machines, and their failures can cause both personal damage and economic loss, if the fault cannot be detected and diagnosed well in advance. Therefore they have received much attention in the field of vibration analysis as they represent an area where much can be gained from the early detection of faults. According to the statistics, about 90% of the faults occur in the surface of the inner race, outer race or rolling elements of the rolling element bearing, in particular most of them are local faults [1]. Therefore, an effective signal processing method is desired to provide more evident information for fault diagnosis of rolling element bearings. Signal processing has gone through the development of time domain, frequency domain and time-frequency analysis. Time and frequency domain analysis only can be applied to stationary and linear signals. To deal with non-stationary signals, the time-frequency analysis method that can extract local features in both time and frequency domains simultaneously has been used widely for machinery fault diagnosis. The regular time-frequency analysis methods include short time Fourier transform (STFT), Wigner-Vill distribution (WVD), wavelet pack transform (WPT), empirical mode decomposition (EMD), etc. WPT can be used for multi-scale analysis of a signal through dilation and translation, so it can extract signal features from both time domain and frequency domain effectively. Therefore, WPT has been successfully applied for the condition monitoring and fault diagnosis of electromechanical equipment [2-3]. In recent years, WPT has been used as a popular method in the field of condition monitoring and fault diagnosis [4–7]. Relative wavelet packet energy (RWPE) can reflect the energy distribution information of the signal in each wavelet packet transform frequency band. Therefore, the RWPE of the rolling bearing signal can reflect the bearing in the event of a fault, the energy distribution of the vibration signal can be as a bearing fault diagnosis eigenvector. In this paper, the proposed method integrate the benefits of RWPE with the strengths of the Hilbert envelope analysis to extraction the fault characteristic frequency. Firstly, the wavelet packet is used to decompose the 213 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management vibration signal of rolling element bearing. Then, calculated the RWPE of each frequency band, select the frequency band with high RWPE for signal reconstruction. Finally, the fault feature frequency is extracted by Hilbert envelope analysis. The diagnosis result show that the proposed method can extract the fault characteristic frequency. Which is shown that the efficiency of the proposed method in fault diagnosis for rolling element bearings. 1. RELARIVE WAVELET PACKET ENERGY The WPT has well-known properties of being orthogonal, complete, and local. The basic operation of WPT is to filter the signal being analyzed through a series of low-pass and high-pass filters recursively. A signal can be decomposed into a set of WP nodes with the form of a full binary tree by the WPT. Let 0,0V be a vector space of nR , corresponding to the node 0 of the parent tree. Then at each level the vector space is split into two mutually orthogonal subspaces given by the following equation: 12,12,1, +++ ⊕= kjkjkj UVV (1) where j indicates the level of the tree, and k represents the node index in level j, given by 12,,0 −= jk  . This process is repeated until level J, giving rise to2J mutually orthogonal subspaces. The principle of WPT can be mathematically described as follows. First, the WP function )(, tW kj n can be mathematically expressed as below [8]: )2(2)( 2/ , ktWtW jnjn kj −= (2) where the integers j and k are the scale and translation parameters, respectively; ,1,0=n is the oscillation parameter. The first two wavelet packet functions with 0== kj are the scaling function Φ(t)and mother wavelet function Ψ(t) as below: )()( 0 0,0 tΦtW = (3) )()( 1 0,0 tΨtW = (4) The remaining WP functions for ,3,2=n are defined through the following recursive relationships: )2()(2)( ,1 2 ktWkhtW nk k n −= ∑ (5) )2()(2)( ,1 12 ktWkgtW k n k n −= ∑+ (6) Where )2(),(2/1)( kttkh −= ϕϕ and )2(),(2/1)( kttkg −= ψψ are the filter coefficients of low-pass and high-pass filters respectively, and they are orthogonal with the relationship )1()1()( khkg k −−= . Here, ⋅⋅, represents the inner product operator. The wavelet packet coefficients n kjQ , are obtained by the inner product between the signal )(tx and the wavelet packet functions n kjW , , as below: dttWtxtWxQ nkj n kj n kj )()()(, ,,, ∫ +∞ ∞− == (7) 214 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management where n kjQ , denotes the nth set of WP coefficients at the jth scale parameter and k is the translation parameter. Mathematically, for a discrete signal )(tx with N data points, assume 02nN = , the wavelet coefficients of node ),( nj can be denoted by{ } jnn j kkQ −= 02,,2,1),(  .The signal energy contained in the WP node ),( nj is calcualted as: [ ] 22 1 0 )(∑ − = = jn k n j n j kQE (8) Following the energy, define the average power at each WP node as: [ ] 22 1 0 0 )( 2 1 ∑ − = −= jn k n jjn n j kQAE (9) With this assumation, the signal energy vector for the j th level can be written as below : [ ] [ ] 0000 21 2,,1 12 2 1212 1 n j jnjnjn RAEAEAEAE n j j n jjj × = ××     × ∈           = −−−  ,    (10) where [ ] 12 0 ×− jn n jAE is a 12 0 ×− jn constant matrix with the value of n jAE . For diffenrent layer, the total energy, that is the sum of average power in vector n jAE , will be the same. Put the average power vectors at layer 0 to J together, then deliver the average power matrix as follows: [ ] 02)1(10 ,,, nJTTJTT RAEAEAEAE ×+∈=  (11) As there will be redundant average power values inside a same WP node whithin the average power vector jAE , the size of the matrix AE, ( ) 021 nJ ×+ , can be reduced to be ( ) JJ 21 ×+ by reorganizing the average power vector jAE as [ ] [ ] J j jJjJjJ RAEAEAERAE n j j n jj Jn j 21 2,,1 12 2 1212 102 × = ××     × − ∈           ⋅= −−−  ,    (12) The above reorganization is just like changing the container area of node ),( nj from jn −02 to jJ −2 ,so the average power value is increased. At last, the reorganized average power matrix, also called WPE matrix here, becomes a new one with the size of ( ) JJ 21 ×+ as follows: [ ] JJTT J TT RRAERAERAEWPE 2)1(10 ,,, ×+∈=  (13) where specifically, the final row of the matrix, jRAE , is called the WPE vector that corresponds to the WP nodes energy at the final level J of the WPT. 2. HILBERT ENVELOPE ANALYSIS 215 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management The vibration signals of interest have repetitive high frequency impulses as a consequence of interaction between rolling element components and defects. These burst sounds generated by the bearing defects are modulated in amplitude by the sequence of repetitive impacts and by the damping effect. The direct FFT analysis of the signals does not provide much information, since noise and high frequency components caused by movement of rolling elements against each other will be mixed with the characteristic frequencies of bearing faults. These repeating fault frequencies are, however, easily measured in the signal envelope. The envelope detection method provides an important and effective approximation to analyze fault signals in high frequency vibrations. The signal envelope can be calculated by the Hilbert transform [9] which acts as a linear operator on the signal to produce a function with the same domain. Given a signal x(t) in the time domain, the Hilbert transform is the convolution of x(t) with the signal tπ 1 which produces a new signal in the time domain. Hilbert transform can be defined as: x�(t) = 1 𝜋𝜋 ∗ 𝑥(𝑡) = 1 𝜋 ∫ 𝑥(𝜏) 𝜋−𝜏 +∞ −∞ 𝑑𝑑 (14) By coupling the x(t) and )(ˆ tx , analytical signal h(t) can be calculated as, )()()(ˆ)()( tjetatxjtxth ϕ=+= (15) where a(t) is the envelope of the signal )(ˆ)()( 22 txtxta += , (16) 3. SIMULATION ANALYSIS The vibration signature of a rolling element bearing with local defects can be typified by an amplitude modulation process. Taking into account the existence of random sliding phenomenon, rolling element bearing vibration signal can be expressed by the following formula: � 𝑥1(𝑡) = ∑ 𝐴ℎ1(𝑡 − 𝑖𝑓𝑖 − 𝑑𝑖)𝑀𝑚=1 ℎ1(𝑡) = exp (−𝐶1𝑡)cos (2 × 8000𝜋𝑡) )17( Where A is the amplitude of signal, equal to 1, Ci is attenuation coefficient equal to 900, fi is the fault characteristic frequency (equal to 100 Hz), iτ is used to simulate the randomness caused by slippage, which is subject to a discrete uniform distribution . The simulated data was acquired with a sample rate of 20 kHz and the data length is 8192. The time domain waveform of the bearing vibration signal in figure 1. It is difficult to identify the fault of bearing from the time waveform , indicating that a further analysis is necessary. Figure 1 The waveform of the outer ring fault signal The RWPE algorithm is used for the characteristic extraction from the separated fault signals. The wavelet function db 3 is applied, decomposition level is 3, 8 decomposition bands are obtained. The RWPE are obtained, and proportional relationships of the various energy bands are made into a series of histograms. The sum of the height of the histogram is 1, therefore, the height of each histogram represents the ratio of the frequency band energy to the total energy. figure 2 show the energy distribution proportional histogram of the outer race fault signals. 216 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 2 Frequency energy of the outer ring fault signal As is shown in figure 2, the changes in the ratios of energy distribution in the node [3,3]. (for which the frequency band is 7500–10000 Hz). The energy distribution ratio coefficients of the one band is therefore used as the eigenvalue of the signals of the outer-race fault. The reconstruction signal as shown in figure 3. To the nonlinear modulation components in the vibration signal for the rolling element bearing, it is proposed that the Hilbert envelope analysis to the decompose modulation components. figure 4 illustrate the results of the proposed method. Four dominant frequency spectrum lines located at the harmonics of the outer race bearing rotation frequency (i.e. fo, 2fo, 3fo, 4fo) can be recognized in figure 4. This strongly demonstrates that the fault occurs in the outer race bearing rolling. Therefore, it is reasonable to conclude that the proposed method is more effective in bearing fault diagnosis. Figure 3 The waveform of the reconstructed signal Figure 4 The result of the proposed method 4. EXPERIMENTS VALIDATION To validate the effectiveness of the proposed method in terms of rotating machinery fault diagnosis, rolling element bearings are analyzed. The vibration signal was collected from the bearing test rig is shown in figure 5. It is comprised of a motor, coupling, intermediate shaft, supporting bearings and electrical brake. The vibration signals were acquired by piezoelectric accelerometer, which was mounted in the vertical direction on the motor drive end bearing housing and the sensitivity was 1.04 mV/ms2. Data were sample at 20 kHz. The structural parameters and kinematical parameters of the experiment bearings are listed in Tab.3. 217 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 5. The motor bearing test rig Table1: Structural parameters and kinematical parameters of the experiment bearings Bearing designation Ball numbers d Pitch Diameter 𝐷𝑚 Ball Number z Contact Angle β 9 46.4 9.53 0° 6206ZZ 𝑓𝑜 𝑓𝑖 𝑓𝑏 𝑓𝑐 89.33 130.99 62.42 9.93 The bearing characteristic frequencies of a rolling element bearing can be calculated with Eqs. (18)-(21). The outer race fault frequency fo, the inner race fault frequency fi, the rolling element fault frequency fb, the fundamental cage frequency fc were formulated as follows respectively [10]: fo = z2 (1 − dDm cosβ)fr (18) fi = z2 (1 + dDm cosβ)fr (19) fb = Dmd [1 − ( dDm cosβ)2]fr (20) fc = 12 (1 − dDm cosβ)fr (21) Where z is the number of balls, d is the diameter of the rolling element, Dm is the groove section size, β is the contact angle, fr is the shaft frequency. Figure 6 presents the time domain waveform. It is difficult to identify the fault of gearbox bearing from figure 6. Therefore, indicating that a further analysis is necessary. Figure 6 The waveform of the outer ring fault experimental signal The RWPE algorithm is used for the characteristic extraction from the separated fault signals. The wavelet function db 3 is applied, decomposition level is 3, 8 decomposition bands are obtained. Figure 7 show the energy distribution proportional histogram of the outer race fault signals. 218 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 7 Frequency energy of the outer ring fault experimental signal Figure 8 The waveform of the reconstructed experimental signal As is shown in figure.9, the changes in the ratios of energy distribution in the node [3,1] (for which the frequency band is 2500–5000 Hz), node [3,3] (for which the frequency band is 7500–10000 Hz) the node [3,5] (for which the frequency band is 15000–17500 Hz) and the node [3,7] (for which the frequency band is 17500–20000 Hz). The energy distribution ratio coefficients of the four bands are therefore used as the eigenvalue of the signals of the outer-race fault. The reconstruction signal as shown in Figure. 8. To the nonlinear modulation components in the vibration signal for the rolling element bearing, it is proposed that the Hilbert envelope analysis to the decompose modulation components. Figure 9 illustrate the results of the proposed method. Five dominant frequency spectrum lines located at the harmonics of the outer race bearing rotation frequency (i.e. fo , 2fo , 3fo , 4fo ,5fo ) can be recognized in figure 9. It can be seen that there are numerous frequency peaks associated with the outer race fault and their harmonics. In conclusion, the proposed method is more effective method in rolling element bearing fault diagnosis. Figure 9 The result of the proposed method 6. CONCLUSION This paper presents a new method based on relative wavelet packet energy (RWPE) and Hilbert envelope analysis method for fault diagnosis of rolling element bearing. In the simulation and experimental verification, the proposed method have the ability to extract the bearing fault signatures. It demonstrates that the proposed method can be considered as an effective method for rolling element bearing faults diagnosis. ACKNOWLEDGMENT This research was supported by the National Natural Science Foundation of China (Grant no. 51605133; 51705127) and Hebei Science and Technology Projects of China (Grant no. 17394303D). REFERENCES [1] Dong, Y., Liao, M., Zhang, X., et al. Faults diagnosis of rolling element bearings based on modified morphological method. Mechanical Systems and Signal Processing, 25(4) (2011) 1276-1286. [2] Peng, Z.K., Chu, F.L.. Application of the wavelet transform in machine condition monitoring and fault diagnostics: a review with bibliography. Mechanical Systems and Signal Processing, 18 (2) (2004) 199-221. [3] Xing, Y.F., Wang, Y.S., Shi, L., et al. Sound quality recognition using optimal wavelet-packet transform and artificial neural network methods. Mechanical Systems and Signal Processing, 66 (2015). 219 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management [4] Mavaddaty, S., Ahadi, S.M., Seyedin, S. Speech enhancement using sparse dictionary learning in wavelet packet transform domain. Computer Speech and Language, 2017, 44:22-47. [5] Bianchi, D., Mayrhofer, E., Gröschl, M., et al. Wavelet packet transform for detection of single events in acoustic emission signals. Mechanical Systems and Signal Processing, 64-65 (2015) 441-451. [6] He, Q. Vibration signal classification by wavelet packet energy flow manifold learning. Journal of Sound and Vibration, 332 (2013) 1881-1894. [7] Han, L., Li, C.W., Guo, S.L., et al. Feature extraction method of bearing AE signal based on improved FAST-ICA and wavelet packet energy. Mechanical Systems and Signal Processing, 62–63 (2015) 91-99. [8] Wang, Y., Xu, G., Lin. L., et al. Detection of weak transient signals based on wavelet packet transform and manifold learning for rolling element bearing fault diagnosis. Mechanical Systems and Signal Processing, 54-55 (2015) 259-276. [9] Cizek, V. Discrete hilbert transform. Audio Electroacoust, IEEE Trans 1970;18:340–3. [10] Randall, R.B., Antoni, J. Rolling element bearing diagnostics—A tutorial. Mech. Syst. Signal Process. 25 (2011) 485-520. 220 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Application of Adaptive Variable Scale Stochastic Resonance in Bearing Fault Diagnosis Jian-guo WANG 1 ,Chao ZHANG 1 , Yuan-yuan HE 1,2 ,and Teng-fei ZHU 1 1 School of Mechanical Engineering, Inner Mongolia University of Science and Technology, 014010, China 2 Inner Mongolia Tongwei,014010,China ABSTRACT An algorithm of bearing fault diagnosis is proposed in this paper, which is based on human cognitive self-regulating particle swarm optimization and variable-scale stochastic resonance. In the algorithm, an objective function is first determined with the stochastic resonance structural parameters a and b considered. Then the objective function is optimized with self-regulating particle swarm optimization algorithm. Thus the optimal values of a and b corresponding to the maximum of signal-noise ratio are obtained. Bring optimal a and b into variable-size stochastic resonance system and filter out noise signal in bearing vibration signal. Finally, spectrum analysis is applied find the bearing fault characteristic frequency. Experiment results show that this method is effective in bearing fault diagnosis. Key words: particle swarm optimization; stochastic resonance; fault diagnosis Corresponding author: Chao ZHANG (zhanghero123@163.com) 1. INTRODUCTION For the fault diagnosis of rotating machinery, the feature extraction of fault signals is still a hard problem in current fault diagnosis area[1]. Particularly, the effect of traditional signal processing methods in practical applications is not ideal [2]. In 1995, Particle Swarm Optimization (PSO) algorithm was proposed by Kendad et al. [3], which is a self-adaptive evolutionary computing technology based on population search. The algorithm is easy to implement, has fewer parameters and is capable of solving complex optimization tasks effectively[4]. As a new type of stochastic search algorithm, PSO algorithm suffers from a series of problems, such as falling into local extreme points, slow convergence and poor accuracy in the late evolution. In literature [5], an adaptive escape particle swarm optimization algorithm is proposed. The algorithm changes the flying speed of particles in the search space by changing velocity self-adaptively. In literature [6], the cognitive self-regulating particle swarm optimization(SRPSO) algorithm is proposed. The algorithm develops self-regulating and self-recognizing PSO algorithms on the basis of PSO algorithm. Stochastic resonance (SR), which was proposed by Benzi et al. [7], is only applicable to small parameters signals. To expand stochastic resonance to large-parameter signals, Leng Yong-gang [8] proposed a variable-scale stochastic resonance method. By setting an appropriate compression ratio R, stochastic resonance can be used under large-parameter conditions. However, the structural parameters a and b of system are usually fixed to 1 and the system does not have self-adaptation ability. In order to visualize the weak impact signal in the fault signal, stochastic resonance is widely used by scholars to extract weak fault feature signal. Literature [9] proposed an adaptive stochastic resonance method and used SNR as performance index to evaluate the output signal of the stochastic resonance system. This paper presents a stochastic resonance method of SRPSO. First, signal-to-noise ratio of output signal is defined to be the objective function. Then SRPSO is used to optimize the stochastic resonance structural parameters a and b. Finally, the parameters a and b corresponding to the fitness value, which maximize the signal-to-noise ratio, are brought into a scale-scale stochastic resonance to filter noise out of input fault signal. Then the time-frequency diagram is obtained and the fault characteristic frequency of the bearing is searched in the frequency spectrum. 2. SRPSO THEORY SRPSO was built on the basis of PSO algorithm, the speed update equation in SRPSO is as follows: 221 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management )()( 2211 1 t id t gd so id t id t id se id t idi t id XPprcXPprcVwV   (1) where Vid and Xid are the velocity and position of the i-th particle in the d-dimensional space, respectively. t and t1 represent the current and next iteration, respectively. is the optimal position of the i-th particle in d-dimensional space. represents the optimal position found by all the particles in d-dimensional space. c1 and c2 represent the acceleration coefficients. r1 and r2 represent two random numbers distributed uniformly between [0,1]. wi denotes the inertia weight of the i-th particle, which is self-regulated by the optimal particle. se idP is the particle's individual self-awareness, os idP is the particle's global perception. They are defined as follows:         particlesother particles optimal0 particlesother 1 particles optimal0 o , , ; , ,  s id se id PP (2) whereγ is binary (i.e., 0 or 1) depending on the threshold value for defining the confidence. The self-adjusting inertia weight strategy is defined as follows:       particlesOther )( particles Optimal)( , , wtw wtw w i i i  (3) where wi is the current inertia weight, and IterIter FI NN ww w 55.0    where NIter is the number of iterations. wI and wF are the initial and final inertia weights. η is a constant that controls the rate of acceleration (usually set to 1). ɑ obeys the uniform distribution on [0,1], which is the direction of all global optimal positions. Threshold (λ) is set to 0.5 to achieve any choice of directions, the social awareness of particles as follows: mi aif Psoid ,,2,1otherwise0 1        (4) Self-awareness apparently affects the search direction of the particle. But for the optimal particle, it has the best orientation and it will discard social and self-perception. Therefore, the velocity update equation of the optimal particle is as follows: ),( positionbest theis 1 iVwV tidi t id   (5) Other particles: )()( 2211 1 t id t gd so id t id t id t idi t id XPprcXPrcVwV   (6) 3. STOCHASTIC RESONANCE (SR) In the study of stochastic resonance, the expression of potential function is: 42 4 1 2 1 )( bxaxxU  (7) where, a and b are the structural parameters of bistable system. The deterministic dynamic system equation for one-dimensional nonlinear bistable systems without considering noise is: v ) (x U  ba / ba/ ) (x U x Figure1. Potential functions of nonlinear bistable system 222 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management As can be seen in Figure 1, the structural parameters a and b can both change the level of the barrier of the bistable system and also change the distance between the two barriers. If the input signal is a mixed signal of Asin(2πf0t) and gaussian white noise n(t), the langevin equation corresponding to the expression is: )()2sin( 0 3 tntfAbxax dt dx   (8) where n(t) is Gaussian white noise (mean 0, noise intensity D). If we choose a, b and D in the bistable system properly, the output response of the system will satisfy the stochastic resonance condition, that is, the jumping frequency of the particles between the two potential wells reaches the signal frequency and enhances the weak periodic signal. For the random noise reduction effect is not obvious and can only detect the small parameters of the signal ( adiabatic approximation small parameter conditions A<< 1, D<< 1, f0<< 1). The basic idea is as follows: (1) Set a frequency compression ratio R; (2) Define a compressed sampling frequency fsr=fs/R; (3) Set the Runge-Kutta method to calculate the step size h=1/fsr. 4. ADAPTIVE STOCHASTIC RESONANCE METHOD If traditional adaptive optimization methods are chosen to optimize the structural parameters in stochastic resonance, there are some limitations. Therefore, this paper proposes an algorithm of human cognitive self-regulating particle swarm optimization to optimize the structural parameters a and b of stochastic resonance, aiming to make the selection of a and b adaptive. The specific steps of the algorithm proposed in this paper are as follows: (1) Initialization: Set the size of the population and randomly select individuals to form an initial population. (2) Set the objective function: the objective function of the genetic algorithm is defined as the output signal to noise ratio of the system, F(a,b)=SNRout(sr(a,b)). (3) Initialize the position and velocity of the particles in the particle group, and set the particle position as the SR parameter [a,b]. (4) Stochastic resonance denoising: The parameters a and b corresponding to the SNR-maximizing fitness value are brought into variable-scale bi-stable stochastic resonance to perform denoising on the input fault signal. (5) Calculate the frequency spectrum: Calculate the frequency spectrum of fault signal denoised by stochastic resonance. (6) Make a decision: Calculate the fault frequency according to the bearing parameters to determine the bearing fault type. 5. SIGNAL SIMULATION Set the simulation signal as follows: x1(t)=0.6sin(2π30t)+n(t), n(t)= D2 g(t), white noise intensity D is 9.1, the mean is 0, the variance is 1, the sampling frequency fs is 2000 Hz, the number of sampling points N is 2000. And the time-frequency spectrum of the simulation signal is shown in Fig. 2. Figure 2. simulation signal time-domain diagram 0 0.2 0.4 0.6 0.8 1 -15 -10 -5 0 5 10 15 time t/s A m p l i t u d e A / V 223 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management In accordance with the traditional random resonance simulation signal denoising (a = 1, b = 1), the signal output results shown in Figure 3. The output signal of the SR optimized by the SRPSO is shown in Fig. 4. It can be seen from the time-frequency spectrum that the noise reduction effect is improved. The amplitude of the characteristic frequency of the simulated signal is more obvious than in Fig. 3. 0 0.2 0.4 0.6 0.8 1 -4 -2 0 2 4 time t/s A m p li tu d e A / V 0 200 400 600 800 1000 0 0.5 1 Frequencyf/Hz A m p li tu d e A / V 30Hz 30Hz 0 0.2 0.4 0.6 0.8 1 -6 -4 -2 0 2 4 6 time t/s A m p lit u d e A / V 0 200 400 600 800 1000 0 0.5 1 1.5 2 Frequency f/Hz A m p lit u d e A / V 30Hz Figure 3. SR signal SR simulation results Figure 4. SRPSO SR signal output optimization results 6. EXPERIMENTAL ANALYSIS Experimental data comes from the Rotary Machinery Diagnostics Laboratory, bearing model ER-10K deep groove ball bearings, rolling element diameter of 7.9mm, rolling body diameter of 30.3mm, the number of 8, the contact angle is 0°. The test set consists of a 1/3 horsepower 3-phase motor, a coupling, a tachometer and corresponding electrical controls. Bearing fault simulation test rig shown in Figure 5, the motor speed is set to 2400r/min. The accelerometers were mounted on the vertical and radial measuring points of the bearing housing. The vibration signals were collected by an 8-channel ZonicBook/618E data logger with sampling frequency of 2560 Hz. According to the parameters of the bearing, it is calculated that the characteristic frequency of the bearing inner ring and the rolling element fault is as shown in Table 1. Figure 5. bearing failure simulation equipment Table 1. MB ER-10K-type rolling element bearing the characteristic frequency Shaft rotation frequency Inner ring Unit 40 197.73 Hz 6.1 . Bearing inner ring failure analysis Select 5120 data points in the inner ring for analysis. Inner ring fault bearing signal time domain waveform is shown in Fig. 6. Figure 6. inner ring bearing fault signal waveform Traditional random resonance (a=1, b=1) results of noise cancellation is shown in Fig.7,Fig. 8 shows the denoising results of the bearing signals optimized by SRPSO. The optimal parameters are a = 0.001 and b = 0.05. It can be 0 0.5 1 1.5 2 -1 -0.5 0 0.5 1 Time t/s A m p li tu d e / A 224 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management seen from the frequency domain in the figure 8, relative to Figure 7. The frequency of bearing fault signatures is more pronounced and the amplitude is greater. 0 0.5 1 1.5 2 -1 -0.5 0 0.5 1 time t/s A m p li tu d e A /V 0 500 1000 1500 0 0.005 0.01 0.015 0.02 frequency f/Hz A m p li tu d e A /V 0 50 100 150 200 250 0 0.005 0.01 0.015 frequency f/Hz A m p li tu d e A /V 197Hz Y:0.0035 0 0.5 1 1.5 2 -40 -20 0 20 time t/s A m p li tu d e A /V 0 500 1000 1500 0 0.5 1 1.5 frequency f/Hz A m p li tu d e A /V 0 50 100 150 200 250 300 0 0.5 1 1.5 frequency f/Hz A m p li tu d e A /V 197Hz Y:0.35 Figure 7. traditional random noise reduction results Figure 8. SRPSO-optimized stochastic resonance denoising results 7. CONCLUSION This paper studies a fault diagnosis method based on human cognitive self-tuning particle swarm optimization and stochastic resonance algorithm. Conclusions are listed below: (1) SRPSO overcomes artificially selected random resonance parameters a and b. (2) Scaled bistable stochastic resonance algorithm based on SRPSO optimization compared with the traditional stochastic resonance algorithm, denoising effect is better. REFERENCES [1] Dai G P. Mechanical Fault Intelligent Diagnosis Based on EMD-Approximate Entropy And LS-SVM[J]. Journal of Mechanical Strength,2011,33(2):165-169. [2] Zhang C, Chen J S. Diagnosing Faults of Bearings with LMD Approximate Entropy and SVM[J].Mechanical Science andTechnology for Aerospace Engineering, 2012,31(9):1539-1548. [3] KENNEDY J, EBERHARTR C. Particleswarm Optimization[C]//Proc of IEEE International Conference on Neural Networks. Piscataway, NJ:IEEE Press,1995:1942-1948. [4] PARSOPOULOS K E,VRAHATIM N. On the computation of all global minimizers through particle swarm optimization[J]. IEEE Transon Evolutionary Computation,2004, 8(3):211-224. [5] HE Ran,WANG Yong-Ji, WANG Qing,et al. An Improved Particle Swarm Optimization Based on Self-Adaptive Escape Velocity[J].Journal of Software,2005, 16(12):2036-2044. [6] M.R. Tanweer ,S. Suresh, N. Sundararajan. Self Regulating particle swarm optimization algorithm[J].Information Sciences 294 (2015)182-202. [7] Benzi R, Parisi G, Vulpiani A. Theory of stochastic resonance in climatic chaner[J]. SIAM Journal on Applied Mathematics,1983, 43(3):565-578. [8] Leng Y G , Wang T Y. Numerical research of twice sampling stochastic resonance for the detection of a weak signal submerged in a heavy Noise[J]. Acta Physica Sinica,2003,52(10): 2432-2437. [9] ZHANG Z H,WANG D,WANG T Y. Self adaptive step-changed stochastic resonance using partical swarm optimization[J].Journal of Vibration And Shock,2013,32(19):125-130. 225 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Characterization and modelling of a customs operation A.J. Hoffman1 1 School of Electrical, Electronic and Computer Engineering, North-West University, South Africa ABSTRACT Effective risk management is a prerequisite to find an acceptable balance between the objectives of a customs operation and the streamlined flow of goods. This requires the use of well-designed customs risk management models that scrutinize all cargo consignments in cyber space based on the analysis of rich data sets that can be used to accurately determine the risk represented by a cargo consignment without physically stopping it. The use of such models results in much reduced physical inspections without increasing the risk to customs of either losing income or allowing the influx of illegal contraband. It therefore represents a much more optimal compromise between the interests of customs and those of trade, reducing the economic cost to the region and making the region more attractive to global economic partners. In this paper we utilize different classification techniques to recognize patterns in electronic data transacted between customs and trade that characterize the risk attributes of cargo consignments. We subsequently extract models that can be applied in real time to minimize disruption of trade flows while reducing customs risks to below set thresholds. We quantify the impact of a variety in input factors and demonstrate how an optimal set of inputs can be selected to arrive at an effective risk management model. The diagnostic abilities of linear regression, neural network and classification tree techniques to predict both customs stops and infractions before they occur are compared. Keywords: Diagnosis, pattern recognition, customs operations, root cause analysis, input-output modeling. Corresponding author: Alwyn Hoffman (alwyn.hoffman@nwu.ac.za) 1. INTRODUCTION There is very little empirical evidence that customs authorities are using well-designed statistical systems to identify possible high-risk or illegal transactions. The need therefore exists for the development of diagnostic models that can be used by customs authorities to more efficiently target declarations that should be inspected, in the process reducing the cost of unnecessary stops or inspections of compliant shipments. Komarov [1] described the use of risk management systems by Ukrainian customs to create risk profiles to enable the automated selection of high risk transactions by dividing transactions into risk categories: high risk or ‘red’ (representing 4.34% of transactions) are subjected to physical inspections, ‘yellow’ (15.60%) to documentation checks, ‘light green’ (6.48%) to ‘information massage’ while the remaining ‘green’ transactions (73.58%) are not subjected to further checks. Komarov however provided little details about the specific data being used and how the relationships between inputs and risk outcomes are established. In a study applied to Senegalese customs data Laporte [2] applied regression models to calculate a risk outcome that reflects the probability of an infraction on a per transaction basis, using six input variables to predict risk: importer, freight agent, HS classification, origin, provenance and customs regime (Laporte 2011). Laporte claimed that by using this model it was possible to filter high risk cargo consignments to such a level of accuracy that more than 96% of all infractions could be found by inspecting only 20% of consignments. In a subsequent study Davaa and Namsrai [3] extracted a similar model from Mongolian customs data to predict infraction probability, using the following list of input variables: HS classification, importer, country of origin, customs terminal code, customs broker and type of transportation means. The level of risk prediction accuracy that they achieved was not quite as impressive as those obtained by Laporte; their model could classify consignments in such a way that the incidence of infractions increased from 0.05% in the lowest risk category to 0.22% in the highest risk category. 226 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management The work of Laporte and others provides evidence of the value of a non-intrusive data analytics based approach to customs risk management. No systematic method or procedure is however described of how to decide which inputs factors to consider for inclusion into a customs risk management model. The cited references furthermore provided no quantified indication of the relationships between the various input factors (e.g. customs office) and the operational customs performance in terms of time delays experienced by commercial trade. In a previous paper we characterized the effectiveness of an existing customs operation in terms of the contribution of specific inputs factors on outcomes as well as the efficiency of current procedures used to select cargo consignments for inspections [4]. This paper will continue our previous work by utilizing input-outcome relationships for customs processes currently applied in South Africa to extract risk models using a variety of empirical modelling techniques, including linear regression, neural networks and decision trees. The specific research questions to be addressed include the quantification of the capability of each identified input factor to predict customs outcomes before these have occurred and a comparison of empirical modelling technique to capture input-output relationships the most effectively. To the best of our knowledge no previous published research have produced accurate answers to the above list of questions in the southern African context. The rest of the paper is structured as follows: In section 2 we describe the data used in the study and in section 3 the process to select suitable explanatory variables to model the outcomes. Section 4 describes the methodology used to extract input-output models while section 5 discusses results and findings. In section 6 we conclude with recommendations for customs operations and suggest future research work. 2. DESCRIPTION OF THE DATA SET The data that was used in this study represents EDI transactions exchanged between SARS Customs and consignors of goods imported into South Africa from September 2014 to September 2016. The available data set includes approximately 3.5 million transactions submitted to SARS Customs over the given time period. For each transaction information was obtained about the nature of the customs declaration as described in Table I below. In addition the data included the customs response codes representing the customs outcomes that we would like to be able to predict (e.g. a decision to stop and inspect a consignment, or finding an infraction), as described in Table II below. From the customs response codes we derived customs outcomes as described in Table III below. All cases that required an amendment were regarded as Infractions. Table I: Input factors explaining nature of declaration Input Factor Number of Categories Examples Customs Office 36 Durban, Cape Town, etc. CPC Code 31 10, 11, 12, etc. Previous CPC Code 23 00, 14, 20, etc. Country of Origin 237 GB, CN, GE, etc. Country of Export 222 GB, CN, GE, etc. Country of Import 197 ZA, ZM, ZW, etc. Transport Code 9 Ocean, Road, Rail, etc. Consignors 310 #0, #1, #7, etc. HS Code 18 Animal, Chemical, etc. Table II: Customs response codes Customs Response Code Description 1 Release 2 Stop for physical inspection at unpack depot or X-ray scanning 4 Refer to Other Governmental Agency (OGA) 6 Reject Declaration 13 Supporting Documents required 26 Request adjustment to declaration 31 Request additional supporting documents 33 Supporting documents received 36 Booked for physical inspection 227 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Table III: Description of customs outcomes Outcome Condition Not Stopped no code 2, 36, 13, 31, 4 Stopped code 2 Not Inspected no code 36 Not Amended no code 6 or 26 Amended code 6 or 26 Inspected code 36 Not Amended no code 6 or 26 Amended code 6 or 26 Request Additional Docs code 13 or 33 Not Amended no code 6 or 26 Amended code 6 or 26 Refer Other Government Agency code 4 Not Amended no code 6 or 26 Amended code 6 or 26 3. SELECTION OF EXPLANATORY VARIABLES AND OUTCOMES In the development of empirical classifiers it is important to reduce the set of input factors, and thus the number of model degrees of freedom, as far as possible before the model is trained to help prevent overtraining and subsequent bad generalization capability [5]. We used Analysis of Variation as well as linear correlation analysis to evaluate the respective explanatory variables before we proceeded to extract empirical models. In Table IV we show the results of an ANOVA analysis applied to determine the ability of the explanatory variables in the first column to explain the customs outcomes listed in the top row. The F-statistic measures the ratio of between-class- variations and within-class-variations. A high F-value indicates on average larger differences between categories than within categories, implying that the respective category variable does have the ability to help explain the respective outcome values. In the table it can be seen that for all of the outcomes at least some of the explanatory variables have significance. Duration and Request for Additional Documents produce on average larger F- statistics, indicating that they may be easier to predict using the available explanatory variables. For all of the outcomes the F-values are sufficiently large to justify proceeding with the development of prediction models based on the available set of explanatory variables. Table IV: F-statistics extracted through one way ANOVA Input Factor Duration Req_Add_Docs Stopped Stopped_Inspected Infraction CustomsOffice 1150.8 1356.2 43.6 168.8 82.2 HSCode 892.1 8934.3 32.6 114.1 105.8 TransportCode 2105.4 4533.6 204.1 731.9 499.1 SPSource 206.7 809.5 84.2 234.2 126.5 CpcCode 423.4 352.7 2.6 10.6 93.2 PreviousCpc 118.4 834.1 4.2 22.6 75.8 CountryOfOrigin 50.1 329.2 4.9 13.3 17.8 CountryExport 59.9 352.0 4.8 18.1 17.7 CountryImport 25.4 29.7 1.8 2.0 3.7 In order to perform a direct comparison between the impacts of different input factors on the various outcomes we implemented a linear correlation analysis. As all of the input factors are categorical in nature rather than continuous, this was done in the following manner: i. For each category of each input factor (e.g. Durban or Cape Town for category Customs Office) the accumulated average value of each outcome (e.g. fraction of infractions) was calculated as function of time as 228 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management from the start of the observation period up to the end of each specific month. These averages represent the behavioural characteristics of the respective categories up to that point in time. ii. For each new observation falling in a subsequent month its category memberships were determined (e.g. Customs Office: Durban; HS Code: textiles; Transport Mode: maritime). That observation was then allocated the accumulated averages for the categories to which it belongs as determined at the end of the previous month. Each observation thus inherits the attributes of the categories that it belongs to; as these attributes are continuous variables, it can be used as basis for a correlation analysis with the observed outcomes. Figure 1 displays the correlations of all outcomes with respect to all the input factors. The Consignor identity is the Input Factor with strongest correlation for Requests for Additional Documents, Stop&Inspections and Infractions. The correlations with Infractions are in general however lower compared to the correlations with outcomes that represent customs decisions (Requests Additional Documents, Stop&Inspections and Refer OGA). Figure 1 Correlation between explanatory variables and customs outcomes (a) (b) Figure 2 Aggregate fraction of (a) stopped consignments and (b) infractions predicted vs aggregate fraction of observations selected based on Consignors The ability of a single input factor to correctly classify outcomes is demonstrated in Figure 2 for the case of predicting Stops and Infractions using Consignor as input. These graphs were constructed by predicting outcomes to be true, starting with consignments from those categories that historically displayed the largest true outcome 229 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management levels, and adding more categories until eventually all have been selected. It can be seen that a much higher selection accuracy for stopped consignments will be achieved when selecting a given fraction of total observations, compared to the selection accuracy for infractions. This indicates that when customs select consignments for stops, the identity of the consignor seems to play a very important role, but that the success of this strategy to actually find infractions is not quite as high. 4. EXTRACTING CLASSIFICATION MODELS This classification problem is characterized by the fact that the target data is highly unbalanced – for our data set only about 0.8% on cases contained infractions and only a few percent were selected for some form of scrutiny. If such an unbalanced data set is directly used for training a regression or neural model, the model parameter optimization process will tend to be dominated by to the large number of samples with false outcomes. In order to overcome this problem we used weighted training sets, by duplicating each of the true outcomes to arrive at a balanced training set that still contains all of the original data. We firstly implement both linear regression and logistic regression models, similar to the approach of Laporte [2]. We then proceed to implement a neural network based model as well as a decision tree model to verify if these more sophisticated techniques can improve upon the performance of simpler techniques. 4.1. Linear regression models The same explanatory variables that were used for the correlation analysis in section 3 above were used as inputs for both linear and logistic regression models. We experimented with different numbers of input factors, selecting inputs based on their linear correlations with the outcome being modelled. To convert the continuous output a threshold value is set; model outputs that exceed this threshold are classified as true. Theoretically we would expect logistic regression models to outperform linear regression as the outcomes are categorical in nature. We found that model performance did not improve significantly beyond the use of three input factors. 4.2. Neural network models Neural network models will have benefits over linear regression models should there be significant nonlinear aspects in the input-output relationships [5]. We extracted feed-forward multilayer perception networks that use the same sets of inputs as the regression models; the size of the single hidden layer is adjusted based on the number of inputs, with sigmoidal transfer functions in the hidden layer and linear transfer functions in the output layer. We used a validation set that made up 10% of the training set values; training was terminated at the point where the modelling error on the validation set started to increase. As both the input and hidden layers were restricted to not more than ten nodes each; the overall number of degrees of freedom in the models would be less than 120. Given the size of the population it was not necessary to apply specific regularization techniques as the large training set would help prevent the model from being over fitted to specific training samples. 4.3. Classification tree models For our selected problem the available input data is inherently categorical in nature. For the previous set of modelling techniques we deliberately translated these categorical values into continuous ones as these types of model perform best when fed with continuous input data. It is however also possible to directly use the categorical input values using decision trees. A classification tree consist of many branches: at each branch point one or more categorical variables are used to make a decision regarding which way to branch. Once the probability of a specific outcome is high enough the branch terminates in a true or false outcome. In the training of decision trees care must be taken not to over train, as a sufficiently large number of branching points, each containing an if-then-else rule, will allow any number of unique inputs observations to be correctly classified. Over training is prevented by limiting the maximum number of levels in the tree: a tree with only one level will classify all observations as either ones or zeros; with two levels at least one rule will be applied to separate the two classes; as the number of levels are increased more conditions can be defined to use as basis for 230 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management more refined classification. To practically determine at which level the tree must be terminated, we used a validation set defined similarly as in the case of neural network training sets. An initial tree was extracted using no limit on tree level, and the performance of the tree was tested on the validation set. The number of tree levels was then gradually reduced until the classification performance for the validation set reached its maximum value. All of these trees were then also applied to the test set to verify if the selected tree also performed best on the test set. 5. RESULTS AND DISCUSSION Each of modelling techniques as described above were applied to the available data to predict both the probability of specific customs decisions (e.g. to stop and inspect) and the probability of actually finding an infraction. It must be appreciated that the process that generated the available data set was not the true process that generated the incidence of infractions, but only the process as applied by the respective customs authority. It can be assumed that customs applied a set of rules and procedures to generate decisions to stop, inspect, etc.; once such a decision was implemented an infraction that was potentially present in the respective consignment may or may not have been found. Any deficiency in this process, both to correctly stop or scrutinize risky consignments and to find infractions that are present, will be directly reflected in the data. The models that were extracted mimic the customs process rather than the true incidence of infractions, and should therefore be evaluated primarily on that basis. This is supported by the results in section 3 where it could be seen that there are stronger correlations between input factors and the incidence of specific customs decisions than between the same input factors and the incidence of infractions. Figure 3 Fraction Stopped Consignments correctly identified by logistic regression model as function of fraction selected Outcome accuracies are calculated for both the training and test sets and for both positive (true) and negative (false) outcomes. The impact of gradually changing the threshold level to separate positive and negative outcomes is illustrated in Figure 3; separate graphs display the results for training and test sets, and a separate graph is provide for Positive, Negative and All outcomes. Starting off with a zero threshold level all observations are classified as Positive, with resulting 50% accuracy: all positive target values are classified correctly and all negative target values are classified incorrectly. As this threshold is increased more observations will fall into the Negative category; if the model possesses any ability to correctly classify observations then the classification accuracy for All should increase, reaching a maximum value before decreasing again as the threshold value approaches 1. 231 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management The results based on regression and neural network models are shown in Table V below. The coefficients of determination (or R2) are somewhat lower that the figures reported by Laporte; this indicates that for our data set the input-output relationships were somewhat weaker. This coefficient is also lower for the test set compared to the training set, indicating that the models extracted from the training data do not fit the test data quite as well, also in line with expectations. It can be seen that the neural networks perform slightly superior compared to the regression models, indicating that their nonlinear modelling capability is of some benefit in capturing the input- output relationships. In Table VI we show similar results to predict the incidence of customs stops. It can be clearly seen that all techniques perform much better to predict customs stops compared to the case of predicting infractions. This confirms that the modelling techniques do have the ability to capture with reasonable accuracy the underlying behaviour present in the customs decision making process. Table V Classification results for predicting infraction using different model types Model Type Linear Regression Logistic Regression Neural Network Train Test Train Test Train Test R2 0.18 0.15 0.20 0.16 0.20 0.16 Fraction Infractions Found Fraction of Observations Inspected 50 0.32 0.48 0.32 0.48 0.32 0.46 80 0.64 0.74 0.63 0.73 0.65 0.66 90 0.81 0.85 0.81 0.83 0.83 0.75 95 0.90 0.90 0.89 0.90 0.91 0.81 Table VI Classification results for predicting stops using different model types Model Linear Regression Logistic Regression Neural Network Train Test Train Test Train Test R2 0.16 0.26 0.39 0.61 0.43 0.66 Fraction Infractions Found Fraction of Observations Inspected 50 0.27 0.22 0.30 0.21 0.26 0.20 80 0.49 0.34 0.65 0.30 0.49 0.29 90 0.68 0.42 0.81 0.83 0.68 0.39 95 0.74 0.65 0.99 0.53 0.77 0.57 A different approach has to be followed in the case of classification trees, as this technique does not allow a threshold to be adjusted in order to achieve a desired selection accuracy; instead the model performance can be moderated by limiting the allowed tree level. The case with tree level equal to 1 represents the trivial case where all observations are classified as true. As the tree level increases fewer observations from the training set are classified as true; at the same time the fraction of events that are correctly selected also goes up as more branches are added to the tree. The more realistic test for classification trees is the results achieved when applied to the prediction of customs decisions, rather than finding infractions, as the available data reflects the customs processes more accurately than the true presence of infractions. As shown in Table VII below the maximum tree level when extracting the model from the training set was 456. The fraction correctly selected infractions from the test set reaches a maximum value for an optimal tree level of about 20. By selecting 16-17% of consignments 93-94% of customs decisions to stop can be predicted out-of-training-sample. The technique therefore work well on data sets that contains the required level of consistency in input-output relationships. For infractions the results are less accurate: 23% of test set observations are selected to produce a selection accuracy of 58% of infractions. This is slightly superior compared to the best results for the regression and neural network models. We can also compare our results with those of Davaa and Namsrai [3]: their risk engine increased the incidence of infractions from 0.05% in the lowest risk categories to 0.22% (i.e. by approximately a factor of 4) in the highest risk categories. Our classification tree increase the average incidence of infractions from 0.6% (when all observations are selected) to 1.7% in the test set when the optimal tree level is used, i.e. by approximately a factor of 3. We can therefore state that for our data set the performance is comparable to the results achieved by [3]. 232 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Table VII Classification results for Stops using Classification Trees Tree Levels 1 2 3 15 20 104 456 Training Fraction Selected 0.00 0.25 0.23 0.17 0.16 0.10 0.08 Fraction Correct 0.51 0.86 0.87 0.89 0.90 0.94 0.96 Fraction Correct No Event 1.00 0.75 0.77 0.83 0.84 0.90 0.92 Fraction Correct Event 0.00 0.97 0.97 0.96 0.96 0.99 0.99 Testing Fraction Selected 0.00 0.17 0.16 0.14 0.13 0.09 0.06 Fraction Correct 0.49 0.89 0.88 0.86 0.86 0.77 0.63 Fraction Correct No Event 1.00 0.83 0.84 0.87 0.87 0.91 0.94 Fraction Correct Event 0.00 0.94 0.93 0.86 0.86 0.63 0.34 6. CONCLUSIONS AND RECOMMENDATIONS We summarize our conclusions and recommendation by addressing the stated research questions. The capability of each input factor to predict customs outcomes before these have occurred are reflected by the results in 3. Consignor identify was clearly the individual explanatory variable that contains the most prediction ability; it is in all likelihood used by the customs authority to select consignments for inspections. Apart from Consignor the other inputs that made a significant contribution includes Country of Origin, Transport Mode (which in the case of our data set is closely related to Customs Office as the two largest customs offices are served by primarily one transport mode each) and CPC Code. Neural networks slightly outperformed regression models, using 3 or 4 inputs. Increasing the number of explanatory variables to a maximum of 9 did not appreciably improve prediction ability. Classification trees provided the best classification accuracies within the set of techniques that were investigated. For all of these techniques the ability to correctly predict the outcome of the customs process was much superior to the ability to predict the incidence of infractions; this was to be expected as the data set was effectively created by the customs decision making process rather than by the true incidence of infractions. From the results in section 5 it is clear that the risk models can improve current customs processes, as they all produced outcome incidence rates in the highest risk categories that are significantly higher than the incidence rates in the lowest risk categories. The level of improvement will depend on the minimum accuracy that is required for finding infractions. Based on our results it is however suspected that the primary limitation to improve the current risk engine is not the quality of the modelling techniques but the quality of the infraction incidence data that was generated by historical customs processes. In terms of the choice of modelling technique, we conclude that if a technique is required that allows accurate control over the fraction consignments selected for infractions, then neural networks combined with setting an optimal risk score threshold value will be the most suitable; if a technique is required that allows the selection of the smallest fraction of consignments for a reasonably high selection accuracy then classification trees may be the best choice. Future work will focus on specific vertical markets that are severely impacted by customs operations. This work will extend the set of data fields extracted and will measure time delays in the process across the entire value chain, rather than focusing on the customs process only. This should indicate in which part of the value chain the most improvements can be achieved based on the ability to predict and detect eventualities at an early stage. REFERENCES [1] Komarov, O. V. (2016) Risk management systems in Customs: the Ukrainian context. World Customs Journal, 10(1), pp. 35 – 44. [2] Laporte, B. (2011) Risk management systems: Using data mining in developing countries' customs administration. World Customs Journal, 5(1), pp. 17-28. [3] Davaa, T. Namsrai, (2015) B. Ways to modernise customs risk management in Mongolia. World Customs Journal, 9(2), pp. 24-37. [4] Hoffman, A.J., Grater, S., Schaap, A., Maree, J. & Bhero, E. 2016. A simulation approach to reconciling customs and trade risk associated with cross-border freight movements. South African Journal of Industrial Engineering, 27(3), pp. 251-264. [5] Bishop, C. (1995) Neural networks for pattern recognition. 233 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management The Improvement of Instantaneous Angular Speed Estimation using Signals from a Dual Read Head for Monitoring Planetary Gearboxes Qiang Zeng, Ibrahim Alqatawneh,,Yuandong Xu, Yimin Shao, Fengshou Gu, Andrew Ball ABSTRACT The Instantaneous Angular Speed (IAS) signals obtained from the shaft encoders are influenced by inevitable manufacture and installation errors of encoder wheels. These errors can lead to low accuracy or unsatisfied estimation of IAS and fault diagnostics. To improve the accuracy of IAS estimation, this paper presents a novel IAS extraction method based on a dual read head sensing system. The analysis of common encoder errors shows that the amplitude of error changes with angular positions on the encoder whereas the IAS to be monitored is independent of the angular position. Therefore, a dual read head IAS estimation method is proposed to estimate the erroneous IAS online, and subsequently is taken from overall IAS measurement. In this way, the estimate of rotor IAS is significantly improved for more accurate monitoring and diagnostics. Experimental evaluation shows that the new method allows the small planet fault in a planetary gearbox to be detected over a wide range of operating conditions. Keywords: Instantaneous angular speed; Planetary gearbox; Encoder error modeling; Dual read head; Diagnosis Corresponding author: Qiang Zeng (q.zeng12357@gmail.com) 1. INTRODUCTION Planetary gearboxes (PGs) play a vital role in power transmission systems based on their major advantages including compact structure, high transmission ratio, error tolerance etc., and they are widely used in many important industries such as aerospace vehicles, energy productions, power stations, mining equipment etc. PG failures can result in significantly economic losses and casualties. According to the previous studies in wind turbine failures [1], the PG failure leads to a second long down time. A malfunction PG with crack was founded in a UH-60A helicopter transmission system [2]. Therefore, monitoring the PG health conditions and diagnosing their faults are crucial to maintain production activities and avoid such severe tragedies. Vibration based fault diagnosis methods are widely used in the fixed-axis gearboxes and usually have a satisfied results. However, due to the unique transmission structures of PGs, Amplitude Modulation (AM), Frequency Modulation (FM) and vibration neutralization, conventional vibration based methods presents rich of signal components, making it difficult to identify the most fault correlated one. Many approaches are needed to extract the weak and distorted fault features. A number of parameters such as Four-Order figure of merit (FM4) and NA4, which is developed by calculate the kurtosis of the residual signal, are presented [3] to indicate the PG faults. Autocorrelation based time synchronous average [4], map strategy [5] and data fusion methods [6] are presented to extract the individual vibration signal from planets or sun gears for minimising the influence of the transmission path induced modulations. The attenuation of the transmission through multi-interfaces is suppressed by calculating the ratio before and after the potential damages [7]. Although those post processing method improves the detection performances, it is still presents less satisfied results. IAS signals are directly related to the rotation dynamics and can be obtained by a very cheap encoder. It has been long interested in the field of condition monitoring. In addition, it also has received an extensive attention in fields such as body sensing, navigation, servo control, position control and so on [8-11]. Specifically, IAS signal from a PG has no influences from the transmission paths and the nonlinear attenuation. However, the IAS signal obtained from encoder also has considerable disadvantages. Its accuracy is highly depended to the encoder resolution, the shaft rotating speed and the sampling rate of data acquisition [12]. 234 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Especially, IAS signal can also be seriously deteriorated by the erroneous IAS components caused by the encoder imperfections or errors due to manufacturing, installation and operational deviations. The encoder imperfections can be in the form of slot width variation at the point of encoder read-heads, and can generate jitters at the pulse and erroneous IAS signal in the estimation process. In general, erroneous IAS signals can be correlated to the low frequency errors caused by eccentricity, swash and distortion [13]; medium frequency error induced by graduation errors (pith error profile error) [11]; and high frequency errors from the measurement phase error and amplitude error [11]. A number of encoder calibration methods and error compensation methods were studied in the fields of manufacture fields. In [13, 14], a higher accurate device was used to obtain a compensation data sheet. In [11, 15, 16] a multiple read-head scheme was adopted to eliminate the low frequency errors. It is this multiple read-head scheme is of particular interesting in this study as it is can be potentially implemented online at accepted cost. Therefore, to obtain an accurate IAS estimation, this paper focuses on suppressing the imprecisions induced by encoder errors through a dual read head (DRH) measurement method. It firstly obtains the imprecisions and then taken away from overall IAS estimation. The paper is organized into 5 sections. After this introduction, Section 2 outlines different types of encoder errors among which the periodic ones can cause more effect and should be removed. Based on general characteristics of the error, Section 3 develops the theoretical basis of DRH method and evaluates the DRH performances based on simulated signals. Section 4 verifies the effectiveness of using this IAS based for PG diagnosis. Finally, Section 5 draw main conclusion attained by this study. 2. ENCODER ERROR MODELING For cost-effective and robust operation, IAS based monitoring usually employs a rotatory encoder which is a round disk having a number of teeth or slots uniformly distributed around the circumference of the disk. It can be off the shell products or available in a mechanical system such as engine starting ring. Whichever encoder is used there are always different types of errors: typically in the form of eccentricity and slot or tooth variations, that degrades IAS estimation. As aforementioned, the different types of errors of an encoder can induce imprecisions in different frequency bands. The high frequency content can be suppressed by converting the pulses to a series of rectangle waves and applying a low pass filter for the estimated IAS signal. The low and medium frequency encoder errors are mainly contributed by the encoder manufacture errors and installation errors, which are inevitable especially in the in- house made encoders. These errors can lead to the slot width variation with the encoder. Consequently, these errors can be phase modulated (PM) into the encoder pulse train signal and generate an erroneous IAS signal within the shaft rotating harmonics that is often based for fault diagnosis. 2.1. Eccentric error analysis Figure 1 shows the eccentric and swash errors between an ideal encoder wheel and an abnormal encoder wheel in a geometric manner. The blue dotted line is the ideally position of the encoder wheel, and the black line represents the encoder wheel position deviation due to errors. The geometric centre of the encoder wheel is point O , which is also the ideal rotating centre. The point R is the location of read-head, which pick up pulses when the slots pass over it. r, from O to R , is the radius of the encoder.  is the measured angle pick up by the read-head and the   is the real rotated angle. n  is the normal vector of the ideal encoder wheel position and n  is the normal vector of the swashed encoder wheel position with a swash angle  . In Figure 1 (a), due to the eccentric error, the encoder is slightly deviated from the ideal position and rotating around the centre O . The angle     is the angle error between measured angle  and the real angle   . According to the geometric relation, angle   can be found as: 235 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management / , 1 tan( ) tan tan tan sin 1 tan tan cos / sin 1 cos e e e r r r r                             ........................................................ (1) In addition, the IAS caused by the eccentricity ef can be given as: 2 2 (cos 1) (cos 1) ( ) (1 cos ) (1 cos ) c e e ef c c d d d dt d dt                               , .............................. (2) where c is the constant angular speed,  is the angular speed oscillation. R O    n  n  a O R r  a b R O O   r r  R (a) (b) (c) Figure 1. Encoder errors: (a) wheel eccentric error (b) wheel swash error and (c) swashed encoder’s projection in n  direction 2.2. Swash error analysis The tilt axis of the encoder wheel results in swash errors as shown in Figure 1 (b), in which the blue dotted line is the ideal position of the wheel and the black line is the swashed encoder wheel. The swash angle between the ideal wheel normal direction n  and swashed wheel normal direction n  is  . O is the rotating centre and Oa is the intersection of the two wheels.  and   are measured angle and real angle respectively. R and R are measured encoder reading point and real reading point respectively. In Figure 1 (c), the swashed wheel was projected onto the surface of ideal position, or n  direction, and the projected wheel is an ellipse, where Oa is the semi-major axis and Ob is the projected semi-minor axis. In the ellipse, Oa r and cosOb r  . Therefore, for a measured  , the ( )OR  can be given as: 2 2 2 cos ( ) ( cos cos ) ( sin ) r OR r r        , ....................................................... (3) In OaR , OR Oa r   ; in ORR , OR r  ; in aRR , RR aR  , therefore: 2 2 2 2[ ( )] 2 cos [ ( )]aR r r OR      , ....................................................... (4) In OaR , 2 2 2[ ( )] [ ( )] 2 ( )cosaR r OR rOR      , .................................................... (5) 236 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management For any measured (0,2 ]  , 2 2(0,2 ] [ ( )] [ ( )]aR aR        , therefore: 2 2 ( )cos cos cos cos (cos cos ) (sin ) OR r             , ........................................... (6) The erroneous angular error caused by the swash can be given as: 2 2 cos arccos(cos ) (cos cos ) (sin ) s               ,.......................................... (7) In addition, the erroneous IAS sf due to the swash error can be given as: ( ) (1 cos ) (1 cos ) cos(2 ) ( ,( 1) ) , 0,1,2,3, 2 2 sf c d d k k k d dt                            , .............. (8) M ea su re d I A S e rr or ( ra d /s ) M ea su re d I A S e rr or ( ra d /s ) (a) (b) Figure 2. IAS measured error caused by (a) eccentric error ( 150 / c rad s  , 0.01  ) (b) swash error ( 150 / c rad s  , 1   ) The IAS measured errors are illustrated in Figure 2 (a) and (b). The measured IAS error caused by the wheel eccentric error is related to the rotating speed c and the eccentric ratio  . The frequency of the erroneous IAS signal is the rotating frequency. The measured IAS error caused by the swash is related to rotating speed c and swash angle  . The frequency of the erroneous IAS signal is at twice as much as the shaft rotating frequency. Nevertheless, both types of errors induce additional IAS components at the integer of rotational frequency, which needs to be suppressed to ensure the performance of detection when faults such as misalignment, unbalances etc. appear at such frequencies. 3. DUAL READ-HEAD BASED IAS ESTIMATION 3.1. Dual read-head IAS estimation method To eliminate the measured IAS error caused by the encoder errors, a dual read-head error estimation method is proposed to achieve the encoder error self-estimation without any other device. The schematic diagram of the dual read-heads using optical sensors and the flowchart of the new proposed method are shown in Figure 3. 237 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management start y2(t)y1(t) IAS demodulation IAA estimation [y1(t)-y2(t-△t)]/△t △t=φ0/ωc FFT feature IAS x(t) (a) (b) Figure 3. Dual read-head IAS estimation (a) schematic diagram of the dual read-heads (b) flowchart of the IAS estimation Read-heads #1 and #2 are mounted on the encoder with an angular interval 0 , the smaller the 0 the better estimated result. To eliminate the highest order of the erroneous IAS components caused by encoder errors, the 0 should be smaller than 2 / N , where N is the encoder resolution. The flowchart of new IAS estimation method is illustrated in Figure 3 (b). The shaft IAS ( )x t includes constant part c and oscillation part  . Theoretically, despite the angle interval 0 between the two read-head, the measured shaft rotating IAS ( )x t from read heads are the same, and the measured erroneous IAS caused by encoder errors are ( )e t and ( )e t t  ,which the erroneous IAS of second read-head is delayed t caused by the interval 0 . This method can also applied to the speed varying condition, where the t determining is more complicated. Firstly, the IAS signals 1( )y t and 2 ( )y t are demodulated from the encoder pulse train signals of the dual read-heads. The demodulated IAS signals are given as: 1 2 ( ) ( ) ( ) ( ) ( ) ( ) y t x t e t y t x t e t t        , ................................................................. (9) Secondly, the error IAS delay time t between two read-heads is determined by 0 / ct    . Then the instantaneous angular acceleration (IAA) ( / 2)x t t   is approximately expressed as: 2 1( ) ( )( ) ( )( ) 2 y t t y tt x t t x t x t t t             , .............................................. (10) Finally, the IAS signal is converted from the IAA signal by using the FFT. Therefore, the IAS signal can be estimated as: 2 1 2( ( ))( ) ( )t t j F x t x t F j e       , .............................................................. (11) where F is the Fourier transform, 1F  is the inverse Fourier transform. 3.2. IAS estimation evaluation To evaluate the proposed method, a simple IAS signal model in Eq. (9) is used. The real shaft rotating IAS component includes the speed variation term and the fault IAS term. ( ) sin(2 ) sin(2 )r fx t a f b f   , ......................................................... (12) In addition, consider the encoder wheel eccentric and swash error, the measured error IAS signal can be given as: 238 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 3 rad/s, ............................................................................ (13) where a is the amplitude of IAS variation, b is the amplitude of fault IAS variation, c is the erroneous IAS caused by the eccentric and d is the erroneous IAS caused by swash. rf and ff are the rotating frequency and fault frequency respectively. IA S ( ra d /s ) A m p li tu d e (r ad /s ) (a) (b) Figure 4. IAS estimation method evaluation (a) waveform (b) frequency spectra ( 0.03a  , 0.02b  , 0.1c  , 0.15d  , 6.33rf  , 19ff  ) The waveform of the estimated IAS signal is smaller than the original IAS signal as illustrated in Figure 4 (a). Some IAS components were eliminated from the original IAS signal. The frequency spectra in Figure 4 (b) show that the estimated IAS signal retain the speed variation component and the fault component, which are influence free from the encoder errors. As illustrated in the estimated IAS spectrum, the low frequency component, <0.5 Hz, has been contaminated by the IAA integration process. The estimated IAS frequency spectrum has the same frequency components as the simulated IAS, which speed variation amplitude and fault amplitude are 0.03 and 0.02 individually. Therefore, the proposed method can also be used to identify the misalignment or run out errors. 4. EXPERIMENTAL STUDY 4.1. Experiment setup (a) (b) Figure 5. Test rig (a) schematic (b) planet tooth damage A test rig was developed to evaluate the proposed IAS estimation method based on monitoring PG health conditions. A schematic diagram of the test rig is shown in Figure 5 (a). It consists of a DC generator (D), an AC motor (A), the tested PG (PG1), an accompany PG (PG2), three couplings (C) and a 60-tooth encoder (E) made 239 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management from 3-D printing. The DC generator applies loads to the test rig and the maximum load is 360 Nm. PG1 with a transmission ratio of 7.2 and PG2 with a transmission ratio of 5.77 are connected to the carrier shafts by a back-to- back layout. The test rig is driven by the AC motor with a full speed up to 1500 rpm (100%) which can be adjusted by a variable speed controller for different operating conditions. Two cases were studied in this paper. One is the planet gear spalling damage fault as illustrated in Figure 5 (b), which is denoted as PF for the brevity of discussion. The other is the healthy case considered as the baseline, denoted as BL. To verify the performance of IAS based monitoring, the tests were carried out under at 60% of AC motor full speed and four loads (25% 50% 75 % and 90% of AC motor maximum load). Two pulse trains reflecting the effect of encoder tooth passing the two optical readers were acquired at 96 kHz for IAS calculation. The characteristic frequencies of PG1 are fault frequencies, mesh frequency and rotating frequencies. According to the tooth number in PG1: the sun gear tooth number 10sz  , the planet tooth number 26pz  , and the ring gear tooth number 62rz  , the transmission ration is 1 / 7.2r sr z z   and the characteristic frequencies are calculated as shown Table1, based on which the fault diagnostics can be implemented. Table 1: Characteristic frequencies of PG1 Events Expression 60% speed Sun gear rotating frequency srf 18.8Hz Carrier frequency /cr srf f r =0.1389 srf 2.61Hz Mesh frequency ( ) 8.6111m s sr cr srf z f f f   161.85Hz Planet rotating frequency / 0.1923pr m p cr srf f z f f   3.62Hz Planet fault frequency 2( ) 2 0.3312pf pr cr srf f f f    2×6.22=12.44Hz Sun fault frequency 3( ) 3 0.8611sf sr cr srf f f f    3×16.18=48.55Hz 4.2. IAS data analysis 4.2.1. Error reduction evaluation A m pl it ud e (r ad /s ) 0. 06 8 0. 15 5 0. 03 4 0. 13 5 f m Figure 6. IAS spectrum comparison for BL case at 40% speed 90% and load The raw IAS and improved IAS spectra obtained from BL case are illustrated in Figure 6. According to encoder error model in Section 2, the eccentric and swash error will lead to the erroneous IAS, illustrated in Figure 2, which makes the measured IAS amplitude smaller than the real shaft IAS signal. In the experiment, the proposed method compensate this IAS reduction caused by encoder errors. Moreover, the amplitudes at higher orders of rerating frequency n*18.82 Hz have been suppressed significantly. In the meantime, other non-rotating harmonics 240 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management remains unaffected. Therefore, the proposed DRH IAS estimation method efficiently canceled the encoder errors, and pave the way to fault diagnosis. 4.2.2. IAS based detection and diagnosis 18.65 37.3 55.95 74.6 93.25 111.9 130.6 149.2 167.9 0 0.1 0.2 25% load BL PF 18.7 37.4 56.1 74.8 93.5 112.2 130.9 149.6 168.3 0 0.1 0.2 50% load 18.77 37.55 56.32 75.1 93.87 112.6 131.4 150.2 169 0 0.1 0.2 75% load 18.82 37.64 56.47 75.29 94.11 112.9 131.8 150.6 169.4 Frequency (Hz) 0 0.1 0.2 90% load 6.177 12.35 0 5 10-3 25% load BL PF 6.193 12.39 0 5 10-3 50% load 6.218 12.44 0 5 10-3 75% load 6.234 12.47 Frequency (Hz) 0 5 10-3 90% load (a) (b) Figure 7. 60% speed IAS spectra (a) IAS spectra (b) IAS spectra zoomed in 0-14Hz The IAS spectra at the 60% speed are illustrated in Figure 7. It can be seen that the spectra are dominant by sun gear shaft rotating harmonics srnf , mesh frequency mf and sidebands of mesh frequency like m crf f , as shown in Figure 7 (a). In the magnified spectra of Figure 7 (b), the planet fault related frequency component 2 pff , generated by fault planet tooth mesh with sun gear and ring gear separately, has higher value in PF case. This indicates the planet fault occurs in the PG. In addition, other components such as crnf and pff also show visible changes due to the effect of refitting the faulty gear. Nevertheless, these amplitudes are of lower amplitudes, compared with BL, indicating less vibration and improved condition associating these gears. 5. CONCLUSIONS The proposed dual read head IAS estimation method has been investigated in theory and experiment. The encoder error model is presented and concluded that the eccentric and swash error will decrease the real IAS by generate a minus erroneous IAS oscillation at first two rotating frequency harmonics. Both simulation and experiment studies show the great capability in supressing the erroneous IAS caused by encoder imperfections, which exhibit mainly at rotating frequency harmonics, and thereby show the effectiveness and efficiency in monitoring the PG faults where the non-integer rotating harmonics are less influenced. Moreover, the fault signatures associated with the planet gear spalling damages can be identified successfully by the increased amplitudes at characteristic frequencies in the improved IAS signals. ACKNOWLEDGMENT The authors are grateful for the financial support provided by the National Natural Science Foundation of China under Contract No. 51475053 and China Scholarship Council No. 201606050039. 6. REFERENCES 1. Hahn, B., M. Durstewitz, and K. Rohrig, Reliability of Wind Turbine–Experiences of 15 years with 1,500 WTs. Institut für Solare Energieversorgungstechnik (ISET), Verein an der Universität Kassel eV, 2005. 34119. 2. Wu, B., et al. An approach to fault diagnosis of helicopter planetary gears. in AUTOTESTCON 2004. Proceedings. 2004. IEEE. 3. Večeř, P., M. Kreidl, and R. Šmíd, Condition indicators for gearbox condition monitoring systems. Acta Polytechnica, 2005. 45(6). 241 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 4. Ha, J.M., et al., Autocorrelation-based time synchronous averaging for condition monitoring of planetary gearboxes in wind turbines. Mechanical Systems and Signal Processing, 2016. 70: p. 161-175. 5. Liang, X., M.J. Zuo, and L. Liu, A windowing and mapping strategy for gear tooth fault detection of a planetary gearbox. Mechanical Systems and Signal Processing, 2016. 80: p. 445-459. 6. Samuel, P.D. and D.J. Pines. Vibration separation methodology for planetary gear health monitoring. in Smart Structures and Materials 2000: Smart Structures and Integrated Systems. 2000. International Society for Optics and Photonics. 7. Mark, W.D., et al., A simple frequency-domain algorithm for early detection of damaged gear teeth. Mechanical Systems and Signal Processing, 2010. 24(8): p. 2807-2823. 8. Nguyen, K.D., et al., A wearable sensing system for tracking and monitoring of functional arm movement. IEEE/ASME Transactions on mechatronics, 2011. 16(2): p. 213-220. 9. Liu, F., et al., Error analyses and calibration methods with accelerometers for optical angle encoders in rotational inertial navigation systems. Applied optics, 2013. 52(32): p. 7724-7731. 10. Ovaska, S.J. and S. Valiviita, Angular acceleration measurement: A review. IEEE transactions on Instrumentation and Measurement, 1998. 47(5): p. 1211-1217. 11. Mancini, D., et al. Encoder system design: strategies for error compensation. in Telescope Control Systems III. 1998. International Society for Optics and Photonics. 12. Li, Y., et al., The measurement of instantaneous angular speed. Mechanical Systems and Signal Processing, 2005. 19(4): p. 786- 805. 13. Qin, S., Z. Huang, and X. Wang, Optical angular encoder installation error measurement and calibration by ring laser gyroscope. IEEE Transactions on Instrumentation and Measurement, 2010. 59(3): p. 506-511. 14. Filatov, Y.V., et al. Laser goniometer systems for dynamic calibration of optical encoders. in Optical Measurement Systems for Industrial Inspection III. 2003. International Society for Optics and Photonics. 15. Watanabe, T., et al. Automatic high-precision calibration system for angle encoder (II). in Recent Developments in Traceable Dimensional Measurements II. 2003. International Society for Optics and Photonics. 16. Watanabe, T., et al., An angle encoder for super-high resolution and super-high accuracy using SelfA. Measurement Science and Technology, 2014. 25(6): p. 065002. 242 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Novel Bearing Fault Detection using Generative Adversarial Networks S. Baggeröhr1, W. Booyse1, P.S. Heyns 1, D.N. Wilke1 1 Centre of Asset Integrity Management, Department of Mechanical and Aeronautical Engineering, University of Pretoria, South Africa ABSTRACT Bearing fault detection and diagnosis (FDD) are important if one is to avoid the more catastrophic failure consequences of large rotating machinery. Faults usually manifest as marginal defects that intensify over time, allowing for well-informed preventative actions with early FDD. Detection of the fault begins with capturing signals from a machine in various forms, such as vibration response signals. Numerous methods exist to extract either time, frequency or time-frequency based diagnostic features from these signals. Traditionally, handpicked descriptive statistical features (mean, RMS, skewness, Kurtosis, etc.) or spectral diagrams are used for FDD. However, machine signals are usually generated under non-stationary operating conditions of varying loads and speeds, which makes FDD with these methods more challenging. A wealth of research has been invested towards machine learning based signal processing techniques to circumvent the problems associated with non-stationary signals. Many of these methods require vast amounts of historical data to train. Since a machine spends most of its life operating in a healthy condition, most historical data is occupied with data that comes from a healthy machine condition. Training these methods are difficult, due to the shortage of data from a machine running in an unhealthy condition. Generative Adversarial Networks (GANs) are proposed as a novel method to perform condition-based diagnostics on assets. This method has the advantage that it can be trained exclusively with healthy data. The GAN, when trained, learns a statistical manifold on which the healthy data lies. A deviation from this statistical manifold can indicate a fault within an asset. This paper presents a preliminary study on the use of GANs to detect and diagnose faults in bearings. It was found that GANs can detect faults in bearings. Also, by tracking the manifold describing the healthy data, one can locate and assess the type of fault. This method has the added benefit of training on the entire data set, as opposed to supervised learning methods, which require the data to be divided into training, validation and test sets. Keywords: Condition monitoring, Fault detection and localization, Diagnosis, Bearings, Deep learning Corresponding author: Stephan Baggeröhr (u12017095@tuks.co.za) 1. INTRODUCTION Rotating machinery have become a staple in almost all areas of modern technology, for example in manufacturing or power generation, where high reliability has always been a big concern. Rotating machinery are generally supported by bearings, whose failure can lead to entire system shutdown. Consequently, bearings are considered one of an asset’s most critical components. As a result, bearing fault detection and diagnosis (FDD) are important if one is to avoid the more catastrophic failure consequences of large rotating machinery. Faults usually manifest as marginal defects that intensify over time, allowing for well-informed preventative actions with early FDD. As such, FDD has become more popular amongst maintenance engineers to increase machine reliability and availability while reducing the costs associated with maintenance and unnecessary shutdowns. FDD is traditionally broken down into a feature extraction phase and a fault classification phase. The feature extraction phase begins with acquiring signal data from a machine of interest. Signals are captured in various forms such as vibration or acoustic emissions for instance. These signals contain rich information about the condition of the machine, and numerous signal processing techniques are used to extract this information in the form of fault features. Features are generally extracted from the time or frequency domain or a combination of both. A good feature is one that is only sensitive to faults and performs well in signals with a low signal-of-interest (fault) to noise ratio. Statistical features such as Root Mean Square (RMS) magnitude, spectral skewness and Kurtosis are a popular choice amongst engineers [1]. Alternatively, in the frequency domain, spectral diagrams are used to manually track 243 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management fundamental fault frequencies that can be calculated from the bearing geometry. Machines are, however, frequently operated under non-stationary conditions, such as varying speed or load. This makes extracting features with these methods more challenging, as they assume stationarity of a signal [2,3]. As a result, more advanced signal processing techniques are used in conjunction with order tracking to extract features in non-stationary conditions. The second step of FDD involves classifying the fault based on the extracted features. Classification is done by comparing the values of the extracted features for healthy machines with those of unhealthy machines. The fault is identified by comparing the characteristics of the features to those of known fault modes. Once the fault is identified, a degradation metric or fault severity indicator (FSI) is selected and used to trend the fault over a predetermined monitoring interval. The resulting trend is then used to make a prediction about the remaining useful life (RUL) of the machine. The diagnosis is heavily dependent on the types of features extracted during the first phase. Features that are also sensitive to operating conditions may result in a misdiagnosis. The entire FDD process is not static, as feedback from each phase is used to update the process. For example, the signal processing method may be adjusted depending on the fault that is identified. Knowing what types of fault features to use and what fault modes to expect requires expert knowledge or careful physics-based modelling of the system. Both of which can be costly or time-consuming to implement. As a result, data-driven approaches have gained increasing popularity over the years. These models have attempted to replace the second step of FDD by directly mapping the extracted features to fault modes. Because the second step of FDD is essentially a pattern recognition type problem, machine learning approaches have predominantly been used [4]. Majority of these methods require vast amounts of historical data to train. As a machine spends most of its life operating in a healthy condition, most historical data is occupied by data from a healthy machine condition. Training these methods is difficult because of this class imbalance. Machine learning algorithms also require good discriminative features to be extracted, and hence the careful selection of features and signal processing techniques are necessary to create a successful machine learning diagnostic algorithm, especially in non-stationary cases. A proposed framework can become very specific to the machines and faults it was developed for. This makes implementing the proposed framework more challenging when a diverse range of similar assets needs monitoring. More recently, advancements in deep learning allow algorithms more flexibility to find complex hierarchical representative features by exploiting deep network architecture. These algorithms can represent complex functions and as a result, they can extract more discriminative features over a broader range of signals. Generative adversarial networks (GANs) are a type of generative model that exploits deep learning architecture to build a representation of a target distribution, such as the probability density of a vibration waveform from a healthy machine. In this paper, Generative Adversarial Networks have been proposed as a novel method to perform condition-based diagnostics on bearings. This method has the advantage that it can be trained solely on healthy data, requiring no preprocessing or advanced signal processing to extract features. The GAN, when trained, learns the probability density of the healthy data. This probability density can be considered as a low dimensional statistical manifold embedded in the high dimensional space of the problem. A deviation from this statistical manifold can indicate a fault within an asset. Unlike conventional statistical learning methods, such as Expectation Maximization or Variational Autoencoders, which require several assumptions regarding the shape of the data distribution (such as factorized gaussian distribution), a GAN can learn an implicit representation of the data which requires no prior assumptions of the structure of the data. As a result, no labelling of data is required. This paper presents a preliminary study on the use of GANs to detect and diagnose faults in bearings. This method has the added benefit of training on the entire data set, as opposed to supervised learning methods, which require the data to be divided into training, validation and test sets. The rest of the document is laid out as follows. The second section covers the theoretical framework for the proposed model. The third section summarizes the results obtained from preliminary studies on the application of the model. 2. GAN MODEL FOR BEARING FAULT DETECTION 2.1. GAN Framework A generative adversarial network [5] is comprised of two subnetworks, namely a Generator network, 𝑮𝜃(𝒛) and a Discriminator network, 𝑫𝜙(𝒙), where 𝜃 and 𝜙 are trainable network parameters (such as weights and biases) and 244 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 𝒙 is a query sample. Training of the networks is achieved by setting the generator network against the discriminator network in a two player non-cooperative game expressed as the min max optimization problem 𝑉, shown in Eq. 1. The generator network passes random noise, 𝒛~𝑈[0,1] or 𝒛~𝒩[0,1] through the network and produces a sample from a parametrized distribution, 𝑸 that is dependent on 𝜃. The discriminator network tries to estimate the probability that the query sample was either produced by the generator or sampled from a target distribution, 𝑷𝒅𝒂𝒕𝒂. Each sub network updates its own parameters with gradients derived from the cost function defined in Eq. 1 and back propagation of the error. The generator will improve the samples it produces based on the feedback it obtains from the discriminator. Theoretically, training is completed, when Nash-equilibrium is reached. In game theory, a Nash- equilibrium where a player has reached a point that leaves him/her in no better or no worse situation, no matter what his/her opponent decides to do. At this stage, the discriminator can no longer distinguish between samples produced by the generator and samples drawn from the target data distribution, and thus 𝑸 ≈ 𝑷𝒅𝒂𝒕𝒂. The discriminator, now unsure of whether the sample is real or generated, outputs a probability of 𝑫𝜙(𝒙) ≈ 0.5. min 𝜃 max 𝜙 𝑉 (𝑫𝜙(𝒙), 𝑮𝜃(𝒛)) = 𝔼𝒙~𝑝𝑑𝑎𝑡𝑎(𝒙)[log 𝑫(𝒙)] + 𝔼𝒛~𝑝𝑧(𝒛)[log(𝟏 − 𝑫(𝑮(𝒙)))] ……… (1) For bearing diagnostics, a GAN is trained on purely healthy data. During the inference stage, a previously unclassified query sample is then passed to the discriminator. If the query matches the distribution of the training data (𝑸 ≈ 𝑷𝒅𝒂𝒕𝒂), the discriminator will output 𝑫𝜙(𝒙) ≈ 0.5, and the sample is classified as healthy. Instead, if the query sample results in 𝑫𝜙(𝒙) ≠ 0.5, the sample is believed to be anomalous from the healthy data and further investigation is performed to identify the cause of the anomaly. The loss functions for generator and discriminator are derived from Eq. 1. The loss function for the discriminator, shown in Eq. 2, when minimized ensures the discriminator correctly classifies samples as real or generated. The loss for the generator, shown in Eq. 3, is taken as the Kullback-Leibler (KL) divergence loss from [6] and ensures the generator produces realistic samples, whilst allowing both the discriminator and generator to reach an optimum. Optimization is done by RMSProp with a learning rate of 10−4 which is exponentially decayed at a rate of 0.98 every 4000 iterations. Where one iteration corresponds to one update of the trainable parameters using a batch of 64 random samples per iteration. ℒ𝐷(𝜃) = − 1 2 log(𝑫(𝒙)) − 1 2 log(1 − 𝑫(𝑮(𝒛))) …..…………….……… (2) ℒ𝐺(𝜙) = − log( 𝑫(𝑮(𝒛)) 1 − 𝑫(𝑮(𝒛)) ) …………...……………………. (3) 2.2. Architecture For the diagnosis of faults in bearings, the architecture proposed by [7], known as a Deep Convolutional GAN (DCGAN), was used. The architecture was originally proposed to learn reusable feature representations in large unlabelled datasets, specifically in the context of computer vision. This makes it ideal for learning features from raw vibration signals. As vibration signals tend to be cyclo-stationary, certain features tend to repeat themselves, albeit at irregular intervals in the case of non-stationary operating conditions. The network is comprised of multiple layers of convolutions and thus the features that are learnt are somewhat invariant to scaling and transformations. This allows the network the flexibility to learn features even in non-stationary operating conditions. Some adjustments are done, to allow for the training on acceleration signals. These adjustments were: • Batch Normalisation was not used in between layers. • An additional fully connected layer was added after the last convolution layer in the discriminator. • Exponential Linear Units (ELU) activation functions [8] were used in the discriminator in place of leaky Rectified Linear Units (RELU). A visual representation of this architecture used in this study is presented in Fig. 1. The discriminator consists of 4 convolution layers with increasing number of filters on each layer. The stride is kept low on the first layer and then increased on the rest of the layers. The second last and last layers of the network are fully connected with an output 245 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management size of 256 and 1 respectively. ELU activation is used on all layers except the output. In the generator network, random noise is passed through a fully connected layer, with an output size chosen and reshaped to ensure the generated samples match the dimension of the real samples. This is then passed through four deconvolution layers as in [7]. The final layer is passed through a tanh activation function. Therefore, training data is normalised between [−1,1], before being presented to the discriminator. The GAN architecture is summarised as follows: Generator architecture: Discriminator architecture: Input: 𝑧 = 𝑏𝑎𝑡𝑐ℎ × 100 → Input: 𝑥 = 𝑏𝑎𝑡𝑐ℎ × 4096 × 1 → Fully Connected: [4096], Reshape: [256 × 16], Relu → Convolution 1: [32 × 25 × 1], Stride: [2 × 1], Elu → Deconvolution 1: [128 × 25 × 1], Stride: [4 × 1], Relu → Convolution 2: [64 × 25 × 1], Stride: [5 × 1], Elu → Deconvolution 2: [64 × 25 × 1], Stride: [4 × 1], Relu → Convolution 3: [128 × 5 × 1], Stride: [5 × 1], Elu → Deconvolution 3: [32 × 25 × 1], Stride: [4 × 1], Relu → Convolution 4: [256 × 5 × 1], Stride: [5 × 1], Elu → Deconvolution 4: [1 × 25 × 1], Stride: [4 × 1], Tanh → Fully Connected: [256], Elu → Output: Reshape: [4096 × 1] Output: Fully Connected: [1], Sigmoid Figure 1: GAN Architecture for Discriminator and Generator networks, used for all investigations in this study. 2.3. Regularization for improved stability The training of a GAN is a particularly difficult optimisation problem, since it involves finding the stationary points on a vector field defined by the gradients of the two loss functions (Eqs. 2 and 3). As opposed to conventional Artificial Neural Network training which only involves finding the stationary points of a vector field defined by only one loss function. When a vector field is defined by more than one function it becomes non-conservative. A non- conservative vector field is significantly more difficult to optimise as the optimisation process becomes path and starting point dependent. A vector field 𝑭 is only conservative when there exists a function 𝑓 such that ∇𝑓 = 𝑭 [9]. To improve the behaviour of the optimisation process researchers have recently proposed regularisation techniques to improve the behaviour of GAN training [10]. Two of these proposed methods were implemented in this paper, namely consensus optimization and instance noise. 2.3.1. Consensus optimization The non-conservative vector field results in the optimization path that enters an orbit around a minimum instead of progressing towards it. Consensus optimization was introduced by [11] as a means of preventing this by modifying 246 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management the loss functions of the generator and the discriminator. This is done by the addition of the term 𝐿(𝜃, 𝜙) = 1 2⁄ ‖𝒗(𝜃, 𝜙)‖ 2, to both loss functions, ℒ̃𝐷(𝜙) = ℒ𝐷(𝜙) + 𝛾𝐿(𝜙, 𝜃) and ℒ̃𝐺(𝜃) = ℒ𝐺(𝜃) + 𝛾𝐿(𝜙, 𝜃), ………………… (4) where 𝒗(𝜃, 𝜙) = ( ∇𝜙ℒ𝐷(𝜙) ∇𝜃ℒ𝐺(𝜃) ), …………………..………………… (5) is a vector containing the gradients of the two loss functions with respect to their trainable parameters and 𝛾 is a consensus factor. The initial 𝛾 was set to 3.5 for all experiments and then exponentially decayed at a rate of 0.5 every 8000 iterations. 2.3.2. Instance noise At the start of training, the generator is essentially outputting noise. Therefore, the discriminator has an easy job of distinguishing between the real sample and the generated ones. It is highly unlikely for the parametric distribution of the generator to overlap with the distribution of the data distribution during early stages. As such [6] proposed the use of instance noise to overcome the problem. Instance noise ensures the parametric distribution and the data distribution initially overlap by adding Gaussian noise to both the generated and real samples. The noise has a mean of zero and a variance 𝜎2 of 0.2. As training progresses the variance of instance noise is linearly reduced to zero by iteration 10000. 3. EXPERIMENTAL VALIDATION 3.1. IMS Dataset This dataset, developed by [12], contains the acceleration response of four Rexnord ZA-2115 double row bearings installed on a single shaft. The shaft was run by an AC motor kept at a constant speed of 2000𝑅𝑃𝑀 with a constant radial load of 6000𝐼𝑏𝑠. The dataset contains the results of three run-to-failure tests. Upon completion of test set No. 1, an inner race fault was discovered on bearing 3 and roller element fault was discovered on bearing 4. A GAN was trained using this test set No. 1. Only the acceleration signals of bearing 3 and 4 in the y direction were considered for this investigation and used for training. Two independent GANs were then trained for 30000 iterations (±2 hours), separately on each of the bearing signals. The first 8% of data was taken as the reference condition (treated as healthy) and used to train the GAN. The training data corresponds to the blue highlighted portion of Fig. 2. In Fig 2a, the output of the GAN trained on the signal from bearing 3 is shown over a period of 35 days. Similarly, in Fig 2b, the output for the GAN trained on bearing 4 is shown. Also shown in both plots are the RMS and Kurtosis of each signal over the 35 days. (a) Bearing 3 (b) Bearing 4 Figure 2: GAN metric of IMS dataset trended over 35 days. Blue highlighted portion represents first 8% of data used for training of the GAN. Also shown are traditional statistical features, RMS and Kurtosis of the same signal. 247 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management In Fig. 2a, the RMS and Kurtosis signals remain relatively unchanged during the first 30 days of operation, after which the RMS starts to increase, and the Kurtosis levels rise. This is in line with the observations of [13] and is typical behaviour of bearings with an inner race fault, where crack propagation and development occur over a short period of time. Looking at the GAN metric, the trend is relatively constant up until day 25, after which, a small jump is noticed. The GAN metric then remains constant until day 30, where it is seen to increase until the last day of observation. In Fig 2b, the GAN metric increases initially until day 10, after which it decreases and remains constant until day 25, where the rate increase until the end of sampling. A small increase in the Kurtosis is also noticed during the first 10 days. This initial increase and decrease are explained by a healing phenomenon whereby small surface cracks are smoothed by continuous rolling action. At a later stage, after day 25 when damage increases, the vibration levels increase once more. The jump that was noticed on day 25 of bearing 3 is explained by the increase in damage of bearing 4 around the same time. This shows the GAN’s sensitivity to any deviation from conditions other than those that were used during training, treating any deviation as anomalous. Furthermore, the magnitude of the GAN metric during the first 10 days of bearing 4 indicate that it is more sensitive to the initialization of faults than the traditional statistical metrics of RMS and Kurtosis. This makes detecting faults during early fault development, far more accurate. Figure 3: Scatter plot of samples with various fault modes presented to a trained GAN. Clustering of similar samples can be seen on the manifold 3.2. MFPT Dataset The second dataset that was tested was a fault dataset released by the Society for Machinery Failure Prevention Technology (MFPT) [14] with an accompanying tutorial in bearing envelope analysis [15]. The dataset was released with the aim to facilitate research into bearing analysis. The dataset is comprised of several sets of test cases with data collected from a bearing test rig that was equipped with a NICE bearing that has 8 rolling elements. Also included were a few real-world cases, but these were not considered. The first dataset is a baseline condition that was sampled from a healthy bearing at a shaft speed of 25 𝐻𝑧 and a load of 270 𝐼𝑏𝑠. The second set, contains data from a bearing with an outer race fault, operated at the same operating conditions as the baseline condition. The last of the datasets considered was that of a bearing with an inner race fault, sampled at varying loads between 0 and 300 𝐼𝑏𝑠 at a constant shaft speed of 25 𝐻𝑧. Consequently, the dataset represents both stationary and non-stationary conditions in terms of load. A GAN was trained for 70000 iterations (±4 hours), solely on data from the one of the 248 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management healthy baseline conditions. Once trained, samples from the remaining sets of data where shown to the discriminator for inference. The second last layer of the discriminator was used to visualise the manifold on which the data was mapped. ISOMAP was used to further reduce the dimension of the second last layer from 256 to 2 features for plotting as shown in Fig. 3. It can be seen in Fig. 3, that healthy samples are clustered together towards the top left corner of the plot. Even healthy samples that were not used for training, are clustered together with samples used for training. Similarly, samples with an outer race fault and the inner race fault are clustered towards the centre and right of the plot respectively. This shows that the features learnt by the discriminator are representative enough to separate samples from various fault modes, even when those samples have not been observed before. It also shows that samples of similar features are mapped to certain regions within the statistical manifold. Demonstrating that the approach may be viable in a semi-supervised diagnostic framework with sparse labels. 3.3. HSBD Dataset High-speed bearing data (HSBD) was collected from a 2𝑀𝑊 commercial wind turbine [16] and was used as the final dataset for this investigation. This real-world data set is inherently non-stationary, due to the varying wind speeds and loads found in nature. The measurements in this dataset were captured over a period of 50 consecutive days. The bearing from the wind turbine had an inner race fault that progressively deteriorated across the 50 days. The data was captured at a sampling rate of 97656 𝑠𝑝𝑠 for 6𝑠. The first 4 days (8%) of raw data was taken as the reference set and used to train a GAN for 70000 iterations (±4 hours). Again, the output of the discriminator was used as a fault metric and trended over the period of 50 days and can be seen in Fig. 4. Also shown in Fig. 4, is the inner race envelope energy (measured in Gs as per [16]) of the raw and resampled data over the 50 days. The GAN metric is constant with small fluctuations until day 25, where it starts to increase until the last day of observations. Small fluctuations can be described by varying weather conditions. No resampling of the raw signal was required when training a GAN. If it is assumed that the training data contains samples from all possible operating conditions, the GAN learns how the healthy data looks at these varying operating conditions and any deviation can be explained by faults. Even when this assumption is weak, as was the case here, the overall trend of the fault can still be recovered. The small fluctuations present during the early stages of observations can be attributed to operating conditions not seen by the GAN during training. The duration of the peaks that are present in the early stages of observation are relatively short when compared to the duration over which the GAN metric increases. This indicates that the condition that caused the increase in the GAN metric is permanent, like a fault condition, and the condition that caused the small fluctuation is temporary, like an unseen operating condition. Figure 4: GAN metric of HSBD dataset trended over the 50 days of observation. Blue highlighted section represents the first 4 days of data used for training of the GAN. Also shown is the inner race fault energy of the raw and resampled signal measured in Gs. 249 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 4. CONCLUSION This paper presents a novel technique that uses Generative Adversarial Networks (GANs) to perform bearing diagnostics. The issue of class imbalance, that is prevalent with diagnostics and typical machine learning algorithms, is solved by leveraging the GAN’s ability to learn a statistical manifold on purely healthy data. Once the manifold is learnt, the discriminator of the network can be used to perform inference on unclassified samples. Faults can be trended by measuring how much the sample deviates from the statistical manifold. Furthermore, it is shown that the network automatically clusters samples that contain similar features on this learnt manifold. This clustering can be used for unsupervised classification of multiple fault modes. The framework is robust to non-stationary signals and requires no advanced signal processing or order tracking to perform. REFERENCES [1] N. Tandon, “A comparison of some vibration parameters for the condition monitoring of rolling element bearings,” Measurement, vol. 12, no. 3, pp. 285–289, 1994. [2] P. Boskoski and D. Juricic, “Fault detection of mechanical drives under variable operating conditions based on wavelet packet Renyi entropy signatures,” Mech. Syst. Signal Process., vol. 31, pp. 369–381, 2012. [3] R. Zimroz, W. Bartelmus, T. Barszcz, and J. Urbanek, “Diagnostics of bearings in presence of strong operating conditions non- stationarity - A procedure of load-dependent features processing with application to wind turbine bearings,” Mech. Syst. Signal Process., vol. 46, no. 1, pp. 16–27, 2014. [4] R. Liu, B. Yang, E. Zio, and X. Chen, “Artificial intelligence for fault diagnosis of rotating machinery: A review,” Mech. Syst. Signal Process., vol. 108, pp. 33–47, 2018. [5] I. Goodfellow et al., “Generative Adversarial Nets,” Adv. Neural Inf. Process. Syst. 27, pp. 2672–2680, 2014. [6] C. K. Sønderby, J. Caballero, L. Theis, W. Shi, and F. Huszár, “Amortised MAP Inference for Image Super-resolution,” Mach. Vis. Appl., vol. 25, no. 6, pp. 1423–1468, Oct. 2016. [7] A. Radford, L. Metz, and S. Chintala, “Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks,” pp. 1–16, 2015. [8] D.-A. Clevert, T. Unterthiner, and S. Hochreiter, “Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs),” pp. 1–14, 2015. [9] J. Stewart, Calculus: Early Transcendentals, 7th ed. Cengage Learning, 2010. [10] L. Mescheder, A. Geiger, and S. Nowozin, “Which Training Methods for GANs do actually Converge?,” 2018. [11] L. Mescheder, S. Nowozin, and A. Geiger, “The Numerics of GANs,” 2017. [12] H. Qiu, J. Lee, J. Lin, and G. Yu, “Wavelet filter-based weak signature detection method and its application on rolling element bearing prognostics,” J. Sound Vib., vol. 289, no. 4–5, pp. 1066–1090, 2006. [13] T. Williams, X. Ribadeneira, S. Billington, and T. Kurfess, “Rolling element bearing diagnostics in run-to-failure lifetime testing,” Mech. Syst. Signal Process., vol. 15, no. 5, pp. 979–993, 2001. [14] E. Bechhoefer, “Fault Data Sets – Society For Machinery Failure Prevention Technology.” [Online]. Available: http://mfpt.org/fault- data-sets/. [15] E. Bechhoefer, “A quick introduction to bearing envelope analysis.” [16] E. Bechhoefer, B. Van Hecke, and D. He, “Processing for Improved Spectral Analysis,” Annu. Conf. Progn. Heal. Manag. Soc., pp. 1–6, 2013. 250 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Condition Monitoring of a Fan using Neural Networks K. Passi1, B. Zhang1, M. Timusk2 1Department of Mathematics and Computer Science, Laurentian University, Sudbury, Ontario, Canada 2Bharti School of Engineering, Laurentian University, Sudbury, Ontario, Canada ABSTRACT Fans are widely used in various industrial fields and it plays a key role in cooling the machinery. For the machinery to work properly, the fan system should remain in stable and error-free condition. Condition monitoring is introduced as a maintenance tool for the failure diagnosis of a fan system. Some methods used in condition monitoring are vibration monitoring and thermal monitoring. Vibration monitoring method was used in this experiment. A fan system based on Machinery Fault SimulatorTM (MFS) was used to simulate different conditions of a fan in the laboratory. An accelerometer was installed on top of the bearing housing. It was used to detect the vibration signal of a running fan. A data acquisition program designed in LabVIEW was used to record and preprocess the raw vibration signal. The collected data was used to detect the condition of the fan system. Neural Network was used for the fault diagnosis. The raw vibration signal is a one-dimensional time domain series data, while the neural network requires multidimensional features as input data. Therefore, it is important to preprocess the raw vibration signal data. Two different preprocessing methods, time-domain features and Auto Regressive (AR) model features were used to preprocess separately. The neural network model was trained by these two methods respectively. The results show that the AR model gave better features than the time domain features method. The condition monitoring system consisted of the following parts: data acquisition, data storage, data preprocessing and the display of results. Some methods were programmed in Matlab, which were called by Matlab scripts in the LabVIEW software. The hybrid programming method helped to generate an efficient program which provided high accuracy of fault diagnosis. Keywords: Fan, Failure diagnosis, Vibration analysis, Neural Network, LabVIEW, Matlab Corresponding author: Kalpdrum Passi (kpassi@cs.laurentian.ca) 1. INTRODUCTION Fans are widely used in various industrial fields. In factory premises, the operation of equipment generates enormous amounts of heat and its operation requires the maintenance of certain temperature conditions. High temperature causes overheating, which may break the machines. Good heat dissipation ability helps the machines and equipment to be in good working condition for a long time. Air-cooled radiator, which has the fan as a key component, exudes a tremendous amount of heat. It is used to maintain machine’s temperature. Wind energy is a renewable, widely distributed and non-polluting source of energy [11] and fan blades are the key components of wind turbines. Fans working at high-speed for a long time in harsh working conditions with inappropriate maintenance and excessive demands may cause faults. Therefore, monitoring the fan’s condition is critical. Fault Diagnosis is an emerging discipline that is based on reliability theory, information theory, cybernetics theory and systems theory [7]. It uses modern test instruments and computer technology as technical means to study a variety of diagnostic objects’ features. Neural network is a highly complex large nonlinear adaptive system, which is made up of a large number of simple processing units [3]. Artificial neural networks can perform massively parallel processing. In this paper, we apply neural network for condition monitoring of a fan. 1.1. Fan’s failure types A rotating fan is basically combined of three parts: the drive device, such as the motor, transmission device, such as the shaft and the blades. Usually in order to control the rotating speed of the blades, electronic control unit or gearbox is added. Fan’s parameters include the following features: wind velocity, air volume, wind pressure, lifetime and 251 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management noise. These parameters impact the performance of the fan. Based on these parameters, researchers are able to make condition monitoring plans. Typical failures of rotating machinery include unbalance, misalignment, static and dynamic friction. Fan blades unbalance, rotor unbalance and bearing failure are the main reasons of fans’ failures, which causes abnormal vibration in the equipment. These abnormal vibrations shorten the life of the equipment and cause some other unpredictable damages. 2. EXPERIMENT SET-UP In order to test and validate a prototype condition monitoring system, the ideal situation would be to collect data and perform experiments on a fan in a plant exhibiting all of the fault conditions of interest. However, there are some limitations that make it impractical to do this. This test required various data sets collected from different fault conditions of a fan. For this reason, the experiments required were conducted in the laboratory using a machinery diagnostics simulator with a fan attachment. The apparatus employed for the experiments had three capabilities. The first capability was that it was able to simulate the fan’s normal operation. The second capability was that it was possible to seed faults on the fan and to operate it under these faulty conditions. The last capability was it was able to collect data from an accelerometer installed on the fan’s bearing housing. The laboratory platform that was used is a commercially available Machinery Fault SimulatorTM (MFS) manufactured by Spectra Quest of Richmond, Virginia. This MFS is designed to reproduce multiple types of machine faults such as shaft or coupling misalignment, mechanical looseness, motor faults, rolling element bearing faults, rotor unbalance and pump faults. In the case of the present work, the test bench used in fan’s fault simulation and its schematic is shown in Figure 1. AC Motor Controller Digital Techometer Shaft Bearing Housings Fan Coupling Accelerometer Figure 1. Photo of test bench and Schematic of the test apparatus The equipment used to control the rotating speed of the motor was a Lenze AC Tech Controller. The test bench was fitted with a digital tachometer to measure and display the rotating speed of the motor. There are six buttons on the keypad, which are used to start or stop the drive, change the speed, and change the rotation direction. In the case of this work the motor was operated in three states: low speed, medium speed and high speed corresponding to 10Hz, 20Hz and 30Hz respectively. The unbalance condition is the most common type of fan fault. In order to simulate the blade unbalance condition, a small mass was added to one of the three blades. In order to simulate the rotor unbalance, a cocked bearing housing was used. The lower surface of the cocked bearing housing that contacted with the rotor base was designed to achieve about a 0.5-degree tilt. The last fault type examined was a rolling element bearing fault. Due to the limited condition in the laboratory, in this case the damped bearing is selected as the failure bearing. Platinum Low-cost Industrial ICP ® Accelerometer, which is made by ICP ® Sensor, was used for measuring the vibration signal in the test [10]. It is now widely used for general industrial purpose all over the world. A piezoelectric element is used by the accelerometer to convert the mechanical motion into an electrical signal. The value of the electrical signal reflects the amplitude of the vibration of the object that the accelerometer is attached to. A data 252 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management acquisition module manufactured by National Instruments (model number PXIe-4492) was used to collect the accelerometer data. This data acquisition module is specifically designed for sound and vibration applications [8]. National Instruments LabVIEW (Laboratory Virtual Instrument Engineering Workbench) is a graphical programming language, similar to other traditional programming languages. The LabVIEW program written for this work performed the following functions: acquisition of the vibration signal, display of the vibration signal, display of the preliminary analysis of the vibration data, and storing the vibration signal data for further analysis and training of fault detection programs. In this experiment, the task was to collect fan’s vibration signals in different states of health. The NI PXIe-4492 module would pick out the samples by the sample rate that was set up at 2000 Hz in the DAQ Assistant function. The samples were saved, processed and analyzed in LabVIEW for fan’s test conditions of healthy, blade unbalance, rotor unbalance and bearing fault at rotating speeds 10 Hz, 20 Hz and 30 Hz, respectively. 3. NEURAL NETWORK DESIGN FOR CONDITION MONITORING A neural network consists of an input layer, one or more hidden layers and an output layer. The input layer of neural network model needs multidimensional data, but the raw vibration signal is a one-dimensional time series data. So, the fan’s vibration signal should be preprocessed to fit the input parameters of the neural network. The raw vibration signal needs to be converted to multiple features as input nodes representing healthy and faulty machine operations. Two different data preprocessing methods, time domain features and auto regressive (AR) model, were compared in the training process. Using the raw time domain vibration signal for analysis is a popular, simple and effective method [9]. The original vibration signal can directly reflect fan’s working condition. Thus, feature extraction is needed for processing the raw vibration data that can reduce the size of the training set while preserving features that correspond to the condition of the fan. Six statistical features were used to describe the raw vibration data: Root Mean Square Value, Peak to Peak Value, Kurtosis, Crest factor, Impulse factor and Energy in time domain [9]. These six features are selected as the input features of the neural network. In each data set, the sampling rate is 2000 Hz, which means there are 10000 samples in each segment of time series data. Based on the sample rate and the rotating speed of the fan system, 400 samples were selected as a period to calculate their features. The reason for choosing 400 samples as a segment of time series data is that they represent two complete revolutions of the rotating fan (10Hz rotating frequency). Matlab was used to calculate the features. The feature set values are the input data for the Back Propagation (BP) neural network module as shown in Figure 2. Fan Vibration signal RMS PeaktoPeak Kurtosis Crest Impluse Energy Fan Vibration signal Class 1 Class 4... ... ... X1 X2 X3 X6 Figure 2. Neural network with time domain features as input nodes There are 6 input nodes based on the fan’s vibration signal feature vector and 4 output nodes based on the fan’s conditions. The number of hidden layers can be calculated by the inequalities 𝑙 < 𝑚 + 𝑛 + a and ℓ > 𝑙𝑜𝑔+𝑛 where n is the number of input nodes; m is the number of output nodes; a is a constant number between 1 and 10. Thus the number of nodes in the hidden layer range from 4 to13. Using Matlab modeling of the BP Neural Network, different number of nodes in the hidden layer were tried and it was found that 12 nodes gave the best accuracy. The fan vibration signal features representing four different conditions are labeled as 1, 2, 3, 4. Based on the labels, the target output values were set as 1000, 0100, 0010, 0001 representing healthy fan, blades unbalance, rotor unbalance and bearing fault, respectively. Neural Network Pattern Recognition Tool in Matlab was used to train the neural network. In the toolbox, 70% sample set (210 samples) was used as a training set, 15% sample set was used for validation, and the remaining 15% sample set was used for testing. The network adjusts the weights according to the 253 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management classification error based on the training samples. Figure 3 shows the training results for four conditions with different speeds. Figure 3. Training results for four conditions with different speeds The first three matrices show the results of the training set, validation set and testing set and the fourth matrix shows the results of the combined data set. The columns in the matrices show the target classes, while the rows show the output classes. For example, in the Training Confusion Matrix, 30 samples (14.3% sample of training set) of output class 1 were correctly classified as the target class 1 (green background). Meanwhile, 2 samples (1% samples of training set) of target class 1 were misclassified as output class 2 (red background). The percentages shown in green color in the last row (grey background), columns 1 – 4 show the number of samples in the trained results that are correctly classified to the target class, and the percentages shown in red color show the number of samples in the trained results that are not classified to the target class. For example, in the Training Confusion Matrix, 55.6% of the samples were correctly classified as target class 1, and 44.4% of the samples were misclassified as other classes. The last column (grey background) are the results of the output classes. The percentages in green show in each output class how many samples were classified correctly, and the percentages in red show how many samples were classified incorrectly. For example, 52.6% samples of the output class 1 were classified correctly, the rest 47.4% samples in output class 1 were classified incorrectly. The last row-column (blue background) are the test results. For example, in the Training Confusion Matrix 70% of the samples in training set were classified correctly and 30% of the samples in the training set were classified incorrectly. The last matrix shows the test results of all the features. It is clear that 68% of the samples were classified correctly, which means for all of the 300 feature vector samples, 96 samples were classified to the wrong class. From this test, it was obvious that using only one accelerometer to collect data, enough information was not available for training the BP neural network module. One accelerometer just gave us the information from the top of the bearing house. If we want to obtain more vibration signal information, we need more accelerometers which should be installed in different positions in order to get data from other dimensions. Auto Regressive (AR) model [6] is a time series analysis method and its model parameters contain all the important features about the machinery’s condition. Accurate AR model reflects the objective rules of dynamic system. AR model’s coefficients are very sensitive to state changes in a dynamic system. Therefore, the actual AR coefficients were used as the feature vector to analyze the state of the dynamic system. 254 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management AR model is a widely used in practical work, and the function is a set of linear equations. Based on the knowledge of digital signal processing, a P-order AR model is equivalent to a P-order linear predictor. Based on the output of the AR model, the prediction parameters are calculated, which is the linear prediction coefficient. This linear prediction coefficient was used in voice coding at first, so it is named Linear Prediction Coding (LPC) [2]. The AR model’s coefficients can be calculated by linear prediction analysis. Many algorithms are used to calculate AR model’s parameters, such as Burg’s lattice-based method, Forward- backward approach, Geometric lattice approach, Least-squares approach and Yule-Walker approach. In practical work, Forward-backward approach is often used to calculate AR model’s parameters [5]. Once the signal has been analyzed by AR model into discrete parameters, these parameters can be used for classification. Every set of parameters is able to show the state of fan in a time interval. This process is able to simplify the data and convert one-dimensional data into multidimensional data, which can be the input to the neural network model. When using AR model to process vibration signal, users choose the number of AR model parameters. However, it is important to find the right number. If AR model is used to fit the time series, there is eventually a number that fits best, and this number is the optimal choice for the AR model. Two optimal value selection methods were used to find the number of parameters: Final Prediction Error (FPE) [1] and An Information Criterion (AIC) [4]. The FPE and AIC curves were plotted and the number of parameters were determined to be 20. Minimizing the prediction error in the least squares sense, Linear Prediction Coding (LPC) is used to determine the coefficients of the forward linear predictor. It is used in filter design, eigenvalue extraction and speech coding. As the number of the parameters has been determined to be 20, fan’s vibration signal is used as the AR (20) model’s output. The parameters calculated from LPC by every 20 fan vibration signal samples can be used as the fan vibration features. In each original fan vibration signal data file, there are 10,000 acceleration samples, and as the number of parameters is 20, every 400 samples are used to calculate their eigenvalues. The feature vector sample is the concentrated expression of the selected 400 vibration signal samples. It includes the features of the 400 samples, which can be used in classification algorithm. Feed-forward Neural Network with input parameters selected as eigenvectors from AR modeling is shown in Figure 4. Figure 4. Neural network with AR model parameters as input nodes There are 20 input nodes based on the fan vibration signal feature vector. According to the inequalitties discussed earlier, the number of nodes in the hidden layer range from 5 to19. Using Matlab modeling of the BP Neural Network, we try these different numbers as the number of nodes in the hidden layer and select 18 nodes in the hidden layer for the test. The structure of the BP neural network is 20-18-4, which means the input layer has 20 nodes, the hidden layer has 18 nodes and the output layer has 4 nodes. After all of the input vectors, output vectors, number of hidden layers are known, the Neural Network Pattern Recognition Tool is used to train the network. This step is similar to the training process used the input features Root Mean Square Value, Peak to Peak Value, Kurtosis, Crest factor, Impulse factor and Energy in time domain. Figure 5 shows the test results. The correct classification rate of the trained neural network model was 100%, which means for the entire 300 feature vector samples all of them were classified to the correct class. This shows that preprocessing by the AR model gives accurate results for classification by the neural network. The 100% result is achieved in the ideal conditions in the lab. It may not give 100% results in an industrial setting. Fan vibration signal Feature parameters from AR model Class 1 Class 4 x1 x2 xn ... ... ... State recognition results 255 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 5. Training results for the four conditions Two different methods were used for processing the original data and the processed data was used as input to train the neural network. The results show that the features that were calculated by time domain features do not train the neural network well. Enough information cannot be obtained for the classification by a traditional method with limited equipment available in the experiments. The comparison of the results is shown in Figure 6. Time domain feature AR model Time domain feature AR model Time domain feature AR model Time domain feature AR model Time domain feature AR model Class 1 Class 2 Class 3 Class 4 All 100% Classification Corrcet Classification Incorrcet Figure 6. Comparison of Time Domain Features and AR Model feature vector Based on the above results, AR model was selected to process the original fan vibration signal. 256 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 3.1. Classifying the test data on the trained neural network The raw vibration signal samples were used to do the test the neural network. The data in one-dimensional waveform signal was converted into a matrix form to use on Matlab. The raw vibration signal was converted to AR model feature vector consisting of 20 features. AR model feature vector set was used as the input to the trained BP neural network. The output of the BP neural network classification function is a one-dimensional array which is close to the class label. The class label is normalized by rounding the number to the nearest integer, which is 0 and 1. The output of the BP neural network model is the class label. As mentioned before, class label 1000, 0100, 0010 and 0001 represents healthy fan, fan blades unbalance, rotor unbalance and bearing fault, respectively. Since the class labels are listed by columns, number of 1’s is calculated in each column in order to get the number of samples classified to their target class. The result count is shown in a graphical display window as shown in Figure 7. The white bar shows the number of samples belonging to class 1. The test data used in this process was healthy fan’s vibration signal. The classification result was correct. Figure 7. Results display in LabVIEW 4. CONCLUSION AND FUTURE WORK This paper explored the application of neural networks for failure diagnosis of a fan. LabVIEW program was used to collect and process data and call Matlab functions for the classification. The results from the experiment show that the AR model is the appropriate preprocessing method for calculating the input data for the neural network. The powerful computing ability of Matlab and the data acquisition ability of LabVIEW can be perfectly combined together. The system designed in LabVIEW works well for fault diagnosis. After the fan vibration signal is collected, the system shows the class results in a short time. The results show that the system is a good fault diagnosis tool for fan condition monitoring. Since the equipment used for the experiments was limited, there are still some other problems that have not been discussed in this paper. In a future study, other types of faults such as gearing box faults and motor faults may be considered. All of the faults tested in the project were in extreme cases; however, faults normally get worse gradually. Less extreme faults will be tested in the future. Extra accelerometers can be used to get more vibration details and database can be used in the program for data storage. Other classification techniques such as rough sets can be tested and compared with the neural network for fault detection. It would be useful to detect faults in the industrial setting. REFERENCES [1] Akaike, H. Fitting autoregressive models for prediction. (1969) Annals of the Institute of Statistical Mathematics. 243-247. [2] Atal, B. S., (2006). The history of linear prediction. IEEE Signal Processing Magazine, 23(2). 154-161. [3] Demuth, H., and Beale, M., (1993). Neural network toolbox for use with MATLAB. [4] Fogel, D. B., (1991). An information criterion for optimal neural network selection, IEEE Transactions on Neural Networks, 2(5). 490- 497. [5] Hyndman, R. J., (1993). Yule-Walker estimates for continuous time autoregressive models, Journal of Time Series Analysis, 14(3): 281-296. [6] Junsheng, C., Dejie, Y., and Yu, Y., (2006). A fault diagnosis approach for roller bearings based on EMD method and AR model, Mechanical Systems and Signal Processing, 20(2): 350-362. [7] Mobley, R., K., (2012). An introduction to predictive maintenance,” 2nd edition, Toronto, Ontario, Canada, Elsevier Science. [8] National Instruments, (2012). NI PXIe-4492 Introduction, USA, http:// sine.ni.com/nips/cds/view/p/lang/en/nid/209541. [9] Patel, J., Patel, V, and Patel, A., (2013). Fault diagnostics of rolling bearing based on improve time and frequency domain features using artificial neural networks, International Journal for Scientific Reaearch & Development, Vol. 1, Issue 4. 257 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management [10] IMI Sensors, (2012). Platinum Low-cost Industrial ICP ® accelerometer introduction. [11] https://www.imi-sensors.com/Industrial_Accelerometers/Low-Cost _for_Permanent_Installation/Low-cost_2- pin_MIL_Connector/603C01.aspx. [12] Smith, J.L., (2009). World oil: market or mayhem?. Journal of Economic Perspectives, 3(23):145-164. 258 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Time-frequency domain analysis of varying speed vibration response of dual-rotor system Yizhou Yang, Chao Liu, Dongxiang Jiang and Wenguang Yang State Key Lab of Control and Simulation of Power Systems and Generation Equipment, Department of Energy and Power Engineering, Tsinghua University, Beijing, China ABSTRACT Dual-rotor is a common form of rotor system where two rotors are coupled via an inter-shaft bearing and the speeds of the rotors are usually different and variable. When the vibration signal of a dual-rotor system is non-stationary, traditional frequency-domain signal processing methods are not suitable. In this paper, variable speed vibration responses of a dual-rotor system dynamic simulation model is acquired and analyzed by a time-frequency domain signal analysis approach called frequency slice wavelet transform (FSWT). The dual-rotor system model is established under an inertia coordinate using finite element method. Then the rotor system model is solved by the explicit Newmark-β method to acquire the unbalance response under a fixed rotor speed or a varying speed, which could be movement of uniform angular acceleration or randomly fluctuating speed. By choosing the suitable scale coefficient, FSWT could achieve a good balance between time and frequency resolution, thus reflecting the detailed features in the vibration response. Meanwhile frequency components of the complex signal could be reconstructed by a FSWT representation. Keywords: dual-rotor system, varying speed, time-frequency domain analysis, advanced signal processing, frequency slice wavelet transform. Corresponding author: Dongxiang Jiang (jiangdx@tsinghua.edu.cn) 1. INTRODUCTION Many rotating machinery often work under varying speed conditions, resulting in aperiodic characteristics in their vibration. Thus traditional methods of signal processing based on Fourier transform are not suitable for such non- stationary vibration signals [1]. The frequency components in the signal are variable, which could be analyzed by time-frequency domain signal methods [2]. Dual-rotor is a common form of rotor system in rotating machinery like aero-engines. The high-pressure rotor and the low-pressure rotor are coupled via an inter-shaft bearing, and the speed of the rotors are different and variable, making the vibration signals more complex than general rotating machinery. Time-frequency signal processing methods are implemented on non-stationary signals in order to obtain the variation of the frequency components of a signal over time. The short-time Fourier transform (STFT) is one of the most basic forms of time-frequency analysis [3]. STFT divides a signal into shorter segments of equal length and compute the Fourier transform separately on each shorter segment then the changing spectra as a function of time is plotted. But once the window of STFT is fixed the time-frequency resolution is fixed so the ideal time-frequency resolution is hard to get. The wavelet transform obtains a scalable window function by selecting the scale factor and the shift factor, thus has a good time-frequency localization characteristic [4]. The ratio between the bandwidth and the analysis frequency remains unchanged, so that the wavelet function can automatically adapt to changes in the frequency components of the signal. The short-time window is used for high-frequency components and the long- time window is used for low-frequency components. Frequency slice wavelet transform (FSWT) is a kind of time- frequency signal analysis method by means of extension of STFT defined directly in frequency domain [5, 6]. The adjustable scale parameter makes the time-frequency resolution of FSWT decomposition controllable. The inverse transform of FSWT no longer rely on wavelet functions like wavelet transforms, so time-frequency domain segmentation could be performed flexibly in time-frequency space, and the desired signal components could be separated by signal reconstruction. Besides of the measurement of an actual rotor system or an experimental rotor system, modelling and simulation is 259 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management also an important way to acquire different kinds of rotor vibration signals. Researchers have built different types of models when dealing with dynamic problems of rotor systems [7-9]. The finite element model turn the continuous flexible structure into finite discrete elements, obtains rich internal information, and are also convenient to add nonlinear forces, which is suitable for numerical calculation [10]. Processing the flexible parts as finite elements and the parts of large stiffness as discrete lumped masses could reduce the complexity of the system model and ensure the accuracy of simulation results at the same time, making it a more efficient modeling approach. In this paper, a finite element dual-rotor system model is established under an inertia coordinate. The rotors are modelled by beam elements and the bearings are using a nonlinear multi-body dynamic model. Unbalance responses under a fixed rotor speed or a varying speed are acquired by solving the system model using the explicit Newmark- β method. The varying rotor speed is realized by solving the rotation equations coupled with the system dynamic equations. Vibration displacement signals acquired from the dual-rotor system are then analyzed by frequency slice wavelet transform to obtain the variation of the main frequency components over time. Time-frequency contours of the transient signals are plotted on which the time-frequency domain segmentation could be based. The desired signal components are reconstructed using the inverse transform of FSWT. 2. DYNAMIC MODEL OF THE DUAL-ROTOR SYSTEM The dual-rotor system consists of two rotors connected by an inter-shaft bearing. Each rotor is composed of a flexible shaft, rigid discs and bearings. The schematic of the dual-rotor system is shown in Figure 1. The shafts are modelled with finite element method using the 3-D elastic beam element. The beam element has 2 nodes and each node has 6 degree of freedoms (DOFs), which are translations in the X, Y and Z directions and rotations around the X, Y and Z directions. The DOFs and the motion equation of the beam element are: 𝑞e = [𝑢𝑥1, 𝑢𝑦1, 𝑢𝑧1, 𝜃𝑥1, 𝜃𝑦1, 𝜃𝑧1, 𝑢𝑥2, 𝑢𝑦2, 𝑢𝑧2, 𝜃𝑥2, 𝜃𝑦2, 𝜃𝑧2] 𝑇 (1) 𝑀𝑒?̈?𝑒 + (−𝜔𝐺𝑒)?̇?𝑒 + 𝐾𝑒𝑞𝑒 = 0 (2) 𝑀𝑒 is the consistent mass matrix, 𝐺𝑒 is the gyroscopic matrix, 𝐾𝑒 is the stiffness matrix and 𝜔 is the rotational speed. The rigid discs is modelled as discrete lumped masses using the 3-D mass element, which has one node with 6 DOFs. The DOFs and the motion equation of the mass element are: 𝑞e = [𝑢𝑥, 𝑢𝑦, 𝑢𝑧, 𝜃𝑥, 𝜃𝑦, 𝜃𝑧] 𝑇 (3) 𝑀𝑒?̈?𝑒 + (−𝜔𝐺𝑒)?̇?𝑒 = 𝑄𝑒 (4) 𝑄𝑒 is the external force vector and could be unbalance forces or other types of excitation. Use the element assembly method, the motion equations of the rotors could be derived as 𝑀𝑖 𝑟?̈?𝑖 𝑟 + (𝐶𝑖 𝑟 − 𝜔𝑖𝐺𝑖 𝑟)?̇?𝑖 𝑟 + 𝐾𝑖 𝑟𝑞𝑖 𝑟 = 𝑄𝑖 𝑟, 𝑖 = 1,2 (5) 𝐶𝑟 is the damping matrix, here using Rayleigh damping which is in the form of proportional damping. Figure 1. Dual-rotor system and nonlinear multi-body dynamic model of rolling element bearing. 260 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management The rolling element bearings supporting the rotors are modelled using a kind of nonlinear multi-body dynamic model, which is assumed as nonlinear springs with no mass and evenly arranged along the rolling element raceway [11, 12] (Figure 1). When the rotation position of the rotor is 𝜃(𝑡), the cage position is 𝜃𝑐(𝑡) and the angular position of the 𝑖th rolling element is 𝜃𝑖(𝑡). 𝜃𝑐(𝑡) = 𝜃(𝑡) 2 (1 − 𝑑 𝐷 cos 𝛼) (6) 𝜃𝑖(𝑡) = 2𝜋(𝑖−1) 𝑍 + 𝜃𝑐(𝑡) + 𝜃0 + (0.5 − 𝑟𝑎𝑛𝑑) × 𝜃𝑠𝑙𝑖𝑝, 𝑖 = 1,2, … , 𝑍 (7) 𝜃0 is the initial position of the first rolling element. (0.5 − 𝑟𝑎𝑛𝑑) × 𝜃𝑠𝑙𝑖𝑝 is the phase deviation caused by slippage. If the displacement of the shaft (𝑥𝑠, 𝑦𝑠) and the outer race (𝑥𝑜, 𝑦𝑜) is known, the contact deformation for the 𝑖th rolling element is 𝛾𝑖 = (𝑥𝑠 − 𝑥𝑜) cos 𝜃𝑖 + (𝑦𝑠 − 𝑦𝑜) sin 𝜃𝑖 − 𝛿 𝑖 = 1, 2, … 𝑍 (8) 𝛿 is the initial clearance between the rolling elements and the bearing races. The total bearing force could be derived in the form of external forces by summing the contact force of each rolling element. 𝐹𝐵𝑥 = −𝑘𝑏 ∑ (𝜆𝑖𝛾𝑖) 𝑛 cos 𝜃𝑖 𝑍 𝑖=1 , 𝐹𝐵𝑦 = −𝑘𝑏 ∑ (𝜆𝑖𝛾𝑖) 𝑛 sin 𝜃𝑖 𝑍 𝑖=1 (9) 𝑘𝑏 is the contact stiffness factor. 𝜆𝑖 is a coefficient for judging whether the rolling element deformation is positive or not. 𝑛 is the exponent of Hertz contact, taking 3/2 for ball bearings and 10/9 for roller bearings. The rotor system model is established by assembling all the finite elements and adding the external forces. The generalized external force vector of the system contains the unbalance forces, the weight forces and the nonlinear forces of the rolling element bearings. 𝑀𝑠?̈?𝑠 + (𝐶𝑠 − 𝜔1𝐺1 𝑟 − 𝜔2𝐺2 𝑟)?̇?𝑠 + 𝐾𝑠𝑞𝑠 = 𝑄𝑠 (10) 𝑄𝑠 = 𝐹𝑢 + 𝑊 + 𝐹𝐵 (11) The unbalance force on the disc is the main excitation of this dynamic model. When the rotation position of the rotor is 𝜃(𝑡), the rotation speed is 𝜔(𝑡) = ?̇?(𝑡), then unbalance forces will be 𝐹𝑢𝑖𝑦 = 𝑚𝑑𝑒[?̇?(𝑡)] 2 cos[𝜃(𝑡)] , 𝐹𝑢𝑖𝑧 = 𝑚𝑑𝑒[?̇?(𝑡)] 2 sin[𝜃(𝑡)] (12) 𝑚𝑑 is the disc mass and 𝑒 is the disc eccentricity. If the rotor speed is constant, 𝜃(𝑡) = 𝜔𝑡. But if the rotating speed changes over time, the angular velocity and displacement of the rotors could be obtained by solving the rotation equations. 𝐼𝑖 𝑟𝜃?̈? + 𝑐𝑖 𝑟𝜃?̇? + 𝑘𝑖 𝑟𝜃𝑖 = 𝑇𝑖 𝑟, 𝑖 = 1, 2 (13) 𝐼𝑖 𝑟 is the total moment of inertia of a rotor, 𝑐𝑖 𝑟 the rotational damping and 𝑘𝑖 𝑟 the rotational stiffness. 𝑇𝑖 𝑟 is the load torque, which could be controlled to control the movement of the rotor. 3. FREQUENCY SLICE WAVELET TRANSFORM For a signal 𝑓(𝑡), ?̂?(𝜔) is its frequency slice function (FSF) and the Fourier transformation of 𝑝(𝑡). The frequency slice wavelet transform (FSWT) of 𝑓(𝑡) is 𝑊(𝑡, 𝜔, 𝜆, 𝜎) = 1 2𝜋 𝜆 ∫ 𝑓(𝑢)?̂?∗ ( 𝑢−𝜔 𝜎 ) 𝑒𝑖𝑢𝑡𝑑𝑢 +∞ −∞ (14) where the scale 𝜎 ≠ 0 and energy coefficient 𝜆 ≠ 0 are constants or functions of 𝜔 and 𝑡. ?̂?∗(𝜔) is the conjugate of ?̂?(𝜔). Define another scale κ = 𝜔 𝜎⁄ , which is not directly related to 𝜔 and 𝑢, and can be adjusted to tune the transform to be more sensitive to frequency or more sensitive to time. Then FSWT is in the form as 𝑊(𝑡, 𝜔, 𝜆, 𝜅) = 1 2𝜋 𝜆 ∫ 𝑓(𝑢)?̂?∗ (𝜅 𝑢−𝜔 𝜔 ) 𝑒𝑖𝑢𝑡𝑑𝑢 +∞ −∞ (15) It is impossible to have high level of resolution in time and frequency domain at the same time due to the Heisenberg 261 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management uncertainty principle. So two eclectic parameters of a signal are defined to estimate the scale parameter κ and have a compromise between time and frequency resolution. One is the frequency resolution ratio 𝜂 = 𝛥𝜔 𝜔⁄ (16) The other is the expected response ratio of amplitude 0 < 𝜐 ≤ 1 (17) It is recommended to use the inequality below to estimate the scale parameter 𝜅: 𝜅 ≥ √(1 𝜐⁄ )2 − 1 𝜂⁄ (18) Since the time and frequency behaviors of a signal are not independent, the information of FSWT is redundant. There are various forms of inverse transforms to rebuild the original signal, and one of the simplest forms is often used and shown below. If the frequency slice function ?̂?(𝜔) satisfies ?̂?(0) = 1, the original signal 𝑓(𝑡) could be reconstructed as 𝑓(𝑡) = 1 2𝜋𝜆 ∫ ∫ 𝑊(𝑡, 𝜔, 𝜆, 𝜎)𝑒𝑖𝜔(𝑡−𝜏)𝑑𝜏 ∞ −∞ 𝑑𝜔 ∞ −∞ (19) It can be seen that the inverse transform has no relation with the frequency slice function 𝑝(𝑡) or ?̂?∗(𝜔). In any specific time-frequency region (𝑡1, 𝑡2, 𝜔1, 𝜔2), the signal component could be reconstructed [13] as 𝑓𝑥(𝑡) = 1 2𝜋𝜆 ∫ ∫ 𝑊(𝑡, 𝜔, 𝜆, 𝜎)𝑒𝑖𝜔(𝑡−𝜏)𝑑𝜏 𝑡2 𝑡1 𝑑𝜔 𝜔2 𝜔1 (20) For FSWT is used for processing discrete signals here, the discrete FSWT is 𝑊(∙, 𝑘) = 𝜆𝐹−1{𝐹{𝑓}.∗ 𝑃𝑘}, 𝑘 = 0,1, … , 𝑁 − 1 (21) 𝑃𝑘 is the discrete FSF and could be seen as a series of filters. 𝐹 is the Fourier transform and 𝐹 −1 the inverse Fourier transform, which could be calculated by FFT and IFFT. Correspondingly, the discrete inverse transform of FSWT is 𝑓(∙) = 1 𝜆𝑁 𝐹−1{∑ 𝑊(𝑙, 𝑘)𝑒−𝑖(2𝜋 𝑁⁄ )𝑙𝑘, 𝑘 = 0,1, … , 𝑁 − 1 𝑁𝑟−1 𝑙=0 } (22) 4. SIMULATION RESULT The nonlinear equation of the dual-rotor system model is solved by the explicit Newmark-β method. This numerical integration approach is suitable for obtaining the dynamic response of a nonlinear system. Then the displacement, velocity and acceleration at each node of the finite element model can be obtained. In a dual-rotor system, the high-pressure rotor and the low-pressure rotor both have unbalanced excitation forces. Usually the rotating speeds of the two rotors are different, and there is a certain relationship between them. Thus the excitation frequencies of the two rotors are also different. Here the speed relationship is taken as a fixed speed ratio of 1.5, either at simulations of constant rotor speed or variable rotor speed. Calculate the steady-state unbalance response of the dual-rotor system at a fixed speed. The rotating speed is (60, 90) Hz, and the unbalanced excitation of the high-pressure rotor and the low-pressure rotor exist at the same time. The vibration displacement waveform of the low-pressure rotor’s second disc in the horizontal direction and its frequency spectrum are shown in Figure 2. No clear nonlinear phenomenon appears in the shaft displacement. Both frequencies of the excitations are reflected on the same node due to the coupling effect of the inter-shaft bearing. If the load torque of the rotor fluctuates, the rotor speed will fluctuate accordingly. Figure 3 shows the situation when the rotor load torques occur random fluctuations, which leads to changes in the amplitude and frequency of the vibration signal. The peaks of the rotational frequencies in the spectrum are wider than ones in the fixed-speed vibration spectrum. Figure 4 shows an experimental signal acquired from a dual-rotor test rig [14], whose rotating speed is about (40.5, 54.0) Hz. Due to the mechanical dimensional errors and the speed fluctuate of the test rig, the experimental signal is not as smooth as the fixed-speed simulated signal. But the two main frequencies are clear, and have relatively wide peaks in the spectrum, which is similar to the simulated signal of fluctuating speed. 262 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 5 Figure 2. Steady-state unbalance response of the dual-rotor system. Figure 3. Unbalance response of the dual-rotor system with fluctuating rotor speed. Figure 4. Experimental signal from a dual-rotor test rig. The accelerating rotation process with a fixed angular acceleration is simulated, with the angular acceleration of (20, 30) Hz/s. Figure 5 is the simulated displacement signal of the dual-rotor system with only the unbalance excitation of the low-pressure rotor, and the one with only the unbalance excitation of the high-pressure rotor. It can be seen that different natural frequencies of the system are excited by the unbalance force of the low-pressure rotor or the high-pressure rotor. When the rotor is passing a natural frequency, known as a critical speed of the rotor system [15], the vibration amplitude first increase gradually, then decrease quickly after the peak amplitude. After the rotor passes the critical speed, the vibration at the natural frequency is in the decaying stage, and is synthesized with the higher frequency vibration, which results in the occurrence of the beat vibration. Time-frequency contours of the speed-up signals in the region of (0s, 5s, 0Hz, 150Hz) drawn using FSWT are displayed in Figure 6. Take 𝜂 = 0.1 and 𝜐 = 0.5, and it is recommended that 𝜅 ≥ 10√3. Since there is much 263 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management uncertainty in the selection of the scale parameter 𝜅, it should be adjusted based on the FSWT result to achieve a balance between the time and frequency resolution. Take 𝜅 = 30, and the FSWT results show the relationship between the signal frequency components and time clearly. The white line in the image shows the increasing of the main vibration frequency during the acceleration process. The red peaks show the rotating speed passing a critical speed. Figure 5. Simulated displacement signals with the unbalance excitation of the low-pressure rotor or the high-pressure rotor. Figure 6. Time-frequency contours of the speed-up signals with the unbalance excitation of one rotor. Figure 7 is the simulated displacement signal of the dual-rotor system with the unbalance excitation of both the low- pressure rotor and the high-pressure rotor. So natural frequencies of the both rotors could be excited. The FSWT contour of the signal is shown in Figure 8, where the time-frequency and the scale parameter remain the same. Two white lines show the two main frequencies changing at the same time, while the three red peaks show the first three orders of critical speeds of the dual-rotor system. Short-time Fourier transform (STFT) is used for time-frequency analysis and its contour is also shown in Figure 8. Comparing with FSWT, although the frequency trend can be seen, the time-frequency resolution of the STFT output is much lower. This shortcoming might become a bigger problem when the frequency components become more complex. Based on the FSWT result, the vibration signal could be decomposed and reconstructed. Split the time-frequency contour into two parts and each part contains the main frequency of one rotor like in Figure 6. By setting the FSWT values of one part all to zero, the other part could be rebuilt by the inverse transform of FSWT. The two reconstructed sub-signals are shown in Figure 9, which are very close to the signals excited by one rotor unbalance above. Adding the two reconstructed sub-signal as one signal, which is shown in Figure 9 and is very close to the original signal. It is noticed that although the reconstruction requires a square time-frequency region, any shape of time-frequency part could rebuild a signal by setting the unwanted part to zero. So the decomposition and reconstruction in the time- 264 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management frequency domain of a signal using FSWT is quite flexible. Figure 7. Simulated displacement signal with the unbalance excitation of two rotors. Figure 8. Time-frequency contours of the speed-up signal with the unbalance excitation of two rotors using FSWT and STFT. Figure 9. Signal decomposition and reconstruction using FSWT. (1) Origin signal; (2) rebuilt sub-signal of low-pressure rotor; (3) rebuilt sub-signal of high-pressure rotor; (4) rebuilt signal. 5. CONCLUSION This paper presents a dual-rotor system model built by finite element method and a time-frequency method to analyze 265 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management the varying speed vibration response of the system. In the system model, two rotors are supported on rolling element bearings and coupled by an inter-shaft bearing, where the shafts are modelled by beam elements and the bearings are using a nonlinear multi-body dynamic model. Unbalance responses under a fixed rotor speed or a varying speed are simulated and the varying speed is realized by solving the rotation equations of the rotors. The acceleration process will pass through the first few critical speeds, some of which are excited by the low-pressure unbalance and the others are excited by the high-pressure unbalance. A time-frequency analysis approach called frequency slice wavelet transform is utilized to obtain the variation of the main frequency components over time in a transient signal. The time-frequency contours plotted by FSWT have a higher time-frequency resolution than the traditional short-time Fourier transform. Upon the FSWT coefficients time- frequency domain segmentation could be performed flexibly and desired signal components could be reconstructed by the inverse transform of FSWT. For the speed-up signal of the dual-rotor system, the vibrations of natural frequencies excited by different rotors are separated and rebuilt with a higher degree of reduction. REFERENCES [1] Z. M. Lin J, "Dynamic signal analysis for speed-varying machinery: A review (in Chinese)," Sci Sin Tech, vol. 45, pp. 669–686, 2015. [2] Z. Feng, M. Liang and F. Chu, "Recent advances in time–frequency analysis methods for machinery fault diagnosis: A review with application examples," Mechanical Systems and Signal Processing, vol. 38, pp. 165-205, 2013. [3] A. K. S. Jardine, D. Lin and D. Banjevic, "A review on machinery diagnostics and prognostics implementing condition-based maintenance," Mechanical Systems and Signal Processing, vol. 20, pp. 1483-1510, 2006. [4] C. Torrence and G. P. Compo, "A Practical Guide to Wavelet Analysis," vol. 79, pp. 61 - 78, 1998. [5] Z. Yan, A. Miyamoto and Z. Jiang, "Frequency slice wavelet transform for transient vibration response analysis," Mechanical Systems and Signal Processing, vol. 23, pp. 1474-1489, 2009. [6] Z. Yan, A. Miyamoto, Z. Jiang, and X. Liu, "An overall theoretical description of frequency slice wavelet transform," Mechanical Systems and Signal Processing, vol. 24, pp. 491-507, 2010. [7] G. Chen, "Vibration modelling and verifications for whole aero-engine," Journal of Sound and Vibration, vol. 349, pp. 163-176, 2015. [8] P. M. Hai and P. Bonello, "An impulsive receptance technique for the time domain computation of the vibration of a whole aero- engine model with nonlinear bearings," Journal of Sound and Vibration, vol. 318, pp. 592-605, 2008. [9] J. J. Sinou, "Non-linear dynamics and contacts of an unbalanced flexible rotor supported on ball bearings," Mechanism and Machine Theory, vol. 44, pp. 1713-1732, 2009. [10] H. D. Chiang, C. Hsu, W. Jeng, S. Tu, W. Li, A. S. O. M. Engineers, and Null, "Turbomachinery Dual Rotor-Bearing System Analysis," in American Society of Mechanical Engineers(ASME) Turbo Expo 2002 v.4 pt.B:Ceramics Industrial and Cogeneration Structures and Dynamics Amsterdam,The Netherlands, 2002, p. 8. [11] N. Sawalhi and R. B. Randall, "Simulating gear and bearing interactions in the presence of faults," Mechanical Systems and Signal Processing, vol. 22, pp. 1924-1951, 2008. [12] K. Kappaganthu and C. Nataraj, "Nonlinear modeling and analysis of a rolling element bearing with a clearance," Communications in Nonlinear Science and Numerical Simulation, vol. 16, pp. 4134-4145, 2011. [13] D. Chendong, G. Qiang and X. Xianfeng, "Generator Unit Fault Diagnosis Using the Frequency Slice Wavelet Transform Time- frequency Analysis Method," Proceedings of the CSEE, vol. 33, p. I0014, 2013. [14] N. Wang, D. Jiang and K. Behdinan, "Vibration response analysis of rubbing faults on a dual-rotor bearing system," Archive of Applied Mechanics, vol. 87, pp. 1891-1907, 2017. [15] M. Guskov, J. J. Sinou, F. Thouverez, and O. S. Naraikin, "Experimental and Numerical Investigations of a Dual-Shaft Test Rig with Intershaft Bearing," International Journal of Rotating Machinery, vol. 2007, pp. 1-12, 2007. 266 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Remaining Useful Life Prediction and Uncertainty Modelling with Bayesian Deep Learning C.J. Louw, P.S. Heyns University of Pretoria, South Africa ABSTRACT The linearly decreasing remaining useful life modeling strategy was successfully implemented on the FD001 NASA turbofan engine degradation simulation data set time series regression problem, with 4 different Bayesian Deep Learning network architectures. This included a Dense Network, Simple Recurrent Network, Gated Recurrent Unit Network and Long Short-Term Memory Network, with Monte Carlo Dropout for remaining useful life prediction and uncertainty modeling. The Long Short-Term Memory Network was the most accurate at predicting the remaining useful life from the condition monitoring information for the turbofan engine examples in the testing set, due to its advanced sequence modeling architecture. The complex distributions and the model uncertainty associated with remaining useful life predictions were successfully and efficiently modeled with Monte Carlo Dropout. This made it possible to better understand the model uncertainty associated with remaining useful life predictions, which was previously given extremely limited attention in prognostics and health management research with Deep Learning. The Long Short-Term Memory Network model uncertainty drastically decreased, and model prediction accuracy drastically increased close to failure for all the turbofan engine examples in the testing set. The Long Short-Term Memory Network could on average very accurately predict the remaining useful life for all the turbofan engine examples 50 cycles before failure in the testing set, with a mean absolute error of only 1.954 cycles. Keywords: Bayesian Deep Learning, Prognostics, Remaining Useful Life Modelling, Condition Monitoring, Pattern Recognition Corresponding author: Carel Johannes Louw (u12032876@tuks.co.za) 1. INTRODUCTION The accurate prediction and uncertainty modeling of remaining useful life (time to failure) for complex physical assets is extremely important for minimizing downtime and maintenance costs, and maximizing production output, availability, reliability and profitability [1]. The linearly decreasing remaining useful life modeling strategy [1] [2] can generally be used for the convenient remaining useful life modeling of complex physical assets, with numerous machine learning modeling techniques as a supervised time series regression problem. For this strategy the condition monitoring information time series inputs are modeled (mapped) directly on to the linearly decreasing remaining useful life time series output, for multiple examples that the complex physical asset was run to failure. This strategy does however assume that historic run to failure condition monitoring information is available and that degradation is trendable with respect to the measured condition monitoring information. The definition of failure (threshold) with respect to the measured condition monitoring information must also be consistent between examples. There are however extremely limited machine learning techniques that can accurately model long-term sequence information in condition monitoring information time series. Deep Learning with Simple Recurrent, Gated Recurrent Unit, Long Short-Term Memory Networks has achieved state-of-the-art performance on the majority of supervised machine learning sequence modeling problems [3]. There has however been extremely limited previous research done on applying Simple Recurrent, Gated Recurrent Unit, Long Short-Term Memory Networks for remaining useful life prediction and uncertainty modeling, which is crucial for asset management decision making. 267 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 2. DEEP LEARNING BACKGROUND 2.1. Deep Learning Models Supervised Deep Learning regression models consist of numerous stacked layers, where each layer has trainable parameters and a nonlinear activation function. These layers can include Dense, Simple Recurrent, Gated Recurrent Unit, Long Short-Term Memory and Regression Layers. The layers in Deep Learning regression models are stacked. This means that the output vector of a previous layer is used as the input vector for the next layer, with the objective of modeling complex input to output mappings [4]. For this paper the inputs were condition monitoring information time series and the output was a remaining useful life time series of multiple turbofan engine examples that were run to failure. 2.2. Dense Layer The Dense Layer [4] is very simple and can model general input tot output mappings, but cannot model sequence information from previous time steps. The Dense Layer output vector ℎ௜ሺ௧ሻ is calculated as shown in equation (1). Where 𝑊௜,௝ is the weight matrix, 𝑥௝ሺ௧ሻ is the input vector and 𝑏௜ is the bias vector. ℎ௜ሺ௧ሻ ൌ ReLU ቌ෍ 𝑊௜,௝𝑥௝ሺ௧ሻ ௝ ൅ 𝑏௜ቍ (1) The rectified linear unit activation function ReLUሺ𝑧௜ሻ for any vector 𝑧௜ is calculated as shown in equation (2). ReLUሺ𝑧௜ሻ ൌ ൜𝑧௜ for 𝑧௜ ൒ 00 for 𝑧௜ ൏ 0 (2) 2.3. Simple Recurrent Layer The Simple Recurrent Layer [4] can model sequence information from previous time steps, but suffers from the vanishing and exploding gradient problem [5] during training which limits its capacity to model long-term sequence information. The Simple Recurrent Layer output vector ℎ௜ሺ௧ሻ is calculated as shown in equation (3). Where 𝑊௜,௝ is the weight matrix, 𝑥௝ሺ௧ሻ is the input vector, 𝑅௜,௝ is the recurrent weight matrix, ℎ௝ሺ௧ିଵሻ is the output vector from the previous time step and 𝑏௜ is the bias vector. ℎ௜ሺ௧ሻ ൌ tanh ቌ෍ 𝑊௜,௝𝑥௝ሺ௧ሻ ௝ ൅ ෍ 𝑅௜,௝ℎ௝ሺ௧ିଵሻ ௝ ൅ 𝑏௜ቍ (3) The hyperbolic tangent activation function tanhሺ𝑧௜ሻ for any vector 𝑧௜ is calculated as shown in equation (4). tanhሺ𝑧௜ሻ ൌ ௘ ೥೔ି௘ష೥೔ ௘೥೔ା௘ష೥೔ (4) 2.4. Gated Recurrent Unit Layer The Gated Recurrent Unit Layer [6] is significantly more complex, but manages the vanishing and exploding gradient problem [5] during training with numerous gating operations that drastically increases its capacity to model relevant long-term sequence information. The Gated Recurrent Unit Layer candidate cell state vector 𝑠௜ሺ௧ሻ is calculated as shown in equation (5). Where 𝑊௜,௝௦ is the candidate cell state weight matrix, 𝑥௝ሺ௧ሻ is the input vector, 𝑅௜,௝௦ is the candidate cell state recurrent weight matrix, 𝑟௝ሺ௧ሻ is the reset gate vector, ℎ௝ሺ௧ିଵሻ is the output vector from the previous time step and 𝑏௜௦ is the candidate cell state bias vector. 268 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 𝑠௜ሺ௧ሻ ൌ tanh ቌ෍ 𝑊௜,௝௦ 𝑥௝ሺ௧ሻ ௝ ൅ ෍ 𝑅௜,௝௦ 𝑟௝ሺ௧ሻℎ௝ሺ௧ିଵሻ ൅ 𝑏௜௦ ௝ ቍ (5) The Gated Recurrent Unit Layer update gate vector 𝑢௜ሺ௧ሻ is calculated as shown in equation (6). Where 𝑊௜,௝௨ is the update gate weight matrix, 𝑥௝ሺ௧ሻ is the input vector, 𝑅௜,௝௨ is the update gate recurrent weight matrix, ℎ௝ሺ௧ିଵሻ is the output vector from the previous time step and 𝑏௜௨ is the update gate bias vector. 𝑢௜ሺ௧ሻ ൌ 𝜎 ቌ෍ 𝑊௜,௝௨ 𝑥௝ሺ௧ሻ ௝ ൅ ෍ 𝑅௜,௝௨ ℎ௝ሺ௧ିଵሻ ൅ 𝑏௜௨ ௝ ቍ (6) The sigmoid activation function 𝜎ሺ𝑧௜ሻ for any vector 𝑧௜ is calculated as shown in equation (7). 𝜎ሺ𝑧௜ሻ ൌ 11 ൅ 𝑒ି௭೔ (7) The Gated Recurrent Unit Layer reset gate vector 𝑟௜ሺ௧ሻ is calculated as shown in equation (8). Where 𝑊௜,௝௥ is the reset gate weight matrix, 𝑥௝ሺ௧ሻ is the input vector, 𝑅௜,௝௥ is the reset gate recurrent weight matrix, ℎ௝ሺ௧ିଵሻ is the output vector from the previous time step and 𝑏௜௥ is the reset gate bias vector. 𝑟௜ሺ௧ሻ ൌ 𝜎 ቌ෍ 𝑊௜,௝௥ 𝑥௝ሺ௧ሻ ௝ ൅ ෍ 𝑅௜,௝௥ ℎ௝ሺ௧ିଵሻ ൅ 𝑏௜௥ ௝ ቍ (8) The Gated Recurrent Unit Layer cell state vector 𝑐௜ሺ௧ሻ is calculated (updated) as shown in equation (9). Where 𝑢௜ሺ௧ሻ is the update gate vector, 𝑠௜ሺ௧ሻ is the candidate cell state vector and 𝑐௜ሺ௧ିଵሻ is the cell state vector from the previous timestep. 𝑐௜ሺ௧ሻ ൌ 𝑢௜ሺ௧ሻ𝑠௜ሺ௧ሻ ൅ ቀ1 െ 𝑢௜ሺ௧ሻቁ 𝑐௜ሺ௧ିଵሻ (9) The Gated Recurrent Unit Layer output vector ℎ௜ሺ௧ሻis then finally calculated as shown in equation (10). Where 𝑐௜ሺ௧ሻ isthe cell state vector. ℎ௜ሺ௧ሻ ൌ 𝑐௜ሺ௧ሻ (10) 2.5. Long Short-Term Memory Layer The Long Short-Term Memory Layer [5] is even more complex, but also manages the vanishing and exploding gradient problem [5] during training with numerous gating operations that drastically increases its capacity to model relevant long-term sequence information. The Long Short-Term Memory Layer candidate cell state vector 𝑠௜ሺ௧ሻ is calculated as shown in equation (11). Where 𝑊௜,௝௦ is the candidate cell state weight matrix, 𝑥௝ሺ௧ሻ is the input vector, 𝑅௜,௝௦ is the candidate cell state recurrent weight matrix, ℎ௝ሺ௧ିଵሻ is the output vector from the previous time step and 𝑏௜௦ is the candidate cell state bias vector. 𝑠௜ሺ௧ሻ ൌ tanh ቌ෍ 𝑊௜,௝௦ 𝑥௝ሺ௧ሻ ௝ ൅ ෍ 𝑅௜,௝௦ ℎ௝ሺ௧ିଵሻ ൅ 𝑏௜௦ ௝ ቍ (11) 269 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management The Long Short-Term Memory Layer update gate vector 𝑢௜ሺ௧ሻ is calculated as shown in equation (12). Where 𝑊௜,௝௨ is the update gate weight matrix, 𝑥௝ሺ௧ሻ is the input vector, 𝑅௜,௝௨ is the update gate recurrent weight matrix, ℎ௝ሺ௧ିଵሻ is the output vector from the previous time step and 𝑏௜௨ is the update gate bias vector. 𝑢௜ሺ௧ሻ ൌ 𝜎 ቌ෍ 𝑊௜,௝௨ 𝑥௝ሺ௧ሻ ௝ ൅ ෍ 𝑅௜,௝௨ ℎ௝ሺ௧ିଵሻ ൅ 𝑏௜௨ ௝ ቍ (12) The Long Short-Term Memory Layer forget gate vector 𝑓௜ሺ௧ሻ is calculated as shown in equation (13). Where 𝑊௜,௝௙ is the forget gate weight matrix, 𝑥௝ሺ௧ሻ is the input vector, 𝑅௜,௝௙ is the forget gate recurrent weight matrix, ℎ௝ሺ௧ିଵሻ is the output vector from the previous time step and 𝑏௜௙ is the forget gate bias vector. 𝑓௜ሺ௧ሻ ൌ 𝜎 ቌ෍ 𝑊௜,௝௙ 𝑥௝ሺ௧ሻ ௝ ൅ ෍ 𝑅௜,௝௙ ℎ௝ሺ௧ିଵሻ ൅ 𝑏௜௙ ௝ ቍ (13) The Long Short-Term Memory Layer output gate vector 𝑜௜ሺ௧ሻ is calculated as shown in equation (14). Where 𝑊௜,௝௢ is the output gate weight matrix, 𝑥௝ሺ௧ሻ is the input vector, 𝑅௜,௝௢ is the output gate recurrent weight matrix, ℎ௝ሺ௧ିଵሻ is the output vector from the previous time step and 𝑏௜௢ is the output gate bias vector. 𝑜௜ሺ௧ሻ ൌ 𝜎 ቌ෍ 𝑊௜,௝௢ 𝑥௝ሺ௧ሻ ௝ ൅ ෍ 𝑅௜,௝௢ ℎ௝ሺ௧ିଵሻ ൅ 𝑏௜௢ ௝ ቍ (14) The Long Short-Term Memory Layer cell state vector 𝑐௜ሺ௧ሻ is calculated as shown in equation (15). Where 𝑢௜ሺ௧ሻ is the update gate vector, 𝑠௜ሺ௧ሻ is the candidate cell state vector, 𝑓௜ሺ௧ሻ is the forget gate vector and 𝑐௜ሺ௧ିଵሻ is the cell statevector from the previous time step. 𝑐௜ሺ௧ሻ ൌ 𝑢௜ሺ௧ሻ𝑠௜ሺ௧ሻ ൅ 𝑓௜ሺ௧ሻ𝑐௜ሺ௧ିଵሻ (15) The Long Short-Term Memory Layer output vector ℎ௜ሺ௧ሻis then finally calculated as shown in equation (16). Where 𝑜௜ሺ௧ሻ is the output gate vector and 𝑐௜ሺ௧ሻ is the cell state vector. ℎ௜ሺ௧ሻ ൌ 𝑜௜ሺ௧ሻ tanh ቀ𝑐௜ሺ௧ሻቁ (16) 2.6. Regression Layer The Regression Layer [4] is the last layer in a supervised Deep Learning regression model and is just a Dense Layer with a linear activation function. The Regression Layer output vector 𝑦௜ሺ௧ሻ is calculated as shown in equation (17). Where 𝑊௜,௝ is the weight matrix, 𝑥௝ሺ௧ሻ is the input vector and 𝑏௜ is the bias vector. 𝑦௜ሺ௧ሻ ൌ ෍ 𝑊௜,௝𝑥௝ሺ௧ሻ ௝ ൅ 𝑏௜ (17) 270 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 2.7. Deep Learning Optimization Algorithms There are numerous optimization algorithms that can be used to train the weight matrices, recurrent weight matrices and bias vectors of Deep Learning models on training set examples. The most popular optimization algorithms in Deep Learning were investigated, which included Stochastic Gradient Descent with Momentum, AdaGrad, RMSProp and Adam [4]. The Adam optimization algorithm generally produced the best results for the investigated FD001 NASA turbofan engine degradation simulation data set time series regression problem. 2.8. Dropout Regularization Deep Learning models are unfortunately prone to overfitting on training set examples. This means that trained Deep Learning models can generally very accurately model previously seen training set examples, but can generally not generalize for previously unseen testing set examples. Dropout [7] is a stochastic regularization technique which reduces overfitting and increases the generalization ability of the Deep Learning models for more accurate predictions on previously unseen testing set examples. With Dropout regularization a fraction of elements in numerous output vectors in different layers of a Deep Learning model are randomly deactivated or dropped out (by zero multiplication) during a training iteration. This greatly reduces overfitting and increases the generalization ability of the Deep Learning model. Dropout regularization can also be interpreted as an efficient form of ensemble learning, since the Deep Learning model changes with each training iteration. This greatly improves modeling performance and generalization on the training and testing set. 2.9. Bayesian Deep Learning Deep Learning models can generally not model the complex distributions and model uncertainty associated with predictions. Monte Carlo Dropout [8] however is a new and efficient Bayesian Deep Learning technique that approximates the complex distributions and model uncertainty associated with predictions. With Monte Carlo Dropout a Deep Learning model is trained with the stochastic Dropout regularization technique in order to reduce overfitting. Dropout is generally only used during model training and turned off during model testing. With Monte Carlo Dropout however, Dropout is kept on during model testing and multiple stochastic predictions are made for the same input, which results in a complex output distribution. The average of the complex distribution can then be calculated in order to make a prediction. More importantly, the complex distribution can then be used in order to understand the model uncertainty associated with a prediction by calculating confidence intervals. 3. DATA SET DESCRIPTION The FD001 NASA turbofan engine degradation simulation data set time series regression problem [9] was investigated for this paper, because of its high popularity in prognostics and health management research [1], high complexity and general comparability to other engineering systems. The objective of the NASA turbofan engine degradation simulation data set time series regression problem, was to model the remaining useful life time series output from 24 condition monitoring information time series inputs, for multiple turbofan engine examples that were run to failure in the training set. The training set therefore represents previously seen historic run to failure condition monitoring information for multiple turbofan engine examples. The model then had to predict the remaining useful life time series output from 24 unseen condition monitoring information time series inputs for multiple turbofan engine examples that were run to failure in the testing set. The testing set therefore represents previously unseen real world operational run to failure condition monitoring information for multiple turbofan engine examples. The 24 condition monitoring information time series inputs that were measured for each of the turbofan engines that was run to failure in the training and testing set, included numerous temperatures, pressures, fan speeds, efficiencies (ratios), flow rates and other measurements. The condition monitoring information time series inputs were averaged (by the data set authors) after each operational cycle for each of the turbofan engines that was run to failure in the training and testing set. The training and testing set included 80 and 20 turbofan engine examples that were run to failure respectively. The high complexity of the problem is mainly due to unknown amounts of initial wear and the relatively high variance in failure times of the turbofan engine examples that were run to failure. The mean, standard deviation, minimum and maximum failure times of the turbofan engine examples were 206.31, 46.34, 128 and 362 cycles respectively. The condition monitoring information measurements were also extremely noisy which made modeling the data set even more challenging. 271 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 4. MODELING STRATEGY The linearly decreasing remaining useful life time series modeling strategy [1] [2] was used for the FD001 NASA turbofan engine degradation simulation data set time series regression problem. The condition monitoring information time series inputs were therefore trained (mapped) directly on the linearly decreasing remaining useful life time series output with numerous Bayesian Deep Learning models, for each turbofan engine example that was run to failure in the training set. The linearly decreasing remaining useful life time series modeling strategy does however make the important and often overlooked assumption that the definition of failure (threshold) with respect to the measured condition monitoring information is consistent for all the training and testing set turbofan engine examples. The linearly decreasing remaining useful life time series modeling strategy was implemented with 4 different Bayesian Deep Learning models. This included a Feedforward Network, Simple Recurrent Network, Gated Recurrent Unit Network and Long Short-Term Memory Network. The Feedforward Network (FN) architecture consisted of the following layers from input to output: 4 Dense Layers and 1 Regression layer. The Simple Recurrent Network (SRN) architecture consisted of the following layers from input to output: 1 Dense Layer, 2 Simple Recurrent layers, 1 Dense Layer and 1 Regression layer. The Gated Recurrent Unit Network (GRUN) architecture consisted of the following layers from input to output: 1 Dense Layer, 2 Gated Recurrent Unit layers, 1 Dense Layer and 1 Regression layer. The Long Short-Term Memory Network (LSTMN) architecture consisted of the following layers from input to output: 1 Dense Layer, 2 Long Short-Term Memory layers, 1 Dense Layer and 1 Regression layer. The models were all trained on the training set with the Adam optimization algorithm in TensorFlow and regularized with Dropout in each layer, except the regression layer. The remaining useful life distribution at each time step for each turbofan engine in the testing set, was obtained from 10,000 Monte Carlo Dropout predictions. The remaining useful life distribution at each time step for each turbofan engine in the testing set, was then averaged in order to make a remaining useful life (RUL) prediction, and the 5% and 95% percentiles calculated for lower and upper confidence intervals (CI) respectively. 5. RESULTS The remaining useful life prediction performance of the 4 different Bayesian Deep Learning models were compared by calculating the mean squared error and mean absolute error, between the actual and predicted remaining useful life for all the turbofan engine examples in the testing set. Table 1 compares the testing set mean squared error performance in cycles2 and the testing set mean absolute error performance in cycles for the 4 different Bayesian Deep Learning models. Table 1: The testing set mean squared error performance in cycles2 and the testing set mean absolute error performance in cycles for the 4 different Bayesian Deep Learning models Deep Learning Model Testing Set Mean Squared Error [Cycles2] Testing Set Mean Absolute Error [Cycles] FN 2303.315 35.704 SRN 1229.455 26.495 GRUN 955.236 22.123 LSTMN 716.467 17.871 The remaining useful life prediction performance of the 4 different Bayesian Deep Learning models were visualized by plotting the predicted remaining useful life versus time in cycles for 4 turbofan engine examples in the testing set. Figure 1 shows the predicted remaining useful life versus time in cycles of the 4 different Bayesian Deep Learning models for 4 turbofan engine examples in the testing set. 272 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 1: The predicted remaining useful life versus time in cycles of the 4 different Bayesian Deep Learning models for 4 turbofan examples in the testing set The remaining useful life prediction with confidence intervals of the 4 different Bayesian Deep Learning models were also visualized by plotting the remaining useful life with confidence intervals versus time in cycles for another turbofan engine example in the testing set. Figure 2 shows the predicted remaining useful life with confidence intervals versus time in cycles of the 4 different Bayesian Deep Learning models for a turbofan engine example in the testing set. 0 50 100 150 200 250 300 0 50 100 150 200 250 300 Re ma ini ng Us efu l L ife [C ycl es] Time [Cycles] Actual RUL FN Predicted RUL SRN Predicted RUL GRUN Predicted RUL LSTMN Predicted RUL 0 50 100 150 200 250 0 50 100 150 200 Re ma ini ng Us efu l L ife [C ycl es] Time [Cycles] Actual RUL FN Predicted RUL SRN Predicted RUL GRUN Predicted RUL LSTMN Predicted RUL 0 50 100 150 200 250 0 50 100 150 200 250 Re ma ini ng Us efu l L ife [C ycl es] Time [Cycles] Actual RUL FN Predicted RUL SRN Predicted RUL GRUN Predicted RUL LSTMN Predicted RUL 0 50 100 150 200 250 300 0 50 100 150 200 250 300 Re ma ini ng Us efu l L ife [C ycl es] Time [Cycles] Actual RUL FN Predicted RUL SRN Predicted RUL GRUN Predicted RUL LSTMN Predicted RUL 273 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 2: The predicted remaining useful life with confidence intervals versus time in cycles of the 4 different Bayesian Deep Learning models for a turbofan engine example in the testing set The turbofan engine degradation with respect to the condition monitoring information generally had significantly increased trendability 50 cycles before failure and significantly increased remaining useful life prediction performance. The remaining useful life prediction performance of the 4 different Bayesian Deep Learning models were therefore also compared by calculating the mean squared error and mean absolute error, between the actual and predicted remaining useful life for all the turbofan engine examples in the testing set 50 cycles before failure. Table 2 compares the testing set mean squared error performance in cycles2 and the testing set mean absolute error performance in cycles for the 4 different Bayesian Deep Learning models 50 cycles before failure. 0 50 100 150 200 250 300 0 50 100 150 200 250 Re ma ini ng Us efu l L ife [C ycl es] Time [Cycles] Actual RUL FN Predicted RUL FN Predicted RUL CI 0 50 100 150 200 250 0 50 100 150 200 250 Re ma ini ng Us efu l L ife [C ycl es] Time [Cycles] Actual RUL SRN Predicted RUL SRN Predicted RUL CI 0 50 100 150 200 250 300 0 50 100 150 200 250 Re ma ini ng Us efu l L ife [C ycl es] Time [Cycles] Actual RUL GRUN Predicted RUL GRUN Predicted RUL CI 0 50 100 150 200 250 300 0 50 100 150 200 250 Re ma ini ng Us efu l L ife [C ycl es] Time [Cycles] Actual RUL LSTMN Predicted RUL LSTMN Predicted RUL CI 274 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Table 2: The testing set mean squared error performance in cycles2 and the testing set mean absolute error performance in cycles for the 4 different Bayesian Deep Learning models 50 cycles before failure Deep Learning Model Testing Set Mean Squared Error [Cycles2] Testing Set Mean Absolute Error [Cycles] FN 47.061 6.561 SRN 18.082 3.597 GRUN 11.474 2.253 LSTMN 9.577 1.954 6. CONCLUSION The linearly decreasing remaining useful life modeling strategy was successfully implemented on the FD001 NASA turbofan engine degradation simulation data set time series regression problem, with 4 different Bayesian Deep Learning network architectures. This included a Dense Network, Simple Recurrent Network, Gated Recurrent Unit Network and Long Short-Term Memory Network, with Monte Carlo Dropout for remaining useful life prediction and uncertainty modeling. The Long Short-Term Memory Network was the most accurate at predicting the remaining useful life from the condition monitoring information for the turbofan engine examples in the testing set, due to its advanced sequence modeling architecture. The complex distributions and the model uncertainty associated with remaining useful life predictions were successfully and efficiently modeled with Monte Carlo Dropout. This made it possible to better understand the model uncertainty associated with remaining useful life predictions, which was previously given extremely limited attention in prognostics and health management research with Deep Learning. The Long Short-Term Memory Network model uncertainty drastically decreased, and model prediction accuracy drastically increased close to failure for all the turbofan engine examples in the testing set. The Long Short-Term Memory Network could on average very accurately predict the remaining useful life for all the turbofan engine examples 50 cycles before failure in the testing set, with a mean absolute error of only 1.954 cycles. 7. BIBLIOGRAPHY [1] Y. Lei, N. Li, L. Guo, N. Li, T. Yan and J. Lin, "Machinery health prognostics: A systematic review from data acquisition to RUL prediction," Mechanical Systems and Signal Processing, no. 104, p. 799–834, 2018. [2] F. O. Heimes, "Recurrent Neural Networks for Remaining Useful Life Estimation," International Conference On Prognostics And Health Management, pp. 1-6, 2008. [3] F. Chollet, Deep Learning with Python, 2017. [4] I. Goodfellow, Y. Bengio and A. Courville, Deep Learning, MIT Press, 2016. [5] S. Hochreiter and J. Schmidhuber, "Long Short-Term Memory," Neural Computation, vol. 9, no. 8, pp. 1735- 1780, 1997. [6] K. Cho, "Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation," Universite de Montreal, 2014. [7] N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever and R. Salakhutdinov, "Dropout: A Simple Way to Prevent Neural Networks from Overfitting," Journal of Machine Learning Research, no. 15, pp. 1929-1958, 2014. [8] Y. Gal and Z. Ghahramani, "Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning," University of Cambridge, 2015. [9] A. Saxena and K. Goebel, "Turbofan Engine Degradation Simulation Data Set, NASA Ames Prognostics Data Repository," 2008. [Online]. Available: https://ti.arc.nasa.gov/tech/dash/groups/pcoe/prognostic-data- repository/. [Accessed 3 March 2018]. 275 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Rolling Element Bearings Prognostics Using High-Frequency Spectrum of Offline Vibration Condition Monitoring Data Mehdi Behzad1, Hesam Addin Arghand2 and Abbas Rohani Bastami3 1 School of Mechanical Engineering, Sharif University of Technology, Tehran, Iran 2 School of Mechanical Engineering, Sharif University of Technology, Tehran, Iran 3 Mechanical and Energy Engineering Department, Shahid Beheshti University, Tehran, Iran ABSTRACT Remaining useful life (RUL) prediction of rolling element bearings (REBs) with offline condition monitoring (CM) data is the purpose of this paper. A data driven algorithm based on feedforward neural network (FFNN) is proposed for this aim. Since, usually the number of offline measurements are not enough, the generalized Weibull failure rated function is used for producing the auxiliary points that are employed for training. Considering the physics of the bearing degradation, vibration level in the high-frequency bandwidth of the spectrum is used as a feature and its performance in REB prognostic problem is compared with that of using popular recommended features in the diagnostic standards. Bearing accelerated life test data as well as data from two industrial bearings are used to investigate the purpose of this study. The results show that using the high-frequency vibration level features rather than the proposed frequency bandwidth in guidelines and standards for recording the vibration of rotating machines produce more accurate prediction of RUL. Keywords: Prognostics, Offline Measurement, Neural Network, High-Frequency Vibration Level, Remaining Useful Life. Corresponding author: Mehdi Behzad (m_behzad@sharif.edu) 1. INTRODUCTION Rolling element bearings (REB) are highly used components in the rotating machines. Therefore, the accurate remaining useful life (RUL) prediction of these components leads to considerable enhancement of reliability in this kind of equipment. Condition based maintenance (CBM) is the advanced method which has been developed in recent decades. Therefore, many researchers have focused on the methods for RUL prediction of REBs. In this regard, developing the data-driven approaches have been the goal of many studies in last two decades. An et al. [1] reviewed data-driven approaches as well as physics based methods. Then, they proposed some recommendations for choosing the appropriate method. Generally, the data-driven approaches are classified to two main categories: statistical methods and artificial intelligence (AI) methods [2]. The base of all the AI methods includes two main steps. At the first step, the condition monitoring (CM) data of one or more run-to-failure tests are used for training an AI model. Next, in the second step the trained AI model is used for estimating the RUL by using the CM history data [3]. Two main challenges in developing the data-driven approaches includes introducing new features and introducing more powerful algorithms. Both challenges have been of interest for researchers in this field of study. Qiu et al. [4] proposed a robust method that uses the wavelet filter for feature extraction combining with self-organized map in order to do prognostics of REBs. Gebraeel and Lawley [5] employed the vibration level in the defective frequency and its six harmonics as the features for neural network (NN) modelling. Mahammad et al. [6] used feedforward neural network (FFNN) algorithm for RUL prediction problem. They employed the Weibull hazard rate function for fitting the features to overcome their fluctuations. Tian [7] employed generalized Weibull hazard rate function for the same purpose. This fitting function improved the performance and the accuracy of the FFNN in prognostic problem. Ben Ali et al. [8] developed an algorithm based on NN classification combining with statistical features for RUL prediction of REBs. In another research, Van and Kang [9] proposed a two-stage method in order to extracting rich and more accurate features for REBs degradation. Chouri et al. [10] developed an algorithm based on supporting vector machine (SVM). They 276 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management employed eight features in the time domain and two features in the frequency domain as the inputs of their proposing algorithm. Benkedjouh et al. [11] used the energy of signal in different levels as the features. Additionally, they combined the supporting vector regression (SVR) and isometric feature mapping reduction in order to make a data-driven model. In another research, Liu et al. [12] used a data driven framework with statistical features in the time domain. Zhao et al. [13] employed the time-frequency domain features and linear regression model for RUL prediction of the REBs. Behzad et al. [14] employed vibration level in high-frequency bandwidth for tracking the degradation and RUL estimation with an AI method. Some guidelines and standards recommend to measure and record the level of the vibration in the frequency bandwidth [10,1000] Hz for monitoring in rotating machine. Therefore, this feature is used for many industrial applications. However, selecting high-frequency bandwidth for vibration condition monitoring (VCM) of REBs is more reliable. In this article, the comparison between vibration level in the mentioned frequency interval and the vibration level in the high frequency bandwidth for prognostic of REBs is studied. An appropriate algorithm for offline VCM records is developed and the proposing features capability for RUL prediction are investigated. The represented results verify the better performance of high-frequency bandwidth vibration level for REBs prognostics. 2. FEATURE SELECTION A significant challenge in data-driven prognostic methods is finding appropriate features. Since the degradation of REB is a monotonic process, a proper feature is expected to have a monotonic trend during the life of REB. In addition, it should be an indicator for degradation level of the REB. Many researchers have employed different type of features for bearing RUL prediction. However, it is still a challenge to find features with the best accuracy for the prognostics purpose. In this article, an idea based on physics of bearing degradation is used for tracking the fault growth. Behzad et al. [15] studied the vibration of defective bearing with a physics-based model. They explained that when the defect starts in the contacting metal surfaces of the REB components, the roughness of surfaces increases in some regions. Therefore, the micro- geometric characteristics of the surface change gradually. Then, the impacts between two contacting surfaces increase. Employing the impulse train model, they showed the degradation of the surfaces causes the increase of vibration level in the high-frequency bandwidth. Based on this idea, tracking the vibration in the high-frequency bandwidth is expected to indicate the defect earlier rather than vibration level in the lower frequencies. Therefore, the RMS of the spectrum in the high-frequency bandwidth [a,b] is considered as a feature and it is shown with HFRMS(a,b). The frequency interval [a,b] is selected based on spectrum study of REB. The natural frequency regions are also considered for selecting this interval. This feature is previously employed for online VCM data in [14]. In this article, the similar approach is developed for offline VCM data. 3. AI MODEL Artificial neural network (ANN) is one of the commonly used methods in data-driven approaches. In this article, a FFNN is employed for modelling the prognostic problem. A very popular FFNN structure with one hidden layer and one output layer has been used by many researchers for modelling engineering nonlinear problems successfully. This structure which has been depicted in figure 1, is used in this study too. Since the degradation of REBs is a dynamic problem, the features in two steps (the current step and a step in previous measurement point) are considered for inputs of this structure. In the online monitoring system, usually the distance between measurement points is constant and relatively short. Therefore, there are plenty of measurement values in online condition monitoring system. However, in the offline system, the gap between measurement values are not necessarily same and the measurement values number is much less than the online data. The data-driven algorithm that is used in this study, has been developed for offline CM data. To this aim, at the first step an appropriate function is fitted to the trend of selected feature in the 277 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management training process. The generalized Weibull hazard rate function is selected for fitting at this step. This function which has been represented in equation (1), includes three parameters that are tuned in fitting step. Figure 1. FFNN structure for modelling the prognostic of REBs C fV A Bt  (1) In the equation (1) t is the time, A is the fitted value at t = 0, B is the scale parameter, C is the shape parameter, and Vf is the fitted function. Since the number of measurements during the offline condition monitoring of a machine may be limit, it is proposed to use some auxiliary points. To this aim, a set of auxiliary equidistant points in [0,tfinal] is considered where tfinal is the whole life of an REB. The values of the features in these points are calculated by fitted function. The number of 2 to 5 times of original measurement points number is proper for auxiliary points number. Then, in the training process, the auxiliary points are used for training and the original points are used for validation. This is due to preventing the overfitting in training stage. Next, the trained FFNN is used for prediction. The target in training stage and the output in prediction stage are life ratio and estimation of life ratio respectively. The life ratio (Ti) is defined as the life of the REB in the current time (ti) per the final life (tfinal) of the REB: i i final t T t  (2) At the prediction stage, first the features are calculated from the measurement data. Then, applying the calculated features to the trained NN, the life ratio estimation is calculated. Finally, the estimation of RUL at point i is calculated as: i finali i i i t RUL t t t T     (3) where, iRUL is the estimation of RUL at point i, finalt is estimation of final life, and iT is the estimation of life ratio at point i. The discussed algorithm is shown in a flowchart in Figure 2. In the next section, more details of using this algorithm with actual CM data will be discovered. 278 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 2. Flowchart of the proposed prognostic algorithm for offline CM data 4. RESULTS An important challenge of using data-driven approaches is providing enough data for training stage. Usually in these methods, it is required to assume that the operating condition of machine in training data and prediction data have been the same. Therefore, in actual industrial cases, it may be difficult to provide enough data for training in many applications. However, implementing the accelerated run-to-failure tests of REBs in the laboratory is an alternative for checking the performance of algorithms. In this article, the proposed method is used for prediction of both experimental accelerated life test data and actual industrial data. The published data of PRONOSTIA experiment [16] is used as laboratory data. In addition, data of two industrial case-studies provided by Behravesh Vibration Engineering Company have been analyzed with the proposed algorithm. 4.1. PRONOSTIA Experiment data PRONOSTIA experiments data which published in PHM 2012 conference [16] has been used by many researchers for prognostic study of REBs. Figure 3 shows the details of this experimental test rig. Figure 3. overview of the Prognostia test rig (16) 279 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management This set of data includes 17 full run-to-failure online vibration measurement data which are in three different operating conditions. The data of the first operating condition (shaft speed 1800 rpm and radial load 4000 N) is employed for the purpose of this article. Totally, there are seven series of data in this operation condition. Based on the initial study of these data, the kurtosis of the vibration records of the second bearing is fluctuating in a very large range. Even in the healthy condition of this REB, the kurtosis touches very high thresholds (over 100) [14]. Therefore, it is assumed that there existed a problem in the measurement or installation of the second bearing. Due to this reason, the second test is excluded from the analysis and the other six tests in the first operating condition are considered for the analysis in this part. The inner diameter and the outer diameter of the test bearing in these experiments are 20 mm and 32 mm respectively. Additionally, the dynamic load capacity has been reported 4000 N. Vibration signals are recorded every 10 seconds with the 25.6 kHz sampling frequency during whole life of the bearing and the length of each signal is 0.1 second. Since, the proposed method in this article is going to be tested with offline data, one measurement in every 25 minutes (1500 sec) is selected for analysis. The selected signals are not equidistant in time and a random selection algorithm is used for choosing the offline points. More details about the using data is represented in table 1. Comparing the average of the mean error in the numerical tests, shows the better performance of the selected features in numerical test 4. Beside the frequency range [8,10] kHz which has produced the most accurate results, the results of numerical tests 1, 3 and 5 are very close to each other. Here, sample prediction result plots for bearing 1_7 in all numerical tests are depicted in Figure 4. In each prediction, the vibration condition monitoring data of five other bearings are used for training. These results verify a better prediction accuracy of the features employed in numerical test 4. Applied features: time, RMS Applied features: time, RMS, K Applied features: time, HFRMS(1,10) Applied features: time, HFRMS(8,10) Applied features: time, RMS(0,1) Figure 4. Prediction results on Bearing 1_7 280 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Table 3. Numerical test results for laboratory data error B 1_1 B1_3 B1_4 B1_5 B1_6 B1_7 Average error Test 1 e 0.115 0.137 0.286 0.045 0.076 0.114 0.129 E 0.203 0.391 0.375 0.130 0.213 0.184 0.249 Test 2 e 0.072 0.092 0.245 0.096 0.179 0.117 0.133 E 0.198 0.189 0.455 0.345 0.594 0.257 0.340 Test 3 e 0.170 0.075 0.312 0.064 0.058 0.099 0.130 E 0.609 0.292 0.408 0.183 0.111 0.167 0.295 Test 4 e 0.093 0.043 0.218 0.068 0.044 0.041 0.085 E 0.307 0.094 0.374 0.104 0.082 0.080 0.174 Test 5 e 0.096 0.111 0.350 0.064 0.075 0.076 0.128 E 0.176 0.315 0.467 0.167 0.161 0.228 0.252 4.2. Industrial data The results of proposed algorithm in this article on the bearing accelerated life test VCM data analyzed in previous part. On the other hand, the two industrial cases are going to be studied in this part for the same purpose. The first case-study is for a bearing on the drive-end of a fan in a car manufacturing company. The bearing has been monitored regularly over its full life. From August 2012 to August 2013, sixteen vibration measurements have been recorded for this bearing. The second case-study is for a bearing of a sanding machine in a wood-based products manufacturer company. The vibration measurement of this bearing has been recorded from Feb 2015 to Feb 2017 during the whole life of the bearing. Totally, twenty-three measurements have been recorded during this period of time and the bearing has gone to failure at the end of this period. The frequency bandwidth [10,1000] Hz is selected for vibration measurement of these machines based on recommendations of guidelines and standards for vibration condition monitoring of rotating machines. Besides that, the vibration in a wide frequency bandwidth [10,8000] Hz is also recorded for all measurements. The offline records in both sets of data are not equidistant in time. Four numerical tests with different feature groups is used for this part of study. Table 4 presents the features corresponding to each numerical test. Table 4. numerical tests along with applied features for industrial case study Numerical Test No. Applied features Test 1 ti-1 , ti , RMSi-1 , RMSi Test 2 ti-1 , ti , RMSi-1 , RMSi , Ki-1 , Ki Test 3 ti-1 , ti , HFRMS(1,8)i-1 , RMS(1,8)i Test 4 ti-1 , ti , HFRMS(0,1)i-1 , RMS(0,1)i Each numerical test has been repeated three times. In each repetition, the best result between five consecutive attempts is considered as the final result. The corresponding prediction plots of all the numerical tests for the both industrial bearings are depicted in Figure 5 and Figure 6. Finally, the average values of three final results of each numerical test has been calculated and reported in Table 5. 281 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Comparing the results in Table 5, shows the better performance of the high-frequency bandwidth features. This comparison verifies the importance of the high-frequency vibration investigation in bearing degradation analysis. Although the frequency bandwidth [10,1000] Hz commonly is used for condition monitoring and diagnostics of the machines, the results depicted in this study represents how the analysis of high-frequency bandwidth of the spectrum concludes more accurate estimation in REBs prognostics. Table 5. Numerical test results comparison error Bearing 1 (Car Manufacturing Co.) Bearing 2 (Wood-based products Manufacturing Co.) Test 1 e 0.037 0.122 E 0.143 0.449 Test 2 e 0.277 0.415 E 1.010 1.940 Test 3 e 0.045 0.070 E 0.170 0.419 Test 4 e 0.209 0.161 E 0.753 0.610 Applied features: time, RMS Applied features: time, RMS, K Applied features: time, HFRMS(1,8) Applied features: time, RMS(0,1) Figure 5. Prediction results on the industrial bearing 1 regarding to the Fan in car manufacturing company. 282 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Applied features: time, RMS Applied features: time, RMS, K Applied features: time, HFRMS(1,8) Applied features: time, RMS(0,1) Figure 6. Prediction results on the industrial bearing 1 regarding to the sanding machine in wood-based product manufacturing Co. 5. CONCLUSION In this article, an algorithm based on FFNN approach has been proposed for prognostics of REBs by using offline VCM data. Since the number of offline measurements during the whole life of the equipment are not enough, this algorithm produces some auxiliary points for training. Features in two different time steps are used as inputs of the NN model due to dynamic behavior of degradation. RMS of vibration spectrum in various frequency bandwidths was considered as a feature. In addition, time and kurtosis were two other features that have been combined with RMS in different numerical tests. The proposed algorithm has been applied to a set of bearing accelerated life run-to-failure tests data as well as two industrial cases data. According to the results of this study, the RMS of high-frequency bandwidth in vibration spectrum is a better feature for tracking the degradation. Therefore, using these features for RUL estimation has produced more accurate predictions. ACKNOWLEDGEMENT The financial support of the ministry of science, research and technology is gratefully acknowledged for this study. In addition, the authors greatly appreciate the help of Behravesh Vibration Engineering Co. for providing the industrial machine vibration data. 283 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management REFERENCES [1] An, D., Kim, N. H., & Choi, J. H. (2015). Practical options for selecting data-driven or physics-based prognostics algorithms with reviews. Reliability Engineering & System Safety, 133, 223-236. [2] Peng, Y., Dong, M., & Zuo, M. J. (2010). Current status of machine prognostics in condition-based maintenance: a review. The International Journal of Advanced Manufacturing Technology, 50(1-4), 297-313. [3] Huang, H. Z., Wang, H. K., Li, Y. F., Zhang, L., & Liu, Z. (2015). Support vector machine based estimation of remaining useful life: current research status and future trends. Journal of Mechanical Science and Technology, 29(1), 151-163. [4] Qiu, H., Lee, J., Lin, J., & Yu, G. (2003). Robust performance degradation assessment methods for enhanced rolling element bearing prognostics. Advanced Engineering Informatics, 17(3-4), 127-140. [5] Gebraeel, N., Lawley, M., Liu, R., & Parmeshwaran, V. (2004). Residual life predictions from vibration-based degradation signals: a neural network approach. IEEE Transactions on industrial electronics, 51(3), 694-700. [6] Mahamad, A. K., Saon, S., & Hiyama, T. (2010). Predicting remaining useful life of rotating machinery based artificial neural network. Computers & Mathematics with Applications, 60(4), 1078-1087. [7] Tian, Z. (2012). An artificial neural network method for remaining useful life prediction of equipment subject to condition monitoring. Journal of Intelligent Manufacturing, 23(2), 227-237. [8] Ali, J. B., Chebel-Morello, B., Saidi, L., Malinowski, S., & Fnaiech, F. (2015). Accurate bearing remaining useful life prediction based on Weibull distribution and artificial neural network. Mechanical Systems and Signal Processing, 56, 150-172. [9] Van, M., & Kang, H. J. (2016). Two-stage feature selection for bearing fault diagnosis based on dual-tree complex wavelet transform and empirical mode decomposition. Proceedings of the Institution of Mechanical Engineers, Part C: Journal of Mechanical Engineering Science, 230(2), 291-302. [10] Chouri, B., Monteiro, F., Tabaa, Mohamed, Dandache, A., Komarasamy, G., Wahi, A., ... & Huerta, L. (2013). Residual useful life estimation based on stable distribution feature extraction and SVM classifier. Journal of Theoretical and Applied Information Technology, 55(3), 299-306. [11] Benkedjouh, T., Medjaher, K., Zerhouni, N., & Rechak, S. (2013). Remaining useful life estimation based on nonlinear feature reduction and support vector regression. Engineering Applications of Artificial Intelligence, 26(7), 1751-1760. [12] Liu, Z., Zuo, M. J., & Qin, Y. (2016). Remaining useful life prediction of rolling element bearings based on health state assessment. Proceedings of the Institution of Mechanical Engineers, Part C: Journal of Mechanical Engineering Science, 230(2), 314- 330. [13] Zhao, M., Tang, B., & Tan, Q. (2016). Bearing remaining useful life estimation based on time–frequency representation and supervised dimensionality reduction. Measurement, 86, 41-55. [14] Behzad, M., Arghand, H. A., & Rohani Bastami, A. (2017). Remaining useful life prediction of ball-bearings based on high-frequency vibration features. Proceedings of the Institution of Mechanical Engineers, Part C: Journal of Mechanical Engineering Science, 0954406217734885. [15] Behzad, M., Bastami, A. R., & Mba, D. (2011). A new model for estimating vibrations generated in the defective rolling element bearings. Journal of Vibration and Acoustics, 133(4), 041011. [16] Nectoux, P., Gouriveau, R., Medjaher, K., Ramasso, E., Chebel-Morello, B., Zerhouni, N., & Varnier, C. (2012, June). PRONOSTIA: An experimental platform for bearings accelerated degradation tests. In IEEE International Conference on Prognostics and Health Management, PHM'12. (pp. 1-8). IEEE Catalog Number: CPF12PHM-CDR. 284 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Investigation of Infrared Thermography as a Dual Online Diagnostic Tool for Dynamic Structural Health Monitoring D.A. Desai1, S.M Talai1, P.S. Heyns2 1 Department of Mechanical Engineering, Mechatronics & Industrial Design, Tshwane University of Technology, Pretoria 0001, South Africa 2 Department of Mechanical & Aeronautical Engineering, University of Pretoria, Hatfield 0028, South Africa ABSTRACT It is well known that infrared thermography (IRT) has fully matured as a tool for temperature condition monitoring. However in this paper, an experimental investigation of IRT as a dual online diagnostic tool in terms of temperature and vibration frequency characterization of dynamic structures is carried out, as such studies have not as yet been adequately investigated. Tests were conducted on both healthy and defect-induced cantilever beam- like structures coupled with a lacing wire subjected to forced excitations. Thermal images were acquired at the frictional interfaces, employing two infrared cameras. The analyzed frequencies obtained from the frictional temperature evolution time domain waveform using a Matlab FFT algorithm compared well with alternate accelerometer data acquired with the maximum relative error being 0.30 %. Moreover, the beam with the induced defect showed a temperature increase of 8.1 oC compared to the healthy one and exhibited multiple spectral peaks around the dominant frequency, hence revealing successful discrimination between them. This study in essence proves that IRT is capable of reliably describing structural vibration behavior in terms of frequency. These findings are particularly useful in overcoming many limitations inherent in some of the current contact vibration measuring techniques operating in turbulent, unclean environments. Keywords: Frictional heat generation, structural health indicators, condition monitoring, structural defect, non-destructive testing and evaluation (NDT&E) Corresponding author: Dawood Desai (desaida@tut.ac.za) 1. INTRODUCTION In today’s mechanical and aerospace engineering communities, the need for enhanced ability to monitor dynamic structures and detect potential damages at the earliest possible stage for effective structural health monitoring (SHM) is ever increasing [1]. The online temperature measurement performed in a non-contact way using Infrared thermography (IRT) facilitates early diagnosis of probable faults and proper preventive measures adopted to avoid major shut downs. Furthermore, the non-destructive testing and evaluation (NDT&E) techniques based on IRT are the most advanced techniques available today and has attracted much research attention [2]. Titman [3] explained that they represent the most promising method of structural defect detection using the principle that all bodies emit infrared radiation with the absolute temperature greater than 0 K. Bagavathiappan et al. [4] reported that since the abnormal temperature pattern is an indication of the unhealthy state; it’s widely used in temperature condition monitoring. Vibrothermography which is a NDT&E technique is used to find surface and near surface defects such as cracks and delaminations through observations of vibration induced heat generation [5]. It involves exposing the specimen to a sonic or ultrasonic vibration, then observing the induced frictional heating with an infrared camera. Montanini and Freni [6] investigated the correlation between the vibrational mode shapes and viscoelastic heat generation in vibrothermography. The authors found that there exists a minimum threshold of vibration amplitude that allows reliable flaw detections for a specific geometry. The results showed that exciting the specimen at lower modes is sufficient to generate heat flux for flaw identification. Gao et al. [7] described a statistical method for conducting a good vibrothermography inspection test for crack detection in aircraft engine fan blades. They showed that use of the Bayesian method enables the development of confidence intervals for the probability of detecting cracks. Furthermore, Dimarogonas and Syrimbeis [8] successfully predicted the vibration signature from 285 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management heat generation due material damping by considering a vibrating rectangular plate. Most recently, Talai et al. [9] developed a technique for structural vibration characteristics measurement using an IRT approach by considering healthy cantilever beam-like structures coupled with a lacing wire. For decades, however, vibration monitoring has been utilized to assess SHM for predicting potential failures that, in turn, enhances reliability and availability [10-14] where the time domain waveform signals are transformed into frequency domain using Fourier’s transformation. Nevertheless, the available measurement techniques such as strain gauges have a shorter sensor life span due to fatigue, turbulent and unclean environments coupled with centrifugal forces at high temperatures and erosion as reported by Al-Bedoor [15] in power plant applications. Also, there are times, when one can’t have easy access to a machine because of its location, or desires the additional confidence of having another diagnostic tool available. For this reason, a non-contact diagnostic tool with ability of monitoring dual indicators of structural health is of great benefit. Therefore, this paper presents a practical application of IRT as a dual online diagnostic tool for dynamic SHM in terms of temperature monitoring and spectral vibration frequency. To achieve this, both healthy and defect induced cantilever beam-like structures coupled with a lacing wire subjected to mechanical excitation in the laboratory environment are employed with infrared cameras focused on the frictional interfaces at the lacing wire zone for the thermal imaging. Results using this technique were successfully validated against miniature accelerometer acquired measurements. Hence, this study has great significance for the effective SHM of dynamic structures, thus, reducing down-time and maintenance costs, leading to increased plant efficiency. 2. MATERIALS AND EXPERIMENTAL METHOD 2.1. Materials Both the beams and lacing wires were manufactured of AISI 304 stainless steel due their low thermal conductivity, hence, generate heat with slight frictional effect [16]. The geometric and material properties are given in table 1 and table 2, respectively. Table 1. Beam and lacing wire geometric dimensions. Table 2. AISI 304 steel material properties [18]. Material properties Parameters Density, ρ 7740 kg/m3 Young modulus, E 200 GPa Poisson ratio, ν 0.33 Static friction coefficient, µs (µk = 0.75 µs) [19] 0.15 Thermal properties Thermal conductivity, k 16.5 W/mK Specific heat capacity, c 500 J/kgK Description Dimension Beam mass 0.10 kg Length 300 mm Width 25 mm Thickness 2 mm Lacing wire diameter 5 mm Beam hole diameter [17] [tolerance grade: F8/js7] mm Beam-lacing wire hole location from fixed end 250 mm Location of the exciters 290 mm 286 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 2.2. Experimental procedure The experimental setup used in the work of Talai et al. [9] was employed in this study as depicted in figure 1. A pre-load force of 15 N applied at the lacing wire throughout the experiments. Three excitation patterns of the beams were considered. Firstly; both beams at the same forcing frequency of 20 Hz , secondly; Beam 1 at 20 Hz and Beam 2 at 25 Hz , finally; both beams at 25 Hz but with Beam 1 induced with structural defect. These excitation frequencies were considered based on Shannon’s sampling rate theorem of 0.4 times the rated resolution of the IR cameras used. The thermal images were recorded continuously for 150 s. The analyzed frequencies from interface temperature time domain waveform were subsequently validated against the corresponding frequencies acquired concurrently using miniature Deltatron accelerometers attached to the respective beams. 3. RESULTS AND DISCUSSIONS In case of both beams excited at the same frequency value of 20 Hz , it was expected that the thermal imaging will attain the same maximum frictional temperature. However, this was not the case practically. The thermal imaging exhibited temperature difference of 0.9 oC (figure 2) that was attributed partly to the emanating beam’s dynamics leading to the non-uniform interface boundary motion and the different thermal sensitivities of the IR cameras used in this study. Accordingly, the frequencies evaluated from frictional temperature evolution for Beam 1 and 2 were 19.9885 Hz (figure 3a) and 19.9973 Hz (Figure 3b), respectively. These were in good agreement to the acquired accelerometer frequencies (figure 3a and 3b) with relative errors being 0.26 % and 0.21 % of the former and latter, respectively. Montanini and Freni [6] explained that the amount of frictional heat generated depends on several parameters, which includes coefficient of friction, normal applied force and the sliding relative speed. However, this study kept other factors constant while the forcing frequencies varied. As expected, the structure under higher excitation frequency allowed the frictional interface to slide against each other more often (as opposed to lower frequencies) leading to greater frictional heat generation resulting in a temperature difference of 2.2 oC depicted in figure 4 in the case of Beam 1 and Beam 2 excited at 20 Hz and 25 Hz, respectively. Moreover, Mabrouki et al. [16] observed similar findings while investigating frictional heating for efficient use of vibrothermography. Hence, the conclusion was that frictional heat increases with the interface frequency. On the other hand, the analyzed frequencies from thermal imaging were 19.9832 Hz (figure 5a) and 25.0025 Hz (figure 5b) for Beam 1 and Beam 2, respectively. Likewise, good agreement was exhibited when compared to the accelerometer measured frequencies, the relative errors being 0.28 % (figure 5a) for the former and 0.19 % (figure 5b) for the latter. 287 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 1. Laboratory experimental setup. (a) (b) Figure 2. Thermal images for both beams excited at 20 Hz : 150 s (a) Beam 1 and (b) Beam 2. It is known that the presence of structural defects reduce stiffness, hence, larger displacement variations occur resulting in increased frictional heat generation due to interfacial metallurgical boundary local deformations. This was evident by the abnormal temperature difference of 8.1 oC among the beams investigated with a forcing frequency of 25 Hz with Beam 1 induced with a structural defect (figure 6) compared to the healthy beam. The analyzed frequencies from frictional temperature generation for Beam 1 around the forcing frequency were 24.9748 Hz, 24.2328 Hz and 25.7605 Hz (figure 7a), while the frequencies for Beam 2 had a clear spectral peak at 25.0013 Hz (figure 7b). Based on the dominant frequencies, the exhibited relative errors were 0.30 % and 0.19 % of Beams 1 and 2, respectively; compared to the accelerometer measured values. It was interesting to observe the 288 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management unhealthy beam being clearly distinguished by its abnormal temperature generation and vibration frequency spectral peaks, both of which were monitored online using IRT. (a) (b) Figure 3. FFT of temperature evolution and displacement: both beams at 20 Hz (a) Beam 1 and (b) Beam 2. (a) (b) Figure 4. Thermal images for excitation: 150 s (a) Beam 1 at 20 Hz and (b) Beam 2 at 25 Hz. 289 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management (a) (b) Figure 5. FFT of temperature evolution and displacement (a) Beam 1 at 20 Hz and (b) Beam 2 at 25 Hz. (a) (b) Figure 6. Thermal images for both beams excitated at 25 Hz : 150 s (a) defect induced and (b) healthy. 290 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management (a) (b) Figure 7. FFT of temperature evolution and displacement: both beams at 25 Hz (a) Beam 1 induced defect & (b) healthy Beam 2. 4. CONCLUSION All the frequencies predicted through the IRT approach agreed well with those acquired by miniature accelerometer sensors, with the maximum relative error being 0.30 % thus, validating experimental measurements using IRT. Also, the unhealthy beam was clearly discriminated based on the abnormal temperature generation and multiple spectral frequency peaks around the forcing frequency. Therefore, it can be concluded that IRT is reliable as a dual online diagnostic tool for SHM as evident by this research work. 5. ACKNOWLEDGEMENT The authors greatly appreciate the support of Tshwane University of Technology, University of Pretoria and Eskom Power Plant Institute (South Africa) in funding this research. 291 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management REFERENCES [1] Le, T. T. H., Point, N., Argoul, P., Cumunel, G. (2016) Structural changes assessment in axial stressed beams through frequencies variation, International Journal of Mechanical Sciences 110: 41-52. [2] Meola, C., Carlomagno, G. M. (2004) Recent advances in the use of infrared thermography, Measurement Science and Technology 15:(9) 27-58. [3] Titman, D. J. (2001) Applications of thermography in non-destructive testing of structures, NDT&E International 34: 149-154. [4] Bagavathiappan, S., Lahiri, B. B., Saravanan, T., Philip, J., Jayakumar, T. (2013) Infrared thermography for condition monitoring – A review, Infrared Physics & Technology 60: 35-55. [5] Renshaw, J., Chen, J. C., Holland, S. D., Bruce, T. R. (2011) The sources of heat generation in vibrothermography, NDT&E International 44:(8) 736-739. [6] Montanini, R., Freni, F. (2013) Correlation between vibrational mode shapes and viscoelastic heat generation in vibrothermography, NDT&E International 58: 43-48. [7] Gao, C., Meeker, W. Q., Mayton, D. (2014) Detecting cracks in aircraft engine fan blades using vibrothermography nondestructive evaluation, Reliability Engineering and System Safety 131: 229-235. [8] Dimarogonas, A. D., Syrimbeis, N. B. (1992) Thermal signatures of vibrating rectangular plates, Journal of Sound and Vibration 157:(3) 467-476. [9] Talai, S. M., Desai, D. A., Heyns, P. S. (2016) Vibration characteristics measurement of beam-like structures using infrared thermography, Infrared Physics & Technology 79: 17-24. [10] Mukhopadhyay , N. K., Chowdhury, S. G., Das, G., Chattoraj, I., Das, S. K., Bhattacharya, D. K. (1998) An investigation of the failure of low pressure steam turbine blades, Engineering failure analysis 5:(3) 181-193. [11] Wei, F., Pizhong, Q. (2010) Vibration-based Damage Identification Methods: A Review and Comparative Study, Structural Health Monitoring 10:(1) 83-111. [12] Rao, S. J. (1991) Turbo-machine blade vibration (first edition), New Delhi: New age international publishers 7-20. [13] Jaiswal, B. L., Bhave, S. K. (1994) Experimental evaluation of damping in a bladed disk model, Journal of sound and vibration 177:(1) 111-120. [14] Torshizi, M. S. E., Nikravesh, Y. S. M., Jahangiri, A. (2009) Failure analysis of gas turbine generator cooling fan blades, Engineering Failure Analysis 16:(5) 1686-1695. [15] Al-Bedoor, B. O. (2002) Discussion of the available methods for blade vibration measurement, In: ASME 2002 Pressure Vessels and Piping Conference. Vancouver, BC: Canada. 1561: 53-61. [16] Mabrouki, F., Thomas, M., Genest, M., Fahr, A. (2009) Frictional heating model for efficient use of vibrothermography, NDT&E International 42:(5) 345-352. [17] Sanders, W. P. (1996) Turbine steam path engineering for operations and maintenance staff (first edition), Richmond Hill, Ontario: Canada 236-239. [18] Madhusudana, C. V. (1999) Thermal conductance of cylindrical joints, International Journal of Heat and Mass Transfer 42: 1273-1287. [19] Oden, J. T., Martins, J. A. (1985) Models and computational methods for dynamic friction phenomena, Computer methods in applied mechanics and engineering 52: 527-634. 292 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Experimentally validated numerical simulation of prediction of structural vibration frequencies from interfacial frictional temperature signature S.M. Talai1, D.A. Desai1, P.S. Heyns2 1 Department of Mechanical Engineering, Tshwane University of Technology, South Africa 2 Department of Mechanical & Aeronautical Engineering, University of Pretoria, South Africa ABSTRACT Friction studies have shown that the contact interface of dynamic structural components generally results in a temperature increase due to periodic motion of the boundary. Furthermore, the well-known heat conduction equation yields the temperature distribution. Subsequently, the aim of this study was to investigate the prediction of vibration frequency from the interfacial frictional temperature signatures of structures under excitation. In this work, an experimentally validated finite element (FE) model was developed using the commercial software ABAQUS. Tests were done on AISI 304 steel of healthy cantilever beam-like structures coupled with a friction interface and subjected to forced excitations. A coupled temperature-displacement transient methodology was utilized to predict the frictional heat generation and subsequently the vibrational frequencies extracted from the temperature wave form, which in turn were validated by comparing them to the experimental results employing infrared thermography. The results confirmed the feasibility of using the interface frictional temperature signature to describe the vibration frequency with good accuracy and may be used to overcome some limitations encountered with conventional instrumentation such as strain gauge failures due to fatigue. Keywords: Frictional heat generation, structural health indicator, condition monitoring, vibration frequency, infrared thermography. Corresponding author: Stephen M. Talai (stevetalai@gmail.com ) 1. INTRODUCTION Temperature monitoring of dynamic components aids early diagnosis of the potential damages and proper preventive measures adopted to avoid major breakdowns [1-3]. Titman [4] explained that it represent the most promising method of structural defect detection using the principle that all bodies emit infrared radiation with the absolute temperature greater than 0 . For decades, however, vibration monitoring has been utilized to assess SHM for predicting potential failures that, in turn, enhance the reliability and availability [5-9]. Several techniques are applied for the online structural vibration frequency monitoring. Strain gages bonded to the structure, experience the same motion (strain) as the structure and hence its change in resistance gives the strain applied to the structure. Even though this method has the advantage of performing measurement of individual structure, it has several disadvantages such as shorter sensor life span due to fatigue. Laser Doppler Vibrometry (LDV) with an Eulerian approach allows overcoming most of the limitations mentioned in the use of strain gauges [10]. Nonetheless, Castellini et al. [11] reported that its main limitations are the speckle effects and poor signal-to-noise ratio when measuring vibration on the low diffusive surface. Also, interferometry has the ability to provide traceability of vibration displacement measurement. However, its accuracy is limited by the environment, thus it is often not practical to perform vibration measurements in turbulent environments like in the case of power plants [12]. There are times, however, when one cannot have access to a machine because of its location, or desires the additional confidence of having another diagnostic tool available. For this reason, a diagnostic tool with the ability of monitoring dual indicators of structural health is of benefit. This paper presents an experimentally validated numerical simulation of predicting vibration frequency from the temperature wave form. To achieve this, both healthy and defect induced cantilever beam-like structures were coupled with a friction interface subjected to mechanical excitation in a laboratory environment, while infrared cameras focused on the frictional interfaces for the thermal imaging. This study is significant for the effective SHM of dynamic structures, thus, reducing the down-time and maintenance cost, leading to increased plant efficiency. 293 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 2. FE MODELLING AND SIMULATION FE modelling and simulations were carried out using the commercial software ABAQUS/CAE version 6.13-1. This software is well suited for performing non-linear simulations [13]. The models were developed as follows: 2.1. FE geometric model The geometric model composed of 3D deformable solid parts consisting of two straight rectangle cantilever beam coupled with lacing wire. Similarly, Petreski [14] used a group of cantilever beams to numerically study the natural frequencies and mode shapes due to changes of lacing wire such as diameter, elasticity and location on the LP steam turbine blades. The geometric dimensions and material properties are as used in the work of Talai, Desai and Heyns [15] on vibration characteristics measurement of beam-like structures using infrared thermography. 2.1.1. Assembly, loading, boundary condition and meshing The model assembly is depicted in figure 1. Figure 1. FEM model (a) assembly, boundary conditions and loading (b) Beam 1 interface mesh with reference node (c) Beam 2 interface mesh with reference node. The sinusoidal loads were defined on the beams’ surface in order to simulate the mechanical excitations while a lateral force to the lacing wire to mimic the centrifugal force. The forced excitations of beams were achieved through the formulation of a sinusoidal load, using a varying force function expressed as    tSinFtF n  , ,………………………………………………………. (1) where Fn is the excitation force which varies sinusoidally with time, t at a frequency of ω cycle/second. The assumptions made when defining the loads and boundary conditions were [16]: i. Beams and lacing wire initially isothermal at 22 C. ii. The heat losses due to convection effect are of convection coefficient of at ambient temperature. iii. The radiation heat losses negligible compared to convection. iv. The frictional heat transfer obeys the heat partition model of the ratio of thermal conductivity. The Bauschinger effect which is the changing material properties under cyclic loading is generally involved in frictional contacts [17, 18]. For instance, loading of a specimen beyond its yield limit in a given direction lowers 294 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management the yield strength in the opposite direction. This leads to non-linear structural material properties over the entire period and for the case of numerical simulation, it usually results in higher computational cost [19, 20]. Accordingly, higher loading of the FEM at the expense of greater computational efforts leads to more frictional heat generation. However, since this study entailed an investigation, it considered small loads to shorten the computational time. Therefore, the transverse sinusoidal force of amplitude of 0.10 and lateral force of 0.001 defined to idealise the beams harmonic loading and lacing wire lateral loading, respectively. The simulations were carried as follows: first; both beams at 20 , secondly; both beams at 40 , thirdly; beam 1 at 20 while beam 2 at 40 and finally; both beams at 25 , but with beam 1 induced with a defect. These frequencies were considered based on Shannon’s sampling rate theorem on rated optical resolution of the IR cameras used in this study. An encastre boundary condition defined for FE beams end, while asymmetric boundary condition at the centre of the lacing wire i.e. (allows the free lateral movement). Also, a reference node (beam 1: Ref. node1 and beam 2: Ref. node 2) created at each beam-lacing wire interface as a set in the modelling facilitated a precise data acquisition for the frictional temperature evolution and displacement spectrum, hence, analysis. The model was meshed using element type C3D8RT. However, the beam interface hole and the lacing wire contact region meshed with highly dense elements since it was the main focus of the study, for the obvious reason of increasing the accuracy of the results. The mesh seed size around the beam interface was 0.157 , while the rest of the model 2.5 . On the other hand, lacing wire composed of 0.302 seed size. The total elements generated for the healthy FEM used in the simulation was 100 320. 2.1.2. Friction interaction properties The interaction properties define the contact properties between the master and slave surfaces. ABAQUS considers the master surface as the surface that undergoes larger motion and vice versa for the slave. In this regard, the beam was assigned as master and the lacing wire surface as the slave. The following interaction properties were defined: i. The friction formulation of static-kinetic exponential decay as tangential behaviour. ii. Heat distribution according to the partition ratio given in equation. iii. Thermal conductance using clearance dependent data based on material conductivity (table 1). Table 1. Interface clearance dependent data. Conductance Clearance 2.1.3. Analysis procedure The static analysis type simulated the lateral force acting on the lacing wire. The contact frictional heat, however, is strongly dependent on the relative motion [16, 20]. The ABAQUS/CAE solver uses coupled temperature- displacement analysis of simulations involving thermo-mechanical analysis [13]. It takes into account the initial ambient temperature and the relative interaction properties for calculating the transient temperature distribution. Wenya et al. [20], carried out a numerical modelling and simulation of friction welding processes based on ABAQUS using coupled temperature displacement transient. The temperature results were in good agreement with experimentally measured using thermocouples. Correspondingly, this study utilised similar analysis type in simulating the interface frictional thermal generation. 2.2. Experimental Setup The laboratory experimental setup is presented in figure 2. The infrared cameras and all sensors were as in the work of Talai, Desai and Heyns [15]. During the forced excitation, first both beams were excited at 20 , secondly, one beam was excited at and the second one at , to facilitate the validation of the FEA results. During the entire excitation period, the thermal images were recorded continuously for 150 . 295 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 2. Laboratory experimental setup. 3. RESULTS AND DISCUSSION A PC with the 64-bit operating system was used for the simulation. It consisted of four processors each with 3.20 and RAM of 32 . The total simulation wall clock time was 272 437 . Since the forcing frequencies were periodic, whether the beams were excited in phase or out of phase, the analysed frequencies will be of the same value regardless of the phase. Based on this knowledge, the study considered out of phase excitations. Primarily, it is known that frictional heating concentrates within the real area of contact between two bodies in relative motion [21]. In addition, according to the Coulomb law of friction, frictional heat generation occurs only if there exists a relative motion between the interacting structures [22]. This means that the frictional temperature signature is consistent with the displacement pattern. Thus, the structure under higher excitation frequency allows the frictional interface to slide against each other more times; unlike at lower frequencies, hence, leading to greater frictional heat evolution as seen with temperature increase from 0.11 (figure 3a) to 0.20 (figure 4a ) for 20 Hz and 40 , respectively. Moreover, Mabrouki et al. [22] observed similar findings on investigation of the frictional heating model for efficient use of vibrothermography. Hence, the conclusion was that frictional heat increases with the interface frequency. The analyzed frequencies using a MATLAB FFT algorithm are presented in figure 3b and figure 4b. The observed increasing temperature peak and vice versa for displacement with increasing excitation frequency was attributed to the increasing temperature as prior explained and shortening periodic time considering the same amplitude of displacement, respectively. Successfully, the frequencies analyzed from the frictional temperature evolution were in good agreement to that acquired from the corresponding displacement as summarized in table 2. Table 2. Frequencies for both beams excited at same frequency values. Forcing frequency for both beams (Hz) Results Frequency predicted Temp. signature Disp. curve 20 Figure 3a 19.99 19.99 40 Figure 4b 39.99 39.99 296 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management In the case of beams excitation at different forcing frequencies of beam 1 at 20 and beam 2 at 40 ; the results revealed a frictional temperature increase of 0.24 and 0.32 in the former and latter, respectively (figure 5a). Interestingly, the analyzed frictional temperature evolution from incredibly predicted the vibration frequency (figure 5b) just as displacement curves as shown in table 3. Also, the values agreed well with those of corresponding excitations prior obtained for same excitation frequencies (table 2). This implies that the irrespective of the structural excitations, the analysis of the interface frictional heat evolution is capable enough of predicting its vibration frequency as evidenced by these findings. Table 3. FEA frequencies with different excitation frequencies. FEM simulation results (figure 3a and figure 4a) showed that exciting beams at the same value of frequencies and time period, yields equal maximum frictional temperature. However, this was not the case experimentally. The thermal imaging exhibited a temperature difference of 0.9 for both beams excited at 20 Hz (figure 6) at the ambient temperature being 22.4 . The frequencies evaluated from the thermal imaging (figure 6) post analysis of beam 1 and 2 were 19.9885 (figure 7a) and 19.9973 (figure 7b), respectively. These were in good agreement with the corresponding FEA predicted frequencies (table 2) with relative errors being 0.0075 and 0.0365 for the former and latter, respectively, thus, validating the FEM simulation results. Furthermore, beam 1 possessed several smaller frequencies that were associated with the beam multi-dynamics due to the periodic loading. It is known that the existence of a crack reduces the stiffness of a structural member due to local flexibility [23]. Yang et al. [24] analytically examined the forced vibration of a cracked cantilever beam. The authors found that dynamic deflection increases due to the crack, thus increasing frictional heat evolution as evidenced by the similar contour interface temperature for healthy FE model (figure 8a) and vice versa for a model with an induced defect (figure 8b). This finding implied that a beam induced with a defect can be positively identified from the observation of contact interface thermal variation. Importantly, it is worth pointing out that this method is currently being used for dynamic structural condition monitoring as reported by Bagavathiappan et al. [1]. Even although the frequencies analysed from the frictional temperature evolution and displacement were in agreement for both models (25.00 ) as seen in figure 9, significant variations in peaks existed. The frequency peak evaluated from frictional temperature signature increased in the case of beam induced with a defect and vice versa for the healthy one. This may be attributed to the increased frictional temperature due to increased displacement in respect to an induced structural defect, while there was less frictional heat in the case of the healthy beam due to less resulting displacement. Beam 1 Beam 2 (a) (b) Figure 3. Both beams excited at 20 (a) interface temperature contour after 0.1 and (b) FFT of temperature evolution and displacement. Forcing frequency Results Frequency predicted Temp. signature Disp. curve Beam 1: 20 Figure 5b 19.99 19.99 Beam 2: 40 39.97 39.97 297 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Beam 1 Beam 2 (a) (b) Figure 4. Both beams excited at 40 (a) interface temperature contours after 0.05 and (b) FFT of temperature evolution and displacement. Beam 1 Beam 2 (a) (b) Figure 5. Beam 1 excited at 20 and beam 2 at 40 (a) interface temperature contours after 0.1 and (b) FFT of temperature evolution and displacement. (a) (b) Figure 6. Thermal images for both beams excited at 20 . 298 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management (a) (b) Figure 7. FFT of temperature evolution for both beams excited at 20 (a) beam 1 and (b) beam 2. Beam 1 Beam 2 Beam 1 Beam 2 (a) (b) Figure 8. Interface temperature contour for both beams excited at 25 Hz (a) Both beams are healthy (b) Beam 1 induced with defect while Beam 2 is healthy. Figure 9. FFT of temperature evolution and displacement for both healthy and defected FE beam models. 4. CONCLUSION In conclusion, the analysed frictional temperature time-domain wave form obtained from the simulation of 3D FEM formulated confirmed the ability of predicting the structural vibration frequency of mechanical components. Furthermore, the approach developed forms the basis for online monitoring of vibration frequency, hence, effective SHM employing infrared thermography as a dual diagnostic tool. 299 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 5. ACKNOWLEDGEMENT The authors greatly appreciate the support of Eskom Power Plant Engineering Institute (Republic of South Africa), University of Pretoria and Tshwane University of Technology for funding this research. REFERENCES [1] Bagavathiappan, S., Lahiri, B. B., Saravanan, T., Philip, J., Jayakumar, T. (2013) Infrared thermography for condition monitoring – A review, Infrared Physics & Technology 60: 35-55. [2] Lahiri, B. B., Bagavathiappan, S., Soumya, C., Mahendran, V., Pillai, V. P. M., Philip, J., Jayakumar, T. (2014) Infrared thermography based defect detection in ferromagnetic specimens using a low frequency alternating magnetic field, Infrared Physics & Technology 64: 125-133. [3] Meola, C., Carlomagno, G. M. (2004) Recent advances in the use of infrared thermography, Measurement Science and Technology 15:(9) 27-58. [4] Titman, D. J. (2001) Applications of thermography in non-destructive testing of structures, NDT&E International 34: 149-154. [5] Mukhopadhyay , N. K., Chowdhury, S. G., Das, G., Chattoraj, I., Das, S. K., Bhattacharya, D. K. (1998) An investigation of the failure of low pressure steam turbine blades, Engineering failure analysis 5:(3) 181-193. [6] Wei, F., Pizhong, Q. (2010) Vibration-based Damage Identification Methods: A Review and Comparative Study, Structural Health Monitoring 10:(1) 83-111. [7] Rao, S. J. (1991) Turbo-machine blade vibration (first edition), New Delhi: New age international publishers 7-20. [8] Jaiswal, B. L., Bhave, S. K. (1994) Experimental evaluation of damping in a bladed disk model, Journal of sound and vibration 177:(1) 111-120. [9] Torshizi, M. S. E., Nikravesh, Y. S. M., Jahangiri, A. (2009) Failure analysis of gas turbine generator cooling fan blades, Engineering Failure Analysis 16:(5) 1686-1695. [10] Oberholster, A. J., Heyns, P. S. (2011) Eulerian laser Doppler vibrometry: Online blade damage identification on a multi-blade test rotor, Mechanical Systems and Signal Processing 25:(1) 344-359. [11] Castellini, P., Martarelli, M., Tomasini, E. P. (2006) Laser Doppler Vibrometry: Development of advanced solutions answering to technology’s needs, Mechanical systems and signal processing 20: 1265-1285. [12] Brock, N., Hayes, J., Kimbrough, B., Millerd, J., North-Morris, M., Novak, M., Wyan, C. J. (2005) Dynamic Interferometry, In: Proceedings of SPIE 5875. [13] ABAQUS/CAE (2013) Analysis User’s Manuel, Dassault Systèmes. [14] Petreski, Z. (2009) Natural frequencies of a blade group with a lacing wire, Mechanical scientific engineeering 28:(1) 1-5. [15] Talai, S. M., Desai, D. A., Heyns, P. S. (2016) Vibration characteristics measurement of beam-like structures using infrared thermography, Infrared Physics & Technology 79: 17-24. [16] Miller, S. F., Shih, A. J. (2007) Thermo-Mechanical Finite Element Modeling of the Friction Drilling Process, Journal of manufacturing science and engineering 129: 531-538. [17] Papuga, J., Růžička, M., Meggiolaro, M. A., Castro, J. T. P. d., Wu, H. (2015) Computationally-efficient Non-linear kinematic Models to Predict Multiaxial Stress-strain Behavior under Variable Amplitude Loading, In: 3rd International Conference on Material and Component Performance under Variable Amplitude Loading, VAL 2015: Procedia Engineering 101: 285-292. [18] Claeys, M., Sinou, J. J., Lambelin, J. P., Todeschini, R. (2016) Modal interactions due to friction in the nonlinear vibration response of the “Harmony” test structure: Experiments and simulations, Journal of Sound and Vibration 376: 131-148. [19] Arrazola, P. J., Özel, T. r. (2010) Investigations on the effects of friction modeling in finite element simulation of machining, International Journal of Mechanical Sciences 52:(1) 31-42. [20] Wenya, L., Shanxiang, S., Feifan, W., Zhihan, Z., Tiejun, M., Jinglong, L. (2012) Numerical simulation of friction welding processes based on ABAQUS environment., Journal of engineering science and technology review 5:(3) 10-19. [21] Mabrouki, F., Thomas, M., Genest, M., Fahr, A. (2009) Frictional heating model for efficient use of vibrothermography, NDT&E International 42:(5) 345-352. [22] Simar, A., Bréchet, Y., de Meester, B., Denquin, A., Gallais, A., Pardoen, T. (2012) Integrated modeling of friction stir welding of 6xxx series Al alloys Process, microstructure and properties, Progress in Materials Science 57: 95-187. [23] Dimarogonas, A. D. (1996) Vibration of cracked structures: A state of the art review, Engineering Fracture Mechanics 55:(5) 831- 857. [24] Yang, J., Chen, Y., Xiang, Y., Jia, X. L. (2008) Free and forced vibration of cracked inhomogeneous beams under an axial force and a moving load, Journal of Sound and Vibration 312:(1-2) 166-181. 300 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Corrosion monitoring with Acoustic Emission of steel embedded in concrete that is subjected to different environmental conditions T. Sigoba1, M. Howse2, Q.I. Sikakana3 1, 3 Department of Non-Destructive Testing and Physics, Faculty of Applied and Computer Sciences, Vaal University of Technology, South Africa 2 SSA Acoustic & Specialised Inspections, Vereeniging, South Africa ABSTRACT The use of steel reinforcement in concrete structures is an established building code requirement for structural integrity. The Acoustic Emission (AE) technique has been used in industry to evaluate corrosion in reinforced concrete since the late 1970’s and early 1980’s [1,2]. The AE technique is well suited for limited access applications as the waves travel along entire structures. This study utilized five samples of mild steel imbedded in concrete subjected to different environmental conditions and the AE activity and intensities were recorded and compared. The environmental conditions were to simulate actual locations where concrete structure are built. The five samples consisted of uncoated mild steel bars embedded in concrete, which were then submerged in, drinking water; salt water; salt water (insulated steel); hydrochloric acid; salt water and copper forming a galvanic cell. The samples were kept at ambient temperature and monitored over a seven week period utilizing AE sensors with a resonant frequency of 30 kHz. The hydrochloric acid sample exhibited the highest cumulative AE energy, followed by the galvanic cell, salt water, drinking water and lastly salt water (insulated steel). The cumulative number of AE activity followed the same trend except for the saltwater and galvanic cell samples obtaining the fewer hits than the drinking water. These findings were confirmed, with the exception of the galvanic cell sample, by visual examination of the surface corrosion present on the samples post concrete removal. Keywords: accelerated corrosion of steel; acoustic emission; unloaded reinforced concrete; environmental conditions; structural health monitoring 1. INTRODUCTION 1.1 Corrosion of Steel Reinforcement in Concrete Corrosion effects of steel reinforcement in concrete structures must be considered in the design phase. It is a fact that corrosion of steel in concrete is accelerated when the structures are located in environments with high salt levels or where the temperatures are constantly high with humidity. Active corrosion in steel in concrete is for the most part, not externally visible, and it may take years after initiation of cracks that it surfaces. Concrete acts, as an electrolyte and the reinforcement will develop a potential. Corrosion is an electrochemical process in which electric current flows between the cathodic and anodic (steel) sites through the concrete. In new concrete, steel is protected from corrosion due to the formation of a protective, passive film on the surface of the embedded steel in the highly alkaline surroundings of hydrated cement (>pH 12.5). Eventually a pH reduction caused by carbonation or by ingress of chloride causes the passive film to degrade, allowing the reinforcement to corrode in the presence of oxygen and moisture. To minimise the 301 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management risk of corrosion in steel reinforced concrete; the concrete must be of appropriate quality for the intended application and that the depth of cover to the steel reinforcement must meet the ACI 318 code [3]. Proper compaction and curing of the fresh concrete is essential, so as to have dense concrete with low permeability. Porous concrete on the other hand, has high permeability increasing the risk of corrosion. The cement mixing ratios are therefore crucial, particularly the water-to-cement ratio. It is well documented that corrosion of steel reinforcement in concrete on existing structures can be assessed by different methods such as mentioned in the review by Ha-Wong Song et al. [4]. 1.2 Acoustic Emission Monitoring of Reinforced Steel Bars in Concrete Acoustic Emission (AE) is the elastic energy released from materials which are undergoing deformation. The rapid release of elastic energy, the AE event, propagates through the structure to arrive at where a transducer is mounted. By analysis of the resultant waveform in terms of feature data such as amplitude, energy and time of arrival, the severity and location of the AE source can be assessed. One study of the applicability of AE technique for detecting corrosion of reinforced-steel bar in concrete was performed by Zongjin et al. [5]. Their results demonstrated that there is a clear relationship between the AE events and reinforced-steel bar corrosion in concrete. A comparative study of the rate of corrosion of five samples of mild steel bars embedded in concrete were done. The analysis of the characteristics of the AE waveforms emanating from the steel-concrete interface where the chemical reactions for corrosion takes place are presented. The purpose of this research project is thus to advance the knowledge of AE as an evaluation tool for static structures subjected to different environmental conditions in the absence of dynamic stresses (unloaded). This study also contributes to determining steel reinforced concrete structural durability and its deterioration [6-8]. Figure 1, is a standard three stage graph depicting the deterioration process due to salt attack [9]. Figure 1: Deterioration / Durability process of steel reinforced concrete due to corrosion Application of AE in concrete has its limitation, for the following reasons [10]: a) Each application has a unique set of parameters, resulting in different results influenced by sample size, type of transducer, path of signal, etc. b) Strength of the signal is dependent on the energy content of the source. c) Susceptibility to noise, that is, external noise sources may skew results. Nonetheless, AE from unloaded steel reinforced concrete has its own ‘signature’ as compared to those from other materials reinforced concrete, like pre-stress and post-tension reinforced concrete, stainless steel in concrete, epoxy-coated steel in concrete, galvanizes steel reinforcement , fibre, etc. One such case are the amplitudes of the events from steel reinforced concrete tend to be larger than those without steel. Thus, 302 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management this study of inspection for corrosion in steel reinforced concrete subjected to different environmental conditions using AE aims to optimise the procedures. The source location of the acoustic signal was not the objective of this study, since only one sensor per sample was used. 1.3 The operational basics of an AE system The typical operational setup for an AE system consists of one to several AE sensors, computer-based data acquisition devices and a couplant. The role of the AE sensor is acting as a receiver to convert detected dynamic displacements or sound waves into electric signals. There are two categories of sensors usually applied in the AE testing: resonant and broadband sensors. Resonance type sensors, which are usually applied in the material with high attenuation like concrete, gain higher sensitivity but lower frequency range than the broadband sensors. The choice of couplant is crucial to the test sensitivity. The couplant layer should be even and it is essential to get rid of bubbles and make couplant layer as thin as possible to guarantee good acoustic transmission. 1.4 The AE signal feature descriptions In the AE testing, count, amplitude, duration, rise time, and energy are the five most frequently used parameters. Other parameters like threshold, RA value, frequency, to name a few are also widely employed in crack classification [11,12]. These parameters are shown in figure 2. Figure 2. Acoustic Emission parameters for a hit Although in this experiment the samples were not subjected to loading, it is noteworthy that the RA value; defined to be the ratio of rise time and amplitude, is now a standard value used to identify the classification of fractures [13,14]. For example, a decreasing RA value suggests the tendency of tensile fracture [15]. The features utilised as defined by the AE acquisition system’s user manual [16] were:  Hit: the detection and measurement of an AE Signal on a Channel  ASL (Average Signal Level): measure of the average amplitude signal received on a channel  Absolute Energy: this is derived from the integral of the squared voltage signal divided by the reference resistance (10 kΩ) over the duration of the AE waveform packet. Measured in Attojoules. Its time driven dataset is calculated as follows; 𝐴𝑏𝑠𝑜𝑙𝑢𝑡𝑒 𝐸𝑛𝑒𝑟𝑔𝑦 [𝑎𝐽] = ∑ ቀ(ௌ௘௡௦௢௥ ௏௢௟௧௔௚௘) మ×ଵ଴షల௦௘௖ ଵ଴ ௞ఆ × ଵ ௃௢௨௟௘ ௐ௔௧௧ௌ௘௖ ቁ (1) : AE Counts Rise Time Duration Energy Maximum Amplitude Threshold level 303 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 2. THE PENCIL LEAD BREAK TEST VERIFICATION The purpose of verification is twofold. First, it ensures that the transducers are in good acoustic contact with the part being monitored. Generally, the lead breaks should register amplitudes of at least 90 dB for a reference voltage of 1 mV and a total system gain of 90 dB. Second it checks the accuracy of the source location setup. This last purpose involves indirectly determining the actual value of the acoustic wave speed for the object being monitored. In this experiment, source location determination could not be used due to only one sensor per sample utilised. Thus, the pencil lead breaks were conducted by breaking 0.5 mm 2H lead approximately 25 mm on the steel bars from each sensor; exclusively to verify coupling sensitivity. Pencil / lead breaks conducted confirmed an amplitude response of 90 dB ± 2 dB. 3. SAMPLE PREPARATION Ratios of concrete mix used (in kilograms) were, Cement: aggregate: sand = 1 : 3 : 3. A proportionate amount of water was added to this concrete mix which is medium to high strength concrete for civil structures. When cured after 28-days it has a compressive strength of about 3000 psi. (21 MPa). Tins were used for setting the concrete mix blocks. The 200 mm long mild steel bars were placed vertically in the middle of each concrete block. All five set-ups samples were exposed to the sun for three days, to allow the concrete to partially dry. Holes were drilled in each set-up sample to reach the mild steel bar so as to accelerate the corrosion rate and to simulate water ingress into the concrete structure. The partially dry concrete blocks were removed from the tins, and allowed to completely dry. The concrete blocks were then each placed in plastic containers and numbered according to the different conditions they were to be subjected to as described in table 1. 22 days was allowed for further curing of the concrete, as this produces noise because of material friction during shrinkage. Data acquisition was thereafter initiated. TABLE 1: Sample Description Sample Number Sample Type Composition Description 1 Salt water mixture (control) 35g salt in 1000 ml of water Insulated steel embedded in concrete and submerged in salt water; concentration in 500 ml 2 Tap (drinking) water 500 ml of tap water Non insulated steel embedded in concrete and submerged in water 3 Salt water mixture (brine) 35g salt in 1000 ml of water Non insulated steel embedded in concrete and submerged in salt water 4 Hydrochloric Acid (HCl) 500 ml of commercial hydrochloric acid Container is placed in a well-ventilated area. Non insulated steel embedded in concrete and submerged in acid (hydrochloric acid) 5 Galvanic cell 500 ml of salt saturated tap water with a copper wire inserted vertically Non insulated steel embedded in concrete and submerged in salt water And copper wire (galvanic cell) 304 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 3 shows a cross-sectional view of a steel bar embedded in the concrete block with a transducer attached. Transducer / Sensor Steel bar Drilled hole Concrete block Figure 3: Schematic diagram of a steel bar in concrete with transducer attached Table 2 summarises the chemical element composition intensities of the mild steel bars. Three runs for intensity counts per second (cps) were done and the average relative standard deviation (RSD) calculated. The RSD showed big variances for some elements, implying that dilution for the analysed sample was not consistent. Nonetheless, the elements identified were carbon, zinc, iron, cobalt, nickel, copper, aluminium, chromium, vanadium, manganese and lead. These chemical elements are consistent with the family of mild structural steels. TABLE 2: Chemical element composition of mild steel bars 305 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 4. EXPERIMENTAL SET_UP Figure 4 is a picture of the five samples already connected to the AE system. Figure 4: Set-up of the Five Test Samples The system set-up – including the five samples – has the acquisition system (Mistras Group micro express 8 system) connected in-line with the 20 dB pre-amplifiers, which are connected to the Mistras R3I (30kHz resonant frequency) sensors. Couplant was applied to each of the sensors and attached to the samples. The 20 dB pre amplifier were then switched on and data acquired on all five samples for between 45 minutes to 24 hours and then normalised to 1 hour for comparison and trend analysis. Table 3 shows the Software Acquisition Settings for the experiment. Sample 4’s threshold was increased to 44 dB due to data rate saturation during acquisition. TABLE 3: AEwin for Express 8 v5.70 Software Acquisition Setup Sample Number Threshold [dB] Analogue Frequency Filter [kHz] Sampling Rate [k Samples per second] Peak Definition Time [µs] Hit Definition Time [µs] Hit Lockout Time [µs] 1,2,3,5 40 20 - 100 10000 2500 5000 5000 4 44 20 - 100 10000 2500 5000 5000 306 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 5. RESULTS AND DISCUSSIONS 5.1 Acoustic Emission Data The corrosion process occurs at a steady rate and hence, extraneous noise - such as mechanical bangs from nearby activities – was identifiable and the associated time removed before analysis of the data. The normalised data was trended in figures 5a and 5b. It was observed that, for all five samples, the absolute energy remained relatively constant during the course of the experiment (Figure 5a), while the number of hits decreased with time (Figure 5b). Figure 5a: Normalised Absolute Energy vs Time The decreasing rate of hit with time was attributed to the build-up of by-products on the mild steel, and thereby reducing the effective surface area available for the reactions, hence impeding the reaction rate. Figure 5b: Normalised Hits vs Time 100 1000 10000 100000 1000000 10000000 100000000 1 2 3 4 5 6 7 8 Ab so lu te E ne rg y [a J] Time - Data Acquisition Data Set Normalised Absolute Energy vs Time Series1 Series2 Series3 Series4 Series5 10 100 1000 10000 100000 1 2 3 4 5 6 7 8 Hi ts Time - Data Acquisition Data Set Normalised Hits vs Time Series1 Series2 Series3 Series4 Series5 307 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management The cumulative hit and absolute energy per sample set-up are captured in the graph of figure 6. Figure 6: Cumulative Hit & Energy Data per Sample The graph extracted numerical cumulative hits and absolute energies for each sample set-up are tabulated in descending order in tables 4a & 4b, respectively. TABLE 4a: Sample Cumulative Hits TABLE 4b: Sample Cumulative Energy Sample Hits Sample Absolute Energy 4 (HCL) 2.00E+06 4 (HCL) 1.49E+09 2 (Water) 8.50E+03 5 (Galvanic Cell) 3.69E+08 3 (Brine) 3.01E+03 3 (Brine) 1.68E+08 5 (Galvanic Cell) 2.51E+03 2 (Water) 7.68E+07 1 (Control) 2.06E+03 1 (Control) 3.16E+07 The cumulative energy (Table 4b) behaved as anticipated for the five samples and correlated with visual observations conducted at the end of the experiment. The hydrochloric acid sample had the greatest energy release, followed by the galvanic cell sample, then the brine sample, then the drinking water sample and finally the insulated control sample. The cumulative number of hits, however, revealed an alternative order of decreasing activity as follows: hydrochloric acid, drinking water, brine, galvanic cell, insulated control. The drinking water sample and galvanic cell sample exchanged positions. To explain this behaviour, it is postulated that the brine sample and galvanic cell sample produced a residue build-up which slowed down the hit rate but not the overall energy released. 3.16E+07 7.68E+07 1.68E+08 1.49E+09 3.69E+08 2.06E+03 8.50E+03 3.01E+03 2.00E+06 2.51E+03 1 2 3 4 5 1 10 100 1000 10000 100000 1000000 10000000 100000000 1E+09 SAMPLE NUMBER Cumulative Data per Sample Absolute Energy [aJ] Total Hits 308 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 5.2 Visual Observations The samples steel bars were removed from the concrete 6 weeks post-termination of the data acquisition phase. Each sample steel bar was cleaned via a nail brush to reveal any surface damage. The visual observations are described in table 5 below, with the photographs of the corresponding steel bars shown in figure 7. TABLE 5: Visual Findings Summary Sample Condition of Material Embedded in Concrete Condition of Material at Concrete to Air Interface & Above 1 (Control) Minor corrosion in areas where paint failure occurred Minor corrosion in areas where paint failure occurred 2 (Water) Localised pits near drilled hole locations Light general pitting at concrete to air interface 3 (Brine) Deep pit (+-40% loss) at drilled hole location (bottom right of photograph) Highly localised damage in areas Minor corrosion at concrete interface Localised damage in a number of areas 4 (HCL) Light general scaling present Concentrated pitting and scaling present from interface to top of sample 5 (Galvanic Cell) Minor localised pits near drilled hole locations Light general pitting at concrete to air interface below residue layer Figure 7: Varied corrosion of sample steel bars post-termination of data acquisition Comparison between the AE data and visual observation results of the condition of the sample steel bars is summarised and ordered as follows: Area of general corrosion (greatest to least) : Sample 4, Sample 2, Sample 3, Sample 5, Sample 1 Corrosion pits (deepest to shallow) : Sample 3, Sample 4, Sample 2, Sample 5, Sample 1 Overall Condition (worst to best) : Sample 4, Sample 3, Sample 2, Sample 5, Sample 1 309 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 6. CONCLUSION The AE system was utilised to detect the rate of corrosion / material loss on five samples of mild steel bars embedded in concrete. The hydrochloric acid sample (Sample 4) had the most extensive corrosion damage, followed by the brine sample (Sample 3), then the drinking water sample (Sample 2), then the galvanic cell (Sample 5) with the insulated steel (Sample 1) only having minor damage where paint failure occurred. There was a correlation between the visually observed areas of corroded surface with the total number of AE hits from the sample. This correlates with the fact that the larger the surface area, the higher the chemical reaction rate which is then observed as an increase in the number of AE hits / activity. Thus, the rate of corrosion activity is reflected in the amount of AE activity (number of hits) emanating from a sample. The cumulative absolute AE energy released from the samples coincides with observations on the number of deep pits on the samples. The only exception being sample 5 (galvanic cell) which did not exhibit particularly deep pits but possessed the second highest energy released. This could be due to the energy release during the formation of salt crystals on the outer surface of the sample during the experiment. The correlation between energy detected and severity of pitting on the samples may be explained by the greater energy released during the breaking of bonds deeper into the lattice structure. The decrease in AE hit rate during the experiment coupled with the absolute AE energy remaining relatively constant points towards each source event (spalling of corrosion) releasing more energy as time passes. This phenomenon is often observed in the industry where “older” corrosion with deeper pits produce higher amplitude hits (and higher energy) than “fresh” corrosion which usually only exhibit minor surface pitting. Noting that the areas of corrosion in this study were within 200 mm of the sensors; future experimentation to determine the ‘effective’ AE monitoring distance for optimal corrosion detection of reinforcement steel in concrete is envisaged. Acknowledgement The authors hereby acknowledge SSA Acoustic & Specialised Inspections for the use of their equipment and premises for the duration of this project, particularly, Gavin McCabe (MD) and the assistance given by Bradley Fagan (AE level 1). REFERENCES [1] McCabe, R., Koerner, R.M. and Lord, A.E., "Acoustic Emission Behavior of Concrete Laboratory Specimens." AC1 J. (July 1976). [2] Li, L. and Poorooshasb, H.B., "Characteristics of Acoustic Emission from Reinforced Concrete,” Progress in Acoustic Emission III, The Jap. Soc. of NDI, 522-528 (1986). [3] ACI Committee 318, Building Code Requirements for Structural Concrete, AC 318-02, American Concrete Institute, Farming Hills, Michigan, 2002. [4] Song H.W., and Saraswathy V., “Corrosion Monitoring of Reinforced Concrete Structures - A Review,” Int. J. Electrochem. Sci. 2, 1-28 (2007). [5] Zongjin Li., Faming Li., Zdunek A., Landis E. and Shah S., “Application of acoustic emission technique to detection of rebar corrosion in concrete,” ACI Mater J. 95, 68–76 (1998). [6] Goszczyńska, F.B., “Analysis of the process of crack initiation and concrete with acoustic emission testing,” Archives of Civil and Mechanical Engineering 14, 134-143 (2014). [7] Goszczyńska, B., Świt, G., Trąmpczyński, W., Krampikowska, A., Tworzewska, J. and Tworzewski, P., “Experimental verification of the acoustic emission (AE) method for the cracking process determination and location in concrete elements,” in: IABSE Rotterdam Conference Report 99, 150–151 (2013). 310 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management [8] Goszczyńska, B., Świt, G., Trąmpczyński, W., Bacharz, K., Godowska, M. and Krampikowska, A., “Identification of acoustic emission signals in unloaded concrete,” in: Proceedings 58th Annual Conference on Scientific Problems of Civil Engineering, Krynica – Rzeszów, 202–203 (2012). [9] JSCE (2001) Standard Specification for concrete and concrete structures on maintenance, JSCE, Tokyo. [10] Maji, A.K., and Sahu, R., “Acoustic Emissions from Reinforced Concrete,” Experimental Mechanics 34, 379 – 388 (December 1994). [11] Kawasaki, Y., Wakuda, T., Kobarai, T. and Ohtsu, M., “Corrosion mechanisms in reinforced concrete by acoustic emission," Construction and Building Materials 48, 1240–1247 (2013). [12] Pollock, A.A., "Acoustic Emission Inspection." Metals Handbook 17, 278- 294 (1989). [13] Berkovits, A. and Fang, D., "Study of fatigue crack characteristics by acoustic emission." Engineering Fracture Mechanics 51(3), 401-409 (1995). [14] Grosse, C., Reinhardt, H. and Dahm, T., "Localization and classification of fracture types in concrete with quantitative acoustic emission measurement techniques," NDT & E International 30(4), 223-230 (1997). [15] Soulioti, D., Barkoula, N.M., Paipetis, A., Matikas, T.E., Shiotani, T. and Aggelis, D.G., "Acoustic emission behavior of steel fibre reinforced concrete under bending," Construction and Building Materials 23(12), 3532-3536 (2009). [16] Express-8 AE System User manual Rev 0, Mistras Group Incorporated (2014). 311 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Variations in vibration responses of an ice-going vessel during wave slamming C.M van Zijl1, A. Bekker1 1Sound and Vibration Research Group, Department of Mechanical and Mechatronic Engineering, Stellenbosch University, South Africa ABSTRACT A polar supply and research vessel was instrumented with an accelerometer network to determine full-scale dynamic responses of the operational ship structure. The vessel in question is pre-disposed to problematic stern slamming. In a recent voyage to the Marginal Ice Zone, the vessel encountered extremely rough weather, including storm in a field of brash ice. During this voyage wave slamming occurred and ice debris was flung onto deck spaces. This work investigates the variation in vibration and modal responses, which result from harsh operational conditions. The aim is to examine the feasibility of modal tracking towards potential long-term structural health monitoring and inverse force estimation. Three case studies are identified from operational data. The vibration responses are subjected to conventional time history analyses and newly developed slamming response analyses. LMS PolyMax automatic parameter selection is used to identify the changes in modal frequencies under stormy conditions. These values are reflected against results from a finite element analysis and earlier measurements in a harbor environment. In the investigated cases, the fundamental modal frequencies varied by 0.1 Hz in different operational conditions. It is found that rotational speed of the propulsion shaft corresponds to the modal frequency of the first bending mode. As such, this mode is not ideal for modal tracking because of harmonic contamination. The excitation of higher order bending modes could be an indicator of wave slamming, as these modes are not significantly excited in moderate seas. Keywords: Operational modal analysis, ice-going vessel, modal tracking 1. INTRODUCTION Antarctic research institutes and their scientists have relied on polar supply and research vessels to supply bases and serve as floating laboratories. The need to predict tipping points in climatology is adding momentum to polar and oceanographic research in attempts to increase our understanding of the sparsely explored oceans (Moedas, Pandor and Kassab, 2017). South Africa’s polar supply and research vessel, the SA Agulhas II, is a state-of-the-art ship, which has been designed to meet the harsh operational requirements of the stormy Southern Ocean and icy conditions of Antarctica. The full- scale ship responses from this first-in-class vessel are highly relevant to influence the design of future research vessels. As a result a consortium was established to conduct full scale measurements on-board the SAA II to create a legacy of data to enhance the scientific basis for ice-going ships (Bekker et al., 2018). The particular focus of the present work includes an investigation into the potential use of accelerometer network to track variable vessel responses as a result of varying environmental conditions or long term (20 years) modal tracing to assess decaying structural integrity. Modal tracking is enabled through operational modal analysis (OMA), which is an output only modal analysis technique. In OMA, structural responses to ambient excitation (by ice and water) during the normal operating conditions are used to achieve dynamic deformation of the structure (Brincker and Ventura, 2015). This deformation is influenced by engineering design parameters (such as mass, stiffness and damping properties) and boundary conditions that constrain and load the structure (ice, water, vessel draft, temperature). 312 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management A further application of modal tracking includes the potential of inverse force estimation, which could indicate novel methods of estimating ship ice resistance or wave slamming forces (Soal, 2018). Outside the ambit of engineering, vibration signatures could potentially be indicative of specific environmental conditions. In the context of research vessels such phenomena could enable the automated detection and sampling of ice thickness or wave height (Sharkh et al., 2014), which would add localized environmental observations towards the increased fidelity and interpretation of satellite images. 2. METHODS 2.1. Vessel and voyage The SAA II can carry up to 100 passengers in 46 cabins and is operated by a crew of 45 members. She is equipped with two shaft-lines with four-bladed variable pitch propellers, which are powered electric motors. The diesel generators comprise 6-cylinders, which are run at a constant speed of 750 rpm. The general specifications of the SAA II are reported in Table 1. Table 1. Specifications of the SA Agulhas II. Gross tonnage 12 897 tons Diesel engine type 6L32 Length 143 m Electric motor type N3 HXC 1120 LL8 Breadth 22 m Speed at MCR* 140 rpm Classification DNV Power at MCR* 4500 kW Class notation 1A1 PC-5/ICE-10 Nominal torque 307 k(Sharkh et al., 2014)Nm Yard / Year STX Finland /2012 Propeller maker Rolls Royce Main engine maker Wärtsilä No. of blades / Diameter 4 / 4.3 m * MCR – Mean Continuous Rating Figure 1 A map showing the voyage route of the SAA II on her Winter Cruise in July 2017. The selected case studies are indicated: Case 1 (Blue), Case 2 (Red) and Case 3 (Green). Figure 2. The violent wave action during a storm in brash ice, caused ice debris to be flung onto the deck. 313 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management The SAA II is prone to stern slamming, which has led to passenger complaints and equipment damage on board (Omer and Bekker, 2018). Slamming is the exposure of a vessel structure to wave impacts, which give rise to large forces for a short duration of time (Kapsenberg, 2011) caused by water entry. The lightly damped nature of many vessel structures result in a global vibratory response, known as whipping, which follows a slamming event (Dessi, 2014). Concurrent with the findings of Mouton (1991) and Kapsenberg et al. (2011), the slamming-prone dynamics of the SAA II are attributed to the flat, extended transom design which originated from the desire to create additional deck space at the stern. Owing to the sparse availability of full-scale wave slamming data, the long-term effects of excessive slamming on the structure of the SAA II are yet unknown.Full-scale measurements were performed on a designated scientific Winter Cruise of SAA II in July 2017 as shown in the voyage track in Figure 1. The focus of scientific operations was set on investigations of the ice and waves in the Marginal Ice Zone (MIZ). This voyage is of particular interest as wave slamming was encountered in open water as well as the MIZ. The SAA II departed from Cape Town harbour on 28 June 2017 with an ambitious scientific agenda, which suffered owing to the encounter of several storms and motion sickness of the newly sea-faring scientists. In order to regain lost time the vessel speed was increased to 17 knots by switching the engine into “ice mode” on 1 July. The drop keel was raised to reduce resistance. During the next two days, the operations on board mostly consisted in underway sampling and navigation in stormy waters. Weather forecasts indicated a substantial frontal system approaching the vessel in the area where sea ice was expected on 4 July. At 4:45am GMT on 4 July, the first brash sea ice was encountered. The vessel entered a large field of pancake ice with floes smaller than 3 m, and an average thickness of 40 to 60 cm. Long waves with estimated periods of more than 10 s were persistently high and were experienced by the ship which was proceeding at a sustained speed of 16 to 17 knots. The progress of scientific stations was hindered by several large waves and slams that interrupted the work. The ship continued to head NW heaving into the swell in wind conditions exceeding 20 m/s and finally reached stormy open water. Snow was falling heavily. Wind speed reached 30 m/s with stronger gusts. In search of sea ice, the vessel course was changed in a SE direction. After the change in heading, the pancake floes increased in size. The ice surface detected by satellite images indicated 100%, although the encountered ice was not consolidated, with estimated significant wave height of up to 7 m. The vessel voyaged North and ice conditions changed rapidly to a more mobile ice surfaces. The stormy conditions decreased in the morning of 5 July, which enabled work and inspection of the deck. Several large floes were washed on deck as shown in Figure 2. An ice impact had damaged a mast on the ship’s bow, which was taken down. Calmer weather enabled scientific work to resume. Another storm was encountered on 7 July. The vessel speed was reduced and scientific stations were again interrupted by rough weather conditions, which persisted until 8 July. The weather improved on 9 July enabling the voyage to proceed as planned. The vessel finally returned to Cape Town harbour on 13 July. Figure 3. A diagram showing the measurement locations of the accelerometer network on the SAA II. 314 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 2.2. Full-scale measurements Table 2. Sensor layout and location, types of sensors, cable routing, visual observations. Point reference Channel Axis Deck Side Description Type Sensitivity [mV/m.s-2] 1 9 +Z 4 Port Bow chain locker DC 20.00 2 10 +Y 4 Middle Bow chain locker DC 20.39 3 11 +Z 4 Starboard Bow chain locker DC 20.36 4 14 -Z 3 Port Cargo hold fore ICP 9.96 5 15 -Z 3 Starboard Cargo hold fore ICP 10.05 6 16 -Z 3 Port Cargo hold aft ICP 10.25 7 17 -Z 3 Starboard Cargo hold aft ICP 10.31 8 18 -Z 2 Port Fresh water room ICP 10.44 9 19 -Z 2 Starboard Engine store ICP 10.36 10 20 +Z 2 Port Stern thruster ICP 10.43 11 21 +Z 2 Starboard Stern thruster ICP 10.72 12 26 +X 2 Starboard Steering gear aft DC 20.19 13 27 +Y 2 Starboard Steering gear aft DC 20.22 14 28 +Z 2 Starboard Steering gear aft DC 20.13 15 29/25 +Z 2 Port Steering gear aft DC 20.25 16 5 +Y 4 Starboard Aft stairwell ICP 10.60 17 6 +Z 4 Starboard Aft stairwell ICP 10.18 18 7 +Y 7 Starboard Passenger accommodation ICP 10.01 19 8 +Z 7 Starboard Passenger accommodation ICP 10.56 20 12 +Y 8 Starboard Chef’s quarters ICP 10.03 21 13 +Z 8 Starboard Chef’s quarters ICP 10.20 22 3 +Y 9 Starboard Bridge DC 20.29 23 4 +Z 9 Starboard Bridge DC 20.46 24 22 +Z 9 Port Bridge ICP 10.10 Measurements from 24 accelerometers were utilized in the present analysis. The hull instrumentation comprised six pairs of accelerometers on starboard and portside locations for vertical acceleration measurements and two lateral sensors, one on the starboard side at the stern and one on the vessel centre line in the bow chain locker (see Figure 3). The remaining sensors monitored accelerations in the super-structure, closer to accommodation areas, recreational spaces and the bridge. The sensor network includes both DC and ICP accelerometers as summarized in Table 2. Data was acquired using an LMS SCADAS master-slave system and LMS.TurbineTesting acquisition software. Acceleration was sampled at 2048 Hz. 315 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 2.3. Case studies Three case studies were selected for analysis from the operational measurements as detailed in Table 3. Each case comprised 30 minutes of data. With vibration monitoring and modal tracking in mind, the cases were selected to highlight the differences in operational conditions that may be experienced from calmer to extreme conditions. Case 1 and Case 2 reflect instances where slamming was reported in the voyage notes in water close to 0 ˚C. Case 1 represents data from an open water storm, whereas Case 2 reports on the rare occurrence of a storm with slamming in brash ice. Case 3 represents ship responses as experienced in some of the calmest conditions experienced on the voyage in warmer water. Operational conditions were recorded from the ship logbook. The Beaufort scale is an empirical measure, which relates wind speed to observed conditions at sea on a scale of zero (Calm) to 12 (Hurricane force). Table 3 Operational conditions during the three selected case studies. Case 1 Case 2 Case 3 Date 7 July 2017 5 July 2017 12 July 2017 Time (UTC) 03:00 – 3:30 AM 01:00 – 1:30 AM 08:00 – 8:30 AM Conditions Storm (Open water) Storm (MIZ) Calm water Ship true heading [˚] 0˚ 322 ˚ 307 ˚ Swell height [m] & direction 3.5 WNW 6.0 WNW 2.0 SW Beaufort scale 8 7 5 Ice conditions [-] None (open water) Brash ice (floes < 3m) None (open water) Hull temperature [C] 0.6 ˚C -1.8 ˚C 18 ˚C Ship speed 27.8 – 29.0 11.4 – 15.0 13.9 – 25.6 3. RESULTS 3.1. Acceleration time histories (a) Lateral acceleration. (b) Vertical acceleration. Figure 4. Acceleration time histories as measured at the bow and stern for three case studies. Figure 4 presents the de-trended and high-pass filtered time-series of acceleration measurements in the lateral (Y) and vertical (Z) directions from sensors at the bow and stern during the three respective cases. Levels of vibration are significantly higher in the vertical direction than the lateral direction for the two storm cases while vibration 316 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management levels in the two directions are similar for the calm water case. The time-histories of the two storm cases include several impulsive events followed by decaying vibration amplitudes. Impulsive events followed by decaying vibrations are characteristic of wave slamming, followed by whipping of the global vessel structure. The frequency and magnitude of impulsive events more frequently appear in the MIZ storm case (Case 2). The extreme peak in the lateral acceleration measurements (Figure 4a) is believed to be attributed to impact of the hull with floating ice debris. The maximum peak vibration occurs during the open water storm (Case 1). The time history for the calm water case appears stationary with small impulsive events, which are not prominent compared to the general vibration floor. Some focus is afforded to the global distribution of peak metrics for the fourteen hull sensors to shed light on the possible effects of wave slamming (see Figure 5). Peak acceleration values are lower in the mid-ship owing to the stiffening presence of the superstructure as seen in Figure 5b. In Figure 5c there is a clear trend when considering the maximum crest factor, which shows that the most impulsive acceleration was excited during the open water storm. (b) Peak vertical acceleration (d) CF – Vertical acceleration Figure 5 A diagram showing the global distribution of vibration peak values and crest factors throughout the hull structure ( Case 1, Case 2, Case 3). (a) Peak lateral acceleration (c) CF - Lateral acceleration 317 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 3.2. Slamming analysis An in-house slamming detection algorithm (Bekker, Zijl and Saunders, 2018) was used to extract impulsive peaks caused by unpredictable, non-linear wave slamming. The slamming location was decided based on the sensor on which the peak occurred first (Omer and Bekker, 2016). The number of detected slams as well as significant slams (defined to have CF > 6 (Bekker, Zijl and Saunders, 2018) ) and their impact sites are shown in Figure 6. For Case 1 the vessel is cruising at full speed into 3.5 m waves. Structural excitation is dominated by bow slams, with more slams recorded at the starboard bow shoulder, although waves are approaching from the port side. Slamming for Case 2 is most frequent in the stern when the vessel is cruising at medium speed, almost directly into 6 m swell. Small slams frequently occur all around the vessel in oblique moderate seas, with no significant slams recorded. (a) Case 1 (b) Case 2 (c) Case 3 Figure 6 A summary of wave slamming impacts sites as recorded for the different case studies. The ship outline (top view) is indicated as well as the incident wave angle (blue arrow). The detected slamming counts are indicated by on the starboard side and on the port side. Larger slams with a crest factor > 6 are indicated by on the starboard side and on the port side. 3.3. Power spectral densities (a) Lateral acceleration. (b) Vertical acceleration. Figure 7 Auto-power spectral densities of the acceleration signals in the bow (solid line) and stern (dashed line) for the case studies. Auto-power spectral densities (ASPDs) of the un-filtered DC acceleration measurements at the bow and stern are presented for the three cases in Figure 7. Low frequency peaks in the ASPDs of all three cases occur in the lateral direction and vertical direction at 0.094 Hz and 0.125 Hz, respectively. These are thought to represent the rigid body motion of the vessel, which is also the most significant for Case 2, with the greatest significant wave height. Sharp, narrow frequency peaks are discernible in the ASPDs of the calm water case at 2.02 Hz and 8.05 Hz. These peaks are likely harmonic excitation from the first and fourth blade pass frequency of the four-bladed propeller. The propeller mostly operates at a speed of 140 rpm (2.33 Hz) and the pitch is adjusted to achieve different levels of thrust. Higher levels of ambient vibration during the storm cases mask propulsion harmonics in the respective APSDs. Two possible modes occurring at 3.50 Hz and 5.20 Hz may be identified from the ASPDs of the lateral direction measurements. The peak at 3.50 Hz is present in measurements from both the stern and bow and may represent a lateral bending mode. The ASPDs of measurements in the vertical direction present four peaks of interest: 2.10 Hz, 318 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 3.70 Hz, 5.20 Hz and 5.90 Hz. The peak at 5.90 Hz is present only in measurements recorded at the stern of the vessel, while the other three peaks of interest are visible in the APSDs of bow and stern measurements. 3.4. Operational modal analysis LMS Test.Lab 11A Operational PolyMax was used to perform an operational modal analysis on decimated data (128 Hz) from all 24 acceleration measurements on the SAA II. Cross-powers were calculated with 32,768 NFFT points and a 50% overlap exponential window was applied, resulting in a frequency resolution of 0.0390 Hz. The vertical direction accelerometer in the engine store room (Point 19) was selected as the reference sensor since this resulted in the clearest stabilization diagrams. The automatic pole selection add-in of Operational PolyMax was used to select poles from the stabilization diagrams. Selected modes were validated using the MAC criteria (Pastor, Binda and Harčarik, 2012) as shown in Figure 8. The identified modal frequencies for the case studies are shown in Table 4. (a) Case 1 – Storm (Open water). (b) Case 2 – Storm (MIZ). (c) Case 3 – Calm water. Figure 8 A comparison of the MAC matrices for the three case studies according to LMS PolyMAX automatic parameter selection. Table 4. A comparison of the modal frequencies identified through operational modal analysis compared to earlier investigations. Mode description STX Finland (2010) Harbour (K. Soal, Bienert and Bekker, 2015) Case 1 Case 2 Case 3 First vertical bending 2.09 1.94 2.10 2.15 2.14 First lateral bending 3.19 3.57 - 3.50 Second vertical bending 3.76 3.37 3.69 3.67 3.76 Second torsional 4.93 - - - Third vertical bending 5.37 4.72 5.20 5.01 - Combined torsion and bending 6.52 - - - Second lateral bending 6.93 - - - 4. DISCUSSION The SAA II encountered stormy conditions in open water as well as the MIZ, which has resulted in the recording of unique acceleration measurements of wave slamming on a ship structure in open water and in ice. Wave slamming in open water and in ice causes increased excitation of the vessel modes, but also raises the general vibration level. This raised vibration floor challenges the detection of wave slamming occurrences. The first bending mode dominates the flexural response of the vessel. The use of this mode for modal tracking is challenged as the modal frequency 319 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management coincides with the harmonic excitation from shaft rotation. In general the identified modal frequencies in the present investigation are higher, than those determined by Soal et al. (2015) in earlier harbor measurements. The determined frequencies do however correlate well with the finite element analysis by STX Finland (2010). Modal parameter variations could be attributed to multiple factors including vessel speed, water temperature, and ship draft, to name a few. At 5.2 Hz, wave slamming excites significant responses in lateral acceleration measurements at the stern. The automated modal identification by LMS Polymax does not succeed in the identification of the first lateral bending mode for Case 2 nor the third vertical bending mode for Case 3. 5. CONCLUSIONS & RECOMMENDATION The rotational speed of the propulsion shaft corresponds to the modal frequency of the first bending mode. This mode is not ideal for modal tracking as a result of harmonic contamination. The monitoring of lateral motion in the stern could be an indicator of wave slamming, especially at frequencies around 5.2 Hz. The meaningful interpretation of modal tracking on a polar supply and research vessel requires high quality data of environmental conditions (wave conditions, ice conditions, temperature) as well as the ship state (draft, ship speed, shaft speed). Automated modal tracking will require algorithms that ensure robust modal identification. The use of the MACXP criterion (Vacher, 2010) has been successfully used by Soal (Soal, 2018) to clear up spurious modes for modal tracking of a polar vessel. ACKNOWLEDGEMENTS Funding through the National Research Foundation (NRF) SANAP programme is thankfully recognized. REFERENCES [1] Bekker, A. et al. (2018) ‘From data to insight for a polar supply and research vessel’, Ship Technology Research, pp. 1–17. doi: 10.1080/09377255.2018.1464241. [2] Bekker, A., Zijl, C. M. Van and Saunders, C. F. W. (2018) ‘The detection of wave slamming from vibration measurements on a polar supply and research vessel’, in Submitted to the 14th International Conference on Vibration Engineering and Technology of Machinery. Lisbon, Portugal. [3] Brincker, R. and Ventura, C. (2015) Introduction to Operational Modal Analysis. First Edit. John Wiley & Sons Ltd. doi: 10.1002/9781118535141. [4] Dessi, D. (2014) ‘Whipping-based criterion for the identification of slamming events’, International Journal of Naval Architecture and Ocean Engineering, 6(4), pp. 1082–1095. doi: 10.2478/ijnaoe-2013-0232. [5] Finland), (STX (2010) FINNSAP finite element analysis of the PSRV NB1369. Rauma, Finland. [6] Kapsenberg, G. K. et al. (2002) ‘Whipping loads due to aft body slamming’, in 24th Symposium on Naval Hydrodynamics. Fukuoka, Japan. [7] Kapsenberg, G. K. (2011) ‘Slamming of ships: where are we now?’, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 369(1947), pp. 2892–2919. doi: 10.1098/rsta.2011.0118. [8] Moedas, C., Pandor, N. and Kassab, G. (2017) Belem Statement on Atlantic Research Innovation Cooperation, 13 July. Lisbon: Atlantic Ocean Research Alliance. Available at: https://ec.europa.eu/research/iscp/pdf/belem_statement_2017_en.pdf (Accessed: 21 July 2017). [9] Moton, C. J. (1991) Open-water resistance and seakeeping characteristics of ships with ice-breaking bows. Annapolis, Maryland. Available at: http://www.dtic.mil/dtic/tr/fulltext/u2/a245643.pdf (Accessed: 18 March 2018). [10] Omer, H. and Bekker, A. (2016) ‘Detection of wave slamming sites from ship deflections’, Research and Development Journal of South Africa, 32, pp. 50–57. [11] Omer, H. and Bekker, A. (2018) ‘Human responses to wave slamming vibration on a polar supply and research vessel’, Applied Ergonomics. Elsevier Ltd, 67, pp. 71–82. doi: 10.1016/j.apergo.2017.09.008. [12] Pastor, M., Binda, M. and Harčarik, T. (2012) ‘Modal Assurance Criterion’, Procedia Engineering, 48, pp. 543–548. doi: 10.1016/j.proeng.2012.09.551. [13] Sharkh, S. M. et al. (2014) ‘A Novel Kalman Filter Based Technique for Calculating the Time History of Vertical Displacement of a Boat from Measured Acceleration’, Marine Engineering Frontiers, 2(August). Available at: www.seipub.org/mef. [14] Soal, K. (2018) System identification and modal tracking of ship structures. Stellenbosch University. [15] Soal, K., Bienert, J. and Bekker, A. (2015) ‘Operational modal analysis on the polar supply and research vessel the S.A. Agulhas II’, in 6th International Operational Modal Analysis Conference, IOMAC 2015. [16] Vacher, P. (2010) ‘Extensions of the MAC Criterion to Complex Modes.’, in International Conference on Noise 330 and Vibration Engineering (ISMA) . 320 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Remote Monitoring of Wind Turbine Blades based on High-speed Photogrammetry Miaoshuo Li1, Xianghong Wang2, Fulong Liu1, Xiaoli Tang1, Fengshou Gu1 & Andrew D. Ball1 1School of Computing and Engineering, University of Huddersfield, Queens gate, Huddersfield, HD1 3DH, UK 2Schoole of Automobile and Mechanical Engineering, Changsha University of Science and Technology, 410004, China ABSTRACT The wind turbine energy industry has been growing rapidly in recent years. Due to extreme operating environments, condition monitoring of wind turbine blades is a challenging task and received extensive studies. The conventional methods based on contact transducers such as ultrasonic or fibre-optical strain gauge are expensive and complex to install and the measurement range is also limited to discrete points. To overcome this issue, this paper presents a new way of condition monitoring for the wind turbine blades based on a high-speed photogrammetry in combination with a scheme a non-contact excitation. The research progress is reviewed regarding to the condition monitoring techniques of wind turbine blades and development on photogrammetry. A photogrammetry system is built to identify the vibration modes of wind turbine blades without mounting any sensors on blades. An air impulse excitation system that consists of a solenoid valve and high-pressure air from air compressor is used to produce the artificial impulse excitation. A high-speed industrial camera captures the image and measures the dynamic displacement of the blades. The results obtained by photogrammetry is accurate and reliable, compared with that of conventional accelerometer-based measurements. Moreover, the faults caused by snow or ice covering are simulated by changing mass of blades, likewise fault of loosening the screws was also tested. The modal parameters extracted from baseline and faulty conditions show significant distinctions, which has affirmed that this vision method is effective and sensitive to blade faults. Keywords: condition monitoring, wind turbine blades, photogrammetry, modal analysis, high-speed camera. Corresponding author: Fengshou Gu(F.Gu@hud.ac.uk) 1. INTRODUCTION Due to the harsh service environment and complex working conditions, faults of wind turbines are inevitable and frequent, which will lead to long downtimes and high maintenance costs [1]. As the core component of the wind turbine to capture wind energy, the blades are often operating under a high stress and the blade faults always cause secondary accident [2]. Therefore, detecting faults of wind turbine blades timely are significant for improving operational reliability and reducing maintenance costs [3]. At present, the testing methods for the condition monitoring of wind turbine blades [4-6] mainly include stress method, acoustic emission method, impedance method, ultrasonic method, vibration method, and infrared detection method. Apart from the infrared detection method, the other methods require to install many sensors that is necessary to provide information at limited discrete points. However, the complex dynamic response of complex blades cannot be easily measured using a limited set of sensors. Placing sensors on a utility-scale wind turbine blade is labor intensive, affects the normal operation of the blades, and can also introduce electrical noise to the measured signal due to the extensive wiring. Infrared detection is a type of non-contact detection, with features such as a large area and remote detection. However, its detection results are easily affected by environments such as external temperature, and the identification and qualitative analysis of defects are still difficult to be achieved. 321 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Due to the limitations of these methods, the researchers had used photogrammetry technology [7, 8] to detect the blade state. The technology combines the advantages of the vibration method and the visual method to measure the vibration of the object in a non-contact method. Corten et al. [9] used photogrammetry technology for the first time to measure the vibration of a wind turbine with a diameter of 10m. The measurement error was 0.043% of the visual field range. Johnson et al. [10] developed a point projection stereography system consisting of a digital projector and a binocular camera. The system was used to collect surface vibrations during the static and fatigue tests of the blades and captured submillimeter-level deformation and torsion. Baqersad et al. [11] and Lundstrom et al. [12] conducted dynamic experimental studies on small wind turbine blades and identified their modal shapes. Ozbek et al. [13] applied the three-dimensional point tracking to a 2.5 MW wind turbine and the modal parameters of the rotating utility-scale wind turbine blade were identified using stereo-photogrammetric measurement. The above studies are static, or fatigue results conducted in a laboratory environment. Moreover, these studies mainly focused on the measurement of blade deformation and vibration, and did not further analyze the correspondence between deformation, vibration and faults such as blade cracks and unbalanced failures. To make more advancement of this promising technology, the research of condition monitoring for a 2 kW wind turbine blades was carried out based on the high-speed photogrammetry in this paper. Firstly, a marked point tracking method is put forward to detect the motion of markers on the blade. Secondly, the effectiveness of photogrammetry inspection on vibration is verified by comparing with the results of the acceleration detection. Thirdly, the experiment and results of faults identification is acquired. Finally, the conclusions are given. This current paper paves the way for developing an optical condition monitoring system that can be used to perform full-sized structural evaluation of a real-world wind turbine blades. 2. METHODOLOGY The measurement of vibration displacement of wind turbine blades using photogrammetry technology includes the following key steps: 1) preprocessing, including strobe filter, which removes the interference noise generated by white-bright lamps, and image preprocessing, which is mainly used to enhance the brightness and contrast of the image and realize image inversion; 2) measurement of vibration displacement, mainly recognizing the feature points and obtaining the actual vibration displacement; and 3) modal identification using a stochastic subspace method. The specific theories are detailed in this section in order to provide a basis for experimental studies. 1) Preprocessing The traditional fluorescent lamp is directly used for 50 Hz alternating current, and its strobe frequency is 100 Hz. When the digital camera collecting, the water ripple will appear due to strobe frequency of current of lamp. To get the stable lightning condition in the laboratory, a filter of the video is implemented to cut out most fluorescent ambient light. At the same time, to highlight the feature of point on the blades, the invert and adjustment of brightness and contrast of images is implemented. In this study, the brightness and contrast are adjusted by (1) where ( , )f i j is the pixel of source image point (i, j), and ( , )g i j is the adjusted pixel I(i, j) indicate that the pixel is in row i and column j. and  controls the contrast and brightness of image, respectively. 2) Measurement of vibration displacement g( , ) ( , )i j f i j =  +  322 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management The first step of measuring vibration is to track the feature points on the wind turbine blades. Because Harris corner detector has the features of strong invariance to rotation, scale, illumination variation and image noise [14], the Harris corner detector is adopted to recognize the feature points in the wind turbine blades and get a good performance. This algorithm measures the local gray level changes of an image with patches shifted by a small amount in different directions based on a local auto-correlation function of the image. Given the pixel gray level as ( , )f i j , the gray level intensity changes , ( , )x yE u v of each pixel (u, v) of the image moves a very small (x, y) can be described as follows: 2 , , , 2 2 2 , , 2 , 2 , E ( , ) ( ( , ) ( , )) [ ( , )] [ , ]( ) x y u v x y u v x y x y x x y u v x y x y y u v W f u x v y f u v W f u f y o u v f f f x x y W f f f y =  + + − =  + +                   (2) where W is the low pass filtering by a Gaussian function, and ,u vW is the coefficient that Gauss filter in location (u, v). x f f x  =  , y f f y  =  is the respective horizontal and vertical derivative of image ( , )f x y , respectively.  denotes convolution. Denoting 2 , 2 , x x y u v x y x y y f f f M W f f f   =        , so Eq. (2) can be described as  , ( , )x y x E u v x y M y   =     (3) The eigenvalues of M can describe the extreme curvature of the image gray level autocorrelation function. And a corner point can be judged according to the size of eigenvalues. Therefore, a corner response function (CRF). R is defined to determine whether the pixel (u, v) is a corner point by the size of R. R is described as: 2R( , ) det[ ] (trace )x y M k M= − (4) where 1 2 1 2det , traceM M   = = + , 1 and 2 are the eigenvalues of M. 0.04 ~ 0.06k = is an empirical constant. Therefore, the pixel displacement of feature points on the blades can be obtained. Then the homographic transformation [15] is used to obtain the vibration displacement of each feature point. 3) Modal identification Since the input of air impulse excitations is hardly measured precisely. A type of output-only modal identification is adopted. Specifically, the covariance-driven stochastic subspace identification (Cov-SSI) algorithm [16], which can be implemented on line due to less computational cost, was applied to the multi-point vibration responses and then the modal parameters, including natural frequencies, damping ratio and mode shapes will be identified, which is crucial to distinguish healthy and faulty conditions. 323 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 3. EXPERIMENT 3.1. Experimental Rig In this research, an experimental rig consists of a set of blades from a 2kW wind turbine and an air excitation system. The model of wind turbine is iSTA Breeze I-2000, it has 3 composite blades with length of 107 cm and weight 650 gr. To obtain vibration modal properties (frequency, damping ratio and shape) of these lightweight blades in laboratory, the blades is fixed on a frame shown in Figure 1 and excited by air pulses. In this way, the measurement was carried out which are more similar to the real operating conditions. In addition, 7 black markers are made on the trailing edge as the objective to be tracked by a remote high-speed camera for obtaining the flap-wise vibration as illustrated in Figure 1. Distance=2.47m Air excitation Piezoelectric sensors Markers Figure 1. Schematic diagram of experimental rig and instrumentation of the high-speed camera and the accelerometers The air excitation system is made up of a compressor to generate high pressure air, an air pressure regulator gauge, and a solenoid valve. This system can produce an air impulse to excite blades, being like natural excitation due to a strong wind turbulence. In combination of the high-speed camera set at 2.47 metres away from the blades, this test system can measure the vibration modes in a remote way and expected to be more accurate compared conventional contact measurements. 3.2. Instrumentation The video measurement system consists of a high-speed industrial camera with a telescopic lens. The high-speed camera uses a CMOS sensor, the resolution is 640×480 pixel and the maximum fps is at 600. In this experiment, frames per second was set to 200 fps for low computational cost and found is sufficient to identify the vibration properties. The captured videos then are processed in Python & OpenCV and MATLAB using method mentioned in previous sections. Meanwhile, for the verification of the photogrammetric results obtained, acceleration responses were also collected by 4 piezoelectric accelerometers. These sensors are mounted at points near location of markers. The signals are acquired by a four-channel dynamic measurement system which is hosted by laptop (Figure 1). 324 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 3.3. Experimental Procedure The first step, to verify the accuracy of optical measurement. While the camera is capturing the video, piezoelectric accelerometer is collecting acceleration at the same time. Because the traditional contact sensor is precise and frequency range is wide, the vibration signals collected from accelerometers can be regarded as a good reference to verify the error of the photogrammetry method. The second step, the baseline condition is obtained. All used piezoelectric accelerometers are removed from the blades. since mass of the sensors inevitably has impact on the dynamic characteristics of the blades. So that using camera we can obtain the more real dynamic characteristics of healthy condition, The third step, according to other researchers’ previous research [17], the trailing edge deformation and real load distribution at root end are two important factors leading to failure. Thus, at the non-destructive condition, loosening the screws at the end root and adding mass near the trailing edge are implemented to simulate the typical fault on the wind turbine blades. The aim is clear that judge the fault using photogrammetry. 4. RESULT ANALYSIS 4.1. Verification in Time Domain and Frequency Domain Unlike accelerometers measuring the acceleration straightforwardly, photogrammetry need to detect and track markers in the video, get the trajectory of markers’ motion, and then convert trajectory of pixel in successive images into physical displacement in space, finally solve the accelerations through differential calculation. Figure 2 shows 5 acceleration responses on 5 monitored markers. In this figure, periodicity of signals is clear, and the amplitudes increase with respect to the distance of blade root. Figure 2. Acceleration of 5 markers acquired from photogrammetry 325 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management The comparison between photogrammetry signals and accelerometer signals in the time domain and frequency domain is shown in Figure 3. Sampling rate of piezoelectric sensor is 1.5k Hz, that is significantly higher than photogrammetry (200 Hz). Apparently more high frequency components can be acquired by the accelerometers. Meanwhile, the resolution of CMOS sensor also has the influence on the result. Despite these, the amplitude and frequency values are well agreeable. Given that large-scale wind turbine blades have lower natural frequency, in this case the photogrammetry should be very effective. Figure 3. Comparison of comparison between photogrammetry signals and accelerometer signals in Time domain and frequency domain 4.2. Verification of Modal Identification Having confirmed that the vibration response signals is reliable, the stochastic subspace identification is implemented to identify the natural frequency, damping ratio, and the modal shape in order to implement a condition monitoring method without the needs of baseline data. To calibrate the results, the stabilization diagram and the results are provided in Figure 4 and Figure 5 respectively for the case of normal blade conditions. Although, there are different spurious modes in these two diagrams, both they successfully identify the same modes. In Figure 5, the differences for first and second natural frequencies is 0.31% and 0.1% respectively. However, the error of damping ration is more obvious, which is common problem in using output-only methods. Moreover, two mode shapes are consistent with each other for the two methods. These then confirms that the photogrammetry system shows acceptable performance. 326 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 4. The stabilization diagram of the method of accelerometer and photogrammetry Figure 5. The modal parameters for the first and second modes 327 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 4.3. Condition Monitoring based on Modal Parameters Because the modal parameters of wind turbine blades can be accurately extracted from photogrammetry. The condition monitoring test based on the photogrammetry can be implemented logically. The artificial faults are set to check out if this method is stable and sensitive to some common fault of the wind turbine blades. All in all, the change of natural frequency can indicate the faults, regulation of damping ratio is not very clear. Modal shapes keep feature of first and second order mode but cannot see a significant change. At the beginning, all sensors are removed from the blade to test the parameters of baseline condition, so that the faulty cases could have a standard. Since part mass are subtracted, the natural frequency increase naturally, comparing with the above verification test. After that, the sensors as 3 extra mass (11g per sensor) are gradually added on the blade, the change of frequency are discovered clearly, especially when the third mass block is added near the end of the blade, where often accumulates most snow or ice and cause most enormous imbalance and lead to fault easily. Finally, the screw which fix blades on the shaft was loosened, this part always bears huge stress. Therefore, this fault is chosen to simulate early crack or loosening. From table 1 it can be seen that the change in natural frequency is distinctive, which allows for an assessment of blade conditions. Table 1: change of modal parameters due to faults First order Second order Natural frequency Change rate f Damping ratio Change rate  Natural frequency Change rate f Damping ratio Change rate  Baseline case 8.79Hz 0% 1.4% 0% 29.73Hz 0% 0.9% 0% Snow or frozen 1 mass 8.63Hz -1.82% 1.0% -28.57% 28.87Hz -2.89% 1.3% 44.44% 2 mass 8.34Hz -5.12% 1.6% 14.29% 28.65Hz -3.63% 1.2% 33.33% 3 mass 7.50Hz -14.68% 1.4% 0% 26.83Hz -9.75% 1.0% 11.11% Screw Loosen 8.61Hz -2.05% 1.2% -14.29% 29.26Hz -1.58% 1.4% 55.55% 5. CONCLUSION In this research, modal parameters identified using a photogrammetry method are validated by the result of the traditional piezoelectric sensor based method. The difference of the natural frequency is less than 0.31%. Furthermore, the healthy case and the faulty cases are differentiated by the image signals distinctly. The mass change of blades as small as 11g can be detected, which is shown more significantly when the added mass is more distant from the root. In conclusion, as a novel non-contact method, photogrammetry combines with the air impulse excitation is an effective way to implement the remote condition monitoring of wind turbine blades. REFERENCES [1] Yang W, Tavner P J, Crabtree C J, et al. Wind turbine condition monitoring: technical and commercial challenges[J]. Wind Energy, 2014, 17(5): 673-693. 328 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management [2] C A Walford. Wind turbine reliability: Understanding and minimizing wind turbine operation and maintenance costs [R]. USA: Sandia National Laboratories,2006. [3] M H Hansen. Aeroelastic instability problems for wind turbines[J]. Wind Energy, 2007, 10(6):551-577. [4] W Yang, Z Peng, K Wei, et al. Structural health monitoring of composite wind turbine blades: challenges, issues and potential solutions[J]. Iet Renewable Power Generation, 2017, 11(4):411-416. [5] Li D, Ho S C M, Song G, et al. A review of damage detection methods for wind turbine blades[J]. Smart Materials and Structures, 2015, 24(3): 033001. [6] Ye G, Neal B, Boot A, et al. Development of an ultrasonic NDT system for automated in-situ inspection of wind turbine blades[C]//EWSHM-7th European Workshop on Structural Health Monitoring. 2014. [7] E Mikhail, J Bethel, J C McGlone, Introduction to Modern Photogrammetry, New York:Wiley, 2001 [8] Poozesh P, Baqersad J, Niezrecki C, et al. Large-area photogrammetry based testing of wind turbine blades[J]. Mechanical Systems and Signal Processing, 2017, 86: 98-115. [9] G P Corten, J C Sabel. Optical motion analysis of wind turbines [C]. In: Proceedings of European union wind energy conference. Goteborg, Sweden, 1996, 5: 20-24. [10] J T Johnson, S Hughes, J V Dam. A stereo video grammetry system for monitoring wind turbine blade surfaces during structural testing[J]. ASME Early Career Tech J, 2009, 8:1-10. [11] J Baqersad, J Carr, T Lundstrom. Dynamic characteristics of a wind turbine blade using 3D digital image correlation[C]. In: health monitoring of structural and biological systems, Proceedings of SPIE, San Diego, USA, 2012, 83:12-15. [12] T Lundstrom, J Baqersad, C Niezrecki, et al.Using high speed stereo photogrammetry techniques to extract shape information from wind turbine/rotor operating data. In: Topics in modal analysis II[C]. Proceedings of the 30th IMAC, a conference on structural dynamics. New York: Springer, 2012, 6: 269-275. [13] Ozbek M, Rixen D J. Operational modal analysis of a 2.5 MW wind turbine using optical measurement techniques and strain gauges[J]. Wind Energy, 2013, 16(3): 367-381. [14] Lowe D G. Distinctive Image Features from Scale-Invariant Keypoints[C]. International Journal of Computer Vision. 2004:91-110. [15] Hartley, Richard, Zisserman, Andrew. Multiple view geometry in computer vision[J]. Kybernetes, 2004, 30(9/10):1865 - 1872. [16] Reynders E, Roeck G D. Reference-based combined deterministic-stochastic subspace identification for experimental and operational modal analysis[J]. Mechanical Systems & Signal Processing, 2008, 22(3):617-637. [17] Zhou, H.F., et al., A review of full-scale structural testing of wind turbine blades[J]. Renewable and Sustainable Energy Reviews, 2014. 33: p. 177-187. 329 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management An improved resonance demodulation technique based on spectral kurtosis and fault characteristic harmonic-to-noise ratio Chang Yan1, Jing Lin 1,2, Ming Zhao1, liang Zeng1 1 School of Mechanical Engineering, Xi’an Jiaotong University, Xi’an, China 2 School of Reliability and Systems Engineering, Beihang University, Xueyuan Road No. 37, Haidian District, Beijing, China ABSTRACT Rolling bearings are widely used in energy, construction machinery and high-speed railway because of its strong load-bearing capacity and small start-up torque. The faults of rolling bearings as the core component of the machine can lead to huge economic losses and even fatal accidents. The health monitoring and fault diagnosis of rolling bearings have attracted increasing attention over the past decades. Traditional spectral kurtosis (SK) method is commonly implemented by kurtogram to determine the resonance frequency band for demodulation. However, there exist problems when SK is applied to signals containing random strong interferences. As a result, envelope harmonic-to-noise ratio (EHNR) is proposed to reduce the influence of random strong interferences and strengthen the periodic components in the signal. Unfortunately, EHNR is not sensitive to fault impact when the periodic components in the signal are not related to the failure such as rotational frequency interferences and power frequency interferences. To overcome these limitations, this paper proposes an improved resonance demodulation technique (RDT) based on SK and fault characteristic harmonic-to-noise ratio (FCHNR). Compared with SK, the improved RDT has two distinct advantages. Firstly, the proposed method can be applied to the rolling bearings under the working condition with variable speed through integrating with order tracking technique. Secondly, the resonance frequency band is no longer solely determined by the maximum value of kurtosis or EHNR. Instead, the periodicity and impulsivity of rolling bearing fault can be taken into account simultaneously, thus the improved RDT has higher robustness. Finally, the improved RDT is validated by real fault data of railway bearings. The results show that the improved RDT can detect the fault of rolling bearing under variable speed with higher efficiency and superiority. Keywords: spectral kurtosis; fault characteristic harmonic-to-noise ratio; resonance frequency band; order tracking 1. INTRODUCTION With the advancement of modern industry, rolling element bearings are widely used in various rotating machinery as core components, such as high-speed railway, mine excavator, aircraft engine, etc. The faults of rolling bearings may lead to huge economic losses and even fatal accidents [1,2]. Hence, increasing attention has been paid on the health monitoring and fault diagnosis of rolling bearings over the past decades. Since the vibration signals contain rich information on health conditions of rotary machinery, vibration analysis is an effective approach for fault diagnosis of rolling bearings [3-6]. Once the localized faults occur on a rolling bearing, the periodic impacts caused by faults can excite the system resonant and generate amplitude modulation phenomenon. Therefore, how to accurately determine the resonance frequency band of vibration signals and realize faults feature extraction in the background of strong noise is crucial. Various signal processing methods have been developed for the diagnosis of rolling bearings. Antoni J [7] proposed spectral kurtosis (SK) methodology to find the optimal filter for fault diagnosis. Y. Wang and M. Liang [8] proposed an adaptive SK technique to determine the bandwidth and center frequency for the fault detection of rolling element bearings. Y. Lei et al. [3] proposed an improved Kurtogram based on wavelet packet transform. Xu X et al. [9] introduced a new index, EHNR, to improve Kurtogram for periodic impulses detection. 330 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Among all kinds of signal processing methods, SK and EHNR have been demonstrated as effective and efficient indexes for impact detection of rolling bearings. However, the kurtosis index is only sensitive to the impact of fault, but not the periodicity of fault. When the fault is weak, the impact of the fault is not obvious and easily submerged by random strong interference in the environment, resulting in the error of the resonance frequency band determined by the kurtosis index. On the other hand, EHNR is proposed to weaken the influence of random strong interference and enhance the periodicity component. However, the sensitivity of now available EHNR to impact caused by faults is insufficient, thus it is easy to be misled by the periodic component caused by non-fault factors when determining the resonance frequency band. To overcome these drawbacks of SK or EHNR and take the impact and periodicity of the fault into account, an improved RDT is explored in this paper. The proposed method filters kurtosis value by calculating FCHNR, which is different from the existing EHNR, of the same frequency band. FCHNR has directivity to the fault impact. Finally, the effectiveness and robustness of the proposed improved RDT is validated by the data from a railway bearing test rig. 2. THEORETICAL BACKGROUND 2.1.SK and kurtogram SK was first proposed by Dwyer to locate the resonance frequency to estimate the impulsivity. The theoretical basis of SK was further developed by J. Antoni [10,11] for real cases. Given time variable t; ),( ftX is the complex envelope of signal )(tx at frequency f . The 2n order spectral instantaneous moments of the signal )(tx is defined as follows:  nn ftXftS 2 2 |),(|),( , ............................................................. (1) where < > is the mean operation over time. Thus, the SK of ),( ftX can be described as follows: 2 ),( ),( ),( 2 4  ftS ftS ftK , ............................................................... (2) 2.2. Fault characteristic harmonic-to-noise ratio For rolling bearings, assuming that the theoretical ball pass order of outer race (BPOO), ball pass order of inner race (BPOI), fundamental train order (FTO) and roller spin order (RSO) are of , if , cf and rf , respectively. For input signal )(tx , the equal angle resampling signal can be denoted ( )y  , of which the Fourier transform is )(fO . Due to the discrete error, the actual value of )(fO at of , if , cf and rf are )ˆ( ofO , )ˆ( ifO , )ˆ( cfO and )ˆ( rfO , respectively, and their high order harmonics are )(  onfO , )(  infO , )(  cnfO and )( r  nfO , where n =2,3,4… . It's worth noting that )(  nfO is not strictly equal to ˆ( )O n f . Let  be the standard deviation of the )(fO . For any j , if ( ) 6jO f   , remove )( jfO from the array of )(fO . As a result, a new array )~(fN can be obtained, whose length is less than the length of )(fO . The noise energy level can be estimated by )~(fN , of which  is the standard deviation. When two and three times the fault characteristic frequency occur on the spectrum clearly, it is generally believed that the bearing has a early failure. If there is only a clear fault characteristic frequency, we can't come to the same conclusion. 331 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management As a result, the FCHNR of outer race is defined as follows: 2 (2 ) (3 )o o o O f O f r      , .................................................................. (3) The FCHNR of inner race is defined as follows: 2 (2 ) (3 )i i i O f O f r      , ................................................................... (4) The FCHNR of fundamental train is defined as follows: 2 (2 ) (3 )c c c O f O f r      , .................................................................. (5) The FCHNR of roller is defined as follows: 2 (2 ) (3 )r r r O f O f r      , .................................................................. (6) 2.3. An improved RDT Due to the limited impact energy caused by early faults of rolling bearings, it is difficult to determine the resonant frequency band by the SK or the FCHNR alone. An improved RDT is proposed to integrate SK and FCHNR to improve the resonance demodulation technique as shown in figure 1. First, the SK is applied to find the impact components in the measured signal. And then FCHNR is employed to determine whether the impact components is related to the failure of outer race, inner race or the fundamental train of rolling bearings. The threshold can be the average of the FCHNRs of all bands or the median of the FCHNR array. Filtered Signal by Filter Bank(1/3 Binary Tree) Envelope of Signal by Hilbert Transform Vibration Signal Acquisition FCHNR>Threshold Kurtosis = 0 Calculate Kurtosis and FCHNR Find the Filter Band with Maximum Kurtosis Envelope Order Spectrum and Further Analysis Equal Angle Interpolation of Envelope Signal Figure 1. The flow chart of the proposed hybrid method. 332 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 3. EXPERIMENT The experimental data of a railway rolling bearing test rig is utilized to illustrate the validity of the proposed method. The outer race of the bearings is fixed with the carriage and the inner race rotates with the wheel shaft, whose rotating speed is monitored by a tachometer. A PCB accelerometer is mounted on the top of axle box to pick up the vibration signal of the bearing. The geometric parameters of the bearing are listed in table 1, and the sampling frequency is 25600Hz. The original vibration signal with 0.5 seconds is shown in figure 2 and the rotating speed signal is shown in figure 3. Figure 4(a) (b) and (c) show the results of the original kurtogram, EHNR-based kurtogram and the proposed hybrid kurtogram, respectively. Obviously, they find different optimal resonance frequency bands. The filtered signals at those optimal resonance frequency bands are demodulated with Hilbert transform, and combined order tracking method, order envelope spectrum (OES) was obtained. Figure 5(a) shows the result of OES of the original vibration signal, from which we can’t find obvious fault feature. There is also no fault harmonic in the results of OES based on SK or EHNR, which are shown in figure 5(b) and figure 5(c), respectively. Figure 5(d) shows the result of OES based on the proposed method, and we can see the obvious outer race fault characteristic harmonics. The diagnosis result is consistent with the actual fault. Table 1. Geometric parameters of the Rolling bearings. Number of rollers Contact angle (degree) Roller diameter (mm) Pitch diameter (mm) 19 10 26 180 Figure 2. The original vibration signal Figure 3. The rotating speed signal (a) 333 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management (b) (c) Figure 4. The result of kurtogram: (a) The original kurtogram;(b) EHNR-based kurtogram;(c) HM-based kurtogram (a) (b) 334 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management (c) (d) Figure 5. The order envelope spectrum: (a) OES of the original vibration signal; (b) OES based on SK; (c) OES based on EHNR; (d) OES based on proposed method 4. CONCLUSION Through the comparisons among the SK-based RDT, EHNR-based RDT and the improved RDT, it is appropriate to draw the following conclusions that the proposed hybrid method combined SK and FCHNR has higher robustness. It improved the traditional resonance demodulation technique and is proved effective for fault diagnosis of rolling bearing. SK is used to find the frequency band which is related to the impact. And the FCHNR is applied to determine whether the impact is related to failure of rolling bearing. The proposed method is demonstrated by data collected from a railway rolling bearing test rig. And it can detect the fault of rolling bearing under variable speed with higher efficiency and superiority. 335 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 5. ACKNOWLEDGMENT This project is supported by the National Natural Science Foundation of China (Grant No. 51421004), which are highly appreciated by the authors. REFERENCES [1]. Randall R B. Vibration-based condition monitoring : industrial, aerospace and automotive applications[J]. Mechanisms & Machine Science, 2011, 3(4):431-477. [2]. Wang D, Zhao Y, Yi C, et al. Sparsity guided empirical wavelet transform for fault diagnosis of rolling element bearings[J]. Mechanical Systems & Signal Processing, 2018, 101:292-308. [3]. Lei Y, Lin J, He Z, et al. Application of an improved kurtogram method for fault diagnosis of rolling element bearings[J]. Mechanical Systems & Signal Processing, 2011, 25(5):1738-1749. [4]. Ou L, Yu D, Yang H. A new rolling bearing fault diagnosis method based on GFT impulse component extraction[J]. Mechanical Systems & Signal Processing, 2016, 81:162-182. [5]. Li Y, Xu M, Wang R, et al. A fault diagnosis scheme for rolling bearing based on local mean decomposition and improved multiscale fuzzy entropy[J]. Journal of Sound & Vibration, 2016, 360:277-299. [6]. Janjarasjitt S, Ocak H, Loparo K A. Bearing condition diagnosis and prognosis using applied nonlinear dynamical analysis of machine vibration signal[J]. Journal of Sound & Vibration, 2008, 317(1):112-126. [7]. Antoni J. The spectral kurtosis: a useful tool for characterising non-stationary signals[J]. Mechanical Systems & Signal Processing, 2006, 20(2):282-307. [8]. Wang Y, Liang M. An adaptive SK technique and its application for fault detection of rolling element bearings[J]. Mechanical Systems & Signal Processing, 2011, 25(5):1750-1764. [9]. Xu X, Zhao M, Lin J, et al. Envelope harmonic-to-noise ratio for periodic impulses detection and its application to bearing diagnosis[J]. Measurement, 2016, 91:385-397. [10]. Antoni J, Randall R B. The spectral kurtosis: application to the vibratory surveillance and diagnostics of rotating machines[J]. Mechanical Systems & Signal Processing, 2006, 20(2):308-331. [11]. Antoni J. Fast computation of the kurtogram for the detection of transient faults[J]. Mechanical Systems & Signal Processing, 2007, 21(1):108-124. 336 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management A vision of energy-based visualisation of large scale industrial systems for the purposes of condition monitoring G. van Schoor1, K.R. Uren2 1 Unit for Energy and Technology Systems, Faculty of Engineering, North-West University, South Africa 2 School of Electrical, Electronic and Computer Engineering, North-West University, South Africa ABSTRACT Most large scale industrial systems can be viewed as processes that convert energy from one form to another. Energy is a universal quality of which the distribution holds meaning across physical domains i.e. thermal-fluid, mechanical, electrical and chemical. This paper presents a vision of using energy information of a process to visualise or characterise what is happening in the process with regards to energy. This energy information is then represented in a form that retains the structural information, making it possible to relate patterns in the representation to specific locations in the process. The energy attributes considered include energy flow rates between components of the system and the change in exergy flow rates across components. An energy representation of a process can form the basis for fault detection and diagnosis (FDD) or optimal control through the definition of a reference energy representation. The use of energy information linked to specific locations in the system, classifies the approach to FDD in the hybrid class which represents a hybrid between a pure data-driven and model-based approach. Attributed graphs are considered in this work as a structure that can describe the system’s energy attributes while retaining structural information. The usefulness of the energy representations for the purpose of condition monitoring is illustrated through a few case studies. The case studies include a heated two tank system, a gas to liquids process and a Brayton cycle. The results illustrate that energy-visualisation as a means to condition monitoring shows promise in terms of detecting and diagnosing typical faults in large scale industrial processes where energy conversion is the main concern. Keywords: energy visualisation, condition monitoring, large scale industrial systems, attributed graphs 1. INTRODUCTION Modern industrial plants are becoming more reliant on condition monitoring and fault tolerant control schemes to be more efficient and profitable. Also, strict safety and environmental regulations further motivate the need for advanced process monitoring schemes. For future monitoring schemes to be successful, aspects such as robustness, uncertainties and big data need to be addressed [1], [2]. Accordingly, new approaches need to consider a hybrid approach that combines various data-driven, analytical and knowledge-based techniques. This paper shares the vision that energy as a unifying attribute can be used to characterise large scale industrial systems. The synthesis of any complex system into a geometric representative structure, with functional nodes inter-connected by process streams and mechanical, electrical or heat links and the subsequent energy characterisation of the structure, results in a data structure that poses very useful for the purposes of FDD and control. The concept of energy visualisation as a means to characterise and assess the health of a system initiated from the study of [3], [4] on the health monitoring of a Brayton cycle using the enthalpy and entropy attributes of the system. In this work the enthalpy and entropy distribution across the different locations in the system are used as a basis for health assessment. This idea was further explored in a paper by van Schoor et al. presenting a survey on energy as basis for modelling, supervision and control [5]. This paper initiated the exploration of ways to visualize energy characteristics of components in industrial plants such as heat exchangers and compressors [6], [7]. Considering literature related to the analysis and characterisation of industrial systems from an energy perspective, it became evident that both the first and second laws of thermodynamics are important [8]. This implies that knowledge of both the quantity and quality of energy is necessary for a complete characterisation of a system. This involves the inclusion of exergy as a quality parameter for energy-based visualisation of systems. Exergy-based 337 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management methods have their roots in exergetic and thermoeconomic analysis and their use for diagnosis and optimisation have been widely discussed in literature [9]–[11]. Since many industrial plants can be classified as complex and large-scale due to the large number of component interactions, a mathematical framework is needed to represent these component interactions and attributes. Also, this mathematical framework should be integrable in a fault detection and diagnosis scheme. Graph theory is considered a natural fit for this purpose. Graphical approaches toward fault detection and diagnosis are readily found in literature [12]–[14]. Smaili et al. [12], developed a fault monitoring framework for multi-energy systems using signed directive graphs. Directed graphs are used to capture the process topology and connectivity to show the causal relationships between variables. Graphs are then used for qualitative fault indicators in the form of fault signature matrices. Another study conducted by van Graan et al. considered a linear graph representation of a heat exchanger [15]. Exergy and energy flowrates were assigned as attributes to the nodes and links of the linear graph. This graph was then used for fault detection using a pattern recognition technique called graph matching [16]. Graph matching is mainly concerned with finding a correspondence between the node and link attributes of different graphs in an optimal way [17]. In this paper the goal is to discuss a novel approach and vision of representing industrial processes as attributed graphs containing energy information. These graphs will be called energy signatures which represent an energy visualisation of a process. The term visualisation can mean a visual picture of the condition of the plant, but may also take an abstract form such as a matrix. It will be shown how these energy signatures can be utilised for fault detection and diagnosis by means of attributed graph matching. The paper is organized as follows. Section 2 shares the idea of energy visualisation as a means to characterise large scale industrial systems. In section 3 the concept of energy visualisation for fault detection and diagnosis is illustrated through a few case studies. The paper is concluded in section 4. 2. ENERGY VISUALISATION In this section the nature of multi-domain systems is firstly discussed after which the idea of using graphs to visualise system energy attributes in a structured way is presented. Lastly the concept of using graph matching as a means to represent system fault information in a structured way is presented. Throughout this section the example of a simplified coal-fired power station is used to illustrate the applicable concepts. 2.1. Multi-domain systems Energy is a universal quality of which the distribution holds meaning across physical domains i.e. thermal-fluid, mechanical, electrical and chemical [5]. In a coal-fired power station energy is transformed through a sequence of processes across physical domains. The source of energy would in this case be coal of a certain calorific value. The coal is transformed from chemical potential energy to thermal potential energy through a chemical process where chemical kinetics describe the transformation of energy. The thermal potential energy is transformed into mechanical energy through a thermal-fluid process that is described by thermodynamics. The mechanical energy is transformed to electrical energy through an electromechanical process described by electromagnetics. The product of the process in this case is electrical power. The idea emphasised in this paper is that the energy distribution across the process with specific reference to structure reveals important information about the health of the system. Energy information in the form of energy stored, energy converted, and energy dissipated at the different points in the system will be of specific interest. Due to process irreversibility in the thermal-fluid and chemical domains also characterising these processes in terms of exergy lost or exergy change is of specific interest. The energy interaction through a general multi-domain system component can be illustrated as shown in Fig.1. The component experiences thermal fluid or chemical domain interaction in the form of a fluid stream that enters and exits the component at energy rates of 𝑄ሶ௜௡ and 𝑄ሶ௢௨௧ respectively. This interaction stands in direct relation to the other energy interactions indicated i.e. electrical power in ሺ𝑃௜௡ሻ and out ሺ𝑃௢௨௧ሻ, mechanical power in or out ሺ𝑃ሻ and heat flow in or out ሺ𝑄ሻሶ . The change in exergy of a fluid stream through a component is denoted by ∆𝐵 and 338 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management represents the change in the ability of the fluid to do work as it passes through the component. The general energy interaction portrayed can be made applicable to any system component by omitting the interactions that do not apply. In the case of a turbine for example the main interaction will be a fluid stream entering and exiting the component with mechanical power as the main output from the component. In the case of a heat exchanger the primary side can be represented by a fluid stream entering and exiting the component with heat flow in the direction of the secondary side of the heat exchanger. Similarly, the general multi-domain energy interaction can be applied to other process components e.g. a reactor, a pump, and an electrical transformer to name but a few. Component Fluid flow in Fluid flow out Mechanical power in / out Heat flow in / out P B Q inQ outQ Exergy change Electrical flow in Electrical flow out inP outP Figure 1. Illustration of energy interaction through a general multi-domain system component The change in exergy of a fluid stream through a component constitutes components of kinetic, potential, physical and chemical exergy. For the case of a thermal fluid component, considering only the change in physical ሺ∆𝐵௣ሻ and chemical exergy ሺ∆𝐵௖ሻ: ∆𝐵 ൌ ∆𝐵௣ ൅ ∆𝐵௖ ൌ േ𝑊ሶ ൅ 𝐵ௗ௘௦௧௥௢௬௘ௗ, ......................................................... (1) with 𝑊ሶ the nett energy flow rate in or out of the component. 𝑊ሶ could either be equal to 𝑃, the mechanical power in or out of the component or 𝑄ሶ , the nett heat flow in or out of the component. 𝐵ௗ௘௦௧௥௢௬௘ௗ represents the exergy destroyed across the component. The chemical exergy is important where the chemical composition of the stream entering the component differs from the one leaving. Furthermore, from the energy balance requirement: 𝑊ሶ ൌ 𝑄ሶ௜௡ െ 𝑄ሶ௢௨௧, ............................................................................ (2) where 𝑄ሶ௜௡ and 𝑄ሶ௢௨௧ are the heat flow in and out of the component due to the fluid stream. The change in physical exergy of a fluid stream through a component at flow rate 𝑚ሶ , flowing in at enthalpy and entropy values of ℎ௜௡ and 𝑠௜௡ and out at enthalpy and entropy values of ℎ௢௨௧ and 𝑠௢௨௧ respectively with a reference temperature of 𝑇଴, is given by: ∆𝐵௣ ൌ 𝑚ሶ ሺℎ௜௡ െ ℎ௢௨௧ሻ െ 𝑚ሶ 𝑇଴ሺ𝑠௜௡ െ 𝑠௢௨௧ሻ. ....................................................... (3) The change in chemical exergy of a fluid stream through a component at flow rate 𝑚ሶ is given by [18]: Δ𝐵௖ ൌ 𝑚ሶ ൫∑ 𝑥௜𝑏ത௖,௜௜ ൯௜௡ െ 𝑚ሶ ൫∑ 𝑥௜𝑏ത௖,௜௜ ൯௢௨௧, ........................................................... (4) with 𝑥௜ the fraction of the component in the substance and 𝑏ത௖,௜ the standard chemical exergy of the corresponding component. From the similarity between (1) and the sum of (3) and (4) and the requirement for energy balance, 𝑊ሶ ൌ 𝑚ሶ ሺℎ௜௡ െ ℎ௢௨௧ሻ ൅ m ൫∑ 𝑥௜𝑏ത௖,௜௜ ൯௜௡ െ m ൫∑ 𝑥௜𝑏ത௖,௜௜ ൯௢௨௧ and ....................................... (5)𝐵ௗ௘௦௧௥௢௬௘ௗ ൌ െ𝑚ሶ 𝑇଴ሺ𝑠௜௡ െ 𝑠௢௨௧ሻ. ............................................................... (6) From (2) and (5) it follows that 𝑄ሶ௜௡ ൌ 𝑚ሶ ℎ௜௡ ൅ m ൫∑ 𝑥௜𝑏ത௖,௜௜ ൯𝑖𝑛and .............................................................. (7) 339 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 𝑄ሶ௢௨௧ ൌ 𝑚ሶ ℎ௢௨௧ ൅ m ൫∑ 𝑥௜𝑏ത௖,௜௜ ൯௢௨௧ . .............................................................. (8) 2.2. Attributed graph as visualisation instrument Graphs are very useful to represent complex system interactions in a structured way [19]. The use of a specific type of graph called an attributed graph gives the opportunity to capture both structural information and attributes associated with specific locations in the system structure [15], [20]. A graph representation of a system firstly requires the system to be represented by links and nodes. To make it an attributed graph attributes are associated with the links and the nodes. An attribute associated with a link is called a link attribute and an attribute associated with a node is called a node attribute. Different approaches to configuring energy attributed graphs could be taken. Firstly, the choice of what constitutes a link and what constitutes a node needs to be decided. Then what constitutes a link energy attribute and what constitutes a node energy attribute need to be decided. In this work a node is defined as a system component e.g. a compressor or turbine or reactor and a link is defined as a connection between two system components e.g. the connection between a compressor and a turbine. Node energy attributes are seen as energy changes that occur in a component e.g. energy losses in the component or in the case of a thermal-fluid component the exergy change or exergy lost in the component. In the case of a component where chemical energy transformation takes place both physical and chemical exergy changes would be node energy attributes of interest. In cases where energy is stored in a component, the stored energy would also be a node attribute of interest. Link energy attributes are seen as energy flow attributes. Here the power flow between components would normally be of interest as well as the direction of the power flow. With these definitions in mind, it will now be illustrated how the simplified coal-fired power station as portrayed in Figure 2(a) can be represented as an energy attributed graph. (a) 1B 2B 3B 4B 5B 6B7B 81Q 18Q 12Q 23Q 34Q 35Q 48Q 56Q 86Q 57Q   87Q   72Q   68Q (b) Figure 2. (a) Simplified representation of a coal-fired power station. (b) Attributed graph of a coal-fired power station The simplified system constitutes a boiler, a power turbine, a generator (G), a condenser and a water pump as the main components. The boiler is represented as a heat exchanger with the pulverised coal and air mixture entering the primary side and the flu gas exiting the primary side. Heat is exchanged to the secondary side where water enters and super-heated steam exits. The super-heated pressurised steam then expands in the turbine that drives a generator that delivers electrical power as the system product. The expanded steam is cooled in the condenser which also acts as a heat exchanger. The hot steam enters the condenser on the primary side and water exits. Heat is exchanged to the secondary side that is typically a cooling tower with cool air entering and hot air exiting. To complete the water cycle, the water that exits the condenser is pumped to the boiler. The next step to illustrate the concept of energy visualisation by means of a graph is to set up a graph from Figure 2(a) where each component represents a node and connections to and between components are represented by links. A heat exchanger is however assigned two nodes, one for the primary side and one for the secondary side 340 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management with a link between the primary side node and the secondary side node that represents the heat exchanged from the one side to the other. Figure 2(b) shows the graph set up from Figure 2(a) with the node allocations as well as the attributes associated with each node indicated. In Figure 2(b) nodes 1 and 2 represent the boiler primary and secondary sides, nodes 3 and 4 represent the Turbine and Generator, nodes 5 and 6 represent the condenser primary and secondary sides and node 7 represents the pump. Node 8 is chosen as the environment and represents a node where all links to outside the system connect to. The attribute 𝑄ሶ଼ଵrepresents the hydraulic and chemical power transfer from the environment (node 8) to the boiler primary side (node 1). All the energy flow rate attributes in Figure 2(b) should be similarly interpreted with the subscript designating the nodes between which the energy flow applies. ∆𝐵ଵ represents the change in chemical and physical exergy across the boiler primary side. The attributed graph is now mathematically described by the node signature based formulation given in [16]:               NNNNN N N QQQB QQQB QQQB     21 222212 112111 sN . ................................................... (9) Each row of 𝐍௦ describes the node in the graph corresponding to the row number. In this way row 𝑛 describes all the attributes associated with node 𝑛. The node attributes are ordered in such a way that the structural information is retained. The first entry in every row represents the exergy change across the node corresponding to that row number. The attributes that follow for a particular row are the energy flows to and from the particular node. In the case of row 𝑛 the attributes ሾ𝑄ሶ௡ଵ 𝑄ሶ௡ଶ ⋯ 𝑄ሶ௡ேሿ would represent all the energy flows from node 𝑛 and in the order of the flow to node 1 first, followed by the flow to node 2 next. 2.3. Graph matching for condition monitoring With the attributed structured graph 𝐍௦ set up as described in section 2.2 it is now possible to use graph matching as a technique for condition monitoring. By defining a normal conditions graph 𝐍௦,௡ and comparing an operational conditions graph 𝐍௦,௢ with 𝐍௦,௡ it becomes possible, based on the difference between the two, to detect and diagnose faults. A first step in graph matching can be used to describe the difference between 𝐍௦,௡ and 𝐍௦,௢. The method as defined by [16] determines the so-called cost matrix to describe the difference between two graphs. It is based on determining the distance between every node of the two graphs being compared. The distance between two nodes is determined based on the difference between their node signatures. The Heterogeneous Euclidean Overlap Metric (HEOM) for the distance between node 𝑖 of 𝐍௦,௡ and node 𝑗 of 𝐍௦,௙ is given by 𝐻𝐸𝑂𝑀൫𝐍௦,௡௜, 𝐍௦,௙௝൯ ൌ ට∑ 𝛿ሺ𝐍௦,௡௜ሺ𝑎ሻ, 𝐍௦,௙௝ሺ𝑎ሻሻଶ௞௔ୀଵ , ............................................ (10) where 𝐍௦,௡௜ሺ𝑎ሻ refers to the ሺ𝑖, 𝑎ሻ entry in the normal signature matrix, and 𝐍௦,௙௝ሺ𝑎ሻ refers to the ሺ𝑗, 𝑎ሻ entry in the faulty signature matrix. 𝑎 refers to the 𝑎-th column entry of the rows considered, 𝑘 is the length of the row and the function 𝛿 for the numeric attributes only case is given by δሺ𝐍௦,௡௜ሺ𝑎ሻ, 𝐍௦,௙௝ሺ𝑎ሻሻ ൌ ห𝐍ೞ,೙೔ሺ௔ሻି𝐍ೞ,೑ೕሺ௔ሻห௥௔௡௚௘ೌ , ........................................................ (11) where 𝑟𝑎𝑛𝑔𝑒௔ is used to normalise the attributes, and is defined as 𝑟𝑎𝑛𝑔𝑒௔ ൌ max 𝑎 െ min 𝑎 , ................................................................. (12) where max 𝑎 and min 𝑎 are the maximum and minimum values respectively observed in the 𝑎-th column entry of 𝐍௦,௡. Next, a cost matrix, 𝐂 can be compiled from the distances between the nodes of graphs 𝐍௦,௡ and 𝐍௦,௙. The cost matrix is a square matrix with dimension equal to the number of nodes. 341 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Since the cost matrix represents the difference between two system conditions based on the change in the energy distribution in the system, analysing the characteristics of the cost matrix 𝐂 to detect and diagnose faults is investigated. To this end the eigenvalues and eigenvectors of 𝐂 are analysed. For a system with 𝑁 nodes, the cost matrix 𝐂 will be an 𝑁 ൈ 𝑁 matrix with 𝑁 eigenvalues, ሾ𝜆ଵ 𝜆ଶ ⋯ 𝜆ேሿ and 𝑁 eigenvectors, ሾ𝐱ଵ 𝐱ଶ ⋯ 𝐱ேሿ . For the normal case the graph 𝐍௦,௡ is compared with itself to obtain the normal set of eigenvalues and eigenvectors 𝝀௡ and 𝐱௡ respectively. For a fault case the corresponding set of eigenvalues and eigenvectors 𝝀௙ and 𝐱௙ are obtained. By applying a simple direction threshold to only the eigenvalues obtained, a simple 𝑁 ൈ 𝑟 qualitative fault signature matrix 𝐅௤ for 𝑟 faults is obtained as follows: 𝐅௤ ൌ ⎣ ⎢⎢ ⎢⎢ ⎡𝐹ଵ 𝐹ଶ 𝐹ଷ ⋯ 𝐹௥െ ൅ െ ⋯ െ ൅ െ െ ⋯ െ ൅ െ െ ⋯ െ ⋮ ⋮ ⋮ ⋯ ⋮ ൅ െ ൅ ⋯ ൅⎦ ⎥⎥ ⎥⎥ ⎤ row1 2 3 ⋮ 𝑁 ........................................................ (13) where columns denoted 𝐹ଵ, 𝐹ଶ, ⋯ , 𝐹௥ represent the eigenvalues for the respective 𝑟 fault conditions. The size of the fault signature matrix can be extended ሺ𝑁 ൅ 1ሻ fold by also applying the direction threshold to the eigenvectors. The extended fault signature for fault 1 can be given by 𝐅௤ଵ ൌ eigenvalues ⎣ ⎢⎢ ⎢⎢ ⎡൅ െ ൅ ⋯ ൅െ ൅ െ ⋯ െ ൅ െ െ ⋯ െ ൅ െ െ ⋯ െ ⋮ ⋮ ⋮ ⋯ ⋮ ൅ െ ൅ ⋯ ൅⎦ ⎥⎥ ⎥⎥ ⎤ row 1 1 ൅ 1 1 ൅ 2 1 ൅ 3 ⋮ 1 ൅ 𝑁 ................................................... (14) 𝐱ଵ 𝐱ଶ 𝐱ଷ ⋯ 𝐱ே The ൅ and – signs in (13) and (14) are fictitious to illustrate the idea. For 𝑘 faults the extended 𝐅௤ will take the form of 𝐅௤ ൌ ሾ𝐅௤ଵ 𝐅௤ଶ ⋯ 𝐅௤௞ሿ. 3. CASE STUDIES The concept of energy visualisation is now illustrated through a few case studies, applying the method outlined in section 2. The first case study is the heated two tank system representing an open system with only the thermal- fluid domain of interest. Next a gas to liquids plant where the thermal-fluid and chemical domains are important is considered. The third case study is a Brayton cycle where a closed cycle with the mechanical and thermal-fluid domains of interest. Due to space limitations only a short summary of each case study is provided. 3.1. Heated two-tank system A diagram of the system is shown in Fig. 3(a). The outlet flow from both tanks is proportional to the square root of the level in each tank. The outlet from the first tank flows into the second tank, and each tank has its own supply of cold water with a control valve to control the level of each tank. Each tank also exchanges heat with a hot water line. The temperature in the tanks is controlled using control valves on the hot water lines. An energy-attributed graph for the two-tank system is depicted in Fig. 3(b). The change in exergy flow rate, Δ𝐵, and the energy flow rate, Δ𝑄, are considered as the node and link attributes respectively. From Fig. 4(b) five nodes of interest are identified namely: (1) The hot water line transferring heat to tank1; (2) Tank 1; (3) The hot water line transferring heat to tank 2; (4) Tank 2; (5) The environment. In this case study 5 possible faults were considered: two sensor faults, namely the tank level sensor, and the temperature sensor; two actuator faults, one related to the heat transfer from the hot water side to the tank, 342 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management generally called fouling, and the other valve failure, and finally, a process fault in the form of tank leakage. These faults are induced one at a time, first for tank 1 and then for tank 2. FT3 FC3 TC1 TT1 FT1 FC1 LT1 LC1 FT4 FC4 TC2 TT2 FT2 FC2 LT2 LC2 Hot water line Environment Hot water line Valve 3 Valve 4 Valve 1 Valve 2 𝐿1, 𝑇1 𝐿2, 𝑇2 1 2 Tank 1 Tank 2 5 3 4  (a) Δ𝐵1 Δ𝐵2 Δ𝐵4 Δ𝐵3 5 5 5 5 5 5 3 1 4 2 𝑄ሶ51 𝑄ሶ15 𝑄ሶ45 𝑄ሶ54 𝑄ሶ43 𝑄ሶ53𝑄ሶ23𝑄ሶ52 𝑄ሶ12 (b) Figure 3. (a) Heated two-tank system. (b) Energy-based attributed graph By applying a direction threshold on the fault eigenvalues with respect to the normal case eigenvalues, qualitative fault signature matrices for tank 1 and tank 2 are obtained as follows: 𝐅𝒒𝒕𝒕𝟏 ൌ ⎣ ⎢⎢ ⎢⎢ ⎡𝐹ଵ 𝐹ଶ 𝐹ଷ 𝐹ସ 𝐹ହ൅ ൅ ൅ ൅ ൅ ൅ ൅ ൅ െ െ ൅ ൅ ൅ െ ൅ െ െ െ െ ൅ െ െ െ െ െ⎦ ⎥⎥ ⎥⎥ ⎤ , and 𝐅𝒒𝒕𝒕𝟐 ൌ ⎣ ⎢⎢ ⎢⎢ ⎡𝐹ଵ 𝐹ଶ 𝐹ଷ 𝐹ସ 𝐹ହ൅ ൅ ൅ ൅ ൅ ൅ ൅ ൅ ൅ ൅ ൅ ൅ ൅ െ ൅ െ െ െ െ െ െ െ െ െ െ⎦ ⎥⎥ ⎥⎥ ⎤ . .......................... (15) Fault columns having the same pattern are considered not diagnosable using the current residual definition. From (15) it is clear that all faults are detectable. In terms of diagnosability fault types 4 and 5 of 𝐅𝒒𝒕𝒕𝟏 are diagnosable, but uniquely linked to a specific tank. For 𝐅𝒒𝒕𝒕𝟐 only fault type 4 is uniquely diagnosable. 3.2. Gas to liquids process In this section, a gas to liquids process is considered. The modelled process, shown in Fig. 4, firstly comprises an autothermal reformer (ATR) which produces syngas when fed with specific ratios of feedstocks. The produced syngas is then fed into a Fischer-Tropsch reactor (FTR) in order to produce a variety of liquid products. The energy-attributed graph developed for this model is shown in Fig. 5. The nodes correspond to the units within the process as indicated on both diagrams. The attributes of the nodes being the change in exergy, Δ𝐵, seen over the units. The links indicate how the units are connected to one another. The link attributes are described by the energy flow, 𝑄ሶ , of the material streams. The three faults that were considered are: 1) A +10% deviation of the molar flow of the methane feed stream, which is representative of an actuator fault. 2) A temperature deviation of the reactor feed stream of -10%. This would be the result of a faulty heater (node 5). 3) The third fault was induced within the recycle stream; a faulty compressor leading to a pressure drop of 300 kPa. 343 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 4. GTL plant layout Figure 5. An energy-based attributed graph of the GTL plant The qualitative fault matrix based on only the eigenvalues is given by 𝐅𝒒𝑮𝑻𝑳T ൌ ൥ 𝐹ଵ ൅ െ െ െ െ െ െ െ ൅ ൅ െ െ െ𝐹ଶ ൅ െ െ െ െ െ െ ൅ ൅ ൅ െ െ െ𝐹ଷ ൅ െ െ െ െ െ െ ൅ ൅ ൅ െ െ െ ൩……….………. (16) As (16) reveals all three faults were detectable. Only F1 was found to be diagnosable though. One of the reasons would be that the recycle stream inevitably propagates the faults throughout the entire process. 3.3. Brayton cycle The Brayton cycle considered in this section is portrayed in Fig. 6(a). In this closed-loop Brayton cycle the working fluid is compressed through the compressor (comp) before heat energy is add through a heat addition component. The working fluid is then expanded through a compressor turbine (CT) and then through a second turbine (PT) producing a nett work output. The working fluid is then cooled through a heat rejection component, before being re-introduced into the compressor. The compressor inlet temperature is controlled to be constant and the pressure is varied to produce the desired nett work output. The compressor turbine inlet is also controlled at a constant temperature. 344 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management CT PTComp 5 3 2 1 8 6 7Environment Heat addition Heat rejection (a) 5 4 3 . 45Q 3B 4B 8 2 68 8 . 23Q . 86Q . 68Q . 62Q 2B 6B 8 5B 78 . 87Q . 78Q . 75Q 7B 1 . 12Q . 51Q . 81Q 1B . 34Q . 38Q . 48Q (b) Figure 6. (a) A Brayton cycle power conversion unit. (b) An energy-based attributed graph of the Brayton cycle By applying the method described in section 2.2, the attributed graph of Fig. 6(b) is obtained. The attributed graph is constructed such that the work input to the compressor and the work output of the compressor turbine is not equal and therefore connected but rather connected to the environment node. The 8 nodes of interest are (1) the compressor, (2) the heat addition component secondary, (3) the compressor turbine, (4) the power turbine, (5) the heat rejection component primary, (6) the heat addition component primary, (7) the heat rejection component secondary and (8) the environment node. The three fault conditions emulated are: 1) a change in the pressure ratio of the compressor, 2) a change in the pressure ratio of the power turbine, 3) a leak between the compressor and the heat addition component. By comparing the eigenvalues and –vectors under fault conditions to that of the normal condition and applying a direction threshold, the following fault signatures are obtained. Fault1  Eigenvalues  +  +  +  +  +  +  Eigenvectors  ‐  ‐  ‐  ‐  ‐  +  ‐  ‐  ‐  +  +  ‐  +  ‐  +  ‐  ‐  ‐  ‐  +  +  ‐  ‐  +  +  +  ‐  +  ‐  ‐  ‐  +  ‐  +  +  +  (a) Fault2  Eigenvalues  +  +  +  +  +  +  Eigenvectors  +  +  ‐  ‐  ‐  ‐  +  ‐  ‐  ‐  +  ‐  +  +  +  +  ‐  +  +  +  +  +  ‐  ‐  +  ‐  ‐  +  +  +  +  +  ‐  ‐  +  ‐  (b) Fault3  Eigenvalues  ‐  +  +  +  +  +  Eigenvectors  +  ‐  ‐  +  +  +  +  ‐  ‐  ‐  ‐  ‐  +  +  +  +  +  +  +  ‐  +  ‐  ‐  ‐  +  ‐  ‐  +  +  +  +  +  ‐  ‐  ‐  +  (c) Figure 7. Fault signatures for: (a) Fault 1. (b) Fault 2. (c) Fault 3. From the fault signature matrices it is clear that when only considering the eigenvalues, fault diagnosis is not possible. By including the deviation in the eigenvectors fault diagnosis becomes possible. 4. CONCLUSION The case studies discussed illustrate that the use of energy visualisation as a means of characterizing the health of industrial plants is viable. All faults emulated could be detected but not necessarily isolated. Extending the 345 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management characterization to include the eigenvectors as part of detecting the deviation from the normal conditions, improves the diagnosis capability. Further work will include exploring different methods of compiling the node signature matrices to capture more node attributes per node while maintaining structural information. The simplicity of the mathematical formulation however presents a challenge. An important next step will be confirming the reliability of the energy visualisation techniques through appropriate sensitivity analyses. The vision of energy visualisation will be extended to include optimisation and control of large scale industrial systems. 5. ACKNOWLEDGEMENTS This work is based on the research supported wholly or in part by the National Research Foundation (NRF) of South Africa (Grant numbers 91093, 103392). Opinions expressed, and conclusions arrived at are those of the authors and are not necessarily to be attributed to the NRF. REFERENCES [1] K. Severson, P. Chaiwatanodom, and R. D. Braatz, “Perspectives on Process Monitoring of Industrial Systems,” IFAC- PapersOnLine, vol. 48, no. 21, pp. 931–939, 2015. [2] M. S. Reis and G. Gins, “Industrial Process Monitoring in the Big Data/Industry 4.0 Era: From Detection, to Diagnosis, to Prognosis,” Processes, vol. 5, no. 3, p. 35, 2017. [3] C. P. du Rand and G. van Schoor, “Fault diagnosis of generation IV nuclear HTGR components – Part I: The error enthalpy–entropy graph approach,” Ann. Nucl. Energy, vol. 40, no. 1, pp. 14–24, 2012. [4] C. P. du Rand and G. Van Schoor, “Fault diagnosis of generation IV nuclear HTGR components - Part II: The area error enthalpy-entropy graph approach,” Ann. Nucl. Energy, vol. 41, pp. 79–86, 2012. [5] G. van Schoor, K. R. Uren, M. A. van Wyk, P. A. van Vuuren, and C. P. du Rand, “An energy perspective on modelling, supervision, and control of large-scale industrial systems: Survey and framework,” in IFAC World Congress, 2014. [6] L. B. Fouché, K. R. Uren, and G. van Schoor, “Energy-based visualisation of an axial-flow compressor system for the purposes of Fault Detection and Diagnosis,” IFAC-PapersOnLine, vol. 49, no. 7, 2016. [7] K. R. Uren and G. V. Schoor, “Energy-based visualisation of a counter-flow heat exchanger for the purpose of fault identification,” IFAC-PapersOnLine, vol. 49, no. 7, 2016. [8] V. Nikulshin, C. Wu, and V. Nikulshina, “Exergy efficiency calculation of energy intensive systems by graphs,” Int. J. Appl. Thermodyn., vol. 5, no. 2, pp. 67–74, 2002. [9] H. Marais, G. van Schoor, and K. R. Uren, “The merits of exergy-based fault detection in petrochemical processes,” J. Process Control, 2017. [10] V. Verda, L. M. Serra, and A. Valero, “Zooming procedure for the thermoeconomic diagnosis of highly complex energy systems,” Int. J. Thermodyn., vol. 5, no. 2, pp. 75–83, 2002. [11] V. Verda and R. Borchiellini, “Exergy method for the diagnosis of energy systems using measured data,” Energy, vol. 32, no. 4, pp. 490–498, 2007. [12] R. Smaili, R. El Harabi, and M. N. Abdelkrim, “Design of fault monitoring framework for multi-energy systems using Signed Directed Graph,” IFAC-PapersOnLine, vol. 50, no. 1, pp. 15734–15739, 2017. [13] M. R. Maurya, R. Rengaswamy, and V. Venkatasubramanian, “A systematic framework for the development and analysis of signed digraphs for chemical processes. 1. Algorithms and analysis,” Ind. Eng. Chem. Res., vol. 42, no. 20, pp. 4789–4810, 2003. [14] G. Wu, L. Zhang, and J. Tong, “Online Fault Diagnosis of Nuclear Power Plants Using Signed Directed Graph and Fuzzy Theory,” in 2017 25th International Conference on Nuclear Engineering, 2017, p. V004T06A011-- V004T06A011. [15] S. van Graan, G. van Schoor, and K. R. Uren, “Graph matching as a means to energy-visualisation of a counter-flow heat exchanger for the purpose of fault diagnosis,” IFAC-PapersOnLine, vol. 50, no. 1, pp. 2842–2847, 2017. [16] S. Jouili, I. Mili, and S. Tabbone, “Attributed graph matching using local descriptions,” in International Conference on Advanced Concepts for Intelligent Vision Systems, 2009, pp. 89–99. [17] T. S. Caetano, J. J. McAuley, L. Cheng, Q. V Le, and A. J. Smola, “Learning graph matching,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 31, no. 6, pp. 1048–1058, 2009. [18] T. J. Kotas, The exergy method of thermal plant analysis. Elsevier, 2013. [19] C. Godsil and G. F. Royle, Algebraic Graph Theory. Springer New York, 2013. [20] M. Dehmer, F. Emmert-Streib, and Y. Shi, “Quantitative Graph Theory: A new branch of graph theory and network science,” Inf. Sci. (Ny)., vol. 418–419, pp. 575–580, 2017. 346 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management A low-cost condition monitoring solution for industrial bakery equipment H. Marais1, J. Black1 1 School of Electrical, Electronic and Computer Engineering, North-West University, South Africa ABSTRACT Artisanal bakeries perform a critical function in many societies all over the world. However, artisanal bakers struggle to compete with commercial bakeries’ efficiencies due to the lack of technology deployment. One aspect that could lead to significant improvements of up to 36% are improved maintenance processes. Unfortunately the prohibitive costs associated with commercial condition monitoring equipment and a lack of proven efficiency improvements in the artisanal bakery hamper the adoption of such technologies. In this work a low-cost, remote, condition monitoring and measurement system is developed that makes use of commercial off-the-shelf technology. The prototype system provides immediate return on investment to the baker and maintenance staff while at the same time gathering high-resolution data for the development of more advanced diagnostic algorithms. The latter is especially of importance due to the lack of diagnostic algorithms capable of handling the variance introduced by the viscosity characteristics of dough and other mixer products. Keywords: Practical, Low-Cost, Condition Monitoring 1. INTRODUCTION The consumption of confectionary and other baked goods continues to rise leading to the impression that bakeries are lucrative high-profit industries. However, without careful management of bakery operations profit margins of up to 45% are quickly eroded [1]. Considering bakeries world-wide, a trend of increasing the degree of automation appears to be on the increase. Even in South Africa the use of technology such as robotic arms is not unheard of [2]. However, artisan bread (hand prepared) still dominate the African and Middle-East markets [1] and as such large commercial bakery operations compete with several smaller operations. Due to the technology dichotomy between commercial operators and smaller artisanal operators it is not atypical to find commercial operators employing the latest automation and process information to eke out even the smallest profits whilst artisans rarely employ technology beyond the machinery required to produce the relevant baked goods. This can largely be ascribed to the relative cost of commercial automation and monitoring technology. As such, the artisanal vendor is excluded from the advantages that such technologies could provide. In this work the development of a low-cost, scalable condition monitoring device for an artisanal bakery takes centre stage. In Section 2 some background on maintenance processes in artisanal bakeries are discussed, as well as how the changing technological landscape enables low-cost condition monitoring solutions to be developed. In Section 3, System Development, the architecture of the proposed solution is discussed in detail, and aspects such as scalability and data security receive special attention. Section 4 deals with the development of the monitoring algorithm. Section 5 provides initial results with a brief discussion of issues encountered during the prototype development. Finally, in Section 6 conclusions and future work is presented. 347 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 2. BACKGROUND 2.1 Bakery Equipment Considering typical operations involved in the baking process the oven might spring to mind as being the most critical. However, the mixer plays an equally important role and arguably performs under harsher environmental conditions [3]. The typical components of an industrial 10 litre mixer is shown in Figure 1. Fundamentally a mixer is an electrical motor connected via a gearbox to a type of mixing attachment. The entire drivetrain is housed in a cast iron frame. Figure 1: Typical components of an industrial mixer 2.2 Maintenance models In a typical artisanal bakery environment the dominant maintenance strategy is run to failure (RTF) even though RTF has been shown to be one of the most costly strategies (mainly due to loss of operational time). Introduction of preventative maintenance (+18-25% overall financial improvement) and predictive maintenance strategies (+20-25% overall financial improvement) may be of significant interest to artisanal bakeries [4]. However, the operational environment in which mixers operate is harsh with high levels of atmospheric dust, long operational hours, and high temperatures. Although some advances have been made such as the use of sealed gearboxes the ingress of flour dust into drivetrain components remain a challenge. As such any monitoring or diagnostic system shall be required to be non-invasive. Non-invasive methods will allow minimal foreign matter ingress whilst also preserving any manufacturer provided guarantees. 2.3 Non-invasive diagnostic methods The combination of the simplicity of a mixer in conjunction with typical operating procedures reduces the problem space to identification of motor and gearbox conditions. Methods such as vibrations monitoring [5] [6], temperature monitoring [7], and the monitoring of the electrical characteristics of the drive motor [6] [8], are commonly employed. Amongst the methods employed, the analysis of motor current signature analysis (MCSA) is one of the most popular methods, and would be attractive from both a non-invasive perspective and the limited remote deployability the technique affords. Recent trends in literature [8] indicate that the signal processing aspect is receiving increasing attention [9]. Amongst techniques commonly deployed is the Fast Fourier Transform, the instantaneous power spectrum [10], and wavelet decompositions [11]. However, regardless of the specific signal processing the objective is to develop a unique representation for each of the faults of interest. This process is referred to as residual generation by Gertler or more commonly as signature generation. It has been noted that large variance exist in the current signature of bearings presenting failure modes. Since there are several bearings present in the drivetrain and gearbox of the mixer it is realistic to expect similar results for the mixer. Additionally, and perhaps more importantly, the viscosity of dough is highly dependent on the properties of the mixture constituting the dough. Further complicating the matter is the 348 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management variance in product for which the mixer is used. It is not uncommon for mixing operations to range from beating eggs through cake mixtures to kneading of bread dough. This situation further complicates matters of condition monitoring or fault detection in these machines. As such, a theoretical analysis of techniques is of limited value. In practice the correlation of operational data with maintenance data, and sensor data would lead to an optimal solution. Unfortunately, this presents a conundrum with regards to the deployment of condition monitoring technology in artisanal bakeries. In this work the problem is addressed by the development of low-cost, flexible monitoring hardware as a driver for the adoption of condition monitoring and fault detection techniques in artisanal bakeries. 2.4 Remote measurement Although there are several remote diagnostic architectures proposed in literature [12] [13], the rise of the internet of things (IoT) and the proliferation of low-cost computing hardware and sensors has seen the development of customised remote measurement solutions that are repurposed for remote diagnostics. Primarily the difference lies in where the bulk of the computational load is carried. Simply retrofitting existing diagnostic systems with remote terminals leads to excessive data overhead being incurred while providing limited additional value. Traditionally, these diagnostic systems made use of high sampling frequencies with the advantages associated with that. On the other hand systems driven primarily from the connectivity side is typically characterised by the means in which data is exchanged between the sensing node and the remote terminal. Unfortunately, naïve approaches yield systems with limited data usage and limited diagnostic usage. This scenario is presented schematically in Figure 2. Figure 2: Comparison between classical and web-centric remote diagnostic system architecture Neither of the architectures in Figure 2 make optimal use of resources. The classical approach leads to excessive data communication overhead, and the web-centric approach ignores the computational power that is located in the sensor platform; effectively centralising a decentralised problem. By leveraging the computing power in the edge of the network cascade failures introduced by the centralised nature of the web-centric approach can be avoided. This approach is referred to as fog computing by Cisco [14] and aims to optimally utilise computing power regardless of the location thereof. 2.5 Data security One advantage afforded by the classic approach is that data security was less of a concern. However, the web-centric approach as well as an approach making use of fog computing creates a significant risk in terms of data security. It is reported that up to 45% of all IoT devices are vulnerable to data security risks. For this reason any data protocol that is used between the sensor platform (so-called edge nodes) and the central database should be designed for maximum security. 349 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 3. SYSTEM DEVELOPMENT For the proposed remote diagnostic system the fog computing paradigm provides for the most flexibility in terms of deployment. Due to the large number of unknown factors a dedicated platform cannot be developed as diagnostic techniques have differing computational complexity. Secondly, due to the cost sensitive nature of the platform commercial off-the-shelf (COTS) hardware will be preferred where possible. Over the last two decades the cost of computing power has significantly decreased with microcontrollers becoming commonplace. The proliferation of single board computers (SBCs) has placed industrial computing power [15] within the means of even small business. Additionally, the rise of the Internet-of-Things (IoT) has seen significant advances with regards to the availability and affordability of sensors. Considering all of the above the following architecture is arrived at (see Figure 3). Such an architecture provides for various diagnostic algorithms to be implemented in an agile fashion whilst retaining the ability to derive a purpose made solution. Figure 3: Remote diagnostic system with edge node computing The system architecture in Figure 3 can be realised by making use of the Raspberry Pi SBC together with an analogue to digital conversion module. The Raspberry Pi Model 3+ already provides the capability to connect to network infrastructure via WiFi® and Ethernet (IEEE 802.03) thereby simplifying the deployment of the hardware. Additionally the Raspberry Pi provides more than adequate computational power (1GHz, 64-bit Quad-core architecture) thereby allowing for various diagnostic processing algorithms to be deployed on the edge node. As mentioned in Section 2.5 data security is of primary concern but frugal use of data communication bandwidth remains important. Although there are several data protocols available that could satisfy either the security or overhead requirements the MQTT protocol was selected as the protocol of choice. This is not only due to the superior data security provided by means of transport layer security (TLS) but the low protocol overhead the MQTT protocol presents (having been designed for the IoT) [16]. Although the MQTT protocol is available as a free and open-source standard several commercial providers offer implementations. One such implementation is provided by Amazon Web Services (AWS) thereby removing the need for the system developer to implement this aspect of the system by hand. The system architecture in Figure 3 can be expanded as shown in Figure 4, in which the directions of the arrows indicate the expected data flows. It should be noted that although provision has been made for multiple types of sensor input only the current measurement and voltage measurement functions were implemented. In the case of the current measurement a low-cost, clamp-on, current transformer (CT) was selected due to cost considerations. The selected CT (YHDC SCT013) features a peak current rating of 15Arms with a ratio of 30:1 although similar CTs can easily be accommodated. An example of the CT in question is depicted in Figure 5a. At this point the system’s main components can either be procured from commercial vendors or from cloud computing platforms 350 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 4: Physical System Architecture Figure 5: Depiction of the selected current transformer Combining the Raspberry Pi 3, analogue to digital conversion (ADC) expansion board (HAT in Raspberry Pi parlance), an enclosure, and some CTs results in the prototype system depicted in Figure 5b. The prototype system can be developed for approximately US$80 (including a single CT) which is in line with the affordability constraints of the artisanal baker. 4. DIAGNOSTIC ALGORITHM As mentioned in Section 2 MCSA algorithms are the most common form of non-invasive condition monitoring for induction machines and the systems to which these machines are connected. However, although the diagnostic environment in artisanal bakeries lends itself well to the application of condition monitoring techniques the identified challenges preclude the use of commercial solutions. Additionally, the lack of published literature complicates the business case presented to bakery owners. As such a two- pronged approach is followed in terms of the diagnostic algorithm. An FFT-based approach has been identified as the most suitable and will provide sufficient information for future development endeavours. However, due to the large variance the mixer is subjected to, detailed diagnostic information is difficult to obtain reliably. In order to provide almost immediate commercial value to the bakery owner the existing maintenance process needs to be improved. Although the current maintenance process is largely RTF, economic improvements can be had if some operational information of the mixer were input to the maintenance scheduling activity. For this reason, the main diagnostic information gathered (and presented to the baker) concerns the running time of the mixer (operational hours) as well as the average and peak current draws of the mixer. The latter can be used by existing maintenance staff to aid in the diagnostic process. From literature the start-up time of an induction machine can be indicative of incipient faults or simply wear and tear. As such, the algorithm should make provision for the identification of the following discrete events: • Start-up; • Peak starting current; • Transition to operational mode; • Machine shutdown. A naïve approach would be to simply make use of threshold values which could be derived from the plate specifications of the machine. However, such an approach would not allow for natural variation between 351 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management machines and, more importantly, would not allow for the commissioning of the system on used (or currently in-service) machines. In order to accommodate machine variation derivatives are used in lieu of direct threshold values. With reference to Figure 6, the observed rms current clearly features the start-up, transient, running, and shutdown phases. However, in order to simplify matters a windowed rms function is deployed. Naturally the window length affects the transient detection resolution and thus the window should be kept as small as possible. In a South African power system the fundamental frequency of 50 Hz naturally lends itself to a window size of 20 ms. Although a crude approximation, this is similar to fundamental demodulation from [11]. Figure 6: Up-time detection algorithm output From the windowed rms current the inflection points can easily be identified by means of inspection. However, since inspection would be impractical for large-scale diagnostic deployments the use of current gradients and associated thresholds proved to be successful. A comment on the use of a threshold value is in order. Analytically speaking, setting the derivative of the current equal to 0 would provide the inflection points. However, since the differentiation is accomplished numerically the inflection points can not be guaranteed to be exactly 0. Rather than making use of tuned sensitivity parameters, setting gradient threshold values on the order of 750 A/s allows for natural machine variation while also accommodating the numerical variations expected. From Figure 6 it is clear that the proposed algorithm correctly identifies the various phases of machine start-up. Although the raw data is stored locally on the Raspberry Pi’s SD card, only the timestamps of the inflection points are transmitted via the MQTT protocol to the server. When the machine is running changes in operating current is handled in similar fashion, and should no changes be detected (steady-state operation), the average of the previous period is reported to the server. 5. INITIAL RESULTS The application of the diagnostic algorithm in Figure 6 within the framework provided in Figure 4 results in the generation of a web-page that contains a visual representation of the mixer in question, some historic data including the total operational up-time of the machine, as well as the historic average current drawn. The visual representation of the mixer is automatically updated to reflect the state of the machine, and the live diagnostic data is continually displayed as new field data is collected. A representation hereof is shown in Figure 8. The use of a responsive website rather than a dedicated application is that the website automatically allows for scalability across multiple platforms and devices. 352 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 7: Example of web front-end Figure 8: Representation of live diagnostic data processing The current implementation updates diagnostic data once per second at a total monthly cost of approximately US$8 (considering that 3G/LTE data is used). Due to the limitations imposed by only equipping a single cake mixer in a small artisanal bakery further exploration of the motor current data (or power spectra) can only be accomplished once sufficient maintenance data has been collected in addition to raw sensor data. 6. CONCLUSIONS AND RECOMMENDATAIONS In this work the development of a prototype remote diagnostic and measurement system for artisanal bakeries were presented. It was demonstrated that a prototype system could be developed for approximately US$80 and feature affordable monthly operating costs. Although the current development is more focussed on the development of the hardware a return on investment is provided for by means of some operational use data that can be provided in real-time to the baker or maintenance staff. During the initial phase of the development detail diagnostic algorithms cannot be deployed due to the lack of operational and maintenance data. However, the system makes provision for the storage of up to 3 months’ worth of raw data which would be suitable for the development of more sophisticated diagnostic algorithms. Future work would include the development of more sophisticated diagnostic algorithms, specifically taking into account the variability in operational environment attributed to the characteristics of dough viscosity. ACKNOWLEDGEMENTS This work is based on the research supported, in part, by the National Research Foundation (NRF) of South Africa (grant nos. 91093 and 103392). Any opinion, finding and conclusion, or recommendation expressed in this material is that of the authors and not necessarily attributable to the NRF. 353 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 7. REFERENCES [1] H. Zourides and T. Bolton, “Irresistibly better baking!,” Supermarket & Retailer, pp. 12-15, January 2016. [2] Vector Magazine, “Robotics in action in commercial bakeries,” EE Publishers, 27 September 2016. [Online]. Available: http://www.ee.co.za/article/abb-robotics-proactive-connected-services-in- action-at-south-african-pioneer-foods-bakeries.html. [Accessed 15 February 2018]. [3] V. Bamford, “Smooth operators: maintain your mixers,” British Baker, pp. 31-32, February 2018. [4] G. P. Sullivan, H. Pugh, A. P. Melendez and W. D. Hunt, Operations & Maintenance Best Practices: A Guide to Achieving Operational Efficiency, Federal Energy Management Program, 2010. [5] S. Mitra and C. Koley , “Vibration signal analysis of induction motors used in process control operation,” in 2013 IEEE 1st International Conference on Condition Assessment Techniques in Electrical Systems, Kolkata, India, 2013. [6] S. Mitra and C. Koley, “Different measurement techniques for detection of bearing faults in industrial actuators — comparative study,” in 2017 IEEE Calcutta Conference, Kolkata, India, 2017. [7] J. U. Porep, D. R. Kammerer and R. Carle, “On-line application of near infrared (NIR) spectroscopy in food production,” Trends in Food Science & Technology, vol. 46(2) Part A, pp. 211-230, 2015. [8] S. Karmakar, Induction motor fault diagnosis : approach through current signature analysis, Singapore: Springer, 2016. [9] L. T. Ran, E. Kim and A. C. Tan, “A practical signal processing approach for condition monitoring of low speed machinery using Peak-Hold-Down-Sample algorithm,” Mechanical Systems and Signal Processing, vol. 36, no. 2, pp. 256-270, 2013. [10] M. Irfan, N. Saad, R. Ibrahim and V. Asirvadam, “Condition monitoring of induction motors via instantaneous power analysis,” Journal of Intelligent Manufacturing, vol. 28, no. 6, pp. 1259-1267, 2017. [11] D. Miljković,, “Brief review of motor current signature analysis,” CrSNDT Journal, vol. 2, pp. 14-26, 2015. [12] F. Campos, W. Mills and M. Graves, “A reference architecture for remote diagnostics and prognostics applications,” in AUTOTESTCON Proceedings, Huntsville, AL, USA, 2002. [13] W. Chen, Z. Chai and Q. Zhang, “A Real-Time Remote Diagnostic System Architecture Based on OSGi Platform,” Future Wireless Networks and Information Systems. Lecture Notes in Electrical Engineering, 2012. [14] Cisco, “Fog Computing and the Internet of Things: Extend the Cloud to Where the Things Are,” Cisco Networks, USA, 2015. [15] T. Ahmad, H. Studiawan and T. Ramadhan, “Developing a Raspberry Pi-based Monitoring System for Detecting and Securing an Object,” International Electronics Symposium (IES) 2014, 2014. [16] “MQTT,” [Online]. Available: www.mqtt.org. 354 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Online Performance Monitoring of Discrete Legs in a Convective Heat Exchanger of a Coal Fired Power Plant Boiler G.T. Prinsloo, P.G. Rousseau, P. Gosai Department of Mechanical Engineering, University of Cape Town, South Africa ABSTRACT The convective pass of a coal fired power plant boiler consists of several heat exchanger stages. Knowledge about the real-time performance of these heat exchangers is valuable for informing operational practices, preparing maintenance strategies and performing root cause analysis. For the specific boiler being studied, the steam in each stage is split into four legs that run in parallel across the boiler width. These legs alternate between the sides and across the centre of the boiler to disperse the effects of an uneven heat profile along the cross section of the flue gas flow path. Previous work has shown that the typical measurement setup in these boilers allows the performance parameters of each complete heat exchanger stage to be determined online using one-dimensional mass and energy balance equations [1][2]. However, flow and temperature maldistribution occurring in steam headers and the flue gas flow path leads to a non-uniform distribution of the performance of individual legs. We present and compare two methods of determining the performance of the individual heat exchanger legs online from process measurements. The first method employs soft sensors based on a thermo-fluid process model that not only accounts for the mass and energy balance, but also solves for momentum conservation. This allows the prediction of the flow distribution in common headers and ducts. The Flownex one-dimensional thermo-fluid network software is used for the modelling effort. The second method is based on the reconstruction-based contribution method which compares measured data to plant performance signatures that allows identification of deterioration of individual heat exchanger legs. These signatures are originally generated with the aid of the process model developed above for different fault scenarios. A case study of a final superheater stage is presented to assess the benefits and limitations of each method. These methods enable the performance of individual legs to be monitored online from process measurements and therefore provide higher resolution results within a heat exchanger stage. Keywords: heat exchanger modelling, model-based performance monitoring, fault signatures, data reconstruction, diagnosis Corresponding author: Gert Thomas Prinsloo (prnger003@myuct.ac.za) 1. INTRODUCTION Coal fired power plants (CFPP) are the single largest generator of electrical energy worldwide [3]. In these plants, boilers transfer energy contained in coal to water/steam in the form of thermal energy. Optimal management of the steam generation process directly impacts on boiler reliability and the overall power plant thermal efficiency. The convective pass of coal fired boilers, where heat from the flue gas is transferred to tube surfaces mainly by convection, consists of several heat exchanger stages. A typical arrangement of these heat exchanger stages is shown in Figure 1. Knowledge about the real-time thermal performance of these heat exchangers is valuable for informing operational practices, preparing maintenance strategies and performing root cause analysis. Online performance monitoring is made possible by online measurements from which performance parameters can be deduced. In CFPP boilers, the flue gas temperature, abrasiveness and flow geometry inhibit representative online measurements on the flue gas side. Therefore, the conventional approach to online performance monitoring utilize soft sensors based on mass and energy balance (MEB) models to obtain flue gas parameters required to estimate heat transfer rates and overall heat transfer coefficients of the entire convective pass [4] or heat exchanger stages [1][2], from steam side process measurements. Other techniques such as heat flux measurements or strain gauged sling tubes are usually applied together with MEB models to corroborate results [5]. 355 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management In this study we consider a specific case study boiler where each of the heat exchanger stages in turn consist of four legs running in parallel across the boiler width. These legs alternate between the centre and the sides of the boiler as the steam travels from one heat exchanger stage to the next. This is done to disperse the effects of an uneven flue gas heat profile along the cross section of the flue gas flow path. This configuration is the norm in modern boiler design [6]. Since the resolution to which the outcomes of online monitoring can be optimized is directly linked to the resolution of monitoring, we aim to monitor the performance of individual legs. We propose two methods for creating such a monitoring system. The first method, being the most intuitive approach to online monitoring, is the design of soft sensors to estimate unmeasured parameters of interest. We base the soft sensor design on a steady state thermo- fluid process model. The second method is similar to data-based monitoring, which classes patterns in measurement data according to the fault status. However, we obtain our labelled training data from process model simulation results rather than from historical process data. We compare the benefits and limitations of the two methods at the hand of actual plant data while assuming healthy sensors and actuators. 2. CASE STUDY BOILER DESCRIPTION The case study boiler is a 618 MW two pass drum boiler with reheat. For this paper we focus on monitoring the final superheater (SH3). Figure 1 shows a diagram of this boiler with the location of SH3. Figure 1: Side view of the case study boiler convective pass heat exchanger stage layout. SH3 has a cross-parallel flow configuration and consists of 28 elements, which can be further broken down into four legs running in parallel, each containing seven elements. Throughout this paper, each of these legs will be represented as a single lumped heat exchanger as represented in Figure 2. Figure 2: Final superheater steam flow layout. 356 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management On the steam side, each leg has a separate inlet connected to the separate outlets of SH2 by the final superheater spray water attemperator stations. The outlets of legs A and D are connected by a common outlet header. The steam flow can therefore be distributed between these outlets. This implies that the steam flow and temperature measurements on the left-hand side of the final outlet header, shown in Figure 3, do not necessarily only represent the average values of steam coming out of leg A. It can also include the effect of steam coming out of leg D, depending on how much mixing takes place within the header, since the flow will rarely be distributed symmetrically. This configuration and reasoning also holds for legs B and C. Figure 3: Final superheater measurement configuration. Note that the configuration shown for legs A & D also holds for legs B & C, which is not included in this figure for clarity. Because of the abrasive, high-temperature combustion products it is difficult to obtain representative measurements on the flue gas side. Process measurements are therefore predominantly located on the steam side, taken on interconnecting piping where no significant heat is being transferred. Flue gas temperature measurements are located up- and downstream of the economiser heat exchanger, which is far downstream of the final superheater. Measurements of overall coal and air flow rate into the boiler are also available. 3. METHOD 1 – SOFT SENSOR DESIGN In this method, a process model is developed using a first principles one dimensional approach to predict steam flow through each leg of the final superheater. The resultant soft sensors predicts values of unknown parameters of interest based on the limited measured values indicated in Figure 3. 3.1. Steam side model The steam side is modelled according to the flow layout shown in Figure 2. Each heat exchanger leg is modelled as a pipe with uniform heat input along the length. The dimensions and other physical characteristics of the pipe and remaining flow connections are taken from plant design data to account for flow resistance as a function of flow characteristics. Using the measurements shown in Figure 3 as input boundary conditions, the model solves for the conservation of mass, energy and momentum. The results show the thermodynamic state of the steam at each location throughout the final superheater system with accompanying flow distributions. Of specific interest is the mass flow rate through each heat exchanger leg and the corresponding outlet steam temperatures, from which the heat transferred to the steam flowing through each leg is determined. 357 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 3.2. Flue gas side model The flue gas path is modelled as four streams linked to each leg in the final superheater as shown in Figure 4. Each stream is modelled as a pipe with the assumption that mixing between streams within the heat exchanger stage is negligible. The heat extracted at each heat exchanger leg is assumed to be equal to that gained by the steam, thereby neglecting directed radiation and losses. The remaining inputs required by the model is the flue gas temperature downstream of SH3 and the overall mass flow rate. These inputs are obtained from indirect measurement as described in [1]. The model solves the remaining thermodynamic states and flow distributions of the flue gas. Figure 4: Flue gas model layout. 3.3. Thermal performance evaluation The thermal performance of a heat exchanger is represented by its total thermal resistance 𝑅𝑡𝑜𝑡𝑎𝑙. Using the average properties obtained with the soft sensors, the actual value of 𝑅𝑡𝑜𝑡𝑎𝑙 can be determined for each heat exchanger leg using the heat transfer rate, ?̇?, and the log mean temperature difference, ∆𝑇𝐿𝑀𝑇𝐷 [7]. If the actual resistance is baselined against that expected at the corresponding load according to design, conclusions can be drawn about the magnitude of deviations from the design values. However, this approach does not allow one to determine whether this deviation is caused by changes in the physical heat exchanger characteristics or by deviation of process properties from design values. Changes in heat exchanger characteristics relate to increased conduction thermal resistance caused by external ash and internal oxide layer build-up on the tubes. Deviations in process properties lead to changes in the convection and radiation thermal resistance, as well as changes in temperature differences driving heat transfer. These causes are respectively classified as uncontrollable- and controllable losses as discussed in [8], where uncontrollable losses are related to aging and degradation of plant equipment. Using the actual process properties obtained with the soft sensors, the expected convection and radiation thermal resistance can be calculated using suitable correlations. The difference between these quantities and the total thermal resistance yields the conduction thermal resistance. In this study, the Dittus and Boelter correlation [9] and the Hillpert correlation [10] is used to respectively estimate the steam and flue gas convection thermal resistance, while radiation thermal resistance is neglected. 4. METHOD 2 – DATA-BASED APPROACH The data-based method proposed here is analogous to the reconstruction-based contribution (RBC) method described in [11]. In the RBC method, the normal operating region (NOR) is defined with principal components analysis (PCA) of process history data obtained during normal operation. When a new measurement or sample vector is outside the NOR boundaries, a fault is detected. Isolation is then achieved by removing the effects of possible faults from the sample vector. The fault vector of which the removal reconstructs the sample vector back into the NOR is then the assumed fault. This method requires the fault library, which contains all fault vectors or matrices, to be known [12]. However, the proposed method does not construct a NOR from process history data, but rather defines the normal operating point (NOP) using design data corresponding to the load. Instead of placing the fault library at the sample 358 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management vector and evaluating which fault vector penetrates the NOR, the proposed method centres the data around the NOP and performs a change of basis from the measurement space using the fault library as the transformation matrix. This produces the contribution of each possible fault to a measurement sample if the fault library is invertible, allowing diagnosis of multiple simultaneous faults. This method also requires the fault library to be known. The fault library can be constructed using either labelled process history data or model simulation. Labelled data refers to data collected while the fault status of equipment was known. This poses limitations for application to boilers because of, amongst other reasons, the following: (1) Obtaining data corresponding to a known online fault status implies the existence of a means to determine its fault status online. This is difficult for power plant boilers because the harsh environment in which they operate impedes measurements for verifying faults. (2) Imposing faults in an actual plant for training purposes are not feasible as the relevant fault parameters are neither measured nor controlled. (3) Training is further complicated by boilers being prone to several faults being active simultaneously. (4) Service intervals are quite extended, reducing the opportunity for correlating data of the previous operating cycle to the fault status established from offline inspection. (5) Diagnosis is limited to encountered faults. Because of these limitations, we use a thermo-fluid process model to construct the fault library. The process model is simulated with imposed faults to obtain fault signatures. The accuracy of the fault signatures therefore depends directly on the accuracy of modelling. Nonlinearities in process parameter relations can however cause changes in fault signatures depending on the operating conditions and the magnitude of deviations. It is posited that, for a given load, the state of parameters will not deviate drastically from design values. We therefore generate the fault library as a function of the load only. 4.1. Model configuration The model is developed in Flownex for steady state operation with the same layout as that of the soft sensor model described above. The input-output configuration differs from that used in the soft sensor model. The soft sensor model receives measurements as inputs to calculate unmeasured parameters of interest. For constructing the fault library, we use unmeasured parameters of interest as inputs to calculate the expected measurements. This allows one to impose changes to the unmeasured parameters and observe the measurement response. Some boundary conditions required to fully determine the model are also measurements, but their response is a function of unmodeled processes. These parameters are therefore also included as inputs and treated as unmeasured parameters. Heat exchanger legs are modelled to account for convection heat transfer on the flue gas and steam sides with the same correlations used for the soft sensor model. The model also accounts for the effects of fouling on heat transfer resistance by incorporating a cleanliness factor. The attemperator spray water mass flow rates are calculated to achieve the steady state outlet steam temperature setpoint. The model is adjusted to align results with design data. 4.2. Training methodology The different faults are imposed on the process model to construct the fault library. A fault is any parameter that deviates from the design conditions at the corresponding load. Without assigning structure to the changes in the flue gas side properties (biasing or deviations from the mean), we cannot distinguish if decreased heat transfer is caused by lower flue gas temperature, lower flue gas mass flow rate, or increased fouling. Similarly, the cause of increased heat transfer being either increased flue gas temperature or flow rate cannot be isolated. This is because these faults ultimately affect the measurement space in the same way, therefore yielding linearly dependent fault vectors for any of these faults. All these parameters are therefore lumped into one fault called the heat transfer index (HI), which represents a deviation from the design heat transfer rate. A HI of less than zero represents either less heat provided by the flue gas, or the increased heat transfer resistance of fouling. A HI of more than zero indicates that the greater than design heat provided by the flue gas has a greater influence on the heat transfer than the resistance caused by fouling. 359 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management The table below shows the parameters that are changed respectively for each leg or the whole stage during training, along with the specific magnitude of the deviation from design conditions. Table 1: Faults imposed as deviations from design values for training. Parameter Size of deviation Steam inlet temperature per leg +5 ℃ Heat transfer index (HI) +0.4 MW Total steam mass flow rate for stage +5 kg/s The changes imposed on these parameters during the training phase do not need to be both in the positive and negative directions, but only in one direction. This is because the fault signature corresponding to a decrease in a specific parameter is the negative of the fault signature for an increase in that same parameter. Further, only one simulation is needed per fault case. The simulation above is repeated for various loads to construct fault libraries as a function of load. During monitoring, the fault library is interpolated between the simulation cases to match the actual plant load. 5. RESULTS A soot blowing event is used to compare the two methods described in this paper. The reason for this choice is that the results produced by the two monitoring frameworks are expected to respond in a predictable way during such an event. The expected nett effect is a decrease in the conduction thermal resistance of the heat exchanger in the region where soot blowing is taking place, by the removal of ash from the tubes. Therefore, it should result in an increase in the heat transfer rate and in the HI after soot blowing. However, it is possible for the heat transfer rate to first decrease during the event, if the steam used for soot blowing is at a lower temperature than the flue gas, only to increase afterwards. Figure 5: Heat transfer indicators obtained by the soft sensor (top) and data-based (bottom) methods for all legs Actual plant data was recorded at five-minute intervals over a period of 250 minutes during which soot blowing took place at a constant load of 618MW. The darkened backgrounds in Figure 5 to Figure 8 indicate the activation of tube leak detectors near SH3, which is an indication of active soot blowing. The heat transfer index being monitored by the data-based method, HI, as well as the heat transfer rate calculated with the soft sensors, 𝑄𝑠𝑠̇ , are plotted for all 𝑄 𝑠𝑠 [ M W ] H I [- ] 360 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management legs in Figure 5. The HI indicates a deviation from the design heat transfer rate with a value of one corresponding to the trained deviation of +0.4 MW in Table 1. The results are quite similar for all legs and for brevity only that for leg D are shown in more detail below. The HI and 𝑄𝑠𝑠̇ are overlaid in Figure 6. Figure 6: Heat transfer indicators overlaid for leg D The total thermal resistance obtained with the soft sensors along with its components of conduction and convection are shown in Figure 7. Figure 7: Thermal resistance constituents of leg D obtained with soft sensors The log mean temperature difference, ∆𝑇𝐿𝑀𝑇𝐷, of the two fluids across the heat exchanger leg, obtained with the soft sensors is plotted in Figure 8. Figure 8: Log mean temperature difference across leg D 𝑄 𝑠𝑠 [ M W ] H I [- ] 361 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 6. DISCUSSION AND CONCLUSIONS The results shown in Figure 5 correspond to what is expected. Legs A and D, which are located at the opposite sidewalls of the boiler, consistently have lower heat transfer rates than the two centre legs as determined with the soft sensor method. This corresponds with the fact that heat is lost through the boiler walls. The indicators of heat transfer rate determined with the separate methods are in excellent agreement, providing confidence in the validity of both methods. Figure 7 shows that the conduction thermal resistance undergoes much more changes during the soot blowing event than the convection thermal resistance, since the ash layer forms part of the conduction resistance. The response in the temperature difference across the heat exchanger shown in Figure 8 can be caused by some combination of changes in process parameters because of soot blowing and controls responding to these changes. While the data-based method is limited to monitoring deviations from the design heat transfer rate, the soft sensors can further identify the contribution of conduction thermal resistance, convection thermal resistance and temperature differences to this deviation. This allows one to classify losses as being either controllable or uncontrollable. improving root cause analysis and allowing corrective action to be taken more effectively. The training methodology employed for the data-based method only used one simulation for each type of deviation, but specialised software was required for these simulations. However, its application to online monitoring only requires relatively small sized matrix multiplication, which makes this an attractive method for real-time diagnosis. The soft sensor method required simulation with specialised thermo-fluid software for each monitored sample. These simulations are more computationally expensive and requires manual intervention. 7. ACKNOWLEDGEMENTS The authors would like to acknowledge the support provided by Eskom through the Eskom Power Plant Engineering Institute (EPPEI) and the University of Cape Town, as well as M-Tech Industrial (Pty) Ltd for allowing the use of Flownex®. REFERENCES [1] A. Valero and C. Cortbs, “Ash Fouling in Coal-Fired Utility Boilers. Monitoring and Optimization of On-Load Cleaning,” vol. 22, no. 96, pp. 189–200, 1996. [2] C. Cantrell and S. Idem, “On-Line Performance Model of the Convection Passes of a Pulverized Coal Boiler,” Heat Transf. Eng., vol. 31, no. 14, pp. 1173–1183, 2010. [3] IEA - International Energy Agency, “Key World Energy Statistics 2017,” 2017. [4] J. Taler, M. Trojan, and D. Taler, “Computer System for On-Line Monitoring of Slagging and Fouling and Optimization of Sootblowing in Steam Boilers,” pp. 1–12, 2010. [5] S. A. Idem, “An Integrated Approach to Online Monitoring of Fouling in Coal-Fired Power Plants,” Heat Transf. Eng., vol. 28, no. 2, pp. 73–75, 2007. [6] M. . Nielson, F.S., Danesi, P. and Radhakrishnan, “Modern Boiler Design,” BWE Energy India, no. January, 2012. [7] F. P. Incropera, D. P. DeWitt, T. L. Bergman, and A. S. Lavine, Fundamentals of Heat and Mass Transfer. 2007. [8] H. Kim, M. G. Na, and G. Heo, “Application of monitoring, diagnosis, and prognosis in thermal performance analysis for nuclear power plants,” Nucl. Eng. Technol., vol. 46, no. 6, pp. 737–752, 2014. [9] R. H. S. Winterton, “Where did the Dittus and Boelter equation come from?,” Int. J. Heat Mass Transf., vol. 41, no. 4–5, pp. 809– 810, 1998. [10] R. Hilpert, “Wärmeabgabe von geheizten Drähten und Rohren im Luftstrom,” Forsch. auf dem Gebiet des Ingenieurwesens A, vol. 4, no. 5, pp. 215–224, Sep. 1933. [11] C. F. Alcala and S. J. Qin, “Reconstruction-based contribution for process monitoring,” Automatica, vol. 45, no. 7, pp. 1593–1600, 2009. [12] C. F. Alcala and S. J. Qin, “Reconstruction-based contribution for process monitoring with kernel principal component analysis,” Am. Control Conf. (ACC), 2010, pp. 7022–7027, 2010. 362 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Simulation Based LOX/Methane Rocket Engine Fault Features Analysis Jingyu xiong, Jianjun Wu, Yuqiang Cheng College of Aerospace Science and Engineering, National University of Defense Technology, China ABSTRACT LOX/Methane Rocket Engine has attracted widely attention due to its advantages, such as economy, nontoxicity and good comprehensive performance. In order to solve the problem of parameter selection and optimization for LOX/methane rocket engine health monitoring, the fault features of reusable LOX/methane rocket engine are investigated and the key parameters are selected for engine health monitoring. Due to the lack of fault sample data, it is difficult to obtain the features of fault modes. So the modular model library was built on the basis of lumped parameter approximation, and the dynamic simulation of the work process for the engine was carried out on the basis of MATLAB Simulink system. To establish fault simulation model, fault factors are added into the balance equations of components. Comparison results show that the simulation results are in good agreement with the real test data of the engine. It can complement important fault sample data in the research of fault detection and diagnosis methods for the engine. Using these data we can analyze the change of parameters in the dynamic transition process from a normal state to a fault state. Finally, the key parameters for typical fault detection and diagnosis are obtained. Keywords: LOX/Methane Rocket Engine; Modeling and Simulation; Fault Features Analysis; Parameter Selection; Fault Simulation Corresponding author: Jianjun Wu (jjwu@nudt. edu. cn) 1. INTRODUCTION LOX/Methane rocket engine is a new type of liquid rocket engine. Its propellant has significant advantages such as low viscosity, low density, high heat capacity, high specific impulse, low carbon deposition during combustion, low price, non-toxicity, and non-pollution. LOX/Methane is called 21st-century rocket propellant, and LOX/Methane rocket engine has broad application prospects[1]. The fault test of rock engine is dangerous and the cost is expensive. So it is difficult to get fault sample data. System modeling and simulation for liquid propulsion systems provides a way to research the fault of rock engine. Since 1970s, lots of contributions has been made for modeling and simulation of Liquid Propellant Rocket Engine. In 1973, Rocketdyne published a report regarding SSME’s dynamic characteristics related to Pogo phenomenon, engine components were mathematically modeled using linear differential[2]. In 1995, C. Goertz proposed a modular method for different cycles of engine. Through this method, users can select the basic components to simulate the static performances of engines such as SSME and RD-12[3]. In 2000, K. Liu, Y. Zhang used Microsoft Visual C++ 6.0 to develop a method for modular dynamic simulation of liquid propellant rocket engines[4]. In 2017, M. Naderi, et al. developed a modular simulation software for steady state analysis of Stage Combustion Cycle Liquid Propellant Rocket Engine using MATLAB Simulink. In their work, SSME is studied as a case study and the required input and output data for each module has been described and the algorithm is presented but the coefficients required for the equations are not described and the programming algorithm which also has not been explained in detail[5]. In the current paper, the second chapter gives a brief structure introduction to the LOX/Methane rocket engine. In the third chapter, the fault model of each engine component is established using the lumped parameter approximation modeling theory and modular modeling idea. Then connect each components and form the engine integrated system model in MATLAB Simulink. In the fourth chapter, simulate and analyze the typical faults. Finally the features of fault and the key parameters for typical fault detection and diagnosis can be extracted from the simulation data. 363 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 2. STRUCTURE OF ENGINE LOX/Methane rocket engine adopts liquid oxygen and liquid methane cryogenic propellant. This gas generator cycle for pump-fed liquid rocke engine was chosen. The engine consists of thrust chambers, gas generator, turbopumps, propellant supply systems, valves and regulating components. The two propellants come out of the tank and are pumped into the thrust chamber and the gas generator. The gas produced by gas generator goes into the methane turbine and the oxygen turbine respectively and then excreted. The advantages of using this cycle are: The system is relatively simple, the turbine power is small and the quality of the engine is lighter, so the requirements for production and test are relatively low, and the cost of the engine itself is reduced. However the drawback of this design is that the specific impulse of the engine is lower because of the turbine exhaust loss. The system diagram is shown in figure 1. 1 2 3 4 5 6 7 8 9 10 11 12 1 - methane pump 2 - oxygen pump 3 - methane turbine 4 - oxygen turbine 5 - gas generator6 - thrust chamber 7 - methane pump front valve 8 - methane main control valve9 - methane secondary control valve 10 - oxygen pump front valve 11 - oxygen main control valve12 - oxygen pair control valve Figure 1. The system diagram 3. MODEL OF COMPONENTS The main components of the turbo-pump liquid rocket engine system are fixed, including turbine, pump, thermal assembly (combustion chamber, gas generator, gas conduit), regulator (throttling device) and automatic device. For different types of liquid rocket engines, the dynamic mathematical model has a certain generality in form. Using this feature, build each component separately. After the component model is created, it can be debugged separately, and then combined together to form the entire system model according to its physical connection. In this way, the modular modeling and simulation of the engine system can be performed. Similarly, we use this method to model and analyze LOX/Methane rocket engine[6]. Literature [7] summed up the main fault of pump pressure liquid rocket engine. Turbo pump, combustion chamber and pipeline fault sum more than 70% including pump cavitation, combustion chamber ablation, blocked and leakage. 1. Turbine model The engine turbine includes an oxidizer turbine and a fuel turbine. The relationship between various parameters in the turbine working process can be described by the following equation. Turbine power equation: 2 / 2eN v q η= ⋅ ⋅ , (1) Turbine efficiency equation: 2 1 2 3 e e n nb b b v v η     = + +        , (2) Turbine flow equation: 364 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management ( ) ( ) 1 1 1 0 2 1 1 0 0 0 2 2 1 1 2 2 1 1 k k k k i ii k k k k k i i i ii AP Pk k P kRT q AP P P Pk k P P P kRT µ µ + − − + −      ≤    + +    =           − >      − +        , (3) Turbine gas theory jet velocity: ( ) 1 2 1 1 k k e e i i Pkv RT k P −    = −   −     , (4) Turbine power balance equation: ( )230 pN NdnJ dt / nπ − = ⋅ ∑ , (5) Gas pressure between turbine stator and rotor: ( ) 1 1 0 1 k k k k e i i PP P P θ θ − −    = + −       , (6) Where k represents gas adiabatic index, R represents gas constant, T represents gas temperature, b represents coefficient of experience, µ represents flow coefficient of nozzle, A represents turbine nozzle area, N represents turbo power, pN∑ represents sum of power driven by turbine. θ represents turbine reaction force. 2. Pump model When simulating the working process of the engine, the pump's static characteristics are used to describe the operation of the pump. Its static characteristics are completely determined based on experience and can be described as follows. Pump head: ( )2 21 2 3 1e i p p p p p p p pP P P n n q q Fµ µ µ∆ = − = + + ⋅ , (7) Pump power: 3 2 2 1 2 3p n p n p p n p pN n n q n qυ υ υ= + + , (8) Were P∆ represents pump head, 1pµ , 2pµ , 3pµ represents the empirical coefficient of the pump head, 1nυ , 2nυ , 3nυ represents the empirical coefficient of the pump power, 1pF represents failure factor, when 1 1pF = , represents pump operating in normal state ;when 10 1pF≤ ≤ indicates pump cavitation failure. 3. Thermal Component Model The engine thrust chamber mathematical model can be described by the following equations: Mass conservation equation: ( )21g ig lo lf g eg dm q q q F q dt = + + − + , (9) Variation of gas density in thermal components: 365 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 1 gdmd dt V dt ρ = ⋅ , (10) Change rate of gas mixture ratio in thermal components: ( )( )1 o fdr RTr q r qdt PV= + − × ⋅ , (11) Gas heat value: ( )RT RT r= , (12) Both sides derivative of Ideal gas equation: ( )gdm d RTdP RT P P dV dt V dt RT dt V dt = ⋅ + ⋅ − ⋅ , (13) Outlet flow equation: ( ) 1 1 11 2 1 11 2 2 1 1 2 2 1 1 k k k kg e i e k k k k kg e e e F AP Pk k P kRT q F AP P P Pk k P P P kRT ζ ζ + − − + −      ≤    + +    =           − >      − +        , (14) Where gm represents the quality of high temperature gas in thermal components, ρ represents density of gas, V represents volume of gas, P represents pressure of gas, k represents mixing ratio of gas. giq 、 loq and lfq represent the gas mass flow, the liquid oxidant mass flow and the liquid fuel mass flow into the heat module respectively; geq represents Outlet flow of thermal components; ζ represents the flow coefficient of thermal component throat; A represents the throat area of thermal components, 1gF represents throat ablation factor, normal state: 1 1gF = , ablation state, 1 1gF > ; 2gF represents gas leakage, normal state 2 0gF = , failure state, 2 0gF > . 4. Liquid pipe model Liquid pipeline plays the role of propellant transmission to components in the engine. Its input is pressure and propellant mass flow, and at the same time, the output pressure and propellant mass flow rate of the next component. In modeling and simulation, it is necessary to consider the inertia, viscosity and compressibility of the liquid. Considering the inertia and viscosity of fluids, according to the principle of pressure superposition, the equations of motion of the propellant elements in the pipeline can be obtained: 2 1 2[( ) ]l i e l dqL F P P F q dt α= − − , (15) Considering the compressibility of the fluid, according to the mass conservation equation, the continuous equation of the propellant component in the pipeline can be obtained by derivation: 2( ) /e i e l dP q q F dt ξ = − , (16) Where ξ represents liquid pipe flow capacity coefficient, it reflects the compressibility of the liquid in the pipeline; L represents the inertial resistance coefficient of the liquid;α represents the resistance coefficient of the pipeline; iq and eq represent the inlet and outlet mass flow of the pipeline separately; iP and eP represent the 366 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management pressure at the inlet and outlet of the pipeline separately; V represents line volume, a represents sound speed in liquid, 1lF represents factor of leakage, leakage state: 11 0lF> > , normal state: 1 1lF = ; 2lF represents factor of blocked, blocked state: 2 1lF > , normal state 2 1lF = . Based on the modular model of the engine system, fault factors are added into the balance equations of components the engine, fault simulation system is designed. We use MATLAB Simulink to model each module. Finally, connect each module according to engine structure, the fault simulation software for LOX/methane rocket engine was obtained, as shown in figure 2. According to the input conditions of the simulated working process, by calling the components coefficients of the engine database and the input coefficients of each component can be set and call the four order Runge-Kutta method to solve. Figure 2. LOX/methane rock engine fault simulation system 4. RESULT ANALYSIS In steady state the maximum error of actual data and simulation data is less than 12%, so the simulation system is considered to be accurate. In this paper, rock engine typical failures in steady state (leakage, cavitation, ablation, blocked) are simulated. There are 27 monitoring parameters set in the system. Taking methane pipeline fault as example, the changes of oxygen parameters are much smaller than the methane's, so the figures of the oxygen parameters are ignored Use the dimensionless parameter D to characterize the change of each parameter after the failure. Annotate: Fault value Rated value= Rated value D − 1 Methane main pipe blocked The blocked factor of the methane main pipe in the simulation model is set to 2.0, the parameters change with time as shown in figure 3. 0.0 0.1 0.2 0.3 0.4 -0.040 -0.035 -0.030 -0.025 -0.020 -0.015 -0.010 -0.005 D time flow rate of methane pump/main pipe flow rate of methane valve/thrust chamber nozzle/cooling jacket Figure 3. Change of methane parameters with methane main pipe blockage factor 2 0.0 0.1 0.2 0.3 0.4 -0.020 -0.015 -0.010 -0.005 0.000 0.005 0.010 0.015 0.020 D time pressure of methane nozzle inlet/cooling jacket outlet pressure of gas generator mathane nozzle pressure of thrust chamer/nozzle pressure of methane pump outlet/main pipe inlet pressure of cooling jacket intlet/thrust chamer valva outlet 0.0 0.1 0.2 0.3 0.4 0.000 0.005 D time rotate speed of methane turbine rotate speed of oxygen turbine 367 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management When a blocked fault occurs, the equivalent flow resistance coefficient increases, and the value of the blockage failure factor is equivalent to the increase in the flow resistance through the component. When the flow resistance of the methane main pipe is doubled, the methane main flow rate is the same as the methane pump flow rate change. The methane valve flow rate, the methane nozzle flow rate, and the cooling jacket flow rate in the thrust chamber are the same and are reduced by approximately 1%. The inlet pressure of the main methane pipe increased by about 1.5%. The pressure of the remaining components of the methane line decreased in different degrees. After the stable combustion chamber, the inlet pressure of the methane nozzle was restored to the vicinity of the rated value. By comparison, the outlet pressure of the methane main pipe drops by a maximum of 0.45%. The increased rotating speed of the methane turbine is significantly faster than that of the oxygen turbine. 2. Methane main pipe leakage fault The methane main pipe leakage factor of the simulation model is set to 0.2, the parameters change with time as shown in figure 4. 0.0 0.1 0.2 0.3 0.4 -0.2 -0.1 0.0 0.1 0.2 0.3 D time flow rate of methane pump/main pipe inlet flow rate of methane valve/thrust chamber nozzle/cooling jacket flow rate of main pipe outlet Figure 4. Change of methane parameters with methane main pipe leakage factor 0.2 When the leakage factor of the methane main pipe is 0.2, the flow of the main pipe inlet is the same as that of the methane pump, and the flow rate increases by 5% when reaching the steady state. The flow rate of the methane valve in the thrust chamber, the flow rate of the methane nozzle, and the cooling jacket flow rate are the same, which are reduced by about 20%. The pressure of each components of the methane pipeline decreased in different degrees. Among them, the methane pump outlet, the methane main pipe outlet, and the pressure in the thrust room valve outlet were more similar, their reduction is nearly 12.6%. The speed of the methane turbine and the oxygen turbine is reduced. 3. Ablative fault of thrust chamber Setting the thrust chamber ablation factor of the simulation model to 2.0, the parameters change with time as shown in figure 5. 0.0 0.1 0.2 0.3 0.4 0.5 -0.1 0.0 0.1 0.2 0.3 0.4 0.5 D time flow rate of methane pump/main pipe flow rate of methane valve/thrust chamber nozzle/cooling jacket Figure 5. Change of methane parameters with thrust chamber ablation factor 2 When a failure with a thrust chamber ablation coefficient of 2.0 occurs, the flow of each components of the methane and oxygen pipeline will increase suddenly and then decrease with time, finally, after reaching steady state, the flow rate is lower than the rated 3.5%. The pressures of various parts of the methane and oxygen pipelines gradually decreased with time and then rose slightly to reach a steady state, in which the pressure of the thrust chamber changed most significantly. Changes in two turbines are similar to changes in pressure. It is worth mentioning that when the thrust chamber ablation occurs, the pressure changes much more than the flow rate changes. 0.0 0.1 0.2 0.3 0.4 -0.14 -0.12 -0.10 -0.08 -0.06 -0.04 -0.02 0.00 0.02 D time pressure of methane nozzle inlet/cooling jacket outlet pressure of gas generator mathane nozzle pressure of thrust chamer/nozzle pressure of methane pump/main pipe 0.0 0.1 0.2 0.3 0.4 -0.07 -0.06 -0.05 -0.04 -0.03 -0.02 -0.01 0.00 0.01 D time rotate speed of methane turbine rotate speed of oxygen turbine 0.0 0.1 0.2 0.3 0.4 0.5 -0.7 -0.6 -0.5 -0.4 -0.3 -0.2 -0.1 0.0 0.1 D time pressure of methane nozzle inlet/cooling jacket outlet pressure of gas generator mathane nozzle pressure of thrust chamer/nozzle pressure of methane pump outlet/main pipe inlet pressure of cooling jacket intlet/main pipe outlet 0.0 0.1 0.2 0.3 0.4 0.5 -0.30 -0.25 -0.20 -0.15 -0.10 -0.05 0.00 D time rotate speed of methane turbine rotate speed of oxygen turbine 368 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management From the results, we do not look at the specific values of the parameters, but compare their relative sizes to judge the sensitive parameter to fault. Take into account of the length of the article to omit the other results. According to the comparison of each parameters’ value between the failure and rated condition, the different types of faults can be diagnosed, as shown in table 1: Table 1: the change of parameters for different fault Fault Flow rate of methane pipeline Pressure of methane pipeline Rotating speed qpif qmif qmof qcvif qcvof qjof qcof Ppof Pmof Pcvof Pjof Pc nf Leakage of methane pump + - - - - - - - - - - - - Methane main pipe blocked - - - - - - - + - - - - + Methane pump leakage + + - - - - - - - - - - - Cavitation of methane pump + + + + + + + + + + + + + Thrust chamber ablated - - - - - - - - - - - - - Annotate: + represents value after the fault is greater than the rated value,- represents value after the fault is less than the rated value. Here only some typical faults of methane pipeline are listed, the same approach can be used for oxygen'. 5. CONCLUSION In this study, The modular modeling of fault simulation for LOX engine is made by using MATLAB Simulink, some typical faults occurred in steady state are simulated, and the change of each parameter after releasing fault is analyzed. For blocked and leakage faults, the variation of flow rate parameters is more sensitive to other parameters. In the leakage failure, the upstream components’ flow rate will increase, while downstream components’ flow rate will decrease. Other parameters will decrease for blocked failure, but the entry pressure will increase. For ablative fault, all parameters will decrease, and the change of pressure parameters is more significant. The extraction of these fault features is helpful for studying LOX/methane rocket engine diagnosis. And other faults of LOX/methane rocket engine can also be further studied on this platform. REFERENCES [1] Kai Yang, Manrui Cai. The Latest Development of LOX/Methane Rocket Engine. Aerospace China. 2017. 10:14-19. [2] Wilhelm W. F. SSME Model Dynamic Characteristics Related to POGO, Rocketdyne division Rockwel International, 1973. Report No. : N76-15246. [3] Goertz C. , A modular method for the analysis of liquid rocket engine cycles. 1995. [4] Liu K. , Zhang Y. , A study on versatile simulation of liquid propellant rocket engine systems transients. 2000. [5] Naderi M. , Liang G. , Karimi H. , Modeling and simulation of staged combustion cycle LPRE, in 8th International conference on mechanical and aerospace engineering (ICMAE). 22-25 July, 2017, IEEE Xplore: Prague, Czech Republic. [6] Yanjun Li, Jianjun Wu. Study on Key Techniques of Fault Detection and Diagnosis for New Generation Large-scale Liquid-propellant Rocket Engines. April 2014. [7] Yang Erfu, Zhang Zhenpeng. Study on Fault Monitoring and Diagnosis Techniques for Thrust Chamber and Turbo-pump System of Liquid Rock Engines . Journal of Beijing University of Aeronautics and Astronautics. 1999. 10:619-622. 369 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Pipe network leak detection: Sensor placement optimization using Support Vector Machines and a model-based leak detection technique J.C. van der Walt1, P.S Heyns1, D.N Wilke1 1 Centre of Asset Integrity Management, Department of Mechanical and Aeronautical Engineering, University of Pretoria, South Africa ABSTRACT This paper investigates a model-based method to detect leaks in water distribution networks. The model is used to generate data on which a Support Vector Machine is trained to locate and estimate the size of leaks on an experimental network. In this paper the leak location and size for an experimental network are shown as well as the effect of pressure sensor placement. The experimental investigation includes model calibration and the estimation of two simultaneous leaks in the network. The estimated solution for the experimental network showed the leaking pipe to be detectable with the leak sizes estimated within 0.7mm. The sensor placement optimization was completed with an exhaustive search algorithm and by removing sensors with the smallest impact on the detection accuracy, where it was found that training time of the SVM can be reduced significantly by using less sensors. Additionally, using less sensors for the estimation of the leaks was found to increase the probability up to 38% in finding the leak. Keywords: Condition Monitoring, Fault Detection and Localization, Practical Low-Cost Condition Monitoring, Leak Detection 1. INTRODUCTION In the 2011-2012 National Non-Revenue Water assessment (1) in South Africa the average national non-revenue water consumption was found to be 37%. While the world average in that period was 36.6%. Currently, to reduce non-revenue water consumption in South Africa, pressure management systems are installed, minimum night flows are logged, and water balances are completed. These methods only reduce the amount of leaked water or raise water waste awareness. The primary method of locating leaks comes from consumers. Contractors use listening sticks, geophones, ground penetrating radar, and noise loggers to locate potential leaks. More advanced techniques such as the Smartball, the Saharah system, and JD7 systems are also used (2) to detect potential leaks. In recent years research to find leaks using on-line machine learning techniques with simulated time data has intensified with studies by Mashford et al. (3), Mandal et al. (4), Kayaalp et al. (5), ect. That showed the use of support vector machines (SVM) to be effective in finding leaks in a variety of networks. Mashford et al. (3) optimized the cost and kernel function parameters of the SVM on simple networks to find the best classification of the leaks. They could achieve a classification accuracy of 76.8%, with 77.5% of the leak cases within 100m of the actual leak. They went on to apply this technique on a water network in South Eastern Melbourne (6) where they investigated the accuracy of the SVM when classifying the emitter coefficients for each leak. They also investigated the application of this method for low leak rates, ranging between 50-100l/h. They found that all the leaks could be classified within a 500m range from the actual leak location with the network having a size of 1000m ×1100m. Mandal et al. (4) investigated different training algorithms for SVMs on the classification of leaks in water networks. They compared Particle Swarm Optimization (PSO) and Enhanced Particle Swarm (EPSO) Optimization with Artificial Bee Colony (ABC) optimization and found that the SVM trained by ABC could classify the leaks nearly 10% more accurately than PSO or EPSO. Kayaalp et al. (5) used an experimental network to compare three algorithms for leak identification. The three algorithms were the RAkELd (Random k-Labelsets), BRkNN (Binary Relevance k-Nearest Neighbors) and a SVM. They found that RAkELd performed the best between the three algorithms, with the SVM performing better than BRkNN. Salam et al. (7) investigated the use of SVMs for leak identification on a network in Makassar, Indonesia. They found that the leak size could be correctly predicted with an accuracy of 86.68%, while the leak location could be predicted with an accuracy of 76.14% on the dataset. Rosich et al. (8) investigated optimum sensor placement on a district metered area (DMA) in the Barcelona water distribution network. Their methodology is based on a depth-first search by choosing the nodes with the lowest cost 370 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management and back-tracking to other un-explored nodes. Sanz and Pèrez (9) investigated the optimal placement between pressure and flow sensors on the Nova Icaria DMA in Spain. Their method estimates pressure and flow sensitivity using Singular Value Decomposition. They tested different sensor configurations with the use of three to six sensors and found that the demand calibration error decreases with increased number of sensors. They also found that pressure sensors are more beneficial in the calibration of demand components. Meseguer et al. (10) used a Genetic Algorithm (GA) for the optimization of the sensor positions on a DMA in Barcelona. They compared an optimal and non-optimal pressure sensor distribution and showed that the sum of the five largest non-isolable leaks could be reduced from 2976 nodes to 1357 nodes. Steffelbauer et al. (11) considered uncertainties with optimum sensor placement. They applied their methodology to a real-word DMA and found that the number of sensors to the cost followed an extended power law equation. Their methodology included a GA to optimize the sensor placement. Jung et al. (12) investigated a multi-objective optimal meter placement model which maximized the detection probability and robustness of a metered network, while minimizing the amount of false alarms. They tested this methodology on the Austin network and found that a linear relationship exists between the detection probability and the amount of false alarms. Cugueró-Escofet et al. (13) completed sensor placement optimization by considering topological issues on the network which lead to mislabeling of leaks. Their methodology was applied on two DMA within the Barcelona water distribution network and they could suggest optimum number of sensors and their placement from their results. Fuchs-Hanusch et al. (14) compared six optimal sensor placement algorithms by applying them on real-world networks. They evaluated these algorithms on their robustness and their ability to localize small leaks. They found that the localization results could find leaks up to 0.5l/s if sensors were placed close to them. Soldevila et al. (15) proposed a sensor placement approach for classifier-based leak localization. The proposed method is based on a hybrid feature selection algorithm that combines a filter based on relevancy and redundancy with a wrapper based on genetic algorithms. They applied the proposed methodology on the Hanoi and Limassol water networks. In this paper a linear kernel function SVM is trained on a calibrated experimental network to localize and classify leaking pipes. The calibration of the EPANET model is completed by optimizing pipe roughness, pump efficiency and minor loss coefficients. Two leak cases are classified by the SVM to find a baseline accuracy of the model. The experimental measurements included 12 pressure sensors allowing for optimum sensor selection for the training of the SVM. The optimization is completed with an exhaustive search algorithm and an algorithm removing sensors with the lowest cost to the detection probability. 2. METHODOLOGY This methodology uses SVMs to calculate probabilities for the classification of leaks, as well as predictions for the size and location of the leaks. The SVM uses vectors, referred to as the kernel function, to generate decision boundaries between different data sets. The trained kernel functions can then be used to give probabilities and regressive estimations for the leaking pipe, the leak size and leak location. The SVMs are generated using scikit-learn (16), which is a Python package used for machine learning. The kernel functions were left as default, which were linear kernel functions. The methodology is model-based to monitor pressure-flow deviations and require a dataset of possible leaks to train the SVMs on. The dataset contained 1000 data samples randomly chosen depending on three variables. The leak size was randomly selected between 1-4mm with the leak location between 0-3m. These two variables were then fixed to a specific pipe which were randomly chosen. Three SVMs were created, each solving a single output. These three SVMs consisted of one for classification of the leaking pipe, one for regression on the leak size and, one for regression on the leak location. From the randomly chosen data an EPANET model could be simulated to find pressure values. These pressure values are used as inputs for the SVMs. No artificial noise was added to the model measurements, and the classification of the leaks depended on the error between the EPANET model and the experimental measurements. This was done to disentangle error contributions for solving the inverse problem. 371 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 3. EXPERIMENTAL INVESTIGATION Figure 1 shows the experimental network under investigation in this study. This network consists of six pipes serving as a distribution network. Each pipe in this network is installed with two pressure sensors, one at the start and one at the end of each pipe. These pressure sensors are marked as 𝑃1 − 𝑃12 in Figure 1. Additional flow sensors are installed to measure the demands or outflows for each pipe, marked as 𝐹1 − 𝐹6. The flow measurements are used to help with the calibration of the network. The network is fed by pumping water from a reservoir, with the demands and leaks feeding back into the reservoir. The pipes in this network have a diameter of 10mm and have lengths of 3m each with 10cm of pipe before the pressure is measured. This is to reduce the turbulence after the T-pieces and L-bows before measuring the pressure. The 3m pipe sections are spaced 30cm apart. The demands in this network are created by drilling 3mm holes into the end of the pipes. The experimental network contains three leaks, which can be switched on and off individually. The leaks can be found on pipe 1, 3, and 5. The first leak have a diameter of 3mm while the other two have a diameter of 2mm. Figure 2 shows an example of the application of a 2mm leak on this network. In this paper leaks on pipe 3 and 5 are tested and can be found 1m and 1.5m from the start of their respective pipes. The leaks and demands in this network were created following the approach illustrated in Figure 2. An EPANET model of this network was created and calibrated to simulate the experimental network. The calibration process comprised of the optimization of 16 variables. They include one pump efficiency coefficient, minor loss coefficients, and roughness coefficients to minimize the error between the measurements. In Figure 1 red squares indicate the location of 12 minor loss coefficients added to the EPANET model to simulate the losses due to the connections. Two additional minor loss coefficients were added: one to the demand return pipe and one to the leak return pipe. Two additional calibration factors were added to the optimization algorithm, the pump efficiency and the roughness coefficient for all the pipes in the model. The optimization was completed by minimizing a mean squared error between the 12 pressures and 6 flow measurements for the experimental measurements and the simulated model measurements. An error was calculated over all the leaking and non-leaking cases for this model calibration. Table 1 shows the results found for the 16 optimization parameters. It is evident that the minor loss coefficient depends on the flow rate within a pipe. From this table the roughness coefficients of the pipes can be seen to be 245.7, while the pump efficiency was found to be 80.1%. As expected, the minor loss coefficients for L-bows and T-pieces differ. This can be seen where minor loss 1 is much smaller than minor losses 2-6. Minor loss coefficients 7-12 can be seen to be much smaller, this is because of an entering flow condition at these T-Pieces. The minor losses in the demand return pipe can be seen to be much higher than that of the minor losses in the leak return pipe. This is due to the flow being much higher in the demand return pipe. The errors after optimizing these coefficients were calculated, and for the case where no leak exists, a mean squared error of 2.52% between the model and experimental measurements. For the first leak case an error of 44.7% was found and for the second leak case an error of 28.8% was found. To find the leaks for these two leak cases, a dataset for the SVM were created using the calibrated model. The training methodology described in Section 2 was used, and the results achieved is given in Section 4. The two leak cases estimated on the experimental had sizes of 2mm and was located 1m and 1.5m from the start of each pipe. The leaking pipes were pipe 3 and 5 respectively. All 12 pressure sensors were used for the results shown in the next section. 372 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 1: Photo of the Experimental Network Figure 2: Photo of the 2mm leak applied to the Experimental Network 373 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Table 1: Optimized Parameters for Experimental Network Optimized Parameter Value Roughness Coefficient 245.7 Pump Efficiency 80.1% Minor Loss 1 11.0 Minor Loss 2-6 28.6-40.8 Minor Loss 7-12 0.1-2.4 Minor Loss Demands 287.0 Minor Loss Leaks 14.2 4. RESULTS The results found for the first leak case are depicted in Figure 3. The estimated leak using the simulated experimental measurement (ideal result) and actual experimental (measured) results for each pipe section are shown. The ideal results show the results if there was no model error, the leaking pipe could be estimated with a 100% probability. Additionally, it can be seen that the leak size and location could be estimated correctly with a size of 2mm and at a location of 1m. The results from the experimental measurements, estimated from the SVM trained on a model with a 44.7% error, could estimate the leaking pipe with a probability of 60.5%. The leak size was estimated to be 1.34mm and the leak location was found at 2.22m. This results in a leak size estimation error of 0.66mm and a location estimation error of 1.22m. For the second leak case the results found by the SVM can be seen in Figure 4. For the ideal model, the leak could be located with a probability of 100%, with the leak size and location estimated perfectly as 2mm and 1.5m. With the experimental measurements, having a model error of 28.8%, the leaking pipe could be located with a probability of 93.6%. The leak size using the experimental measurements was found to be 1.29mm at a location of 1.28m. This resulted in a leak size estimation error of 0.71mm and a leak location estimation error of 0.22m. 5. SENSOR PLACEMENT OPTIMIZATION To find the optimum sensors to be used in the experimental network, an exhaustive search algorithm is used with the SVM to find the highest probability of each leak for every sensor combination. Due to the amount of possible combinations, the search algorithm is limited to a depth of four sensor combinations. This allows two test scenarios: the first test scenario up to the four most important sensors while the other test scenario up to the four least important sensors for each leak case. Table 2 shows the results found by this search algorithm for the first leak case on the experimental network. It can be seen that if a single sensor is used, where the ninth pressure sensor gave the best results, the leak could only be found with a probability of 33.45%. The exhaustive optimization is repeated by adding up to three more sensors. The highest probability of the leak being found increased to 99.87% with the addition of two sensors for a total of three sensors. Initially, using all twelve pressure sensors, the leak could be located with a probability of 60.5%. If only eleven pressure sensors were used, removing the fifth pressure sensor, the leak could be located with a probability of 82.59%. Removing an additional three sensors results in the leak being located with a probability of 98%. 374 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 3: Results for the first leak on the experimental network Figure 4: Results for the second leak on the experimental network 375 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Table 2: Pressure sensor usage optimization for the first leak in the experimental network Sensor Order Sensors Used Sensors Not Used One Sensor Two Sensors Three Sensors Four Sensors Eight Sensors Nine Sensors Ten Sensors Eleven Sensors Twelve Sensors 1st P9 P9 P9 P9 P5 P5 P5 P5 2nd P2 P2 P2 P3 P3 P3 3rd P11 P11 P2 P2 4th P10 P4 Probability 33.45 92.73 99.87 99.34 98.0 97.11 95.40 82.59 60.50 Table 3 shows the results found by the search algorithm for the second leak case tested on the experimental network. For this leak case it can be seen that if a single pressure sensor was used, the fifth sensor, the leak could only be located with a probability of 12.25%. Adding an additional pressure sensor, pressure sensor 10, the leak could be located with a probability of 99.97% which was found to be the highest for the second leak case. For this leak case the leak could be located with a probability of 93.6% using all twelve pressure sensors. Removing pressure sensors for this leak case bears smaller effect on the leak locating probability compared to the previous leak case, since this leak could already be located with a high probability. Removing the eleventh pressure sensor from the experimental measurements increases the leak locating probability with only 3.34%. Table 3: Pressure sensor usage optimization for the second leak in the experimental network Sensor Order Sensors Used Sensors Not Used One Sensor Two Sensors Three Sensors Four Sensors Eight Sensors Nine Sensors Ten Sensors Eleven Sensors Twelve Sensors 1st P5 P5 P5 P5 P11 P11 P11 P11 2nd P10 P10 P10 P6 P6 P6 3rd P2 P4 P12 P12 4th P7 P1 Probability 12.25 99.97 99.86 99.82 99.53 99.22 98.54 96.94 93.6 The leak probability can be plotted over the number of sensors used and can be seen in Figure 5. For these two leak cases, using only 5, 6, and 7 pressure sensors, the results were unavailable and were linearly interpolated between the calculated results. This is shown with dashed lines in this figure. From the figure there can be seen that a peak leak locating probability was found for both leak cases when only 2-3 pressure sensors were used, with a gradual decline in detection probability as more sensors are added. This gradual decline in probability increases significantly after 10 sensors being used. This behavior of the optimum sensor placement when removing sensors is due to uncertainty and noise from the additional sensors, with little additional information. This increases the detection probability up to 8 sensors to be used. When removing more sensors, the ratio between information and noise decreases and the problem becomes ill posed with too little information. To evaluate the optimum sensor placements for both the leak cases, an additional approach for the optimization of the sensor placements were followed. The approach started with all the sensors and removed the sensor with the smallest cost on the detection probability. This could be completed until a single sensor was left. The results for this technique can be seen in Figure 6. 376 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 5: Summary of the leak estimation probability for the number of sensors used Figure 6 Optimal Sensors used for the two leak cases. Number of sensor removed indicated in square bracket 377 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management In this figure there can be seen that the leak detection probability was plotted against the number of sensors used. Three cases were plotted, the first leak case, the second leak case and the overall accuracy over the training dataset. From the results there can be seen that removing up to 5 sensors gradually increases the overall average for detecting the two leaks. Labeled in the square brackets are the sensor number that was removed from the SVM prediction. From the figure there can be seen that sensor 8 has the largest contribution for the detection of the leaks, which is similar to the results found in Table 2. Using only three sensors, sensor 8, 11 and 9, results in leak 1 and 2 being detected with a probability of 51.3% and 79.2% respectively. Overall the detection accuracy for the complete dataset is shown to be 87.3%. 6. DISCUSSION OF RESUTLS In the experimental investigation, model calibration was completed on the EPANET model to fit it to the experimental measurements. Sixteen calibration factors were used and it could clearly be seen that the amount of flow within a pipe effects the minor loss coefficient. Additionally, it was shown that L-bows and T-pieces have different minor loss coefficients and that flow into these fittings generate nearly no loss coefficients. The SVM combined with this model-based method could find the two leaks in the actual experimental leaking pipes with 60.5% and 93.6% probabilities respectively. This was found with calibration model errors for the two leak cases of 44.7% and 28.8%. For the first leak case the leak size and location were calculated with an error of 0.66mm and 1.22m. This result is impressive considering the model error. For the second leak case the leak size and location were calculated with an error of 0.71mm and 0.22m. For this network it was shown that this method of leak detection can solve the problem using all 12 pressure sensor measurements and a zero-model error with 100% accuracy. With the exhaustive search algorithm, the sensor placement optimization was completed. It was seen that using between 2-3 sensors for each leak case can increase the leak detection probability to 99.86%. This increases the leak detection probability to ideal model conditions. The sensors to be used was found as the two sensors connected to the pipe under consideration for possible leaks. Furthermore, it was found that using more than 10 pressure sensors starts reducing the leak detection probability drastically for the experimental network. This is due to the noise to pressure information ratio from the experimental measurements. 7. CONCLUSION This paper investigated the usage of a SVM with a model-based methodology. An experimental network was under consideration where after model calibration was completed, the model errors were calculated to be 44.7% and 28.8% for the two leak cases. Using the SVM the leaks could be located accurately with the leak sizes within 0.7mm and the leak location within 1.22m Performing sensor placement optimization, it was found that the SVM can find the leaks with higher probabilities using less pressure sensors. The pressure sensors with the best locating probability was found to be the sensors connected to the pipe under consideration. An increase of 32.23% probability in leak location confidence was found by using only two sensors instead of twelve. This model-based method shows promise in leak detection although certain limitations with this method exist. This paper shows the importance of selecting sensors when using the pressure flow deviation method. 378 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 8. REFERENCES 1. R. Mckenzie, ZN. Siqalaba, WA. Wegelin. The state of non-revenue water in South Africa. s.l. : Water Research Commission, 2012. 2. Mckenzie, RS. Guidelines for reducing water losses in South African Municipalities. s.l. : Water Research Commission, 2014. 3. An approach to leak detection in pipe networks using analysis of monitored pressure values by support vector machines. J. Mashford, D. De Silva, D. Marney, S. Burn. s.l. : International Conference on Network and System Security, 2009, Vol. 3. 4. Leak detection of pipeline: An integrated approach of rough set theory and artificial bee colony trained SVM. S.K Madal, F.T.S Chan, M.K Tiwari. s.l. : Expert Systems with Applications, 2012, Vol. 39. 5. Leak detection and localization on water trasportation pipelines: a multi-label classification approach. F. Kayaalp, A. Zengin, R. Kara, S. Zavrak. 10, s.l. : Neural Computing Applications Forum, 2017, Vol. 28. 6. Leak detection in simulated water pipe networks using SVM. J. Mashford, D. De Silva, S. Burn, D. Marney. 26, s.l. : Applied Artificial Intelegence, 2012, Vol. 5. 7. A leakage detection system on the Water Pipe Network through Support Vector Machine method. A.E.U Salam, M. Tola, M. Selintung, F. Maricar. s.l. : Makassar International Conference on Electrical Engineering and Informatics, 2014. 8. Optimal sensor placement for leakage detection and isolation in water distribution networks. A. Rosich, R. Sarrate, F. Nejjari. s.l. : IFAC Symposium on Fault Detection, 2012, Vol. 8. 9. Comparison of demand calibration in water distribution networks using pressure and flow sensors. G. Sanz, R. Pèrez. s.l. : Computer Control for Water Industry Conference, 2015, Vol. 13. 10. Model-based monitoring techniques for leakage localisaiton in distribution water networks. J. Meseguer, JM. Mirats- Tur, G. Cembrano, V. Puig. s.l. : Computer Control for Water Industry Conference, 2015, Vol. 13. 11. Effiecient sensor placement for leak localization considering uncertainties. DB. SteffelBauer, D. Fuchs-Hasusch. s.l. : Water Resource Management, 2016, Vol. 30. 12. Robust meter network for water distribution pipe burst detection. D. Jung, JH. Kim. s.l. : Water, 2017, Vol. 9. 13. Optimal pressure sensor placement and assessment for leak location using a relaxed isolation index: Application to the Barcelona water network. MA. Cugueró-Escofet, V. Puig, J. Quevedo. s.l. : Control Engineering Practice, 2017, Vol. 63. 14. Real-world comparison of sensor placement algorithms for leakage localization. D. Fuchs-Hanusch, D. Steffelbauer. s.l. : International Conference on Water Distribuion System Analysis, 2017, Vol. 18. 15. Sensor placement for classifier-based leak localization in water distribuion networks using hybrid feature selection. A. Soldevila, J. Blesa, S. Tornil-Sin, RM. Fernandez-Canti, V. Puig. s.l. : Computers and Chemical Engineering, 2018, Vol. 108. 16. Scikit-learn: Machine Learning in Python. F. Varoquaux, G Gramfort, A. Michel, V. Thirion, B. Grisel, O. Blondel, M. Prettenhofer, P. Weiss, R. Dubourg, V. Vanderplas, J. Passos, A. Cournapeau, D. Burcher, M. Perrot, M. Duchesnay. 2011, Journal of Machine Learning Research, pp. 2825-2830. 379 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Performance visualisation of a transcritical CO2 heat pump under fault conditions J.J.A. de Bruin1, K.R. Uren1, G. van Schoor2, M. van Eldik3 1 School of Electrical, Electronic and Computer Engineering, North-West University, South Africa 2 Unit for Energy and Technology Systems, Faculty of Engineering, North-West University, South Africa 3 School of Mechanical and Nuclear Engineering, North-West University, South Africa ABSTRACT Heat pump systems have gained acceptance and appeal as an energy efficient alternative to electrical geysers for the purpose of water heating. This paper investigates a next-generation transcritical heat pump system using carbon dioxide as its working fluid. The water-to-water heat pump system used in this study simultaneously produces cooled and heated water. The heat pump system consists of four components. An evaporator is used to extract heat from a stream of water in order to evaporate and then superheat the working fluid. Chronologically after the evaporator comes the reciprocating compressor that raises the pressure and temperature of the working fluid to enable its circulation through the system. The gas cooler follows the compressor and is responsible for transferring heat from the working fluid to the second stream of interacting water. The expansion valve causes the working fluid, that leaves the gas cooler component, to undergo a large pressure drop and phase change. The working fluid that leaves the expansion valve enters the evaporator component for evaporation and superheating. The continuous conversion between different forms of energy, as enabled by the heat pump’s components, make heat pumps susceptible to many different types of fault conditions. In this paper, the identification of faults that can occur during the operation of a heat pump and the degrading effects thereof on system performance were investigated. Fault detection and monitoring of the heat pump system via various visual representations are proposed. Specifically; fouling, working fluid leakage and coinciding water pump failure as system faults were investigated. The visual representations could uniquely identify and distinguish between the investigated fault conditions. The graphs were also able to monitor the severity and progression of system faults and their contribution to performance degradation. Keywords: Transcritical heat pump, carbon dioxide, diagnosis, system performance, performance visualisation 1. INTRODUCTION In the residential and domestic sectors, electrical geysers are traditionally used to provide warm water on demand. Carré [1] states that space and water heating accounts for approximately 70% of the IEA19 country group domestic sector energy consumption. Heat pump systems have gained notoriety in recent times due to their high efficiency when applied as alternative space heating and fluid heating solutions. Heat pumps contain different system components in which energy is converted back and forth between different forms. Energy can be present in the form of thermal-, mechanical- and electrical energy in heat pump systems. The relative complexity of heat pump systems and the continuous conversion between different forms of energy by components that are present in heat pump systems make such a system susceptible to many different types of fault conditions [2]. Li et al. [3] did a theoretical and numerical study on an air-source heat pump system in the high-altitude region of Tibet, China. The study was performed to assess the performance of a specific air-source heat pump (ASHP) system operating at various ambient conditions. The study benchmarked and represented the ASHP system’s performance using various derived two-dimensional relationships. Parameters like system power consumption, heat pump heating capacity and the coefficient of performance (COP) were related to ambient air temperature at various produced hot water temperatures. Li et al. focused only on the influence of ambient factors on system performance. Fault conditions were not considered in the study. Another study by Casteleiro-Roca et al. [4] considered a geothermal 380 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management heat pump system used to heat a bioclimatic house in Galicia, Spain. The study used a novel intelligent approach to detect possible fault conditions in the system’s geothermal heat exchanger. The fault detection approach implemented classification techniques and temperature probe date from two temperature sensors, located on the system’s geothermal heat exchanger, to identify the presence of fault conditions and their effect on the system. The approach proved to have high accuracy, but only for the identification of fault conditions which stemmed from problems associated with the system’s geothermal heat exchanger. Mehrabi and Yuill [5] did a study on the generalised effects of faults on the normalised performance of air conditioners and heat pumps. Fault intensity (FI) factors were used to model the drift of various defined parameters, which occur due to faults, from their nominal values. Fault conditions which incorporated the effects of change in refrigerant system mass, COP, input power, working fluid mass flow and pressure drop due to liquid line restrictions were considered in the study. Different operational scenarios were presented by depicting the heat pump cycle on temperature-entropy (T-s) and pressure-enthalpy (P-h) diagrams. Each depicted cycle diagram corresponded to various unique FI factor values. In this paper, the use of unique two-dimensional and three-dimensional visual representations which correlate system performance parameters to the progressive effects of faults are investigated. The approach uses the interpretation of the proposed graphs as a means to identify specific faults and quantify their magnitude and progression in the considered heat pump system. In Section 2 a system description is given of the investigated system and its component arrangement. Section 2 also discusses the holistic system level modelling approach used to solve for the system parameters in the selected simulation environment. After that, Section 3 presents and discusses the indices used to benchmark the overall system performance of the investigated heat pump. In Section 4 the unique visual representation to be used to visualise the degradation impacts of faults on the heat pump system’s performance is presented. The paper concludes with Section 5 where the summary of key findings of the study and recommendations for future work on the topic are discussed. 2. SYSTEM DESCRIPTION AND MODEL 2.1. Overview The heat pump system considered in this paper is a thermal-fluid device that uses carbon dioxide as an environmentally friendly working fluid. The secondary interacting fluid in the system is water. Two different streams of water are circulated through the system. The first stream that flows through the evaporator component serves as an energy source and is therefore cooled during system operation. The second stream flowing through the gas cooler serves as an energy sink and is therefore heated during system operation. Additionally, the heat pump system consists of a compressor which circulates the carbon dioxide and an expansion valve which regulates pressure drop and initiates the evaporation process in the evaporator. The heat pump system thus transfers heat energy from the one water stream to the other via the circulated carbon dioxide gas. The heat pump system’s component layout is illustrated in Figure 1. Figure 1. Diagrammatic representation of heat pump system layout 381 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management The first feed pump ensures circulation of water through the evaporator in which heat is transferred from the water to the carbon dioxide. The water is thus cooled, and the carbon dioxide absorbs heat to evaporate to the saturated gas state and heat even further into the superheated gas state. The second feed pump circulates the water to be heated through the gas cooler component. The water absorbs heat from the supercritical carbon dioxide. The supercritical gas is thus cooled, and heated water is delivered at the gas cooler outlet during system operation. 2.2. Refrigerant cycle representation The thermodynamic behaviour of the system’s refrigerant may be illustrated via a graphical representation that uses critical cycle points. Cycle points are graphed points that are representative of refrigerant thermophysical properties at specific locations in the system. Cycle point (1) to (2) represents heat absorption by the refrigerant in the evaporator component. The compressor does work on the refrigerant to circulate it through the system. The action of the compressor is depicted by cycle point (2) to (3). After compression, the refrigerant transfers heat to water in the gas cooler component. The heat rejection from the refrigerant is represented by cycle point (3) to (4). After heat rejection, the expansion valve component decreases the refrigerant pressure and enables the process of evaporation. The expansion valve isenthalpic pressure drop process occurs from point (4) to (1). The refrigerant behaviour may be visualised on a temperature-entropy diagram as shown in Figure 2. Figure 2. Heat pump refrigerant loop depicted on a T-s diagram [6] 2.3. System model The system model was developed in the Engineering Equation Solver [7] (EES®) environment. The simulation methodology used a system level approach. Semi-theoretical models were used with the implementation of conservation laws to solve for the required parameters which quantified the behavioural properties of each component in the system. The laws of conservation of mass, momentum and energy (for steady incompressible flow) were each applied to every component in the system. The use of the incompressible form of the conservation of momentum law is valid due to the Mach number of the refrigerant never exceeding 0.1 throughout the system within the range of simulated system conditions. At flow Mach numbers below 0.3, the compressible and incompressible form of the law of conservation of momentum both yield the same thermo-physical property results. The law of conservation of mass for steady incompressible flow through an individual component’s control volume region (CV) is expressed mathematically as [8]: 382 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management ∑ ?̇?𝑖𝑛 𝑛1 𝑖=1 │𝑖 = ∑ ?̇?𝑜𝑢𝑡 𝑛2 𝑗=1 , (1) where the first term is the summation of all the mass flow streams entering the CV and the second term is the summation of all the mass flow streams flowing out of the CV. The refrigerant and interacting water all flow through pipes with no other fluid flowing in or out of the piping. The conservation of mass for every system component may thus be simplified to: ?̇?𝑖𝑛 = ?̇?𝑜𝑢𝑡 . (2) Equation (2) holds true for each component and for the refrigerant side of the heat exchanger as well as the water side. The law of conservation of momentum for steady (incompressible low Mach number) flow through an individual component’s CR is expressed as [8]: ∑(𝑃𝑡𝑜𝑡)𝑖𝑛 𝑛1 𝑖=1 │𝑖 + ∑(𝜌𝑔𝑧)𝑖𝑛 𝑛2 𝑗=1 │𝑗 = ∑(𝑃𝑡𝑜𝑡)𝑜𝑢𝑡 𝑛3 𝑘=1 │𝑘 + ∑(𝜌𝑔𝑧)𝑜𝑢𝑡 𝑛4 𝑙=1 │𝑙 + ∆𝑃𝑡𝑜𝑡 , (3) where the first term is the total pressure of the fluid(s) entering the CV, the second term is the hydrostatic pressure of fluid(s) entering the CV, the third and fourth term represents the total- and hydrostatic pressure of fluid(s) leaving the CV and the fifth term represents the pressure drop which occurs as the various interacting fluids flow through the CV. Equation (3) may be simplified by only considering the interacting fluids for each component. For the refrigerant side of the evaporator, gas cooler and expansion valve components are: (𝑃𝑡𝑜𝑡|𝑟)𝑖𝑛 = (𝑃𝑡𝑜𝑡|𝑟)𝑜𝑢𝑡 + ∆𝑃𝑡𝑜𝑡|𝑟 . (4) The conservation of momentum for the water side of the evaporator and gas cooler components is: (𝑃𝑡𝑜𝑡|𝑤)𝑖𝑛 = (𝑃𝑡𝑜𝑡|𝑤)𝑜𝑢𝑡 + ∆𝑃𝑡𝑜𝑡|𝑤 . (5) The pressure ratio of the compressor is used to find the discharge pressure of the refrigerant leaving the compressor. The ratio of pressure for a compressor is defined as [9]: 𝑟𝑃 = 𝑃𝑡𝑜𝑡|𝑜𝑢𝑡𝑙𝑒𝑡 𝑃𝑡𝑜𝑡|𝑖𝑛𝑙𝑒𝑡 . (6) Finally, the law of conservation of energy (First Law of Thermodynamics) for a steady flow process of an individual component’s CV is given by [8]: ?̇?𝑛𝑒𝑡 + ?̇?𝑛𝑒𝑡 + ∑(?̇?ℎ𝑡𝑜𝑡)𝑖𝑛 𝑛1 𝑖=1 │𝑖 + ∑(?̇?𝑔𝑧)𝑖𝑛 𝑛2 𝑗=1 │𝑗 = ∑(?̇?ℎ𝑡𝑜𝑡)𝑜𝑢𝑡 𝑛3 𝑘=1 │𝑘 + ∑(?̇?𝑔𝑧)𝑜𝑢𝑡 𝑛4 𝑙=1 │𝑙 , (7) where the first term is the net heat transfer into the CV, the second term is the net work absorbed by the CV, the third term is the total flow energy entering the CV, the fourth term is the gravitational potential energy of the flow(s) entering the system, the fifth term represents the flow energy leaving the CV, and the sixth term represents the gravitational potential energy of the flow(s) departing the CV. The conservation of energy for each component was compiled under the assumption that the elevation head difference throughout the system is negligible. The conservation of energy for the evaporator and gas cooler component is (?̇?ℎ𝑡𝑜𝑡|𝑟)𝑖𝑛 + (?̇?ℎ𝑡𝑜𝑡|𝑤)𝑖𝑛 = (?̇?ℎ𝑡𝑜𝑡|𝑟)𝑜𝑢𝑡 + (?̇?ℎ𝑡𝑜𝑡|𝑤)𝑜𝑢𝑡 . (8) The conservation of energy for the compressor component is (?̇?ℎ𝑡𝑜𝑡|𝑟)𝑖𝑛 + ?̇?𝐶 = (?̇?ℎ𝑡𝑜𝑡|𝑟)𝑜𝑢𝑡 . (9) The expansion valve component’s throttling process is assumed to be isenthalpic. Under the isenthalpic assumption, the conservation of energy for the expansion valve simplifies to: (?̇?ℎ𝑡𝑜𝑡|𝑟)𝑖𝑛 = (?̇?ℎ𝑡𝑜𝑡|𝑟)𝑜𝑢𝑡 . (10) 383 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management In addition to the traditional conservation laws, an exergy balance was also applied to each component to solve for its corresponding internal irreversibility rate within each system component. An exergy balance for an individual component’s CV has the form [10]: ?̇?𝐶𝑅 ?̇? + ?̇?𝑛𝑒𝑡 + ∑(?̇?)𝑖𝑛 𝑛1 𝑖=1 │𝑖 = ∑(?̇?)𝑜𝑢𝑡 𝑛2 𝑗=1 │𝑗 + 𝐼?̇? , (11) where the first term is the exergy transfer to the CV due to heat transfer, the second term is the net work absorbed by the CV, the third term is the exergy entering the CV, the fourth term is exergy leaving the CV and the fifth term is the rate at which exergy is irreversibility destroyed inside of the CV. The evaporator and gas cooler component do not have associated work terms in their respective exergy balance equations. The exergy balance for the evaporator is: (?̇?𝑔)𝑖𝑛 + (?̇?𝑤)𝑖𝑛 = (?̇?𝑔)𝑜𝑢𝑡 + (?̇?𝑤)𝑜𝑢𝑡 + 𝐼?̇? . (12) The exergy balance for the gas cooler component is identical to the exergy balance for the evaporator component, as in (12), with the only difference being that the irreversibility rate’s subscript change to GC. The exergy balance for the compressor component is: (?̇?𝑔)𝑖𝑛 + ?̇?𝐶 = (?̇?𝑔)𝑜𝑢𝑡 + 𝐼?̇? . (13) The exergy balance for the expansion valve component is: (?̇?𝑔)𝑖𝑛 = (?̇?𝑔)𝑜𝑢𝑡 + 𝐼?̇?𝑉 . (14) 2.4. Validation of the system model The EES® system simulation model from which results were generated for this paper has been validated by de Bruin et al. [11] using the comparison of simulation data to experimental results overlaid on a cycle T-s diagram. This paper generates results from an updated and improved version of the previously validated simulation model. 3. SYSTEM PERFORMANCE MEASURES Thermal-fluid systems can be modelled and evaluated by usage of their energy as well as exergy characteristics. The application of system energy indices is considered first. The most well-known benchmark of heat pump performance is the coefficient of performance [9]. The COP relates how much useful energy leaves the refrigerant loop compared to the energy invested in driving the heat pump’s compressor. The formula for the coefficient of performance of a heat pump assigned to a heating objective is: 𝐶𝑂𝑃 = ?̇?ℎ𝑒𝑎𝑡 ?̇?𝑒𝑙𝑒𝑐 , (15) where ?̇?ℎ𝑒𝑎𝑡 is the heat transferred to the water being heated and ?̇?𝑒𝑙𝑒𝑐 is the electrical energy required by the system’s compressor. The COP does not give an indication of how well the system is performing compared to ideal energy transfer system conditions. Stene [12] recommends the Lorentz efficiency index in order to benchmark a heat pump system’s COP to a maximum COP which could be achieved under theoretical ideal circumstances. The Lorentz efficiency for a transcritical heat pump has the form: 𝜂𝐿𝑍 = 𝐶𝑂𝑃 𝐶𝑂𝑃𝑚𝑎𝑥 100, (16) where 𝐶𝑂𝑃𝑚𝑎𝑥 is expressed as 𝐶𝑂𝑃𝑚𝑎𝑥 = 𝑇𝐺𝐶̅̅ ̅̅ 𝑇𝐺𝐶̅̅ ̅̅ − 𝑇𝑒𝑣𝑎𝑝 . (17) 𝑇𝐺𝐶̅̅ ̅̅ is the logarithmic mean heat rejection temperature in the gas cooler component and 𝑇𝑒𝑣𝑎𝑝 is the evaporation temperature respectively found in the evaporator component. Kotas [10] argues that it is more rational to use exergy analysis to evaluate the performance of thermal-fluid systems. Kotas purposes the formulation of rational exergy 384 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management domain efficiency indices which compare process operation to that of a process which is completely reversible. The rational efficiency of a process can be defined in two different manners. The first way to define rational efficiency is by relating how much irreversibility is deducted from the driving exergy of a system due to effects of intrinsic and avoidable process inefficiencies. Mathematically the first form of rational efficiency for a heat pump is expressed as: 𝑅𝑎𝑡1 = ?̇?𝑒𝑙𝑒𝑐 − ∑ 𝐼?̇? 𝑛 𝑖=1 ?̇?𝑒𝑙𝑒𝑐 100, (18) where 𝐼?̇? is irreversibility rate inside a sub-system component which must always have a positive and greater than zero value. The second rational efficiency has equivalent logic to the COP because it is defined as the ratio of useful desired exergy delivered by the system to the exergy expenditure used to drive the system. The second form of rational efficiency is expressed as 𝑅𝑎𝑡2 = ∆?̇?𝐺𝐶 ?̇?𝑒𝑙𝑒𝑐 100, (19) where ∆?̇?𝐺𝐶 is the exergy absorbed by the water that is heated by the heat transfer process in the gas cooler component. 4. VISUALISATION OF FAULT CONDITIONS 4.1. Heat pump system faults There are multiple faults which can occur in the considered heat pump system. A fault with a fast-enough onset could be considered so severe that it may cause the system to cut out completely and to stop working instantly. A severe system fault could, for example, be a loss of electrical power to the system’s compressor. If the heat pump system’s compressor cuts out during operation, then the heat pump system will stop transferring heat between the two water streams almost instantaneously. Other system faults manifest and progress slowly with time. These faults can slowly reduce the performance of the system over time without an operator noticing due to the slow and incremental, steady decrease in system performance. This paper considers three faults that may develop slowly over time in a carbon dioxide heat pump system namely: the build-up of unwanted material inside the heat exchanger tubes, gas leakage from the refrigerant loop and finally external water pump failure. The build-up of unwanted material inside heat exchangers is formally called fouling [13]. Fouling build-up occurs when water with impurities is circulated through heat exchangers. The impurities precipitate onto the inside of the tube walls with time. The refrigerant may leak from the primary loop in a heat pump system due to the culmination of erosive effects of particulate matter that is carried with the flow of a fluid through system tubing. The erosion of the tubing may ultimately lead to a hole in the piping from which refrigerant may leak. Pump failure may occur due to problems with the pump’s electric motor or due to pump impeller damage that has occurred over time due to insufficient pump net positive suction head (NPSH) at the pump’s inlet [14]. Fouling may also build-up with time inside water pumps just like inside the tubing of components. 4.2. Visualisation of gas cooler fouling build-up Fouling in the system was simulated by varying the gas cooler’s inner tube’s outer fouling factor (𝐹𝐹𝑖𝑜), which effects the gas cooler component’s thermal resistance levels, from 0.0 to 2.0 [m2-K/kW]. Fouling has the effect of introducing extra thermal resistance in the path of heat transfer inside a heat exchanger. The effect of fouling is the same as with the introduction of extra electrical resistance in an electrical circuit. The analogy between thermal and electrical resistance is depicted in Figure 3. 385 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 3. Circuit analogy that shows the similarities between thermal and electrical resistance An increasing amount of fouling presence in the transcritical heat pump system’s gas cooler was found to have the effect on the magnitude of the four performance indices used to benchmark the refrigerant cycle as summarised in Table 1: Table 1: The effect of the fouling fault condition on the magnitude of the heat pump’s performance indices FAULT 𝑪𝑶𝑷 𝜼𝑳𝒁 𝑹𝒂𝒕𝟏 𝑹𝒂𝒕𝟐 Gas cooler fouling Decreases Decreases Decreases Increases A decrease in the heat pump system’s performance may be due to the progression of any number of faults in the system. To specifically visualise the impact and the progression of fouling a logarithmic pressure – flow energy (log. P-h) diagram is suitable. The log P-h diagram that illustrates the heat pump’s working fluid under various gas cooler fouling conditions is depicted in Figure 4. Figure 4. Fouling fault condition visualised on a log. P-h diagram 386 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management Figure 4 represents the thermo-physical condition of the heat pump’s working fluid at various physical locations throughout the heat pump system’s constituent components. Visual inspection of Figure 4 shows that in order to maintain the required heat transfer rate in the gas cooler component – the compressor’s discharge pressure and temperature trends to increase with an increase in the fouling build-up in the gas cooler. 4.3. Visualisation of working fluid leakage Working fluid leakage was simulated using a parametric study in which the mass flow rate value for the reference condition was used as the initial entry and with subsequent decreasing values for the mass flow rate onwards in the parametric table. Table 2 summarises the effect that an increasing amount of working fluid leakage has on the four performance indices. Table 2: The effect of the working fluid fault condition on the magnitude of the heat pump’s performance indices FAULT 𝑪𝑶𝑷 𝜼𝑳𝒁 𝑹𝒂𝒕𝟏 𝑹𝒂𝒕𝟐 Refrigerant leakage Decreases Decreases Increases Decreases A discretised temperature-position (T-pos) diagram has some interesting properties suited for the visual investigation of working fluid leakage. Figure 5 illustrates the lengthwise thermal interaction between the cooled working fluid stream and the heated water stream, that respectively interact in the gas cooler component, before and after working fluid leakage has occurred. Figure 5. Working fluid leakage fault condition visualised on a T-pos diagram From Figure 5 it should be noted that a pinching zone may be identified on the diagram. It can be seen that in the interval between 0 [m] and 5 [m] that the working fluid undergoes a rapid temperature decrease when compared to the reference condition in the same length interval. The pinching zone occurs due to the decreased ability of the working fluid to carry and store thermal energy as it flows through the gas cooler. The pinching zone occurs between the interacting fluid streams after working fluid leakage has occurred and the region is indicated by the black ellipse. Due to working fluid leakage the temperature difference between the interaction fluid streams in the pinching zone, as represented by the symbol – 𝜹, trends increasingly to zero. 387 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management 4.4. Visualisation of simultaneous water pump failure Simultaneous water pump failure was simulated by varying the delivered volumetric flow rate of each pump of the two water pumps simultaneously between 16 and 6.5 [l/min] in a parametric table. Simultaneous failure was simulated to investigate a system fault condition that may stem from improper pump installation/ maintenance of both system heat exchangers. A decreased flow rate delivery from a pump can be thought of as a forerunner effect that may be a warning sign in the lead up to complete pump failure. The Lorentz efficiency was selected as the z- axis parameters for the compilation of a three-dimensional graph. The x-axis represents the volumetric flow rate of feed pump one and the y-axis represents the volumetric flow rate of feed pump two. Figure 6 depicts the relationship between heat pump Lorentz efficiency and the volumetric flow rates delivered by the heat pump’s feed pumps. Figure 6. Lorentz efficiency three-dimensional representation as a function of feed pump volumetric flow rates 5. CONCLUSION Figure 4 shows a pattern where the representative expansion valve line moves from the left to the right on the graph. The line that is representative of the heat transfer in the gas cooler shows a pattern of moving from a lower pressure to a higher pressure. A transcritical heat pump of which the associated refrigerant shows the shift in cyclic representation as illustrated in Figure 4 may be diagnosed as having fouling present as a fault condition in it. The presence of a pinching zone on a T-pos diagram may be used as a means to identify and isolate a working fluid leakage fault condition. The comparison between representative fluid lines on T-pos diagrams may be used as a means to track the progression of working fluid leakage within a heat pump system. The results presented in Table 1 are distinguishable from the results present in Table 2. Gas cooler fouling and refrigerant leakage are thus distinguishable fault conditions when only using the behaviour of the performance indices as a fault detection procedure. From Figure 6 it can be seen that when the volumetric flow rate of the evaporator’s water pump, 𝑄𝐸, and the volumetric flow rate of the gas cooler’s water pump, 𝑄𝐺𝐶, are decreased then the Lorentz efficiency, 𝜂𝐿𝑍, of the associated heat pump trends to a minimum value. The Lorentz efficiency is at its lowest value in the blue ‘cold spot’ region in Figure 6. The cold spot region coincides with the lowest values of the simulated volumetric flow delivery rates by the heat pump system’s two water pumps. Future research should include an investigation of whether the same patterns and visual representations, as suggested in this paper, could be used to identify faults in larger simulated data ranges. Various combinations of fault conditions also need to be investigated and such an investigation may result in the development of improved residual definitions. 388 31st Conference on Condition Monitoring COMADEM 2018, North-West University, South Africa. and Diagnostic Engineering Management ACKNOWLEDGEMENTS The authors would like to thank the National Research Foundation (NRF) of South Africa (Grant numbers 91093 and 103392). The opinions expressed in this paper are those of the authors and are not necessarily to be attributed to the NRF. REFERENCES [1] J.-B. Carré, "Experimental investigation of electrical domestic heat pumps equipped with a twin-stage oil-free radial compressor," Ecole Polytechnique Fédérale de Lausanne, Lausanne, 2015. [2] H. Madani and E. Roccatello, "A comprehensive study on the important faults in heat pump system during the warranty period," International Journal of Refrigeration, no. 48, pp. 19-25, 2014. [3] Y. Li, W. Li, J. Liu, J. Lu, L. Zeng, L. Yang and L. Xie, "Theoretical and numerical study on performance of the air-source heat pump system in Tibet," Renewable Energy, no. 144, pp. 489-501, 2017. [4] J. Casteleiro-Roca, H. Quintián, J. Calvo-Rolle, E. Corchado, M. d. C. Meizoso-López and A. Piñón-Pazos, "An intelligent fault detection system for a heat pump installation based on a geothermal heat exchanger," Journal of Applied Logic, no. 17, pp. 36-47, 2016. [5] M. Mehrabi and D. Yuill, "International Journal of Refrigeration," 11 October 2017. [Online]. Available: https://doi.org/doi:10.1016/j.ijrefrig.2017.10.017.. [Accessed March 2018]. [6] Y. A. Cengel and M. A. Boles, "Thermodynamics an engineering approach. Eighth ed.," Mc Graw Hill Education, New York, 2015. [7] F-Chart Software, "EES Engineering Equation Solver," F-Chart Software, 2018. [Online]. Available: http://www.fchart.com/ees/. [Accessed March 2018]. [8] Y. A. Çengel and R. H. Turner, Fundamentals of thermal-fluid sciences, 2nd ed., McGraw-Hill Higher Education, 2004. [9] Borgnakke and Sonntag, Fundamentals of thermodynamics, 8th ed., Don Fowley, 2013. [10] T. Kotas, in The exergy method of thermal plant analysis, Malabar, Krieger Publishing Company, 1995, pp. 99-103. [11] J. de Bruin, K. Uren, G. van Schoor and M. van Eldik, "A thermodynamic cycle model of a transcritical CO2 heat pump for energy-visualisation," Stellenbosch, 2017. [12] J. Stene, "Residential CO2 heat pump system for combined space heating and hot water heating," Norwegian University of Technology and Science, Trondheim, 2004. [13] Y. A. Çengel and A. J. Ghajar, in Heat and mass transfer, New York, McGraw-Hill Education, 2015, p. 656. [14] A. Sayers, Hydraulic and compressible flow turbomachines, London: McGraw-Hill, 1990. 389