<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Intelligent Data Mining for Turbo-Generator Predictive Maintenance: An Approach in Real-World</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Alexandre</forename><surname>Pellicel</surname></persName>
							<email>alexandre.pellicel@termonorte.com.br</email>
							<affiliation key="aff0">
								<orgName type="institution">TermoNorte Energy Thermal Power Plant Co</orgName>
								<address>
									<settlement>Porto Velho</settlement>
									<country key="BR">Brazil</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Gonçalo</forename><surname>Cássio</surname></persName>
							<email>goncalo.cassio@termonorte.com.br</email>
							<affiliation key="aff0">
								<orgName type="institution">TermoNorte Energy Thermal Power Plant Co</orgName>
								<address>
									<settlement>Porto Velho</settlement>
									<country key="BR">Brazil</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Marco</forename><surname>Aurélio</surname></persName>
						</author>
						<author>
							<persName><forename type="first">A</forename><surname>Lopes</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">TermoNorte Energy Thermal Power Plant Co</orgName>
								<address>
									<settlement>Porto Velho</settlement>
									<country key="BR">Brazil</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Luiz</forename><forename type="middle">Eduardo</forename><surname>Borges Da Silva</surname></persName>
							<affiliation key="aff1">
								<orgName type="department">Institute Gnarus</orgName>
								<address>
									<settlement>Itajuba</settlement>
									<country key="BR">Brazil</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Erik</forename><forename type="middle">Leandro</forename><surname>Bonaldi</surname></persName>
							<email>erik.bonaldi@gmail.com</email>
							<affiliation key="aff1">
								<orgName type="department">Institute Gnarus</orgName>
								<address>
									<settlement>Itajuba</settlement>
									<country key="BR">Brazil</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Levy</forename><forename type="middle">Ely</forename><surname>De Lacerda De Oliveira</surname></persName>
							<affiliation key="aff1">
								<orgName type="department">Institute Gnarus</orgName>
								<address>
									<settlement>Itajuba</settlement>
									<country key="BR">Brazil</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Jonas</forename><surname>Guedes Borges Da Silva</surname></persName>
							<affiliation key="aff1">
								<orgName type="department">Institute Gnarus</orgName>
								<address>
									<settlement>Itajuba</settlement>
									<country key="BR">Brazil</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Germano</forename><surname>Lambert-Torres</surname></persName>
							<affiliation key="aff1">
								<orgName type="department">Institute Gnarus</orgName>
								<address>
									<settlement>Itajuba</settlement>
									<country key="BR">Brazil</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Pierre</forename><surname>Rodrigues</surname></persName>
							<email>pierre.rodrigues@jordaoengenharia.com.br</email>
							<affiliation key="aff2">
								<orgName type="institution">Jordão Engineering Co</orgName>
								<address>
									<settlement>Rio de Janeiro</settlement>
									<country key="BR">Brasil</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Intelligent Data Mining for Turbo-Generator Predictive Maintenance: An Approach in Real-World</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">825AAEF759877FC9243816E82577E30C</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T16:27+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Electrical measurements</term>
					<term>signal processing</term>
					<term>rough sets</term>
					<term>data mining</term>
					<term>intelligent systems</term>
					<term>turbo-generators</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>This paper presents the development of a supervision system for predictive maintenance and diagnosis of turbo-generators. The aim of the developed system is to verify the degradation conditions of TermoNorte generators. Initially, a system for extracting features of the turbo-generator operational database has been developed to detect possible problems that cause premature fails. The system has been divided in two parts. The first one is a data acquisition system directly connected to the generator in order to sample some operational variables. The second part concerns an intelligent data mining, based on Rough Sets Theory, into the database involving the supervision system variables, to use the existing historic data to perform analysis of the problems and possible causes.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>The generators are the most important equipments in the energy generation process. The power system reliability, power system supply and power system stability are indexes directly affects for the generator operational conditions. For this reason, protection and monitoring equipment are increasingly employed in order to prevent fails <ref type="bibr" target="#b0">[1]</ref>.</p><p>One of the technologies that can be employed within the purpose of predicting failures is the electric signature analysis (ESA) <ref type="bibr" target="#b1">[2]</ref>, which consists of a set of methods and techniques that monitor the condition of electric machines by identifying patterns and deviations. It is detected by processing and analysis of voltage and current signals acquired machinery under monitoring.</p><p>These techniques based on electrical signatures can be applied from the generator and primary source until the motor and load coupled. They may be based on: (a) invasive methods, such as the electric circuit analysis (with static analysis and nonenergized machine, also referred to as offline analysis and therefore invasive), or (b) non-invasive methods, such as ESA (dynamic analysis, i.e. with the machine in operation, also referred to as online analysis) <ref type="bibr" target="#b1">[2]</ref>.</p><p>For a more comprehensive monitoring of the generator, it is important to the application of invasive and non-invasive methods, based not only on the signature electric as well as other monitoring techniques such as vibration analysis. It is recommended the application of invasive techniques in shutdowns, while noninvasive techniques should be applied periodically during the operating cycle of the machine.</p><p>This project aims to develop a methodology for the detection and dynamic analyses of online monitoring of the condition of turbo-generators based on acquisition, processing and analysis of voltage and current signals. The main fails such as shortcircuit in stator and rotor windings, fails in excitement system, misalignment and eccentricity of spinning field have been studied by electric signature analysis.</p><p>The paper presents the developed system and some practical results in a TermoNorte Thermal Power-Plant, located in Porto Velho, northwest part of Brazil.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Electric Signature Analysis</head><p>Electric Signature Analysis (ESA) is the term used for all evaluations of voltage and current signals of electric machines. The most common analysis transforms the voltage and current signal to the frequency domain where they are analyzed. The analysis is based on two fundamental assumptions: (a) the signature of a machine with failure is different from the signature of a machine in perfect state of operation and (b) the failures are repeated with regular patterns, causing failure patterns, which can be identified and related parts of the machine.</p><p>These techniques can be applied in electric motors and generators. It is important to note that the Voltage Signature Analysis (VSA) is related to an upstream analysis, i.e. toward the generator; and the Current Signature Analysis (CSA) is related to a downstream, i.e. toward the motor. In this project, CSA and the Extended Park Vector Approach (EPVA) are the methods used in this development because they have more features applicable to electric generators. Also these methods have been applied in electric motors, but not to generators yet <ref type="bibr" target="#b1">[2]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Overview of Current and Voltage Signature Analysis</head><p>CSA or VSA techniques are used to generate analyses and trend of electric machines. They aim to detect predictive failures in a plant, such as: problems in the stator winding, rotor problems, problems on coupling load, efficiency and loading of the system; bearing problems, among others.</p><p>It might be of surprise, but electrical signals (voltage and\or current) can carry additional information about electrical and mechanical problems of generating equipment, but the machine works as a transducer for mechanical failures, allowing the electrical signals (voltage and/or current) can carry information of electrical and mechanical problems. The signals of current and/or voltage of one (or three) phases of the machine produce, after examination, the signature of machine, i.e., its operating pattern. This signature is composed of frequency magnitudes of each individual component extracted from their signals of current or voltage. This fact allows the monitoring of the evolution of the frequency magnitudes, which can denote some sort of evolution of the operational conditions of the machinery.</p><p>The response that the user wants is to know whether your machine is "healthy" or not, and what part of machine is in failure.</p><p>This analysis (diagnosis) is not easily done because it involves a set of comparisons with previously stored patterns and own "history" of the machine under analysis. At this moment, usually an expert is called to produce the final diagnosis, generating command when stopping the machine. Thus, the system developed in this project for automatic diagnosis combines the history of turbo-generator, expert knowledge and failures patterns and it can be very useful for a power company.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Extended Park Vector Approach</head><p>The EPVA technique should be used to verify electric stator imbalances. However, it can only be used if the signals of voltage and/or current have been demodulated <ref type="bibr" target="#b2">[3]</ref>. The central idea of this technique is checking failures by the distortion of Park´s circle, i.e., more distortion in the Park´s circle more is the unbalance of the machine. The current components of the Park vector are described by i D and i Q :</p><formula xml:id="formula_0">C B A D i i i i                        6 1 6 1 3 2 (1) C B Q i i i               2 1 2 1 (<label>2</label></formula><formula xml:id="formula_1">)</formula><p>Where the currents i A , i B and i C are the three phases. In ideal conditions:</p><formula xml:id="formula_2">) cos( 2 6             t i i M D (3) ) ( 2 6             t sin i i M Q (4)</formula><p>For normal conditions, Park circle is centered at the origin of coordinates.</p><p>The Park circle has distortions when there are abnormal conditions of operation or when mechanical or electric failures occur. However, these distortions in the Park circle are not easy to be seen or measured, hence the proposition of the Extended PVA (EPVA), observing the spectrum module of Park vector.</p><p>The EPVA technique combines the robustness analysis of Park circle and the flexibility of spectral analysis <ref type="bibr" target="#b3">[4]</ref>. An important feature of the Park transformation process is the fundamental component of analyzed signals is erased <ref type="bibr" target="#b4">[5]</ref>. This fact allows the component characteristics of failure to appear with greater prominence. And more, to be a method that covers the three-phase simultaneously electric stator imbalances are also covered by this method <ref type="bibr" target="#b5">[6]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Electrical Signal Processing</head><p>For the characteristic extraction of digital signals, there is a pre-conditioning process and then some paths to compute the values of each variable. Different parameters are obtained in the time domain and frequency domain. In Fig. <ref type="figure" target="#fig_0">1</ref>, a flowchart of the used techniques is shown. The grayish background blocks represent a processed signal that can be viewed or used for the characteristic extraction. The blank blocks represent an algorithm or a processing applied to digital signal. Below a brief commentary on each of the processing blocks is presented:</p><p>• Signal Composition: it converts the data from the data acquisition system to digital signals whose amplitudes represent actual values for current and voltage.</p><p>• Pre-conditioning: process that eliminates the initial part of the signal to avoid samples obtained during the transients of the filters. Then the average value of each signal whose nature is alternating is deleted.</p><p>• Park Transformation: when there is a three-phase electrical system composed by three currents (I A , I B and I C ) and three voltages (V AB , V BC and V CA ), the Park transformation is applied to obtain the Park vector, composed by components I Q , I D and I 0 . In some cases, it is also used the spectrum of this vector module for electric system imbalance.</p><p>• Hilbert transformation: when applied to a signal, it returns the magnitude (envelope) and instantaneous phase.</p><p>• RMS Filter: knowing the fundamental frequency of the signal, the RMS filter returns the instantaneous RMS value of the signal during the sampling period, resulting in the so-called RMS Curve.</p><p>• Windowing: filter applied to a signal in time to reduce the effect of "leakage" in the frequency spectrum. There are several types of windowing (Blackman, Hamming, Hanning, etc.), the Blackmann windowing has been used. This window allows the identification of peaks as lobes slightly wider and less "leakage" on their side bands than other Windows.</p><p>• Fourier Transform: used to transform a time domain signal into the frequency domain, the discrete Fourier transform (DFT) returns a vector with the spectrum amplitudes and their phases. To accelerate the achievement of the DFT, we used an algorithm called FFT (Fast Fourier Transform).</p><p>For each acquired electrical signals, various parameters are computed. These parameters are used for the evaluation process and for the extraction of new features, and they are listed below.</p><p>• Average amplitude: the average value of the signal in the period under review;</p><p>• RMS amplitude: also called effective value or mean square;</p><p>• Minimum and maximum amplitude values: maximum and minimum values of amplitude in the period under review;</p><p>• Amplitude, phase and fundamental frequency: value of amplitude and frequency in Hz of the fundamental component of signal (electrical system to signal fundamental frequency is 60 Hz);</p><p>• Fundamental harmonics: multiples of the fundamental component.</p><p>• Harmonic distortion index (HDI): it indicates the significance of harmonic content when compared to the fundamental component of the signal.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Description of the Data Mining Algorithm</head><p>This section introduces expeditiously the data mining algorithm used to perform comparisons between the processed signals, the database signals and failures patterns. The used technique was based on the Rough Set Theory <ref type="bibr" target="#b6">[7]</ref>. This technique aims to extract a set of rules (or conditions) from a database through two hyper-sets, called upper approximation set and lower approximation set. The set of rules contains the lower approximation set and is contained by the upper approximation set. The central idea of the algorithm is to reduce the number of elements in the upper approximation set and to increase the number of elements in the lower approximation set. In an ideal condition, these two sets would become only one set that would be the required set. This set is represented by the set of production rules.</p><p>The used algorithm <ref type="bibr" target="#b7">[8]</ref>  The next step is to check equal attributes or unnecessary classification (dispensable attributes). This is done in the first case by mere inspection and, in the second case, by the removal of each of them and subsequent verification of the inclusion of issues of classification.</p><p>Then, with only the essential attributes, the core of each rule set is computed. The core is formed by those attributes indispensable for that rule, and those sets of examples. Next step of forming core set consists of computing reduce set, which contain only of core attributes augmented by attributes qualifying exactly a rule. Finally, the rules are similar and the set of production rules for the classification of input signals.</p><p>In the context of the developed software for turbo-generator predictive maintenance, the algorithm and mathematical structures described were implemented and serve to extract knowledge of diagnostic data. Thus, the system is able to diagnose new cases on the basis of the knowledge extracted from previous cases. It is important to note that the procedure is transparent to the user, that is, it occurs within the computational package developed, activated by a button command in the program window itself and providing the user with the proper classification.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Illustrative Example of the Data Mining Process for Turbo-Generator Feature Extractions</head><p>The current development has been applied in the TermoNorte Thermal Power Plant, located in the Brazilian north region close to Amazon jungle. This power plant is composed by two plant in the same area. The TermoNorte I has a total generation capacity equal to 64 MW, from 4 Diesel Wärtsilä motor-generators, each one with 16</p><p>MW. The TermoNorte II has a total generation capacity equal to 340 MW, from 3 GE gas turbines.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1">Some Features about the Data Acquisition</head><p>The installed data acquisition system is composed by current and voltage transducers, a pre-processing acquisition module and a data acquisition module, shown in Fig. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.2">Computational Package</head><p>The developed computational package is composed to two main parts: (a) data acquisition control and (b) feature extraction and signal processing. The first part contains parameters for data acquisition process. The main signal acquisitions are usually made through three acquisitions:</p><p>• Acquisition 1: it aims to collect both the signs of current and voltage (phase) of the turbine, so the EPVA techniques and energy quality are applied;</p><p>• Acquisition 2: acquisition of voltages, with the goal of applying the technique VSA (Voltage Signature Analysis);</p><p>• Acquisition 3: it does the acquisition of one of the stages for the application of the technique CSA (Current Signature Analysis).  </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.3">Feature Extraction -Data Mining Process</head><p>The described algorithm in Section 4 has been also implemented in the computational package (second part of the package in our description -for users this division of the package doesn´t exist). The signals shown in Fig. <ref type="figure" target="#fig_2">3</ref> are expressed by their main features, such as frequencies, amplitudes, phases, and merge to turbo-generator parameters itself. This set of data is the input data, and must be related to a type of previous operation condition: normal, abnormal, failure #1, failure #2, and so on. An example of the input signal database is shown in Table <ref type="table" target="#tab_1">1</ref>.</p><p>With the database the data mining process starts with the definition of labels (classes or ranks) for each attribute (input variable). The program contains a pre-set of labels for each attribute. This pre-set has been adjusted during the test phase of the prototype in the power plant. However, if the user would change the interval of these labels it is possible. However, in the daily operation, this pre-set of labels remains constant. Internally, the program merges the equal examples, verifies dispensable attributes, computes the core and the reduce sets, and finally produces the final set of rules. And academic example of this process is presented for a small database (part of the real database). Table <ref type="table" target="#tab_2">2</ref> shows the data after the application of labels. Ten examples are shown with the following input attributes: frequency, amplitude, TDH (harmonic distortion level), and distortion (from Park Vector circle). The possible outputs are "normal", "warning", and "danger". After the transformation from numbers in labels of the attribute values, the second step of the algorithm can be performed -to remove equal examples. In this case, examples 2 and 7 are equal, and 4 and 8 also. Then one of them can be removed without any type of information lack, resulting in Table <ref type="table">3</ref>.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. Block diagram of the algorithms of signal conditioning and processing.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>2 .Fig. 2</head><label>22</label><figDesc>Fig. 2 Data acquisition system: (a) data acquisition pre-processing acquisition modules, and (b) voltage transducers.</figDesc><graphic coords="7,169.20,300.56,163.68,122.88" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Fig. 3</head><label>3</label><figDesc>Fig. 3 Acquired Signals by: (a) EPVA to current and voltage, respectively, (b) CSA signal, and (c) VSA signal. Special window interfaces have been developed to transfer all system control to the operator. Examples of this interface are shown in Fig. 4. These interfaces are in Portuguese language. The first figure shows an example of the supervision interface with the data acquisition control information; and the second figure is one of the analysis procedures.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Fig. 4 .</head><label>4</label><figDesc>Fig. 4. Examples of user interface of the computational package.</figDesc><graphic coords="8,127.44,441.68,168.96,117.84" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head></head><label></label><figDesc>In the first step (initialization), ranks (classes) of each attribute (input or output variables) are defined, and each interval receives an identification label. This division creates a cross-linked sample space and the next step may apply (remove equal examples). All examples within a same hyper-cube are grouped into only one.</figDesc><table><row><cell>has six main steps, they are:</cell></row><row><cell>1. initialization;</cell></row><row><cell>2. remove equal examples;</cell></row><row><cell>3. remove of dispensable attributes;</cell></row><row><cell>4. compute the core set;</cell></row><row><cell>5. compute the reduce set; and</cell></row><row><cell>6. merge rules.</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 1 .</head><label>1</label><figDesc>Partial example of the signal acquisition database.</figDesc><table><row><cell>Acquisition</cell><cell>Sample Frequency (Hz)</cell><cell>Number of Samples</cell><cell>Time of Sample (s)</cell><cell>Spectral Definition (Hz)</cell><cell>Total Time (s)</cell></row><row><cell>1</cell><cell>8193 1638</cell><cell>21845</cell><cell>2,7 13,3</cell><cell>0,3704 0,0752</cell><cell>27 37</cell></row><row><cell>2</cell><cell>8193 1638</cell><cell>131072</cell><cell>16 80</cell><cell>0,0625 0,0125</cell><cell>40 104</cell></row><row><cell>3</cell><cell>8193 1638</cell><cell>131072</cell><cell>16 80</cell><cell>0,0625 0,0125</cell><cell>40 104</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 2 .</head><label>2</label><figDesc>Partial example of the signal acquisition database.</figDesc><table><row><cell cols="3">Example Frequency Amplitude</cell><cell>TDH</cell><cell>Distortion</cell><cell>Output</cell></row><row><cell>1</cell><cell>Low</cell><cell>Normal</cell><cell>Normal</cell><cell>Normal</cell><cell>Normal</cell></row><row><cell>2</cell><cell>Low</cell><cell>Medium</cell><cell>Medium</cell><cell>Normal</cell><cell>Normal</cell></row><row><cell>3</cell><cell>Low</cell><cell>Medium</cell><cell>Normal</cell><cell>High</cell><cell>Normal</cell></row><row><cell>4</cell><cell>Medium</cell><cell>Medium</cell><cell>Normal</cell><cell>Medium</cell><cell>Warning</cell></row><row><cell>5</cell><cell>Medium</cell><cell>Medium</cell><cell>Normal</cell><cell>High</cell><cell>Warning</cell></row><row><cell>6</cell><cell>Medium</cell><cell>High</cell><cell>Normal</cell><cell>High</cell><cell>Danger</cell></row><row><cell>7</cell><cell>Low</cell><cell>Medium</cell><cell>Medium</cell><cell>Normal</cell><cell>Normal</cell></row><row><cell>8</cell><cell>Medium</cell><cell>Medium</cell><cell>Normal</cell><cell>Medium</cell><cell>Warning</cell></row><row><cell>9</cell><cell>High</cell><cell>High</cell><cell>Medium</cell><cell>Medium</cell><cell>Danger</cell></row><row><cell>10</cell><cell>Medium</cell><cell>High</cell><cell>Normal</cell><cell>Medium</cell><cell>Danger</cell></row></table></figure>
		</body>
		<back>
			<div type="annex">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>In order to verify possible dispensable attributes, each attribute is removed and a verification of possible mistake classification is performed. In this case, for instance, the attribute "frequency" is not dispensable because without it examples 2 and 5 present two different outputs for the same input. The same occurs with the attributes "amplitude" and "distortion". However, the attribute "TDH" is dispensable in this case because without this attribute the table remains consistent (Table <ref type="table">4</ref>). At this moment, the database is ready to compute the core set. Removing each value of each example and verifying the mistake in the classification, it is possible to computer each element of the core set. If the lack of the element creates a mistake, this element takes part of the core set, otherwise not. Table <ref type="table">5</ref> presents the core set of the illustrative database example. Then the reduce set can be computed. It is made including the minimum number of attributes with the core to represent the example. In this case, it results in 11 examples (rules). Finally, the final set of rules is composed by 7 different rules, shown in Table <ref type="table">6</ref>. An example of the produced rule of the developed system is: If I PV ≥ -26db (0.05) then output = "warning" and failure = "stator current unbalance".</p><p>In English language: If the current Park Vector component is equal to or bigger than -26 db (0.05) then the operational condition is "warning" and the possible failure is "stator current unbalance".</p><p>Rule-extraction algorithm is usually run once a quarter. The most important part of the process to the users is the analysis of the current signals, it means, the operational condition of the machine at this moment. For acquired current signals pass by the rule set and a condition of the generator is presented to the operator. The major part of the time the answer of the program is "Normal"; however, when a abnormal situation is detected a failure pattern is shown to the operator. An example of this is presented in Fig. <ref type="figure">5</ref>. This figure shows this abnormal situation with two pre-set lines. The yellow line expresses the warning level and the red line express a danger level. In this cases, -26 db and -20 db, respectively.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">Conclusions</head><p>This paper shows a complete development of a supervision system with an intelligent data signal processing based on feature extraction using Rough Set Theory. The feature extraction relates processed current and voltage signals from the turbogenerator by VSA, CSA and EPVA techniques, turbo-generator electrical and mechanical parameters, and typical types of failures existing in this kind of machine.</p><p>Hardware and software have been developed to acquire and treat the electrical signals in a non-invasive process. It means, the operational condition of the generator is verified without any type of disturbance in the machine or in its control. The electrical signals are taken out of the machine, more specifically in the secondary of instrument transformers (CT and PT) in the panel control.</p><p>This system is currently in full operation at TermoNorte Thermal Power Plant, in Brazil.</p></div>			</div>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Proposing a Procedure for the Application of Motor Current Signature Analysis on Predictive Maintenance of Induction Motors</title>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">L</forename><surname>Bonaldi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">E L</forename><surname>Oliveira</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Lambert-Torres</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">E</forename><surname>Borges Da Silva</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">20th Int. Cong. Exh. Condition Monitoring and Diagnosis Engineering Management, COMADEM 2007</title>
				<meeting><address><addrLine>Faro, Portugal</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<title level="m" type="main">Failure Predictive Diagnostic in Three-Phase Induction Motors with MCSA and Rough Set Theory</title>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">L</forename><surname>Bonaldi</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2006">2006</date>
		</imprint>
		<respStmt>
			<orgName>Itajuba Federal School of Engineering, Itajuba -Brazil</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">Ph.D. Thesis</note>
	<note>in Portuguese</note>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<title level="m" type="main">Failure Diagnostic in Three-Phase Induction Motors</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">J M</forename><surname>Cardoso</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1991">1991</date>
			<publisher>Coimbra Editora</publisher>
			<pubPlace>Coimbra -Portugal</pubPlace>
		</imprint>
	</monogr>
	<note>in Portuguese</note>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">A Review of Induction Motors Signature Analysis as a Medium for Faults Detection</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">H</forename><surname>Benbouzid</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Trans. Industrial Eletronics</title>
		<imprint>
			<biblScope unit="volume">47</biblScope>
			<biblScope unit="page" from="984" to="993" />
			<date type="published" when="2000">2000</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Diagnosis of the Multiple Induction Motor Faults Using Extended Park&apos;s vector Approach</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">M A</forename><surname>Cruz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">J M</forename><surname>Cardoso</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Int. J. Comadem</title>
		<imprint>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="19" to="25" />
			<date type="published" when="2001">2001</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Diagnosis of Stator Inter-Turn Short Circuits in DTC Induction Motor Drives</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">M A</forename><surname>Cruz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">J M</forename><surname>Cardoso</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Trans. Industry Applications</title>
		<imprint>
			<biblScope unit="volume">40</biblScope>
			<biblScope unit="page" from="1349" to="1360" />
			<date type="published" when="2004">2004</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Rough Sets</title>
		<author>
			<persName><forename type="first">Z</forename><surname>Pawlak</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Int. J. Information and Computer Sciences</title>
		<imprint>
			<biblScope unit="volume">11</biblScope>
			<biblScope unit="page" from="341" to="356" />
			<date type="published" when="1982">1982</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Rough Set Theory -Fundamental Concepts, Principals, Data Extraction, and Applications</title>
		<author>
			<persName><forename type="first">S</forename><surname>Rissino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Lambert-Torres</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Data Mining and Knowledge in Real Life Applications</title>
				<editor>
			<persName><forename type="first">J</forename><surname>Ponce</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Karahoca</surname></persName>
		</editor>
		<imprint>
			<publisher>-Tech Press</publisher>
			<date type="published" when="2009">2009</date>
			<biblScope unit="page" from="35" to="58" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
