<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Trend template: mining trends with a semi-formal trend model</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Olga</forename><surname>Streibel</surname></persName>
							<email>streibel@inf.fu-berlin.de</email>
						</author>
						<author>
							<persName><forename type="first">Lars</forename><surname>Wißler</surname></persName>
							<email>lars.wissler@googlemail.com</email>
						</author>
						<author>
							<persName><forename type="first">Robert</forename><surname>Tolksdorf</surname></persName>
						</author>
						<author>
							<persName><forename type="first">Danilo</forename><surname>Montesi</surname></persName>
							<email>montesi@cs.unibo.it</email>
						</author>
						<author>
							<affiliation key="aff0">
								<orgName type="laboratory">Networked Information Systems Group</orgName>
								<orgName type="institution">Freie Universität Berlin</orgName>
								<address>
									<settlement>Berlin</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff1">
								<orgName type="institution">University of Bologna</orgName>
								<address>
									<settlement>Bologna</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Trend template: mining trends with a semi-formal trend model</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">8F2504492391FF12BCE325F58411826C</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T07:17+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Predictions of uprising or falling trends are helpful in different scenarios in which users have to deal with huge amount of information in a timely manner,such as during financial analysis. This temporal aspect in various cases of data analysis requires novel data mining techniques. Assuming that a given set of data, e.g. web news, contains information about a potential trend, e.g. financial crisis, it is possible to apply statistical or probabilistic methods in order to find out more information about this trend. However, we argue that in order to understand the context, the structure, and explanation of a trend, it is necessary to take a knowledge-based approach. In our study we define trend mining and propose the application of an ontology-based trend model for mining trends from textual data. We introduce the preliminary definition of trend mining as well as two components of our trend model: the trend template and the trend ontology. Furthermore, we discuss the results of our experiments with trend ontology on the test corpus of German web news. We show that our trend mining approach is relevant for different scenarios in ubiquitous data mining.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>When discussing trends some of us may think about the ups and downs of NASDAQ 1 , or DAX 2 curves, or changes in public opinion on politics before elections. Likewise, one can think about web trends, life style trends or daily trends, i.e. hot topics, in the news or on social networks. Changes in a mobile data stream also fall within the definition of a trend. Understanding a trend as a hot topic is related to the research in Emerging Topic Detection (EDT) and Topic Detection and Tracking (TDT), the subfields of information retrieval <ref type="bibr" target="#b3">[Allan, 2002</ref><ref type="bibr" target="#b3">][Kontostathis et al., 2003]</ref>. A trend is defined there as a topic that emerges in interest and utility over time. Accordingly, common examples of trends may be the "Arab Spring" 1 http://www.nasdaq.com/ online accessed 04-17-2013 2 http://dax-indices.com/EN/index.aspx?pageID=4 online accessed <ref type="bibr">04-17-2013</ref> which emerged in political news worldwide in the beginning of 2011, as well as the financial and real estate crisis which started to emerge on business news worldwide in 2008. A graphical representation of a trend, based on GoogleTrends<ref type="foot" target="#foot_0">3</ref> , is shown in Fig. <ref type="figure">1</ref>.</p><p>Figure <ref type="figure">1</ref>: This graph shows a search volume index for the terms "financial crisis" (blue curve) and "insolvent" (red curve) in Germany from 2006 to 2011. Source: GoogleTrends Several methods have been proposed for detecting trends in texts or discovering trends in the web news (see Section 3). Other works provide approaches from statistics and time series analysis that can be applied for analyzing trends in nontextual data. Our work contributes to the general understanding of trend mining that we see as highly relevant to ubiquitous data mining. In this paper, we explain our abstract concept of a trend template and go on to describe a trend ontology which is an instance of the trend template.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Ubiquitous data mining and trend mining</head><p>The Ubiquitous Data Mining (UDM) is defined as the essential part of the ubiquitous computing <ref type="bibr">[Witten and Eibe, 2005]</ref>. The UDM techniques help in extracting useful knowledge from data that describes the world in movement, including the aspects of space and time. Time is the necessary dimension for trend mining-there is no trend without time. And a trend is one of the aspects of a world in movement. Before we discuss general trend characteristics, we want to mention the sociological and statistical perspectives on the trend, as well as define trend mining. This helps in understanding the trend characteristics that create the basis for the definition of our trend template later in this paper.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Trend from different perspectives</head><p>Detecting trends from the sociological point of view is an analytical method for observing changes in peoples behavior over time with regard to "six attitudes towards trends" <ref type="bibr">[Vejlgaard, 2008]</ref>. The definition of these six attitudes is based on eight different personality profiles of groups who participate in the trend process: trend creators, trend setters, trend followers, early mainstreamers, mainstreamers, late mainstreamers, conservatives and anti-innovators. Detecting trends from the statistics perspective is based on trend analysis of time-series data with two goals in mind: "modeling time series (i.e. to gain insight into the mechanisms or underlying forces that generate the time series) and forecasting time series (i.e. to predict the future values of the time-series variables)" <ref type="bibr" target="#b3">[Han and Kamber, 2006]</ref>. The trend analysis process consists of four major components: trend or long-term movements, cyclic movements or cyclic variations, seasonal movements or seasonal variations, and irregular or random movements <ref type="bibr" target="#b3">[Han and Kamber, 2006]</ref>. A trend, in this context, is an indicator for a change in the data mean <ref type="bibr" target="#b5">[Mitsa, 2010]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Trend mining</head><p>Since data mining can be described as "the extraction of implicit, previously unknown, and potentially useful information from data" <ref type="bibr">[Witten and Eibe, 2005]</ref>, we propose the use of the term trend mining as defined below: DEF 2.1 Trend mining is the extraction of implicit, previously unknown and potentially useful knowledge from time-ordered text or data. The trend mining techniques can be used for capturing trend in order to support user in providing previously unknown information and knowledge about the general development in users field of interests.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Related Research</head><p>In general, when mining trends from textual data, at least the following three research areas should be mentioned: emergent trend detection, topic detection and tracking, and temporal data mining.</p><p>In <ref type="bibr" target="#b3">[Kontostathis et al., 2003]</ref> several systems that detect emerging trends in textual data are presented. These ETD systems are classified into two main categories: semiautomatic and fully-automatic. For each system there is a characterization based on the following aspects: input data and attributes, learning algorithms and visualization. This comparison includes an overview over the research published in <ref type="bibr" target="#b2">[Allan et al., 1998</ref><ref type="bibr" target="#b3">][Lent et al., 1997</ref><ref type="bibr" target="#b1">][Agrawal et al., 1995</ref><ref type="bibr" target="#b5">][Swan and Jensen, 2000</ref><ref type="bibr" target="#b5">][Swan and Allan, 1999</ref><ref type="bibr" target="#b5">][Watts et al., 1997]</ref>. TDT research <ref type="bibr" target="#b3">[Allan, 2002]</ref> is predominantly related to the event-based approaches. Event-based approaches for trend mining underlie the assumption that trends are always triggered by an event, which is often defined as "something happening" or "something taking place" [Lita <ref type="bibr" target="#b4">Lundquist, 2000]</ref> in the literature. Considering a trend from the event research perspective means that trend detection has to be understood as a monitoring task. This is mostly the case for so-called short-term trends that are indeed triggered by some events and in order to detect them we have to monitor the stream in which they occur, e.g. the occurrence of "Eyjafjallajkull eruption"<ref type="foot" target="#foot_1">4</ref> which was reported in social networks and on the news in March 2010. However, so-called long-term trends, e.g. "financial crisis", that started to be on-topic in 2008 are not necessarily conjoined with one specific event. It is more a chain of events or even the "soft" indicators as public opinion or news. No sharp distinction has been made between the TDT and ETD research fields, which means that some research such as [Swan and <ref type="bibr" target="#b5">Allan, 1999]</ref> or <ref type="bibr" target="#b3">[Lavrenko et al., 2000]</ref> can be in fact classified into both fields. Temporal data mining research <ref type="bibr" target="#b5">[Mitsa, 2010]</ref> offers methods for clustering, classification, dimension reduction and processing of time-series data <ref type="bibr" target="#b5">[Wang et al., 2005]</ref>. It addresses in general the temporal data and the techniques of time series analysis on these data. One definition of temporal data is "time series data which consist of real valued sampled at regular time intervals" <ref type="bibr" target="#b5">[Mitsa, 2010]</ref>. Temporal data mining applies the data mining methodology and deals with the same approaches for classification or clustering, that are relevant for mining trends in textual data.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Trend template</head><p>Based on our experiments and considerations, we outline the following assumptions about trends in the general context of this work; A trend can be described by the following characteristics: trigger, context, amplitude, direction, time interval, and relation. Fig. <ref type="figure" target="#fig_0">2</ref> illustrates the trend template. In 4.1, we more precisely define each characteristic.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1">Definitions</head><p>Trigger is a thing. They can be: an event, a person, or a topic anything that triggers the trend. A trigger can but does not have to cause a trend. A trigger makes the trend visible. An example of a trigger is Lehman Brothers 5 insolvency that can be classified as both a topic and an event.</p><p>Context is the area of the trigger. If the trigger is a topic then the context is this topic's area, e.g. Lehman Brothers insolvency is mentioned in the context of real estate market. Amplitude is the strength of a given trend. It can be expressed by a number, the higher the number the more impact the trend has or by a qualitative value that describes the trend phase, e.g. beginning (setter), emerging (follower), mainstream, fading (conservative). Time is necessary while spotting trend, since there can be no trend without time. It is the interval in which the trend is appearing, independent from the amplitude, e.g. the real estate crisis appeared between the years 2008-2011. Relation expresses the dependency between the trigger and the context, it puts the given trigger, e.g. Lehman Brothers insolvency within the given context of the real estate crisis in a relation, e.g. Lehman Brothers insolvency is part of the </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2">Formal description</head><p>The trend template is an abstract model that describes the main concepts that are important and necessary for knowledge-based trend mining. In following, we more explicitly define the trend template: DEF. 4.1: Trend template (TT) is a quintuple:</p><formula xml:id="formula_0">T T := T, C, R, T W, A</formula><p>where: T is trigger, C is context, R is relation, T W is time window, and A is amplitude.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>DEF. 4.2: T-Trigger is set of concepts:</head><p>T := {t 0 , . . . , t n }, n ∈ N ∧ t ∈ T so that if E, P , T o are the sets defining: events: E := {e 0 , . . . , e n }, n ∈ N ∧ e ∈ E persons:</p><formula xml:id="formula_1">P := {p 0 , . . . , p n }, n ∈ N ∧ p ∈ P locations: L := {l 0 , . . . , l n }, n ∈ N ∧ l ∈ L topics: T o := {t o0 , . . . , t on }, n ∈ N ∧ t o ∈ T o then: T := E ∪ P ∪ T o ∪ L DEF. 4.3: C-Context</formula><p>is a union set consisting of a set of concepts and a set of relations between them where c is a context element:</p><formula xml:id="formula_2">C := C co ∪ R co , c ∈ C</formula><p>with C co the set of concepts</p><formula xml:id="formula_3">C co := {c co0 , . . . , c con }, n ∈ N ∧ c co ∈ C co</formula><p>and R co the set of relations:</p><formula xml:id="formula_4">R co := {r co0 , . . . , rco n }, n ∈ N ∧r co ∈ R co ∧R co ⊆ C co × C co</formula><p>whereas r co defines a binary relation:</p><formula xml:id="formula_5">r co : c cox , c coy −→ r co (c cox , c coy ) ∧ c cox = c coy</formula><p>and the context element is defined by:</p><formula xml:id="formula_6">c = c co ∪ (c coi , c coj ) C = C co ∪ C co × C co DEF.</formula><p>4.4: R-Relational is a set of relations:</p><formula xml:id="formula_7">R := {r 0 , . . . , r n }, n ∈ N ∧ r ∈ R ∧ R := {T × C} with r i : t i , c i −→ r i (t i , c i )</formula><p>DEF. 4.5: TW-Time window is a function that assigns time slice to the time points:</p><formula xml:id="formula_8">T P := {t point |t point = = ms ∨ second ∨ minute ∨ hour ∨ day ∨ month ∨ year} T S := t point0 . . . t pointn T W : T P −→ T S</formula><p>DEF. 4.6: A-Amplitude is a function that assigns a value to the quadruple of T, C, R, T W</p><formula xml:id="formula_9">A : T × C × R × T W −→ N ∪ V where N is the set of natural numbers and V is the set of categorical values a : (t, c, r, tw) −→ n ∨ v</formula><p>One way of implementing the trend template is the realization of this model in the form of an ontology. We can understand the ontology as an instance of the trend template.</p><p>Based on the trend template described above, we created an applicable model, using SKOS 6 and RDFS/OWL 7 concepts and properties. Our model serves as a general model that can be extended regarding the particular application domain and applied for annotating a text corpus in order to retrieve the trend structure. The trend ontology is divided into levels meta, middle and low which correspond to three abstract layers of the model. Whereas the low level and the middle level relate to the corresponding application domain (in our case it is the German Stock Exchange, DAX), the meta level is the most interesting one. Meta ontology incorporates the general trend characteristics and can be applied to any application domain. The central concepts of the ontology are Trigger, Trig-gerCollection, Indication, Relational and ValuePartition and have been modeled as subconcepts of skos:Concept, skos:Collection and time:TemporalEntity, with different semantic construction, e.g. skos:related, skos:member. The concepts mirror the composition of the trend template. Trigger consists of three subconcepts: event, person, location. The main goal of the meta ontology is to offer all necessary concepts and relations in order to span the trend template as a structure over a text corpus. To actually translate a specific document corpus into such a structure, meta ontology needs to be combined with a domain specific trend ontology which defines domain specific concepts, their keywords and possibly also their relations. This can either be done manually by extracting common terms as keywords and linking them to their respective concepts, or automatically by entity recognition. The pseudocode 6.1 describes the algorithm that we applied to build up the trend description on the test corpus.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">Experiments</head><p>The text corpus which we call German finance data 8 that served as our test corpus consists of about 40,500 news articles related to the fields of business and finance, provided as XML files. The corpus is available in German and provides news articles from January 2007 to May 2008. The text was parsed in cooperation with neofonie 9 from the following sources: comdirect 10 , derivatecheck 11 , Handelsblatt 12 , God-modeTrader 13 , Yahoo 14 , Financial Times Deutschland 15 , and finanzen.net 16 . In general, the content of the corpus is focused on finance and business information concerning German companies and stocks. It focuses on the situation at DAX, as well as on reviews and ratings of German companies and shares. For evaluation purposes regarding usefulness and practicability, the trend ontology has been filled with two different parts of the test corpus: stock market specific documents in Part 1 and the general business news in Part 2 (subsequently first and second part). They contain over 5,000 and 16,000 documents respectively. We specified several basic questions and respective queries as relevant for trends in general and specifically for stock market trends. Querying the ontology for the total occurrence of concepts yields the following output (shortened to some of the most relevant concepts): Germany (9,137), USA (4,808), Deutsche Telekom (442), Allianz (433), Switzerland (382), Starbucks (104). The output corresponds directly to the corpus of German stock news with a clear focus on German companies followed by the still dominant US market. A similar query for often mentioned lines of business in the context of Germany in contrast to the USA yields a major focus on the industry for Germany. 4.5% to 7.1% of the total occurrences of Germany appear in the context of different lines of industry. The USA is strong in the context of IT (9%) and services (6.9%). Moreover, we checked so-called topic structure by using our ontology. Here a general example for the concept Germany:  In Fig. <ref type="figure" target="#fig_2">3</ref> we show the comparison of the performance values for the stock markets as ranked by ontology (test based on time window: July 2007 to April 2008) and reported in real (time window July 2007 to July 2011). Applying the trend ontology to the test set enables to find out specific information about the certain trend that is described in the documents of the test set. Our preliminary experiments results that we partially present in this paper show that our idea of a trend template could help in harvesting knowledge from the given test data in a timely manner.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7">Conclusions and future work</head><p>This paper presents our research on knowledge-based trend mining, wherein the main contribution is our semi-formal model of a trend template. We showed that the implementation of the trend template in the form of a trend ontology allows for capturing the trend structure out of a test document set. Our experiments confirm that a knowledge-based approach for mining trends out of data allows for extended trend explanations. Currently we are comparing the trend ontology experiment results with the results from adapted K-Means clustering and LDA-based topic modeling algorithms applied on our test set.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: Trend template-an abstract conceptualization</figDesc><graphic coords="3,149.80,54.00,312.40,240.90" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head></head><label></label><figDesc>trendonto:#Industry : 510 trendonto:#Investment : 468 trendonto:#Supplier : 422 trendonto:#AutomobilIndustry : 414</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: Performance of shares in the first corpus (5,000 documents) by ontology based ranking and comparison with share indices in the time window July 2007 to July 2011.</figDesc><graphic coords="5,54.00,54.00,217.35,130.95" type="bitmap" /></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_0">http://www.google.com/trends/ online accessed04-17-2013   </note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_1">The eruption of an Icelandic volcano in March 2010 that caused air travel chaos in Europe and revenue lost for the airlines http://www.volcanodiscovery.com/iceland/ eyjafjallajoekull.html online accessed04-17-2013  </note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_2">http://www.lehman.com/ online accessed04-17-2013   </note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>This work has been partially supported by the "InnoProfile-Corporate Semantic Web" project funded by the German Federal Ministry of Education and Research (BMBF) and the BMBF Innovation Initiative for the New German Länder -Entrepreneurial Regions.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<author>
			<persName><surname>Currently</surname></persName>
		</author>
		<ptr target="http://www.finanzen.net" />
		<title level="m">the publishing process at Linguistic Data Consortium</title>
				<imprint>
			<date type="published" when="2012">May 2013. 04-25-2012 10. 04-25-2012 11. 04-25-2012 12. 04-25-2012 13. 04-25-2012 14. 04-25-2012 15. 04-25-2012 16. 04-30-2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Querying shapes of histories</title>
		<author>
			<persName><surname>Agrawal</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 21st VLDB</title>
				<meeting>the 21st VLDB</meeting>
		<imprint>
			<publisher>Morgan Kaufmann Publishers Inc</publisher>
			<date type="published" when="1995">1995. 1995</date>
			<biblScope unit="page" from="502" to="514" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">On-line new event detection and tracking</title>
		<author>
			<persName><forename type="first">Allan</forename></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">SIGIR&apos;98: Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval</title>
				<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="1998">1998. 1998</date>
			<biblScope unit="page" from="37" to="45" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Topic Detection and Tracking. Event-based Information Organization</title>
		<author>
			<persName><forename type="first">;</forename><surname>Allan</surname></persName>
		</author>
		<author>
			<persName><surname>Han</surname></persName>
		</author>
		<author>
			<persName><forename type="first">;</forename><forename type="middle">J</forename><surname>Kamber</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Han</surname></persName>
		</author>
		<author>
			<persName><surname>Kamber</surname></persName>
		</author>
		<author>
			<persName><surname>Kontostathis</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 6 th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining Workshop on Text Mining</title>
				<editor>
			<persName><forename type="first">James</forename><surname>Allan</surname></persName>
		</editor>
		<meeting>the 6 th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining Workshop on Text Mining</meeting>
		<imprint>
			<publisher>AAAI Press</publisher>
			<date type="published" when="1997">2002. 2002. 2006. 2006. 2003. 2003. 2000. 2000. 1997. 1997</date>
			<biblScope unit="page" from="227" to="230" />
		</imprint>
	</monogr>
	<note>Proceedings of the KDD&apos;97</note>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<title level="m" type="main">Language, Text, and Knowledge. Mental Models of Expert Communication</title>
		<author>
			<persName><forename type="first">Lita</forename><surname>Lundquist</surname></persName>
		</author>
		<editor>Robert J. Jarvella Lita Lundquist</editor>
		<imprint>
			<date type="published" when="2000">2000. 2000</date>
			<publisher>De Gruyter</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Toas intelligence mining; analysis of natural language processing and computational linguistics</title>
		<author>
			<persName><forename type="first">;</forename><surname>Mitsa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Allan</forename><forename type="middle">;</forename><surname>Swan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Russell</forename><surname>Swan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">James</forename><surname>Allan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">;</forename><surname>Swan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Russel</forename><surname>Swan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">David</forename><surname>Jensen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">;</forename><surname>Wang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">PKDD &apos;97: Proceedings of the First European Symposium on Principles of Data Mining and Knowledge Discovery</title>
		<title level="s">Lecture Notes in Computer Science</title>
		<editor>
			<persName><forename type="first">Ian</forename><forename type="middle">H</forename><surname>Witten</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">F</forename><surname>Eibe</surname></persName>
		</editor>
		<meeting><address><addrLine>Berlin</addrLine></address></meeting>
		<imprint>
			<publisher>Morgan Kaufmann Publishers Inc</publisher>
			<date type="published" when="1997">2010. 2010. 1999. 1999. 2000. 2000. 2008. 2005. 2005. 1997. 1997. 2005. 2005</date>
			<biblScope unit="volume">3516</biblScope>
			<biblScope unit="page" from="323" to="334" />
		</imprint>
	</monogr>
	<note>Data Mining Concepts and Techniques</note>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
