<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Determining Patient Similarity in Medical Social Networks</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Sebastian</forename><surname>Klenk</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Stuttgart University Intelligent Systems Group</orgName>
								<address>
									<addrLine>Universitätsstrasse 38</addrLine>
									<postCode>70569</postCode>
									<settlement>Stuttgart</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Jürgen</forename><surname>Dippon</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Stuttgart University Intelligent Systems Group</orgName>
								<address>
									<addrLine>Universitätsstrasse 38</addrLine>
									<postCode>70569</postCode>
									<settlement>Stuttgart</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Peter</forename><surname>Fritz</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Stuttgart University Intelligent Systems Group</orgName>
								<address>
									<addrLine>Universitätsstrasse 38</addrLine>
									<postCode>70569</postCode>
									<settlement>Stuttgart</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Gunther</forename><surname>Heidemann</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Stuttgart University Intelligent Systems Group</orgName>
								<address>
									<addrLine>Universitätsstrasse 38</addrLine>
									<postCode>70569</postCode>
									<settlement>Stuttgart</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Determining Patient Similarity in Medical Social Networks</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">225A64CFC7F7373EA5160D067B05F125</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-19T15:35+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>In social networks the primary concern of people is to find others who share similar interests. For medical systems this means finding people who have similar symptoms or comparable diseases. Here a simple matching of variables would lead to a very small number of identical cases and determining similarity would usually fail due to the categorical nature of most factors. In particular, such problems arise for cancer patients. We have developed a system that is capable of determining similarity in terms of the survival time distribution. By a similarity based search our approach allows to determine related patients. Thus recommendations for contacts of interest become possible. We will present the theoretical foundation as well as a use case scenario with an existing data mining software.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Finding "patients like me" is a big issue for people suffering from severe illness. Today, this problem is addressed by the medical social network with identical name <ref type="foot" target="#foot_0">1</ref> , and by organizations such as the german ACHSE<ref type="foot" target="#foot_1">2</ref> or the european Eurordis<ref type="foot" target="#foot_2">3</ref> , which represent the common interests of patients and have brought together people with similar diseases successfully for several years now.</p><p>The goal of most medical social web sites is to provide a forum and a more direct way for patients to exchange thoughts, feelings, and experiences. Therefore the search for other people with a similar disease history and similar symptoms is crucial. For this purpose, patient profiles are presented which share a large number of similarities, just like in other social networks. However, defining such similarities for patient profiles is significantly more difficult than for other types of social networks. Different aspects of a disease have to be weighted differently, so a simple matching of factors is insufficient.</p><p>We have developed a similarity measure for cancer patients which calculates influence values for factor levels and thereby facilitates a soft matching. This means that different aspects are also weighted differently. For example, the fact that two cancer patients have developed metastasis is weighted much higher than similar age. This leads to a domain specific matching and provides better recommendations on who might have had similar experiences or who might have knowledge a user can benefit from. Finding relationships of this kind is the very basis of social media.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Related work</head><p>An important part of social networking research <ref type="bibr" target="#b16">[17]</ref> is on recommender systems <ref type="bibr" target="#b4">[5,</ref><ref type="bibr" target="#b9">10,</ref><ref type="bibr" target="#b7">8,</ref><ref type="bibr" target="#b15">16]</ref>. These are systems that recommend certain items to the user, usually products, but also people one might want to know. As this is particularly interesting for e-commerce applications, most research is on suggesting new products.</p><p>For recommending people, there are two common approaches: (i) Content based recommendation which uses the information the user enters into the social network application, whereas (ii) relationship based recommendation traces who are the friends of the users friends, which the user might want to meet. Chen et. al. <ref type="bibr" target="#b1">[2]</ref> provide an overview on both fields and perform a comparative study. Their results are mostly in favor of the relationship based approach, whereas they argue that similarity in content is so far calculated by keyword matching, which is just not sufficient. An example for a relationship based method is the work of Lin et. al. <ref type="bibr" target="#b11">[12,</ref><ref type="bibr" target="#b6">7]</ref>, who deal with the problem of matching people in the context of searching for experts. They combine a graph based approach with a matching of search terms with profile terms which yields good results. But in the case of medical data an approach of this kind would lead to insignificant results because term matching does not reflect the true difference of the underling objects. Here more detailed domain knowledge is be required to determine term weightings. As stated by both Felfering et. al. <ref type="bibr" target="#b7">[8]</ref> and Volinsky <ref type="bibr" target="#b15">[16]</ref>, deep domain knowledge is so far not used excessively in recommender systems.</p><p>It is obvious that content based recommendation is, at least in principle, superior to relationship based recommendation, as it would allow to explore the entire network rather than just the subset a user is connected to. We therefore aim at improving content based recommendation by making an interpretation of the given content feasible.</p><p>Another important aspect of a weighted content based approach is security. Such a system is less likely to be subject to fraud or spamming as described by Mobasher et. al <ref type="bibr" target="#b12">[13]</ref>.</p><p>Apart from recommender systems, distance learning has a long history in the area of case based reasoning <ref type="bibr" target="#b13">[14]</ref>. Learning distance measures facilitates a context sensitive estimation of similar cases. Arshadi and Jurisica <ref type="bibr" target="#b0">[1]</ref> employ logistic regression to estimate a distance measure which gives relevance to certain aspects of the data. The method we describe here differs from the one proposed by Arshadi and Jurisica, as it allows for continuous dependent data which can even be censored, a feature that is crucial for medical data.</p><p>The distance measure we are using here is based on an idea proposed in <ref type="bibr" target="#b5">[6]</ref>, which has been extended and implemented in the medical data mining system OCDM <ref type="bibr" target="#b10">[11]</ref>. In the present paper we present a new application of this idea in the context of medical social networks.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Similarity for patient data</head><p>Measuring the similarity of natural continuous data items is very much straight forward. Every data dimension has the same weight and differences between dimensions can be interpreted in a very intuitive way. For categorical and artificial data, as is the case for patient data, differences in variables are anything else but intuitive and the weighting varies with each dimension. Formally speaking for two data items x and y a distance looks as follows:</p><formula xml:id="formula_0">d(x, y) = n k=1 α k d k (x k , y k ).<label>(1)</label></formula><p>Here α = (α i ) i=1...n is a weighting term that is assigned to each dimension and corresponds to its influence on the similarity. When working with lung cancer patient data for example it makes a huge difference whether the patient smokes or not but the area he or she lives in is of minor importance. Therefore similarities for smoker (yes or no) should have higher α values than for similarity in zip code. Besides the weighting factor there is also the functions d k which could be the absolute, the squared or the binary distance</p><formula xml:id="formula_1">d k (x, y) = 1 if x = y else d(x, y) = 0 depending on the dimension k.</formula><p>Determining a suitable weighting is essential to finding a good similarity measure. Therefore it is necessary to have a method at hands to calculate such a weighting. The central idea to the similarity measure learning approach we have taken, is to have a linear relation between a number of independent and one dependent variable that can be estimated and used as a weighting scheme. An ideal candidate to estimate such a scheme is the logistic regression <ref type="bibr" target="#b8">[9]</ref>. This is a supervised learning scheme that, based on training data, estimates the influence a given set of independent variables has on a dependent variable.</p><p>Formally it calculates the probability of a variable G having a certain value g given the information contained in all the other variables X = x</p><formula xml:id="formula_2">P (G = g|X = x) = exp(β T g x) 1 + g ′ ∈G exp(β T g ′ x)</formula><p>.</p><p>(</p><formula xml:id="formula_3">)<label>2</label></formula><p>This formula gives us the influence each element x i of x has on the outcome g of G.</p><p>Here the weight vector β represents this information. Equation ( <ref type="formula" target="#formula_3">2</ref>) can be used to model this influence for discrete data, for continuous and censored dependent variables, Cox has developed a method to calculate β <ref type="bibr" target="#b2">[3]</ref>. The central thought of his work is that the function h(t|x) can be described as</p><formula xml:id="formula_4">h(t|x) = h 0 (t) • P (h = h 0 |X = x),<label>(3)</label></formula><p>where h 0 (t) is unknown. This leads to</p><formula xml:id="formula_5">h 0 (t) • exp(β T x). (<label>4</label></formula><formula xml:id="formula_6">)</formula><p>What is actually estimated in ( <ref type="formula" target="#formula_4">3</ref>) and ( <ref type="formula" target="#formula_5">4</ref>) is the distribution function of the survival times. It is based on an unknown baseline hazard function that determines the risk of a patient at a certain moment. The formula in ( <ref type="formula" target="#formula_4">3</ref>) is known as Cox proportional hazard regression or just Cox regression and is mostly used in survival analysis <ref type="bibr" target="#b3">[4,</ref><ref type="bibr" target="#b14">15]</ref>. The actual estimation of these parameters takes place with a Newton-Raphson based method. Therefore the partial log likelihood function (for the parameter β over a training set) is maximized.</p><p>Given the influence information β out of (3), it is easy to develop a distance measure that is sensitive to the relevant aspects of the data concerning the variable for which the estimator was trained. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Patient recommendations by regression estimation</head><p>Recommendations of other people in a social network is a central theme of social applications (see also Figure <ref type="figure">3</ref> for examples). In the above section we have described how regression estimation can lead to a weighting of variables and thereby allow for the calculation of specific distance measures. Here we will describe how such a measure can be used to determine other people in the social network that one might want to know.</p><p>A social network application consist of a large database containing information on the people belonging to it. The information was entered by the people themselfs and may therefor contain only certain aspects of their profile. To determine other people with similar views it is necessary to calculate a distance measure as described above. Given a database with sample cases (the training data) one is able to estimate the weighting parameter β and apply it to a distance measure of the form:</p><formula xml:id="formula_7">d(x, y) = n k=1 α k d k (x k , y k ).</formula><p>Here α = (α i ) i=1...n with α i = exp σ•β and σ being a scaling factor to match the influence of the weighting on the distance measure. The measure itself could be the squared distance d k (x, x) = ||x − x|| 2 or simply the absolute distance. The scaling factor it self can be chosen to suite the needs of the recommendation, should the influence of the independent variables on the survival be weighted more heavily a value of σ &gt;&gt; 1 should be selected, in any other case σ ≤ 1 is a good choice. Now if this measure is applied to all people in the database one obtains a partially ordered list where the first few profiles can be used as recommendations.</p><p>To reduced computational load one could restrict the number of computations by only considering profiles that share a least amount of common fields. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Implementation</head><p>We have implemented the similarity distance measure in our data mining software OCDM <ref type="bibr" target="#b10">[11]</ref>, where similar patients are found for a given patient profile. This system, although intended for physicians, recommends similar profiles for a further study. For a patient an identical approach could lead to a recommender system as described above. In this section we are describing technical details about the similarity search. We will thereby concentrate on rather generic technical aspects, further details about the actual implementation of the similarity search can be found in <ref type="bibr" target="#b10">[11]</ref>. As basis for the developed system serves a Post-greSQL Database Server and a Java-Tomcat Servlet-Engine. As performance is a critical aspect of the software and much calculation has to be done during the estimation of the distance measure (on one hand the calculation of the weights and on the other the similarity calculation when recommending other profiles) we didn't follow a strict layer separation. Some tasks that involved extensive data processing were developed as stored procedures that run inside the database process. Most of the heavy-load calculation was thereby separated from the middleware and the GUI. As some of the calculation procedures are needed in the stored procedures and in the business logic we implemented these as Java classes such that they could be used in PL/Java code in the database as well as plain Java objects in the application server. We did some experiments with the similarity based distance we have developed and thereby achieved results comparable to that of common SQL queries. We measured the time it took for the database server to return results. For a data set of roughly 15.000 cases the database returned the select data on average after 10 milliseconds whereas the similarity based search took 35 milliseconds. These results can be placed in context when looking at the time it takes to process a simple SELECT statement with a function term (adding a constant to a column value) or a SELECT statement with an aggregate (calculating the average of a column value). The results are summarized in Table <ref type="table" target="#tab_0">1</ref>. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Discussion</head><p>We have presented a domain specific distance measure for medical social networks. It is not intended to be generally applicable to the broad audience of medical social networks, rather, it allows certain groups of patients to obtain better recommendations. If it is known that a user suffers, e.g., from a certain cancer type, search for other network members is focused and directed by criteria specific to this disease. The weighting in the actually calculated distance measure (1) can be easily adapted to a particular user group. Another important aspect of the above described distance measure is that it is solely focused on the survival time and does not include other possibly relevant aspects such as regional proximity or corresponding interests. In our experience, this restriction led to the best results. However, the restriction can be easily removed to include combinations of different weighting schemes. For two given weighting vectors α 1 and α 2 it is easy to combine them to a new weighting scheme α * by just summing up corresponding normalized elements</p><formula xml:id="formula_8">α * i = 1 2 • ||α 1 || α 1 i + 1 2 • ||α 2 || α 2 i .</formula><p>Data coding and treatment of missing values are important issues, because not every user will conform to standardized nomenclature to describe his or her disease, and likewise, many users will not present all their information in a social network. Both data coding and missing values have significant influence on distance estimation. Missing values can already be handled by the parameter estimation procedure and the distance measure itself as well. So the remaining problem is the lack of a formal notation. This, of course, could dramatically decrease the efficiency of the training process (if it is based on the data in the network). However, social networks have grown at such pace in the recent years that it is still highly likely to find a sufficient number of "good" training samples, even if data with unclear values have to be omitted. When it comes to proximity calculation, informal and varying notation could be handled in such a way that only those variable values that match certain criteria are considered for calculation, while all others are treated as missing values.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">Conclusion</head><p>We have presented a method to calculate similarities of patient profiles for recommending people to other members in a social network. As connecting to other people is the central aspect of medical social networks, a subject specific similarity search can increase the performance of recommendations and thereby increase the usefulness of the social network application dramatically. In addition to presenting the theoretical foundation we also have given insight into some implementation details as well as performance measures. These show comparable results to more complex SQL queries and can serve as a guideline when implementing a similar approach in a real world application. As the method we have presented is highly subject specific, i.e., dependent on the estimation of survival time data, it might be interesting to see further research on other medical data that might be less dependent on a time to event. Further the incorporation of social graph information seems to be promising.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. The recommendation of people in other social networks (on the left side Xing and on the right side Facebook)</figDesc><graphic coords="4,147.71,385.39,134.78,95.42" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Fig. 2 .</head><label>2</label><figDesc>Fig. 2. The presentation of similar patients in the OCDM system</figDesc><graphic coords="5,146.27,387.31,302.66,249.98" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 .</head><label>1</label><figDesc>Time until results are returned in milliseconds</figDesc><table><row><cell>Query Type</cell><cell>mean</cell><cell>std-err.</cell></row><row><cell>Simple SELECT</cell><cell>9.29</cell><cell>6.28</cell></row><row><cell>SELECT with function term</cell><cell>24.81</cell><cell>8.29</cell></row><row><cell>SELECT with aggregate</cell><cell>95.60</cell><cell>11.31</cell></row><row><cell>Similarity Search</cell><cell>35.09</cell><cell>10.50</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">PatientsLikeMe is a social networking health site with over 40,000 Members http://www.patientslikeme.com</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">The German Alliance for Rare Chronic Diseases http://www.achse-online.de</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_2">Eurordis -Rare Diseases Europe http://www.eurordis.org</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Data mining for case-based reasoning in highdimensional biological domains. Knowledge and Data Engineering</title>
		<author>
			<persName><forename type="first">N</forename><surname>Arshadi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Jurisica</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Transactions on</title>
		<imprint>
			<biblScope unit="volume">17</biblScope>
			<biblScope unit="issue">8</biblScope>
			<biblScope unit="page" from="1127" to="1137" />
			<date type="published" when="2005-08">Aug. 2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Make new friends, but keep the old: recommending people on social networking sites</title>
		<author>
			<persName><forename type="first">Jilin</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Werner</forename><surname>Geyer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Casey</forename><surname>Dugan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Michael</forename><surname>Muller</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ido</forename><surname>Guy</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">CHI &apos;09: Proceedings of the 27th international conference on Human factors in computing systems</title>
				<meeting><address><addrLine>New York, NY, USA</addrLine></address></meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2009">2009</date>
			<biblScope unit="page" from="201" to="210" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Regression models and life-tables</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">R</forename><surname>Cox</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of the Royal Statistical Society Series B (Methodological)</title>
		<imprint>
			<biblScope unit="volume">34</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="187" to="220" />
			<date type="published" when="1972">1972</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Analysis of binary data</title>
		<author>
			<persName><forename type="first">David</forename><forename type="middle">R</forename><surname>Cox</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">J</forename><surname>Snell</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Monographs on statistics and applied probability</title>
				<meeting><address><addrLine>London</addrLine></address></meeting>
		<imprint>
			<publisher>Chapman and Hall</publisher>
			<date type="published" when="1989">1989</date>
			<biblScope unit="volume">32</biblScope>
		</imprint>
	</monogr>
	<note>2. edition</note>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Item-Based Top-N Recommendation Algorithms</title>
		<author>
			<persName><forename type="first">M</forename><surname>Deshpande</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Karypis</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ACM Transactions on Information Systems</title>
		<imprint>
			<biblScope unit="volume">22</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="143" to="177" />
			<date type="published" when="2004-01">January 2004</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">A statistical approach to case based reasoning, with application to breast cancer data</title>
		<author>
			<persName><forename type="first">J</forename><surname>Dippon</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Fritz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Kohler</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Comput. Stat. Data Anal</title>
		<imprint>
			<biblScope unit="volume">40</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="579" to="602" />
			<date type="published" when="2002">2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Searching for experts in the enterprise: combining text and social network analysis</title>
		<author>
			<persName><forename type="first">Kate</forename><surname>Ehrlich</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ching-Yung</forename><surname>Lin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Vicky</forename><surname>Griffiths-Fisher</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2007 international ACM conference on Supporting group work</title>
				<meeting>the 2007 international ACM conference on Supporting group work<address><addrLine>New York, NY, USA</addrLine></address></meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2007">2007</date>
			<biblScope unit="page" from="117" to="126" />
		</imprint>
	</monogr>
	<note>GROUP &apos;07</note>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Guest editors&apos; introduction: Recommender systems</title>
		<author>
			<persName><forename type="first">Alexander</forename><surname>Felfernig</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Gerhard</forename><surname>Friedrich</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Lars</forename><surname>Schmidt-Thieme</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Intelligent Systems</title>
		<imprint>
			<biblScope unit="volume">22</biblScope>
			<biblScope unit="page" from="18" to="21" />
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<title level="m" type="main">The elements of statistical learning</title>
		<author>
			<persName><forename type="first">Trevor</forename><forename type="middle">J</forename><surname>Hastie</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Robert</forename><forename type="middle">J</forename><surname>Tibshirani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jerome</forename><forename type="middle">H</forename><surname>Friedman</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2002">2002</date>
			<publisher>Springer</publisher>
		</imprint>
	</monogr>
	<note>corrected print. edition</note>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Recommendation framework for online social networks</title>
		<author>
			<persName><forename type="first">Przemys</forename><surname>Law</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Kazienko</forename></persName>
		</author>
		<author>
			<persName><forename type="first">Katarzyna</forename><surname>Musia L</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Advances in Web Intelligence and Data Mining, Studies in Computational Intelligence</title>
				<imprint>
			<date type="published" when="2006">2006</date>
			<biblScope unit="volume">12</biblScope>
			<biblScope unit="page" from="111" to="120" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Interactive survival analysis with the ocdm system: From development to application</title>
		<author>
			<persName><forename type="first">S</forename><surname>Klenk</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Dippon</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Fritz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Heidemann</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Information Systems Frontiers</title>
		<imprint>
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Smallblue: People mining for expertise search</title>
		<author>
			<persName><forename type="first">Ching-Yung</forename><surname>Lin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Kate</forename><surname>Ehrlich</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Vicky</forename><surname>Griffiths-Fisher</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Christopher</forename><surname>Desforges</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE MultiMedia</title>
		<imprint>
			<biblScope unit="volume">15</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="78" to="84" />
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Toward trustworthy recommender systems: An analysis of attack models and algorithm robustness</title>
		<author>
			<persName><forename type="first">Bamshad</forename><surname>Mobasher</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Robin</forename><surname>Burke</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Runa</forename><surname>Bhaumik</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Chad</forename><surname>Williams</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ACM Trans. Internet Technol</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page">23</biblScope>
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Case-based reasoning on images and signals : with 30 tables</title>
		<author>
			<persName><forename type="first">Petra</forename><surname>Perner</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="s">Studies in computational intelligence</title>
		<imprint>
			<biblScope unit="volume">73</biblScope>
			<date type="published" when="2008">2008</date>
			<publisher>Springer</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Modern applied biostatistical methods using S-Plus</title>
		<author>
			<persName><forename type="first">Steve</forename><surname>Selvin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="s">Monographs in epidemiology and biostatistics</title>
		<imprint>
			<biblScope unit="volume">28</biblScope>
			<date type="published" when="1998">1998</date>
			<publisher>Oxford University Press</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<title level="m" type="main">Matrix factorization techniques for recommender systems</title>
		<author>
			<persName><forename type="first">Chris</forename><surname>Volinsky</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2009">2009</date>
			<biblScope unit="volume">42</biblScope>
			<biblScope unit="page" from="30" to="37" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Social networking</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">C</forename><surname>Weaver</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><forename type="middle">B</forename><surname>Morrison</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Computer</title>
		<imprint>
			<biblScope unit="volume">41</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="97" to="100" />
			<date type="published" when="2008-02">Feb. 2008</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
