<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Analyzing Semantic Interoperability in Bioinformatic Database Networks</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Philippe</forename><surname>Cudré-Mauroux</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">School of Computer and Communication Sciences</orgName>
								<orgName type="institution">EPFL</orgName>
								<address>
									<country key="CH">Switzerland</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Julien</forename><surname>Gaugaz</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">School of Computer and Communication Sciences</orgName>
								<orgName type="institution">EPFL</orgName>
								<address>
									<country key="CH">Switzerland</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Adriana</forename><surname>Budura</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">School of Computer and Communication Sciences</orgName>
								<orgName type="institution">EPFL</orgName>
								<address>
									<country key="CH">Switzerland</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Karl</forename><surname>Aberer</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">School of Computer and Communication Sciences</orgName>
								<orgName type="institution">EPFL</orgName>
								<address>
									<country key="CH">Switzerland</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Analyzing Semantic Interoperability in Bioinformatic Database Networks</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">86EE02CD3B3511403ECFFEDB596EEB6A</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T04:00+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>We consider the problem of analytically evaluating semantic interoperability in large-scale networks of schemas interconnected through pairwise schema mappings. Our heuristics are based on a graphtheoretic framework capturing important statistical properties of the graphs. We validate our heuristics on a real collection of interconnected bioinformatic databases registered with the Sequence Retrieval System (SRS). Furthermore, we derive and provide experimental evaluations of query propagation on weighted semantic networks, where weights model the quality of the various schema mappings in the network.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Even if Semantic Web technologies have recently gained momentum, their deployment on the wide-scale Internet is still in its infancy. Only a very small portion of websites have so far been enriched with machine-processable data encoded in RDF or OWL. Thus, the difficulty to analyze semantic networks due to the very lack of realistic data one can gather about them. In <ref type="bibr" target="#b4">[5]</ref>, we introduced a graph-theoretic model to analyze interoperability of semantic networks and tested our heuristics on large-scale, random topologies. In this paper, we extend these heuristics and test them on a real semantic network, namely on a collection of schemas registered with the Sequence Retrieval System (SRS).</p><p>We start below by giving a short introduction to SRS. We then present our approach, which boils down to an analysis of the component sizes in a graph of schemas interconnected through schema mappings. We report on the statistical properties of the SRS network we consider and on the performance of our heuristics applied on this network. Finally, we report on the performance of our approach on larger and weighted networks mimicking the statistical properties of the SRS network.</p><p>The work presented in this paper was supported (in part) by the National Competence Center in Research on Mobile Information and Communication Systems (NCCR-MICS), a center supported by the Swiss National Science Foundation under grant number 5005-67322. <ref type="bibr" target="#b1">2</ref> The Sequence Retrieval System (SRS) SRS, short for "Sequence Retrieval System", is a commercial information indexing and retrieval system designed for bioinformatic libraries such as the EMBL nucleotide sequence databank, the SwissProt protein sequence databank or the Prosite library of protein subsequence consensus patterns. It is a distributed, interoperable data management system which was initially developed at the European Molecular Biology Laboratory in Heidelberg, and which allows the querying of one or several databases simultaneously, regardless of their format or schemas.</p><p>Administrators wishing to register new databases with SRS first have to define the schema they have adopted to store data, using a custom language called Icarus. Once their schemas have been defined, administrators can import schema instances (i.e., text files) whose data will be correctly parsed, indexed and processed thanks to the corresponding schema definitions. Additionally, administrators can manually define relationships between their database schema and similar schemas. In SRS, these relationships are represented as links relating one entry of a database schema to one entry of another schema. Thanks to this structure of links between databases, users can propagate queries they pose locally against one specific schema to other schemas available in the system (for technical details, we refer the interested reader to http://www.lionbioscience.com/ )</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Graph analysis of an SRS repository</head><p>Conceptually, the model described above is very close to what one could expect from a subgraph of the semantic web itself, i.e., a collection of related schemas (or ontologies) linked one to another through pairwise mappings. The graph which can be extracted from a SRS repository has two main advantages over those which can be built from current RDFS / OWL repositories: i) it is based on a real-world collection of schemas which are being used on a daily basis by numerous independent parties and ii) it is of a reasonable size (several hundreds of semantically related complex schemas). Thus, after having been rather unsuccessful at finding reasonable semantic networks from the Semantic Web itself, we decided to build a specialized crawler to analyze the semantic graph of an SRS repository and to test our heuristics on the resulting network.</p><p>We chose to analyze the semantic network from the European Bioinformatics Institute SRS repository, publicly available at http://srs.ebi.ac.uk/. We built a custom crawler which traverses the entire network of databases and extracts schema mapping links stored in the schema definition files. The discussion below is based on the state of the SRS repository as of May 2005.</p><p>The graph resulting from our crawling process is an undirected graph of 388 nodes (database schemas) and 518 edges (pairwise schema mappings). We chose to represent links as undirected edges since they are used in both directions by SRS (they basically represent cross-references between two entries of two database schemas). We identified all connected components in the graph (two nodes are in the same connected component if there is a path from one to the other following edges). The analysis revealed one giant connected component (i.e., a relatively large set of interconnected schemas) of 187 nodes, which represent roughly half of the nodes and 498 edges. Besides the giant connected component, the graph also has two smaller components, each consisting of two vertices. The rest of the nodes are isolated, representing mostly result databases or databases for which no link to other databases was defined.</p><p>The average degree of the nodes is 2.2 for the whole graph and 4.6 for the giant component. Real networks differ from random graphs in that often their degree distribution follows a power law, or has a power law tail, while random graphs have a Poisson distribution of degrees <ref type="bibr" target="#b1">[2]</ref>. Unsurprisingly, our semantic network is no exception as can be seen in Figure <ref type="figure" target="#fig_1">1</ref> below, which depicts an accurate approximation of the degree distribution of our network by a powerlaw distribution y(x) = αx −γ with α = 0.21 and γ = 1.51.  Another interesting property which we explored was the tendency of the schemas to form clusters, quantified by the average clustering coefficient. Intuitively, the clustering coefficient of a vertex measures the degree to which its neighbors are neighbors of each other. More precisely, the clustering coefficient indicates the ratio of existing edges connecting a node's neighbors to each other to the maximum possible number of such edges. The network we considered has a high average clustering coefficient of 0.32 for the whole graph and of 0.54 for the giant component. The diameter (maximum shortest paths between any two vertices) of the giant connected component is 9. These data indicate that our network can be characterized as scale-free (power-law distribution of degrees) or small-world (small diameter, high clustering coefficient).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Analyzing semantic interoperability in the large</head><p>In <ref type="bibr" target="#b4">[5]</ref>, we introduced a graph-theoretic framework for analyzing semantic interoperability in large-scale networks. As described above, we model database schemas (or ontologies) as nodes, interconnected by edges (schema mappings). Schema mappings are used to iteratively propagate queries posed against a local schema to other related schemas (see <ref type="bibr" target="#b0">[1]</ref> for how this can be implemented in practice). Note that links can be directed or undirected, weighted or non-weighted depending on the schema mappings being used.</p><p>In such a network, the density of mappings is important in order to propagate a local query from one database (schema) to the other databases. A query can only be propagated to all databases if the semantic network is connected, that is if there exists a path from one schema to any other schema following schema mapping links. If some schemas are isolated, queries cannot be propagated to/from the rest of the graph, thus making it impossible to have a semantically interoperable network of databases. This observation motivated us to take advantage of percolation theory to determine when a semantic network could be connected or not. Our framework for analyzing semantic interoperability takes advantage of generatingfunctionologic <ref type="bibr" target="#b6">[7]</ref> functions for the degree distribution of the semantic graph:</p><formula xml:id="formula_0">G 0 (x) = ∞ k=0 p k x k<label>(1)</label></formula><p>where p k represents the probability that a randomly chosen vertex has degree k.</p><p>We showed (by extending results from <ref type="bibr" target="#b5">[6]</ref>) that a network cannot be semantically interoperable in the large unless the connectivity indicator ci = k k(k−2−cc)p k is greater than zero, with cc representing the clustering coefficient. Also, we provided heuristics for estimating the relative size S of the biggest semantically interoperable cluster of schemas by solving</p><formula xml:id="formula_1">S = 1 − G 0 (u), (<label>2</label></formula><formula xml:id="formula_2">)</formula><p>where u is the smallest non-negative real solution of</p><formula xml:id="formula_3">u = G 1 (u)<label>(3)</label></formula><p>and G 1 (u), the distribution of outgoing edges from first to second-order neighbors, is</p><formula xml:id="formula_4">G 1 (x) = 1 x cc G 0 (x) G 0 (1) = 1 z 1 1 x cc G 0 (x).<label>(4)</label></formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Applying our heuristics to the SRS graph</head><p>We applied our heuristics to the SRS graph we obtained from the crawling process. The results are as follows: we get a connectivity indicator ci of 25.4, indicating that the semantic network is clearly in a super-critical state and that a giant component interlinking most of the databases has appeared. The size of this giant component as estimated by our heuristics (see above) is of 0.47, meaning that 47% of the schemas are part of the giant connected component. This is surprisingly close to the real value of 0.48 as observed in the graph.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">Generating a Graph with a given Power-Law Degree Distribution</head><p>Going slightly further, we want to analyze the dynamics of semantic graphs with varying numbers of edges. Our aim is to generate graphs with the same statistical properties as the SRS graphs, that is, graphs following a power-law degree distribution:</p><formula xml:id="formula_5">P (k) = αk −γ<label>(5)</label></formula><p>but with a varying number of edges. We take from <ref type="bibr" target="#b2">[3]</ref> a graph-building algorithm yielding a power-law degree distribution with a given exponent γ. It goes as following: (1) create a (large) number N of vertices. (2) to each vertex i, assign an "importance" x i , which is a real number taken from a distribution ρ(x). (3) for each pair of vertices, draw a link with probability f (x i , x j ), which is function of the importance of both vertices. Now if f (x i , x j ) = (x i x j /x 2 M ) (where x M is the largest value of x in the graph), we know from <ref type="bibr" target="#b2">[3]</ref> that the degree distribution of a graph will be</p><formula xml:id="formula_6">P (k) = x 2 M N x ρ x 2 M N x k<label>(6)</label></formula><p>where x is the expected value of the importance x, such that P (k) follows a power-law if ρ(x) does so.</p><p>We then choose a power-law distribution</p><formula xml:id="formula_7">ρ(x) = γ − 1 (m 1−γ − Q 1−γ ) x −γ<label>(7)</label></formula><p>defined over the interval [m, Q]. However, we still have to find values for m and Q such that the scale of the resulting degree distribution equals α. Using equation 7, we find the expected importance value as</p><formula xml:id="formula_8">x = (γ − 1)(m 2 Q γ − m γ Q 2 ) (γ − 2)(mQ γ − m γ Q) .<label>(8)</label></formula><p>Replacing ρ(x) in equation 6, the degree distribution of the resulting graph becomes</p><formula xml:id="formula_9">P (k) = x 2 M N x γ − 1 (m 1−γ − Q 1−γ ) x 2 M N x k −γ<label>(9)</label></formula><p>such that, equating with equation 5, we get</p><formula xml:id="formula_10">α = x 2 M N x γ − 1 (m 1−γ − Q 1−γ ) x 2 M N x −γ . (<label>10</label></formula><formula xml:id="formula_11">)</formula><p>We can then arbitrarily choose m &gt; 0 and find Q by numerical approximation, since the right-hand side of equation 10 is defined and continue for values of Q &gt; m.</p><p>Figures <ref type="figure">2 and 3</ref> show the results of our heuristics on networks of respectively 388 (i.e., mimicking the original SRS graph) and 3880 edges (i.e., 10 times bigger than the original SRS graph but with the same statistical properties) constructed in the way presented above with a varying number of edges. The curves are averaged over 50 consecutive runs. As for the original SRS network, we see that we can accurately predict the size of the giant semantic component, even for very dense graphs. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Connectivity Indicator and Giant Component Size in Weighted Graphs</head><p>So far, we only analyzed the presence and size of a giant connected component in order to determine which portion of a semantic network could potentially be semantically interoperable. In large-scale decentralized networks, however, one should not only look into the giant semantic component itself, but also analyze the quality of the mappings used to propagate queries from one schema to the other (see <ref type="bibr" target="#b0">[1]</ref> for a discussion on that topic). Indeed, in any large, decentralized network, it is very unlikely that all schema mappings could correctly map queries from one schema to the other, because of the lack of expressivity of the mapping languages, and of the fact that some (most?) of the mappings might be generated automatically. Thus, as considered by more and more semantic query routing algorithms, we introduce weights for the schema mappings to capture the quality of a given mapping. Weights range from zero (indicating a really poor mapping unable to semantically keep any information while translating the query) to one (for perfect mappings, keeping the semantics of the query intact from one schema to the other). We then iteratively forward a query posed against a specific schema to other schemas through schema mappings if and only if a given mapping has a weight (i.e., quality) greater than a predefined threshold τ . τ = 0 corresponds to sending the query through any schema mapping, irrespective of its quality. On the contrary, when we set τ to one, the query gets only propagated to semantically perfect mappings, while even slightly faulty mappings are ignored. Previous works in statistical physics and graph theory have looked into percolation for weighted graphs. We present hereafter an extension of our heuristics for weighted semantic networks inspired by <ref type="bibr" target="#b3">[4]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1">Connectivity Indicator</head><p>As before, we consider a generating function for the degree distribution</p><formula xml:id="formula_12">G 0 (x) = ∞ k=0 p k x k (11)</formula><p>where p k is the probability that a randomly chosen vertex has degree k in the network. We then define t jk as the probability that an edge has a weight above τ given that it binds vertices of degree j and k. Thus, w k = ∞ j=0 t jk is the probability that an edge transmits, given that it is attached to a vertex of degree k. The generating function for the probability that a vertex we arrive at by following a randomly chosen edge is of degree k and transmits is</p><formula xml:id="formula_13">G 1 (x) = ∞ k=0 w k kx k−1 x cc ∞ k=0 kp k (<label>12</label></formula><formula xml:id="formula_14">)</formula><p>where cc is the clustering coefficient. Now, from <ref type="bibr" target="#b3">[4]</ref>, we know that the generating function for the probability that one end of a randomly chosen edge on the graph leads to a percolation cluster of a given number of vertices is</p><formula xml:id="formula_15">H 1 (x) = 1 − G 1 (1) + xG 1 [H 1 (x)] .<label>(13)</label></formula><p>Similarly, the generating function for the probability that a randomly chosen vertex belongs to a percolation cluster of a given number of vertices is</p><formula xml:id="formula_16">H 0 (x) = 1 − G 0 + xG 0 [H 1 (x)]<label>(14)</label></formula><p>such that the mean component size corresponding to a randomly chosen vertex is</p><formula xml:id="formula_17">s = H 0 (1) = G 0 (1) + G 0 (1)G 1 (1) 1 − G 1 (1)<label>(15)</label></formula><p>which diverges for G 1 (1) ≥ 1. However,</p><formula xml:id="formula_18">G 1 (1) = ∞ k=0 w k p k k(k − 1 − cc) ∞ k=0 kp k (16) such that a giant connected component appears if ci = ∞ k=0 kp k [w k (k − 1 − cc) − 1] ≥ 0 (17)</formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2">Giant Component Size</head><p>As seen above, H 0 (x) represents the distribution for the cluster size which a randomly chosen vertex belongs to, excluding the giant component. Thus, according to <ref type="bibr" target="#b3">[4]</ref>, H 0 (1) is equal to the fraction of the nodes which are not in the giant component. The fraction of the nodes which are in the giant component is hence S = 1 − H 0 (1). Using equation 14 we can write</p><formula xml:id="formula_19">S = 1 − H 0 (1) = G 0 (1) − G 0 [H 1 (1)] .<label>(18)</label></formula><p>with <ref type="bibr" target="#b0">1)</ref>]. Thus H 1 (1) = u where u is the smallest non-negative solution of</p><formula xml:id="formula_20">H 1 (1) = 1 − G 1 (1) + xG 1 [H 1<label>(</label></formula><formula xml:id="formula_21">u = 1 − G 1 (1) + G 1 (u).<label>(19)</label></formula><p>The relative size of the giant component reached by the query in a weighted semantic graph follows as</p><formula xml:id="formula_22">S = G 0 (1) − G 0 (u). (<label>20</label></formula><formula xml:id="formula_23">)</formula><p>Figures <ref type="figure" target="#fig_4">4 and 5</ref> show the results of our heuristics on weighted networks of respectively 388 and 3880 nodes, for a varying number of edges and various values of τ . The curves are averaged over 50 consecutive runs, and the weights of individual schema mappings are randomly generated using a uniform distribution ranging from zero to one. We see that our heuristics can quite adequately predict the relative size of the graph to which a given query will be forwarded. Also, as for the unweighted analysis, we observe similar behaviors for the two graphs; This is rather unsurprising as we are dealing with scale-free networks whose properties are basically independent of their size.  In this paper, we tested graph-theoretic heuristics to evaluate semantic interoperability on a real semantic network. The results confirm the validity of our heuristics beyond our initial hopes as we could predict quite accurately the size of the giant semantic component in the network. Also, we extended our analysis to apply our heuristics on larger networks enjoying similar statistical properties and on weighted semantic networks. It was for us quite important to test our heuristics using real-world data, as semantic network analyses mostly consider artificial networks today, due to the current lack of semantically enriched websites or deployed semantic infrastructures. In the future, we plan to extend our analyses to take into account the dynamicity (churn) of the network of schema mappings, and to consider more accurate query forwarding schemes based on transitive closures of mapping operations.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: An approximation of the degree distribution of our semantic network by a power-law distribution y(x) = αx −γ with α = 0.21 and γ = 1.51</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 2 :Figure 3 :</head><label>23</label><figDesc>Figure 2: Estimating the giant component size of a scale-free semantic network of 388 nodes with a varying number of edges</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Figure 4 :</head><label>4</label><figDesc>Figure 4: Fraction of the graph a local query will be forwarded to, for a weighted network of 388 nodes with a varying number of edges and various forwarding thresholds τ</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Figure 5 :</head><label>5</label><figDesc>Figure 5: Fraction of the graph a local query will be forwarded to, for a weighted network of 3880 nodes with a varying number of edges and various forwarding thresholds τ</figDesc></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">The Chatty Web: Emergent Semantics Through Gossiping</title>
		<author>
			<persName><forename type="first">K</forename><surname>Aberer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Cudré-Mauroux</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Hauswirth</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International World Wide Web Conference (WWW)</title>
				<imprint>
			<date type="published" when="2003">2003</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Statistical mechanics of complex networks</title>
		<author>
			<persName><forename type="first">R</forename><surname>Albert</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A.-L</forename><surname>Barabási</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Reviews of Modern Physics</title>
		<imprint>
			<biblScope unit="volume">74</biblScope>
			<biblScope unit="issue">47</biblScope>
			<date type="published" when="2002">2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Scale-free networks from varying vertex intrinsic fitness</title>
		<author>
			<persName><forename type="first">G</forename><surname>Caldarelli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Capocci</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">D L</forename><surname>Rios</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Muoz</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Phys. Rev. Lett</title>
		<imprint>
			<biblScope unit="volume">89</biblScope>
			<biblScope unit="page">258702</biblScope>
			<date type="published" when="2002">2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Network robustness and fragility: Percolation on random graphs</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">S</forename><surname>Callaway</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Newmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">H</forename><surname>Strogatz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">J</forename><surname>Watts</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Phys. Rev. Lett</title>
		<imprint>
			<biblScope unit="volume">85</biblScope>
			<biblScope unit="page">54685471</biblScope>
			<date type="published" when="2000">2000</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">A Necessary Condition For Semantic Interoperability In The Large</title>
		<author>
			<persName><forename type="first">P</forename><surname>Cudré-Mauroux</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Aberer</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International Conference Ontologies, DataBases, and Applications of Semantics (ODBASE)</title>
				<imprint>
			<date type="published" when="2004">2004</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Random graphs with arbitrary degree distributions and their applications</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">E J</forename><surname>Newman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">H</forename><surname>Strogatz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">J</forename><surname>Watts</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Phys. Rev</title>
		<imprint>
			<biblScope unit="volume">64</biblScope>
			<biblScope unit="page">26118</biblScope>
			<date type="published" when="2001">2001</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">S</forename><surname>Wilf</surname></persName>
		</author>
		<title level="m">Generatingfunctionology</title>
				<meeting><address><addrLine>London</addrLine></address></meeting>
		<imprint>
			<publisher>Academic Press</publisher>
			<date type="published" when="1994">1994</date>
		</imprint>
	</monogr>
	<note>2nd Edition</note>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
