<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Visualizing reconciliations in co-phylogeny (Extended Abstract)</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Tiziana</forename><surname>Calamoneri</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Computer Science Department</orgName>
								<orgName type="institution">University of Rome &quot;Sapienza&quot;</orgName>
								<address>
									<settlement>Rome</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Valentino</forename><forename type="middle">Di</forename><surname>Donato</surname></persName>
							<affiliation key="aff1">
								<orgName type="department">Engineering Department</orgName>
								<orgName type="institution">Roma Tre University</orgName>
								<address>
									<settlement>Rome</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Diego</forename><surname>Mariottini</surname></persName>
							<affiliation key="aff1">
								<orgName type="department">Engineering Department</orgName>
								<orgName type="institution">Roma Tre University</orgName>
								<address>
									<settlement>Rome</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Maurizio</forename><surname>Patrignani</surname></persName>
							<affiliation key="aff1">
								<orgName type="department">Engineering Department</orgName>
								<orgName type="institution">Roma Tre University</orgName>
								<address>
									<settlement>Rome</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Visualizing reconciliations in co-phylogeny (Extended Abstract)</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">8664971FFF702FA42CEDCC0AC518FC0D</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T07:54+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract/>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>An urgent need in the biological research field consists of unambiguously and effectively representing co-phylogenetic trees, that are pairs of phylogenetic trees that have a mapping among their nodes. The typical application is the investigation of the co-evolution of families of organisms such as hosts and parasites.</p><p>A phylogenetic tree is a full binary rooted tree representing the inferred evolutionary relationships among relative species. In order to be meaningful from a biological point of view, a phylogenetic tree is required to be a full binary tree (i.e., each node has zero or two children). The biologists who study the co-evolution of hosts and parasites start with a host phylogenetic tree H, a parasite phylogenetic tree P , and a mapping function ϕ from the leaves of P to the leaves of H and represent them as a tanglegram (i.e. a pair of plane trees whose leaves are connected by straight-line edges <ref type="bibr" target="#b0">[1,</ref><ref type="bibr" target="#b3">4]</ref>). Triple (H, P, ϕ) is called co-phylogenetic tree and only represents the input of a more complicated analysis process that aims at computing a mapping γ, called reconciliation, that extends ϕ and maps all the parasite nodes onto the host nodes with constraints. Intuitively, a reconciliation is a reasonable reconstruction of the co-evolution events that produced the co-phylogenetic tree <ref type="bibr" target="#b1">[2,</ref><ref type="bibr" target="#b5">6]</ref>.</p><p>Given a co-phylogenetic tree, a great number of different reconciliations are possible. Some of them can be easily discarded, since they are not consistent with time (i.e. they induce contradictory constraints on the periods of existence of the species associated to internal nodes). The remaining reconciliations are generally ranked based on some quality measure. Nevertheless, the number of plausible reconciliations is still so high and the difference between two equallyranked reconciliations may be so big that biologists have to perform a painstaking manual inspection to select the reconciliations that are more compatible with the two phylogenetic trees and with their understanding of the evolution phenomena.</p><p>In this context, the contributions of this paper are the following:</p><p>1. We propose a new and unambiguous metaphor to represent reconciliations of co-phylogenetic trees. The main idea is that of representing H in a suitable space-filling style and of using a traditional node-link style to represent P . 2. In order to pursue readability we study the number of crossings that are introduced in the drawing of tree P in order to respect the represented reconciliation; on the one hand, we characterize some "planar instances", i.e., triples H, P, ϕ , in which every reconciliation can be represented without crossings; on the other hand, we show that, in the general case and for all kinds of representations that comply with a restricted set of aesthetic criteria, reducing the number of crossings in the representation of the reconciliations is NP-hard.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">A new model for the visualization of reconciliations</head><p>While a decade-old literature is devoded to the visualization of co-phylogenetic trees as tanglegrams (see, for example, <ref type="bibr" target="#b0">[1,</ref><ref type="bibr" target="#b3">4]</ref>), little can be said about the representation of the reconciliations of co-phylogenetic trees, except some visualization strategies adopted by the available tools for their computation.</p><p>In particular, few strategies can be highlighted. The simplest one, schematically represented in Fig. <ref type="figure" target="#fig_0">1</ref>(a), is that of representing the two trees by adopting the traditional node-link metaphor, where the nodes of P are drawn close to the nodes of H they are associated to. The advantage of this strategy lays in its simplicity. However, when several parasite nodes are associated to the same host node, the drawing becomes cluttered and the attribution of parasite nodes to host nodes becomes unclear. Further, even if tree P was drawn without crossings (tree H always is), the overlapping of the two trees produces a high number of crossings. An alternative strategy (Fig. <ref type="figure" target="#fig_0">1(b)</ref>) consists in representing H as a background shape, such that its nodes are light-colored circles and its edges are thick pipes, while P is contained into H and drawn in the traditional node-link style. This strategy is particularly effective, as it is unambiguous and crossings between the two trees are strongly reduced, but it is still cluttered when a parasite subtree has to be squeezed inside the reduced area of a host node.</p><p>To solve this cluttering problem some visualization tools adopt the strategy of keeping the containment metaphor while only drawing thick edges of H and omitting host nodes (Fig. <ref type="figure" target="#fig_0">1(c)</ref>). This produces a node-link drawing of the parasite tree drawn inside the pipes representing the host tree. Also this strategy is sometimes ambiguous, since it is unclear how to attribute parasite nodes to host nodes.</p><p>With the aim of overcoming the limitations of existing visualization strategies, we propose a new metaphor for the representation of reconciliations. Our strategy is complementary to the third strategy just described: we omit altogether the arcs of H and represent its nodes as regions of the plane. Arcs are implicitly represented by the contact of the regions corresponding to the nodes. Tree P , instead, maintains the traditional node-link representation, where parasites are placed into the regions associated with the hosts they are mapped to. This representation is not ambiguous, since the containment relationship between host and parasites unambiguously represents the mapping γ. Indeed, the representation of tree H is a variant of a representation known in the literature with the name of icicle <ref type="bibr" target="#b4">[5]</ref>. An icicle (Fig. <ref type="figure" target="#fig_1">2(a)</ref>) is a spacefilling representation of hierarchical information in which nodes are represented by rectangles and arcs are represented by the contact of rectangles, such that the bottom side of the rectangle of a node touches the top sides of its children.</p><p>With respect to the standard icicle representation, we draw H with the following variants (Fig. <ref type="figure" target="#fig_1">2(b)):</ref> we impose that the bottom sides of the rectangles representing the leaves of H touch the bottom border of the drawing area. This forces all contemporaneous hosts to share the same bottom line that represents current time, for each generation of the host tree, we modify the height of the rectangles in such a way that all the parasite subtrees are contained into their hosts rectangles,</p><p>we specialize the icicle to the representation of binary tree, rounding the corners of the rectangles in order to ease the visual recognition of the two children of each node.</p><p>3 Planar instances and reconciliations with the minimum number of crossings</p><p>In this section we characterize the reconciliations that can be planarily drawn, showing that there exist some "planar instances" not depending on the represented reconciliation. Then we naturally wonder what happens when the given instance is not planar and show an NP-hardness result. We omit the proof for the sake of brevity. Let be given a co-phylogenetic tree (H, P, ϕ) and a reconciliation γ.</p><p>Definition 1. A drawing D(γ) of γ is a visual representation of γ according to any metaphor such that:</p><p>the leaves of H are drawn along a horizontal line (i.e. they have the same y-coordinate, w.l.o.g. let it be equal to 0), tree H is embedded (i.e. drawn in a plane way) in the half-plane with nonnegative y-coordinates, each node p ∈ P is drawn inside the representation of node h ∈ H such that γ(p) = h, each edge (s, t) of P (except the ones performing a host-transfer) is drawn inside the representation of the nodes and the edges of H belonging to path from γ(s) and γ(t), the sub-forest induced by all nodes of P mapped on the same node of H follows a strictly downward convention.</p><p>We call V (H), A(H), V (P ) and A(P ) the set of nodes and of arcs of H and P , respectively. V L (H)) and V L (P )) are the set of leaves of the two trees.</p><p>Let G(H, P, ϕ) be the graph whose arc set is A(H) ∪ A(P ) and node set is V (H) ∪ (V (P ) \ V L (P )) and every leaf l of P is identified with leaf ϕ(l) of H. G(H, P, ϕ) is cyclic and, in general, not planar. It can be represented on the plane so that:</p><p>all the leaves of H (and hence also those of P ) lie on a horizontal line l, internal nodes and edges of H are embedded (i.e. drawn in a plane way) in the half-plane above l in a strictly downward fashion, internal nodes and edges of P are represented (not necessarily without crossings) in the half-plane under l in a strictly upward fashion.</p><p>In the following, we will speak about drawing of G(H, P, ϕ) without other specifications to mean a representation on the plane satisfying the above constraints.</p><p>Theorem 1. The following claims are equivalent:</p><p>1. G(H, P, ϕ) is planar and a plane drawing of it is known, 2. for each time-consistent reconciliation γ, there exists a plane drawing D(γ).</p><p>Theorem 1 states that we will have a plane drawing of a reconciliation γ if and only if graph G(H, P, ϕ) is planar. It is hence natural to wonder what happens otherwise. We focus on the problem of the crossing minimization in the drawing of a given reconciliation and prove that computing a drawing of a reconciliation with the minimum number of crossings is NP-hard.</p><p>Given γ and a constant k, we consider the problem Reconciliation Layout (RL) as the problem asking whether there exist a permutation of the leaves of H and a mapping of the nodes of H and of P on the plane such the drawing of γ has at most k crossings.</p><p>Theorem 2. Problem RL is NP-complete.</p><p>We prove it by reducing to RL the NP-complete problem Two-Trees Crossing Minimization (TTCM), whose input consists of two binary trees T 1 and T 2 , whose leaf sets are in one-to-one correspondence, and a constant k; the question is whether T 1 and T 2 admit a tanglegram drawing with at most k crossings among the tangles <ref type="bibr" target="#b2">[3]</ref>.</p><p>We observe that, in the reduction, a primary role is played by host-transfer edges, and this is reasonable, since they connect nodes in completely different parts of the tree, possibly very far. Nevertheless, host-transfers are very special edges and we believe that biologists have no problems in understanding the drawing even if these edges introduce some crossings, so we wonder whether crossings involve only host-transfers or not. We answer in a negative way to this question by showing that some crossings can occur, even if host-transfer edges are eliminated from the representation.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 :</head><label>1</label><figDesc>Fig. 1: Three visualization strategies for representing co-phylogenetic trees.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Fig. 2 :</head><label>2</label><figDesc>Fig. 2: (a) An icicle. (b) The representation adopted for trees H and P , where dotted lines represent host-transfers.</figDesc></figure>
		</body>
		<back>

			<div type="funding">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>This research was partially supported by MIUR project "MODE -MOrphing graph Drawings Efficiently", prot. 20157EFM5C 001 and by Sapienza University of Rome projects "Graph Algorithms for Phylogeny: a promising approach" and "Combinatorial structures and algorithms for problems in co-phylogeny".</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Drawing (complete) binary tanglegrams</title>
		<author>
			<persName><forename type="first">K</forename><surname>Buchin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Buchin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Byrka</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Nöllenburg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Okamoto</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">I</forename><surname>Silveira</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Wolff</surname></persName>
		</author>
		<idno type="DOI">10.1007/s00453-010-9456-3</idno>
		<ptr target="http://dx.doi.org/10.1007/s00453-010-9456-3" />
	</analytic>
	<monogr>
		<title level="j">Algorithmica</title>
		<imprint>
			<biblScope unit="volume">62</biblScope>
			<biblScope unit="issue">1-2</biblScope>
			<biblScope unit="page" from="309" to="332" />
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Jungles: a new solution to the host/parasite phylogeny reconciliation problem</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Charleston</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Math Biosci</title>
		<imprint>
			<biblScope unit="volume">149</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="191" to="223" />
			<date type="published" when="1998">1998</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Comparing trees via crossing minimization</title>
		<author>
			<persName><forename type="first">H</forename><surname>Fernau</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Kaufmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Poths</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">J. Comput. Syst. Sci</title>
		<imprint>
			<biblScope unit="volume">76</biblScope>
			<biblScope unit="issue">7</biblScope>
			<biblScope unit="page" from="593" to="608" />
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Comparing trees via crossing minimization</title>
		<author>
			<persName><forename type="first">H</forename><surname>Fernau</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Kaufmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Poths</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">J. Comput. Syst. Sci</title>
		<imprint>
			<biblScope unit="volume">76</biblScope>
			<biblScope unit="issue">7</biblScope>
			<biblScope unit="page" from="593" to="608" />
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Icicle plots: Better displays for hierarchical clustering</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">B</forename><surname>Kruskal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">M</forename><surname>Landwehr</surname></persName>
		</author>
		<ptr target="http://www.jstor.org/stable/2685881" />
	</analytic>
	<monogr>
		<title level="j">The American Statistician</title>
		<imprint>
			<biblScope unit="volume">37</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="162" to="168" />
			<date type="published" when="1983">1983</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Trees within trees: phylogeny and historical associations</title>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">D</forename><surname>Page</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Charleston</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Trends Ecol Evol</title>
		<imprint>
			<biblScope unit="volume">13</biblScope>
			<biblScope unit="issue">9</biblScope>
			<biblScope unit="page" from="356" to="359" />
			<date type="published" when="1998">1998</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
