<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Real-world DAG datasets
Name Vertices Edges Average Degree (max.) Median Degree
arxiv</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Optimization Techniques for 2-hop Labeling of Dynamic Directed Acyclic Graphs</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jinhyun Ahn</string-name>
          <email>jhahncs@snu.ac.kr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Biomedical Knowledge Engineering Laboratory and Dental Research Institute Seoul National University</institution>
        </aff>
      </contrib-group>
      <volume>6</volume>
      <issue>000</issue>
      <abstract>
        <p>Graph databases are increasingly used for RDF(Resource Description Framework) data management. Mining reachability relationships between resources is one of important building blocks for graph databases. Reachability relationships in a graph and its corresponding directed acyclic graph(DAG) are equivalent when we focus on reachability alone, which allows to focus on DAGs in this thesis. Diverse labeling schemes have been proposed to e ciently determine the reachability of DAGs. We focus on a state-of-the-art 2-hop labeling scheme that is based on a permutation of vertices to achieve a linear index size and reduce on-line searches that are required when the reachability cannot be answered by 2-hop labels only. We observed that the approach can be improved in three di erent ways; 1) space-e ciency guarantee the minimized index size without randomness 2) update-e ciency update labels e ciently when graphs changes 3) parallelization labeling should be cluster-based, and solved in a distributed fashion. In these regards, this PhD thesis proposes optimization techniques that address these issues. In this paper in particular, a way of reducing the 2-hop label size is proposed with preliminary results on real-world DAG datasets. In addition, we will discuss the feasibilities of the other issues based on our on-going works.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Problem statement</title>
      <p>
        A 2-hop labeling scheme of a DAG is to label each vertex v with a pair (Lout(v),
Lin(v)), where Lout(v) is a set of vertices that v can reach and Lin(v) is a set
of vertices reachable from v [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. In this thesis, we focus on a state-of-the-art
variation of 2-hop labeling [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], where a permutation of vertices is used to allow
Lout(v) and Lin(v) to keep at most k reachable vertices, which probabilistically
guarantees reduction in on-line Depth-First-Search(DFS), a condition required
when the reachability cannot be answered by these labels only. We brie y review
the labeling scheme as follows.
      </p>
      <p>2-hop Labeling based on Independent Permutation (IP) Let G(V; E)
be a DAG with V as a set of vertices and E as a set of edges pairing two vertices.
u ↝ v is used to denote that u can reach v, and u   v, otherwise. A permutation
of V is a bijection denoted as ∶ V Ð→ V . Given G and a positive integer k, IP
࢜ ߪ ૚ ሺ࢜ሻ ࡸ ࢕࢛࢚ ࢜ሻሺ ࡸ ࢏࢔ ሺ࢜ሻ</p>
      <p>࢜ ߪ ૛ ሺ࢜ሻ ࡸ ࢕࢛࢚ ሺ࢜ሻ ࡸ ࢏࢔ ሺ࢜ሻ
randomly generates and then outputs the 2-hop index Ik ∶ V Ð→ H, where H
is a set of pairs (Lout(v); Lin(v)) de ned as:</p>
      <p>Lout(v) = mink{ (u)Sv ↝ u}</p>
      <p>Lin(v) = mink{ (u)Su ↝ v}
, where mink{A} is a sub-set of A, such that Ai &lt; Aj if i &lt; j and Smink{A}S ≤ k.
In other words, mink{A} contains the k smallest integers in A.</p>
      <p>In the original version of IP, is randomly generated using the Knuth shu e
algorithm. We argue that the randomized way of generating a permutation is
unable to ensure that the minimized size of index is obtained. Therefore, we
de ne the rst problem that generates a space-e cient permutation de ned in
De nition 1.</p>
      <p>De nition 1. Space-e cient Permutation Given G(V; E) and a positive
integer k, output such that</p>
      <p>1
argmin size(Ik(V )) ∶= SV S v∈V</p>
      <p>Q space(Lout(v)) + space(Lin(v))
, where space(L) is the space requirements for a set L of integers, which can be
de ned according to the application requirements.</p>
      <p>An illustrative example that shows the e ect of on the index size is depicted
in Fig. 1, where we assume space(L) = ∑w∈L w to see the index size di erence
for the very small DAG.</p>
      <p>
        Linked Open Data1(LOD) evolves over time [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. When a DAG Gt changes
into Gt+1, the straightforward way is to construct the 2-hop index against Gt+1
from scratch. As this is not feasible for massive DAGs in LOD, we de ne a second
problem that generates an update-e cient permutation by which 1) a smaller
number of existing labels are modi ed and 2) a smaller index size is maintained,
as de ned in De nition 2. Since there is a trade-o between these criteria, weights
1 http://linkeddata.org
are used according to requirements. The rst factor of the equation in De nition
2 represents the number of vertices in the current version Vv+1 whose labels are
not the same as in the previous version in Ik(v).
      </p>
      <p>De nition 2. Update-e cient Permutation Assume we already have Gt(Vt; Et)
and its 2-hop index Ik(Vt). Given Gt+1(Vt+1; Et+1), generate a new permutation
t+1 such that
× S{v ∈ Vt</p>
      <p>Vt+1SIkt (v) ≠ I t+1 (v)}S +
k
× size(Ikt+1 (Vt+1))</p>
      <p>Meanwhile, existing 2-hop labeling algorithms running on a single machine
are not e cient enough to address the above problems for massive DAGs. Our
third problem is thus to devise a cluster-based 2-hop labeling algorithm that
supports De nition 1 and 2.</p>
      <p>argmin</p>
      <p>
        t+1
, where
≥ 0,
Reachability relationships in a graph and its corresponding DAG are equivalent,
since a graph can be transformed into a DAG by condensing every strongly
connected component(SCC) into a single vertex and retaining edges between
vertices in SCC and the other vertices that are connected to SCC [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. In terms
of reachability relationships, LOD can be viewed as a real-world DAG dataset.
A vast amount of knowledge is available in LOD, including social networks,
biological networks, tra c networks, and software version management.
Proteinprotein interaction networks, for example, can be represented in DAGs. One may
be interested in mining interactions (i.e., reachability relationships) between two
proteins (i.e., vertices) [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Regarding RDF triple stores, reachability relationship
identi cation helps process SPARQL queries e ciently [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
3
      </p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        There are extensive studies on labeling of DAGs for reachability queries
processing [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. The straightforward way is to pre-compute the edge transitive
closures(TC) [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. Even if this method provides an answer to a reachability query in
O(1) time, the index could be huge for even small dense graphs. On-line
traversal, such as DFS and Bread-First Search(BFS), do not require any index but
lead to slow query processing. Various approaches have been made to address
trade-o s between the index size and the speed of query processing. Some
authors have adapted the prime number labeling scheme to DAGs [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. However,
prime numbers quickly grow to large values. Tree Cover [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] labels a vertex v
with a compressed TC of the subtree rooted at v. [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] present GRIPP(GRaph
Indexing based on Pre- and Postorder numbering) that utilizes spanning trees
based on an interval label scheme. GRAIL [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] is based on randomized
multiple interval labeling. FERRARI [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] labels each vertex with a mixture of exact
and approximate reachability intervals, where subsets of intervals are merged
into approximate intervals. 2-hop labeling [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] labels each vertex v with a pair
(Lout(v); Lin(v)).
      </p>
      <p>
        Recently, a state-of-the-art variation of 2-hop labeling has been proposed
in [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] that reports outstanding performance compared to existing approaches.
A permutation of vertices is rst generated randomly. MinHash is adapted to
construct 2-hop labels; at most k reachable vertices are only maintained in
2hop labels. For pairs that cannot be determined by k reachable vertices only, an
on-line search is performed. Some heuristics are also presented to minimize the
case that requires exhaustive on-line DFS searches.
4
      </p>
    </sec>
    <sec id="sec-3">
      <title>Research questions</title>
      <p>
        The research objective is to optimize the IP labeling [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] in terms of index size,
updating cost, and scalability. We want to answer the following research
questions (where di culties regarding the questions are also discussed).
      </p>
      <p>1) Is it possible to deterministically choose a permutation of V that best
minimizes the index size? Because the number of candidate permutations is n!, a
brute-force algorithm is not feasible. One may choose a permutation by sorting
V using a topological sort. Smaller numbers are assigned to vertices having
more reachable vertices. It can be expected that this method allows Lin(v) to
have smaller numbers. However, conversely, Lout(v) would have large numbers,
o setting smaller size of Lin(v). A more sophisticated technique is required.</p>
      <p>2) When a new vertex vnew is added to a DAG Gt(Vt; Et) that has already
been labeled by t, which number should be assigned to t+1(vnew)? The simplest
way is to make t+1(vnew) = max( t(Vt)) + 1. However, with this approach, the
minimized index size is not always ensured.</p>
      <p>3) In order to do 2-hop labeling in a cluster, when vertices are distributed,
how can we know reachable vertices from a vertex? Given a set M of machines,
we may distribute a subset Vi of V to a machine Mi and then perform labeling
independently. The challenge is to obtain Lin(v) and Lout(v) for a vertex v ∈ Vi
residing in Mi whereas some o ∈ Lin(v) ⋃ Lout(v) have been distributed to the
other machines Mj , such that i ≠ j.
5</p>
    </sec>
    <sec id="sec-4">
      <title>Hypotheses</title>
      <p>Several hypotheses regarding research questions are stated as follows.
{ If smaller numbers are assigned to
2-hop index size is reduced.</p>
      <p>(v) because v has more edges, then the
{ The reachability query processing performance increases as index size is
reduced.
{ If 2-hop labeling is carried out in a cluster, then the index construction and
update would be faster than existing single machine based algorithms are.
6</p>
    </sec>
    <sec id="sec-5">
      <title>Approach</title>
      <p>To prove the rst hypotheses, we rst compute the degree of each of the vertices
in V and then sort V in decreasing order by degree. The order is regarded as
the desired permutation (V ) as follows.</p>
      <p>(v) = i
, where v is the i-th element in V degree such that V degree is a sorted set of V ,
sorted by decreasing degree of v.</p>
      <p>In other words, the more degrees v has, the smaller the number assigned to
v . This is derived by an expectation that if v has many edges, v is likely
( )
to become a reachable vertex from another vertex. The overall 2-hop index size
could be reduced by making (v) smaller. Note that the current approach does
not take into account the parameter k. We plan to introduce the parameter k
to guarantee a reduced index size for arbitrary k.</p>
      <p>
        Regarding a cluster-based algorithm, we are motivated by the approach in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]
that proposed cluster based labeling algorithms for trees. In that work, Mapper
performs incomplete labeling and then Reducer completes the labels by referring
to the o set table shared by all machine. The o set table is constructed based
on information collected from each Mapper, which contains information needed
to complete the labels in Reducers. We plan to adapt the idea in a manner that
e ectively deals with large DAGs in a cluster.
7
      </p>
    </sec>
    <sec id="sec-6">
      <title>Preliminary results</title>
      <p>We implemented the rst proposed approach based on the source codes provided
by the authors of IP2. The proposed approach (denoted as IP adv) is evaluated
compared with the original approach (IP) and baseline (IP x). IP generates a
2 http://www1.se.cuhk.edu.hk/˜hwei/source/IP.zip
1 2 3 4 5 6 7 8 9 10
random permutation, which means that a di erent index is constructed for each
run. IP x is based on the identity permutation such that (v) = j, where v is the
j-th element in V . All the experiments were conducted on a machine with a 2.4
GHz CPU and 40 GB of RAM. We downloaded the real-world DAG datasets3.
Statistics of the datasets are listed in Table 1.</p>
      <p>The index sizes for each dataset are plotted in Fig. 2. The index size by
IP adv and IP x are compared with the index sizes generated by 10 runs of IP.
For each run, only IP showed the di erent index size due to its random nature.
IP adv showed the best performance for all datasets. For small datasets with
large degrees such as arxiv and pubmed, relatively small improvements are
observed. For large datasets, we observed relatively large improvements, except
3 https://code.google.com/archive/p/ferrari-index/downloads
for citeseer and yago. It can be seen in Table 1 that these datasets have
relatively small median degrees. Since IP adv is based on degrees, too small or
too large degrees would not signi cantly contribute to a reduction in the index
size. In future works, we plan to utilize more graph metrics to work better against
such datasets.
8</p>
    </sec>
    <sec id="sec-7">
      <title>Evaluation plan</title>
      <p>
        We plan to collect LOD datasets and then transform them into DAGs by
condensing SCC. Synthetic DAG datasets will also be considered, varying graph
metrics such as in/out-degree, diameter, and vertices with no ancestors or
descendants. In particular, several releases of DBpedia datasets4 will be collected
in order to evaluate the performance of updating the index. To examine the pros
and cons of our approach in greater details, we will compare it to a number
of notable existing approaches such as [
        <xref ref-type="bibr" rid="ref10 ref16 ref18">10, 16, 18</xref>
        ]. Regarding our cluster-based
algorithm, as there exist few works on cluster-based 2-hop labeling, we will
modify existing cluster-based tree labeling algorithms [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] and consider these as the
baseline.
9
      </p>
    </sec>
    <sec id="sec-8">
      <title>Re ections</title>
      <p>
        It remains a challenge to apply graph reachability indexing techniques to very
large sets of LOD. In this paper, we showed that simple graph metrics can
be exploited to reduce the index size. By improving the proposed approach
further, it will become more applicable to LOD. Another characteristic of LOD
is that it changes over time. We have some experience on scalable RDF change
detection [
        <xref ref-type="bibr" rid="ref2 ref7 ref8">2, 8, 7</xref>
        ] that outputs added and removed triples for two given RDF
dataset versions. By utilizing this system, it would be possible to update only a
portion of an existing 2-hop index while avoiding re-labeling. To perform these
algorithms e ciently in a scalable manner, we can utilize distributed computing
frameworks such as MapReduce and Spark.
      </p>
    </sec>
    <sec id="sec-9">
      <title>Acknowledgement</title>
      <p>I would like to acknowledge my advisors, Hong-Gee Kim (Seoul National
University) and Dong-Hyuk Im (Hoseo University). This work was supported by
Institute for Information &amp; communications Technology Promotion(IITP) grant
funded by the Korea government(MSIP) (No. R0101-16-0054, WiseKB: Big data
based self-evolving knowledge base and reasoning platform).
4 http://wiki.dbpedia.org/services-resources/datasets/
previous-releases</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Agrawal</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Borgida</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jagadish</surname>
          </string-name>
          , H.V.:
          <article-title>E cient management of transitive relationships in large data and knowledge bases</article-title>
          , vol.
          <volume>18</volume>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>1989</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Ahn</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Im</surname>
            ,
            <given-names>D.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eom</surname>
            ,
            <given-names>J.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zong</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kim</surname>
          </string-name>
          , H.G.:
          <article-title>G-Di : A Grouping Algorithm for RDF Change Detection on MapReduce</article-title>
          . In: Semantic Technology, pp.
          <volume>230</volume>
          {
          <fpage>235</fpage>
          . Springer (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Choi</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>K.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>Y.J.:</given-names>
          </string-name>
          <article-title>Parallel labeling of massive xml data with mapreduce</article-title>
          .
          <source>The Journal of Supercomputing</source>
          <volume>67</volume>
          (
          <issue>2</issue>
          ),
          <volume>408</volume>
          {
          <fpage>437</fpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Cohen</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Halperin</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kaplan</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zwick</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>Reachability and distance queries via 2-hop labels</article-title>
          .
          <source>SIAM Journal on Computing</source>
          <volume>32</volume>
          (
          <issue>5</issue>
          ),
          <volume>1338</volume>
          {
          <fpage>1355</fpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Gabr</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kahveci</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Signal reachability facilitates characterization of probabilistic signaling networks</article-title>
          .
          <source>BMC Bioinformatics</source>
          <volume>16</volume>
          (
          <issue>Suppl 17</issue>
          ),
          <source>S6</source>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Gubichev</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bedathur</surname>
            ,
            <given-names>S.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Seufert</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Sparqling kleene: fast property paths in rdf-3x</article-title>
          .
          <source>In: First International Workshop on Graph Data Management Experiences and Systems</source>
          . p.
          <fpage>14</fpage>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Im</surname>
            ,
            <given-names>D.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>S.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>H.J.:</given-names>
          </string-name>
          <article-title>Backward inference and pruning for RDF change detection using RDBMS</article-title>
          .
          <source>Journal of Information</source>
          Science pp.
          <volume>238</volume>
          {
          <issue>255</issue>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Im</surname>
            ,
            <given-names>D.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>S.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>H.J.:</given-names>
          </string-name>
          <article-title>A version management framework for RDF triple stores</article-title>
          .
          <source>International Journal of Software Engineering and Knowledge Engineering</source>
          <volume>22</volume>
          (
          <issue>01</issue>
          ),
          <volume>85</volume>
          {
          <fpage>106</fpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9. Kafer, T.,
          <string-name>
            <surname>Umbrich</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hogan</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Polleres</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Towards a dynamic linked data observatory</article-title>
          .
          <source>LDOW at WWW</source>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Seufert</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Anand</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bedathur</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weikum</surname>
          </string-name>
          , G.:
          <article-title>Ferrari: Flexible and e cient reachability range assignment for graph indexing</article-title>
          .
          <source>In: Data Engineering (ICDE)</source>
          ,
          <year>2013</year>
          IEEE 29th International Conference on. pp.
          <volume>1009</volume>
          {
          <fpage>1020</fpage>
          .
          <string-name>
            <surname>IEEE</surname>
          </string-name>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Simon</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>An improved algorithm for transitive closure on acyclic digraphs</article-title>
          .
          <source>Theoretical Computer Science</source>
          <volume>58</volume>
          (
          <issue>1</issue>
          ),
          <volume>325</volume>
          {
          <fpage>346</fpage>
          (
          <year>1988</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12. Tri l, S.,
          <string-name>
            <surname>Leser</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>Fast and practical indexing and querying of very large graphs</article-title>
          .
          <source>In: Proceedings of the 2007 ACM SIGMOD international conference on Management of data</source>
          . pp.
          <volume>845</volume>
          {
          <fpage>856</fpage>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Veloso</surname>
            ,
            <given-names>R.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cerf</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Meira</surname>
            <given-names>Jr</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            ,
            <surname>Zaki</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.J.:</surname>
          </string-name>
          <article-title>Reachability queries in very large graphs: A fast re ned online search approach</article-title>
          . In: EDBT. pp.
          <volume>511</volume>
          {
          <fpage>522</fpage>
          .
          <string-name>
            <surname>Citeseer</surname>
          </string-name>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Wei</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yu</surname>
            ,
            <given-names>J.X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lu</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jin</surname>
          </string-name>
          , R.:
          <article-title>Reachability querying: An independent permutation labeling approach</article-title>
          .
          <source>Proceedings of the VLDB Endowment</source>
          <volume>7</volume>
          (
          <issue>12</issue>
          ),
          <volume>1191</volume>
          {
          <fpage>1202</fpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          , Zhang,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.</surname>
          </string-name>
          :
          <article-title>Adapting prime number labeling scheme for directed acyclic graphs</article-title>
          .
          <source>In: Database Systems for Advanced Applications</source>
          . pp.
          <volume>787</volume>
          {
          <fpage>796</fpage>
          . Springer (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16. Y ld r m, H.,
          <string-name>
            <surname>Chaoji</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zaki</surname>
            ,
            <given-names>M.J.:</given-names>
          </string-name>
          <article-title>GRAIL: a scalable index for reachability queries in very large graphs</article-title>
          .
          <source>The VLDB JournalThe International Journal on Very Large Data Bases</source>
          <volume>21</volume>
          (
          <issue>4</issue>
          ),
          <volume>509</volume>
          {
          <fpage>534</fpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Yu</surname>
            ,
            <given-names>J.X.</given-names>
          </string-name>
          , Cheng, J.:
          <article-title>Graph reachability queries: A survey</article-title>
          .
          <source>In: Managing and Mining Graph Data</source>
          , pp.
          <volume>181</volume>
          {
          <fpage>215</fpage>
          . Springer (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Zhu</surname>
            ,
            <given-names>A.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lin</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xiao</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          :
          <article-title>Reachability queries on large dynamic graphs: a total order approach</article-title>
          .
          <source>In: Proceedings of the 2014 ACM SIGMOD international conference on Management of data</source>
          . pp.
          <volume>1323</volume>
          {
          <fpage>1334</fpage>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>