<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">A Directed Hypergraph Model for RDF</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Amadís</forename><surname>Antonio</surname></persName>
						</author>
						<author>
							<persName><forename type="first">Martínez</forename><surname>Morales</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Universidad de Carabobo</orgName>
								<address>
									<country key="VE">Venezuela</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">María</forename><surname>Esther</surname></persName>
						</author>
						<author>
							<persName><forename type="first">Vidal</forename><surname>Serodio</surname></persName>
							<affiliation key="aff1">
								<orgName type="institution">Universidad Simón Bolívar</orgName>
								<address>
									<country key="VE">Venezuela</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">A Directed Hypergraph Model for RDF</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">6B5EBF383DE2E6B1ACFAFB23DBE2CBE1</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T09:36+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract/>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>RDF is a proposal of the W3C to express metadata about resources in the Web. The RDF data model allows several representations, each one with its own limitations at expressive power and support for the tasks of query answering and semantic reasoning. In this paper, we present a directed hypergraph model for RDF to represent RDF documents efficiently, overcoming those limitations.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Related Work</head><p>An RDF document can be viewed as a graph: nodes are resources and arcs are properties. Formally <ref type="bibr" target="#b2">[4]</ref>, suppose there are three infinite sets: U (URIs), B = {b j : j ∈ N} (blank nodes), and L (literals). (s, p, o) ∈ (U ∪B)×U ×(U ∪B∪L) is an RDF triple, s is called the subject, p the predicate, and o the object. An RDF graph T is a set of RDF triples. The universe of T , univ(T ), is the set of elements of U ∪B∪L that occur in the triples of T . sub(T ) (resp. pred(T ), obj(T )) is the set of all elements in univ(T ) that appear as subjects (resp. predicates, objects) in T . RDF graphs allow several representations: labeled directed graphs (LDG) <ref type="bibr" target="#b2">[4,</ref><ref type="bibr" target="#b5">7]</ref>, undirected hypergraphs (UH) <ref type="bibr" target="#b3">[5]</ref>, and bipartite graphs (BG) <ref type="bibr" target="#b3">[5,</ref><ref type="bibr" target="#b4">6]</ref>.</p><p>In the LDG model, given an RDF graph T , nodes in V are elements of sub(T ) ∪ obj(T ), and arcs in E are elements of pred(T ) <ref type="bibr" target="#b2">[4,</ref><ref type="bibr" target="#b5">7]</ref>. Each (s, p, o) ∈ T is represented by a labeled arc, s . This approach may violate some of the graph theory constraints. Thus, while LDG model is the most widely used representation, it can not be considered a formal model for RDF <ref type="bibr" target="#b0">[1]</ref>.</p><p>In the UH model, given an RDF graph T , each t = (s, p, o) ∈ T is a hyperedge in E and each element of t (subject s, predicate p, and object o) is a node in V . The number of nodes and hyperedges for UH representation is |V | = |univ(T )| and |E| = |T | <ref type="bibr" target="#b3">[5]</ref>. However, UH represent RDF documents as a generalization of undirected graphs, losing the notion of direction in RDF graphs, which impacts the task of semantic reasoning. Besides, it can be hard to graphically represent large RDF graphs, like the museum example <ref type="bibr">[2]</ref>.</p><p>In the BG model, given an RDF graph T , there are two types of nodes in V : statement nodes St (one for each (s, p, o) ∈ T ) and value nodes V al (one for each x ∈ univ(T )). Arcs in E relate statement and value nodes as follows: Each t ∈ St has three outcoming arcs that point to the corresponding node for the subject, predicate, or object of the triple represented by t. The number of nodes and arcs for BG representation is |St| = |T |, |V al| = |univ(T )|, and |E| = 3|T | <ref type="bibr" target="#b3">[5,</ref><ref type="bibr" target="#b4">6]</ref>. While BG satisfy the requirement of a formal model for RDF, issues such as reification, entailment and reasoning have not been addressed yet <ref type="bibr" target="#b0">[1]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Proposed Solution</head><p>Directed hypergraphs (DH) have been used as a modeling tool to represent concepts and structures in many application areas: formal languages, relational databases, production and manufacturing systems, public transportation systems, between others <ref type="bibr" target="#b1">[3]</ref>. RDF DH are formally defined as follows: Definition 1. Let T be an RDF graph. The RDF directed hypergraph representing T is a tuple H(T ) = (W, E, ρ) such that: W = {w : w ∈ univ(T )} is the set of nodes, E = {e i : 1 ≤ i ≤ |T |} is the set of hyperarcs, and ρ : W ×E → {s, p, o} is the role function of nodes w.r.t. hyperarcs. Let t ∈ T be an RDF triple, e ∈ E an hyperarc, and w ∈ W a node such that w ∈ head(e) ∪ tail(e). Then the following must hold: (i) (ρ(w, e) = s) ⇔ (w ∈ tail(e)) ∧ (w ∈ sub({t})), (ii) (ρ(w, e) = p) ⇔ (w ∈ tail(e)) ∧ (w ∈ pred({t})), and (iii)</p><formula xml:id="formula_0">(ρ(w, e) = o) ⇔ (w ∈ head(e)) ∧ (w ∈ obj({t}))</formula><p>The information is only stored in the nodes, while hyperarcs preserve the role of nodes and the notion of direction in RDF graphs. Thus, the space complexity of our approach must be smaller than the complexity of representations in section 1. The number of nodes and hyperarcs for DH representation is |W | = |univ(T )| and |E| = |T |. Besides, concepts and algorithms of hypergraph theory could be used to manipulate RDF graphs under this representation.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Preliminary Experimental Results</head><p>Labeled directed graph (LDG) and directed hypergraph (DH) representations were studied over twenty synthetic simple RDF documents, randomly generated using an uniform distribution. Around 33% of the resources simultaneously played the role of subject, predicate, or object. Document sizes were increased, ranging from 1000 to 100000 triples. We used two metrics in this study: (a) the space in memory required to store information, measured as the number of nodes and arcs and (b) the number of comparisons required to answer an elemental query. In both cases, the LDG approach showed a trend of linear dependence on the size of the document, while DH exhibited a more independent behavior.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Conclusions and Future Plan</head><p>We proposed a directed hypergraph model for RDF. Initial results make us believe that our approach scales better than existing representations. In the future, we propose to: (1) extend this representation for RDFS graphs, (2) develop query evaluation algorithms for conjunctive and SPARQL queries, (3) study the impact of this model on the tasks of query answering and semantic reasoning, and (4) conduct empirical studies to analyze the goodness of our approach.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>p−</head><label></label><figDesc>→ o. The number of nodes and arcs for LDG representation is |V | ≤ 2|T | and |E| = |T | [5]</figDesc></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">RDF as Graph-Based, Diagrammatic Logic</title>
		<author>
			<persName><forename type="first">F</forename><surname>Dau</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of ISMIS&apos;06</title>
				<meeting>ISMIS&apos;06</meeting>
		<imprint>
			<date type="published" when="2006">2006</date>
			<biblScope unit="page" from="332" to="337" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Directed Hypergraphs and Applications</title>
		<author>
			<persName><forename type="first">G</forename><surname>Gallo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Longo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Pallottino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Nguyen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Discrete Applied Mathematics</title>
		<imprint>
			<biblScope unit="volume">42</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="177" to="201" />
			<date type="published" when="1993">1993</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Foundations of Semantic Web Databases</title>
		<author>
			<persName><forename type="first">C</forename><surname>Gutierrez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">A</forename><surname>Hurtado</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">O</forename><surname>Mendelzon</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings ACM Symposium on PODS</title>
				<meeting>ACM Symposium on PODS</meeting>
		<imprint>
			<date type="published" when="2004">2004</date>
			<biblScope unit="page" from="95" to="106" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<title level="m" type="main">A Graph Model for RDF</title>
		<author>
			<persName><forename type="first">J</forename><surname>Hayes</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2004">2004</date>
		</imprint>
		<respStmt>
			<orgName>Technische Universitt Darmstadt / Universidad de Chile</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">Diploma Thesis</note>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Bipartite Graphs as Intermediate Model for RDF</title>
		<author>
			<persName><forename type="first">J</forename><surname>Hayes</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Gutierrez</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the ISWC&apos;04</title>
				<meeting>the ISWC&apos;04</meeting>
		<imprint>
			<date type="published" when="2004">2004</date>
			<biblScope unit="page" from="47" to="61" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Resource Description Framework (RDF): Concepts and Abstract Syntax</title>
		<author>
			<persName><forename type="first">G</forename><surname>Klyne</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">J</forename><surname>Carroll</surname></persName>
		</author>
		<ptr target="http://www.w3.org/TR/2004/REC-rdf-concepts-20040210/" />
	</analytic>
	<monogr>
		<title level="m">W3C Recommendation</title>
				<imprint>
			<date type="published" when="2004">2004</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
