<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Neologism: Easy Vocabulary Publishing</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Cosmin</forename><surname>Basca</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Digital Enterprise Research Institute</orgName>
								<orgName type="institution">National Univerisity of Ireland</orgName>
								<address>
									<settlement>Galway Galway</settlement>
									<country key="IE">Ireland</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Stéphane</forename><surname>Corlosquet</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Digital Enterprise Research Institute</orgName>
								<orgName type="institution">National Univerisity of Ireland</orgName>
								<address>
									<settlement>Galway Galway</settlement>
									<country key="IE">Ireland</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Richard</forename><surname>Cyganiak</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Digital Enterprise Research Institute</orgName>
								<orgName type="institution">National Univerisity of Ireland</orgName>
								<address>
									<settlement>Galway Galway</settlement>
									<country key="IE">Ireland</country>
								</address>
							</affiliation>
						</author>
						<author role="corresp">
							<persName><forename type="first">Sergio</forename><surname>Fernández</surname></persName>
							<email>sergio.fernandez@fundacionctic.org</email>
							<affiliation key="aff1">
								<orgName type="institution">Fundación CTIC Gijón</orgName>
								<address>
									<settlement>Asturias</settlement>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Thomas</forename><surname>Schandl</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Digital Enterprise Research Institute</orgName>
								<orgName type="institution">National Univerisity of Ireland</orgName>
								<address>
									<settlement>Galway Galway</settlement>
									<country key="IE">Ireland</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Neologism: Easy Vocabulary Publishing</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">B24322930078FBC7BA9FFC2CB8B12FED</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T22:52+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Creating, documenting, publishing and maintaining an RDF Schema vocabulary is a complex, time-consuming task. This makes vocabulary maintainers reluctant to evolve their creations quickly in response to user feedback; it prevents use of RDF for casual, ad-hoc data publication about niche topics; it leads to poorly documented vocabularies, and contributes to poor compliance of vocabularies with bestpractice recommendations. Neologism is a web-based vocabulary editor and publishing system that dramatically reduces the time required to create, publish and modify vocabularies. By removing a lot of pain from this process, Neologism will contribute to a generally more interesting, relevant and standards-compliant Semantic Web.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Anyone who wants to publish information as RDF on the Semantic Web first faces the choice which RDF Schema vocabulary or OWL ontology to use. Some areas, such as social networks (FOAF), online communities (SIOC) or general document metadata (DC) are covered by established vocabularies. Outside of these domains, registries like SchemaWeb<ref type="foot" target="#foot_0">3</ref> and search services like Falcons Concept Search<ref type="foot" target="#foot_1">4</ref> assist in the task of finding vocabularies for niche topics, but what they find might be of insufficient quality, or might not cover all required terms, and at present many areas of interest are not covered by any vocabulary at all.</p><p>In summary, most efforts to publish information on the Semantic Web first require an effort to create, extend or modify an RDF Schema vocabulary or OWL ontology. But this is a complex and time-consuming task in itself. It involves:</p><p>-Creating the formal specification of the vocabulary in RDFS or OWL, writing documentation that is clear and helpful for users of the ontology, keeping both documents in sync as the vocabulary evolves, archiving older versions of the documents, defining and maintaining mappings to related vocabularies, configuring the web server in accordance with W3C best practices <ref type="bibr" target="#b2">[3]</ref>.</p><p>In this paper we present an online vocabulary editor and publishing system based on Drupal<ref type="foot" target="#foot_2">5</ref> , implemented in PHP and ActionScript, which will support vocabulary authors in the tasks above and thereby dramatically reduce the time required to create, publish and modify vocabularies. The work presented in this paper is in progress.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">The value of vocabularies</head><p>We define vocabularies as simple, "lightweight" ontologies, such as FOAF, DC, SIOC and SKOS. They usually comprise less than 50 terms. Expressivity is limited to RDF Schema plus selected OWL features, e.g. inverse functional properties and class disjointness. Their value is in providing common terminology for exchanging information between programs. The actual information is in the RDF instance data that is expressed with the vocabulary's terms, while in more complex ontologies, the actual information lies in the definitions of the classes and properties. A vocabulary is created by publishing a description of its terms in natural using HTML or formal using RDFS/OWL language. Since classes and properties are identified by URIs, it is considered a good practice to make these URIs resolvable <ref type="bibr" target="#b1">[2,</ref><ref type="bibr" target="#b2">3]</ref>. This enables clients to look up definitions of the vocabulary terms, with the following benefits:</p><p>-Information publishers can refer to a specification. This is important to create interoperability around a vocabulary. The top ten most popular vocabularies of 2006 <ref type="foot" target="#foot_3">6</ref> all have a such a specification. -RDF-aware tools such as data browsers (e.g. Tabulator <ref type="bibr" target="#b1">[2]</ref>), SPARQL query builders and RDF instance editors can use the formal specification to improve the user experience, e.g. by showing friendlier labels and comments, listing available terms and providing widgets appropriate to a property's data type. -Inference can be performed to increase recall when performing queries or lookups against RDF data, which is especially useful when terms are mapped to other vocabularies. Systems that use such techniques are the Tabulator data browser <ref type="bibr" target="#b1">[2]</ref> and the Sindice semantic lookup index <ref type="bibr" target="#b5">[6]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Current approaches to vocabulary publishing</head><p>Vocabulary maintenance with text editors and custom scripts. Many popular vocabularies such as FOAF and SIOC are maintained by a process involving hand-authoring of RDF and HTML files and custom scripts, e.g. SpecGen<ref type="foot" target="#foot_4">7</ref> . Often, complex custom Apache configurations are employed to follow best practices regarding content negotiation, MIME types and resolvable URIs <ref type="bibr" target="#b2">[3]</ref>. Offline ontology editors. OWL ontology editors such as Protégé <ref type="bibr" target="#b4">[5]</ref>, TopBraid Composer<ref type="foot" target="#foot_5">8</ref> and SWOOP<ref type="foot" target="#foot_6">9</ref> can be used to create the formal specification of a vocabulary. While being great tools for knowledge engineering professionals, these applications have a steep learning curve and they intimidate casual users. They use a file-based, offline model, where ontology files are stored on the local user's computer. Remote publishing, if supported at all, is an after-thought.</p><p>Web-based systems. OntoWiki <ref type="bibr" target="#b0">[1]</ref> provides basic ontology editing, but its main focus is the display and editing of RDF instance data. MyOntology <ref type="bibr" target="#b6">[7]</ref> focuses on collaborative editing in a larger community, in the hope of creating rich knowledge bases, while creation of simple vocabularies typically does not involve many collaborating users. Knoodl<ref type="foot" target="#foot_7">10</ref> is a hosted service with strong community features and an easy-to-use vocabulary editor, but it does not publish created vocabularies with resolvable URIs or according to best-practice guidelines.</p><p>Areas for improvement. We identify four points where we can simplify the process: (i) Instant web-based publishing instead of file-based offline editing. (ii) Focus on a limited subset of RDFS and OWL. (iii) No instance editing or browsing. (iv) Handling of HTTP details like URI management, content negotiation and redirects within the web-based application.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Easier vocabulary publishing with Neologism</head><p>Neologism<ref type="foot" target="#foot_8">11</ref> is a web-based vocabulary editor and publishing platform designed to address these issues. It is currently being implemented and will soon be released as an open-source project. This section presents Neologism's current state.</p><p>Public interface. To non-authenticated users on the Web, Neologism presents a very simple interface: a homepage that lists one or more vocabularies, and for each of them a vocabulary page containing some general information about the vocabulary (Figure <ref type="figure" target="#fig_0">1</ref>), followed by the descriptions of all its classes and properties. Editor. After a vocabulary maintainer logs in, additional links become visible on the vocabulary page and allow adding new terms, as well as editing of existing terms. Terms are created and edited through a web form (Figure <ref type="figure" target="#fig_1">2</ref>). The form allows entry of an ID (to become part of the term's URI), label, comment, subclasses, subproperties, domain, range, disjoint classes, inverse properties, and marking a property as inverse functional. Authenticated users can also create new vocabularies and modify the vocabulary metadata.</p><p>Overview diagram. The vocabulary page provides access to a diagram that shows the vocabulary's classes and their relationships (Figure <ref type="figure" target="#fig_2">3</ref>). The vocabulary maintainer can arrange the diagram into a sensible layout and then save its current state which will henceforth be shown to all users.</p><p>RDFS output, URIs and content negotiation. The URIs identifying classes and properties are always generated by appending the hash character and the term's ID to the URI of the vocabulary page. This makes sure that the vocabulary page is returned when these URIs are resolved. HTTP requests to the vocabulary page are subject to content negotiation. Web browsers will see the HTML variant shown in Figure <ref type="figure" target="#fig_0">1</ref>. RDF-aware clients will receive the RDFS/OWL specification, either in RDF/XML or N3 syntax. In a nutshell, Neologism publishes standardscompliant vocabularies on the Web without requiring any additional effort on the part of vocabulary maintainers.</p><p>Implementation. Neologism is implemented in PHP as a Drupal module. Drupal reduces development time by providing many features for free, such as account management. It also makes integration with a larger Drupal-based site very easy, for example to provide a news blog and discussion forum for each vocabulary. All data is stored in a MySQL database. RAP<ref type="foot" target="#foot_9">12</ref> is used to serialize RDF/XML and N3. The PHP Content Negotiation library<ref type="foot" target="#foot_10">13</ref> is used instead of the usual Apache rules to implement content negotiation, and Vapour<ref type="foot" target="#foot_11">14</ref> was used to validate its correctness. The overview diagram is implemented using Adobe Flex and coded in ActionScript; the ObjectHandles and Tweener libraries are used for animation and object handling.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Future Work</head><p>Hosted Neologism service. Currently, vocabulary maintainers must install Neologism on their own webspace. A central hosted service, which could be easily built on the Drupal platform, would remove this barrier.</p><p>Branching and revision tracking. Neologism does not yet offer revision control. Some desirable features for vocabulary revision control are: archival of all prior versions; grouping of several small edits into a single version to avoid putting the vocabulary into an inconsistent intermediate state; publishing changes as a draft before accepting them as a new version.</p><p>Plugin system. We intentionally kept the set of supported class and property annotations small to simplify the user experience, and don't support many possible further annotations, such as OWL cardinality constraints, plural and inverse labels<ref type="foot" target="#foot_12">15</ref> , multilingual labels or associating Fresnel lenses <ref type="bibr" target="#b3">[4]</ref> with classes and properties. Such additional annotations could be supported through plugins that are installed by vocabulary maintainers.</p><p>Consistency checking. Neologism doesn't check the created vocabulary for consistency. This can become an issue when a vocabulary is integrated with several external vocabularies. A solution could be the integration of an external reasoning service that performs consistency checks and is invoked through an API over the Web.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">Conclusion</head><p>We have shown a web-based vocabulary publishing system that simplifies the process of creating, publishing and maintaining RDF vocabularies by (i) instant web-based publishing, (ii) focus on a limited subset of RDFS and OWL, (iii) avoiding instance editing or browsing, and (iv) handling URI management and HTTP content negotiation. We hope that the presented system will encourage the creation of new vocabularies and thereby contribute to a generally more interesting, relevant and standards-compliant Semantic Web.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. A vocabulary page in Neologism, as it appears to an authenticated user.</figDesc><graphic coords="3,165.95,115.84,283.46,92.20" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Fig. 2 .</head><label>2</label><figDesc>Fig. 2. A form for editing a class.</figDesc><graphic coords="4,165.95,115.84,283.46,173.78" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Fig. 3 .</head><label>3</label><figDesc>Fig. 3. The vocabulary overview diagram.</figDesc><graphic coords="5,165.95,115.84,283.47,212.60" type="bitmap" /></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_0">http://www.schemaweb.info/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_1">http://iws.seu.edu.cn/services/falcons/conceptsearch/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_2">http://drupal.org/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="6" xml:id="foot_3">http://ebiquity.umbc.edu/resource/html/id/196/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="7" xml:id="foot_4">http://sioc-project.org/specgen</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="8" xml:id="foot_5">http://www.topbraidcomposer.com/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="9" xml:id="foot_6">http://www.mindswap.org/2004/SWOOP/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="10" xml:id="foot_7">http://knoodl.com/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="11" xml:id="foot_8">http://neologism.deri.ie/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="12" xml:id="foot_9">http://www4.wiwiss.fu-berlin.de/bizer/rdfapi/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="13" xml:id="foot_10">http://ptlis.net/source/php-content-negotiation/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="14" xml:id="foot_11">http://vapour.sourceforge.net/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="15" xml:id="foot_12">http://www.wasab.dk/morten/2004/03/label</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">OntoWiki, a tool for social, semantic collaboration</title>
		<author>
			<persName><forename type="first">S</forename><surname>Auer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Dietzold</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Riechert</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">The Semantic Web -ISWC</title>
				<imprint>
			<date type="published" when="2006">2006. 2006. 2006</date>
			<biblScope unit="volume">4273</biblScope>
			<biblScope unit="page" from="736" to="749" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Tabulator: Exploring and Analyzing Linked Data on the Semantic Web</title>
		<author>
			<persName><forename type="first">T</forename><surname>Berners-Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Chilton</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Connolly</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Dhanaraj</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Hollenbach</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Lerer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Sheets</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">The 3rd International Semantic Web User Interaction Workshop (SWUI06)</title>
				<imprint>
			<date type="published" when="2006">2006</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<title level="m" type="main">Best Practice Recipes for Publishing RDF Vocabularies</title>
		<author>
			<persName><forename type="first">D</forename><surname>Berrueta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Phipps</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2008">2008</date>
			<pubPlace>W3C</pubPlace>
		</imprint>
	</monogr>
	<note type="report_type">Working Draft</note>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Fresnel, a Browser-Independent Presentation Vocabulary for RDF</title>
		<author>
			<persName><forename type="first">C</forename><surname>Bizer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Pietriga</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International Semantic Web Conference</title>
				<imprint>
			<date type="published" when="2006">2006. 2006</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">The Protégé OWL Plugin: An open development environment for semantic web applications</title>
		<author>
			<persName><forename type="first">H</forename><surname>Knublauch</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">W</forename><surname>Fergerson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">F</forename><surname>Noy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Musen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">The Semantic Web ISWC</title>
				<imprint>
			<date type="published" when="2004">2004. 2004. 2004</date>
			<biblScope unit="volume">3298</biblScope>
			<biblScope unit="page" from="229" to="243" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Sindice.com: A document-oriented lookup index for open linked data</title>
		<author>
			<persName><forename type="first">E</forename><surname>Oren</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Delbru</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Catasta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Cyganiak</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Stenzhorn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Tummarello</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International Journal of Metadata, Semantics and Ontologies</title>
				<imprint>
			<date type="published" when="2008">2008</date>
			<biblScope unit="volume">3</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">myOntology: The marriage of ontology engineering and collective intelligence</title>
		<author>
			<persName><forename type="first">K</forename><surname>Siorpaes</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Hepp</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ESWC 2007 Workshop Bridging the Gap between Semantic Web and Web 2</title>
				<imprint>
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
