<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main"></title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">30601AFE94FC24589400C4A54B1D0616</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-25T00:41+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>There has been considerable debate as to the merits and the applicability of probabilistic or statistical reasoning to Semantic Web. Much of this debate seems to have centered on the applicability of statistical methods in a supposedly deterministic setting. In this paper, we argue that statistical reasoning ("reasoning with uncertainty") need not be a substitute for traditional Description Logic (DL) / First-Order Logic (FOL) reasoning, instead statistical methods can serve as a complement to logicbased reasoning systems in two ways: (i) Offer a meta-reasoning (or audit) mechanism to validate logical reasoning, and (ii) Act as a "filler" where Ontological information either does not exist, or is insufficient to reason conclusively.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Much of the Semantic Web effort has focused on the design and development of Ontologies and related technologies. This approach presupposes that a critical mass of Ontologies will exist that can sufficiently and accurately respond to reasoning queries. As Sir Tim Berners-Lee puts it [Berners- <ref type="bibr" target="#b0">Lee, 1998</ref>]: "The choice of classical logic for the Semantic web is not an arbitrary choice among equals. Classical logic is the only way that inference can scale across the web."</p><p>However, a pure logic-based approach looks increasingly implausible given the paucity of Ontologies and the difficulty in constructing and maintaining Ontologies. Just like the World Wide Web (WWW) had a ready and mature platform to run on i.e. the Internet -which had been in existence for a long time prior to the emergence of the WWW, we feel that the Semantic Web needs an underlying platform, upon which Ontologies can function and interoperate.</p><p>We argue that this platform should be a web of statistical "metadata" -which expresses semantic relations in probabilistic terms. Such systems (e.g. Bayesian Networks, Probabilistic Relational Models) have also been in existence for a while and are used in various Machine Learning and AI applications such as Machine Vision, Speech Recogni-tion, and Robotics etc. The Semantic Web would do well to re-use some of these efforts in building this underlying framework.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Ontologies and Probabilistic Models</head><p>We introduce the notion that Probabilistic Graph Models (PGM) or Bayesian Networks can be viewed as fuzzy Ontologies; conversely an Ontology can be viewed as a crisper Bayesian Networks. In our proposed architecture, there may not be a clear dividing line between them. A good way of visualizing this relation would be to view Ontologies and Bayesian Networks as ships floating in a sea of statistical "metadata". We use this metaphor to describe the notion that the sea of statistical metadata fills-in the gaps between the islands of Ontologies. Lately there have been some efforts to develop Probabilistic Ontologies by annotating OWL or RDF Ontologies with probabilistic information e.g. BayesOWL <ref type="bibr" target="#b3">[Ding,Peng 2004]</ref>. We argue against this approach, and suggest that probabilistic and logic-based reasoning approaches should be viewed as orthogonal to each other. It makes most sense to keep the Ontological information separate from the statistical data, along the lines of how the WWW operates -in which an HTML page links to a "FTP" site or a "mailto" to an email hyperlink and the necessary protocols invoked only when clicked. We suggest, that probabilistic (or statistical) information be encoded using any of the widely accepted Bayesian Interchange Formats such as XML-BIF <ref type="bibr" target="#b1">[Cozman, 1998]</ref>, or Microsoft Research's XBN <ref type="bibr">[Microsoft, 1998]</ref> or Hugin.net <ref type="bibr" target="#b2">[Jensen, 2004]</ref> format. We propose that the Ontological model encapsulate what it is designed for -expressing logical relations between resources, and the probabilistic model express the statistical relation between them. We do not see a need to mix-and-match as they offer very different views on the same information-set and are perceptually orthogonal.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">A Hybrid Reasoning Model</head><p>Reasoning using Ontologies is based on predicate logic and belongs in the classical tradition of monotonic deductive reasoning i.e. propositions are either true or false. But this proposed framework provides a mechanism for handling fuzzier, incomplete and inaccurate inputs. In this model, reasoning can be performed using a "bottom-up" approach where a query unanswered by a pure Ontological match is extended further up the hierarchy (Fig 1 <ref type="figure">.</ref>) until all required information is found. An adjunct application might be to validate traditional reasoning with a mathematical confidence level (meta-reasoning).</p><p>Some examples of the reasoning activities possible using this system are:</p><p>1. Deductive Reasoning: Deductive reasoning allows a system to deduce information given a set of (possibly incomplete and erroneous) information. For example, it can deduce that the best course to learn "Machine Vision", "Genomics" and "Political Science" at MIT is most probably "6.804J Computational Cognitive Science" even though the course does not directly teach Political Science. It is making a best-guess fit for the requirements <ref type="bibr">[OCW, 2005]</ref>.</p><p>2. Abductive Reasoning: Abductive reasoning allows a system to infer the possible causes for a certain effect. For example, the possible courses for learning Artificial Intelligence at MIT are 6.803, 6.825 etc. This is the equivalent of diagnostic reasoning in Bayesian Networks <ref type="bibr">[OCW, 2005]</ref>. 3. Monotonic reasoning, non-monotonic reasoning and default values: Traditional DL-based Ontologies can represent information for monotonic reasoning. For example, one might declare that Universities in the US have a GPA scale of 4.0, but MIT uses a 5.0 GPA scale -so the system monotonically cannot reason with that information unless it has been explicitly encoded.</p><p>This kind of non-monotonic reasoning is possible with the proposed approach.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Conclusion</head><p>"Reasoning with Uncertainty" is probably a misnomer to describe the efforts required in this area -a more appropriate phraseology would be "reasoning without certainty". While the difference may seem pedantic, the underlying notion is that "uncertainty" is not a state unto itself, but merely the absence of certainty. In a Semantic Web sense, it is a state where Ontological information is non-existent, incomplete or inconclusive. Statistical reasoning could therefore be the bedrock upon which DL/FOL based querying and reasoning can be performed.</p><p>This means that the semantic web can operate in areas currently out-of-bounds because of a lack of Ontological information. We therefore hypothesize that statistical "metadata" could be the building-block of the Semantic Web leading to better and more accurate reasoning mechanisms.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1</head><label>1</label><figDesc>Figure1illustrates a hierarchical mechanism of aligned Ontologies and Bayesian Networks. At the very top are the toplevel Ontologies on which there is general agreement and acceptance, at the bottom are the fuzzier, grayer-scale Bayesian Networks which represent relations between resources using probabilistic mechanisms.</figDesc></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Acknowledgments</head><p>We would like to acknowledge the gracious help and support of the members of the World Wide Web Consortium (W3C) and faculty of MIT-CSAIL for their help and support. We would especially like to thank Ralph Swick and Sir Tim Berners-Lee for their critique and feedback.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<author>
			<persName><forename type="first">Berners-Lee ;</forename><surname>Lee</surname></persName>
		</author>
		<ptr target="http://www.w3.org/DesignIssues/Rules.html" />
		<title level="m">Axioms of Web Architecture</title>
				<imprint>
			<date type="published" when="1998-06-20">1998. 1998. June 20, 2005</date>
		</imprint>
	</monogr>
	<note>Tim Berners-</note>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">The Interchange Format for Bayesian Networks</title>
		<author>
			<persName><forename type="first">Fabio</forename><surname>Cozman</surname></persName>
		</author>
		<author>
			<persName><surname>Gagliardi Cozman</surname></persName>
		</author>
		<ptr target="http://research.microsoft.com/dtas/bnformat/" />
	</analytic>
	<monogr>
		<title level="m">Microsoft Research, XML Belief Network File Format: Main Page</title>
				<imprint>
			<date type="published" when="1998-06-23">1998. 1998. June 23, 2005. , 1998. 1998. June 23, 2005</date>
		</imprint>
		<respStmt>
			<orgName>Microsoft</orgName>
		</respStmt>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<title level="m" type="main">A Brief Overview of the Three Main Paradigms of Expert Systems</title>
		<author>
			<persName><forename type="first">V</forename><surname>Jensen ; Finn</surname></persName>
		</author>
		<author>
			<persName><surname>Jensen</surname></persName>
		</author>
		<ptr target="http://developer.hugin.com/Getting_Started/Paradigms/" />
		<imprint>
			<date type="published" when="2004-06-23">2004. June 23, 2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">A Probabilistic Extension to Ontology Language OWL</title>
		<author>
			<persName><forename type="first">Peng</forename><surname>Ding</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Zhongli</forename><surname>Ding</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Yun</forename><surname>Peng</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 37th Hawaii International Conference on System Sciences -2004</title>
				<meeting>the 37th Hawaii International Conference on System Sciences -2004</meeting>
		<imprint>
			<date type="published" when="2004">2004</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
