<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Sentiment Analysis of Digital Currency Discussions: A Machine Learning and Ontology Approaches</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Atmane</forename><surname>Hadji</surname></persName>
							<email>a.hadji@centre-univ-mila.dz</email>
							<affiliation key="aff0">
								<orgName type="department">Computer Science Department</orgName>
								<orgName type="laboratory">LISI Laboratory</orgName>
								<orgName type="institution">University Center A. Boussouf Mila</orgName>
								<address>
									<postCode>43000</postCode>
									<settlement>Mila</settlement>
									<country key="DZ">Algeria</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Farid</forename><surname>Boumaza</surname></persName>
							<affiliation key="aff1">
								<orgName type="department">Computer Science Department</orgName>
								<orgName type="institution">University of Mohamed El Bachir El Ibrahimi</orgName>
								<address>
									<addrLine>Bordj Bou Arreridj</addrLine>
									<postCode>34030</postCode>
									<country key="DZ">Algeria</country>
								</address>
							</affiliation>
							<affiliation key="aff2">
								<orgName type="laboratory">LAPECI Laboratory</orgName>
								<orgName type="institution">University of Oran1</orgName>
								<address>
									<postCode>31000</postCode>
									<settlement>Oran</settlement>
									<country key="DZ">Algeria</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Dina</forename><surname>Sirine Bali</surname></persName>
							<affiliation key="aff3">
								<orgName type="department">Department of Computer Science</orgName>
								<orgName type="institution">University Center A . Boussouf Mila</orgName>
								<address>
									<postCode>43000</postCode>
									<settlement>Mila</settlement>
									<country key="DZ">Algeria</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Sentiment Analysis of Digital Currency Discussions: A Machine Learning and Ontology Approaches</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">71DB9716CD236B1A20450B5D092E21A3</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T17:11+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Sentiment Analysis</term>
					<term>Social Networks</term>
					<term>Ontology</term>
					<term>Bitcoin</term>
					<term>Machine learning</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>A Sentiment analysis on social networks has become an increasingly important research field in recent years, driven by the rapid growth of social media and the vast amount of user-generated data. Understanding online opinions and sentiments is crucial for gaining insights into public attitudes and trends. In this study, we compare two approaches for sentiment detection: the first relies on ontologies, and the second utilizes machine learning techniques. Ontologies provide a structured framework to represent domain-specific knowledge, thus enhancing the accuracy of sentiment analysis. In the machine learning approach, we employed four algorithms: Support Vector Machines (SVM), K-Nearest Neighbors (K-NN), Decision Tree, and Random Forest. SVM demonstrated superior performance compared to other algorithms such as K-NN. Our approach was applied to sentiment analysis of Facebook discussions about Bitcoin, demonstrating the practical application of both ontology-based and machine learning techniques in the financial domain. The results highlight the effectiveness of both approaches in economic sentiment analysis, offering valuable insights into trends and sentiments that could be extended to other fields such as finance and commerce.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>In recent years, social media has become a crucial platform where users share their opinions, sentiments, and experiences, creating an abundance of exploitable textual data. This surge in information has driven the need for sentiment analysis, a field dedicated to interpreting and categorizing the emotions and opinions expressed online. Sentiment analysis has applications in diverse areas such as marketing, finance, economics, and politics, where it enables the classification of opinions as positive, negative, or neutral. In the economic context, for instance, sentiment analysis helps to understand consumer and investor perceptions and to anticipate market trends.</p><p>However, accurately extracting opinions from vast quantities of textual data remains challenging. Traditional static indexing methods often fall short in their ability to capture the nuances and context in which sentiments are expressed. To address this, two approaches stand out in the literature: the ontology-based approach and the machine learning-based approach. The former utilizes a structured representation of domain knowledge, enabling each opinion to be associated with a specific semantic meaning, enhancing interpretability. The latter approach, on the other hand, relies on machine learning models that can automatically recognize the contexts in which opinions are expressed, offering improved precision through learning algorithms such as decision trees.</p><p>In this study, we present and compare these two methods for opinion extraction from online text, focusing on economic topics such as Bitcoin. On one hand, the ontological approach is examined for its ability to provide precise semantic analysis. On the other, the machine learning approach is assessed for its capacity to recognize varied contexts automatically. This research aims to demonstrate the strengths and limitations of each method, offering insights into their applications for understanding economic trends and public perceptions in various domains.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Background and Related works 2.1. Rule-Based NLP</head><p>The extraction of Rule-based opinion extraction uses predefined patterns or guidelines to identify and extract subjective information, sentiments, or attitudes from text data. This approach is widely used in natural language processing (NLP) and sentiment analysis tasks. This approach relies on a set of predefined linguistic patterns, grammatical rules, or heuristics to process and analyze text data. These rules, designed by linguists or NLP experts, capture specific linguistic structures, sentiments, or entities within the text.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.1.">Subjectivity and Sentiment Analysis</head><p>Opinion extraction is a subtask of sentiment analysis, aiming to identify the sentiment or emotion expressed in a piece of text. Subjectivity refers to the extent to which a statement is influenced by personal feelings, opinions, or beliefs.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.2.">Key Components</head><p>The "Key Components" refer to the fundamental elements or essential techniques employed in the processes of opinion extraction and sentiment analysis. These components enable the detection, structuring, and interpretation of opinions expressed in texts ,they include:</p><p>• Linguistic Patterns: Rules are typically defined based on linguistic patterns, syntactic structures, or semantic cues, including specific keywords, parts of speech, or syntactic relationships that are indicative of opinions or sentiments. • Gazetteers: A gazetteer is a list of words or phrases associated with specific categories or entities, used alongside rules to identify named entities or specific terms related to opinions. • Regular Expressions: Regular expressions are powerful tools for defining complex patterns in text and can capture various linguistic features that indicate opinions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">Ontology-Based Approach</head><p>Ontology-based opinion extraction uses a structured, formal framework to represent domain knowledge, allowing for a more precise interpretation of opinions by linking opinion concepts and their relationships within an ontology. This method enhances the semantic understanding of text, enabling more contextual analysis of sentiments.</p><p>• Semantic Representation: The ontology provides a structure of concepts and relationships specific to the study domain, allowing each opinion to be linked to its semantic meaning. The concepts and relationships defined in the ontology help capture the implicit aspects of the expressed sentiments. • Knowledge Structure: Unlike static rules, ontology represents a dynamic knowledge framework, allowing adaptation to context and language variations within opinions. • Opinion Modeling: Opinions are integrated within the ontology structure, allowing them to be contextualized based on their relationships with other domain concepts, offering a more robust interpretation of the emotions and attitudes expressed.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3.">Machine Learning-Based Approach</head><p>Machine learning-based opinion extraction uses trained models on large datasets to automatically identify sentiments and opinions in varied contexts. This approach adapts to language nuances without requiring predefined rules. This study explores and compares these two distinct methods ontology-based and machine learningbased to assess their effectiveness in opinion extraction, particularly in analyzing economic or social opinions expressed on social media. Each approach has unique strengths in terms of accuracy, semantic interpretation, and adaptability.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.4.">Related works</head><p>This section presents the state of the art in ontology-based and machine learning-based information extraction (IE) methods. Ontology-based IE methods leverage structured knowledge representations to capture complex relationships within specific domains. These approaches were initially inspired by semantic web technologies, using ontologies to represent hierarchical and interconnected knowledge structures. Ontology-based methods are widely applied in areas such as information retrieval and natural language processing, offering advantages in precise information categorization and supporting interoperability across systems. By defining specific entities and the relationships among them, ontologybased methods enable robust and contextually relevant information extraction that improves data consistency across applications.</p><p>Several studies illustrate the utility of ontology-based approaches for IE. For instance, an ontologydriven framework <ref type="bibr" target="#b0">[1]</ref> leverages human expert knowledge to extract domain-specific information from unstructured text, adding structured information to a dedicated ontology. The system in <ref type="bibr" target="#b1">[2]</ref> integrates AI with ontology creation to facilitate clinical data extraction, enabling medical practitioners to visualize patient information effectively. Another work, OntoHuman <ref type="bibr" target="#b2">[3]</ref>, introduces an automated ontologybased method to extract key-value pairs in the field of spatial engineering, allowing user feedback to refine ontologies and improve data extraction. Additionally, OBIESOF <ref type="bibr" target="#b3">[4]</ref> is an ontology-based retrieval system for organic agriculture, structured to store and share agricultural knowledge, thus supporting future application development in this sector. A related study <ref type="bibr" target="#b4">[5]</ref> applies an ontology-based system for land use analysis, integrating relevant geographical and legal criteria to enhance decision-making capabilities.</p><p>On the other hand, machine learning (ML)-based IE methods demonstrate significant flexibility and adaptability in processing unstructured data across various domains. Unlike rule-based systems, ML algorithms-such as Support Vector Machines, Random Forest, and deep learning models-identify patterns and extract relevant information by learning from large datasets, making them highly suitable for dynamic and diverse data sources. ML models have shown exceptional results in extracting structured information from complex data sources, including text, images, and documents.</p><p>Several studies highlight the efficacy of ML-based methods. A study on clinical data <ref type="bibr" target="#b5">[6]</ref> used ML and NLP techniques to identify fracture types in radiology reports, showcasing the potential of ML for structured medical data extraction. Additionally, an information extraction system for clinical applications <ref type="bibr" target="#b6">[7]</ref> demonstrates how ML can accurately capture contextual information from radiology reports, enhancing abnormality tracking. Another research <ref type="bibr" target="#b7">[8]</ref> focused on ML-driven invoice processing, where the LayoutLM model outperformed traditional methods in handling layout variations across unstructured invoices. In the domain of misinformation detection, <ref type="bibr" target="#b8">[9]</ref> presented an ML-based approach for identifying COVID-19-related "fake news," leveraging medical features for enhanced detection accuracy. Moreover, recent works <ref type="bibr">[10][11]</ref> demonstrated the effectiveness of transformer-based models in handling handwritten digital documents and complex resume data, illustrating how advanced ML models can transform unstructured data into usable knowledge. In summary, ontology-based and machine learning-based methods provide complementary strengths in information extraction. Ontologies offer structured, contextually relevant knowledge representation, while machine learning provides scalability and adaptability, especially in dynamic data environments. Together, these methods push the boundaries of information extraction, each bringing unique advantages to various applications and contributing to a richer understanding of domain-specific data.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Proposed Approach</head><p>The following architecture (Figure <ref type="figure" target="#fig_0">1</ref>) depicts the detailed design of our opinion analysis system. The proposed system consists of several stages:</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">Data Collection</head><p>We get information from social network (Facebook) online. We processed comments related to fan opinions semi-automatically. We leverage the GATE platform (General Architecture for Text Engineering) to proficiently extract relevant comments from popular social media platforms such as Facebook and Twitter.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">Pretreatment</head><p>In this step, we identified the comments related to the Champions League, then processed them in the next step. The filtering techniques applied to the corpus include more than one baseband. We filter the data by bypassing extra spaces and formatting elements to obtain plain text. Consequently, typos are corrected using automated and manual tools, and text normalization is followed, including the removal of special characters, spaces and punctuation.</p><p>Currently, social media worldwide is considered the most visited source for information on modern technologies like Bitcoin. Bitcoin is the most prominent cryptocurrency with the largest market capitalization. Additionally, it is a digital currency that users can only access online. Thus, online platforms play a crucial role in disseminating information to individuals about Bitcoin and how it is used. People mainly turn to social media when making purchase decisions, including buying or investing in Bitcoin, which is why we chose social media-specifically Facebook, as it gathers all segments of society.</p><p>In our study, we classified the factors influencing Bitcoin into three distinct categories: positive factors, negative factors, and neutral factors <ref type="bibr" target="#b11">[12]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.1.">Positive Factors</head><p>We identified several positive factors impacting Bitcoin's increase in value, including but not limited to rising demand, institutional adoption, inflation and economic instability, heightened media coverage, and other elements.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.2.">Negative Factors</head><p>The depreciation of Bitcoin is influenced by multiple factors, some of which include high volatility, economic crises such as wars, high-interest rates, competition from other crypt ocurrencies, difficulty in using it as currency, and additional factors. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.3.">Neutral Factors</head><p>There are also neutral elements, some of which are mentioned below: competition assessment, stability, and media updates.</p><p>The goal of extracting these factors that influence Bitcoin's value is to better understand the market and predict future trends, to enhance individuals' confidence in Bitcoin, encourage its usage, expand its application across different fields, improve the performance of exchanges and other platforms, and help more people understand this currency. Additionally, it aims to provide insight into the risks associated with investing in Bitcoin, protecting consumers from fraud.</p><p>We also focus on analyzing opinions about Bitcoin through posts and comments on Facebook regarding Bitcoin's price, satisfaction levels, and associated risks. Through this feedback, it is possible to:</p><p>• Determine the extent of Bitcoin's popularity; • Assess whether people are optimistic or pessimistic about its future and better understand their needs;</p><p>• Measure public confidence in Bitcoin, their satisfaction level, and future expectations;</p><p>• Enable developers to design new technologies to improve market efficiency;</p><p>• Facilitate transactions and raise awareness of the risks associated with investing in Bitcoin, as well as provide insight into its influence on the economy and society.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3.">Method 1 based Ontology</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3.1.">Ontology Creation Step</head><p>The flexibility of Ontology construction is a key aspect of this study. For this process, we adopted a top-down approach: starting with identifying high-level concepts, then refining them into more specific ones within our ontology, referred to as the "Bitcoin Ontology, " which encapsulates the core knowledge of our work. This ontology was manually developed and then implemented in OWL format using the Protégé tool .</p><p>As outlined, the manual ontology development process involves the following steps <ref type="bibr" target="#b12">[13]</ref>:</p><p>• Defining the domain and scope of the ontology;</p><p>• Considering the reuse of existing ontologies;</p><p>• Listing essential terms for the ontology;</p><p>• Defining classes and establishing the class hierarchy;</p><p>• Defining properties (slots) for the classes; • Defining slot facets; • Creating instances.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3.2.">Tokenization</head><p>The Tokenizer divides text into simple words such as numbers, punctuation marks and many different types. For example, we have different words in Majestic and Minuscule, and among certain types of punctuation, etc. There is a "Token" annotation in the box, it should not be changed for different applications or text types.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3.3.">Sentence Splitter</head><p>The sentence splitter is a cascade of finite-state transducers that segments text into sentences. This module is required for the tagger. The separator uses a list of gazetteer abbreviations to help distinguish phrase marking points from other types.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3.4.">Part Of Speech Tagger</head><p>The tagger used is a modified version of the Brill tag, which assigns a part-of-speech tag to each word or symbol in the text. It is based on a lexicon and a set of default rules, which were learned from a large corpus from the Wall Street Journal. These elements can be adjusted manually if necessary. Two additional lexicons are available: one for texts entirely in uppercase and the other for texts entirely in lowercase. To use them, simply load the appropriate lexicon, replacing the default one. In any case, the default rule set should always be used.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.4.">Metode 02 Machine Learning</head><p>Machine learning is a field of artificial intelligence that enables computer systems to learn and improve automatically from experience. By using algorithms and mathematical models, it analyzes data to recognize patterns and make decisions without being explicitly programmed. Machine learning applications are diverse, ranging from speech recognition and online product recommendations to fraud detection and autonomous driving. This field is rapidly advancing due to technological progress and the increasing availability of massive datasets, opening new possibilities across many industrial and scientific sectors <ref type="bibr" target="#b13">[14]</ref>.</p><p>In this study, we investigate the application of machine learning techniques for opinion and sentiment extraction, leveraging four distinct algorithms: Support Vector Machines (SVM), K-Nearest Neighbors (K-NN), Random Forest Classifier, and Decision Tree Classifier. Each of these algorithms possesses unique characteristics and advantages, which significantly impact their effectiveness in identifying and extracting relevant information:</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.4.1.">Support Vector Machines (SVM)</head><p>The Support Vector Machine (SVM) algorithm excels at classifying data by identifying the optimal hyperplane that maximally separates classes. In the realm of opinion and sentiment analysis, SVM is particularly effective for categorizing diverse types of information within complex textual data, ensuring precise and reliable classification.</p><p>For linearly separable data, the separation hyperplane can be determined by:</p><formula xml:id="formula_0">𝑊 𝑇 𝑥 + 𝑏 = 0<label>(1)</label></formula><p>• w is the weight vector (or normal) of the hyperplane.</p><p>• x is the feature vector of a data point.</p><p>• 𝑏 is the bias (offset) of the hyperplane.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.4.2.">Random Forest Classifier</head><p>The Random Forest algorithm improves classification performance by leveraging an ensemble of decision trees. By combining the outputs of multiple trees, it enhances generalization and reduces the risk of overfitting, making it particularly effective for managing diverse and noisy text data. A Random Forest Classifier is an ensemble learning technique that merges the predictions of several decision trees to boost classification accuracy and mitigate overfitting. Each tree is trained on randomly selected subsets of data and features.</p><formula xml:id="formula_1">𝑦 ˆ= mode ({𝑇 𝑖 (x) | 𝑖 = 1, 2, . . . , 𝑁 })<label>(2)</label></formula><p>where:</p><formula xml:id="formula_2">• 𝑝(𝑦 = 1 | 𝑥) = 𝑇 (x)</formula><p>is the prediction of the 𝑖-th decision tree.</p><p>• 𝑁 is the number of trees,</p><p>• The mode function returns the most common class label among all trees' predictions</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.4.3.">K-Nearest Neighbors (K-NN)</head><p>The K-Nearest Neighbors (K-NN) algorithm is a straightforward yet powerful technique for classification and regression tasks. It classifies a data point by analyzing the majority class among its k-nearest neighbors in the feature space. This approach is especially advantageous for addressing multi-class problems and performs effectively when the data distribution is localized, making it a practical choice for various applications. The K-Nearest Neighbors (K-NN) algorithm classifies a data point by measuring its distance to all other points in the dataset, selecting the k-closest neighbors, and assigning the class label most common among those neighbors. For a given data point x, the distance to each neighbor is computed using a metric like Euclidean distance:</p><formula xml:id="formula_3">𝑑(x, x 𝑖 ) = ⎯ ⎸ ⎸ ⎷ 𝑛 ∑︁ 𝑗=1 (𝑥 𝑗 − 𝑥 𝑖,𝑗 ) 2<label>(3)</label></formula><p>where:</p><p>• x is the input feature vector.</p><p>• x 𝑖 is the feature vector of the 𝑖-th neighbor.</p><p>• 𝑛 is the number of features.</p><p>• The class of x is determined by the majority vote among the 𝑘-nearest neighbors.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.4.4.">Decision Tree Classifier</head><p>Decision trees are highly interpretable models that operate by making a series of binary decisions. They are well-suited for extracting straightforward rules from textual data and provide clarity in understanding the criteria used for classification.</p><p>A Decision Tree Classifier divides data into subsets based on specific feature values, constructing a tree-like structure where each node corresponds to a decision guided by an attribute.</p><formula xml:id="formula_4">𝐺𝑖𝑛𝑖(𝐷) = 1 − 𝑘 ∑︁ 𝑖=1 𝑝 2 𝑖 (4)</formula><p>where:</p><p>• 𝑘 is the number of classes.</p><p>• 𝑝 𝑖 is the proportion of instances belonging to class 𝑖.</p><p>The tree continues to split until it reaches a stopping criterion, such as a maximum depth or minimum number of samples per leaf.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Results and Evaluation</head><p>After running the corpus with the use of JAPE and Gazetteer rules (figure <ref type="figure" target="#fig_2">3</ref>), the system is now able to detect the entities named "Opinion Positive", "Opinion Negative" and "Opinion Neutral" corresponding to opinions on a Cryptocurrency ''Bitcoin". Following the application of the Betcoin Opinion ontology to the corpus ( Figure <ref type="figure" target="#fig_1">2</ref>), the system can now identify named entities related to opinion of Betcoin.</p><p>The data used in the dataset for the first ontology-based method is the same as that used in the machine learning approach. This dataset is annotated with a range of attributes to support effective information extraction and sentiment analysis, including the classification of sentiments into Positive Opinions, Neutral Opinions, and Negative Opinions.</p><p>These annotations aim to evaluate machine learning models designed to extract relevant opinions related to Bitcoin. Figures <ref type="figure" target="#fig_3">3 and 4</ref> illustrate the results obtained for each algorithm used in our study: Support Vector Machines (SVM), K-Nearest Neighbors (K-NN), Decision Tree, and Random Forest.</p><p>To evaluate and compare the methods we studied, we will use metrics: Precision, Recall, and F-scale. Precision refers to the correctness of the retrieval, while recall refers to the completeness of the retrieval. The F-measure provides the harmonic mean between precision and recall <ref type="bibr" target="#b14">[15]</ref>.</p><p>According to <ref type="bibr" target="#b15">[16]</ref> :</p><p>• Precision is the percentage of correctly recognized named entities (NE) among the recognized results: Precision = Number of correctly recognized NE Total number of recognized NE</p><p>• Recall is the percentage of correctly recognized named entities among the total entities that should have been recognized. It is a widely used measure in NLP evaluations:</p><formula xml:id="formula_6">Recall = Number of correctly recognized NE Total number of NE in the corpus<label>(6)</label></formula><p>• F-measure is the harmonic mean of precision and recall, providing a balanced evaluation:  </p><formula xml:id="formula_7">𝐹 -measure = 2 • (Precision × Recall) Precision + Recall<label>(7)</label></formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Analysis and Discussion</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1.">Analysis and Discussion Machine learning</head><p>The results obtained in this study are highly satisfactory, as demonstrated by the Precision, Recall and F-mesure (see to Figure <ref type="figure" target="#fig_2">3</ref>, Figure <ref type="figure" target="#fig_3">4</ref> and Table <ref type="table" target="#tab_1">1</ref>). This section provides an in-depth analysis of the performance of the four algorithms (SVM, K-NN, Random Forest, and Decision Tree) used for opinion detection and sentiment analysis related to Bitcoin, based on data extracted from Facebook. The performance is compared in terms of precision, recall, and F-measure for three categories of opinions: negative, neutral, and positive.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1.1.">Results Analysis</head><p>• SVM: The SVM classifier achieves the best overall performance, with an average precision of 0.90, a recall of 0.86, and an F-measure of 0.86, demonstrating its robustness in sentiment classification tasks. For negative opinions, the model exhibits strong detection capabilities, as evidenced by an F-measure of 0.87. Its performance is particularly remarkable for neutral opinions, achieving an exceptional F-measure of 0.95 and a perfect recall of 1.00, highlighting its ability to accurately identify and classify neutral sentiments. However, in the case of positive opinions, while precision reaches a flawless 1.00, the relatively low recall of 0.57 reduces the overall effectiveness in this category, resulting in an F-measure of 0.73. • K-Nearest Neighbors (K-NN): The K-NN algorithm demonstrates the least effectiveness among the evaluated classifiers, with an average precision of 0.78, recall of 0.75, and F-measure of 0.76. Despite this, it performs reasonably well in detecting negative opinions, achieving an F-measure of 0.87, comparable to that of the SVM classifier. However, its performance declines notably for neutral opinions, where an F-measure of 0.78 is observed, primarily due to limited recall (0.70).</p><p>The algorithm faces significant challenges in classifying positive opinions, as reflected in its particularly low F-measure of 0.57, highlighting difficulties in accurately capturing this sentiment category. • Random Forest: The Random Forest algorithm delivers strong overall performance, achieving a precision of 0.88, recall of 0.86, and an F-measure of 0.85, underscoring its reliability in sentiment classification tasks. For negative opinions, it attains an F-measure of 0.83, which, although effective, is slightly lower compared to SVM and K-NN. Its performance in identifying neutral opinions is excellent, with an F-measure of 0.95, aligning closely with the results achieved by SVM. For positive opinions, the algorithm mirrors SVM's performance, achieving perfect precision (1.00) but exhibiting limited recall (0.57), leading to an overall F-measure of 0.73 in this category. • Decision Tree: The Decision Tree algorithm demonstrates performance comparable to Random Forest, achieving an average precision of 0.88, recall of 0.86, and an F-measure of 0.85. For negative opinions, it performs on par with SVM, achieving an F-measure of 0.87, indicating strong detection capabilities. Its classification of neutral opinions is solid, with an F-measure of 0.91, although slightly below the performance of Random Forest and SVM. For positive opinions, similar to other algorithms, the Decision Tree achieves perfect precision (1.00), but its low recall (0.57) reduces the F-measure to 0.73, highlighting challenges in effectively capturing this sentiment category.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1.2.">Comparative Discussion</head><p>• Overall Performance: SVM emerges as the top-performing algorithm, excelling in handling complex data and maximizing class separation, particularly for neutral and negative opinions. K-NN, despite its intuitive design, delivers the lowest overall performance, struggling notably with positive opinions due to its sensitivity to noise and limitations in capturing complex decision boundaries. Random Forest and Decision Tree display comparable performances, effectively capturing intricate patterns through their decision-tree-based methodologies. For neutral opinions, all algorithms, except K-NN, perform admirably. SVM and Random Forest stand out, achieving perfect recall (1.00), showcasing their precision in this category. However, detecting positive opinions poses a significant challenge across all models, with consistently low recall values (0.57). This difficulty may stem from data imbalance or the inherent ambiguity in distinguishing positive </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.2.">Analysis and Discussion of the Tow Methods</head><p>This section presents a comparative analysis of two approaches used for sentiment analysis on Bitcoinrelated posts from Facebook: Method 1 (Ontology-based) and Method 2 (Machine Learning-based). The results (See Table <ref type="table" target="#tab_2">2</ref>) are assessed based on three sentiment categories (Positive, Neutral, and Negative) and performance metrics: precision, recall, and F-measure.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.2.1.">Results Analysis</head><p>• Positive Opinions: Method 1: The F-measure of 0.63 reflects moderate performance in identifying positive sentiments, limited by lower precision (0.56). Method 2: With an F-measure of 0.83, Method 2 significantly outperforms Method 1, driven by high precision (0.86) and balanced recall (0.832).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>• Neutral Opinions:</head><p>Method 1: Achieves an F-measure of 0.66, with good recall (0.80) but relatively low precision (0.57). Method 2: Excels in detecting neutral opinions, achieving an F-measure of 0.897, the highest among all categories. This is due to strong precision (0.882) and near-perfect recall (0.925).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>• Negative Opinions:</head><p>Method 1: Demonstrates acceptable performance with an F-measure of 0.71, supported by recall (0.85) and moderate precision (0.62). Method 2: Outperforms Method 1 with an F-measure of 0.86, indicating better reliability in detecting negative sentiments, with precision (0.815) and recall (0.91) both being strong.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.2.2.">Comparative Discussion</head><p>• Overall Performance: Method 1: while demonstrating moderate performance, relies heavily on predefined rules and domain knowledge, limiting its flexibility and adaptability to nuanced language variations in social media posts. Method 2: (Machine Learning-based) consistently outperforms Method 1 (Ontology-based) across all sentiment categories. This is largely due to its ability to learn complex patterns in data and generalize well to unseen examples. • Neutral Opinions:</p><p>Method 1: exhibits higher recall values across all categories compared to its precision, suggesting a tendency to detect more instances (including false positives). Method 2: in contrast, achieves a better balance between precision and recall, reducing false positives while maintaining strong detection rates.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">Conclusion</head><p>This study has provided an in-depth evaluation and comparison of ontology-based and machine learningbased approaches for sentiment analysis of Bitcoin-related discussions on social media, specifically Facebook. The results indicate that machine learning algorithms, particularly SVM, outperform both other algorithms (such as K-NN) and the ontology-based method in terms of precision, recall, and F-measure. While the ontology-based approach offers value through domain-specific knowledge representation, it falls short in flexibility and overall performance.</p><p>The strength of machine learning lies in its adaptability to complex and heterogeneous data, whereas ontologies provide a structured framework for capturing semantic relationships. These complementary attributes highlight the potential of hybrid approaches that combine the strengths of both methodologies.</p><p>Future research could explore hybrid methods to enhance both accuracy and interpretability. Incorporating additional datasets from diverse social media platforms and employing techniques such as data rebalancing may help address biases in certain sentiment categories, particularly positive opinions. Additionally, advanced deep learning models like BERT or GPT could further improve sentiment analysis by capturing the nuanced linguistic contexts of social media discussions. Expanding these methodologies to other domains, such as economics or healthcare, could open up new avenues for sentiment analysis applications.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Declaration on Generative AI</head><p>The author(s) have not employed any Generative AI tools.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: General architecture of the proposed system</figDesc><graphic coords="5,72.00,65.61,451.29,441.26" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: Results of Opinion Extraction in SVM and KNN Algorithms</figDesc><graphic coords="9,72.00,65.60,451.29,327.86" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: Results of Opinion Extraction in SVM and KNN Algorithms</figDesc><graphic coords="10,72.00,65.61,451.31,370.89" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Figure 4 :</head><label>4</label><figDesc>Figure 4: Results of Opinion Extraction in Decision Tree and Random Forest Algorithms</figDesc><graphic coords="11,72.00,65.61,451.28,394.62" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 1</head><label>1</label><figDesc>Results of Machine Learning (Average of four Algorithms)</figDesc><table><row><cell cols="4">Machine Learning Precision Recall F-Measure</cell></row><row><cell>Negative Opinion</cell><cell>0.815</cell><cell>0.910</cell><cell>0.860</cell></row><row><cell>Neutral Opinion</cell><cell>0.882</cell><cell>0.9255</cell><cell>0.897</cell></row><row><cell>Positive Opinion</cell><cell>0.860</cell><cell>0.832</cell><cell>0.830</cell></row><row><cell>Total</cell><cell>0.880</cell><cell>0.860</cell><cell>0.850</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 2</head><label>2</label><figDesc>Results of Opinion Extraction (Ontology and ML Methods)</figDesc><table><row><cell></cell><cell></cell><cell cols="3">Precision Recall F-measure</cell></row><row><cell>Positive</cell><cell>Method 1</cell><cell>0.560</cell><cell>0.740</cell><cell>0.630</cell></row><row><cell>Positive</cell><cell>Method 2</cell><cell>0.860</cell><cell>0.832</cell><cell>0.830</cell></row><row><cell>Neutral</cell><cell>Method 1</cell><cell>0.570</cell><cell>0.800</cell><cell>0.660</cell></row><row><cell>Neutral</cell><cell>Method 2</cell><cell>0.882</cell><cell>0.925</cell><cell>0.897</cell></row><row><cell cols="2">Negative Method 1</cell><cell>0.620</cell><cell>0.850</cell><cell>0.710</cell></row><row><cell cols="2">Negative Method 2</cell><cell>0.815</cell><cell>0.910</cell><cell>0.860</cell></row><row><cell>sentiments.</cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell cols="5">In terms of robustness and generalization, tree-based algorithms (Random Forest and Decision</cell></row><row><cell cols="5">Tree) demonstrate strong resilience by mitigating overfitting risks. Despite this, they slightly trail</cell></row><row><cell cols="5">behind SVM, which maintains the best overall performance in sentiment classification tasks.</cell></row></table></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<author>
			<persName><forename type="first">R</forename><surname>Anantharangachar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Ramani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Rajagopalan</surname></persName>
		</author>
		<idno type="arXiv">arXiv:1302.1335</idno>
		<title level="m">Ontology guided information extraction from unstructured text</title>
				<imprint>
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">The use of ontology in clinical information extraction</title>
		<author>
			<persName><forename type="first">S</forename><surname>Jusoh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Awajan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Obeid</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Physics: Conference Series</title>
		<imprint>
			<biblScope unit="volume">1529</biblScope>
			<biblScope unit="page">52083</biblScope>
			<date type="published" when="2020">2020</date>
			<publisher>IOP Publishing</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Ontohuman: ontology-based information extraction tools with human-in-the-loop interaction</title>
		<author>
			<persName><forename type="first">K</forename><surname>Opasjumruskit</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Böning</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Schindler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Peters</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International Conference on Cooperative Design, Visualization and Engineering</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2022">2022</date>
			<biblScope unit="page" from="68" to="74" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">An ontology-based information extraction system for organic farming</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">A</forename><surname>Abayomi-Alli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Misra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">O</forename><surname>Akala</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">M</forename><surname>Ikotun</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><forename type="middle">A</forename><surname>Ojokoh</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">International Journal on Semantic Web and Information Systems (IJSWIS)</title>
		<imprint>
			<biblScope unit="volume">17</biblScope>
			<biblScope unit="page" from="79" to="99" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">An ontology-based information extraction system for residential landuse suitability analysis</title>
		<author>
			<persName><forename type="first">M</forename><surname>Al-Ageili</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Mouhoub</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">International Journal of Software Engineering and Knowledge Engineering</title>
		<imprint>
			<biblScope unit="volume">32</biblScope>
			<biblScope unit="page" from="1019" to="1042" />
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Leaving no stone unturned: using machine learning based approaches for information extraction from full texts of a research data warehouse</title>
		<author>
			<persName><forename type="first">J</forename><surname>Fiebeck</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Laser</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">B</forename><surname>Winther</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Gerbel</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International Conference on Data Integration in the Life Sciences</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="50" to="58" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Toward complete structured information extraction from radiology reports using machine learning</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">M</forename><surname>Steinkamp</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Chambers</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Lalevic</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">M</forename><surname>Zafar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">S</forename><surname>Cook</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of digital imaging</title>
		<imprint>
			<biblScope unit="volume">32</biblScope>
			<biblScope unit="page" from="554" to="564" />
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Automated invoice processing: Machine learning-based information extraction for long tail suppliers</title>
		<author>
			<persName><forename type="first">F</forename><surname>Krieger</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Drews</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Funk</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Intelligent Systems with Applications</title>
		<imprint>
			<biblScope unit="volume">20</biblScope>
			<biblScope unit="page">200285</biblScope>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Machine learning-based identifications of covid-19 fake news using biomedical information extraction</title>
		<author>
			<persName><forename type="first">F</forename><surname>Fifita</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Smith</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">B</forename><surname>Hanzsek-Brill</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Zhou</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Big Data and Cognitive Computing</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="page">46</biblScope>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Structured information extraction from scientific text with large language models</title>
		<author>
			<persName><forename type="first">J</forename><surname>Dagdelen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Dunn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Walker</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">S</forename><surname>Rosen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Ceder</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">A</forename><surname>Persson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Jain</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Nature Communications</title>
		<imprint>
			<biblScope unit="volume">15</biblScope>
			<biblScope unit="page">1418</biblScope>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Esgnet: A multimodal network model incorporating entity semantic graphs for information extraction from chinese resumes</title>
		<author>
			<persName><forename type="first">S</forename><surname>Luo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Yu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Information Processing &amp; Management</title>
		<imprint>
			<biblScope unit="volume">61</biblScope>
			<biblScope unit="page">103524</biblScope>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Automatic opinion extraction from football-related social media: A gazetteer and rule-based approach</title>
		<author>
			<persName><forename type="first">A</forename><surname>Hadji</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M.-K</forename><surname>Kholladi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">NCAIA</title>
		<imprint>
			<biblScope unit="page">61</biblScope>
			<date type="published" when="2023">2023. 2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Enhancing spatial information extraction from arabic text: A hybrid approach with ontology and rule-based</title>
		<author>
			<persName><forename type="first">A</forename><surname>Hadji</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M.-K</forename><surname>Kholladi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Borisova</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Ingenierie des Systemes d&apos;Information</title>
		<imprint>
			<biblScope unit="volume">29</biblScope>
			<biblScope unit="page">1261</biblScope>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Advanced nlp methods for disaster information extraction: Analyzing jape rules, ontologies, and machine learning approaches</title>
		<author>
			<persName><forename type="first">A</forename><surname>Hadji</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">K</forename><surname>Kholladi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 3rd International Conference on Computer Science&apos;s Complex System and their Application (CCSA&apos;2024)</title>
		<title level="s">Computer Science Book Series</title>
		<meeting>the 3rd International Conference on Computer Science&apos;s Complex System and their Application (CCSA&apos;2024)</meeting>
		<imprint>
			<publisher>Springer Nature</publisher>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
	<note>In press</note>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">A hybrid ontology-based information extraction system</title>
		<author>
			<persName><forename type="first">F</forename><surname>Gutierrez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Dou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Fickas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Wimalasuriya</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Zong</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Information Science</title>
		<imprint>
			<biblScope unit="volume">42</biblScope>
			<biblScope unit="page" from="798" to="820" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<title level="m" type="main">Metrics for evaluation of ontology-based information extraction</title>
		<author>
			<persName><forename type="first">D</forename><surname>Maynard</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Peters</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Li</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2006">2006</date>
			<publisher>EON@ WWW</publisher>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
