<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">A Hybrid Question Answering Model with Ontological Integration for Environmental Information ⋆</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Tianda</forename><surname>Sun</surname></persName>
							<email>tianda.sun@york.ac.uk</email>
							<affiliation key="aff0">
								<orgName type="institution">University of York</orgName>
								<address>
									<postCode>YO10 5DD</postCode>
									<settlement>Heslington</settlement>
									<region>York</region>
									<country key="GB">United Kingdom</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Jamie</forename><surname>Carr</surname></persName>
							<email>jamie.carr@york.ac.uk</email>
							<affiliation key="aff0">
								<orgName type="institution">University of York</orgName>
								<address>
									<postCode>YO10 5DD</postCode>
									<settlement>Heslington</settlement>
									<region>York</region>
									<country key="GB">United Kingdom</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Dimitar</forename><surname>Kazakov</surname></persName>
							<email>dimitar.kazakov@york.ac.uk</email>
							<affiliation key="aff0">
								<orgName type="institution">University of York</orgName>
								<address>
									<postCode>YO10 5DD</postCode>
									<settlement>Heslington</settlement>
									<region>York</region>
									<country key="GB">United Kingdom</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">A Hybrid Question Answering Model with Ontological Integration for Environmental Information ⋆</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">2D39584A6D35D985654EAB0C8509A53C</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T16:35+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Relation Extraction</term>
					<term>Knowledge Graph Construction</term>
					<term>Retrieval Augmentation Generation</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>This paper presents a novel approach to constructing a Question Answering model for analysing Nationally Determined Contributions (NDC) reports within the environmental sector. The approach is based on Large Language Models (LLMs) equipped with Retrieval Augmented Generation (RAG) and enhanced by ontology integration. Acknowledging the challenges inherent in directly applying RAG, our approach begins with the development of a specialised ontology framework for NDC reports. This framework supports the construction of a knowledge graph that provides essential, verifiable information for a Question Answering (QA) model. In the next step, the model combines RAG embeddings with ontology-based queries, aiming to enhance the reliability of answers across various NDC reports. We evaluate the performance of our hybrid model through testing with a set of questions and human/AI evaluation across different LLMs. While the results indicate improvements in the efficiency of climate change-related QA models, they also underscore the complexity of achieving significant enhancements in this domain. Our findings contribute to ongoing discussions about the potential and limitations of integrating ontological methods with LLM for environmental information retrieval.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Ontologies and the knowledge graphs derived from them are valuable knowledge representation approaches in many application areas. Designing such ontologies is a laborious process requiring domain expertise and knowledge engineering skills. The process can be assisted by tapping into existing sources of knowledge containing a structured component. One example of this is using the labelled information shown in the little grey rectangle at the top of each Wikipedia page to produce a vast ontology known as DBpedia <ref type="bibr" target="#b0">[1]</ref>. Domain-specific work in the biomedical, finance <ref type="bibr" target="#b1">[2,</ref><ref type="bibr" target="#b2">3]</ref> or clinical. Most textual information is available in a non-structured form though, and extracting concepts and relations is a valuable but challenging task <ref type="bibr" target="#b3">[4]</ref>.</p><p>Retrieval Augmented Generation (RAG) offers a transformative approach to enhancing Question Answering systems by dynamically retrieving and incorporating external knowledge during the generation process. In the context of ontologies, RAG can be particularly powerful. By integrating RAG with a robust ontology, the system can access a structured repository of domain-specific knowledge, such as those concerning environmental policies and practices outlined in Nationally Determined Contributions (NDC) reports. This integration allows the QA system to fetch relevant information and contextually adapt its responses based on ontological relationships and entities. The methodology leverages the precision of ontology-based data retrieval and the flexibility of generative models to produce more accurate and contextually relevant answers. Studies such as those by <ref type="bibr" target="#b4">Li et al. (2024)</ref> have demonstrated the efficacy of RAG in various domains by showing significant improvements in the accuracy and relevance of generated answers <ref type="bibr" target="#b4">[5]</ref>. Applying these principles, our model aims to tackle environmental data's complex terminology and inter-relation characteristics, thereby enhancing the decision-making process in climate change mitigation efforts.</p><p>In this paper, we start with these ideas to investigate how LLM can construct domain-specific knowledge graphs with limited participation of domain experts. Then, we involve our constructed ontology into a retrieval augmentation generation (RAG) system for the question and answering(QA) system to answer a set of NDC-related questions and evaluate the answer. Our contributions are as follows:</p><p>• We implement a novel environmental knowledge graph framework manually and apply it to extract information from environmental reports on climate change. This approach aims to capture important information and organise it into an ontology.</p><p>• Based on the ontology we constructed, we developed a hybrid RAG system to retrieve information from both our ontology and NDC report. Our evaluation shows that our method performs better than the baseline method.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Background</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.">Ontologies and Knowledge Graphs</head><p>Large knowledge graph such as Freebase <ref type="bibr" target="#b5">[6]</ref> and DBpedia <ref type="bibr" target="#b0">[1]</ref> have shown a remarkable ability to provide a well-structured data source in a wide range of NLP tasks, including question-answering <ref type="bibr" target="#b6">[7]</ref>, information retrieval <ref type="bibr" target="#b7">[8]</ref>, chatbots <ref type="bibr" target="#b8">[9]</ref>, recommendation systems <ref type="bibr" target="#b9">[10]</ref> and machine translation <ref type="bibr" target="#b10">[11]</ref>.</p><p>With structured vocabulary and predefined relationships, knowledge graphs and ontologies facilitate automated associations between data and knowledge, enabling nuanced analyses and hypothesis generation in various scientific domains such as finance <ref type="bibr" target="#b2">[3]</ref> and health care <ref type="bibr" target="#b11">[12]</ref>. However, constructing domain-specific ontology is a multifaceted, challenging task requiring the deep participation of domain experts. Researchers usually divided knowledge construction into separate NLP tasks, including named entity recognition(NER) and relation extraction(RE). Since models and NLP tools based on the general domain cannot satisfy the form of scientific corpus or data, all steps require independent fine-tuning or adjustment for different scientific requirements, as well as human annotation with domain experts <ref type="bibr" target="#b3">[4]</ref>. Some joint relation extraction methods, such as NovelTagging <ref type="bibr" target="#b12">[13]</ref> or TPLinker <ref type="bibr" target="#b13">[14]</ref>, demonstrate the concept of unifying NER and RE steps, which can mitigate the error propagation between different models and reduce the dataset annotation cost and also improve the performance for tackling overlapping or complex relation scenarios. These also push the work on scientific knowledge graph construction.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">LLM-based Relation Extraction and knowledge construction</head><p>With the rapid growth of LLMs, the few/zero-shot learning abilities via prompt learning on LLMs are widely investigated by NLP researchers. Recent progress in large language models (LLMs) like GPT series(GPT-3 <ref type="bibr" target="#b14">[15]</ref>, ChatGPT, GPT-4 <ref type="bibr" target="#b15">[16]</ref>), LLama-series(LLama <ref type="bibr" target="#b16">[17]</ref>, Alpaca <ref type="bibr" target="#b17">[18]</ref>, Vicuna <ref type="bibr" target="#b18">[19]</ref>), PaLM <ref type="bibr" target="#b19">[20]</ref> and GLM <ref type="bibr" target="#b20">[21]</ref> has showcased remarkable performance across a spectrum of natural language processing (NLP) tasks. The ongoing expansion in the number of model parameters and the size of training datasets has endowed LLMs with emergent capabilities, facilitating them to partake in in-context learning (ICL). In ICL, these models can derive insights from a limited set of demonstrative examples presented within the input context and exceed the previous baselines in multiple NLP tasks under few-shot or zero-shot settings, as well as relation extraction and knowledge construction <ref type="bibr" target="#b21">[22]</ref>.</p><p>Based on the ICL paradigm, Wadhwa et al. <ref type="bibr" target="#b22">[23]</ref> propose comparing few-shot LLM-based relation extraction and fine-tuned baselines. At the same time, the result indicates that Few-shot prompting with GPT-3 achieves near SOTA performance and can exceed the previous supervised SOTA performance when using the Chain-of-Thought (COT) <ref type="bibr" target="#b23">[24]</ref> prompt for enhancement. In addition, in the domainspecific area, Agrawal et al. <ref type="bibr" target="#b24">[25]</ref> point out that LLMs perform well at zero-and few-shot clinical relation extraction despite not being explicitly trained in clinical records. Rajpoot et al. <ref type="bibr" target="#b25">[26]</ref> developed the GPT-FinRE model using ICL and achieved a 0.718 F1 score on their financial dataset, REFinD. Moreover, some works such as LLM2KB <ref type="bibr" target="#b26">[27]</ref> attempt to connect LLM to the knowledge construction task, which aims to construct a knowledge base from input relation triples, and link to the external knowledge base such as Wikidata to capture the related information. They achieved an average F1 score of 0.6185 across 21 relations in the LM-KBC challenge held at the ISWC 2023 conference.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3.">Retrieval Augmentation Generation(RAG)</head><p>Retrieval-augmented generation (RAG) is a method that integrates traditional large language models (LLMs) with external knowledge sources to enhance their output quality and relevance. The primary goal of RAG systems is to overcome the limitations inherent in LLMs, such as content hallucination, outdated information, and lack of traceable reasoning processes by dynamically incorporating up-todate information from external databases or other structured knowledge forms during the generation process <ref type="bibr" target="#b27">[28]</ref>. This methodology significantly improves the models' accuracy and reliability by ensuring that the generated content is contextually relevant and factually accurate.</p><p>The operational framework of an RAG system typically involves querying an external database using a retrieval mechanism that can pull relevant information based on the input query or context. This retrieved content is then fed into the LLM, enabling the model to produce outputs enhanced by the externally sourced data. The integration of retrieval capabilities allows RAG systems to remain dynamic, adapting to new data and evolving user requirements.</p><p>Using ontology in RAG systems introduces a structured way of organizing information that can significantly enhance the retrieval process. Ontologies provide a framework for representing knowledge as a set of concepts within a domain and the relationships between those concepts. This structured approach allows RAG systems to perform more precise and context-aware retrievals, essential for generating accurate and relevant outputs.</p><p>A notable example is presented in the study by Sabrina Toro et al., where they explore the "Dynamic Retrieval Augmented Generation of Ontologies using Artificial Intelligence (DRAGON-AI)" system. <ref type="bibr" target="#b28">[29]</ref> This system utilizes ontologies to dynamically generate and update knowledge, illustrating a profound enhancement in managing and utilizing structured domain knowledge effectively in real-time applications. This methodology is particularly beneficial in fields requiring precise and up-to-date information, such as biomedical and environmental sciences, where it helps mitigate the substantial collaborative efforts typically required from domain experts.</p><p>Further, Julien Delile et al. introduce a graph-based retriever that captures the long-tailed data of biomedical knowledge. Their work emphasizes the utility of integrating graph-based knowledge representations with RAG systems to handle rare or less frequently seen information better. <ref type="bibr" target="#b29">[30]</ref> By incorporating these knowledge graphs, the system achieves a more comprehensive retrieval coverage, significantly enhancing the generation capabilities of LLMs in the biomedical research domain. This approach enhances the retrieval of relevant data. It improves the overall accuracy and utility of the generated content by ensuring it reflects rare yet critical information that might otherwise be overlooked.</p><p>These studies demonstrate how ontologies and knowledge graphs can significantly enhance RAG systems by providing a more structured, nuanced, and comprehensive approach to information retrieval and generation. The structured nature of ontological frameworks allows for a deeper semantic understanding of queries and the relationships between different pieces of information, leading to more accurate and contextually appropriate outputs. However, there is still a lack of research which directly links to the ontology-enhanced RAG system for extracting information in the climate change area, which is the motivation for our research.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Case Study: Processing Climate Change Policies</head><p>To demonstrate our framework, we considered the Nationally Determined Contributions (NDCs) to the United Nations Framework Convention on Climate Change (UNFCCC). These documents, which are updated every five years, outline countries' national strategies to mitigate and adapt to climate change. Specifically, we focused on elements of NDCs pertaining to Nature-based Solutions (NBS), which, broadly speaking, is the sustainable use or use of natural features and processes to tackle socio-environmental issues (in this case, climate change).</p><p>As the urgency to mitigate and adapt to climate change becomes increasingly pressing, consideration of climate strategies and policies by analysts is likely to become a more common process. As such, the development of ontologies on relevant issues is likely to be of great utility to those involved, as they will essentially allow rapid and automated comparisons to be made between countries, institutions and other entities. Moreover, given that climate change issues typically cut across multiple themes and sectors (e.g. energy, transport, or, in the case of NBS, biodiversity conservation and natural resource management), efforts to assess (and ultimately maximise) levels of cross-sectoral alignment between policies and intended activities may also be expected to increase, and can also be facilitated through the use of topic-specific ontologies. In this regard, we hope that our efforts to develop an ontology specific to NBS will have further utility beyond this demonstration.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Method Design</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1.">Dataset</head><p>We collected the environmental reports we needed from the Convention on Biological Diversity(CBD) and the United Nations Climate Change(UNFCCC) website. We chose 10 reports from all publications by different countries, including Albania, Angola, Antigua and Barbuda, Armenia, Australia, Azerbaijan, Bahrain, Bangladesh, Nigeria, and Papua New Guinea (PNG).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2.">Relation Triple Extraction Process</head><p>As figure <ref type="figure" target="#fig_0">1</ref> shows, our model mainly includes two main processes: Relation Triple Extraction aims to extract important features from the given NDC reports and output them as a fixed JSON format, while Relation Triple Alignment receive the structural result combining with a pre-designed ontology framework to assign entities to the specified class.</p><p>In the Relation Triple Extraction process, we follow the general RAG structure to load different NDC reports and split them into pages. Through one LLM as a paging encoder to generate different embeddings and store them in a vector database. Then, we design a structural prompt to let LLM extract featured information with JSON-style output by computing the similarity between the page's vectors. Our prompt and example result for the structural extraction are shown in Table <ref type="table" target="#tab_1">1</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.3.">NDC Ontology Framework Design</head><p>Then, to mostly capture the relation between different initiatives from different countries, using Stanford's WebProtégé <ref type="bibr" target="#b30">[31]</ref>, we create our knowledge base framework with one domain expert. The main structure(components) of the ontology and some related examples are shown as follows:</p><p>• Object properties: Object properties link pairs of instances (individuals). They describe the relationship between two individuals, e.g. country A benefits_from project B. {{ "Initiative 1": {{ "Nature-Based Solution": "", "Actor": "", "Challenge": "", "Start Date": "", "End Date": "", "Location": "", "Budget": "" }}, and start with Initiative 1, Initiative 2, etc. Please gives out as many as initiatives that mentioned in the NDC document as you can find.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Example result:</head><p>Here are some initiatives related to nature-based solutions in Papua New Guinea's NDC, based on the provided document: { "Initiative 1": { "Nature-Based Solution": "Reducing Emissions from Deforestation and Forest Degradation and Enhancement of Carbon Stocks (REDD+)", "Actor": "Papua New Guinea government, with support from development partners", "Challenge": "Forest degradation and deforestation, particularly due to logging and land use changes for agriculture", "Start Date": "2017", "End Date": "2027", "Location": "Nationwide", "Budget": "N/A" } ...</p><p>to represent specific objects or entities in the domain of interest. E.g. Indonesian is a named individual and belongs to the class Country.</p><p>Figure <ref type="figure" target="#fig_1">2</ref> shows the most general classes, as well as some of the class hierarchies in our ontology (known as Aris). We defined seven main class categories that conclude different kinds of national policy attributes in 'Action', 'Actor', 'Budget', 'Challenge', 'Document', 'Location' and 'Outcome'. Similarly, in figure <ref type="figure" target="#fig_2">3</ref>, we show the basic design for object and data properties, which are defined to reflect the relationship between classes and individuals. Figure <ref type="figure" target="#fig_3">4</ref> indicates an example ontology individual called 'FoodSecurityInNigeria' that belongs to the 'FoodSecurity' subclass, which links through several different subclass under the main class 'Challenge' and connected to the other three different main classes: 'Outcome', 'Actions' and 'Actor'. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.4.">Relation Triple Alignment Process</head><p>For the relation triple alignment process, based on the output of the last step, we develop another prompt combined with the hierarchy structure of our Aris ontology framework to let LLM assign an entity to a leaf subclass such as the 'FoodShortages' in figure <ref type="figure" target="#fig_1">2</ref>. Specifically, given a set of tree-structure class-subclass names as {JSON}, the LLM will classify the entity to a certain subclass and give the thinking process. Our prompts and example result are shown in Table <ref type="table">2</ref>, with content in parentheses used as a placeholder:</p><p>Following the progress above, we constructed the fulfilled Aris ontology with 10 countries' NDC reports. The complete version of Aris ontology contains 5923 relation triples with 401 classes and subclasses and 19 different relations. We extracted 155 environmental initiatives across 10 countries that contain nature-based solutions, challenges, actors and time period information.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.5.">RAG-based Question&amp;Answer model Construction</head><p>In the next step, to verify the performance of our ontology straightforwardly, we implement an RAGbased Q&amp;A model that combines the retrieval information from both vector database and structure ontology. As the figure <ref type="figure" target="#fig_4">5</ref> shows, the model will retrieve a set of select page content ranking by the similarity score with cosine similarity function, as well as generate related SPARQL query to achieve information from the Aris ontology to generate answers. Specifically, our method will extract entities related to the country in the question and then generate SPARQL query to extract related information   from ARIS ontology to generate the answer. To compare our result, we set the baseline as the general RAG framework with the retrieval vector from the page-split database.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Result Discussion</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1.">Question Set and Evaluation Setting</head><p>To evaluate our result, we propose a set of questions across the climate-change-based information we are interested in. To perform a clear structure for evaluating the result, we design the prompt prefix for the questions and let the LLM answer it by listing its answer, reason and reference. Some examples are shown in Table <ref type="table" target="#tab_2">3</ref>.</p><p>Based on these questions, we formulate our dataset across 10 countries and their combinations for single and double-country queries, then generate 40 questions for a single query and 45 questions for a double-country query. The RAG-based model will answer the questions based on the prompt to list </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Table 2 Prompt and example result for Relation Triple Alignment</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Prompt:</head><p>Here is a list string that represents some classes in an ontology: {JSON}. Now, I want to give you a named individual:{Ind}, which should belong to one of these subclasses. If we consider the JSON I give to you as a tree structure, please find the most relevant class that the individual belongs to by the meaning. If you think no leaf subclass is relevant please answer "No match found". Please only give one answer for the suitable subclass, but not two or more. Please answer this question in this format precisely without any additional explanation or information: Thinking: {{The steps you are thinking}} Answer: {{The subclass name}} Example Answer from LLM: Thinking: The individual "Papua New Guinea government, with support from development partners" is a political entity at the national level. I will search for a matching subclass in the provided ontology that represents this concept. Answer: NationalGovernment points in enumerates. We manually verified our result based on the facts in the original NDC report against the following criteria: Precision: How many answers match the facts in the NDC report? Recall: How many answers from the model cover the points in the NDC reports?</p><p>We choose Mixtral 8*7B <ref type="bibr" target="#b31">[32]</ref> as the core LLM for both embedding and ontology/RAG model. Compared to the other open-source model, the Mixtral 8*7B model has demonstrated superior performance across a variety of benchmarks, including code generation, math problem-solving, and general language understanding.</p><p>It matches or exceeds the capabilities of more extensive models like Llama 2 70B and GPT-3.5 with the efficient memory requirement and longer context length, up to 32000 tokens. We locally run the Mixtral 8*7B model with 1 H100 GPU for all the tasks we proposed above.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.2.">Result Analysis</head><p>The result in table <ref type="table" target="#tab_3">4</ref> shows the performance of our model. We separately test our model on the baseline model and the Hybrid RAG model with human evaluation to check the result. Compared to the baseline model, our model achieved 0.836 precision and 0.588 recall in the single-country query questions and 0.676 precision and 0.582 recall in the cross-country query questions, which indicates the success of the enhancement based on the ontology data during the RAG progress. Ontology data: {'solution': 'IntegratedWaterResourcesManagementAndWatershedProtection', 'challenge': 'WaterScarcity_Flooding_AndClimateChangeImpactsOnWaterResources', 'Actor': 'Ministry-OfEnergyAndWater', 'budget': 'N/A', 'start_date': '2020', 'end_date': '2030'}, ...</p><p>Here we can see that our model successfully answer the question with the ontology data even if the baseline hasn't retrieve the related information from the text. Our model suffer from the manual evaluation efficiency which led to a limited question dataset to expand the question type to other type of environmental problem. However, our result and example demonstrate that to integrate with the ontology we design, our model can exceed the raw RAG system and sufficient in answer the NDC-based environment questions.</p><p>While our study introduces a novel integration of ontological frameworks with retrieval-augmented generation models, it also encounters specific limitations that pave the way for future research opportunities. Firstly, the current ontology is restricted to extracting relational triples from Nationally Determined Contributions (NDC) reports across ten countries. This limitation confines the breadth of our ontological database and potentially impacts the model's applicability to global environmental data. Future efforts will aim to expand our method to include a wider range of countries, thereby enriching the ontology's diversity and representativeness.</p><p>Secondly, the scope of our evaluation is constrained by the manual effort required, limiting us to a smaller dataset of questions and answers. This has potentially restricted our ability to thoroughly assess the model's performance across more complex environmental queries and diverse data scenarios. Moving forward, we plan to develop more robust evaluation methodologies that can handle larger and more intricate datasets of environmental-related questions. This will not only improve the accuracy and reliability of our model but also enhance its utility for more comprehensive environmental information retrieval tasks.</p><p>By addressing these limitations, future research can significantly enhance the model's functionality and applicability, providing more detailed and extensive support for environmental decision-making processes.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">Conclusion</head><p>This study presents a novel integration of Retrieval Augmented Generation (RAG) with a specialized ontology tailored for environmental policy documents, particularly focusing on Nationally Determined Contributions (NDC) reports. The development of this hybrid model showcases an innovative approach to improving the accuracy and relevance of question-answering systems in the environmental domain. Our primary contribution lies in the formulation of a domain-specific knowledge graph that significantly augments the question-answering capabilities of large language models (LLMs) through ontology-based queries. The combination of RAG embeddings with these queries allows our model to deliver responses that are not only accurate but also deeply contextualized, drawing on the structured knowledge embedded in environmental policies. Testing across different LLMs has confirmed that our model offers a clear improvement over traditional RAG systems in handling complex environmental data. The ability to integrate and manipulate domain-specific ontologies enables the model to address intricate queries about climate change mitigation strategies and policies with a higher degree of precision. Our future work will focus on expanding the ontology's scope, refining the integration mechanisms, and exploring the application of this model to other domains requiring high accuracy and context-specific responses. This endeavour not only advances the field of environmental informatics but also contributes to the broader discussion on the potential and limitations of applying advanced AI techniques in highly specialized and impactful areas.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Model structure on LLM-based for knowledge graph completion</figDesc><graphic coords="6,132.64,65.61,330.00,366.00" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: The main structure of Aris ontology, here we are showing the design of the Actions classes and subclasses.</figDesc><graphic coords="7,200.58,94.15,194.12,612.72" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: Defined Object and Data relation in Aris ontology.</figDesc><graphic coords="8,186.60,65.61,222.08,156.45" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Figure 4 :</head><label>4</label><figDesc>Figure 4: An example ontology individual defined in Aris ontology. It belongs to the subclass 'FoodSecurity' and is linked with the other top-level classes ('PublicWellBeing', 'ClimateVulnerability', 'ClimateRelated'), which belongs to the main class 'Challenge'</figDesc><graphic coords="8,152.74,261.34,289.80,244.50" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Figure 5 :</head><label>5</label><figDesc>Figure 5: Structure of the hybrid RAG that mixed the knowledge graph data and retrieval content from the NDC report</figDesc><graphic coords="9,171.39,65.61,252.50,225.00" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>•</head><label></label><figDesc>Data properties: Data properties link instances (individuals) to data values. They describe the attributes of instances by assigning them specific data values, such as numbers or strings. E.g. Data property: Project A has_start_date 01/01/99.</figDesc><table><row><cell>• Class/Subclass: A class is a category or type representing a set of entities with common proper-</cell></row><row><cell>ties. Classes can inherit properties from other classes, which means they can be arranged in a</cell></row><row><cell>hierarchical structure.</cell></row></table><note>E.g. Class: Country • Named Individuals: Named Individuals are specific instances of classes. They are concrete examples or entities that belong to one or more classes. In WebProtégé, individuals are created</note></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 1</head><label>1</label><figDesc>Prompt for Relation Triple ExtractionPrompt: NDC, or Nationally Determined Contribution, is a climate action plan to cut emissions and adapt to climate impacts. I am interested in Country NDCs. In particular, I want to know some of the nature-based solutions proposed in those NDCs, who the actors are involved in, and what the challenges these solutions address are. I also want to know what specific initiatives are implementing these solutions, over what period, where, and with what budget. Please structure your reply as a list of labelled items, where each item is one of the following: nature-based solution, actor, challenge, start date, end date, location, and budget. If any of the information is not available, please leave it N/A. Please list as many initiatives as you can find in the document. Please output it as a JSON object as follows:</figDesc><table /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 3</head><label>3</label><figDesc>Prompt on RAG-based QA System and 2 example questions Here is the retrieval context from the NDC report, please answer the question based only on the following context: {context} Question: {question} Please think of the question step by step and provide a detailed answer, It also gives the reason and references that describe which page comes from the context of this format:</figDesc><table><row><cell>Prompt:</cell></row><row><cell>Answer:</cell></row><row><cell>Point 1:</cell></row><row><cell>Reason:</cell></row><row><cell>Point 2:</cell></row><row><cell>Reason:</cell></row><row><cell>Point 3:</cell></row><row><cell>...</cell></row><row><cell>Please provide as many answers as you can find.</cell></row><row><cell>Example Single Country Query Question:</cell></row><row><cell>"What specific climate-related challenges is {country} hoping to address through the use of nature-based</cell></row><row><cell>solutions?"</cell></row><row><cell>Example Double Country Query Question:</cell></row><row><cell>Are {country X} and {country Y} sharing any same challenges or natural-based solutions in the climate</cell></row><row><cell>change aspect?</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 4</head><label>4</label><figDesc>Performance of methods across different strategies. "No, Angola is not setting measurable, time-bound targets specifically for nature-based solutions in the provided context. The document outlines Angola's commitment to reduce greenhouse gas emissions and adapt to climate change but does not explicitly mention numerical targets for nature-based solutions. " Hybrid: "Yes, Angola is setting measurable and time-bound targets with regards to nature-based solutions. The nature-based solutions mentioned in the ontology data are linked to specific challenges and have a start date of 2020 and an end date of 2030. This indicates that the country has committed to implementing these solutions over a 10-year period. The solutions being measured include Coastal Protection and Mangrove Restoration, Community-based Natural Resource Management and Conservation, Conservation and Restoration of Wetlands, Peatlands, and Other Blue Carbon Ecosystems, Disaster Risk Reduction and Early Warning Systems for Climate-related Hazards, Integrated Water Resources Management and Watershed Protection... "</figDesc><table><row><cell>Method</cell><cell>Question type</cell><cell cols="2">Precision Recall</cell></row><row><cell cols="2">Baseline (Raw RAG) Single-country</cell><cell>0.686</cell><cell>0.482</cell></row><row><cell></cell><cell>Cross-country</cell><cell>0.522</cell><cell>0.474</cell></row><row><cell>Hybrid</cell><cell>Single-country</cell><cell>0.836</cell><cell>0.588</cell></row><row><cell></cell><cell>Cross-country</cell><cell>0.676</cell><cell>0.582</cell></row><row><cell cols="4">Below we show an example from both our ontology-enhanced RAG model and the baseline:</cell></row><row><cell cols="4">Question: "With regards to nature-based solutions, is country Albania setting measurable, time-</cell></row><row><cell cols="4">bound targets? If so, what is being measured, and over what time period? "</cell></row><row><cell>Baseline:</cell><cell></cell><cell></cell><cell></cell></row></table></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">DBpedia -A large-scale, multilingual knowledge base extracted from Wikipedia</title>
		<author>
			<persName><forename type="first">J</forename><surname>Lehmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Isele</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Jakob</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Jentzsch</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Kontokostas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">N</forename><surname>Mendes</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Hellmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Morsey</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Van Kleef</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Auer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Bizer</surname></persName>
		</author>
		<idno type="DOI">10.3233/SW-140134</idno>
		<ptr target="https://content.iospress.com/articles/semantic-web/sw134.doi:10.3233/SW-140134" />
	</analytic>
	<monogr>
		<title level="j">Semantic Web</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<biblScope unit="page" from="167" to="195" />
			<date type="published" when="2015">2015</date>
			<publisher>IOS Press</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Pair Trading with an Ontology of SEC Financial Reports</title>
		<author>
			<persName><forename type="first">C</forename><surname>Erten</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Chotai</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">L</forename><surname>Kazakov</surname></persName>
		</author>
		<ptr target="https://eprints.whiterose.ac.uk/169191/" />
	</analytic>
	<monogr>
		<title level="m">The 2020 IEEE Symposium Series on Computational Intelligence:IEEE Symposium on Computational Intelligence for Financial Engineeting and Economics</title>
				<meeting><address><addrLine>CIFEr</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2020">2020. 2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Ontology Graph Embeddings and ILP for Financial Forecasting</title>
		<author>
			<persName><forename type="first">C</forename><surname>Erten</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">L</forename><surname>Kazakov</surname></persName>
		</author>
		<ptr target="https://eprints.whiterose.ac.uk/178510/" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 30th International Conference on Inductive Logic Programming</title>
				<meeting>the 30th International Conference on Inductive Logic Programming</meeting>
		<imprint>
			<publisher>GRC</publisher>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<title level="m" type="main">Construction of Knowledge Graphs: State and Challenges</title>
		<author>
			<persName><forename type="first">M</forename><surname>Hofer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Obraczka</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Saeedi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Köpcke</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Rahm</surname></persName>
		</author>
		<ptr target="https://arxiv.org/abs/2302.11509v1" />
		<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<author>
			<persName><forename type="first">J</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Yuan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Zhang</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2403.10446</idno>
		<title level="m">Enhancing llm factual accuracy with rag to counter hallucinations: A case study on domain-specific queries in private knowledge-bases</title>
				<imprint>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Freebase: a collaboratively created graph database for structuring human knowledge</title>
		<author>
			<persName><forename type="first">K</forename><surname>Bollacker</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Evans</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Paritosh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Sturge</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Taylor</surname></persName>
		</author>
		<idno type="DOI">10.1145/1376616.1376746</idno>
		<idno>doi:10.1145/1376616.1376746</idno>
		<ptr target="https://dl.acm.org/doi/10.1145/1376616.1376746" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2008 ACM SIGMOD international conference on Management of data, SIGMOD &apos;08</title>
				<meeting>the 2008 ACM SIGMOD international conference on Management of data, SIGMOD &apos;08<address><addrLine>New York, NY, USA</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computing Machinery</publisher>
			<date type="published" when="2008">2008</date>
			<biblScope unit="page" from="1247" to="1250" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Complex Sequential Question Answering: Towards Learning to Converse Over Linked Question Answer Pairs with a Knowledge Graph</title>
		<author>
			<persName><forename type="first">A</forename><surname>Saha</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Pahuja</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Khapra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Sankaranarayanan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Chandar</surname></persName>
		</author>
		<idno type="DOI">10.1609/aaai.v32i1.11332</idno>
		<ptr target="https://ojs.aaai.org/index.php/AAAI/article/view/11332.doi:10.1609/aaai.v32i1.11332" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the AAAI Conference on Artificial Intelligence</title>
				<meeting>the AAAI Conference on Artificial Intelligence</meeting>
		<imprint>
			<date type="published" when="2018">2018</date>
			<biblScope unit="volume">32</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Entity-Duet Neural Ranking: Understanding the Role of Knowledge Graph Semantics in Neural Information Retrieval</title>
		<author>
			<persName><forename type="first">Z</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Xiong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Sun</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Liu</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/P18-1223</idno>
		<ptr target="https://aclanthology.org/P18-1223.doi:10.18653/v1/P18-1223" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics</title>
		<title level="s">Long Papers</title>
		<meeting>the 56th Annual Meeting of the Association for Computational Linguistics<address><addrLine>Melbourne, Australia</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2018">2018</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="2395" to="2405" />
		</imprint>
	</monogr>
	<note>Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">End-to-End Knowledge-Routed Relational Dialogue System for Automatic Diagnosis</title>
		<author>
			<persName><forename type="first">L</forename><surname>Xu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Q</forename><surname>Zhou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Gong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Liang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Tang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Lin</surname></persName>
		</author>
		<idno type="DOI">10.1609/aaai.v33i01.33017346</idno>
		<ptr target="https://ojs.aaai.org/index.php/AAAI/article/view/4722.doi:10.1609/aaai.v33i01.33017346" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the AAAI Conference on Artificial Intelligence</title>
				<meeting>the AAAI Conference on Artificial Intelligence</meeting>
		<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="volume">33</biblScope>
			<biblScope unit="page">1</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">RippleNet: Propagating User Preferences on the Knowledge Graph for Recommender Systems</title>
		<author>
			<persName><forename type="first">H</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Zhao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Xie</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Guo</surname></persName>
		</author>
		<idno type="DOI">10.1145/3269206.3271739</idno>
		<idno>doi:10.1145/3269206.3271739</idno>
		<ptr target="https://doi.org/10.1145/3269206.3271739" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 27th ACM International Conference on Information and Knowledge Management, CIKM &apos;18</title>
				<meeting>the 27th ACM International Conference on Information and Knowledge Management, CIKM &apos;18<address><addrLine>New York, NY, USA</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computing Machinery</publisher>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="417" to="426" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">An Unsupervised Joint System for Text Generation from Knowledge Graphs and Semantic Parsing</title>
		<author>
			<persName><forename type="first">M</forename><surname>Schmitt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Sharifzadeh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Tresp</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Schütze</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/2020.emnlp-main.577</idno>
		<ptr target="https://aclanthology.org/2020.emnlp-main.577.doi:10.18653/v1/2020.emnlp-main.577" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), Association for Computational Linguistics</title>
				<meeting>the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), Association for Computational Linguistics</meeting>
		<imprint>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="7117" to="7130" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Ontologizing health systems data at scale: making translational discovery a reality</title>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">J</forename><surname>Callahan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">L</forename><surname>Stefanski</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">M</forename><surname>Wyrwa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Zeng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Ostropolets</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">M</forename><surname>Banda</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">A</forename><surname>Baumgartner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">D</forename><surname>Boyce</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Casiraghi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><forename type="middle">D</forename><surname>Coleman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">H</forename><surname>Collins</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">J</forename><surname>Davies</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">A</forename><surname>Feinstein</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">Y</forename><surname>Lin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Martin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">A</forename><surname>Matentzoglu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Meeker</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Reese</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Sinclair</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">B</forename><surname>Taneja</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">E</forename><surname>Trinkley</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">A</forename><surname>Vasilevsky</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">E</forename><surname>Williams</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><forename type="middle">A</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">C</forename><surname>Denny</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">B</forename><surname>Ryan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Hripcsak</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">D</forename><surname>Bennett</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Haendel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">N</forename><surname>Robinson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">E</forename><surname>Hunter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">G</forename><surname>Kahn</surname></persName>
		</author>
		<idno type="DOI">10.1038/s41746-023-00830-x</idno>
		<ptr target="https://www.nature.com/articles/s41746-023-00830-x.doi:10.1038/s41746-023-00830-x" />
	</analytic>
	<monogr>
		<title level="j">npj Digital Medicine</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<biblScope unit="page" from="1" to="18" />
			<date type="published" when="2023">2023</date>
			<publisher>Nature Publishing Group</publisher>
		</imprint>
	</monogr>
	<note>1 Publisher</note>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Joint Extraction of Entities and Relations Based on a Novel Tagging Scheme</title>
		<author>
			<persName><forename type="first">S</forename><surname>Zheng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Bao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Hao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Zhou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Xu</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/P17-1113</idno>
		<ptr target="https://aclanthology.org/P17-1113.doi:10.18653/v1/P17-1113" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics</title>
				<meeting>the 55th Annual Meeting of the Association for Computational Linguistics<address><addrLine>Vancouver, Canada</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2017">2017</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="1227" to="1236" />
		</imprint>
	</monogr>
	<note>: Long Papers), Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">TPLinker: Single-stage Joint Extraction of Entities and Relations Through Token Pair Linking</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Yu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Zhu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Sun</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/2020.coling-main.138</idno>
		<ptr target="https://aclanthology.org/2020.coling-main.138.doi:10.18653/v1/2020.coling-main.138" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 28th International Conference on Computational Linguistics, International Committee on Computational Linguistics</title>
				<meeting>the 28th International Conference on Computational Linguistics, International Committee on Computational Linguistics<address><addrLine>Barcelona, Spain (Online</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="1572" to="1582" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Language Models are Few-Shot Learners</title>
		<author>
			<persName><forename type="first">T</forename><surname>Brown</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Mann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Ryder</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Subbiah</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">D</forename><surname>Kaplan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Dhariwal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Neelakantan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Shyam</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Sastry</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Askell</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Agarwal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Herbert-Voss</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Krueger</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Henighan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Child</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Ramesh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Ziegler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Winter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Hesse</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Sigler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Litwin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Gray</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Chess</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Clark</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Berner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Mccandlish</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Radford</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Sutskever</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Amodei</surname></persName>
		</author>
		<ptr target="https://papers.nips.cc/paper/2020/hash/1457c0d6bfcb4967418bfb8ac142f64a-Abstract.html" />
	</analytic>
	<monogr>
		<title level="m">Advances in Neural Information Processing Systems</title>
				<imprint>
			<publisher>Curran Associates, Inc</publisher>
			<date type="published" when="2020">2020</date>
			<biblScope unit="volume">33</biblScope>
			<biblScope unit="page" from="1877" to="1901" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<idno type="DOI">10.48550/arXiv.2303.08774</idno>
		<idno type="arXiv">arXiv:2303.08774</idno>
		<ptr target="http://arxiv.org/abs/2303.08774.doi:10.48550/arXiv.2303.08774" />
		<title level="m">GPT-4</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
		<respStmt>
			<orgName>OpenAI</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">Technical Report</note>
</biblStruct>

<biblStruct xml:id="b16">
	<monogr>
		<title level="m" type="main">LLaMA: Open and Efficient Foundation Language Models</title>
		<author>
			<persName><forename type="first">H</forename><surname>Touvron</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Lavril</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Izacard</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Martinet</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M.-A</forename><surname>Lachaux</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Lacroix</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Rozière</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Goyal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Hambro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Azhar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Rodriguez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Joulin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Grave</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Lample</surname></persName>
		</author>
		<idno type="DOI">10.48550/arXiv.2302.13971</idno>
		<idno type="arXiv">arXiv:2302.13971</idno>
		<ptr target="http://arxiv.org/abs/2302.13971.doi:10.48550/arXiv.2302.13971" />
		<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<monogr>
		<title level="m" type="main">Stanford Alpaca: An Instruction-following LLaMA model</title>
		<author>
			<persName><forename type="first">R</forename><surname>Taori</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Gulrajani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Dubois</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Guestrin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Liang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">B</forename><surname>Hashimoto</surname></persName>
		</author>
		<ptr target="https://github.com/tatsu-lab/stanford_alpaca,publicationTitle:GitHubrepository" />
		<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<monogr>
		<author>
			<persName><forename type="first">W.-L</forename><surname>Chiang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Lin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Sheng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Zheng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Zhuang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Zhuang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">E</forename><surname>Gonzalez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Stoica</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">P</forename><surname>Xing</surname></persName>
		</author>
		<ptr target="https://lmsys.org/blog/2023-03-30-vicuna/" />
		<title level="m">Vicuna: An Open-Source Chatbot Impressing GPT-4 with 90%* ChatGPT Quality</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<monogr>
		<title level="m" type="main">PaLM: Scaling Language Modeling with Pathways</title>
		<author>
			<persName><forename type="first">A</forename><surname>Chowdhery</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Narang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Devlin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Bosma</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Mishra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Roberts</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Barham</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">W</forename><surname>Chung</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Sutton</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Gehrmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Schuh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Shi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Tsvyashchenko</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Maynez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Rao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Barnes</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Tay</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Shazeer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Prabhakaran</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Reif</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Du</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Hutchinson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Pope</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Bradbury</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Austin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Isard</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Gur-Ari</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Yin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Duke</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Levskaya</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Ghemawat</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Dev</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Michalewski</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Garcia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Misra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Robinson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Fedus</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Zhou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Ippolito</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Luan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Lim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Zoph</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Spiridonov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Sepassi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Dohan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Agrawal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Omernick</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">M</forename><surname>Dai</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">S</forename><surname>Pillai</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Pellat</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Lewkowycz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Moreira</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Child</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Polozov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Zhou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Saeta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Diaz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Firat</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Catasta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Wei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Meier-Hellstern</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Eck</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Dean</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Petrov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Fiedel</surname></persName>
		</author>
		<idno type="DOI">10.48550/arXiv.2204.02311</idno>
		<idno type="arXiv">arXiv:2204.02311</idno>
		<ptr target="http://arxiv.org/abs/2204.02311.doi:10.48550/arXiv.2204.02311" />
		<imprint>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<monogr>
		<title level="m" type="main">GLM: General Language Model Pretraining with Autoregressive Blank Infilling</title>
		<author>
			<persName><forename type="first">Z</forename><surname>Du</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Qian</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Ding</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Qiu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Tang</surname></persName>
		</author>
		<idno>arXiv:</idno>
		<ptr target="2103.10360" />
		<imprint>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<monogr>
		<title level="m" type="main">A Comprehensive Survey on Deep Learning for Relation Extraction: Recent Advances and New Frontiers</title>
		<author>
			<persName><forename type="first">X</forename><surname>Zhao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Deng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Cheng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Lam</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Shen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Xu</surname></persName>
		</author>
		<idno>arXiv:</idno>
		<ptr target="2306.02051" />
		<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Revisiting Relation Extraction in the era of Large Language Models</title>
		<author>
			<persName><forename type="first">S</forename><surname>Wadhwa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Amir</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Wallace</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/2023.acl-long.868</idno>
		<ptr target="https://aclanthology.org/2023.acl-long.868.doi:10.18653/v1/2023.acl-long.868" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics</title>
				<meeting>the 61st Annual Meeting of the Association for Computational Linguistics<address><addrLine>Toronto, Canada</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2023">2023</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="15566" to="15589" />
		</imprint>
	</monogr>
	<note>: Long Papers), Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b23">
	<monogr>
		<title level="m" type="main">Chain-of-Thought Prompting Elicits Reasoning in Large Language Models</title>
		<author>
			<persName><forename type="first">J</forename><surname>Wei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Schuurmans</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Bosma</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Ichter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Xia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Chi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Q</forename><surname>Le</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Zhou</surname></persName>
		</author>
		<idno>arXiv:</idno>
		<ptr target="2201.11903[cs" />
		<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">Large language models are few-shot clinical information extractors</title>
		<author>
			<persName><forename type="first">M</forename><surname>Agrawal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Hegselmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Lang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Kim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Sontag</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/2022.emnlp-main.130</idno>
		<ptr target="https://aclanthology.org/2022.emnlp-main.130.doi:10.18653/v1/2022.emnlp-main.130" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics</title>
				<meeting>the 2022 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics<address><addrLine>Abu Dhabi, United Arab Emirates</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2022">2022</date>
			<biblScope unit="page" from="1998" to="2022" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<monogr>
		<title level="m" type="main">GPT-FinRE: In-context Learning for Financial Relation Extraction using Large Language Models</title>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">K</forename><surname>Rajpoot</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Parikh</surname></persName>
		</author>
		<idno type="DOI">10.48550/arXiv.2306.17519</idno>
		<idno type="arXiv">arXiv:2306.17519</idno>
		<ptr target="http://arxiv.org/abs/2306.17519.doi:10.48550/arXiv.2306.17519" />
		<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b26">
	<monogr>
		<title level="m" type="main">LLM2KB: Constructing Knowledge Bases using instruction tuned context aware Large Language Models</title>
		<author>
			<persName><forename type="first">A</forename><surname>Nayak</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">P</forename><surname>Timmapathini</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2308.13207</idno>
		<ptr target="http://arxiv.org/abs/2308.13207" />
		<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b27">
	<monogr>
		<author>
			<persName><forename type="first">Y</forename><surname>Gao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Xiong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Gao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Jia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Pan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Bi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Dai</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Sun</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Q</forename><surname>Guo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Wang</surname></persName>
		</author>
		<idno>ArXiv abs/2312.10997</idno>
		<ptr target="https://api.semanticscholar.org/CorpusID:266359151" />
		<title level="m">Retrievalaugmented generation for large language models: A survey</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b28">
	<monogr>
		<author>
			<persName><forename type="first">S</forename><surname>Toro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">V</forename><surname>Anagnostopoulos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Bello</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Blumberg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Cameron</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Carmody</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">D</forename><surname>Diehl</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">M</forename><surname>Dooley</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Duncan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Fey</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Gaudet</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">L</forename><surname>Harris</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Joachimiak</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Kiani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">C</forename><surname>Lubiana</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">T</forename><surname>Munoz-Torres</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>O'neil</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Osumi-Sutherland</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">P</forename><surname>Puig</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Reese</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">M C</forename><surname>Reiser</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Robb</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Ruemping</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Seager</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Sid</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Stefancsik</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Weber</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Wood</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">J</forename><surname>Haendel</surname></persName>
		</author>
		<author>
			<persName><surname>Mungall</surname></persName>
		</author>
		<idno>ArXiv abs/2312.10904</idno>
		<ptr target="https://api.semanticscholar.org/CorpusID:266359619" />
		<title level="m">Dynamic retrieval augmented generation of ontologies using artificial intelligence (dragon-ai)</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b29">
	<monogr>
		<author>
			<persName><forename type="first">J</forename><surname>Delile</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Mukherjee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">V</forename><surname>Pamel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Zhukov</surname></persName>
		</author>
		<idno>ArXiv abs/2402.12352</idno>
		<ptr target="https://api.semanticscholar.org/CorpusID:267751251" />
		<title level="m">Graph-based retriever captures the long tail of biomedical knowledge</title>
				<imprint>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b30">
	<analytic>
		<title level="a" type="main">WebProtégé: A collaborative ontology editor and knowledge acquisition tool for the Web</title>
		<author>
			<persName><forename type="first">T</forename><surname>Tudorache</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Nyulas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">F</forename><surname>Noy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Musen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Semantic Web</title>
		<imprint>
			<biblScope unit="volume">4</biblScope>
			<biblScope unit="page" from="89" to="99" />
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b31">
	<monogr>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">Q</forename><surname>Jiang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Sablayrolles</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Roux</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Mensch</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Savary</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Bamford</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">S</forename><surname>Chaplot</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>De Las Casas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">B</forename><surname>Hanna</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Bressand</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Lengyel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Bour</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Lample</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">R</forename><surname>Lavaud</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Saulnier</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M.-A</forename><surname>Lachaux</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Stock</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Subramanian</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Antoniak</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">L</forename><surname>Scao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Gervet</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Lavril</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Lacroix</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">E</forename><surname>Sayed</surname></persName>
		</author>
		<idno>ArXiv abs/2401.04088</idno>
		<ptr target="https://api.semanticscholar.org/CorpusID:266844877" />
		<title level="m">Mixtral of experts</title>
				<imprint>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
