<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Autopopulated ontologies for materials science. Journal of Chemical Information and Modeling</journal-title>
      </journal-title-group>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Properties in Text- A Knowledge Graph Generation Approach</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Deperias Kerre</string-name>
          <email>deperias.kerre@lirmm.fr</email>
          <email>dkerre@strathmore.edu</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Anne Laurent</string-name>
          <email>anne.laurent@umontpellier.fr</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kenneth Maussang</string-name>
          <email>Kenneth.Maussang@umontpellier.fr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Dickson Owuor</string-name>
          <email>dowuor@strathmore.edu</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Workshop</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Cascade Lasers, Semantic Web</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>IES, Univ Montpellier</institution>
          ,
          <addr-line>CNRS, Montpellier</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Information Extraction</institution>
          ,
          <addr-line>Knowledge Graphs, Linked Data, Ontologies, Retrieval Augmented Generation, Quantum</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>LIRMM, Univ Montpellier</institution>
          ,
          <addr-line>CNRS, Montpellier</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>SCES, Strathmore University</institution>
          ,
          <addr-line>Nairobi</addr-line>
          ,
          <country country="KE">Kenya</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2021</year>
      </pub-date>
      <volume>61</volume>
      <issue>9</issue>
      <fpage>1</fpage>
      <lpage>12</lpage>
      <abstract>
        <p>A well structured collection of the various Quantum Cascade Laser (QCL) design and working properties data provides a platform to analyze and understand the relationships between these properties. This analysis can result in insights into how diferent design features impact laser performance properties. Most of these QCL properties are captured in scientific text. This poses challenges in generating structured QCL properties data for exploration properties due to the specific nature of this domain. There is therefore a need for eficient methodologies that can be utilized to extract QCL properties from text and generate a semantically enriched and interlinked platform where the properties can be analyzed. There is also the need to maintain provenance and reference information on which these properties are based. Semantic Web technologies such as Ontologies and Knowledge Graphs (KGs) have proven capability in providing interlinked data platforms for knowledge representation in various domains. In this paper, we propose an approach for generating a QCL properties Knowledge Graph (KG) from text. The approach is based on the QCL ontology and a Retrieval Augmented Generation (RAG) enabled information extraction pipeline based on GPT 4-Turbo language model. The properties of interest include: working temperature, laser design type, lasing frequency, laser optical power and the heterostructure. The experimental results demonstrate the feasibility and efectiveness of this approach for eficiently extracting QCL properties from unstructured text and generating a QCL properties Knowledge Graph, which has potential applications in semantic enrichment and analysis of QCL data.</p>
      </abstract>
      <kwd-group>
        <kwd>Generation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Quantum Cascade Lasers (QCL) are semiconductor laser devices which consist of a nanometric stack
of diferent semiconductor materials and whose spectral emission is restricted to the frequency range
from about 100 GHz to 10 THz. The stacks of diferent materials are referred to as heterostructures.
Interactions between the various layers of materials result in various emission behaviours of the QCL
laser device [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The nature of the radiations emitted by these laser devices have enabled various
applications ranging from analysis of chemicals, high-resolution spectroscopy in astronomy, detection
of organic compounds in drugs etc [
        <xref ref-type="bibr" rid="ref2">2, 3, 4, 5, 6, 7</xref>
        ].
      </p>
      <p>QCL properties can be broadly classified into two categories: Design properties and the
Optoelectronic/ Working properties. The design features consists of the laser design characteristics such as
the laser design types, the material combinations used and the layer sequencing. The Optoelectronic
properties on the other hand refers to the performance behaviour of a QCL device with particular
design characteristics. Examples of working properties include Power, Temperature, Frequency etc.</p>
      <p>CEUR</p>
      <p>ceur-ws.org</p>
      <p>The working properties of a QCL device are dependent on the design features. This implies that there
is a relationship between the laser design and working properties. Understanding the relationships
between the QCL design and working properties plays an important role in the fabrication of a QCL
semiconductor device with target properties, for instance, the Working Temperature.</p>
      <p>The QCL properties are described in text where several device designs and their corresponding
working properties are proposed. Some of the laser properties mentioned in the text are based on
references in other text. Some of the QCL properties such as temperature may have several values in text
hence its important to extract the correct value of interest. A sample text description of QCL properties
is given: “GaAs/AlGaAs quantum cascade lasers based on four quantum well structures operating at 4.7
THz are reported. A large current density dynamic range is observed, leading to a maximum operation
temperature of 150 K for the double metal waveguide device and a high peak output power more than 200
mW for the single surface plasmon waveguide device” [8]. This text description contains several QCL
properties such as lasing frequency (4.7 THz), power (200 mW), working temperature (200 K) and the
heterostructure materials (GaAs/AlGaAs). The data on QCL properties therefore exist in heterogeneous
text sources in an unstructured format and therefore require a lot of efort to collect data, structure, and
explore the relationships between the various QCL properties.</p>
      <p>Methods for generation of KGs from text that involve extraction of triples from text have been
proposed. These methods are not suitable for the QCL domain, as the relations between entities are not
explicitly mentioned in the text but deduced from expert knowledge. Generation of KGs for the domain
can therefore be achieved by population of structured QCL data onto an expert-defined KG schema.
This requires generation of structured QCL data from text and a foundational ontology to provide the
KG schema semantics. There have been attempts to extract QCL properties from text using rule-based
approaches [9], indicating a possibility to generate QCL data from text. A formal representation of the
QCL properties in form of an ontology model has also been proposed [10].</p>
      <p>The existing Knowledge KGs in the materials science domain don’t capture properties for the QCL
domain and cannot therefore be readily utilized to answer queries on QCL design features and the
corresponding performance characteristics. There is therefore the need for a semantically enriched
platform that captures the QCL properties in text, their provenance information together with links
to the references for the properties. This will enable exploration of the relationships between the
various properties in the form of queries. The insights derived from these information can be used in
the fabrication of laser devices with target properties. This will also provide an interlinked platform
where both machines and humans can explore and query the QCL properties data in a FAIR (Findable,
Accessible, Interoperable, and Reusable) manner [11].</p>
      <p>In this paper, we present an approach for generating a QCL properties KG from text. The main
contributions of this paper are therefore as follows:
• We propose a Retrieval Augmented Generation (RAG) based approach for QCL property extraction
from scientific text to generate structured QCL properties data.
• We present an experimental analysis of the RAG-based approach on various Large Language</p>
      <p>Models.
• Based on the QCL Ontology and other vocabularies in the materials science domain, we implement
a mapping process to generate a KG for the QCL properties.
• We evaluate the ability of the generated Knowledge Graph in capturing domain knowledge using
sample test cases.</p>
      <p>The remaining sections of this paper is organized as follows: we give an overview of related works in
information extraction and knowledge representation in the materials science domain in section 2, the
Knowledge Graph generation workflow in section 3, the experimental results in section 4 and lastly
conclude in section 5.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <sec id="sec-2-1">
        <title>2.1. Information Extraction in Materials Science Domain</title>
        <p>Extraction of materials properties from text has gained a lot of interest recently as occasioned by the
need for accelerated materials discovery. The methodologies developed can be broadly classified in
to rule-based methods and machine learning based methods. Rule-based methods constitutes rules,
grammars and other expert-defined structures for identification of properties in specific domains. The
machine learning based approaches entails training of learning algorithms on labelled data to enable
them to learn how to identify specific materials properties in text.</p>
        <p>Examples of rule based toolkits adopted for properties extraction in materials science include
chemDataExtractor [12], LeadMine [13], ChemicalTagger[14], tmChem [15] and ChemSpot [16]. The
chemDataExtractor toolkit has also been widely adopted for materials properties extraction in other specific
use cases which includes: thermo-electric materials [17], semiconductor bandgaps [18], refractive
indices and dielectric constants [19], an auto-populated ontology of materials science [20], battery
materials [21], transition temperatures of magnetic materials [22] and quantum cascade laser properties
[9].</p>
        <p>Machine learning methods have also been adopted in the extraction of materials science properties.
Examples include generation of datasets of gold nano-particle synthesis procedures, morphologies and
size entities [23], and materials synthesis recipes [24]. Another work is on the use of the combination
of deep convolutional and recurrent neural networks for named entity recognition [25]. BERT
(Bidirectional Encoder Representations from Transformers) models have also been proposed for the analysis of
optical materials [26] and extraction of battery materials from scientific text [ 27].</p>
        <p>The emergence of generative large language models have also opened opportunities in the extraction
of materials properties from text. The models have been harnessed for text parsing in solid-state
synthesis internary chalcogenides [28] .The in-context learning method is also applied to assess the
ability of LLMs in processing materials data [29] and extracting materials data from research papers
[30]. Lastly, LLMs have also been utilised in the construction of functional materials Knowledge Graph
in multidisciplinary materials science [31].</p>
        <p>Despite the great achievements in the adoption of Information Extraction methods for extracting
materials science data from scientific text, there still exist open challenges that need to be addressed in
order to develop eficient methodologies in generating structured data for QCL properties from scientific
text. The rule-based approaches for materials properties extraction from text are domain specific and
are limited in cases where there is slight change in the text structure. The machine learning models need
retraining in order to be utilized for QCL properties extraction from text. The LLM based methodologies
present a promising direction in the extraction of materials science data from text. The models however,
require quality training data and massive computing resources in order to be fine-tuned for specific
domain properties extraction from text. For the in-context learning approaches for LLMs, there is need
to eficiently generate the best quality training examples to be used for prompting during the materials
properties data extraction.</p>
        <p>There is therefore, the need for eficient methodologies for extracting QCL properties from text to
generate structured QCL properties data. This will provide a foundation for the generation of knowledge
representation platforms for analyzing the relationship between the various QCL properties captured
in various heterogeneous textual data sources.</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Knowledge Graphs in Materials Science Domain</title>
        <p>Knowledge Graphs have been proposed in representing knowledge for properties in the materials
science domain. The KGs are based on several foundational and specific domain ontologies developed
for this domain. The motivation behind these KGs is to provide an accelerated analysis of materials
science properties for various reasons, including materials discovery.</p>
        <p>Nanomine, a KG for nanocomposite materials is proposed in [32]. This KG provides a unified
platform for generating visualizations and analyzing the relationship in nanocomposite materials. The
KG generation pipeline for Nanomine involves manual extraction of data from papers, structured it
into files such as excel and uploading it to the ontology schema. Propnet is a KG proposed for a wide
range of materials science properties [33]. The KG provides a computational framework that helps
scientists to automatically calculate additional information from their datasets such as the Materials
Project database.</p>
        <p>Other KGs are also proposed for the general materials science data [34, 35, 36]. The generation
pipelines entail use of advanced NLP techniques such as Named Entity Recognition and Relationship
Extraction [34, 35] and deep-learning approaches [36]. Lastly, a materials terminology KG has been
proposed in [37] and the materials experiment KG proposed in [38]. The materials terminology KG
comprises of 8660 materials terms and their explanations automatically generated from text corpus
via NLP techniques. The materials experiment KG captures provenance information of each material
sample together with associated data and metadata.</p>
        <p>Despite adoption of KGs in the representation and exploration of materials science data, the existing
KGs cannot be readily used to represent the knowledge in QCL properties data. Nanomine KG
captures knowledge on domain-specific nanocompoiste particles hence not suitable for the QCL domain.
Propnet and the other general materials science KGs captures wide materials science concepts which
do not capture knowledge on the QCL heterostructure design properties. The relationships to other
working properties and working modes are also not captured. The materials terminology and materials
experiment KGs do not capture the relationships suitable for the QCL properties. None of the KGs can
therefore be adopted as a unified platform for representing and analyzing QCL properties data together
with its provenance information.</p>
        <p>The KG generation techniques from text need to improved in order to be adopted to the task of
generating the QCL properties KG from text. For instance, for the QCL domain, the relationships
between properties are not directly captured in text and can only be inferred from expert knowledge.
This implies that relation extraction techniques are limited in QCL KG generation from text.</p>
        <p>There is therefore need for eficient methodologies for extracting QCL properties from text to generate
structured data and utilize this data to generate a Knowledge Graph for representing QCL properties,
relationships among them and the provenance information. To the best of our knowledge, this work
presents the initial steps for implementing the task of QCL properties extraction from text and KG
generation for property exploration.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Methodology</title>
      <p>The Knowledge Graph generation approach is composed of the following parts: Information Extraction
pipeline and the KG modeling and data enrichment part.</p>
      <sec id="sec-3-1">
        <title>3.1. Information Extraction Pipeline</title>
        <p>In this module, we explore the use of large language models in developing the pipeline. Large language
models such as GPT models are trained on general knowledge data and cannot be eficiently used on
specific domain tasks without adaptation. Fine-tuning and in-context learning have been proposed as
methods to adapt LLMs to domain specific tasks. Fine-tuning requires a lot of quality training data and
requires a large amount of computational resources. For in-context learning, the quality of the output
is dependent on the quality of the prompt and the examples it contains. In most cases especially for few
shot learning, the examples have to be manually added in the prompt.</p>
        <p>In this paper, we hypothesize that exposing the model to a labeled instruction data consisting of
sample text describing QCL properties and the corresponding extracted properties improves the model’s
performance on this task of property extraction from text. This also aligns the model’s output to the
expected format and minimizes irrelevant responses. This also eliminates the need for fine-tuning and
static prompt generation.</p>
        <p>We propose a hybrid few shot learning strategy where the few shot examples consist of the best
examples automatically generated from the instruction dataset by a RAG pipeline as opposed to normal
few shot learning where static examples have to be specified in every prompt. We adopt GPT-4 Turbo
model in our approach. This is owed to GPT-4 Turbo improved eficiency in generating responses and
the larger context window [39]. Our method is based on the Naive RAG approach [40]. As illustrated
in Figure 1, the module has three sections i.e Retrieval, Data Augmentation and Data Generation. The
rest of this subsection describes the pipeline modules.</p>
        <sec id="sec-3-1-1">
          <title>3.1.1. Retrieval</title>
          <p>An input sentence (query) containing QCL property of interest and an instruction to the large language
model is submitted by the user to the retriever. This is then forwarded by the retriever to the data
augmentation module for retrieval of similar responses based on the query.</p>
        </sec>
        <sec id="sec-3-1-2">
          <title>3.1.2. Data Augmentation</title>
          <p>This module consists of train data embeddings stored in an embedding database (DB). This provides the
context to the model during information extraction. In our case, the context is provided by an original
QCL properties instruction dataset [41]. The dataset description is given elsewhere[42]. We sample
80 % of this dataset for training and 10 % for testing. The dataset comprises of 1040 sample sentences
containing QCL properties, an instruction to the model for information extraction together with the
corresponding properties extracted.</p>
          <p>Sentence embeddings are computed using a based pre-trained sentence transformer model[43]. We
adopt the all-mpnet-base-v21 version of the sentence transformer model in our approach. The query
embeddings are also computed by the sentence transformer model in order to allow for comparison
with the embeddings in the training data.</p>
          <p>Similarity scores between the query embeddings and the train sentence embeddings in the embedding
DB are computed using the cosine similarity metric. The examples in the embedding DB that are more
similar are retrieved. The examples and the user query are both passed to a prompt generator which
prepares a prompt based on a defined prompt template. We define a prompt template that allows parsing
a user query together with the relevant examples showing sample instructions with text containing
1https://huggingface.co/sentence-transformers/all-mpnet-base-v2
properties and the extracted properties. The generated prompt is then passed to the Generation phase.
Figure 2 shows the prompt template and Figure 6 shows a sample regenerated prompt.</p>
        </sec>
        <sec id="sec-3-1-3">
          <title>3.1.3. Generation</title>
          <p>In the generation module, a response is generated by the language model in a zero-shot manner. This
stage is flexible as any language model can be used for response generation. The model responses are
then processed to remove any incomplete records or any irrelevant responses.</p>
          <p>
            Formally, the general process of extraction of QCL properties from text based on the detailed RAG
approach is carried out as follows: we frame the QCL property extraction from text task in the form of
a conversation as follows: Given a set of input sentences  , where  = { 1,  2,  3...  }, we design a prompt
 that contains an instruction  , the Sentences  and contextual examples  = { 1,  2,  3...,   }, where
K depends on the number of examples desired for the context. This prompt is passed to the model
in order to extract a particular QCL property record R, where  = { 1,  2,  3...,   } and n implies the
number of properties in the record. It is worthwhile to note that each P is passed to M for each property
record. Algorithm 1 shows the steps followed to extract the data.
For proof of concept, we use the pipeline to generate sample structured QCL properties data for 36
QCL devices from 36 abstracts documenting QCL devices and their properties [
            <xref ref-type="bibr" rid="ref10 ref11 ref12 ref13 ref14 ref15 ref16 ref17 ref18 ref19 ref20 ref21 ref22 ref23 ref24 ref3 ref4 ref5 ref6 ref7 ref8 ref9">8, 44, 45, 46, 47, 48, 49, 50,
51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78</xref>
            ]. The data
is post-processed to have a clean file of all the properties extracted. We also include the metadata (DOI
and URL) for provenance information and references for referencing the various mentioned properties
during the post-processing phase. The missing values are also included in the post-processing stage.
The final data constitutes a well-structured csv file containing QCL properties data for every device
together with the associated provenance information and the references.
          </p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Knowledge Graph Modeling and Data Enrichment</title>
        <p>In this section, there are two processes that we carry out: first we define the KG model to organize the
data and secondly we map the data to enrich it. We detail them in the following subsections:</p>
        <sec id="sec-3-2-1">
          <title>3.2.1. Ontologies and Knowledge Graph Modeling</title>
          <p>A Knowledge Graph represents a semantic network of interlinked entities. Entities refer to real-world
concepts or ideas that can be identified by a unique identifier on the web. A Knowledge Graph is defined
in the form of triples, that consist of two entities and a relation (predicate) linking them. Formally, a
Knowledge graph   can be defined as   = { 1,  2,  3...,   } and  refers to the various triples in the KG
and n the number of triples in the KG. A triple  = {, , } where  is the subject,  the predicate„ and 
the object.  ,  and  are denoted by Resource Identifiers in the form of Uniform Resource Identifiers
(URIs) or Literals (e.g., strings, numbers, dates). The semantics of a KG are provided by ontologies or
standard vocabularies.</p>
          <p>In our case, the entities are the various QCL properties, the various relationships among them, and the
provenance information for these properties. In order to define a KG schema to represent the knowledge
for QCL properties, we adopt concepts from several ontologies that capture the terms of interest for the
QCL properties and the relationships between them. The semantics for the QCL properties and the
relationships between them are provided by the QCL ontology2 [10]. This is an ontology for properties
in the QCL domain hence suitable for capturing knowledge on QCL terms.</p>
          <p>
            The other vocabularies adopted in the schema include concepts from BIBO3 and schema.org4. Formal
definitions for working properties are adopted from the Materials Design Ontology (MDO) [
            <xref ref-type="bibr" rid="ref25">79</xref>
            ]. The
provenance ontology is used to model the provenance information of the QCL properties. Figure 3
shows the KG schema used to organize the data (with the instances in shaded boxes) and Table 1 shows
the prefixes and URIs for the namespaces used in the KG schema.
          </p>
          <p>The KG schema covers the following categories of information that we enrich: laser heterostructure
(heterostructure materials), laser working properties (power, temperature and the lasing frequency), laser
design types, provenance information and citation tracks. The heterostructure captures the materials
stacking properties of the laser and the related design. It is defined by the concept
QpOnto:LaserHeterostructure. A heterostructure contains heterostructure materials
(QpOnto:HeterostructureMaterials) and the material combination has a formula (QpOnto:matFormula) which indicates the materials
used and the ratio of combination in a string. A heterostructure also has a design type. Examples of
design types includes the resonant phonon and the LO phonon design types.</p>
          <p>The optoelectronic properties captures the QCL performance behaviour as a result of injection of
current. These includes the working temperature, power and frequency. These properties are related to
particular design information i.e design types and heterostructure materials. The working temperature
also depends on the laser working mode i.e whether the emission is in continuous or pulse mode.
2https://github.com/DeperiasKerre/qcl_Onto/blob/main/qclontology/version-1.0/qclonto.owl
3https://dcmi.github.io/bibo/
4https://schema.org/</p>
          <p>The QCL properties provenance information is also captured in the schema. This is implemented by
capturing the metadata (DOI and URL) of the articles documenting the various laser properties. We
also provide links to references in those articles as some of the properties proposed in a given design
are based on another design in a referenced article. This is done using the concept BIBO:cite from
the BIBO ontology. The semantics of the units, quantity kinds, numerical values and their associated
relationships for the QCL working properties are modeled by re-using the terms in the QUDT ontology.</p>
        </sec>
        <sec id="sec-3-2-2">
          <title>3.2.2. Data Enrichment</title>
          <p>In this phase, several steps are carried out in order to populate the KG schema with the structured
QCL properties data generated in section 3.1 in order to generate the KG. The mapping process entail
mapping the data literal values to the respective entities via data properties and linking them through
the object properties as defined by the KG schema. We also specify the data types for all the data
instances. This is implemented using the rdflib library 5. The units, design types and the relevant
working modes descriptions are also enriched with the relevant URIs. The redundant triples are also
examined and eliminated from the generated Knowledge Graph.</p>
          <p>
            The final RDF file is then serialized into the Turtle and the RDF/XML formats in order to generate
the Knowledge Graph for querying and exploration. The generated Knowledge Graph contains a
total of 2979 triples containing the QCL properties and their associated provenance information. A
visualization of a sample instance of a QCL heterostructure (capturing the design type and material
combination information) and its provenance information is shown in Figure 4 with the data values
and the provenance information adopted from [
            <xref ref-type="bibr" rid="ref17">70</xref>
            ] .
          </p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Experiments and Results</title>
      <p>In this section, we carry out experiments to evaluate the KG generation approach. This is done is
based on three strategies: the performance of the approach in QCL property extraction from text, the
consistency and the correctness of the generated KG in terms conformance to the QCL properties
domain requirements i.e the ability to capture the intended knowledge correctly. We also provide an
analysis of the completeness of the proposed KG.</p>
      <sec id="sec-4-1">
        <title>4.1. Property Extraction From Text</title>
        <sec id="sec-4-1-1">
          <title>4.1.1. Evaluation Metrics</title>
          <p>
            For the evaluation of the information extraction module, we adopt the expert validation approach.
This entails comparison of the model’s output with the expert annotated ground truth label in the
evaluation dataset. We use the precision and recall in order to evaluate the performance of the approach
on QCL property extraction from text[
            <xref ref-type="bibr" rid="ref26">80</xref>
            ]. Precision is the fraction of correct (relevant) records among
all extracted records and the recall is the fraction of successfully extracted records among all correct
(relevant) records in the dataset. The metrics are determined as follows:
   =
          </p>
          <p>+  
(1)
 =   (2)</p>
          <p>+  
TP refers to the true positive count (the number of correct records extracted), FP is the false positive
count (the number incorrect records extracted), and FN corresponds to the false negative count (the
number of correct records that are not extracted). We define the terms correct and incorrect in our
context as follows:
Definition 1:</p>
          <p>The word “correct” in this context implies that the property value extracted
can be validated by a human expert when reading the corresponding sentence containing the property.
It should also match with the ground truth label value in the evaluation dataset. Property values with
units are only considered correct if they are extracted together with the units.</p>
          <p>Definition 2: An “incorrect” (false) record suggests that the value extracted does not
correspond to the correct/expected value as compared to the ground truth value in the evaluation
dataset.</p>
        </sec>
        <sec id="sec-4-1-2">
          <title>4.1.2. Evaluation Dataset and Baselines</title>
          <p>We evaluate the information extraction pipeline on a test dataset obtained from the instruction dataset
described in section 3.1. This contains a total of 130 sentences containing the diferent QCL properties.
Table 2 gives the summary statistics of the test data per QCL property.</p>
          <p>For the baseline, we compare the hybrid few-shot learning approach (RAG-enhanced approach) with
direct prompting where no contextual information is provided. We evaluate our approach on GPT
4-Turbo and the Mistral-7b-Instruct model. We don’t perform evaluations on GPT-4 due to the context
window limitations. We analyze the performance per QCL property as diferent properties have varying
levels of dificulty during the extraction process.</p>
        </sec>
        <sec id="sec-4-1-3">
          <title>4.1.3. Evaluation Results</title>
          <p>We present the results for QCL property extraction from text analyzed per QCL property as the properties
have a varying level of dificulty during the extraction process. The results are as shown in Table 3 for
the laser power, Table 4 for working temperature, Table 5 for laser heterostructure materials, Table 6
for lasing frequency, Table 7 for the laser design type and Table 8 for the average performance of the
models for all the properties.</p>
          <p>The laser power, working temperature and frequency exhibit higher precision in the extraction
process in both direct prompting and the proposed approach. This is attributed to the fact that these
properties are generally available in the general knowledge of the models. However, there is lower
recall for the simple queries for these properties. This is in cases where more than one value of these
properties are mentioned and the models struggle to identify the required value hence failing to extract
these properties. This is also the case when the values are given in ranges, for instance, the lasing
frequency. Provision of examples capturing such scenarios improves the model performance on these
properties. This is indicated by the best performance exhibited by the GPT 4-Turbo RAG based approach
as shown in Tables 3, 4 and 6. It is also noted that there is a significant improvement in performance for
both the precision and recall for the lasing frequency and the working temperature with the RAG based
approach(Tables 4 and 6). This is due to the ability of the model to learn how to identify the properties
of interest based on the provided examples.</p>
          <p>The laser design type property entails a QCL domain specific property. The models exhibits higher
precision in cases where the keyword “design type” is explicitly used in the text description but fails
totally to recognize and extract the property in cases where the key word is not used. For both models,
there is an improvement in generalization (in terms of precision) as the models utilize the labeled
examples to recognize this domain specific terms. There is however a decrease in recall for the
GPTTurbo model. This is attributed to cases where the model does not completely extract terms with more
variations form the context documents. In this case, the Mistral-instruct-7b model exhibits the best
performance with the proposed RAG approach as shown in Table 7.</p>
          <p>The QCL heterostructure materials property is also a QCL domain specific property. With this
property, there is no significant improvement with the proposed approach for all the models. Despite the
heterostructure being a domain specific concept, its description is characterized by the terms“‘structure”,
“heterostructure” or “materials” which enables the models to identify the properties at relatively higher
precision even with direct prompting. The best performance is exhibited by the mistral 7b-instruct
model in a direct prompting format (Table 5).</p>
          <p>In summary, exposure of large language models to quality labeled data improves their ability in
recognizing and extracting the relevant QCL properties. Averagely, an improvement of performance is
achieved as follows: precision +3.59 %, recall +3.08 % for GPT-4 Turbo and precision +12.67 %, recall
+11.32 % for Mistral-7b-instruct model. The proposed approach enables the update of the models context
without full fine-tuning that is computationally expensive. With the proposed approach, it’s even
possible to train the model to adopt a certain output format for the extracted properties to avoid any
unnecessary responses or undesired output formats. This approach can be extended for the other
QCL properties as it enables the model to learn how to identify the domain specific properties with
the provided examples with less resources. With this approach, the model’s performance is however
dependent on data quality and the properties covered. The model’s performance therefore increases
with more diverse examples in the model’s context.</p>
          <p>The performance of the model (in terms of the recall) is still low for some properties such as the
design type due to the domain specific nature of this property and the varying styles in which it is
expressed in text as compared to other properties. More training strategies can be explored to improve
the model’s performance on such properties. This can vary from training methods to developing diverse
datasets for the task of extracting such properties.</p>
        </sec>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Knowledge Graph Evaluation</title>
        <p>The generated Knowledge Graph is evaluated based on two metrics: the consistency in the KG triples,
the correctness of the KG in capturing the intended knowledge in the QCL domain properties and the
completeness of the KG. The consistency ensures the logical soundness of the defined triples in the
generated KG. The correctness of the KG in terms of the domain requirements aims to evaluate the
generated KG‘s ability in capturing the domain knowledge of interest and being able to provide answers
to questions regarding the various QCL properties.</p>
        <sec id="sec-4-2-1">
          <title>4.2.1. Logical Consistency</title>
          <p>
            The Knowledge Graph consistency is validated by lack of inconsistencies/contradictions in the generated
triples. This is implemented using logical reasoners. We the validate the KG consistency using the
pellet reasoner [
            <xref ref-type="bibr" rid="ref27">81</xref>
            ].
          </p>
        </sec>
        <sec id="sec-4-2-2">
          <title>4.2.2. Knowledge Graph Correctness</title>
          <p>Knowledge Graph correctness refers to the accuracy/relevance of the facts stored in the KG. Under this
we evaluation metric, we adopt the expert validation approach to assess the ability of the proposed KG
in capturing the domain requirements correctly. With the help of QCL domain experts, we define a set
of test cases in form of competency questions (CQs). The CQs comprise of set of inquiries that can be
utilized by experts to query and explore the various QCL properties and the relationship between them.
The responses obtained from the queries can be used for insights regarding the design of optimal QCL
devices with target properties. This enables a quicker exploration of QCL properties from heterogeneous
data sources.</p>
          <p>The competency questions capture the whole QCL properties of interest i.e the design, working
properties, the laser working modes and the provenance information. Table 9 shows the classes of the
CQs. The X,Y and Z are place holders for any particular properties of interest to be used in the queries.
For every class of queries in Table 9, we design and run several possibilities of the queries. We compare
the queries output with the expected output in order to determine the precision in question answering
by the Knowledge Graph.</p>
          <p>We run a total of 20 queries in order to validate the suitability of the generated KG in capturing the
QCL properties, the relationships between them and their provenance information. Table 10 shows
What are the possible material compositions of a QCL laser heterostructure with a
design type X ?
What is the working property X of a QCL laser working in mode Y ?
What is the performance property X of a QCL laser having a heterostructure with
material composition Y?
For a particular performance property X, what are the corresponding laser
heterostructure designs?
For a particular performance property X, what are the corresponding heterostructure
material compositions?
What are the the DOIs and/ the URLs of the scientific articles documenting a laser
with performance property W or with heterostructure materials X or working mode Y
or design type Z.</p>
          <p>What are the DOIs and URLs of the articles being referenced by a QCL device with
property W or with heterostructure materials X or working mode Y or design type Z?
the specific queries run for each class of queries specified in Table 9. The queries range from simple to
complex queries regarding the QCL properties and their provenance information.</p>
          <p>We present an example of a scenario where an expert requires specific information regarding
a QCL property and its relation to another property in order to make decisions regarding an
optimal QCL design. We illustrate how this information can be retrieved via the KG using queries in Table 10.
Example 4.1: Consider a scenario where a QCL expert is interested in the possible
heterostructure materials composition of a QCL device with a certain working property, for instance, a lasing frequency
value greater than 1.5 THz.</p>
          <p>The question in example 4.1 can be captured by a query for CQ 5.2 in Table 10. A corresponding
SPARQL query and the retrieved results for query 5.2 are shown in Figure 5. All the queries are
successfully answered by the generated KG and the complete results are available in the GitHub
repository 6.</p>
          <p>The ability of the generated KG in successfully answering the competency questions indicates its
capability in capturing QCL properties information from the various textual sources. This provides
a unified platform that allows exploration of QCL properties in a structured manner to be able to
derive insights on the relationship between the properties as opposed to manually exploring the textual
documents for these properties. This is useful in scenarios where there is need for a quicker comparison
of the various QCL properties for instance, the working properties for a particular laser design.</p>
          <p>The ability to capture the provenance information for the QCL properties also makes it possible to
track the sources of this information via permanent identifiers such as the DOI and the URL. With the
generated KG, it is also possible to have a linked access to the references for the various QCL properties
as some of the properties are based on other properties mentioned in the references. This therefore
provides an eficient way of accessing this information for quicker analysis.</p>
        </sec>
        <sec id="sec-4-2-3">
          <title>4.2.3. Knowledge Graph Completeness</title>
          <p>
            Knowledge Graph completeness estimates the proportion of information contained in the KG in
relation to the required information [
            <xref ref-type="bibr" rid="ref28">82</xref>
            ]. This metric is important in checking the possibility crucial
information being left out in the KG. In this work, we assess the completeness of the KG in two
dimensions: Schema completeness and Property Completeness. Schema completeness refers to the
degree to which classes and properties are presented in a schema. On the other hand, property
completeness refers to the extent of the missing property values of a specific kind of property [
            <xref ref-type="bibr" rid="ref29">83</xref>
            ].
Schema Completeness: In order to assess the schema completeness, we establish the mandatory
properties for a class in order to determine the missing facts in a class [
            <xref ref-type="bibr" rid="ref30">84</xref>
            ]. A mandatory property for
a given class instance refers to a relation that every instance of the class should be involved in, for
instance in our case, every QCL optolectronic property should have a unit.
          </p>
          <p>In order to determine the mandatory properties, we consider the following conditions: (i) QCL
properties class instances (design/working properties) should be linked to the relevant provenance
information (should contain links to papers describing them), (ii) The working temperatures should
have a corresponding working mode, (iii) The optoelectronic properties should be linked to units, have
quantity kind, quantity value and lastly (iv) The optoelectronic characteristics should be related to
corresponding design features. The mandatory relations are therefore wasAttributedTo,
correspondsToWorkingMode, hasQuantityValue, hasQuantityKind, hasUnit and relatesToHeterostructure. For each of
these mandatory attributes, we determine the the ratio of instances that actually have the properties in
the data captured by the KG. The results are presented in Table 11.</p>
          <p>Mandatory Rela- Number of Instances in the KG
tion (Expected to have Relation)
wasAttributedTo 100
correspondsToWork- 36
ingMode
hasQuantityValue 79
hasQuantityKind 79
hasUnit 69</p>
          <p>As illustrated in Table 11, all the mandatory relations are adequately captured in the KG schema
with no information missing for the KG classes. This implies that all the class instances are adequately
associated with other instances, hence validating the KG schema quality.</p>
          <p>Property Completeness: In order to assess the property completeness, we consider the data
properties and check whether there is any missing information for these properties. We assert that
all articles in the KG should have DOIs, all heterostructure materials should have a material formula,
and all quantity values for the optoelectronic properties should have numeric values. We determine
the ratio of the values as compared to the data properties for the instances in the KG. We present the
results in Table 12.</p>
          <p>The results in Table 12 indicate that all data values for property instances are captured in the KG.
This implies that all articles in the KG have DOIs, all heterostructure materials have a corresponding
material formula and all the quantity values in the KG have a corresponding numerical values. This
validates the property completeness of the generated KG.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion and Future Work</title>
      <p>In this paper, we address the issue of semantic enrichment of QCL properties in text by presenting an
approach for generating a Knowledge Graph for QCL properties from text. The approach is composed of
an information extraction pipeline for extracting QCL properties from text based on an LLM enabled RAG
approach and the data enrichment part where all the data is mapped and the relationships interlinked.
We evaluate the performance of the approach in QCL property extraction from text and the correctness
of the generated Knowledge Graph in modeling the knowledge in the QCL properties domain.</p>
      <p>The proposed information extraction approach presents competitive results indicating the model’s
ability to learn how to identify domain specific properties with the help of curated examples. The
generated Knowledge Graph indicates its ability in modeling the knowledge in the QCL properties and
their provenance information hence providing a semantically enriched, unified platform for quicker
analysis and insights regarding the fabrication of QCL laser devices with target properties. Our work
represents an important step towards the development of automated methods for extracting and
representing complex scientific knowledge regarding QCL properties from text. This can be extended to
other domains in order to develop methods for KG generation from text especially for domain specific
KG generation from text. We believe that this approach has the potential to transform the way that
researchers interact with scientific literature, and open up new avenues for discovery of facts in scientific
literature.</p>
      <p>The generated Knowledge Graph is however based on a limited number of articles and properties.
This can be extended with more articles and other QCL properties such as the layer sequences, thickness,
the current density among others in an incremental way using the same pipeline in an incremental way.
There is also the need for more analysis of the proposed approach on other QCL properties with more
diverse datasets. The concepts in the KG as represented by the QCL ontology can be extended using
various AI techniques for instance, the use of LLMs in learning and extracting the new entities/concepts
in the scientific literature for ontology population. This will enable timely update of the concepts in
the ontology and the KG in general. Future works may include extending the KG with more data,
concepts and proposing learning methods for the QCL laser working properties prediction based on
design features. An example of learning methods for understanding the relationship between the design
features that can be explored entails the use of KG embeddings for predicting the relationship between
the QCL properties.</p>
    </sec>
    <sec id="sec-6">
      <title>Availability of Materials</title>
      <p>The source code and the materials used for the production of this work are publicly available at our
GitHub repository: https://github.com/DeperiasKerre/qKG.</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgement</title>
      <p>This work was funded by the French Embassy in Kenya (Scientific and Academic Cooperation
Department) and the CNRS (under the framework “Dispositif de Soutien aux Collaborations avec l’Afrique
sub-saharienne”). The authors would also like to thank Strathmore University, School of Computing
and Engineering Sciences, and the Doctoral Academy for creating an opportunity for this work to be
produced.</p>
    </sec>
    <sec id="sec-8">
      <title>Declaration on Generative AI</title>
      <p>The author(s) have not employed any Generative AI tools.
[3] Federici, J. F., Schulkin, B., Huang, F., Gary, D., Barat, R., Oliveira, F., &amp; Zimdars, D. (2005). THz
imaging and sensing for security applications—explosives, weapons and drugs. Semiconductor
science and technology, 20(7), S266. https://doi.org/10.1088/0268-1242/20/7/018
[4] Jepsen, P. U., Cooke, D. G., &amp; Koch, M. (2011). Terahertz spectroscopy and imaging–Modern
techniques and applications. Laser &amp; Photonics Reviews, 5(1), 124-166. https://doi.org/10.1002/lpor.
201000011
[5] Wubs, J.R., Macherius, U., Weltmann, K.D., Lü, X., Röben, B., Biermann, K., Schrottke, L., Grahn,
H.T. &amp; Van Helden, J.H.(2023). Terahertz absorption spectroscopy for measuring atomic oxygen
densities in plasmas. Plasma Sources Science and Technology, 32(2), 025006. https://doi.org/10.1088/
1361-6595/acb815
[6] Richter, H., Wienold, M., Schrottke, L., Biermann, K., Grahn, H. T., &amp; Hübers, H. W. (2015). 4.7-THz
local oscillator for the GREAT heterodyne spectrometer on SOFIA. IEEE Transactions on Terahertz
Science and Technology, 5(4), 539-545. https://doi.org/10.1109/TTHZ.2015.2442155
[7] Shur, M., &amp; Liu, X. (2022, March). Biomedical applications of terahertz technology. In Advances in
Terahertz Biomedical Imaging and Spectroscopy (Vol. 11975, p. 1197502). SPIE. https://doi.org/10.
1117/12.2604800
[8] Ohtani, K., Turčinková, D., Bonzon, C., Benea-Chelmus, I.C., Beck, M., Faist, J., Justen, M., Graf, U.U.,
Mertens, M. &amp; Stutzki, J. (2016). High performance 4.7 THz GaAs quantum cascade lasers based on
four quantum wells. New Journal of Physics, 18(12), 123004. https://doi.org/10.1088/1367-2630/18/
12/123004
[9] Kerre, D., Laurent, A., Maussang, K., &amp; Owuor, D. (2023, August). A text mining pipeline for mining
the quantum cascade laser properties. In European Conference on Advances in Databases and
Information Systems (pp. 393-406). Cham: Springer Nature Switzerland. https://doi.org/10.1007/
978-3-031-42941-5_34
[10] Kerre, D., Laurent, A., Maussang, K., &amp; Owuor, D.(2025). A Concise Ontological Model of the
Design and Optoelectronic Properties in the Quantum Cascade Laser Domain. Semantic Web, 16
(4), 1-17. https://doi.org/10.1177/22104968251359870
[11] Wilkinson, M.D., Dumontier, M., Aalbersberg, I.J., Appleton, G., Axton, M., Baak, A., Blomberg, N.,
Boiten, J.W., da Silva Santos, L.B., Bourne, P.E. &amp; Bouwman, J. (2016). The FAIR Guiding Principles
for scientific data management and stewardship. Scientific data, 3(1), 1-9. https://doi.org/10.1038/
sdata.2016.18
[12] Swain, M. C., Cole, J. M. (2016). ChemDataExtractor: a toolkit for automated extraction of chemical
information from the scientific literature. Journal of chemical information and modeling, 56(10),
1894-1904.https://doi.org/10.1021/acs.jcim.6b00207
[13] Lowe, D. M., Sayle, R. A. (2015). LeadMine: a grammar and dictionary driven approach to entity
recognition. Journal of cheminformatics, 7(1), 1-9.https://doi.org/10.1186/1758-2946-7-S1-S5
[14] Hawizy, L., Jessop, D. M., Adams, N., Murray-Rust, P. (2011). ChemicalTagger: A tool for
semantic text-mining in chemistry. Journal of cheminformatics, 3, 1-13. https://doi.org/10.1186/
1758-2946-3-17
[15] Leaman, R., Wei, C. H., Lu, Z. (2015). tmChem: a high performance approach for chemical named
entity recognition and normalization. Journal of cheminformatics, 7(1), 1-10. https://doi.org/10.
1186/1758-2946-7-S1-S3
[16] Rockt¨aschel, T., Weidlich, M., Leser, U. (2012). ChemSpot: a hybrid system for chemical named
entity recognition. Bioinformatics, 28(12), 1633-1640. https://doi.org/10.1093/bioinformatics/bts183
[17] Sierepeklis, O., Cole, J. M. (2022). A thermoelectric materials database auto-generated from the
scientific literature using ChemDataExtractor. Scientific Data, 9(1), 648. https://doi.org/10.1038/
s41597-022-01752-1
[18] Dong, Q., Cole, J. M. (2022). Auto-generated database of semiconductor band gaps using
chemdataextractor. Scientific Data, 9(1), 193. https://doi.org/10.1038/s41597-022-01294-6
[19] Zhao, J., Cole, J. M. (2022). A database of refractive indices and dielectric constants auto-generated
using chemdataextractor. Scientific data, 9(1), 192. https://doi.org/10.1038/s41597-022-01295-5
[20] Mavracic, J., Court, C. J., Isazawa, T., Elliott, S. R., Cole, J. M. (2021). ChemDataExtractor 2.0:
knowledge graph. Digital Discovery, 2(4), 909-914. https://doi.org/10.1039/D3DD00067B
[39] OpenAI. 2023. New Models and Developer Products Announced at DevDay. https://openai.com/
index/new-models-and-developer-products-announced-at-devday/. Accessed: 2024-09-28.
[40] Gao, Y., Xiong, Y., Gao, X., Jia, K., Pan, J., Bi, Y., Dai, Y., Sun, J., Wang, H. &amp; Wang, H. (2023).
Retrievalaugmented generation for large language models: A survey. arXiv preprint arXiv:2312.10997.
https://doi.org/10.48550/arXiv.2312.10997
[41] Kerre, D., Laurent, A., Maussang, K., &amp; Owuor, D., (2024). An Instruction Dataset for
Extracting Quantum Cascade Laser Properties from Scientific Text. Recherche Data Gouv: DOI:
10.57745/U3U7XR. https://doi.org/10.57745/U3U7XR
[42] Kerre, D., Laurent, A., Maussang, K., &amp; Owuor, D. (2025). An instruction dataset for extracting
quantum cascade laser properties from scientific text. Data in Brief, 58, 111255. https://doi.org/10.
1016/j.dib.2024.111255
[43] Song, K., Tan, X., Qin, T., Lu, J., &amp; Liu, T. Y. (2020). Mpnet: Masked and permuted pretraining
for language understanding. Advances in neural information processing systems, 33, 16857-16867.
https://doi.org/10.5555/3495724.3497138
[44] Kumar, S., Williams, B. S., Hu, Q., &amp; Reno, J. L. (2006). 1.9 THz quantum-cascade lasers with
one-well injector. Applied Physics Letters, 88(12). https://doi.org/10.1063/1.2189671
[45] Williams, B. S., Callebaut, H., Kumar, S., Hu, Q., &amp; Reno, J. L. (2003). 3.4-THz quantum cascade
laser based on longitudinal-optical-phonon scattering for depopulation. Applied Physics Letters,
82(7), 1015-1017. https://doi.org/10.1063/1.1554479
[46] Kumar, S., Hu, Q., &amp; Reno, J. L. (2009). 186 K operation of terahertz quantum-cascade lasers based
on a diagonal design. Applied Physics Letters, 94(13).https://doi.org/10.1063/1.3114418
[47] Razavipour, S.G., Dupont, E., Chan, C.W.I., Xu, C., Wasilewski, Z.R., Laframboise, S.R., Hu, Q. &amp;
Ban, D. (2014). A high carrier injection terahertz quantum cascade laser based on indirectly pumped
scheme. Applied Physics Letters, 104(4). https://doi.org/10.1063/1.4862177
[48] Dupont, E., Fathololoumi, S., Wasilewski, Z.R., Aers, G., Laframboise, S.R., Lindskog, M., Razavipour,
S.G., Wacker, A., Ban, D. &amp; Liu, H.C. (2012). A phonon scattering assisted injection and extraction
based terahertz quantum cascade laser. Journal of Applied Physics, 111(7).https://doi.org/10.1063/1.
3702571
[49] Hempel, M., Röben, B., Niehle, M., Schrottke, L., Trampert, A., &amp; Grahn, H. T. (2017). Continuous
tuning of two-section, single-mode terahertz quantum-cascade lasers by fiber-coupled, near-infrared
illumination. AIP Advances, 7(5). https://doi.org/10.1063/1.4983030
[50] Wang, T., Liu, J. Q., Chen, J. Y., Liu, Y. H., Liu, F. Q., Wang, L. J., &amp; Wang, Z. G. (2013).
Continuouswave operation of terahertz quantum cascade lasers at 3.2 THz. Chinese Physics Letters, 30(6),
064201. https://doi.org/10.1088/0256-307X/30/6/064201
[51] Lü, X., Röben, B., Schrottke, L., Biermann, K., &amp; Grahn, H. T. (2021). Correlation between
frequency and location on the wafer for terahertz quantum-cascade lasers. Semiconductor Science
and Technology, 36(3), 035012. https://doi.org/10.1088/1361-6641/abdd4b
[52] Khabibullin, R.A., Shchavruk, N.V., Pavlov, A.Y., Klochkov, A.N., Glinskiy, I.A., Tomosh, K.N.,
Ponomarev, D.S., Cirlin, G.E. &amp; Zhukov, A.E. (2019). Design and fabrication of terahertz quantum
cascade laser with double metal waveguide based on multilayer GaAs/AlGaAs heterostructures.
In IOP Conference Series: Materials Science and Engineering (Vol. 475, No. 1, p. 012020). IOP
Publishing.https://doi.org/10.1088/1757-899X/475/1/012020
[53] Ushakov, D. V., Afonenko, A. A., Afonenko, A. A., Khabibullin, R. A., Fadeev, M. A., Gavrilenko, V.</p>
      <p>I., &amp; Dubinov, A. A. (2024). Feasibility of GaAs/AlGaAs quantum cascade laser operating above 6
THz. Journal of Applied Physics, 135(13). https://doi.org/10.1063/5.0198236
[54] Deutsch, C., Krall, M., Brandstetter, M., Detz, H., Andrews, A.M., Klang, P., Schrenk, W., Strasser,
G. &amp; Unterrainer, K. (2012). High performance InGaAs/GaAsSb terahertz quantum cascade lasers
operating up to 142 K. Applied Physics Letters, 101(21). https://doi.org/10.1063/1.4766915
[55] Brandstetter, M., Deutsch, C., Krall, M., Detz, H., MacFarland, D.C., Zederbauer, T., Andrews, A.M.,
Schrenk, W., Strasser, G. &amp; Unterrainer, K. (2013). High power terahertz quantum cascade lasers
with symmetric wafer bonded active regions. Applied Physics Letters, 103(17).https://doi.org/10.</p>
    </sec>
    <sec id="sec-9">
      <title>A. Appendix: Sample Generated Prompt</title>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Faist</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Capasso</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sivco</surname>
            ,
            <given-names>D. L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hutchinson</surname>
            ,
            <given-names>A. L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sirtori</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Cho</surname>
            ,
            <given-names>A. Y.</given-names>
          </string-name>
          (
          <year>1995</year>
          ).
          <article-title>Quantum cascade laser: a new optical source in the mid-infrared</article-title>
          .
          <source>Infrared Physics &amp; Technology</source>
          ,
          <volume>36</volume>
          (
          <issue>1</issue>
          ),
          <fpage>99</fpage>
          -
          <lpage>103</lpage>
          . https://doi.org/10.1016/
          <fpage>1350</fpage>
          -
          <lpage>4495</lpage>
          (
          <issue>94</issue>
          )
          <fpage>00058</fpage>
          -S
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Mittleman</surname>
            ,
            <given-names>D. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jacobsen</surname>
            ,
            <given-names>R. H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Neelamani</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baraniuk</surname>
            ,
            <given-names>R. G.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Nuss</surname>
            ,
            <given-names>M. C.</given-names>
          </string-name>
          (
          <year>1998</year>
          ).
          <article-title>Gas sensing using terahertz time-domain spectroscopy</article-title>
          .
          <source>Applied Physics B: Lasers &amp; Optics</source>
          ,
          <volume>67</volume>
          (
          <issue>3</issue>
          ). https://doi.org/10.1007/s003400050520 1063/1.4826943
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [56]
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>Y.Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>J.Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>F.Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>J.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhai</surname>
            ,
            <given-names>S.Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhuo</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>L.J.</given-names>
          </string-name>
          , Liu,
          <string-name>
            <given-names>S.M.</given-names>
            &amp;
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <surname>Z.G.</surname>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>High power-eficiency terahertz quantum cascade laser</article-title>
          .
          <source>Chinese Physics B</source>
          ,
          <volume>25</volume>
          (
          <issue>8</issue>
          ), 084206. https://doi.org/10.1088/
          <fpage>1674</fpage>
          -1056/25/8/084206
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [57]
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shen</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jiang</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhan</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Deng</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chu</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Duan</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>High-power terahertz quantum cascade lasers with ∼ 0.23 in continuous wave mode</article-title>
          .
          <source>Aip Advances</source>
          ,
          <volume>6</volume>
          (
          <issue>7</issue>
          ).https://doi.org/10.1063/1.4959195
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [58]
          <string-name>
            <surname>Brandstetter</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kainz</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zederbauer</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krall</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schönhuber</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Detz</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schrenk</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Andrews</surname>
            ,
            <given-names>A.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Strasser</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Unterrainer</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>InAs based terahertz quantum cascade lasers</article-title>
          .
          <source>Applied Physics Letters</source>
          ,
          <volume>108</volume>
          (
          <issue>1</issue>
          ).https://doi.org/10.1063/1.4939551
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [59]
          <string-name>
            <surname>Valmorra</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scalari</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ohtani</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Beck</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Faist</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2015</year>
          ).
          <article-title>InGaAs/AlInGaAs THz quantum cascade lasers operating up to 195 K in strong magnetic field</article-title>
          .
          <source>New Journal of Physics</source>
          ,
          <volume>17</volume>
          (
          <issue>2</issue>
          ), 023050.https://doi.org/10.1088/
          <fpage>1367</fpage>
          -2630/17/2/023050
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [60]
          <string-name>
            <surname>Walther</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scalari</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Faist</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Beere</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Ritchie</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          (
          <year>2006</year>
          ).
          <article-title>Low frequency terahertz quantum cascade laser operating from 1.6 to1</article-title>
          . 8THz.
          <source>Applied Physics Letters</source>
          ,
          <volume>89</volume>
          (
          <issue>23</issue>
          ). https://doi.org/10.1063/ 1.2404598
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [61]
          <string-name>
            <surname>Walther</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fischer</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scalari</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Terazzi</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hoyler</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Faist</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2007</year>
          ).
          <source>Quantum cascade lasers operating from 1.2 to1. 6THz. Applied Physics Letters</source>
          ,
          <volume>91</volume>
          (
          <issue>13</issue>
          ).https://doi.org/10.1063/1.2793177
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [62]
          <string-name>
            <surname>Hu</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Williams</surname>
            ,
            <given-names>B. S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kumar</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Callebaut</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kohen</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Reno</surname>
            ,
            <given-names>J. L.</given-names>
          </string-name>
          (
          <year>2005</year>
          ).
          <article-title>Resonant-phononassisted THz quantum-cascade lasers with metal-metal waveguides</article-title>
          .
          <source>Semiconductor science and technology</source>
          ,
          <volume>20</volume>
          (
          <issue>7</issue>
          ), S228.https://doi.org/10.1088/
          <fpage>0268</fpage>
          -1242/20/7/013
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [63]
          <string-name>
            <surname>Olariu</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Senica</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Faist</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2024</year>
          ).
          <article-title>Single-mode, surface-emitting quantum cascade laser at</article-title>
          26 m.
          <source>Applied Physics Letters</source>
          ,
          <volume>124</volume>
          (
          <issue>4</issue>
          ).https://doi.org/10.1063/5.0176281
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [64]
          <string-name>
            <surname>Scalari</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Amanti</surname>
            ,
            <given-names>M. I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fischer</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Terazzi</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Walther</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Beck</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Faist</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2009</year>
          ).
          <article-title>Step well quantum cascade laser emitting at 3 THz</article-title>
          . Applied physics letters,
          <volume>94</volume>
          (
          <issue>4</issue>
          ).https://doi.org/10.1063/1. 3068496
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [65]
          <string-name>
            <surname>Schrottke</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lü</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rozas</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Biermann</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Grahn</surname>
            ,
            <given-names>H. T.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>Terahertz GaAs/AlAs quantumcascade lasers</article-title>
          .
          <source>Applied Physics Letters</source>
          ,
          <volume>108</volume>
          (
          <issue>10</issue>
          ).https://doi.org/10.1063/1.4943657
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [66]
          <string-name>
            <surname>Fasching</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Benz</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Unterrainer</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zobl</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Andrews</surname>
            ,
            <given-names>A.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Roch</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schrenk</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Strasser</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          (
          <year>2005</year>
          ).
          <article-title>Terahertz microcavity quantum-cascade lasers</article-title>
          .
          <source>Applied Physics Letters</source>
          ,
          <volume>87</volume>
          (
          <issue>21</issue>
          ).https: //doi.org/10.1063/1.2136222
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [67]
          <string-name>
            <surname>Ohtani</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Beck</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scalari</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Faist</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2013</year>
          ).
          <article-title>Terahertz quantum cascade lasers based on quaternary AlInGaAs barriers</article-title>
          .
          <source>Applied Physics Letters</source>
          ,
          <volume>103</volume>
          (
          <issue>4</issue>
          ).https://doi.org/10.1063/1.4816352
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [68]
          <string-name>
            <surname>Deutsch</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Benz</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Detz</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Klang</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nobile</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Andrews</surname>
            ,
            <given-names>A.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schrenk</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kubis</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vogl</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Strasser</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Unterrainer</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          (
          <year>2010</year>
          ).
          <article-title>Terahertz quantum cascade lasers based on type II InGaAs</article-title>
          /GaAsSb/InP. Applied Physics Letters,
          <volume>97</volume>
          (
          <issue>26</issue>
          ).https://doi.org/10.1063/1.3532106
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [69]
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhu</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Freeman</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dean</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Valavanis</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Davies</surname>
            ,
            <given-names>A.G.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Linfield</surname>
            ,
            <given-names>E.H.</given-names>
          </string-name>
          (
          <year>2014</year>
          ).
          <article-title>Terahertz quantum cascade lasers with &gt;1 W output powers</article-title>
          .
          <source>Electronics letters</source>
          ,
          <volume>50</volume>
          (
          <issue>4</issue>
          ),
          <fpage>309</fpage>
          -
          <lpage>311</lpage>
          .https: //doi.org/10.1049/el.
          <year>2013</year>
          .4035
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [70]
          <string-name>
            <surname>Williams</surname>
            ,
            <given-names>B. S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kumar</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Qin</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hu</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Reno</surname>
            ,
            <given-names>J. L.</given-names>
          </string-name>
          (
          <year>2006</year>
          ).
          <article-title>Terahertz quantum cascade lasers with double-resonant-phonon depopulation</article-title>
          .
          <source>Applied physics letters</source>
          ,
          <volume>88</volume>
          (
          <issue>26</issue>
          ).https://doi.org/10.1063/ 1.2216112
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [71]
          <string-name>
            <surname>Adams</surname>
            ,
            <given-names>R.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vijayraghavan</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>Q.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fan</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Capasso</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Khanna</surname>
            ,
            <given-names>S.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Davies</surname>
            ,
            <given-names>A.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Linfield</surname>
            ,
            <given-names>E.H.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Belkin</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          (
          <year>2010</year>
          ).
          <source>GaAs/Al0</source>
          . 15Ga0.
          <article-title>85As terahertz quantum cascade lasers with double-phonon resonant depopulation operating up to 172</article-title>
          K. Applied Physics Letters,
          <volume>97</volume>
          (
          <issue>13</issue>
          ).https: //doi.org/10.1063/1.3496035
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [72]
          <string-name>
            <surname>Williams</surname>
            ,
            <given-names>B. S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kumar</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Callebaut</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hu</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Reno</surname>
            ,
            <given-names>J. L.</given-names>
          </string-name>
          (
          <year>2003</year>
          ).
          <article-title>Terahertz quantum-cascade laser at  ≈ 100 m using metal waveguide for mode confinement</article-title>
          .
          <source>Applied Physics Letters</source>
          ,
          <volume>83</volume>
          (
          <issue>11</issue>
          ),
          <fpage>2124</fpage>
          -
          <lpage>2126</lpage>
          .https://doi.org/10.1063/1.1611642
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [73]
          <string-name>
            <surname>Luo</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Laframboise</surname>
            ,
            <given-names>S. R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wasilewski</surname>
            ,
            <given-names>Z. R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aers</surname>
            ,
            <given-names>G. C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>H. C.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Cao</surname>
            ,
            <given-names>J. C.</given-names>
          </string-name>
          (
          <year>2007</year>
          ).
          <article-title>Terahertz quantum-cascade lasers based on a three-well active module</article-title>
          .
          <source>Applied physics letters</source>
          ,
          <volume>90</volume>
          (
          <issue>4</issue>
          ). https: //doi.org/10.1063/1.2437071
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [74]
          <string-name>
            <surname>Adams</surname>
            ,
            <given-names>R.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vizbaras</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jang</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grasse</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Katz</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Boehm</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Amann</surname>
            ,
            <given-names>M.C.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Belkin</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          (
          <year>2011</year>
          ).
          <article-title>Terahertz sources based on intracavity frequency mixing in mid-infrared quantum cascade lasers with passive nonlinear sections</article-title>
          .
          <source>Applied Physics Letters</source>
          ,
          <volume>98</volume>
          (
          <issue>15</issue>
          ).https://doi.org/10.1063/1. 3579260
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [75]
          <string-name>
            <surname>Bosco</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Franckié</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scalari</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Beck</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wacker</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Faist</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2019</year>
          ).
          <article-title>Thermoelectrically cooled THz quantum cascade laser operating up to 210</article-title>
          K. Applied Physics Letters,
          <volume>115</volume>
          (
          <issue>1</issue>
          ).https: //doi.org/10.1063/1.5110305
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [77]
          <string-name>
            <given-names>Lander</given-names>
            <surname>Gower</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            ,
            <surname>Levy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Piperno</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Addamane</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. J.</given-names>
            ,
            <surname>Reno</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. L.</given-names>
            , &amp;
            <surname>Albo</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.</surname>
          </string-name>
          (
          <year>2023</year>
          ).
          <article-title>Twowell injector direct-phonon terahertz quantum cascade lasers</article-title>
          .
          <source>Applied Physics Letters</source>
          ,
          <volume>123</volume>
          (
          <issue>6</issue>
          ). https://doi.org/10.1063/5.0155250
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [78]
          <string-name>
            <surname>Vitiello</surname>
            ,
            <given-names>M. S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scamarcio</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Spagnolo</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dhillon</surname>
            ,
            <given-names>S. S.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Sirtori</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          (
          <year>2007</year>
          ).
          <article-title>Terahertz quantum cascade lasers with large wall-plug eficiency</article-title>
          .
          <source>Applied Physics Letters</source>
          ,
          <volume>90</volume>
          (
          <issue>19</issue>
          ). https://doi.org/10. 1063/1.2737129
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [79]
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Armiento</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Lambrix</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          (
          <year>2020</year>
          ,
          <article-title>November)</article-title>
          .
          <article-title>An ontology for the materials design domain</article-title>
          .
          <source>In International Semantic Web Conference</source>
          (pp.
          <fpage>212</fpage>
          -
          <lpage>227</lpage>
          ). Cham: Springer International Publishing.
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [80]
          <string-name>
            <surname>Schilling-Wilhelmi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ríos-García</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shabih</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gil</surname>
            ,
            <given-names>M.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Miret</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koch</surname>
            ,
            <given-names>C.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Márquez</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Jablonka</surname>
            ,
            <given-names>K.M.</given-names>
          </string-name>
          (
          <year>2025</year>
          ).
          <article-title>From Text to Insight: Large Language Models for Materials Science Data Extraction</article-title>
          .
          <source>Chem. Soc. Rev</source>
          .
          <volume>54</volume>
          ,
          <fpage>1125</fpage>
          -
          <lpage>1150</lpage>
          . https://doi.org/10.1039/D4CS00913D
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [81]
          <string-name>
            <surname>Sirin</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grau</surname>
            ,
            <given-names>B. C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kalyanpur</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Katz</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          (
          <year>2007</year>
          ).
          <article-title>Pellet: A practical owl-dl reasoner</article-title>
          .
          <source>Journal of Web Semantics</source>
          ,
          <volume>5</volume>
          (
          <issue>2</issue>
          ),
          <fpage>51</fpage>
          -
          <lpage>53</lpage>
          . https://doi.org/10.1016/j.websem.
          <year>2007</year>
          .
          <volume>03</volume>
          .004
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [82]
          <string-name>
            <surname>Issa</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Adekunle</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hamdi</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cherfi</surname>
            ,
            <given-names>S. S. S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dumontier</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Zaveri</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2021</year>
          ).
          <article-title>Knowledge graph completeness: A systematic literature review</article-title>
          .
          <source>IEEE Access</source>
          ,
          <volume>9</volume>
          ,
          <fpage>31322</fpage>
          -
          <lpage>31339</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [83]
          <string-name>
            <surname>Pipino</surname>
            ,
            <given-names>L. L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>Y. W.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>R. Y.</given-names>
          </string-name>
          (
          <year>2002</year>
          ).
          <article-title>Data quality assessment</article-title>
          .
          <source>Communications of the ACM</source>
          ,
          <volume>45</volume>
          (
          <issue>4</issue>
          ),
          <fpage>211</fpage>
          -
          <lpage>218</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [84]
          <string-name>
            <surname>Lajus</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Suchanek</surname>
            ,
            <given-names>F. M.</given-names>
          </string-name>
          (
          <year>2018</year>
          , April).
          <article-title>Are all people married? Determining obligatory attributes in knowledge bases</article-title>
          .
          <source>In Proceedings of the 2018 World Wide Web Conference</source>
          (pp.
          <fpage>1115</fpage>
          -
          <lpage>1124</lpage>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>