<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Scientific Data</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>LLM-retrieval based scientific knowledge grounding</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Gabriel K. Reder</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Carl Collins</string-name>
          <email>c.collins@gold.ac.uk</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Abbi Abdel Rehim</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Larisa Soldatova</string-name>
          <email>l.soldatova@gold.ac.uk</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ross D. King</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Chalmers University of Technology</institution>
          ,
          <addr-line>Gothenburg 412 96</addr-line>
          ,
          <country country="SE">Sweden</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Goldsmiths, University of London</institution>
          ,
          <addr-line>London SE14 6AD</addr-line>
          ,
          <country country="UK">United Kingdom</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>The Alan Turing Institute</institution>
          ,
          <addr-line>London NW1 2DB</addr-line>
          ,
          <country country="UK">United Kingdom</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>University of Cambridge</institution>
          ,
          <addr-line>Cambridge, CB3 0AS</addr-line>
          ,
          <country country="UK">United Kingdom</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2024</year>
      </pub-date>
      <volume>7</volume>
      <issue>2020</issue>
      <abstract>
        <p>The automated high-throughput laboratory ofers unprecedented potential for scientific discovery, yet efectively linking studies to existing knowledge remains a significant challenge. As the general body of scientific knowledge grows, so too does the burden of contextualizing a new experiment. While ontologies and databases serve as structured common repositories, their rigid schemas are often incompatible with the unstructured or semistructured formats of most laboratories. In this study we investigate the integration of large language models (LLMs) with ontology-based vector databases to anchor semi-structured scientific experiments into knowledge bases via automated retrieval. Our approach extracts scientific entities from unstructured experimental texts, and grounds them to relevant ontology terms. We automate knowledge grounding, which enhances the integration of unstructured experimental data into established formal scientific languages. We have tested our method on a diverse selection of experimental yeast biology papers focused on Saccharomyces cerevisiae, a foundational model system that has driven major discoveries in molecular and cellular biology, and observed strong pipeline performance. We argue that such a knowledge grounding approach is a critical component for the new wave of eficient artificial intelligence (AI) driven automated laboratories that integrate LLMs with high-throughput experimentation and data-driven discovery.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Artificial Intelligence</kwd>
        <kwd>Large Language Models</kwd>
        <kwd>Information Extraction for RKGs/SKGs</kwd>
        <kwd>Ontologies</kwd>
        <kwd>Knowledge Engineering</kwd>
        <kwd>Saccharomyces cerevisiae</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>The rise of automation in the laboratory presents a unique opportunity to generate scientific findings
at an unprecedented rate; however, a key issue remains overlooked. Empirical science relies on the
integration of experiments, data, and discoveries with existing knowledge. Linking to background
knowledge is a major challenge for the experimentalist, one that grows larger as the scientific corpus
expands. As laboratory automation accelerates this expansion, the linking problem grows, risking the
creation of vast quantities of isolated observations stranded on knowledge islands.</p>
      <p>
        Take, for example, the most fundamental kernel of the experimental process: the hypothesis. In the
classical scientific method, an experiment is conducted to test a specific hypothesis, and knowledge is
generated by its subsequent confirmation or rejection [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. For knowledge to efectively spread within
the scientific community, hypotheses must be clearly and universally understood. Achieving this
requires the use of consistent vocabulary and terminology shared by other scientists. This consistency
enables the scientific community to fully comprehend, replicate, and validate experimental findings,
strengthening the reliability of the conclusions. If knowledge linking is successful, a single experiment
may contribute to the general body of scientific knowledge. This process becomes even more important
when laboratory automation is introduced. Unlike the human scientist, a laboratory robot has no
understanding of the material it handles in the service of a hypothesis. As such, the hypothesis, and all
sample metadata, must be suficiently descriptive and exact to understand the results from an automated
laboratory.
      </p>
      <p>Typically, hypotheses are written in natural language with constraints around key scientific concepts.
These constraints are implicit and can lead to confusion. For example, a scientist may contextualize their
hypothesis by using the word “yeast” when in fact they mean “Saccharomyces cerevisiae”. This choice
encodes the scope of the experiment’s findings and its implications for knowledge acquisition. These
choices, shaped by experiential understanding of terminology, can vary considerably among scientists,
types of publications, and research venues. Robust and systematic nomenclatures for scientific concepts,
materials, and actions are needed to avoid confusion. This codification of shared nomenclature takes
the form of knowledge bases.</p>
      <p>
        Knowledge bases, specifically ontologies and databases, have proven to be highly efective as terminal
repositories of community knowledge and observations, respectively. Public databases have been
quintessential in advancing the life sciences since their adoption across the research community [
        <xref ref-type="bibr" rid="ref2 ref3 ref4">2, 3, 4</xref>
        ].
The Protein Data Bank (PDB) [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] is a prominent example of a successful knowledge repository. By
establishing a communal repository for standardized data, the PDB and its associated standards have
been invaluable to advancements in protein biology, from benchtop research to clinical applications,
notably by enabling the development of computational models for protein design [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Databases serve
as centralized repositories for findings from various laboratories, enforcing a standardized structure
across all entries. In this sense, scientific database structures are syntactical templates for how findings
should be communicated. Ontologies take this codification further by defining controlled vocabulary
terms (entities) and the formal relations between them. Ontologies and knowledge graphs provide
highly robust shared languages for information sharing [
        <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
        ]. The knowledge structures (schemas) of
databases and ontologies are key to their communal utility. Such schemas embody a robust, widely
accepted community standard that facilitates efective communication of knowledge, enabling others to
understand and replicate findings.
      </p>
      <p>Formalization requires efort, and efective knowledge base utilization requires strict adherence
to their structures. Yet scientists mostly work with unstructured or unlinked formats such as text
documents, spreadsheets, notebooks, and raw data. Conforming these to knowledge base structures is
a laborious process, especially if performed manually. As such, experimenters usually communicate
results in a semi-structured format, the scientific paper, leaving some burden of linking to the reader.
The significant increase in experiment execution and data generation rates from automated laboratories
makes manual knowledge linking infeasible necessitating computational approaches.</p>
      <p>
        Generative artificial intelligence (AI) models, specifically large language models (LLMs), represent an
exciting opportunity to speed the adoption and implementation of automation for discovery science,
and their capabilities hint at promising abilities to aid in the knowledge linking process. They excel
at working with unstructured inputs and outputs; however, they do not natively interface well with
predefined knowledge schemas [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. They have, however, proven adept at interfacing with structured
databases and application programming interfaces (APIs) when these structures are included in LLM
prompts [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Further, LLMs have been shown to excel in question answering applications when the
scope of answer is explicitly specified in input prompts. An exciting avenue of LLM usage combining
database access with prompt scoping is retrieval augmented generation (RAG) [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. In this formulation,
the results of a database query-potentially initiated by an upstream LLM-are fed into a downstream
LLM prompt. Since the LLM is prompted to answer from the query results rather than its internal
knowledge, such an approach grounds answers to specific databases and limits hallucination.
      </p>
      <p>In this work, we investigate the use of a combination of LLMs, database retrieval, and ontologies to
ground unstructured scientific inputs to formal knowledge bases. Specifically, we follow a retrieval
augmented generation approach utilizing LLMs coupled to vector database stores of ontologies to
ground terms in hypotheses to ontology identifiers. LLMs were used to extract hypotheses from a
pool of carefully selected yeast research papers and extract scientific entities before grounding them
to ontology terms. We tested our pipeline on a diverse selection of papers spanning a wide range of
research domains within the model organism Saccharomyces cerevisiae and found that this approach
efectively automates the grounding of knowledge from unstructured scientific experimental data.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Methods</title>
      <sec id="sec-2-1">
        <title>2.1. Pipeline overview</title>
        <p>The automated pipeline developed and tested consists of three modules: (1) hypothesis extraction from
the full text of a paper (summarization), (2) stratified entity extraction from the summarized hypothesis
(extraction), and (3) grounding of entities to ontological knowledge bases (grounding). Summarization
consists of presentation of the entire paper text to a LLM model which is prompted to consider the text
and produce the main hypothesis tested by the paper. The extraction module takes an input text and
extracts single entities/concepts of interest according to user-defined schema. The grounding module
takes these entities and links them to the best found term in ontologies specified by the user to fall under
a given schema category. Crucially, the entity grounding utilizes the LLMs fuzzy reasoning capabilities
to make a decision based on scientific context.</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Manual selection of yeast biology research papers and extraction of hypotheses</title>
        <p>
          A total of 15 recent research publications were manually selected to eclectically cover the field of yeast
(Saccharomyces cerevisiae) biology in an unbiased manner. To provide a framework for research paper
selection, we used the Gene Ontology (GO) [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] as a guide for selecting papers representative of a
broad range of scientific concepts. Papers were manually selected to partially or fully embody concepts
represented by direct children nodes of the root ‘Biological Process’ GO term. These ranged
from high level cellular processes such as cell division, reproduction, and homeostasis, to somewhat
more restricted phenomena including metabolism, gene regulation, and localized processes such as
chromatin reorganization, cellular component biogenesis and signaling. We excluded several GO terms
deemed to have no relevance to S. cerevisiae. Papers were also carefully selected to represent traditional
biological research domains including genomics, molecular biology and biochemistry, cellular biology,
systems biology, evolutionary biology and biotechnology. The 15 papers and their respective manually
annotated GO terms are shown in Table 1.
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>2.3. Manual hypothesis summarization</title>
        <p>Three expert reviewers were assembled to read the papers and decide on consensus human extracted
hypotheses for comparison to the automated pipeline results. The 15 papers were divided into groups of
5 and each reviewer was assigned two of these groups for a total of 10 papers each. Each reviewer read
the assigned papers and extracted hypotheses for each of them independently of the other reviewers.
Reviewers were prompted to extract hypotheses that are clear, logically-sound, actionable, and reflective
of the paper’s aims. The three reviewers then compared individual results to produce a consensus
human hypothesis for each of the papers. The consensus hypotheses are shown in Table 2.</p>
      </sec>
      <sec id="sec-2-4">
        <title>2.4. Automated hypothesis summarization</title>
        <p>
          Papers were fed to the pipeline as PDF files and tokenized into LLM-compatible text inputs. Together
with the tokenized paper, the LLM was prompted to extract only the main hypothesis tested by the paper.
To provide a degree of consistency across extraction, the LLM was prompted to phrase the hypothesis
as a single sentence. Paper contents were tokenized and concatenated into a single annotated string
before injection into the LLM summarization prompt. Original page breaks and counts in the PDF
were maintained as annotation strings to give the LLM a sense of the paper’s physical structure. The
summary module flow was implemented using LangChain [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]. PDF files were processed using the
PyMuPDF library [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. Output model specification and verification were implemented using Pydantic
[
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]. The resulting extracted hypotheses are shown in Table 3.
        </p>
      </sec>
      <sec id="sec-2-5">
        <title>2.5. Entity extraction and ontology selection</title>
        <p>Input hypotheses were fed into the extraction module together with a user-defined knowledge extraction
schema of desired entity categories and annotated descriptions of each category. The module flow
was implemented using LangChain, and schemas were defined using Pydantic. Field descriptions in
the extraction model’s Pydantic class are fed to the LLM during the entity extraction task. Together
with the class-level docstring, these define the entity extraction task for the LLM. For example, the
class-level docstring “The entities extracted from a scientific hypothesis. The entities are divided into
diferent categories, these field lists MUST be mutually exclusive. An entity cannot be in more than
one list.” together with the category-level description “Any specific genes or proteins mentioned in the
hypothesis” were used to extract gene/protein entities. In principle, the module accepts any user-defined
Pydantic schema. The example schema and linked ontologies used in our testing is shown in Figure 1
including the prompt annotations used to define each category.</p>
      </sec>
      <sec id="sec-2-6">
        <title>2.6. Retrieval Augmented Generation (RAG) grounding of hypotheses</title>
        <p>
          Extracted entities from the hypotheses were grounded to ontologies in database vector store formats
using similarity search to term names from intermediate LLM-generated search terms. OWL [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] files
were downloaded for our chosen ontologies of interest and compiled to Faiss vector store databases
[
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]. The following ontologies and versions were chosen for each schema category:
        </p>
        <sec id="sec-2-6-1">
          <title>Biological components:</title>
        </sec>
        <sec id="sec-2-6-2">
          <title>Genes/proteins: • Gene ontology (GO-plus, version 2024-09-08) [11] • Ascomycete phenotype ontology (APO, version 2024-09-18) [17] Taxa:</title>
        </sec>
        <sec id="sec-2-6-3">
          <title>Small molecules:</title>
          <p>
            • NCBI Taxon (NCBITaxon, version 2024-07-01) [
            <xref ref-type="bibr" rid="ref19">19</xref>
            ]
• Chemical Entities of Biological Interest (ChEBI, version 2024-07-27) [
            <xref ref-type="bibr" rid="ref20">20</xref>
            ]
          </p>
          <p>
            The langchain-rdf [
            <xref ref-type="bibr" rid="ref21">21</xref>
            ] library was used to parse OWL files into a Faiss-compatible format. Input
user YAML files were used to link extraction schema categories with specific vector databases. We note
that a single vector database associated with an extraction category may contain one or more stored
ontologies according to the user’s preferences. In our case, the ‘Biological components‘ category was
linked to a vector store database containing terms from the GO and APO ontologies. For each entity,
the following procedure was performed. The LLM was first prompted to look at the hypothesis and the
individual entity to decide on appropriate search terms. This term could be the same as the extracted
entity or a modified entity term given the context of the hypothesis. Both cases were observed as output
in our pipeline runs. For example, during grounding of paper 1 hypothesis terms, the LLM decided to
use the search term “protein synthesis rate” when grounding the term “protein production rate”. Finally,
the generated search term was used as input for a similarity search against the vector database linked to
the entity’s schema category. Searches were based on ontology term names and the top 5 results were
presented to the LLM in a subsequent prompt. Given the top 5 search hits, the input hypothesis, and
the original entity, the LLM was prompted to select the best search term. The entire body of extraction
and grounding results is shown in the supplementary information.
          </p>
        </sec>
      </sec>
      <sec id="sec-2-7">
        <title>2.7. Evaluation</title>
        <p>The panel of three scientists compared the LLM-extracted hypothesis to the consensus human
hypothesis for each of the 15 papers. Each reviewer individually assigned an evaluation score between
1 and 3 for the LLM extraction performance based on the following criteria:
3 = The LLM hypothesis accurately summarizes the paper’s aims and agrees with the human
consensus hypothesis
2 = The LLM hypothesis difers slightly from the human consensus hypothesis but captures some or all
of the paper’s aims
1 = The LLM hypothesis does not correctly summarize the paper’s aims at all</p>
        <p>For each paper, the individual scores were averaged across the three reviewers’ scores to produce an
average hypothesis evaluation score.</p>
        <p>The three scientists also produced an individual entity score and a grounding score for each entity
extracted from each of the hypotheses. The entity score evaluated the quality of the entity extraction,
specifically how reasonable the extracted entity is. Reasonable entities were sought to be self-contained,
understandable, coherent scientific concepts/units that should map to ontology terms. The grading
criteria followed a 1-3 scale where:
3 = the entity is reasonable in the right category (e.g. is extracted as a “Taxa” when it should
be).
2 = reasonable entity in the wrong category or semi-reasonable entity in the right category.
1 = nonsensical/overly complex entity.</p>
        <p>The grounding score was designed to evaluate the efectiveness of grounding based on the extracted
entity. In other words, in the context of the hypothesis, does the grounding do a good job of finding the
appropriate term in the ontology? The following 1-3 scale was used:
3 = The grounding term fits the entity very well in its context.
2 = The grounding term is related to the entity but not an exact fit.
1 = The grounding term is unrelated to the entity.</p>
        <p>Evaluation scores and summaries are shown in Figure 2. The individual reviewer scores can be found
in the supplementary information.</p>
      </sec>
      <sec id="sec-2-8">
        <title>2.8. Environment, models, and code availability</title>
        <p>
          Gpt-4o (version 2024-08-06) was used as the LLM model for all pipeline tasks. The pipeline can currently
use any OpenAI model or model available through Ollama [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ]. All text embedding was performed
with the FlagEmbedding (bge-small-en-v1.5) embedding model [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ].
        </p>
        <p>Vector store creation was performed on an Amazon Web Services (AWS) EC2 r5.4xlarge instance
with 16 vCPUs and 128 GB memory. Currently, the pipeline requires substantial memory to create
vector stores involving large ontology files (hundreds of MB or more). Once created, vector stores can
be reused without rebuilding. The prebuilt vector stores used in this work’s tests are available in this
work’s Zenodo repository https://doi.org/10.5281/zenodo.14014577.</p>
        <p>The complete pipeline is available in the ragnosis GitHub repository https://github.com/gkreder/
ragnosis.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Results and Discussion</title>
      <p>In this work, we aimed to assess the capability of current LLM-retrieval systems to automatically
link unstructured scientific content to structured knowledge bases. We tested our pipeline on 15
representative research papers in Saccharomyces cerevisiae biology, evaluating its ability to extract
hypotheses, identify key entities, and accurately ground these entities within selected ontologies. Our
ifndings demonstrate that this approach shows immense promise, closely matching human performance
on critical tasks.</p>
      <sec id="sec-3-1">
        <title>3.1. Results</title>
        <p>The automated hypothesis extraction module generated concise and accurate single-sentence hypotheses
for each paper, aligning closely with the human consensus hypotheses (Table 2 and Table 3). Human
evaluation of the results on a basic scale of 1-3 are shown in Figure 2A. Further details on the automated
hypothesis extraction, human consensus extraction, and evaluation can be found in the Methods section.</p>
        <p>The entity extraction module was run on the hypotheses extracted using the automated pipeline. For
this test, entities were extracted into four schema categories: (1) biological components, (2)
genes/proteins, (3) taxa, and (4) small molecules. A total of 123 entities were extracted across all papers. On a
per-category basis the entity counts were the following: biological components: 73, genes/proteins: 27,
taxa: 14, small molecules: 9. Extracted entities were grounded using the pipeline with the compiled
ontology vector stores. The prompts used to guide extraction for each category, along with the ontologies
associated with them, can be found in Figure 1C. Human evaluation scores, ranging from 1 to 3 on basic
scales, were assigned by a three-person review panel for both entity extraction and grounding. These
average scores across reviewers are shown in Figure 2B, while per-reviewer distributions for each score
are presented in Figure 2C. Full extraction and grounding results can be found in the supplementary
information.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Discussion</title>
        <p>
          The hypothesis extraction demonstrated strong consistency between the automated outputs and the
consensus derived from human evaluators, mirroring broader findings that LLMs excel in summarization
tasks [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ]. Indeed, this task required no retrieval or grounding from the LLMs. Rather, it served as
a means of producing condensed unstructured scientific input to the downstream grounding task.
Nevertheless, the performance of the automated hypothesis extraction was impressive. Our primary
goal was to test entity extraction and knowledge grounding on unstructured input, and hypothesis
extraction efectively served this purpose.
        </p>
        <p>Analysis of the hypothesis extraction process highlighted several key issues. The most significant
is defining what constitutes a hypothesis. Our tests focused on extracting hypotheses from papers
describing completed scientific studies. Both in human extraction and LLM prompting, eforts were
made to avoid incorporating the study’s results into the hypotheses. However, this was somewhat
unavoidable, as a paper’s narrative is naturally influenced by the study’s findings. Papers describe studies
with a priori aims that range from entirely speculative to purely confirmational, and the mechanistic
granularity of hypotheses tested can vary dramatically. Our human and automated results reflect this
heterogeneity. At times, the LLM incorporated more of the study’s results into its hypotheses (as
seen in paper 7), while in other cases, the human evaluators included more mechanistic detail (as in
paper 4). A related issue concerns the question of novelty. Hypotheses inherently rely on existing
knowledge, with their novelty stemming from the ability to shed new light on that knowledge, rather
than simply rephrasing it. This subtlety may be more challenging for LLMs to capture. For example,
the LLM-extracted hypothesis from paper 9 appears to suggest that the novelty of the work centers
on the characterization of the Spt6 protein, when in reality, the authors aimed to shed light on the
Cdc73 subunit of Paf1. Both human and LLM hypotheses describe the proposed relationship between
the proteins, but the LLM hypothesis frames Spt6 as the focus of discovery. In brief, modern scientific
studies do not uniformly adhere to the classical scientific method. They include engineering applications
and untargeted screens in addition to classical experiments. Such heterogeneity must be taken into
consideration in follow up automation development.</p>
        <p>
          We found entity performance to be efective, regardless of the type of hypothesis used as input. Terms
tended to fit the categories and were generally well-scoped. Notably, the extraction process is flexible,
allowing the user to specify their desired input schema categories to guide identification of relevant
entities. This user-defined input schema for entity extraction has been successfully demonstrated in
previous work, such as in [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ], and our results further validate its efectiveness and viability. A key
consideration is that the flexibility of knowledge schema choice leads to variations in performance
depending on the input knowledge model. This is not a question of computational extraction performance,
but rather schema design.
        </p>
        <p>
          As shown in Figure 2, entity extraction performance varies significantly across categories. The
‘Biological components’ category exhibits the weakest performance, primarily due to its inherent
vagueness compared to more specific categories like ‘Taxa’, particularly in the context of our paper set
focused solely on yeast biology. This touches on the larger question of knowledge modeling in the life
sciences and beyond. The design and optimal use of descriptive and appropriate knowledge schema for
scientific domains is a fruitful field in itself [
          <xref ref-type="bibr" rid="ref26 ref27 ref28">26, 27, 28</xref>
          ]. This work does not aim to rigorously compare
schemas; rather, we emphasize that our retrieval approach facilitates the linking of unstructured scientific
texts to any knowledge schema. The efectiveness of this linking is inherently constrained by the quality
of the chosen schema. Entity extraction also sufers from the vague definition of an entity. A few times,
entire phrases were extracted as an entity, for example “spectrum of beneficial mutations”
from paper 3 or “cell wall protein mannosylation” from paper 12. Such compound entities
could likely be further decomposed into more modular knowledge units. Conversely, the pipeline
extracted overly specific entities at times, such as “serine 15” from paper 10. In some instances, the
grounding scheme successfully overcame these vague or overly specific extractions, as described below.
A straightforward approach would be to base the knowledge extraction schema on the knowledge
base itself. Ontologies efectively capture the hierarchical structure of encoded knowledge, and basing
extracted categories on ontology levels would undoubtedly enhance grounding performance. We believe
this approach will be especially well-suited for grounding applications on static knowledge bases. Often,
experiments will involve unexplored areas of knowledge and existing knowledge bases will provide an
incomplete structure of the scientist’s domain. Such cases will require user-defined knowledge schema
and perhaps iterated cycles of knowledge base modification from experimental results. We envision
automated closed-loop cycles of experimental grounding and knowledge base improvement involving
intermediate test schema.
        </p>
        <p>
          The most significant neurosymbolic performance enhancement in our pipeline comes from retrieval
augmented generation (RAG) grounding of extracted terms, yielding highly promising results. Previous
approaches have utilized LLMs for schema-aligned entity extraction, however they typically rely on
traditional deterministic methods for grounding within ontologies or knowledge bases. In contrast, our
method and concurrent new approaches [
          <xref ref-type="bibr" rid="ref29">29</xref>
          ] integrate LLMs directly in the grounding process, marking
a shift toward more adaptable, context-sensitive knowledge grounding. This is achieved through RAG,
presenting the LLM with a choice of possible grounding terms and leaving the final choice to the LLM
given the input context. The grounding evaluation scores, shown in Figure 2, largely reflect that the
automated LLM-retrieval approach performs well on this task. The LLM’s fuzzy logic is crucial in
selecting among the database hits. A clear example of this occurred across multiple papers, where
the pipeline accurately grounded protein entities to their correct species designation within the PRO
ontology. For instance, when grounding the entity “Cln3”, it correctly selected “S000000038 CLN3
(yeast)” instead of alternative terms such like “protein CLN3 (mouse)”, “G1/S-specific
cyclin CLN3 (Candida albicans SC5314)”, or “protein CLN3 (human)”, all of which are
also present in the ontology. Such logic would be challenging to implement deterministically across
diferent use cases, but it leverages the strengths of LLMs, where they excel in handling complex,
context-dependent reasoning. As noted earlier, the grounding scheme efectively compensated for
poorly extracted entities by utilizing the hypothesis context. For instance, it grounded “cell wall
protein mannosylation” from paper 12 to “GO_0035268 protein mannosylation,” and
“serine 15” from paper 10 to “MOD_00696 phosphorylated residue.” Full details of the
extraction and grounding results are provided in the supplementary information.
        </p>
        <p>
          Further development will be undertaken to improve the performance of the extraction and grounding
modules. One notable improvement will come from basing candidate ontology term presentation to
the LLM on semantic search to more than the term name in the ontology. Knowledge base terms often
contain annotations, synonyms, and relations to other terms. Basing the vector store search on these
rather than purely term names will likely further improve the quality of the candidates for the LLM to
choose from. We also note that the choice of ontologies to search was somewhat arbitrary and based on
our test entity extraction schema. Some choices were straightforward, such as grounding entities in the
"Taxa" category to the NCBITaxon ontology. Choosing GO-Plus and APO for the “biological components”
category likely omitted other ontologies with potentially relevant terms. More comprehensive study of
the design of these elements warrants further investigation. Such development will build on prior and
ongoing eforts to robustly systematize and encode scientific knowledge, hypotheses, protocols, data,
and findings [
          <xref ref-type="bibr" rid="ref30 ref31 ref32 ref33">30, 31, 32, 33</xref>
          ]. Further expansion beyond hypotheses must also be explored. An especially
promising direction will be in grounding raw and processed data from experiments to knowledge base
terms. For example, sample identifiers in experimental data may be automatically contextualized to
other experiments through iterated rounds of context presentation and grounding. Such a scheme
would increase the insight generated from a single experiment and allow for richer meta-analysis.
        </p>
        <p>
          Based on our human evaluations, we found that the extraction and grounding pipeline performed
well for knowledge grounding and represents a promising direction for future research. Reflective of the
scientific process itself, the evaluators varied in their assessments but were in general agreement (Figure
2). Our tests were conducted on a small, representative set of papers spanning a broad range of topics
within yeast biology. However, there is no reason to believe that our approach would not be suitable
for larger-scale work that encompasses a broader range of scientific knowledge, inputs and outputs,
and applications - given appropriate resources. Beyond its use in mining public data, we envision
such a pipeline as a crucial component of a fully automated self-driving laboratory where AI agents
generate hypotheses, conduct experiments, and analyze data in iterative cycles of closed-loop discovery.
Approaches that integrate laboratory robotics with symbolic systems and knowledge bases have been
and continue to be developed [
          <xref ref-type="bibr" rid="ref34 ref35 ref36">34, 35, 36</xref>
          ]. However, experimental knowledge is often inherently
unstructured, making such neurosymbolic systems crucial for the future of automated laboratories,
as they leverage the strengths of both logical and subsymbolic generative approaches. By leveraging
these advanced systems, we are approaching a new phase in scientific research, where automation and
intelligent knowledge grounding can open valuable opportunities for discovery and innovation.
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Acknowledgments</title>
      <p>This work was supported by the Engineering and Physical Sciences Research Council [grant numbers
EP/X032418/1, EP/X033740/1] and the Wallenberg AI, Autonomous Systems and Software Program
(WASP) funded by the Alice Wallenberg Foundation.</p>
    </sec>
    <sec id="sec-5">
      <title>Declaration on Generative AI</title>
      <p>During the preparation of this work, the author(s) used GPT-4o in order to: Grammar and spelling
check, Paraphrase and reword. After using these tool(s)/service(s), the author(s) reviewed and edited
the content as needed and take(s) full responsibility for the publication’s content.</p>
    </sec>
    <sec id="sec-6">
      <title>A. Tables</title>
      <p>Progression through the START Checkpoint of the cell cycle is dependent on an
increased production rate of Cln3 which does not scale relative to cell size increase.
In Saccharomyces cerevisiae, the Hsl1 protein kinase inhibits the activity of the
Pah1encoded phosphatidate phos-phatase (PAP) by phosphorylation at sites Ser-748 and
Ser-773 of the PAP protein, leading to a reduced conversion of phosphatidate (PA)
into diacyl-glycerol (DAG) and increase in the synthesis of membrane phospholipids.
Whole genome duplication (WGS) in Saccharomyces cerevisiae provides an
immediate fitness advantages via accumulation of structural variants enabling adaptive
strategies to environment, however this comes at a cost of slowing long-term
adaptability, for example via recessive mutations, aneuploidies, and copy-number variants.
Increased production of pentacyclic triterpenes in Saccharomyces cerevisiae can be
engineered through overexpression of mevalonate (MVA) pathway genes such as
ERG13 (HMGS) and HMG1 (HMGR), overexpression of lupeol synthase, deletion
of the negative regulator of the MVA pathway, ROX1, and repression of the sterol
synthesis pathway gene ERG7.</p>
      <p>Under hydrogen peroxide induced stress in Saccharomyces cerevisiae, the Opi1p
protein translocates to the nucleus to repress transcription of inositol upstream
activating sequence (UAS-INO)-containing genes for the cell to adapt its cell membrane
to reduce permeability to exogenous hydrogen peroxide.</p>
      <p>SPO73 acts in a pathway involving SPO71, VPS13 and SPO1 to regulate proper
prospore membrane elongation in Saccharomyces cerevisiae.</p>
      <p>The peroxisomal import membrane protein, Pex14p, is regulated by phosphorylation
in saccharomyces cerevisiae.</p>
      <p>The Saccharomyces cerevisiae transcription factor, Crf1p, activates the transcription
of the ribosomal biogenesis genes, UTP22 and HMO1, as a mechanism to fine-tune
responses to mTORC1 signaling.</p>
      <p>The Cdc73 subunit of the Paf1 complex in Saccharomyces cerevisiae interacts with
subunits of the Pol II elongation complex in an unknown manner.</p>
      <p>One or all of the kinases, Ymr291w/Tda1, PKA, Sch9, and Snf1 phosphorylate
hexokinase ScHxk2 at phosphorylation site serine 15 to activate transcription of
glucoserepressible genes under low external glucose levels in Saccharomyces cerevisiae
The number of Golgi cisternae produced and maintained in a Saccharomyces
cerevisiae is tightly regulated by the GPI-anchored protein sorting process in the
endoplasmic reticulum, particularly to the endoplasmic reticulum exit sites (ERESs).
In Saccharomyces cerevisiae, genes related to cell wall biosynthesis and maintenance,
are direct determinants of the budding lifespan of the cell, as measured by its
reproductive longevity or budding cycle capacity.</p>
      <p>In Saccharomyces cerevisiae, visible light afects the Yeast Respiratory Oscillator
(YRO) through impacting oxidative state, leading to negative efects on metabolism.
The heat shock response program in Saccharomyces cerevisiae involves changes to
the expression of genes encoding proteins with a wide range of functions in addition
to molecular chaperones.</p>
      <p>Under glucose starvation and carbon scarcity conditions in Saccharomyces cereivisae,
Acetyl-CoA driven histone acetylation is shifted to focus on up-regulation of
transcriptional programs focused on starvation survival and metabolism regulation, namely
gluconeogenis and fat metabolism.
Paper</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>K.</given-names>
            <surname>Popper</surname>
          </string-name>
          , The Logic of Scientific Discovery, Routledge Classics, 2nd ed ed., Taylor and Francis, Abingdon, Oxon,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>H. J.</given-names>
            <surname>Imker</surname>
          </string-name>
          , 25 Years of Molecular Biology Databases:
          <article-title>A Study of Proliferation, Impact, and</article-title>
          <string-name>
            <surname>Maintenance</surname>
          </string-name>
          ,
          <source>Frontiers in Research Metrics and Analytics</source>
          <volume>3</volume>
          (
          <year>2018</year>
          ). doi:
          <volume>10</volume>
          .3389/frma.
          <year>2018</year>
          .
          <volume>00018</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>D. J.</given-names>
            <surname>Rigden</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X. M.</given-names>
            <surname>Fernández</surname>
          </string-name>
          ,
          <article-title>The 2024 Nucleic Acids Research database issue and the online molecular biology database collection</article-title>
          ,
          <source>Nucleic Acids Research</source>
          <volume>52</volume>
          (
          <year>2024</year>
          )
          <fpage>D1</fpage>
          -
          <lpage>D9</lpage>
          . doi:
          <volume>10</volume>
          .1093/ nar/gkad1173.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>A. D.</given-names>
            <surname>Baxevanis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bateman</surname>
          </string-name>
          ,
          <source>The Importance of Biological Databases in Biological Discovery, Current Protocols in Bioinformatics</source>
          <volume>50</volume>
          (
          <year>2015</year>
          ). doi:
          <volume>10</volume>
          .1002/0471250953.bi0101s50.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>S. K.</given-names>
            <surname>Burley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. M.</given-names>
            <surname>Berman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Duarte</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Feng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. W.</given-names>
            <surname>Flatt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. P.</given-names>
            <surname>Hudson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Lowe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Peisach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. W.</given-names>
            <surname>Piehl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Rose</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sali</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Sekharan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Shao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Vallat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Voigt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. D.</given-names>
            <surname>Westbrook</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. Y.</given-names>
            <surname>Young</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Zardecki</surname>
          </string-name>
          ,
          <article-title>Protein Data Bank: A Comprehensive Review of 3D Structure Holdings and Worldwide Utilization by Researchers, Educators, and</article-title>
          <string-name>
            <surname>Students</surname>
          </string-name>
          ,
          <source>Biomolecules</source>
          <volume>12</volume>
          (
          <year>2022</year>
          )
          <article-title>1425</article-title>
          . doi:
          <volume>10</volume>
          .3390/biom12101425.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>S. K.</given-names>
            <surname>Burley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Bhikadiya</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Bi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Bittrich</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Chao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. A.</given-names>
            <surname>Craig</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. V.</given-names>
            <surname>Crichlow</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Dalenberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Duarte</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Dutta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Fayazi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Feng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. W.</given-names>
            <surname>Flatt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ganesan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ghosh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. S.</given-names>
            <surname>Goodsell</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. K.</given-names>
            <surname>Green</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Guranovic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Henry</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. P.</given-names>
            <surname>Hudson</surname>
          </string-name>
          , I. Khokhriakov,
          <string-name>
            <given-names>C. L.</given-names>
            <surname>Lawson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Liang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Lowe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Peisach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Persikova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. W.</given-names>
            <surname>Piehl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Rose</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sali</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Segura</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Sekharan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Shao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Vallat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Voigt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Webb</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. D.</given-names>
            <surname>Westbrook</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Whetstone</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. Y.</given-names>
            <surname>Young</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Zalevsky</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Zardecki</surname>
          </string-name>
          ,
          <article-title>RCSB Protein Data Bank (RCSB.org): Delivery of experimentally-determined PDB structures alongside one million computed structure models of proteins from artificial intelligence/machine learning</article-title>
          ,
          <source>Nucleic Acids Research</source>
          <volume>51</volume>
          (
          <year>2023</year>
          )
          <fpage>D488</fpage>
          -
          <lpage>D508</lpage>
          . doi:
          <volume>10</volume>
          .1093/nar/gkac1077.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>R.</given-names>
            <surname>Hoehndorf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. N.</given-names>
            <surname>Schofield</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. V.</given-names>
            <surname>Gkoutos</surname>
          </string-name>
          ,
          <article-title>The role of ontologies in biological and biomedical research: A functional perspective</article-title>
          ,
          <source>Briefings in Bioinformatics</source>
          <volume>16</volume>
          (
          <year>2015</year>
          )
          <fpage>1069</fpage>
          -
          <lpage>1080</lpage>
          . doi:
          <volume>10</volume>
          .1093/ bib/bbv011.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>R.</given-names>
            <surname>Stevens</surname>
          </string-name>
          ,
          <article-title>Ontology-based knowledge representation for bioinformatics, Briefings in Bioinformatics 1 (</article-title>
          <year>2000</year>
          )
          <fpage>398</fpage>
          -
          <lpage>414</lpage>
          . doi:
          <volume>10</volume>
          .1093/bib/1.4.398.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>T.</given-names>
            <surname>Schick</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. D.-Y. R. Dessì</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Raileanu</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Lomeli</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          <string-name>
            <surname>Hambro</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Zettlemoyer</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Cancedda</surname>
          </string-name>
          , T. Scialom, Toolformer: Language Models Can Teach Themselves to Use Tools (????).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>S.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Xiong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Cui</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Yuan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Liu</surname>
          </string-name>
          , T.-W. Kuo,
          <string-name>
            <given-names>N.</given-names>
            <surname>Guan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. J.</given-names>
            <surname>Xue</surname>
          </string-name>
          , Retrieval-Augmented
          <source>Generation for Natural Language Processing: A Survey</source>
          ,
          <year>2024</year>
          . arXiv:
          <volume>2407</volume>
          .
          <fpage>13193</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <article-title>The Gene Ontology Consortium, The Gene Ontology Resource: 20 years and still GOing strong</article-title>
          ,
          <source>Nucleic Acids Research</source>
          <volume>47</volume>
          (
          <year>2019</year>
          )
          <fpage>D330</fpage>
          -
          <lpage>D338</lpage>
          . doi:
          <volume>10</volume>
          .1093/nar/gky1055.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>H.</given-names>
            <surname>Chase</surname>
          </string-name>
          , LangChain,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13] Pymupdf/PyMuPDF, PyMuPDF,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>S.</given-names>
            <surname>Colvin</surname>
          </string-name>
          , E. Jolibois,
          <string-name>
            <given-names>H.</given-names>
            <surname>Ramezani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Garcia</given-names>
            <surname>Badaracco</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Dorsey</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Montague</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Matveenko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Trylesinski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Runkle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Hewitt</surname>
          </string-name>
          ,
          <string-name>
            <surname>A</surname>
          </string-name>
          . Hall, Pydantic,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>G.</given-names>
            <surname>Antoniou</surname>
          </string-name>
          ,
          <string-name>
            <surname>F. van Harmelen</surname>
          </string-name>
          ,
          <source>Web Ontology Language: OWL</source>
          , in: S. Staab, R. Studer (Eds.),
          <source>Handbook on Ontologies</source>
          , Springer, Berlin, Heidelberg,
          <year>2009</year>
          , pp.
          <fpage>91</fpage>
          -
          <lpage>110</lpage>
          . doi:
          <volume>10</volume>
          .1007/ 978-3-
          <fpage>540</fpage>
          -92673-
          <issue>3</issue>
          _
          <fpage>4</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>M.</given-names>
            <surname>Douze</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Guzhva</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Deng</surname>
          </string-name>
          , J. Johnson, G. Szilvasy, P.-E. Mazaré,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lomeli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Hosseini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Jégou</surname>
          </string-name>
          , The Faiss library,
          <year>2024</year>
          . doi:
          <volume>10</volume>
          .48550/arXiv.2401.08281. arXiv:
          <volume>2401</volume>
          .
          <fpage>08281</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>S. R.</given-names>
            <surname>Engel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Balakrishnan</surname>
          </string-name>
          , G. Binkley,
          <string-name>
            <given-names>K. R.</given-names>
            <surname>Christie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. C.</given-names>
            <surname>Costanzo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. S.</given-names>
            <surname>Dwight</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. G.</given-names>
            <surname>Fisk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. E.</given-names>
            <surname>Hirschman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. C.</given-names>
            <surname>Hitz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. L.</given-names>
            <surname>Hong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. J.</given-names>
            <surname>Krieger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. S.</given-names>
            <surname>Livstone</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. R.</given-names>
            <surname>Miyasato</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Nash</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Oughtred</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Park</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. S.</given-names>
            <surname>Skrzypek</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Weng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. D.</given-names>
            <surname>Wong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Dolinski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Botstein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Cherry</surname>
          </string-name>
          ,
          <article-title>Saccharomyces Genome Database provides mutant phenotype data</article-title>
          ,
          <source>Nucleic Acids Research</source>
          <volume>38</volume>
          (
          <year>2010</year>
          )
          <fpage>D433</fpage>
          -
          <lpage>D436</lpage>
          . doi:
          <volume>10</volume>
          .1093/nar/gkp917.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>C.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. E.</given-names>
            <surname>Ross</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. E.</given-names>
            <surname>Cowart</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. N.</given-names>
            <surname>Arighi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. H.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. A.</given-names>
            <surname>Natale</surname>
          </string-name>
          ,
          <string-name>
            <surname>Protein</surname>
          </string-name>
          s41597-
          <fpage>020</fpage>
          -00679-9.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>E. W.</given-names>
            <surname>Sayers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. E.</given-names>
            <surname>Bolton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. R.</given-names>
            <surname>Brister</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Canese</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Chan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. C.</given-names>
            <surname>Comeau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. M.</given-names>
            <surname>Farrell</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Feldgarden</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. M.</given-names>
            <surname>Fine</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Funk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Hatcher</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kannan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Kelly</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Klimke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Landrum</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Lathrop</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Lu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. L.</given-names>
            <surname>Madden</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Malheiro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Marchler-Bauer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. D.</given-names>
            <surname>Murphy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Phan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Pujar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. H.</given-names>
            <surname>Rangwala</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V. A.</given-names>
            <surname>Schneider</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Tse</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ye</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. W.</given-names>
            <surname>Trawick</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. D.</given-names>
            <surname>Pruitt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. T.</given-names>
            <surname>Sherry</surname>
          </string-name>
          ,
          <article-title>Database resources of the National Center for Biotechnology Information in</article-title>
          <year>2023</year>
          (????).
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>K.</given-names>
            <surname>Degtyarenko</surname>
          </string-name>
          , P. de Matos,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ennis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Hastings</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Zbinden</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>McNaught</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Alcántara</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Darsow</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Guedj</surname>
          </string-name>
          , M. Ashburner,
          <article-title>ChEBI: A database and ontology for chemical entities of biological interest</article-title>
          ,
          <source>Nucleic Acids Research</source>
          <volume>36</volume>
          (
          <year>2008</year>
          )
          <fpage>D344</fpage>
          -
          <lpage>D350</lpage>
          . doi:
          <volume>10</volume>
          .1093/nar/gkm791.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>V.</given-names>
            <surname>Emonet</surname>
          </string-name>
          ,
          <string-name>
            <surname>LangChain</surname>
            <given-names>RDF</given-names>
          </string-name>
          ,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22] Ollama/ollama, Ollama,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>S.</given-names>
            <surname>Xiao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Muennighof</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Lian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.-Y.</given-names>
            <surname>Nie</surname>
          </string-name>
          , C-Pack:
          <article-title>Packed Resources For General Chinese Embeddings</article-title>
          ,
          <source>in: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval</source>
          , SIGIR '24,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computing Machinery, New York, NY, USA,
          <year>2024</year>
          , pp.
          <fpage>641</fpage>
          -
          <lpage>649</lpage>
          . doi:
          <volume>10</volume>
          .1145/3626772.3657878.
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Zhu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Yi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          , et al.,
          <article-title>A survey on evaluation of large language models</article-title>
          ,
          <source>ACM transactions on intelligent systems and technology 15</source>
          (
          <year>2024</year>
          )
          <fpage>1</fpage>
          -
          <lpage>45</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>J. H.</given-names>
            <surname>Caufield</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Hegde</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Emonet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. L.</given-names>
            <surname>Harris</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. P.</given-names>
            <surname>Joachimiak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Matentzoglu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Kim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Moxon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. T.</given-names>
            <surname>Reese</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Haendel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. N.</given-names>
            <surname>Robinson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. J.</given-names>
            <surname>Mungall</surname>
          </string-name>
          ,
          <article-title>Structured Prompt Interrogation and Recursive Extraction of Semantics (SPIRES): A method for populating knowledge bases using zero-shot learning</article-title>
          ,
          <source>Bioinformatics</source>
          <volume>40</volume>
          (
          <year>2024</year>
          )
          <article-title>btae104</article-title>
          . doi:
          <volume>10</volume>
          .1093/bioinformatics/btae104.
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>M. E.</given-names>
            <surname>Aranguren</surname>
          </string-name>
          , E. Antezana,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kuiper</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Stevens</surname>
          </string-name>
          ,
          <article-title>Ontology Design Patterns for bioontologies: A case study on the Cell Cycle Ontology, BMC Bioinformatics 9 (</article-title>
          <year>2008</year>
          )
          <article-title>S1</article-title>
          . doi:
          <volume>10</volume>
          .1186/
          <fpage>1471</fpage>
          -2105-9-S5-S1.
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>S.</given-names>
            <surname>Tartir</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I. B.</given-names>
            <surname>Arpinar</surname>
          </string-name>
          ,
          <article-title>Ontology Evaluation and Ranking using OntoQA</article-title>
          ,
          <source>in: International Conference on Semantic Computing (ICSC</source>
          <year>2007</year>
          ),
          <year>2007</year>
          , pp.
          <fpage>185</fpage>
          -
          <lpage>192</lpage>
          . doi:
          <volume>10</volume>
          .1109/ICSC.
          <year>2007</year>
          .
          <volume>19</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>R.</given-names>
            <surname>Jackson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Matentzoglu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Overton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Vita</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. P.</given-names>
            <surname>Balhof</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. L.</given-names>
            <surname>Buttigieg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Carbon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Courtot</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. D.</given-names>
            <surname>Diehl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. M.</given-names>
            <surname>Dooley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W. D.</given-names>
            <surname>Duncan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. L.</given-names>
            <surname>Harris</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Haendel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. E.</given-names>
            <surname>Lewis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. A.</given-names>
            <surname>Natale</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Osumi-Sutherland</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ruttenberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. M.</given-names>
            <surname>Schriml</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Smith</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. J. Stoeckert</given-names>
            <surname>Jr.</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. A.</given-names>
            <surname>Vasilevsky</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. L.</given-names>
            <surname>Walls</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zheng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. J.</given-names>
            <surname>Mungall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Peters</surname>
          </string-name>
          , OBO Foundry in
          <year>2021</year>
          :
          <article-title>Operationalizing open data principles to evaluate ontologies</article-title>
          ,
          <year>Database 2021</year>
          (
          <year>2021</year>
          )
          <article-title>baab069</article-title>
          . doi:
          <volume>10</volume>
          .1093/database/ baab069.
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>D.</given-names>
            <surname>Shlyk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Groza</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mesiti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Montanelli</surname>
          </string-name>
          , E. Cavalleri,
          <article-title>REAL: A Retrieval-Augmented Entity Linking Approach for Biomedical Concept Recognition</article-title>
          , in: D.
          <string-name>
            <surname>Demner-Fushman</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Ananiadou</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Miwa</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Roberts</surname>
          </string-name>
          , J. Tsujii (Eds.),
          <source>Proceedings of the 23rd Workshop on Biomedical Natural Language Processing</source>
          , Association for Computational Linguistics, Bangkok, Thailand,
          <year>2024</year>
          , pp.
          <fpage>380</fpage>
          -
          <lpage>389</lpage>
          . doi:
          <volume>10</volume>
          .18653/v1/
          <year>2024</year>
          .bionlp-
          <volume>1</volume>
          .
          <fpage>29</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>L. N.</given-names>
            <surname>Soldatova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Rzhetsky</surname>
          </string-name>
          , Representation of research hypotheses,
          <source>Journal of Biomedical Semantics</source>
          <volume>2</volume>
          (
          <year>2011</year>
          )
          <article-title>S9</article-title>
          . doi:
          <volume>10</volume>
          .1186/2041-1480-2-S2-S9.
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>L. N.</given-names>
            <surname>Soldatova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. D.</given-names>
            <surname>King</surname>
          </string-name>
          ,
          <article-title>An ontology of scientific experiments</article-title>
          ,
          <source>Journal of The Royal Society Interface</source>
          <volume>3</volume>
          (
          <year>2006</year>
          )
          <fpage>795</fpage>
          -
          <lpage>803</lpage>
          . doi:
          <volume>10</volume>
          .1098/rsif.
          <year>2006</year>
          .
          <volume>0134</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>S. J.</given-names>
            <surname>Chalk</surname>
          </string-name>
          ,
          <article-title>Scidata: a data model and ontology for semantic representation of scientific data</article-title>
          ,
          <source>Journal of cheminformatics 8</source>
          (
          <year>2016</year>
          )
          <fpage>1</fpage>
          -
          <lpage>24</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [33]
          <string-name>
            <given-names>A.</given-names>
            <surname>Bandrowski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Brinkman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Brochhausen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. H.</given-names>
            <surname>Brush</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Bug</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. C.</given-names>
            <surname>Chibucos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Clancy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Courtot</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Derom</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dumontier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Fan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Fostel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Fragoso</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Gibson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gonzalez-Beltran</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Haendel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Heiskanen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Hernandez-Boussard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Jensen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. L.</given-names>
            <surname>Lister</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Lord</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Malone</surname>
          </string-name>
          , E. Manduchi,
          <string-name>
            <given-names>M.</given-names>
            <surname>McGee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Morrison</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Overton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Parkinson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Peters</surname>
          </string-name>
          , P. RoccaSerra,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ruttenberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.-A.</given-names>
            <surname>Sansone</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. H.</given-names>
            <surname>Scheuermann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Schober</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Smith</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. N.</given-names>
            <surname>Soldatova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. J. S.</given-names>
            <surname>Jr</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. F.</given-names>
            <surname>Taylor</surname>
          </string-name>
          , C. Torniai,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Turner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Vita</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. L.</given-names>
            <surname>Whetzel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zheng</surname>
          </string-name>
          ,
          <article-title>The Ontology for Biomedical Investigations</article-title>
          ,
          <source>PLOS ONE 11</source>
          (
          <year>2016</year>
          )
          <article-title>e0154556</article-title>
          . doi:
          <volume>10</volume>
          .1371/journal.pone.
          <volume>0154556</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [34]
          <string-name>
            <given-names>R. D.</given-names>
            <surname>King</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. E.</given-names>
            <surname>Whelan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F. M.</given-names>
            <surname>Jones</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. G. K.</given-names>
            <surname>Reiser</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. H.</given-names>
            <surname>Bryant</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. H.</given-names>
            <surname>Muggleton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. B.</given-names>
            <surname>Kell</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. G.</given-names>
            <surname>Oliver</surname>
          </string-name>
          ,
          <article-title>Functional genomic hypothesis generation and experimentation by a robot scientist</article-title>
          ,
          <source>Nature</source>
          <volume>427</volume>
          (
          <year>2004</year>
          )
          <fpage>247</fpage>
          -
          <lpage>252</lpage>
          . doi:
          <volume>10</volume>
          .1038/nature02236.
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [35]
          <string-name>
            <given-names>A.</given-names>
            <surname>Coutant</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Roper</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Trejo-Banos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Bouthinon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Carpenter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Grzebyta</surname>
          </string-name>
          , G. Santini,
          <string-name>
            <given-names>H.</given-names>
            <surname>Soldano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Elati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ramon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Rouveirol</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. N.</given-names>
            <surname>Soldatova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. D.</given-names>
            <surname>King</surname>
          </string-name>
          ,
          <article-title>Closed-loop cycles of experiment design, execution, and learning accelerate systems biology model development in yeast</article-title>
          ,
          <source>Proceedings of the National Academy of Sciences</source>
          <volume>116</volume>
          (
          <year>2019</year>
          )
          <fpage>18142</fpage>
          -
          <lpage>18147</lpage>
          . doi:
          <volume>10</volume>
          .1073/pnas.1900548116.
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          [36]
          <string-name>
            <given-names>G. K.</given-names>
            <surname>Reder</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. H.</given-names>
            <surname>Gower</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Kronström</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Halle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Mahamuni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Patel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Hayatnagarkar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. N.</given-names>
            <surname>Soldatova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. D.</given-names>
            <surname>King</surname>
          </string-name>
          ,
          <string-name>
            <surname>Genesis-DB</surname>
          </string-name>
          :
          <article-title>A database for autonomous laboratory systems</article-title>
          ,
          <source>Bioinformatics Advances</source>
          <volume>3</volume>
          (
          <year>2023</year>
          )
          <article-title>vbad102</article-title>
          . doi:
          <volume>10</volume>
          .1093/bioadv/vbad102.
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          [37]
          <string-name>
            <given-names>A.</given-names>
            <surname>Litsios</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. H. E. W.</given-names>
            <surname>Huberts</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. M.</given-names>
            <surname>Terpstra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Guerra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Schmidt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Buczak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Papagiannakis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Rovetta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Hekelaar</surname>
          </string-name>
          , G. Hubmann,
          <string-name>
            <given-names>M.</given-names>
            <surname>Exterkate</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Milias-Argeitis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Heinemann</surname>
          </string-name>
          ,
          <article-title>Diferential scaling between G1 protein production and cell size dynamics promotes commitment to the cell division cycle in budding yeast</article-title>
          ,
          <source>Nature Cell Biology</source>
          <volume>21</volume>
          (
          <year>2019</year>
          )
          <fpage>1382</fpage>
          -
          <lpage>1392</lpage>
          . doi:
          <volume>10</volume>
          .1038/ s41556-019-0413-3.
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          [38]
          <string-name>
            <given-names>S.</given-names>
            <surname>Khondker</surname>
          </string-name>
          , G.-S. Han,
          <string-name>
            <given-names>G. M.</given-names>
            <surname>Carman</surname>
          </string-name>
          ,
          <article-title>Protein kinase Hsl1 phosphorylates Pah1 to inhibit phosphatidate phosphatase activity and regulate lipid synthesis in Saccharomyces cerevisiae</article-title>
          ,
          <source>Journal of Biological Chemistry</source>
          <volume>300</volume>
          (
          <year>2024</year>
          )
          <article-title>107572</article-title>
          . doi:
          <volume>10</volume>
          .1016/j.jbc.
          <year>2024</year>
          .
          <volume>107572</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          [39]
          <string-name>
            <surname>K. J. Fisher</surname>
            ,
            <given-names>S. W.</given-names>
          </string-name>
          <string-name>
            <surname>Buskirk</surname>
            ,
            <given-names>R. C.</given-names>
          </string-name>
          <string-name>
            <surname>Vignogna</surname>
            ,
            <given-names>D. A.</given-names>
          </string-name>
          <string-name>
            <surname>Marad</surname>
            ,
            <given-names>G. I. Lang</given-names>
          </string-name>
          ,
          <article-title>Adaptive genome duplication afects patterns of molecular evolution in Saccharomyces cerevisiae</article-title>
          ,
          <source>PLOS Genetics 14</source>
          (
          <year>2018</year>
          )
          <article-title>e1007396</article-title>
          . doi:
          <volume>10</volume>
          .1371/journal.pgen.
          <volume>1007396</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          [40]
          <string-name>
            <given-names>J. N.</given-names>
            <surname>Bröker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Müller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. Van</given-names>
            <surname>Deenen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Prüfer</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.</surname>
          </string-name>
          <article-title>Schulze Gronover, Upregulating the mevalonate pathway and repressing sterol synthesis in Saccharomyces cerevisiae enhances the production of triterpenes</article-title>
          ,
          <source>Applied Microbiology and Biotechnology</source>
          <volume>102</volume>
          (
          <year>2018</year>
          )
          <fpage>6923</fpage>
          -
          <lpage>6934</lpage>
          . doi:
          <volume>10</volume>
          .1007/ s00253-018-9154-7.
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          [41]
          <string-name>
            <given-names>C.</given-names>
            <surname>Camelo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Vilas-Boas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. P.</given-names>
            <surname>Cepeda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Real</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Barros-Martins</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Pinto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Soares</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. S.</given-names>
            <surname>Marinho</surname>
          </string-name>
          , L. Cyrne,
          <article-title>Opi1p translocation to the nucleus is regulated by hydrogen peroxide in Saccharomyces cerevisiae: Opi1p regulation by hydrogen peroxide in Saccharomyces cerevisiae</article-title>
          ,
          <source>Yeast</source>
          <volume>34</volume>
          (
          <year>2017</year>
          )
          <fpage>383</fpage>
          -
          <lpage>395</lpage>
          . doi:
          <volume>10</volume>
          .1002/yea.3240.
        </mixed-citation>
      </ref>
      <ref id="ref42">
        <mixed-citation>
          [42]
          <string-name>
            <given-names>E. M.</given-names>
            <surname>Parodi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Roesner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. S.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <article-title>SPO73 and SPO71 Function Cooperatively in Prospore Membrane Elongation During Sporulation in Saccharomyces cerevisiae</article-title>
          ,
          <source>PLOS ONE 10</source>
          (
          <year>2015</year>
          )
          <article-title>e0143571</article-title>
          . doi:
          <volume>10</volume>
          .1371/journal.pone.
          <volume>0143571</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref43">
        <mixed-citation>
          [43]
          <string-name>
            <given-names>A.</given-names>
            <surname>Schummer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Maier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Gabay-Maskit</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Hansen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W. W. D.</given-names>
            <surname>Mühlhäuser</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Suppanz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Fadel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Schuldiner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Girzalsky</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Oeljeklaus</surname>
          </string-name>
          , E. Zalckvar,
          <string-name>
            <given-names>R.</given-names>
            <surname>Erdmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Warscheid</surname>
          </string-name>
          ,
          <article-title>Pex14p Phosphorylation Modulates Import of Citrate Synthase 2 Into Peroxisomes in Saccharomyces cerevisiae</article-title>
          ,
          <source>Frontiers in Cell and Developmental Biology</source>
          <volume>8</volume>
          (
          <year>2020</year>
          )
          <article-title>549451</article-title>
          . doi:
          <volume>10</volume>
          .3389/fcell.
          <year>2020</year>
          .
          <volume>549451</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref44">
        <mixed-citation>
          [44]
          <string-name>
            <given-names>S.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mashkoor</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Balamurugan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Grove</surname>
          </string-name>
          ,
          <article-title>Yeast Crf1p is an activator with diferent roles in regulation of target genes</article-title>
          ,
          <source>Yeast</source>
          <volume>41</volume>
          (
          <year>2024</year>
          )
          <fpage>379</fpage>
          -
          <lpage>400</lpage>
          . doi:
          <volume>10</volume>
          .1002/yea.3939.
        </mixed-citation>
      </ref>
      <ref id="ref45">
        <mixed-citation>
          [45]
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Ellison</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Namjilsuren</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. K.</given-names>
            <surname>Shirra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. S.</given-names>
            <surname>Blacksmith</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. A.</given-names>
            <surname>Schustef</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. M.</given-names>
            <surname>Kerr</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Fang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Xiang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Shi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. M.</given-names>
            <surname>Arndt</surname>
          </string-name>
          ,
          <article-title>Spt6 directly interacts with Cdc73 and is required for Paf1 complex occupancy at active genes in Saccharomyces cerevisiae</article-title>
          ,
          <source>Nucleic Acids Research</source>
          <volume>51</volume>
          (
          <year>2023</year>
          )
          <fpage>4814</fpage>
          -
          <lpage>4830</lpage>
          . doi:
          <volume>10</volume>
          .1093/nar/gkad180.
        </mixed-citation>
      </ref>
      <ref id="ref46">
        <mixed-citation>
          [46]
          <string-name>
            <given-names>S.</given-names>
            <surname>Kaps</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Kettner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Migotti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Kanashova</surname>
          </string-name>
          , U. Krause, G. Rödel, G. Dittmar,
          <string-name>
            <given-names>T. M.</given-names>
            <surname>Kriegel</surname>
          </string-name>
          ,
          <article-title>Protein Kinase Ymr291w/Tda1 Is Essential for Glucose Signaling in Saccharomyces cerevisiae on the Level of Hexokinase Isoenzyme ScHxk2 Phosphorylation*</article-title>
          ,
          <source>Journal of Biological Chemistry</source>
          <volume>290</volume>
          (
          <year>2015</year>
          )
          <fpage>6243</fpage>
          -
          <lpage>6255</lpage>
          . doi:
          <volume>10</volume>
          .1074/jbc.M114.595074.
        </mixed-citation>
      </ref>
      <ref id="ref47">
        <mixed-citation>
          [47]
          <string-name>
            <given-names>S.</given-names>
            <surname>Sasaki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Schlarmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Hanaoka</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Nishii</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Moriya</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Muñiz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Funato</surname>
          </string-name>
          ,
          <article-title>Protein sorting upon exit from the endoplasmic reticulum dominates Golgi biogenesis in budding yeast</article-title>
          ,
          <source>FEBS Letters 598</source>
          (
          <year>2024</year>
          )
          <fpage>548</fpage>
          -
          <lpage>555</lpage>
          . doi:
          <volume>10</volume>
          .1002/1873-
          <fpage>3468</fpage>
          .
          <fpage>14830</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref48">
        <mixed-citation>
          [48]
          <string-name>
            <given-names>M.</given-names>
            <surname>Molon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Woznicka</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zebrowski</surname>
          </string-name>
          ,
          <article-title>Cell wall biosynthesis impairment afects the budding lifespan of the Saccharomyces cerevisiae yeast</article-title>
          ,
          <source>Biogerontology</source>
          <volume>19</volume>
          (
          <year>2018</year>
          )
          <fpage>67</fpage>
          -
          <lpage>79</lpage>
          . doi:
          <volume>10</volume>
          .1007/ s10522-017-9740-6.
        </mixed-citation>
      </ref>
      <ref id="ref49">
        <mixed-citation>
          [49]
          <string-name>
            <surname>J. B. Robertson</surname>
            ,
            <given-names>C. R.</given-names>
          </string-name>
          <string-name>
            <surname>Davis</surname>
            ,
            <given-names>C. H.</given-names>
          </string-name>
          <string-name>
            <surname>Johnson</surname>
          </string-name>
          ,
          <article-title>Visible light alters yeast metabolic rhythms by inhibiting respiration</article-title>
          ,
          <source>Proceedings of the National Academy of Sciences</source>
          <volume>110</volume>
          (
          <year>2013</year>
          )
          <fpage>21130</fpage>
          -
          <lpage>21135</lpage>
          . doi:
          <volume>10</volume>
          .1073/pnas.1313369110.
        </mixed-citation>
      </ref>
      <ref id="ref50">
        <mixed-citation>
          [50]
          <string-name>
            <given-names>M.</given-names>
            <surname>Mühlhofer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Berchtold</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. G.</given-names>
            <surname>Stratil</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Csaba</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Kunold</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. C.</given-names>
            <surname>Bach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. A.</given-names>
            <surname>Sieber</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Haslbeck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Zimmer</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. Buchner,</surname>
          </string-name>
          <article-title>The Heat Shock Response in Yeast Maintains Protein Homeostasis by Chaperoning and Replenishing Proteins</article-title>
          ,
          <source>Cell Reports</source>
          <volume>29</volume>
          (
          <year>2019</year>
          )
          <fpage>4593</fpage>
          -
          <lpage>4607</lpage>
          .
          <year>e8</year>
          . doi:
          <volume>10</volume>
          .1016/j. celrep.
          <year>2019</year>
          .
          <volume>11</volume>
          .109.
        </mixed-citation>
      </ref>
      <ref id="ref51">
        <mixed-citation>
          [51]
          <string-name>
            <surname>W.-C. Hsieh</surname>
            ,
            <given-names>B. M.</given-names>
          </string-name>
          <string-name>
            <surname>Sutter</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Ruess</surname>
            ,
            <given-names>S. D.</given-names>
          </string-name>
          <string-name>
            <surname>Barnes</surname>
            ,
            <given-names>V. S.</given-names>
          </string-name>
          <string-name>
            <surname>Malladi</surname>
            ,
            <given-names>B. P.</given-names>
          </string-name>
          <string-name>
            <surname>Tu</surname>
          </string-name>
          ,
          <article-title>Glucose starvation induces a switch in the histone acetylome for activation of gluconeogenic and fat metabolism genes</article-title>
          ,
          <source>Molecular Cell</source>
          <volume>82</volume>
          (
          <year>2022</year>
          )
          <fpage>60</fpage>
          -
          <lpage>74</lpage>
          .
          <year>e5</year>
          . doi:
          <volume>10</volume>
          .1016/j.molcel.
          <year>2021</year>
          .
          <volume>12</volume>
          .015.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>