<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Evidence to Insights: GraphRAG as a Dynamic Knowledge Layer for the Collaboration for Environmental Evidence's Database of Evidence Reviews</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Christoph Stade</string-name>
          <email>christoph-stade@outlook.de</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jodi Schneider</string-name>
          <email>jschneider@pobox.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yuanxi Fu</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Harvard Radclife Institute for Advanced Study</institution>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Ludwig-Maximilians-Universität München</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Illinois Urbana-Champaign</institution>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Environmental evidence synthesis plays a crucial role in policy-making, yet policy-makers often struggle to situate individual insights into a broader context of research findings. The Collaboration for Environmental Evidence's Database for Evidence Reviews (CEEDER) provides a collection of evidence, but it remains dificult to navigate. Large language models help to overcome this challenge, doing well at summarization, but often miss critical relational contexts. We propose basing the language model on a knowledge graph that grows with the evidence through the use of Microsoft's GraphRAG. As a dynamic knowledge layer this approach eficiently connects, retrieves, and summarizes relevant research. Unlike traditional manually curated knowledge graphs, GraphRAG dynamically links entities without predefined schemas, adapting to the evolving landscape of environmental science. By enhancing domain-specific query-focused summarization, GraphRAG has the potential to increase data accessibility in settings such as CEEDER where conventional knowledge graphs are not feasible.</p>
      </abstract>
      <kwd-group>
        <kwd>sustainability</kwd>
        <kwd>environmental science</kwd>
        <kwd>evidence synthesis</kwd>
        <kwd>generative artificial intelligence</kwd>
        <kwd>GraphRAG</kwd>
        <kwd>knowledge graphs</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Environmental policy-making depends on the synthesis of research, yet extracting meaningful insights
from its vast scientific literature is challenging. The sustainability literature has been doubling every
8.3 years and encompasses disciplines including biology, chemistry, engineering, health sciences, and
social sciences [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The Collaboration for Environmental Evidence’s Database for Evidence Reviews1
(CEEDER) curates nearly 2,000 systematically assessed reviews to support decision-makers [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], but
even these structured resources require manual efort to navigate and interpret.
      </p>
      <p>
        Traditional search methods like online search or CEEDER’s own search string function ofer only
keyword-based retrieval. They don’t provide direct answers and don’t connect related findings across
diferent reports. Recent advances in large language models (LLMs) have enabled automated text
processing at scale. However, without structured retrieval, LLMs struggle to map conceptual relationships,
often overlooking the critical connections necessary for efective decision-making [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>
        Knowledge graphs (KGs) ofer a promising approach to addressing these challenges by structuring and
linking data. In conjunction with LLMs they can make complex webs of evidence navigable, improving
discoverability and supporting reasoning over data. However, creating high-quality KGs requires
significant involvement from domain experts, which becomes prohibitively expensive at scale [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>
        Introduced by Microsoft in 2024, GraphRAG 2 is a novel evolution of the retrieval-augmented
generation (RAG) technique, combining automated knowledge graph construction and query-focused
summarization to support human sensemaking over entire text corpora [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. The approach retrieves
CEUR
      </p>
      <p>
        ceur-ws.org
contextually relevant information using relationship-based similarity, consequently generating more
relevant answers than traditional RAG methods that rely solely on vector similarity of embedded text
fragments [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Additionally, while RAG is typically used to generate answers based on general
knowledge, in this work we begin to see that GraphRAG could excel at connecting across domain-constrained
documents with an explicit structure, such as scientific papers and especially evidence reviews. Note
that the underlying entity knowledge graph is diferent from conventional knowledge graphs in that it
does not incorporate structured ontologies, which poses a tradeof we will discuss in section 3.2.
      </p>
      <p>We propose using Microsoft GraphRAG as a dynamic knowledge layer on CEEDER and identify the
following three advantages for policy-makers to gain with this approach. Using GraphRAG:
1. Provides provenance links for generated knowledge claims to original text chunks in the dataset
2. Encourages proactive knowledge discovery by making evidence more accessible
3. Continuously integrates the newest scientific documents</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>
        Utilizing graph structures for query-focused summarization was previously suggested by Park and
Ko [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Their approach uses a graph attention network and a personalized PageRank algorithm to
strengthen the relationship between query nodes and document content. Our approach using GraphRAG
is advantageous because it can leverage cross-document relationships, integrating multiple pieces of
evidence that collectively represent a domain in one unified graph, rather than summarizing one
document at a time.
      </p>
      <p>
        Barron et al. explored how KG-assisted RAG can enhance language model performance in highly
domain specific settings [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], improving structure and relevance at the cost of significant pre-processing
and manual curation. Their integration of KGs with vector stores emphasizes structured ontologies
which are explicitly built by human experts-in-the-loop using curated corpora. Although the need
for human experts is minimized for scalablility, it can still be a barrier when resources are limited.
In contrast, by using GraphRAG to dynamically link entities, our approach enables retrieval while
completely eliminating the need for predefined schemas.
      </p>
      <p>
        Within the sustainability domain, Gupta et al. designed a KG-assisted RAG approach for
questionanswering about Environmental, Social, and Governance news articles [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Their approach uses Rapid
Automatic Keyword Extraction (RAKE) for unsupervised keyword extraction from sentences in a news
corpus, then creates embeddings of extracted key phrases with phrase-BERT that get saved as entities
in a KG. At query time entities are retrieved based on their cosine similarity to the query string. Their
KG aims to provide an LLM with factual data about companies, events, and dates. Yet it is limited
because the retrieval is based on individual node scoring. However, our envisioned application to
policymaking requires abstraction beyond localized facts to answer questions spanning across sub-domains
within climate research, not attempted by their approach. GraphRAG is more suitable since it supports
high-level, global queries and identifying overarching themes by creating community summaries and
combining partial answers.
      </p>
      <p>
        Our approach integrates a Python package actively maintained by Microsoft 3. This ensures a readily
usable command-line interface deployment, a convenience lacking in similar approaches [
        <xref ref-type="bibr" rid="ref6 ref7 ref8">7, 6, 8</xref>
        ].
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. A Dynamic Knowledge Layer for CEEDER</title>
      <p>The CEEDER database’s rigorous evidence synthesis methodology creates unique opportunities for
graph-based knowledge discovery: Each systematic review’s structured review question provides natural
anchor points for graph construction. Per individual evidence review, CEEDER’s curators assign a
review question following the PECO/PICO (Population, Exposure/Intervention, Comparison, Outcome)
question format. The review questions serve as a top-level representation of evidence documents
3https://github.com/microsoft/graphrag
throughout, describing their contents in the most condensed form, even more so than the review’s
abstract. We make every review question part of the context fragments input to GraphRAG’s indexing.
In practice, review questions can guide query optimization as pre-formulated prompts for policy-makers
to use as an entry point to a document or topic.</p>
      <sec id="sec-3-1">
        <title>3.1. Methods and Results</title>
        <p>
          First, we parsed the database dump CSV file (retrieved from CEEDER on August 2, 2024) to extract
the Title, Review Question and Abstract columns for each review. To limit the amount of data to process,
we restricted ourselves to the 356 reviews in the climate change collection by applying a CEEDER
search filter, which was further reduced after removing two duplicate entries. The reviews were
formatted as individual .txt files with three lines per extracted column entry and moved to GraphRAG’s
input directory. Next we ran the GraphRAG index pipeline to create the entity knowledge graph
following the documentation5. Our configuration difers in pointing to local instances running on
ollama 0.6.06 rather than remote OpenAI models, removing dependency on proprietary APIs, cutting
cost, and ensuring privacy. For the LLM we chose mistral-nemo with 12B parameters7 for its strong
performance-to-size ratio. Preliminary testing with stronger models yield more populated and denser
graphs, suggesting potential for improved performance. The embedding model was nomic-embed-text8
with high general-purpose embedding quality. Table 1 shows an example global query against all
ingested abstracts and a sample review question as user input. Compared to GraphRAG’s local search,
global search is a resource-intensive method, but it often gives accurate responses for questions that
require an understanding of the dataset as a whole. Figure 1 shows the resulting graph visualized using
Gephi network analysis software [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] and the GraphRAG visualization guide9, limited to clusters with
5https://microsoft.github.io/graphrag/get_started/
6https://ollama.com/
7https://ollama.com/library/mistral-nemo
8https://ollama.com/library/nomic-embed-text
9https://microsoft.github.io/graphrag/visualization_guide/
more than one node and rearranged for improved readability. The indexing process extracted 90,230
entities and 70,895 relationships aggregated to 608 nodes and 250 edges through community-based
summarization and pruning (see Figure 1).
        </p>
        <p>Immediate limitations are visible, like the dark green colored cluster to the right with the meaningless
artifact “Review question”, used to structure the input files. Further, the graph is sparse. There are
many clusters with a single concept node (not shown in Figure 1), likely because of the small input size
of 354 abstracts with 13.97 sentences on average or because of the limited complexity of the language
model used to process the abstracts.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. GraphRAG Compared to Conventional Knowledge Graphs</title>
        <p>
          The entity knowledge graph (EKG) produced by GraphRAG’s indexing pipeline does not conform
to any single established model of a conventional knowledge graph [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]. Nodes have default types
organization, person, geo, and event - an LLM prompt can be used to generate alternative types
and are a factor for retrieval. Edges have a weight that counts their normalized occurrences. Both
nodes and edges have one or more free-form description strings with arbitrarily rich textual metadata.
Beyond the standard attributes of the EKG’s GraphML-serialization it is missing a schema resembling
an ontology.
        </p>
        <p>In contrast to conventional knowledge graphs, the EKG’s descriptions are not exclusively meant for
human consumption but treated as the single source for all semantic information about the relation
for the LLM to process. For example, the node Nitrogen Fixation is described as “Nitrogen Fixation
refers to the process of converting atmospheric nitrogen into a usable form for plants.” and connected
to the node Plants via a relation with the description “Plants rely on nitrogen fixation for growth
and development.” GraphRAG’s handling of rich semantic descriptions for both nodes and edges is
close to that of a property graph: high complexity in the description enables complex statements to be
expressed with relatively little notation, in strings that the LLM can process.</p>
        <p>
          Conventional knowledge graphs need additional mechanisms like reification or subgraphs to realize
similar contextual information, such as temporal or geographical annotations. And compared to
GraphRAG’s flexible EKG, conventional KGs are expensive to create and thus impractical when resources
are limited. Significant efort in ontology design, data integration, and maintenance is required to
create domain-specific knowledge graphs, with domain experts who define taxonomies and update
schemas [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]. GraphRAG’s indexing pipeline, as a dynamic knowledge layer, eliminates this overhead
inherent to knowledge engineering by automatically extracting and structuring relationships as new
data is introduced just by re-indexing.
        </p>
        <p>While GraphRAG lacks ontology-driven reasoning and complex SPARQL queryability, it ofers
compensatory advantages. The flexible EKG enables continuous evidence integration, and scales
eficiently without the need for costly manual curation. This adaptability makes GraphRAG a
costefective alternative in resource-constrained settings, such as the CEEDER database.</p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Environmental Policy Impact</title>
        <p>
          Policy decisions are inherently tied to the quality and reliability of synthesized evidence. This is true for
both evidence reviews and artificially generated reports, making it essential that the methodologies used
to synthesize information are both accurate and comprehensible. In fact, EU policy-makers highlight
trust, regulatory uncertainty, and risks associated with AI, including its potential harms and the need for
explainability, as key challenges in harnessing AI for policy-driven applications such as climate change
mitigation [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]. Ensuring AI transparency is crucial for informed decision-making, as misrepresented
or opaque evidence can lead to misguided policies with real-world consequences [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ].
        </p>
        <p>Unlike traditional AI models that generate responses without clear sourcing, Microsoft’s GraphRAG
provides provenance, explicitly linking generated statements back to their original text chunks in the
9https://github.com/infoqualitylab/CEEDER_study/blob/main/Spring2025_Christoph/mistral-nemo12b/visualized_merged_
graph.pdf</p>
        <sec id="sec-3-3-1">
          <title>Review Question</title>
          <p>What are the efects of nitrogen input on greenhouse gas emissions from wetland ecosystems?</p>
        </sec>
        <sec id="sec-3-3-2">
          <title>Response</title>
          <p>Nitrogen input significantly impacts greenhouse gas emissions from wetlands, primarily afecting
methane (CH4) and nitrous oxide (N2O) emissions.</p>
          <p>Here’s a summary of the key findings:
1. Methane (CH4) Emissions: Excess nitrogen can lead to increased methane production and
emission in wetlands [Data: Reports (2, 7, 64, +more)]. This is due to enhanced microbial activity
under high nitrogen conditions. However, the efects of nitrogen input on methane emissions can
vary depending on other factors such as water table depth and temperature [Data: Reports (4, 6)].
[...]</p>
        </sec>
        <sec id="sec-3-3-3">
          <title>3. Plant Community Composition and Greenhouse Gas Emissions: Nitrogen input may also</title>
          <p>alter plant community composition in wetlands, potentially afecting greenhouse gas emissions [Data:
Reports (3, 5)]. Changes in plant species can influence the amount of carbon sequestered by wetlands,
which indirectly impacts greenhouse gas emissions.
dataset and relationships in the entity KG. For instance, in Table 1, GraphRAG cites “[Data: Reports (2,
7, 64, +more)]”. Consecutive indices are sequential text fragments in the input; larger gaps indicate
that GraphRAG makes connections between multiple reviews’ findings. By linking research questions
and their answers, GraphRAG enables policy-makers to see how findings in diferent contexts and on
diferent populations relate through a common intervention. For example, consider the review question:
“What are the efects of nitrogen input on greenhouse gas emissions from wetland ecosystems?” A
separate review addresses: “What are the efects of nitrogen input on soil nitrous oxide emissions and
soil ammonia and nitrate concentrations in natural ecosystems?” Both study the efects of nitrogen
input but in diferent settings. A policy designed for nutrient management in wetlands could benefit
from lessons learned in natural ecosystems, identifying both shared mechanisms and critical contextual
diferences. As the CEEDER climate change collection grows, these overlaps will multiply. This will
enhance GraphRAG’s ability to find consensus, conflicting viewpoints, and trends across multiple
evidence reviews and their findings, going well beyond isolated summarizations. Ultimately, GraphRAG
could help policy-makers move from narrow, study-specific insights to a comprehensive understanding
of the broader landscape of environmental research.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Limitations and Future Work</title>
      <p>
        The sampled global queries with selected review questions already suggest an improvement over regular
RAG. However, to verify, this we will run an empirical evaluation. Since environmental science does
not have an existing labeled data set that could be used for validation, we will adopt Edge et al.’s
improvement to the LLM-as-a-Judge technique, employing a language model itself as an evaluator for
the output of other language models [
        <xref ref-type="bibr" rid="ref13 ref5">5, 13</xref>
        ]. The goal is to use two diferent models, one for generating
the data, the other for evaluating, to overcome egocentric bias [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. We will test diferent configurations
through a constructive ablation study to determine the highest contributors, including an LLM with
only concatenated abstracts in the context window, an LLM with RAG, an LLM with GraphRAG, and
ifnally, an LLM with GraphRAG with fine-tuned models for sustainability.
      </p>
      <p>
        Moving forward, we aim to extend GraphRAG’s EKG with lightweight ontologies in a hybrid approach
to enable additional dimensions to search, such as vertical exploration through evidence hierarchies,
utilizing CEEDER’s evidence overview collection, horizontal discovery of interdisciplinary connections
(e.g., linking the climate change collection to other review collections) and temporal analysis of evolving
scientific consensus across review versions. Two sustainability-focused enhancements could be made to
the core GraphRAG pipeline. First, we will use ClimateBERT [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], a fine-tuned model trained on 450,000
environmental science abstracts, to improve entity recognition accuracy for sustainability concepts.
Second, we will improve the evidence tracking by updating the cryptic “Data: Reports…” (Table 1)
provided by GraphRAGs out-of-the-box capabilities to enhance them with publication titles and DOIs.
      </p>
      <p>
        In the future, we envision the entity knowledge graph facilitating applications such as detecting
knowledge gaps and conflicting statements. Missing connections between concepts, sparse graph
regions, or dangling nodes may indicate knowledge gaps such as under-researched areas or open
questions. By traversing and inspecting the graph, either automatically, manually or both, it may
be possible to find conflicting or contrasting statements and compute maximum flow to identify the
statement with stronger support [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. This could provide decision-makers with insight into scientific
consensus and uncertainty.
      </p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusions</title>
      <p>
        Sustainability research urgently needs automated methods to help policy-makers synthesize research
outcomes because it is one of the fastest growing fields of research, doubling in unique contributions
every 8.3 years, with work from diverse disciplines such as biology, chemistry, engineering, health
sciences, and social sciences [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Our GraphRAG pipeline improves knowledge discovery in environmental
science by structuring relationships between concepts that traditional retrieval models often conflate.
GraphRAG furthers LLMs’ capabilities for processing vast amounts of text to enable meaningful analysis
at scale.
      </p>
      <p>
        However, challenges remain, especially in the graph construction process. The lack of entity
disambiguation and semantic normalization risks missing complex relationships. Further, generative data
might not accurately represent the source data. System use should always be accompanied by clear
disclosures of AI use and the potential for errors in outputs [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Yet, unlike static KGs, GraphRAG
continuously integrates emerging evidence, making it well-suited for domains like sustainability, where
evolving research shapes policy decisions.
      </p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>
        This work was partly supported by NSF Grant #2046454 CAREER: Using network analysis to assess
confidence in research synthesis. This is a continuation of work Christoph Stade did [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] through a
DAAD RISE Worldwide scholarship at University of Illinois Urbana-Champaign while completing his
Bachelor’s degree at Technische Hochschule Nürnberg Georg Simon Ohm; he is now a Master’s student
at Ludwig-Maximilians-Universität München. Jodi Schneider is supported in part as the 2024–2025
Perrin Moorhead Grayson and Bruns Grayson Fellow, Harvard Radclife Institute for Advanced Study.
Thanks to Corinne McCumber, Heng Zheng, and Malik Salami for providing feedback on a draft.
      </p>
    </sec>
    <sec id="sec-7">
      <title>CRediT</title>
      <p>Christoph Stade - Conceptualization, Data curation, Formal analysis, Investigation, Software, Validation,
Visualization, Writing - original draft, Writing - review &amp; editing; Jodi Schneider - Funding acquisition,
Supervision, Writing - review &amp; editing; Yuanxi Fu - Conceptualization, Methodology, Supervision,
Writing - review &amp; editing</p>
    </sec>
    <sec id="sec-8">
      <title>Declaration on Generative AI</title>
      <p>During the preparation of this work, the authors used GPT-4o for: Drafting content, Paraphrase and
reword, and Grammar and spelling check. After using these tools/services, the authors reviewed and
edited the content as needed and take full responsibility for the publication’s content.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>L. M. A.</given-names>
            <surname>Bettencourt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Kaur</surname>
          </string-name>
          ,
          <article-title>Evolution and structure of sustainability science</article-title>
          ,
          <source>Proceedings of the National Academy of Sciences</source>
          <volume>108</volume>
          (
          <year>2011</year>
          )
          <fpage>19540</fpage>
          -
          <lpage>19545</lpage>
          . doi:
          <volume>10</volume>
          .1073/pnas.1102712108.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>K.</given-names>
            <surname>Konno</surname>
          </string-name>
          , S. H. Cheng, J. Eales, G. Frampton,
          <string-name>
            <given-names>C.</given-names>
            <surname>Kohl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Livoreil</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Macura</surname>
          </string-name>
          , B.
          <string-name>
            <surname>C. O'Leary</surname>
            ,
            <given-names>N. P.</given-names>
          </string-name>
          <string-name>
            <surname>Randall</surname>
            ,
            <given-names>J. J.</given-names>
          </string-name>
          <string-name>
            <surname>Taylor</surname>
            , P. Woodcock,
            <given-names>A. S.</given-names>
          </string-name>
          <string-name>
            <surname>Pullin</surname>
          </string-name>
          ,
          <article-title>The CEEDER database of evidence reviews: An open-access evidence service for researchers and decision-makers,</article-title>
          <source>Environmental Science Policy</source>
          <volume>114</volume>
          (
          <year>2020</year>
          )
          <fpage>256</fpage>
          -
          <lpage>262</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.envsci.
          <year>2020</year>
          .
          <volume>08</volume>
          .021.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>R.</given-names>
            <surname>Logan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. F.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. E.</given-names>
            <surname>Peters</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Gardner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <article-title>Barack's wife Hillary: Using knowledge graphs for fact-aware language modeling</article-title>
          ,
          <source>in: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics</source>
          ,
          <year>2019</year>
          , pp.
          <fpage>5962</fpage>
          -
          <lpage>5971</lpage>
          . doi:
          <volume>10</volume>
          .18653/v1/
          <fpage>P19</fpage>
          -1598.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M.</given-names>
            <surname>Hofer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Obraczka</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Saeedi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Köpcke</surname>
          </string-name>
          , E. Rahm,
          <article-title>Construction of knowledge graphs: State and challenges</article-title>
          ,
          <source>Information</source>
          <volume>15</volume>
          (
          <year>2024</year>
          ). doi:
          <volume>10</volume>
          .3390/info15080509.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>D.</given-names>
            <surname>Edge</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Trinh</surname>
          </string-name>
          , N. Cheng, J.
          <string-name>
            <surname>Bradley</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Chao</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Mody</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Truitt</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Larson</surname>
          </string-name>
          ,
          <article-title>From local to global: A Graph RAG approach to Query-Focused Summarization</article-title>
          ,
          <year>2024</year>
          . doi:
          <volume>10</volume>
          .48550/arXiv.2404.16130.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>C.</given-names>
            <surname>Park</surname>
          </string-name>
          , Y. Ko, QSG Transformer:
          <article-title>Transformer with Query-attentive Semantic Graph for QueryFocused Summarization</article-title>
          ,
          <source>in: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '22</source>
          ,
          <year>2022</year>
          , p.
          <fpage>2589</fpage>
          -
          <lpage>2594</lpage>
          . doi:
          <volume>10</volume>
          .1145/ 3477495.3531901.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>R. C.</surname>
          </string-name>
          <article-title>B</article-title>
          . et al.,
          <article-title>Domain-specific retrieval-augmented generation using vector stores, knowledge graphs, and</article-title>
          tensor factorization,
          <year>2024</year>
          . doi:
          <volume>10</volume>
          .48550/arXiv.2410.02721.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>T. K.</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Goel</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Verma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Dey</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Bhardwaj</surname>
          </string-name>
          ,
          <article-title>Knowledge graph aided LLM based ESG question-answering from news</article-title>
          ,
          <source>in: Proceedings of the 2nd International Workshop on Knowledge Graphs for Sustainability</source>
          , volume
          <volume>3753</volume>
          ,
          <year>2024</year>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>3753</volume>
          /paper6.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>M.</given-names>
            <surname>Bastian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Heymann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Jacomy</surname>
          </string-name>
          ,
          <string-name>
            <surname>Gephi:</surname>
          </string-name>
          <article-title>An open source software for exploring and manipulating networks</article-title>
          ,
          <source>Proceedings of the International AAAI Conference on Web and Social Media</source>
          <volume>3</volume>
          (
          <year>2009</year>
          )
          <fpage>361</fpage>
          -
          <lpage>362</lpage>
          . doi:
          <volume>10</volume>
          .1609/icwsm.v3i1.
          <fpage>13937</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>A.</given-names>
            <surname>Hogan</surname>
          </string-name>
          , E. Blomqvist,
          <string-name>
            <given-names>M.</given-names>
            <surname>Cochez</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.</surname>
          </string-name>
          <article-title>D'amato</article-title>
          , G. D.
          <string-name>
            <surname>Melo</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Gutierrez</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Kirrane</surname>
            ,
            <given-names>J. E. L.</given-names>
          </string-name>
          <string-name>
            <surname>Gayo</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Navigli</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Neumaier</surname>
            ,
            <given-names>A.-C. N.</given-names>
          </string-name>
          <string-name>
            <surname>Ngomo</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Polleres</surname>
            ,
            <given-names>S. M.</given-names>
          </string-name>
          <string-name>
            <surname>Rashid</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Rula</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Schmelzeisen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Sequeda</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Staab</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Zimmermann</surname>
          </string-name>
          ,
          <article-title>Knowledge graphs</article-title>
          ,
          <source>ACM Comput. Surv</source>
          .
          <volume>54</volume>
          (
          <year>2021</year>
          ). doi:
          <volume>10</volume>
          . 1145/3447772.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>F. af Malmborg</surname>
          </string-name>
          ,
          <article-title>Narrative dynamics policy-Sensemaking, agency construction, and anchoring</article-title>
          ,
          <source>Review of Policy Research</source>
          <volume>40</volume>
          (
          <year>2022</year>
          )
          <fpage>757</fpage>
          -
          <lpage>780</lpage>
          . doi:
          <volume>10</volume>
          .1111/ropr.12529.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>J.</given-names>
            <surname>Gurevitch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Koricheva</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Nakagawa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. B.</given-names>
            <surname>Stewart</surname>
          </string-name>
          ,
          <article-title>Meta-analysis and the science of research synthesis</article-title>
          ,
          <source>Nature</source>
          <volume>555</volume>
          (
          <year>2018</year>
          )
          <fpage>175</fpage>
          -
          <lpage>182</lpage>
          . doi:
          <volume>10</volume>
          .1038/nature25753.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>L.</given-names>
            <surname>Zheng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Chiang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Sheng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Zhuang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhuang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. P.</given-names>
            <surname>Xing</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. E.</given-names>
            <surname>Gonzalez</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Stoica</surname>
          </string-name>
          ,
          <article-title>Judging LLM-as-a-judge with MT-Bench and</article-title>
          <string-name>
            <surname>Chatbot Arena</surname>
          </string-name>
          ,
          <year>2023</year>
          . doi:
          <volume>10</volume>
          .48550/arxiv.2306.05685.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>D.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Jiang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Beigi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Tan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bhattacharjee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Jiang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Shu</surname>
          </string-name>
          , L. Cheng, H. Liu,
          <article-title>From generation to judgment: Opportunities and challenges of LLM-as-a-</article-title>
          <string-name>
            <surname>judge</surname>
          </string-name>
          ,
          <year>2024</year>
          . doi:
          <volume>10</volume>
          .48550/arXiv.2411.16594.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>N.</given-names>
            <surname>Webersinke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kraus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Bingler</surname>
          </string-name>
          , M. Leippold,
          <article-title>ClimateBERT: A pretrained language model for climate-related text</article-title>
          ,
          <source>in: Proceedings of AAAI 2022 Fall Symposium: The Role of AI</source>
          in Responding to Climate Challenges,
          <year>2022</year>
          . doi:
          <volume>10</volume>
          .5167/uzh-235046.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>P.</given-names>
            <surname>Shiralkar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Flammini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Menczer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. L.</given-names>
            <surname>Ciampaglia</surname>
          </string-name>
          ,
          <article-title>Finding streams in knowledge graphs to support fact checking</article-title>
          ,
          <source>in: 2017 IEEE International Conference on Data Mining (ICDM)</source>
          ,
          <year>2017</year>
          , pp.
          <fpage>859</fpage>
          -
          <lpage>864</lpage>
          . doi:
          <volume>10</volume>
          .1109/ICDM.
          <year>2017</year>
          .
          <volume>105</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>C.</given-names>
            <surname>Stade</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Schneider</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Fu</surname>
          </string-name>
          ,
          <article-title>Making sense of the environmental science landscape: An exploration of the CEEDER database</article-title>
          , in: METSTI 2024: Workshop on Informetric,
          <source>Scientometric, and Scientific and Technical Information Research</source>
          ,
          <year>2024</year>
          . doi:
          <volume>10</volume>
          .5281/zenodo.13895517.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>