<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Advancing Cross-Document Relation Extraction with Hybrid Retrieval and Knowledge-Augmented Reasoning</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Marco Martinelli</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Information Engineering, University of Padova</institution>
          ,
          <addr-line>Padova</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2026</year>
      </pub-date>
      <abstract>
        <p>Existing Relation Extraction (RE) methods typically focus on extracting relational facts between entity pairs within single sentences or documents. However, in practice, a large amount of relational facts can only be inferred by reasoning across multiple documents. In this work, we introduce the task of Cross-Document Relation Extraction (CDRE), placed in between the domains of Information Retrieval (IR) and Natural Language Processing (NLP). CDRE enables the acquisition of knowledge in the wild, making it better suited for real-world use cases where relevant information is scattered across multiple sources. After formally introducing the task and the components involved in a CDRE system, we present the research directions that we plan to pursue to advance the state of the art. Specifically, we propose to integrate sparse and dense retrieval models with the heuristic-based methods currently employed in CDRE to improve the retrieval efectiveness of relevant passages from multiple documents. To further improve this retrieval, we introduce path-ranking algorithms as re-rankers to filter out less informative passages. Additionally, we explore leveraging graph-based representations to enhance document retrieval. Next, we plan to adapt Knowledge Injection (KI) techniques widely employed in sentenceand document-level RE to the CDRE setting, aiming to improve their robustness against syntactic and semantic variations, hence enhancing extraction efectiveness. Finally, we present an evaluation framework designed to assess the overall performances of CDRE systems and analyze the impact of each individual component.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Natural Language Processing</kwd>
        <kwd>Relation Extraction</kwd>
        <kwd>Cross-Document Relation Extraction</kwd>
        <kwd>Information Retrieval</kwd>
        <kwd>Document Retrieval</kwd>
        <kwd>Passage Retrieval</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        RE is a crucial NLP task that aims to detect the semantic relations between a pair or target entities in a
given text. It is fundamental for natural language understanding, automated Knowledge Bases (KBs)
construction and population, and, more generally, for nearly all knowledge-driven AI tasks [
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ].
      </p>
      <p>
        Most of the RE research has been limited to scenarios where the entity pair is within a single sentence
or document [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. However, many use cases require extracting relations across multiple documents,
where the target entity pair may not coexist within the same document. For example, more than 57.6%
of the relational facts in Wikidata are not described in individual Wikipedia documents [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>That led to the introduction of the CDRE task, which breaks through the limitations of document
boundaries to acquire knowledge from multiple text sources. CDRE requires a RE system to infer the
relation holding between two entities by retrieving and reasoning over multiple documents from a
large-scale corpus of documents. Therefore, CDRE places itself in the middle of the IR and NLP domains.</p>
      <p>
        Compared to more traditional RE tasks, CDRE introduces two main challenges. The first relates to
Document Retrieval (DR) and Passage Retrieval (PR), requiring systems to identify relevant documents
and extract from them informative passages for the target entity pair. The second dificulty is associated
with reasoning, since working at the cross-document level requires systems to perform both intra- and
cross-document reasoning across multiple documents and then predict the relations by aggregating
information [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>In this research project, the primary efort will be on improving the retrieval component of CDRE
systems, using dense and sparse retrievers typically employed in IR and path-mining and ranking
approaches used in Open-Domain Question Answering (ODQA). Although these techniques have
shown great potential in their specific domains, their synergy with CDRE is still in its early stages.
Improvements can be made in matching relevant passages scattered across multiple documents,
balancing syntactic and semantic aspects to ensure that relevant information is captured both for individual
entities and for the entity pair as a whole, and in optimizing the ranking specifically for the CDRE task,
prioritizing the most informative passages for relation inference.</p>
      <p>Furthermore, we will model DR in CDRE using knowledge graphs to implement a hybrid approach
that integrates traditional keyword-based methods with knowledge graph-based ones.</p>
      <p>
        The next step will be to employ KI, which is incorporating external information (e.g., KBs and
ontologies) into CDRE models to improve their reasoning and inference capabilities. This technique
has been widely employed in both sentence- and document-level RE, showing great efectiveness
[
        <xref ref-type="bibr" rid="ref4 ref5">4, 5</xref>
        ]. However, the integration of these techniques at the cross-document level is more dificult since
managing context and ensuring that injected knowledge remains accurate and pertinent throughout
diferent documents presents significant challenges.
      </p>
      <p>The following sections are structured as follows: Section 2 describes the CDRE task in detail and
presents the current methods and resources for CDRE; Section 3 details the proposed research questions
and directions; and Section 4 concludes with some final remarks.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Task Description</title>
      <p>Given a target entity pair (ℎ, ) a (large-scale) corpus of documents , and a set of relations  =
{1, . . . , } ∪ N/A, the CDRE task can be decomposed into four stages:
1. Document Retrieval (DR), which requires the system to find documents {1, . . . , } ∈ 
relevant to the pair (ℎ, ).
2. Passage Retrieval (PR), which extracts the most informative passages {1, . . . ,  } ∈
{1, . . . , } to ensure that the input fits within the token limit of the employed RE model.
3. Input Construction (IC), that is, processing the selected passages into a format suitable for RE.</p>
      <p>This usually involves combining the selected passages into a single text representation, hence
reconducing the CDRE task to a document-level RE problem.
4. Relation Extraction (RE), consisting of reasoning over the provided input to predict the relation
 ∈  between ℎ and .</p>
      <p>In that scope, we define an entity  as a bridge entity if it is in relation with ℎ in a document  and
with  in a document . Furthermore, a pair of documents (, ) containing, respectively, the head
(ℎ ∈ ) and tail ( ∈ ) entities and connected by a bridge entity is called (reasoning) text path.</p>
      <p>In Figure 1, the documents “Pink Floyd: Live at Pompeii” and “Progressive music” represent a text path,
with the bridging entities that connect the passages linked with arrows.</p>
      <sec id="sec-2-1">
        <title>2.1. Related Works</title>
        <p>
          Currently, there is only one dataset specifically designed to test the RE systems’ ability at the
crossdocument level: CodRED [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]. It includes over 30’000 positive relations associated with more than
210’000 documents, providing ground truth for both DR and CDRE. Furthermore, CodRED features a
subset of relations annotated with evidence sentences, namely sentences belonging to the head and tail
documents that allow the inference of the annotated relation, providing ground truth for PR.
        </p>
        <p>CodRED includes two benchmark settings to fully evaluate each required capability of a CDRE
system:
1. Closed setting: The model is provided with the entity pair (ℎ, ) and the corresponding text
path (ℎ, ) from which to establish the relation.</p>
        <p>Pink Floyd: Live at Pompeii
Passage 1
Pink Floyd: Live at Pompeii is a 1972 concert
documentary film directed by Adrian Maben and
featuring the English rock group Pink Floyd performing
at the ancient Roman amphitheatre in Pompeii, Italy …
Passage 5
The performances of Echoes, A Saucerful of Secrets,
and One of These Days were filmed from 4 to 7 October
1971. O'Rourke delivered a demo to Maben in order for
him to prepare for the various shots required, which he
finally managed to do the night before filming started …</p>
        <p>Progressive Music
Passage 7
… development of late 1960s progressive rock
exemplified by the Moody Blues, Procol Harum,</p>
        <p>Pink Floyd, and the Beatles …
A Saucerful of Secrets
(Q207661)</p>
        <p>Genre (P136)</p>
        <p>Progressive rock
(Q49451)</p>
        <p>2. Open setting: The model is provided only with the entity pair (ℎ, ) and needs to first retrieve
relevant documents from the corpus of documents before determining the relation.</p>
        <p>Previous works have focused on one or both of these settings, implementing one or more of the
stages required in CDRE, as summarized in Table 1.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Research Directions</title>
      <p>This project aims to integrate techniques from the IR, ODQA, and KI domains into CDRE, addressing
two interlinked yet distinct objectives: enhance the DR, PR, and IC stages to improve extraction
independently of the employed downstream RE model, and advance the cross-document reasoning
capabilities of RE models to advance the state of the art (SOTA) in terms of efectiveness. To achieve
these goals, three Research Questions (RQs) have been outlined:
RQ1: Are dense and sparse retrieval methods, along with path-ranking algorithms, efective for
retrieving relevant passages for CDRE? Is it appropriate to apply knowledge graphs for DR in
CDRE?
RQ2: Can KI enhance the robustness of CDRE models against syntactic and semantic variations and
improve their flexibility to adapt to varying relation descriptions across diferent contexts?
RQ3: How to evaluate the efectiveness of the proposed CDRE systems and the individual impact of
the employed methods on the overall CDRE efectiveness?</p>
      <sec id="sec-3-1">
        <title>3.1. Research Design &amp; Methods</title>
        <p>Based on the defined RQs, the research framework illustrated in Figure 2 has been outlined.</p>
        <p>RQ1: Input Construction with Neural Retrievers and Path-Mining
RQ2: Cross-Document RE with Knowledge Injection
Neural
Retrievers
Path-Mining
Algorithms</p>
        <p>
          Regarding RQ1 (see the blue rectangle), the first step is to reproduce the heuristic-based methods
proposed in [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] and [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] and integrate the scores from diferent PR models, such as SPLADE [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ],
Contriever [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ], DPR [13], and ColBERT [14], into these heuristics. The inputs constructed in this
manner will then be fed into the same downstream RE model to assess their diferent impacts on
extraction efectiveness.
        </p>
        <p>
          Next, we will select the PR models that demonstrate the best performance and replace the
heuristicbased methods with path-ranking algorithms. Specifically, we will first retrieve relevant passages and
then rank them using these algorithms, extending the approach proposed in [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] by constructing a
knowledge graph of passages and scoring their importance based on their centrality within the graph
[15].
        </p>
        <p>
          Finally, we will implement a hybrid DR approach that integrates traditional keyword-based methods,
such as BM25 and TF-IDF, with knowledge-graph based techniques. Given an entity pair (ℎ, ), we will
ifrst construct a knowledge graph where documents are nodes and edges represent bridging entities,
shared entities, and semantic similarity [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]. These edges will be weighted, with weights potentially
learnable. We will then estimate the centrality of documents within this graph and use it as a relevance
score [16, 17], which will be combined with the traditional scoring functions mentioned above to select
the most relevant documents.
        </p>
        <p>To answer RQ2 (see the green rectangle), we will enhance the text understanding in CDRE models
by augmenting texts with detailed attributes and relationships from domain-specific ontologies and
KBs.</p>
        <p>We will address imperfections and coverage gaps of KBs by extending techniques applied for automatic
KBs enrichment, such as random walks [18] and rule mining models [19], to infer missing details in
KBs and enhance the accuracy of augmented texts.</p>
        <p>Subsequently, we will address the challenge of adapting to relation descriptions and syntactic and
semantic variations by adapting techniques from RE tasks that employ pre-trained Language Models
(LMs) [20] and Large Language Models (LLMs) [21], known for their efectiveness in capturing
longdistance relations and mapping entities to unique identifiers [ 22]. Furthermore, by incorporating KI, we
aim to enhance the robustness and context awareness of these models, reducing their susceptibility to
hallucinations and insensitivity to negations.</p>
        <p>To address RQ3 (see yellow rectangle), we will first evaluate the impact of the various PR and IC
approaches on the efectiveness of the downstream CDRE models.</p>
        <p>Following the identification of the most efective ones, we will integrate them into a single model
(i.e., RQ1) and conduct an ablation study to assess their individual contributions. These approaches
will then be applied to construct the inputs for CDRE models and LMs with external KI (i.e., RQ2), with
another round of iterative evaluation to refine their application.</p>
        <p>We will then evaluate the efectiveness of the developed DR methods by comparing the RE scores
obtained for the same entity pairs in both the open and closed settings, allowing us to determine
whether the retrieved documents enable the models to predict the same relations they identify when
provided with the exact document pair and, consequently, understand the impact of DR on the overall
performance of CDRE systems.</p>
        <p>For evaluation, DR and PR performances will be measured using standard IR evaluation metrics:
Precision@K, Recall@K, F1-score, Mean Average Precision (MAP), and Normalized Discounted Cumulative
Gain (NDCG). Concerning RE, since it can be considered a multi-label classification problem, Precision,
Recall, and F1-score will be used for evaluation.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusion</title>
      <p>The CDRE task lies in between the IR and NLP domains, presenting a wide range of challenges at
diferent levels of granularity. Despite the modernity of the task, various promising approaches have
been proposed in the literature, implementing one or more of the elements involved in a CDRE system.</p>
      <p>However, there is significant room for improvement in all components of CDRE, along with a need to
better understand their individual impact on overall efectiveness. Therefore, the primary foundational
step of this project involves working on the retrieval and IC layers by reproducing and augmenting the
solutions currently developed and evaluating their impact on a fixed downstream RE model. After that,
the focus of the project will be on improving the reasoning and inference capabilities of CDRE models
by leveraging external KI.</p>
      <p>The final goal of my research is the release of an end-to-end CDRE system that can work in both closed
and open settings, improving the current SOTA and potentially enhancing the integration between
Document Retrieval, Passage Retrieval, Input Construction, and reasoning to achieve more robust and
efective Cross-Document Relation Extraction.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
      <p>This project has received funding from the HEREDITARY Project, as part of the European Union’s
Horizon Europe research and innovation programme under grant agreement No GA 101137074.</p>
    </sec>
    <sec id="sec-6">
      <title>Declaration on Generative AI</title>
      <p>During the preparation of this work, the author used GPT-4o and Grammarly in order to: Grammar
and spelling check. After using these tools, the author reviewed and edited the content as needed and
takes full responsibility for the publication’s content.
information retrieval with contrastive learning, CoRR (2021). URL: https://arxiv.org/abs/2112.09118.
doi:10.48550/ARXIV.2112.09118.
[13] V. Karpukhin, B. Oguz, S. Min, P. Lewis, L. Wu, S. Edunov, D. Chen, W.-t. Yih, Dense passage
retrieval for open-domain question answering, in: Proceedings of the 2020 Conference on Empirical
Methods in Natural Language Processing (EMNLP), Association for Computational Linguistics,
Online, 2020, p. 6769–6781. URL: https://www.aclweb.org/anthology/2020.emnlp-main.550. doi:10.
18653/v1/2020.emnlp-main.550.
[14] K. Santhanam, O. Khattab, J. Saad-Falcon, C. Potts, M. Zaharia, Colbertv2: Efective and eficient
retrieval via lightweight late interaction, in: Proceedings of the 2022 Conference of the North
American Chapter of the Association for Computational Linguistics: Human Language
Technologies, Association for Computational Linguistics, Seattle, United States, 2022, p. 3715–3734. URL:
https://aclanthology.org/2022.naacl-main.272. doi:10.18653/v1/2022.naacl-main.272.
[15] R. Ribeiro, D. M. de Matos, Revisiting centrality-as-relevance: support sets and similarity as
geometric proximity, J. Artif. Int. Res. 42 (2011) 275–308.
[16] W. Kim, H. Jang, H.-J. Kim, D. Kim, A document query search using an extended centrality with the
word2vec, in: Proceedings of the 18th Annual International Conference on Electronic Commerce:
ECommerce in Smart Connected World, ICEC ’16, Association for Computing Machinery, New York,
NY, USA, 2016. URL: https://doi.org/10.1145/2971603.2971617. doi:10.1145/2971603.2971617.
[17] O. Levi, F. Raiber, O. Kurland, I. Guy, Selective cluster-based document retrieval, in: Proceedings
of the 25th ACM International on Conference on Information and Knowledge Management,
CIKM ’16, Association for Computing Machinery, New York, NY, USA, 2016, p. 1473–1482. URL:
https://doi.org/10.1145/2983323.2983737. doi:10.1145/2983323.2983737.
[18] N. Lao, T. Mitchell, W. W. Cohen, Random walk inference and learning in a large scale knowledge
base, in: R. Barzilay, M. Johnson (Eds.), Proceedings of the 2011 Conference on Empirical Methods
in Natural Language Processing, Association for Computational Linguistics, Edinburgh, Scotland,
UK., 2011, pp. 529–539. URL: https://aclanthology.org/D11-1049/.
[19] L. A. Galárraga, C. Teflioudi, K. Hose, F. Suchanek, Amie: association rule mining under incomplete
evidence in ontological knowledge bases, in: Proceedings of the 22nd international conference on
World Wide Web, ACM, Rio de Janeiro Brazil, 2013, p. 413–422. URL: https://dl.acm.org/doi/10.
1145/2488388.2488425. doi:10.1145/2488388.2488425.
[20] A. Romadhony, D. H. Widyantoro, A. Purwarianti, Utilizing structured knowledge bases in
open ie based event template extraction, Applied Intelligence 49 (2019) 206–219. doi:10.1007/
s10489-018-1269-0.
[21] D. Xu, W. Chen, W. Peng, C. Zhang, T. Xu, X. Zhao, X. Wu, Y. Zheng, E. Chen, Large language
models for generative information extraction: A survey, arXiv preprint arXiv:2312.17617 (2023).
[22] J. H. Caufield, H. Hegde, V. Emonet, N. L. Harris, M. P. Joachimiak, N. Matentzoglu, H. Kim,
S. Moxon, J. T. Reese, M. A. Haendel, P. N. Robinson, C. J. Mungall, Structured prompt interrogation
and recursive extraction of semantics (spires): a method for populating knowledge bases using
zero-shot learning, Bioinformatics 40 (2024) btae104. doi:10.1093/bioinformatics/btae104.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M.</given-names>
            <surname>Ali</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Saleem</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Moussallem</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Sherif</surname>
          </string-name>
          , A.
          <string-name>
            <surname>-C. Ngonga Ngomo</surname>
          </string-name>
          ,
          <article-title>Reld: A knowledge graph of relation extraction datasets</article-title>
          , in: C.
          <string-name>
            <surname>Pesquita</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          <string-name>
            <surname>Jimenez-Ruiz</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>McCusker</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Faria</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Dragoni</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Dimou</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Troncy</surname>
          </string-name>
          , S. Hertling (Eds.),
          <source>The Semantic Web</source>
          , Springer Nature Switzerland, Cham,
          <year>2023</year>
          , pp.
          <fpage>337</fpage>
          -
          <lpage>353</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>K.</given-names>
            <surname>Lu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.-H.</given-names>
            <surname>Hsu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. D.</given-names>
            <surname>Ma</surname>
          </string-name>
          , M.
          <article-title>Chen, Multi-hop evidence retrieval for cross-document relation extraction, in: Findings of the Association for Computational Linguistics: ACL 2023, Association for Computational Linguistics</article-title>
          , Toronto, Canada,
          <year>2023</year>
          , p.
          <fpage>10336</fpage>
          -
          <lpage>10351</lpage>
          . URL: https: //aclanthology.org/
          <year>2023</year>
          .findings-acl.
          <volume>657</volume>
          . doi:
          <volume>10</volume>
          .18653/v1/
          <year>2023</year>
          .findings-acl.
          <volume>657</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Yao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Du</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Sun</surname>
          </string-name>
          ,
          <article-title>Codred: A cross-document relation extraction dataset for acquiring knowledge in the wild</article-title>
          ,
          <source>in: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing</source>
          , Association for Computational Linguistics, Online and
          <string-name>
            <given-names>Punta</given-names>
            <surname>Cana</surname>
          </string-name>
          , Dominican Republic,
          <year>2021</year>
          , p.
          <fpage>4452</fpage>
          -
          <lpage>4472</lpage>
          . URL: https://aclanthology.org/
          <year>2021</year>
          . emnlp-main.
          <volume>366</volume>
          . doi:
          <volume>10</volume>
          .18653/v1/
          <year>2021</year>
          .emnlp-main.
          <volume>366</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>X.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Sun</surname>
          </string-name>
          , W. Hu,
          <article-title>Enhancing document-level relation extraction by entity knowledge injection</article-title>
          ,
          <source>in: The Semantic Web - ISWC</source>
          <year>2022</year>
          : 21st International Semantic Web Conference, Virtual Event,
          <source>October 23-27</source>
          ,
          <year>2022</year>
          , Proceedings, Springer-Verlag, Berlin, Heidelberg,
          <year>2022</year>
          , p.
          <fpage>39</fpage>
          -
          <lpage>56</lpage>
          . URL: https://doi.org/10.1007/978-3-
          <fpage>031</fpage>
          -19433-
          <issue>7</issue>
          _3. doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>031</fpage>
          -19433-
          <issue>7</issue>
          _
          <fpage>3</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>B.</given-names>
            <surname>Chen</surname>
          </string-name>
          , G. Yuan,
          <article-title>Domain knowledge-driven relation extraction methods</article-title>
          ,
          <source>Journal of Data Science and Intelligent Systems</source>
          (
          <year>2024</year>
          ). URL: https://ojs.bonviewpress.com/index.php/jdsis/article/view/ 2524. doi:
          <volume>10</volume>
          .47852/bonviewJDSIS42022524.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>F.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Fei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Su</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Shi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Ji</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Cai</surname>
          </string-name>
          ,
          <article-title>Entity-centered cross-document relation extraction</article-title>
          ,
          <source>in: Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing</source>
          , Association for Computational Linguistics, Abu Dhabi, United Arab Emirates,
          <year>2022</year>
          , p.
          <fpage>9871</fpage>
          -
          <lpage>9881</lpage>
          . URL: https://aclanthology.org/
          <year>2022</year>
          .emnlp-main.
          <volume>671</volume>
          . doi:
          <volume>10</volume>
          .18653/v1/
          <year>2022</year>
          . emnlp-main.
          <volume>671</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>J.</given-names>
            <surname>Son</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Kim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Lim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Jang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Lim</surname>
          </string-name>
          ,
          <article-title>Explore the way: Exploring reasoning path by bridging entities for efective cross-document relation extraction, in: Findings of the Association for Computational Linguistics: EMNLP 2023, Association for Computational Linguistics</article-title>
          , Singapore,
          <year>2023</year>
          , p.
          <fpage>6755</fpage>
          -
          <lpage>6761</lpage>
          . URL: https://aclanthology.org/
          <year>2023</year>
          .findings-emnlp.
          <volume>450</volume>
          . doi:
          <volume>10</volume>
          .18653/v1/
          <year>2023</year>
          .findings-emnlp.
          <volume>450</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Jain</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Mutharaju</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Kavuluru</surname>
          </string-name>
          ,
          <article-title>Knowledge-driven cross-document relation extraction</article-title>
          , in: L.
          <string-name>
            <surname>-W. Ku</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Martins</surname>
          </string-name>
          , V. Srikumar (Eds.),
          <source>Findings of the Association for Computational Linguistics: ACL</source>
          <year>2024</year>
          ,
          <article-title>Association for Computational Linguistics</article-title>
          , Bangkok, Thailand,
          <year>2024</year>
          , pp.
          <fpage>3787</fpage>
          -
          <lpage>3797</lpage>
          . URL: https://aclanthology.org/
          <year>2024</year>
          .findings-acl.
          <volume>227</volume>
          /. doi:
          <volume>10</volume>
          .18653/v1/
          <year>2024</year>
          . findings-acl.
          <volume>227</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>B.</given-names>
            <surname>Na</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Jo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Kim</surname>
          </string-name>
          , I.-c. Moon,
          <article-title>Reward-based input construction for cross-document relation extraction</article-title>
          ,
          <source>in: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume</source>
          <volume>1</volume>
          :
          <string-name>
            <surname>Long</surname>
            <given-names>Papers)</given-names>
          </string-name>
          ,
          <source>Association for Computational Linguistics</source>
          , Bangkok, Thailand,
          <year>2024</year>
          , p.
          <fpage>9254</fpage>
          -
          <lpage>9270</lpage>
          . URL: https://aclanthology.org/
          <year>2024</year>
          .
          <article-title>acl-long</article-title>
          .
          <volume>501</volume>
          . doi:
          <volume>10</volume>
          .18653/v1/
          <year>2024</year>
          .
          <article-title>acl-long</article-title>
          .
          <volume>501</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>H.</given-names>
            <surname>Yue</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Lai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Yao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Su</surname>
          </string-name>
          ,
          <article-title>Towards better graph-based cross-document relation extraction via non-bridge entity enhancement and prediction debiasing, in: Findings of the Association for Computational Linguistics ACL 2024</article-title>
          ,
          <article-title>Association for Computational Linguistics</article-title>
          , Bangkok, Thailand and virtual meeting,
          <year>2024</year>
          , p.
          <fpage>680</fpage>
          -
          <lpage>691</lpage>
          . URL: https://aclanthology.org/
          <year>2024</year>
          . ifndings-acl.
          <volume>38</volume>
          . doi:
          <volume>10</volume>
          .18653/v1/
          <year>2024</year>
          .findings-acl.
          <volume>38</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>T.</given-names>
            <surname>Formal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Piwowarski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Clinchant</surname>
          </string-name>
          , Splade:
          <article-title>Sparse lexical and expansion model for first stage ranking</article-title>
          ,
          <source>in: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval</source>
          ,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , Virtual Event Canada,
          <year>2021</year>
          , p.
          <fpage>2288</fpage>
          -
          <lpage>2292</lpage>
          . URL: https://dl.acm.org/doi/10.1145/3404835.3463098. doi:
          <volume>10</volume>
          .1145/3404835.3463098.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>G.</given-names>
            <surname>Izacard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Caron</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Hosseini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Riedel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Bojanowski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Joulin</surname>
          </string-name>
          , E. Grave, Unsupervised dense
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>