<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>LIS at SimpleText 2025: Enhancing Scientific Text Accessibility with LLMs and Retrieval-Augmented Generation</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Anya Amel Nait djoudi</string-name>
          <email>anya.nait-djoudi@lis-lab.fr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sarah Nouali</string-name>
          <email>sarah.nouali@lis-lab.fr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mohsine Aabid</string-name>
          <email>mohsine.aabid@lis-lab.fr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ismail Badache</string-name>
          <email>ismail.badache@lis-lab.fr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Adrian-Gabriel Chifu</string-name>
          <email>adrian.chifu@lis-lab.fr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Patrice Bellot</string-name>
          <email>patrice.bellot@lis-lab.fr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Aix Marseille Université</institution>
          ,
          <addr-line>CNRS, LIS, Marseille</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2025</year>
      </pub-date>
      <abstract>
        <p>To improve public access to scientific knowledge, this work introduces a scientific text simplification model that combines Large Language Models (LLMs) with a Retrieval-Augmented Generation (RAG) framework. This paper presents the contribution of the R2I1 team from the LIS Laboratory2 to the SimpleText 2025 Lab, specifically Task 1.2: Document-level Scientific Text Simplification. Our main contribution is MedSimplify, a glossary of over 3,000 simplified definitions complied from multiple public medical sources. These definitions are integrated into a prompt-based simplification pipeline. We evaluated several LLM configurations for text simplification. Our results show that Mistral 7B with zero-shot prompting and MedSimplify definitions ( Mistral_DASP_0) was the best-performing system, achieving a SARI score of 43.51 the highest among all our submissions. This result placed our team 5th in the CLEF 2025 SimpleText Task 1.2. Our findings show that grounding simplification in curated domain specific definitions improves readability while maintaining factual accuracy.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Text simplification</kwd>
        <kwd>Scientific text simplification</kwd>
        <kwd>Biomedical text simplification</kwd>
        <kwd>Large Language Models (LLM)</kwd>
        <kwd>Retrieval-Augmented Generation (RAG)</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Scientific publications serve as a primary medium for communicating research findings across a wide
spectrum of disciplines, including but not limited to biomedical science. While scientific publications
such as those in the biomedical domain are essential for advancing knowledge and informing public
understanding, their heavy use of technical terminology and limited background context often makes
them inaccessible to non-expert audiences [
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ]. Understanding language spoken or written requires
constructing a situation model that integrates both the explicit content and inferred meanings drawn
from relevant background knowledge. Without this domain-specific knowledge, even fluent readers may
struggle to comprehend scientific texts, regardless of their ability to decode the words (e.g., when reading
a technical article on astrophysics) [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. This helps explain why jargon not only impedes comprehension
but also cognitive processing, leading to increased resistance to persuasion, heightened risk perceptions,
and reduced support for scientific advancements [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. This accessibility gap has sparked growing interest
in the field of scientific text simplification, which seeks to bridge the divide between complex academic
writing and broader public understanding.
      </p>
      <p>
        In response to this challenge, initiatives such as the CLEF 2025 SimpleText Track [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] have emerged
to promote the development and evaluation of automated systems capable of simplifying scientific
literature for non-expert audiences. In particular, Task 1.2 [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] focuses on identifying and explaining
scientific jargon in a simplified and accessible manner.
      </p>
      <p>
        Recent work [
        <xref ref-type="bibr" rid="ref7 ref8 ref9">7, 8, 9</xref>
        ] has demonstrated the potential of large language models (LLMs) in text
simplification. Furthermore, Retrieval-Augmented Generation (RAG) [ 10] has been explored as a way
to enrich outputs with external knowledge. For instance, [11] applied RAG to incorporate definitions
and explanatory content from sources like Wikipedia 1, yielding slight improvements in relevance and
readability.
      </p>
      <p>Building on these insights, our approach in Task 1.2 combines LLMs and RAG to enhance the
simpliifcation of scientific jargon. We employed models such as Mistral [ 12] and LLaMA [13], experimenting
with diverse prompting strategies to generate clear and concise definitions tailored to non-expert users.
To enrich the context and improve definition quality, we integrated retrieval from curated knowledge
sources, enabling the models to ground their responses in relevant background information.</p>
      <p>Our approach aims to improve the interpretability of domain-specific terminology while minimizing
the risk of hallucinations or factual errors, thereby contributing to broader eforts in making scientific
content more transparent and accessible.</p>
      <p>Our main contributions are:
• MedSimplify: a glossary of over 3,000 simplified definitions for medical terms.
• An LLM-based pipeline for automatically extracting domain-specific terminology.
• Integration of simplified definitions into prompting strategies (zero-shot, one-shot, and iterative
refinement) to support efective scientific text simplification.</p>
      <p>The rest of this paper is organized as follows. In Section 2, we present our experimental setup and
describe the specific runs submitted. Section 3 discusses the results obtained from these runs and
includes additional post-competition experiments. Finally, in Section 4, we conclude the paper by
summarizing the key lessons learned and outlining future perspectives.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Experimental Setup</title>
      <sec id="sec-2-1">
        <title>2.1. Dataset</title>
        <p>In this section, we detail the approach we propose for Task 1.2 of the CLEF 2025 SimpleText Track.
For Task 1.2 of the CLEF 2025 SimpleText Track, we relied on datasets provided by the organizers,
primarily the Cochrane-auto corpus [14] which consists of aligned pairs of biomedical abstracts
and corresponding lay summaries, originally sourced from the Cochrane Database of Systematic
Reviews (CDSR). Cochrane-auto provides alignment at the document, paragraph, and sentence levels.
It includes: 1,085 document pairs, 4,171 paragraph pairs, 14,719 sentence pairs. Unlike the training data,
which was limited to Cochrane-auto, the test set as shown in Figure 1 comprised examples from a
broader set of biomedical sources, including: 217 texts from Cochrane, 236 texts from Cochrane-auto
(119 in the validation and 117 in the test splits, respectively), 110 texts from Medline, 103 texts from
SimpleText2024 [15]. This diverse composition introduced variability in writing style, structure, and
vocabulary, increasing the complexity of the task. In particular, Medline and SimpleText 2024 included
content not aligned in the same way as Cochrane-auto, which required our models to generalize beyond
the distribution of the training data.</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Terminology Simplification Glossary</title>
        <p>To address lexical complexity in biomedical texts, we developed MedSimplify 2, a specialized
glossary of simplified definitions for complex biomedical terms. As discussed in [ 16], complex words,</p>
        <sec id="sec-2-2-1">
          <title>1https://fr.wikipedia.org/ 2https://github.com/Anyantd/MedSimplify/</title>
          <p>particularly those that are rare, domain specific or used in unusual contexts, can hinder understanding
and disturb reader focus. These challenges are especially pronounced in biomedical communication,
where technical terminology may not align with readers’ background knowledge. MedSimplify was
constructed by aggregating and unifying content from multiple publicly available medical glossaries
that were contructed by researchers and experts in the medical field: the Glossary of Lay Terminology
for Consent Forms3, Plain Language Thesaurus for Health Communications4, CLAD-Thesaurus5, Plain
English Health Dictionary6, Glossary (in Lay Terms)7. They were systematically extracted, cleaned, and
de-duplicated to produce a unified dataset, which was then stored in CSV format. Each entry within the
MedSimplify glossary comprises a medical term and its corresponding layman-friendly definition. In
instances where multiple definitions were available for a given term, the shortest one was consistently
selected to accommodate token limitations inherent in the Retrieval-Augmented Generation (RAG)
prompting pipeline. The final glossary encompasses over 3,000 entries and occupies approximately
178 kB. While this dataset was primarily designed to enhance contextual grounding within our RAG
framework, it is important to acknowledge that each term is associated with only a single definition,
which may not fully capture nuanced, context-specific meanings.</p>
        </sec>
      </sec>
      <sec id="sec-2-3">
        <title>2.3. Preprocessing</title>
        <p>Given the domain-specific complexity of biomedical texts, we employed the Mistral 7B language model
to identify potentially challenging terms for non-expert readers. For each test instance, the full input
text was provided to the model, with prompts crafted to elicit a list of domain specific keywords likely to
hinder comprehension. We did not impose a fixed number of keywords, allowing the model to determine
3https://feinstein.northwell.edu/sites/northwell.edu/files/2019-07/Glossary-of-Lay-Terminology-for-Consent-Forms-07-19.
pdf
4https://stacks.cdc.gov/view/cdc/11500
5https://clad.tccld.org/wp-content/uploads/2014/12/CLAD-Thesaurus.pdf
6https://nt.gov.au/__data/assets/pdf_file/0006/1257567/aid-plain-english-health-dictionary-spread.pdf
7https://rcm1.rcm.upr.edu/institutionalreview/wp-content/uploads/sites/16/2020/04/layterms.pdf
term salience dynamically (see Table 1 for prompt details). The output was post-processed to remove
extraneous content such as formatting artifacts, or irrelevant introductory phrases. Extracted terms
were then matched against entries in our terminology simplification glossary (Section 2.2) using exact
string matching. Only terms with valid matches were retained, ensuring that subsequent simplification
steps were grounded in high confidence, human-authored definitions. We choose for this first version
of our work to use exact matching, to ensure high precision by only retrieving intended terms, avoids
false positives from similar-looking but unrelated words and provides clean, reliable inputs for the
model. These term-definition pairs were later embedded in our best performing prompt 1 to provide
explicit contextual support during generation.</p>
      </sec>
      <sec id="sec-2-4">
        <title>2.4. Architecture</title>
        <p>Our pipeline for text simplification follows a Retrieval-Augmented Generation (RAG) paradigm,
comprising the following key stages:
2.4.1. Retrieval via Exact Matching
For each keyword of the input text, we retrieve a corresponding definition from our custom-built
dictionary using an exact string match strategy. This ensures deterministic retrieval of precise,
humanauthored definitions written in plain English (layman’s terms) and avoids ambiguity introduced by
semantic or fuzzy retrieval.
2.4.2. Prompt Design
We designed and evaluated multiple prompts to guide the language model in producing accurate and
accessible simplifications. To explore diferent strategies for leveraging contextual definitions and
examples, we designed four distinct prompt types:
• Keyword-Guided Retrieval Prompt (KGR): Presents only extracted keywords from the input text to
guide targeted keyword extraction;
• Definition-Augmented Simplification (Zero-Shot) (DASP_0): Includes the complex input text and
associated definitions, with no examples provided;
• Definition-Augmented Simplification (One-Shot) (DASP_1): Adds a single example of input–output
simplification alongside definitions to support in-context learning;
• Iterative Refinement Prompt (IRP): Takes an initial simplified version and the original text,
prompting the model to improve clarity and accuracy without distorting the meaning, inspired by [17].
2.4.3. Generation Models
The enriched prompt (original input + definitions) is passed to an LLM model to generate the simplified
text. We utilized diferent models with the prompts DASP-0 and DASP-1 described in Table 1:
• The Mistral 7B model8 was used in Mistral_DASP_0 and Mistral_DASP_1;
• the Gemma 2-9B model9 [18] was used in Gemma2_DASP_1;
• and finally the Med42-v2 model 10[19] in Med42_DASP_0.</p>
        <sec id="sec-2-4-1">
          <title>The diferent solutions are summarized in Table 2.</title>
        </sec>
        <sec id="sec-2-4-2">
          <title>8Mistral 7B: https://huggingface.co/mistralai/Mistral-7B-v0.3 9Gemma 2-9B: https://huggingface.co/google/gemma-2-9b-it 10Med42-v2: https://huggingface.co/m42-health/Llama3-Med42-8B</title>
          <p>I have the following document: {Complex text}
Please give me the keywords that are present in this document and separate them
with commas. Make sure you to only return the keywords and say nothing else. For
example, don’t say: "Here are the keywords present in the document"
Using these definitions, please simplify the following scientific text for a general
audience. Use plain language and explain any complex terms or acronyms. Ensure
that all numbers, results, and facts remain exactly the same. Do not paraphrase
numerical data or alter the meaning of findings.</p>
          <p>DEFINITIONS: {List of definitions}
TEXT: {Complex text}
You are a helpful assistant that simplifies biomedical or scientific texts.</p>
          <p>Task:
Using these definitions, simplify the following scientific text for a general audience.</p>
          <p>Use plain language and explain any complex terms or acronyms. Ensure that all
numbers, results, and facts remain exactly the same. Do not paraphrase numerical
data or alter the meaning of findings.</p>
          <p>Example: (Example of a pair of complex text and its simplified version)
Definitions: {List of definitions}
Text:{example of complex text}
Simplified:{example of simplified text}
Now do the same for the following:
Definitions: {definition}
Text:{text}
Simplified:
Improve the simplified version of the scientific text below to make it clearer and
easier for a general audience.</p>
          <p>Your goal is to maximize the SARI score by simplifying language and structure, while
keeping all facts, numbers, and findings exactly the same. Do this step by step.</p>
          <p>ORIGINAL TEXT: {Complex text}
FIRST SIMPLIFIED VERSION: {Generated simple text}</p>
          <p>REFINED VERSION:
2.4.4. Post Competition Approaches
Following the competition, we conducted a series of additional experiments to further enhance model
performance and evaluate output quality using the SARI score, employing alternative strategies as
outlined bellow :
1. Gemma3_DASP_0: we used in this solution the model11 [20] with the prompt DASP_0.
2. Mistral_IRP: In one of our post-competition experiments, we aimed to improve the performance
of Mistral_DASP_0 through an iterative refinement strategy. Specifically, we reintroduced
the initially generated simplified outputs from Mistral_DASP_0 back into the model as input,
using the prompt Iterative Refinement Prompt (IRP) described in Section 1. The objective was to
encourage the model to produce a second, potentially more refined simplification by leveraging
its own prior output as an intermediate step.
3. Mistral_FKGL: We assessed the readability of the simplified texts generated using the models
from the evaluation phase (see Table 2) by computing their Flesch-Kincaid Grade Level (FKGL)
[21] scores. FKGL scores were calculated using the Python library Textstat12. FKGL is a standard
11Gemma 3: https://huggingface.co/google/gemma-3-4b-it
12Textstat:https://pypi.org/project/textstat/
readability metric that estimates the U.S. school grade level required to understand a given text.
Since we aim to produce simplified texts that are more accessible to the general public, we selected
the version with the lowest FKGL score, indicating higher readability. The post-competition
results are shown in Table 2.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Experimental Results</title>
      <p>
        In this section, we present the results of our experiments, submitted on the platform Codabench13 that
calculated the SARI scores which can be found in Table 2. Our runs are referred to as ASM in the CLEF
overview paper [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
      <p>We extracted a total of 848 definitions from our glossary that matches the complex texts. These were
matched to 386 complex texts, with an average of 1.2 definitions per text.</p>
      <p>We observe that augmenting the prompt with plain English definitions of complex keywords, as done
in Mistral_DASP_0 compared to Mistral_baseline, leads to a slight improvement in the SARI
score.</p>
      <p>Among the models using the zero-shot prompt DASP_0 namely Mistral_DASP_0, Med42_DASP_0,
and Gemma3_DASP_0. Mistral consistently delivers the best performance.</p>
      <p>Additionally, the one-shot prompt variant DASP1 does not yield better results than Mistral_DASP_0,
indicating that oneshot prompting did not provide a significant advantage in this context. This result
indicates that the example used in our one-shot prompt might have disturbed the model.</p>
      <p>During the postcompetition phase, we explored further refinements using the Iterative
Refinement Prompt (IRP) with Mistral. While Mistral_IRP showed a slight decrease compared to
Mistral_DASP_0, it still achieved a strong performance. Notably, the FKGL-based approach
(Mistral_FKGL) matched and even slightly surpassed the highest SARI score, demonstrating the
potential of readability-based ranking in producing simplified outputs.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusion</title>
      <p>This work explored the simplification of biomedical scientific texts using a Retrieval-Augmented
Generation (RAG) framework. The main contributions include: (1) the development of a glossary
aggregating over 3,000 simplified definitions from public medical sources, (2) a pipeline for extracting
expert domain terms using an LLM-based keyword extractor, and (3) the integration of these definitions
into diverse prompting strategies including zero-shot, one-shot, and iterative refinement to support
simplified texts generation of scientific texts.</p>
      <p>While the test corpus primarily consisted of biomedical documents, it also included other scientific
texts, requiring models to generalize beyond the biomedical domain.
13Codabench : https://www.codabench.org/</p>
      <p>The best-performing configuration was the Mistral 7B model with zero-shot prompting
Mistral_DASP_0 , which achieved a SARI score of 43.50. A small post-competition enhancement using
Mistral_FKGL further improved the score to 43.51 the highest observed performance. These results
confirm that grounding simplification in curated, domain specific definitions can enhance readability
without compromising factual integrity.</p>
      <p>Despite these promising outcomes, limitations remain. Relying on a single static definition per term
fails to capture contextual nuance, and the glossary though extensive may lack coverage of rare or
emerging terms. We aimed with this first version of our work, to test whether adding definitions to the
context would be beneficial. In future versions of our work, we plan to include synonyms and explore
additional matching approaches. Additionally, one-shot prompting yielded limited gains, indicating
potential for improved prompt design or few-shot learning strategies.</p>
      <p>Future work will focus on expanding the glossary with context-sensitive or multi-definition entries,
leveraging semantic retrieval to better align definitions with context, and fine-tuning LLMs specifically
for biomedical simplification. These improvements aim to further enhance the accessibility, relevance,
and trustworthiness of simplified scientific content.</p>
    </sec>
    <sec id="sec-5">
      <title>Declaration on Generative AI</title>
      <p>During the preparation of this work, the author(s) used GPT-4 for grammar and spelling checks, as
well as for paraphrasing and rewording. After using these tools, the author(s) reviewed and edited the
content as needed, and take full responsibility for the publication’s content.
[10] P. Lewis, E. Perez, A. Piktus, F. Petroni, V. Karpukhin, N. Goyal, H. Küttler, M. Lewis, W.-t. Yih,
T. Rocktäschel, et al., Retrieval-augmented generation for knowledge-intensive nlp tasks, Advances
in neural information processing systems 33 (2020) 9459–9474.
[11] Z. You, S. Radhakrishna, S. Ming, H. Kilicoglu, Uiuc_bionlp at biolaysumm: an
extract-thensummarize approach augmented with wikipedia knowledge for biomedical lay summarization, in:
Proceedings of the 23rd Workshop on Biomedical Natural Language Processing, 2024, pp. 132–143.
[12] A. Q. Jiang, A. Sablayrolles, A. Mensch, C. Bamford, D. S. Chaplot, D. de las Casas, F.
Bressand, G. Lengyel, G. Lample, L. Saulnier, L. R. Lavaud, M.-A. Lachaux, P. Stock, T. L. Scao,
T. Lavril, T. Wang, T. Lacroix, W. E. Sayed, Mistral 7b, 2023. URL: https://arxiv.org/abs/2310.06825.
arXiv:2310.06825.
[13] H. Touvron, T. Lavril, G. Izacard, X. Martinet, M.-A. Lachaux, T. Lacroix, B. Rozière, N. Goyal,
E. Hambro, F. Azhar, et al., Llama: Open and eficient foundation language models, arXiv preprint
arXiv:2302.13971 (2023).
[14] J. Bakker, J. Kamps, Cochrane-auto: An aligned dataset for the simplification of biomedical abstracts,
in: Proceedings of the Third Workshop on Text Simplification, Accessibility and Readability (TSAR
2024), 2024, pp. 41–51.
[15] L. Ermakova, V. Laimé, H. McCombie, J. Kamps, Overview of the clef 2024 simpletext task 3:
Simplify scientific text, in: Working Notes of the Conference and Labs of the Evaluation Forum
(CLEF 2024), 2024, pp. 3147–3162.
[16] A. Kelious, M. Constant, C. Coeur, Complex word identification: A comparative study between
chatgpt and a dedicated model for this task, in: Proceedings of the 2024 Joint International
Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING
2024), 2024, pp. 3645–3653.
[17] T. Guidroz, D. Ardila, J. Li, A. Mansour, P. Jhun, N. Gonzalez, X. Ji, M. Sanchez, S. Kakarmath,
M. M. Bellaiche, et al., Llm-based text simplification and its efect on user comprehension and
cognitive load, arXiv preprint arXiv:2505.01980 (2025).
[18] G. Team, M. Riviere, S. Pathak, P. G. Sessa, C. Hardin, S. Bhupatiraju, L. Hussenot, T. Mesnard,
B. Shahriari, A. Ramé, et al., Gemma 2: Improving open language models at a practical size, arXiv
preprint arXiv:2408.00118 (2024).
[19] C. Christophe, P. K. Kanithi, T. Raha, S. Khan, M. A. Pimentel, Med42-v2: A suite of clinical llms,
arXiv preprint arXiv:2408.06142 (2024).
[20] G. Team, A. Kamath, J. Ferret, S. Pathak, N. Vieillard, R. Merhej, S. Perrin, T. Matejovicova, A. Ramé,</p>
      <p>M. Rivière, et al., Gemma 3 technical report, arXiv preprint arXiv:2503.19786 (2025).
[21] J. P. Kincaid, R. P. Fishburne Jr, R. L. Rogers, B. S. Chissom, Derivation of new readability formulas
(automated readability index, fog count and flesch reading ease formula) for navy enlisted personnel,
Technical Report, Institute for Simulation and Training, University of Central Florida, 1975. URL:
https://api.semanticscholar.org/CorpusID:61131325.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>S.</given-names>
            <surname>Bao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Ru</surname>
          </string-name>
          ,
          <article-title>Ctyun ai at biolaysumm: Enhancing lay summaries of biomedical articles through large language models and data augmentation</article-title>
          ,
          <source>in: Proceedings of the 23rd Workshop on Biomedical Natural Language Processing</source>
          ,
          <year>2024</year>
          , pp.
          <fpage>837</fpage>
          -
          <lpage>844</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Guo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Qiu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          , T. Cohen,
          <article-title>Automated lay language summarization of biomedical scientific reviews</article-title>
          ,
          <source>in: Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence</source>
          , AAAI Press,
          <year>2021</year>
          , pp.
          <fpage>160</fpage>
          -
          <lpage>168</lpage>
          . URL: https://ojs.aaai.org/index.php/AAAI/article/view/16089. doi:
          <volume>10</volume>
          . 1609/aaai.v35i1.
          <fpage>16089</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>D. T.</given-names>
            <surname>Willingham</surname>
          </string-name>
          ,
          <article-title>How knowledge helps</article-title>
          ,
          <source>American Educator</source>
          <volume>30</volume>
          (
          <year>2006</year>
          )
          <fpage>30</fpage>
          -
          <lpage>37</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>O. M.</given-names>
            <surname>Bullock</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. Colón</given-names>
            <surname>Amill</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. C.</given-names>
            <surname>Shulman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. N.</given-names>
            <surname>Dixon</surname>
          </string-name>
          ,
          <article-title>Jargon as a barrier to efective science communication: Evidence from metacognition</article-title>
          ,
          <source>Public Understanding of Science</source>
          <volume>28</volume>
          (
          <year>2019</year>
          )
          <fpage>845</fpage>
          -
          <lpage>853</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>L.</given-names>
            <surname>Ermakova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Azarbonyad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Bakker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Vendeville</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Kamps</surname>
          </string-name>
          ,
          <article-title>Overview of the CLEF 2025 SimpleText track: Simplify scientific texts (and nothing more)</article-title>
          , in: J. Carrillo de Albornoz,
          <string-name>
            <given-names>J.</given-names>
            <surname>Gonzalo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Plaza</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>García Seco de Herrera</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Mothe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Piroi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Rosso</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Spina</surname>
          </string-name>
          , G. Faggioli, N. Ferro (Eds.),
          <source>European Conference on Information Retrieval, Lecture Notes in Computer Science</source>
          , Springer, Springer,
          <year>2025</year>
          , pp.
          <fpage>425</fpage>
          -
          <lpage>433</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>J.</given-names>
            <surname>Bakker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Vendeville</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Ermakova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Kamps</surname>
          </string-name>
          ,
          <article-title>Overview of the CLEF 2025 SimpleText Task 1: Simplify Scientific Text</article-title>
          , in: G. Faggioli,
          <string-name>
            <given-names>N.</given-names>
            <surname>Ferro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Rosso</surname>
          </string-name>
          , D. Spina (Eds.), Working Notes of CLEF 2025:
          <article-title>Conference and Labs of the Evaluation Forum</article-title>
          , CEUR Workshop Proceedings, CEUR-WS.org,
          <year>2025</year>
          , pp.
          <fpage>3147</fpage>
          -
          <lpage>3162</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>M.</given-names>
            <surname>Färber</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Aghdam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Im</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Tawfelis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Ghoshal</surname>
          </string-name>
          ,
          <string-name>
            <surname>Simplifymytext:</surname>
          </string-name>
          <article-title>An llm-based system for inclusive plain language text simplification</article-title>
          ,
          <source>in: European Conference on Information Retrieval</source>
          ,
          <year>2025</year>
          , pp.
          <fpage>418</fpage>
          -
          <lpage>424</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>S.</given-names>
            <surname>Agrawal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Carpuat</surname>
          </string-name>
          ,
          <article-title>Controlling pre-trained language models for grade-specific text simplification</article-title>
          ,
          <source>arXiv preprint arXiv:2305.14993</source>
          (
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>N.</given-names>
            <surname>Colic</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.-D. Kim</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Rinaldi</surname>
          </string-name>
          ,
          <article-title>Pre-gamus: Reducing complexity of scientific literature as a support against misinformation</article-title>
          ,
          <source>in: Proceedings of the Workshop on DeTermIt! Evaluating Text Dificulty in a Multilingual Context@ LREC-COLING</source>
          <year>2024</year>
          ,
          <year>2024</year>
          , pp.
          <fpage>196</fpage>
          -
          <lpage>201</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>