<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Computers in Biology and Medicine</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>A LLMOps-Driven Framework for Clinical Data Harmonization</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Alberto Marfoglia</string-name>
          <email>alberto.marfoglia2@unibo.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Antonio Robustelli</string-name>
          <email>antonio.robustelli2@unibo.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Christian D'Errico</string-name>
          <email>christian.derrico2@unibo.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sabato Mellone</string-name>
          <email>sabato.mellone@unibo.it</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Antonella Carbonaro</string-name>
          <email>antonella.carbonaro@unibo.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science and Engineering, University of Bologna</institution>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Electrical, Electronic and Information Engineering “Guglielmo Marconi”, University of Bologna</institution>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2025</year>
      </pub-date>
      <volume>187</volume>
      <issue>2025</issue>
      <abstract>
        <p>The rapid growth of clinical data, driven by advances in medical research and digital health technologies, presents major challenges in ensuring data interoperability, standardization, and management. Standards such as Fast Healthcare Interoperability Resources (FHIR) are essential for enabling seamless data exchange. However, converting raw data into standardized formats remains a complex and resource-intensive task. Existing solutions often rely on manual processes or rigid rule-based systems, which are time-consuming, error-prone, and dificult to scale. To address these limitations, we propose a modular framework that employs Natural Language Processing (NLP) to streamline the FHIR mapping. Additionally, it integrates Large Language Model Operations (LLMOps) principles to automate and monitor the lifecycle of models involved in the data transformation. The framework consists of three core modules: (1) extraction of relevant clinical variables from heterogeneous data sources, (2) validation for anomaly detection and compliance with healthcare standards, and (3) assisted mapping of variables to FHIR resources. We evaluate the framework by applying it to an existing clinical data harmonization pipeline. Compared to the baseline process, our approach achieves a 59% reduction in time. This result underscores the potential of NLP-assisted frameworks to improve scalability, reliability, and eficiency in clinical data standardization.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Clinical Data Harmonization</kwd>
        <kwd>FHIR (Fast Healthcare Interoperability Resources)</kwd>
        <kwd>Digital Health</kwd>
        <kwd>Natural Language Processing (NLP)</kwd>
        <kwd>Large Language Model Operations (LLMOps)</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        The healthcare landscape is rapidly evolving, driven by advancements in medical research and digital
technologies, leading to an unprecedented surge in clinical data. Efectively managing and sharing
this data across heterogeneous healthcare organizations remains a critical challenge impacting patient
care, medical research, and policy development [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Seamless data exchange requires both structural
and semantic interoperability, which involves the adoption of standardized vocabularies, formal data
descriptions, and machine-readable formats. Consequently, tools leveraging contemporary clinical
standards for data mapping have become essential, often enabling semi-automated conversion workflows
that would otherwise require substantial manual efort [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>
        Interoperability is the foundation of modern digital health initiatives [
        <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
        ], facilitating the prospective
curation of data, eficient communication, and the secondary use of real-world clinical data. The
European Health Data Space (EHDS) exemplifies its economic and strategic value, aiming to establish a
unified health data ecosystem projected to save €11 billion over ten years [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
      <p>
        Among existing healthcare data standards, the Fast Healthcare Interoperability Resources (FHIR)
framework, developed by Health Level Seven International (HL7), has emerged as a leading approach
for structuring and exchanging clinical information. FHIR ensures flexible and modular data
representation, facilitating interoperability across clinical trials, hospital information systems, and public
health networks [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Its cost-efectiveness, improved data quality, and analytical flexibility make it
particularly advantageous for reusing medical data in the real world [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Increasingly, FHIR is being
adopted in specialized healthcare domains, from chronic disease management and patient-centered
applications [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] to modular platforms that unify heterogeneous data sources and create standardized
Digital Twins [
        <xref ref-type="bibr" rid="ref10 ref9">9, 10</xref>
        ].
      </p>
      <p>
        Despite the growing adoption of FHIR, converting raw clinical data into standardized formats remains
a complex, labor-intensive process. Traditional transformation approaches rely heavily on manual
efort and static rules, which limits scalability and introduces inconsistency [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. Natural Language
Processing (NLP), particularly through Large Language Models (LLMs), ofers a promising approach to
automate core tasks such as clinical variable extraction, data validation, and resource mapping [
        <xref ref-type="bibr" rid="ref12 ref13">12, 13</xref>
        ].
      </p>
      <p>
        However, LLM-based systems present substantial barriers when transitioning from research to
realworld deployment. The healthcare domain represents a particular case, where data privacy, regulatory
compliance, and model governance are non-negotiable [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. Consequently, many promising prototypes
remain confined to the experimental phases due to the lack of robust operational infrastructure and
interdisciplinary collaboration [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. In this context, recent engineering paradigms, such as Machine
Learning Operations (MLOps), ofer a practical foundation for ensuring the reliability of AI, thereby
addressing these concerns [
        <xref ref-type="bibr" rid="ref16 ref17">16, 17</xref>
        ]. Based on MLOps features, such as automation, continuous monitoring,
and reproducibility, Large Language Model Operations (LLMOps) frameworks represent a
promising solution to satisfy the operational demands of the LLMs lifecycle, including prompt versioning,
input-output management, and real-time risk mitigation of hallucinations and performance drift [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ].
      </p>
      <p>To address these technical and operational limitations, we present FLEX-LLM-CARE (FHIR Language
Extraction and Transformation with LLMs for Clinical Automation and Reasoning Engine). This modular
LLM-based transformation framework leverages LLMOps techniques to streamline the standardization
of clinical data into FHIR resources. The proposed solution enhances three key harmonization steps:
(1) the extraction of clinical variables from both structured and unstructured data; (2) the validation
through anomaly detection and compliance with clinical guidelines; and (3) the transformation of
validated content into interoperable FHIR resources.</p>
      <p>
        To demonstrate the efectiveness of the proposed framework, we apply it to an existing standardization
pipeline [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] to enhance the automation level of the overall process. Our results show that
FLEX-LLMCARE reduces the total manual efort required for data harmonization by 59%, equivalent to saving
over 200 hours of experts’ labor.
      </p>
      <p>These findings highlight FLEX-LLM-CARE’s potential to make large-scale FHIR adoption more
practical and cost-efective. Beyond reducing manual efort, the integration of LLMOps introduces
essential capabilities that support long-term sustainability and reliability. Looking ahead, tools like
FLEX-LLM-CARE can help healthcare organizations adopt data standards more easily, leading to faster
insights, improved data sharing, and more connected, interoperable health systems.</p>
      <p>The remainder of this paper is structured as follows: Sec. 3 reviews related NLP-based standardization
eforts. Sec. 2 introduces LLMOps and the baseline pipeline. Sec. 4 details our framework modules.
Sec. 5 presents the experimental results and Sec. 6 concludes with a discussion of findings and future
directions.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Background</title>
      <p>This section provides an overview of the concepts underlying FLEX-LLM-CARE. We first summarize the
main characteristics of LLMOps, which represent the set of concepts adopted to improve the automation
of our proposal. Then, we briefly recall the high-level structure of the standardization pipeline on which
the framework is applied.</p>
      <sec id="sec-2-1">
        <title>2.1. Large Language Model Operations</title>
        <p>
          Constructing LLMs is a complex task that involves vast amounts of data, High-Performance Computing
(HPC) architectures, and targeted fine-tuning steps [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ]. The datasets required to train LLMs typically
contain so many parameters that manual data quality checks become impractical. Consequently, LLMs
can sufer from biases, hallucinations, and outdated knowledge, resulting in inaccurate or misleading
outcomes. For this reason, engineering paradigms based on MLOps are fundamental to face these issues
by ensuring automation, continuous monitoring, and reproducibility [
          <xref ref-type="bibr" rid="ref16 ref17">16, 17</xref>
          ]. In the context of LLMs,
the solution is represented by LLMOps, which, according to the definition provided by Diaz-De-Arcaya
et al. [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ], is an extension of MLOps specifically tailored to manage the LLMs lifecycle efectively.
        </p>
        <p>
          Since the management of LLMs is more complex than ML models, Shan and Shan [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ] defined a
conceptual framework named 4D that, as shown in Fig. 1, takes its name from the following four steps:
        </p>
        <p>• Discover: it recognizes the need for an LLM and explores its potential use cases. This involves
examining recent developments in LLM technology, understanding their capabilities, and assessing
how they can solve specific issues or improve existing workflows. Hence, this step identifies
relevant data sources, sets objectives, and defines the application scope.
• Distill: it prepares and refines the data employed for the training process. Since data quality
and variety are crucial for the model’s performance, this step involves data cleaning, structuring,
and augmentation. It also includes the initial model training, in which the system learns how to
generate outputs or predictions based on the distilled data.
• Deploy: it integrates the LLM into the operational environment by making it a part of a broader
system. Hence, this step involves establishing the technical infrastructure, addressing performance
and security requirements, and ensuring smooth interaction with existing tools. It also focuses
on version control, containerization, and API connectivity.
• Deliver: it delivers the LLM in production. This involves monitoring its performance in
realworld scenarios and refining it continuously based on user feedback and new input data. This step
also includes evaluating the LLM’s impact on business results and user satisfaction by making
ongoing adjustments to optimize its efectiveness.</p>
        <p>
          Since the 4D framework ofers a systematic approach for LLM lifecycle management, we adopt it to
automate the data standardization process within our proposal. Note that, due to its conceptual nature,
this framework comprises several substeps. However, to simplify the discussion, we refer to [
          <xref ref-type="bibr" rid="ref16 ref17">16, 17</xref>
          ] for
more details and specifications.
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. The considered pipeline</title>
        <p>
          We evaluated the efectiveness of our proposed framework and its modular components (as described
in Sec. 4) by applying it to optimize the FHIR mapping workflow of a pipeline introduced in a previous
study [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]. This workflow follows a classical Extract-Transform-Load (ETL) architecture and comprises
the following five sequential modules:
• Input: gathers input data from a specific source, without applying restrictions on the data model
and format. Additionally, this module addresses data security and regulatory compliance by
ensuring the quality, integrity, and confidentiality of data.
• Refinement: preprocesses the gathered input data by ensuring a uniform output format (e.g.,
JSON). To this end, this module removes missing data and structures the information to enable
seamless conversion operations in subsequent steps.
• Mapping: converts the refined input data to the target data model (e.g., FHIR). To this end, this
module employs a templating strategy;
• Validation: it performs all the procedures to check the conformity of output resources with
the standard target data model. In the event of undesirable outcomes, this module immediately
performs roll-back actions to ensure the system’s state remains intact.
• Publishing: stores the validated data in an accessible repository. This module also provides
essential debugging functionalities like querying, exporting, and logging.
        </p>
        <p>Despite its modularity, the pipeline exhibits several limitations due to its reliance on manual
intervention. Specifically, the Input module requires domain experts to manually define the relevant
tables and annotate each field with appropriate semantic labels and ontologies. The Refinement module
necessitates expert consultations to identify and resolve missing or inconsistent values. The Mapping
module depends on curated templates developed by experts, while the Validation module operates using
preconfigured static FHIR profiles.</p>
        <p>To address these limitations and enhance scalability and consistency, we integrate FLEX-LLM-CARE
into the pipeline. This integration aims to automate critical phases of the standardization process,
thereby minimizing manual efort and reducing the risk of data inconsistencies. A comparative analysis
of processing times between the original pipeline and the enhanced version incorporating our framework
is presented in Sec. 5.2.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Related Work</title>
      <p>
        Eforts to standardize clinical data have led to the development of several pipelines aimed at enhancing
interoperability and transforming heterogeneous data formats into structured models such as HL7
FHIR [
        <xref ref-type="bibr" rid="ref6 ref7 ref8">6, 7, 8</xref>
        ]. For example, Bennett et al. [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] converted the MIMIC-IV dataset into FHIR, creating a
widely used resource for research. Montazeri et al. [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] developed a FHIR-based dataset to support
cardiovascular CPOE integration within Electronic Health Record (EHR) systems.
      </p>
      <p>
        Beyond these datasets, several approaches have targeted the technical aspects of FHIR mapping.
Sinaci et al. [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] proposed a GUI-based method, Simon et al. [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ] developed a metadata repository
enabling automated data conversion through a REST API, and Marfoglia et al. [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] introduced a modular
ETL pipeline using template-based mappings to enhance platform independence and reusability.
      </p>
      <p>While these approaches mark progress toward interoperability, key limitations persist. Most notably,
the lack of full automation necessitates manual data curation and validation, which slows down the
mapping process and increases the risk of errors. Additionally, reliance on custom scripts and opaque
logic reduces portability across healthcare contexts.</p>
      <p>
        Natural Language Processing (NLP) and, more recently, Large Language Models (LLMs), have shown
promise in healthcare for processing unstructured data [
        <xref ref-type="bibr" rid="ref12 ref13 ref24">24, 12, 13</xref>
        ]. For instance, NLP has supported
clinical trial automation by extracting patient data from EHRs and identifying adverse drug events [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ].
Within Clinical Decision Support Systems (CDSS), Klug et al. [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ] applied NLP to extract actionable
insights from clinical notes. Remmer et al. [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ] leveraged NLP for the multi-label classification of medical
summaries, combining clinical and linguistic embeddings. Instead, Gulum et al. [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ] combined Deep
Learning (DL) with NLP to enhance cancer decision-making, providing more accurate diagnoses and
personalized treatment recommendations. Despite their success, these NLP solutions primarily address
classification, summarization, or prediction tasks rather than the challenge of converting structured
clinical data into FHIR-compliant formats.
      </p>
      <p>
        Moreover, deploying AI systems in real-world healthcare settings requires strict attention to
governance, reproducibility, and regulatory compliance [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. In response, MLOps has emerged to support
continuous monitoring and operational scalability. LLMOps, an evolution of MLOps for LLMs, extends
these capabilities to the management of large-scale language models [
        <xref ref-type="bibr" rid="ref16 ref17">16, 17</xref>
        ]. In this context, we propose
Ref.
      </p>
      <p>FLEX-LLM-CARE, a modular LLM-based transformation framework that applies LLMOps principles to
streamline the FHIR data standardization process. In contrast to prior work (see Table 1), our approach
minimizes manual intervention, avoids static configurations, and enables more scalable, reusable, and
transparent data standardization pipelines.</p>
    </sec>
    <sec id="sec-4">
      <title>4. The proposed framework</title>
      <p>This section introduces the proposed FLEX-LLM-CARE framework, which leverages recent advances in
LLMs to enable the automated standardization and semantic enrichment of clinical data in compliance
with the FHIR specification. To this end, we define three core modules, which are respectively focused
on: (1) the extraction of clinical variables from both structured and unstructured data; (2) the validation
through anomaly detection and compliance with clinical guidelines; and (3) the mapping of validated
content into interoperable FHIR resources. Subsequently, we describe how LLMOps (i.e., the 4D
framework) is applied in FLEX-LLM-CARE to automate the data mapping. Fig. 3 illustrates the
FLEXLLM-CARE framework, highlighting its modules, employed strategies, and considered input and output
data formats.</p>
      <sec id="sec-4-1">
        <title>4.1. Clinical Data Extraction Module (CDE-M)</title>
        <p>CDE-M extracts clinical concepts from unstructured and semi-structured text using domain-specific
Named Entity Recognition (NER) models. Then, a specialized LLM links the extracted entities to
standardized clinical ontologies. In detail, this association is possible by combining the LLM with
the Retrieval-Augmented Generation (RAG) technique. RAG integrates external facts by retrieving
relevant information from a pre-built knowledge base to improve the accuracy of the output. In
this case, the knowledge corresponds to the embedding representation of clinical ontology concepts
stored in a vectorial database. Therefore, this approach resolves ambiguities and improves contextual
understanding, making the extracted data more usable for seamless conversion into FHIR resources.</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Clinical Variable Validation and Anomaly Detection Module (CVV-ADM)</title>
        <p>CVV-ADM ensures that the structured data is clinically consistent and semantically valid within the
related medical context. In detail, its core is represented by an LLM capable of understanding clinical
logic, domain-specific terminology, and guideline-based reasoning. For this reason, CVV-ADM supports
two sequential validation strategies:
• Guideline-Aware: the employed LLM directly evaluates the structured data by referencing
medical best practices and clinical guidelines. Moreover, this strategy validates data by
checking anomalies, such as incompatible medications, implausible lab values, missing context, and
elements falling outside the expected clinical norms;
• Ontology-Driven: it converts validated data into graph-based structures (i.e., semantic nodes and
relationships) linked to medical ontologies, enriching their semantics. This strategy enables
symbolic reasoning and logical inference across the graph, allowing the detection of inconsistencies,
redundancies, or missing connections.</p>
      </sec>
      <sec id="sec-4-3">
        <title>4.3. FHIR Resource Mapping and Transformation Module (FRMT-M)</title>
        <p>FRMT-M streamlines the mapping process of validated and cleaned data to the corresponding FHIR
resources. In detail, this module employs an LLM that can learn the mapping logic through an In-Context
Learning (ICL) approach or a targeted fine-tuning. Moreover, the LLM integrates a RAG mechanism
that accesses a vector database populated with the FHIR documentation, related specifications, and
examples of validated mappings. Therefore, this approach enables the generation of outputs validated
against authoritative sources, thereby enhancing the explainability and reliability of the resulting FHIR
mapping.</p>
      </sec>
      <sec id="sec-4-4">
        <title>4.4. LLMOps trought the 4D framework</title>
        <p>Since the defined modules introduce new challenges regarding intermediate datasets, additional data
structures, and various LLMs, LLMOps becomes paramount to improve the automation level of the
entire FHIR mapping process. To this end, according to the definition provided in Sec. 2.1, we describe
how the 4D framework enhances the defined modules (i.e., CDE-M, CVV-ADM, and FRMT-M):
• Discover: Since we face diferent specific tasks (i.e., data extraction from unstructured text,
data validation to detect anomalies, and FHIR mapping), it becomes crucial to select the most
appropriate LLM. To this end, referring to existing LLM benchmarkings, such as the rankings
provided by Hugging Face, can help choose the most suitable model for our tasks.
• Distill: Since it is essential to improve the prediction logic and the possible training of the
LLMs chosen, selecting the data cleaning, structuring, and augmentation techniques becomes
pivotal. For example, for the CDE-M and FRMT-M, we can use the RAG and prompt engineering
techniques previously mentioned. In the case of CVV-ADM, since the LLM model is fine-tuned
on medical knowledge and anomaly detection tasks, we can employ PEFT, LoRA, or QLoRA to
adapt models with minimal computational resources eficiently. When training from scratch or at
scale, Composer (by MosaicML) and datasets from LLMDataHub can be used.
• Deploy: Since we use diferent LLMs and techniques, the resulting hyperparameter combinations
and model versions can be numerous. For this reason, we ensure that all experiments are
rigorously tracked throughout the deployment process. It is essential to leverage open-source
platforms such as MLFlow for lifecycle management, and orchestrators like Kubeflow or Apache
Airflow for pipeline automation. Containerization with Docker and orchestration via Kubernetes
further enhances scalability and portability across deployment environments.
• Deliver: Since high-quality clinical data is often limited and not directly accessible, continuous
post-deployment monitoring is essential to assess model robustness and reliability over time. To
this end, we can integrate monitoring tools such as Evidently AI for model evaluation and drift
detection. Prometheus and Grafana can be used for real-time metric collection and visualization.
Furthermore, LangSmith ofers tools for debugging and testing LLM-based applications, while
OpenLLMetry and LangKit enhance observability and governance by extracting key metrics from
LLM input/output behavior.</p>
        <p>
          However, due to the numerous application scenarios, the mentioned technologies represent only a
small subset of the existing ones. For this reason, we refer to [
          <xref ref-type="bibr" rid="ref18 ref29">18, 29</xref>
          ] for a more exhaustive overview.
        </p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Results</title>
      <p>
        This section presents the evaluation of FLEX-LLM-CARE in automating clinical data harmonization
using the FHIR standard. We begin by detailing the technical implementation of each module—CDE-M,
CVV-ADM, and FRMT-M—within the data standardization pipeline shown in Fig.2. We then assess the
practical impact of our framework by comparing it to the baseline pipeline introduced by Marfoglia et
al.[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], focusing on the harmonization of a public dataset [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ].
      </p>
      <sec id="sec-5-1">
        <title>5.1. Modules implementations</title>
        <p>According to the limitations discussed in Sec. 2.2, we applied FLEX-LLM-CARE’s modules to the
standardization pipeline depicted in Fig. 2. To this end, as shown in Fig. 4, we made the following
associations: (CDE-M ↦→ Input), (CVV-ADM ↦→ Refinement), and (FRMT-M ↦→ Mapping).</p>
        <p>Fig. 4 allows us to appreciate how the application of FLEX-LLM-CARE enhances the automation level
of the Input, Refinement, and Mapping steps. This is followed by the Validation and Publishing steps,
which, although fully automated, depend on the results of the first three steps. Finally, the Semantic
module is highlighted in orange but is not explored in this study, as it is outside the scope.</p>
        <p>
          In detail, to extract clinical concepts from the unstructured and semi-structured text, we implemented
the CDE-Module by employing BioBERT [
          <xref ref-type="bibr" rid="ref31">31</xref>
          ] to identify clinical entities. Then, we mapped such entities
into standardized ontologies using BioMistral-7B [
          <xref ref-type="bibr" rid="ref32">32</xref>
          ], a biomedical LLM capable of interpreting column
headers and field contents. To support this mapping, we also integrated a RAG technique to perform a
semantic search through the Facebook AI Similarity Search (FAISS) library, which allowed us to retrieve
embeddings of standardized ontology terms (e.g., from SNOMED-CT and LOINC).
        </p>
        <p>Subsequently, we focused on ensuring that the structured data was considered clinically consistent
and semantically valid. To this end, we based the CVV-AD Module on ClinicalBERTand, above all,
by implementing the related validation strategies (i.e., Guideline-Aware and Ontology-Driven). More
precisely, for the Guideline-Aware, we employed ClinicalBERT to flag inconsistent entries, such as
abnormal value ranges, contradictory clinical assertions, and missing dependencies. Instead, for the
Ontology-Driven approach, we leveraged Apache Jena to convert validated data into Resource
Description Framework (RDF) triples and utilized BioPortal to link them to various repositories of biomedical
ontologies.</p>
        <p>
          To streamline the mapping of validated data to the corresponding FHIR resources, we
developed the FRMT module. This module leverages BioMistral-7B [
          <xref ref-type="bibr" rid="ref32">32</xref>
          ], guided by a few-shot learning
approach. By providing illustrative examples—such as (ProstheticKnee, CommercialName) ↦→
(DeviceDefinition)—the model adapts to the mapping logic with minimal supervision. To further
improve accuracy and contextual relevance, we incorporated a RAG technique, similar to that used
in the CDE module. This technique accesses a vector database populated with FHIR documentation,
which supports BioMistral-7B during generation.
        </p>
        <p>Finally, as described in Sec. 4.4, we enhanced the automation level of our modules using the 4D
framework. Therefore, we used MLFlow to track our experiments, from the considered hyperparameters
to the achieved performance metrics. However, all this has been made possible thanks to Hugging
Face, which allowed us to download and employ the LLMs mentioned above. Additionally, we used
LangChain to retrieve the required data (i.e., datasets and documentation) and support their conversion
into corresponding embedding vectors.</p>
      </sec>
      <sec id="sec-5-2">
        <title>5.2. Impact of FLEX-LLM-CARE Automation on Clinical Data Harmonization</title>
        <p>
          We evaluated the performance of FLEX-LLM-CARE by applying it to the harmonization pipeline
proposed by Marfoglia et al.[
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]. Specifically, we assessed the reduction in manual efort by comparing
the original process with the FLEX-LLM-CARE-enhanced pipeline. This evaluation was based on a
mapping task involving a public dataset containing longitudinal clinical data on prosthetic patients
[
          <xref ref-type="bibr" rid="ref30">30</xref>
          ].
        </p>
        <p>Although the dataset being considered was already structured and well-documented, it lacked
consistent semantic descriptions for many fields, making standardization and schema interpretation
non-trivial. Additionally, data integrity checks were carried out using manually written Python scripts.
These factors contributed to residual manual efort despite the dataset’s relative maturity.</p>
        <p>FLEX-LLM-CARE introduced automation in three modules—Input, Refinement, and
Mapping—targeting the most labor-intensive steps: Schema interpretation, Data cleaning, Anomaly detection,
and FHIR resource mapping. A first version of the FRMT module, leveraging BioMistral-7B, achieved
approximately 60% accuracy in recommending appropriate FHIR resources for each source term. While
this level of accuracy does not eliminate the need for manual review, it provides meaningful suggestions
that significantly reduce expert time through partial matches, narrowed search spaces, and reduced
cognitive load. Based on these benefits, we conservatively estimate a 60–65% reduction in mapping
time.</p>
        <p>
          Table 2 summarizes the estimated expert time required for each task in the original versus
FLEX-LLMCARE-assisted workflows. These estimates are based on historical project timelines [
          <xref ref-type="bibr" rid="ref33">33</xref>
          ] and expert
feedback from the dataset curation process, during which four domain experts collectively contributed
approximately 360 hours through a combination of collaborative sessions and individual annotation
eforts. Task-level time allocations reflect this prior experience. While schema interpretation was
relatively eficient due to the dataset’s structured format, FHIR resource mapping remained the most
demanding step, owing to the inherent complexity of aligning clinical terms with appropriate FHIR
constructs.
        </p>
        <sec id="sec-5-2-1">
          <title>FLEX-LLM</title>
        </sec>
        <sec id="sec-5-2-2">
          <title>CARE Module</title>
        </sec>
        <sec id="sec-5-2-3">
          <title>Manual Auto (hrs) (hrs)</title>
          <p>CDE-M</p>
        </sec>
        <sec id="sec-5-2-4">
          <title>Task</title>
        </sec>
        <sec id="sec-5-2-5">
          <title>Description</title>
          <p>Spcrehteamtioaninter- Irdnoeltfeeisnr.peret tasbelmesanantidc</p>
          <p>Normalize formats,
Data cleaning handle missing
values.
tAencotimonaly de-
Ioudreesn.inticfoynismisptelanutsivbalelFHIR resource Select resources,
demapping fine templates.
Syntactic vali- Ensure FHIR
strucdation tural correctness.
Semantic vali- Conformance to
dation FHIR profiles.
Storage FSHtoIrRe reasonudrceesx.pose</p>
        </sec>
        <sec id="sec-5-2-6">
          <title>Total</title>
        </sec>
        <sec id="sec-5-2-7">
          <title>Pipeline Step [2]</title>
          <p>Input
Refinement</p>
          <p>CVV-ADM
Refinement</p>
          <p>CVV-ADM
Mapping
Validation</p>
          <p>Despite the structured nature of the dataset, the automation introduced by FLEX-LLM-CARE led
to significant time savings across all major stages of harmonization. The most substantial gains were
observed in the FHIR resource mapping step, which was reduced by approximately 64%, primarily due
to partial mapping assistance and structured suggestion triage. Overall, the framework reduced the
estimated manual efort by approximately 211 hours, resulting in a 59% reduction in curation time.</p>
          <p>These results underscore the potential of LLM-driven systems in enhancing the eficiency of clinical
data harmonization, thereby making the adoption of th FHIR standard more accessible. While the
reported time savings reflect the most tangible benefit, the integration of LLMOps through the 4D
framework (see Sec. 4.4) also introduces critical operational advantages. These include robust experiment
tracking, model versioning, and post-deployment monitoring, which collectively support reproducibility,
transparency, and long-term maintainability. Although not directly reflected in the labor-hour estimates,
these capabilities further strengthen the scalability and reliability of the FLEX-LLM-CARE approach,
reinforcing its suitability for real-world observational research settings.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusions</title>
      <p>We introduced FLEX-LLM-CARE, a modular framework for clinical data standardization powered
by LLMs and operationalized through the 4D LLMOps lifecycle. The framework includes three key
modules: (1) clinical variable extraction from heterogeneous sources (CDE-M), (2) validation and
anomaly detection (CVV-ADM), and (3) FHIR resource mapping through assisted selection (FRMT-M).
Integrated into an existing pipeline and evaluated on a public dataset, FLEX-LLM-CARE can reduce
manual curation efort by up to 59% while improving automation across key standardization stages.</p>
      <p>A key insight is that coupling LLMs with structured operational support significantly lowers the
manual burden of clinical data mapping while enhancing traceability, reproducibility, and maintainability.
This combination enhances adaptability and reduces data integration overhead in complex environments,
such as healthcare.</p>
      <p>However, challenges remain, particularly in semantic validation, which was not deeply explored due
to the computational cost of fine-tuning. Improving semantic precision remains dificult, especially
when clinical relationships are subtle or implicit.</p>
      <p>Looking forward, future work will explore: (a) enhancing retrieval-augmented generation (RAG)
strategies to incorporate domain-specific knowledge better; (b) applying advanced embedding
techniques to improve contextual understanding and output fidelity; and (c) benchmarking FLEX-LLM-CARE
across diverse LLMs to assess robustness in real-world deployments.</p>
      <p>As LLM ecosystems evolve, managing model versions, hyperparameters, and intermediate data will
grow more complex. In this context, LLMOps will remain foundational, enabling dynamic
experimentation, reproducibility, and safe deployment. Future work will continue to build on this operational
backbone to advance scalable and semantically accurate FHIR harmonization.</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgments</title>
      <p>This study was partially supported by the Italian Ministry of University and Research under PNRR-PNC
Project PNC0000002 ”DARE – Digital Lifelong Prevention” (CUP: B53C22006450001).</p>
    </sec>
    <sec id="sec-8">
      <title>Declaration on Generative AI</title>
      <p>During the preparation of this work, the authors used ChatGPT, Grammarly in order to: Grammar and
spelling check, Paraphrase and reword. After using this tool/service, the authors reviewed and edited
the content as needed and take full responsibility for the publication’s content.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A.</given-names>
            <surname>Chatterjee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Pahari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Prinz</surname>
          </string-name>
          ,
          <article-title>HL7 FHIR with SNOMED-CT to Achieve Semantic and Structural Interoperability in Personal Health Data: A Proof-of-</article-title>
          <string-name>
            <surname>Concept</surname>
            <given-names>Study</given-names>
          </string-name>
          ,
          <source>Sensors</source>
          <volume>22</volume>
          (
          <year>2022</year>
          )
          <article-title>3756</article-title>
          . doi:
          <volume>10</volume>
          .3390/s22103756.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>A.</given-names>
            <surname>Marfoglia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Nardini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V. A.</given-names>
            <surname>Arcobelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Moscato</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Mellone</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Carbonaro</surname>
          </string-name>
          ,
          <article-title>Towards realworld clinical data standardization: A modular FHIR-driven transformation pipeline to enhance doi:</article-title>
          10.1016/j.compbiomed.
          <year>2025</year>
          .
          <volume>109745</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>M.</given-names>
            <surname>Lehne</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Sass</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Essenwanger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Schepers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Thun</surname>
          </string-name>
          ,
          <article-title>Why digital medicine depends on interoperability</article-title>
          ,
          <source>npj Digital Medicine</source>
          <volume>2</volume>
          (
          <year>2019</year>
          )
          <article-title>79</article-title>
          . doi:
          <volume>10</volume>
          .1038/s41746-019-0158-1.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>T.</given-names>
            <surname>Benson</surname>
          </string-name>
          , G. Grieve, Why Interoperability Is Hard, in: T. Benson, G. Grieve (Eds.),
          <source>Principles of Health Interoperability: FHIR, HL7 and SNOMED CT, Health Information Technology Standards</source>
          , Springer International Publisher, Cham,
          <year>2021</year>
          , pp.
          <fpage>21</fpage>
          -
          <lpage>40</lpage>
          . doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>030</fpage>
          -56883-
          <issue>2</issue>
          _
          <fpage>2</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>European</given-names>
            <surname>Commission</surname>
          </string-name>
          ,
          <source>European Health Data Space Regulation (EHDS)</source>
          , https://health.ec.europa.
          <article-title>eu/ehealth-digital-health-and-care/european-health-data-spaceregulation-ehds_</article-title>
          <string-name>
            <surname>en</surname>
          </string-name>
          ,
          <year>2025</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>N.</given-names>
            <surname>Pimenta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Chaves</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Sousa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Abelha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Peixoto</surname>
          </string-name>
          ,
          <article-title>Interoperability of Clinical Data through FHIR: A review, Procedia Computer Science (</article-title>
          <year>2023</year>
          ). doi:
          <volume>10</volume>
          .1016/j.procs.
          <year>2023</year>
          .
          <volume>03</volume>
          .115.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>J.</given-names>
            <surname>Gehrmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Herczog</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Decker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Beyan</surname>
          </string-name>
          ,
          <article-title>What prevents us from reusing medical real-world data in research</article-title>
          ,
          <source>Scientific Data</source>
          <volume>10</volume>
          (
          <year>2023</year>
          )
          <article-title>459</article-title>
          . doi:
          <volume>10</volume>
          .1038/s41597-023-02361-2.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>R.</given-names>
            <surname>Gazzarata</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Almeida</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Lindsköld</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Cangioli</surname>
          </string-name>
          , E. Gaeta, G. Fico,
          <string-name>
            <surname>C. E. Chronaki,</surname>
          </string-name>
          <article-title>HL7 Fast Healthcare Interoperability Resources (HL7 FHIR) in digital healthcare ecosystems for chronic disease management: Scoping review</article-title>
          ,
          <source>International Journal of Medical Informatics</source>
          <volume>189</volume>
          (
          <year>2024</year>
          )
          <article-title>105507</article-title>
          . doi:
          <volume>10</volume>
          .1016/j.ijmedinf.
          <year>2024</year>
          .
          <volume>105507</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>A.</given-names>
            <surname>Carbonaro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Marfoglia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Nardini</surname>
          </string-name>
          , S. Mellone, CONNECTED:
          <article-title>Leveraging digital twins and personal knowledge graphs in healthcare digitalization</article-title>
          ,
          <source>Frontiers in Digital Health</source>
          <volume>5</volume>
          (
          <year>2023</year>
          ). doi:
          <volume>10</volume>
          .3389/fdgth.
          <year>2023</year>
          .
          <volume>1322428</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>A.</given-names>
            <surname>Marfoglia</surname>
          </string-name>
          ,
          <string-name>
            <surname>C. D'Errico</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Nardini</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Mellone</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Carbonaro</surname>
          </string-name>
          ,
          <article-title>CONNECTED: A Knowledge Graph-Driven Platform for Clinical Data Harmonization and Personalized Digital Twin-Based Healthcare</article-title>
          , in: 2025 IEEE International Conference (PerCom Workshops),
          <source>IEEE Computer Society</source>
          ,
          <year>2025</year>
          , pp.
          <fpage>116</fpage>
          -
          <lpage>121</lpage>
          . doi:
          <volume>10</volume>
          .1109/PerComWorkshops65533.
          <year>2025</year>
          .
          <volume>00051</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>G.</given-names>
            <surname>Lichtner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. S.</given-names>
            <surname>Alper</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Jurth</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Spies</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Boeker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. J.</given-names>
            <surname>Meerpohl</surname>
          </string-name>
          ,
          <string-name>
            <surname>F.</surname>
          </string-name>
          <article-title>von Dincklage, Representation of evidence-based clinical practice guideline recommendations on FHIR</article-title>
          ,
          <source>Journal of Biomedical Informatics</source>
          <volume>139</volume>
          (
          <year>2023</year>
          )
          <article-title>104305</article-title>
          . doi:
          <volume>10</volume>
          .1016/j.jbi.
          <year>2023</year>
          .
          <volume>104305</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>N.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ren</surname>
          </string-name>
          ,
          <article-title>An EHR Data Quality Evaluation Approach Based on Medical Knowledge and Text Matching</article-title>
          , IRBM
          <volume>44</volume>
          (
          <year>2023</year>
          )
          <article-title>100782</article-title>
          . doi:
          <volume>10</volume>
          .1016/j.irbm.
          <year>2023</year>
          .
          <volume>100782</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>N.</given-names>
            <surname>Hong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Wen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Shen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Sohn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Wang</surname>
          </string-name>
          , H. Liu, G. Jiang,
          <article-title>Developing a scalable FHIR-based clinical data normalization pipeline for standardizing and integrating unstructured and structured electronic health record data</article-title>
          ,
          <source>JAMIA Open 2</source>
          (
          <year>2019</year>
          ). doi:
          <volume>10</volume>
          .1093/jamiaopen/ooz056.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>L. E.</given-names>
            <surname>Lwakatare</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Raj</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Crnkovic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Bosch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. H.</given-names>
            <surname>Olsson</surname>
          </string-name>
          ,
          <article-title>Large-scale machine learning systems in real-world industrial settings: A review of challenges and solutions</article-title>
          ,
          <source>Information and Software Technology</source>
          <volume>127</volume>
          (
          <year>2020</year>
          )
          <article-title>106368</article-title>
          . doi:
          <volume>10</volume>
          .1016/j.infsof.
          <year>2020</year>
          .
          <volume>106368</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>S.</given-names>
            <surname>Vänskä</surname>
          </string-name>
          ,
          <string-name>
            <surname>K.-K. Kemell</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Mikkonen</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Abrahamsson</surname>
          </string-name>
          ,
          <article-title>Continuous Software Engineering Practices in AI/ML Development Past the Narrow Lens of MLOps: Adoption Challenges</article-title>
          , e-Informatica
          <source>Software Engineering Journal</source>
          <volume>18</volume>
          (
          <year>2024</year>
          )
          <article-title>240102</article-title>
          . doi:
          <volume>10</volume>
          .37190/e-Inf240102.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>V.</given-names>
            <surname>Moskalenko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Kharchenko</surname>
          </string-name>
          ,
          <article-title>Resilience-aware MLOps for AI-based medical diagnostic system</article-title>
          ,
          <source>Frontiers in Public Health</source>
          <volume>12</volume>
          (
          <year>2024</year>
          ). doi:
          <volume>10</volume>
          .3389/fpubh.
          <year>2024</year>
          .
          <volume>1342937</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>M.</given-names>
            <surname>Reddy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Dattaprakash</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kammath</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Manokaran</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Be</surname>
          </string-name>
          , Application of MLOps in Prediction of Lifestyle Diseases,
          <source>ECS Transactions 107</source>
          (
          <year>2022</year>
          )
          <article-title>1191</article-title>
          . doi:
          <volume>10</volume>
          .1149/10701. 1191ecst.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>R.</given-names>
            <surname>Shan</surname>
          </string-name>
          , T. Shan,
          <string-name>
            <surname>Enterprise LLMOps: Advancing Large Language Models Operations Practice</surname>
            , in: 2024 IEEE
            <given-names>Cloud</given-names>
          </string-name>
          <string-name>
            <surname>Summit</surname>
          </string-name>
          ,
          <year>2024</year>
          , pp.
          <fpage>143</fpage>
          -
          <lpage>148</lpage>
          . doi:
          <volume>10</volume>
          .1109/Cloud-Summit61220.
          <year>2024</year>
          .
          <volume>00030</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>J.</given-names>
            <surname>Diaz-De-Arcaya</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>López-De-Armentia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Miñón</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I. L.</given-names>
            <surname>Ojanguren</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. I.</given-names>
            <surname>Torre-Bastida</surname>
          </string-name>
          ,
          <article-title>Large Language Model Operations (LLMOps): Definition, Challenges, and Lifecycle Management</article-title>
          ,
          <source>in: 2024 9th International Conference on Smart and Sustainable Technologies (SpliTech)</source>
          ,
          <year>2024</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>4</lpage>
          . doi:
          <volume>10</volume>
          .23919/SpliTech61897.
          <year>2024</year>
          .
          <volume>10612341</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <surname>A. M. Bennett</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Ulrich</surname>
            ,
            <given-names>P. van Damme</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wiedekopf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. E. W.</given-names>
            <surname>Johnson</surname>
          </string-name>
          ,
          <article-title>MIMIC-IV on FHIR: Converting a decade of in-patient data into an exchangeable, interoperable format</article-title>
          ,
          <source>Journal of the American Medical Informatics Association</source>
          <volume>30</volume>
          (
          <year>2023</year>
          )
          <fpage>718</fpage>
          -
          <lpage>725</lpage>
          . doi:
          <volume>10</volume>
          .1093/jamia/ocad002.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>M.</given-names>
            <surname>Montazeri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Khajouei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Afraz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Ahmadian</surname>
          </string-name>
          ,
          <article-title>A systematic review of data elements of computerized physician order entry (CPOE): Mapping the data to FHIR, Informatics for Health and Social Care 0 (</article-title>
          <year>2023</year>
          )
          <fpage>1</fpage>
          -
          <lpage>18</lpage>
          . doi:
          <volume>10</volume>
          .1080/17538157.
          <year>2023</year>
          .
          <volume>2255285</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>A. A.</given-names>
            <surname>Sinaci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Gencturk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. A.</given-names>
            <surname>Teoman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. B.</given-names>
            <surname>Laleci Erturkmen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Alvarez-Romero</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>MartinezGarcia</surname>
          </string-name>
          , B.
          <string-name>
            <surname>Poblador-Plou</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Carmona-Pírez</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Löbe</surname>
            ,
            <given-names>C. L.</given-names>
          </string-name>
          <string-name>
            <surname>Parra-Calderon</surname>
          </string-name>
          ,
          <article-title>A Data Transformation Methodology to Create Findable, Accessible, Interoperable, and Reusable Health Data: Software Design, Development, and</article-title>
          <string-name>
            <given-names>Evaluation</given-names>
            <surname>Study</surname>
          </string-name>
          ,
          <source>J Med Internet Res</source>
          <volume>25</volume>
          (
          <year>2023</year>
          ). doi:
          <volume>10</volume>
          .2196/42822.
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>F.</given-names>
            <surname>Simon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Schladetzky</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Macke</surname>
          </string-name>
          , T. ABLAßa, J. Ingenerf, K.-S. Ann-Kristin,
          <article-title>Metadata Driven Integration of Clinical Data for Secondary Use in FHIR-A Pilot Study at the UKSH, Studies in health technology</article-title>
          and
          <source>informatics 317</source>
          (
          <year>2024</year>
          )
          <fpage>146</fpage>
          -
          <lpage>151</lpage>
          . doi:
          <volume>10</volume>
          .3233/SHTI240850.
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>J. R.</given-names>
            <surname>Jim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A. R.</given-names>
            <surname>Talukder</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Malakar</surname>
          </string-name>
          ,
          <string-name>
            <surname>M. M. Kabir</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Nur</surname>
            ,
            <given-names>M. F.</given-names>
          </string-name>
          <string-name>
            <surname>Mridha</surname>
          </string-name>
          ,
          <article-title>Recent advancements and challenges of NLP-based sentiment analysis: A state-of-the-art review</article-title>
          ,
          <source>Natural Language Processing Journal</source>
          <volume>6</volume>
          (
          <year>2024</year>
          ). doi:
          <volume>10</volume>
          .1016/j.nlp.
          <year>2024</year>
          .
          <volume>100059</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>R.</given-names>
            <surname>Garg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <article-title>A Systematic Review of NLP Applications in Clinical Healthcare: Advancement and Challenges</article-title>
          , in: S.
          <string-name>
            <surname>Das</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Saha</surname>
            ,
            <given-names>C. A.</given-names>
          </string-name>
          <string-name>
            <surname>Coello Coello</surname>
            ,
            <given-names>J. C.</given-names>
          </string-name>
          Bansal (Eds.),
          <source>Advances in Data-Driven Computing and Intelligent Systems</source>
          ,
          <year>2024</year>
          , pp.
          <fpage>31</fpage>
          -
          <lpage>44</lpage>
          . doi:
          <volume>10</volume>
          .1007/
          <fpage>978</fpage>
          -981-99-9521-
          <issue>9</issue>
          _
          <fpage>3</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>K.</given-names>
            <surname>Klug</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Beckh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Antweiler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Chakraborty</surname>
          </string-name>
          , G. Baldini,
          <string-name>
            <given-names>K.</given-names>
            <surname>Laue</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Hosch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Nensa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Schuler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Giesselbach</surname>
          </string-name>
          ,
          <article-title>From admission to discharge: A systematic review of clinical natural language processing along the patient journey</article-title>
          ,
          <source>BMC Medical Informatics and Decision Making</source>
          <volume>24</volume>
          (
          <year>2024</year>
          )
          <article-title>238</article-title>
          . doi:
          <volume>10</volume>
          .1186/s12911-024-02641-w.
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>S.</given-names>
            <surname>Remmer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Lamproudis</surname>
          </string-name>
          ,
          <string-name>
            <surname>H.</surname>
          </string-name>
          <article-title>Dalianis, Multi-label Diagnosis Classification of Swedish Discharge Summaries - ICD-10 Code Assignment Using KB-BERT</article-title>
          ,
          <source>in: Proc. of the International Conference on Recent Advances in Natural Language Processing (RANLP</source>
          <year>2021</year>
          ),
          <year>2021</year>
          , pp.
          <fpage>1158</fpage>
          -
          <lpage>1166</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Gulum</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. M.</given-names>
            <surname>Trombley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kantardzic</surname>
          </string-name>
          ,
          <article-title>A Review of Explainable Deep Learning Cancer Detection Models in Medical Imaging</article-title>
          ,
          <source>Applied Sciences</source>
          <volume>11</volume>
          (
          <year>2021</year>
          ). doi:
          <volume>10</volume>
          .3390/app11104573.
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>S.</given-names>
            <surname>Pahune</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Akhtar</surname>
          </string-name>
          ,
          <article-title>Transitioning from MLOps to LLMOps: Navigating the Unique Challenges of Large Language Models</article-title>
          ,
          <source>Information</source>
          <volume>16</volume>
          (
          <year>2025</year>
          ). doi:
          <volume>10</volume>
          .3390/info16020087.
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>V. A.</given-names>
            <surname>Arcobelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Moscato</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Palumbo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Marfoglia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Nardini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Randi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Davalli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Carbonaro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Chiari</surname>
          </string-name>
          , S. Mellone,
          <article-title>FHIR-standardized data collection on the clinical rehabilitation pathway of trans-femoral amputation patients, Scientific Data (</article-title>
          <year>2024</year>
          )
          <article-title>806</article-title>
          . doi:
          <volume>10</volume>
          .1038/ s41597-024-03593-6.
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>J.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Yoon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Kim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. H.</given-names>
            <surname>So</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. Kang,</surname>
          </string-name>
          <article-title>BioBERT: A pre-trained biomedical language representation model for biomedical text mining</article-title>
          ,
          <source>Bioinformatics</source>
          <volume>36</volume>
          (
          <year>2019</year>
          )
          <fpage>1234</fpage>
          -
          <lpage>1240</lpage>
          . doi:
          <volume>10</volume>
          .1093/bioinformatics/btz682.
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Labrak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bazoge</surname>
          </string-name>
          , E. Morin,
          <string-name>
            <given-names>P.-A.</given-names>
            <surname>Gourraud</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Rouvier</surname>
          </string-name>
          , R. Dufour,
          <article-title>BioMistral: A Collection of Open-Source Pretrained Large Language Models for Medical Domains</article-title>
          , in: L.
          <string-name>
            <surname>-W. Ku</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Martins</surname>
          </string-name>
          , V. Srikumar (Eds.),
          <source>Findings of the Association for Computational Linguistics: ACL</source>
          <year>2024</year>
          ,
          <article-title>Association for Computational Linguistics</article-title>
          , Bangkok, Thailand,
          <year>2024</year>
          , pp.
          <fpage>5848</fpage>
          -
          <lpage>5864</lpage>
          . doi:
          <volume>10</volume>
          .18653/v1/
          <year>2024</year>
          .findings-acl.
          <volume>348</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [33]
          <string-name>
            <given-names>V.</given-names>
            <surname>Arcobelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Moscato</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Marfoglia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Nardini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Randi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Davalli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Carbonaro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Palumbo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Chiari</surname>
          </string-name>
          ,
          <string-name>
            <surname>S. Mellone,</surname>
          </string-name>
          <article-title>MOTU on FHIR: A preliminary strategy to enable interoperability for retrospective dataset standardization</article-title>
          ,
          <source>in: 2023 IEEE EMBS Special Topic Conference on Data Science and Engineering in Healthcare, Medicine and Biology</source>
          ,
          <year>2023</year>
          , pp.
          <fpage>81</fpage>
          -
          <lpage>82</lpage>
          . doi:
          <volume>10</volume>
          .1109/
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>