<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>DiTraRe: AI on a Spider's Web. Interweaving Disciplines for Digitalisation</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Anna M. Jacyszyn</string-name>
          <email>Anna.Jacyszyn@fiz-Karlsruhe.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Harald Sack</string-name>
          <email>Harald.Sack@fiz-Karlsruhe.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>DiTraRe-Study Group</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Matthias Razum</string-name>
          <email>Matthias.Razum@fiz-Karlsruhe.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Felix Bach</string-name>
          <email>Felix.Bach@fiz-Karlsruhe.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>FIZ Karlsruhe - Leibniz Institute for Information Infrastructure</institution>
          ,
          <addr-line>Hermann-von-Helmholtz-Platz 1, 76344 Eggenstein-Leopoldshafen</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Karlsruhe Institute of Technology</institution>
          ,
          <addr-line>Hermann-von-Helmholtz-Platz 1, 76344 Eggenstein-Leopoldshafen</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The recently established Leibniz Science Campus “Digital Transformation of Research” (DiTraRe) investigates the efects of a broadly understood process of digitalisation of research on a multilevel scale. The project concentrates on four research clusters concerning diferent topics and gathering use cases from varying scientific areas. For a multi-scale investigation these research clusters are interwoven with four dimensions, each of which approaches the tasks from a diferent perspective and poses its own research questions. Within this “spider's web” we are not only developing practical solutions for each use case but also seeking to find generalisations valuable to the scientific community as well as society in general. Sophisticated AI technologies, like natural language processing, knowledge extraction, and ontology engineering, are investigated within the DiTraRe project by the dimension Exploration and knowledge organisation. This position paper aims to describe the DiTraRe Science Campus in general as well as concentrate on its aforementioned dimension concerning implementation of AI techniques.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;digitalisation</kwd>
        <kwd>knowledge organisation</kwd>
        <kwd>research data management</kwd>
        <kwd>applied artificial intelligence</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        The ongoing process of digitalisation holds great potential to simplifying and assisting not only our
daily lives but also research activities. It enables us to transform our data into machine-readable
formats, to which we can later apply numerous state-of-the-art (SOTA) techniques, such as e.g. machine
learning (ML) models. The recently established Leibniz Science Campus “Digital Transformation of
Research” (DiTraRe) [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]1 aims to investigate and analyse processes of digitalisation in research and
their corresponding efects in a multi-levelled and interdisciplinary approach. The foundations for
innovative techniques of knowledge creation will be laid by developing and applying many data-driven
methods.
      </p>
      <p>This paper concentrates on the vision of the DiTraRe dimension Exploration and knowledge
organisation which is developing AI methods to support DiTraRe use cases. Within the project we will
significantly enhance the ability to analyse and interpret fitness data for sports research. Our work with
chemists concentrates on automatising processes in the laboratories, which will result in advancing
the way we can teach AI the laws of chemistry. With biomedical engineers we will use SOTA AI
techniques to develop a novel method of predicting the length of stay at an intensive care unit (ICU) in a
non-invasive and much quicker way. A feature to support the process of creation of a uniform platform
which we will construct with climate researchers will strongly increase the re-use and availability of
earth science data.</p>
    </sec>
    <sec id="sec-2">
      <title>2. DiTraRe project</title>
      <p>DiTraRe is based on the longstanding successful cooperation between FIZ Karlsruhe – Leibniz Institute
for Information Infrastructure (FIZ KA)2 and Karlsruhe Institute of Technology (KIT)3. The project is
organised as a matrix: four research clusters begin with one use case (UC), which has specific research
questions (RQs). Then, each of the UCs is being investigated within four dimensions.</p>
      <sec id="sec-2-1">
        <title>2.1. Research clusters</title>
        <p>Protected data spaces provides UC Sensitive data in sports science in cooperation with KIT Institute of
Sports and Sports Science (KIT-IfSS) which is developing the MO|RE data platform. The challenge within
this task is to investigate how sensitive health (i.e. BMI, blood pressure) and personal (i.e. geolocation,
social status) data can be published in a way that personal and data protection rights are adhered to.</p>
        <p>Smart data acquisition begins with UC Chemotion Electronic Lab Notebook (ELN)4 in cooperation
with KIT Institue of Biological and Chemical Systems (KIT-IBCS). The aim of this UC is to extend and
improve automatisation processes in the chemistry labs. Its challenges include i.e. high complexity of
processes ongoing in a lab as well as missing methods for automated data curation.</p>
        <p>AI-based knowledge realms includes UC AI in biomedical engineering as a collaboration with KIT
Institute for Biomedical Engineering (KIT-IBT), which develops computer models of the human heart.
In order to overcome the problem of data privacy as well as biases in databases, simulated databases are
being used. However, this leads to questioning the trustworthiness and explainability of AI.</p>
        <p>Publication cultures provides UC Publication of large datasets in collaboration with KIT Institute of
Meteorology and Climate Research (KIT-IMK) which generates and analyses very large datasets of
atmospheric data. Due to the size of these data, publication, and reuse of data are limited and very
ineficient. New methods need to be developed to enable exploration and evaluation.</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Dimensions</title>
        <p>Nowadays, SOTA methods need to be used to gain necessary access of the ever-growing abundance of
information. This includes e.g. natural language processing (NLP) and ontology engineering, which
enable obtaining well-structured knowledge. These semantic technologies, along with other AI techniques,
are investigated within the DiTraRe by the dimension Exploration and knowledge organisation.</p>
        <p>The dimension Legal and ethical challenges is dealing with data ethics, data protection, copyright,
and data law. Legal and political context are studied in detail. Tools and processes will concentrate
on providing the researchers with software enabling to keep a data continuum, e.g. by extending the
possibilities of existing data repositories. The whole process will be examined from a technical, but
also societal and ethical perspective by Reflection and resonance . The dimension will concentrate on
transparency and comprehensibility of communicating scientific findings outside and within academia.
3. AI on a spider’s web: Exploration and knowledge organisation</p>
      </sec>
      <sec id="sec-2-3">
        <title>3.1. Sensitive data in sports science</title>
        <p>
          KIT-IfSS has developed a platform to collect, publish, and share motor performance data (MO|RE)
to deal with the issues of re-usability and reliability of data originating from numerous projects [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ].
Even though the sports science is greatly profiting from this repository (e.g. study between green
space availability and youth’s physical activity [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]), many significant research questions still remain
unanswered, i.e. evolution of motor performance over time. The plan is to combine the MO|RE
repository with another database (namely KonsortSWD [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]), which provides broad information on
        </p>
        <sec id="sec-2-3-1">
          <title>2FIZ Karlsruhe web page, https://fiz-karlsruhe.de/</title>
          <p>
            3KIT web page, https://kit.edu/
4Chemotion ELN web page, https://chemotion.net/
social, behavioural, educational, and economic status. We will develop a knowledge graph (KG) of
the extended MO|RE database to enable sports scientists an enhanced data analysis and interpretation.
Preliminary studies have proven that the usage of KGs in health research is promising [
            <xref ref-type="bibr" rid="ref5 ref6">5, 6</xref>
            ].
          </p>
          <p>
            We will concentrate on representations of datasets combining protected and non-protected data in a
single KG and investigate possibilities of an eficient access management while retaining data privacy.
Our plan is also to adapt the KG to the needs of sports science in general so that researchers working in
other disciplines can easily make use of the motor activity data. Our eforts will clearly advance the
potential KGs have within sports research by enabling knowledge discovery from patients’ personal
databases, thus uncovering new theories and classifying patients’ health [
            <xref ref-type="bibr" rid="ref6 ref7">6, 7</xref>
            ].
          </p>
        </sec>
      </sec>
      <sec id="sec-2-4">
        <title>3.2. Chemotion Electronic Lab Notebook</title>
        <p>
          The complexity of processes ongoing in the chemistry lab leads to the need of novel methods of data
acquisition and management to expand and enhance the data flow [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]. In the last years new methods
were developed to make data findable, accessible, interoperable and reusable (FAIR) via the Chemotion
ELN [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]. However, the process of digitalisation is far from being completed as many adaptations of the
current workflows need to be done [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]. Our goal is to support chemists with further development of
the ELN to accelerate their research by creating new automatisation methods supported with AI.
        </p>
        <p>
          Data in Chemotion repository need to be tested for completeness and consistency in order to conform
to the community standards [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]. This process is currently done partially manually by users and
researchers. We will apply multiple AI techniques, i.e. NLP, to automatise the process of the data
curation. We are going to specifically investigate the reaction description module in the Chemotion
repository. Because of its complexity the text entered by the user is troublesome to standardise.
Development of a full automatisation of data curation for the ELN will significantly improve and
accelerate the functioning of the entire system, as it will exclude humans from the data quality check,
thus giving them more time for other more crucial research tasks. And since using the ELN is also a
best practice example, supporting it in the long run will facilitate teaching AI understanding the laws
of chemistry [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ], i.e. by development of self-driving labs [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ].
        </p>
      </sec>
      <sec id="sec-2-5">
        <title>3.3. AI in biomedical engineering</title>
        <p>
          Since its first clinical applications, simulations and computational modelling have become an important
and regularly used method in cardiac electrophysiology [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]. ML methods are also being utilised in
processes concerning high-risk patients, i.e. those admitted to an intensive care unit (ICU) [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. Recent
studies which concentrated on predicting the length of ICU stay and mortality used a significant number
of variables, e.g. 17 [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ] or 20 [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. Acquisition of these measurements upon patient’s admission to an
ICU is time and efort consuming. The researchers at KIT-IBT are now in the process of developing a
prediction tool which will use an electrocardiogram (ECG) together with only few other variables to
predict mortality and length of ICU stay faster.
        </p>
        <p>
          A recent study used only the parameters of an ECG of non-cardiac patients [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]. We will collaborate
with KIT-IBT on including data of patients with heart diseases in their novel model. Publicly available
Medical Information Mart for Intensive Care (MIMIC) IV database [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ] together with clinical data from
University of Freiburg Heart Center will be used. In our innovative approach we will concentrate on
using a raw ECG as an input and employ a multi-modal large language model (multi-modal LLM) to
feed it with diferent variables in addition to the ECG. We will investigate how far incrementally adding
diferent types of external information (i.e. text, image) influences the outcome. This model will enable
a quick and non-invasive assessment of a length of stay at the ICU which is important because of
significant costs that ICU long stays yield for general health care systems.
        </p>
      </sec>
      <sec id="sec-2-6">
        <title>3.4. Publication of large datasets</title>
        <p>
          KIT-IMK researchers are using multiple internal and external datasets to analyse the composition of
the atmosphere. The available datasets consist of highly non-uniform (e.g. dimensions of the variables
are diferent) and in some cases non-standardised entries where the metadata have slightly diferent
names for the same data (see examples in the Earth Data portal5). This metadata chaos as well as large
sizes of accessible datasets (see i.e. data from IASI satellite [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ]) makes it challenging to access and use
the data from within a repository. Within the DiTraRe we are aiming at creating a platform to enable
easy and convenient publication of large heterogeneous datasets.
        </p>
        <p>
          Environmental scientists have been working on a uniform platform to collect datasets from diferent
disciplines, such as V-FOR-WaTer [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ]. However, it is still under construction. Our plan is to focus
specifically on KIT-IMK data and test whether AI methods such as KGs or LLMs can support functionality
of such repositories. We will explore the utility of AI in structuring datasets, preprocessing data, and
standardising metadata. An intriguing case is non-stationarity of the climate system in general and
its impact on the possibility of an efective usage of AI methods, i.e. an ontology. Our research will
provide significant contributions to creating a uniform data management platform.
        </p>
      </sec>
      <sec id="sec-2-7">
        <title>3.5. Overarching activities and synergies</title>
        <p>
          Along with one of the main goals of DiTraRe, we will study synergies between research clusters. Both
KIT-IfSS and KIT-IBT would profit from using clinical data, and plausibly health data from wearables.
This way they can extend their studies and find connections between numerous health indicators.
Another connection is that KIT-IMK as well as KIT-IfSS and KIT-IBCS are making use of the RADAR
platform, which is a repository developed at FIZ KA for the archival and publication of research data6.
We are investigating possibilities of its further development with the support of AI methods, i.e. by
adding a SPARQL endpoint explorer SHMARQL which enables user friendly exploration of the shape of
data [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ].
        </p>
        <p>
          Additionally, our plan is to formulate and answer RQs in the area of computer science. An important
topic will be the applications of LLMs and its consequences in diferent UCs within varying context.
Thus, the DiTraRe will not only bring forward our understanding of what role the specific aspect of AI
plays within the scope of the digitalisation of research, but also enables new insights about knowledge
representation in LLMs [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ] as well as about their eficiency in memorising and reasoning among
structured knowledge [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ].
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>4. Conclusions</title>
      <p>In the DiTraRe we investigate the efects and influences of a broadly understood digitalisation of
research both within academia and society. This paper describes in detail tasks of the DiTraRe dimension
Exploration and knowledge organisation, within which we explore efects of applying AI methods to
specific UCs originating from various disciplines. The developed solutions in the context of scientific
knowledge representation and organisation will cover areas as broad as accessing databases containing
private and sensitive data, improving treatment of high-risk patients, automatising data quality control
for chemistry and facilitating easy re-use of large datasets for climate research. The practices which
will be the outcomes of DiTraRe, will significantly improve current SOTA in the investigated areas of
research as well as advance our understanding of applying AI in the process of knowledge organisation.
The project is currently in its development phase and we would significantly profit from networking
with other researchers looking into similar topics in scientific knowledge representation.</p>
    </sec>
    <sec id="sec-4">
      <title>Acknowledgments</title>
      <p>The Leibniz Science Campus “Digital Transformation of Research” (DiTraRe) is funded by the Leibniz
Association. We would like to thank our use case partners for kindly ofering their time to revise the
draft and sharing their insightful suggestions on how to improve this publication.</p>
      <sec id="sec-4-1">
        <title>5Earth Data portal, https://earth-data.de/ 6RADAR web page, https://radar.products.fiz-karlsruhe.de/en</title>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>A. Graphical representation of DiTraRe structure</title>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M.</given-names>
            <surname>Razum</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Bach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Brünger-Weilandt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Scherz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Böhm</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Sack</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Volkamer</surname>
          </string-name>
          ,
          <article-title>Proposal for a Leibniz ScienceCampus - Digital Transformation of Research (DiTraRe</article-title>
          ),
          <year>2023</year>
          . doi:
          <volume>10</volume>
          .5281/ zenodo.11109406, project proposal.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>K.</given-names>
            <surname>Klemm</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Bös</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Kron</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Eberhardt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Woll</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Niessner</surname>
          </string-name>
          ,
          <article-title>Development and introduction of a disciplinary data repository for sport scientists based on the example mo|re data: eresearch infrastructure for motor research data : Bausteine forschungsdatenmanagementempfehlungen und erfahrungsberichte für die praxis vonforschungsdatenmanagerinnen und -managern, Bausteine Forschungsdatenmanagement 1 (</article-title>
          <year>2024</year>
          )
          <fpage>1</fpage>
          -
          <lpage>14</lpage>
          . doi:
          <volume>10</volume>
          .17192/bfdm.
          <year>2024</year>
          .
          <volume>1</volume>
          .8615.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>C.</given-names>
            <surname>Nigg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Fiedler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Burchartz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Reichert</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Niessner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Woll</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Schipperijn</surname>
          </string-name>
          ,
          <article-title>Associations between green space availability and youth's physical activity in urban and rural areas across germany</article-title>
          ,
          <source>Landscape and Urban Planning</source>
          <volume>247</volume>
          (
          <year>2024</year>
          ). doi:
          <volume>10</volume>
          .1016/j.landurbplan.
          <year>2024</year>
          .
          <volume>105068</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>R. G. D.</surname>
          </string-name>
          <article-title>Forum], Big data in social, behavioural, and economic sciences: Data access and research data management. ratswd output, German Data Forum (RatSWD) 4 (</article-title>
          <year>2020</year>
          ). doi:
          <volume>10</volume>
          .17620/02671. 52.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>P.</given-names>
            <surname>Ernst</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Meng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Siu</surname>
          </string-name>
          , G. Weikum,
          <article-title>Knowlife: A knowledge graph for health and</article-title>
          life sciences,
          <source>2014 IEEE 30th International Conference on Data Engineering</source>
          (
          <year>2014</year>
          )
          <fpage>1254</fpage>
          -
          <lpage>1257</lpage>
          . doi:
          <volume>10</volume>
          .1109/ ICDE.
          <year>2014</year>
          .
          <volume>6816754</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>X.</given-names>
            <surname>Tao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Pham</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Yong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Goh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , Y. Cai,
          <article-title>Mining health knowledge graph for health risk prediction</article-title>
          ,
          <source>World Wide Web</source>
          <volume>23</volume>
          (
          <year>2020</year>
          )
          <fpage>2341</fpage>
          -
          <lpage>2362</lpage>
          . URL: https://doi.org/10.1007/ s11280-020-00810-1. doi:
          <volume>10</volume>
          .1007/s11280-020-00810-1.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>A.</given-names>
            <surname>Rossanez</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. dos Reis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Torres</surname>
          </string-name>
          , H. de Ribaupierre,
          <article-title>Kgen: a knowledge graph generator from biomedical scientific literature</article-title>
          ,
          <source>BMC Medical Informatics and Decision Making</source>
          <volume>20</volume>
          (
          <year>2020</year>
          ).
          <source>doi:10.1186/s12911-020-01341-5.</source>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>F.</given-names>
            <surname>Fink</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Hüppe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Jung</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hofmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Herres-Pawlis</surname>
          </string-name>
          ,
          <article-title>Sharing is caring: Guidelines for sharing in the electronic laboratory notebook (eln) chemotion as applied by a synthesis-oriented working group</article-title>
          ,
          <source>Chemistry-Methods</source>
          <volume>2</volume>
          (
          <year>2022</year>
          )
          <article-title>e202200026</article-title>
          . doi:https://doi.org/10.1002/ cmtd.202200026.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>S.</given-names>
            <surname>Herres-Pawlis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Bach</surname>
          </string-name>
          , I. Bruno,
          <string-name>
            <given-names>S.</given-names>
            <surname>Chalk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Jung</surname>
          </string-name>
          , et al.,
          <article-title>Minimum information standards in chemistry: A call for better research data management practices</article-title>
          ,
          <source>Angewandte Chemie International Edition</source>
          <volume>61</volume>
          (
          <year>2022</year>
          )
          <article-title>e202203038</article-title>
          . doi:https://doi.org/10.1002/anie.202203038.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>F.</given-names>
            <surname>Tristram</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Jung</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Hodapp</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Schröder</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Wöll</surname>
          </string-name>
          ,
          <string-name>
            <surname>S. Bräse,</surname>
          </string-name>
          <article-title>The impact of digitalized data management on materials systems workflows</article-title>
          ,
          <source>Advanced Functional Materials</source>
          <volume>34</volume>
          (
          <year>2024</year>
          )
          <article-title>2303615</article-title>
          . doi:https://doi.org/10.1002/adfm.202303615.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>P.</given-names>
            <surname>Mafettone</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Friederich</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Baird</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Blaiszik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Brown</surname>
          </string-name>
          , et al.,
          <article-title>What is missing in autonomous discovery: open challenges for the community</article-title>
          ,
          <source>Digital Discovery</source>
          <volume>2</volume>
          (
          <year>2023</year>
          )
          <fpage>1644</fpage>
          -
          <lpage>1659</lpage>
          . doi:
          <volume>10</volume>
          . 1039/D3DD00143A.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>M.</given-names>
            <surname>Peirlinck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Costabal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Yao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Guccione</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Tripathy</surname>
          </string-name>
          , et al.,
          <article-title>Precision medicine in human heart modeling, Biomechanics and Modeling in Mechanobiology 20 (</article-title>
          <year>2021</year>
          )
          <fpage>803</fpage>
          -
          <lpage>831</lpage>
          . doi:
          <volume>10</volume>
          .1007/ s10237-021-01421-z.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>S.</given-names>
            <surname>Iwase</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Nakada</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Shimada</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Oami</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Shimazui</surname>
          </string-name>
          , et al.,
          <article-title>Prediction algorithm for icu mortality and length of stay using machine learning</article-title>
          ,
          <source>Scientific Reports</source>
          <volume>12</volume>
          (
          <year>2022</year>
          ).
          <source>doi: 10.1038/ s41598-022-17091-5.</source>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>J. Wu</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Lin</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Hu</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Zhang</surname>
          </string-name>
          , G. Kong,
          <article-title>Predicting prolonged length of icu stay through machine learning</article-title>
          .,
          <source>Diagnostics</source>
          <volume>11</volume>
          (
          <year>2021</year>
          ). doi:
          <volume>10</volume>
          .3390/diagnostics11122242.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>S.</given-names>
            <surname>Jana</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Dasgupta</surname>
          </string-name>
          , L. Dey,
          <article-title>Predicting medical events and icu requirements using a multimodal multiobjective transformer network</article-title>
          ,
          <source>Experimental Biology and Medicine</source>
          <volume>247</volume>
          (
          <year>2022</year>
          )
          <fpage>1988</fpage>
          -
          <lpage>2002</lpage>
          . doi:
          <volume>10</volume>
          .1177/15353702221126559.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>K.</given-names>
            <surname>Erdem</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Duman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Ergün</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Ergün</surname>
          </string-name>
          ,
          <article-title>The correlation between electrocardiographic parameters and mortality in non-cardiac icu patients</article-title>
          ,
          <source>European Review for Medical and Pharmacological Sciences</source>
          <volume>27</volume>
          (
          <year>2023</year>
          )
          <fpage>6662</fpage>
          -
          <lpage>6670</lpage>
          . doi:
          <volume>10</volume>
          .26355/eurrev_202307_
          <fpage>33136</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>A.</given-names>
            <surname>Johnson</surname>
          </string-name>
          , L. Bulgarelli,
          <string-name>
            <given-names>L.</given-names>
            <surname>Shen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gayles</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Shammout</surname>
          </string-name>
          , et al.,
          <article-title>Mimic-iv, a freely accessible electronic health record dataset</article-title>
          ,
          <source>Scientific Data</source>
          <volume>10</volume>
          (
          <year>2023</year>
          ). doi:
          <volume>10</volume>
          .1038/s41597-022-01899-x.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>M.</given-names>
            <surname>Schneider</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Ertl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Diekmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Khosrawi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Weber</surname>
          </string-name>
          , et al.,
          <article-title>Design and description of the musica iasi full retrieval product</article-title>
          ,
          <source>Earth System Science Data</source>
          <volume>14</volume>
          (
          <year>2022</year>
          )
          <fpage>709</fpage>
          -
          <lpage>742</lpage>
          . doi:
          <volume>10</volume>
          .5194/ essd-14-
          <fpage>709</fpage>
          -
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>M.</given-names>
            <surname>Strobl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Azmi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Balazs</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Bouguezzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Dolich</surname>
          </string-name>
          , et al.,
          <article-title>Streamlining data pre-processing and analysis through the v-for-water web portal</article-title>
          ,
          <source>in: European Geosciences Union General Assembly</source>
          ,
          <year>2024</year>
          . doi:
          <volume>10</volume>
          .5194/egusphere-egu24-
          <volume>10364</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>E.</given-names>
            <surname>Posthumus</surname>
          </string-name>
          , Sparql-shmarql,
          <year>2024</year>
          . URL: https://github.com/epoz/shmarql.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>F.</given-names>
            <surname>Petroni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Rocktäschel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Riedel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Lewis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bahktin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Miller</surname>
          </string-name>
          ,
          <article-title>Language models as knowledge bases?</article-title>
          , in: K. Inui,
          <string-name>
            <given-names>J.</given-names>
            <surname>Jiang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Ng</surname>
          </string-name>
          ,
          <string-name>
            <surname>X.</surname>
          </string-name>
          Wan (Eds.),
          <source>Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)</source>
          ,
          <article-title>Association for Computational Linguistics</article-title>
          , Hong Kong, China,
          <year>2019</year>
          , pp.
          <fpage>2463</fpage>
          -
          <lpage>2473</lpage>
          . URL: https://aclanthology.org/D19-1250. doi:
          <volume>10</volume>
          .18653/ v1/
          <fpage>D19</fpage>
          -1250.
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>Q.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <article-title>Can language models act as knowledge bases at scale</article-title>
          ?,
          <year>2024</year>
          . doi:
          <volume>10</volume>
          . 48550/arXiv.2402.14273, arXiv preprint.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>