<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>September</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.1007/s13222</article-id>
      <title-group>
        <article-title>BabelfishLib (Lightning Talk)</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Philipp M. Grulich</string-name>
          <email>grulich@tu-berlin.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Stefen</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Volker Markl</string-name>
          <email>volker.markl@tu-berlin.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>A. Ziehn</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff4">4</xref>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>User-Defined Functions, Data Management</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>A. Chaudhary</institution>
          ,
          <addr-line>B. Del Monte, D. Giouroukis, P. M</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>D. Giouroukis</institution>
          ,
          <addr-line>P. M. Grulich</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>DFKI GmbH, Alt-Moabit 91c</institution>
          ,
          <addr-line>10559 Berlin</addr-line>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Technische Universität Berlin</institution>
          ,
          <addr-line>Straße des 17. Juni 135, 10623 Berlin</addr-line>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>[1] P. M. Grulich</institution>
          ,
          <addr-line>S. Zeuch, V. Markl, Babelfish: Eficient</addr-line>
        </aff>
        <aff id="aff5">
          <label>5</label>
          <institution>[3] S. Zeuch, E. T. Zacharatou</institution>
          ,
          <addr-line>S. Zhang, X. Chatziliadis</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <volume>1</volume>
      <issue>2023</issue>
      <fpage>0009</fpage>
      <lpage>0009</lpage>
      <abstract>
        <p>Today, data scientists, web developers, and application developers build complex data processing pipelines by combining diferent tools and programming languages. To this end, most data processing systems support user-defined functions (UDFs) in common languages like Java, Python, or JavaScript. These UDFs enable users to express arbitrary business logic in their preferred programming language, leverage 3rd-party libraries, and increase the modularity and testability of their data processing pipelines. Although UDFs provide a large degree of freedom, their flexibility comes with a high-performance cost compared to traditional relational queries. As a result, most experts recommend avoiding UDFs whenever possible.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>we propose in this talk BabelfishLib, which provides our Babelfish Engine [
1] as an extensible component for the eficient
and secure execution of UDFs. In an environment where virtually every data management system requires UDF support,
BabelfishLib can centralize these eforts and provide a unified UDF runtime that can be used across diferent systems. In
particular, BabelfishLib targets three major design goals. First, it provides eficient execution strategies for UDFs in diferent
programming languages. Second, it ensures that the execution of untrusted UDF code is isolated from the data processing
system, guaranteeing system security. Third, it analyzes UDFs and provides information for further query optimizations.
As a result, BabelfishLib mitigates the performance overhead of UDFs in state-of-the-art systems while it ensures security
and isolation at the same time. Currently, we leverage BabelfishLib to accelerate UDFs in our data processing platform
NebulaStream [2, 3].</p>
      <p>We believe that BabelfishLib can be a first step towards a unified accelerator for UDFs, which can be integrated across
diferent data processing systems. Furthermore, it provides a playground for further research focusing on specific aspects
of the acceleration of UDF. Finally, through this presentation, we intend to spark a discussion across the community to
consolidate requirements for eficient UDF execution and combine diferent eforts in the same direction.
execution of polyglot queries, Proc. VLDB Endow.
3489496.3489501. doi:10.14778/3489496.3489501.
[2] S.</p>
    </sec>
    <sec id="sec-2">
      <title>Zeuch, X.</title>
    </sec>
    <sec id="sec-3">
      <title>Chatziliadis, D. Giouroukis, P. M. Grulich, A. Ziehn, V.</title>
    </sec>
    <sec id="sec-4">
      <title>Mark, groho,</title>
    </sec>
    <sec id="sec-5">
      <title>Data management for the internet of things,</title>
    </sec>
    <sec id="sec-6">
      <title>Datenbank-Spektrum 22 (2022) 131–141.</title>
      <p>LGOBE
rOcid
nEvelop-O
Joint Workshops at 49th International Conference on Very Large Data
Bases (VLDBW’23) — Second International Workshop on Composable</p>
    </sec>
    <sec id="sec-7">
      <title>Chaudhary,</title>
      <p>D. P. A.
Nu</p>
    </sec>
    <sec id="sec-8">
      <title>Nebulastream:</title>
      <p>ronpub.com/ojiot/OJIOT_2020v6i1n07_Zeuch.html.</p>
    </sec>
  </body>
  <back>
    <ref-list />
  </back>
</article>