<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Towards a Neural Database Execution Engine</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Christos Tsapelas</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Archimedes, Athena Research Center</institution>
          ,
          <country country="GR">Greece</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Informatics and Telecommunications, National and Kapodistrian University of Athens</institution>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Supervised by Georgia Koutrika</institution>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Recent advances in natural language understanding have heightened the interest in AI systems capable of answering queries across multiple data modalities, such as structured database tables and unstructured text. Current approaches typically rely on Large Language Models (LLMs) to facilitate queries between these modalities, which incurs substantial computational costs and often yields suboptimal performance. To this direction, this research introduces a novel query execution engine designed to bridge diverse data modalities, leveraging the high-eficiency querying capabilities of database systems with the advanced reasoning capacities of LLMs. This paper presents a prototype architecture for such a multi-modal database system, detailing its core components and their functionalities to demonstrate how it can achieve efective, scalable query processing across structured and unstructured data.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;database systems</kwd>
        <kwd>large language models</kwd>
        <kwd>memory networks</kwd>
        <kwd>hybrid query execution</kwd>
        <kwd>virtual knowledge bases</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>The evolution of modern data warehouses has introduced
unprecedented challenges and opportunities, with data
volumes now encompassing multiple modalities, such as
structured table data, unstructured text, and images. Each data
type possesses a unique structure, necessitating tailored
querying methods that efectively harness the properties
of each modality. However, despite these advances,
existing systems struggle to generalize queries across multiple
modalities, presenting a key limitation in addressing the
needs of diverse, cross-modal data integration tasks.</p>
      <p>
        Database management systems (DBMS) excel in
performing rapid, eficient, and precise queries on extremely large
data volumes at scale. However, their primary focus remains
on exact computation at scale, with limited reasoning
capabilities [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. In contrast, Large Language Models (LLMs)
excel at processing natural language data across massive
textual corpora, ofering logical reasoning over unstructured
data due to the model’s ability to embed knowledge within
its weights [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. This distinction highlights a crucial gap
between traditional DBMS architectures and the reasoning
and flexibility capabilities that LLMs bring to unstructured
data processing.
      </p>
      <p>
        Numerous contemporary applications demand complex
queries that integrate information across multiple
modalities [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Current dominant approaches for multi-modal data
integration include retrieval-augmented generation (RAG),
similarity-based search, and Text2SQL. These techniques,
however, exhibit limitations in both the diversity of query
types they can accommodate and their query execution
performance. Text2SQL methods, for instance, are efective for
natural language queries that have a direct SQL equivalent,
whereas RAG systems are constrained to point lookups
involving only a limited number of records, requiring an LLM
to execute the join operation.
      </p>
      <p>To this direction, my doctoral research seeks to bridge
the gap between the approaches of LLMs and traditional
database systems to enable eficient, flexible hybrid search
queries. The objective is to develop a prototype neural
query execution engine equipped with novel algorithms
for eficient data access and join operations, leveraging the
strengths of both learned models and traditional database
methods, for rapid and precise query execution across
multiple data modalities. The proposed neural execution engine
seeks to empower database systems with the flexibility to
handle diverse data modalities and complex query types,
addressing a critical need in the field of data management and
paving the way for next-generation data retrieval solutions.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>
        Recent works in natural language understanding require
retrieving and reasoning, like question answering. For such
knowledge-intensive tasks, it is required to assimilate
information from diferent sections of large diferent inputs
such as books and article collections [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. To this direction,
the notions of Virtual Knowledge Bases (VKBs) [
        <xref ref-type="bibr" rid="ref4 ref5 ref6">5, 6, 4</xref>
        ] and
Memory Networks [
        <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
        ] are proposed, in which entity
mentions in text are transformed into dense representations to
represent properties or relations expressed with text
passages.
      </p>
      <p>
        Moreover, the advanced reasoning capabilities of LLMs
using RAG in question answering, has emerged a new area
of research where the system takes as input both structured
and unstructured data for reasoning over diferent
modalities [
        <xref ref-type="bibr" rid="ref1 ref10 ref11 ref12 ref13 ref14 ref3 ref9">1, 9, 3, 10, 11, 12, 13, 14</xref>
        ], or use the LLMs as a query
engine to pose SQL queries [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>
        Simultaneously, the database community has introduced
an innovative research direction involving learning-based
techniques to enhance query execution. Advances such as
learned sorting [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] and learned scans and joins algorithms
[
        <xref ref-type="bibr" rid="ref16 ref17">16, 17</xref>
        ] have demonstrated highly promising results in
optimizing traditional query processes. These methods indicate
that machine learning techniques can significantly improve
fundamental database operations, reinforcing the potential
for a hybrid approach that integrates both database and
LLM methodologies.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Research Questions</title>
      <p>The main purpose of this research proposal is the
development of a prototype execution engine able to execute queries
combining diferent data modalities. The implementation
of such a novel system, emerges a set of research questions:
RQ1 How can structured database tables and unstructured
data sources, like text documents, be efectively
associated to form a unified querying framework?
RQ2 How can dense vector representations be constructed
to preserve both the semantic richness of text and the
structural integrity of database entities?
RQ3 What new operators are required and how can
database operators be adapted to process queries
involving structured and unstructured data?
RQ4 How can a cost-based query optimizer be designed
to generate eficient execution plans for hybrid queries
involving structured and unstructured data?
RQ5 What techniques can be employed to ensure the query
engine scales eficiently for large datasets across
multiple modalities?</p>
      <sec id="sec-3-1">
        <title>3.1. Research Opportunities</title>
        <p>Building upon the related work, the proposed query
engine represents a significant advancement in addressing the
previously outlined research questions.</p>
        <p>Virtual Knowledge Bases (VKBs) generate dense
representations of real-world entities, such as those found within
Wikipedia, to enable querying. However, these
representations have not been applied within the context of database
systems. A key component of this research involves
establishing connections between database entities, as defined by
the data model of each database, and external text corpora
or additional modalities, such as images.</p>
        <p>Furthermore, the current state-of-the-art approach for
integrating multiple data modalities relies on Multi-Modal
Large Language Models (MLLMs). This methodology
typically employs large-scale LLMs to process queries, a strategy
that is computationally expensive and constrained by the
input size limitations inherent to LLMs.</p>
        <p>The primary objective of this research is to enable
eficient execution of multi-modal queries capable of
managing large-scale data in a manner aligned with traditional
database systems. To achieve this, the research will extend
conventional database operators, such as scans and joins,
by developing novel implementation algorithms designed
to process diverse data modalities.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. A Prototype of A Neural Database</title>
    </sec>
    <sec id="sec-5">
      <title>Engine</title>
      <p>In this section, an overview of the proposed neural database
execution engine is provided. Suppose the example query:
"Find all customers who purchased ’Product X’ and had
positive experience regarding the quality of the product, within
the past six months.". This query is transformed into a
multimodal SQL query and sent to the engine for execution. The
example query will assist to describe several aspects of the
proposed system, along with the execution flow of a hybrid
query between database tables and a text corpora.</p>
      <p>
        In Figure [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], we present the architecture of our query
engine. The example query is parsed and the system fetches
the Product and Sales database tables and their related
mention tables. Then, the optimizer is invoked to generate an
optimal execution plan for the given query. The optimizer
faces many challenges like selecting the appropriate scan
and join operators within and across modalities, while
predicting their optimal order in the execution plan. In Figure 1,
diferent physical operators are separated to make clear the
diferent processing steps. Finally, the generated execution
plan is submitted to the neural engine for execution.
      </p>
      <p>Before the query engine can execute queries across both
data modalities, a preparatory phase, referred to as mention
tables construction, is required. In the provided example,
mentions of ’Product X’ must be recognized within text
passages. In this phase, the system generates a series of
key-value (KV) tables that bridge the information within
database tables and text documents stored in blob storage.</p>
      <p>The objective of these KV tables is to create dense
vector representations of entities (keys) that encapsulate the
knowledge embedded in the text corpus (values). These
representations are structured to seamlessly integrate with
a Transformer model, enabling eficient and efective
processing by the query engine in subsequent stages.</p>
      <p>Upon initializing these learned tables, the execution
engine is ready to process queries. When a query is posed,
the system parses it and generates an optimal execution
plan. The plan selection process resembles that of
traditional database systems, wherein the optimizer explores the
space of possible execution plans and evaluates candidate
plans based on a cost model.</p>
      <p>Given the hybrid nature of the proposed query engine,
which supports queries across multiple data modalities, it
is necessary to define new hybrid operators capable of
handling data from both structured and unstructured sources,
like scans, projections, joins etc. These operators are
designed to facilitate seamless integration and processing of
data across the diverse modalities enclosed by the system.</p>
      <p>During the next subsections, the main components of the
proposed query engine are describes. Initially, the process
of mention tables is described, a methodology to associate
table data with the available text corpus. Next, the core of
the execution engine is detailed, focusing on the needed
operators and the query optimizer of the system.</p>
      <sec id="sec-5-1">
        <title>4.1. Mention Tables</title>
        <p>
          As previously noted, it is essential to establish associations
between data from database tables and the available text
documents, defining the specific types of information to be
retrieved and assimilated across these data sources [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. The
database schema provides a structured representation of the
entities within the database, with clearly defined properties
for each entity type.
        </p>
        <p>
          Thus, an initial processing step is proposed between the
diferent data sources, where each passage in text documents
is annotated with the main entities (fact tables) from the
database and we highlight entity mentions in the passage
with special tokens. Figure [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] shows the construction of
mention tables. Representations of these tokens are later
used to generate entity encodings.
        </p>
        <p>
          The goal of mention tables is to gather these database
entity encodings into matrices constructing key-value stores
containing the dense vector representations for each entity
in text documents forming a virtual knowledge base of the
available text documents, like [
          <xref ref-type="bibr" rid="ref18 ref4 ref7 ref8">7, 4, 18, 8</xref>
          ]
        </p>
      </sec>
      <sec id="sec-5-2">
        <title>4.2. Neural Query Execution</title>
        <p>To extend the querying capabilities of traditional database
systems across multiple data modalities, it becomes
necessary to adapt and expand conventional database operators
to efectively manage and process data from both structured
and unstructured sources.</p>
        <p>Within a neural database system, operators are
categorized into two distinct types: a) single-modal operators,
which are designed to process a single data modality (e.g.,
structured table data or unstructured text passages), such as
scan operations, and b) multi-modal operators, which are
capable of processing and associating inputs across multiple
data modalities, such as join and aggregation operations,
that integrate information from both structured and
unstructured sources. Thus, there is an emerging need to extend
traditional relational algebra used in database systems, to
describe the new neural operators for diferent modalities.</p>
        <p>In database systems, all operators in relational algebra
take as input a relation and the output is the result of the
operator applied on the input relation, which is again a
relation. In the case of the proposed query engine, we need
to define the neural operators regarding the scan, filter and
project of mention tables, as well the join implementation
between the a database table and a mention table.</p>
        <p>Scan Mention Tables Scan operations over mention
tables enable eficient querying and processing of
entityassociated text passages. These include entity retrieval scans,
which extract passages linked to specific entities, like
’Product X’ in example query, and mention highlight scans, which
identify all occurrences of targeted entities. Contextual
similarity scans rank passages based on semantic relevance to a
query vector, while entity-to-entity relationship scans reveal
co-occurrences within text. Additional methods, such as
aggregated entity statistics scans and temporal or categorical
iflters , allow for deeper insights by analyzing mention
frequency, context diversity, or filtering by specific attributes.
Advanced operations, like neighborhood scans for exploring
entity connections and multi-modal entity scans for linking
to database tables, further enhance the querying capabilities
of mention tables. These methods leverage the dense vector
representations of entities to facilitate robust and flexible
data exploration.</p>
        <p>Join Mention &amp; Database Tables For join operations in
the proposed query engine, two types of joins are possible:
a) joins within mention tables and b) hybrid joins between
database and mention tables.</p>
        <p>Joins within mention tables enable the discovery of
relationships between entities based on shared textual contexts
or semantic relevance. These include entity co-occurrence
joins, which retrieve passages where multiple entities are
mentioned together, and contextual similarity joins, which
link entities based on the similarity of their dense vectors
and passage-level joins, which connect text passages that
reference related entities, enabling richer narratives</p>
        <p>Joins between mention tables and database tables
integrate structured and unstructured data to provide a unified
query interface. Entity-ID joins link entities in mention
tables to their corresponding database records, while
propertybased joins combine entities based on shared attributes, such
as linking customer mentions with their structured profiles.
In the example query, the first join of the execution plan
is an Entity-ID join between Product database and mention
tables. Aggregated knowledge joins enrich database records
with insights from text passages, and hybrid semantic joins
bridge structured relationships in the database with
semantic similarity in mention tables, enabling advanced querying
across diverse data modalities.</p>
        <p>
          While the traditional database operators are well-defined,
the landscape of neural operators for execution is an active
area of research. There are eforts from the database
community enhancing traditional operators [
          <xref ref-type="bibr" rid="ref13 ref15 ref16">15, 16, 13</xref>
          ] with
neural models for faster query processing, while there are
approaches that propose learned operators, e.g learned scans
and joins [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ]. Further, the proposed query engine can
utilize the reasoning capabilities of LLMs to provide reasoning
or summaries on the results of the aforementioned
operators at the end of query results or on some intermediate
step of query processing. In the provided example, the LLM
is invoked to evaluate all reviews text passages per product.
        </p>
      </sec>
      <sec id="sec-5-3">
        <title>4.3. Query Optimization</title>
        <p>The query optimizer for the proposed neural execution
engine bridges the gap between structured and unstructured
data processing, enabling eficient query execution across
database tables and mention tables. By integrating
traditional database strategies with neural processing, it ensures
scalability and adaptability for hybrid, multi-modal queries.</p>
        <p>A core characteristic of the optimizer is its cost-based
approach, which evaluates potential query execution plans
based on resource consumption, including computation
time, cardinality estimation on both database and mention
tables, memory usage, and I/O overhead. For neural
operators, additional factors such as the cost of vector similarity
computations and embedding generation are built-in the
cost model, ensuring an accurate evaluation of query plans.</p>
        <p>Moreover, two very important aspects are the query
decomposition and the cross-modal data flow. The optimizer
decomposes complex queries into into modality-specific
sub-queries, ensuring eficient processing of structured and
unstructured data by simultaneously selecting the most
appropriate operators, as well as, their optimal order in the
execution plan. In this context, the proposed execution plan
should minimizes redundant computations and
intermediate results, optimizing data transfer between operators, for
eficient cross-modal data flow.</p>
        <p>Finally, the optimizer is designed to adapt to dynamic
query workloads and evolving data characteristics by
supporting runtime re-optimization. It monitors operator
performance during execution and adjusts plans as needed.
Furthermore, it integrates pre-trained or fine-tuned
neural models for unstructured data processing, ensuring their
efective and eficient use in query execution.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>5. Conclusions and Future Work</title>
      <p>This paper presents a prototype query engine designed to
execute queries across multiple data modalities eficiently
and at scale. A central proposition of this work is the initial
association of key entities from the database with their
corresponding references within the text corpus. The system
then constructs mention tables, key-value tables containing
dense vector representations of entities and their related
textual passages. The results of this approach have the
potential to inspire new algorithms that link structured database
information with external unstructured data sources.</p>
      <p>Furthermore, this novel tabular representation of
unstructured text enables the development of specialized operators
that the query engine must support. The design and
implementation of these operators establish the foundational
components of the envisioned query engine. Additionally,
the integration of these operators calls for the development
of a new generation of query optimizers capable of
generating eficient execution plans across both structured and
unstructured data modalities. This research lays the
fundamentals for a novel approach to querying multi-modal data
and opens new avenues for future exploration in hybrid
query optimization and execution strategies.</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgments</title>
      <p>This work has been partially supported by DataGEMS,
funded by the European Union’s Horizon Europe Research
and Innovation programme, under grant agreement No
101188416 and by project MIS 5154714 of the National
Recovery and Resilience Plan Greece 2.0 funded by the European
Union under the NextGenerationEU Program.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A.</given-names>
            <surname>Biswal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Patel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Jha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kamsetty</surname>
          </string-name>
          , S. Liu,
          <string-name>
            <given-names>J. E.</given-names>
            <surname>Gonzalez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Guestrin</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Zaharia, Text2sql is not enough: Unifying ai and databases with tag</article-title>
          ,
          <source>arXiv preprint arXiv:2408.14717</source>
          (
          <year>2024</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M.</given-names>
            <surname>Saeed</surname>
          </string-name>
          , N. De Cao, P. Papotti,
          <article-title>Querying large language models with sql</article-title>
          ,
          <source>arXiv preprint arXiv:2304.00472</source>
          (
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>L.</given-names>
            <surname>Patel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Jha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Guestrin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Zaharia</surname>
          </string-name>
          , Lotus:
          <article-title>Enabling semantic queries with llms over tables of unstructured and structured data</article-title>
          ,
          <source>arXiv preprint arXiv:2407.11418</source>
          (
          <year>2024</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zemlyanskiy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ainslie</surname>
          </string-name>
          , M. de Jong, P. Pham,
          <string-name>
            <given-names>I.</given-names>
            <surname>Eckstein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Sha</surname>
          </string-name>
          , Readtwice:
          <article-title>Reading very large documents with memories</article-title>
          ,
          <source>in: Proceedings of NAACL</source>
          ,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>B.</given-names>
            <surname>AlKhamissi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Celikyilmaz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Diab</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ghazvininejad</surname>
          </string-name>
          ,
          <article-title>A review on language models as knowledge bases, 2022</article-title>
          . URL: https://arxiv.org/abs/ 2204.06031. arXiv:
          <volume>2204</volume>
          .
          <fpage>06031</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6] M. de Jong, Y. Zemlyanskiy,
          <string-name>
            <given-names>N. A.</given-names>
            <surname>FitzGerald</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Sha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W. W.</given-names>
            <surname>Cohen</surname>
          </string-name>
          ,
          <article-title>Mention memory: incorporating textual knowledge into transformers through entity mention attention</article-title>
          ,
          <source>in: 10th International Conference on Learning Representations, ICLR</source>
          <year>2022</year>
          ,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>S.</given-names>
            <surname>Sukhbaatar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Weston</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Fergus</surname>
          </string-name>
          , et al.,
          <article-title>End-to-end memory networks</article-title>
          ,
          <source>Advances in neural information processing systems</source>
          <volume>28</volume>
          (
          <year>2015</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zhong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Lei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <article-title>Training language models with memory augmentation</article-title>
          ,
          <source>in: Proceedings of the 2022 Conference on EMNLP, Association for Computational Linguistics</source>
          ,
          <year>2022</year>
          , pp.
          <fpage>5657</fpage>
          -
          <lpage>5673</lpage>
          . URL: https://aclanthology.org/
          <year>2022</year>
          .emnlp-main.
          <volume>382</volume>
          /. doi:
          <volume>10</volume>
          .18653/v1/
          <year>2022</year>
          .emnlp-main.
          <volume>382</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>L.</given-names>
            <surname>Patel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Kraft</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Guestrin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Zaharia</surname>
          </string-name>
          ,
          <article-title>Acorn: Performant and predicate-agnostic search over vector embeddings and structured data</article-title>
          ,
          <source>Proceedings of the ACM on Management of Data</source>
          <volume>2</volume>
          (
          <year>2024</year>
          )
          <fpage>1</fpage>
          -
          <lpage>27</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>A.</given-names>
            <surname>Dargahi Nobari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Rafiei</surname>
          </string-name>
          ,
          <string-name>
            <surname>Dtt:</surname>
          </string-name>
          <article-title>An example-driven tabular transformer for joinability by leveraging large language models</article-title>
          ,
          <source>Proceedings of the ACM on Management of Data (SIGMOD) 2</source>
          (
          <year>2024</year>
          ). URL: https://doi. org/10.1145/3639279. doi:
          <volume>10</volume>
          .1145/3639279.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>G.</given-names>
            <surname>Badaro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Saeed</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Papotti</surname>
          </string-name>
          ,
          <article-title>Transformers for tabular data representation: A survey of models and applications</article-title>
          ,
          <source>Transactions of the Association for Computational Linguistics</source>
          <volume>11</volume>
          (
          <year>2023</year>
          )
          <fpage>227</fpage>
          -
          <lpage>249</lpage>
          . URL: https://aclanthology.org/
          <year>2023</year>
          .tacl-
          <volume>1</volume>
          .14/. doi:
          <volume>10</volume>
          . 1162/tacl_a_
          <fpage>00544</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>M. J. Cafarella</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Re</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Suciu</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          <string-name>
            <surname>Etzioni</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Banko</surname>
          </string-name>
          ,
          <article-title>Structured querying of web text</article-title>
          ,
          <source>in: 3rd Biennial Conference on Innovative Data Systems Research (CIDR)</source>
          , Asilomar, California, USA,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>C.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Russo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Cafarella</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Cao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. B.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Franklin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Kraska</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Madden</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Vitagliano</surname>
          </string-name>
          ,
          <article-title>A declarative system for optimizing ai workloads</article-title>
          ,
          <source>arXiv preprint arXiv:2405.14696</source>
          (
          <year>2024</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hulsebos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Ma</surname>
          </string-name>
          , S. Shankar,
          <string-name>
            <given-names>S.</given-names>
            <surname>Zeigham</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. G.</given-names>
            <surname>Parameswaran</surname>
          </string-name>
          , E. Wu,
          <article-title>Towards accurate and eficient document analytics with large language models</article-title>
          ,
          <year>2024</year>
          . URL: https://arxiv.org/abs/2405.04674. arXiv:
          <volume>2405</volume>
          .
          <fpage>04674</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>A.</given-names>
            <surname>Kristo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Vaidya</surname>
          </string-name>
          ,
          <string-name>
            <given-names>U.</given-names>
            <surname>Çetintemel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Misra</surname>
          </string-name>
          , T. Kraska,
          <article-title>The case for a learned sorting algorithm</article-title>
          ,
          <source>in: Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data, SIGMOD '20</source>
          ,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computing Machinery, New York, NY, USA,
          <year>2020</year>
          , p.
          <fpage>1001</fpage>
          -
          <lpage>1016</lpage>
          . URL: https://doi.org/10.1145/ 3318464.3389752. doi:
          <volume>10</volume>
          .1145/3318464.3389752.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>I.</given-names>
            <surname>Sabek</surname>
          </string-name>
          , T. Kraska,
          <article-title>The case for learned inmemory joins</article-title>
          ,
          <source>Proc. VLDB Endow</source>
          .
          <volume>16</volume>
          (
          <year>2023</year>
          )
          <fpage>1749</fpage>
          -
          <lpage>1762</lpage>
          . URL: https://doi.org/10.14778/3587136. 3587148. doi:
          <volume>10</volume>
          .14778/3587136.3587148.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>M.</given-names>
            <surname>Urban</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Binnig</surname>
          </string-name>
          , Eleet:
          <article-title>Eficient learned query execution over text and tables</article-title>
          , volume
          <volume>17</volume>
          ,
          <string-name>
            <given-names>VLDB</given-names>
            <surname>Endowment</surname>
          </string-name>
          ,
          <year>2024</year>
          , pp.
          <fpage>4867</fpage>
          -
          <lpage>4880</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>A.</given-names>
            <surname>Miller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Fisch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Dodge</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.-H.</given-names>
            <surname>Karimi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bordes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Weston</surname>
          </string-name>
          ,
          <article-title>Key-value memory networks for directly reading documents</article-title>
          ,
          <source>in: Proceedings of the 2016 Conference on EMNLP, Association for Computational Linguistics</source>
          , Austin, Texas,
          <year>2016</year>
          . URL: https://aclanthology.org/D16-1147. doi:
          <volume>10</volume>
          . 18653/v1/
          <fpage>D16</fpage>
          -1147.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>