<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Semantic Caching for OLAP via LLM-Based Query Canonicalization</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Laurent Bindschaedler</string-name>
          <email>bindsch@mpi-sws.org</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Max Planck Institute for Software Systems</institution>
          ,
          <addr-line>Saarbrücken</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>OLAP</institution>
          ,
          <addr-line>Caching, Canonicalization, Signatures, Semantics, Star-schema, Natural language, LLMs</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2026</year>
      </pub-date>
      <abstract>
        <p>Analytical workloads exhibit substantial semantic repetition, yet most production caches key entries by SQL surface form (text or AST), fragmenting reuse across BI tools, notebooks, and NL interfaces. We introduce a safety-first middleware cache for dashboard-style OLAP over star schemas that canonicalizes both SQL and NL into a unified key space-the OLAP Intent Signature-capturing measures, grouping levels, filters, and time windows. Reuse requires exact intent matches under strict schema validation and confidence-gated NL acceptance; two correctness-preserving derivations (roll-up, filter-down) extend coverage without approximate matching. Across TPC-DS, SSB, and NYC TLC (1,395 queries), we achieve 82% hit rate versus 28% (text) and 56% (AST) with zero false hits; derivations double hit rate on hierarchical queries.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>https://binds.ch (L. Bindschaedler)</p>
      <p>CEUR
Workshop</p>
      <p>
        ISSN1613-0073
on hierarchical workloads. Although NL canonicalization accuracy degrades under ambiguity (44%
adversarial, 51% BIRD [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]), layered safety mechanisms prevent incorrect reuse. At a 0.5 confidence
threshold, precision reaches 76.9% at 36.5% coverage.
      </p>
      <p>This paper makes the following contributions:
• An OLAP Intent Signature that canonicalizes heterogeneous SQL and NL into a unified cache key
space, enabling cross-client reuse without requiring a single semantic API.
• Safety-first reuse with schema validation and confidence gating, with quantified NL failure modes
(44% adversarial accuracy) showing why safety mechanisms are essential.
• Safe derivations (roll-up, filter-down) that extend reuse without approximate matching.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Background and Motivation</title>
      <p>
        Most production query caches key entries by SQL surface form (normalized text or AST-derived
representations) [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. This fragments reuse: heterogeneous clients (BI tools, notebooks, NL interfaces)
generate diferent SQL for identical questions. The result is high semantic repetition without cache reuse.
We focus on query-level caching (complete results), not cell-level OLAP caches that store individual
cube cells [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]; our middleware approach works with any SQL-compatible backend. Our key observation
is that dashboard-style OLAP queries over star schemas share stable semantic structure [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]: measures,
grouping levels, filters, and time windows. We key the cache by an OLAP Intent Signature encoding
these components, enabling cross-client and cross-modality reuse.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. System Design</title>
      <p>This section describes a middleware cache that answers OLAP-style aggregation queries. It maps each
request (SQL or NL) to a structured OLAP Intent Signature, the cache key. The design is correctness-first:
reuse requires exact intent equality with strict validation. Extensions beyond exact matches are limited
to correctness-preserving transformations under stated assumptions.</p>
      <sec id="sec-3-1">
        <title>3.1. Scope and Assumptions</title>
        <p>
          We focus on a high-value class of dashboard queries, specifically aggregations over a star or snowflake
schema [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] with a single fact table and joins to dimension tables along schema-defined foreign keys. The
queries may consist of WHERE, GROUP BY, HAVING, and optionally ORDER BY and LIMIT. We deliberately
exclude features that complicate semantic equivalence, such as window functions, set operations,
correlated subqueries, recursive CTEs, and lateral joins.
        </p>
        <p>Scope Coverage. In TPC-DS, only 14% of queries qualify (the rest use window functions, CTEs, or
set operations); SSB and NYC TLC are 100% covered. Queries outside the scope of supported operations
bypass the cache and execute directly on the backend.</p>
        <p>To make equivalence checkable, we assume that join semantics are dictated by the schema: given the
fact table and referenced dimension attributes, there is a unique join path.</p>
        <p>
          Terminology. We adopt standard OLAP terminology [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]: a dimension is a conceptual grouping (e.g.,
Time, Geography), while a level is a specific granularity within a dimension hierarchy (e.g., Year &gt;
Quarter &gt; Month). Roll-up aggregates from finer to coarser levels; drill-down moves from coarser to
ifner levels. Slicing fixes a single dimension value; dicing selects ranges across multiple dimensions.
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Architecture</title>
        <p>Figure 1 illustrates the system architecture. The middleware operates as an intermediary between
clients, such as BI tools, notebooks, and NL interfaces, and the backend OLAP engine.</p>
        <p>For each request, the following steps are taken: (1) canonicalize the request into an intent signature,
(2) validate the signature against the schema and safety rules, (3) look up the signature hash in the cache,
(4) on a miss, execute on the backend and store the result under the signature. Because canonicalization
NL</p>
        <p>Canonicalizer</p>
        <p>Validator</p>
        <p>Hash
bypass
miss</p>
        <p>Cache
Backend</p>
        <p>hit
populate</p>
        <p>Result
execute
and validation are distinct steps, reuse decisions are auditable: the signature is an explicit contract that
determines whether a cached result may be returned.</p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. OLAP Intent Signature</title>
        <p>We define a canonical signature as a JSON object containing all semantics that can afect the numerical
output of the query. The components of the signature include:
• Measures: aggregation function and base expression (for example SUM(f_sales)), including</p>
        <p>DISTINCT flags.
• Grouping levels: list of hierarchy levels in GROUP BY, sorted alphabetically to a canonical order (for
example geo.region, time.month). SQL GROUP BY order is semantically irrelevant; we canonicalize
to ensure equivalent queries produce identical signatures.
• Filters: a set of normalized predicates over non-temporal dimensions and facts, including canonical
literal formats.
• Time window: explicit start and end boundaries on the time dimension, normalized to a canonical
representation. We separate time windows from general filters for two reasons: (i) temporal
predicates require special canonicalization (resolving “last quarter” to concrete dates, handling timezone
normalization), and (ii) time boundaries are critical for cache invalidation—when new data arrives,
entries with open-ended time windows (“last 30 days”) must be refreshed while closed windows (“Q1
2024”) remain valid.
• Post-aggregation operators: HAVING, ORDER BY, LIMIT.
• Metric identity (optional): a metric identifier from a governed layer, if available.
• Scope (optional): for multi-tenant deployments, a tenant or user identifier ensures cache isolation
across security boundaries.</p>
        <p>The optional fields are relevant for governed or multi-tenant deployments; our evaluation uses
ungoverned single-tenant schemas.</p>
        <p>We serialize the signature into a canonical JSON string (with sorted keys and normalized lists) and
compute a hash (SHA-256) to obtain a fixed-length cache key. This ensures diferent surface forms
map to the same signature (Figure 2). Join paths are implicit, determined by the schema’s foreign keys,
so signatures omit them; queries with ambiguous joins (role-playing dimensions, self-joins) bypass
caching.</p>
        <p>NL: “Show total revenue by region for electronics in Q1 2024”
SQL:
SELECT r.region_name,</p>
        <p>SUM(s.amount) AS revenue
FROM sales s</p>
        <p>JOIN regions r ON s.region_id = r.id</p>
        <p>JOIN products p ON s.product_id = p.id
WHERE p.category = 'electronics'</p>
        <p>AND s.sale_date &gt;= '2024-01-01'</p>
        <p>AND s.sale_date &lt; '2024-04-01'
GROUP BY r.region_name
{
}
OLAP Intent Signature:
"measures": [{"agg": "SUM",</p>
        <p>"expr": "sales.amount"}],
"levels": ["regions.region_name"],
"filters": [{"col": "products.category",</p>
        <p>"op": "=", "val": "electronics"}],
"time_window": {
"start": "2024-01-01",
"end": "2024-04-01"}</p>
      </sec>
      <sec id="sec-3-4">
        <title>3.4. Canonicalization</title>
        <p>SQL requests are parsed into an AST and normalized deterministically (identifier resolution, predicate
ordering, literal canonicalization). NL requests are mapped to signatures using an LLM constrained
to produce strict JSON; the LLM also returns a confidence score (0–1) used for safety gating.
Lowconfidence signatures bypass caching to avoid incorrect reuse.</p>
      </sec>
      <sec id="sec-3-5">
        <title>3.5. Validation and Cache Lookup</title>
        <p>Before reuse, we validate that all referenced measures/dimensions exist, time windows resolve to
concrete boundaries, and join paths are unambiguous. Validation failures bypass the cache and execute
on the backend, prioritizing misses over incorrect reuse.</p>
      </sec>
      <sec id="sec-3-6">
        <title>3.6. Correctness-Preserving Reuse Beyond Exact Hits</title>
        <p>Exact intent matching is the default reuse mode. We also support two safe derivations:
Roll-Up from Finer-Grained Cache. If a cached entry has identical measures and filters but finer
grouping levels, we can re-aggregate the cached result. Roll-up is permitted only for composable
aggregations (SUM, COUNT, MIN, MAX); it is rejected for AVG, COUNT DISTINCT, or ratios.
Filter-Down from Cached Supersets. If a cached entry uses a superset filter and contains the
attributes needed to apply a tighter filter, we filter the cached result. Derivations are disabled when
ORDER BY or LIMIT is present.</p>
        <p>Both derivations have explicit preconditions that prevent incorrect reuse. Note that drill-down (finer
← coarser) is not supported: query-level caching lacks the detailed data needed to derive finer-grained
results from coarser aggregates.</p>
      </sec>
      <sec id="sec-3-7">
        <title>3.7. Safety Policy for NL-Driven Reuse</title>
        <p>NL canonicalization can produce schema-valid but semantically incorrect signatures. We control NL
reuse via layered policies: (1) schema validation for structural correctness, (2) confidence-gated reuse
bypassing low-confidence signatures, and (3) heuristic checks rejecting common ambiguity patterns
(unresolved relative time, underspecified spatial terms). These layers prioritize misses over false hits.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Prototype Implementation</title>
      <p>We implement the middleware as a Python prototype with three components. The canonicalizer parses
SQL via sqlglot and transforms it into intent signatures through deterministic AST normalization; NL
requests use an LLM endpoint emitting schema-constrained JSON with confidence scores. The validator
checks that referenced measures/dimensions exist and join paths are unambiguous; failures bypass the
cache. The cache store uses Parquet files indexed by signature hash with a SQLite metadata index for
derivation candidate lookup. NL string-to-signature mappings are memoized to avoid repeat LLM calls.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Evaluation</title>
      <sec id="sec-5-1">
        <title>5.1. Setup and Results</title>
        <p>
          We evaluate on three OLAP workloads: TPC-DS [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] (14 in-scope queries), SSB [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] (13 queries), and
NYC TLC [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] (18 queries). For each of 45 canonical intents, we generate 21 SQL variants (formatting,
alias, predicate order changes) and 10 NL paraphrases, yielding 1,395 queries total. All variants are
verified to produce identical results, establishing ground truth for hit-rate and false-hit measurement.
We compare against TextCache (normalized SQL text), ASTCache (deterministic AST canonicalization),
and NL-to-SQL+AST (NL translated to SQL, then AST-cached). For NL robustness, we also test on 63
adversarial queries and 150 BIRD [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] human-authored questions.
        </p>
        <p>Table 1 reports hit rates and reduction factors. LLMSigCache achieves the highest hit rate on all
three workloads (82% average), outperforming TextCache (28.2%) and ASTCache (55.6%).</p>
        <p>Within LLMSigCache, NL reuse arises from both NL-to-NL hits (paraphrases of the same intent) and
cross-surface hits (NL matching SQL-populated entries or vice versa). Cross-surface sharing accounts
for 3–34% of NL cache hits; most NL benefit comes from unifying paraphrases under intent keys.</p>
        <p>
          All methods produce zero false hits on controlled workloads. NL accuracy drops to 44% on
adversarial queries and 51% on BIRD [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] human-authored questions, making safety essential: a 0.5 confidence
threshold yields 77% precision at 37% coverage. Backend compute drops 85–90%. Safe derivations
(roll-up, filter-down) raise hit rate from 37% to 80% on hierarchical workloads.
        </p>
      </sec>
      <sec id="sec-5-2">
        <title>5.2. RQ2: Does Intent-Signature Caching Preserve Correctness?</title>
        <p>
          NL Semantic Accuracy under Ambiguity. NL canonicalization accuracy degrades under realistic
ambiguity. On 63 adversarial queries (Table 2), accuracy is 44.4%; on 150 human-authored BIRD [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]
questions, 51.3%. Time references (“last month”) and dimension terms (“area”) are most error-prone.
        </p>
      </sec>
      <sec id="sec-5-3">
        <title>5.3. RQ3: What Are the Backend Savings and Overheads?</title>
        <p>SQL cache lookup adds 9–16 ms median overhead (Table 4a). NL canonicalization costs ∼1.3 s on first
occurrence but is memoized for repeats. Dashboard-like query orderings (Sequential, Zipf) maintain
high hit rates even at 10–25% cache capacity (Table 4b).</p>
      </sec>
      <sec id="sec-5-4">
        <title>5.4. RQ4: Do Correctness-Preserving Derivations Extend Coverage?</title>
        <p>We construct an SSB hierarchical workload where queries drill through time, geography, and product
hierarchies. TPC-DS and NYC TLC lack systematic hierarchy traversal, so derivations provide limited
benefit on those workloads. Without derivations, hit rate is 37%; enabling roll-up and filter-down raises
it to 80% with zero false hits.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6. Discussion</title>
      <p>
        Cache Invalidation. Our evaluation uses static snapshots, but production systems must link entries
to data freshness [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. Queries with closed time windows (“Q1 2024”) remain valid indefinitely;
openended windows (“last 30 days”) require refresh. Empirical characterization of cache churn under realistic
update patterns is left for future work.
      </p>
      <p>Deployment Configurations. The precision-coverage trade-of is tunable: conservative (threshold
0.7, all heuristics) yields 71.4% precision at 22.2% coverage; balanced (0.5, time+spatial) reaches 72.0% at
39.7%; disabling safety drops to 48.3% precision with 30 wrong results.</p>
      <p>
        Limitations. Key limitations include: (1) queries with window functions, set operations, or CTEs
bypass the cache; in TPC-DS, only 14% of queries qualify, though dashboard-oriented workloads
(SSB, NYC TLC) are fully covered; (2) NL canonicalization accuracy drops on ambiguous inputs (44%
adversarial, 51% BIRD [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]), requiring confidence gating and heuristic checks; (3) roll-up derivations are
limited to additive measures; and (4) synthetic workloads may overstate hit rates versus production trafic.
Cache invalidation under data updates remains the primary systems gap for production deployment.
      </p>
    </sec>
    <sec id="sec-7">
      <title>7. Related Work</title>
      <p>
        Classic semantic caching [
        <xref ref-type="bibr" rid="ref13 ref14">13, 14</xref>
        ] stores results by predicate regions. We use a structured intent signature
derivable from both SQL and NL. Prior work on aggregate views [
        <xref ref-type="bibr" rid="ref4 ref5">5, 4</xref>
        ] and summarizability [
        <xref ref-type="bibr" rid="ref15 ref16">15,
16</xref>
        ] informs our roll-up derivations. Text-to-SQL research [
        <xref ref-type="bibr" rid="ref17 ref18 ref19 ref6">17, 6, 18, 19</xref>
        ] enables NL→SQL→cache
pipelines, but translation variability fragments reuse; we canonicalize NL directly to intent signatures.
BI semantic layers [
        <xref ref-type="bibr" rid="ref20 ref21">20, 21</xref>
        ] cache within platform-specific models; we provide portable middleware
across heterogeneous clients. Embedding-based caches [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] use similarity but cannot ensure correctness.
      </p>
    </sec>
    <sec id="sec-8">
      <title>8. Conclusion</title>
      <p>We introduced a safety-first semantic cache for dashboard-style OLAP that canonicalizes SQL and NL
into a unified key space, the OLAP Intent Signature. Evaluation shows large hit-rate and backend-savings
gains over text and AST caching, while safety mechanisms prevent incorrect reuse despite NL errors.
Key directions for future work include robust cache invalidation under data updates, stronger schema
grounding for NL canonicalization, and extending input modalities to MDX.</p>
    </sec>
    <sec id="sec-9">
      <title>Declaration on Generative AI</title>
      <p>The LLMs evaluated in this work (GPT-4o-mini, Claude-3.5-haiku) are components of the proposed
system and were not used to prepare the manuscript. The author used Claude (Anthropic) and
Grammarly for writing style, grammar, spelling, and formatting, and OpenAI Deep Research for citation
management. The author reviewed and edited all outputs and takes full responsibility for the content.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>B.</given-names>
            <surname>Mohammadi</surname>
          </string-name>
          ,
          <string-name>
            <surname>L. Bindschaedler,</surname>
          </string-name>
          <article-title>The case for instance-optimized LLMs in OLAP databases</article-title>
          ,
          <source>in: Proceedings of DOLAP</source>
          <year>2025</year>
          : 27th International Workshop on Design,
          <article-title>Optimization, Languages and Analytical Processing of Big Data, co-located with EDBT/ICDT 2025</article-title>
          , volume
          <volume>3931</volume>
          <source>of CEUR Workshop Proceedings</source>
          , CEUR-WS.org, Barcelona, Spain,
          <year>2025</year>
          , pp.
          <fpage>59</fpage>
          -
          <lpage>63</lpage>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>3931</volume>
          /short3.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>J.</given-names>
            <surname>Gray</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Chaudhuri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bosworth</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Layman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Reichart</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Venkatrao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Pellow</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Pirahesh</surname>
          </string-name>
          ,
          <article-title>Data cube: A relational aggregation operator generalizing group-by, cross-tab, and sub-totals, Data Mining and Knowledge Discovery 1 (</article-title>
          <year>1997</year>
          )
          <fpage>29</fpage>
          -
          <lpage>53</lpage>
          . URL: https://doi.org/10.1023/A:1009726021843. doi:
          <volume>10</volume>
          .1023/A:
          <fpage>1009726021843</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3] ClickHouse, Query cache,
          <source>ClickHouse Docs</source>
          ,
          <year>2026</year>
          . URL: https://clickhouse.com/docs/operations/ query-cache,
          <source>last modified 2026-01-15 (Git history)</source>
          .
          <source>Accessed</source>
          <year>2026</year>
          -
          <volume>02</volume>
          -20.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>C.-S.</given-names>
            <surname>Park</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.-H. Kim</surname>
            ,
            <given-names>Y.-J.</given-names>
          </string-name>
          <string-name>
            <surname>Lee</surname>
          </string-name>
          ,
          <article-title>Rewriting OLAP queries using materialized views and dimension hierarchies in data warehouses</article-title>
          ,
          <source>in: Proceedings of the 17th International Conference on Data Engineering (ICDE</source>
          <year>2001</year>
          ), Heidelberg, Germany, April 2-
          <issue>6</issue>
          ,
          <year>2001</year>
          , IEEE Computer Society,
          <year>2001</year>
          , pp.
          <fpage>515</fpage>
          -
          <lpage>523</lpage>
          . URL: https://doi.org/10.1109/ICDE.
          <year>2001</year>
          .
          <volume>914865</volume>
          . doi:
          <volume>10</volume>
          .1109/ICDE.
          <year>2001</year>
          .
          <volume>914865</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>D.</given-names>
            <surname>Srivastava</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Dar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. V.</given-names>
            <surname>Jagadish</surname>
          </string-name>
          ,
          <string-name>
            <surname>A. Y. Levy</surname>
          </string-name>
          ,
          <article-title>Answering queries with aggregation using views</article-title>
          ,
          <source>in: Proceedings of the 22nd International Conference on Very Large Data Bases (VLDB '96)</source>
          , Mumbai (Bombay),
          <source>India, September 3-6</source>
          ,
          <year>1996</year>
          , Morgan Kaufmann,
          <year>1996</year>
          , pp.
          <fpage>318</fpage>
          -
          <lpage>329</lpage>
          . URL: https://www.vldb.org/conf/1996/P318.PDF.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>J.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Hui</surname>
          </string-name>
          , G. Qu,
          <string-name>
            <given-names>J.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Qin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Geng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Huo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Zhou</surname>
          </string-name>
          , C. Ma, G. Li,
          <string-name>
            <given-names>K.</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Huang</surname>
          </string-name>
          , R. Cheng, Y. Li,
          <article-title>Can LLM already serve as A database interface? A BIg bench for large-scale database grounded text-to-</article-title>
          <string-name>
            <surname>SQLs</surname>
          </string-name>
          ,
          <source>in: Advances in Neural Information Processing Systems</source>
          , volume
          <volume>36</volume>
          ,
          <year>2023</year>
          . URL: https://proceedings.neurips.cc/paper_files/paper/2023/ hash/83fc8fab1710363050bbd1d4b8cc0021-Abstract-Datasets_and_Benchmarks.html.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>J.</given-names>
            <surname>Hyde</surname>
          </string-name>
          , Cache control,
          <source>Pentaho Mondrian Documentation</source>
          ,
          <year>2011</year>
          . URL: https://mondrian.pentaho. com/documentation/cache_control.php, author: Julian Hyde;
          <article-title>last modified by Luc Boudreau</article-title>
          ,
          <year>August 2011</year>
          .
          <source>Accessed</source>
          <year>2026</year>
          -
          <volume>02</volume>
          -20.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Golfarelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Rizzi</surname>
          </string-name>
          , Data Warehouse Design:
          <article-title>Modern Principles and Methodologies</article-title>
          ,
          <source>McGraw Hill Professional</source>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Transaction</given-names>
            <surname>Processing</surname>
          </string-name>
          <article-title>Performance Council, TPC Benchmark™ DS Standard Specification</article-title>
          ,
          <source>Transaction Processing Performance Council</source>
          ,
          <year>2021</year>
          . URL: https://www.tpc.org/tpc_documents_current_ versions/pdf/tpc-ds
          <source>_v3.2.0.pdf, version 3.2.0.</source>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>P. O'Neil</surname>
            ,
            <given-names>B. O</given-names>
          </string-name>
          <string-name>
            <surname>'Neil</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          <string-name>
            <surname>Chen</surname>
          </string-name>
          , Star Schema Benchmark,
          <source>Technical Report</source>
          , University of Massachusetts Boston,
          <year>2009</year>
          . URL: https://www.cs.umb.edu/~poneil/StarSchemaB.PDF, revision
          <volume>3</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11] New York City Taxi and
          <article-title>Limousine Commission, TLC trip record data</article-title>
          ,
          <source>NYC Taxi &amp; Limousine Commission website (NYC.gov)</source>
          ,
          <year>2026</year>
          . URL: https://www.nyc.gov/site/tlc/about/tlc
          <article-title>-trip-record-data</article-title>
          . page, accessed
          <year>2026</year>
          -
          <volume>02</volume>
          -20.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>A.</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I. S.</given-names>
            <surname>Mumick</surname>
          </string-name>
          ,
          <article-title>Maintenance of materialized views: Problems, techniques, and applications</article-title>
          ,
          <source>IEEE Data Engineering Bulletin</source>
          <volume>18</volume>
          (
          <year>1995</year>
          )
          <fpage>3</fpage>
          -
          <lpage>18</lpage>
          . URL: https://dblp.org/rec/journals/debu/GuptaM95. html.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>S.</given-names>
            <surname>Dar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Franklin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. Þ.</given-names>
            <surname>Jónsson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Srivastava</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Tan</surname>
          </string-name>
          ,
          <article-title>Semantic data caching and replacement</article-title>
          ,
          <source>in: Proceedings of the 22nd International Conference on Very Large Data Bases (VLDB '96)</source>
          , Mumbai (Bombay),
          <source>India, September 3-6</source>
          ,
          <year>1996</year>
          , Morgan Kaufmann,
          <year>1996</year>
          , pp.
          <fpage>330</fpage>
          -
          <lpage>341</lpage>
          . URL: https://www.vldb.org/conf/1996/P330.PDF.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>P.</given-names>
            <surname>Scheuermann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Shim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Vingralek</surname>
          </string-name>
          ,
          <article-title>WATCHMAN: A data warehouse intelligent cache manager</article-title>
          ,
          <source>in: Proceedings of the 22nd International Conference on Very Large Data Bases (VLDB '96)</source>
          , Mumbai (Bombay),
          <source>India, September 3-6</source>
          ,
          <year>1996</year>
          , Morgan Kaufmann,
          <year>1996</year>
          , pp.
          <fpage>51</fpage>
          -
          <lpage>62</lpage>
          . URL: https: //www.vldb.org/conf/1996/P051.PDF.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>H.-J. Lenz</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Shoshani</surname>
          </string-name>
          ,
          <article-title>Summarizability in OLAP and statistical data bases</article-title>
          ,
          <source>in: Proceedings of the 9th International Conference on Scientific and Statistical Database Management (SSDBM '97)</source>
          , Olympia, Washington, USA,
          <year>August</year>
          11-
          <issue>13</issue>
          ,
          <year>1997</year>
          , IEEE Computer Society,
          <year>1997</year>
          , pp.
          <fpage>132</fpage>
          -
          <lpage>143</lpage>
          . URL: https://doi.org/10.1109/SSDM.
          <year>1997</year>
          .
          <volume>621175</volume>
          . doi:
          <volume>10</volume>
          .1109/SSDM.
          <year>1997</year>
          .
          <volume>621175</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>T. B. Pedersen</surname>
            ,
            <given-names>C. S.</given-names>
          </string-name>
          <string-name>
            <surname>Jensen</surname>
            ,
            <given-names>C. E.</given-names>
          </string-name>
          <string-name>
            <surname>Dyreson</surname>
          </string-name>
          ,
          <article-title>Extending practical pre-aggregation in on-line analytical processing</article-title>
          ,
          <source>in: Proceedings of the 25th International Conference on Very Large Data Bases (VLDB '99)</source>
          , Edinburgh, Scotland, UK, September 7-
          <issue>10</issue>
          ,
          <year>1999</year>
          ,
          <year>1999</year>
          , pp.
          <fpage>663</fpage>
          -
          <lpage>674</lpage>
          . URL: https://www.vldb.org/conf/1999/P62.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>T.</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Yasunaga</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Yao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Roman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , D. Radev,
          <article-title>Spider: A large-scale human-labeled dataset for complex and cross-domain semantic parsing and text-to-SQL task</article-title>
          ,
          <source>in: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP</source>
          <year>2018</year>
          ), Brussels, Belgium,
          <source>October 31-November 4</source>
          ,
          <year>2018</year>
          ,
          <year>2018</year>
          , pp.
          <fpage>3911</fpage>
          -
          <lpage>3921</lpage>
          . URL: https://aclanthology.org/D18-1425/. doi:
          <volume>10</volume>
          .18653/v1/
          <fpage>D18</fpage>
          - 1425.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>M.</given-names>
            <surname>Pourreza</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Rafiei</surname>
          </string-name>
          , DIN-SQL:
          <article-title>Decomposed in-context learning of text-to-SQL with self-correction</article-title>
          ,
          <source>in: Advances in Neural Information Processing Systems</source>
          , volume
          <volume>36</volume>
          ,
          <year>2023</year>
          . URL: https://proceedings.neurips.cc/paper_files/paper/2023/hash/ 72223cc66f63ca1aa59edaec1b3670e6-Abstract-Conference.html.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>S.</given-names>
            <surname>Chaturvedi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Chadha</surname>
          </string-name>
          , L. Bindschaedler,
          <article-title>SQL-of-thought: Multi-agentic text-to-SQL with guided error correction</article-title>
          ,
          <source>in: Deep Learning for Code in the Agentic Era (NeurIPS 2025 Workshop: DL4C)</source>
          ,
          <year>2025</year>
          . URL: https://arxiv.org/abs/2509.00581.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <surname>Cube</surname>
          </string-name>
          , Introduction, Cube Documentation,
          <year>2026</year>
          . URL: https://cube.dev/docs/product/introduction, last modified 2026-
          <volume>02</volume>
          -19 (Git history).
          <source>Accessed</source>
          <year>2026</year>
          -
          <volume>02</volume>
          -20.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <surname>dbt</surname>
            <given-names>Labs</given-names>
          </string-name>
          , Inc., dbt semantic layer,
          <source>dbt Developer Hub</source>
          ,
          <year>2026</year>
          . URL: https://docs.getdbt.com/docs/ use
          <article-title>-dbt-semantic-layer/dbt-sl</article-title>
          ,
          <source>last updated 2026-02-19. Accessed</source>
          <year>2026</year>
          -
          <volume>02</volume>
          -20.
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>F.</given-names>
            <surname>Bang</surname>
          </string-name>
          ,
          <string-name>
            <surname>GPTCache:</surname>
          </string-name>
          <article-title>An open-source semantic cache for LLM applications enabling faster answers and cost savings</article-title>
          , in: L.
          <string-name>
            <surname>Tan</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Milajevs</surname>
            , G. Chauhan,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Gwinnup</surname>
          </string-name>
          , E. Rippeth (Eds.),
          <source>Proceedings of the 3rd Workshop for Natural Language Processing Open Source Software (NLP-OSS</source>
          <year>2023</year>
          ),
          <article-title>Association for Computational Linguistics</article-title>
          , Singapore,
          <year>2023</year>
          , pp.
          <fpage>212</fpage>
          -
          <lpage>218</lpage>
          . URL: https://aclanthology. org/
          <year>2023</year>
          .nlposs-
          <volume>1</volume>
          .24/. doi:
          <volume>10</volume>
          .18653/v1/
          <year>2023</year>
          .nlposs-
          <volume>1</volume>
          .
          <fpage>24</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>