<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Metadata-Guided Difusion and LLM-Orchestrated Quality Governance for Time Series Imputation</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Imane Hocine</string-name>
          <email>imane.hocine@uni.lu</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Asma Abboura</string-name>
          <email>a.abboura@univ-chlef.dz</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Abdelaziz Kella</string-name>
          <email>abdelaziz.kella@lastingdynamics.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Maria Hanini</string-name>
          <email>m.hanini@sheffield.ac.uk</email>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Grégoire Danoy</string-name>
          <email>gregoire.danoy@uni.lu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>FSTM/DCS, University of Luxembourg</institution>
          ,
          <addr-line>Esch-sur-Alzette</addr-line>
          ,
          <country country="LU">Luxembourg</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Lasting Dynamics</institution>
          ,
          <addr-line>Las Palmas de Gran Canaria</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Snt, University of Luxembourg</institution>
          ,
          <addr-line>Esch-sur-Alzette</addr-line>
          ,
          <country country="LU">Luxembourg</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>University of Hassiba Benbouali</institution>
          ,
          <addr-line>Chlef</addr-line>
          ,
          <country country="DZ">Algeria</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>University of Shefield</institution>
          ,
          <addr-line>Shefield</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2026</year>
      </pub-date>
      <abstract>
        <p>High-quality time series (TS) data is essential for reliable analytics, forecasting, and knowledge-driven systems. In operational settings, however, TS are frequently degraded by missing values arising from sensor faults, intermittent connectivity, maintenance activities, and extended partial-blackout events. Whilst recent difusionbased models have improved imputation accuracy, they remain largely signal-centric, make limited use of semantic and operational metadata, and provide little support for data quality considerations.</p>
      </abstract>
      <kwd-group>
        <kwd>Time series imputation</kwd>
        <kwd>Knowledge graph</kwd>
        <kwd>Metadata</kwd>
        <kwd>Difusion models</kwd>
        <kwd>LLM orchestration</kwd>
        <kwd>Data quality</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Imagine a city relying on a network of trafic sensors to manage congestion in real time. When multiple
sensors fail due to maintenance or network disruptions, the system must reconstruct missing
measurements to make safe and eficient decisions. Similarly, satellite time series TS used for vegetation
monitoring or climate analysis are frequently incomplete due to cloud cover, sensor outages, or
acquisition constraints. In both cases, missing time series values threaten operational reliability, safety, and
trust in downstream analytics and decision-making pipelines.</p>
      <p>
        Classical imputation methods use interpolation and state-space models [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Neural network
approaches apply recurrent architectures and transformers to learn temporal dependencies [
        <xref ref-type="bibr" rid="ref2 ref3">2, 3</xref>
        ].
Graphbased methods exploit spatial and functional relationships between sensors, propagating information
across correlated entities [
        <xref ref-type="bibr" rid="ref4 ref5 ref6">4, 5, 6</xref>
        ]. More recently, difusion models have been applied to TS imputation
[
        <xref ref-type="bibr" rid="ref4 ref6 ref7">7, 6, 8, 9, 4, 10, 11</xref>
        ], generating probabilistic reconstructions through iterative denoising processes.
These approaches face a fundamental limitation in controllable generation [11, 12]. Difusion models
learn from observed correlations in training data. Graph-based methods encode relationships through
Published in the Proceedings of the Workshops of the EDBT/ICDT 2026 Joint Conference (March 24-27, 2026), Tampere, Finland
(M. Hanini); 0000-0001-9419-4210 (G. Danoy)
      </p>
      <p>CEUR
Workshop</p>
      <p>ISSN1613-0073
ifxed topologies derived from distance metrics or learned correlations. Neither can systematically access
operational constraints that determine which conditioning sources are semantically valid.</p>
      <p>Moreover, they operate on raw signal values and remain signal-centric. Relevant dependencies are
either assumed observable in the data or implicitly learnable from correlations. In practice, many
critical relationships are implicit and external to the signal [13, 12]. For instance, sensor networks
contain structured metadata that is not present in raw measurements, including device specifications,
operational logs, and quality assessments. This metadata determines which sensors provide valid
conditioning context. Optical and radar satellites measure diferent physical properties. Using radar to
impute optical observations may produce smooth interpolations that violate spectral consistency. A
recently recalibrated sensor may have shifted its measurement baseline. Conditioning on its historical
values introduces bias. As such, when context is absent or implicit, imputation models may produce
statistically plausible but operationally invalid reconstructions. Specifically, difusion models may
condition on unreliable or invalid sources.</p>
      <p>These failures may well occur in reality. In trafic management, imputing flow measurements from a
highway sensor using an unrelated arterial sensor can overestimate throughput capacity, triggering
unsafe signal timing decisions. In satellite-based crop monitoring, conditioning optical vegetation
indices on radar input produces incoherent values that misclassify crop health, propagating errors
into yield forecasts. When such imputed values are ingested into knowledge graphs for downstream
reasoning, the damage compounds. Invalid upstream reconstructions become trusted facts in the graph,
silently degrading every query and inference built on them. These failure modes cannot be resolved by
improving model capacity alone. They require access to external metadata to determine which sources
are semantically valid for conditioning.</p>
      <p>To address this gap, we envision a metadata- and governance-aware framework for TS imputation.
The system combines a metadata knowledge graph (KG), subgraph-conditioned difusion, and large
language model (LLM)-based orchestration. This vision directly addresses key challenges in hybrid
KG–LLM ecosystems by improving the fidelity of data streams feeding KGs, reducing downstream
reasoning errors caused by poor upstream data quality, and supporting human-in-the-loop governance
[10].</p>
      <sec id="sec-1-1">
        <title>Hypothesis and Vision</title>
        <p>
          Our central hypothesis is that externalising implicit relationships in a KG and conditioning difusion
models on quality-filtered subgraphs improves imputation under structured missingness, heterogeneous
sensors, and evolving operational conditions. Learning-based models capture temporal and
crossvariable patterns from observed data. They cannot infer operational semantics, provenance, or quality
constraints that are not present in raw signals [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. Data-driven imputation degrades when correlations
are absent, when sensors are heterogeneous, or when operational context changes. We therefore
propose treating metadata as a first-class artefact, external to both data and models, and representing it
in a KG that is explicit, queryable, and evolvable [14, 15]. Within this paradigm: (i)Difusion models
perform statistical reconstruction, generating candidate imputations conditioned on observed data and
selected context [
          <xref ref-type="bibr" rid="ref7">7, 9, 16</xref>
          ]. (ii)Knowledge graphs externalise implicit relationships, enforce semantic
and quality constraints, and encode provenance signals [14, 15]. (iii)LLM-based agents orchestrate the
workflow and generate human-readable explanations for audit and governance [ 17, 18]. Separating
statistical reconstruction from semantic and operational reasoning allows the system to adapt to evolving
conditions without retraining core models. The framework does not replace state-of-the-art difusion
methods; it provides a conditioning and governance layer that makes implicit metadata explicit during
generation. This reframes TS imputation as a governed, explainable component of data ecosystems
rather than an isolated preprocessing step. This paper describes the system architecture and identifies
open technical challenges. Implementation is in progress. The design targets scenarios where structured
metadata exists and quality governance is required.
        </p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>
        TS imputation has been widely studied. Classical imputation methods include interpolation, Kalman
filtering, and state-space models [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. These methods provide interpretability and computational eficiency.
Yet, they fail on high-dimensional data with non-linear dependencies and complex missing patterns.
Neural architectures using recurrent networks and Transformers [16, 19] capture non-linear dynamics
through learnt representations. These models treat each series independently or learn correlations from
data. They do not exploit explicit metadata about sensor properties, operational constraints, or quality
indicators. Difusion models represent a recent advance in generative imputation [
        <xref ref-type="bibr" rid="ref6 ref7">7, 6, 8, 9, 16</xref>
        ]. These
models generate probabilistic reconstructions by modelling temporal and cross-variable dependencies
through iterative denoising. Recent work extends difusion to forecasting and general TS generation
[16, 10, 11]. Conditioning remains limited to observed measurements and patterns learnt from training
data. External metadata specifying sensor modality, operational constraints, or quality indicators is not
incorporated.
      </p>
      <p>
        Graph-based methods propagate information across correlated sensors using GNNs, hypergraphs, or
attention mechanisms [
        <xref ref-type="bibr" rid="ref4 ref5 ref6">4, 5, 6</xref>
        ], capturing spatial, functional, and topological dependencies. Yet, they
often rely on fixed graphs derived from heuristics and rarely exploit rich metadata or quality indicators,
limiting interpretability and adaptability. General-purpose TS architectures such as TimeMixer++
mainly emphasise representation capacity across TS tasks rather than metadata conditioning [10].
      </p>
      <p>A parallel line of work uses knowledge graphs to encode structured metadata, provenance, and
relational constraints [14, 15]. Recent research explores LLMs as orchestrators for KG workflows,
including construction from unstructured text, subgraph selection, and explanation generation [17, 18].
Prior work focuses on KG construction, completion, retrieval-augmented generation, and post-hoc
reasoning, with few approaches guiding TS imputation or producing auditable quality narratives.</p>
      <p>Existing methods address generative modelling, relational reasoning, or workflow orchestration
independently. The proposed paradigm combines difusion-based imputation, metadata-driven KG, and
LLM orchestration in a single framework for governed TS reconstruction.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Framework Architecture</title>
      <p>The framework comprises three-layer hybrid system operating on diferent data representations. The
metadata knowledge graph maintains sensor specifications, quality indicators, and relationships.
Difusion models generate imputations conditioned on observed measurements and graph-derived context.
LLM agents query the graph, configure imputation parameters, and produce audit trails. Figure 1
shows the data flow. When imputation is requested for sensor  at time  , the system would extract
a quality-filtered subgraph  sub containing candidate conditioning sensors. A graph neural network
computes embeddings injected into the difusion model’s denoising process. Generated samples are
validated against metadata graph constraints. An LLM agent documents which sensors were retained,
and why.</p>
      <sec id="sec-3-1">
        <title>3.1. Metadata Knowledge Graph</title>
        <p>The metadata knowledge graph would store sensor properties, quality indicators, and operational
constraints. Raw TS remain in specialised time-series databases. This separation addresses scalability
using a graph database (such as Neo4j) that handles thousands of sensor entities and their relationships,
whilst numerical storage manages high-frequency measurements. Sensors are represented as nodes
with attributes such as measurement type, physical units, operating range, calibration dates, and
reliability indicators. Contextual entities capture geographic regions, infrastructure components, and
asset hierarchies. Edges encode relationships such as spatial proximity, network topology, functional
similarity, and maintenance dependencies, while partitions summarise similar behaviors or dynamics.</p>
        <p>From a technical perspective, the KG uses a lightweight property-graph schema rather than a full
ontology. Core node types include Sensor, MeasurementType, Location, Asset, and QualityEvent.</p>
        <p>Sensor nodes store attributes such as modality, physical units, operating range, calibration date, and
reliability indicators. Edges encode MEASURES, LOCATED_IN, ADJACENT_TO, FUNCTIONALLY_RELATED,
and AFFECTED_BY, enabling traversal over specified relationships. Quality constraints are enforced
using query-level filtering, allowing the schema to evolve with operational needs.</p>
        <p>In doing so, the graph externalises relationships that learning-based models cannot infer from raw
signals. A GNN trained on TS may learn that sensors A and B are correlated. It cannot determine
whether A measures temperature in Celsius whilst B measures humidity as a percentage, rendering them
semantically incompatible for direct conditioning. The graph encodes modality explicitly. Crucially, the
system maintains an explicit, queryable, and evolvable representation of metadata without handling
highvolume numerical data, enforcing a clean separation between statistical reconstruction and semantic
reasoning.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Quality-Aware Subgraph Extraction</title>
        <p>For each imputation task targeting sensor  over time window [ 1,  2], we extract a task-specific subgraph
 sub = ( sub,  sub) that defines admissible conditioning context. Candidate neighbors are first identified
through graph traversal over spatial, topological, and similarity relations, and then filtered based
on quality indicators (e.g., missingness rates, anomaly flags) and semantic compatibility (e.g., sensor
modality or measurement units).</p>
        <p>This approach replaces heuristic neighbor selection (e.g.,  -nearest by distance) with
metadatagoverned decisions. Each edge in  sub has an explicit justification, and operators can inspect, override,
or revise these rules through graph queries without retraining the difusion model. Thresholds and
traversal depth  are configurable at query time, enabling transparent and auditable context selection.
Using metadata for subgraph extraction, the framework ensures that conditioning context is semantically
and operationally valid. This enables flexibility and explainability in the imputation process.</p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. KG-Conditioned Difusion Imputation</title>
        <p>In standard difusion-based TS imputation,   denotes the noisy latent at difusion step  and  obs the
observed measurements. The model learns the reverse process:</p>
        <p>( −1 ∣   ,  obs).</p>
        <p>Our framework extends this formulation by conditioning the generation process on the metadata
subgraph:</p>
        <p>( −1 ∣   ,  obs,  sub).</p>
        <p>Embeddings of  sub are calculated using a graph neural network that encodes relational
structure, quality indicators, and semantic attributes. These embeddings are injected into the difusion
model’s denoising network. Implementation options include concatenation with intermediate features,
cross-attention mechanisms, or message-passing layers. The choice depends on the specific difusion
architecture and computational constraints.</p>
        <p>Constraint enforcement combines soft penalties during training with hard clipping at sampling time.</p>
        <p>Let ℒdifusion denote the standard difusion objective and ℒconstraint a penalty for violations of
sensor-specific bounds [  min,   max] encoded in the metadata graph. The resulting training objective is
ℒ = ℒdifusion + ℒ constraint.</p>
        <p>This design allows the knowledge graph and difusion model to evolve independently. Adding new
sensor types requires updating graph schema, not retraining the denoising network. Changing quality
thresholds modifies subgraph extraction rules without touching model parameters.</p>
      </sec>
      <sec id="sec-3-4">
        <title>3.4. LLM-Orchestrated Workflow</title>
        <p>LLMs orchestrate the imputation pipeline through specialised agents that query the knowledge graph
and coordinate workflow steps.</p>
        <p>The KG-construction agent updates metadata from maintenance logs, calibration records, and
operational documentation. It parses unstructured text to extract structured facts and writes them to
the graph whilst tracking provenance.</p>
        <p>The imputation-planning agent receives imputation requests and queries the graph for candidate
conditioning sensors. It adapts subgraph extraction based on missingness patterns and metadata quality.</p>
        <p>The quality-assessment agent evaluates generated reconstructions against metadata-derived
constraints. It checks whether values fall within valid operating ranges, assesses temporal consistency
with adjacent observations, and compares against ground truth when available. Quality metrics are
written back to the graph as provenance records.</p>
        <p>The explanation agent generates human-readable narratives documenting imputation decisions.
For each reconstruction, it describes which sources, constraints, and reliability factors influenced each
imputation. Explanations reference only entities and attributes retrieved from the graph to prevent
hallucinated claims.</p>
        <p>Agent interactions with graph databases use generated Cypher queries. To ground LLM outputs in
factual metadata, agents receive graph query results in their prompts and are constrained to reference
only retrieved entities. If an agent generates ungrounded claims, the system detects this through entity
linking and retries with stricter constraints. Agent interactions follow a retrieve-then-generate pattern.
For each task, the relevant agent first issues a structured query to retrieve candidate entities and their
attributes from the KG. The query results are then injected into the agent’s prompt as the sole factual
context. To prevent hallucination, agent outputs undergo entity-linking verification and every sensor,
attribute, or event referenced in the response is checked against the query result set. If ungrounded
references are detected, the agent is re-prompted with the specific violation. This
retrieve-constrainverify loop applies uniformly across all agents. The orchestration layer provides transparency for
human oversight and structured provenance for downstream reasoning. The modular agent design
allows individual components to be updated or replaced as LLM capabilities evolve.</p>
      </sec>
      <sec id="sec-3-5">
        <title>3.5. Operational interaction between components</title>
        <p>Components operate at distinct abstraction levels. The KG provides a task-specific quality-filtered
subgraph. A lightweight GNN encodes this subgraph into embeddings representing semantic
compatibility, quality indicators, and relational structure. LLM agents orchestrate the workflow and generate
explanations grounded in KG facts but do not participate in numerical generation.</p>
      </sec>
      <sec id="sec-3-6">
        <title>3.6. Illustrative Example</title>
        <p>Consider a satellite-derived TS for a fixed geographic tile observed across five acquisition times. Four
cloud contamination, creating structured corruption as shown in Table 1.
sources provide measurements: a primary optical sensor  1, an auxiliary optical sensor  2, a radar
sensor  , and an adjacent tile  observed by  1. At time  3, the measurement from  1 is invalid due to</p>
        <p>Let  denote the multivariate series over { 1,  2, , }
excluding  1( 3). A task-specific metadata subgraph  sub is extracted from the knowledge graph in
Figure 2 by retaining only semantically compatible and quality-valid sensors. In this example,  2 and
 constitute admissible conditioning signals, while  1 is excluded due to cloud invalidation and 
due to modality incompatibility with optical vegetation indices. Conditioning the difusion model
on ( obs,  sub) yields a posterior distribution over the missing value. The imputed value  ̂ KG
( 3) is
interpreted as a point summary (e.g., conditional mean, MAP) derived under KG-governed context
selection. Raw observations alone do not explain why  1( 3) is missing nor which alternative sources
should be trusted. The metadata KG provides essential context: (i)  1( 3) is flagged as invalid due to
cloud contamination, (ii)  is a radar sensor and therefore not compatible with optical reflectance, and
(iii) the adjacent tile  exhibits historically strong spatial afinity with the target tile. Figure
2 illustrates
how these semantic constraints are encoded and operationalised for context selection. A signal-centric
approach would average all available sources, including semantically incompatible ones such as  ,
leading to biased or incoherent estimates. In contrast, the framework excludes invalid sources and
conditions only on admissible signals. This estimate respects temporal continuity and is accompanied
by explicit, metadata-grounded justification. This example illustrates how
metadata knowledge graphs
encode provenance, quality, and semantic constraints that are absent from raw observations. For illustration,
a simplified weighted combination gives:  ̂ KG
( 3) = 0.7 ⋅  2( 3) + 0.3 ⋅ ( 3) = 0.623.</p>
        <p>and  obs the set of all observed entries</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Discussion</title>
      <p>This framework demonstrates how KGs can govern generative imputation in operational data systems.
Three architectural decisions distinguish this approach from signal-only methods. Difusion models
receive explicit semantic constraints rather than inferring them from correlations. This is expected to
improve robustness when training correlations are absent, misleading, or outdated. The KG externalises
what models cannot learn from measurements alone [13].</p>
      <p>Statistical reconstruction, semantic validation, and workflow coordination operate independently.
New sensors, updated calibration schedules, or revised quality policies modify graph content without
retraining difusion models. Human operators inspect and override subgraph extraction rules through
graph queries. This modularity supports adaptation in evolving deployments. Additionally, LLM agents
document which sensors were included or excluded, which constraints were enforced, and which quality
measures determined subgraph membership. Explanations reference graph entities, enabling systems
to trace provenance.</p>
      <p>The framework addresses a gap in hybrid KG–LLM ecosystems by allowing metadata governance
at the imputation stage. This reduces the risk of poor TS fidelity propagating errors into reasoning
systems. The design pattern generalises beyond TS. Domains where raw measurements require semantic
validation (medical sensors, financial data, environmental monitoring) could apply metadata-conditioned
generation. The specific technologies (Neo4j, difusion models, LLM agents) are instantiation choices.
The principle is separating what models learn from data, from what systems enforce through metadata.</p>
      <p>The proposed approach is designed to minimise training and fine-tuning overhead. Difusion models
are trained or reused independently of the knowledge graph and LLM components. The graph neural
network operates on compact subgraphs rather than the full metadata graph to lower inference costs.
Since metadata evolves more slowly than raw time series, graph embeddings can be cached or
incrementally updated. LLMs are only used at inference time without fine tuning. The approach is feasible
in environments where retraining large models is costly or impractical. It is most valuable in domains
where structured metadata exists alongside time series measurements, sensors are heterogeneous in
their characteristics, and governance is required. For homogeneous univariate time series without
operational metadata, the metadata layer adds limited value and standard imputation methods could
sufice. The framework’s benefit scales with metadata richness. The more operational context is
available in the KG, the greater the improvement over traditional approaches. The specific technology
choices are instantiation decisions, and the core principle of separating statistical reconstruction from
metadata-governed context selection is architecture-agnostic.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion &amp; Open Challenges</title>
      <p>This vision paper opens research directions for difusion-based time series imputation by explicitly
integrating metadata as conditioning context. Existing difusion models learn temporal dependencies
from observed correlations but cannot infer operational semantics, quality constraints, or provenance
from raw measurements. We propose externalising this knowledge in queryable graphs and conditioning
generation on quality-filtered subgraphs. This reframes imputation as a process constrained by semantic
validity and not temporal plausibility alone.</p>
      <p>Implementation is underway integrating KGs with difusion-based imputation models. Evaluation
will assess whether metadata and KG-conditioned imputation improves robustness on real-world
datasets, including both regular and irregular time series. Envisaged baselines include Yun et al. [8]
and Difusion-TS [ 16]. Considered metrics for evaluation will include RMSE/MAE, metadata-aware
criteria covering constraint violations, structured missingness robustness, and fidelity of LLM-generated
explanations.</p>
      <p>Notable challenges remain open. Subgraph extraction and graph embedding add computational
overhead. This calls for latency analysis for high-frequency streams, and caching/incremental updates
need exploration. The framework assumes reasonably accurate metadata, yet operational systems
contain outdated calibration records, incorrect specifications, and stale quality indicators. Degradation
under these errors remains an open problem. Potential approaches include uncertainty-aware graph
queries, metadata validation pipelines, and human-in-the-loop verification for high-stakes imputations.
Similarly, explanation agents must avoid hallucinating facts not present in the knowledge graph.
Structured generation and constrained decoding may improve reliability, but formal verification that
references only retrieved entities is still unsolved.</p>
      <p>Existing imputation benchmarks measure reconstruction error on held-out test data, but do not assess
semantic validity, explainability quality, or governance efectiveness. Evaluation frameworks capturing
these dimensions are needed.</p>
      <p>Additionally, sensor networks lack common vocabularies for quality indicators, calibration
procedures, and operational constraints. Domain-specific ontologies exist, but integrating heterogeneous
deployments requires schema alignment and entity resolution. KG construction from unstructured logs
remains a challenging extraction problem.</p>
      <p>These challenges define a research agenda for trustworthy imputation in operational settings.
Translating this vision into production systems requires addressing computational eficiency, metadata
quality assurance, and evaluation protocols that measure semantic validity and auditability beyond
reconstruction error.</p>
    </sec>
    <sec id="sec-6">
      <title>6. Acknowledgement</title>
      <p>This work was supported by the Luxembourg National Research Fund (FNR) &amp; the National Centre
for Research and Development (NCBR) under the SERENITY Project (ref. C22/IS/17395419;
POLLUXXI/15/Serenity/2023)</p>
    </sec>
    <sec id="sec-7">
      <title>Declaration on Generative AI</title>
      <p>The authors declare that generative AI tools (ChatGPT) were used for language refinement, including
Grammar and spelling check, improve writing style, and peer review simulation. All content was
subsequently reviewed and edited by the authors, who take full responsibility for the accuracy, originality,
and claims presented in this work.
[8] T. Yun, H. Jung, J. Son, Imputation as inpainting: Difusion models for spatiotemporal data
imputation (2023).
[9] M. R. U. Islam, P. Tadepalli, A. Fern, Self-attention-based difusion model for time-series imputation
in partial blackout scenarios, in: Proceedings of the AAAI Conference on Artificial Intelligence,
volume 39, 2025, pp. 17564–17572.
[10] S. Wang, J. Li, X. Shi, Z. Ye, B. Mo, W. Lin, S. Ju, Z. Chu, M. Jin, Timemixer++: A general time
series pattern machine for universal predictive analysis, arXiv preprint (2024). ArXiv:2410.16032.
[11] R. Jiang, G.-C. Zheng, T. Li, T.-R. Yang, J.-D. Wang, X. Li, A survey of multimodal controllable
difusion models, Journal of Computer Science and Technology 39 (2024) 509–541.
[12] J. Hoelscher-Obermaier, E. Stevinson, V. Stauber, I. Zhelev, V. Botev, R. Wu, J. Minton, Leveraging
knowledge graphs to update scientific word embeddings using latent semantic imputation, in:
Proceedings of the first Workshop on Information Extraction from Scientific Publications, 2022,
pp. 43–53.
[13] Y. Liu, G. Shen, N. Liu, X. Han, Z. Xu, J. Zhou, X. Kong, Trafic data imputation via knowledge
graph-enhanced generative adversarial network, PeerJ Computer Science 10 (2024) e2408.
[14] A. Hogan, E. Blomqvist, M. Cochez, C. d’Amato, G. D. Melo, C. Gutierrez, S. Kirrane, J. E. L. Gayo,</p>
      <p>R. Navigli, S. Neumaier, et al., Knowledge graphs, ACM Computing Surveys (Csur) 54 (2021) 1–37.
[15] S. Ji, S. Pan, E. Cambria, P. Marttinen, P. S. Yu, A survey on knowledge graphs: Representation,
acquisition, and applications, IEEE Transactions on Neural Networks and Learning Systems 33
(2022) 494–514.
[16] X. Yuan, Y. Qiao, Difusion-ts: Interpretable difusion for general time series generation, in:</p>
      <p>International Conference on Learning Representations (ICLR), 2024.
[17] Y. Zhu, X. Wang, J. Chen, S. Qiao, Y. Ou, Y. Yao, S. Deng, H. Chen, N. Zhang, Llms for knowledge
graph construction and reasoning: Recent capabilities and future opportunities, arXiv preprint
arXiv:2305.13168 (2024). V4, Dec 2024.
[18] D. Edge, H. Trinh, N. Cheng, J. Bradley, A. Chao, A. Mody, S. Truitt, D. Metropolitansky, R. O. Ness,
J. Larson, From local to global: A graph rag approach to query-focused summarization, arXiv
preprint arXiv:2404.16130 (2024).
[19] H. Wu, J. Xu, J. Wang, M. Long, Autoformer: Decomposition transformers with auto-correlation
for long-term series forecasting, NeurIPS 34 (2021) 22419–22430.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>J.</given-names>
            <surname>Hagiwara</surname>
          </string-name>
          , et al.,
          <article-title>Time series analysis for the state-space model with</article-title>
          R/Stan, Springer,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>W.</given-names>
            <surname>Cao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Li</surname>
          </string-name>
          , et al.,
          <article-title>Brits: Bidirectional recurrent imputation for time series</article-title>
          ,
          <source>NeurIPS</source>
          (
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Che</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Purushotham</surname>
          </string-name>
          , et al.,
          <article-title>Gru-d: Handling missing data with recurrent neural networks, ICML (</article-title>
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>B.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Luo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zheng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Lv</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Ma</surname>
          </string-name>
          , Hyperimts:
          <article-title>Hypergraph neural network for irregular multivariate time series forecasting</article-title>
          ,
          <source>in: International Conference on Machine Learning (ICML)</source>
          ,
          <year>2025</year>
          . ArXiv:
          <volume>2505</volume>
          .
          <fpage>17431</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>X.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Si</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Sun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Lyu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Tang</surname>
          </string-name>
          , Hgts-former:
          <article-title>Hierarchical hypergraph transformer for multivariate time series analysis</article-title>
          ,
          <source>arXiv preprint</source>
          (
          <year>2025</year>
          ). ArXiv:
          <volume>2508</volume>
          .
          <fpage>02411</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>M.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Feng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Sun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Du</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Fu</surname>
          </string-name>
          ,
          <article-title>Pristi: A conditional difusion framework for spatiotemporal imputation</article-title>
          ,
          <source>in: 2023 IEEE 39th international conference on data engineering (ICDE)</source>
          , IEEE,
          <year>2023</year>
          , pp.
          <fpage>1927</fpage>
          -
          <lpage>1939</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Tashiro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Song</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Song</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ermon</surname>
          </string-name>
          , Csdi:
          <article-title>Conditional score-based difusion models for probabilistic time series imputation</article-title>
          ,
          <source>NeurIPS</source>
          <volume>34</volume>
          (
          <year>2021</year>
          )
          <fpage>24804</fpage>
          -
          <lpage>24816</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>