1. Introduction

1613-0073

Compliance: A Multi-Agent Schema-Light Knowledge Graph for Regulatory Compliance QA

Hemant Sunil Jomraj

hjomraj@mastercontrol.com 0 1

Bhavik Agarwal

bagarwal@mastercontrol.com 0 1

Viktoria Rojkova

vrojkova@mastercontrol.com 0 1

Workshop

Regulatory Compliance, Knowledge Graph, Large Language Model, Ontology Free

0 MasterControl AI Research, MasterControl , 6350 South 3000 East, Salt Lake City, Utah , USA 1 Section Overlap @ 0.50 Section Overlap @ 0.60 Section Overlap @ 0.75 Answer Accuracy (avg) Avg. Degree Avg. Shortest Path

2025

2 6

Regulatory QA demands precise, verifiable answers grounded in domain text. We present a multi‑agent framework that fuses a schema light (ontology minimal) knowledge graph of subject-predicate-object (SPO) triplets with retrieval‑augmented generation (RAG). Agents continuously extract, normalize, and deduplicate triplets from regulatory documents; each triplet is embedded and stored, together with linked source segments and metadata, in a unified vector index. At query time, triplet level retrieval aligns user intent with concise “who‑did‑what‑to‑whom” facts and returns both the triplets and their provenance text to an LLM for answer synthesis. In complex regulatory queries, the system improves traceability and supports subgraph visualization, while achieving higher strict‑threshold section overlap and better graph connectivity versus text-only baselines.

Compliance

1. Introduction

Regulated domains (e.g., health and life sciences) demand high precision, verifiability, and domain grounding in QA[ 1, 2, 3 ]. General LLMs, including recent model families [ 4, 5, 6 ], excel in language but risk hallucinations [ 7, 8, 9 ], especially where compliance evidence and provenance are required [ 10 ]. We propose a practical system combining: (i) schema-light triplet extraction and KG maintenance [ 11 ], (ii) a unified vector store with triplets and source text, and (iii) a multi-agent QA pipeline that retrieves at the triplet level and returns answers with verifiable evidence.Our contributions are based on knowledge graph methods [ 12, 13, 14, 15 ] and regulatory KG/RAG applications [ 16, 17, 18, 19, 20, 21 ].

2.1. Units, extraction, and provenance

The regulatory text is segmented into atomic sections ( : C→X={x1, … , }, then an extraction pipeline produces SPO triplets Φ(Ω()) = { = ( , , )}. The provenance is captured by Λ ∶ → 2 , mapping each to one or more source sections for auditability. Open IE and related practices inform the extraction side [ 22, 23 ], with open-world learning and schema emergence supported by previous work [24, 25]. Canonicalization and entity linking address vocabulary fragmentation [26, 27], while ontology-driven precedents [28] and community KGs [29, 30] motivate minimal, reusable meta-relations.

2.2. Embedding and unified index

search [31]. Queries are embedded as . (

Each triplet is rendered as text ( ), embedded via a transformer encoder into ∈ ℝ ; we store , , Λ( )) in a vector index. The density retrieval choices are inspired by DPR and modern similarity

CEUR

ceur-ws.org

2.3. Triplet-first retrieval with text evidence

We compute = TopK(sim( , )) and recover evidence = ⋃ ∈ Λ( ), then pass (, , ) to an LLM to generate the answer . This implements RAG [ 11 ] with structured facts to reduce hallucinations [32], and has shown utility in healthcare / pharmaceutical QA [ 21, 20 ].

2.4. Design notes

We supplement the answers with an interactive subgraph of the retrieved triplets (Figure 1) to expose how the evidence pieces connect. This improves user trust and supports auditability. The completeness / consistency of , retrieval suficiency, and auditable provenance are central. The schema light choice accelerates ingestion while relying on canonicalization to temper emergent vocabularies [26, 27].

3. Multi-Agent Architecture

We deploy specialized agents for ingestion, extraction, normalization/cleaning, indexing, retrieval, storybuilding, and generation (Figure 2). This follows established multi-agent design principles for modularity and scalability [33, 34, 35, 36] and is in line with recent regulatory KG/RAG systems [ 16, 17, 19, 18 ].

Without Triplets

With Triplets

4. Evaluation 4.1. Protocol

We sample target sections ′ ⊂ , build a ground-truth story per section by concatenating related mentions, generate Q/A with an LLM, and compare our system’s retrieval and answers against these references. This mirrors open-domain QA/RAG setups [ 11, 31 ] while focusing on regulatory corpora [ 16, 19 ].

4.2. Metrics

Section-level overlap. For = { } ∪ ( ) and retrieved , , ( , , ) = a similarity threshold for near-matches.

Factual correctness. A secondary judge (LLM or expert) marks ⋆ consistent with the ground truth story; structured facts are expected to reduce hallucination [ 7, 32 ].

Navigation. For sections and ℓ ∈ ( ), let () be extracted triplets. We compute Nav( ′) = 1 ∑=1 ∑∑ ℓℓ ∈∈(( )) || (( ))∩∪ (( ℓℓ ))|| and graph connectivity (avg. degree, shortest path). | , ∩ | , optionally with | , |

5. Discussion and Limitations

Schema-light design. Fast ingestion and adaptability come with vocabulary fragmentation; the emergence of selective schema plus canonicalization mitigates this [26, 27].

Extraction quality. Regulatory jargon and cross references may produce missing / noisy triplets; iterative curation and weak supervision help. Temporal/conditional logic may need rules beyond SPO.

Eficiency. Large, changing corpora benefit from incremental updates and eficient vector/graph indexing. The approach complements the domain-specific RAG work [ 21, 20 ] and the regulatory deployments of KG / RAG [ 16, 19, 17, 18 ].

6. Conclusion

A schema light KG with triplet-first retrieval and textual evidence has been successfully deployed at scale. It supports numerous compliance professionals with precise and auditable QA in regulatory domains, addressing known LLM risks [ 7, 8, 9, 10 ], with user counts rapidly expanding across regulatory teams (https://www.prnewswire.com/news-releases/mastercontrol-launches-ai-powered-regulatorychat-to-simplify-compliance-navigation-for-life-sciences-manufacturers-302533086.html). Future work includes deeper temporal/conditional reasoning and tighter human-in-the-loop curation.

Declaration on Generative AI

During the preparation of this work, the author(s) used Overleaf AI Assist paid context-aware edits to improve grammar, spelling, word choice, and sentence structure, all specifically trained for academic writing. After using these tool(s)/service(s), the author(s) reviewed and edited the content as needed and take(s) full responsibility for the publication’s content. [24] A. Carlson, J. Betteridge, B. Kisiel, B. Settles, E. R. Hruschka Jr., T. M. Mitchell, Toward an architecture for never-ending language learning, in: Proceedings of the 24th AAAI Conference on Artificial Intelligence (AAAI), 2010, pp. 1306–1313. [25] S. Riedel, L. Yao, A. McCallum, Relation extraction with matrix factorization and universal schemas, in: Proceedings of NAACL-HLT, 2013, pp. 74–84. [26] L. Galárraga, C. Teflioudi, K. Hose, F. M. Suchanek, Canonicalizing open knowledge bases, in: Proceedings of the 23rd ACM International Conference on Information and Knowledge Management (CIKM), 2014, pp. 1679–1688. [27] W. Shen, J. Wang, P. Luo, M. Wang, A survey on entity linking: Methods, techniques, and applications, IEEE Transactions on Knowledge and Data Engineering 27 (2014) 443–460. [28] F. Probst, S. Eck, W. Kuhn, Scalable semantics: A case study on ontology-driven geographic information integration, International Journal of Geographical Information Science 20 (2006) 563–583. [29] J. Lehmann, et al., Dbpedia – a large-scale, multilingual knowledge base extracted from wikipedia,

Semantic Web (2015). [30] F. M. Suchanek, G. Kasneci, G. Weikum, Yago: A core of semantic knowledge unifying wikipedia and wordnet, in: Proceedings of the 16th International Conference on World Wide Web (WWW), 2007, pp. 697–706. [31] V. Karpukhin, et al., Dense passage retrieval for open-domain question answering, 2020. ArXiv preprint. [32] J. Li, et al., Enhancing llm factual accuracy with rag to counter hallucinations: A case study on domain-specific queries in private knowledge-bases, 2024. ArXiv preprint. [33] Y. Shoham, K. Leyton-Brown, Multiagent Systems: Algorithmic, Game-Theoretic, and Logical Foundations, Cambridge University Press, 2008. URL: https://www.eecs.harvard.edu/cs286r/courses/ fall08/files/SLB.pdf. [34] M. Wooldridge, An Introduction to MultiAgent Systems, 2 ed., Wiley, 2009. [35] G. Weiss, Multiagent Systems: A Modern Approach to Distributed Artificial Intelligence, MIT

Press, 2000. URL: https://ieeexplore.ieee.org/book/6267355. [36] A. Zygmunt, et al., Agent-based environment for knowledge integration, 2013. ArXiv preprint.

[1]

U.S.

Food and

Drug

Administration , Fda guidance documents, 2025 .

[2]

J. J.

Cordes ,

S. E.

Dudley , L. Washington, Regulatory Compliance Burden, Technical Report, GW Regulatory Studies Center , 2022 .

[3]

Han , A . Ceross,

Bergmann , More than red tape: Exploring complexity in medical device regulatory afairs , BMJ Innovations ( 2024 ).

[4]

Zhong , et al., Evaluation of openai o1: Opportunities and challenges of agi , 2024 . ArXiv preprint.

[5]

Yang , et al., Qwen2.5 technical report , 2024 . ArXiv preprint.

[6]

Abdin , et al., Phi-4 technical report , 2024 . ArXiv preprint.

[7]

Ji , et al., Survey of hallucination in natural language generation , 2024 . ArXiv preprint.

[8]

Ling , et al., Domain specialization as the key to make large language models disruptive: A comprehensive survey , 2024 . ArXiv preprint.

[9]

Wang , S. Zhang, Large language models in medical and healthcare fields: Applications, advances, and challenges , Artificial Intelligence Review ( 2024 ).

[10] J. B. Hakim , et al., The need for guardrails with large language models in medical safety-critical settings: An artificial intelligence application in the pharmacovigilance ecosystem , 2024 . ArXiv preprint.

[11]

Lewis , et al., Retrieval-augmented generation for knowledge-intensive nlp tasks , 2021 . ArXiv preprint.

[12]

Hogan , E. Blomqvist,

Cochez , et al., Knowledge

graphs

, 2021 . ArXiv preprint.

[13]

Hogan , E. Blomqvist,

Cochez , et al., Knowledge Graphs , volume 12 of Synthesis Lectures on Data, Semantics, and Knowledge, Morgan & Claypool, 2021 .

[14]

Nickel , et al., A review of relational machine learning for knowledge graphs , 2015 . ArXiv preprint.

[15]

Chen , et al., A review: Knowledge reasoning over knowledge graph , Expert Systems with Applications ( 2020 ).

[16]

Ershov , A case study for compliance as code with graphs and language models , 2023 . ArXiv preprint.

[17]

Chattoraj , et al., Semantically Rich Approach to Automating Regulations of Medical Devices, Technical Report , University of Maryland, Baltimore County, 2024 .

[18]

Xiang , et al., Integrating knowledge graph and large language model for safety management regulatory texts , in: Lecture Notes in Computer Science , volume 14250 , 2025 , pp. 976 - 988 .

[19]

Hillebrand , et al., Advancing risk and quality assurance: A rag chatbot for improved regulatory compliance , 2024 . Available on IEEE Xplore.

[20]

Kim ,

Min , From rag to qa-rag: Integrating generative ai for pharmaceutical regulatory compliance process , 2024 . ArXiv preprint.

[21]

Yang , et al., Retrieval-augmented generation for generative artificial intelligence in health care, npj Digital Medicine ( 2025 ).

[22]

Etzioni ,

Fader ,

Christensen ,

Soderland , Mausam, Open information extraction: The second generation , in: Proceedings of the 22nd International Joint Conference on Artificial Intelligence (IJCAI) , 2011 , pp. 3 - 10 .

[23]

Fader ,

Soderland ,

Etzioni , Open information extraction for the web , Communications of the ACM 57 ( 2014 ) 80 - 86 .