Refinement of the COHESIVE Information System towards a Unified Ontology of Food Terms for the Public Health Organizations Iolanda Mangone 1, Nicolas Radomski 1*, Adriano Di Pasquale 1, Andrea Santurbano 2, Paolo Calistri 1, Cesare Cammà 1and Kitty Maassen 3 1 National Reference Centre (NRC) for Whole Genome Sequencing of microbial pathogens: data-base and bioinformatics analysis (GENPAT), Istituto Zooprofilattico Sperimentale dell'Abruzzo e del Molise "Giuseppe Caporale" (IZSAM), via Campo Boario, Teramo, 64100 (TE), Italy (www.izs.it) 2 LARUS Business Automation, via Bruno Maderna 7, Mestre, – 30174 (VE), Italy (www.larus-ba.it) 3 National Institute for Public Health and the Environment (RIVM), P.O. Box 1, Bilthoven, 3720 (BA), The Netherlands (www.rivm.nl) * corresponding author Abstract Background. The task 4.1 of the One Health European joint programme (OHEJP) “One Health Structure In Europe” (COHESIVE) focuses on integrating pathogen information from public health, animal health and food safety surveillance at Member State level. Considered information are metadata associated to each sample (i.e. isolation date, origin, matrix) and whole genome sequencing (WGS) data from official laboratories (e.g. next generation sequencing data and bioinformatics-based analytical outcomes). Methods. A WEB-based platform called the COHESIVE Information System (CIS) has been created with separate instances for three Member States, in order to provide a proof of concept showing the advantages for surveillance and investigation of outbreaks at the genomic scale, considering food as a source of human pathogens. Currently, a CIS Version 2 (CISv2) is under development to integrate a unified food ontology at Member State level, taking into account as a first step organizations from Italy, Norway and The Netherlands: countries involved in the feasibility study foreseen in the project. More precisely, the last developments focused on the harmonization of the foodborn disease biosample contextual data collected over the past few decades (i.e. contextual metadata of foodborne samples sent in by labs for sequencing) based on the rule-based text mining tool LexMapr, and the implementation of the FoodOn ontology into the CIS based on the graph-database Neo4j to allow future records of harmonized food terms in the CISv2. Results. The successful harmonization of the past food terms and implementation of the FoodOn ontology into the CIS were mandatory steps allowing food ontology harmonization between organizations and improvement of queries from the CISv2 based on relational- and graph-databases. Keywords 1 COHESIVE information system, food ontology, relational-database, graph-database, genomics-based surveillance 1 IFOW 2021: 2nd Integrated Food Ontology Workshop, held at JOWO 2021: Episode VII The Bolzano Summer of Knowledge, September 11-18, 2021, Bolzano, Italy EMAIL: i.mangone@izs.it (I. Mangone); n.radomski@izs.it (N. Radomski); a.dipasquale@izs.it (A. Dipasquale); andrea.santurbano@larus- ba.it (A. Santurbano); p.calistri@izs.it (P. Calistri); c.camma@izs.it (C. Camma); kitty.maassen@rivm.nl (K. Maassen) ORCID: https://orcid.org/0000-0002-6716-091X (I. Mangone); https://orcid.org/0000-0002-7480-4197 (N. Radomski); https://orcid.org/0000-0002-9328-3972 (A. Dipasquale); https://orcid.org/0000-0001-5066-8971 (A. Santurbano); https://orcid.org/0000- 0001-6004-9373 (P. Calistri); https://orcid.org/0000-0002-7547-1195 (C. Camma); https://orcid.org/0000-0002-0864-464X (K. Maassen) © 2021 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). CEUR Workshop Proceedings (CEUR-WS.org) 1. Introduction The microbiological surveillance and outbreak investigation are today supported by public health organizations through genomics-based information systems integrating multiple metadata related to foodborn disease biosample collected over the past few decades for sequencing by veterinarians, biologists, researchers and medical doctors [1]. Unfortunately, these metadata associated to samples are, neither organized nor harmonized between public health organizations in charge of food, veterinary and environmental sectors [2]. Consequently, several projects of the food [3], veterinary [4] and environmental [5] sectors, aim currently at organizing and harmonizing these metadata based on developments and implementations of ontologies. Both in and between European countries, the European Joint Programme (EJP) “One Health Structure In Europe” (COHESIVE) is of paramount importance for organizations of food production systems, as well as the veterinary and human health domains, in dealing with (re‐)emerging zoonoses, including antimicrobial resistance and food‐borne zoonoses 2. Because of current implementation of omics for outbreak investigation, source attribution and risk assessment of food-borne microorganisms across European Member States [6], the EJP COHESIVE, initially developed to collect data related to the area of risk‐analysis, aims today at integrating also genomics data from human and veterinary domains involved in genomics-based surveillance (Figure 1). Figure 1: Actors and data types of the “COHESIVE information system” (CIS) supported by the European Joint Programme (EJP) “One Health Structure In Europe” (COHESIVE) 2 EJP COHESIVE Home: https://onehealthejp.eu/jip-cohesive/ In the framework of the EJP COHESIVE, the “COHESIVE information system” (CIS) has been developed by IZSAM and three demo versions have been provided to organizations from Italy, Norway and The Netherlands (Task 4.1) to integrate pathogen information from public health, animal health and food safety surveillance at Member State level, integrating metadata related to samples (i.e. isolation date, origin, matrix) and genomics analyses (i.e. genome assembly, mapping of reads, species identification, mutations of interest) [7–9]. The first challenge of the CIS is to harmonize the past food terms which have been accumulated in different languages from different European organizations over the past few decades, though free-systems of recording independently of food term ontology. The second challenge of the CIS is to allow recording by European organizations of future food terms following a common ontology of food terms. A common and unified ontology of food terms into the CIS would allow queries from past and future food terms recorded by different European organizations (e.g. Which samples related to the cheese factory sector were isolated during 2008 in Italy with a clonal complex CC8?). In parallel, the EJP “One health suRveillance Initiative on harmOnization of data collection and interpretatioN” (ORION) (WP3) focused on the development of a “Health Surveillance Ontology” (HSO) at the European level [10], while other international consortia developed “Open Biological and Biomedical Ontology” (OBO) foundries [11], such like the “Genomic Epidemiology Ontology” (GenEpiO) [12] and the “food ontology” (FoodOn) [13]. Among these ongoing projects, the FoodOn ontology fit particularly well requirements of the CIS concerning the need of unified food terms (Figure 2), while HSO, OBO, GenEpiO focus mainly on surveillance system level data (e.g. number of samples collected, cases observed, ...), development of interoperable ontologies for the biological sciences (e.g. chemical entities, human disease, gene ontology, phenotype and trait, …), as well as vocabulary necessary to identify, document and research foodborne pathogens (e.g. genomic laboratory testing, specimen and isolate metadata), respectively. Figure 2: FoodOn: a harmonized food ontology to increase global food traceability, quality control and data integration [13] The specific objectives of the presented CIS Version 2 (CISv2) are (Figure 3): 1 the harmonization of the past food terms into the CISv2 which we hope to achieve using text mining tool LexMapr and FoodOn ontology from the CIS (i.e. Action 1), 2 the recording of the future food terms into the CISv2 which we hope to achieve implementing FoodOn ontology into the CIS through the graph-database Neo4j (i.e. Action 2), 3 the discussion of future actions related to food ontology harmonization between organizations which we hope to achieve implementing the CISv2 in different organizations (i.e. Action 3), 4 and the discussion of future improvements related to queries from the CISv2 which we hope to achieve combining relational- (CIS) and graph-database (CISv2) relationships (i.e. Action 4). CHALLENGES OBJECTIVES S OLUTIONS NEW S YS TEM Action 1 Food On h a rm on iza tion of th e p a s t food te rm s Le xM a p r + Food O n t o lo g y Action 2 CIS CIS v2 re c ordin g of th e fu tu re food te rm s + C O H E S IV E C O H E S IV E In f o r m a tio n In f o r m a tio n S y s te m S y s te m Action 3 Ve r s io n 1 Ve r s io n 1 h a rm on iza tion - in e roga ble c om m on - dif fe re n t la n gu a ge s im p le m e n ta tion in s e v e ra l la n gu a ge s - dis ha rm on iou s food be tw e e n org a n iza tion s - ha rm on iou s food te rm s org a n iza tion s te rm s - a bs e n c e of G e n e - ha rm on iou s G e n e On tology te rm s On tology te rm s Action 3 im p rov e m e n ts c om bin a tion of re la tion a l- a n d re la te d to q u e rie s g ra p h -da ta ba s e re la tion s h ip s Figure 3: Challenges, objectives, solutions and future new version (CISv2) of the “COHESIVE information system” (CIS) supported by the European Joint Programme (EJP) “One Health Structure In Europe” (COHESIVE) 2. Material and Methods The harmonization of the past food terms from the CIS (i.e. Action 1) and implementation of the FoodOn ontology into the CIS (i.e. Action 2) are required before performing any actions related to food ontology harmonization between organizations (i.e. Action 3) and improvement of queries from the CISv2 (i.e. Action 4). 2.1. Action 1: Harmonization of Past Food Terms from the CIS based on the Food Ontology FoodOn The CIS terms come from different providers and were manually curated by each involved organization from Italy, Norway and The Netherlands. The lists of food terms from several organizations written in different languages (i.e. Italy, Norway, Netherlands) were translated by each organization into lists of food terms in English without independent verification of the translation to avoid bias or error (Figure 4). This English translation of multilingual food terms was done with Google translate. Then, these lists of English food terms were mapped against the food ontology FoodOn [13] with the rule-based text mining tool LexMapr [14]. More precisely, LexMapr uses a rule-engine for handling synonyms, prefixes and suffixes to automatically map a matrix of English food terms with FoodOn codes. The resulting harmonized food terms were finally imported in the CISv2 (Figure 4). The accuracy of LexMapr mapping and missing food terms from FoodOn were not assessed in the present study. • New source “FoodOn” C IS Lis t of Lis t of C O H E S IV E Te rm s in E n g lis h • New “FoodOn code” as dif fe re n t In f o r m a t io n la n g u a g e s te rm s matrix term with FoodOn S y s te m Ve r s io n 1 source • Association between the term and the FoodOn code Le xM a p r Food On Lis t of C IS v 2 E n g lis h < te r m , Foo d On te rm s C O H E S IV E code> In f o r m a t io n Food on tolog y O n t o lo g y a s s oc ia tion s S y s te m Ve r s io n 2 Figure 4: Harmonization of food terms from the CIS based on the food ontology FoodOn with the objective to harmonize the past food terms of samples accumulated over the past few decades into the CISv2 2.2. Action 2: Implementation of the FoodOn Ontology into the CIS to Allow Recording of Future Food Terms from the CISv2 The implementation of the FoodOn ontology into the CIS was performed with the graph-database Neo4j [15] (Figure 5). The graph-database Neo4j is able to reveal invisible contexts and hidden relationships, storing and traversing networks of highly connected data. In the present context, Neo4J transforms xml specifications of the FoodOn ontology [13] into a graph-database of food terms. Using open source technologies 3, the FoodOn ontology was imported into Neo4j in order to build the first iteration of the knowledge graph. From the resulting CISv2, the Apache Zeppelin notebook [16] was used to create a data pipeline that covers from the ingestion, the enrichment, to the visualization of queries performed on the FoodOn ontology through Neo4J [15] (Figure 5). The Apache Zeppelin notebook is a Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL and other languages. C IS C O H E S IV E In f o r m a tio n S y s te m Ve r s io n 1 C IS v 2 C O H E S IV E In f o r m a tio n S y s te m Food On Ve r s io n 2 Food O n t o lo g y Figure 5: Implementation of FoodOn ontology into the CIS with the objective to allow future records of food terms into the CISv2 based on the food ontology FoodOn 3 Open source technologies GitHub: https://github.com/neo4j-contrib/neo4j-apoc-procedures and https://github.com/neo4j-labs/neosemantics 3. Results The tools for the harmonization of the past food terms into the CIS (i.e. Action 1) and implementation of the FoodOn ontology into the CIS (i.e. Action 2) were carefully selected according to future other actions related to food ontology harmonization between organizations (i.e. Action 3) and improvement of queries from the CISv2 (i.e. Action 4). 3.1. Action 1: Harmonization of Past Food Terms from the CIS based on the Food Ontology FoodOn Compared to nutritional ontologies designed to annotate and describe intervention trials [17–19], the FoodOn ontology [13] was selected because of its strong representation of food nutrients and processing [20]. Compared to other text mining tools in Literature [21], finance [22] and medicine [23], the selection of the rule-based text mining tool LexMapr to map food terms translated in English against the FoodOn ontology (Table 1) was driven by its interoperability across sectors [14]. Initially developed to fulfil objectives of public health surveillance networks like the US FDA’s GenomeTrakr system and the US National Antimicrobial Resistance Monitoring System (NARMS), LexMapr describes indeed food pathogen source for reporting of transmission dynamics in public health foodborne pathogen surveillance and investigation 4. Without speaking about issues related to English translations, the proposed mockup (Table 1) shows that there is a need to improve FoodOn's curation because FoodOn does not have a term for a generic pizza with or without meat or cheese 5. Table 1 Mockup of food terms from the CIS harmonized through LexMapr-based mapping against the FoodOn ontology with the objective to harmonize the past food terms of samples accumulated over the past few decades into the CISv2 Code from Matrix FoodOn Code* Source# providers Pizz ka a muzzarel e a pummarol Code-1 FOODON_000102 Naples coding system ngopp Code-2.1 Pizza Margherita FOODON_000102 ISO Code-2 Pizza FOODON_000001 ISO * # FoodOn codes are arbitrary examples ISO means that samples from the present example follow the requirements of the International Organization for Standardization 4 Open source text mining tools LexMapr: https://www.cineca-project.eu/blog-all/lexmapr-a-rule-based-text-mining-tool-for-ontology-term-mapping-and-classification 5 FoodOn term "pizza food product" URL: https://urlsand.esvalabs.com/?u=http%3A%2F%2Fpurl.obolibrary.org%2Fobo%2FFOODON_03310775&e=2f9e67d3&h=88f8ff14& f=n&p=y 3.2. Action 2: Implementation of the FoodOn Ontology into the CIS to Allow Recording of Future Food Terms from the CISv2 The implementation of the FoodOn ontology into the CIS was successfully performed through Neo4J [15], as exemplified with the Apache Zeppelin notebook (Figure 6) [16]. The graph-database Neo4j was selected for its capacity to query easily ontologies [15]. Figure 6: Zeppelin browser allowing to navigate trough and query the FoodOn ontology implemented into the CIS through Neo4j with the objective to allow future records of food terms into the CISv2 based on the FoodOn ontology 4. Discussion 4.1. Action 3: Harmonization of Food Terms and Implementation of the FoodOn Ontology in Public Health Organizations In a near future, we plan to harmonize the past food terms from the CIS (i.e. Action 1) and implement the FoodOn ontology into the CIS (i.e. Action 2) instances provided during the feasibility study for the organizations: IZSAM (Italy), NVI (Norway) and RIVM (The Netherlands) at the time, in order to use the same CISv2 implementing an identical food ontology. Indeed, different coding systems can be harmonized looking for different English translated terms that LexMapr [14] maps towards the same FoodOn codes [13] (Table 1). Even if LexMapr mapping of English translated terms against the FoodOn ontology may not be completely efficient, it allows sharing of common FoodOn terms derived from English translated terms between organizations using different languages. 4.2. Improvement of Queries from the CISv2 Today, the CIS without Neo4j implementation (Figure 4) can be interrogated through relational-database queries for data related to isolation date, sampling origin, food matrix and/or genomics data, while the CISv2 implementing Neo4j [15] can be interrogated through queries of graph-database relationships for food sectors organized inside the FoodOn ontology [13]. We plan in a near future to improve queries combining queries of relational- and graph-database relationships in order to interrogate the CISv2 for isolation date, sampling origin, food matrix, genomics data and/or food sectors organized inside the FoodOn ontology. Just using the CIS, we can query samples via standard SQL statements (). --------------- SQL QUERY on sampling and genomics metadata ---------- select samples where sampling_date is 2018 and sampling_place is Italy and CC=CC8 Draft code 1: SQL query using relational-database relationships of the CIS (response: samples isolated during 2008 in Italy with a clonal complex CC8) Neo4J implemented into the CISv2 allows query of FoodOn terms using graph-database relationships (). If the queried FoodOn term do not exist, it would become a new Environment Ontology (ENVO) term useful for FoodOn curation. --------------- Neo4J QUERY on FoodOn Terms -------------------------- select foodoncode with a relationship to foodoncode of "cheese factory" Draft code 2: Neo4J query using graph-database relationships of the CISv2 (response: FoodOn codes related to the FoodOn term "cheese factory") Using the combination of SQL and Neo4J queries, an innovative query could use relational- and graph-database relationships (). ( --------------- SQL QUERY on sampling and genomics metadata ---------- select samples where sampling_date is 2018 and sampling_place is Italy and CC=CC8 ) AND foodoncode in ( --------------- NEO4J QUERY on FoodOn Terms -------------------------- select foodoncode with a relationship to foodoncode of "cheese factory" ) Draft code 3: SQL and Neo4J queries using relational- and graph-database relationships of the CISv2, respectively (response: samples isolated during 2008 in Italy with a clonal complex CC8 and a FoodOn code related to the FoodOn term "cheese factory") Adding additional ontologies, such as the Gene Ontology (GO) dedicated to GO terms [24, 25] describing the metabolic pathways 6, other innovative combinations of SQL and Neo4J queries could use relational- and graph-database relationships filtering samples via classical relational constraints, and adding ontology constraints on food matrix and genome annotations (). This kind of Gene Ontology-based query is typically usefull to list GO-terms from a subset of genomes and a larger collection of genomes in order to perform a Genome Ontology Enrichment Analysis (GOEA) [26] identifying over-represented metabolic pathways across genomes of interest (e.g. genomes involved in an outbreak). ( --------------- SQL QUERY on sampling and genomics metadata ---------- select samples where sampling_date is 2018 and sampling_place is Italy and CC=CC8 ) AND foodoncode in ( --------------- NEO4J QUERY on FoodOn Terms -------------------------- select foodoncode with a relationship to ontology_node of "cheese factory" ) AND GOterm in ( --------------- NEO4J QUERY on GO Terms ------------------------------ select GOterm with a relationship to ontology_node of "ATPase activity" ) Draft code 4: SQL and Neo4J queries using relational- and graph-database relationships of the CISv2, respectively (response: samples isolated during 2008 in Italy with a clonal complex CC8, a FoodOn code related to the FoodOn term "cheese factory" and a Gene Ontology code related to the Gene Ontology term "ATPase activity") 6 Open source ontology GENE ONTOLOGY: http://geneontology.org/docs/download-ontology/ 4.3. Significant Overlapping with Existing Efforts The present development of the CISv2, dedicated to food ontology between organizations, overlaps significantly to existing efforts in the field of human nutrition, especially the food, health, nutrition domain ontologies (FHNDO) [17], the Ontology for Nutritional Studies (ONS) [18], and the Ontology for Nutritional Epidemiology (ONE) [19]. While these nutritional ontologies aim at identifying healthy diets based on interoperability of ontologies related to classifications of diets, diseases and food [17–19], the CISv2 is dedicated to surveillance and investigation of foodborne outbreaks in human at the genomic scale based on ontologies related to genes and food. Instead of identifying healthy diets (i.e. FHNDO, ONS and ONE), the immediate goal of the project is to explain genetically foodborne outbreaks in human (i.e. CISv2). Compared to the relational-databases dedicated to the nutritional ontologies FHNDO [17], ONS [18] and ONE [19], the CISv2 presents the advantage to be able to combine relational- (i.e. CIS) and graph-databases (i.e. Neo4j). 5. Conclusion The harmonization of the past food terms into the CIS (i.e. Action 1) and implementation of the FoodOn ontology into the CIS (i.e. Action 2) will allow harmonization of the food ontology between organizations (i.e. Action 3) and improvement the interrogation of the CISv2 (i.e. Action 4) combining queries from relational- (i.e. CIS) and graph-databases (i.e. Neo4j). Thenceforth, the CISv2 need FoodOn curators from the ontology community to perform better biosample description, text mining and text mashing to ontology terms. In the longer term, we also plan to extend the CISv2 to other ontologies, like Gene Ontology in order to perform GOEA. Based on successful outcomes of actions 1 and 2, the CISv2 presents today harmonious English food terms and can be distributed for easy implementation in different European organizations with standard servers (Action 3) and used to perform combinations of queries from relational- (i.e. past food terms from CIS) and graph-databases (i.e. future food terms from CISv2). 6. Acknowledgements The study was funded by the European Joint Programme (EJP) “One Health Structure In Europe” (COHESIVE). Mention of trade names or commercial products in this article is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the IZSAM. The authors declare that they have no competing interests and thank especially the Italian Ministry of Health for supporting in the acquisition of high-performance computing resources. This manuscript was drafted by Nicolas Radomski based on an available Word template under a Creative Commons License Attribution-ShareAlike 4.0 International (CC BY-SA 4.0). 7. References [1] Y. Jang, T. Choi, J. Kim, J. Park, J. Seo, S. Kim, et al. An integrated clinical and genomic information system for cancer precision medicine. BMC Med Genomics. 2018;11:34. [2] E. Griffiths, D. Dooley, M. Graham, G. Van Domselaar, F.S.L. Brinkman, W.W.L. Hsiao. Context Is Everything: Harmonization of Critical Food Microbiology Descriptors and Metadata for Improved Food Safety and Surveillance. Front Microbiol. 2017;8:1068. [3] T. Eftimov, G. Ispirova, D. Potočnik, et al. ISO-FOOD ontology: A formal representation of the knowledge within the domain of isotopes for food science. Food Chem. 2019;277:382–90. [4] F.C. Dórea, F. Vial, K. Hammar, A. Lindberg, P. Lambrix, et al. Drivers for the development of an Animal Health Surveillance Ontology (AHSO). Prev Vet Med. 2019;166:39–48. [5] M. Masmoudi, S.B. Abdallah Ben Lamine, H.B. Zghal, et al. An ontology-based monitoring system for multi-source environmental observations. Procedia Comput Sci. 2018;126:1865–74. [6] EFSA Panel on Biological Hazards (EFSA BIOHAZ Panel), K. Koutsoumanis, A. Allende, A. Alvarez‐Ordóñez, D. Bolton, S. Bover‐Cid, et al. Whole genome sequencing and metagenomics for outbreak investigation, source attribution and risk assessment of food‐borne microorganisms. EFSA J. 2019;17. doi:10.2903/j.efsa.2019.5898. [7] S. Srinivasan, S.K. Mandal, L. Kumar, C. Menaka, A.A. Menon. A Structured Protective Cohesive Health Care Information System Using Security And Storage Mechanism In Cloud. Int J Eng Trends Technol. 2021;69:29–33. [8] A. Di Pasquale and V. Caldarelli. COHESIVE Information System (CIS). 2019. doi:10.5281/ZENODO.3257358. [9] A. Di Pasquale and V. Caldarelli. COHESIVE Information System (CIS). 2019. doi:10.5281/ZENODO.3342572. [10] M. Filter, T. Buschhardt, F. Dórea, E. Lopez de Abechuco, T. Günther, E.M. Sundermann, et al. One Health Surveillance Codex: promoting the adoption of One Health solutions within and across European countries. One Health Amst Neth. 2021;12:100233. [11] The OBI Consortium, B. Smith, M. Ashburner, et al. The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration. Nat Biotechnol. 2007;25:1251–5. [12] FAIRsharing Team. FAIRsharing record for: GenEpiO. 2018. doi:10.25504/FAIRSHARING.Y1MMBV. [13] D.M. Dooley, E.J. Griffiths, G.S. Gosal, P.L. Buttigieg, R. Hoehndorf, M.C. Lange, et al. FoodOn: a harmonized food ontology to increase global food traceability, quality control and data integration. Npj Sci Food. 2018;2:23. [14] G. Gosal, E. Griffiths, D. Dooley, I. Gill, D. Fornika, H. Tate, et al. LexMapr: a rule-based text mining tool for ontology-driven harmonization of short biomedical specimen descriptions. 2019. doi:10.7490/F1000RESEARCH.1117323.1. [15] F.M. Santos López, E.G. Santos De La Cruz. Literature review about Neo4j graph database as a feasible alternative for replacing RDBMS. Ind Data. 2015;18:135. [16] D. Chrimes, H. Zamani. Using Distributed Data over HBase in Big Data Analytics Platform for Clinical Services. Comput Math Methods Med. 2017;2017:1–16. [17] T. Helmy, A. Al-Nazer, S. Al-Bukhitan, A. Iqbal. Health, Food and User’s Profile Ontologies for Personalized Information Retrieval. Procedia Comput Sci. 2015;52:1071–6. [18] The ENPADASI consortium, F. Vitali, R. Lombardo, D. Rivero, F. Mattivi, P. Franceschi, et al. ONS: an ontology for a standardized description of interventions and observational studies in nutrition. Genes Nutr. 2018;13:12. [19] C. Yang, H. Ambayo, B. De Baets, P. Kolsteren, N. Thanintorn, D. Hawwash, et al. An Ontology to Standardize Research Output of Nutritional Epidemiology: From Paper-Based Standards to Linked Content. Nutrients. 2019;11:1300. [20] L. Chan, N. Vasilevsky, A. Thessen, et al. The landscape of nutri-informatics: a review of current resources and challenges for integrative nutrition research. Database. 2021;2021:baab003. [21] L. Feng, Y.K. Chiam, S.K. Lo. Text-Mining Techniques and Tools for Systematic Literature Reviews: A Systematic Literature Review. In: 2017 24th Asia-Pacific Software Engineering Conference (APSEC). Nanjing: IEEE; 2017. p. 41–50. doi:10.1109/APSEC.2017.10. [22] A. Gupta, V. Dengre, H.A. Kheruwala, M. Shah. Comprehensive review of text-mining applications in finance. Financ Innov. 2020;6:39. [23] C. Luque, J.M. Luna, M. Luque, S. Ventura. An advanced review on text mining in medicine. Wiley Interdiscip Rev Data Min Knowl Discov. 2019;9. doi:10.1002/widm.1302. [24] M. Ashburner, C.A. Ball, J.A. Blake, D. Botstein, H. Butler, J.M. Cherry, et al. Gene Ontology: tool for the unification of biology. Nat Genet. 2000;25:25–9. [25] E.I. Boyle, S. Weng, J. Gollub, H. Jin, D. Botstein, J.M. Cherry, et al. GO::TermFinder--open source software for accessing Gene Ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes. Bioinformatics. 2004;20:3710–5. [26] M. Vila Nova, K. Durimel, K. La, A. Felten, P. Bessières, M.Y. Mistou, et al. Genetic and metabolic signatures of Salmonella enterica subsp. enterica associated with animal sources at the pangenomic scale. BMC Genomics. 2019;20:814.