=Paper=
{{Paper
|id=Vol-2823/Paper5
|storemode=property
|title=Knowing the Unknown: Unshielding the Mysteries of Semantic Web in Health Care Domain
|pdfUrl=https://ceur-ws.org/Vol-2823/Paper5.pdf
|volume=Vol-2823
|authors=Pallavi Nagpal, Deepika Chaudhary, Jaiteg Singh
}}
==Knowing the Unknown: Unshielding the Mysteries of Semantic Web in Health Care Domain==
Knowing the Unknown: Unshielding the Mysteries of Semantic Web in Health Care Domain Pallavi Nagpal, Deepika Chaudhary and Jaiteg Singh Chitkara University Institute of Engineering &Technology, Punjab, India. Abstract Health care is one of the popular segments wherein a huge amount of data is being generated every second and in a heterogeneous format. It is only the use of semantic technologies that can act as a guard for data integration and enriching it by adding proper semantics. The Semantic Web gives us a sensible and versatile answer to perceive proficiency across the congregation of healthcare data. Eventually the ultimate goal of improving the health care practices depends on how the healthcare data is linked together. The major challenges to chase the goal are not only limited to enable the integration of heterogeneous data sources but also requires the development of certain tools for efficient search, ontology management, and data analytics. The major objective of this study is to understand the technologies behind the Semantic Web and to unshield the mysteries of application of Semantic Web in the healthcare segment. The data for such an extensive survey was gathered from repositories of Pub Med, Scopus, Web of Science, and Google Scholar along with few medical science libraries. Keywords 1 Semantic Web, Healthcare, Ontology, Knowledge base, SNOMED-CT ACI’21: Workshop on Advances in Computational Intelligence at ISIC 2021, February 25-27, 2021, Delhi, India EMAIL: pallavihptu@gmail.com (P.Nagpal); deepika.chaudhary@chitkara.edu.in (D.Chaudhary.); jaiteg.singh@chitkara.edu.in (J.Singh). 2020 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). CEUR Workshop Proceedings (CEUR-WS.org) 1. Introduction Layer 1 Uniform Resource Identifier (URI) – This layer uniquely The better definition of Semantic Web identifies the available resources on the (SW) always begins with the word Semantic. Internet. This layer ensures the uniqueness of In simple words semantics signifies meaning. Objects. Meaning allows effective use of data. Meaning is mostly missing in many of the information Layer 2 Extensible Mark-up sources and is the job of the user or Language (XML) – A language that lets it programming instruction to deliver the same. underlying user to create web pages with a For example, if we simply look at thedefined vocabulary. This layer also supports tag it is used to highlight major headings but sharing documents across the web. as technical people, it is understood that the Layer 3 & Layer 4 Resource text which is surrounded by
tag is of more importance to the user when compared to Description Framework (RDF) - RDF other. Basic semantics are attached for the acts like an Entity-Relationship(ER) model search engines with tags; however, which is drafted for writing a set of statements they are just solitary keywords and lacks when to describe objects (resources) available on the linked. Semantics accord keywords with web. RDF Schema on the other hand is used to effective meaning through relationships. The write ontologies to represent relationships Semantic Web in simple terms is a web of data between web objects [26]. For integrating paraphrased and linked together in ways to richness to the interoperability of data among build context that follow a grammar & various domains Ontology Web Language language construct. Although these semantics (OWL) was developed. can also be added with a help of programming Layer 5 Proof & Trust layer - This language in that case also formal standards are layer is all about the representation of logics to missing and also these semantics cannot be deduce one document from another and there shared, aggregated, and validated. One can validations. The trust layer ensures the faith also attach these semantics through through the use of digital signatures. programming languages but in that case, too, To conclude we can state that the Semantic there are no formal standards to be followed, Web can be better understood as an extension and sharing, aggregation, and validation of the WWW, it extends itself through the among statements becomes complex if not usable, systematized semantics that is based on impossible. The Semantic Web all together research in knowledge/facts representation and supports the evolutionary nature of WWW. the logic to approach the goal of ubiquitous The Semantic Web architecture is based on the information sharing. When compared to Layered Approach. These layers have some WWW it's the Semantic Web which primarily strong key dependencies between them. The consists of statements for application Semantic Web layer cake presented in Figure consumption. These statements are not just 1 highlights these dependencies. plain statements for human interpretations but also include logic and act as meaningful links which can directly be interpreted by machines. The outcome of this paper is to provide insights on the recent advancements done in this field. The more specific objectives can be summarized as follows: a) To create a theoretical foundation in the area of data standards and interoperability, ontologies and its visualization, semantic data repositories and Semantic Web user Interfaces. b) To reveal the current state of art of Figure 1 : The Semantic Web Stack application of Semantic Web in Health Care Domain. c) To un-shield the successful use cases of segment. In this phase, a detailed study of new approaches, techniques and Semantic literature has been carried on to improve the Web applications in the field of health care quality and safety of web applications in the data management. health care domain. 3.1 Data Standards and The next section presents the present day challenges which require attention of the Interoperability researcher community. 3.1.1 Theoretical Foundation 2. Healthcare Challenges in Current The automation of various health care services; usage of various medical information Scenario systems and other technological instruments The use of information systems and have put together and still is generating technologically advanced medical devices in volumes of medical data and that too in various health care hospitals is producing and heterogeneous form. The huge volume of data will continue to produce vast amount of data. has to be processed in a meaningful form to This huge collection needs to be explored and produce some valuable knowledge. This transformed in such a way that it can be knowledge can lead to sound health care converted into valuable information which can practices and thus can be of utmost use for thus improve the healthcare processes. Which humans[26]. When the data is coming from is not easy and can be daunting because of multiple sources it's the interoperability various challenges? The following challenges standards that act as a bridge to integrate and needs to be addressed: exchange the data across systems and services. To achieve this requires Schema matching a) Interoperability of health and medical techniques that can transform the data from data. human-understandable to machine-ready b) Personalization of Ontologies and its format. This technique when implemented can visualization. produce good healthcare services thereby c) Explosion of health data and Semantic reducing the cost by eliminating duplicate Data Repositories operations. The authors in their work have d) Development of new user friendly highlighted the meaning of interoperability for interfaces Semantic Web applications which can be defined as the vocabulary, organization, and Semantic Web is attaining approval in structure of data required to integrate the data addressing these challenges and therefore from multiple institutions [28]. Figure 2 gives World Wide Web consortium (WWWC) a glimpse of various standards dealing with established the Health Care and Life Science interoperability and the changes concerning Interest Group in the year 2015 time. (HCLSIG,2015) with the objective to defend and support the use of Semantic Web technologies in health care segment. The rest part of the paper is organized as follows section 3 provides a background and short outline of prevailing literature for the mentioned challenges, section 4 highlights the new techniques and approaches relevant to health care segment, section 5 concludes the study. 3. Literature Review on Semantic Figure 2: Rate of Change/Year (https://yosemiteproject.org/interoperability- Web Technologies in Health Care roadmap/) Domain This section presents a short outline of a The interoperability can be categorized into few of the prevailing literature towards two parts a) functional and b) semantic where Semantic Web technologies in the health care functional deals with the common procedures and semantic deals with the framing of a This standard can also leverage RESTFUL common language which machine can web services. The given section categories the understand in the end to end communication. literature review underwent by various To make applications fully interoperable researchers. requires standards. As per the literature mentioned the standards can be divided into a) 1. Private and public sector organization to Vocabulary or ontology Standards b) Data align their systems with Fast Healthcare interchange and Integration standards c) Interoperability Resources is the upcoming Health record maintenance standards. These standard for data exchange. standards are established through four methods (https://www.dataversity.net/ semantic- a) Adhoc b) Defacto b) Government- interoperability-future-healthcare-data/ ).s mandated) Consensus. [14] 2. Development of various standards for 3.1.2 Present State of Art providing data interoperability; XML, Below we present the literature survey GPS, Web Services, and Security, TCP/IP, done in this direction: 802.11, GPS [12] The authors worked on schema matching 3. Development of Reference Information techniques and presented a method for Model (RIM) (HL7 Version 3 (V3) - a automating the process for matching schema at suite of specifications based on HL7's the field level; they achieved 71.8% when Reference Information Model (RIM)), its mapping the four staged process which data elements, terminology, clinical includes string matching and substring statements, templates, document matching [25]. The authors in their paper have architecture. discussed the standards required in Clinical (http://www.hl7.org/implement/standards/r data management [15]. Clinical data im.cfm). management deals with signs, operations, 4. Development of Collaborative Standard medicines, and lab values for a particular Hubs for quality improvement in the patient. The data here can only be healthcare segment across rural and urban interoperable when in a structured form. This states (Srivastava et al., 2020). transformation is a challenge and requires a lot of research. Further data analysis also requires 3.2 Ontologies & its Visualization the data to be converted in a structured form. 3.2.1 Theoretical Foundation The authors have given the FAIR guiding Ontologies play a very important role in principle as one of the possible solutions to Semantic Web applications. They act as a this problem. The acronym of FAIR is common vocabulary for a specific domain "Findability", "Accessibility", consisting of a terminological collection of "Interoperability", and "Reusability". The terms together with the rules to combine these authors in the study have mentioned Fast terms and form relations. They also act as a Healthcare Interoperability Resources (FHIR) basis for interoperability. as a newbie in the area of biomedical informatics and healthcare [26]. These 3.2.2 Present State of Art guidelines when followed can provide a Here, we first survey the work done in this technological edge over health level seven direction. The authors in this paper did an (HL7). It was also discussed in the Yosemitea extensive survey on the current state of manifesto (https://www.dataversity.net/ ontology development in the area of health semantic-interoperability-future-healthcare- care [20]. They have studied various papers in data/) that for data exchange the Resource the time frame of 2009 and 2018. They Description Framework (RDF) to be used. classified the work done by various Existing data standards to be mapped with researchers and proved that at present also RDF's, Government agencies should mandate there is a dire need for the development of new RDF as a Universal Healthcare exchange ontologies for delivering effective healthcare language. These standards can be adopted in services. The authors in this paper have the area of legacy systems, modern medical highlighted the role of ontology in decision entities, and healthcare information systems. making and how ontology that defines concepts such as disease, location, and evidence-based decision support systems environment and the interrelationships can [11]. influence the process of decision making in the 5. To work upon improving the quality of public sector domain [26]. They also focused health services using semantic web on the various mapping methods to work on technologies. Majorly the privacy and interoperability issues. Further, they have security, trust, risks, and social developed a centralized knowledge base for implications and the quality of information healthcare systems specifically for the Tamil are important and play an important role in Nadu Region of India. The author in this the semantic web areas [15]. paper has integrated the data which is coming 6. HCLS Knowledge Base is designed as a from different sources and thus created an knowledge base where data from multiple ontology [19]. They have followed four-way sources (PubMed, Clinical Trials) have steps which include identification of data been stored. WHOsGHO – From WHO sources, the formalization of concepts, statistics about 3 million of data has been performed audit, and thus formulated converted to RDF [23]. Other projects ontology. In the table given below we similar to the one mentioned above like summarize the work done in the direction of DailyMed (2015) would continue to creating new ontologies into past, Work in generate data from the health domain progress, and Future Scope. The authors based on the principles of linked data [23] stated that today also lot of data which is 7. The data hub is currently indexing published by big organization like WHO is hundred of datasets which are tagged published in proprietary format and not in under the healthcare category. The accordance with Semantic Web standards and requirement of a lot of new projects in this therefore it is still very difficult to integrate direction which can convert the data stored and further process those chunks [28]. in a proprietary format to linked data [23] Although a lot of research work (WHOGO, 2015), (PubMed, 2015), (NIH, 2015) has been 3.3 Semantic Websites user carried in this direction still there is a long way Interface ahead. The table below summarizes the literature. 3.3.1 Theoretical Foundation There are multiple projects such as 1. Development of a Centralized knowledge CardioShare and Bio2RDF which have certain base. Personalization of ontologies for capabilities for navigating and querying the describing the class hierarchies among underlying Semantic Web data. However they chronically ill patients to form a decision lag intuition and can be more improvised. support system for chronically ill patients 3.3.2 Present State of Art (Riaño et al., 2012) The projects which are designed in this 2. Introduction of Electronic Health care direction should have the capabilities for system to improve the various parameters searching and navigating through the Semantic of health care services [19]. Web data [28]. There are certain projects 3. Improvement was done in various which have these capabilities like Bio2RDF organizations dealing in the service and CardioShare, but these projects are limited delivery segment, availability of reliable and are not that intuitive which means a novice health data for healthcare providers, and user will find it very difficult to explore and improves upon in the public health system. visualize the RDF triples. Therefore, there is a A lot of new ontologies were also strong need to develop a good interface while developed. The process of ontology developing the Semantic Web Applications. validation was also improved [27]. The architecture is so simple that even a 4. Development of comprehensive beginner can explore it without much monitoring frameworks in the field of difficulty. The authors in this paper have maternal health informatics that would be designed and Semantic Web Portal (SWP) created with the consensus of people which is a light weight portal to browse and practicing it and also on the ontology- visualize the data generated and that too in based data integration approach. These meaningful and friendly way [9]. This system frameworks will facilitate the research and was deployed in Indiana University Health Care Center to store and visualize the semantic application. A reasoner can therefore be information from one place and was used by defined as a piece of code which is able to multiple users i.e the patients the doctors, the infer logical consequences among a set of practitioners to look all semantic information declared facts or axioms [8]. Correctness, in one place. efficiency, soundness and completeness of the 1. Design and development of few interfaces new inferences drawn are some of the to explore and navigate the Semantic Web important attributes of a good reasoner. data. The portals can only read the JSON FaCT++, Pellet, HermiT, Kaon2, Hoolet are files [18] the few examples of Semantic Web reasoners. 2. To make this process more interactive the The reasoners can be categorized into various researchers are working on the strategies categories as per the various OWL profiles. In which can also read the dynamically the study done by [9] the authors have a generated JSON objects. For novice users, clearly described the various types of the researchers are working to make the reasoners. interface user friendly and easy to explore [18] 3. Semantic Graph mining to identify and 4. Conclusions rank the nodes and relationships. Although the technologies in the underlying Development of user friendly and more domain have contributed a lot in health care intuitive interface and graph visualisation segment still this study reveals that there still methods to navigate through various certain challenges that needs to be resolved. ontologies This survey reveals the present day challenges (https://arxiv.org/abs/2008.03053). which require an ultimate attention of the 4. Representation of Semantic data in researcher .The end users can be benefitted in proprietary formats- A lot of big case we get an optimal solution to the organizations share the data in PDF or challenges mentioned herein. spreadsheet format which makes it difficult to integrate. CardioSHARE 5. Future Work and advancements (Vandervalk, McCarthy& Wilkinson, in Semantic Health Care Segment 2008) represents a decentralized web service framework that provides a Healthcare segment is generating a lot of data SPARQL endpoint that enables querying continuously. Semantic interoperability of data transparently resources in the "deep web" can help the human community a lot. The from distributed and independent source. patients, the doctors, various organizations can 5. Bio2RDF in this project 11 billion of data get the maximum benefits from this. The that comes from various heterogeneous figure below represents the future issues that sources have been ported to RDF formats need to be addressed to make this technology a (https://bio2rdf.org/).Bio2RDF in this huge success. project 11 billion of data that comes from various heterogeneous sources have been ported to RDF formats (https://bio2rdf.org/). 3.4Semantic Web and Reasoning 3.4.1Theoretical Foundation The success behind any Semantic Web application lies in the inference process. Semantic Analysis is the mapping between syntax and the meaning of the sentences. Semantic analysis is used to check whether inserted sentence is accurate or not. The Semantic Web application designers are extremely benefitted if they can select a Figure 3: Advancements in Semantic HealthCare suitable reasoner as per the design of the a) Semantic Data Integration for IoT 1_11 Sensor Data [4] Berners-Lee, T., Hall, W., Hendler, J.A., In present scenario, about 35 billion of IoT O'Hara, K., Shadbolt, N. and Weitzner, devices are connected and it is predicted that D.J., 2006. A framework for web this number would grow around 120 billion in science. Foundations and Trends® in Web 2025, which would be generating around 180 Science, 1(1), pp.1-130. trillion gigabytes of data. This data comes [5] Bergman M.K. (2018) Platforms and from various heterogeneous devices thus Knowledge Management. In: A making the formats incompatible to integrate. Knowledge Representation Practionary. Which creates a significant problem for IoT Springer, Cham. application developers? Semantic Annotations https://doi.org/10.1007/978-3-319-98092- and Clustering can be used as a method to 8_12 integrate this data which is a challenge and can [6] Bernstein, A., Hendler, J., & Noy, N. be considered as a future scope [3] (2016). A new look of the Semantic Web. b) Development of new faceted interfaces [7] Brooks, P., & Avera, H. (2010). Standards for searching and exploring semantic web and interoperability in healthcare health care data information systems: Current status, Irrespective of the complexity and size of problems, and research issues. In Fifth data interfaces act as central linkage between MWAIS Conference. human computer interactions. This becomes [8] Jain, S., Jain, N. K., & Goel, C. K. (2009). more complex when data is coming from Reasoning in EHCPRs system. Int. J. various heterogeneous systems. Therefore, Open Problems Compt. Math, 2(2). there is a need of designing new faceted and [9] Ding, Y., Sun, Y., Chen, B., Borner, K., interactive interfaces [24] Ding, L., Wild, D., & Toma, I. (2010). c) Integration of Machine learning Semantic web portal: a platform for better algorithms into Semantic web reasoners browsing and visualizing semantic data. In Certain new reasoners are required in the area Active Media Technology (pp. 448-460). of the health care domain to generate Springer Berlin Heidelberg inferences that can learn from themselves and [10] El Asikri, M. et al. (2016) are based on ANN or deep learning techniques ‘Contribution to ontologies building using [13]. the semantic web and web mining’, Proceedings - 2016 International References Conference on Engineering and MIS, ICEMIS 2016, (February 2018). doi: 10.1109/ICEMIS.2016.7745329. [1] Andročec, D., Novak, M., & Oreški, D. [11] Henao, J., Quintana, Y. and Safran, C. (2018). Using Semantic Web for Internet (2019) ‘An informatics framework for of Things Interoperability: A Systematic maternal and child health (MCH) Review. International Journal on Semantic monitoring’, Studies in Health Technology Web and Information Systems (IJSWIS), and Informatics, 257(February 2020), pp. 14(4), 147-171. 157–162. doi: 10.3233/978-1-61499-951- [2] Allemang, D. and Hendler, J., 5-157. (2011). ‘Semantic web for the working [12] Kubben, P. (2019) ontologist: effective modeling in RDFS 2019_Book_FundamentalsOfClinicalData and OWL’. Elsevier. Scie. Available at: [3] Balakrishna S., Thirumaran M., Solanki papers3://publication/uuid/5CA1CB66- V.K. (2020) IoT Sensor Data Integration E0F7-4439-866E-935EFEAF8720. in Healthcare using Semantics and [13] Kulmanov, M., Smaili, F. Z., Gao, X., Machine Learning Approaches. In: Balas & Hoehndorf, R. (2020). Semantic V., Solanki V., Kumar R., Ahad M. (eds) similarity and machine learning with A Handbook of Internet of Things in ontologies. Briefings in Bioinformatics. Biomedical and Cyber Physical System. [14] Mandel, J. C. et al. (2016) ‘SMART Intelligent Systems Reference Library, vol on FHIR: A standards-based, interoperable 165. Springer, Cham. apps platform for electronic health https://doi.org/10.1007/978-3-030-23983- records’, Journal of the American Medical Informatics Association, 23(5), pp. 899– [21] Saripalle, R. K. (2019). Fast Health 908. doi: 10.1093/jamia/ocv189. Interoperability Resources (FHIR): [15] Ngo, Q. T. and Tao, N. B. (2020) Current status in the healthcare ‘Determinants of semantic web technology system. International Journal of E-Health adoption from IT professionals’ and Medical Communications perspective: Industry competition, (IJEHMC), 10(1), 76-93. organization innovativeness, and data [22] Schulz, S., Stegwee, R., & Chronaki, management capability’, International C. (2019). Standards in healthcare data. Journal of Data and Network Science, In Fundamentals of Clinical Data 4(3), pp. 271–288. doi: Science (pp. 19-36). Springer, Cham. 10.5267/j.ijdns.2020.6.005. [23] Zenuni, Xhemal & Raufi, Bujar & [16] Liyanage H, Krause P, de Lusignan S Ismaili, Florie & Ajdari, Jaumin. (2015). Using ontologies to improve semantic State of the Art of Semantic Web for interoperability in health data BMJ Health Healthcare. Procedia - Social and & Care Informatics 2015; 22: Behavioral Sciences. 195. 1990-1998. doi: 10.14236/jhi.v22i2.159 10.1016/j.sbspro.2015.06.213. [17] Okikiola, F. M., Ikotun, A. M., [24] Zhang, Guo-Qiang et al. ‘Ontologies Adelokun, A. P., & Ishola, P. E. (2020). A as Nested Facet Systems for Human–data Systematic Review of Health Care Interaction’. 1 Jan. 2020: 79 – Ontology. Asian Journal of Research in 86.Informatics 2015; 22: doi: Computer Science, 5(1), 15-28. 10.14236/jhi.v 22i2.159. [18] Oren, E., Heitmann, B., & Decker, S. [25] Patel, A., & Jain, S. (2018). (2008). ActiveRDF: Embedding Semantic Formalisms of representing Web data into object-oriented knowledge. Procedia Computer languages. Journal of Web Science, 125, 542-549. Semantics, 6(3), 191-202. [26] Jain, S. (2018). Intelligent decision [19] Sachdeva, S., & Bhalla, S. (2012). support for unconventional emergencies. Semantic interoperability in standardized In Exploring Intelligent Decision Support electronic health record databases. Journal Systems (pp. 199-219). Springer, Cham. of Data and Information Quality [27] Jain, S., & Meyer, V. (2018). (JDIQ), 3(1), 1-37. Evaluation and refinement of emergency [20] Satti, F. A., Ali Khan, W., Ali, T., situation ontology. International Journal of Hussain, J., Yu, H. W., Kim, S., & Lee, S. Information and Education (2020). Semantic Bridge for Resolving Technology, 8(10), 713-719. Healthcare Data Interoperability. In 34th [28] Devi, R., Mehrotra, D., & Baazaoui- International Conference on Information Zghal, H. (2020, June). An r2rml-based Networking, ICOIN 2020 (pp. 86-91). Approach to Map Dengue Patient [9016461] (International Conference on Database to Ontology. In 2020 8th Information Networking; Vol. 2020- International Conference on Reliability, January). IEEE Computer Infocom Technologies and Optimization Society. https://doi.org/10.1109/ICOIN486 (Trends and Future 56.2020.9016461 Directions)(ICRITO) (pp. 790-795). IEEE.