=Paper=
{{Paper
|id=Vol-1327/1
|storemode=property
|title=Pain Assessment Terminology in the NCBO BioPortal: Evaluation and Recommendations
|pdfUrl=https://ceur-ws.org/Vol-1327/icbo2014_paper_16.pdf
|volume=Vol-1327
}}
==Pain Assessment Terminology in the NCBO BioPortal: Evaluation and Recommendations==
ICBO 2014 Proceedings Pain Assessment Terminology in the NCBO BioPortal: Evaluation and Recommendations Werner CEUSTERS Department of Biomedical Informatics and Institute for Healthcare Informatics University at Buffalo Buffalo, NY - USA ceusters@buffalo.edu Abstract—The International Association for the Study of Pain TABLE I. PAIN TERMS ANALYZED (IASP) publishes since 1986 a relatively frequently updated list of Allodynia: pain due to a stimulus that does not normally provoke pain. pain terms currently known as the ‘IASP Taxonomy’. It was Note: The stimulus leads to an unexpectedly painful response. examined how nine terms defined in this taxonomy and used by Analgesia: absence of pain in response to stimulation which would pain specialists to describe findings of somatosensory testing and normally be painful. pain assessment are classified in the representational artifacts Dysesthesia: an unpleasant abnormal sensation, whether spontaneous or accepted in the NCBO BioPortal. It was found that the majority evoked. Note: Special cases of dysesthesia include hyperalgesia and of the BioPortal resources cover the terms poorly and that the allodynia. quality of the hierarchies and the mappings are below acceptable Hyperalgesia: increased pain from a stimulus that normally provokes pain. quality standards. It is concluded that without the BioPortal Hyperesthesia: increased sensitivity to stimulation, excluding the special senses. Note: Hyperesthesia includes both allodynia and hyperalgesia, studies of this nature are hard to perform, but also that for the but the more specific terms should be used wherever they are BioPortal to become an instrument which is useful for other applicable. purposes than determining that its content is of poor quality, the Hyperpathia: a painful syndrome characterized by an abnormally painful internal quality assurance principles used for its development reaction to a stimulus. and maintenance need to be improved and documented. Hypoalgesia: diminished pain in response to a normally painful stimulus. Hypoesthesia: decreased sensitivity to stimulation, excluding the special Keywords—pain terminology, NCBO BioPortal, quality senses. assurance Paresthesia: an abnormal sensation, whether spontaneous or evoked. Note: it has been agreed to recommend that paresthesia be used to describe an abnormal sensation that is not unpleasant while dysesthesia be used I. INTRODUCTION preferentially for an abnormal sensation that is considered to be unpleasant. There is a sense in which, since paresthesia refers to Findings based on the various kinds of responses that patients abnormal sensations in general, it might include dysesthesia, may report when subjected to stimuli to test their somatosensory status, are typically described using terms such as ‘allodynia’, ‘hyperesthesia’, and so forth. Standard It is therefore not possible to use these definitions in the definitions for these terms were first proposed in 1979 [1] and Ontology for Pain-Related Mental Health and Quality of Life are since then regularly updated by the International (OPMQoL) which is being developed as part of the NIDCR- Association for the Study of Pain (IASP), in print for the last funded project R01DE021917 with the goal to integrate five time in 1994 [2], with more regular electronic updates on the datasets gathered in four different countries from patients IASP webpage [3], the last one May 2012 (subset in Table 1). suffering from one or other form of orofacial pain [4, 5]. These definitions, together with the IASP definition for It was hypothesized that alternatives could be found in the ‘pain’ as ‘an unpleasant sensory and emotional experience BioPortal of the National Center for Biomedical Ontology associated with actual or potential tissue damage, or described in terms of such damage’, suggest the hierarchy displayed in Fig. 1. IASP pain assessment terminology hierarchy Fig. 1 in which terms displayed in SMALL CAPS are the immediate superordinate terms found in the definitions and the arrows stand for the classical subsumption relation. Although the individual definitions follow the Aristotelian style ‘an A is a B which C’, the defined terms do not lead all together to a complete directed graph with an overarching top, not even if all 29 IASP terms would be included. Furthermore, the terms ‘allodynia’ and ‘hyperalgesia’ have superordinate terms which under their standard meanings should represent disjoined classes: although sensation and sensitivity are certainly related, nothing which is a kind of one can also be a kind of the other. 1 ICBO 2014 Proceedings TABLE II. SUMMARY ASSESSMENT OF TERMINOLOGICAL AND ONTOLOGICAL QUALITY OF THE SEARCH TERM RELATED BIOPORTAL CLASSES RETRIEVED Representational Artifact → SNOMEDCT SOPHARM WHO-ART COSTART RH-MESH ICD10CM MEDDRA NIFSTD GALEN ICPC2P CTCAE PHARE NDFRT OMIM CRISP MESH SYMP Totals DOID SNMI CSSO NCIT ICNP BDO RCD PDQ MP HP Assessment Parameter Norm AP1 IASP search terms covered 9 1 6 2 1 2 1 0 5 1 2 2 8 3 4 4 4 1 1 0 1 5 3 7 9 4 6 2 AP2 Number of direct class matches >8 1 6 2 1 2 1 1 5 1 2 8 9 7 4 5 5 1 1 1 1 5 11 7 14 4 6 2 113 Direct classes with wrong IASP AP3 0 1 2 1 3 2 9 synonymy AP4 Direct classes with definitions =AP2 2 1 2 1 4 7 4 5 3 1 6 36 Number of direct classes with AP5 0 6 6 5 17 inappropriate homonymy Number of additional direct AP6 0 1 3 1 2 3 13 classes through spelling variants AP7 Number of class matches >AP2 7 24 12 2 8 7 9 39 5 16 16 15 72 31 35 40 3 5 3 6 56 60 38 164 39 24 26 762 AP8 Foreign classes in hierarchy 0 8 7 5 1 1 5 7 6 40 Number of hierarchy classes AP9 0 7 4 2 4 1 6 15 11 1 3 4 5 2 49 24 60 with disjointness violations Evaluation Maximum number of norm violations 6 8 8 6 8 6 1 8 6 8 8 8 8 8 8 8 6 6 1 6 8 8 8 8 8 8 8 Number of norm violations (except P8) 4 4 3 2 3 3 1 4 3 3 6 3 5 2 3 5 3 5 1 4 6 5 5 5 3 2 5 (NCBO) [6] which contains to date 370 representational To assess the extent to which the search terms are artifacts with over 5.6 million classes. The objectives of the adequately covered in the individual BioPortal resources, and work reported on here were to assess (1) whether these in the BioPortal as a whole, well-known quality assessment resources offer a more adequate view on pain assessment criteria and recommendations – see results and discussions for terminology, and (2) to what extent the BioPortal is a useful details – for terminologies [10, 11] and ontologies [12] were instrument in determining whether (1) is indeed the case. used. To assess the adequacy of the backbone hierarchy within individual resources 7 disjoint collections of in total 10 high II. METHODOLOGY level groupings, inspired by the various preferred terms that were retrieved, were constructed: [Adverse event], [Body part], The nine terms – henceforth called ‘search terms’ – from Table [Discipline], [Disease, Disorder or Finding; NON-pain 1 were submitted to the on-line version of the BioPortal disorder; Pain / sensation finding], [Pharm. Effect / Endpoint], Annotator [7] thereby using the following annotator options: [Function / Process; Technique / Therapy] and [Meta / Top]. (1) ‘longest match only’ unselected, (2) manual mappings Each class (with disambiguation where required as for instance included, and (3) inclusion of all ancestors. With these options for ‘analgesia’) was classified into one of these groupings on thus set, the annotator returned for each search term ST in this the basis of its preferred term. Examples of classes labelled step one or more records, each such record containing (1) the Meta are classes with preferred terms such as Inactive Concept unique identifier of a class CL in relation to which ST was and Unclassified, whereas the Top labelling include classes found (2) the name of the representational artifact RA to which such as Snomed CT Concept and Topical descriptor [8]. CL belongs, (3) whether CL was retrieved on the basis of what the annotator qualifies as a ‘direct match’ between ST on the The adequacy of the mappings between directly matched one hand and a preferred term, synonym or identifier of CL on classes was assessed semi-automatically. Mapping records in the other hand, or on the basis of being – mostly within RA, but which the semantics of at least one of the classes could not be occasionally also within a representational artifact other than determined, were excluded. Records where only one of the RA – an ancestor of a class which matches directly, and (4) the classes was marked as being Meta, were automatically tagged preferred term PT of CL [8]. as obsolete. Records for which the preferred names of both classes were identical, except in the case of ‘analgesia’ given In a second step, all detailed terminological information its homonymous semantics, were automatically assigned as available for each CL matching directly was retrieved, being correct. All other cases were assessed manually. including a visualization of the subsumption graph and all the mappings – if any at all – of CL to classes in other representational artifacts within the BioPortal. The raw data III. RESULTS and analysis file is available as [8]. Mappings between classes Querying for the 9 search terms in the BioPortal Annotator from different representational artifacts are further qualified by exactly as displayed in Table 1 returned 762 annotation records the BioPortal as being the result of enjoying shared Concept of which 113 were about in total 104 candidate annotation Unique Identifiers (CUIs) from the Unified Medical Language classes labelled by the Annotator as ‘direct’ and which System (UMLS), and/or being automatically generated using originated from 27 different sources [8] out of the 371 total the Lexical OWL Ontology Matcher (LOOM), which generates artifacts at the time this work was performed. 17 annotation mappings based on lexical similarity of the preferred name and records revealed that in the ICPC2, RH-MeSH and SNOMED synonyms between pairs of ontologies [9]. CT some of the search terms matched directly to more than one 2 ICBO 2014 Proceedings class (Table 3, AP5 in Table 2) – thus reflecting homonymy, TABLE III. MAPPING OF SEARCH TERMS TO PREFERRED TERMS IN THE REPRESENTATIONAL ARTIFACTS while 9 records showed that some of the classes were mapped to by distinct search terms (AP3 in Table 2) – thus reflecting Representational Grand Total SNOMEDCT Occurrence SOPHARM synonymy for the terms involved within the context of that WHO-ART RH-MESH ICD10CM COSTART MEDDRA Artifact → NIFSTD ICPC2P NDFRT PHARE GALEN OMIM CTCAE MESH SYMP CRISP SNMI source. Ignoring capitalization, the 104 direct annotation Search Term DOID CSSO ICNP NCIT PDQ BDO RCD MP HP Preferred Term classes exhibited in total 25 distinct preferred terms. In Table 3 Allodynia 1 1 1 1 1 1 1 1 1 9 9 it is displayed how these preferred terms are related to the Allodynia Hyperalgesia 1 1 1 1 1 1 1 1 1 8 1 8 1 original search terms in each resource. Analgesia 1 1 1 1 1 1 1 1 1 1 1 1 2 1 1 1 17 16 Analgesia 1 1 1 1 1 1 2 1 1 10 9 225 additional candidate annotation records [8] were No sensitivity to pain 1 1 2 2 retrieved by querying for three of the spelling variants pain agnosia Analgesia [PE] 1 1 1 1 1 1 suggested by some of the retrieved preferred terms obtained by Hypalgesia 1 1 1 querying for the original search terms (Table 3): 77 for Pain Therapy Dysesthesia 1 1 1 1 1 1 1 1 2 2 8 2 7 hyperaesthesia, 76 for hypesthesia, and 72 for hypoaesthesia. Dysesthesia 1 1 1 1 1 2 7 6 These records reveal that these terms match directly with 14 Paresthesia Hyperalgesia 1 1 1 1 1 1 1 1 1 1 1 1 3 1 3 1 1 1 20 1 16 classes that were not matched with the original search terms, O/E - hyperesthesia present (& [hyperalgesia]) 2 2 1 thereby bringing ICD10 on board as extra representational Hyperalgesia [Disease/Finding] Hyperalgesia 1 1 1 1 1 1 1 1 1 1 3 1 1 1 1 1 16 1 14 artifact. These records are not included in any further analysis. HYPERAESTHESIA 1 1 1 649 annotation records were labelled by the Annotator as Hyperesthesia 1 1 1 4 1 1 1 3 1 1 1 16 11 Hyperesthesia [Disease/Finding] 1 1 1 containing hierarchical ancestors of the classes matched HYPERAESTHESIA 4 4 1 directly, totaling 206 distinct ancestor classes with together 169 Hyperesthesia 1 1 1 1 1 3 1 1 1 11 9 distinct preferred terms [8]. One class, labelled Hyperpathia Hyperalgesia 1 1 1 2 1 1 5 2 4 2 ‘UMLS:OrphanClass’ appeared in 40 records involving the 8 Hyperpathia 1 1 1 3 3 representational artifacts labeled ICPC2, MESH, NDFRT, Hypoalgesia HYPALGESIA 1 1 1 1 1 1 1 1 1 8 1 8 1 OMIM, PDQ, RCD, SNMI, and SNOMEDCT. 1036 mapping HYPOAESTHESIA 1 1 1 records were retrieved for all 104 classes matched directly to Hypoalgesia Hypoesthesia 1 1 1 1 1 1 1 1 1 1 1 1 1 2 1 6 10 6 9 the search terms, of which 71 duplicates, yielding 965 records Hypesthesia [Disease/Finding] 1 1 1 further analyzed [8]. 399 of those records required manual Reduced sensation of skin 1 1 1 Sensory impairment 1 1 1 assessment. Hypesthesia 1 1 1 3 3 Hypoesthesia 1 1 1 1 4 4 Paresthesia 1 1 1 1 1 4 1 1 1 1 1 3 1 1 1 20 15 A. Quality of BioPortal Resources Retrieved Paresthesia [Disease/Finding] 1 1 1 paraesthesia 4 4 1 Table 2 provides – with the exception of assessment parameter Paresthesia 1 1 1 1 1 1 1 1 1 3 1 1 1 15 13 AP8 – a summary assessment of the terminological and Grand Total 1 6 2 1 2 1 1 5 1 2 8 9 7 4 5 5 1 1 1 1 5 11 7 14 4 6 2 113 ontological quality of the classes (and by extension of the Occurrence 1 6 2 1 2 1 1 5 1 2 2 9 7 4 5 5 1 1 1 1 5 4 7 9 4 6 2 95 resources from which they originate) that were retrieved for the B. Adequacy of the NCBO BioPortal 9 search terms. Further details about certain aspects are available in Table 3 and Table 4. 9 APs are considered, and for Out of the 27 representational artifacts which have at least one each AP a norm is determined. Table 2 thus illustrates that: class with a direct match to a search term, 22 have classes which by the BioPortal are mapped to at least one other class x only SNOMED CT covers the 9 search terms in the from another artifact. 618 of these mappings are within these lexical form provided by the IASP (AP1), while 22 sources whereas 347 mappings are towards classes from 18 MeDDRA has complete coverage if lexical variants are target representational artifacts outside these sources. Of these taken into account (AP6), (it was not checked whether 18, MeDDRA and RH-MeSH are the only two that have resources contained atomic terms that through post- classes directly matched with the search terms, thus reflecting coordination would allow to express the terms), the BioPortal documentation that mappings are not always bidirectional. x 5 resources do not make the distinctions in terminology made by the IASP (AP3, details in Table 3), Table 5 quantifies the appropriateness of the mappings on the basis of our methodology. The ‘B’ and ‘T’ following the x 11 resources provide textual definitions for at least resource names in Table 5 indicate whether the resource some of the classes (AP2, AP4), exhibits mappings bi-directionally resp. only incoming. B- x 3 resources exhibit inappropriate homonymy for some mappings are only counted once in the totals. Mappings are of the search terms (AP5), qualified as being excluded (‘Excl.’) from the analysis because of either ambiguity or missing information on the side of the x more than half of the resources exhibit for at least some classes mapped to (‘T?’) or being in the realm of the 22 source of the search terms a hierarchy which on the basis of the classifications (’S?’). ‘Correct’ mappings result from (1) the face value of the preferred terms is composed of automatic assignment of the adequacy assessment for pairs of disjoint classes (AP9, details in Table 4), source and target classes with identical non-ambiguous preferred terms (’SAME’), and the manual verification of (2) x none of the representational artifacts cover the domain classes with synonymous preferred terms, i.e. lexical variants delineated by the IASP search terms adequately when or descriptions (‘VARIANT’) and (3) classes with ambiguous taking all assessment parameters into account. 3 ICBO 2014 Proceedings preferred terms. Erroneous mappings (‘ERROR’) are brought TABLE V. CORRECTNES OF DIRECT CLASS MAPPINGS about by (1) automatic determination of mapping to or from Error Correct Excl. inactive classes (‘OBSO’) and manual verification of (2a) mapping to or from classes with ambiguous meaning HOMONYM DISAMBIG. % WRONG VARIANT Representational (‘HOMONYM’), and (2b) inappropriate mappings between WRONG TOTAL Artifacts SAME classes with unambiguous meanings (‘WRONG’). Table 6 OBSO provides insight in the accuracy of the methods applied in the T? S? BioPortal to create mappings, i.e. whether on the basis of the ACGT-MO T 1 1 100 UMLS Concept Unique Identifiers (‘cui’), the LOOM AI-RHEUM T 1 1 100 BDO B 3 20 5 3 31 10.7 algorithm (‘loom’) or both. COSTART B 31 12 44 40 44 171 34.4 CRISP B 7 4 17 2 19 21 70 36.7 CSSO B 1 2 17 8 3 31 10.7 IV. DISCUSSION CTCAE B 4 19 4 4 31 14.8 During ‘The Consensus Workshop: Convergence on an GALEN B 5 9 1 7 6 1 3 32 50 HIMC-ICD09 T 2 9 11 100 Orofacial Pain Taxonomy’, held March 30 – April 1, 2009, HIMC-LOINC T 3 3 0 Miami, Florida, which was attended by representatives from all HL7 T 4 2 6 0 major pain institutions, it was concluded that an adequate HOM-CLINIC T 3 3 0 treatment of the ontology of pain together with an appropriate HOMERUN-UHC T 3 3 0 HP B 7 2 19 2 4 34 30 terminology, is mandatory to advance the state of the art in ICD10 T 1 2 3 6 16.7 diagnosis, treatment and prevention [13]. ICD10CM B 2 7 6 4 19 13.3 ICPC2P B 1 6 21 48 1 10 87 9.21 As a first step, it was proposed to study the terminology IFAR T 2 2 0 and ontology of pain as currently defined. The ontological LOINC T 6 3 9 0 MEDDRA T 15 122 137 100 aspects have since then been covered in [4], and the underlying MESH B 2 8 5 44 13 4 1 16 93 19.7 principles thereof been applied, for instance, in the definition MP B 6 6 13 29 8 6 1 7 76 36.8 of new pain-related disease entities and classifications [14, 15]. NCIT B 9 9 4 43 23 3 1 23 115 24.2 NCIt-Activity T 2 2 4 50 NDFRT B 5 10 8 60 1 1 28 113 27.4 TABLE IV. GROUPING OF THE SEARCH TERMS IN DISJOINT UPPER NDF-RT T 2 4 7 24 8 1 1 2 49 37 CLASSES IN THE HIERARCHY OF THE REPRESENTATIONAL ARTIFACTS NIFSTD B 2 2 17 4 2 27 16 OMIM B 3 3 18 9 2 35 18.2 Representational PDQ B 3 4 4 8 3 1 11 34 31.8 SNOMEDCT Grand Total SOPHARM WHO-ART RH-MESH ICD10CM B 2 11 8 1 3 25 61.9 COSTART Artifact → MEDDRA PHARE NIFSTD ICPC2P NDFRT PHARE GALEN OMIM CTCAE MESH SYMP CRISP T 2 1 3 0 SNMI Search Term DOID CSSO PMA ICNP NCIT PDQ BDO RCD MP HP Grouping RCD B 9 10 5 36 24 4 1 11 100 27.3 Allodynia 7 8 4 11 8 11 15 10 5 79 Disease or Finding 6 4 4 8 8 9 11 10 5 65 RH-MESH T 12 86 98 100 Function / Process 4 4 RPO T 2 2 4 50 Meta / Top 1 3 2 4 10 Analgesia 4 5 7 9 1 6 7 6 8 3 6 11 5 6 9 9 102 SNOMEDCT B 6 150 3 2 15 4 1 4 185 88.3 Body part 1 1 SOPHARM B 6 8 13 29 4 6 1 10 77 40.9 Discipline 3 3 Disease or Finding 3 2 3 1 1 1 7 2 9 3 3 9 9 53 SYMP B 7 14 55 17 18 111 22.6 NON-pain disorder 4 4 SYN T 2 2 4 50 Technique / Therapy Function / Process 3 6 1 3 3 2 2 9 11 TRAK T 2 1 3 0 Meta / Top 2 3 2 1 2 1 11 WHO-ART B 34 7 10 18 17 86 59.4 Pharm. Eff./Endpoint 1 6 1 2 10 Dysesthesia 4 9 1 11 8 5 22 60 Grand Total 78 148 41 241 177 23 19 238 965 37.7 Adverse event 1 1 % of mappings 27.67 45.70 26.63 Disease or Finding 2 8 1 8 8 2 13 42 Function / Process 2 2 Meta / Top 1 1 3 1 9 15 Hyperalgesia Body part 4 7 1 8 1 11 8 5 8 3 14 19 6 29 10 4 13 150 1 The analysis performed here is another response to the Discipline Disease or Finding 4 3 3 4 1 8 8 5 6 1 12 19 3 15 10 4 1 103 4 workshop’s recommendations with the goal to obtain more NON-pain disorder Function / Process 4 2 12 12 6 insight in how pain assessment terminology is dealt with in Meta / Top Hyperesthesia 4 8 5 3 8 1 11 2 2 8 2 1 14 18 5 14 4 24 86 representational artifacts such as widely used classification Body part 1 1 systems, terminologies, and ontologies. At the same time, it Disease or Finding 3 7 4 4 1 8 6 18 2 10 4 67 Function / Process 2 2 provided an opportunity to assess the usability of the NCBO Meta / Top Hyperpathia 1 1 4 4 3 2 10 1 4 6 23 16 43 BioPortal for a task of this nature, and the appropriateness of Disease or Finding Function / Process 4 8 3 14 2 29 2 the principles and methods applied in the BioPortal to present a Meta / Top Hypoalgesia 4 1 8 2 10 1 9 17 10 4 13 67 12 unified, highly standardized and ontology-like view on Body part Disease or Finding 1 3 1 8 8 13 10 4 1 48 1 resources which are qua structure and underlying design NON-pain disorder 12 12 principles very different. Meta / Top 2 4 6 Hypoesthesia 4 7 1 11 8 8 5 20 4 68 Body part 1 1 Disease or Finding 3 6 1 8 8 6 2 11 4 49 A. Are Resources in the BioPortal intrinsically flawed Function / Process 2 2 Meta / Top Paresthesia 7 4 2 4 1 8 3 8 1 11 2 8 8 5 1 9 18 5 15 3 16 107 As can be inferred from Table 2 and Table 3, all retrieved Adverse event Body part 2 1 2 1 4 resources, with – at first sight – the exception of MeDDRA and Discipline 1 1 SNOMED CT, seem to perform quite poorly in terms of Disease or Finding 6 2 2 2 7 4 1 8 8 6 1 18 2 11 3 81 Function / Process 2 2 coverage of the domain. Of course, some resources might have Meta / Top Grand Total 1 1 1 4 3 2 1 7 24 12 2 8 7 9 39 5 16 16 15 72 31 35 40 3 5 3 1 4 18 6 56 60 38 164 39 24 26 762 been designed with a specific purpose in mind and pain 4 ICBO 2014 Proceedings assessment terminology therefor being out of their scope. It is TABLE VI. MAPPING SOURCES however hard to imagine for what sort of purpose a term such Result cui cui, loom loom Grand Total as paresthesia might be relevant and dysesthesia not: if one is Error 29 20 218 267 present, all should be present. An exception is analgesia in the WRONG 5 4 69 78 sense of a procedure rather than of a symptom: there would OBSO 24 16 108 148 indeed be no place for any of the other terms in procedure HOMONYM 41 41 terminologies. Although there are indeed a few resources Correct 50 23 368 441 retrieved for which analgesia is the only term matched, these SAME 2 9 230 241 VARIANT 48 14 115 177 resources are not restricted to procedures. Some resources turn DISAMBIG. 23 23 out to exhibit a better coverage when spelling variants are used Excluded 22 17 218 257 in the queries, but not to the extent that it can explain the S? 31 31 overall lack of coverage. T? 22 17 187 226 Grand Total 101 60 804 965 Some resources, such as COSTART, MeSH and WHO- % Wrong 36.71 46.51 37.20 37.71 ART, suffer from the lack of discrimination between terms in pairs such as hypoalgesia/hypesthesia, hyperalgesia/hyper- esthesia, dysesthesia/paresthesia and analgesia/hypoalgesia. First there is the observation that through the mappings, 16 This was also found in SNOMED CT but only for classes that additional resources were discovered that contain classes which were labelled ‘inactive’ thus reflecting that these mistakes map directly to classes which were retrieved by means of the made in earlier versions were corrected afterwards. search terms. This can in part be explained by the absence of the search terms in the synonym set of these additional classes, 15 resources exhibit through the eyes of the BioPortal a but upon further inspection, it turns out that in case of in total backbone structure which at least can be frowned upon (Table 255 mappings for RH-MeSH and MeDDRA, as well as for 4). How can analgesia be a kind of nervous system (possible) resources which according to the syntax of the URIs (COSTART), communication disorder (DOID - Human of the classes mapped to might be named ‘HOMERUN-UHC’, Disease Ontology), or pharmacogenomics (PHARE)? How can ‘HOM-CLINIC’, ‘HIMC-LOINC’ and ‘HIMC-ICD09’, the paresthesia be a kind of peripheral nervous system (OMIM), URIs returned by the annotator do not resolve at all [8]. The hyperalgesia a kind of adrenal adenoma (WHO-ART) or former 4 resources are also not listed on the BioPortal webpage neuroscience (CRISP)? One can assume sloppy design on the as being resources it contains, yet classes from them show up side of the authors of these resources, or violation of the in the mapping results. In case of SNOMED CT, mappings are principle that preferred terms should have face validity [10]: primarily involving classes which are marked as ‘inactive’. thus in COSTART ‘nervous system’ might not mean nervous system, but rather symptom related to the nervous system. Or, A second observation is that – after excluding these 255 and this leads to the next section, perhaps the BioPortal mappings as well as two others for which the meaning of the represents the structure of these resources erroneously? source class could not be disambiguated – still almost 38% of the mappings are inaccurate. There is no significant difference B. Is the BioPortal itself, or are some design or quality in accuracy between mappings produced using LOOM or UMLS CUIs alone. However, when both the LOOM and CUI- assurrance principles behind it, intrinsically flawed? methods suggest a mapping, the error rate increases to over That something wasn’t right with the representation of WHO- 46%, thus almost the equivalent of flipping a coin. ART in the BioPortal was noted by Ruttenberg in 2011 and as such acknowledged by BioPortal staff who traced the issue C. Limitations down to be caused by the WHO-ART source codes, but nevertheless decided nothing to do about it at that time [16]. The work reported on here bears certain limitations. Although And apparently never since: the version of WHO-ART that the data demonstrate (1) that the domain of pain assessment showed up in the work reported about in this paper was version terminology is poorly covered in the BioPortal resources, (2) ‘2013AB’ which was uploaded to the BioPortal, according to that the way in which the BioPortal organizes the retrieved the summary page, February 18, 2014, indeed without any classes hierarchically using the subclass relation is debatable, attention to the known issues. The data presented here and (3) that the techniques used to map these classes between demonstrate further that it is not just WHO-ART of which the resources are not quite adequate, no generalizations can be representation in the BioPortal is problematic with respect to made to other domains. A further limitation is that the data the semantics of the subclass relationship, but also 14 other were retrieved using the BioPortal website rather than the resources that were retrieved on the basis of the search terms REST services. Perhaps these services offer better ways to (Table 2, AP9). filter inadequate data, but if that were the case, one could wonder why such filters are not used on the website. Another indication that the BioPortal could benefit from some quality assurance introspection comes from the finding Assessment of the correctness of the suggested hierarchy that for 8 of the 27 resources retrieved the Annotator returned and the mappings was carried out with the quality criteria of ‘UMLS:OrphanClass’ as ancestor for 40 of the classes matched the OBO Foundry and adherence to the principles of directly (Table 2, AP8). Ontological Realism in mind, neither of which are universally accepted [17] yet gaining considerable attraction [18]. Thus it Also the mapping results provide serious evidence in the is quite conceivable that reviewers outside the Foundry would direction that quality improvement is required. report lower error rates, for instance by finding it perfectly 5 ICBO 2014 Proceedings acceptable that the ‘concept’ of analgesia as a pharmaceutical ACKNOWLEDGMENT effect in some drug is considered equivalent to the ‘concept’ of The work described is funded in part by grant 1R01DE021917- analgesia as a procedure performed by an anesthesiologist or as 01A1 from the National Institute of Dental and Craniofacial a state of a patient brought about by such procedure. At the Research (NIDCR). The content of this paper is the other hand, since the review here was based by first flagging responsibility of the author and does not necessarily represent results that for sure require manual evaluation (see the official views of the NIDCR or the NIH. methodology) it might very well be that certain mapping- or ancestor records were erroneously not flagged. In that sense, the error rates presented here could very well be – modulo REFERENCES mistakes made by sloppiness of the reviewer – the best case [1] IASP Subcommittee on Taxonomy. Pain terms: a list with definitions scenario. Another limitation is that this study does point out the and notes on usage. Pain. 1979;6(3):249-52. kind of mistakes and how to find them semi-automatically, but [2] Merskey H, Bogduk N. Classifications of chronic pain: Description of is not conclusive on whether the root cause is in the source chronic pain syndromes and definition of pain terms. Report by the International Association for the Study of Pain Task Force on systems, the BioPortal, or a combination of both. Taxonomy. Seattle: IASP Press; 1994. [3] International Association for the Study of Pain. IASP Pain Taxonomy. V. CONCLUSION AND RECOMMENDATIONS 2012; Available from: http://www.iasp-pain.org/Content/Navigation Menu/ GeneralResourceLinks/PainDefinitions/default.htm. Without doubt, studies such as this one could not be carried out [4] Smith B, Ceusters W, Goldberg LJ, Ohrbach R. Towards an Ontology of without a resource such as the BioPortal, or would require a lot Pain. In: Okada M, editor. Proceedings of the Conference on Logic and more time and effort. Evenly without doubt, the BioPortal Ontology. Tokyo: Keio University Press; 2011. p. 23-32. made it possible to reach the objectives of this study which [5] Ceusters W. An information artifact ontology perspective on data were to find out (1) whether the sources in the BioPortal collections and associated representational artifacts. Stud Health provide a more adequate view on pain assessment terminology Technol Inform. 2012;180:68-72. – the answer being no, and (2) to what extent the BioPortal [6] Noy NF, Shah NH, Whetzel PL, Dai B, Dorf M, Griffith N, et al. BioPortal: ontologies and integrated data resources at the click of a itself is a useful instrument in determining whether (1) is mouse. Nucleic Acids Research. 2009 July;1:37. indeed the case – the answer being yes. As a side effect, this [7] Jonquet C, Shah NH, Musen MA. The open biomedical annotator. study raises serious questions about the quality assurance Summit on translational bioinformatics. 2009;2009:56-60. principles employed in the design and management of the [8] Ceusters W. Supplementary data to Pain Assessment Terminology in the BioPortal, more specifically (1) about the quality of the NCBO BioPortal: Evaluation and Recommendations.2014: Available resources the BioPortal accepts for inclusion – it might seem from: www.referent-tracking.com/SupplementarydatatoICBO2014.docx. unfair to criticize a lack of clear best practice policies in the [9] Whetzel PL, Team N. NCBO Technology: Powering semantically aware investigated resources while not distinguishing their different applications. J Biomed Semantics. 2013 Apr 15;4 Suppl 1:S8. semantic expressivity, the point being however that the [10] Cimino JJ. Desiderata for controlled medical vocabularies in the twenty- BioPortal itself does not allow for such distinctions and first century. Methods of Information in Medicine. 1998;37(4-5):394- 403. ‘promotes’ all resources as ontologies, (2) the suitability of representing the hierarchy of these resources by means of the [11] Cimino JJ. In Defense of the desiderata. Journal of Biomedical Informatics. 2006;39(3):299-306. subclass relation, and (3) about certain house-keeping [12] Schulz S, Jansen L. Formal ontologies in biomedical knowledge operations. Quality seems thus far not to have been much of a representation. Yearbook of medical informatics. 2013;8(1):132-46. concern to the BioPortal scientific community, as witnessed by [13] Ohrbach R, List T, Goulet J, Svensson P. Recommendations from the the presence of only one paper in Pubmed that addresses the International Consensus Workshop: Convergence on an Orofacial Pain topic [19]. Furthermore, although the BioPortal does indeed Taxonomy. Journal of Oral Rehabilitation. 2010. offer a mechanism to users to make notes on the quality of [14] Nixdorf D, Drangsholt M, Ettlin D, Gaul C, de Leeuw R, Svensson P, et BioPortal content [6], it doesn’t seem to be used much: the al. Classifying orofacial pains: a new proposal of taxonomy based on BioPortal homepage displays a list of the 5 last notes ontology. Journal of Oral Rehabilitation. 2012;39(3):161-9. submitted, of which the last three were submitted 7 months [15] Schiffman E, Ohrbach R, Truelove E, Look J, Anderson G, Goulet J-P, et al. Diagnostic Criteria for Temporomandibular Disorders (DC/TMD) prior to writing this paper, all three about a ‘request’ issued by for Clinical and Research Applications: Recommendations of the user rboden – noted in the name of ‘Jesus’ as contact person – International RDC/TMD Consortium Network and Orofacial Pain to add the following new term ‘We need someone with Special Interest Group. Journal of Oral and Facial Pain and Headache. qualifications’. It is a bad sign that spam of this kind, whether 2014;28(1):6-27. unnoticed or noted but not acted upon, is accepted. [16] Ruttenberg A, Ferguson R. Something amiss in translation of WHOART. 2011 [cited 2014 April 15]; Available from: For the BioPortal to become an instrument which is useful https://mailman.stanford.edu/pipermail/bioontology-support/2011- for other purposes than determining that its content is of poor April/003124.html. quality the following suggestions are in order: (1) do not accept [17] Brochhausen M, Burgun-Parenthoine A, Ceusters W, Hasman A, Leong resources that violate standard subsumption principles, (2) TY, Musen M, et al. Discussion of “Biomedical Ontologies: Toward Scientific Debate”. Meth Information in Medicine. 2011;50(3):217-36. display for each resource quality metrics, rather than mere quantity metrics, for instance the extent to which they follow [18] Schulz S, Balkanyi L, Cornet R, Bodenreider O. From concept representations to ontologies: A paradigm shift in health informatics? the principles of ontological realism or the OBO Foundry, and Healthcare Informatics Research. 2013;19(4):235-42. (3) provide better documentation about the methods and [19] He Z, Ochs C, Agrawal A, Perl Y, Zeginis D, Tarabanis K, et al. A algorithms used to present hierarchies and mappings, and about family-based framework for supporting quality assurance of biomedical the internal quality assurance principles. ontologies in BioPortal. AMIA Symp proceedings. 2013;2013:581-90. 6