=Paper=
{{Paper
|id=None
|storemode=property
|title=Comparing pharmacologic classes in NDF-RT and SNOMED CT
|pdfUrl=https://ceur-ws.org/Vol-714/ShortPaper04_Mortensen.pdf
|volume=Vol-714
|dblpUrl=https://dblp.org/rec/conf/smbm/MortensenB10
}}
==Comparing pharmacologic classes in NDF-RT and SNOMED CT==
Comparing Pharmacologic Classes in NDF-RT and
SNOMED CT
Jonathan Mortensen1 , Olivier Bodenreider∗2
1 Department of Biomedical Engineering, Case Western Reserve University, Cleveland, Ohio, USA
2 National Library of Medicine, Bethesda, Maryland, USA
Email: Jonathan Mortensen - Jonathan.Mortensen@case.edu; Olivier Bodenreider∗ - olivier@nlm.nih.gov;
∗ Corresponding author
Abstract
Background: Clinical decision support systems and semantic mining require interoperable representations of
pharmacologic classes across reference terminological systems. We explore two such systems: NDF-RT and
SNOMED CT.
Methods: We evaluate the overlap of pharmacologic classes in NDF-RT (VA Classes) and SNOMED CT. We
compare classes based on the set of their members (drugs) across systems, using the Jaccard coefficient as a
measure of overlap between two classes.
Results: There is a limited overlap among the two systems. The average Jaccard value is 0.293. Only 11.5% of
the VA classes have a Jaccard value of 0.75 or above.
Conclusions: The analysis of discrepancies between pharmacologic classes across systems offers a strategy for
identifying classes in need of critical review. Due to the heterogeneity of the representation of pharmacologic
classes in various terminologies, we recommend that drugs, not classes, be annotated in text for semantic mining
purposes.
Introduction therapeutic properties of some drugs on angina
pectoris. Some classes are also defined in reference to
Pharmacologic classes are typically established in several properties, e.g., nitrate vasodilator, referring
reference to some of the properties of the active to both the chemical structure of nitrates and their
moiety, with respect to chemistry, physiology, relaxing action on the musculature of blood vessels
metabolism and therapeutic intent [1]. For example, (physiologic effect). In other words, pharmacologic
the classes platelet aggregation inhibitors and classes provide an abstract representation of drug
anticoagulants refer to the physiologic effect of drugs properties, useful in the context of clinical decision
decreasing platelet aggregation and coagulation, support and for the annotation of biomedical
respectively. In contrast, the class cardiac glycoside resources, including clinical text and the biomedical
refers to the chemical structure of drugs such as literature.
digoxin, while the class antianginal refers to the
116
While interoperability among terminologies is a a basic hierarchy. A drug generally belongs to
requirement for clinical decision support, in which only one class. Examples of VA classes include
decision support rules are defined in reference to ANTIMALARIALS, of which the clinical drug
concepts in various terminologies (e.g., concepts for QUININE SO4 162.5MG TAB is a member. Its
drug classes), it is also important that annotations to parent class is ANTIPROTOZOALS. In addition,
biomedical entities such as drug classes be consistent there are 425 EPCs (not used in this work). Differing
within and across datasets when such datasets are from the VA Classes, the EPCs have a nearly flat
exchanged and integrated, as these annotations form hierarchy and are defined in reference to various
the basis for knowledge discovery through semantic properties, such as physiologic effect, therapeutic
mining. intent, ingredient and mechanism of action.
The National Drug File - Reference Terminology The July 11th 2010 Version of NDF-RT was used
(NDF-RT) is a drug terminology produced by in the evaluation.
the Department of Veterans Affairs in the United SNOMED CT is currently the largest clinical
States and is recommended as the standard in e- terminology. It is developed and maintained
prescribing systems [2]. Other clinical terminologies by the International Health Terminology Standard
such as SNOMED CT also include pharmacologic Development Organization (IHTSDO) [4]. In
information. SNOMED CT, the drugs are simply related to
The objective of this work is to evaluate the pharmacologic classes through the isa relationship.
degree to which annotations to drug classes in For example, there is an isa relationship between the
various terminological systems are interoperable, drug Quinine and the class Cinchona antimalarial.
with a focus on pharmacologic classes from NDF- The January 31, 2010 Version of SNOMED CT was
RT. More specifically, we evaluate the overlap of used in the study.
VA classes to those in SNOMED CT. The analysis Related Work. Others have examined many
of the classes reveals discrepancies between the two aspects of NDF-RT. [5] investigated the coverage
systems and offers a strategy for identifying classes of the Physiologic Effects hierarchy in NDF-
in need of critical review. RT. It was found that the physiologic effects
category was sufficient for classifying medications.
[6] investigated the addition of pharmacogenomics
into the hierarchy. [7] applied NDF-RT to mapping
Background text from medication lists at the Mayo Clinic
In this section, we give a brief presentation of NDF- using the SmartAccess Vocabulary Server. NDF-
RT and SNOMED CT and present some related RT covered 97.8% of the concepts found in the
work on NDF-RT. medication lists, indicating NDF-RT can be used
The National Drug File - Reference Ter- in a clinical setting for medication purposes. [8]
minology (NDF-RT) is based upon the National compared NDF-RT to the National Drug File,
Drug File, a listing of medications produced by Medicare Part D and a proprietary knowledge base.
the Department of Veteran Affairs [3]. It serves It was determined that 76% of the classes from the
as a reference standard for a variety of medical three original terminologies were contained in NDF-
situations related to drugs and medications. NDF- RT. In recent work, [9] evaluated the correspondence
RT is a description logic-based model available in of NDF-RT drugs and classes to RxNorm drugs and
OWL and XML formats. It includes 9 “Kinds” classes. As of October 2009, approximately 50% of
of information: Cellular or Molecular Interac- the drugs did not correspond between terminologies.
tions, Clinical Kinetics, Diseases Manifestations or [10] mapped medications to diseases, showing a
Physiologic States, Pharmaceutical Preparations, clear example of how NDF-RT can be applied in
Physiological Effects, RxNorm Dose Forms, Ther- clinical decision support situations. As another
apeutic Categories, and VA Drug Interactions. The example of clinical applications [11] integrated NDF-
Pharmaceutical Preparations hierarchy organizes RT into the process of generating structured product
drugs into three categories: Products by Generic labeling. Finally, [12] used the NDF-RT drug classes
Ingredient Combination, Products by VA Class and to determine the anti-coagulation status of patients
External Pharmacologic Classes (EPC). based on their medication list, demonstrating a first
There are 485 VA Drug Classes organized into step in clinical decision support.
117
Our study focuses not on content coverage, but between NDF-RT and SNOMED CT is used to
rather on interoperability among systems of drug summarize the external comparison and determine
classes in various terminologies, including NDF-RT the overall similarity between the two class systems.
and SNOMED CT. These terminologies were chosen To obtain an extension, the clinical drug
as a reference because they contain drug hierarchies, members of a drug class were obtained. As opposed
are mature, and are widely used. More specifically, to VA (where drug classes are linked directly to
we want to assess whether similar sets of drugs are clinical drugs), in SNOMED CT, the ingredients of
linked to the same classes in different systems. a class were first obtained, then the clinical drugs
As part of the evaluation, we use a concept for those ingredients were obtained using relations
alignment technique described by [13]. NDF-RT was in NDF-RT, thus keeping the domain of clinical
loaded into a Virtuoso endpoint [14] for SPARQL drugs limited to only NDF-RT. In addition, the drug
querying, which allowed for evaluation of the drug members of a class included its drugs and all drugs
classes. which were members of any subclasses. For example,
the clinical drug QUININE SO4 260MG TAB is
linked directly to the VA class ANTIMALARIALS,
Methods but is also considered a member of the its
To evaluate the drug classes in NDF-RT, we parent class ANTIPROTOZOALS. Using the drug
developed an extensional method of evaluation, members, the drug member intersection was found,
comparing between VA classes and SNOMED CT comparing the extension of the VA classes to the
drug classes. Instead of comparing pharmacologic SNOMED CT classes.
classes based on lexical resemblance of their names,
we compare the extensions of these classes.
The extension of a pharmacologic class is the set Results
of drugs a class has as members. The degree to which
There are 485 VA and 722 SNOMED CT classes.
any two drug classes are similar was determined by
To reduce comparisons (and noise), classes which
the overlap of their extensions. This is measured by
did not have any drug members were removed.
the Jaccard Coefficient,
There were 95 VA (20%) and 195 SNOMED CT
|A ∩ B| (27%) classes without members. Examples of classes
J(A, B) = , with no members include INVESTIGATIONAL
|A ∪ B|
ANTI-TUBERCULAR DRUGS (VA), ANTIFUN-
where the intersection is the number of drugs which GALS,TOPICAL OTIC (VA), Antineoplastic alka-
are the same between any two classes and the union loid (SNOMED CT) and Corticosteroids used in the
is the total number of drugs between any two drug treatment of asthma (SNOMED CT).
classes [15]. An example of extension is presented in Among the 15,027 clinical drugs in NDF-RT,
Figure 1a. Here, the VA class THROMBOLYTICS 8414 correspond to ingredients also present in
and the SNOMED CT class Thrombolytic share SNOMED CT. Examples of clinical drugs specific
6 drugs, including streptokinase, while the drug to NDF-RT include medicinal products from classes
drotrecogin is specific to the SNOMED CT class. such as HERBS/ALTERNATIVE THERAPIES
The Jaccard value is computed as the cardinality (e.g.,WILD CHERRY BARK PWDR).
of the intersection (6) over that of the union of the The extensional comparison was obtained by
classes (7), i.e., 0.86. (In actuality, the classes are calculating the overlap between the sets of drug
compared, not based on the ingredients, but based members of class pairs and can be summarized by
on the clinical drugs they have as members. The the average Jaccard coefficient for all class pairs
corresponding ingredients are shown in Figure 1a for between NDF-RT and SNOMED CT. Through their
brevity.) average Jaccard value, pairs of pharmacologic class
The extension of each VA class is compared to systems can be compared for their overall similarity.
that of every class in SNOMED CT. For a given VA The average Jaccard value is 0.293, indicating
class, the SNOMED CT class for which the highest limited overlap overall between drug extensions
Jaccard value is found is selected as the best match. across the two class systems.
The average Jaccard of the pairwise comparisons The distribution of the average highest Jaccard
118
(a) (b)
Figure 1: (a) Comparison of the extensions of the VA class THROMBOLYTICS and the SNOMED CT class
Thrombolytic, (b) Distribution of Jaccard (highest per class)
value for the VA classes is shown in Figure hydroxide. Conversely, solutions of glycerol are
1b. Very few classes exhibit complete overlap classified as osmotic laxatives in SNOMED CT, but
(Jaccard = 1.0). Examples include the VA class as LAXATIVES, RECTAL in NDF-RT.
DIRECT RENIN INHIBITOR and the SNOMED
CT class Renin Inhibitor. This particular class
contains the clinical drugs corresponding to only Discussion
one ingredient, aliskiren. The proportion of VA Overlap among Class Systems
classes with a Jaccard value of 0.75 or above is
The similarity among the two pharmacologic class
11.5%. For example, the Jaccard value for the
systems under investigation (VA and SNOMED
overlap between the VA class THROMBOLYTICS
CT) is relatively limited. The average Jaccard
and the SNOMED CT class thrombolytic is 0.86. As
values among classes based on shared drugs is
shown in Figure 1a, the clinical drugs corresponding
0.293. The following reasons can be proposed as
to six ingredients are common to both the VA class
an explanation, in addition to sheer differences
and the SNOMED CT class. These ingredients
in classification and discrepancies illustrated in
are alteplase, anistreplase, reteplase, streptokinase,
the section above. In some cases, there is no
tenecteplase and urokinase. Additionally, SNOMED
equivalent class in SNOMED CT for a given
CT also lists drotrecogin as a member of the class
VA class, especially for high-level aggregation
thrombolytic, although the indications for this drug
classes (e.g., BLOODPRODUCTS / MODIFIERS
seem to be limited to severe sepsis.
/ VOLUME EXPANDERS ), residual classes (e.g.,
Finally, 75.6% of the VA classes have a CARDIOVASCULAR AGENTS, OTHERS ) and
Jaccard value lower than 0.5. For example, classes specific to topical forms (e.g., BETA-
the Jaccard value for the overlap between the BLOCKERS, TOPICAL OPHTALMIC ). Another
VA class HYPEROSMOTIC LAXATIVES and the reason is that partially overlapping classes are
SNOMED CT class osmotic laxatives is only 0.16. defined using different classificatory criteria. For
While clinical drugs corresponding to the ingredients example, ophtalmic forms of beta-blockers such
lactulose and magnesium sulfate are common to as TIMOLOL MALEATE 0.5% GEL, OPH are
both classes, many clinical drugs found in the VA classified as BETA-BLOCKERS, TOPICAL OPH-
class are not in the SNOMED CT class (e.g., other TALMIC in NDF-RT and as anti glaucoma agent
magnesium salts such as magnesium biphosphate in SNOMED CT. While the former class only
and magnesium hydroxide). Interestingly, clinical contains beta-blockers, the latter includes a wider
drugs corresponding to magnesium hydroxide are range of products (e.g., apraclonidine). Another
part of a different SNOMED CT class, saline difference between the two class systems is that
119
the pharmacologic class is a property of the clinical corresponding classes can be added automatically
drug for the VA classes, whereas it is inherited in reference to the most useful pharmacologic
through the ingredient for SNOMED CT classes. class system in a particular context. Annotations
For example, injectable forms of acetylcysteine to another pharmacologic class system can be
are classified as ANTIDOTES/DETERRENTS, recomputed from the ingredients in case of reuse of
OTHERS in NDF-RT, while topical solutions (e.g., these resources for a different purpose.
for inhalation) are classified as MUCOLYTICS. In
contrast, all forms of this drug are classified as
both drugs used in the treatment of paracetamol
poisoning and mucolytic agent in SNOMED CT.
Finally, we also found a limited number of errors,
such as the classification of the antibacterial drug Limitations and Future Work
NORFLOXACIN 0.3% SOLN, OPH as BETA-
BLOCKERS, TOPICAL OPHTALMIC in NDF- There are a few limitations to this work. The
RT. evaluation was only a quantitative evaluation,
comparing the two terminologies. It was not an
evaluation of the clinical quality or the use of NDF-
Classes without Drug Members RT in a clinical situation. In addition, the domain
of drugs used in the comparison was only clinical
One particular difference between the two pharma-
drugs in NDF-RT, as we assumed the clinical drugs
cologic class systems is the number of classes for
to be complete. No comparisons were done at
which there is no drug in NDF-RT. (These classes
the ingredient level. Because of this, we obtained
were omitted from our statistics). These differences
all the NDF-RT clinical drugs of an ingredient
have different causes in different systems. For VA
from the terminology. Some classes that have no
classes, most classes with no drugs correspond to
drugs members may have had ingredients; however,
investigational drugs. In contrast, in SNOMED
these ingredients were either not present in NDF-
CT, such classes correspond essentially to classes for
RT or they did not have clinical drugs associated
which the corresponding medicinal products are out
with them, resulting in the class not having drug
of the scope of NDF-RT, including blood products
members.
(e.g., Red cells - irradiated ), dietary products (e.g.,
Gluten free food product) and various prescribable For a complete evaluation of NDF-RT, the
entities (e.g., Sterile maggots). external pharmacologic classes (EPCs) will be
included in future work. To leverage these classes,
they first must be enriched with drugs. A technique
Consequences for Semantic Mining to do such an operation has been piloted by [12],
As the sets of drugs available in terminological which utilizes a description logics based classifier to
systems vary considerably across systems, with classify drugs into EPCs.
minimal overlap among them, annotation of the
In addition to the extensional approach used in
literature directly with classes from a given system
this study, we would like to explore an intensional
is likely to result in annotated datasets that will
approach to comparing the classes, leveraging
not be interoperable, and whose annotations will be
synonymy relations in the Unified Medical Language
difficult to reconcile. Even if some terminologies
System (UMLS). In practice, classes across systems
such as SNOMED CT and NDF-RT tend to
could be mapped through the UMLS and the
provide good coverage of clinical drugs, their overlap
extensions of equivalent classes could be compared.
with other terminologies in terms of pharmacologic
classes remains limited. Finally, this work may be considered a class
In practice, a better option for semantic mining is centric approach, focused around drugs associated
to annotate drugs rather than pharmacologic classes. with classes. Future work will include a drug centric
Drugs names are relatively standard (at least at approach, which focuses on classes associated with
the ingredient level) and integration resources such drugs. More specifically, we will study the set of
as RxNorm are already available. Once resources pharmacologic classes associated with a given drug
have been annotated at the ingredient level, the in different pharmacologic class systems.
120
Conclusions 3. Lincoln MJ, Brown SH, Nguyen V, Cromwell T, Carter J,
By using an automated method of comparing Erlbaum M, Tuttle M: US Department of Veterans
Affairs enterprise reference terminology strategic
classes using drug class extensions, inconsistencies overview. Medinfo 2004, 11(Pt 1):391–395.
between terminologies were discovered. These
4. SNOMED CT (Systematized Nomenclature of
inconsistencies serve as an indicator for possible Medicine-Clinical Terms)[http://www.ihtsdo.org/
review. The automated method of pairwise class snomed-ct/].
member comparison complements standard lexical 5. Rosenbloom ST, Awad J, Speroff T, Elkin PL, Rothman
matching and can serve as an additional quality R, III AS, Peterson J, Bauer BA, Wahner-Roedler DL,
assurance tool for terminologies. This methodology Lee M, et al.: Adequacy of representation of the
National Drug File Reference Terminology Phys-
sets a framework for pairwise comparison of iologic Effects reference hierarchy for commonly
drug classes between terminological systems using prescribed medications. In AMIA Annu Symp Proc
only their drug members. Finally, due to the 2003:569–78.
heterogeneity of the representation of pharmacologic 6. Chute CG, Carter JS, Tuttle MS, Haber M, Brown
classes in various terminologies, we recommend that SH: Integrating pharmacokinetics knowledge into
a drug ontology as an extension to support
drugs, not classes, be annotated in text for semantic pharmacogenomics. In AMIA Annu Symp Proc
mining purposes. 2003:170–4.
7. Brown SH, Elkin PL, Rosenbloom ST, Husser C, Bauer
BA, Lincoln MJ, Carter J, Erlbaum M, Tuttle MS: VA
Competing interests National Drug File Reference Terminology: a
cross-institutional content coverage study. Medinfo
The authors declare that they have no competing 2004, 11(Pt 1):477–81.
interests.
8. Carter JS, Brown SH, Bauer BA, Elkin PL, Erlbaum
MS, Froehling DA, Lincoln MJ, Rosenbloom ST, Wahner-
Roedler DL, Tuttle MS: Categorical information in
Authors’ contributions pharmaceutical terminologies. In AMIA Annu Symp
Proc 2006:116–20.
Jonathan Mortensen and Olivier Bodenreider con-
9. Pathak J, Chute CG: Analyzing categorical informa-
ceived and designed the study. Jonathan Mortensen tion in two publicly available drug terminologies:
acquired the data and performed the analysis and RxNorm and NDF-RT. Journal of the American
interpretation of the data. Both authors contributed Medical Informatics Association 2010, 17(4):432–439.
to the redaction of the manuscript and approved its 10. Burton MM, Simonaitis L, Schadow G: Medication
final version. and Indication Linkage: A Practical Therapy for
the Problem List? In AMIA Annual Symposium
Proceedings 2008:86–90.
11. Schadow G: Structured product labeling improves
Acknowledgements detection of drug-intolerance issues. Journal of
This research was supported in part by the Intramural the American Medical Informatics Association 2009,
Research Program of the National Institutes of Health 16(2):211–219.
(NIH), National Library of Medicine (NLM) in addition 12. Bodenreider O, Mougin F, Burgun A: Automatic
to the Choose Ohio First Scholarship Program. determination of anticoagulation status with
NDF-RT. In Proceedings of the 13th ISMB’2010 SIG
meeting ”Bio-ontologies” 2010:140–143.
13. Bodenreider O, Burgun A: Aligning knowledge
References sources in the UMLS: methods, quantitative
1. Carter JS, Brown SH, Erlbaum MS, Gregg W, Elkin results, and applications. In Stud Health Technol
PL, Speroff T, Tuttle MS: Initializing the VA Inform, Volume 107(Pt 1) 2004:327–31.
medication reference terminology using UMLS
metathesaurus co-occurrences. In Proceedings of the 14. Virtuoso Universal Server[http://virtuoso.
AMIA Symposium 2002:116–20. openlinksw.com/].
2. Miller RA: Clinical Decision Support and Elec- 15. Jaccard P: Étude comparative de la distribution
tronic Prescribing Systems: A Time for Respon- florale dans une portion des Alpes et des Jura.
sible Thought and Action. Journal of the American Bulletin de la Société Vaudoise des Sciences Naturelles
Medical Informatics Association 2005, 12(4):403–409. 1901, (37):547–579.
121