<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Essential SNOMED: Simplifying SNOMED CT and Supporting Integration with Health Information Models</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Peter MacIsaac</string-name>
          <email>peter@macisaacinformatics.org</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>MB.BS</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>FRACGP</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Don Walker</string-name>
          <email>donald.walker@adelaide.edu.au</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>MB.BS</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rachel Richesson</string-name>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Heather Grain FACHI</string-name>
          <email>h.grain@latrobe.edu.au</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Peter Elkin</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jon Patrick</string-name>
          <email>jonpat@cs.usyd.edu.au</email>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>LaTrobe University</institution>
          ,
          <addr-line>Melbourne, Victoria</addr-line>
          ,
          <country country="AU">Australia</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Mayo Clinic</institution>
          ,
          <addr-line>Rochester, MN</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Terminology Central</institution>
          ,
          <addr-line>Canberra</addr-line>
          ,
          <country country="AU">Australia</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>University of Adelaide</institution>
          ,
          <addr-line>Adelaide, South</addr-line>
          <country country="AU">Australia</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>University of South Florida College of Medicine</institution>
          ,
          <addr-line>Tampa, FL</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff5">
          <label>5</label>
          <institution>University of Sydney</institution>
          ,
          <addr-line>NSW</addr-line>
          ,
          <country country="AU">Australia</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2008</year>
      </pub-date>
      <fpage>51</fpage>
      <lpage>58</lpage>
      <abstract>
        <p>SNOMED CT (SCT) has been designed and implemented in an era when health computer syst ems generally r equired termi nol ogy representations in the form of singular precoordinated concepts. Consequently, much of SCT content represents pre-coordinated concepts and their relationships. In this conceptual paper the role of pre- and post-coordinated terminology expressions are considered in the context of the current development direction of Electronic Health Records and the use of communications and knowledge repositories. The move from current SCT structures to an implementation form of SCT that focuses on “atomic concepts” will support post-coordination and terminology binding to information models. This core or “essential” SNOMED CT - called SNOMED Essential Terminology (S-ET) - would be smaller in terms of core concept numbers, simpler, easier to maintain and more intuitive for implementers. Our proposed implementation form of SNOMED CT would contain only “atomic concepts” with their attendant hierarchies and relationship data. These would be supported by a strict model for representing current and future pre-coordinated concepts based on the use of an existing specific post- coordination expression, grammar, or representation. The resulting concept expressions would be postcoordinated from a smaller core of atomic components. Using definitional relationships, the proposed implementation form could equate existing pre-coordinated terms with postcoordinated representations, allowing SCT to maintain links with legacy data. A strategy for testing and implementing this approach is discussed and empirical research and feasibility testing is recommended.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>
        SNOMED CT (SCT) is becoming the international
standard clinical terminology with a new
international licensing and governance process
which makes it widely accessible. The adoption of
SCT by multiple countries was influenced by many
published studies demonstrating its comprehensive
coverage [1-4] and advanced structural features.
SNOMED CT has antecedents in the College of
American Pathologists family of terminologies, the
UK National Health Service Read Codes. As with
any living language, it has absorbed content from a
number of other terminologies and classifications.
SCT contains concepts and terms that describe the
“language of use” as well as concepts which define
the “language of meaning”[
        <xref ref-type="bibr" rid="ref11">5-7</xref>
        ]. Consequently, SCT
contains many pre-coordinated concepts that have
varying levels of semantic complexity alongside the
component or essential concepts which are
themselves the building blocks of these complex
clinical expressions. While there are sound
historical and ongoing pragmatic reasons for this
evolutionary development, the resulting mix of
concept structures makes implementation within
various information models complex and prone to
variation. Currently, SCT is “cluttered” with
precoordinated terms that are incompletely defined by the
internal information model that exists within SCT,
making transformations between existing
precoordinated terms and post- coordinated
representations difficult to achieve. This result
limits opportunities for interoperability across
systems, [8] which is one of the key objectives of a
controlled terminology.
are well known within the health informatics
community and proposes what may, at first glance,
seem to be radical surgery. This proposal is in
reality an extension and combination of existing
features of SCT to create a more tractable solution
to support both SCT development and the art and
science of terminology development. This paper is
not a report of a quantitative analysis of SCT
structures or experimental results of the types of
change proposed. These should come later, if the
fundamental proposition is believed to be sound
and a potential contribution to terminology
development and maintenance methods.
      </p>
      <p>The computational representation of data is a
combination of the use of information models
and terminology. We propose a variation,
restructure and extension o f t h e c u r r e n t
S N O M E D - C T t e r m i n o l o g y t o s u p p o r t
implementation in various information models.
Using a pragmatic approach the SCT would be
altered in that existing pre-coordinated
concepts would be identified, flagged and then
defined through linkage to their atomic
concepts and relationship types. The atomic or
essential concepts would continue to be
placed in logical and definitional hierarchies
and relationship structures and subject to the
use of description logic for definition,
inference, and classification purposes.
Existing pre-coordinated SCT concepts would
retain their identifiers and be linked to the modified
terminology as “pre-defined-post-coordinated
concepts”, and would be logically equivalent to any
post-coordination representing the same meaning.
The retention of pre-coordinated concepts and the
specification of their computational definitions
would allow pre-coordinated terms to be used in
interface applications, as pre-coordinated terms
can be useful in helping data entry to be
more consistent: supporting the language of
use. If users have a retrieval list of
precoordinated concepts that have post-coordinated
equivalents, application developers can encourage
users to use a more consistent post-coordinated
form or to use entry terms that have relationships
to post-coordinated expressions using
fullydefined atomic concepts.</p>
      <p>This approach to the re-organization of SCT with
the formal expression of the canonical form for
pre-coordination is described as “ S N O M E D
E s s e n t i a l Terminology” (S-ET), or
simply “Essential SNOMED”; the name
coined by Dr. Walker when first describing
this approach. This paper describes the case
for change in SCT representation and advantages
of moving to this representation, the
background to the development of this
approach, a representation model for
precoordinated concepts, and an implementation
perspective. Simple examples have been selected,
not to prove the feasibility of this approach, but to
illustrate the principles. The need for a more
technically challenging and quantitative approach
to evaluation of this proposal is recognized and
discussed.</p>
    </sec>
    <sec id="sec-2">
      <title>TERMINOLOGIES AND INFORMATION</title>
    </sec>
    <sec id="sec-3">
      <title>MODELS</title>
      <p>
        It is now widely accepted that health information
storage is achieved through a combination of the
use of controlled terminologies and standardized
data models or architecture, yet the boundaries
between the models used for terminology
construction and health record construction are
blurred. [9-12] The HL7 TermInfo project
attempted to resolve this by providing guidance
on how SNOMED CT could be used in HL7
version 3 messages and data structures. [12-14]
An example of this terminology model
information model interface is the question of
whether concept negation should be managed
within the terminology or within the data model.
Should the negation be expressed as part of the
terminological unit: “no history of breast cancer”,
or as different components within an information
model: “history of breast cancer” + “negative”?.
[15] The semantics can be represented in the
terminology as a pre or post coordinated concept or
in a combination of the data model and
terminology. The machinery to support this latter
approach is contained in standard information
models such as the HL7 Reference Information
Model (RIM).[12] HL7’s TermInfo working group
has recommended that when SNOMED is being
used in HL7 V3 models, negation be managed in
the terminology and that the model based approach
to attaching a negation indicator be deprecated.
This issue points to the need for sufficient
flexibility in the management of post-coordination
to allow for the transformation of concept
structures and modifiers between the various
options. The existence of other data and
information models (e.g., CDISC) – which might
develop and endorse their own guidance for use
of complex terminologies such as SCT
suggest that standardization of SCT terminology
use in HL7 (RIM-based) applications might not
guarantee interoperability with applications
using other information models. [
        <xref ref-type="bibr" rid="ref13">16</xref>
        ]
While the issue of terminology and information
model interaction is somewhat independent of the
way that coordination of complex concepts occurs,
there is a need for both pre and post-coordinated
approaches to co-exist to fully support the spectrum
of information representation. It is also recognized
that equivalence between pre-coordinated and
postcoordinated concepts has to be established to
maintain consistency in interpretation of
terminology and between concept representation
using different combinations of terminology and
information model binding. In the current SCT
infrastructure this is achieved using computation
and testing the equivalence of the canonical form of
the two terminological variations. This requires
that all of the atomic or component concepts
and pre- coordinated concepts are fully defined
- not the case in practice. Having a formal
definition explicitly developed for current and
future pre- coordinated concepts within
SNOMED would support the recognition of
equivalence. [8, 17, 18]
      </p>
    </sec>
    <sec id="sec-4">
      <title>ISSUES WITH SNOMED-CT</title>
    </sec>
    <sec id="sec-5">
      <title>IMPLEMENTATION</title>
      <p>Several studies have shown that inter-rater
reliability of SCT coding is poor, at least in part
due to the complexity of the SCT structure and the
inconsistency of existing content. [3, 19-21] This
paper proposes that a simpler, more consistent
representation of SCT will reduce confusion and
improve the quality of SCT implementation.
This would need to be tested once working
subsets of the S-ET have been developed and so
examine the impact on coding consistency of the
interaction between the information model and
the use of differently coordinated terminology.
SCT size will certainly grow as new countries
adopt it, especially when it becomes the
terminology to support the many uses of coded
clinical data, such as public heath. Trying to
keep up with the need for language of use
through definition of pre-coordinated concept
phrases is a recipe for “combinatorial explosion”
in the size of a terminology. This is bad enough
in a terminology of simple structure, yet in one
of SCTs complexity and richness of function,
the impact is especially significant. A key
technical challenge involves keeping the
terminology to a manageable size and level of
complexity so that it is both maintainable and
supports end users’ applications. A second
challenge for SCT maintenance is to allow
compatibility with historical versions used by
legacy applications while maintaining
relevance as the core terminology resource
for the current and future generations of health
information systems. The model proposed in
this paper will support both of these
objectives.</p>
    </sec>
    <sec id="sec-6">
      <title>BACKGROUND TO S-ET DEVELOPMENT</title>
      <p>In 1999 a combined pre and post-coordinated
model for a medicines terminology was proposed
by two of the authors (DW and PM) for
Australia, based on an architecture designed
earlier by DW for a proprietary drug information
service. Both pre-coordinated concepts and their
contained atomic components and relationship
types were accommodated. The Essential
SNOMED notion, which was initially
canvassed informally within the health
terminology community in 2001, was further
developed following a comparative technical
analysis of several terminology options that were
then being investigated for use in Australian
General Practice [23] and which subsequently
recommended use of SCT leading in time to
Australia becoming an early adopter of a national
SCT license. A review of candidate
terminologies at that time for use in General
Practice examined several options. One terminology,
DOCLE, was constructed of atomic concepts,
joined by operators using a Bachus Naur Form
(BNF), a standard system for representation of
computable expressions using syntax or rules.
[24] What was notable was the extensive use of
pre-coordinated terms that were constructed from
atomic elements. For example “cancer@breast”
was a pre-coordinated concept for “breast cancer”,
yet it is constructed using a post-coordination
model of atomic concepts and the location operator
.“@”. The process of normalization of DOCLE
for inclusion in a terminology service found that
a number of atomic concepts needed to be
created to support existing content. Considering
this approach and drawing on prior experience
with the development of a medicines
terminology requiring a full set of atomic
elements which were combined to create fully
defined pre-coordinated medicines concepts,
it was postulated that the SCT terminology
could be significantly simplified by creating a
separate data structure for the pre-coordinated
concepts where these were parsed and then
described in a post- coordination grammar. [25]</p>
    </sec>
    <sec id="sec-7">
      <title>DESIGN OF ESSENTIAL SNOMED</title>
      <p>Essential SNOMED would contain a complete set
of “atomic concepts” from which all other concepts
could be constructed by post-coordination. These
atomic concepts would be carefully crafted into
their hierarchies and defined by their relationships.
SNOMED CT most likely contains many - if
not most - of these atomic concepts. They
would consist of both primitive and fully
defined concepts. The large number of
precoordinated concepts that are not in the above
group should be “flagged” in the complete
SNOMED CT data structure as
“predefined-postcoordinated equivalent concepts”, and eventually
associated with their post-coordinated defining
atomic concepts using a formal post
coordination or compositional syntax as is
already described. This group of
precoordinated concepts would not rely on their
hierarchical position or SNOMED relationships for
their definition - instead they would be defined by
the compositional expression (or formalism) used
to construct their atomic post-coordinated concepts
(combination of existing atomic concepts), as
described above. All the existing
precoordinated concepts in SCT could remain, along
with their identifier for use where the situation
required this approach. Pre-coordinated concepts
being added to SCT would also follow this pattern.
For example, the pre-coordinated concept
“Cellulitis of the left foot with osteomyelitis of the
third metatarsal without lymphangitis” can be
expressed using atomic concepts and relationship
types, is shown in Table 1. The concepts and
relationship concepts that comprise the definition
would all be considered core Essential SNOMED
content.</p>
      <p>Operator
and</p>
      <p>The current SNOMED CT terminology model
specifies relationships between concepts and terms,
but does not make a distinction between
postcoordinated concepts expressions and
precoordinated concepts. We propose that this
distinction be made explicitly, as a tool to assist in
SNOMED-CT terminology maintenance and
implementation. Figure 1 describes the way
that the new architectural elements could be linked
with existing S- CT structures which are
represented by the three elements placed at the
right hand side of the figure.</p>
    </sec>
    <sec id="sec-8">
      <title>DISCUSSION</title>
      <p>At the outset it is acknowledged that this
proposal is grounded in the excellent overall
design and management features of SCT.</p>
      <p>The advantages of this proposed structure for SCT
are reduced size and complexity for ease of
implementation and maintenance. An inevitable
outcome would be a reduction in the
combinatorial explosion that occurs when rampant
pre-coordination of concepts and phrases occurs,
yet this comes at the cost of introducing a new
element in the post coordination expression that
links the pre-coordinated concepts to their atomic
elements. The core terminology concepts and
hierarchies should be however much simplified..
The core of S-ET would grow some as new
atomic concepts were added. The S-ET structure
would be expressively intuitive as its approach to
concept r e p r e s e n t a t i o n w o u l d s u p p o r t
c o n c e p t constructions. Hierarchical
simplification would result as the definition of
the many pre-coordinated-concepts would be
independent of immediate hierarchies or
relationships – S-ET would use the compositional
expression to link with hierarchies and defining
relationships of the atomic concepts. Existing
approaches to canonical forms would continue and
allow equivalence testing between different
precoordinated concepts and post-coordinated
expressions. Pre-coordinated concepts would still
be able to be represented in a hierarchical
arrangement to support inference and subsumption,
however these could be calculated rather than
explicit expressed as happens currently in SCT. In
this model the hierarchical relationships would be
inferred rather than the canonical form.</p>
      <p>Equating pre- and post-coordination may be easier,
as the computational form is actually specified for
concepts within SCT. It is acknowledged that the
current approach in SNOMED is not
comprehensive due to incomplete set of canonical
representations and possible lack of semantics to
fully describe the meaning of existing
semantically complex pre-coordinated concepts.
Both the pre-coordinated form and the various
representations of the post-coordinated concept
are valid ways of describing the same concept.
The first is more aligned with human
interpretation and the second supports computer
processing of the data. It is clear that both forms of
concept representation are needed and both have to
be supported by clinical terminologies such as
SNOMED CT. The approach recommended in
S N O M E D E s s e n t i a l Terminology is
believed to be consistent with the current SCT
approach to canonical form definition.</p>
      <p>This model is highly dependent upon an
expressive and computable syntax for
postcoordination. The process of moving to an S-ET
distribution format will highlight any
deficiencies in the current post-coordination
methods and constraints as these will become
explicit and subject to development. SNOMED –</p>
      <p>CT authors can continue to develop
precoordinated expressions if required. End users,
particularly those who rely on the use of
precoordinated concepts, will have the capacity to add
locally relevant pre-coordination through a minor
modification of the SCT way of managing local
extensions, and in doing so would not require
frequent change submissions to the core essential
SNOMED terminology. As discussed earlier an
S-ET model, coupled with an improved model
for managing pre-coordination will support the
terminology user interface.</p>
      <p>One of the difficulties faced by SNOMED is the
need to harmonize with widely used terminologies
that are heavily structured on pre-coordination.
LOINC and MEDCIN and most health
classifications would be examples {28]. A
S N O M E D E s s e n t i a l Terminology would
not need to include the pre- coordinated
concepts imported from such
terminologies, and could instead relate
their concepts to SNOMED-CT by
mapping which used the post-coordination
syntax. Alternatively such pre-coordinated
concepts could be placed along with
existing SCT pre-coordinated content. The
end result would provide the flexibility of
incorporating or mapping to external
terminologies, even though they may not
share the same data models as SCT.</p>
      <p>It is recognized that there are situations where
precoordination is more efficient from a computational
perspective, as in the recognition of commonly
used text strings in natural language processing
(NLP) applications. S N O M E D E s s e n t i a l
Terminology will allow the further development
of such concepts without undue concern about the
combinatorial explosion that might otherwise
exist. NLP requires the consistent application of
terminology and parsing of text. If a S N O M E D
E s s e n t i a l Terminology model is not adopted
then it is likely that some equivalent derivative
product will be created by necessity by these key
application areas. Having a standard form will
support consistency of output of different NLP
applications.</p>
      <p>One of the current strategies to simplify
SNOMED is to restructure the relationship between
terminologies and classifications. Removing or
retiring classification concepts from SCT will allow
them to reside in their respective
classifications and have linkage to the clinical
terminology by mapping or other formal
constructs. S N O M E D E s s e n t i a l
Terminology proposes making a similar change
to manage both the historical terminological
clutter resulting from SNOMED’s antecedents and
use in legacy information systems. In addition it
will meet the widely accepted need to continue to
manage post-coordination in a modern
terminology to support the computer-human
interface.</p>
    </sec>
    <sec id="sec-9">
      <title>Making the transformation to S-ET</title>
      <p>The transformation to a S N O M E D E s s e n t i a l
Terminology would require a set of suitable
“relationship-types” and an appropriate
postcoordination representation form or “syntax” that
catered for the “pre-defined-postcoordinated
equivalent concepts”. SNOMED has published a
BNF for this syntax. This syntax describes
the core SCT concepts, and their
relationships. An XML equivalent (in addition
or as an alternative) may be helpful for the current
computer engineering environment. This paper
is not exploring the relative merits of these
approaches; however the process of defining
the post-coordination equivalents of existing
concepts will also provide a validity check on the
completeness of the syntax or post coordination
model, and as such is complementary to activities
of the International Health Terminology Standards
Development Organisation’s (IHTSDO) Concept
Model Special Interest Group.</p>
      <p>As the “pre-defined-post-coordinated concepts”
could be related back to their atomic components
(which are themselves part of the SNOMED
hierarchy and relationship structure) it would no
longer be necessary to separately define the
hierarchies or associations for the pre-coordinated
concepts within the terminology. This does not
preclude such constructs being employed, much
like current indexing activity at run-time. These
hierarchies could be machine classified. For
example, if the phrase “fractured ankle” was
compiled from two concepts as follows:
[problem, action or issue] = “fracture”</p>
      <p>[which has FindingSite] = “ankle”
Consequently , if it was necessary to locate
“injuries of the lower limb”, then the hierarchical
ancestors of “fracture” would include “injuries”
and those of “ankle” would include “lower limb”.
The issue of what is and what isn’t an atomic or
pre-coordinated concept is subject to debate and the
boundaries can be fuzzy. Is headache a single
concept or a post-coordinated ‘pain’ with
‘location’ of ‘head’? Technically, it should
not be part of an S-ET based on atomic
concepts but is it sufficiently common and
semantically ‘simple’ enough to warrant
inclusion? Under our proposal, “atoms refer
to semantic units, not term labels or
compound term labels. While it is clear that
even single concepts can have compound names, it
would be a conceptual error to consider a concept
such as Hodgkin’s lymphoma to be a
precoordinated concept, whereas a “fractured right
femur” is patently so.</p>
      <p>Building essential SNOMED will be
necessarily a pragmatic exercise which can cope
with one or other of these forms or both. The
consequences of the fuzziness in determining
whether an existing concept should be managed as
a pre or post coordinated concepts are not
expected to be significant. All are still included in
SCT.</p>
      <p>New pre-coordinated concepts could be created
(if r e q u i r e d b y i n f o r m a t i o n s y s t e m s o r
u s e r preference), although this temptation may
best be resisted as it is expected that the
requirements for pre-coordination would become
less pressing with the introduction of standard
information models (e.g., HL7 V3, archetypes or
OpenEHR) and the advancement of Natural
Language Processing (NLP) to support data
entry.[26] Complex pre-coordinated concepts can
sometimes be useful in encouraging consistency in
representation where small nuances may be
unintentionally instantiated where no difference in
clinical meaning exists. For example: Colon
cancer can be represented either as: Malignant
Neoplasm - Has finding site – Colon; or as:
Colon – HasSpecimen - Malignant Neoplasm.
This ambiguity is undesirable, and the
availability of pre-coordinated concept
expressions at the interface level can prevent
this type of variation, and set patterns for
good practice in post- coordination within
terminology services.</p>
      <p>The atomic-concepts included in S-ET would
be those that are necessary and appropriate to build
pre-coordinated concepts that currently exist, or
may be added subsequently, as well as the atomic
concepts currently in use. The boundaries around
atomic concept definition are often fuzzy as
discussed above. Editorial rules would be
required to consider inclusion of concepts that are
not “semantically atomic” but are very common.
A pragmatic approach would need to be
developed, and the following may suggest one
strategy:
1. The entries expected to be found as
defined concepts in a large medical
dictionary [27]; this would likely include items
that have a distinct clinical meaning and are
used frequently – e.g. Lung cancer; breast
cancer; direct inguinal hernia; chest pain.
2. Those concepts that cannot be adequately
defined by the composition of their
postcoordinated concepts due perhaps to use of
an uncommon or unsupported semantic type
for the relationship between elements.</p>
      <p>As a result many of the pre-coordinated concepts
found in SCT diagnoses, findings and procedures
would be excluded from the atomic- concept list
and be placed in the pre-coordinated group.
It is clear that any change to the structure or
representation forms of SCT may have an impact
on reference set (subset) development, use within
value-sets, and mapping to classifications and use in
local extensions. These areas need to be further
examined, however S-ET would not have a
significant impact, as the current SCT and S-ET
would contain the same concepts and
relationships. There is a significant advantage for
local extensions as local terminology experts could
map new local concepts to atomic elements within
SCT, hence gaining the benefits of
classification and relationship modelling,
without having to wait for formal inclusion in
later releases. The development of reference sets
based on concept and hierarchy selection would
also include related pre-coordinated concepts.
The feasibility of remodelling large sections of
SNOMED CT, particularly when there are
competing priorities for terminology development,
must be assessed. While a conversion strategy has
not been covered in detail, the re-organization could
consider using current SCT relationships - but with
some care because of their known limitations. The
size of the term string, the number of individual
words, the presence of relationships, and a
comparison with lists of terms extracted from
medical dictionaries might help identify potential
p r e - c o o r d i n a te d c o n c e p t s . I t i s p o s s i b l e
th a t a f u n c ti o n al r e s u l t t o c r e a t e S - E T
co u ld r es ul t f r o m f l ag g in g pr
ec o o r d i n a t e d c o n c e p t s a n d t e r m s , w i t h o u t
s u b s ta n t ia l l y al t er i n g t h e p u b l i ca t i o n
str u c tu r e. As wi t h mos t terminology
development, specific tools to manage the
transition to S-ET would need to be developed,
refined and the end result would need
appropriate checking and quality control
processes and upfront attention to ongoing
maintenance.</p>
      <p>While the first efforts at instantiation of the S-ET
model may involve the restructure of arbitrary
twigs and branches of the SNOMED hierarchical
tree, an approach proposed would be to operate on
concepts identified in large sub-setting exercises
where terminology of use has been identified from
analysis of actual clinical use in a specific domain
such as intensive care [29] or general practice .[23]</p>
    </sec>
    <sec id="sec-10">
      <title>CONCLUSION</title>
      <p>This paper has proposed a modest alteration to the
structure of SNOMED CT so that it supports the
co-existence of pre and post coordination in a form
that advances the basic structure of what might be
regarded as good terminology practice [30].</p>
      <p>The changes do not require any fundamental
changes in SCT methods, but rather a structural
extension and the incorporation of existing
postcoordination methods of expression into the core
terminology.</p>
      <p>This paper outlines a number of issues with the
current SCT architecture and proposes a solution
which is consistent with its current design and
which may have a number of advantages. If the
proposed model creates resonance with the end
users of SNOMED CT, it should be exposed to
empirical testing and considered by the IHTSDO
and their related organizations. The authors hope
this paper stimulates discussion and feedback. We
look forward to formal testing of these ideas for
feasibility and acceptance.</p>
    </sec>
    <sec id="sec-11">
      <title>ACKNOWLEDGMENTS</title>
      <p>This paper was inspired by an original paper authored by
Don Walker with contributions from Peter MacIsaac
arising from the “GP Vocabulary Project” supported by
the Australian Department of Health and Ageing.
http://www.adelaide.edu.au/health/gp/research/current/in
formatics/#vocab).</p>
      <p>The authors would like to acknowledge the
contributions of Tad McKeon, anonymous KRMED
reviewers, and numerous colleagues who have
contributed in some way to this paper and our
education and research on terminology and informatics.
This work has been supported in part by grants from
the Centers for Disease Control and Prevention
(PH00022 and HK00014; PLE).
10.
11.
12.
13.
14.
15.
16.
17.
18.
19.</p>
      <p>in Ontologies (AOW 2006), Tasmania,
Conferences in Research &amp; Practice in</p>
      <p>Information Technology, 72, pp61-68, 2006
Rector, AL, Qamar R, Marley T. Binding</p>
      <p>Ontologies and Coding Systems to
Electronic Health Records and Messages.</p>
      <p>KR-MED 2006 – Biomedical Ontology in
Action. November 8, 2006, Baltimore,</p>
      <p>Maryland, USA
Elkin PL, Brown SH, Lincoln MJ, Hogarth M,
Rector A. A formal representation for
messages containing compositional
expressions. Int J Med Inform 2003
Sep;71(23):89-102.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Wasserman</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <article-title>An applied evaluation of SNOMED CT as a clinical vocabulary for the computerized diagnosis and problem list</article-title>
          .
          <source>AMIA Ann Symp Proc</source>
          ,
          <year>2003</year>
          : p.
          <fpage>699</fpage>
          -
          <lpage>703</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Elkin</surname>
            ,
            <given-names>P.L.</given-names>
          </string-name>
          , et al.,
          <article-title>Evaluation of the Content Coverage of SNOMED CT: Ability of SNOMED Clinical Terms to Represent Clinical Problem Lists</article-title>
          .
          <source>Mayo Clinic Proceedings</source>
          ,
          <year>2006</year>
          .
          <volume>81</volume>
          (
          <issue>6</issue>
          ): p.
          <fpage>741</fpage>
          -
          <lpage>748</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Warren</surname>
            ,
            <given-names>J.J.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>R.P.</given-names>
            <surname>Wilson</surname>
          </string-name>
          .
          <article-title>Representing Cardiovascular Concepts in an Electronic Health Record Using SNOMED CT®</article-title>
          .
          <source>in American Medical Informatics Association Annual Symposium</source>
          .
          <year>2006</year>
          . Washington, D.C.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Richesson</surname>
            ,
            <given-names>R.L.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>J.E.</given-names>
            <surname>Andrews</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.P.</given-names>
            <surname>Krischer</surname>
          </string-name>
          ,
          <article-title>Use of SNOMED CT to Represent Clinical Research Data: A Semantic Characterization of Data</article-title>
          <source>Items on Case Report Forms in Vasculitis Research Journal of the American Medical Informatics Association</source>
          ,
          <year>2006</year>
          .
          <volume>13</volume>
          : p.
          <fpage>536</fpage>
          -
          <lpage>546</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Andrews</surname>
            ,
            <given-names>J.E.</given-names>
          </string-name>
          ,et al.,
          <article-title>Comparing Heterogeneous SNOMED CT Coding of Clinical Research Concepts by Examining Normalized Expressions</article-title>
          .
          <source>Journal of Biomedical Informatics</source>
          ,
          <year>2008</year>
          . in press.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          R e c t o r , A . L .
          <article-title>T he I nt e r f ac e B e t w e e n Information, Terminology, and Inference Models</article-title>
          .
          <source>in Tenth World Conference on Medical and Health Informatics: MedInfo2001</source>
          .
          <year>2001</year>
          . London.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Dampney</surname>
            ,
            <given-names>C.N.G.</given-names>
          </string-name>
          , G. Pegler, and
          <string-name>
            <given-names>M.</given-names>
            <surname>Johnson</surname>
          </string-name>
          .
          <source>Harmonising Health Information Models - A Critical Analysis of Current Practice. in Ninth National Health Informatics Conference</source>
          .
          <year>2001</year>
          .
          <string-name>
            <surname>Canberra</surname>
            <given-names>ACT</given-names>
          </string-name>
          , Australia.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Huff</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Carter</surname>
          </string-name>
          .
          <article-title>A Characterization of Terminology Models, Clinical Templates</article-title>
          , Message Models, and
          <article-title>Other Kinds of Clinical Information Models</article-title>
          .
          <source>in AMIA Symposium</source>
          .
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Chute</surname>
            ,
            <given-names>C.G.</given-names>
          </string-name>
          ,
          <article-title>Medical Concept Representation, i n M e d i c a l I n f o r m a t i c s . K n o w l e d g e M a n a g e m e n t a n d D a t a M i n i n g i n Biomedicine</article-title>
          .,
          <string-name>
            <surname>Chen</surname>
            <given-names>H.</given-names>
          </string-name>
          , et al.,
          <source>Editors</source>
          .
          <year>2005</year>
          , Springer U.S. p.
          <fpage>163</fpage>
          -
          <lpage>182</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Markwell</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Meaning</surname>
            <given-names>Well</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Well</surname>
          </string-name>
          meaning - HL7
          <source>TermInfo</source>
          .
          <year>2005</year>
          ,
          <article-title>The Clinical Information Consultancy Ltd</article-title>
          . www.clininfo.co.uk
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>HL7</surname>
          </string-name>
          , Health Level Seven.
          <year>2005</year>
          , Health Level Seven, Inc.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Elkin</surname>
            ,
            <given-names>P.L.</given-names>
          </string-name>
          , et al.,
          <article-title>A controlled trial of automated classification of negation from clinical notes</article-title>
          .
          <source>BMC Med Inform Decis Mak</source>
          ,
          <year>2005</year>
          .
          <volume>5</volume>
          (
          <issue>1</issue>
          ): p.
          <fpage>13</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          16.
          <string-name>
            <surname>Richesson</surname>
            ,
            <given-names>R.L.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>J.P.</given-names>
            <surname>Krischer</surname>
          </string-name>
          , Data Standards in Clinical Research: Gaps, Overlaps, Challenges and
          <string-name>
            <given-names>Future</given-names>
            <surname>Directions</surname>
          </string-name>
          .
          <source>Journal of the American Medical Informatics Association</source>
          ,
          <year>2007</year>
          .
          <volume>14</volume>
          (
          <issue>6</issue>
          ): p.
          <fpage>687</fpage>
          -
          <lpage>696</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <surname>Spackman</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <article-title>A. and</article-title>
          K.E. Campbell,
          <article-title>Compositional concept representation using SNOMED: towards further convergence of clinical terminologies</article-title>
          .
          <source>Proc AMIA Symp</source>
          ,
          <year>1998</year>
          : p.
          <fpage>740</fpage>
          -
          <lpage>4</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>Rector</surname>
            ,
            <given-names>A.L.</given-names>
          </string-name>
          , et al.,
          <source>The GRAIL Concept Modelling Language for Medical Terminology. Artificial Intelligence in Medicine</source>
          <year>1997</year>
          .
          <volume>9</volume>
          : p.
          <fpage>139</fpage>
          -
          <lpage>171</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <surname>Rothschild</surname>
            ,
            <given-names>A.S.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>H.P.</given-names>
            <surname>Lehmann</surname>
          </string-name>
          , and G. Hripcsak,
          <article-title>Inter-rater Agreement in Physiciancoded Problem List</article-title>
          , in American Medical Informatics Association. 2005: Washington, D.C.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <surname>Burkhart</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          , et al.,
          <article-title>Mapping parish nurse documentation into the nursing interventions classification: a research method</article-title>
          .
          <source>Comput Inform Nurs</source>
          ,
          <year>2005</year>
          .
          <volume>23</volume>
          (
          <issue>4</issue>
          ): p.
          <fpage>220</fpage>
          -
          <lpage>9</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <surname>Chiang</surname>
            ,
            <given-names>M.F.</given-names>
          </string-name>
          , et al.
          <article-title>Reliability of SNOMEDCT Coding by Three Physicians using Two Terminology Browsers</article-title>
          . in American Medical Informatics Association Annual Symposium.
          <year>2006</year>
          . Washington, D.C.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <surname>Andrews</surname>
            ,
            <given-names>J.E.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>R.L.</given-names>
            <surname>Richesson</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.P.</given-names>
            <surname>Krischer</surname>
          </string-name>
          ,
          <article-title>Variation of SNOMED CT Coding of Clinical Research Concepts among Coding Experts</article-title>
          . JAMIA,
          <year>2007</year>
          .
          <volume>14</volume>
          : p.
          <fpage>497</fpage>
          -
          <lpage>506</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <surname>Walker</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          and e. al,
          <source>General Practice Vocabulary Project</source>
          .
          <year>2001</year>
          , Commonwealth Department of Health and Ageing.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <surname>Welcome to DOCLE Systems</surname>
          </string-name>
          .
          <year>2008</year>
          , . http://www.docle.com.au/
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <string-name>
            <surname>Elkin</surname>
            ,
            <given-names>P.E.</given-names>
          </string-name>
          , et al.,
          <article-title>An Evaluation of the Content Coverage of SNOMED CT for Clinical Problem Lists</article-title>
          .
          <source>Mayo Clin Proc</source>
          ,
          <year>2006</year>
          .
          <volume>81</volume>
          (
          <issue>6</issue>
          ): p.
          <fpage>741</fpage>
          -
          <lpage>8</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <surname>Elkin</surname>
            ,
            <given-names>P.L.</given-names>
          </string-name>
          , et al.,
          <article-title>Guideline and Quality Indicators for Development, Purchase and Use of Controlled Health Vocabularies</article-title>
          .
          <source>International Journal of Medical Informatics</source>
          ,
          <year>2002</year>
          .
          <volume>68</volume>
          (
          <issue>1-3</issue>
          ): p.
          <fpage>175</fpage>
          -
          <lpage>186</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <string-name>
            <surname>Patrick</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Herkes</surname>
            <given-names>R</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ryan</surname>
            <given-names>A</given-names>
          </string-name>
          .
          <article-title>Enhancement technologies for clinical information systems</article-title>
          .
          <source>HISA NSW conference proceedings 2008.</source>
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          <string-name>
            <given-names>Cimino</given-names>
            <surname>JJ</surname>
          </string-name>
          .
          <article-title>Desiderata for controlled medical vocabularies in the twenty-first century</article-title>
          .
          <source>Methods Inf Med</source>
          <year>1998</year>
          ;
          <volume>37</volume>
          (
          <issue>4-5</issue>
          ):
          <fpage>394</fpage>
          -
          <lpage>403</lpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>