Linking African Traditional Medicine Knowledge
Gossa Lô, Victor de Boer, and Stefan Schlobach
Department of Computer Science, the Network Institute,
Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
{a.g.lo, v.de.boer, k.s.schlobach}@vu.nl
Abstract. African Traditional Medicine (ATM) is widely used in Africa
as the first-line of treatment thanks to its accessibility and affordability.
However, the lack of formalization of this knowledge can lead to safety
issues and malpractice. This paper investigates a possible contribution
of the Semantic Web in realizing the formalization and integration of
ATM with data on conventional medicine (CM). As a proof of concept
we convert various ATM datasets and link them to CM data. This results
in a Linked ATM knowledge graph. We finally give some examples with
some interesting SPARQL queries and insightful results.
1 Introduction
Traditional medicine (TM) denotes a heterogeneous set of diagnostic- and ther-
apeutic practices that uses scientifically often unvalidated knowledge, orally
passed on through generations. Its aim is to prevent, diagnose and treat physical
and mental illnesses. The World Health Organization notes that African Tradi-
tional Medicine (ATM) plays a crucial role in Sub-Saharan Africa where it is the
first line of treatment for ca. 80% of the population [1], thanks to its accessibility,
affordability and embeddedness in the traditional belief systems [9].
As the use and availability of plants differs per region, lack of formalization
not only raises safety concerns, but also risks the loss of critical knowledge [3].
Formalizing TM knowledge can benefit the determination of medical effects of
plants in terms of diseases and symptoms [5]. Furthermore, bridging the gap
between ATM and CM by integrating the former into formal health care systems
can result in a deeper understanding of pathology and medical knowledge.
This paper presents a proof of concept on how Semantic Web technologies can
contribute to the preservation and formalization of ATM, and its integration with
CM. This is demonstrated by converting ATM datasets into machine-readable
data and by connecting them to Linked Data knowledge graphs on the Web.
Finally, the benefits of the presented concept will be shown by validating the
use cases formulated below with the use of queries.
An illustration of how Web technologies can contribute to the representation
of TM is the Traditional Chinese Medicine (TCM) Database System1 . It has been
maintained since 1984 by the Institute of Information on TCM, and contains over
1
http://cowork.cintcm.com/engine/windex1.jsp, accessed 2017-07-5
2 Gossa Lô, Victor de Boer, and Stefan Schlobach
1,100,000 items of data, divided over 40 categories. Furthermore, a semantic
eScience infrastructure was established, in response to the sheer volume and
diversity of TCM information and services that affect its interoperability [2].
The Linking Open Drug Data (LODD) project aggregated and added biomed-
ical data (e.g. about drugs, TCM, diseases, etc.) to the Linked Data Cloud. Their
focus is to facilitate the obtaining of new insights and finding unforeseen associ-
ations between entities. The data sets contain over 8.4 million RDF triples and
388,000 RDF links to external data sources [4,8].
2 Case study and methods
We investigate the potential of Semantic Web technologies for the preservation
and formalization of ATM knowledge through two use cases. The first concerns
the difference in treatment in two regions in Madagascar. Both the availability
of plant species in a region and the knowledge about treatment methods that is
passed on in a community influence treatment practice. Formalizing these meth-
ods prevents loss of knowledge and can help ATM practitioners in rural regions
to exchange knowledge and find new treatments. The second use case focuses
on investigating differences between ATM practice in Senegal and Madagascar.
This exchange of knowledge not only benefits ATM practitioners, but could also
be a valuable contribution to drug discovery and pharmacology on a global scale.
Keur Massar Traditional Hospital. One dataset used was provided by
Hôpital Traditionnel de Keur Massar2 in Senegal founded in 1980 by Professor
Yvette Parès3 , who had years of experience working with traditional practition-
ers from different ethnic groups in Senegal. This hospital has a strong focus on
phytotherapy along ATM traditions and produces medicinal plant preparations
(about 1,000 products and recipes) from their botanical garden.
Datasets. Besides this dataset, two datasets on ATM practice in Madagascar
were used. The first contains data on medicinal plants used to treat the six
most frequent diseases in the Ambalabe rural community in Madagascar [6]. The
second dataset contains data on the most used medicinal plants by communities
in Mahaboboka, Amboronabo, Mikoboka, in Southern Madagascar [7]. The most
relevant attributes that were stored are: the hierarchy of plant names, plant
parts, in addition to diseases, ailments and preparation & administration modes.
We then linked the plant- and disease data in our datasets to resources on
BioPortal4 and DBpedia5 . The BioPortal REST API was used to search for
URIs for terms across ontologies, such as the Human Disease Ontology 6 , the
2
http://www.hopitalkeurmassar.com/
3
Head of the department of Plant Biology at Cheikh Anta Diop University in Dakar
4
http://bioportal.bioontology.org/
5
http://www.dbpedia.org
6
https://bioportal.bioontology.org/ontologies/DOID
Linking African Traditional Medicine Knowledge 3
Symptom Ontology 7 and SNOMEDCT 8 . If no URI is found, a synonym of the
term is entered or linked to a proper resource on DBpedia.
Dataset conversion and linking. Our three datasets (.CSV) were trans-
lated into RDF and linked to the target datasets. First, columns from the original
tabular data are stored in a dictionary as values, and assigned to a new key ob-
ject name:
’set2 99’: [u’Emilia humifusa DC. Rakotoarivelo’, u’Asteraceae’, u’Infected wound’,
u’L’, u’Angea’, u’Cataplasm on wound’, u’400’, 0.02, u’ID’, 25L]
A new data model was created that does not use existing ontologies, with con-
verted triples of the form atmData:set2 99 atmVocab:familyName dbpg:Asteraceae.
Results. The result of this conversion is a total of 13,028 triples, of 672 plant
types that treat 1,799 health conditions. The data is available at a public GIT
repository at https://github.com/biktorrr/linkedatmdata. For live brows-
ing and querying, we provide a triple store endpoint at http://semanticweb.
cs.vu.nl/linkedatm/home. The 672 set objects all have the same structure as
the below RDF Turtle snippet shows.
atmData : s e t 2 9 9 atmVocab : a i l m e n t atmData : I n f e c t e d w o u n d ;
atmVocab : binomialName atmData : E m i l i a h u m i f u s a DC. R a k o t o a r i v e l o ;
atmVocab : familyName dbpg : A s t e r a c e a e ;
atmVocab : p l a n t P a r t s ”L”ˆˆ xsd : s t r i n g ;
atmVocab : p r e p a r a t i o n A d m i n i s t r a t i o n ” Cataplasm on wound”@en .
New and existing prefixes are used to simplify SPARQL queries with:
prefix dbr :
prefix dbo :
prefix atmData :
prefix atmVocab :
prefix p r l :
prefix snmd :
3 Validating
By querying possible outcomes of the use cases, the proof of concept is validated.
Consider the following use case:
How does Malaria treatment differs in two different regions in Madagascar?
SELECT ? a i l m e n t ? f a m i l y ? binomialName ? p a r t s ? p r e p a r a t i o n WHERE {
? s atmVocab : familyName ? f a m i l y .
? s atmVocab : binomialName ? binomialName .
? binomialName r d f s : l a b e l ? b i n o m i a l .
? s atmVocab : genusType ? genusType .
? s atmVocab : a i l m e n t | atmVocab : d i s e a s e T r e a t e d ? a i l m e n t .
? ailment r d fs : l a b e l ? ailmentLabel .
? s atmVocab : p l a n t P a r t s ? p a r t s .
? s atmVocab : p r e p a r a t i o n | atmVocab : p r e p a r a t i o n A d m i n i s t r a t i o n ? p r e p a r a t i o n .
FILTER ( ? a i l m e n t L a b e l = ” M a l a r i a ”@en ) }
GROUP BY ? a i l m e n t ? f a m i l y ? binomialName ? p a r t s ? p r e p a r a t i o n
LIMIT 6
7
https://bioportal.bioontology.org/ontologies/SYMP
8
https://bioportal.bioontology.org/ontologies/SNOMEDCT
4 Gossa Lô, Victor de Boer, and Stefan Schlobach
As Fig. 1 shows, differences in plant use and treatment of malaria occur
even between neighboring regions. While the Ambalabe community uses several
parts, the other regions predominantly use the leaves. Thus, LinkedATM could
both contribute to collaborative knowledge sharing between communities, and
preserve critical knowledge for future generations.
Fig. 1. Difference in treatment of malaria in distinct communities in Madagascar
The second case focuses on the difference in use of a specific plant species be-
tween Senegal and Madagascar.The Sclerocarya birrea tree grows in both South-
ern and West Africa. However, the treatment purposes differ per country. The
following query shows its use in Senegal:
SELECT DISTINCT ? f a m i l y ? genusType ? binomialName ? i n d i c a t i o n WHERE {
? s e t atmVocab : nomGenre ? genusType .
SERVICE { ? genusType dbo : f a m i l y ? f a m i l y } .
? s e t atmVocab : nomBinomial ? binomialName .
? s e t atmVocab : i n d i c a t i o n ? i n d i c a t i o n .
? indication rdfs : label ? indicationLabel .
FILTER ( ? genusType = dbr : S c l e r o c a r y a ) .
FILTER ( ? binomialName = atmData : S c l e r o c a r y a b i r r e a ) . }
GROUP BY ? f a m i l y ? genusType ? binomialName ? i n d i c a t i o n
The family of the tree (i.e. Anacardiaceae) is unknown and is identified by linking
the known genus type to DBpedia. Fig. 2 shows that the tree is used to treat
hypoglycemia, as an anti-infective, an antivenin or an astringent for the skin.
Fig. 2. Sclerocarya birrea in Senegal - query result
The aforementioned communities in Southern Madagascar on the other hand,
use this tree to treat malaria, in prenatal care, postpartum recovery, dizziness
during pregnancy, fever and caries, as shown in Fig. 3.
Fig. 3. Sclerocarya birrea in Madagascar - query result
Linking African Traditional Medicine Knowledge 5
4 Conclusion
This paper stipulates that formalizing ATM, and linking it to conventional
medicine, can yield innovative knowledge benefiting both ATM practitioners
and researchers on the usage of plant components in conventional pharmacol-
ogy. Our proof of concept focuses on two concrete use cases, using data from an
ATM hospital in Senegal and the results from earlier use cases in Madagascar.
This has resulted in a dataset of 13,028 RDF triples, which describe 672 plant
types and 1,799 health conditions. The data has been linked to knowledge on
the Web (BioPortal & DBpedia), and shown to be of interest in the use cases.
Further steps are required, including obtaining more comprehensive and var-
ied data, linked to pharmaceutical ontologies for comparison with conventional
medicinal substances. Making the data accessible to ATM practitioners in rural
areas in Africa would require the development of an offline and voice-based ver-
sion. Semantic Web techniques can, without a doubt, contribute to Linked ATM.
To achieve this, there are challenges to overcome. However, as was demonstrated
in this proof on concept paper, the first step has been taken.
Acknowledgements. We thank Djibril Bâ and Geneviève Baumann for their in-
put. This work is supported by the W4RA initiative (http://w4ra.org).
References
1. Antwi-Baffour, S.S., Bello, A.I., Adjei, D., Mahmood, S.A., Ayeh-Kumi, P.F.: The
place of traditional medicine in the african society. AJHR 2, 49–54 (2014)
2. Chen, H., Mao, Y., Zheng, X., Cui, M., Feng, Y., Deng, S., Yin, A., Zhou, C., Tang,
J., Jiang, X., et al.: Towards semantic e-science for traditional chinese medicine.
BMC bioinformatics 8(3), S6 (2007)
3. Firenzuoli, F., Gori, L.: Herbal medicine today: clinical and research issues.
Evidence-Based Complementary and Alternative Medicine 4(S1), 37–40 (2007)
4. Jentzsch, A., Zhao, J., Hassanzadeh, O., Cheung, K.H., Samwald, M., Andersson,
B.: Linking open drug data. In: I-SEMANTICS (2009)
5. Kamsu-Foguem, B., Diallo, G., Foguem, C.: Conceptual graph-based knowledge
representation for supporting reasoning in african traditional medicine. Engineering
Applications of Artificial Intelligence 26(4), 1348–1365 (2013)
6. Rakotoarivelo, N.H., Rakotoarivony, F., Ramarosandratana, A.V., Jeannoda, V.H.,
Kuhlman, A.R., Randrianasolo, A., Bussmann, R.W.: Medicinal plants used to treat
the most frequent diseases encountered in ambalabe rural community, eastern mada-
gascar. Journal of ethnobiology and ethnomedicine 11(1), 68 (2015)
7. Randrianarivony, T.N., Ramarosandratana, A.V., Andriamihajarivo, T.H., Rako-
toarivony, F., Jeannoda, V.H., Randrianasolo, A., Bussmann, R.W.: The most used
medicinal plants by communities in mahaboboka, amboronabo, mikoboka, south-
western madagascar. Journal of ethnobiologyðnomedicine 13(1), 19 (2017)
8. Samwald, M., Jentzsch, A., Bouton, C., Kallesøe, C.S., Willighagen, E., Hajagos,
J., Marshall, M.S., Prud’hommeaux, E., Hassanzadeh, O., Pichler, E., et al.: Linked
open drug data for pharmaceutical research and development. Journal of chemin-
formatics 3(1), 19 (2011)
9. World Health Organization: Traditional medicine strategy 2002-2005.
http://www.wpro.who.int/health_technology/book_who_traditional_
medicine_strategy_2002_2005.pdf (2002)