=Paper=
{{Paper
|id=Vol-2710/paper20
|storemode=property
|title=An ASP Approach for Arteries Classification in CT-scans
|pdfUrl=https://ceur-ws.org/Vol-2710/paper20.pdf
|volume=Vol-2710
|authors=Francesco Fabiano,Alessandro Dal Palù
|dblpUrl=https://dblp.org/rec/conf/cilc/FabianoP20
}}
==An ASP Approach for Arteries Classification in CT-scans==
An ASP Approach for Arteries Classification in CT-scans? Francesco Fabiano1 and Alessandro Dal Palù2 1 Department of Mathematics, Computer Science and Physics, University of Udine, Via delle Scienze 206, 33100 Udine, Italy francesco.fabiano@uniud.it 2 Department of Mathematical Physical and Computer Sciences, University of Parma, Parco Area delle Scienze 53/A, 43124, Parma, Italy alessandro.dalpalu@unipr.it Abstract. Automated segmentation of CT scans is the first step in the pipeline for the interpretation and identification of potential patholo- gies in human organs. Several methods based on Machine Learning are currently available, even if their precision is still outperformed by med- ical doctors. In this field there are some intrinsic limitations to ML ap- proaches, such as the cost and time to acquire high quality annotated scans for training; a considerably high variability of organs morphol- ogy due to age, health conditions, genetics; acquisition noise. This pa- per outlines a new methodology based on Answer Set Programming, which returns reliable, easy-to-program and explainable interpretations. In particular, we focus on the CT scan analysis and retrieval of tree-like structure, corresponding to main blood vessels (arteries) arrangement. The structure is compared to the knowledge base of vessels contained in anatomy text-books. The mapping of vessels names is computed by an ASP program. This preliminary step produces a robust input to a reasoner for the multi-organ labeling and localization problem. Keywords: Answer set programming · CT scan · Image processing · Blood vessels classification 1 Introduction In Italy, according to the registry of tumor [1], there have been 373 thousands new diagnosis of malign cancer in 2018. The survival rate at 5 years is still very low, e.g., 20% for liver tumours and only 8% for pancreas neoplasia. Unfortu- nately, while surgery is the only curative treatment, a small subset of patients are diagnosed at a sufficiently early stage such that surgery is still a viable option, mainly because at such stage cancer is often asymptomatic. The screening process typically involves three main categories of examina- tions: ultrasound (US), computed tomography (CT) scans and magnetic reso- nance (MR). They are performed in this specific order and in case of detected ? Copyright © 2020 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). 2 F. Fabiano and A. Dal Palù anomalies, the next examination is prescribed. Such investigations are ordered by increasing running cost, increasing sensitivity and decreasing accessibility. Usually CT scans show a very good sensitivity for cancer, but in case of small tumours (less than 2 cm) sensibility and specificity often drop because cancer is still indistinguishable from healthy tissue. Moreover, radiologist experience and training make a difference in spotting smaller tumours. Consequently, the diag- nosis is very challenging, and an early stage tumour can hide behind acquisition noise, even if other diagnostic modalities (i.e., MR) could highlight the problem. In recent years, several lines of research have been dedicated to support the radiologists’ work and to help them in processing and assessing the amount of data coming from CT scans. The goal is typically to train a system that suggests potential abnormalities during the experts’ analyses. In Section 2.2 a short overview is provided. The automated support for diagnosis is a challenging process, because of a combination of unique domain-specific features: – Acquisition noise: radiologists train their eyes to selectively remove noise from the images and to capture the tiniest shadow that could be associated to tumours; – Patients anatomy: compared to text-books, abdominal organs are subject to a high variability in terms of size, shape, position, ageing, genetics, etc.; – Annotated models: the cost for the creation of clean, complete and anno- tated model to be fed to automatic training requires some hours for a single scan. For effective training a large set of cases is needed; – Need for an explanation: a radiologist can support its diagnosis with a set of deductions coming from the observation of CT scans. This should be provided by the automated process as well, as mentioned in [9]. Machine learning approaches do not yet outperform human experts, as op- posed to other fields of AI applications, partly because of the above mentioned issues. We argue that such scenario will not drastically mutate in the future and therefore a different methodology could be more suitable and should be investigated. Our project is about the development of a software that could help radiolo- gists with the detection of early stage tumors and to avoid delayed diagnosis and treatment. The key idea is to build a system that relies on medical knowledge (anatomy and radiology) and reasons on it. Constraint programming [14] and Answer Set Programming [5] are the type of technologies suitable for this task. In particular, they produce a solution model that is compatible with a knowl- edge base and an input instance. Moreover, they can provide a high level proof that can be interpreted by experts, for both model improvement and diagnosis support. The challenging goal is to build Logic Programming (LP) programs that embed the reasoning tasks that radiologists activate during the interpretation of a scan. Typically, they refer to the text-book knowledge and they map it to the patient at hand. Differently from atlas based approaches, where a model is stretched to fit the instance, radiologist aggregate low level features found in the An ASP approach for arteries classification in CT-scans 3 scan and combine them according to a high level reasoning. As an example, many high level properties are retained among all human beings: liver is connected to aorta through the same set of vessels connections, despite their size, length and placement. In such scenario, liver identification can be easily tracked by following the correct junctions along the vessel network. In general, radiologists interleave different types of activities during the vi- sualization of scans: low level features and landmarks recognition in the scan, localization and labeling of organs and analysis of organs features. The process is often performed by looking at a single 2D scan slice at a time and then jumping among neighbor slices, creating an animation motion that helps in the removal of noise and in visualizing and highlighting 3D features. Usually, the 3D models reconstructions from raw data, available in all scan visualizer software, are not suitable for cancer diagnosis. Our expected result is to develop a software that could warn a radiologist about some potential issues, in the attempt to reduce false negative CT exams. The analysis could provide insightful information about relationships between organs and suggest whether the patient is a candidate for surgical treatment. Finally, the software should be integrated into the Picture Archive and Commu- nication System (PACS)[10], which is the standard for handling medical imaging data, in order to load it seamlessly in the radiologist workflow. This paper presents the preliminary models and results achieved so far. Sec- tion 2 covers the essential topics discussed in the paper. We shortly depict the main pipeline for the complete tool in Section 3 and we focus on the arteries de- tection and classification, namely the problem of assigning the anatomical name to each vessel in the scan. In Section 3.1 we briefly discuss the vessel extraction and in Section 3.2 we present an ASP model for the vessels’ labelling. We show some preliminary results in Section 4 and we conclude in Section 5. 2 Background In this section we review the basic notions that are covered by the project. We briefly review CT scans, some examples in automated organ recognition and some basics of Answer Set Programming. 2.1 CT scan Computer Assisted Tomography (in short CT) scan [8] is a well established X-ray based acquisition procedure for a general evaluation of pathologies and traumas. A patient’s body is acquired through a sequence of X-ray images that are shot from different positions such that they are orthogonal to a central axis (ideally placed from feet to the head). Each image shows the cumulative absorp- tion of X-rays caused by body’s content along a specific point of view, as seen in common 2D X-rays shots. Given the CT scanner controlled setup, a set of acquisitions can provide information about each single position in the body: a deconvolution procedure is run to determine the radiometric absorption of each 4 F. Fabiano and A. Dal Palù voxel (volume element). Typically, the size of a voxel is 1 mm3 . Such volumetric information is arranged in a 3D matrix, where each cell contains a scalar number which corresponds to the radiometric absorption of that specific voxel. The unit of measure is the HU (Hounsfield Unit). Different chemical elements, as well as different body structures, show different X-ray absorption. Therefore, 3D images can be used to observe organs, vessels, bones etc. Some structures show similar intensity or absorption levels and a trained radiologist is able to discriminate finer details in order to assess potential abnormalities. For example, typical tu- mours are less dense than healthy tissues and therefore they absorb less X-rays. This fact can be visualized on a 3D image as a darker area compared to the neighborhood, assuming to depict values with shades of gray ranging from black (low absorption, e.g., air) to white (high absorption, e.g., bones). Radiological images are saved with a standard file format called DICOM [6]. This image format is uncompressed and it contains 16 bit of information for each voxel (volume element). Each image usually provides a range of around 3000 distinct values among voxels. Since human vision is capable of distinguish much less variability (especially on a gray scale), an image may hide some relevant piece of information that is not visible to the eyes, even when enhancing tools are used by post processing software. Furthermore, CT images are affected by intrinsic noise related to the acquisition process and deconvolution of density, which may interfere with finer analysis. 2.2 Related work Artificial Intelligence, ranging from vision to machine learning, has been applied to CT scans, given the valuable richness of information and the high demand for increased accuracy in interpretation and diagnosis. The overall goal is not to substitute the radiologist, but to provide them with a tool that can assist their interpretation process, by highlighting regions of attention and presenting reasons that support the warning. Starting from low level vision problems, organ segmentation (see [15, 21, 18] for some examples and [11] for a survey) is a fundamental problem, where a program is asked to return a binary matrix, where the value 1 is assigned to voxels that belong to a specific organ/tissue. The extension of the request to multi-organ segmentation is rather trivial, while its implementation becomes more complex but it takes advantage of mutual classification choices. The state of the art methods lacks from satisfactory precision (up to 86% of volume correctly classified for pancreas), since the boundaries of organs are often mis-classified. Unfortunately, such regions are often the place where cancer can grow from (e.g., the tail of the pancreas) and radiologists are still able to outperform the tools. The circulatory system (including arteries and veins) can be identified by techniques similar to organ segmentation, adapted to the line-shaped structures [20, 12, 17]. In order to ease the identification of tiniest details, typical analyses require a contrast medium to be ingested by the patient before the scan. Built on top of feature extraction problem, an interesting goal is to correctly label An ASP approach for arteries classification in CT-scans 5 each vessel with its medical name. This can help in referencing the area of investigation and it can also help surgical preparation. On the high-level end, another fundamental problem is direct cancer detec- tion (e.g., [2]) customized for a selected organ. In this case, starting from a correct organ segmentation, the 3D voxels are analyzed in terms of texture and other features, typically by neural networks and classifiers. 2.3 ASP A general program P in the language ASP is a set of rules r of the form: a0 ← a1 , . . . , am , not am+1 , . . . , not an where 0 ≤ m ≤ n and each element ai , with 0 ≤ i ≤ n, is an atom of the form p(t1 , . . . , tk ), p is a predicate symbol of arity k and t1 , . . . , tk are terms built using variables, constants and function symbols. Negation-as-failure (naf) literals are of the form not a, where a is an atom. Let r be a rule, we denote with h(r) = a0 its head, and B + (r) = {a1 , . . . , am } and B − (r) = {am+1 , . . . , an } the positive and negative parts of its body, respectively; we denote the body with B(r) = {a1 , . . . , not an }. A rule is called a fact whenever B(r) = ∅; a rule is a constraint when its head is empty (h(r) = false); if m = n the rule is a definite rule. A definite program consists of only definite rules. A term, atom, rule, or program is said to be ground if it does not contain variables. Given a program P , its ground instance is the set of all ground rules obtained by substituting all variables in each rule with ground terms. In what follows we assume atoms, rules and programs to be grounded. Let M be a set of ground atoms (false ∈ / M ) and let r be a rule: we say that M |= r if B + (r) 6⊆ M − or B (r) ∩ M 6= ∅ or h(r) ∈ M . M is a model of P if M |= r for each r ∈ P . The reduct of a program P w.r.t. M , denoted by P M , is the definite program obtained from P as follows: (i) for each a ∈ M , delete all the rules r such that a ∈ B − (r), and (ii) remove all naf-literals in the the remaining rules. A set of atoms M is an answer set [5] of a program P if M is the minimal model of P M . A program P is consistent if it admits an answer set. We will make use of ASP to describe a model which will allow to classify the vessels extracted from the CT-scans. In fact, thanks to the declarative nature of ASP, it is possible to define concise and accurate anatomical rules (that follows “rigid” patterns) that allow a precise labelling even in presence of errors or uncertainty often produced by the acquisition noise. 3 The methodology outline Our system has been designed to take advantage of the best technologies for each of the three phases is composed of. It starts with a filtering process, in order to attenuate acquisition noise, and a low level feature extraction. The second phase, based on Logic Programming (LP), labels each body part in the 6 F. Fabiano and A. Dal Palù scan by reasoning on the set of features. Finally, organ evaluations can take place inside predicted organs volumes (here, original scan data are used in place of the filtered version for improved detection precision). Our vision is to build a system that is based, in the second phase, on LP. We believe that such technology proposes some distinctive advantages that can outperform other approaches. The other two phases already show promising results in the literature. The main advantage of LP is the capability of including a knowledge base, which can be build from anatomy textbooks and from radiologists interviews about their empirical interpretation procedures. Such knowledge can be trans- lated into a set of clear rules that drive the search of a model matching the set of patient’s low level features. For example, a rule can state that a specific artery extends on a specific area and it branches from another artery. A low-level feature can be a tubular shape with its interconnections. Since vessels form a tree-like structure, the process of matching boils down to a constrained graph coloring problem. Rather than training a classifier, the presence of a knowledge base allows to i) interact with Medical Doctors on a simple and natural language like level; ii) integrate the model with information extracted from already existing medical ontologies [3, 16]; iii) receive an explanation in terms of used rules to justify the answers; iv) consequently, update and refine the rules in a scientist-in-the-loop process; v) avoid to build a system that invests its training time for implicitly learning the anatomical general rules directly from low-level features. In partic- ular, information retrieval from ontologies could allow us to extract anatomical information regarding specific vessels (through specific queries). The extracted piece of information, associated to the anatomy of the vessel (e.g., its average length or radius) and/or the nomenclature of some of its component, will be then integrated to our knowledge base. As already said, thanks to the declara- tive nature of the model, this integration would be relatively easily achieved. Our system’s interpretation phase mimics the same mental procedure a ra- diologists follows when she/he observes a CT scan. We interviewed radiologists during some scans interpretations and asked them to report on the logics behind their actions and findings. In particular, the procedure appears to be stratified from a coarse to fine approach. There is a preference in ruling out geometric properties first (low-level features) and combine them to identify organs and pathologies. Such strategy allows to decouple feature extraction from the rea- soning phase. Therefore, off the shelf algorithm from computer vision can be employed and next specific LP programs can reason about features. In popular machine learning approaches the two steps are often merged together. The first radiologist action is to localize main landmarks and to find a refer- ence in the body. Even if organs position, size and shape can individually differ, there are some structural properties that are always constant (or more precisely, there is a set of known structural conformations that occur with some distribu- tion among population). Vessel organization and structure possess this property and they are exploited as a 3D map that gives directions to organs. The correct An ASP approach for arteries classification in CT-scans 7 classification of arteries (which are more visible compared to veins, even with- out a contrast drug) allows to reach main organs and unlock further deductions about neighborhood, boundaries etc. Here, radiologists use a global approach that takes advantage of the whole scan, starting from some safe landmarks. This phase resembles a constrained graph coloring, which merges radiometric prop- erties of organs, anatomy information and low level features extracted from the scan. We give a brief description of filtering and feature extraction in Section 3.1 and we focus on the the arteries classification problem in Section 3.2. Our future plan is to extend the LP analysis to the multi-organ classification phase. 3.1 Filtering and feature extraction In the literature, filtering is fundamental to remove acquisition noise and com- pensate for spurious interference (e.g. a titanium implant) [19, 13]. Total varia- tion is a rather effective filtering technique and it is anisotropic and edge pre- serving [7], so that shapes and boundaries are maintained at the correct position and contrast. We implemented an interleaved Total Variation and median filter in order to enhance arteries shape for the goals of our paper. Unlike [12, 20], where contrast medium in the arteries allows a simple thresh- olding on the image, we opted for a finer analysis. For each voxel we compute the principal axis (main eigenvector) in density arrangement and connect voxels that are oriented along compatible and strong 1D arrangements. Such procedure returns pieces of reliable tubular fragments that are merged in a bottom up pro- cess. The ones that are also connected to the thoracic Aorta (the main artery connected to the heart), are the candidates for labeling. Such set of arteries are managed as a forest of trees that branch from the Aorta. Additional features are added to tree branches, e.g. estimated diameter and body quadrant spanned. Figure 1, on the left, depicts a slice of the CT scan, where voxels values are plotted with a rainbow palette. Each voxel’s principal direction is printed with short red lines centered at density local barycenter. It can be seen that clusters of coherent lines are positioned along artieries (dark green color regions). In the same Figure on the right, the local cues are merged and tracked. The resulting yellow lines cover the arteries barycenters. The vertical green line depicts the Aorta. 3.2 ASP vessels classification We focus now on the automated process that examines the graph produced by the feature extraction phase and associates the largest subset of arcs to the corresponding anatomical names. We introduce an Answer Set Programming model that, exploiting anatomical rules and relations, performs a labeling of the main arteries that separate from the thoracic Aorta. This is a minimal test- bed domain that is sufficient to contain the general issues in finding the correct labeling. Nevertheless, the program can be extended to cover additional regions with the effort of coding new set of modular rules. Since all the rules can be 8 F. Fabiano and A. Dal Palù Fig. 1. Detection of voxels 1D directions (left) and vessel tracking (right) expanded and modified (thanks to the nature of declarative programming), we focus on the structure of the program rather than the accuracy of the rules of knowledge base in use. Given the presence of possible errors due to acquisition noise, filtering and extraction of low-level features, the ASP model needs to identify the best clas- sification while allowing the presence of both missing and incorrect information. For example, some major artery could not be detected because the junction with Aorta is outside the scan and/or some set of voxels could resemble an important vessel while only being a cluster of acquisition noise or a surface of an organ. Let us note that in this proof of concept, the set of rules in use is based onto anatomical relationships retrieved from atlas depictions. Some work will be dedicated to enrich such information thanks to a critical review by radiologists. In particular, we focus on the set of arteries that separate from the Aorta. Branching of such arteries along sub-trees, except for the branches of the celiac trunk, are not yet considered. Domain Knowledge Let us present the organization of knowledge as extracted by the procedure described above. We prepare some facts that constitute the knowledge base. Such knowledge base covers all the vessels we are interested in and that are going to be assigned or discarded by the ASP model. In particular, the factual domain knowledge is formed of three main different types of facts – m art(ID, Z, A, R): it describes a main artery vessel, i.e. a vessel that separates from the Aorta ; – b art(ID, R): it describes a bifurcation artery vessel, i.e. any vessel that branches from a main artery vessel; – edge(ID1, ID2): it encodes the tree parent-child relationship between two connected arteries. The parameters of the predicates have the following meaning: – ID/ID1/ID2: the unique id associated to a single vessel; – Z: an integer that contains the position along the z axis (i.e. the slice number) where a connection takes place. Intuitively, a higher value is closer to the head and a lower number is closer to feet; An ASP approach for arteries classification in CT-scans 9 – A: the angle, in degrees, formed by the vessel when it separates from the aorta (where the 0 corresponds with left side of the body and increases clockwise); – R: the radius, in voxels, of the vessel at the separation from the Aorta. The domain knowledge is automatically extracted. In case of additional geomet- rical properties, the procedure can be extended to produce such data. The model The ASP model objective is to associate the vessels, identified through the domain knowledge, to their anatomical description. The program also needs to discern between vessels that represents actual anatomical features from the ones that are false positives. To perform an accurate labeling we created a model comprised of anatomical rules that describe some fundamental properties that help the characterization of a specific label. In particular, several properties associated to each single label are stored: i) the relative (w.r.t. the other vessels) z-height of a vessel; ii) its radius; iii) its branching direction angle; iv) its number of entering/exiting edges; and v) its interconnections with other vessels. To achieve an optimal classification, we search for a model that verifies the highest number of rules. The model is reinforced by the presence of multiple properties for each vessel and the knowledge base uncertainty, errors and miss- ing information can be handled as well. The program needs to be functional even when some major piece of information is missing (e.g., the edge between the Aorta and one of its main branches), and that is why we chose an optimiza- tion problem on the number of rules rather than imposing strict constrains on labeling. Let us present a simplified version of the ASP model. For the sake of read- ability we show only two arteries’ description. The classification of the remaining ones follows the same pattern with according anatomical rules. First of all, let us start by introducing some predicates that are used to identify anatomical characteristics (i.e., arteries’ radius and direction) with dif- ferent degrees of precision. Not all the predicates are included for the sake of readability. rad small (R) :- rad(R), R > 0, R ≤ 20. rad big (R) :- rad(R), R > 20. rad 1 (R) :- rad(R), R > 0, R ≤ 5. rad 2 (R) :- rad(R), R > 5, R ≤ 10. ··· rad 8 (R) :- rad(R), R > 40. quad 1 (A) :- angle(A), A > 0, R ≤ 90. ··· quad 4 (A) :- angle(A), A > 270. R ≤ 360. semiquad 1 (A) :- angle(A), A > 0, A ≤ 45. ··· semiquad 8 (A) :- angle(A), R > 315. A ≤ 360. 10 F. Fabiano and A. Dal Palù where rad(1..60) and angle(1..360) are facts. Intuitively the predicates rad small(R) and rad big(R) are used to classify the radius in two main groups to perform a first partition of the vessels. Then, thanks to predicates, rad {1/2../8}(R) it is possible to refine this characteri- zation as in the model of the celiac trunk artery. The predicates related to the direction angle follow the same principle: the predicates quad {1/2/3/4}(A) give a coarse classification of the angles while the predicates semiquad {1/2../8}(A) refine it. We present now the model of the celiac trunk (referred to as celiac) and of the left gastric (referred to as gast) arteries. While the former is a branch of the Aorta (and is therefore a m art) the latter is a b art being a bifurcation of the celiac trunk itself. As before, we present a simplified version of the model. p b art (ID, R ) :- id(ID), rad(R), b art in(ID, R). p m art (ID, Z, A, R ) :- id(ID), height(Z), angle(A), rad(R), m art(ID, Z, A, R). artery (ID, N) :- id(ID), name(N), b art(ID, , N). artery (ID, N) :- id(ID), name(N), m art(ID, , , , N). m art (ID, Z, A, R, N) :- art gen(ID, N), name(N), p m ar(ID, Z, A, R), N = celiac, quad 1(A), rad big(R). conf r (celiac, 0) :- m art( , , A, , celiac), semiquad 2(A). conf r (celiac, 1) :- m art( , , , R, celiac), rad 6(R). conf r (celiac, 2) :- m art(ID, , , , celiac), edge(aorta, ID). conf r (celiac, 3) :- m art(ID1, , , , celiac), b art(ID2, , gast), edge(ID1, ID2). conf r (celiac, 4) :- m art( , , , Z1, celiac), m art( , , , Z2, sup mese), Z1 > Z2. conf r (celiac, 5) :- m art( , , , Z1, celiac), m art( , , , Z2, inf mese), Z1 > Z2 + 30. b art (ID, R, N) :- art gen(ID, N), name(N), p m ar(ID, Z, A, R), N = gast, rad big(R). conf r (gast, 0) :- m art( , , , R, celiac), rad 6(R). conf r (gast, 1) :- b art(ID2, , gast), m art(ID1, , , , celiac), edge(ID1, ID2). conf r (gast, 2) :- b art(ID2, , gast), K = 1, K = #count{ID2 : edge(ID1, ID2)}. Where the predicates id(ID/ID1/ID2) and height(Z/Z1/Z2) are extracted from the domain knowledge. The general idea behind the presented model is to generate, for each artery fact in the knowledge base (m art and b art), a predicate that associates the id of the artery to a name. This first association follows some loose anatomical rules that associate the artery labels only to vessels that respect such rules. An ASP approach for arteries classification in CT-scans 11 That is why predicates m art and b art only use the less precise radius and angle descriptors (i.e., rad {big/small} and quad {1/.../4}). If a vessel does not satisfy such general rule, it is surely not involved in such label. When the IDs are associated to the names, the program checks which of the various confidence rules (conf r) are satisfied for a certain artery. These sets of satisfied rules will be then used to give a score to each model instantiation; the more rules are satisfied, the more the labeling is optimal. Let us show how this optimization problem is encoded in our ASP program. 0{art gen(ID, N) : name(N)}1:- id(ID). rule count(K):- K = #count{N, R : conf r(N, R)}. #maximize{K : rule count(K)}. These lines of code tell the solver Clingo [4] to select the generation of predicates art gen such that the various associations ID-name satisfy the highest number of confidence rules. Line 0 {art gen(ID,N) : name(N)} 1 allows to skip some association ID-name because, as already said, the knowledge base could miss the description of all the vessels. 4 Results The results of the features extraction on a filtered CT scan (Figure 1) have been converted into facts of the knowledge base for the ASP model presented in Section 3.2. As we already said, the model is comprise of rules extracted from atlas by non medical experts and is, therefore, still not completely accurate. Our preliminary test of our model focuses on labeling the thoracic aorta, its main branches and some of the sub-arteries of such branches. We modelled a total of 20 arteries. The model correctly classified the Aorta and its main branches. On the other hand the classification of the sub-arteries is not as accurate, since we did not include a large set of rules in the model. However, the knowledge coming from the extracted vessels is richer and we believe further information can be exploited and used for labelling the whole set of arteries. Moreover, the features extraction has not been properly tuned. We prefer to keep some spurious arteries rather than missing some true positives. Therefore, wrong edges predicates are present, thus increasing the number of candidate vessels that can be labeled as sub-arteries. All the experiments were performed on a 2.20GHz Intel Core i7-8750H ma- chine with 16 GB of memory and with Ubuntu 18.04.5 LTS. The ASP program was executed exploiting the multi-thread (eight threads in parallel) capabili- ties of Clingo. The average number of found models is 15, while the average running times of the program were 0.06 and 124.75 seconds to find the op- timal model and to conclude the search, respectively. We also measured the accuracy = correctly labeled arteries total number of arteries for the optimal model. As we already men- tioned, we were able to perform the complete test on a single CT scan, obtain- ing an accuracy of 70%. It is important to notice that the accuracy restricted 12 F. Fabiano and A. Dal Palù to the main Aorta branches was 87.5%. The global accuracy is lower because some sub-arteries are unclassified and some mis-classified, because of the limited knowledge input to the model. The results of the classification are summarized in Table 1. artery type label artery type label artery type label aorta main 3 inf. mesent. main 3 2nd lumbar R main 3 celiac main 3 suprarenal R main 3 2nd lumbar L main 3 gastric bif. 7 suprarenal L main 3 3rd lumbar R main 3 splenic bif. − renal R main 3 3rd lumbar L main 3 hepatic bif. 7 renal L main 3 4th lumbar R main 3 pancreatic bif. 7 1st lumbar R main − 4th lumbar L main 3 sup. mesent. main 3 1st lumbar L main − Table 1. The result of the ASP classification. The field type is used to distinguish between main branches of the Aorta (main) and sub-trees of the main branches (bif.). The symbols 3, 7 and − identify a correctly classified, a mis-classified and an unclas- sified artery, respectively. In Figure 2 we depict the Aorta (the large vertical red pipe) and branching arteries, with a different color associated to each edge. Labelled edges are tagged with the computed name, while sub-branches are often given a not assigned label. Fig. 2. Two views of labelled artieries An ASP approach for arteries classification in CT-scans 13 5 Conclusion This paper presents a Declarative Programming methodology for CT scan au- tomated interpretation. We focus on arteries detection and classification and we introduced an ASP model to label the vessels. This model relies on anatomical rules to identify the best association of detected vessels with their anatomi- cal name. The implemented ASP program allows to identify the main arteries that branch out from the Aorta and some of their bifurcations. Our preliminary results show that the methodology is viable. More work will be dedicated to increase the knowledge base on the complete set of thoracic arteries, to extract a larger set of descriptors and to tune the labelling to different CT scans. The program contains a set of anatomical rules that are easily modifiable; currently, we populate the rules with simple spatial relationships derived from anatomical atlases. This is a key feature that allows, in future versions, to in- tegrate the experience of radiologists in the model, making it more accurate. Moreover, we presented a version of the program with only “positive” rules: that is, rules that, if activated, contribute to the support of a desired property. The program can be easily expanded to also introduce “negative” rules that penalize certain model’s instantiations, when activated. We plan to differentiate weights associated to rules, reflecting the relevance of certain properties, from anatomical or empirical viewpoint. Moreover, in a future version of the model, we plan to also add dependencies between the confidence rules. This last three future works will need supervision of expert radiologists that could help us in recognizing both “negative” anatomical rules and the weight of the confidence rules. Finally, being the model based on declarative programming, it is possible to interface with tools that would parse the output of the model in natural lan- guage. This will allow to produce an explanation for the labeling and confidence rules choices. Having explainable AI techniques [9] is, in fact, a desiderata when introducing automated tools in environments where the health of the patients is taken into consideration. Introducing tools that explain the results will allow a more confident usage of the program from the medical operators and an easier error pruning/detection. Acknowledgements The authors would like to thank the GNCS 2019 project Logic programming for early detection of pancreatic cancer and Medical Doctors Mirko D’Onofrio and Nicolò Cardobi (University of Verona) for providing CT scan data and useful discussions. References 1. AIRTUM, G.I.: Inumeri del cancro in italia: Intermedia editore (2018) 14 F. Fabiano and A. Dal Palù 2. Chu, L.C., Park, S., Kawamoto, S., Wang, Y., Zhou, Y., Shen, W., Zhu, Z., Xia, Y., Xie, L., Liu, F., et al.: Application of deep learning to pancreatic cancer detection: Lessons learned from our initial experience. Journal of the American College of Radiology 16(9), 1338–1342 (2019) 3. Deshpande, P., Rasin, A., Brown, E.T., Furst, J., Montner, S.M., Armato III, S.G., Raicu, D.S.: Augmenting medical decision making with text-based search of teaching file repositories and medical ontologies: Text-based search of radiol- ogy teaching files. International Journal of Knowledge Discovery in Bioinformatics (IJKDB) 8(2), 18–43 (2018) 4. Gebser, M., Kaminski, R., Kaufmann, B., Schaub, T.: Clingo = ASP + control: Preliminary report. CoRR abs/1405.3694 (2014) 5. Gelfond, M., Lifschitz, V.: The stable model semantics for logic programming. In: ICLP/SLP. vol. 88, pp. 1070–1080 (1988) 6. Graham, R.N., Perriss, R.W., Scarsbrook, A.F.: Dicom demystified: a review of digital file formats and their use in radiological practice. Clinical radiology 60(11), 1133–1140 (2005) 7. Grasmair, M., Lenzen, F.: Anisotropic total variation filtering. Applied Mathemat- ics & Optimization 62(3), 323–339 (2010) 8. Herman, G.T.: Fundamentals of computerized tomography: image reconstruction from projections. Springer Science & Business Media (2009) 9. Holzinger, A., Biemann, C., Pattichis, C.S., Kell, D.B.: What do we need to build explainable ai systems for the medical domain? arXiv preprint arXiv:1712.09923 (2017) 10. Hood, M.N., Scott, H.: Introduction to picture archive and communication systems. Journal of Radiology Nursing 25(3), 69–74 (2006) 11. Kardell, M.: Automatic segmentation of tissues in ct images of the pelvic region. master thesis (2014) 12. Kitasaka, T., Kagajo, M., Nimura, Y., Hayashi, Y., Oda, M., Misawa, K., Mori, K.: Automatic anatomical labeling of arteries and veins using conditional random fields. International Journal of Computer Assisted Radiology and Surgery 12(6), 1041–1048 (2017) 13. Ritschl, L., Bergner, F., Fleischmann, C., Kachelrieß, M.: Improved total variation- based ct image reconstruction applied to clinical data. Physics in Medicine & Bi- ology 56(6), 1545 (2011) 14. Rossi, F., Van Beek, P., Walsh, T.: Handbook of constraint programming. Elsevier (2006) 15. Roth, H.R., Farag, A., Lu, L., Turkbey, E.B., Summers, R.M.: Deep convolutional networks for pancreas segmentation in ct imaging. In: Medical Imaging 2015: Image Processing. vol. 9413, p. 94131G. International Society for Optics and Photonics (2015) 16. Rubin, D.L.: Creating and curating a terminology for radiology: ontology modeling and analysis. Journal of digital imaging 21(4), 355–362 (2008) 17. Trentin, C., Faggiano, E., Conti, M., Auricchio, F.: An automatic tool for thoracic aorta segmentation and 3d geometric analysis. In: 2015 9th International Sym- posium on Image and Signal Processing and Analysis (ISPA). pp. 60–65. IEEE (2015) 18. Wang, Y., Zhou, Y., Shen, W., Park, S., Fishman, E.K., Yuille, A.L.: Abdomi- nal multi-organ segmentation with organ-attention networks and statistical fusion. Medical image analysis 55, 88–102 (2019) An ASP approach for arteries classification in CT-scans 15 19. Zeng, G.L., Li, Y., Zamyatin, A.: Iterative total-variation reconstruction ver- sus weighted filtered-backprojection reconstruction with edge-preserving filtering. Physics in Medicine & Biology 58(10), 3413 (2013) 20. Zhang, W., Liu, J., Yao, J., Summers, R.M.: Automatic anatomical labeling of abdominal arteries for small bowel evaluation on 3d ct scans. In: 2013 IEEE 10th international symposium on biomedical imaging. pp. 210–213. IEEE (2013) 21. Zhou, Y., Xie, L., Shen, W., Fishman, E., Yuille, A.: Pancreas segmentation in ab- dominal ct scan: a coarse-to-fine approach. arXiv preprint arXiv:1612.08230 (2016)