<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>WOA</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Unlocking Insights and Trust: The Value of Explainable Clustering Algorithms for Cognitive Agents⋆</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Federico Sabbatini</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Roberta Calegari</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science and Engineering (DISI), Alma Mater Studiorum-University of Bologna</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Pure and Applied Sciences, University of Urbino Carlo Bo</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <volume>24</volume>
      <fpage>6</fpage>
      <lpage>8</lpage>
      <abstract>
        <p>In the realm of cognitive agents, including both human users and AI systems, explainable clustering algorithms have gained prominence. These algorithms ofer enhanced transparency, making clustering results comprehensible to users and aiding AI systems in decision-making. They also facilitate knowledge discovery by revealing cluster characteristics, reducing cognitive load for users, and playing a vital role in ethical and bias mitigation. This paper introduces an innovative extension of the existing PSyKE framework, designed to support explainable clustering techniques and, thus, to augment cognitive agent capabilities. State-of-the-art review, experiment findings, and a synthesis of key insights are also provided.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Explainable clustering</kwd>
        <kwd>Explainable artificial intelligence</kwd>
        <kwd>Symbolic knowledge extraction</kwd>
        <kwd>PSyKE</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        In the realm of cognitive agents, which encompass both human users and artificial intelligence
(AI) systems, the advent of explainable clustering algorithms has gained significant attention [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
These algorithms ofer several advantages that amplify the eficacy and transparency of
clustering processes across diverse domains [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. This paper explores the benefits of explainable
clustering algorithms to augment the capabilities of cognitive agents.
      </p>
      <p>
        At the forefront of these advantages there is the enhanced transparency provided by
explainable clustering algorithms [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. They have the capability to yield clustering results in a
comprehensible and interpretable manner, ensuring that both human users and AI systems can
discern the rationale behind the grouping of data points. This transparency, we argue, is an
indispensable element for fostering trust in AI systems and empowering human users to engage
in the validation and comprehension of the clustering process. Moreover, explainable
clustering algorithms ofer improved decision support, which is invaluable in the realm of cognitive
agents [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. These agents often rely on clustering outcomes to make informed decisions or to
provide recommendations. The inherent transparency in explainable clustering assists these
agents in deciphering the intricate structures within data, thereby facilitating more informed
and robust decision-making processes. Beyond their utility in decision-making, explainable
clustering algorithms serve as an instrument for efective knowledge discovery [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Clustering
serves as a foundational step in knowledge discovery, and explainable clustering algorithms take
this a step further. Not only do they create clusters, but they also unravel the defining
characteristics that distinguish each cluster. This capacity empowers cognitive agents to gain insights
into complex data sets, enriching the pool of knowledge they can leverage. Further, explainable
clustering algorithms also contribute to reduce cognitive load on human users, particularly
in the face of complex tasks, such as clustering high-dimensional data sets [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. By presenting
clustering results in a more digestible and comprehensible manner, these algorithms alleviate
the cognitive burden placed on human users, promoting eficiency and accuracy. Perhaps most
significantly, explainable clustering plays a pivotal role in ethical and bias mitigation [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. It
empowers cognitive agents to identify and rectify potential biases or ethical concerns within
the data or the clustering process itself. By enabling the explainability of clustering decisions,
explainable clustering algorithms support the pursuit of fairness and equity in data-driven
processes.
      </p>
      <p>In light of these considerations, this paper introduces a groundbreaking extension of the
PSyKE framework tailored to enhance the capabilities of cognitive agents via explainable
clustering support. The paper is organised as follows. A state-of-the-art review is first provided
(Section 2), followed by our proposal, the explainable clustering support integrated within
the PSyKE Framework (Section 3). We then delve into the findings of experiments conducted
(Section 4) and conclude with a synthesis of key insights derived from our exploration.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Works</title>
      <sec id="sec-2-1">
        <title>2.1. Explainable Clustering</title>
        <p>
          Several explainable clustering techniques have been developed in the last decades and it is
possible to find in the literature their practical application in critical areas, also to tackle complex
tasks involving image data sets and medical time series [
          <xref ref-type="bibr" rid="ref7 ref8 ref9">7, 8, 9</xref>
          ].
        </p>
        <p>
          A subset of the proposed algorithms are based on tree-based clustering according to diferent
strategies that may be classified as top-down or bottom-up [
          <xref ref-type="bibr" rid="ref10 ref11 ref12 ref13 ref14 ref15 ref16">10, 11, 12, 13, 14, 15, 16</xref>
          ]. Explainable
clustering methods adhering to the top-down approach usually start by building the tree root
node, associated with the whole training data set. Successively, the root node’s data are
partitioned into disjoint subsets associated with the child nodes and this is recursively repeated
to grow the clustering tree. The tree expansion ends according to a stopping criterion that may
consider the predictive performance of the clustering at a given depth and/or the availability of a
ifxed, minimum amount of training samples in deep nodes. Each internal node may correspond
to a constraint on an individual input feature or a set of constraints involving all of them. A
common characteristic of top-down strategies is the input feature space partitioning via cutting
hyperplanes that are perpendicular to the data dimensions.
        </p>
        <p>
          Diferent approaches are the explanation of clusters via rectangular input space
partitioning [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ], or the description of clusters in terms of centroids and distances [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ]. The former
may achieve a good human-interpretability extent, since it describes clusters based upon only 2
interval inclusion constraints. However, the algorithm may combine multiple input attributes
and thus consider preconditions on new, composite features. This behaviour may constitute a
hindering factor for human interpretability.
2.1.1. CLASSIX
The CLASSIX (contrived acronym defined by the authors as “CLustering by Aggregation with
Sorting-based Indexing” and the letter “X” for “eXplainability”) algorithm [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ] has been recently
proposed as a novel 2-phase explainable clustering procedure. It is presented as a technique
denoted by small computational time requirements.
        </p>
        <p>The first phase of CLASSIX is a greedy aggregation aimed at creating groups of training
instances having “small” distances from each other. The distance may be tuned by users through
a dedicated input parameter. It is worth noting that a preceding sorting step is required to
perform the aggregation.</p>
        <p>
          The second phase consists of merging the groups into definitive clusters. Two merging
strategies are supported by CLASSIX, namely, density- or distance-based (see [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ] for further
details).
        </p>
        <p>CLASSIX requires two used-defined parameters defining (i) a lower-bound for the accepted
cluster size, intended as the number of samples, and (ii) an upper-bound for the distance between
training instances assigned to the same group (with reference to the aggregation phase).</p>
        <p>
          The CLASSIX technique may provide explanations locally or globally. Global explanations
are built based on the coordinates of the initial points for each individual group created at the
end of the first phase of the procedure. On the other hand, two kinds of local explanations are
supported. It is possible to obtain the reason behind the cluster assignment corresponding to an
individual instance or CLASSIX may be queried to explain why two instances are assigned to
the same cluster or not. Local explanations are provided by listing the operations performed
during CLASSIX’s merging phase.
2.1.2. IMM
The IMM (Iterative Mistake Minimization) clustering procedure [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] is presented by the authors
as an accurate, eficient, and interpretable method based on the induction of decision trees.
The output decision trees are binary and their internal nodes are associated with training data
partitions. Splits corresponding to internal nodes always involve individual input attributes.
        </p>
        <p>The IMM algorithm requires growing a tree having  leaves to identify as many clusters. The
tree induction considers a set of desiderata, e.g., keeping the tree size as small as possible and
minimising the cluster’s fragmentation while deepening the tree. Fragmentation is intended as
spreading instances belonging to a single cluster over multiple subtrees.</p>
        <p>Explanations for individual cluster assignments are provided by describing the complete
paths starting from the tree root through the leaves associated with those assignments. It is
also possible to obtain global explanations for the clustering by listing all the existing paths to
the diferent leaves.</p>
        <p>As for the tree growth complexity, it is worth noting that if IMM identifies  clusters the
corresponding tree has a depth equal to  − 1 in the worst case (unbalanced tree). As a result,
any clustering assignment is described in terms of the conjunction of at most  − 1 constraints
on individual input features.</p>
        <sec id="sec-2-1-1">
          <title>2.1.3. ExACT and CREAM</title>
          <p>
            ExACT [
            <xref ref-type="bibr" rid="ref15">15</xref>
            ] and CREAM [
            <xref ref-type="bibr" rid="ref14">14</xref>
            ] are tree-based explainable clustering techniques achieving
humaninterpretability via hypercubic approximation of the identified clusters. Both algorithms induce
a binary tree to partition the input feature space and each internal node of the tree corresponds
to a hypercube-inclusion constraint.
          </p>
          <p>Trees are built recursively, according to a top-down strategy, and each node of the tree
corresponds to an input feature space subregion. The tree root is associated with the surrounding
cube, i.e., the minimal cube enclosing all the training instances. During a single recursive
iteration an internal node is marked with a hypercube-inclusion constraint and therefore its
two child nodes represent the hypercubic partition of the input feature space denoted by the
constraint on one side and the complementary subregion on the other side.</p>
          <p>
            Both ExACT and CREAM exploit underlying instances of Gaussian mixture models [
            <xref ref-type="bibr" rid="ref19">19</xref>
            ]
to identify relevant clusters of data and instances of DBSCAN [20] to remove outliers from
the identified clusters. Clusters without outliers are then approximated via hypercubes. The
selection of the best splits to be associated with internal nodes follows diferent approaches.
ExACT tries to perform a greedy minimisation of the cluster fragmentation, whereas CREAM
opts for a greedy maximisation of the estimated predictive performance corresponding to the
available splits. The two approaches are thus based on the selection of best local alternatives,
without any guarantees of absolute optimality. The diferences between ExACT and CREAM are
depicted in Figure 1 for artificial data sets having concentric [
            <xref ref-type="bibr" rid="ref15">15</xref>
            ] or overlapping [
            <xref ref-type="bibr" rid="ref14">14</xref>
            ] clusters.
          </p>
          <p>
            Three user-defined parameters are required by ExACT and CREAM, namely: (i) a maximum
tree depth; (ii) a predictive error threshold; and (iii) an upper-bound for the number of clusters
identifiable via Gaussian mixture models. Depth and error threshold may be automatically
tuned with the OrCHiD procedure [
            <xref ref-type="bibr" rid="ref14">14</xref>
            ].
          </p>
          <p>It is worth noting that besides mere clustering tasks ExACT and CREAM may also be applied
to perform explainable classification and regression, given that they are able to associate to
each cluster one amongst the following outputs: cluster ids, class labels, constant values and
linear combinations of the input features [21, 22].</p>
        </sec>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. The PSyKE Framework</title>
        <p>PSyKE is a general-purpose Python software library mainly dedicated to symbolic knowledge
extraction [23, 24], but also providing a suite of tools for data pre-processing, manipulation and
visualisation as well as for machine learning tasks. It ofers a unified interface for several
extraction techniques belonging to the pedagogical paradigm and it supports interoperability with
other widely adopted Python packages, as numpy, pandas and sklearn [25]. Interoperability
with Semantic Web tools is also provided [26]. Knowledge-extraction techniques supported by
PSyKE can be applied to any kind of supervised machine learning model without limitations
about the nature of the task at hand, i.e., classification as well as regression.</p>
        <p>At the time of writing PSyKE includes implementations of the following knowledge-extraction
procedures: Rule-extraction-as-learning (REAL) [27], Trepan [28], Cart [29], Iter [30],
GridEx [31], GridREx [32] and CReEPy [33]. These techniques provide global explanations
for the predictions obtained via opaque machine learning models in the form of a
humaninterpretable Prolog theory. Therefore, PSyKE may be exploited as a tool to achieve trustworthy
artificial intelligence [34].</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Explainable Clustering Support in the PSyKE Framework</title>
      <p>In order to support explainable clustering within the PSyKE framework, the structure of the
main project package (the psyke package) has been totally redesigned. The new structure of
the software library, depicted in Figure 2, is efective from version 0.5 1. Only the psyke package
is shown in the figure, since it is the main subject of the presented framework extension.</p>
      <sec id="sec-3-1">
        <title>3.1. The EvaluableModel Interface</title>
        <p>The current design of PSyKE’s main package is based on the notion of EvaluableModel, an
interface representing any predictive model that may be evaluated via some scoring metric
(e.g., a machine learning predictor, an interpretable model obtained via knowledge extraction, a
clustering technique). Evaluable models in PSyKE come along with information about the
preprocessing routines applied to the data sets, i.e., the parameters applied to perform normalisation
and/or discretisation. Any evaluable model should be able to provide predictions and its
predictive performance should be assessable through an adequate scoring function.</p>
        <p>Accordingly, the interface exposes two methods. The predict method is abstract and
accepts a dataframe (i.e., a pandas dataframe, but also numpy arrays are accepted) and returns
the corresponding predictions. The definition of this method depends on the specific model.
Therefore, it must be defined within other classes implementing the EvaluableModel interface.
The score method accepts a dataframe and a scoring function and then returns the scoring
function evaluated on the instances of that dataframe. The interface provides scoring functions
for classification (e.g., classification accuracy, F 1 score and confusion matrices), regression (e.g.,
mean absolute/squared error and R2 score), and clustering (e.g., adjusted Rand index, adjusted
mutual information, V-measure and Fowlkes-Mallows score).</p>
        <p>The EvaluableModel interface is extended by three other interfaces, namely:
HyperCubePredictor describing any evaluable model whose predictions are based on a
hypercubic partitioning of the input feature space. The set of hypercubes is an attribute
defined by the interface. It also defines the inherited predict method;
Extractor representing any evaluable model providing interpretable predictions by
highlighting symbolic input/output relationships extracted from an opaque predictor. Relationships
are learned via the extract method, requiring as input parameters a training dataframe
and an opaque predictor (e.g., a machine learning model from the sklearn library or any
1Code available at https://github.com/psykei/psyke-python
Figure 2: UML class diagram for the psyke package of PSyKE version 0.5.</p>
        <p>other object having a predict method). The extract method is abstract since it difers
based on the individual extraction techniques and thus it has to be defined by classes
extending the Extractor interface;
Clustering resuming the properties of any explainable clustering technique that may be
iftted on a dataframe and explained via human-interpretable descriptions of the identified
clusters. Accordingly, it exposes two abstract methods for these purposes, to be defined
by inheriting classes.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. The extraction Package</title>
        <p>The psyke package of the PSyKE library encloses the extraction sub-package, dedicated
to symbolic knowledge extraction from opaque machine learning models. The sub-package
contains an interface representing a generic pedagogical knowledge-extraction algorithm
(the PedagogicalExtractor interface, extending the Extractor interface). Three
pedagogical knowledge-extraction algorithms (namely, REAL, Trepan and Cart) implement the
PedagogicalExtractor interface. Each algorithm is enclosed in a dedicated sub-package and
the corresponding main class defines the abstract methods predict and extract inherited
from EvaluableModel and Extractor, respectively.</p>
        <p>The extraction sub-package contains an inner sub-package named hypercubic, dedicated
to hypercube-based knowledge extractors. It defines the HyperCubeExtractor interface,
representing a generic extractor of this kind and extending both the HyperCubePredictor and
the PedagogicalExtractor interfaces. HyperCubeExtractor is realised by four diferent
classes implementing as many knowledge extractors (i.e., GridEx, GridREx, Iter and CReEPy),
each one encapsulated in an individual package. Only the extract method is defined by these
classes, given that the predict method is common and already defined and inherited from
HyperCubePredictor.</p>
        <p>The features of knowledge-extraction algorithms implemented in PSyKE are listed in Table 1,
with particular focus on the translucency of the extractors, the supported machine learning task,
the kind of accepted input features and provided outputs, the shape of the extracted knowledge
and the interpretability extent achieved by the algorithms.</p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. The clustering Package</title>
        <p>The explainable clustering techniques ofered by PSyKE ( ExACT and CREAM) are
contained in the clustering package and realise the HyperCubeClustering interface, given
that they are both clustering procedures based on hypercubic partitioning of the input
feature space. The HyperCubeClustering interface, in turn, extends the aforementioned
HyperCubePredictor and Clustering interfaces. As a result, only the fit and explain
methods need to be defined within the classes implementing explainable clustering techniques.
Since CREAM is an extension of the ExACT algorithm, the corresponding classes follow an
adequate hierarchy. Nonetheless, each algorithm is encapsulated in an individual package.</p>
        <p>The features of the explainable clustering techniques supported by PSyKE are resumed in
Table 1.</p>
        <p>Ref. paper</p>
        <sec id="sec-3-3-1">
          <title>Trans.:</title>
          <p>Pedagogical
Decomp.</p>
        </sec>
        <sec id="sec-3-3-2">
          <title>Task:</title>
          <p>Classification
Regression
Clustering</p>
        </sec>
        <sec id="sec-3-3-3">
          <title>Input feat.:</title>
          <p>Binary
Discrete
Continuous</p>
        </sec>
        <sec id="sec-3-3-4">
          <title>Output:</title>
          <p>String label
Constant
Linear eq.
Cluster id</p>
        </sec>
        <sec id="sec-3-3-5">
          <title>Knowledge:</title>
          <p>Rule list
Decision tree</p>
        </sec>
        <sec id="sec-3-3-6">
          <title>Interpret.:</title>
          <p>Global
Local
×
×
×
×
×
× ‡</p>
          <p>× ‡
×
×
×
×
Summary of the knowledge-extraction and explainable clustering algorithms supported by PSyKE
version 0.5. Translucency is not a property of explainable clustering techniques; it is thus reported only
for knowledge extractors.</p>
          <p>REAL
[27]</p>
          <p>Trepan
[28]</p>
          <p>Knowledge extraction
Cart Iter GridEx</p>
          <p>GridREx</p>
          <p>CReEPy
[29]
[30]
[31]
[32]
[33]</p>
          <p>
            Clustering
ExACT CREAM
[
            <xref ref-type="bibr" rid="ref15">15</xref>
            ]
[
            <xref ref-type="bibr" rid="ref14">14</xref>
            ]
× S
× S
× S
× S
× S
× × ◇ × × ◇ × × ◇ × × × ◇ × ×◇ × × ◇ × ◇ × × ◇ × × ◇
* Only if binary values are encoded as numbers, e.g., 0 and 1
‡ Only if discrete values are binarised, e.g., via one-hot encoding
† Only if discrete values are numeric
S Decision trees may be linearised into an ordered list of rules
◇ Local interpretability may be achieved by considering individual items of the global explanation
          </p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Illustrative Experiment</title>
      <p>To demonstrate the efectiveness and the human-interpretable degree of the explanations
provided by PSyKE’s clustering techniques we carried out a set of experiments on the
wellknown Iris data set [35]. The 150 data instances have been split into training and test sets (75%
+ 25%). The training set was then used to fit instances of ExACT and CREAM. Best values for
the depth and error threshold hyper-parameters of the explainable clustering techniques have
been estimated with OrCHiD. We used a value of 3 for the remaining parameter expressing the
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
*
†
×
×
×
×
×
×
×
× *
× †
×
×
×
×
×
×
×
× *
× †
×
×
×
×
×
×
× *
× †
× *
× †
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
×
× *
× †
(a) ExACT.
(b) CREAM.
Listing 1 Classification rules obtained from the ExACT clsutering on the Iris data set.</p>
      <p>Class Virginica if PetalWidth in [1.6, 2.5] and PetalLength in [4.8, 6.9] and</p>
      <p>SepalWidth in [2.5, 3.8] and SepalLength in [5.7, 7.9].</p>
      <p>Class Versicolor if PetalWidth in [0.9, 2.5] and PetalLength in [3.0, 6.9] and</p>
      <p>SepalWidth in [2.2, 3.8] and SepalLength in [4.9, 7.9].</p>
      <p>Class Setosa otherwise.</p>
      <p>Listing 2 Classification rules obtained from the CREAM clustering on the Iris data set.</p>
      <p>Class Versicolor if PetalWidth in [0.9, 1.7] and PetalLength in [3.0, 5.2] and</p>
      <p>SepalWidth in [2.0, 3.4] and SepalLength in [5.0, 7.0].</p>
      <p>Class Setosa if PetalWidth in [0.0, 0.7] and PetalLength in [0.9, 2.0] and</p>
      <p>SepalWidth in [2.3, 4.4] and SepalLength in [4.3, 5.8].</p>
      <p>Class Virginica otherwise.
maximum amount of identifiable clusters.</p>
      <p>The decision boundaries for the three Iris classes are highlighted in Figure 3. The
corresponding explanations are listed in Listings 1 and 2 for ExACT and CREAM, respectively. It is
possible to notice that the two clustering algorithms provide very diferent decision boundaries
and explanations, but they have comparable quality in terms of human-readability (1 rule per
distinct class) and predictive performance (classification accuracy of about 93% on the test set).</p>
      <p>Readability of explanations shown in Listings 1 and 2 could be improved by removing the
least relevant input features and keeping only the most relevant ones, i.e., the petal length and
width reported in Figure 3.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusions</title>
      <p>The paper delves into the pivotal role of explainable clustering algorithms within the domain
of cognitive agents, encompassing both human users and AI systems. The advantages ofered
by these algorithms in the realm of cognitive agents are multifaceted, ranging from enhancing
interpretability and fostering trust to facilitating more efective decision-making processes.</p>
      <p>Motivated by the need to foster transparency and accountability in the operations of cognitive
agents, we introduce an extension of the PSyKE framework’s design to augment its capabilities
through the incorporation of explainable clustering support. The discussion of this novel
design, coupled with real world experiments, highlights the potential to significantly elevate
the performance and ethical standing of cognitive agents.</p>
      <p>The outcomes of this research pave the way for ongoing advancements in the field,
emphasising the importance of continued development and integration of explainable clustering
algorithms. By doing so, cognitive agents can be expected to evolve into more transparent,
efective, and ethically responsible entities. As we move forward, future eforts will be directed
towards the practical implementation of explainable clustering algorithms within cognitive
agents, involving rigorous testing in simulated real-world scenarios.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>This work has been supported partially by the European Union’s Horizon Europe AEQUITAS
research and innovation programme under grant number 101070363 and partially by the European
Union ICT-48 2020 project TAILOR (No. 952215).
machine learning series, MIT Press, 2012.
[20] R. F. Ling, On the theory and construction of k-clusters, The Computer Journal 15 (1972)
326–332. URL: https://doi.org/10.1093/comjnl/15.4.326. doi:10.1093/comjnl/15.4.326.
arXiv:https://academic.oup.com/comjnl/article-pdf/15/4/326/1005965/150326.pdf.
[21] F. Sabbatini, G. Ciatto, R. Calegari, A. Omicini, Hypercube-based methods for symbolic
knowledge extraction: Towards a unified model, in: A. Ferrando, V. Mascardi (Eds.),
WOA 2022 – 23rd Workshop “From Objects to Agents”, volume 3261 of CEUR Workshop
Proceedings, Sun SITE Central Europe, RWTH Aachen University, 2022, pp. 48–60. URL:
http://ceur-ws.org/Vol-3261/paper4.pdf.
[22] F. Sabbatini, G. Ciatto, R. Calegari, A. Omicini, Towards a unified model for symbolic
knowledge extraction with hypercube-based methods, Intelligenza Artificiale 17 (2023)
63–75. URL: https://doi.org/10.3233/IA-230001. doi:10.3233/IA-230001.
[23] F. Sabbatini, G. Ciatto, R. Calegari, A. Omicini, On the design of PSyKE: A platform
for symbolic knowledge extraction, in: R. Calegari, G. Ciatto, E. Denti, A. Omicini,
G. Sartor (Eds.), WOA 2021 – 22nd Workshop “From Objects to Agents”, volume 2963
of CEUR Workshop Proceedings, Sun SITE Central Europe, RWTH Aachen University,
2021, pp. 29–48. 22nd Workshop “From Objects to Agents” (WOA 2021), Bologna, Italy,
1–3 September 2021. Proceedings.
[24] F. Sabbatini, G. Ciatto, R. Calegari, A. Omicini, Symbolic knowledge extraction from
opaque ML predictors in PSyKE: Platform design &amp; experiments, Intelligenza Artificiale
16 (2022) 27–48. URL: https://doi.org/10.3233/IA-210120. doi:10.3233/IA-210120.
[25] F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel,
P. Prettenhofer, R. Weiss, V. Dubourg, J. VanderPlas, A. Passos, D. Cournapeau, M. Brucher,
M. Perrot, E. Duchesnay, Scikit-learn: Machine learning in Python, Journal of
Machine Learning Research (JMLR) 12 (2011) 2825–2830. URL: https://dl.acm.org/doi/10.5555/
1953048.2078195.
[26] F. Sabbatini, G. Ciatto, A. Omicini, Semantic Web-based interoperability for intelligent
agents with PSyKE, in: D. Calvaresi, A. Najjar, M. Winikof, K. Främling (Eds.),
Explainable and Transparent AI and Multi-Agent Systems, volume 13283 of Lecture Notes
in Computer Science, Springer, 2022, pp. 124–142. URL: http://link.springer.com/10.1007/
978-3-031-15565-9_8. doi:10.1007/978-3-031-15565-9_8.
[27] M. W. Craven, J. W. Shavlik, Using sampling and queries to extract rules from trained
neural networks, in: Machine Learning Proceedings 1994, Elsevier, 1994, pp. 37–45.
doi:10.1016/B978-1-55860-335-6.50013-1.
[28] M. W. Craven, J. W. Shavlik, Extracting tree-structured representations of
trained networks, in: D. S. Touretzky, M. C. Mozer, M. E. Hasselmo (Eds.),
Advances in Neural Information Processing Systems 8. Proceedings of the 1995
Conference, The MIT Press, 1996, pp. 24–30. URL: http://papers.nips.cc/paper/
1152-extracting-tree-structured-representations-of-trained-networks.pdf.
[29] L. Breiman, J. Friedman, C. J. Stone, R. A. Olshen, Classification and Regression Trees, CRC</p>
      <p>Press, 1984.
[30] J. Huysmans, B. Baesens, J. Vanthienen, ITER: An algorithm for predictive regression
rule extraction, in: Data Warehousing and Knowledge Discovery (DaWaK 2006), Springer,
2006, pp. 270–279. doi:10.1007/11823728_26.
[31] F. Sabbatini, G. Ciatto, A. Omicini, GridEx: An algorithm for knowledge
extraction from black-box regressors, in: D. Calvaresi, A. Najjar, M. Winikof, K.
Främling (Eds.), Explainable and Transparent AI and Multi-Agent Systems. Third
International Workshop, EXTRAAMAS 2021, Virtual Event, May 3–7, 2021, Revised Selected
Papers, volume 12688 of LNCS, Springer Nature, Basel, Switzerland, 2021, pp. 18–38.
doi:10.1007/978-3-030-82017-6_2.
[32] F. Sabbatini, R. Calegari, Symbolic knowledge extraction from opaque machine
learning predictors: GridREx &amp; PEDRO, in: G. Kern-Isberner, G. Lakemeyer, T. Meyer (Eds.),
Proceedings of the 19th International Conference on Principles of Knowledge
Representation and Reasoning, KR 2022, Haifa, Israel. July 31 - August 5, 2022, 2022. URL:
https://proceedings.kr.org/2022/57/. doi:10.24963/kr.2022/57.
[33] F. Sabbatini, R. Calegari, Unveiling opaque predictors via explainable clustering: The
CReEPy algorithm, in: Proceedings of BEWARE-2023, Rome, Italy, November 6–9, 2023,
(to appear), 2023.
[34] R. Calegari, F. Sabbatini, The PSyKE technology for trustworthy artificial
intelligence 13796 (2023) 3–16. URL: https://doi.org/10.1007/978-3-031-27181-6_1. doi:10.1007/
978-3-031-27181-6_1, xXI International Conference of the Italian Association for
Artiifcial Intelligence, AIxIA 2022, Udine, Italy, November 28 – December 2, 2022, Proceedings.
[35] R. A. Fisher, The use of multiple measurements in taxonomic problems, Annals of Eugenics
7 (1936) 179–188. doi:10.1111/j.1469-1809.1936.tb02137.x.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A. B.</given-names>
            <surname>Arrieta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Díaz-Rodríguez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. Del</given-names>
            <surname>Ser</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bennetot</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Tabik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Barbado</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>García</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Gil-López</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Molina</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Benjamins</surname>
          </string-name>
          , et al.,
          <article-title>Explainable artificial intelligence (xai): Concepts, taxonomies, opportunities and challenges toward responsible ai</article-title>
          ,
          <source>Information fusion 58</source>
          (
          <year>2020</year>
          )
          <fpage>82</fpage>
          -
          <lpage>115</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>A.</given-names>
            <surname>Rahman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. S.</given-names>
            <surname>Hossain</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Muhammad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Kundu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Debnath</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Rahman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. S. I.</given-names>
            <surname>Khan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Tiwari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. S.</given-names>
            <surname>Band</surname>
          </string-name>
          ,
          <article-title>Federated learning-based ai approaches in smart healthcare: concepts, taxonomies, challenges</article-title>
          and open issues,
          <source>Cluster computing 26</source>
          (
          <year>2023</year>
          )
          <fpage>2271</fpage>
          -
          <lpage>2311</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>S.</given-names>
            <surname>Ali</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Abuhmed</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>El-Sappagh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Muhammad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Alonso-Moral</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Confalonieri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Guidotti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. Del</given-names>
            <surname>Ser</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Díaz-Rodríguez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Herrera</surname>
          </string-name>
          ,
          <article-title>Explainable artificial intelligence (xai): What we know and what is left to attain trustworthy artificial intelligence</article-title>
          ,
          <source>Information Fusion</source>
          <volume>99</volume>
          (
          <year>2023</year>
          )
          <fpage>101805</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>S.</given-names>
            <surname>Dasgupta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Frost</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Moshkovitz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Rashtchian</surname>
          </string-name>
          ,
          <article-title>Explainable k-means and k-medians clustering</article-title>
          ,
          <source>in: Proceedings of the 37th International Conference on Machine Learning</source>
          , Vienna, Austria,
          <year>2020</year>
          , pp.
          <fpage>12</fpage>
          -
          <lpage>18</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>S.</given-names>
            <surname>Bobek</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Brzegowski</surname>
          </string-name>
          , E. Brzychczy,
          <string-name>
            <given-names>G. J.</given-names>
            <surname>Nalepa</surname>
          </string-name>
          ,
          <article-title>Knac: an approach for enhancing cluster analysis with background knowledge and explanations</article-title>
          ,
          <source>Applied Intelligence</source>
          <volume>53</volume>
          (
          <year>2023</year>
          )
          <fpage>15537</fpage>
          -
          <lpage>15560</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>A.</given-names>
            <surname>Endert</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Ribarsky</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Turkay</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. W.</given-names>
            <surname>Wong</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Nabney</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I. D.</given-names>
            <surname>Blanco</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Rossi</surname>
          </string-name>
          ,
          <article-title>The state of the art in integrating machine learning into visual analytics</article-title>
          , in: Computer Graphics Forum, volume
          <volume>36</volume>
          , Wiley Online Library,
          <year>2017</year>
          , pp.
          <fpage>458</fpage>
          -
          <lpage>486</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>E.</given-names>
            <surname>Horel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Giesecke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Storchan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Chittar</surname>
          </string-name>
          ,
          <article-title>Explainable clustering and application to wealth management compliance</article-title>
          , in: T. Balch (Ed.),
          <source>ICAIF '20: The First ACM International Conference on AI in Finance</source>
          , New York, NY, USA, October
          <volume>15</volume>
          -
          <issue>16</issue>
          ,
          <year>2020</year>
          , ACM,
          <year>2020</year>
          , pp.
          <volume>47</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>47</lpage>
          :
          <fpage>6</fpage>
          . URL: https://doi.org/10.1145/3383455.3422530. doi:
          <volume>10</volume>
          .1145/3383455.3422530.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>L.</given-names>
            <surname>Manduchi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hüser</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Faltys</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. E.</given-names>
            <surname>Vogt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Rätsch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Fortuin</surname>
          </string-name>
          ,
          <string-name>
            <surname>T-DPSOM</surname>
          </string-name>
          :
          <article-title>an interpretable clustering method for unsupervised learning of patient health states</article-title>
          , in: M.
          <string-name>
            <surname>Ghassemi</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Naumann</surname>
          </string-name>
          , E. Pierson (Eds.),
          <source>ACM CHIL '21: ACM Conference on Health, Inference, and Learning</source>
          , Virtual Event, USA, April 8-
          <issue>9</issue>
          ,
          <year>2021</year>
          , ACM,
          <year>2021</year>
          , pp.
          <fpage>236</fpage>
          -
          <lpage>245</lpage>
          . URL: https://doi.org/10.1145/3450439.3451872. doi:
          <volume>10</volume>
          .1145/3450439.3451872.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>S.</given-names>
            <surname>Deshmukh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. K.</given-names>
            <surname>Behera</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Mulay</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. A.</given-names>
            <surname>Ahmed</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Al-Kuwari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Tiwari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Farouk</surname>
          </string-name>
          ,
          <article-title>Explainable quantum clustering method to model medical data</article-title>
          ,
          <source>Knowl. Based Syst</source>
          .
          <volume>267</volume>
          (
          <year>2023</year>
          )
          <article-title>110413</article-title>
          . URL: https://doi.org/10.1016/j.knosys.
          <year>2023</year>
          .
          <volume>110413</volume>
          . doi:
          <volume>10</volume>
          .1016/j. knosys.
          <year>2023</year>
          .
          <volume>110413</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>J.</given-names>
            <surname>Basak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Krishnapuram</surname>
          </string-name>
          ,
          <article-title>Interpretable hierarchical clustering by constructing an unsupervised decision tree</article-title>
          ,
          <source>IEEE Trans. Knowl. Data Eng</source>
          .
          <volume>17</volume>
          (
          <year>2005</year>
          )
          <fpage>121</fpage>
          -
          <lpage>132</lpage>
          . URL: https://doi.org/10.1109/TKDE.
          <year>2005</year>
          .
          <volume>11</volume>
          . doi:
          <volume>10</volume>
          .1109/TKDE.
          <year>2005</year>
          .
          <volume>11</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>R.</given-names>
            <surname>Fraiman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Ghattas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Svarc</surname>
          </string-name>
          ,
          <article-title>Interpretable clustering using unsupervised binary trees</article-title>
          ,
          <source>Adv. Data Anal. Classif</source>
          .
          <volume>7</volume>
          (
          <year>2013</year>
          )
          <fpage>125</fpage>
          -
          <lpage>145</lpage>
          . URL: https://doi.org/10.1007/s11634-013-0129-3. doi:
          <volume>10</volume>
          .1007/s11634-013-0129-3.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>D.</given-names>
            <surname>Bertsimas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Orfanoudaki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. M.</given-names>
            <surname>Wiberg</surname>
          </string-name>
          ,
          <article-title>Interpretable clustering via optimal trees</article-title>
          ,
          <source>CoRR abs/1812</source>
          .00539 (
          <year>2018</year>
          ). URL: http://arxiv.org/abs/
          <year>1812</year>
          .00539. arXiv:
          <year>1812</year>
          .00539.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>S.</given-names>
            <surname>Dasgupta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Frost</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Moshkovitz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Rashtchian</surname>
          </string-name>
          ,
          <article-title>Explainable k-means and kmedians clustering</article-title>
          , CoRR abs/
          <year>2002</year>
          .12538 (
          <year>2020</year>
          ). URL: https://arxiv.org/abs/
          <year>2002</year>
          .12538. arXiv:
          <year>2002</year>
          .12538.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>F.</given-names>
            <surname>Sabbatini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Calegari</surname>
          </string-name>
          ,
          <article-title>Explainable Clustering with CREAM</article-title>
          ,
          <source>in: Proceedings of the 20th International Conference on Principles of Knowledge Representation and Reasoning</source>
          ,
          <year>2023</year>
          , pp.
          <fpage>593</fpage>
          -
          <lpage>603</lpage>
          . URL: https://doi.org/10.24963/kr.2023/58. doi:
          <volume>10</volume>
          .24963/kr.2023/58.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>F.</given-names>
            <surname>Sabbatini</surname>
          </string-name>
          , R. Calegari,
          <article-title>ExACT explainable clustering: Unravelling the intricacies of cluster formation</article-title>
          ,
          <source>in: Proceedings of the 2nd International Workshop on Knowledge Diversity, KoDis</source>
          <year>2023</year>
          , Rhodes, Greece, September 2-
          <issue>8</issue>
          ,
          <year>2023</year>
          (to appear),
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>F.</given-names>
            <surname>Sabbatini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Calegari</surname>
          </string-name>
          ,
          <article-title>Bottom-up and top-down workflows for hypercube- and clustering-based knowledge extractors</article-title>
          , in: D.
          <string-name>
            <surname>Calvaresi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Najjar</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Omicini</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Aydogan</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Carli</surname>
            , G. Ciatto,
            <given-names>K.</given-names>
          </string-name>
          Främling (Eds.),
          <source>Explainable and Transparent AI</source>
          and
          <string-name>
            <surname>Multi-Agent Systems</surname>
          </string-name>
          . Fifth International Workshop, EXTRAAMAS 2023, London, UK, May
          <volume>29</volume>
          ,
          <year>2023</year>
          , Revised Selected Papers, volume
          <volume>14127</volume>
          <source>of LNCS</source>
          , Springer Cham, Basel, Switzerland,
          <year>2023</year>
          , pp.
          <fpage>116</fpage>
          -
          <lpage>129</lpage>
          . doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>031</fpage>
          -40878-
          <issue>6</issue>
          _
          <fpage>7</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>J.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Hobbs</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. J.</given-names>
            <surname>Castaldi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. H.</given-names>
            <surname>Cho</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. K.</given-names>
            <surname>Silverman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. G.</given-names>
            <surname>Dy</surname>
          </string-name>
          ,
          <article-title>Interpretable clustering via discriminative rectangle mixture model</article-title>
          , in: F.
          <string-name>
            <surname>Bonchi</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Domingo-Ferrer</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Baeza-Yates</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          <string-name>
            <surname>Zhou</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          Wu (Eds.),
          <source>IEEE 16th International Conference on Data Mining, ICDM 2016, December 12-15</source>
          ,
          <year>2016</year>
          , Barcelona, Spain, IEEE Computer Society,
          <year>2016</year>
          , pp.
          <fpage>823</fpage>
          -
          <lpage>828</lpage>
          . URL: https://doi.org/10.1109/ICDM.
          <year>2016</year>
          .
          <volume>0097</volume>
          . doi:
          <volume>10</volume>
          .1109/ICDM.
          <year>2016</year>
          .
          <volume>0097</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>X.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Güttel</surname>
          </string-name>
          ,
          <article-title>Fast and explainable clustering based on sorting</article-title>
          ,
          <source>CoRR abs/2202</source>
          .01456 (
          <year>2022</year>
          ). URL: https://arxiv.org/abs/2202.01456. arXiv:
          <volume>2202</volume>
          .
          <fpage>01456</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>K. P.</given-names>
            <surname>Murphy</surname>
          </string-name>
          ,
          <article-title>Machine learning - A probabilistic perspective, Adaptive computation</article-title>
          and
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>