1. Related

1613-0073

Data Point Interactions: A Dual Representation Approach for Enhanced Machine Learning

Mohamed Karim Belaid

karim.belaid@idiada.com 0 1 2

Workshop

2 0 Dr. Ing. h.c. F. Porsche AG , Stuttgart , Germany 1 IDIADA Fahrzeugtechnik GmbH , Munich , Germany 2 Supervised learning , Meta-learning, Data Interaction, Explainable AI

2024

14 16

In recent years, Explainable Artificial Intelligence (XAI) has attracted significant attention due to the growing complexity and opacity of ML models. While traditional XAI tools have focused on feature interaction analysis, there is a gap in understanding data point interactions and their impact on model performance. This research addresses this gap by studying data point interactions in ML models, specifically KNN, and proposing novel algorithms to enhance prediction performance.

1. Related Work

Existing literature in XAI has extensively explored feature interactions, providing insights into how diferent features contribute to model predictions. Previous works, such as Ribeiro et al.’s LIME [ 4 ] and Lundberg and Lee’s SHAP [ 5 ], have laid the groundwork for feature-based explanations, leading to mature research literature in the field of feature interaction [ 6, 7, 8, 9 ]. But the domain of data valuation remains relatively unexplored, besides recent pioneer works using approximation [ 10 ] or model specific [ 11 ] explanations. Moreover, the interaction between data points themselves has not been thoroughly investigated. points [ 13 ].

Tynes et al. introduced pairwise diference regressor [ 12 ], a novel meta-learner for chemical tasks that enhances prediction performance, compared to random forest and provides robust uncertainty quantification. In computational chemistry, estimating diferences between data points helps mitigate systematic errors [ 12 ]. In parallel, Wetzel et al. used twin neural network architectures for semisupervised regression tasks, focusing on predicting diferences between target values of distinct data

2. Research Questions and Challenges 2.0.1. RQ.1: How do diferent XAI methods, especially data-based, perform in terms of interpretability, accuracy, and computational eficiency?

We question the maturity and usability of explanation methods for the data science community. The abundance of xAI algorithms can be overwhelming, making it hard for practitioners to select the right one for their needs. The difering requirements and implementations of xAI algorithms pose challenges for data scientists in accurately evaluating them and staying current with their development. This

CEUR

ceur-ws.org issue manifests as the illusion of explanatory depth [ 14 ] in interpreting xAI results [ 15 ], with evidence showing that data scientists often misuse interpretability tools [ 16].

2.0.2. RQ.2: How can we accurately quantify data point interactions within trained ML models?

Existing literature primarily focuses on feature interactions, lacking methodologies for evaluating data point interactions. How do data points interact in forming patterns? How can we measure this interaction? And how can we leverage this explanation to improve the ML pipeline?

2.0.3. RQ.3: Can data point interactions be leveraged to enhance the performance of ML classifiers?

Exploring whether understanding data point interactions can lead to improved model accuracy and robustness. How can we design and implement algorithms that utilize data point interactions for prediction tasks? Developing and evaluating algorithms that incorporate data point interactions in their predictive mechanisms.

3. Method and Evaluation

3.0.1. RQ.1 Given the unsolved burden of evaluating and correctly choosing xAI algorithms, we propose ComparexAI that mitigates two issues: non-unified benchmark for xAI algorithms and the illusion of explanatory depth during the interpretation of results. Compare-xAI emerges as a unique and valuable benchmark. Its distinct contributions lie in its simplicity, scalability, ability to integrate any dataset and ML model, and, most importantly, its focus on the user’s expected explanation. By addressing the pitfalls highlighted in surveys of xAI algorithms through concrete functional tests, Compare-xAI provides a robust evaluation framework. 3.0.2. RQ.2 3.0.3. RQ.3 We propose, STI-KNN, the first algorithm that calculates the exact pair-interaction Shapley values in ( 2) rather than (2 ). STI-KNN is the first algorithm that allows studying the exact interaction on large real-world datasets. This research is the first to consider two disjoint fields: Data valuation and Interaction in Explainable AI. Finally, we study various cases of positive and negative data interactions using STI-KNN.

Leveraging the concept of data point interactions, we introduce the Pairwise Diference Learning (PDL) Classifier. This classifier employs a dual representation of the ML task, achieving better prediction performance by integrating pair interaction data, see Figure 1. The empirical evaluation contains 99 diverse datasets, times 25 CV repetitions. We use the macro F1 metric.

4. Preliminary Results

4.0.1. RQ.1 With 15 post-hoc xAI algorithms, 25 tests, and 50 research papers indexed, Compare-xAI ofers a unified benchmark that accurately reproduces experiments. Through a rigorous selection protocol, it highlights the contrast between theoretical foundations and practical implementations, making the limitations of each method transparent. Compare-xAI uses an intuitive scoring method to absorb the vast quantity of xAI-related papers and reduce human errors in interpreting xAI outputs. Its goal is to unify post-hoc xAI evaluation methods into a multi-dimensional benchmark, providing insights into the strengths and weaknesses of diferent approaches. Link: https://karim-53.github.io/cxai/ 4.0.2. RQ.2 Thanks to the STI-KNN algorithm, the data interaction can quickly be visualized using a heatmap of the Shapley interaction values. The matrix shows an example of interaction. We observe, first, a contrast between in-class and out-of-class interactions, second, a reduction in interaction due to data redundancy, and third, an unusual pattern when data contains outliers. 4.0.3. RQ.3 Our benchmark demonstrates that PDL consistently outperforms state-of-the-art ML models, resulting in improved F1 scores in a majority of cases. This highlights PDL’s efectiveness in enhancing performance over baseline methods, facilitated through its straightforward integration via our Python package. Link: https://github.com/Karim-53/pdll

5. Intermediary Conclusions

Our research indicates that data point interactions play a crucial role in the performance of ML models. By shifting the focus from feature interactions to data interactions, we have opened up new avenues for enhancing model interpretability and accuracy. For more detailed results, refer to the following papers[ 1, 2, 3 ].

6. Planned Next Steps

Confirming the eficiency of the PDL algorithm by studying its calibration and uncertainty estimation. By continuing to explore the interactions between data points, we hope to contribute significantly to the field of xAI and ML, ultimately leading to more transparent, accurate, and robust models. of explanatory depth in explainable ai, in: 26th International Conference on Intelligent User Interfaces, 2021, pp. 307–317. [16] H. Kaur, H. Nori, S. Jenkins, R. Caruana, H. Wallach, J. Wortman Vaughan, Interpreting interpretability: understanding data scientists’ use of interpretability tools for machine learning, in: Proceedings of the 2020 CHI conference on human factors in computing systems, 2020, pp. 1–14.

[1]

M. K.

Belaid , E. Hüllermeier,

Rabus ,

Krestel , Do we need another explainable ai method? toward unifying post-hoc xai evaluation methods into an interactive and multi-dimensional benchmark , arXiv preprint arXiv:2207.14160 ( 2022 ).

[2]

M. K.

Belaid ,

El Mekki ,

Rabus , E. Hüllermeier, Optimizing Data Shapley Interaction calculation from (2 ) to ( 2) for KNN models , arXiv preprint arXiv:2304.01224 ( 2023 ).

[3]

M. K.

Belaid ,

Rabus , E. Hüllermeier, Pairwise diference learning for classification , arXiv preprint arXiv:2406.20031 ( 2024 ).

[4]

M. T.

Ribeiro ,

Singh ,

Guestrin , ” why should i trust you?” explaining the predictions of any classifier , in: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining , 2016 , pp. 1135 - 1144 .

[5]

S. M.

Lundberg ,

S.-I.

Lee , A unified approach to interpreting model predictions , Advances in neural information processing systems 30 ( 2017 ).

[6]

Muschalik ,

Fumagalli ,

Hammer , E. Hüllermeier, Beyond treeshap: Eficient computation of any-order shapley interactions for tree ensembles , in: Proceedings of the AAAI Conference on Artificial Intelligence , volume 38 , 2024 , pp. 14388 - 14396 .

[7]

Fumagalli ,

Muschalik ,

Kolpaczki ,

Hüllermeier ,

Hammer , Shap-iq: Unified approximation of any-order shapley interactions , Advances in Neural Information Processing Systems 36 ( 2024 ).

[8]

Fumagalli ,

Muschalik ,

Kolpaczki ,

Hüllermeier ,

Hammer , Kernelshap-iq: Weighted least-square optimization for shapley interactions , arXiv preprint arXiv:2405.10852 ( 2024 ).

[9]

Kolpaczki ,

Muschalik ,

Fumagalli ,

Hammer , E. Hüllermeier, Svarm-iq: Eficient approximation of any-order shapley interactions through stratification , arXiv preprint arXiv:2401.13371 ( 2024 ).

[10]

Ghorbani ,

Zou , Data Shapley: Equitable valuation of data for ML, in: International Conference on ML, PMLR , 2019 , pp. 2242 - 2251 .

[11]

Jia ,

Dao ,

Wang ,

F. A.

Hubis ,

N. M.

Gurel ,

Li ,

Zhang ,

C. J.

Spanos ,

Song , Eficient task-specific data valuation for nearest neighbor algorithms , arXiv preprint arXiv: 1908 . 08619 ( 2019 ).

[12]

Tynes ,

Gao ,

D. J.

Burrill ,

E. R.

Batista ,

Perez ,

Yang ,

Lubbers , Pairwise diference regression: A ML meta-algorithm for improved prediction and uncertainty quantification in chemical search , Journal of Chemical Information and Modeling 61 ( 2021 ) 3846 - 3857 .

[13]

S. J.

Wetzel ,

R. G.

Melko , I. Tamblyn , Twin neural network regression is a semi-supervised regression algorithm , ML: Science and Technology 3 ( 2022 ) 045007 .

[14]

Rozenblit ,

Keil , The misunderstood limits of folk science: An illusion of explanatory depth , Cognitive science 26 ( 2002 ) 521 - 562 .

[15]

Chromik ,

Eiband ,

Buchner ,

Krüger ,

Butz , I think i get your point, ai! the illusion