Download PDFOpen PDF in browserA Business Intelligence Tool for Explaining SimilarityEasyChair Preprint 1314215 pages•Date: April 30, 2024AbstractAgile Business often requires to identify similar objects (firms, providers, end users, products) between an older business domain and a newer one. Data-driven tools for aggregating similar resources are nowa- days often used in Business Intelligence applications, and a large majority of them involve Machine Learning techniques based on similarity met- rics. However effective, the mathematics such tools are based on does not lend itself to human-readable explanations of their results, leaving a manager using them in a “take it as is”-or-not dilemma. To increase trust in such tools, we propose and implement a general method to ex- plain the similarity of a given group of RDF resources. Our tool is based on the theory of Least Common Subsumers (LCS), and can be applied to every domain requiring the comparison of RDF resources, including business organizations. Given a set of RDF resources found to be sim- ilar by Data-driven tools, we first compute the LCS of the resources, which is a generic RDF resource describing the features shared by the group recursively—i.e., at any depth in feature paths. Subsequently, we translate the LCS in English common language. Being agnostic to the ag- gregation criteria, our implementation can be pipelined with every other aggregation tool. To prove this, we cascade an implementation of our method to (i) the comparison and clustering of drugs (using k-Means) in Drugbank, and (ii) the comparison of contracting processes in Public Procurement (using TheyBuyForYou).For both applications, we present a fairly readable description of the commonalities of the cluster given as input. Keyphrases: Explainable Artificial Intelligence (XAI), Least Common Subsumer (LCS), Resource Description Framework (RDF)
|