Similarity measurement processes are a core part of most machine learning algorithms. Traditional approaches focus on either taxonomic or thematic thinking. Psychological research suggests that a combination of both is needed to model human-like similarity perception adequately. Such a combination is called a Similarity Dual Process Model (DPM). This paper describes how to construct DPMs as a linear combination of existing measures of similarity and distance. We use generalisation functions to convert distance into similarity. DPMs are similar to kernel functions. Thus, they can be integrated into any machine learning algorithm that uses kernel functions.Clearly, not all DPMs that can be formulated work equally well. Therefore we test classification performance in a real-world task: the detection of pedestrians in images. We assume that DPMs are only viable if they yield better classifiers than their constituting parts. In our experiments, we found DPM kernels that matched the performance of conventional ones for our data set. Eventually, we provide a construction kit to build such kernels to encourage further experiments in other application domains of machine learning.
Based on recent findings from the field of human similarity perception, we propose a dual process model (DPM) of taxonomic and thematic similarity assessment which can be utilised in machine learning applications. Taxonomic reasoning is related to predicate based measures (counting) whereas thematic reasoning is mostly associated with metric distances (measuring). We suggest a procedure that combines both processes into a single similarity kernel. For each feature dimension of the observational data, an optimal measure is selected by a Greedy algorithm: A set of possible measures is tested, and the one that leads to improved classification performance of the whole model is denoted. These measures are combined into a single SVM kernel by means of generalisation (converting distances into similarities) and quantisation (applying predicate based measures to interval scale data). We then demonstrate how to apply our model to a classification problem of MPEG-7 features from a test set of images. Evaluation shows that the performance of the DPM kernel is superior to those of the standard SVM kernels. This supports our theory that the DPM comes closer to human similarity judgment than any singular measure, and it motivates our suggestion to employ the DPM not only in image retrieval but also in related tasks.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.