Enhancing ensemble learning and transfer learning in multimodal data analysis by adaptive dimensionality reduction
Andrea Marinoni,
Saloua Chlaily,
Eduard Khachatrian
et al.
Abstract:Modern data analytics take advantage of ensemble learning and transfer learning approaches to tackle some of the most relevant issues in data analysis, such as lack of labeled data to use to train the analysis models, sparsity of the information, and unbalanced distributions of the records. Nonetheless, when applied to multimodal datasets (i.e., datasets acquired by means of multiple sensing techniques or strategies), the state-of-theart methods for ensemble learning and transfer learning might show some limit… Show more
“…In this work, we propose to address this task by exploring the relevance of the features at global (i.e., across all modalities) and local (i.e., across samples for each feature) scale. Following the successful approach proposed in [64], we quantify the multiscale significance of the features by using information theory-based metrics. Specifically, we consider to measure the degree of redundancy and intercorrelation between features across the whole dataset by employing mutual information [65], [66].…”
Section: B Proposed Approachmentioning
confidence: 99%
“…11 by means of state-of-the-art supervised and semisupervised methods based on the classic graph representation introduced in Section II-A [90]- [93]. Moreover, for comparison, we used a method based on ensemble learning approach [64]. It is worth noting that deeply investigating the capacity and limitations of all these methods is out of the scope of this paper.…”
Section: A Motivations Of a New Graph Representation: A Multimodal Ex...mentioning
confidence: 99%
“…In particular, focusing once again on the "apple trees" class, it is possible to compute the average misclassification error between the "apple trees" and "wood" classes obtained over all the experiments we ran: Table IV reports these results. It is therefore possible to appreciate that the algorithms based on the analysis of the graph structure defined according to the guidelines in Section II-A show a substantial increase of the error with respect to the ensemble learning-based architecture in [64]. Hence, the classic graph representation based on heat diffusion model apparently leads to a substantial degradation of the ability of the architectures to characterize and interpret the samples in the dataset.…”
Section: A Motivations Of a New Graph Representation: A Multimodal Ex...mentioning
confidence: 99%
“…At this point, we computed the Q matrix and the weight matrix W according to the classic graph representation based on heat diffusion mechanism. For the Q matrix, we considered the definition as in (13), we used the method in [64] to determine the values of the K tensor, and assumed that the distribution of the B matrix would be uniform. For the classic definition of the weight matrix W based on heat diffusion mechanism, we used a Gaussian kernel to define the function η mentioned in Section II-A [6], [31].…”
Section: Derivation Of Transition Probability Density Function In The...mentioning
Representing data by means of graph structures identifies one of the most valid approach to extract information in several data analysis applications. This is especially true when multimodal datasets are investigated, as records collected by means of diverse sensing strategies are taken into account and explored. Nevertheless, classic graph signal processing is based on a model for information propagation that is configured according to heat diffusion mechanism. This system provides several constraints and assumptions on the data properties that might be not valid for multimodal data analysis, especially when large scale datasets collected from heterogeneous sources are considered, so that the accuracy and robustness of the outcomes might be severely jeopardized. In this paper, we introduce a novel model for graph definition based on fluid diffusion. The proposed approach improves the ability of graph-based data analysis to take into account several issues of modern data analysis in operational scenarios, so to provide a platform for precise, versatile, and efficient understanding of the phenomena underlying the records under exam, and to fully exploit the potential provided by the diversity of the records in obtaining a thorough characterization of the data and their significance. In this work, we focus our attention to using this fluid diffusion model to drive a community detection scheme, i.e., to divide multimodal datasets into many groups according to similarity among nodes in an unsupervised fashion. Experimental results achieved by testing real multimodal datasets in diverse application scenarios show that our method is able to strongly outperform state-of-the-art schemes for community detection in multimodal data analysis.
“…In this work, we propose to address this task by exploring the relevance of the features at global (i.e., across all modalities) and local (i.e., across samples for each feature) scale. Following the successful approach proposed in [64], we quantify the multiscale significance of the features by using information theory-based metrics. Specifically, we consider to measure the degree of redundancy and intercorrelation between features across the whole dataset by employing mutual information [65], [66].…”
Section: B Proposed Approachmentioning
confidence: 99%
“…11 by means of state-of-the-art supervised and semisupervised methods based on the classic graph representation introduced in Section II-A [90]- [93]. Moreover, for comparison, we used a method based on ensemble learning approach [64]. It is worth noting that deeply investigating the capacity and limitations of all these methods is out of the scope of this paper.…”
Section: A Motivations Of a New Graph Representation: A Multimodal Ex...mentioning
confidence: 99%
“…In particular, focusing once again on the "apple trees" class, it is possible to compute the average misclassification error between the "apple trees" and "wood" classes obtained over all the experiments we ran: Table IV reports these results. It is therefore possible to appreciate that the algorithms based on the analysis of the graph structure defined according to the guidelines in Section II-A show a substantial increase of the error with respect to the ensemble learning-based architecture in [64]. Hence, the classic graph representation based on heat diffusion model apparently leads to a substantial degradation of the ability of the architectures to characterize and interpret the samples in the dataset.…”
Section: A Motivations Of a New Graph Representation: A Multimodal Ex...mentioning
confidence: 99%
“…At this point, we computed the Q matrix and the weight matrix W according to the classic graph representation based on heat diffusion mechanism. For the Q matrix, we considered the definition as in (13), we used the method in [64] to determine the values of the K tensor, and assumed that the distribution of the B matrix would be uniform. For the classic definition of the weight matrix W based on heat diffusion mechanism, we used a Gaussian kernel to define the function η mentioned in Section II-A [6], [31].…”
Section: Derivation Of Transition Probability Density Function In The...mentioning
Representing data by means of graph structures identifies one of the most valid approach to extract information in several data analysis applications. This is especially true when multimodal datasets are investigated, as records collected by means of diverse sensing strategies are taken into account and explored. Nevertheless, classic graph signal processing is based on a model for information propagation that is configured according to heat diffusion mechanism. This system provides several constraints and assumptions on the data properties that might be not valid for multimodal data analysis, especially when large scale datasets collected from heterogeneous sources are considered, so that the accuracy and robustness of the outcomes might be severely jeopardized. In this paper, we introduce a novel model for graph definition based on fluid diffusion. The proposed approach improves the ability of graph-based data analysis to take into account several issues of modern data analysis in operational scenarios, so to provide a platform for precise, versatile, and efficient understanding of the phenomena underlying the records under exam, and to fully exploit the potential provided by the diversity of the records in obtaining a thorough characterization of the data and their significance. In this work, we focus our attention to using this fluid diffusion model to drive a community detection scheme, i.e., to divide multimodal datasets into many groups according to similarity among nodes in an unsupervised fashion. Experimental results achieved by testing real multimodal datasets in diverse application scenarios show that our method is able to strongly outperform state-of-the-art schemes for community detection in multimodal data analysis.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.