Innovations play a very important role in the modern economy. They are the key to a higher quality of life, better jobs and economy and sustainable development. The innovation policy is a key element of both national and European Union strategy. The main aim of this paper is to present an ensemble clustering of European Union countries (member states) considering their innovativeness. In the empirical section, symbolic density-based ensemble clustering is used to obtain the co-occurrence matrix. The paper uses symbolicDA, clusterSim and dbscan packages of R software for all calculations. Four different clusters where obtained in the result of clustering. Cluster 1 contains highinnovative countries (innovation leaders). This cluster is also the least homogenous. Cluster 2 contains post-communist countries mainly from central Europe. These countries can be seen as rather mid-low innovative (they try to "catch up" with innovation leaders). Cluster 3 contains moderate innovators. Cluster 4 contains two countries that are also mid-innovative.
The research background of the paper covers the development of a country, that can be measured in various ways. Simple indicators, like GDP and also complex indicators such as HDI (human development index), can be used to measure country development. However, usually countries are divided into groups via setting some arbitrary levels of final measure. What is more, the composite (complex) indices have some problems and errors.
The main purpose of the paper is the assessment of the development of the selected European OECD countries with the application of the linear ordering and ensemble clustering of symbolic data as well as comparison of the ensemble clustering with a single model.
Research methodology covers linear ordering with the application of multidimensional scaling for a visualisation of results and ensemble clustering for symbolic data.
The results are compared according to adjusted Rand and silhouette indices. The obtained results show that ensemble clustering for symbolic data can be a useful tool in country development analysis and allows reaching better results than a single model.
The novelty of the proposed approach is to use a cluster analysis to obtain the clusters of countries with similar variables’ values (indicators of development) and the application of multidimensional scaling for symbolic data in order to visualise linear ordering results.
Informacje o naborze artykułów i zasadach recenzowania znajdują się na stronie internetowej Wydawnictwa www.pracenaukowe.ue.wroc.pl www.wydawnictwo.ue.wroc.pl Publikacja udostępniona na licencji Creative Commons Uznanie autorstwa-Użycie niekomercyjne-Bez utworów zależnych 3.0 Polska (CC BY-NC-ND 3.0 PL)
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.