Dalibor Krleža scite author profile

Dalibor Krleža

5Publications

21Citation Statements Received

91Citation Statements Given

How they've been cited

How they cite others

145

Affiliations

University of Zagreb

Publications

Order By: Most citations

Statistical hierarchical clustering algorithm for outlier detection in evolving data streams

2020

View full text Add to dashboard Cite

Anomaly detection is a hard data analysis process that requires constant creation and improvement of data analysis algorithms. Using traditional clustering algorithms to analyse data streams is impossible due to processing power and memory issues. To solve this, the traditional clustering algorithm complexity needed to be reduced, which led to the creation of sequential clustering algorithms. The usual approach is two-phase clustering, which uses online phase to relax data details and complexity, and offline phase to cluster concepts created in the online phase. Detecting anomalies in a data stream is usually solved in the online phase, as it requires unreduced data. Contrarily, producing good macro-clustering is done in the offline phase, which is the reason why two-phase clustering algorithms have difficulty being equally good in anomaly detection and macro-clustering. In this paper, we propose a statistical hierarchical clustering algorithm equally suitable for both detecting anomalies and macro-clustering. The proposed algorithm is single-phased and uses statistical inference on the input data stream, resulting in statistical distributions that are constantly updated. This makes the classification adaptable, allowing agglomeration of outliers into clusters, tracking population evolution, and to be used without knowing the expected number of clusters and outliers. The proposed algorithm was tested against typical clustering algorithms, including two-phase algorithms suitable for data stream analysis. A number of typical test cases were selected, to show the universality and qualities of the proposed clustering algorithm.

show abstract

Graph Matching Using Hierarchical Fuzzy Graph Neural Networks

Krleža¹,

Fertalj

2017

IEEE Trans. Fuzzy Syst.

View full text Add to dashboard Cite

Challenges in Collective Intelligence: A Survey

Kristijan

Brčić

Kovac

et al. 2023

View full text Add to dashboard Cite

Latent Process Discovery Using Evolving Tokenized Transducer

2019

View full text Add to dashboard Cite

Today organizations capture and store an abundant amount of data from their interaction with clients, internal information systems, technical systems and sensors. Data captured this way comprises many useful insights that can be discovered by various analytical procedures and methods. Discovering regular and irregular data sequences in the captured data can reveal processes performed by the organization, which can be then assessed, measured and optimized, to achieve overall better performance, lower costs, resolve congestions, find potentially fraudulent activities and similar. Besides process discovery, capturing data sequences can give additional behavioral and tendency insights for various observations in the organization, such as sales dynamic, customer behaviour and similar. The issue is that most of the captured data intertwine multiple processes, customers, cases, products in a single data log or data stream. In this article, we propose an evolving tokenized transducer (ETT), capable of learning data sequences from a multi-contextual data log or stream. The proposed ETT is a semi-supervised relational learning method that can be used as a classifier on an unknown data log or stream, revealing previously learned data sequences. The proposed ETT was tested on multiple synthetic and real-life cases and datasets, such as dataset of retail sales sequences, hospital process log involving septic patient treatment and BPI challenge 2019 dataset. Test results are successful, revealing ETT as a prominent process discovery method.

show abstract

Towards intelligent compiler optimization

Kovac

Brčić

Krajna

et al. 2022

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Dalibor Krleža

Statistical hierarchical clustering algorithm for outlier detection in evolving data streams

Graph Matching Using Hierarchical Fuzzy Graph Neural Networks

Challenges in Collective Intelligence: A Survey

Latent Process Discovery Using Evolving Tokenized Transducer

Towards intelligent compiler optimization

Contact Info

Product

Resources

About