Tsuyoshi Idé scite author profile

We report on an automated runtime anomaly detection method at the application layer of multi-node computer systems. Although several network management systems are available in the market, none of them have sufficient capabilities to detect faults in multi-tier Web-based systems with redundancy. We model a Web-based system as a weighted graph, where each node represents a "service" and each edge represents a dependency between services. Since the edge weights vary greatly over time, the problem we address is that of anomaly detection from a time sequence of graphs.In our method, we first extract a feature vector from the adjacency matrix that represents the activities of all of the services. The heart of our method is to use the principal eigenvector of the eigenclusters of the graph. Then we derive a probability distribution for an anomaly measure defined for a time-series of directional data derived from the graph sequence. Given a critical probability, the threshold value is adaptively updated using a novel online algorithm.We demonstrate that a fault in a Web application can be automatically detected and the faulty services are identified without using detailed knowledge of the behavior of the system.

show abstract

Semi-supervised local Fisher discriminant analysis for dimensionality reduction

Sugiyama

et al. 2009

View full text Add to dashboard Cite

When only a small number of labeled samples are available, supervised dimensionality reduction methods tend to perform poorly because of overfitting. In such cases, unlabeled samples could be useful in improving the performance. In this paper, we propose a semi-supervised dimensionality reduction method which preserves the global structure of unlabeled samples in addition to separating labeled samples in different classes from each other. The proposed method, which we call SEmi-supervised Local Fisher discriminant analysis (SELF), has an analytic form of the globally optimal solution and it can be computed based on eigen-decomposition. We show the usefulness of SELF through experiments with benchmark and real-world document classification datasets.Keywords Semi-supervised learning · Dimensionality reduction · Cluster assumption · Local Fisher discriminant analysis · Principal component analysis Editor: Roni Khardon.A preliminary version of this paper was previously published in Sugiyama et al. (2008). A MATLAB implementation of the proposed dimensionality reduction method SELF is available from

show abstract

Semi-Supervised Local Fisher Discriminant Analysis for Dimensionality Reduction

Sugiyama

Idé

Nakajima

et al. 2008

View full text Add to dashboard Cite

Unsupervised Change Analysis Using Supervised Learning

Hido

Idé

Kashima

et al. 2008

View full text Add to dashboard Cite

Abstract. We propose a formulation of a new problem, which we call change analysis, and a novel method for solving the problem. In contrast to the existing methods of change (or outlier) detection, the goal of change analysis goes beyond detecting whether or not any changes exist. Its ultimate goal is to find the explanation of the changes. While change analysis falls in the category of unsupervised learning in nature, we propose a novel approach based on supervised learning to achieve the goal. The key idea is to use a supervised classifier for interpreting the changes. A classifier should be able to discriminate between the two data sets if they actually come from two different data sources. In other words, we use a hypothetical label to train the supervised learner, and exploit the learner for interpreting the change. Experimental results using real data show the proposed approach is promising in change analysis as well as concept drift analysis.

show abstract

Travel-Time Prediction using Gaussian Process Regression: A Trajectory-Based Approach

Idé¹,

Kato²

2009

View full text Add to dashboard Cite

This paper is concerned with the task of travel-time prediction for an arbitrary origin-destination pair on a map. Unlike most of the existing studies, which focus only on a particular link (road segment) with heavy traffic, our method allows us to probabilistically predict the travel time along an unknown path (a sequence of links) if the similarity between paths is defined as a kernel function. Our first innovation is to use a string kernel to represent the similarity between paths. Our second new idea is to apply Gaussian process regression for probabilistic travel-time prediction. We tested our approach with realistic traffic data.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Tsuyoshi Idé

Eigenspace-based anomaly detection in computer systems

Semi-supervised local Fisher discriminant analysis for dimensionality reduction

Semi-Supervised Local Fisher Discriminant Analysis for Dimensionality Reduction

Unsupervised Change Analysis Using Supervised Learning

Travel-Time Prediction using Gaussian Process Regression: A Trajectory-Based Approach

Contact Info

Product

Resources

About