Romain Mathonat scite author profile

Romain Mathonat

7Publications

16Citation Statements Received

276Citation Statements Given

How they've been cited

How they cite others

197

276

Affiliations

Laboratoire d'Informatique en Images et Systèmes d'Information, Atos (France), Claude Bernard University Lyon 1

Publications

Order By: Most citations

SeqScout: Using a Bandit Model to Discover Interesting Subgroups in Labeled Sequences

Mathonat¹,

Nurbakova²,

Boulicaut³

et al. 2019

View full text Add to dashboard Cite

It is extremely useful to exploit labeled datasets not only to learn models but also to improve our understanding of a domain and its available targeted classes. The so-called subgroup discovery task has been considered for a long time. It concerns the discovery of patterns or descriptions, the set of supporting objects of which have interesting properties, e.g., they characterize or discriminate a given target class. Though many subgroup discovery algorithms have been proposed for transactional data, discovering subgroups within labeled sequential data and thus searching for descriptions as sequential patterns has been much less studied. In that context, exhaustive exploration strategies can not be used for real-life applications and we have to look for heuristic approaches. We propose the algorithm SeqScout to discover interesting subgroups (w.r.t. a chosen quality measure) from labeled sequences of itemsets. This is a new sampling algorithm that mines discriminant sequential patterns using a multi-armed bandit model. It is an anytime algorithm that, for a given budget, finds a collection of local optima in the search space of descriptions and thus subgroups. It requires a light configuration and it is independent from the quality measure used for pattern scoring. Furthermore, it is fairly simple to implement. We provide qualitative and quantitative experiments on several datasets to illustrate its added-value.

show abstract

A Behavioral Pattern Mining Approach to Model Player Skills in Rocket League

Mathonat¹,

Boulicaut²,

Kaytoue³

2020

View full text Add to dashboard Cite

Competitive gaming, or esports, is now wellestablished and brought the game industry in a novel era. It comes with many challenges among which evaluating the level of a player, given the strategies and skills she masters. We are interested in automatically identifying the so called skillshots from game traces of Rocket League, a "soccer with rocketpowered cars" game. From a pure data point of view, each skill execution is unique and standard pattern matching may be insufficient. We propose a non trivial data-centric approach based on pattern mining and supervised learning techniques. We show through an extensive set of experiments that most of Rocket League skillshots can be efficiently detected and used for player modelling. It unveils applications for match making, supporting game commentators and learning systems among others.

show abstract

Anytime mining of sequential discriminative patterns in labeled sequences

et al. 2020

View full text Add to dashboard Cite

Actionable Subgroup Discovery and Urban Farm Optimization

Millot

Mathonat

Cazabet

et al. 2020

View full text Add to dashboard Cite

Designing, selling and/or exploiting connected vertical urban farms is now receiving a lot of attention. In such farms, plants grow in controlled environments according to recipes that specify the different growth stages and instructions concerning many parameters (e.g., temperature, humidity, CO2, light). During the whole process, automated systems collect measures of such parameters and, at the end, we can get some global indicator about the used recipe, e.g., its yield. Looking for innovative ideas to optimize recipes, we investigate the use of a new optimal subgroup discovery method from purely numerical data. It concerns here the computation of subsets of recipes whose labels (e.g., the yield) show an interesting distribution according to a quality measure. When considering optimization, e.g., maximizing the yield, our virtuous circle optimization framework iteratively improves recipes by sampling the discovered optimal subgroup description subspace. We provide our preliminary results about the added-value of this framework thanks to a plant growth simulator that enables inexpensive experiments.

show abstract

"What makes my queries slow?": Subgroup Discovery for SQL Workload Analysis

Remil

Bendimerad²,

Mathonat³

et al. 2021

View full text Add to dashboard Cite

Anytime Subgroup Discovery in High Dimensional Numerical Data

Mathonat¹,

Nurbakova

Boulicaut

et al. 2021

View full text Add to dashboard Cite

Subgroup discovery (SD) enables one to elicit patterns that strongly discriminate a class label. When it comes to numerical data, most of the existing SD approaches perform data discretizations and thus suffer from information loss. A few algorithms avoid such a loss by considering the search space of every interval pattern built on the dataset numerical values and provide an "anytime" property: at any moment, they are able to provide a result that improves over time. Given a sufficient time/memory budget, they may eventually complete an exhaustive search. However, such approaches are often intractable when dealing with high-dimensional numerical data, for instance, when extracting features from real-life multivariate time series. To overcome such limitations, we propose MonteCloPi, an approach based on a bottom-up exploration of numerical patterns with a Monte Carlo Tree Search. It enables to have a better exploration-exploitation trade-off between exploration and exploitation when sampling huge search spaces. Our extensive set of experiments proves the efficiency of MonteCloPi on highdimensional data with hundreds of attributes. We finally discuss the actionability of discovered subgroups when looking for skill analysis from Rocket League action logs.

show abstract

"What makes my queries slow?": Subgroup Discovery for SQL Workload Analysis

Remil¹,

Bendimerad²,

Mathonat³

et al. 2021

Preprint

View full text Add to dashboard Cite

Among daily tasks of database administrators (DBAs), the analysis of query workloads to identify schema issues and improving performances is crucial. Although DBAs can easily pinpoint queries repeatedly causing performance issues, it remains challenging to automatically identify subsets of queries that share some properties only (a pattern) and simultaneously foster some target measures, such as execution time. Patterns are defined on combinations of query clauses, environment variables, database alerts and metrics and help answer questions like what makes SQL queries slow? What makes I/O communications high? Automatically discovering these patterns in a huge search space and providing them as hypotheses for helping to localize issues and root-causes is important in the context of explainable AI. To tackle it, we introduce an original approach rooted on Subgroup Discovery. We show how to instantiate and develop this generic data-mining framework to identify potential causes of SQL workloads issues. We believe that such data-mining technique is not trivial to apply for DBAs. As such, we also provide a visualization tool for interactive knowledge discovery. We analyse a one week workload from hundreds of databases from our company, make both the dataset and source code available, and experimentally show that insightful hypotheses can be discovered.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Romain Mathonat

SeqScout: Using a Bandit Model to Discover Interesting Subgroups in Labeled Sequences

A Behavioral Pattern Mining Approach to Model Player Skills in Rocket League

Anytime mining of sequential discriminative patterns in labeled sequences

Actionable Subgroup Discovery and Urban Farm Optimization

"What makes my queries slow?": Subgroup Discovery for SQL Workload Analysis

Anytime Subgroup Discovery in High Dimensional Numerical Data

"What makes my queries slow?": Subgroup Discovery for SQL Workload Analysis

Contact Info

Product

Resources

About