Machine learning (ML)-assisted solutions for quality of transmission (QoT) estimation or classification have received significant attention in recent years. However, due to the unavailability of large and well-structured datasets, individual research groups need to create and use their own datasets for validating their proposed solutions. Therefore, the reported results (obtained using different datasets) are difficult to reproduce and hardly comparable. Regardless of this limitation, the unavailability of a technique to be followed by different research groups for the explainability of the dataset makes it even harder to validate the developed ML-assisted solutions across different papers. In this work, we present a publicly available dataset collection to open the problem of data-driven QoT estimation to the ML community. The dataset collection allows various solutions presented by different research groups to be compared. Furthermore, we present techniques to visualize and evaluate datasets for QoT estimation. The presented visualizations can also deliver deep insight into the error analysis of ML models. We apply these new methods to evaluate an artificial neural network on different datasets. The results show the relevance of the presented visualizations for comparing different approaches and different datasets. The proposed methods enable the comparison and validation of different ML-based solutions and published datasets.
We propose a novel Deep Convolutional Neural Network formulation for network-wide QoT classification tasks and show its effectiveness for networks with significant topological differences. Our formulation achieves ~99% accuracy on large and diverse test datasets.
We demonstrate the development of a QoT classifier over an autonomous machine-learning pipeline, the trading of the classifier over a federated marketplace, and eventually its deployment in the customer’s network as a cloud-native micro-service.
We demonstrate a novel visualization dashboard, compatible with multiple data and telemetry sources, which offers dataset quality evaluation, dataset comparison, ML model error analysis interpretation, and network health monitoring.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.