A multi-class classification system for continuous water quality monitoring

Shakhari, Swapan; Banerjee, Indrajit

doi:10.1016/j.heliyon.2019.e01822

Cited by 17 publications

(12 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Accuracy is defined as the ratio of all True Positives of the matrix divided by the sum of all instances in the dataset and it expresses the ability of the model to correctly identify Low, Medium and High instances. Kappa statistics considers the fact that some of the correct predictions may be identified as such by chance, so it adjusts the reported model accuracy by considering the effect of randomness in correct predictions [61]. Comparing the four implemented machine learning algorithms, in Figure 6 we see that both for accuracy and Kappa statistics, the RF performs best.…”

Section: Evaluating the Performance Of The Machine Learning Methodsmentioning

confidence: 99%

“…(a) (b) To further analyze the modeling results, we compared in detail the performance of the four algorithms by using three metrics: precision, recall and specificity [61]. Precision is defined for each class (Low, Medium and High) as the ratio of True Positives by the sum of True Positives and False Positives for that class.…”

Section: Evaluating the Performance Of The Machine Learning Methodsmentioning

confidence: 99%

See 1 more Smart Citation

Machine Learning Approaches for Predicting Health Risk of Cyanobacterial Blooms in Northern European Lakes

Μέλλιος

Moe

Laspidou

2020

Water

View full text Add to dashboard Cite

Cyanobacterial blooms are considered a major threat to global water security with documented impacts on lake ecosystems and public health. Given that cyanobacteria possess highly adaptive traits that favor them to prevail under different and often complicated stressor regimes, predicting their abundance is challenging. A dataset from 822 Northern European lakes is used to determine which variables better explain the variation of cyanobacteria biomass (CBB) by means of stepwise multiple linear regression. Chlorophyll-a (Chl-a) and total nitrogen (TN) provided the best modelling structure for the entire dataset, while for subsets of shallow and deep lakes, Chl-a, mean depth, TN and TN/TP explained part of the variance in CBB. Path analysis was performed and corroborated these findings. Finally, CBB was translated to a categorical variable according to risk levels for human health associated with the use of lakes for recreational activities. Several machine learning methods, namely Decision Tree, K-Nearest Neighbors, Support-vector Machine and Random Forest, were applied showing a remarkable ability to predict the risk, while Random Forest parameters were tuned and optimized, achieving a 95.81% accuracy, exceeding the performance of all other machine learning methods tested. A confusion matrix analysis is performed for all machine learning methods, identifying the potential of each method to correctly predict CBB risk levels and assessing the extent of false alarms; random forest clearly outperforms the other methods with very promising results. cyanobacterial abundance in lakes has been the focus of several past and current studies that have highlighted different hydrological, climatic and human-oriented conditions. Cyanobacterial blooms are not a modern phenomenon and have been reported in scientific literature for more than 130 years [9]; however, they tend to appear much more frequently in recent decades, mainly due to anthropogenic activities that tend to change the global climatic and environmental regime. Examples of such activities include changes in hydrological flow pathways, excessive use of fertilizers and the gradual removal of natural buffering zones between terrestrial and freshwater ecosystems [10]. On the contrary, there are some anthropogenic changes, such as flooding and flushing, that tend to reduce the growth of cyanobacteria more than other algae [11].Empirical modeling has recognized the fundamental effect of phosphorus and nitrogen on the fluctuation of cyanobacterial biomass, incriminating over-enrichment of lakes with nutrients as a major driver of cyanobacterial blooms [12][13][14][15][16]. In addition, high air or water temperature [11,17], calm weather (low wind speed) [18], high water residence time [19,20], low nitrogen-to-phosphorus ratios [21,22] and low light availability [23,24] are documented as significant factors and possible predictors that determine the dominance of cyanobacteria. However, predicting the concentration of cyanobacterial biomass in lakes is a complex and challen...

show abstract

Section: Evaluating the Performance Of The Machine Learning Methodsmentioning

confidence: 99%

Section: Evaluating the Performance Of The Machine Learning Methodsmentioning

confidence: 99%

Machine Learning Approaches for Predicting Health Risk of Cyanobacterial Blooms in Northern European Lakes

Μέλλιος

Moe

Laspidou

2020

Water

View full text Add to dashboard Cite

show abstract

“…IoT is produced a lot from applications such as to conduct human health monitoring in real time which is better known as e-Health [2], military [3], smart city [4], [5], agriculture [6], [7], and so on [8]. The various applications of IoT technology are used to make predictions using various methods such as fuzzy logic [9]- [11], support vector machine (SVM) [12], decision tree [13], and so on.…”

Section: Introductionmentioning

confidence: 99%

An implentation of IoT for environmental monitoring and its analysis using k-NN algorithm

Prayitno¹,

Fahmi²,

Rasyid

et al. 2021

TELKOMNIKA

View full text Add to dashboard Cite

Environmental monitoring is a process for observing around with various conditions. Recently, internet of things (IoT) and wireless sensor network (WSN) technologies support to solve these problems. In this paper, we implemented a system to monitor environmental conditions using IoT and WSN technology. The data measure is temperature, humidity, carbon monoxide (CO) and carbon dioxide (CO2) sensors. All sensor data will be sent and stored to the cloud through the internet in real-time. We provide applications for monitoring website and mobile phone-based environmental conditions, so users can access wherever and whenever. Furthermore, we also confirm the evaluation of analyst data that used k-NN method is better than other methods with an accuracy rate of 99.0657%.

show abstract

“…Kappa statistics considers the fact that some of the correct predictions may be identified as such by chance, so it adjusts the reported model accuracy by considering the effect of randomness in correct predictions (Shakhari and Banerjee, 2019). Comparing the four implemented machine learning algorithms, in Figure 4.6 we see that both for accuracy and Kappa statistics, the RF performs best.…”

Section: Evaluating the Performance Of The Machine Learning Methodsmentioning

confidence: 98%

“…According to (Landis and Koch, 1977), a model is considered to produce accurate predictions when Kappa exceeds 60%, which in our case is succeeded. To further analyze the modelling results, the performance of the four algorithms was compared in detail by using three metrics: precision, recall and specificity (Shakhari and Banerjee, 2019). Precision is defined for each class (Low, Medium and High) as the ratio of True Positives by the sum of True Positives and False Positives for that class.…”

Section: Evaluating the Performance Of The Machine Learning Methodsmentioning

confidence: 99%

Environmental informatics for modelling of freshwater ecosystems

Μέλλιος¹

View full text Add to dashboard Cite

Σε αυτή τη διδακτορική διατριβή διερευνάται η δυναμική της οικολογικής μοντελοποίησης υπό το πλαίσιο της περιβαλλοντικής πληροφορικής στον προσδιορισμό των παραγόντων που επηρεάζουν ποιοτικές και ποσοτικές παραμέτρους του νερού σε λιμναία οικοσυστήματα. Η λίμνη Κάρλα, μία ελληνική υπερτροφική λίμνη, αντιμετωπίζει συνεχή υποβάθμιση των υδάτων της εξαιτίας της εκτεταμένης φόρτισης με θρεπτικά, των ανολοκλήρωτων έργων του αρχικού σχεδιασμού του ταμιευτήρα, καθώς και των λειτουργικών αστοχιών. Η οικολογική μοντελοποίηση χρησιμοποιείται ως εργαλείο για την προσομοίωση της δυναμικής των θρεπτικών στη λίμνη, για τον εντοπισμό των παραγόντων που επηρεάζουν την τροφική της κατάσταση, ενώ εφαρμόζονται αρκετά διαχειριστικά σενάρια και ένα σενάριο κλιματικής αλλαγής με στόχο την εκτίμηση των επιπτώσεών τους στην κατάσταση της λίμνης. Επιπλέον, γίνεται μοντελοποίηση των μικροκυστινών μέσω ενός σύγχρονου αλγορίθμου τεχνητής νοημοσύνης (ANFIS). Στη συνέχεια, μοντελοποιούνται η κυανοβακτηριακή βιομάζα και τα επίπεδα κινδύνου για την ανθρώπινη υγεία που σχετίζονται με την κυανοβακτηριακή αφθονία, σε ένα δίκτυο Βόρειων Ευρωπαϊκών λιμνών. Εφαρμόζονται η μέθοδος της σταδιακής γραμμικής παλινδρόμησης, μια σειρά από αλγορίθμους μηχανικής εκμάθησης και η Μπαϋεζιανή ιεραρχική μοντελοποίηση, με στόχο την αξιολόγηση της αποτελεσματικότητάς τους στην πρόβλεψη. Τέλος, πραγματοποιείται ανάλυση σχετικά με το ποιοι τύποι καλλιεργειών εξάγουν το περισσότερο εικονικό νερό μέσω του εμπορίου, σε συνάρτηση με το όφελος στην ελληνική οικονομία.

show abstract

A multi-class classification system for continuous water quality monitoring

Cited by 17 publications

References 16 publications

Machine Learning Approaches for Predicting Health Risk of Cyanobacterial Blooms in Northern European Lakes

Machine Learning Approaches for Predicting Health Risk of Cyanobacterial Blooms in Northern European Lakes

An implentation of IoT for environmental monitoring and its analysis using k-NN algorithm

Environmental informatics for modelling of freshwater ecosystems

Contact Info

Product

Resources

About