3D reconstruction of lower anterior teeth from CBCT images: automatic segmentation with manual refinements

The k-means clustering algorithm is considered one of the most powerful and popular data mining algorithms in the research community. However, despite its popularity, the algorithm has certain limitations, including problems associated with random initialization of the centroids which leads to unexpected convergence. Additionally, such a clustering algorithm requires the number of clusters to be defined beforehand, which is responsible for different cluster shapes and outlier effects. A fundamental problem of the k-means algorithm is its inability to handle various data types. This paper provides a structured and synoptic overview of research conducted on the k-means algorithm to overcome such shortcomings. Variants of the k-means algorithms including their recent developments are discussed, where their effectiveness is investigated based on the experimental analysis of a variety of datasets. The detailed experimental analysis along with a thorough comparison among different k-means clustering algorithms differentiates our work compared to other existing survey papers. Furthermore, it outlines a clear and thorough understanding of the k-means algorithm along with its different research directions.

show abstract

Approximate information state for approximate planning and reinforcement learning in partially observed systems

Subramanian¹,

Sinha²,

Seraj³

et al. 2020

Preprint

View full text Add to dashboard Cite

Doubly Robust Off-Policy Actor-Critic Algorithms for Reinforcement Learning

Islam¹,

Seraj²,

Arnob³

et al. 2019

Preprint

View full text Add to dashboard Cite

We study the problem of off-policy critic evaluation in several variants of valuebased off-policy actor-critic algorithms. Off-policy actor-critic algorithms require an off-policy critic evaluation step, to estimate the value of the new policy after every policy gradient update. Despite enormous success of off-policy policy gradients on control tasks, existing general methods suffer from high variance and instability, partly because the policy improvement depends on gradient of the estimated value function. In this work, we present a new way of off-policy policy evaluation in actor-critic, based on the doubly robust estimators. We extend the doubly robust estimator from off-policy policy evaluation (OPE) to actorcritic algorithms that consist of a reward estimator performance model. We find that doubly robust estimation of the critic can significantly improve performance in continuous control tasks. Furthermore, in cases where the reward function is stochastic that can lead to high variance, doubly robust critic estimation can improve performance under corrupted, stochastic reward signals, indicating its usefulness for robust and safe reinforcement learning.Preprint. Under review.

show abstract

Concept Drift for Big Data

Seraj

Ahmed

2020

View full text Add to dashboard Cite

Mean-field approximation for large-population beauty-contest games

Seraj

Mahajan

2021

View full text Add to dashboard Cite

Tsetlin Machine for Solving Contextual Bandit Problems

Seraj¹,

Sharma²,

Granmo³

2022

Preprint

View full text Add to dashboard Cite

This paper introduces an interpretable contextual bandit algorithm using Tsetlin Machines, which solves complex pattern recognition tasks using propositional logic. The proposed bandit learning algorithm relies on straightforward bit manipulation, thus simplifying computation and interpretation. We then present a mechanism for performing Thompson sampling with Tsetlin Machine, given its non-parametric nature. Our empirical analysis shows that Tsetlin Machine as a base contextual bandit learner outperforms other popular base learners on eight out of nine datasets. We further analyze the interpretability of our learner, investigating how arms are selected based on propositional expressions that model the context 1 .

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Raihan Seraj

The k-means Algorithm: A Comprehensive Survey and Performance Evaluation

Approximate information state for approximate planning and reinforcement learning in partially observed systems

Doubly Robust Off-Policy Actor-Critic Algorithms for Reinforcement Learning

Concept Drift for Big Data

Mean-field approximation for large-population beauty-contest games

Tsetlin Machine for Solving Contextual Bandit Problems

Contact Info

Product

Resources

About