Ali Seman scite author profile

Ali Seman

5Publications

34Citation Statements Received

50Citation Statements Given

How they've been cited

How they cite others

Affiliations

Universiti Teknologi MARA

Publications

Order By: Most citations

Towards Development of Clustering Applications for Large-Scale Comparative Genotyping and Kinship Analysis Using Y-Short Tandem Repeats

Seman

Sapawi

Salleh

2015

OMICS: A Journal of Integrative Biology

View full text Add to dashboard Cite

Y-chromosome short tandem repeats (Y-STRs) are genetic markers with practical applications in human identification. However, where mass identification is required (e.g., in the aftermath of disasters with significant fatalities), the efficiency of the process could be improved with new statistical approaches. Clustering applications are relatively new tools for large-scale comparative genotyping, and the k-Approximate Modal Haplotype (k-AMH), an efficient algorithm for clustering large-scale Y-STR data, represents a promising method for developing these tools. In this study we improved the k-AMH and produced three new algorithms: the Nk-AMH I (including a new initial cluster center selection), the Nk-AMH II (including a new dominant weighting value), and the Nk-AMH III (combining I and II). The Nk-AMH III was the superior algorithm, with mean clustering accuracy that increased in four out of six datasets and remained at 100% in the other two. Additionally, the Nk-AMH III achieved a 2% higher overall mean clustering accuracy score than the k-AMH, as well as optimal accuracy for all datasets (0.84–1.00). With inclusion of the two new methods, the Nk-AMH III produced an optimal solution for clustering Y-STR data; thus, the algorithm has potential for further development towards fully automatic clustering of any large-scale genotypic data.

show abstract

An efficient clustering algorithm for partitioning Y-short tandem repeats data

Seman¹,

Bakar²,

Isa³

2012

BMC Research Notes

View full text Add to dashboard Cite

BackgroundY-Short Tandem Repeats (Y-STR) data consist of many similar and almost similar objects. This characteristic of Y-STR data causes two problems with partitioning: non-unique centroids and local minima problems. As a result, the existing partitioning algorithms produce poor clustering results.ResultsOur new algorithm, called k-Approximate Modal Haplotypes (k-AMH), obtains the highest clustering accuracy scores for five out of six datasets, and produces an equal performance for the remaining dataset. Furthermore, clustering accuracy scores of 100% are achieved for two of the datasets. The k-AMH algorithm records the highest mean accuracy score of 0.93 overall, compared to that of other algorithms: k-Population (0.91), k-Modes-RVF (0.81), New Fuzzy k-Modes (0.80), k-Modes (0.76), k-Modes-Hybrid 1 (0.76), k-Modes-Hybrid 2 (0.75), Fuzzy k-Modes (0.74), and k-Modes-UAVM (0.70).ConclusionsThe partitioning performance of the k-AMH algorithm for Y-STR data is superior to that of other algorithms, owing to its ability to solve the non-unique centroids and local minima problems. Our algorithm is also efficient in terms of time complexity, which is recorded as O(km(n-k)) and considered to be linear.

show abstract

An efficient clustering algorithm for partitioning Y-short tandem repeats data

2012

View full text Add to dashboard Cite

show abstract

Review of dimensionality reduction techniques using clustering algorithm in reconstruction of gene regulatory networks

Pindah

Nordin

Seman

et al. 2015

View full text Add to dashboard Cite

Evaluation of k-modes-type Algorithms for Clustering Y-Short Tandem Repeats Data

Seman¹,

Bakar²,

Isa

2012

Trends in Bioinformatics

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Ali Seman

Towards Development of Clustering Applications for Large-Scale Comparative Genotyping and Kinship Analysis Using Y-Short Tandem Repeats

An efficient clustering algorithm for partitioning Y-short tandem repeats data

An efficient clustering algorithm for partitioning Y-short tandem repeats data

Review of dimensionality reduction techniques using clustering algorithm in reconstruction of gene regulatory networks

Evaluation of k-modes-type Algorithms for Clustering Y-Short Tandem Repeats Data

Contact Info

Product

Resources

About