Shyam Deshmukh scite author profile

Shyam Deshmukh

5Publications

31Citation Statements Received

33Citation Statements Given

How they've been cited

How they cite others

Affiliations

Koneru Lakshmaiah Education Foundation

Publications

Order By: Most citations

Collaborative Learning Based Straggler Prevention in Large-Scale Distributed Computing Framework

Deshmukh

Rao

Shabaz

2021

Security and Communication Networks

View full text Add to dashboard Cite

Modern big data applications tend to prefer a cluster computing approach as they are linked to the distributed computing framework that serves users jobs as per demand. It performs rapid processing of tasks by subdividing them into tasks that execute in parallel. Because of the complex environment, hardware and software issues, tasks might run slowly leading to delayed job completion, and such phenomena are also known as stragglers. The performance improvement of distributed computing framework is a bottleneck by straggling nodes due to various factors like shared resources, heavy system load, or hardware issues leading to the prolonged job execution time. Many state-of-the-art approaches use independent models per node and workload. With increased nodes and workloads, the number of models would increase, and even with large numbers of nodes. Not every node would be able to capture the stragglers as there might not be sufficient training data available of straggler patterns, yielding suboptimal straggler prediction. To alleviate such problems, we propose a novel collaborative learning-based approach for straggler prediction, the alternate direction method of multipliers (ADMM), which is resource-efficient and learns how to efficiently deal with mitigating stragglers without moving data to a centralized location. The proposed framework shares information among the various models, allowing us to use larger training data and bring training time down by avoiding data transfer. We rigorously evaluate the proposed method on various datasets with high accuracy results.

show abstract

Survey on Task Assignment Techniques in Hadoop

Patil¹,

Deshmukh²

2012

IJCA

View full text Add to dashboard Cite

MapReduce is an implementation for processing large scale data parallelly. Actual benefits of MapReduce occur when this framework is implemented in large scale, shared nothing cluster. MapReduce framework abstracts the complexity of running distributed data processing across multiple nodes in cluster. Hadoop is open source implementation of MapReduce framework, which processes the vast amount of data in parallel on large clusters. In Hadoop pluggable scheduler was implemented, because of this several algorithms have been developed till now. This paper presents the different schedulers used for Hadoop.

show abstract

Comparative Efficacy Of Two Different Synchronization Protocol In Postpartum Dairy Cows

Hirole¹,

Deshmukh²,

Ingawale³

et al. 2018

Int. J. Livest. Res.

View full text Add to dashboard Cite

Job Classification for MapReduce Scheduler in Heterogeneous Environment

Deshmukh¹,

Aghav²,

Chakravarthy

2013

View full text Add to dashboard Cite

Analysis of SimpleKMeans with Multiple Dimensions using WEKA

Patil¹,

Deshmukh²,

Rajeswari³

2015

IJCA

View full text Add to dashboard Cite

Clustering techniques have more importance in data mining especially when the data size is very large. It is widely used in the fields including pattern recognition system, machine learning algorithms, analysis of images, information retrieval and bio-informatics. Different clustering algorithms are available such as Expectation Maximization (EM), Cobweb, FarthestFirst, OPTICS, SimpleKMeans etc. SimpleKMeans clustering is a simple clustering algorithm. It partitions n data tuples into k groups such that each entity in the cluster has nearest mean. This paper is about the implementation of the clustering techniques using WEKA interface. This paper includes a detailed analysis of various clustering techniques with the different standard online data sets. Analysis is based on the multiple dimensions which include time to build the model, number of attributes, number of iterations, number of clusters and error rate. General TermsData mining, Clustering, WEKA interface.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Shyam Deshmukh

Collaborative Learning Based Straggler Prevention in Large-Scale Distributed Computing Framework

Survey on Task Assignment Techniques in Hadoop

Comparative Efficacy Of Two Different Synchronization Protocol In Postpartum Dairy Cows

Job Classification for MapReduce Scheduler in Heterogeneous Environment

Analysis of SimpleKMeans with Multiple Dimensions using WEKA

Contact Info

Product

Resources

About