Sashank Dara scite author profile

Abstract-Many defect prediction techniques are proposed to improve software reliability. Change classification predicts defects at the change level, where a change is the modifications to one file in a commit. In this paper, we conduct the first study of applying change classification in practice.We identify two issues in the prediction process, both of which contribute to the low prediction performance. First, the data are imbalanced-there are much fewer buggy changes than clean changes. Second, the commonly used cross-validation approach is inappropriate for evaluating the performance of change classification. To address these challenges, we apply and adapt online change classification, resampling, and updatable classification techniques to improve the classification performance.We perform the improved change classification techniques on one proprietary and six open source projects. Our results show that these techniques improve the precision of change classification by 12.2-89.5% or 6.4-34.8 percentage points (pp.) on the seven projects. In addition, we integrate change classification in the development process of the proprietary project. We have learned the following lessons: 1) new solutions are needed to convince developers to use and believe prediction results, and prediction results need to be actionable, 2) new and improved classification algorithms are needed to explain the prediction results, and insensible and unactionable explanations need to be filtered or refined, and 3) new techniques are needed to improve the relatively low precision.

show abstract

Towards privacy preserving threat intelligence

Dara

Zargar

Muralidhara

2018

Journal of Information Security and Applications

View full text Add to dashboard Cite

Cryptography Challenges for Computational Privacyin Public Clouds

Dara¹

2013

View full text Add to dashboard Cite

Efficient Format Preserving encrypted databases

Chandrashekar

Dara²,

Muralidhara

2015

View full text Add to dashboard Cite

FNR: Arbitrary Length Small Domain Block Cipher Proposal

Dara

Fluhrer

2014

View full text Add to dashboard Cite

Abstract. We propose a practical flexible (or arbitrary) length small domain block cipher, FNR encryption scheme. FNR denotes Flexible Naor and Reingold. It can cipher small domain data formats like IPv4, Port numbers, MAC Addresses, Credit card numbers, any random short strings while preserving their input length. In addition to the classic Feistel networks, Naor and Reingold propose usage of Pair-wise independent permutation (PwIP) functions based on Galois Field GF(2 n ). Instead we propose usage of random N ×N Invertible matrices in GF(2).

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Sashank Dara

Online Defect Prediction for Imbalanced Data

Towards privacy preserving threat intelligence

Cryptography Challenges for Computational Privacyin Public Clouds

Efficient Format Preserving encrypted databases

FNR: Arbitrary Length Small Domain Block Cipher Proposal

Contact Info

Product

Resources

About