Johny Ijaq scite author profile

Hypothetical proteins (HPs) are the proteins predicted to be expressed from an open reading frame, making a substantial fraction of proteomes in both prokaryotes and eukaryotes. Genome projects have led to the identification of many therapeutic targets, the putative function of the protein, and their interactions. In this review we enlist various methods linking annotation to structural and functional prediction of HPs that assist in the discovery of new structures and functions serving as markers and pharmacological targets for drug designing, discovery, and screening. Further we give an overview of how mass spectrometry as an analytical technique is used to validate protein characterisation. We discuss how microarrays and protein expression profiles help understanding the biological systems through a systems-wide study of proteins and their interactions with other proteins and non-proteinaceous molecules to control complex processes in cells. Finally, we articulate challenges on how next generation sequencing methods have accelerated multiple areas of genomics with special focus on uncharacterized proteins.

show abstract

Protocol for Molecular Dynamics Simulations of Proteins

Gajula¹,

Kumar²,

Ijaq³

2016

BIO-PROTOCOL

View full text Add to dashboard Cite

A model to predict the function of hypothetical proteins through a nine-point classification scoring schema

Ijaq

Malik

Kumar³

et al. 2019

BMC Bioinformatics

View full text Add to dashboard Cite

BackgroundHypothetical proteins [HP] are those that are predicted to be expressed in an organism, but no evidence of their existence is known. In the recent past, annotation and curation efforts have helped overcome the challenge in understanding their diverse functions. Techniques to decipher sequence-structure-function relationship, especially in terms of functional modelling of the HPs have been developed by researchers, but using the features as classifiers for HPs has not been attempted. With the rise in number of annotation strategies, next-generation sequencing methods have provided further understanding the functions of HPs.ResultsIn our previous work, we developed a six-point classification scoring schema with annotation pertaining to protein family scores, orthology, protein interaction/association studies, bidirectional best BLAST hits, sorting signals, known databases and visualizers which were used to validate protein interactions. In this study, we introduced three more classifiers to our annotation system, viz. pseudogenes linked to HPs, homology modelling and non-coding RNAs associated to HPs. We discuss the challenges and performance of these classifiers using machine learning heuristics with an improved accuracy from Perceptron (81.08 to 97.67), Naive Bayes (54.05 to 96.67), Decision tree J48 (67.57 to 97.00), and SMO_npolyk (59.46 to 96.67).ConclusionWith the introduction of three new classification features, the performance of the nine-point classification scoring schema has an improved accuracy to functionally annotate the HPs.Electronic supplementary materialThe online version of this article (10.1186/s12859-018-2554-y) contains supplementary material, which is available to authorized users.

show abstract

Genome-Wide Mining, Characterization and Development of miRNA-SSRs inArabidopsis thaliana

Kumar

Chauhan

Kompelli³

et al. 2017

Preprint

View full text Add to dashboard Cite

9Simple Sequence Repeats (SSRs), also known as microsatellites are short tandem repeats of that their occurrence within microRNAs (miRNA) genes has received attention. As is widely were mined. The sequences were retrieved by annotations available at EnsemblPlants using 3 1BatchPrimer3 server with miRNA-SSR flanking primers found to be well distributed. Our confirm that miRNA-SSRs were commonly spread across the full length pre-miRNAs, we 3 6

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Johny Ijaq

Annotation and curation of uncharacterized proteins- challenges

Protocol for Molecular Dynamics Simulations of Proteins

A model to predict the function of hypothetical proteins through a nine-point classification scoring schema

Genome-Wide Mining, Characterization and Development of miRNA-SSRs inArabidopsis thaliana

Contact Info

Product

Resources

About