José Manuel García-de la Vega scite author profile

José Manuel García-de la Vega

4Publications

15Citation Statements Received

67Citation Statements Given

How they've been cited

How they cite others

156

Affiliations

Autonomous University of Madrid

Publications

Order By: Most citations

Relational Agreement Measures for Similarity Searching of Cheminformatic Data Sets

Rivera-Borroto

Vega

Marrero-Ponce³

et al. 2016

IEEE/ACM Trans. Comput. Biol. and Bioinf.

View full text Add to dashboard Cite

Research on similarity searching of cheminformatic data sets has been focused on similarity measures using fingerprints. However, nominal scales are the least informative of all metric scales, increasing the tied similarity scores, and decreasing the effectivity of the retrieval engines. Tanimoto's coefficient has been claimed to be the most prominent measure for this task. Nevertheless, this field is far from being exhausted since the computer science no free lunch theorem predicts that "no similarity measure has overall superiority over the population of data sets". We introduce 12 relational agreement (RA) coefficients for seven metric scales, which are integrated within a group fusion-based similarity searching algorithm. These similarity measures are compared to a reference panel of 21 proximity quantifiers over 17 benchmark data sets (MUV), by using informative descriptors, a feature selection stage, a suitable performance metric, and powerful comparison tests. In this stage, RA coefficients perform favourably with repect to the state-of-the-art proximity measures. Afterward, the RA-based method outperform another four nearest neighbor searching algorithms over the same data domains. In a third validation stage, RA measures are successfully applied to the virtual screening of the NCI data set. Finally, we discuss a possible molecular interpretation for these similarity variants.

show abstract

Dunn’s index for cluster tendency assessment of pharmacological data sets

Rivera-Borroto

Rabassa-Gutiérrez

Grau-Abalo

et al. 2012

Can. J. Physiol. Pharmacol.

View full text Add to dashboard Cite

Cluster tendency assessment is an important stage in cluster analysis. In this sense, a group of promising techniques named visual assessment of tendency (VAT) has emerged in the literature. The presence of clusters can be detected easily through the direct observation of a dark blocks structure along the main diagonal of the intensity image. Alternatively, if the Dunn's index for a single linkage partition is greater than 1, then it is a good indication of the blocklike structure. In this report, the Dunn's index is applied as a novel measure of tendency on 8 pharmacological data sets, represented by machine-learning-selected molecular descriptors. In all cases, observed values are less than 1, thus indicating a weak tendency for data to form compact clusters. Other results suggest that there is an increasing relationship between the Dunn's index as a measure of cluster separability and the classification accuracy of various cluster algorithms tested on the same data sets.

show abstract

Integration of ligand and structure-based virtual screening for identification of leading anabolic steroids

Alvarez-Ginarte

Montero‐Cabrera

Vega

et al. 2013

The Journal of Steroid Biochemistry and Molecular Biology

View full text Add to dashboard Cite

Theoretical advances on coefficients of relational agreement: application to cheminformatics as k‐way biomolecular similarity measures

Rivera-Borroto

Vega

Hernández-Díaz

2013

Journal of Chemometrics

View full text Add to dashboard Cite

aWe provide formal proofs on the partial ordering among chance-corrected bivariate coefficients of relational agreement. Moreover, we prove that the non-corrected (chance-corrected) general formula of multivariate relational agreement is the weighted average of the corresponding non-corrected (chance-corrected) general formula of bivariate relational agreement, thus allowing to obtain a specific relationship between each multivariate coefficient and its corresponding bivariate coefficient for seven metric scales of measurements (absolute, difference, ratio, interval, log-ratio, log-interval, and ordinal). As a consequence, we report seven newly multivariate coefficients in the literature. Afterwards, eight multivariate coefficients are applied as k-way biomolecular similarity relations to cheminformatics in order to show their usefulness for discriminating between active and inactive biomolecules. The integration of this type of coefficients into operative virtual screening tools and the generalization to higher-degree polynomial relationships are discussed in the last part of the paper.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.