2021
DOI: 10.1186/s13321-021-00505-3
|View full text |Cite
|
Sign up to set email alerts
|

Extended similarity indices: the benefits of comparing more than two objects simultaneously. Part 1: Theory and characteristics†

Abstract: Quantification of the similarity of objects is a key concept in many areas of computational science. This includes cheminformatics, where molecular similarity is usually quantified based on binary fingerprints. While there is a wide selection of available molecular representations and similarity metrics, there were no previous efforts to extend the computational framework of similarity calculations to the simultaneous comparison of more than two objects (molecules) at the same time. The present study bridges t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
68
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
7
2

Relationship

6
3

Authors

Journals

citations
Cited by 45 publications
(69 citation statements)
references
References 34 publications
1
68
0
Order By: Relevance
“…To calculate the diversity of molecules, the pairwise similarity of each combination of molecules in a set has been traditionally calculated using a binary similarity index, like the Tanimoto similarity, 54,55 and summarised in an aggregate metric. However, the recent development of extended similarity metrics 71,72 enables the simultaneous and straightforward comparison of an arbitrary number of bitvectors such as molecular ngerprints.…”
Section: Internal Similaritymentioning
confidence: 99%
“…To calculate the diversity of molecules, the pairwise similarity of each combination of molecules in a set has been traditionally calculated using a binary similarity index, like the Tanimoto similarity, 54,55 and summarised in an aggregate metric. However, the recent development of extended similarity metrics 71,72 enables the simultaneous and straightforward comparison of an arbitrary number of bitvectors such as molecular ngerprints.…”
Section: Internal Similaritymentioning
confidence: 99%
“…Additionally, the two novel methods discussed at the meeting: the Extended Similarity Indices [22,23] developed by the research group of Miranda-Quintana, and the Structure-Activity Relationships Matrix (SARM) by the group of Bajorath [25], can be helpful in multiple applications such as analog series identification (fragmentation), de novo drug design signatures study, SAR visualization, reactivity predictions, similarity searching, and visualization of chemical space.…”
Section: Ligand-based Drug Design Opportunitiesmentioning
confidence: 99%
“…Extended similarity analysis: from pair of molecules, to chemical space and beyond [22,23] Jürgen Bajorath University of Bonn (Germany)…”
Section: Introductionmentioning
confidence: 99%
“…The algorithm is inspired by the diversity pickers commonly applied in cheminformatics to sample large chemical spaces, usually based on the use of binary molecular fingerprints. 18 The various versions of the extended similarity indices [18][19][20] have shown great promise in the problems of diversity selection 21 and exploration of large and various datasets 22,23 including complex biological ensembles. 24 The keys to this success are the ability of the extended indices to quantify similarities between any number of objects, and the fact that they can do so with linear scaling.…”
Section: Introductionmentioning
confidence: 99%