2016
DOI: 10.1371/journal.pone.0146666
|View full text |Cite
|
Sign up to set email alerts
|

Average Information Content Maximization—A New Approach for Fingerprint Hybridization and Reduction

Abstract: Fingerprints, bit representations of compound chemical structure, have been widely used in cheminformatics for many years. Although fingerprints with the highest resolution display satisfactory performance in virtual screening campaigns, the presence of a relatively high number of irrelevant bits introduces noise into data and makes their application more time-consuming. In this study, we present a new method of hybrid reduced fingerprint construction, the Average Information Content Maximization algorithm (AI… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
21
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
6
1

Relationship

4
3

Authors

Journals

citations
Cited by 15 publications
(21 citation statements)
references
References 35 publications
(27 reference statements)
0
21
0
Order By: Relevance
“…Consequently, we get a third class of compounds with an intermediate activity level such that K i ∈ [10 2 , 10 3 ], which forms a neutral class. Although it is a common practice to ignore this neutral class in the learning process [26], we show that its use allows to explore the chemical space better, see Fig. 1.…”
mentioning
confidence: 81%
“…Consequently, we get a third class of compounds with an intermediate activity level such that K i ∈ [10 2 , 10 3 ], which forms a neutral class. Although it is a common practice to ignore this neutral class in the learning process [26], we show that its use allows to explore the chemical space better, see Fig. 1.…”
mentioning
confidence: 81%
“…To resolve the aforementioned difficulties with application of high resolution fingerprints, the AIC-MAX algorithm [19] was recently introduced to select features with the highest discriminatory potential in virtual screening-like experiments. AIC-MAX uses mutual information normalized by the Shannon entropy to rank a group of features with respect to their significance measured by activity label .where is a binary sequence (fingerprint of length N ) and , and denote the probabilities that , } and , , respectively.…”
Section: Methodsmentioning
confidence: 99%
“…Step 4: Update the velocity by using Equation (8) and the binary position by using Equations (10) and (11).…”
Section: Binary Particle Swarm Optimizationmentioning
confidence: 99%
“…In [7,8], the Fisher criterion is utilized to evaluate features comprehensively, considering both the intra-class distance and the inter-class distance. Information measure-based feature selection methods are utilized to select important features in [9,10]. Other methods, such as the relief method [11] and the rough set method [12], are also adopted for TSSP feature selection.…”
Section: Introductionmentioning
confidence: 99%