2023
DOI: 10.1021/acs.macromol.3c00761
|View full text |Cite
|
Sign up to set email alerts
|

Quantifying Pairwise Similarity for Complex Polymers

Jiale Shi,
Nathan J. Rebello,
Dylan Walsh
et al.

Abstract: Defining the similarity between chemical entities is an essential task in polymer informatics, enabling ranking, clustering, and classification. Despite its importance, the pairwise chemical similarity of polymers remains an open problem. Here, a similarity function for polymers with well-defined backbones is designed based on polymers’ stochastic graph representations generated from canonical BigSMILES, a structurally based line notation for describing macromolecules. The stochastic graph representations are … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 68 publications
(148 reference statements)
0
4
0
Order By: Relevance
“…To map GED(g 1 , g 2 ) onto a distance d ( g 1 , g 2 ) that is 0 and 1, first the GED is normalized to be G E D false( g 1 , g 2 false) false( N 1 + N 2 false) / 2 and then an exponential decay function is used: , d false( g 1 , g 2 false) = 1 exp true( prefix− α · normalG normalE normalD ( g 1 , g 2 ) ( N 1 + N 2 ) / 2 true) where N i denotes the number of nodes of g i and α is a tunable parameter with the default value being 1. d ( g 1 , g 2 ) is 0 when g 1 and g 2 are identical. d ( g 1 , g 2 ) is also symmetric, so d ( g 1 , g 2 ) = d ( g 2 , g 1 ).…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…To map GED(g 1 , g 2 ) onto a distance d ( g 1 , g 2 ) that is 0 and 1, first the GED is normalized to be G E D false( g 1 , g 2 false) false( N 1 + N 2 false) / 2 and then an exponential decay function is used: , d false( g 1 , g 2 false) = 1 exp true( prefix− α · normalG normalE normalD ( g 1 , g 2 ) ( N 1 + N 2 ) / 2 true) where N i denotes the number of nodes of g i and α is a tunable parameter with the default value being 1. d ( g 1 , g 2 ) is 0 when g 1 and g 2 are identical. d ( g 1 , g 2 ) is also symmetric, so d ( g 1 , g 2 ) = d ( g 2 , g 1 ).…”
Section: Methodsmentioning
confidence: 99%
“…Aldeghi et al developed a graph-based representation with “stochastic” edges to describe the average structure of repeat units . These existing text-based and graph-based stochastic representations can be used to calculate the pairwise similarity score, which captures the chemical and topological features contained in a polymer chemical structure diagram . However, these stochastic representations do not specify the weight or probability of each polymer molecule within the ensemble.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…In addition to string notation, molecular graphs, where atoms and bonds can be represented as nodes and edges in graphs, have been successful (Figure b). Extended-connectivity fingerprints (ECFPs), also known as “Morgan fingerprints”, are generated through a circular approach, where the “extended connectivity” of atoms is described with increasingly large radii centered around non-hydrogen atoms . PolyGrammar is a polymer specific graph-type approach that is designed to support architecture and monomer chemistries .…”
Section: Library Synthesis Methodsmentioning
confidence: 99%