This review is devoted to the critical analysis of advantages and disadvantages of existing mixture descriptors and their usage in various QSAR/QSPR tasks. We describe good practices for the QSAR modeling of mixtures, data sources for mixtures, a discussion of various mixture descriptors and their application, recommendations about proper external validation specific for mixture QSAR modeling, and future perspectives of this field. The biggest problem in QSAR of mixtures is the lack of reliable data about the mixtures' properties. Various mixture descriptors are used for the modeling of different endpoints. However, these descriptors have certain disadvantages, such as applicability only to 1 : 1 binary mixtures, and additive nature. The field of QSAR of mixtures is still under development, and existing efforts could be considered as a foundation for future approaches and studies. The usage of non-additive mixture descriptors, which are sensitive to interaction effects, in combination with best practices of QSAR model development (e.g., thorough data collection and curation, rigorous external validation, etc.) will significantly improve the quality of QSAR studies of mixtures.
Many drug candidates fail therapeutic development because of poor aqueous solubility. We have conceived a computer-aided strategy to enable polymeric micelle-based delivery of poorly soluble drugs. We built models predicting both drug loading efficiency (LE) and loading capacity (LC) using novel descriptors of drug-polymer complexes. These models were employed for virtual screening of drug libraries, and eight drugs predicted to have either high LE and high LC or low LE and low LC were selected. Three putative positives, as well as three putative negative hits, were confirmed experimentally (implying 75% prediction accuracy). Fortuitously, simvastatin, a putative negative hit, was found to have the desired micelle solubility. Podophyllotoxin and simvastatin (LE of 95% and 87% and LC of 43% and 41%, respectively) were among the top five polymeric micelle-soluble compounds ever studied experimentally. The success of the strategy described herein suggests its broad utility for designing drug delivery systems.
Background Previous studies assessing the prevalence of COVID-19 sequelae in adults and children were performed in the absence of an agreed definition. We investigated prevalence of post-COVID-19 condition (PCC) (WHO definition), at 6- and 12-months follow-up, amongst previously hospitalised adults and children and assessed risk factors. Methods Prospective cohort study of children and adults with confirmed COVID-19 in Moscow, hospitalised between April and August, 2020. Two follow-up telephone interviews, using the International Severe Acute Respiratory and Emerging Infection Consortium survey, were performed at 6 and 12 months after discharge. Results One thousand thirteen of 2509 (40%) of adults and 360 of 849 (42%) of children discharged participated in both the 6- and 12-month follow-ups. PCC prevalence was 50% (95% CI 47–53) in adults and 20% (95% CI 16–24) in children at 6 months, with decline to 34% (95% CI 31–37) and 11% (95% CI 8–14), respectively, at 12 months. In adults, female sex was associated with PCC at 6- and 12-month follow-up (OR 2.04, 95% CI 1.57 to 2.65) and (OR 2.04, 1.54 to 2.69), respectively. Pre-existing hypertension (OR 1.42, 1.04 to 1.94) was associated with post-COVID-19 condition at 12 months. In children, neurological comorbidities were associated with PCC both at 6 months (OR 4.38, 1.36 to 15.67) and 12 months (OR 8.96, 2.55 to 34.82) while allergic respiratory diseases were associated at 12 months (OR 2.66, 1.04 to 6.47). Conclusions Although prevalence of PCC declined one year after discharge, one in three adults and one in ten children experienced ongoing sequelae. In adults, females and persons with pre-existing hypertension, and in children, persons with neurological comorbidities or allergic respiratory diseases are at higher risk of PCC.
This paper is devoted to the development of methodology for QSPR modeling of mixtures and its application to vapor/liquid equilibrium diagrams for bubble point temperatures of binary liquid mixtures. Two types of special mixture descriptors based on SiRMS and ISIDA approaches were developed. SiRMS-based fragment descriptors involve atoms belonging to both components of the mixture, whereas the ISIDA fragments belong only to one of these components. The models were built on the data set containing the phase diagrams for 167 mixtures represented by different combinations of 67 pure liquids. Consensus models were developed using nonlinear Support Vector Machine (SVM), Associative Neural Networks (ASNN), and Random Forest (RF) approaches. For SVM and ASNN calculations, the ISIDA fragment descriptors were used, whereas Simplex descriptors were employed in RF models. The models have been validated using three different protocols: "Points out", "Mixtures out" and "Compounds out", based on the specific rules to form training/test sets in each fold of cross-validation. A final validation of the models has been performed on an additional set of 94 mixtures represented by combinations of novel 34 compounds and modeling set chemicals with each other. The root mean squared error of predictions for new mixtures of already known liquids does not exceed 5.7 K, which outperforms COSMO-RS models. Developed QSAR methodology can be applied to the modeling of any nonadditive property of binary mixtures (antiviral activities, drug formulation, etc.).
This review explores the application of the Simplex representation of molecular structure (SiRMS) QSAR approach in antiviral research. We provide an introduction to and description of SiRMS, its application in antiviral research and future directions of development of the Simplex approach and the whole QSAR field. In the Simplex approach every molecule is represented as a system of different simplexes (tetratomic fragments with fixed composition, structure, chirality and symmetry). The main advantages of SiRMS are consideration of the different physical-chemical properties of atoms, high adequacy and good interpretability of models obtained and clear procedures for molecular design. The reliability of developed QSAR models as predictive virtual screening tools and their ability to serve as the basis of directed drug design was validated by subsequent synthetic and biological experiments. The SiRMS approach is realized as the complex of the computer program 'HiT QSAR', which is available on request.
Severe adverse drug reactions (ADRs) are the fourth leading cause of fatality in the U.S. with more than 100,000 deaths per year. As up to 30% of all ADRs are believed to be caused by drug-drug interactions (DDIs), typically mediated by cytochrome P450s, possibilities to predict DDIs from existing knowledge are important. We collected data from public sources on 1485, 2628, 4371, and 27,966 possible DDIs mediated by four cytochrome P450 isoforms 1A2, 2C9, 2D6, and 3A4 for 55, 73, 94, and 237 drugs, respectively. For each of these data sets, we developed and validated QSAR models for the prediction of DDIs. As a unique feature of our approach, the interacting drug pairs were represented as binary chemical mixtures in a 1:1 ratio. We used two types of chemical descriptors: quantitative neighborhoods of atoms (QNA) and simplex descriptors. Radial basis functions with self-consistent regression (RBF-SCR) and random forest (RF) were utilized to build QSAR models predicting the likelihood of DDIs for any pair of drug molecules. Our models showed balanced accuracy of 72-79% for the external test sets with a coverage of 81.36-100% when a conservative threshold for the model's applicability domain was applied. We generated virtually all possible binary combinations of marketed drugs and employed our models to identify drug pairs predicted to be instances of DDI. More than 4500 of these predicted DDIs that were not found in our training sets were confirmed by data from the DrugBank database.
In this article the Hierarchical QSAR technology (HiT QSAR) has been used for consensus QSAR modeling of Acetylcholinesterase (AChE) inhibition by various organophosphate compounds. Simplex representation of molecular structure (SiRMS) and Lattice model (LM) QSAR approaches have been used for descriptors generation. Statistical models have been obtained by partial least squares (PLS) method. Various chiral organophosphates represented by their (R)-and (S)-isomers, racemic mixtures and achiral structures have been investigated. Successful consensus model (R 2 ¼ 0.978) based on fourteen best QSAR models obtained using different QSAR approaches and training sets for several levels and methods of molecular structure representation (2.5D, double 2.5D, and 3D) has been used for the prediction of AChE inhibition of new compounds. In order to avoid chance correlations 1000 rounds of Y-scrambling were performed for each selected model. Leverage and ellipsoid Applicability domain (DA) approaches have been used for additional estimation of the quality of prognosis. Molecular fragments enhancing and interfering with inhibitory activity have been determined. It was revealed that atoms individuality and stereochemistry of chiral surroundings of the asymmetric atom of phosphorus play a vital role in AChE inhibition. Thus, (R)-isomers are always less active than the (S) isomers and the racemate. HiT QSAR proves to be a powerful tool for investigation of activity of stereo selective interactions and can be used in such subsequent studies.
The Hierarchical Technology for Quantitative Structure - Activity Relationships (HiT QSAR) was applied to 95 diverse nitroaromatic compounds (including some widely known explosives) tested for their toxicity (50% inhibition growth concentration, IGC50) against the ciliate Tetrahymena pyriformis. The dataset was divided into subsets according to putative mechanisms of toxicity. Classification and Regression Trees (CART) approach implemented within HiT QSAR has been used for prediction of mechanism of toxicity for new compounds. The resulting models were shown to have ~80% accuracy for external datasets indicating that the mechanistic dataset division was sensible. Then, Partial Least Squares (PLS) statistical approach was used for the development of 2D QSAR models. Validated PLS models were explored to (i) elucidate the effects of different substituents in nitroaromatic compounds on toxicity; (ii) differentiate compounds by probable mechanisms of toxicity based on their structural descriptors; (iii) analyze the role of various physical-chemical factors responsible for compounds’ toxicity. Models were interpreted in terms of molecular fragments promoting or interfering with toxicity. It was also shown that mutual influence of substituents in benzene ring plays the determining role in toxicity variation. Although chemical mechanism based models were statistically significant and externally predictive (R2ext=0.64 for the external set of 63 nitroaromatics identified after all calculations have been completed), they were also shown to have limited coverage (57% for modeling and 76% for external set).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.