The challenging problem of modeling blood-brain barrier partitioning is approached through topological representation of molecular structure. A QSAR model is developed for in vivo blood-brain partitioning data treated as the logarithm of the blood-brain concentration ratio. The model consists of three structure descriptors: the hydrogen E-State index for hydrogen bond donors, HS(T)(HBd); the hydrogen E-State index for aromatic CHs, HS(T)(arom); and the second order difference valence molecular connectivity index, d(2)chi(v) (q(2) = 0.62.) The model for the set of 106 compounds is validated through use of an external validation test set (20 compounds of the 106, MAE = 0.33, rms = 0.38), 5-fold cross-validation (MAE = 0.38, rms = 0.47), prediction of +/- values for an external test set (27/28 correct), and estimation of logBB values for a large data set of 20 039 drugs and drug-like compounds. Because no 3D structure information is used, computation of logBB by the model is very fast. The quality of the validation statistics supports the claim that the model may be used for estimation of logBB values for drug and drug-like molecules. Detailed structure interpretation is given for the structure indices in the model. The model indicates that molecules that penetrate the blood-brain barrier have large HS(T)(arom) values (presence of aromatic groups) but small values of HS(T)(HBd) (fewer or weaker H-Bond donors) and smaller d(2)chi(v) values (less branched molecules with fewer electronegative atoms). These three structure descriptors encode influence of molecular context of groups as well as counts of those groups.
Topological structure methods are used to model fish toxicity against three classes of organic chemicals. The models were obtained independent of 3D structure information. Further, no mechanism of partitioning was assumed, thus avoiding the problems associated with selection of partitioning system for computation of log P. QSAR models were developed for a set of 92 compounds, including phenols, anilines and substituted aromatic hydrocarbons, yielding excellent statistics: r2 = 0.87, s = 0.25 and q2 = 0.85 leave-one-out (LOO), that are better than those reported in the literature. The model is based on molecular connectivity valence chi-1 index [1chiv], the atom type E-State indices for chlorine [ST(-Cl)] and for ether oxygen [ST(-O-)], and the maximum hydrogen E-State atom value in a molecule [Hmax]. Each of the subgroups was also separately well modeled. The model for the full set is validated through use of external validation test sets and ten-fold cross-validation (repeated three times). The quality of the validation statistics supports the claim that the model may be used for estimation of pLC50 values for similar molecules. Detailed structure interpretation is given for the descriptors in the model. These four structure descriptors encode influence of molecular context of groups as well as counts of those groups, in addition to molecular skeletal structure.
ChemInform is a weekly Abstracting Service, delivering concise information at a glance that was extracted from about 100 leading journals. To access a ChemInform Abstract of an article which was published elsewhere, please select a “Full Text” option. The original article is trackable via the “References” option.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.