40 research outputs found
Quantifying intrinsic chemical reactivity of molecular structural features for protein binding and reactive toxicity, using the MOSES chemoinformatics system
QSAR Modeling: Where Have You Been? Where Are You Going To?
Quantitative Structure-Activity Relationship modeling is one of the major computational tools employed in medicinal chemistry. However, throughout its entire history it has drawn both praise and criticism concerning its reliability, limitations, successes, and failures. In this paper, we discuss: (i) the development and evolution of QSAR; (ii) the current trends, unsolved problems, and pressing challenges; and (iii) several novel and emerging applications of QSAR modeling. Throughout this discussion, we provide guidelines for QSAR development, validation, and application, which are summarized in best practices for building rigorously validated and externally predictive QSAR models. We hope that this Perspective will help communications between computational and experimental chemists towards collaborative development and use of QSAR models. We also believe that the guidelines presented here will help journal editors and reviewers apply more stringent scientific standards to manuscripts reporting new QSAR studies, as well as encourage the use of high quality, validated QSARs for regulatory decision making
Identification of Novel Functional Inhibitors of Acid Sphingomyelinase
We describe a hitherto unknown feature for 27 small drug-like molecules, namely functional inhibition of acid sphingomyelinase (ASM). These entities named FIASMAs (Functional Inhibitors of Acid SphingoMyelinAse), therefore, can be potentially used to treat diseases associated with enhanced activity of ASM, such as Alzheimer's disease, major depression, radiation- and chemotherapy-induced apoptosis and endotoxic shock syndrome. Residual activity of ASM measured in the presence of 10 µM drug concentration shows a bimodal distribution; thus the tested drugs can be classified into two groups with lower and higher inhibitory activity. All FIASMAs share distinct physicochemical properties in showing lipophilic and weakly basic properties. Hierarchical clustering of Tanimoto coefficients revealed that FIASMAs occur among drugs of various chemical scaffolds. Moreover, FIASMAs more frequently violate Lipinski's Rule-of-Five than compounds without effect on ASM. Inhibition of ASM appears to be associated with good permeability across the blood-brain barrier. In the present investigation, we developed a novel structure-property-activity relationship by using a random forest-based binary classification learner. Virtual screening revealed that only six out of 768 (0.78%) compounds of natural products functionally inhibit ASM, whereas this inhibitory activity occurs in 135 out of 2028 (6.66%) drugs licensed for medical use in humans
Comparison of Multilabel and Single-Label Classification Applied to the Prediction of the Isoform Specificity of Cytochrome P450 Substrates
Ligand-Based Models for the Isoform Specificity of Cytochrome P450 3A4, 2D6, and 2C9 Substrates
A data set of 379 drugs and drug analogs that are metabolized by human cytochrome P450 (CYP) isoforms
3A4, 2D6, and 2C9, respectively, was studied. A series of descriptor sets directly calculable from the
constitution of these drugs was systematically investigated as to their power into classifying a compound
into the CYP isoform that metabolizes it. In a four-step build-up process eventually 303 different descriptor
components were investigated for 146 compounds of a training set by various model building methods,
such as multinomal logistic regression, decision tree, or support vector machine (SVM). Automatic variable
selection algorithms were used in order to decrease the number of descriptors. A comprehensive scheme of
cross-validation (CV) experiments was applied to assess the robustness and reliability of the four models
developed. In addition, the predictive power of the four models presented in this paper was inspected by
predicting an external validation data set with 233 compounds. The best model has a leave-one-out (LOO)
cross-validated predictivity of 89% and gives 83% correct predictions for the external validation data set.
For our favored model we showed the strong influence on the predictivity of the way a data set is split into
a training and test data set
