1,022 research outputs found

    Prediction of aqueous intrinsic solubility of druglike molecules using Random Forest regression trained with Wiki-pS0 database

    Get PDF
    The accurate prediction of solubility of drugs is still problematic. It was thought for a long time that shortfalls had been due the lack of high-quality solubility data from the chemical space of drugs. This study considers the quality of solubility data, particularly of ionizable drugs. A database is described, comprising 6355 entries of intrinsic solubility for 3014 different molecules, drawing on 1325 citations. In an earlier publication, many factors affecting the quality of the measurement had been discussed, and suggestions were offered to improve ways of extracting more reliable information from legacy data. Many of the suggestions have been implemented in this study. By correcting solubility for ionization (i.e., deriving intrinsic solubility, S0) and by normalizing temperature (by transforming measurements performed in the range 10-50 °C to 25 °C), it can now be estimated that the average interlaboratory reproducibility is 0.17 log unit. Empirical methods to predict solubility at best have hovered around the root mean square error (RMSE) of 0.6 log unit. Three prediction methods are compared here: (a) Yalkowsky’s general solubility equation (GSE), (b) Abraham solvation equation (ABSOLV), and (c) Random Forest regression (RFR) statistical machine learning. The latter two methods were trained using the new database. The RFR method outperforms the other two models, as anticipated. However, the ability to predict the solubility of drugs to the level of the quality of data is still out of reach. The data quality is not the limiting factor in prediction. The statistical machine learning methodologies are probably up to the task. Possibly what’s missing are solubility data from a few sparsely-covered chemical space of drugs (particularly of research compounds). Also, new descriptors which can better differentiate the factors affecting solubility between molecules could be critical for narrowing the gap between the accuracy of the prediction models and that of the experimental data

    Solubility Temperature Dependence Predicted from 2D Structure

    Get PDF
    The objective of the study was to find a computational procedure to normalize solubility data determined at various temperatures (e.g., 10 – 50 oC) to values at a “reference” temperature (e.g., 25 °C). A simple procedure was devised to predict enthalpies of solution, ΔHsol, from which the temperature dependence of intrinsic (uncharged form) solubility, log S0, could be calculated. As dependent variables, values of ΔHsol at 25 °C were subjected to multiple linear regression (MLR) analysis, using melting points (mp) and Abraham solvation descriptors. Also, the enthalpy data were subjected to random forest regression (RFR) and recursive partition tree (RPT) analyses. A total of 626 molecules were examined, drawing on 2040 published solubility values measured at various temperatures, along with 77 direct calori metric measurements. The three different prediction methods (RFR, RPT, MLR) all indicated that the estimated standard deviations in the enthalpy data are 11-15 kJ mol-1, which is concordant with the 10 kJ mol-1 propagation error estimated from solubility measurements (assuming 0.05 log S errors), and consistent with the 7 kJ mol-1 average reproducibility in enthalpy values from interlaboratory replicates. According to the MLR model, higher values of mp, H-bond acidity, polarizability/dipolarity, and dispersion forces relate to more positive (endothermic) enthalpy values. However, molecules that are large and have high H-bond basicity are likely to possess negative (exothermic) enthalpies of solution. With log S0 values normalized to 25 oC, it was shown that the interlaboratory average standard deviations in solubility measurement are reduced to 0.06 - 0.17 log unit, with higher errors for the least-soluble druglike molecules. Such improvements in data mining are expected to contribute to more reliable in silico prediction models of solubility for use in drug discovery

    Multi-lab intrinsic solubility measurement reproducibility in CheqSol and shake-flask methods

    Get PDF
    This commentary compares 233 CheqSol intrinsic solubility values (log S0) reported in the Wiki-pS0 database for 145 different druglike molecules to the 838 log S0 values determined mostly by the saturation shake-flask (SSF) method for 124 of the molecules from the CheqSol set. The range of log S0 spans from -1.0 to -10.6 (log molar units), averaging at -3.8. The correlation plot between the two methods indicates r2 = 0.96, RMSE = 0.34 log unit, and a slight bias of -0.07 log unit. The average interlaboratory standard deviation (SDi) is slightly better for the CheqSol set than that of the SSF set: SDiCS = 0.15 and SDiSSF = 0.24. The intralaboratory errors reported in the CheqSol method (0.05 log) need to be multiplied by a factor of 3 to match the expected interlaboratory errors for the method. The scale factor, in part, relates to the hidden systematic errors in the single-lab values. It is expected that improved standardizations in the ‘gold standard’ SSF method, as suggested in the recent ‘white paper’ on solubility measurement methodology, should make the SDi of both methods be about ~0.15 log unit. The multi-lab averaged log S0 (and the corresponding SDi) values could be helpful additions to existing training-set molecules used to predict the intrinsic solubility of drugs and druglike molecules

    Do you know your r2?

    Get PDF
    The prediction of solubility of drugs usually calls on the use of several open-source/commercially-available computer programs in the various calculation steps. Popular statistics to indicate the strength of the prediction model include the coefficient of determination (r2), Pearson’s linear correlation coefficient (rPearson), and the root-mean-square error (RMSE), among many others. When a program calculates these statistics, slightly different definitions may be used. This commentary briefly reviews the definitions of three types of r2 and RMSE statistics (model validation, bias compensation, and Pearson) and how systematic errors due to shortcomings in solubility prediction models can be differently indicated by the choice of statistical indices. The indices we have employed in recently published papers on the prediction of solubility of druglike molecules were unclear, especially in cases of drugs from ‘beyond the Rule of 5’ chemical space, as simple prediction models showed distinctive ‘bias-tilt’ systematic type scatter

    Influence of LAR and VAR on Para-Aminopyridine Antimalarials Targetting Haematin in Chloroquine-Resistance

    Get PDF
    Antimalarial chloroquine (CQ) prevents haematin detoxication when CQ-base concentrates in the acidic digestive vacuole through protonation of its p-aminopyridine (pAP) basic aro- matic nitrogen and sidechain diethyl-N. CQ export through the variant vacuolar membrane export channel, PFCRT, causes CQ-resistance in Plasmodium falciparum but 3-methyl CQ (sontochin SC), des-ethyl amodiaquine (DAQ) and bis 4-aminoquinoline piperaquine (PQ) are still active. This is determined by changes in drug accumulation ratios in parasite lipid (LAR) and in vacuolar water (VAR). Higher LAR may facilitate drug binding to and blocking PFCRT and also aid haematin in lipid to bind drug. LAR for CQ is only 8.3; VAR is 143,482. More hydrophobic SC has LAR 143; VAR remains 68,523. Similarly DAQ with a phenol sub- stituent has LAR of 40.8, with VAR 89,366. In PQ, basicity of each pAP is reduced by distal piperazine N, allowing very high LAR of 973,492, retaining VAR of 104,378. In another bis quinoline, dichlorquinazine (DCQ), also active but clinically unsatisfactory, each pAP retains basicity, being insulated by a 2-carbon chain from a proximal nitrogen of the single linking piperazine. While LAR of 15,488 is still high, the lowest estimate of VAR approaches 4.9 million. DCQ may be expected to be very highly lysosomotropic and therefore potentially hepatotoxic. In 11 pAP antimalarials a quadratic relationship between logLAR and logRe- sistance Index (RI) was confirmed, while log (LAR/VAR) vs logRI for 12 was linear. Both might be used to predict the utility of structural modifications

    The bacterial cell envelope as delimiter of anti-infective bioavailability - An in vitro permeation model of the Gram-negative bacterial inner membrane.

    Get PDF
    Gram-negative bacteria possess a unique and complex cell envelope, composed of an inner and outer membrane separated by an intermediate cell wall-containing periplasm. This tripartite structure acts intrinsically as a significant biological barrier, often limiting the permeation of anti-infectives, and so preventing such drugs from reaching their target. Furthermore, identification of the specific permeation-limiting envelope component proves difficult in the case of many anti-infectives, due to the challenges associated with isolation of individual cell envelope structures in bacterial culture. The development of an in vitro permeation model of the Gram-negative inner membrane, prepared by repeated coating of physiologically-relevant phospholipids on Transwell(®) filter inserts, is therefore reported, as a first step in the development of an overall cell envelope model. Characterization and permeability investigations of model compounds as well as anti-infectives confirmed the suitability of the model for quantitative and kinetically-resolved permeability assessment, and additionally confirmed the importance of employing bacteria-specific base materials for more accurate mimicking of the inner membrane lipid composition - both advantages compared to the majority of existing in vitro approaches. Additional incorporation of further elements of the Gram-negative bacterial cell envelope could ultimately facilitate model application as a screening tool in anti-infective drug discovery or formulation development

    Can small drugs predict the intrinsic aqueous solubility of ‘beyond Rule of 5’ big drugs?

    Get PDF
    The aim of the study was to explore to what extent small molecules (mostly from the Rule of 5 chemical space) can be used to predict the intrinsic aqueous solubility, S0, of big molecules from beyond the Rule of 5 (bRo5) space. It was demonstrated that the General Solubility Equation (GSE) and the Abraham Solvation Equation (ABSOLV) underpredict solubility in systematic but slightly ways. The Random Forest regression (RFR) method predicts solubility more accurately, albeit in the manner of a ‘black box.’ It was discovered that the GSE improves considerably in the case of big molecules when the coefficient of the log P term (octanol-water partition coefficient) in the equation is set to -0.4 instead of the traditional -1 value. The traditional GSE underpredicts solubility for molecules with experimental S0 < 50 µM. In contrast, the ABSOLV equation (trained with small molecules) underpredicts the solubility of big molecules in all cases tested. It was found that the errors in the ABSOLV-predicted solubilities of big molecules correlate linearly with the number of rotatable bonds, which suggests that flexibility may be an important factor in differentiating solubility of small from big molecules. Notably, most of the 31 big molecules considered have negative enthalpy of solution: these big molecules become less soluble with increasing temperature, which is compatible with ‘molecular chameleon’ behavior associated with intramolecular hydrogen bonding. The X‑ray structures of many of these molecules reveal void spaces in their crystal lattices large enough to accommodate many water molecules when such solids are in contact with aqueous media. The water sorbed into crystals suspended in aqueous solution may enhance solubility by way of intra-lattice solute-water interactions involving the numerous H‑bond acceptors in the big molecules studied. A ‘Solubility Enhancement–Big Molecules’ index was defined, which embodies many of the above findings.</p

    Impact of capillary flow hydrodynamics on carrier-mediated transport of opioid derivatives at the blood-brain barrier, based on pH-dependent Michaelis-Menten and Crone-Renkin analyses

    Get PDF
    Most studies of blood-brain barrier (BBB) permeability and transport are conducted at a single pH, but more detailed information can be revealed by using multiple pH values. A pH-dependent biophysical model was applied to the mechanistic analysis of published pH-dependent BBB luminal uptake data from three opioid derivatives in rat: pentazocine (Suzuki et al., 2002a, 2002b), naloxone (Suzuki et al., 2010a), and oxycodone (Okura et al., 2008). Two types of data were processed: in situ brain perfusion (ISBP) and brain uptake index (BUI). The published perfusion data were converted to apparent luminal permeability values, Papp, and analyzed by the pCEL-X program (Yusof et al., 2014), using the pH-dependent Crone-Renkin equation (pH-CRE) to determine the impact of cerebrovascular flow on the Michaelis-Menten transport parameters (Avdeef and Sun, 2011). For oxycodone, the ISBP data had been measured at pH 7.4 and 8.4. The present analysis indicates a 7-fold lower value of the cerebrovascular flow velocity, Fpf, than that expected in the original study. From the pyrilamine-inhibited data, the flow-corrected passive intrinsic permeability value was determined to be P0 = 398 × 10− 6 cm·s− 1. The uptake data indicate that the neutral form of oxycodone is affected by a transporter at pH 8.4. The extent of the cation uptake was less certain from the available data. For pentazocine, the brain uptake by the BUI method had been measured at pH 5.5, 6.5, and 7.4, in a concentration range 0.1–40 mM. Under similar conditions, ISBP data were also available. The pH-CRE determined values of Fpf from both methods were nearly the same, and were smaller than the expected value in the original publication. The transport of the cationic pentazocine was not fully saturated at pH 5.5 at 40 mM. The transport of the neutral species at pH 7.4 appeared to reach saturation at 40 mM pentazocine concentration, but not at 12 mM. In the case of naloxone, a pH-dependent Michaelis-Menten equation (pH-MME) analysis of the data indicated a smooth sigmoidal transition from a higher capacity uptake process affecting cationic naloxone (pH 5.0–7.0) to a lower capacity uptake process affecting the neutral drug (pH 8.0–8.5), with cross-over point near pH 7.4. Evidently, measurements at multiple pH values can reveal important information about both cerebrovascular flow and BBB transport kinetics

    Anomalous salting-out, self-association and pKa effects in the practically-insoluble bromothymol blue

    Get PDF
    Background and Purpose: The widely-used and practically insoluble diprotic acidic dye, bromothymol blue (BTB), is a neutral molecule in strongly acidic aqueous solutions. The Schill (1964) extensive solubility-pH measurement of bromothymol blue in 0.1 and 1.0 M NaCl solutions, with pH adjusted with HCl from 0.0 to 5.4, featured several unusual findings. The data suggest that the difference in solubility of the neutral-form molecule in 1M NaCl is more than 0.7 log unit lower than the solubility in pure water. This could be considered as uncharacteristically high for a salting-out effect. Also, the study reported two apparent values of pKa1, 1.48 and 1.00, in 0.1 M and 1.0 M NaCl solutions, respectively. The only other measured value found for pKa1 in the literature is -0.66 (Gupta and Cadwallader, 1968). Experimental Approach: It was reasoned that the there can be only a single pKa1 for BTB.  Also, it was hypothesized that salting-out alone might not account for such a large difference in solubility observed at  the two levels of salt. A generalized mass action approach incorporating activity corrections for charged species using the Stokes-Robinson hydration equation and for neutral species using the Setschenow equation, was selected to analyze the Schill solubility-pH data to seek a rationalization of these unusual results. Key Results: BTB reveals complex speciation chemistry in saturated aqueous solutions which had been poorly understood for many years. The appear­ance of two different values of pKa1 at different levels of NaCl and the anomalously high value of the empirical salting-out constant could be rationalized to normal values by invoking the formation of a very stable neutral dimer (log K2 = 10.0 ± 0.1 M-1).  A ‘normal’ salting-out constant, 0.25 M-1 was then derived. It was also possible to estimate the ‘self-interaction’ constant.  The data analysis in the present study critically depended on the pKa1 = -0.66 reported by Gupta and Cadwallader. Conclusion: A more reasonable salting-out constant and a consistent single value for pKa1 have been determined by considering a self-interacting (aggregation) model involving an uncharged form of the molecule, which is likely a zwitterion, as suggested by literature spectrophotometric studies
    corecore