Document Type
Article
Publication Date
9-24-2015
Abstract
Background: 1-Octanol solubility is important in a variety of applications involving pharmacology and environmental chemistry. Current models are linear in nature and often require foreknowledge of either melting point or aqueous solubility. Here we extend the range of applicability of 1-octanol solubility models by creating a random forest model that can predict 1-octanol solubilities directly from structure. Results: We created a random forest model using CDK descriptors that has an out-of-bag (OOB) R2 value of 0.66 and an OOB mean squared error of 0.34. The model has been deployed for general use as a Shiny application. Conclusion: The 1-octanol solubility model provides reasonably accurate predictions of the 1-octanol solubility of organic solutes directly from structure. The model was developed under Open Notebook Science conditions which makes it open, reproducible, and as useful as possible
Recommended Citation
Andrew Lang and Michael A Buonaiuto. "Prediction of 1‑octanol solubilities using data from the Open Notebook Science Challenge" Chemistry Central Journal Vol. 9 Iss. 50 (2015) Available at: http://works.bepress.com/andrew-sid-lang/26/
Comments
Article in Chemistry Central Journal